• Share
  • Email
  • Embed
  • Like
  • Save
  • Private Content
Pentaho Data Integration with Kettle
 

Pentaho Data Integration with Kettle

on

  • 3,152 views

A course on Pentaho Data Integration with Kettle. Another interesting course on Talend is on http://www.slideshare.net/melphi_/talend-open-studio-data-integration

A course on Pentaho Data Integration with Kettle. Another interesting course on Talend is on http://www.slideshare.net/melphi_/talend-open-studio-data-integration

Statistics

Views

Total Views
3,152
Views on SlideShare
2,717
Embed Views
435

Actions

Likes
1
Downloads
111
Comments
0

5 Embeds 435

http://www.robertomarchetto.com 430
http://www.linkedin.com 2
http://79.125.10.144 1
http://www.docshut.com 1
http://www.slashdocs.com 1

Accessibility

Categories

Upload Details

Uploaded via as Adobe PDF

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

    Pentaho Data Integration with Kettle Pentaho Data Integration with Kettle Presentation Transcript

    • Pentaho Data Integration (Kettle)
    • PDI Overview (Kettle) ● An entry-level tool for data manipulation (ETL) ● PDI (Kettle) reads procedures stored in XML format ● Spoon is a graphical tool used to develop that procedures ● Procedures are designed linking components ● Many data sources can be used, JDBC, files, web services ● JavaScript and Java support for complex routineswww.robertomarchetto.com
    • Development enviromentwww.robertomarchetto.com
    • Example, Source databasewww.robertomarchetto.com
    • Example, destination databasewww.robertomarchetto.com
    • Schema comparisonwww.robertomarchetto.com
    • Procedure users_dimensionQuery users:SELECT u.id, CONCAT(u.first_name, , u.last_name) as fullname, u.titleFROM users uWHERE u.first_name is not null and u.last_name is not nullwww.robertomarchetto.com
    • Testingwww.robertomarchetto.com
    • Procedure accounts_dimensionQuery accounts:select a.id, a.name, a.industry, a.billing_address_postalcode,a.billing_address_city, a.billing_address_countryfrom accounts awww.robertomarchetto.com
    • Procedure opportunities_factQuery opportunities:SELECT o.id, o.date_entered, o.date_closed, o.assigned_user_id,o.sales_stage, o.name, o.amountFROM opportunities oWHERE o.sales_stage in (Closed Won, Closed Lost) ORDER BY o.idwww.robertomarchetto.com
    • Procedure dates_dimensionwww.robertomarchetto.com
    • Collect procedures in a jobwww.robertomarchetto.com
    • Using JNDI ● Edit JNDI /simple-jndi/jdbc.properties or C:/Documents and Settings/<user>/.pentaho/simple- jndi/default.propertieswww.robertomarchetto.com
    • Running procedures ● Directly from Spoon ● From Pentaho BI Suite ● Using command line (Kitchen, Pan) kitchen.bat /file:D:Jobsjobname.kjb /level:Basic ● In a clustered enviroment ● Using a web services (Carte)www.robertomarchetto.com
    • Publishing on Pentahowww.robertomarchetto.com
    • Running from Pentahowww.robertomarchetto.com
    • Scheduling ● Using Pentahos scheduler ● Using an external scheduler (cron)www.robertomarchetto.com