• Share
  • Email
  • Embed
  • Like
  • Save
  • Private Content
Pentaho Data Integration with Kettle
 

Pentaho Data Integration with Kettle

on

  • 3,236 views

A course on Pentaho Data Integration with Kettle. Another interesting course on Talend is on http://www.slideshare.net/melphi_/talend-open-studio-data-integration

A course on Pentaho Data Integration with Kettle. Another interesting course on Talend is on http://www.slideshare.net/melphi_/talend-open-studio-data-integration

Statistics

Views

Total Views
3,236
Views on SlideShare
2,767
Embed Views
469

Actions

Likes
1
Downloads
111
Comments
0

5 Embeds 469

http://www.robertomarchetto.com 464
http://www.linkedin.com 2
http://79.125.10.144 1
http://www.docshut.com 1
http://www.slashdocs.com 1

Accessibility

Categories

Upload Details

Uploaded via as Adobe PDF

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

    Pentaho Data Integration with Kettle Pentaho Data Integration with Kettle Presentation Transcript

    • Pentaho Data Integration (Kettle)
    • PDI Overview (Kettle) ● An entry-level tool for data manipulation (ETL) ● PDI (Kettle) reads procedures stored in XML format ● Spoon is a graphical tool used to develop that procedures ● Procedures are designed linking components ● Many data sources can be used, JDBC, files, web services ● JavaScript and Java support for complex routineswww.robertomarchetto.com
    • Development enviromentwww.robertomarchetto.com
    • Example, Source databasewww.robertomarchetto.com
    • Example, destination databasewww.robertomarchetto.com
    • Schema comparisonwww.robertomarchetto.com
    • Procedure users_dimensionQuery users:SELECT u.id, CONCAT(u.first_name, , u.last_name) as fullname, u.titleFROM users uWHERE u.first_name is not null and u.last_name is not nullwww.robertomarchetto.com
    • Testingwww.robertomarchetto.com
    • Procedure accounts_dimensionQuery accounts:select a.id, a.name, a.industry, a.billing_address_postalcode,a.billing_address_city, a.billing_address_countryfrom accounts awww.robertomarchetto.com
    • Procedure opportunities_factQuery opportunities:SELECT o.id, o.date_entered, o.date_closed, o.assigned_user_id,o.sales_stage, o.name, o.amountFROM opportunities oWHERE o.sales_stage in (Closed Won, Closed Lost) ORDER BY o.idwww.robertomarchetto.com
    • Procedure dates_dimensionwww.robertomarchetto.com
    • Collect procedures in a jobwww.robertomarchetto.com
    • Using JNDI ● Edit JNDI /simple-jndi/jdbc.properties or C:/Documents and Settings/<user>/.pentaho/simple- jndi/default.propertieswww.robertomarchetto.com
    • Running procedures ● Directly from Spoon ● From Pentaho BI Suite ● Using command line (Kitchen, Pan) kitchen.bat /file:D:Jobsjobname.kjb /level:Basic ● In a clustered enviroment ● Using a web services (Carte)www.robertomarchetto.com
    • Publishing on Pentahowww.robertomarchetto.com
    • Running from Pentahowww.robertomarchetto.com
    • Scheduling ● Using Pentahos scheduler ● Using an external scheduler (cron)www.robertomarchetto.com