DMDW 7. Student Presentation - Pentaho Data Integration (Kettle)

2,441 views
2,333 views

Published on

7. ETL Project by Himanshu Joshi, Sandhya Narayan and Greta Alvarez

Published in: Technology
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
2,441
On SlideShare
0
From Embeds
0
Number of Embeds
716
Actions
Shares
0
Downloads
29
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

DMDW 7. Student Presentation - Pentaho Data Integration (Kettle)

  1. 1. ETL<br />Himanshu Joshi<br />Greta Alvarez<br />Sandhya Narayan<br />
  2. 2. What is ETL?<br />Extracting data from outside sources<br />Transforming it to fit operational needs<br />Loading it into the end target (DB)<br />
  3. 3. Extracting the Excel<br />Errors:<br />Nulll entry<br />Spelling errors<br />Space at the end of entries<br />Start at the end of entries<br />Duplicity<br />
  4. 4. Normalize/Transform??<br />GOAL:<br />Decompose relations with anomalies in order to produce smaller, well-structured relations. Involves dividing large tables into smaller (and less redundant) tables and defining relationships between them. <br />
  5. 5. Tools tried & USED<br />MS SQL SERVER MY SQL<br />Clover ETL Designer<br />Advanced ETL Processor<br />Pentaho DI<br />
  6. 6. Pentaho Data Integration <br />Power Extraction, Transformation and Loading (ETL) capabilities using an innovative, metadata-driven approach. With an intuitive, graphical, drag and drop design environment, and a proven, scalable, standards-based architecture.<br />http://kettle.pentaho.com/<br />
  7. 7. Why Pentaho Data Integration? <br />Open Source<br />ETL supported<br /> User Friendly<br />Easy to use<br />
  8. 8.
  9. 9. DEMO<br />
  10. 10. Don‘t try to reinvent the wheel!!Just Use it ;)<br />

×