Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

DMDW 7. Student Presentation - Pentaho Data Integration (Kettle)


Published on

7. ETL Project by Himanshu Joshi, Sandhya Narayan and Greta Alvarez

Published in: Technology
  • Be the first to comment

  • Be the first to like this

DMDW 7. Student Presentation - Pentaho Data Integration (Kettle)

  1. 1. ETL<br />Himanshu Joshi<br />Greta Alvarez<br />Sandhya Narayan<br />
  2. 2. What is ETL?<br />Extracting data from outside sources<br />Transforming it to fit operational needs<br />Loading it into the end target (DB)<br />
  3. 3. Extracting the Excel<br />Errors:<br />Nulll entry<br />Spelling errors<br />Space at the end of entries<br />Start at the end of entries<br />Duplicity<br />
  4. 4. Normalize/Transform??<br />GOAL:<br />Decompose relations with anomalies in order to produce smaller, well-structured relations. Involves dividing large tables into smaller (and less redundant) tables and defining relationships between them. <br />
  5. 5. Tools tried & USED<br />MS SQL SERVER MY SQL<br />Clover ETL Designer<br />Advanced ETL Processor<br />Pentaho DI<br />
  6. 6. Pentaho Data Integration <br />Power Extraction, Transformation and Loading (ETL) capabilities using an innovative, metadata-driven approach. With an intuitive, graphical, drag and drop design environment, and a proven, scalable, standards-based architecture.<br /><br />
  7. 7. Why Pentaho Data Integration? <br />Open Source<br />ETL supported<br /> User Friendly<br />Easy to use<br />
  8. 8.
  9. 9. DEMO<br />
  10. 10. Don‘t try to reinvent the wheel!!Just Use it ;)<br />