SlideShare a Scribd company logo
ETL Process in Data
Warehouse
Babjee Reddy
babjee@gmail.com
BI-Gems Technology
OLTP
 Objective of OLTP is to process data as
quickly as possible
 Support client Server technoligy
 Support Large Amount of Data
 Data is secure
OLAP/DSS/DWH
 Objective of OLAP database to process
data as Quickly as possible with less
complexity
 It is use for decision making purpose
 Used by Management People
Difference between OLTP, OLAP
Transactional oriented Analytical Oriented
Normalized tables De-normailzed tables
Used by clerical people Use by management people
Current data Historical data
Insert , Updates, deletes Bulk load
Select query retrieves less no of records Retrieves large no of records
ETL
---------------------------------------------------------------------------------------------------------------------
• This Section Covers
• What is ETL is
• Motivation
• Where to Use ETL
• House To Implement ETL
• Key ETL Aspect
Motivation
---------------------------------------------------------------------------------------------------------------------
• Is ETL is Interesting area?
70 to 80% BI(DI or DW) projects is reliable ETL process
• Let’s have a look on the DW & DI market size
• In 2003, DI was USD 9.3 billion market
• In 2008, DI was USD 13 billion market
• By 2015, yearly grow estimated to USD 20 Billon
• The more systems in the world, the more work in Data Integration!
What is ETL?
• ETL = Extract – Transform – Load
• Extract
• Get the data from source system as efficiently as
possible
• Transform
• Perform calculations on data
• Load
• Load the data in the target storage
Why is ETL (System) Important?
 Adds value to data
Removes mistakes and corrects data
Documented measures of confidence in data
Captures the flow of transactional data
Adjusts data from multiple sources to be used together (conforming)
Structures data to be usable by BI tools
Enables subsequent business / analytical data procesing
ETL Disambiguation
•ETL = Extract – Transform – Load
›Not tight specifically to DW anymore
•Process/System
›A complete process including
•Data extraction
•Enforcing DQ and consistency standards
•Conforming data from disparate systems
•Delivering data to target
•People, HW, Documentation, Support, etc.
•Tool
›A piece of software implementing the
•three (four) E-(C)-T-L steps.
•A tool designed specifically to perform data transformations
ETLProcess
ETL Tool:true Datata Intigration
ETL Data Integration Solutions
Where is ETL used?
How to implement ETL system
How to implement ETL
•Scripting (shell, perl, python)
•PL/SQL, sqlldr
•Transformation hardcoded in Java, C#
•Develop (universal) ETL tool in-house
•Using off-the-shelf ETL tool
ETL tool Key Feture
Extract, Load => flexible on interfaces
›Flat files, DBMS, XML data, XLS,
›MQ, web services, LDAP
›Semi-structured data (emails, web logs, wiki pages)
›Unstructured data (blogs, documents)
›Extensibility with custom connectors
›Local data, remote data FTP(S), SFTP, SCP, http(s)
•Clean
›Lookups, Validations, Filters, Translations
•Transform
›Changing data structure, Joins, (De)Normalization, Aggregation, RollUp,
Sorting, Partitioning, Data De-duplication
›Ability to call external tools
•Performance
›Symmetric Multiprocessing (SMP)
•Pipeline processing
•Multithreaded processing
›Massively Parallel Processing (MPP)
•Clustering
•MapReduce
›Load balancing
•User friendliness
›GUI
›Metadata capture
›Training time
•Development
›Reusable components
›Impact Analysis / Data Lineage
•Manageability
›Team collaboration
›Transformation repository
›Metadata repository
›Development process (Dev -> Test -> Prod)
›Security
•Runtime
›Scheduler Automation
›Recovery and Restart
›Workflow
•Others
›Vendor stability
›Release cycle
›Support
Well Known ETL Tools
•Commercial
›Ab Initio
›IBM DataStage
›Informatica PowerCenter
›Microsoft Data Integration Services
›Oracle Data Integrator
›SAP Business Objects – Data Integrator
›SAS Data Integration Studio
•Open-source based
›Adeptia Integration Suite
›Apatar
›CloverETL
›Pentaho Data Integration (Kettle)
›Talend Open Studio/Integration Suite

More Related Content

Similar to ETL (1).ppt

ETL Tools Ankita Dubey
ETL Tools Ankita DubeyETL Tools Ankita Dubey
ETL Tools Ankita Dubey
Ankita Dubey
 
What is a Data Warehouse and How Do I Test It?
What is a Data Warehouse and How Do I Test It?What is a Data Warehouse and How Do I Test It?
What is a Data Warehouse and How Do I Test It?
RTTS
 
Designing modern dw and data lake
Designing modern dw and data lakeDesigning modern dw and data lake
Designing modern dw and data lake
punedevscom
 
Datastage ppt
Datastage pptDatastage ppt
Datastage ppt
Newyorksys.com
 
Creating a Project Plan for a Data Warehouse Testing Assignment
Creating a Project Plan for a Data Warehouse Testing AssignmentCreating a Project Plan for a Data Warehouse Testing Assignment
Creating a Project Plan for a Data Warehouse Testing Assignment
RTTS
 
Test labs 2016. Тестирование data warehouse
Test labs 2016. Тестирование data warehouse Test labs 2016. Тестирование data warehouse
Test labs 2016. Тестирование data warehouse
Sasha Soleev
 
Pentaho etl-tool
Pentaho etl-toolPentaho etl-tool
Pentaho etl-tool
Sreenivas Kappala
 
Capacity Management of an ETL System
Capacity Management of an ETL SystemCapacity Management of an ETL System
Capacity Management of an ETL System
ASHOK BHATLA
 
Capacity management for ETL System
Capacity management for ETL SystemCapacity management for ETL System
Capacity management for ETL System
ASHOK BHATLA
 
Extract, Transform and Load.pptx
Extract, Transform and Load.pptxExtract, Transform and Load.pptx
Extract, Transform and Load.pptx
JesusaEspeleta
 
ETL Process
ETL ProcessETL Process
ETL Process
Rohin Rangnekar
 
Big data analytics beyond beer and diapers
Big data analytics   beyond beer and diapersBig data analytics   beyond beer and diapers
Big data analytics beyond beer and diapers
Kai Zhao
 
What is ETL?
What is ETL?What is ETL?
What is ETL?
Ismail El Gayar
 
OLAP & DATA WAREHOUSE
OLAP & DATA WAREHOUSEOLAP & DATA WAREHOUSE
OLAP & DATA WAREHOUSE
Zalpa Rathod
 
Building the DW - ETL
Building the DW - ETLBuilding the DW - ETL
Building the DW - ETL
ganblues
 
ETL DW-RealTime
ETL DW-RealTimeETL DW-RealTime
ETL DW-RealTime
Adriano Patrick Cunha
 
Is the traditional data warehouse dead?
Is the traditional data warehouse dead?Is the traditional data warehouse dead?
Is the traditional data warehouse dead?
James Serra
 
OLAP & Data Warehouse
OLAP & Data WarehouseOLAP & Data Warehouse
OLAP & Data Warehouse
Zalpa Rathod
 
Part 3 - Data Warehousing Lecture at BW Cooperative State University (DHBW)
Part 3 - Data Warehousing Lecture at BW Cooperative State University (DHBW)Part 3 - Data Warehousing Lecture at BW Cooperative State University (DHBW)
Part 3 - Data Warehousing Lecture at BW Cooperative State University (DHBW)
Andreas Buckenhofer
 
Etl overview training
Etl overview trainingEtl overview training
Etl overview training
Mondy Holten
 

Similar to ETL (1).ppt (20)

ETL Tools Ankita Dubey
ETL Tools Ankita DubeyETL Tools Ankita Dubey
ETL Tools Ankita Dubey
 
What is a Data Warehouse and How Do I Test It?
What is a Data Warehouse and How Do I Test It?What is a Data Warehouse and How Do I Test It?
What is a Data Warehouse and How Do I Test It?
 
Designing modern dw and data lake
Designing modern dw and data lakeDesigning modern dw and data lake
Designing modern dw and data lake
 
Datastage ppt
Datastage pptDatastage ppt
Datastage ppt
 
Creating a Project Plan for a Data Warehouse Testing Assignment
Creating a Project Plan for a Data Warehouse Testing AssignmentCreating a Project Plan for a Data Warehouse Testing Assignment
Creating a Project Plan for a Data Warehouse Testing Assignment
 
Test labs 2016. Тестирование data warehouse
Test labs 2016. Тестирование data warehouse Test labs 2016. Тестирование data warehouse
Test labs 2016. Тестирование data warehouse
 
Pentaho etl-tool
Pentaho etl-toolPentaho etl-tool
Pentaho etl-tool
 
Capacity Management of an ETL System
Capacity Management of an ETL SystemCapacity Management of an ETL System
Capacity Management of an ETL System
 
Capacity management for ETL System
Capacity management for ETL SystemCapacity management for ETL System
Capacity management for ETL System
 
Extract, Transform and Load.pptx
Extract, Transform and Load.pptxExtract, Transform and Load.pptx
Extract, Transform and Load.pptx
 
ETL Process
ETL ProcessETL Process
ETL Process
 
Big data analytics beyond beer and diapers
Big data analytics   beyond beer and diapersBig data analytics   beyond beer and diapers
Big data analytics beyond beer and diapers
 
What is ETL?
What is ETL?What is ETL?
What is ETL?
 
OLAP & DATA WAREHOUSE
OLAP & DATA WAREHOUSEOLAP & DATA WAREHOUSE
OLAP & DATA WAREHOUSE
 
Building the DW - ETL
Building the DW - ETLBuilding the DW - ETL
Building the DW - ETL
 
ETL DW-RealTime
ETL DW-RealTimeETL DW-RealTime
ETL DW-RealTime
 
Is the traditional data warehouse dead?
Is the traditional data warehouse dead?Is the traditional data warehouse dead?
Is the traditional data warehouse dead?
 
OLAP & Data Warehouse
OLAP & Data WarehouseOLAP & Data Warehouse
OLAP & Data Warehouse
 
Part 3 - Data Warehousing Lecture at BW Cooperative State University (DHBW)
Part 3 - Data Warehousing Lecture at BW Cooperative State University (DHBW)Part 3 - Data Warehousing Lecture at BW Cooperative State University (DHBW)
Part 3 - Data Warehousing Lecture at BW Cooperative State University (DHBW)
 
Etl overview training
Etl overview trainingEtl overview training
Etl overview training
 

Recently uploaded

Traditional Musical Instruments of Arunachal Pradesh and Uttar Pradesh - RAYH...
Traditional Musical Instruments of Arunachal Pradesh and Uttar Pradesh - RAYH...Traditional Musical Instruments of Arunachal Pradesh and Uttar Pradesh - RAYH...
Traditional Musical Instruments of Arunachal Pradesh and Uttar Pradesh - RAYH...
imrankhan141184
 
BBR 2024 Summer Sessions Interview Training
BBR  2024 Summer Sessions Interview TrainingBBR  2024 Summer Sessions Interview Training
BBR 2024 Summer Sessions Interview Training
Katrina Pritchard
 
Constructing Your Course Container for Effective Communication
Constructing Your Course Container for Effective CommunicationConstructing Your Course Container for Effective Communication
Constructing Your Course Container for Effective Communication
Chevonnese Chevers Whyte, MBA, B.Sc.
 
Your Skill Boost Masterclass: Strategies for Effective Upskilling
Your Skill Boost Masterclass: Strategies for Effective UpskillingYour Skill Boost Masterclass: Strategies for Effective Upskilling
Your Skill Boost Masterclass: Strategies for Effective Upskilling
Excellence Foundation for South Sudan
 
Advanced Java[Extra Concepts, Not Difficult].docx
Advanced Java[Extra Concepts, Not Difficult].docxAdvanced Java[Extra Concepts, Not Difficult].docx
Advanced Java[Extra Concepts, Not Difficult].docx
adhitya5119
 
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
Nguyen Thanh Tu Collection
 
C1 Rubenstein AP HuG xxxxxxxxxxxxxx.pptx
C1 Rubenstein AP HuG xxxxxxxxxxxxxx.pptxC1 Rubenstein AP HuG xxxxxxxxxxxxxx.pptx
C1 Rubenstein AP HuG xxxxxxxxxxxxxx.pptx
mulvey2
 
IGCSE Biology Chapter 14- Reproduction in Plants.pdf
IGCSE Biology Chapter 14- Reproduction in Plants.pdfIGCSE Biology Chapter 14- Reproduction in Plants.pdf
IGCSE Biology Chapter 14- Reproduction in Plants.pdf
Amin Marwan
 
writing about opinions about Australia the movie
writing about opinions about Australia the moviewriting about opinions about Australia the movie
writing about opinions about Australia the movie
Nicholas Montgomery
 
How to Make a Field Mandatory in Odoo 17
How to Make a Field Mandatory in Odoo 17How to Make a Field Mandatory in Odoo 17
How to Make a Field Mandatory in Odoo 17
Celine George
 
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdfANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
Priyankaranawat4
 
MARY JANE WILSON, A “BOA MÃE” .
MARY JANE WILSON, A “BOA MÃE”           .MARY JANE WILSON, A “BOA MÃE”           .
MARY JANE WILSON, A “BOA MÃE” .
Colégio Santa Teresinha
 
Liberal Approach to the Study of Indian Politics.pdf
Liberal Approach to the Study of Indian Politics.pdfLiberal Approach to the Study of Indian Politics.pdf
Liberal Approach to the Study of Indian Politics.pdf
WaniBasim
 
Gender and Mental Health - Counselling and Family Therapy Applications and In...
Gender and Mental Health - Counselling and Family Therapy Applications and In...Gender and Mental Health - Counselling and Family Therapy Applications and In...
Gender and Mental Health - Counselling and Family Therapy Applications and In...
PsychoTech Services
 
Main Java[All of the Base Concepts}.docx
Main Java[All of the Base Concepts}.docxMain Java[All of the Base Concepts}.docx
Main Java[All of the Base Concepts}.docx
adhitya5119
 
Solutons Maths Escape Room Spatial .pptx
Solutons Maths Escape Room Spatial .pptxSolutons Maths Escape Room Spatial .pptx
Solutons Maths Escape Room Spatial .pptx
spdendr
 
Pengantar Penggunaan Flutter - Dart programming language1.pptx
Pengantar Penggunaan Flutter - Dart programming language1.pptxPengantar Penggunaan Flutter - Dart programming language1.pptx
Pengantar Penggunaan Flutter - Dart programming language1.pptx
Fajar Baskoro
 
B. Ed Syllabus for babasaheb ambedkar education university.pdf
B. Ed Syllabus for babasaheb ambedkar education university.pdfB. Ed Syllabus for babasaheb ambedkar education university.pdf
B. Ed Syllabus for babasaheb ambedkar education university.pdf
BoudhayanBhattachari
 
Beyond Degrees - Empowering the Workforce in the Context of Skills-First.pptx
Beyond Degrees - Empowering the Workforce in the Context of Skills-First.pptxBeyond Degrees - Empowering the Workforce in the Context of Skills-First.pptx
Beyond Degrees - Empowering the Workforce in the Context of Skills-First.pptx
EduSkills OECD
 
Temple of Asclepius in Thrace. Excavation results
Temple of Asclepius in Thrace. Excavation resultsTemple of Asclepius in Thrace. Excavation results
Temple of Asclepius in Thrace. Excavation results
Krassimira Luka
 

Recently uploaded (20)

Traditional Musical Instruments of Arunachal Pradesh and Uttar Pradesh - RAYH...
Traditional Musical Instruments of Arunachal Pradesh and Uttar Pradesh - RAYH...Traditional Musical Instruments of Arunachal Pradesh and Uttar Pradesh - RAYH...
Traditional Musical Instruments of Arunachal Pradesh and Uttar Pradesh - RAYH...
 
BBR 2024 Summer Sessions Interview Training
BBR  2024 Summer Sessions Interview TrainingBBR  2024 Summer Sessions Interview Training
BBR 2024 Summer Sessions Interview Training
 
Constructing Your Course Container for Effective Communication
Constructing Your Course Container for Effective CommunicationConstructing Your Course Container for Effective Communication
Constructing Your Course Container for Effective Communication
 
Your Skill Boost Masterclass: Strategies for Effective Upskilling
Your Skill Boost Masterclass: Strategies for Effective UpskillingYour Skill Boost Masterclass: Strategies for Effective Upskilling
Your Skill Boost Masterclass: Strategies for Effective Upskilling
 
Advanced Java[Extra Concepts, Not Difficult].docx
Advanced Java[Extra Concepts, Not Difficult].docxAdvanced Java[Extra Concepts, Not Difficult].docx
Advanced Java[Extra Concepts, Not Difficult].docx
 
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
 
C1 Rubenstein AP HuG xxxxxxxxxxxxxx.pptx
C1 Rubenstein AP HuG xxxxxxxxxxxxxx.pptxC1 Rubenstein AP HuG xxxxxxxxxxxxxx.pptx
C1 Rubenstein AP HuG xxxxxxxxxxxxxx.pptx
 
IGCSE Biology Chapter 14- Reproduction in Plants.pdf
IGCSE Biology Chapter 14- Reproduction in Plants.pdfIGCSE Biology Chapter 14- Reproduction in Plants.pdf
IGCSE Biology Chapter 14- Reproduction in Plants.pdf
 
writing about opinions about Australia the movie
writing about opinions about Australia the moviewriting about opinions about Australia the movie
writing about opinions about Australia the movie
 
How to Make a Field Mandatory in Odoo 17
How to Make a Field Mandatory in Odoo 17How to Make a Field Mandatory in Odoo 17
How to Make a Field Mandatory in Odoo 17
 
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdfANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
 
MARY JANE WILSON, A “BOA MÃE” .
MARY JANE WILSON, A “BOA MÃE”           .MARY JANE WILSON, A “BOA MÃE”           .
MARY JANE WILSON, A “BOA MÃE” .
 
Liberal Approach to the Study of Indian Politics.pdf
Liberal Approach to the Study of Indian Politics.pdfLiberal Approach to the Study of Indian Politics.pdf
Liberal Approach to the Study of Indian Politics.pdf
 
Gender and Mental Health - Counselling and Family Therapy Applications and In...
Gender and Mental Health - Counselling and Family Therapy Applications and In...Gender and Mental Health - Counselling and Family Therapy Applications and In...
Gender and Mental Health - Counselling and Family Therapy Applications and In...
 
Main Java[All of the Base Concepts}.docx
Main Java[All of the Base Concepts}.docxMain Java[All of the Base Concepts}.docx
Main Java[All of the Base Concepts}.docx
 
Solutons Maths Escape Room Spatial .pptx
Solutons Maths Escape Room Spatial .pptxSolutons Maths Escape Room Spatial .pptx
Solutons Maths Escape Room Spatial .pptx
 
Pengantar Penggunaan Flutter - Dart programming language1.pptx
Pengantar Penggunaan Flutter - Dart programming language1.pptxPengantar Penggunaan Flutter - Dart programming language1.pptx
Pengantar Penggunaan Flutter - Dart programming language1.pptx
 
B. Ed Syllabus for babasaheb ambedkar education university.pdf
B. Ed Syllabus for babasaheb ambedkar education university.pdfB. Ed Syllabus for babasaheb ambedkar education university.pdf
B. Ed Syllabus for babasaheb ambedkar education university.pdf
 
Beyond Degrees - Empowering the Workforce in the Context of Skills-First.pptx
Beyond Degrees - Empowering the Workforce in the Context of Skills-First.pptxBeyond Degrees - Empowering the Workforce in the Context of Skills-First.pptx
Beyond Degrees - Empowering the Workforce in the Context of Skills-First.pptx
 
Temple of Asclepius in Thrace. Excavation results
Temple of Asclepius in Thrace. Excavation resultsTemple of Asclepius in Thrace. Excavation results
Temple of Asclepius in Thrace. Excavation results
 

ETL (1).ppt

  • 1. ETL Process in Data Warehouse Babjee Reddy babjee@gmail.com BI-Gems Technology
  • 2. OLTP  Objective of OLTP is to process data as quickly as possible  Support client Server technoligy  Support Large Amount of Data  Data is secure
  • 3. OLAP/DSS/DWH  Objective of OLAP database to process data as Quickly as possible with less complexity  It is use for decision making purpose  Used by Management People
  • 4. Difference between OLTP, OLAP Transactional oriented Analytical Oriented Normalized tables De-normailzed tables Used by clerical people Use by management people Current data Historical data Insert , Updates, deletes Bulk load Select query retrieves less no of records Retrieves large no of records
  • 5. ETL --------------------------------------------------------------------------------------------------------------------- • This Section Covers • What is ETL is • Motivation • Where to Use ETL • House To Implement ETL • Key ETL Aspect
  • 6. Motivation --------------------------------------------------------------------------------------------------------------------- • Is ETL is Interesting area? 70 to 80% BI(DI or DW) projects is reliable ETL process • Let’s have a look on the DW & DI market size • In 2003, DI was USD 9.3 billion market • In 2008, DI was USD 13 billion market • By 2015, yearly grow estimated to USD 20 Billon • The more systems in the world, the more work in Data Integration!
  • 7. What is ETL? • ETL = Extract – Transform – Load • Extract • Get the data from source system as efficiently as possible • Transform • Perform calculations on data • Load • Load the data in the target storage
  • 8. Why is ETL (System) Important?  Adds value to data Removes mistakes and corrects data Documented measures of confidence in data Captures the flow of transactional data Adjusts data from multiple sources to be used together (conforming) Structures data to be usable by BI tools Enables subsequent business / analytical data procesing
  • 9. ETL Disambiguation •ETL = Extract – Transform – Load ›Not tight specifically to DW anymore •Process/System ›A complete process including •Data extraction •Enforcing DQ and consistency standards •Conforming data from disparate systems •Delivering data to target •People, HW, Documentation, Support, etc. •Tool ›A piece of software implementing the •three (four) E-(C)-T-L steps. •A tool designed specifically to perform data transformations
  • 11. ETL Tool:true Datata Intigration
  • 12. ETL Data Integration Solutions
  • 13.
  • 14. Where is ETL used?
  • 15. How to implement ETL system
  • 16. How to implement ETL •Scripting (shell, perl, python) •PL/SQL, sqlldr •Transformation hardcoded in Java, C# •Develop (universal) ETL tool in-house •Using off-the-shelf ETL tool
  • 17. ETL tool Key Feture Extract, Load => flexible on interfaces ›Flat files, DBMS, XML data, XLS, ›MQ, web services, LDAP ›Semi-structured data (emails, web logs, wiki pages) ›Unstructured data (blogs, documents) ›Extensibility with custom connectors ›Local data, remote data FTP(S), SFTP, SCP, http(s) •Clean ›Lookups, Validations, Filters, Translations •Transform ›Changing data structure, Joins, (De)Normalization, Aggregation, RollUp, Sorting, Partitioning, Data De-duplication ›Ability to call external tools
  • 18. •Performance ›Symmetric Multiprocessing (SMP) •Pipeline processing •Multithreaded processing ›Massively Parallel Processing (MPP) •Clustering •MapReduce ›Load balancing •User friendliness ›GUI ›Metadata capture ›Training time •Development ›Reusable components ›Impact Analysis / Data Lineage
  • 19. •Manageability ›Team collaboration ›Transformation repository ›Metadata repository ›Development process (Dev -> Test -> Prod) ›Security •Runtime ›Scheduler Automation ›Recovery and Restart ›Workflow •Others ›Vendor stability ›Release cycle ›Support
  • 20.
  • 21. Well Known ETL Tools •Commercial ›Ab Initio ›IBM DataStage ›Informatica PowerCenter ›Microsoft Data Integration Services ›Oracle Data Integrator ›SAP Business Objects – Data Integrator ›SAS Data Integration Studio •Open-source based ›Adeptia Integration Suite ›Apatar ›CloverETL ›Pentaho Data Integration (Kettle) ›Talend Open Studio/Integration Suite