SlideShare a Scribd company logo
Table of Contents


•      Introduction
•      ETL tools
•      Comparison
•      Use Cases
•      Conclusion




    Page 2 | 20 March 2008 | ETL Tools Comparison
Table of Contents


•      Introduction
        • What do ETL tools do?

        • Why use an ETL tool?

•      ETL tools
•      Comparison
•      Use Cases
•      Conclusion




    Page 3 | 20 March 2008 | ETL Tools Comparison
What do ETLs tool do?


An ETL tool is a tool that:

•    Extracts data from various data sources (usually legacy)
•    Transforms data
       • from -> being optimized for transaction

         to -> being optimized for reporting and analysis
       • synchronizes the data coming from different databases
       • data cleanses to remove errors

•    Loads data into a data warehouse




    Page 4 | 20 March 2008 | ETL Tools Comparison
Table of Contents


•      Introduction
        • What do ETL tools do?

        • Why use an ETL tool?

•      ETL tools
•      Comparison
•      Use Cases
•      Conclusion




    Page 5 | 20 March 2008 | ETL Tools Comparison
Why use an ETL tool?


 ETL tools save time and money when developing a data
 warehouse by removing the need for “hand-coding”.

 “Hand Coding” is still the most common way of integrating data
 today. It requires hours and hours of development and expertise
 to create a Business-Intelligence-System.

 It is very difficult for data base administrators to connect
 between different brands of databases without using an external
 tool.

 In the event that databases are altered or new databases need
 to be integrated, a lot of “hand-coded” work needs to be
 completely redone.




Page 6 | 20 March 2008 | ETL Tools Comparison
Table of Contents


•      Introduction
•      ETL tools
         •     Pentaho Kettle
         •     Talend
         •     Informatica PowerCenter
         •     Inaplex Inaport
•      Comparison
•      Use Cases
•      Conclusion




    Page 7 | 20 March 2008 | ETL Tools Comparison
Table of Contents


•      Introduction
•      ETL tools
         •     Pentaho Kettle
         •     Talend
         •     Informatica PowerCenter
         •     Inaplex Inaport
•      Comparison
•      Use Cases
•      Conclusion




    Page 8 | 20 March 2008 | ETL Tools Comparison
ETL Tools
    Pentaho Kettle

•    Pentaho is a commercial open-source BI suite that has a
     product called Kettle for data integration.
•    It uses an innovative meta-driven approach and has a strong
     and very easy-to-use GUI
•    The company started around 2001
•    It has a strong community of 13,500 registered users
•    It uses a stand-alone java engine that process the tasks for
     moving data between many different databases and files




    Page 9 | 20 March 2008 | ETL Tools Comparison
Table of Contents


•      Introduction
•      ETL tools
         •     Pentaho Kettle
         •     Talend
         •     Informatica PowerCenter
         •     Inaplex Inaport
•      Comparison
•      Use Cases
•      Conclusion




    Page 10 | 20 March 2008 | ETL Tools Comparison
ETL Tools
Talend

•   Talend is an open-source data integration tool
•   It uses a code-generating approach and uses a GUI
    (implemented in Eclipse RC)
•   It started around October 2006
•   It has a much smaller community then Pentaho, but is
    supported by 2 finance companies
•   It generates Java code or Perl code which can later be run
    on a server




Page 11 | 20 March 2008 | ETL Tools Comparison
Table of Contents


•      Introduction
•      ETL tools
         •     Pentaho Kettle
         •     Talend
         •     Informatica PowerCenter
         •     Inaplex Inaport
•      Comparison
•      Use Cases
•      Conclusion




    Page 12 | 20 March 2008 | ETL Tools Comparison
ETL Tools
    Informatica PowerCenter

•    Informatica has a very good commercial data integration suite
•    It was founded in 1993
•    It is the market share leader in data integration (Gartner
     Dataquest)
•    It has 2600 customers. Of those, there are fortune 100
     companies, companies listed on the Dow Jones and government
     organization
•    The company's sole focus is data integration
•    It has quite a big package for enterprises to integrate their
     systems, cleanse their data and can connect to a vast number of
     current and legacy systems




    Page 13 | 20 March 2008 | ETL Tools Comparison
Table of Contents


•      Introduction
•      ETL tools
         •     Pentaho Kettle
         •     Talend
         •     Informatica PowerCenter
         •     Inaplex Inaport
•      Comparison
•      Use Cases
•      Conclusion




    Page 14 | 20 March 2008 | ETL Tools Comparison
ETL Tools
    Inaplex Inaport

•    Inaplex is a small UK company
•    InaPlex is a producer of Customer Data Integration products
     for mid-market CRM solutions
•    Inaplex mainly focuses on providing simple solutions for it’s
     customers to integrate their data into CRM and accounting
     software like Sage and Goldmine




    Page 15 | 20 March 2008 | ETL Tools Comparison
Table of Contents


•      Introduction
•      ETL tools
•      Comparison
         •     ETL Tools Comparison Chart
         •     Total Cost of Ownership
         •     Risk
         •     Ease of Use
         •     Support
         •     Deployment
         •     Speed
         •     Data Quality
         •     Monitoring
         •     Connectivity
•      Use Cases
•      Conclusion
    Page 16 | 20 March 2008 | ETL Tools Comparison
Table of Contents


•      Introduction
•      ETL tools
•      Comparison
         •     ETL Tools Comparison Chart
         •     Total Cost of Ownership
         •     Risk
         •     Ease of Use
         •     Support
         •     Deployment
         •     Speed
         •     Data Quality
         •     Monitoring
         •     Connectivity
•      Use Cases
•      Conclusion
    Page 17 | 20 March 2008 | ETL Tools Comparison
Comparison
ETL Tool Comparison Chart
                                                 Pentaho    Informatica   Inaplex
                                    Talend
                                                  Kettle   PowerCenter    Inaport

Cost

Risk

Ease of Use

Support

Deployment

Speed

Data Quality

Monitoring

Connectivity



Page 18 | 20 March 2008 | ETL Tools Comparison
Table of Contents


•      Introduction
•      ETL tools
•      Comparison
         •     ETL Tools Comparison Chart
         •     Total Cost of Ownership
         •     Risk
         •     Ease of Use
         •     Support
         •     Deployment
         •     Speed
         •     Data Quality
         •     Monitoring
         •     Connectivity
•      Use Cases
•      Conclusion
    Page 19 | 20 March 2008 | ETL Tools Comparison
Comparison
Total Cost of Ownership

Total Cost of Ownership means the
over all cost for a certain product.

This can mean initial ordering, licensing
servicing, support, training, consulting,
and any other additional payments that
need to be made before the product is
in full use.

 Commercial Open Source products are typically free to use, but the support,
 training and consulting are what companies need to pay for.




                                                                Pentaho    Informatica   Inaplex
                                                       Talend    Kettle   PowerCenter    Inaport

Page 20 | 20 March 2008 | ETL Tools Comparison
Table of Contents


•      Introduction
•      ETL tools
•      Comparison
         •     ETL Tools Comparison Chart
         •     Total Cost of Ownership
         •     Risk
         •     Ease of Use
         •     Support
         •     Deployment
         •     Speed
         •     Data Quality
         •     Monitoring
         •     Connectivity
•      Use Cases
•      Conclusion
    Page 21 | 20 March 2008 | ETL Tools Comparison
Comparison
Risk
There are always risks with projects, especially big projects.

The risks for projects failing are:
   • Going over budget
   • Going over schedule

   • Not completing the requirements or expectations of the customers




 Open Source products have much lower risk then Commercial ones since
 they do not restrict the use of their products by pricey licenses.




                                                                  Pentaho    Informatica   Inaplex
                                                         Talend
                                                                   Kettle   PowerCenter    Inaport


Page 22 | 20 March 2008 | ETL Tools Comparison
Table of Contents


•      Introduction
•      ETL tools
•      Comparison
         •     ETL Tools Comparison Chart
         •     Total Cost of Ownership
         •     Risk
         •     Ease of Use
         •     Support
         •     Deployment
         •     Speed
         •     Data Quality
         •     Monitoring
         •     Connectivity
•      Use Cases
•      Conclusion
    Page 23 | 20 March 2008 | ETL Tools Comparison
Comparison
Ease of Use

 All of the ETL tools, apart from Inaport, have GUI to simplify the
 development process. Having a good GUI also reduces the time to train
 and use the tools.

 Talend – Does have a GUI but is an add-on inside Eclipse RC.

 Pentaho Kettle – Has the most easy to use GUI out of all the tools.
 Training can also be found online or within the community.

 Informatica PowerCenter – Has an easy to use GUI, but requires some
 training to make full use of it.

 Inaplex Inaport – Does not have a “drag and drop” GUI.


                                                     Talend   Pentaho    Informatica   Inaplex
                                                               Kettle   PowerCenter    Inaport


Page 24 | 20 March 2008 | ETL Tools Comparison
Table of Contents


•      Introduction
•      ETL tools
•      Comparison
         •     ETL Tools Comparison Chart
         •     Total Cost of Ownership
         •     Risk
         •     Ease of Use
         •     Support
         •     Deployment
         •     Speed
         •     Data Quality
         •     Monitoring
         •     Connectivity
•      Use Cases
•      Conclusion
    Page 25 | 20 March 2008 | ETL Tools Comparison
Comparison
Support

 Nowadays, all software products have support and all of the ETL tool
 providers offer support.

 Talend – Offers support, but mainly resides in the US.

 Pentaho Kettle – Offers support from US, UK and has a partner consultant
 in Hong Kong.

 Informatica PowerCenter – Offers world-wide support.

 Inaplex Inaport – Offers support, but mainly resides in the UK.




Page 26 | 20 March 2008 | ETL Tools Comparison
Table of Contents


•      Introduction
•      ETL tools
•      Comparison
         •     ETL Tools Comparison Chart
         •     Total Cost of Ownership
         •     Risk
         •     Ease of Use
         •     Support
         •     Deployment
         •     Speed
         •     Data Quality
         •     Monitoring
         •     Connectivity
•      Use Cases
•      Conclusion
    Page 27 | 20 March 2008 | ETL Tools Comparison
Comparison
Deployment
 Talend – Creates a java file or perl file that can be run with an external
 scheduler on any machine with very little resource.
 Recommended one 1Ghz CPU and 512mbs ram

 Pentaho Kettle – Is a stand-alone java engine that can run on any machine
 that can run java. Needs an external scheduler to run automatically.
 It can be deployed on many different machines and used as “slave servers” to
 help with transformation processing.
 Recommended one 1Ghz CPU and 512mbs ram

 Informatica PowerCenter – Requires a server with platforms: Windows,
 Solaris, HP-UX, IBM-UX, Redhat, SUSE linux.
 Recommended to use two CPUs with 1Gb ram for Standard Edition Server

 Inaplex Inaport – Can run on any windows platform that has .NET 2.0 installed
 Recommended one CPU with 50mbs ram.

Page 28 | 20 March 2008 | ETL Tools Comparison
Table of Contents


•      Introduction
•      ETL tools
•      Comparison
         •     ETL Tools Comparison Chart
         •     Total Cost of Ownership
         •     Risk
         •     Ease of Use
         •     Support
         •     Deployment
         •     Speed
         •     Data Quality
         •     Monitoring
         •     Connectivity
•      Use Cases
•      Conclusion
    Page 29 | 20 March 2008 | ETL Tools Comparison
Comparison
Speed
 The speed of ETL tools depends largely on the data that needs to be
 transferred over the network and the processing power involved in
 transforming the data.

 Talend – Is slower then Pentaho. It requires manual tweaking and prior
 knowledge of the specific data source to reduce network traffic and
 processing.

 Pentaho Kettle – Is faster then Talend, but the Java-connector slows it
 down somewhat. Also requires manual tweaking like Talend. Can be
 clustered by placed on many machines to reduce network traffic.

 Informatica PowerCenter – Is the fastest tool. It has an advanced
 “PushDown” option that localizes transformation tasks depending on how
 busy the machine is.

 Inaplex Inaport – does not use any special techniques to improve speed.
                                                     Talend   Pentaho    Informatica   Inaplex
                                                               Kettle   PowerCenter    Inaport
Page 30 | 20 March 2008 | ETL Tools Comparison
Table of Contents


•      Introduction
•      ETL tools
•      Comparison
         •     ETL Tools Comparison Chart
         •     Total Cost of Ownership
         •     Risk
         •     Ease of Use
         •     Support
         •     Deployment
         •     Speed
         •     Data Quality
         •     Monitoring
         •     Connectivity
•      Use Cases
•      Conclusion
    Page 31 | 20 March 2008 | ETL Tools Comparison
Comparison
Data Quality

 Data Quality is fast becoming the most important feature in any data
 integration tool.

 Talend – has DQ features in its GUI, allows for customized SQL
 statements and by using Java.

 Pentaho – has DQ features in its GUI, allows for customized SQL
 statements, by using JavaScript and Regular Expressions. It also has some
 additional modules after subscribing.

 Informatica PowerCenter – does not have that many DQ features, but there
 is another product called Informatica Data Quality which has many DQ
 features.

 Inaplex Inaport – does have DQ features. Because of the very specific data
 that Inaport can integrate, it is relatively easy to clean that data.
                                                             Pentaho    Informatica   Inaplex
                                                    Talend    Kettle   PowerCenter    Inaport
Page 32 | 20 March 2008 | ETL Tools Comparison
Table of Contents


•      Introduction
•      ETL tools
•      Comparison
         •     ETL Tools Comparison Chart
         •     Total Cost of Ownership
         •     Risk
         •     Ease of Use
         •     Support
         •     Deployment
         •     Speed
         •     Data Quality
         •     Monitoring
         •     Connectivity
•      Use Cases
•      Conclusion
    Page 33 | 20 March 2008 | ETL Tools Comparison
Comparison
Monitoring

Monitoring allows to find problems and debug them during and after the
development stage.

Talend – has practical monitoring tools and logging.

Pentaho Kettle – has practical monitoring tools and logging.

Informatica PowerCenter – has extensive monitoring tools and logging.

Inaplex Inaport - has practical monitoring tools and logging.




                                                            Pentaho    Informatica   Inaplex
                                                   Talend    Kettle   PowerCenter    Inaport

Page 34 | 20 March 2008 | ETL Tools Comparison
Table of Contents


•      Introduction
•      ETL tools
•      Comparison
         •     ETL Tools Comparison Chart
         •     Total Cost of Ownership
         •     Risk
         •     Ease of Use
         •     Support
         •     Deployment
         •     Speed
         •     Data Quality
         •     Monitoring
         •     Connectivity
•      Use Cases
•      Conclusion
    Page 35 | 20 March 2008 | ETL Tools Comparison
Comparison
Connectivity

In most cases, ETL tools transfer data from legacy systems. Their connectivity
is very important to the usefulness of the ETL tools.

Talend – Can connect to all the current databases, flat files, xml files, excel
files and web services, but is reliant on Java drivers to connect to those data
sources.

Pentaho Kettle – Can connect to a very wide variety of databases, flat files,
xml files, excel files and web services.

Informatica PowerCenter – Can connect to a huge number of databases,
mainframes, flat files, excel files and web services. It can also export as a web
service.

Inaplex Inaport – Can connect to any ODBC (windows) connection. It usually
gets its data from current databases, outlook, ACT and excel files.
                                                                    Pentaho    Informatica   Inaplex
                                                           Talend    Kettle   PowerCenter    Inaport
Page 36 | 20 March 2008 | ETL Tools Comparison
Comparison
ETL Tool Comparison Chart
                                                 Pentaho    Informatica   Inaplex
                                    Talend
                                                  Kettle   PowerCenter    Inaport

Cost

Risk

Ease of Use

Support

Deployment

Speed

Data Quality

Monitoring

Connectivity



Page 37 | 20 March 2008 | ETL Tools Comparison
Table of Contents


•      Introduction
•      ETL tools
•      Comparison
•      Use Cases
         •     MySQL
         •     Loma Linda University Health Care
         •     BNSF Logistics
         •     U.S. Naval Air Systems Command
•      Conclusion




    Page 38 | 20 March 2008 | ETL Tools Comparison
Table of Contents


•      Introduction
•      ETL tools
•      Comparison
•      Use Cases
         •     MySQL
         •     Loma Linda University Health Care
         •     BNSF Logistics
         •     U.S. Naval Air Systems Command
•      Conclusion




    Page 39 | 20 March 2008 | ETL Tools Comparison
Use Cases
MySQL

 quot;We selected Pentaho for its ease-of-use. Pentaho addressed many of our
 requirements -- from reporting and analysis to dashboards, OLAP and ETL,
 and offered our business users the Excel-based access that they wanted.quot;

Key Challenges
• Reporting and analysis of operational expenses by department and cost
  center
• Multiple data sources including Microsoft Excel (for cost-center rollups)

Results
• Centralized view of spending by department
• Easy access to information from Excel

Why Pentaho
• Ease of use
• Breadth of solution

• Cost of ownership



Page 40 | 20 March 2008 | ETL Tools Comparison
Table of Contents


•      Introduction
•      ETL tools
•      Comparison
•      Use Cases
         •     MySQL
         •     Loma Linda University Health Care
         •     BNSF Logistics
         •     U.S. Naval Air Systems Command
•      Conclusion




    Page 41 | 20 March 2008 | ETL Tools Comparison
Use Cases
 Loma Linda University Health Care

 quot;Pentaho Customer Support has been exceptional. This is a strategic
 application at LLUHC, and working with Pentaho has accelerated our
 deployment and improved our overall application delivery.quot;

Key Challenges
• Providing analytics for billing and operations supporting 500,000 patients and
  600 doctors
Results
• Comprehensive analysis of time periods, services provided, billing groups,
  physicians
• Centralized, secured, consistent information delivery (versus prior Excel-
  based system)
• Ability to drill and analyze down to the individual patient level

Why Pentaho
• Open standards support and ease of integration

• Cost of ownership


 Page 42 | 20 March 2008 | ETL Tools Comparison
Table of Contents


•      Introduction
•      ETL tools
•      Comparison
•      Use Cases
         •     MySQL
         •     Loma Linda University Health Care
         •     BNSF Logistics
         •     U.S. Naval Air Systems Command
•      Conclusion




    Page 43 | 20 March 2008 | ETL Tools Comparison
Use Cases
 BNSF Logistics

 quot;Using Pentaho for our business intelligence platform, along with the expert
 support and knowledge provided by OpenBI, BNSF Logistics was able to
 implement our initial data warehouse with web-based reporting and analytics
 in just six weeks. Not only did we deliver a powerful business intelligence tool
 set for our organization in short order, but were able to do so at a fraction of
 the cost of proprietary alternatives.quot;

Key Challenges
• Cumbersome, manual process for creation and distribution of reports

• Inconsistent data accuracy because of semi-automated preparation processes

Results
• Initial data warehouse with web-based reporting and analytics in 6 weeks

• 75% lower acquisition costs, 50% lower ongoing ownership costs compared to
  proprietary BI
• Ability to monitor operational business health Why Pentaho
• Faster, better decisions in sales processes    Open standards support and
                                                 ease of integration
  Page 44 | 20 March 2008 | ETL Tools Comparison
                                                 Cost of ownership
Table of Contents


•      Introduction
•      ETL tools
•      Comparison
•      Use Cases
         •     MySQL
         •     Loma Linda University Health Care
         •     BNSF Logistics
         •     U.S. Naval Air Systems Command
•      Conclusion




    Page 45 | 20 March 2008 | ETL Tools Comparison
Use Cases
 U.S. Naval Air Systems Command
 quot;[Open technologies] reduce the cost of software development and they
 reduce the time in which innovations in software can be incorporated in
 systems. 'If the project is of a sufficient scale, you cannot get there without an
 open-source approach,' said Dewey Houck, a senior engineer at Boeing, who
 spoke at a conference last month about DOD's use of open source.quot;
 (Government Computer News, Jan. 2008)quot;

Key Challenges
• Analyzing flight data to reduce operational risk and improve training (human
  error is a causal factor in 70% of aviation mishaps)
Results
• Ability to leverage recorded electronic sensor data to reduce risk and improve
  crew performance
Why Pentaho
• Breadth of capabilities

• Proven success and large-scale referenceable deployments

• Successful proof-of-concept

• Dramatically lower costs
 Page 46 | 20 March 2008 | ETL Tools Comparison
Table of Contents


•      Introduction
•      What do ETL tools do?
•      Why use an ETL tools?
•      ETL tools
•      Comparison
•      Use Cases
•      Conclusion




    Page 47 | 20 March 2008 | ETL Tools Comparison
Conclusion


•   Informatica and Pentaho have very good products.

•   Informatica has a far more extensive range of products, but compared to
    Pentaho is very expensive.

•   Pentaho has proved that it can handle small to large scale systems.

•   Pentaho is gaining fast momentum with businesses that would not have
    considered using open source products before.




    Page 48 | 20 March 2008 | ETL Tools Comparison

More Related Content

What's hot

Databricks: A Tool That Empowers You To Do More With Data
Databricks: A Tool That Empowers You To Do More With DataDatabricks: A Tool That Empowers You To Do More With Data
Databricks: A Tool That Empowers You To Do More With Data
Databricks
 
Talend Open Studio Data Integration
Talend Open Studio Data IntegrationTalend Open Studio Data Integration
Talend Open Studio Data Integration
Roberto Marchetto
 
Introducing Databricks Delta
Introducing Databricks DeltaIntroducing Databricks Delta
Introducing Databricks Delta
Databricks
 
Enterprise manager 13c
Enterprise manager 13cEnterprise manager 13c
Enterprise manager 13c
MarketingArrowECS_CZ
 
Introduction to ETL and Data Integration
Introduction to ETL and Data IntegrationIntroduction to ETL and Data Integration
Introduction to ETL and Data Integration
CloverDX (formerly known as CloverETL)
 
Databricks Fundamentals
Databricks FundamentalsDatabricks Fundamentals
Databricks Fundamentals
Dalibor Wijas
 
High Availability & Disaster Recovery on Oracle Cloud Infrastructure
High Availability & Disaster Recovery on Oracle Cloud InfrastructureHigh Availability & Disaster Recovery on Oracle Cloud Infrastructure
High Availability & Disaster Recovery on Oracle Cloud Infrastructure
SinanPetrusToma
 
OCI Overview
OCI OverviewOCI Overview
OCI Overview
Kamil Wieczorek
 
Finit one small step - tips and tricks for transitioning from fdm to fdmee
Finit   one small step - tips and tricks for transitioning from fdm to fdmeeFinit   one small step - tips and tricks for transitioning from fdm to fdmee
Finit one small step - tips and tricks for transitioning from fdm to fdmee
finitsolutions
 
dbt Python models - GoDataFest by Guillermo Sanchez
dbt Python models - GoDataFest by Guillermo Sanchezdbt Python models - GoDataFest by Guillermo Sanchez
dbt Python models - GoDataFest by Guillermo Sanchez
GoDataDriven
 
Snowflake: The most cost-effective agile and scalable data warehouse ever!
Snowflake: The most cost-effective agile and scalable data warehouse ever!Snowflake: The most cost-effective agile and scalable data warehouse ever!
Snowflake: The most cost-effective agile and scalable data warehouse ever!
Visual_BI
 
What is ETL?
What is ETL?What is ETL?
What is ETL?
Ismail El Gayar
 
1- Introduction of Azure data factory.pptx
1- Introduction of Azure data factory.pptx1- Introduction of Azure data factory.pptx
1- Introduction of Azure data factory.pptx
BRIJESH KUMAR
 
Informatica PowerCenter
Informatica PowerCenterInformatica PowerCenter
Informatica PowerCenter
Ramy Mahrous
 
Data pipelines observability: OpenLineage & Marquez
Data pipelines observability:  OpenLineage & MarquezData pipelines observability:  OpenLineage & Marquez
Data pipelines observability: OpenLineage & Marquez
Julien Le Dem
 
Snowflake Architecture.pptx
Snowflake Architecture.pptxSnowflake Architecture.pptx
Snowflake Architecture.pptx
chennakesava44
 
ETL Testing Training Presentation
ETL Testing Training PresentationETL Testing Training Presentation
ETL Testing Training Presentation
Apurba Biswas
 
Oracle flashback
Oracle flashbackOracle flashback
Oracle flashback
Cambodia
 
Making Data Timelier and More Reliable with Lakehouse Technology
Making Data Timelier and More Reliable with Lakehouse TechnologyMaking Data Timelier and More Reliable with Lakehouse Technology
Making Data Timelier and More Reliable with Lakehouse Technology
Matei Zaharia
 
Snowflake Overview
Snowflake OverviewSnowflake Overview
Snowflake Overview
Snowflake Computing
 

What's hot (20)

Databricks: A Tool That Empowers You To Do More With Data
Databricks: A Tool That Empowers You To Do More With DataDatabricks: A Tool That Empowers You To Do More With Data
Databricks: A Tool That Empowers You To Do More With Data
 
Talend Open Studio Data Integration
Talend Open Studio Data IntegrationTalend Open Studio Data Integration
Talend Open Studio Data Integration
 
Introducing Databricks Delta
Introducing Databricks DeltaIntroducing Databricks Delta
Introducing Databricks Delta
 
Enterprise manager 13c
Enterprise manager 13cEnterprise manager 13c
Enterprise manager 13c
 
Introduction to ETL and Data Integration
Introduction to ETL and Data IntegrationIntroduction to ETL and Data Integration
Introduction to ETL and Data Integration
 
Databricks Fundamentals
Databricks FundamentalsDatabricks Fundamentals
Databricks Fundamentals
 
High Availability & Disaster Recovery on Oracle Cloud Infrastructure
High Availability & Disaster Recovery on Oracle Cloud InfrastructureHigh Availability & Disaster Recovery on Oracle Cloud Infrastructure
High Availability & Disaster Recovery on Oracle Cloud Infrastructure
 
OCI Overview
OCI OverviewOCI Overview
OCI Overview
 
Finit one small step - tips and tricks for transitioning from fdm to fdmee
Finit   one small step - tips and tricks for transitioning from fdm to fdmeeFinit   one small step - tips and tricks for transitioning from fdm to fdmee
Finit one small step - tips and tricks for transitioning from fdm to fdmee
 
dbt Python models - GoDataFest by Guillermo Sanchez
dbt Python models - GoDataFest by Guillermo Sanchezdbt Python models - GoDataFest by Guillermo Sanchez
dbt Python models - GoDataFest by Guillermo Sanchez
 
Snowflake: The most cost-effective agile and scalable data warehouse ever!
Snowflake: The most cost-effective agile and scalable data warehouse ever!Snowflake: The most cost-effective agile and scalable data warehouse ever!
Snowflake: The most cost-effective agile and scalable data warehouse ever!
 
What is ETL?
What is ETL?What is ETL?
What is ETL?
 
1- Introduction of Azure data factory.pptx
1- Introduction of Azure data factory.pptx1- Introduction of Azure data factory.pptx
1- Introduction of Azure data factory.pptx
 
Informatica PowerCenter
Informatica PowerCenterInformatica PowerCenter
Informatica PowerCenter
 
Data pipelines observability: OpenLineage & Marquez
Data pipelines observability:  OpenLineage & MarquezData pipelines observability:  OpenLineage & Marquez
Data pipelines observability: OpenLineage & Marquez
 
Snowflake Architecture.pptx
Snowflake Architecture.pptxSnowflake Architecture.pptx
Snowflake Architecture.pptx
 
ETL Testing Training Presentation
ETL Testing Training PresentationETL Testing Training Presentation
ETL Testing Training Presentation
 
Oracle flashback
Oracle flashbackOracle flashback
Oracle flashback
 
Making Data Timelier and More Reliable with Lakehouse Technology
Making Data Timelier and More Reliable with Lakehouse TechnologyMaking Data Timelier and More Reliable with Lakehouse Technology
Making Data Timelier and More Reliable with Lakehouse Technology
 
Snowflake Overview
Snowflake OverviewSnowflake Overview
Snowflake Overview
 

Viewers also liked

ETL tool evaluation criteria
ETL tool evaluation criteriaETL tool evaluation criteria
ETL tool evaluation criteria
Asis Mohanty
 
Comparativa herramientas ETL
Comparativa herramientas ETLComparativa herramientas ETL
Comparativa herramientas ETL
Jorge Bustillos
 
ETL Market Webcast
ETL Market WebcastETL Market Webcast
ETL Market Webcast
mark madsen
 
Scaling MySQL Strategies for Developers
Scaling MySQL Strategies for DevelopersScaling MySQL Strategies for Developers
Scaling MySQL Strategies for Developers
Jonathan Levin
 
Cleanliness is next to Godliness
Cleanliness is next to GodlinessCleanliness is next to Godliness
Cleanliness is next to Godliness
Jonathan Levin
 
Informatica Capabilities As An ETL Tool
Informatica Capabilities As An ETL ToolInformatica Capabilities As An ETL Tool
Informatica Capabilities As An ETL Tool
Edureka!
 
Kettle – Etl Tool
Kettle – Etl ToolKettle – Etl Tool
Kettle – Etl Tool
Dr Anjan Krishnamurthy
 
L Aquila Earthquake
L  Aquila EarthquakeL  Aquila Earthquake
L Aquila Earthquake
Mr Cornish
 
Open Source ETL using Talend Open Studio
Open Source ETL using Talend Open StudioOpen Source ETL using Talend Open Studio
Open Source ETL using Talend Open Studio
santosluis87
 
Clase Patrimonio
Clase PatrimonioClase Patrimonio
Clase Patrimonio
guestc2c089
 
Etl extracción transformación y carga de datos
Etl extracción transformación y carga de datosEtl extracción transformación y carga de datos
Etl extracción transformación y carga de datos
Leonel Ibarra
 
Ukrainian pharmaceutical market
Ukrainian pharmaceutical marketUkrainian pharmaceutical market
Ukrainian pharmaceutical market
vyazyonova
 
Alimento artesanal para aves
Alimento artesanal para avesAlimento artesanal para aves
Alimento artesanal para aves
armando
 
Eeg wave pattern
Eeg wave patternEeg wave pattern
Eeg wave pattern
Roopchand Ps
 
raj Textile project
raj Textile projectraj Textile project
raj Textile project
rajendra vasavani
 
Time Management
Time ManagementTime Management
Time Management
John Oluwagbemiga
 
Pharma Plan Presentation Powerpoint
Pharma Plan Presentation PowerpointPharma Plan Presentation Powerpoint
Pharma Plan Presentation Powerpoint
waschmaschine
 
My research proposal.ppt
My research proposal.pptMy research proposal.ppt
My research proposal.ppt
nanimamat
 
Tea-presentation-1
Tea-presentation-1Tea-presentation-1
Tea-presentation-1
mattbdes
 
Dw product comparison
Dw product comparisonDw product comparison
Dw product comparison
Shantanu Gokhale
 

Viewers also liked (20)

ETL tool evaluation criteria
ETL tool evaluation criteriaETL tool evaluation criteria
ETL tool evaluation criteria
 
Comparativa herramientas ETL
Comparativa herramientas ETLComparativa herramientas ETL
Comparativa herramientas ETL
 
ETL Market Webcast
ETL Market WebcastETL Market Webcast
ETL Market Webcast
 
Scaling MySQL Strategies for Developers
Scaling MySQL Strategies for DevelopersScaling MySQL Strategies for Developers
Scaling MySQL Strategies for Developers
 
Cleanliness is next to Godliness
Cleanliness is next to GodlinessCleanliness is next to Godliness
Cleanliness is next to Godliness
 
Informatica Capabilities As An ETL Tool
Informatica Capabilities As An ETL ToolInformatica Capabilities As An ETL Tool
Informatica Capabilities As An ETL Tool
 
Kettle – Etl Tool
Kettle – Etl ToolKettle – Etl Tool
Kettle – Etl Tool
 
L Aquila Earthquake
L  Aquila EarthquakeL  Aquila Earthquake
L Aquila Earthquake
 
Open Source ETL using Talend Open Studio
Open Source ETL using Talend Open StudioOpen Source ETL using Talend Open Studio
Open Source ETL using Talend Open Studio
 
Clase Patrimonio
Clase PatrimonioClase Patrimonio
Clase Patrimonio
 
Etl extracción transformación y carga de datos
Etl extracción transformación y carga de datosEtl extracción transformación y carga de datos
Etl extracción transformación y carga de datos
 
Ukrainian pharmaceutical market
Ukrainian pharmaceutical marketUkrainian pharmaceutical market
Ukrainian pharmaceutical market
 
Alimento artesanal para aves
Alimento artesanal para avesAlimento artesanal para aves
Alimento artesanal para aves
 
Eeg wave pattern
Eeg wave patternEeg wave pattern
Eeg wave pattern
 
raj Textile project
raj Textile projectraj Textile project
raj Textile project
 
Time Management
Time ManagementTime Management
Time Management
 
Pharma Plan Presentation Powerpoint
Pharma Plan Presentation PowerpointPharma Plan Presentation Powerpoint
Pharma Plan Presentation Powerpoint
 
My research proposal.ppt
My research proposal.pptMy research proposal.ppt
My research proposal.ppt
 
Tea-presentation-1
Tea-presentation-1Tea-presentation-1
Tea-presentation-1
 
Dw product comparison
Dw product comparisonDw product comparison
Dw product comparison
 

Similar to Open Source ETL vs Commercial ETL

Kettle: Pentaho Data Integration tool
Kettle: Pentaho Data Integration toolKettle: Pentaho Data Integration tool
Kettle: Pentaho Data Integration tool
Alex Rayón Jerez
 
Big data analytics beyond beer and diapers
Big data analytics   beyond beer and diapersBig data analytics   beyond beer and diapers
Big data analytics beyond beer and diapers
Kai Zhao
 
Revolutionising Storage for your Future Business Requirements
Revolutionising Storage for your Future Business RequirementsRevolutionising Storage for your Future Business Requirements
Revolutionising Storage for your Future Business Requirements
NetApp
 
Pentaho Data Integration: Extrayendo, integrando, normalizando y preparando m...
Pentaho Data Integration: Extrayendo, integrando, normalizando y preparando m...Pentaho Data Integration: Extrayendo, integrando, normalizando y preparando m...
Pentaho Data Integration: Extrayendo, integrando, normalizando y preparando m...
Alex Rayón Jerez
 
DevOps monitoring: Feedback loops in enterprise environments
DevOps monitoring: Feedback loops in enterprise environmentsDevOps monitoring: Feedback loops in enterprise environments
DevOps monitoring: Feedback loops in enterprise environments
Jonah Kowall
 
Pentaho ppt up
Pentaho ppt upPentaho ppt up
Pentaho ppt up
03446940736
 
Deep Dive: More Oracle Data Pump Performance Tips and Tricks
Deep Dive: More Oracle Data Pump Performance Tips and TricksDeep Dive: More Oracle Data Pump Performance Tips and Tricks
Deep Dive: More Oracle Data Pump Performance Tips and Tricks
Guatemala User Group
 
ETL with WSO2 Enterprise Middleware Platform
ETL with WSO2 Enterprise Middleware Platform ETL with WSO2 Enterprise Middleware Platform
ETL with WSO2 Enterprise Middleware Platform
WSO2
 
Optimizing your Modern Data Architecture - with Attunity, RCG Global Services...
Optimizing your Modern Data Architecture - with Attunity, RCG Global Services...Optimizing your Modern Data Architecture - with Attunity, RCG Global Services...
Optimizing your Modern Data Architecture - with Attunity, RCG Global Services...
Hortonworks
 
Accelerating AI Adoption with Partners
Accelerating AI Adoption with PartnersAccelerating AI Adoption with Partners
Accelerating AI Adoption with Partners
Sri Ambati
 
Integrated Business Intelligence and Data Warehouse
Integrated Business Intelligence and Data WarehouseIntegrated Business Intelligence and Data Warehouse
Integrated Business Intelligence and Data Warehouse
Arie Sutiarso
 
2015 02 12 talend hortonworks webinar challenges to hadoop adoption
2015 02 12 talend hortonworks webinar challenges to hadoop adoption2015 02 12 talend hortonworks webinar challenges to hadoop adoption
2015 02 12 talend hortonworks webinar challenges to hadoop adoption
Hortonworks
 
Kettleetltool 090522005630-phpapp01
Kettleetltool 090522005630-phpapp01Kettleetltool 090522005630-phpapp01
Kettleetltool 090522005630-phpapp01
jade_22
 
IdealNet Data Integration ETL vs Cloud
IdealNet Data Integration ETL vs CloudIdealNet Data Integration ETL vs Cloud
IdealNet Data Integration ETL vs Cloud
cbiddle2
 
World Domination with Pentaho EE?
World Domination with Pentaho EE?World Domination with Pentaho EE?
World Domination with Pentaho EE?
Jos van Dongen
 
Machine Learning - Eine Challenge für Architekten
Machine Learning - Eine Challenge für ArchitektenMachine Learning - Eine Challenge für Architekten
Machine Learning - Eine Challenge für Architekten
Harald Erb
 
Intel-altoweb-caseStudy
Intel-altoweb-caseStudyIntel-altoweb-caseStudy
TopConf : DevOps Monitoring: Feedback Loops in Enterprise Environments
TopConf : DevOps Monitoring: Feedback Loops in Enterprise EnvironmentsTopConf : DevOps Monitoring: Feedback Loops in Enterprise Environments
TopConf : DevOps Monitoring: Feedback Loops in Enterprise Environments
Jonah Kowall
 
Automating Complex High-Volume Technical Paper and Journal Article Page Compo...
Automating Complex High-Volume Technical Paper and Journal Article Page Compo...Automating Complex High-Volume Technical Paper and Journal Article Page Compo...
Automating Complex High-Volume Technical Paper and Journal Article Page Compo...
dclsocialmedia
 
How to Become a Tableau Certified Professional?
How to Become a Tableau Certified Professional?How to Become a Tableau Certified Professional?
How to Become a Tableau Certified Professional?
Intellipaat
 

Similar to Open Source ETL vs Commercial ETL (20)

Kettle: Pentaho Data Integration tool
Kettle: Pentaho Data Integration toolKettle: Pentaho Data Integration tool
Kettle: Pentaho Data Integration tool
 
Big data analytics beyond beer and diapers
Big data analytics   beyond beer and diapersBig data analytics   beyond beer and diapers
Big data analytics beyond beer and diapers
 
Revolutionising Storage for your Future Business Requirements
Revolutionising Storage for your Future Business RequirementsRevolutionising Storage for your Future Business Requirements
Revolutionising Storage for your Future Business Requirements
 
Pentaho Data Integration: Extrayendo, integrando, normalizando y preparando m...
Pentaho Data Integration: Extrayendo, integrando, normalizando y preparando m...Pentaho Data Integration: Extrayendo, integrando, normalizando y preparando m...
Pentaho Data Integration: Extrayendo, integrando, normalizando y preparando m...
 
DevOps monitoring: Feedback loops in enterprise environments
DevOps monitoring: Feedback loops in enterprise environmentsDevOps monitoring: Feedback loops in enterprise environments
DevOps monitoring: Feedback loops in enterprise environments
 
Pentaho ppt up
Pentaho ppt upPentaho ppt up
Pentaho ppt up
 
Deep Dive: More Oracle Data Pump Performance Tips and Tricks
Deep Dive: More Oracle Data Pump Performance Tips and TricksDeep Dive: More Oracle Data Pump Performance Tips and Tricks
Deep Dive: More Oracle Data Pump Performance Tips and Tricks
 
ETL with WSO2 Enterprise Middleware Platform
ETL with WSO2 Enterprise Middleware Platform ETL with WSO2 Enterprise Middleware Platform
ETL with WSO2 Enterprise Middleware Platform
 
Optimizing your Modern Data Architecture - with Attunity, RCG Global Services...
Optimizing your Modern Data Architecture - with Attunity, RCG Global Services...Optimizing your Modern Data Architecture - with Attunity, RCG Global Services...
Optimizing your Modern Data Architecture - with Attunity, RCG Global Services...
 
Accelerating AI Adoption with Partners
Accelerating AI Adoption with PartnersAccelerating AI Adoption with Partners
Accelerating AI Adoption with Partners
 
Integrated Business Intelligence and Data Warehouse
Integrated Business Intelligence and Data WarehouseIntegrated Business Intelligence and Data Warehouse
Integrated Business Intelligence and Data Warehouse
 
2015 02 12 talend hortonworks webinar challenges to hadoop adoption
2015 02 12 talend hortonworks webinar challenges to hadoop adoption2015 02 12 talend hortonworks webinar challenges to hadoop adoption
2015 02 12 talend hortonworks webinar challenges to hadoop adoption
 
Kettleetltool 090522005630-phpapp01
Kettleetltool 090522005630-phpapp01Kettleetltool 090522005630-phpapp01
Kettleetltool 090522005630-phpapp01
 
IdealNet Data Integration ETL vs Cloud
IdealNet Data Integration ETL vs CloudIdealNet Data Integration ETL vs Cloud
IdealNet Data Integration ETL vs Cloud
 
World Domination with Pentaho EE?
World Domination with Pentaho EE?World Domination with Pentaho EE?
World Domination with Pentaho EE?
 
Machine Learning - Eine Challenge für Architekten
Machine Learning - Eine Challenge für ArchitektenMachine Learning - Eine Challenge für Architekten
Machine Learning - Eine Challenge für Architekten
 
Intel-altoweb-caseStudy
Intel-altoweb-caseStudyIntel-altoweb-caseStudy
Intel-altoweb-caseStudy
 
TopConf : DevOps Monitoring: Feedback Loops in Enterprise Environments
TopConf : DevOps Monitoring: Feedback Loops in Enterprise EnvironmentsTopConf : DevOps Monitoring: Feedback Loops in Enterprise Environments
TopConf : DevOps Monitoring: Feedback Loops in Enterprise Environments
 
Automating Complex High-Volume Technical Paper and Journal Article Page Compo...
Automating Complex High-Volume Technical Paper and Journal Article Page Compo...Automating Complex High-Volume Technical Paper and Journal Article Page Compo...
Automating Complex High-Volume Technical Paper and Journal Article Page Compo...
 
How to Become a Tableau Certified Professional?
How to Become a Tableau Certified Professional?How to Become a Tableau Certified Professional?
How to Become a Tableau Certified Professional?
 

Recently uploaded

Recent Advancements in the NIST-JARVIS Infrastructure
Recent Advancements in the NIST-JARVIS InfrastructureRecent Advancements in the NIST-JARVIS Infrastructure
Recent Advancements in the NIST-JARVIS Infrastructure
KAMAL CHOUDHARY
 
(CISOPlatform Summit & SACON 2024) Keynote _ Power Digital Identities With AI...
(CISOPlatform Summit & SACON 2024) Keynote _ Power Digital Identities With AI...(CISOPlatform Summit & SACON 2024) Keynote _ Power Digital Identities With AI...
(CISOPlatform Summit & SACON 2024) Keynote _ Power Digital Identities With AI...
Priyanka Aash
 
How to build a generative AI solution A step-by-step guide (2).pdf
How to build a generative AI solution A step-by-step guide (2).pdfHow to build a generative AI solution A step-by-step guide (2).pdf
How to build a generative AI solution A step-by-step guide (2).pdf
ChristopherTHyatt
 
Evolution of iPaaS - simplify IT workloads to provide a unified view of data...
Evolution of iPaaS - simplify IT workloads to provide a unified view of  data...Evolution of iPaaS - simplify IT workloads to provide a unified view of  data...
Evolution of iPaaS - simplify IT workloads to provide a unified view of data...
Torry Harris
 
EuroPython 2024 - Streamlining Testing in a Large Python Codebase
EuroPython 2024 - Streamlining Testing in a Large Python CodebaseEuroPython 2024 - Streamlining Testing in a Large Python Codebase
EuroPython 2024 - Streamlining Testing in a Large Python Codebase
Jimmy Lai
 
Salesforce AI & Einstein Copilot Workshop
Salesforce AI & Einstein Copilot WorkshopSalesforce AI & Einstein Copilot Workshop
Salesforce AI & Einstein Copilot Workshop
CEPTES Software Inc
 
How Social Media Hackers Help You to See Your Wife's Message.pdf
How Social Media Hackers Help You to See Your Wife's Message.pdfHow Social Media Hackers Help You to See Your Wife's Message.pdf
How Social Media Hackers Help You to See Your Wife's Message.pdf
HackersList
 
Implementations of Fused Deposition Modeling in real world
Implementations of Fused Deposition Modeling  in real worldImplementations of Fused Deposition Modeling  in real world
Implementations of Fused Deposition Modeling in real world
Emerging Tech
 
Acumatica vs. Sage Intacct vs. NetSuite _ NOW CFO.pdf
Acumatica vs. Sage Intacct vs. NetSuite _ NOW CFO.pdfAcumatica vs. Sage Intacct vs. NetSuite _ NOW CFO.pdf
Acumatica vs. Sage Intacct vs. NetSuite _ NOW CFO.pdf
BrainSell Technologies
 
leewayhertz.com-AI agents for healthcare Applications benefits and implementa...
leewayhertz.com-AI agents for healthcare Applications benefits and implementa...leewayhertz.com-AI agents for healthcare Applications benefits and implementa...
leewayhertz.com-AI agents for healthcare Applications benefits and implementa...
alexjohnson7307
 
Using LLM Agents with Llama 3, LangGraph and Milvus
Using LLM Agents with Llama 3, LangGraph and MilvusUsing LLM Agents with Llama 3, LangGraph and Milvus
Using LLM Agents with Llama 3, LangGraph and Milvus
Zilliz
 
Dublin_mulesoft_meetup_Mulesoft_Salesforce_Integration (1).pptx
Dublin_mulesoft_meetup_Mulesoft_Salesforce_Integration (1).pptxDublin_mulesoft_meetup_Mulesoft_Salesforce_Integration (1).pptx
Dublin_mulesoft_meetup_Mulesoft_Salesforce_Integration (1).pptx
Kunal Gupta
 
WPRiders Company Presentation Slide Deck
WPRiders Company Presentation Slide DeckWPRiders Company Presentation Slide Deck
WPRiders Company Presentation Slide Deck
Lidia A.
 
Tirana Tech Meetup - Agentic RAG with Milvus, Llama3 and Ollama
Tirana Tech Meetup - Agentic RAG with Milvus, Llama3 and OllamaTirana Tech Meetup - Agentic RAG with Milvus, Llama3 and Ollama
Tirana Tech Meetup - Agentic RAG with Milvus, Llama3 and Ollama
Zilliz
 
Feature sql server terbaru performance.pptx
Feature sql server terbaru performance.pptxFeature sql server terbaru performance.pptx
Feature sql server terbaru performance.pptx
ssuser1915fe1
 
July Patch Tuesday
July Patch TuesdayJuly Patch Tuesday
July Patch Tuesday
Ivanti
 
Calgary MuleSoft Meetup APM and IDP .pptx
Calgary MuleSoft Meetup APM and IDP .pptxCalgary MuleSoft Meetup APM and IDP .pptx
Calgary MuleSoft Meetup APM and IDP .pptx
ishalveerrandhawa1
 
High Profile Girls Call ServiCe Hyderabad 0000000000 Tanisha Best High Class ...
High Profile Girls Call ServiCe Hyderabad 0000000000 Tanisha Best High Class ...High Profile Girls Call ServiCe Hyderabad 0000000000 Tanisha Best High Class ...
High Profile Girls Call ServiCe Hyderabad 0000000000 Tanisha Best High Class ...
aslasdfmkhan4750
 
Scaling Connections in PostgreSQL Postgres Bangalore(PGBLR) Meetup-2 - Mydbops
Scaling Connections in PostgreSQL Postgres Bangalore(PGBLR) Meetup-2 - MydbopsScaling Connections in PostgreSQL Postgres Bangalore(PGBLR) Meetup-2 - Mydbops
Scaling Connections in PostgreSQL Postgres Bangalore(PGBLR) Meetup-2 - Mydbops
Mydbops
 
High Profile Girls call Service Pune 000XX00000 Provide Best And Top Girl Ser...
High Profile Girls call Service Pune 000XX00000 Provide Best And Top Girl Ser...High Profile Girls call Service Pune 000XX00000 Provide Best And Top Girl Ser...
High Profile Girls call Service Pune 000XX00000 Provide Best And Top Girl Ser...
bhumivarma35300
 

Recently uploaded (20)

Recent Advancements in the NIST-JARVIS Infrastructure
Recent Advancements in the NIST-JARVIS InfrastructureRecent Advancements in the NIST-JARVIS Infrastructure
Recent Advancements in the NIST-JARVIS Infrastructure
 
(CISOPlatform Summit & SACON 2024) Keynote _ Power Digital Identities With AI...
(CISOPlatform Summit & SACON 2024) Keynote _ Power Digital Identities With AI...(CISOPlatform Summit & SACON 2024) Keynote _ Power Digital Identities With AI...
(CISOPlatform Summit & SACON 2024) Keynote _ Power Digital Identities With AI...
 
How to build a generative AI solution A step-by-step guide (2).pdf
How to build a generative AI solution A step-by-step guide (2).pdfHow to build a generative AI solution A step-by-step guide (2).pdf
How to build a generative AI solution A step-by-step guide (2).pdf
 
Evolution of iPaaS - simplify IT workloads to provide a unified view of data...
Evolution of iPaaS - simplify IT workloads to provide a unified view of  data...Evolution of iPaaS - simplify IT workloads to provide a unified view of  data...
Evolution of iPaaS - simplify IT workloads to provide a unified view of data...
 
EuroPython 2024 - Streamlining Testing in a Large Python Codebase
EuroPython 2024 - Streamlining Testing in a Large Python CodebaseEuroPython 2024 - Streamlining Testing in a Large Python Codebase
EuroPython 2024 - Streamlining Testing in a Large Python Codebase
 
Salesforce AI & Einstein Copilot Workshop
Salesforce AI & Einstein Copilot WorkshopSalesforce AI & Einstein Copilot Workshop
Salesforce AI & Einstein Copilot Workshop
 
How Social Media Hackers Help You to See Your Wife's Message.pdf
How Social Media Hackers Help You to See Your Wife's Message.pdfHow Social Media Hackers Help You to See Your Wife's Message.pdf
How Social Media Hackers Help You to See Your Wife's Message.pdf
 
Implementations of Fused Deposition Modeling in real world
Implementations of Fused Deposition Modeling  in real worldImplementations of Fused Deposition Modeling  in real world
Implementations of Fused Deposition Modeling in real world
 
Acumatica vs. Sage Intacct vs. NetSuite _ NOW CFO.pdf
Acumatica vs. Sage Intacct vs. NetSuite _ NOW CFO.pdfAcumatica vs. Sage Intacct vs. NetSuite _ NOW CFO.pdf
Acumatica vs. Sage Intacct vs. NetSuite _ NOW CFO.pdf
 
leewayhertz.com-AI agents for healthcare Applications benefits and implementa...
leewayhertz.com-AI agents for healthcare Applications benefits and implementa...leewayhertz.com-AI agents for healthcare Applications benefits and implementa...
leewayhertz.com-AI agents for healthcare Applications benefits and implementa...
 
Using LLM Agents with Llama 3, LangGraph and Milvus
Using LLM Agents with Llama 3, LangGraph and MilvusUsing LLM Agents with Llama 3, LangGraph and Milvus
Using LLM Agents with Llama 3, LangGraph and Milvus
 
Dublin_mulesoft_meetup_Mulesoft_Salesforce_Integration (1).pptx
Dublin_mulesoft_meetup_Mulesoft_Salesforce_Integration (1).pptxDublin_mulesoft_meetup_Mulesoft_Salesforce_Integration (1).pptx
Dublin_mulesoft_meetup_Mulesoft_Salesforce_Integration (1).pptx
 
WPRiders Company Presentation Slide Deck
WPRiders Company Presentation Slide DeckWPRiders Company Presentation Slide Deck
WPRiders Company Presentation Slide Deck
 
Tirana Tech Meetup - Agentic RAG with Milvus, Llama3 and Ollama
Tirana Tech Meetup - Agentic RAG with Milvus, Llama3 and OllamaTirana Tech Meetup - Agentic RAG with Milvus, Llama3 and Ollama
Tirana Tech Meetup - Agentic RAG with Milvus, Llama3 and Ollama
 
Feature sql server terbaru performance.pptx
Feature sql server terbaru performance.pptxFeature sql server terbaru performance.pptx
Feature sql server terbaru performance.pptx
 
July Patch Tuesday
July Patch TuesdayJuly Patch Tuesday
July Patch Tuesday
 
Calgary MuleSoft Meetup APM and IDP .pptx
Calgary MuleSoft Meetup APM and IDP .pptxCalgary MuleSoft Meetup APM and IDP .pptx
Calgary MuleSoft Meetup APM and IDP .pptx
 
High Profile Girls Call ServiCe Hyderabad 0000000000 Tanisha Best High Class ...
High Profile Girls Call ServiCe Hyderabad 0000000000 Tanisha Best High Class ...High Profile Girls Call ServiCe Hyderabad 0000000000 Tanisha Best High Class ...
High Profile Girls Call ServiCe Hyderabad 0000000000 Tanisha Best High Class ...
 
Scaling Connections in PostgreSQL Postgres Bangalore(PGBLR) Meetup-2 - Mydbops
Scaling Connections in PostgreSQL Postgres Bangalore(PGBLR) Meetup-2 - MydbopsScaling Connections in PostgreSQL Postgres Bangalore(PGBLR) Meetup-2 - Mydbops
Scaling Connections in PostgreSQL Postgres Bangalore(PGBLR) Meetup-2 - Mydbops
 
High Profile Girls call Service Pune 000XX00000 Provide Best And Top Girl Ser...
High Profile Girls call Service Pune 000XX00000 Provide Best And Top Girl Ser...High Profile Girls call Service Pune 000XX00000 Provide Best And Top Girl Ser...
High Profile Girls call Service Pune 000XX00000 Provide Best And Top Girl Ser...
 

Open Source ETL vs Commercial ETL

  • 1. Table of Contents • Introduction • ETL tools • Comparison • Use Cases • Conclusion Page 2 | 20 March 2008 | ETL Tools Comparison
  • 2. Table of Contents • Introduction • What do ETL tools do? • Why use an ETL tool? • ETL tools • Comparison • Use Cases • Conclusion Page 3 | 20 March 2008 | ETL Tools Comparison
  • 3. What do ETLs tool do? An ETL tool is a tool that: • Extracts data from various data sources (usually legacy) • Transforms data • from -> being optimized for transaction to -> being optimized for reporting and analysis • synchronizes the data coming from different databases • data cleanses to remove errors • Loads data into a data warehouse Page 4 | 20 March 2008 | ETL Tools Comparison
  • 4. Table of Contents • Introduction • What do ETL tools do? • Why use an ETL tool? • ETL tools • Comparison • Use Cases • Conclusion Page 5 | 20 March 2008 | ETL Tools Comparison
  • 5. Why use an ETL tool? ETL tools save time and money when developing a data warehouse by removing the need for “hand-coding”. “Hand Coding” is still the most common way of integrating data today. It requires hours and hours of development and expertise to create a Business-Intelligence-System. It is very difficult for data base administrators to connect between different brands of databases without using an external tool. In the event that databases are altered or new databases need to be integrated, a lot of “hand-coded” work needs to be completely redone. Page 6 | 20 March 2008 | ETL Tools Comparison
  • 6. Table of Contents • Introduction • ETL tools • Pentaho Kettle • Talend • Informatica PowerCenter • Inaplex Inaport • Comparison • Use Cases • Conclusion Page 7 | 20 March 2008 | ETL Tools Comparison
  • 7. Table of Contents • Introduction • ETL tools • Pentaho Kettle • Talend • Informatica PowerCenter • Inaplex Inaport • Comparison • Use Cases • Conclusion Page 8 | 20 March 2008 | ETL Tools Comparison
  • 8. ETL Tools Pentaho Kettle • Pentaho is a commercial open-source BI suite that has a product called Kettle for data integration. • It uses an innovative meta-driven approach and has a strong and very easy-to-use GUI • The company started around 2001 • It has a strong community of 13,500 registered users • It uses a stand-alone java engine that process the tasks for moving data between many different databases and files Page 9 | 20 March 2008 | ETL Tools Comparison
  • 9. Table of Contents • Introduction • ETL tools • Pentaho Kettle • Talend • Informatica PowerCenter • Inaplex Inaport • Comparison • Use Cases • Conclusion Page 10 | 20 March 2008 | ETL Tools Comparison
  • 10. ETL Tools Talend • Talend is an open-source data integration tool • It uses a code-generating approach and uses a GUI (implemented in Eclipse RC) • It started around October 2006 • It has a much smaller community then Pentaho, but is supported by 2 finance companies • It generates Java code or Perl code which can later be run on a server Page 11 | 20 March 2008 | ETL Tools Comparison
  • 11. Table of Contents • Introduction • ETL tools • Pentaho Kettle • Talend • Informatica PowerCenter • Inaplex Inaport • Comparison • Use Cases • Conclusion Page 12 | 20 March 2008 | ETL Tools Comparison
  • 12. ETL Tools Informatica PowerCenter • Informatica has a very good commercial data integration suite • It was founded in 1993 • It is the market share leader in data integration (Gartner Dataquest) • It has 2600 customers. Of those, there are fortune 100 companies, companies listed on the Dow Jones and government organization • The company's sole focus is data integration • It has quite a big package for enterprises to integrate their systems, cleanse their data and can connect to a vast number of current and legacy systems Page 13 | 20 March 2008 | ETL Tools Comparison
  • 13. Table of Contents • Introduction • ETL tools • Pentaho Kettle • Talend • Informatica PowerCenter • Inaplex Inaport • Comparison • Use Cases • Conclusion Page 14 | 20 March 2008 | ETL Tools Comparison
  • 14. ETL Tools Inaplex Inaport • Inaplex is a small UK company • InaPlex is a producer of Customer Data Integration products for mid-market CRM solutions • Inaplex mainly focuses on providing simple solutions for it’s customers to integrate their data into CRM and accounting software like Sage and Goldmine Page 15 | 20 March 2008 | ETL Tools Comparison
  • 15. Table of Contents • Introduction • ETL tools • Comparison • ETL Tools Comparison Chart • Total Cost of Ownership • Risk • Ease of Use • Support • Deployment • Speed • Data Quality • Monitoring • Connectivity • Use Cases • Conclusion Page 16 | 20 March 2008 | ETL Tools Comparison
  • 16. Table of Contents • Introduction • ETL tools • Comparison • ETL Tools Comparison Chart • Total Cost of Ownership • Risk • Ease of Use • Support • Deployment • Speed • Data Quality • Monitoring • Connectivity • Use Cases • Conclusion Page 17 | 20 March 2008 | ETL Tools Comparison
  • 17. Comparison ETL Tool Comparison Chart Pentaho Informatica Inaplex Talend Kettle PowerCenter Inaport Cost Risk Ease of Use Support Deployment Speed Data Quality Monitoring Connectivity Page 18 | 20 March 2008 | ETL Tools Comparison
  • 18. Table of Contents • Introduction • ETL tools • Comparison • ETL Tools Comparison Chart • Total Cost of Ownership • Risk • Ease of Use • Support • Deployment • Speed • Data Quality • Monitoring • Connectivity • Use Cases • Conclusion Page 19 | 20 March 2008 | ETL Tools Comparison
  • 19. Comparison Total Cost of Ownership Total Cost of Ownership means the over all cost for a certain product. This can mean initial ordering, licensing servicing, support, training, consulting, and any other additional payments that need to be made before the product is in full use. Commercial Open Source products are typically free to use, but the support, training and consulting are what companies need to pay for. Pentaho Informatica Inaplex Talend Kettle PowerCenter Inaport Page 20 | 20 March 2008 | ETL Tools Comparison
  • 20. Table of Contents • Introduction • ETL tools • Comparison • ETL Tools Comparison Chart • Total Cost of Ownership • Risk • Ease of Use • Support • Deployment • Speed • Data Quality • Monitoring • Connectivity • Use Cases • Conclusion Page 21 | 20 March 2008 | ETL Tools Comparison
  • 21. Comparison Risk There are always risks with projects, especially big projects. The risks for projects failing are: • Going over budget • Going over schedule • Not completing the requirements or expectations of the customers Open Source products have much lower risk then Commercial ones since they do not restrict the use of their products by pricey licenses. Pentaho Informatica Inaplex Talend Kettle PowerCenter Inaport Page 22 | 20 March 2008 | ETL Tools Comparison
  • 22. Table of Contents • Introduction • ETL tools • Comparison • ETL Tools Comparison Chart • Total Cost of Ownership • Risk • Ease of Use • Support • Deployment • Speed • Data Quality • Monitoring • Connectivity • Use Cases • Conclusion Page 23 | 20 March 2008 | ETL Tools Comparison
  • 23. Comparison Ease of Use All of the ETL tools, apart from Inaport, have GUI to simplify the development process. Having a good GUI also reduces the time to train and use the tools. Talend – Does have a GUI but is an add-on inside Eclipse RC. Pentaho Kettle – Has the most easy to use GUI out of all the tools. Training can also be found online or within the community. Informatica PowerCenter – Has an easy to use GUI, but requires some training to make full use of it. Inaplex Inaport – Does not have a “drag and drop” GUI. Talend Pentaho Informatica Inaplex Kettle PowerCenter Inaport Page 24 | 20 March 2008 | ETL Tools Comparison
  • 24. Table of Contents • Introduction • ETL tools • Comparison • ETL Tools Comparison Chart • Total Cost of Ownership • Risk • Ease of Use • Support • Deployment • Speed • Data Quality • Monitoring • Connectivity • Use Cases • Conclusion Page 25 | 20 March 2008 | ETL Tools Comparison
  • 25. Comparison Support Nowadays, all software products have support and all of the ETL tool providers offer support. Talend – Offers support, but mainly resides in the US. Pentaho Kettle – Offers support from US, UK and has a partner consultant in Hong Kong. Informatica PowerCenter – Offers world-wide support. Inaplex Inaport – Offers support, but mainly resides in the UK. Page 26 | 20 March 2008 | ETL Tools Comparison
  • 26. Table of Contents • Introduction • ETL tools • Comparison • ETL Tools Comparison Chart • Total Cost of Ownership • Risk • Ease of Use • Support • Deployment • Speed • Data Quality • Monitoring • Connectivity • Use Cases • Conclusion Page 27 | 20 March 2008 | ETL Tools Comparison
  • 27. Comparison Deployment Talend – Creates a java file or perl file that can be run with an external scheduler on any machine with very little resource. Recommended one 1Ghz CPU and 512mbs ram Pentaho Kettle – Is a stand-alone java engine that can run on any machine that can run java. Needs an external scheduler to run automatically. It can be deployed on many different machines and used as “slave servers” to help with transformation processing. Recommended one 1Ghz CPU and 512mbs ram Informatica PowerCenter – Requires a server with platforms: Windows, Solaris, HP-UX, IBM-UX, Redhat, SUSE linux. Recommended to use two CPUs with 1Gb ram for Standard Edition Server Inaplex Inaport – Can run on any windows platform that has .NET 2.0 installed Recommended one CPU with 50mbs ram. Page 28 | 20 March 2008 | ETL Tools Comparison
  • 28. Table of Contents • Introduction • ETL tools • Comparison • ETL Tools Comparison Chart • Total Cost of Ownership • Risk • Ease of Use • Support • Deployment • Speed • Data Quality • Monitoring • Connectivity • Use Cases • Conclusion Page 29 | 20 March 2008 | ETL Tools Comparison
  • 29. Comparison Speed The speed of ETL tools depends largely on the data that needs to be transferred over the network and the processing power involved in transforming the data. Talend – Is slower then Pentaho. It requires manual tweaking and prior knowledge of the specific data source to reduce network traffic and processing. Pentaho Kettle – Is faster then Talend, but the Java-connector slows it down somewhat. Also requires manual tweaking like Talend. Can be clustered by placed on many machines to reduce network traffic. Informatica PowerCenter – Is the fastest tool. It has an advanced “PushDown” option that localizes transformation tasks depending on how busy the machine is. Inaplex Inaport – does not use any special techniques to improve speed. Talend Pentaho Informatica Inaplex Kettle PowerCenter Inaport Page 30 | 20 March 2008 | ETL Tools Comparison
  • 30. Table of Contents • Introduction • ETL tools • Comparison • ETL Tools Comparison Chart • Total Cost of Ownership • Risk • Ease of Use • Support • Deployment • Speed • Data Quality • Monitoring • Connectivity • Use Cases • Conclusion Page 31 | 20 March 2008 | ETL Tools Comparison
  • 31. Comparison Data Quality Data Quality is fast becoming the most important feature in any data integration tool. Talend – has DQ features in its GUI, allows for customized SQL statements and by using Java. Pentaho – has DQ features in its GUI, allows for customized SQL statements, by using JavaScript and Regular Expressions. It also has some additional modules after subscribing. Informatica PowerCenter – does not have that many DQ features, but there is another product called Informatica Data Quality which has many DQ features. Inaplex Inaport – does have DQ features. Because of the very specific data that Inaport can integrate, it is relatively easy to clean that data. Pentaho Informatica Inaplex Talend Kettle PowerCenter Inaport Page 32 | 20 March 2008 | ETL Tools Comparison
  • 32. Table of Contents • Introduction • ETL tools • Comparison • ETL Tools Comparison Chart • Total Cost of Ownership • Risk • Ease of Use • Support • Deployment • Speed • Data Quality • Monitoring • Connectivity • Use Cases • Conclusion Page 33 | 20 March 2008 | ETL Tools Comparison
  • 33. Comparison Monitoring Monitoring allows to find problems and debug them during and after the development stage. Talend – has practical monitoring tools and logging. Pentaho Kettle – has practical monitoring tools and logging. Informatica PowerCenter – has extensive monitoring tools and logging. Inaplex Inaport - has practical monitoring tools and logging. Pentaho Informatica Inaplex Talend Kettle PowerCenter Inaport Page 34 | 20 March 2008 | ETL Tools Comparison
  • 34. Table of Contents • Introduction • ETL tools • Comparison • ETL Tools Comparison Chart • Total Cost of Ownership • Risk • Ease of Use • Support • Deployment • Speed • Data Quality • Monitoring • Connectivity • Use Cases • Conclusion Page 35 | 20 March 2008 | ETL Tools Comparison
  • 35. Comparison Connectivity In most cases, ETL tools transfer data from legacy systems. Their connectivity is very important to the usefulness of the ETL tools. Talend – Can connect to all the current databases, flat files, xml files, excel files and web services, but is reliant on Java drivers to connect to those data sources. Pentaho Kettle – Can connect to a very wide variety of databases, flat files, xml files, excel files and web services. Informatica PowerCenter – Can connect to a huge number of databases, mainframes, flat files, excel files and web services. It can also export as a web service. Inaplex Inaport – Can connect to any ODBC (windows) connection. It usually gets its data from current databases, outlook, ACT and excel files. Pentaho Informatica Inaplex Talend Kettle PowerCenter Inaport Page 36 | 20 March 2008 | ETL Tools Comparison
  • 36. Comparison ETL Tool Comparison Chart Pentaho Informatica Inaplex Talend Kettle PowerCenter Inaport Cost Risk Ease of Use Support Deployment Speed Data Quality Monitoring Connectivity Page 37 | 20 March 2008 | ETL Tools Comparison
  • 37. Table of Contents • Introduction • ETL tools • Comparison • Use Cases • MySQL • Loma Linda University Health Care • BNSF Logistics • U.S. Naval Air Systems Command • Conclusion Page 38 | 20 March 2008 | ETL Tools Comparison
  • 38. Table of Contents • Introduction • ETL tools • Comparison • Use Cases • MySQL • Loma Linda University Health Care • BNSF Logistics • U.S. Naval Air Systems Command • Conclusion Page 39 | 20 March 2008 | ETL Tools Comparison
  • 39. Use Cases MySQL quot;We selected Pentaho for its ease-of-use. Pentaho addressed many of our requirements -- from reporting and analysis to dashboards, OLAP and ETL, and offered our business users the Excel-based access that they wanted.quot; Key Challenges • Reporting and analysis of operational expenses by department and cost center • Multiple data sources including Microsoft Excel (for cost-center rollups) Results • Centralized view of spending by department • Easy access to information from Excel Why Pentaho • Ease of use • Breadth of solution • Cost of ownership Page 40 | 20 March 2008 | ETL Tools Comparison
  • 40. Table of Contents • Introduction • ETL tools • Comparison • Use Cases • MySQL • Loma Linda University Health Care • BNSF Logistics • U.S. Naval Air Systems Command • Conclusion Page 41 | 20 March 2008 | ETL Tools Comparison
  • 41. Use Cases Loma Linda University Health Care quot;Pentaho Customer Support has been exceptional. This is a strategic application at LLUHC, and working with Pentaho has accelerated our deployment and improved our overall application delivery.quot; Key Challenges • Providing analytics for billing and operations supporting 500,000 patients and 600 doctors Results • Comprehensive analysis of time periods, services provided, billing groups, physicians • Centralized, secured, consistent information delivery (versus prior Excel- based system) • Ability to drill and analyze down to the individual patient level Why Pentaho • Open standards support and ease of integration • Cost of ownership Page 42 | 20 March 2008 | ETL Tools Comparison
  • 42. Table of Contents • Introduction • ETL tools • Comparison • Use Cases • MySQL • Loma Linda University Health Care • BNSF Logistics • U.S. Naval Air Systems Command • Conclusion Page 43 | 20 March 2008 | ETL Tools Comparison
  • 43. Use Cases BNSF Logistics quot;Using Pentaho for our business intelligence platform, along with the expert support and knowledge provided by OpenBI, BNSF Logistics was able to implement our initial data warehouse with web-based reporting and analytics in just six weeks. Not only did we deliver a powerful business intelligence tool set for our organization in short order, but were able to do so at a fraction of the cost of proprietary alternatives.quot; Key Challenges • Cumbersome, manual process for creation and distribution of reports • Inconsistent data accuracy because of semi-automated preparation processes Results • Initial data warehouse with web-based reporting and analytics in 6 weeks • 75% lower acquisition costs, 50% lower ongoing ownership costs compared to proprietary BI • Ability to monitor operational business health Why Pentaho • Faster, better decisions in sales processes Open standards support and ease of integration Page 44 | 20 March 2008 | ETL Tools Comparison Cost of ownership
  • 44. Table of Contents • Introduction • ETL tools • Comparison • Use Cases • MySQL • Loma Linda University Health Care • BNSF Logistics • U.S. Naval Air Systems Command • Conclusion Page 45 | 20 March 2008 | ETL Tools Comparison
  • 45. Use Cases U.S. Naval Air Systems Command quot;[Open technologies] reduce the cost of software development and they reduce the time in which innovations in software can be incorporated in systems. 'If the project is of a sufficient scale, you cannot get there without an open-source approach,' said Dewey Houck, a senior engineer at Boeing, who spoke at a conference last month about DOD's use of open source.quot; (Government Computer News, Jan. 2008)quot; Key Challenges • Analyzing flight data to reduce operational risk and improve training (human error is a causal factor in 70% of aviation mishaps) Results • Ability to leverage recorded electronic sensor data to reduce risk and improve crew performance Why Pentaho • Breadth of capabilities • Proven success and large-scale referenceable deployments • Successful proof-of-concept • Dramatically lower costs Page 46 | 20 March 2008 | ETL Tools Comparison
  • 46. Table of Contents • Introduction • What do ETL tools do? • Why use an ETL tools? • ETL tools • Comparison • Use Cases • Conclusion Page 47 | 20 March 2008 | ETL Tools Comparison
  • 47. Conclusion • Informatica and Pentaho have very good products. • Informatica has a far more extensive range of products, but compared to Pentaho is very expensive. • Pentaho has proved that it can handle small to large scale systems. • Pentaho is gaining fast momentum with businesses that would not have considered using open source products before. Page 48 | 20 March 2008 | ETL Tools Comparison