Submit Search
Upload
Pentaho Data Integration Introduction
•
48 likes
•
32,603 views
M
mattcasters
Follow
A gentle and short introduction into Pentaho Data Integration a.k.a. Kettle
Read less
Read more
Technology
Report
Share
Report
Share
1 of 18
Recommended
Introduction To Pentaho
Introduction To Pentaho
DataminingTools Inc
Introduction To Pentaho
Introduction To Pentaho
pentaho Content
Kettle: Pentaho Data Integration tool
Kettle: Pentaho Data Integration tool
Alex Rayón Jerez
Pentaho
Pentaho
teza123
Pentaho etl-tool
Pentaho etl-tool
Sreenivas Kappala
Data Warehouse Tutorial For Beginners | Data Warehouse Concepts | Data Wareho...
Data Warehouse Tutorial For Beginners | Data Warehouse Concepts | Data Wareho...
Edureka!
ETL
ETL
Mallikarjuna G D
Moving and Transforming Data with Pentaho Data Integration 5.0 CE (aka Kettle)
Moving and Transforming Data with Pentaho Data Integration 5.0 CE (aka Kettle)
Roland Bouman
Recommended
Introduction To Pentaho
Introduction To Pentaho
DataminingTools Inc
Introduction To Pentaho
Introduction To Pentaho
pentaho Content
Kettle: Pentaho Data Integration tool
Kettle: Pentaho Data Integration tool
Alex Rayón Jerez
Pentaho
Pentaho
teza123
Pentaho etl-tool
Pentaho etl-tool
Sreenivas Kappala
Data Warehouse Tutorial For Beginners | Data Warehouse Concepts | Data Wareho...
Data Warehouse Tutorial For Beginners | Data Warehouse Concepts | Data Wareho...
Edureka!
ETL
ETL
Mallikarjuna G D
Moving and Transforming Data with Pentaho Data Integration 5.0 CE (aka Kettle)
Moving and Transforming Data with Pentaho Data Integration 5.0 CE (aka Kettle)
Roland Bouman
Building an Effective Data Warehouse Architecture
Building an Effective Data Warehouse Architecture
James Serra
Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4
Databricks
Data Lake Overview
Data Lake Overview
James Serra
Building Lakehouses on Delta Lake with SQL Analytics Primer
Building Lakehouses on Delta Lake with SQL Analytics Primer
Databricks
Power BI Architecture
Power BI Architecture
Arthur Graus
Pentaho-BI
Pentaho-BI
Edureka!
Demystifying data engineering
Demystifying data engineering
Thang Bui (Bob)
Modernizing to a Cloud Data Architecture
Modernizing to a Cloud Data Architecture
Databricks
Future of Data Engineering
Future of Data Engineering
C4Media
Data Engineering Basics
Data Engineering Basics
Catherine Kimani
Intro to Delta Lake
Intro to Delta Lake
Databricks
ETL VS ELT.pdf
ETL VS ELT.pdf
BOSupport
Build Large-Scale Data Analytics and AI Pipeline Using RayDP
Build Large-Scale Data Analytics and AI Pipeline Using RayDP
Databricks
DW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptx
Databricks
Pentaho | Data Integration & Report designer
Pentaho | Data Integration & Report designer
Hamdi Hmidi
Introduction to ETL and Data Integration
Introduction to ETL and Data Integration
CloverDX (formerly known as CloverETL)
Building End-to-End Delta Pipelines on GCP
Building End-to-End Delta Pipelines on GCP
Databricks
Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)
James Serra
Power BI visuals
Power BI visuals
Aldis Ērglis
Summary introduction to data engineering
Summary introduction to data engineering
Novita Sari
Pentaho data integration 4.0 and my sql
Pentaho data integration 4.0 and my sql
AHMED ENNAJI
5 Steps for Architecting a Data Lake
5 Steps for Architecting a Data Lake
MetroStar
More Related Content
What's hot
Building an Effective Data Warehouse Architecture
Building an Effective Data Warehouse Architecture
James Serra
Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4
Databricks
Data Lake Overview
Data Lake Overview
James Serra
Building Lakehouses on Delta Lake with SQL Analytics Primer
Building Lakehouses on Delta Lake with SQL Analytics Primer
Databricks
Power BI Architecture
Power BI Architecture
Arthur Graus
Pentaho-BI
Pentaho-BI
Edureka!
Demystifying data engineering
Demystifying data engineering
Thang Bui (Bob)
Modernizing to a Cloud Data Architecture
Modernizing to a Cloud Data Architecture
Databricks
Future of Data Engineering
Future of Data Engineering
C4Media
Data Engineering Basics
Data Engineering Basics
Catherine Kimani
Intro to Delta Lake
Intro to Delta Lake
Databricks
ETL VS ELT.pdf
ETL VS ELT.pdf
BOSupport
Build Large-Scale Data Analytics and AI Pipeline Using RayDP
Build Large-Scale Data Analytics and AI Pipeline Using RayDP
Databricks
DW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptx
Databricks
Pentaho | Data Integration & Report designer
Pentaho | Data Integration & Report designer
Hamdi Hmidi
Introduction to ETL and Data Integration
Introduction to ETL and Data Integration
CloverDX (formerly known as CloverETL)
Building End-to-End Delta Pipelines on GCP
Building End-to-End Delta Pipelines on GCP
Databricks
Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)
James Serra
Power BI visuals
Power BI visuals
Aldis Ērglis
Summary introduction to data engineering
Summary introduction to data engineering
Novita Sari
What's hot
(20)
Building an Effective Data Warehouse Architecture
Building an Effective Data Warehouse Architecture
Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4
Data Lake Overview
Data Lake Overview
Building Lakehouses on Delta Lake with SQL Analytics Primer
Building Lakehouses on Delta Lake with SQL Analytics Primer
Power BI Architecture
Power BI Architecture
Pentaho-BI
Pentaho-BI
Demystifying data engineering
Demystifying data engineering
Modernizing to a Cloud Data Architecture
Modernizing to a Cloud Data Architecture
Future of Data Engineering
Future of Data Engineering
Data Engineering Basics
Data Engineering Basics
Intro to Delta Lake
Intro to Delta Lake
ETL VS ELT.pdf
ETL VS ELT.pdf
Build Large-Scale Data Analytics and AI Pipeline Using RayDP
Build Large-Scale Data Analytics and AI Pipeline Using RayDP
DW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptx
Pentaho | Data Integration & Report designer
Pentaho | Data Integration & Report designer
Introduction to ETL and Data Integration
Introduction to ETL and Data Integration
Building End-to-End Delta Pipelines on GCP
Building End-to-End Delta Pipelines on GCP
Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)
Power BI visuals
Power BI visuals
Summary introduction to data engineering
Summary introduction to data engineering
Similar to Pentaho Data Integration Introduction
Pentaho data integration 4.0 and my sql
Pentaho data integration 4.0 and my sql
AHMED ENNAJI
5 Steps for Architecting a Data Lake
5 Steps for Architecting a Data Lake
MetroStar
Big Data Session 1.pptx
Big Data Session 1.pptx
ElsonPaul2
Datalake Architecture
Datalake Architecture
TechYugadi IT Solutions & Consulting
Big Data Analytics: From SQL to Machine Learning and Graph Analysis
Big Data Analytics: From SQL to Machine Learning and Graph Analysis
Yuanyuan Tian
Trivadis Azure Data Lake
Trivadis Azure Data Lake
Trivadis
Introduction Big Data
Introduction Big Data
Frank Kienle
INF2190_W1_2016_public
INF2190_W1_2016_public
Attila Barta
Building the Data Lake with Azure Data Factory and Data Lake Analytics
Building the Data Lake with Azure Data Factory and Data Lake Analytics
Khalid Salama
Is the traditional data warehouse dead?
Is the traditional data warehouse dead?
James Serra
Building big data solutions on azure
Building big data solutions on azure
Eyal Ben Ivri
Meeting today’s dissemination challenges – Implementing International Standar...
Meeting today’s dissemination challenges – Implementing International Standar...
Jonathan Challener
Big data and oracle
Big data and oracle
Sourabh Saxena
Qo Introduction V2
Qo Introduction V2
Joe_F
Hd insight overview
Hd insight overview
vhrocca
Eclipse day Sydney 2014 BIG data presentation
Eclipse day Sydney 2014 BIG data presentation
Sai Paravastu
How to Quickly and Easily Draw Value from Big Data Sources_Q3 symposia(Moa)
How to Quickly and Easily Draw Value from Big Data Sources_Q3 symposia(Moa)
Moacyr Passador
An Overview of VIEW
An Overview of VIEW
Shiyong Lu
INFOGOV14 - Trusting Your KM & ECM Strategy to SharePoint
INFOGOV14 - Trusting Your KM & ECM Strategy to SharePoint
Jonathan Ralton
Modernizing Your Data Warehouse using APS
Modernizing Your Data Warehouse using APS
Stéphane Fréchette
Similar to Pentaho Data Integration Introduction
(20)
Pentaho data integration 4.0 and my sql
Pentaho data integration 4.0 and my sql
5 Steps for Architecting a Data Lake
5 Steps for Architecting a Data Lake
Big Data Session 1.pptx
Big Data Session 1.pptx
Datalake Architecture
Datalake Architecture
Big Data Analytics: From SQL to Machine Learning and Graph Analysis
Big Data Analytics: From SQL to Machine Learning and Graph Analysis
Trivadis Azure Data Lake
Trivadis Azure Data Lake
Introduction Big Data
Introduction Big Data
INF2190_W1_2016_public
INF2190_W1_2016_public
Building the Data Lake with Azure Data Factory and Data Lake Analytics
Building the Data Lake with Azure Data Factory and Data Lake Analytics
Is the traditional data warehouse dead?
Is the traditional data warehouse dead?
Building big data solutions on azure
Building big data solutions on azure
Meeting today’s dissemination challenges – Implementing International Standar...
Meeting today’s dissemination challenges – Implementing International Standar...
Big data and oracle
Big data and oracle
Qo Introduction V2
Qo Introduction V2
Hd insight overview
Hd insight overview
Eclipse day Sydney 2014 BIG data presentation
Eclipse day Sydney 2014 BIG data presentation
How to Quickly and Easily Draw Value from Big Data Sources_Q3 symposia(Moa)
How to Quickly and Easily Draw Value from Big Data Sources_Q3 symposia(Moa)
An Overview of VIEW
An Overview of VIEW
INFOGOV14 - Trusting Your KM & ECM Strategy to SharePoint
INFOGOV14 - Trusting Your KM & ECM Strategy to SharePoint
Modernizing Your Data Warehouse using APS
Modernizing Your Data Warehouse using APS
Recently uploaded
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Miguel Araújo
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Neo4j
🐬 The future of MySQL is Postgres 🐘
🐬 The future of MySQL is Postgres 🐘
RTylerCroy
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
Results
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
Sinan KOZAK
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
Pooja Nehwal
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
wesley chun
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Drew Madelung
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
Radu Cotescu
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
ThousandEyes
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
Martijn de Jong
How to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
naman860154
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Katpro Technologies
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
Anna Loughnan Colquhoun
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
Allon Mureinik
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
V3cube
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
gurkirankumar98700
Recently uploaded
(20)
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
🐬 The future of MySQL is Postgres 🐘
🐬 The future of MySQL is Postgres 🐘
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
How to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Pentaho Data Integration Introduction
1.
2.
3.
Project manager
4.
5.
6.
650 pages
7.
Pentaho Data Integration
for BI Business Intelligence! That's what we do.
8.
Pentaho Data Integration
– Kettle K ettle E xtraction T ransportation T ransformation L oading E nvironment
9.
10.
11.
XML files
12.
XLS files
13.
Xbase files (dBase,
Foxpro, etc)
14.
File systems information
15.
Generated data
16.
MS Access files
17.
LDAP
18.
Geo-data
19.
...
20.
21.
22.
partitioning
23.
merging
24.
joining
25.
duplicating
26.
clustering (MPP)
27.
28.
files
29.
30.
31.
Mapping
32.
Selecting
33.
Filtering
34.
Pivotting ...
35.
36.
Data warehouse population
37.
Partitioned loading
38.
Bulk loading
39.
Parallel loading
40.
Clustering
41.
42.
Debugger
43.
44.
45.
46.
Plugin eco-system
47.
...
48.
49.
50.
All regions on
Earth
51.
Meet on our
Forum : +40,000 posts in 10,000 threads in 4 years
52.
Use our JIRA
case tracking systems
53.
Download more than
10,000 copies of Kettle per month http://www.ohloh.net/projects/3624?p=Kettle http://www.softpedia.com/progClean/Kettle-Clean-80094.html
54.
55.
Export data from
database to text-file or more other databases
56.
Data migration between
database applications
57.
Exploration of data
in existing databases (tables, views, etc.)
58.
Information improvement using
lookups
59.
Data cleaning
60.
Application integration
61.
Data warehouse population
62.
Application integration
63.
Report data generation
64.
...
65.
66.
67.
68.
Natural fit for
additional data sources, targets and transformations
69.
70.
Download free study
at pentaho.com
71.
72.
73.
From Tera-bytes to
Peta-bytes
74.
Big Data stored
in Hadoop (MapReduce) / HDFS / Hive
75.
Reduces complexity for
developers
76.
Leverages standard components
like Pentaho Data Integration
77.
Drag & drop
creation of map and reduce transformations
78.
Cooperation with Apache
79.
Presentation + Demo
: http://vimeo.com/14641559
80.
81.
Forum: http://forums.pentaho.org/forumdisplay.php?f=69
82.
Case tracker:
http://jira.pentaho.org/browse/PDI
83.
Continuous Integration Server:
http://ci.pentaho.com/job/Kettle
84.
Wiki :
http://wiki.pentaho.org/ display/EAI
85.
IRC Channel: ##pentaho
(on Freenode)
86.
Mailing list:
http://groups.google.com/group/kettle-developers
87.
My blog:
http://www.ibridge.be
88.
My coordinates: mcasters
at pentaho dot org
89.
Pentaho Books
90.