• Share
  • Email
  • Embed
  • Like
  • Save
  • Private Content
Pentaho Data Integration Introduction
 

Pentaho Data Integration Introduction

on

  • 21,484 views

A gentle and short introduction into Pentaho Data Integration a.k.a. Kettle

A gentle and short introduction into Pentaho Data Integration a.k.a. Kettle

Statistics

Views

Total Views
21,484
Views on SlideShare
18,864
Embed Views
2,620

Actions

Likes
18
Downloads
0
Comments
3

29 Embeds 2,620

http://todobi.blogspot.com 1312
http://todobi.blogspot.com.es 685
http://blog.professorcoruja.com 297
http://todobi.blogspot.mx 165
http://todobi.blogspot.com.ar 89
http://www.linkedin.com 13
http://todobi.blogspot.fr 13
http://todobi.blogspot.co.il 8
http://translate.googleusercontent.com 5
http://todobi.blogspot.com.br 5
https://www.linkedin.com 3
http://www.techgig.com 2
http://todobi.blogspot.se 2
http://todobi.blogspot.ch 2
http://todobi.blogspot.co.at 2
http://todobi.blogspot.hk 2
http://todobi.blogspot.it 2
http://www.todobi.blogspot.com 2
http://todobi.blogspot.com.au 1
http://webcache.googleusercontent.com 1
http://todobi.blogspot.kr 1
http://todobi.blogspot.ca 1
http://todobi.blogspot.co.uk 1
http://todobi.blogspot.cz 1
http://todobi.blogspot.nl 1
http://todobi.blogspot.pt 1
http://todobi.blogspot.be 1
http://feeds.feedburner.com 1
http://www.slashdocs.com 1
More...

Accessibility

Categories

Upload Details

Uploaded via as OpenOffice

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel

13 of 3 previous next Post a comment

  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

    Pentaho Data Integration Introduction Pentaho Data Integration Introduction Presentation Transcript

      • Pentaho Introduction
        Matt Casters
    • Matt Casters
      • Chief of Data Integration at Pentaho
        • Lead Development
        • Project manager
        • Community liason
      • Kettle Project Founder
      • Author of Pentaho Kettle Solutions
        • Published by Wiley
        • 650 pages
    • Pentaho Data Integration for BI Business Intelligence! That's what we do.
    • Pentaho Data Integration – Kettle K ettle E xtraction T ransportation T ransformation L oading E nvironment
    • Pentaho Data Integration – Extraction
      • Extract data from :
        • 35+ database types
          • MySQL, PostgreSQL, SQLite, ...
          • Oracle, SQL Server, etc
        • Text files
        • XML files
        • XLS files
        • Xbase files (dBase, Foxpro, etc)
        • File systems information
        • Generated data
        • MS Access files
        • LDAP
        • Geo-data
        • ...
    • Pentaho Data Integration – Transportation
      • Transportation of data
        • Engine based data transfer (no code generator)
        • Very flexible pathways:
          • splitting
          • partitioning
          • merging
          • joining
          • duplicating
          • clustering (MPP)
    • Pentaho Data Integration – Transformation
      • Flexibly transform data
        • Looking up data
          • databases
          • files
          • memory...
        • Calculating
        • Scripting
          • JavaScript, SQL, RegExp
        • Splitting
        • Mapping
        • Selecting
        • Filtering
        • Pivotting ...
    • Pentaho Data Integration – Loading
      • Load data into a target format
        • Database loads
        • Data warehouse population
        • Partitioned loading
        • Bulk loading
        • Parallel loading
        • Clustering
    • Pentaho Data Integration – Environment
      • Full GUI called “Spoon” to edit every option in Kettle
        • Drag & Drop
        • Debugger
        • Rich GUI
      • Command line tools
        • execute jobs
        • execute transformations
      • Web server
        • clustering
        • remote execution
      • Programming API for Java
      • Plugin eco-system
      • ...
    • Pentaho Data Integration – Community
      • Paying Pentaho customers
      • Large and small corporations
        • All possible sectors
      • Lone rangers & Hobbiests
      • All regions on Earth
      • Meet on our Forum : +40,000 posts in 10,000 threads in 4 years
      • Use our JIRA case tracking systems
      • Download more than 10,000 copies of Kettle per month
      http://www.ohloh.net/projects/3624?p=Kettle http://www.softpedia.com/progClean/Kettle-Clean-80094.html
    • Pentaho Data Integration – use-cases
      • Load data from text files and store it into a database
      • Export data from database to text-file or more other databases
      • Data migration between database applications
      • Exploration of data in existing databases (tables, views, etc.)
      • Information improvement using lookups
      • Data cleaning
      • Application integration
      • Data warehouse population
      • Application integration
      • Report data generation
      • ...
    • Pentaho Data Integration – Adoption
      • Wide range of production deployments
        • Small and medium-sized companies
        • Large enterprises
      • Rapid product evolution
        • Driven by Pentaho investment
        • Includes significant community contributions
          • “ Contribution-friendly” architecture
          • Natural fit for additional data sources, targets and transformations
    • Pentaho Data Integration – Adoption
      • Most deployed open source data integration solution. Independent study by Mark Madsen of Third Nature and the BeyeNETWORK
      • Download free study at pentaho.com
      • Big Data
    • Pentaho – Big Data
      • Enabling BI on top of big data
      • From Tera-bytes to Peta-bytes
      • Big Data stored in Hadoop (MapReduce) / HDFS / Hive
      • Reduces complexity for developers
      • Leverages standard components like Pentaho Data Integration
      • Drag & drop creation of map and reduce transformations
      • Cooperation with Apache
      • Presentation + Demo : http://vimeo.com/14641559
    • Pentaho Data Integration – Links
      • Homepage: http://kettle.pentaho.org
      • Forum: http://forums.pentaho.org/forumdisplay.php?f=69
      • Case tracker: http://jira.pentaho.org/browse/PDI
      • Continuous Integration Server: http://ci.pentaho.com/job/Kettle
      • Wiki : http://wiki.pentaho.org/ display/EAI
      • IRC Channel: ##pentaho (on Freenode)
      • Mailing list: http://groups.google.com/group/kettle-developers
      • My blog: http://www.ibridge.be
      • My coordinates: mcasters at pentaho dot org
    • Pentaho Books
    • Q&A
        Thank you for listening!