• Share
  • Email
  • Embed
  • Like
  • Save
  • Private Content
Talend
 

Talend

on

  • 3,147 views

 

Statistics

Views

Total Views
3,147
Views on SlideShare
3,006
Embed Views
141

Actions

Likes
1
Downloads
118
Comments
0

2 Embeds 141

http://www.cloudera.com 137
http://blog.cloudera.com 4

Accessibility

Categories

Upload Details

Uploaded via as Microsoft PowerPoint

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

    Talend Talend Presentation Transcript

    • Solution Spotlight Presents
    • © Talend 2011
      2
      Integration with CDH in Talend
      Talend, Global Leader in Open Source Integration Solutions
      Connect external data to Hadoop/HDFS
      Leverage MapReduce in Talend job design
    • Market Positioning – Products
      © Talend 2011
      3
      Application Integration
      Connect applications & services
      MDM
      Reference data management
      Data Quality
      Data profiling & data cleansing
      Data Integration
      Analytics (ETL)
      Operational Integration
      Data replication & synchronization,
      data migration & capture,
      application upgrade, etc.
      Extract, Transform & Load for decision support systems
    • Solution Positioning
      © Talend 2011
      4
      Talend ESB
      Industrialize deployment of Apache-based ESB
      - Free, Apache-based ESB
      - Fully functional Enterprise Service Bus
      Talend ASF
      Deploy large-scale enterprise SOA
      - Governance & security
      - Advanced monitoring
      Talend MDM Enterprise Edition
      Deploy large scale MDM
      - Full permissions management
      - Validation rules
      - Complex workflows
      Talend Open Profiler
      Identify data quality problems
      - Free, GPL, no limitations
      - Custom indicators
      Talend MDM Community Edition
      Manage master data
      • Free, GPL, no limitations
      • Active data model
      - Lightweight business user UI
      Talend Data Quality
      Cleanse & track
      - Specific components
      - Reports
      • Data Quality Portal
      Talend Unified Platform
      Common, unified environment
      - Front end: UI (Eclipse, Web)
      - Back end: repository
      Talend Open Studio
      Create data flows
      • Free, GPL, no limitations
      • Unlimited data flows
      - 450+ components included
      Talend Integration Suite
      Deploy data integration
      • Teamwork
      • Automated deployment & load balancing
      - Scheduling & Monitoring
      Talend LCp
      Manage best practices
      • Testing Platform
      • Repository Manager
      • Project Audit
      Hadoop Integration
    • Hadoop Integration Overview
      TalendIntegration Suite (TIS) key features:
      • Graphical flow design
      • Connecting 450+ set of connectors toHadoop
      • Providing HDFS input/output
      • Read/Write form any source to HDFS, Hive, HBase, and Sequence Files
      • Processing Data inside Hadoop Using ELT with HiveQL Unleashing the Pig
      • Aggregation and cleansing inside Hadoop
      • Mass import/export btw Hadoop and RDBMS
      • Automated deployment
      • Time & Event based scheduler
      • Fail Over / Load Balancing
      • Centralized monitoring of integration processes
      • Shared repository / Metadata management
      © Talend 2011
      5
    • Graphical Interface
      © Talend 2011
      6
      • Graphical flow design
      • Java code generation from flow design
      • Native Hadoop code gen (Java API)
      • Metadata integration with Hive
      • Connect external sources into HDFS (450+ connectors)
      • Aggregate, Cleanse, Transform data in HDFS
      • Leverage MR, Pig, Hive for data processing
      • Real time debugger & direct job deployment on Hadoop cluster
      • Sqoop connectors for mass export/import to RDBMS
    • Typical Use Case Scenario
      • Landing RAW data into Hadoop with tHDFS components
      • Processing & transformation with Hive ELT or tPigSeries
      • Load to traditional RDBMS or Hive for Analysis and Reporting
      © Talend 2011
      7
    • Hadoop Connectors
      © Talend 2011
      8
    • Resources
      © Talend 2011
      9
      • Visit our website at http://www.talend.com
      • Watch pre-recorded webinar about our Hadoop Integration:http://www.talend.com/webinar/archive/
      • Send questions to sales@talend.com or info@talend.com
      • Download Talend Software:http://www.talend.com/download.php
      • Join Talend community to ask technical questions and connect: http://www.talendforge.org
    • http://www.cloudera.com/partners