Uploaded on

 

More in: Technology
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Be the first to comment
No Downloads

Views

Total Views
2,922
On Slideshare
0
From Embeds
0
Number of Embeds
1

Actions

Shares
Downloads
138
Comments
0
Likes
1

Embeds 0

No embeds

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
    No notes for slide

Transcript

  • 1. Solution Spotlight Presents
  • 2. © Talend 2011
    2
    Integration with CDH in Talend
    Talend, Global Leader in Open Source Integration Solutions
    Connect external data to Hadoop/HDFS
    Leverage MapReduce in Talend job design
  • 3. Market Positioning – Products
    © Talend 2011
    3
    Application Integration
    Connect applications & services
    MDM
    Reference data management
    Data Quality
    Data profiling & data cleansing
    Data Integration
    Analytics (ETL)
    Operational Integration
    Data replication & synchronization,
    data migration & capture,
    application upgrade, etc.
    Extract, Transform & Load for decision support systems
  • 4. Solution Positioning
    © Talend 2011
    4
    Talend ESB
    Industrialize deployment of Apache-based ESB
    - Free, Apache-based ESB
    - Fully functional Enterprise Service Bus
    Talend ASF
    Deploy large-scale enterprise SOA
    - Governance & security
    - Advanced monitoring
    Talend MDM Enterprise Edition
    Deploy large scale MDM
    - Full permissions management
    - Validation rules
    - Complex workflows
    Talend Open Profiler
    Identify data quality problems
    - Free, GPL, no limitations
    - Custom indicators
    Talend MDM Community Edition
    Manage master data
    • Free, GPL, no limitations
    • 5. Active data model
    - Lightweight business user UI
    Talend Data Quality
    Cleanse & track
    - Specific components
    - Reports
    • Data Quality Portal
    Talend Unified Platform
    Common, unified environment
    - Front end: UI (Eclipse, Web)
    - Back end: repository
    Talend Open Studio
    Create data flows
    • Free, GPL, no limitations
    • 6. Unlimited data flows
    - 450+ components included
    Talend Integration Suite
    Deploy data integration
    • Teamwork
    • 7. Automated deployment & load balancing
    - Scheduling & Monitoring
    Talend LCp
    Manage best practices
    • Testing Platform
    • 8. Repository Manager
    • 9. Project Audit
    Hadoop Integration
  • 10. Hadoop Integration Overview
    TalendIntegration Suite (TIS) key features:
    • Graphical flow design
    • 11. Connecting 450+ set of connectors toHadoop
    • 12. Providing HDFS input/output
    • 13. Read/Write form any source to HDFS, Hive, HBase, and Sequence Files
    • 14. Processing Data inside Hadoop Using ELT with HiveQL Unleashing the Pig
    • 15. Aggregation and cleansing inside Hadoop
    • 16. Mass import/export btw Hadoop and RDBMS
    • 17. Automated deployment
    • 18. Time & Event based scheduler
    • 19. Fail Over / Load Balancing
    • 20. Centralized monitoring of integration processes
    • 21. Shared repository / Metadata management
    © Talend 2011
    5
  • 22. Graphical Interface
    © Talend 2011
    6
    • Graphical flow design
    • 23. Java code generation from flow design
    • 24. Native Hadoop code gen (Java API)
    • 25. Metadata integration with Hive
    • 26. Connect external sources into HDFS (450+ connectors)
    • 27. Aggregate, Cleanse, Transform data in HDFS
    • 28. Leverage MR, Pig, Hive for data processing
    • 29. Real time debugger & direct job deployment on Hadoop cluster
    • 30. Sqoop connectors for mass export/import to RDBMS
  • Typical Use Case Scenario
    • Landing RAW data into Hadoop with tHDFS components
    • 31. Processing & transformation with Hive ELT or tPigSeries
    • 32. Load to traditional RDBMS or Hive for Analysis and Reporting
    © Talend 2011
    7
  • 33. Hadoop Connectors
    © Talend 2011
    8
  • 34. Resources
    © Talend 2011
    9
    • Visit our website at http://www.talend.com
    • 35. Watch pre-recorded webinar about our Hadoop Integration:http://www.talend.com/webinar/archive/
    • 36. Send questions to sales@talend.com or info@talend.com
    • 37. Download Talend Software:http://www.talend.com/download.php
    • 38. Join Talend community to ask technical questions and connect: http://www.talendforge.org
  • http://www.cloudera.com/partners