SlideShare is now on Android. 15 million presentations at your fingertips.  Get the app

×
  • Share
  • Email
  • Embed
  • Like
  • Save
  • Private Content
 

Hadoop World 2011: Data Ingestion, Egression, and Preparation for Hadoop - Sanjay Kaluskar, Informatica

by on Nov 10, 2011

  • 4,281 views

One of the first challenges Hadoop developers face is accessing all the data they need and getting it into Hadoop for analysis. Informatica PowerExchange accesses a variety of data types and ...

One of the first challenges Hadoop developers face is accessing all the data they need and getting it into Hadoop for analysis. Informatica PowerExchange accesses a variety of data types and structures at different latencies (e.g. batch, real-time, or near real-time) and ingests data directly into Hadoop.  The next step is to parse the data in preparation for analysis in Hadoop.  Informatica provides a visual IDE to deploy pre-built parsers or design specific parsers for complex data formats and deploy them on Hadoop.  Once the analysis is complete,  Informatica PowerExhange delivers the resulting output to other information management systems such as a data warehouse.  Learn in this session from Informatica and one of their customers, how to get all the data you need into Hadoop, parse a variety of data formats and structures, and egress the resultant output to other systems.

Statistics

Views

Total Views
4,281
Views on SlideShare
4,039
Embed Views
242

Actions

Likes
3
Downloads
0
Comments
0

4 Embeds 242

http://www.cloudera.com 235
http://blog.cloudera.com 4
https://www.cloudera.com 2
http://cloudera.matt.dev 1

Accessibility

Categories

Upload Details

Uploaded via SlideShare as Microsoft PowerPoint

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
Post Comment
Edit your comment

Hadoop World 2011: Data Ingestion, Egression, and Preparation for Hadoop - Sanjay Kaluskar, Informatica Hadoop World 2011: Data Ingestion, Egression, and Preparation for Hadoop - Sanjay Kaluskar, Informatica Presentation Transcript