SlideShare a Scribd company logo
www.edureka.co/talend-for-big-data
Simplifying Big Data USING Talend
Slide 2 www.edureka.co/talend-for-big-data
ď‚® Understand how ETL is complementing Hadoop Ecosystem
ď‚® Adapt to ETL-Big Data industry
ď‚® Understand why Talend is used with Big Data
ď‚® Learn Big Data not in months but in Minutes
ď‚® Find out why Talend is important for Data Enthusiasts
 Understand the Use Case – Banking Industry
ď‚® Implement a Talend job with Hadoop
At the end of this session, you will be able to:
Objectives
Slide 3 www.edureka.co/talend-for-big-data
A Graphical Abstraction Layer on top of Hadoop Applications – this makes life so much easy in the Big Data buzz
world
ETL with Big Data
» What no one seems to question in response to these sorts of comments is
the naive assumptions these statements are based on !!
» Is it realistic for most companies to move all of their data into Hadoop?
The typical assertion is that "Hadoop eliminates the need for ETL”…. Seriously ?
Slide 4 www.edureka.co/talend-for-big-data
ETL with Big Data
Machine
Data
Transactional
Data
Business Apps
Data
ETL
Workflow
Big Data
Extra and Load
Slide 5 www.edureka.co/talend-for-big-data
Is writing ETL scripts in
MapReduce code still ETL?
Is ETL running faster (in
few cases & slower in
others) on Hadoop
eliminating ETL?
Is introduction of Hadoop
changing when, where
and how ETL happens?
Yes No Yes
The question isn't really that are we eliminating ETL, but where does ETL take place & how are we changing its definition
ETL with Big Data (Contd.)
Slide 6 www.edureka.co/talend-for-big-data
Defining ETL
E
• represents the ability to consistently and reliably extract data with
high performance and minimal impact to the source system
T
• represents the ability to transform one or more data sets in batch or
real-time into a consumable format
L • stands for loading data into a persistent or virtual data store
Slide 7 www.edureka.co/talend-for-big-data
How learning ETL (along Big Data) is addressing major business problems ?
Why ETL + Hadoop?
BIG DATA
DATA
INTEGRATION
DATA QUALITY MDM ESB BPM
TALEND UNIFIED PLATFORM
Slide 8 www.edureka.co/talend-for-big-data
One Stop Solution!!
Improves efficiency of big data job design with graphic interface
Abstract and generates code
Run transforms inside Hadoop
Native support for HDFS, Sqoop, HBase, Mahout, Pig, Hive &
MapReduce code generate
Apache License 2.0
Embedded in Hortonworks Data Platform
Certified with Cloudera, MapR and Grenplum
An open source ecosystem
Slide 9 www.edureka.co/talend-for-big-data
Talend
Q. Why Talend?
Ans . Because the more connected the world becomes, the more quickly a business must adapt
Slide 10 www.edureka.co/talend-for-big-data
Talend is the only Graphical User Interface tool which is capable enough to “translate” an ETL job to a
MapReduce job. Thus, Talend ETL job gets executed as a MapReduce job on Hadoop and get the big data work
done in minutes
ď‚®This is a key innovation which helps to reduce entry barriers in Big Data technology and allows ETL job
developers (beginners and advanced) to carry out Data Warehouse offloading to greater extent
ď‚®With its Eclipse-based graphical workspace, Talend Open Studio for Big Data enables the developer and data
scientist to leverage Hadoop loading and processing technologies like HDFS, HBase, Hive, and Pig without
having to write Hadoop application code
ď‚®Hadoop Applications, Seamlessly gets Integrated within minutes using Talend
Why Talend?
Slide 11 www.edureka.co/talend-for-big-data
ď‚®By simply selecting graphical components from a palette, arranging and configuring them, you can create Hadoop jobs
For example:
1. Load data into HDFS (Hadoop Distributed File System)
2. Use Hadoop Pig to transform data in HDFS
3. Load data into a Hadoop Hive based data warehouse
4. Perform ELT (extract, load, transform) aggregations in Hive
5. Leverage Sqoop to integrate relational databases and Hadoop
Why Talend? (Contd.)
Slide 12 www.edureka.co/talend-for-big-data
Talend Hadoop Integration
Slide 13 www.edureka.co/talend-for-big-data
ď‚® For Hadoop applications to be truly accessible to your organization, they need to be smoothly integrated into your
overall data flows
ď‚® Talend Open Studio for Big Data is the ideal tool for integrating Hadoop applications into your broader data
architecture
ď‚® Talend provides more built-in connector components than any other data integration solution available, with more
than 800+ connectors that make it easy to read from or write to any major file format, database, or packaged
enterprise application
For Example, in Talend Open Studio for Big Data, you can use drag 'n drop configurable components to create data
integration flows that move data from delimited log files into Hadoop Hive, perform operations in Hive, and extract
data from Hive into a MySQL database (or Oracle, Sybase, SQL Server, and so on)
Talend Hadoop Integration (Contd.)
Slide 14 www.edureka.co/talend-for-big-data
ď‚® More and more enterprise wanted to scale up in Hadoop/Big Data technologies with use of existing pool of
talent and reduce overspending on map-reduce programmer (which is pretty new and expensive)
ď‚® High rise of job trend in Data Scientist/Data Analysis (Talend also comes along with basic BI transformations
which reduces your dependency on simple excel dash board/ BI tools)
ď‚® Gartner is featuring Talend as the best technology in market for Data Integration and Big Data
ď‚® 3 major players in Big Data industry, Hortonworks, Cloudera, MapR have already tied up with Talend for big data
solutions
ď‚® And mostly any level person in industry can quickly get started on this without much pre-requisites
Myth : I don’t know Java programming , how would this course help me learn and excel in Big Data? The biggest
advantage you get with Talend for Big Data is “there is no prerequisite” to learn this concept. Whether you come with
prior knowledge of Hadoop or not , this course has some or other best things to offer
Talend Hadoop Integration (Contd.)
Slide 15 www.edureka.co/talend-for-big-data
Learn Big Data not in months but in Minutes!! Sounds too good ? But true
Big Data in 10 minutes
HADOOP
HORTONWORKSMAPR
CLOUDERA Go from zero to big data in under 10 minutes
Get big data without coding. The Talend Big Data
Sandbox is a ready-to-run virtual environment that
includes Talend Platform for Big Data, popular
Hadoop distributions and data examples
Slide 16 www.edureka.co/talend-for-big-data
Who can use “Talend for Big Data”!!
Slide 17 www.edureka.co/talend-for-big-data
Let us all see quickly, what Talend
can do in minutes, reducing the
man-hours in doing MapReduce
programming in Hadoop, shall we?
We are just about to see the Bigger Picture
Slide 18 www.edureka.co/talend-for-big-data
A Banking industry use case :
“Addressing the challenges in growing the business with use of Big Data“ . We will use customer filled web-log data
(collected by bank) and with the help of Pig-ETL job will answer the question “where should bank hold marketing
campaigns for new product launch to get more business” , in ETL-Big Data Analytics style
In this section, you will be able to sense the true power of Talend+Big Data
Real time Use Case : ETL + Big Data
Slide 19Slide 19Slide 19
Project
ď‚®Use Case
A Leading bank has initiated a new product launch campaign across the cities.
Post campaign , the bank wants to analyze the collected data to increase
Business and attract more customer.
How quickly can the huge log files will be analysed and made some business
value out of it within seconds ?
Wanted to know , explore the “Talend for Big Data” and join us in the next
exciting webinar and see how beautifully talend does the trick without any
complex programming (because seeing is believing).
If that is not all enough , the same talend can generate graphical
interpretation of the business data giving tough time to Business Analytics
tools.
Slide 20 www.edureka.co/talend-for-big-data
Our use case setup is using the below :
» Hortonworks Sandbox 1.3
» Talend Open Studio for Big Data 5.5
» Windows 7 (64 Bit OS)
» Machine : 4GB RAM , i3 processor
Environment Setup
Slide 21 www.edureka.co/talend-for-big-data
Use-case Snapshot
Combination of Integration , HDFS , Pig and BI Graphs … yes its true.
Slide 22 www.edureka.co/talend-for-big-data
Salary Trend
Slide 23 www.edureka.co/talend-for-big-data
References
ď‚® https://www.talend.com/resource/hadoop-applications.html
ď‚® http://www.edureka.co/blog/big-data-and-etl-are-family/
Questions
Slide 24 www.edureka.co/talend-for-big-data
Slide 25 Course Url

More Related Content

What's hot

5 Scenarios: When To Use & When Not to Use Hadoop
5 Scenarios: When To Use & When Not to Use Hadoop5 Scenarios: When To Use & When Not to Use Hadoop
5 Scenarios: When To Use & When Not to Use Hadoop
Edureka!
 
Bulk Loading Into HBase With MapReduce
Bulk Loading Into HBase With MapReduceBulk Loading Into HBase With MapReduce
Bulk Loading Into HBase With MapReduce
Edureka!
 
Hadoop for Data Warehousing professionals
Hadoop for Data Warehousing professionalsHadoop for Data Warehousing professionals
Hadoop for Data Warehousing professionals
Edureka!
 
Hadoop for Java Professionals
Hadoop for Java ProfessionalsHadoop for Java Professionals
Hadoop for Java Professionals
Edureka!
 
Webinar: Big Data & Hadoop - When not to use Hadoop
Webinar: Big Data & Hadoop - When not to use HadoopWebinar: Big Data & Hadoop - When not to use Hadoop
Webinar: Big Data & Hadoop - When not to use Hadoop
Edureka!
 
Learn Big Data & Hadoop
Learn Big Data & Hadoop Learn Big Data & Hadoop
Learn Big Data & Hadoop
Edureka!
 
Talend Components | tMap, tJoin, tFileList, tInputFileDelimited | Talend Onli...
Talend Components | tMap, tJoin, tFileList, tInputFileDelimited | Talend Onli...Talend Components | tMap, tJoin, tFileList, tInputFileDelimited | Talend Onli...
Talend Components | tMap, tJoin, tFileList, tInputFileDelimited | Talend Onli...
Edureka!
 
Introduction to Big Data and Hadoop
Introduction to Big Data and HadoopIntroduction to Big Data and Hadoop
Introduction to Big Data and Hadoop
Edureka!
 
Introduction to Hadoop Administration
Introduction to Hadoop AdministrationIntroduction to Hadoop Administration
Introduction to Hadoop AdministrationEdureka!
 
Hadoop(Term Paper)
Hadoop(Term Paper)Hadoop(Term Paper)
Hadoop(Term Paper)
Dux Chandegra
 
Why Talend for Big Data?
Why Talend for Big Data?Why Talend for Big Data?
Why Talend for Big Data?
Edureka!
 
5 things one must know about spark!
5 things one must know about spark!5 things one must know about spark!
5 things one must know about spark!
Edureka!
 
Big Data Analytics Tutorial | Big Data Analytics for Beginners | Hadoop Tutor...
Big Data Analytics Tutorial | Big Data Analytics for Beginners | Hadoop Tutor...Big Data Analytics Tutorial | Big Data Analytics for Beginners | Hadoop Tutor...
Big Data Analytics Tutorial | Big Data Analytics for Beginners | Hadoop Tutor...
Edureka!
 
Hadoop Tutorial | What is Hadoop | Hadoop Project on Reddit | Edureka
Hadoop Tutorial | What is Hadoop | Hadoop Project on Reddit | EdurekaHadoop Tutorial | What is Hadoop | Hadoop Project on Reddit | Edureka
Hadoop Tutorial | What is Hadoop | Hadoop Project on Reddit | Edureka
Edureka!
 
The Future of Data Science
The Future of Data ScienceThe Future of Data Science
The Future of Data Science
DataWorks Summit
 
Hadoop Training For Beginners | Hadoop Tutorial | Big Data Training |Edureka
Hadoop Training For Beginners | Hadoop Tutorial | Big Data Training |EdurekaHadoop Training For Beginners | Hadoop Tutorial | Big Data Training |Edureka
Hadoop Training For Beginners | Hadoop Tutorial | Big Data Training |Edureka
Edureka!
 
Washington DC DataOps Meetup -- Nov 2019
Washington DC DataOps Meetup   -- Nov 2019Washington DC DataOps Meetup   -- Nov 2019
Washington DC DataOps Meetup -- Nov 2019
DataKitchen
 
Designing modern dw and data lake
Designing modern dw and data lakeDesigning modern dw and data lake
Designing modern dw and data lake
punedevscom
 

What's hot (20)

5 Scenarios: When To Use & When Not to Use Hadoop
5 Scenarios: When To Use & When Not to Use Hadoop5 Scenarios: When To Use & When Not to Use Hadoop
5 Scenarios: When To Use & When Not to Use Hadoop
 
Bulk Loading Into HBase With MapReduce
Bulk Loading Into HBase With MapReduceBulk Loading Into HBase With MapReduce
Bulk Loading Into HBase With MapReduce
 
Hadoop for Data Warehousing professionals
Hadoop for Data Warehousing professionalsHadoop for Data Warehousing professionals
Hadoop for Data Warehousing professionals
 
Hadoop for Java Professionals
Hadoop for Java ProfessionalsHadoop for Java Professionals
Hadoop for Java Professionals
 
Webinar: Big Data & Hadoop - When not to use Hadoop
Webinar: Big Data & Hadoop - When not to use HadoopWebinar: Big Data & Hadoop - When not to use Hadoop
Webinar: Big Data & Hadoop - When not to use Hadoop
 
Learn Big Data & Hadoop
Learn Big Data & Hadoop Learn Big Data & Hadoop
Learn Big Data & Hadoop
 
Talend Components | tMap, tJoin, tFileList, tInputFileDelimited | Talend Onli...
Talend Components | tMap, tJoin, tFileList, tInputFileDelimited | Talend Onli...Talend Components | tMap, tJoin, tFileList, tInputFileDelimited | Talend Onli...
Talend Components | tMap, tJoin, tFileList, tInputFileDelimited | Talend Onli...
 
Introduction to Big Data and Hadoop
Introduction to Big Data and HadoopIntroduction to Big Data and Hadoop
Introduction to Big Data and Hadoop
 
Introduction to Hadoop Administration
Introduction to Hadoop AdministrationIntroduction to Hadoop Administration
Introduction to Hadoop Administration
 
Hadoop(Term Paper)
Hadoop(Term Paper)Hadoop(Term Paper)
Hadoop(Term Paper)
 
Why Talend for Big Data?
Why Talend for Big Data?Why Talend for Big Data?
Why Talend for Big Data?
 
5 things one must know about spark!
5 things one must know about spark!5 things one must know about spark!
5 things one must know about spark!
 
Big Data Analytics Tutorial | Big Data Analytics for Beginners | Hadoop Tutor...
Big Data Analytics Tutorial | Big Data Analytics for Beginners | Hadoop Tutor...Big Data Analytics Tutorial | Big Data Analytics for Beginners | Hadoop Tutor...
Big Data Analytics Tutorial | Big Data Analytics for Beginners | Hadoop Tutor...
 
Hadoop Tutorial | What is Hadoop | Hadoop Project on Reddit | Edureka
Hadoop Tutorial | What is Hadoop | Hadoop Project on Reddit | EdurekaHadoop Tutorial | What is Hadoop | Hadoop Project on Reddit | Edureka
Hadoop Tutorial | What is Hadoop | Hadoop Project on Reddit | Edureka
 
The Future of Data Science
The Future of Data ScienceThe Future of Data Science
The Future of Data Science
 
Hadoop Training For Beginners | Hadoop Tutorial | Big Data Training |Edureka
Hadoop Training For Beginners | Hadoop Tutorial | Big Data Training |EdurekaHadoop Training For Beginners | Hadoop Tutorial | Big Data Training |Edureka
Hadoop Training For Beginners | Hadoop Tutorial | Big Data Training |Edureka
 
Washington DC DataOps Meetup -- Nov 2019
Washington DC DataOps Meetup   -- Nov 2019Washington DC DataOps Meetup   -- Nov 2019
Washington DC DataOps Meetup -- Nov 2019
 
Big data rmoug
Big data rmougBig data rmoug
Big data rmoug
 
No sql3 rmoug
No sql3 rmougNo sql3 rmoug
No sql3 rmoug
 
Designing modern dw and data lake
Designing modern dw and data lakeDesigning modern dw and data lake
Designing modern dw and data lake
 

Similar to Simplifying Big Data ETL with Talend

Talend webinar
Talend webinarTalend webinar
Talend webinar
Edureka!
 
Manipulating Data with Talend.
Manipulating Data with Talend.Manipulating Data with Talend.
Manipulating Data with Talend.
Edureka!
 
Meet the experts dwo bde vds v7
Meet the experts dwo bde vds v7Meet the experts dwo bde vds v7
Meet the experts dwo bde vds v7
mmathipra
 
Talend for big_data_intorduction
Talend for big_data_intorductionTalend for big_data_intorduction
Talend for big_data_intorduction
Lakshman Dhullipalla
 
Hadoop : The Pile of Big Data
Hadoop : The Pile of Big DataHadoop : The Pile of Big Data
Hadoop : The Pile of Big Data
Edureka!
 
Hadoop for Finance - sample chapter
Hadoop for Finance - sample chapterHadoop for Finance - sample chapter
Hadoop for Finance - sample chapter
Rajiv Tiwari
 
2015 02 12 talend hortonworks webinar challenges to hadoop adoption
2015 02 12 talend hortonworks webinar challenges to hadoop adoption2015 02 12 talend hortonworks webinar challenges to hadoop adoption
2015 02 12 talend hortonworks webinar challenges to hadoop adoption
Hortonworks
 
Scalable ETL with Talend and Hadoop, CĂ©dric Carbone, Talend.
Scalable ETL with Talend and Hadoop, CĂ©dric Carbone, Talend.Scalable ETL with Talend and Hadoop, CĂ©dric Carbone, Talend.
Scalable ETL with Talend and Hadoop, CĂ©dric Carbone, Talend.
OW2
 
Coding software and tools used for data science management - Phdassistance
Coding software and tools used for data science management - PhdassistanceCoding software and tools used for data science management - Phdassistance
Coding software and tools used for data science management - Phdassistance
phdAssistance1
 
How pig and hadoop fit in data processing architecture
How pig and hadoop fit in data processing architectureHow pig and hadoop fit in data processing architecture
How pig and hadoop fit in data processing architecture
Kovid Academy
 
Capgemini Data Warehouse Optimization Using Hadoop
Capgemini Data Warehouse Optimization Using HadoopCapgemini Data Warehouse Optimization Using Hadoop
Capgemini Data Warehouse Optimization Using Hadoop
Appfluent Technology
 
Big data an elephant business opportunities
Big data an elephant   business opportunitiesBig data an elephant   business opportunities
Big data an elephant business opportunities
Bigdata Meetup Kochi
 
Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...
Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...
Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...
phdAssistance1
 
Oct 2011 CHADNUG Presentation on Hadoop
Oct 2011 CHADNUG Presentation on HadoopOct 2011 CHADNUG Presentation on Hadoop
Oct 2011 CHADNUG Presentation on Hadoop
Josh Patterson
 
WJAX 2013 Slides online: Big Data beyond Apache Hadoop - How to integrate ALL...
WJAX 2013 Slides online: Big Data beyond Apache Hadoop - How to integrate ALL...WJAX 2013 Slides online: Big Data beyond Apache Hadoop - How to integrate ALL...
WJAX 2013 Slides online: Big Data beyond Apache Hadoop - How to integrate ALL...
Kai Wähner
 
Hadoop and the Data Warehouse: Point/Counter Point
Hadoop and the Data Warehouse: Point/Counter PointHadoop and the Data Warehouse: Point/Counter Point
Hadoop and the Data Warehouse: Point/Counter Point
Inside Analysis
 
JAZOON'13 - Kai Waehner - Hadoop Integration
JAZOON'13 - Kai Waehner - Hadoop IntegrationJAZOON'13 - Kai Waehner - Hadoop Integration
JAZOON'13 - Kai Waehner - Hadoop Integrationjazoon13
 
Big Data and Enterprise Data - Oracle -1663869
Big Data and Enterprise Data - Oracle -1663869Big Data and Enterprise Data - Oracle -1663869
Big Data and Enterprise Data - Oracle -1663869
Edgar Alejandro Villegas
 
Big Data & Open Source - Neil Jadhav
Big Data & Open Source - Neil JadhavBig Data & Open Source - Neil Jadhav
Big Data & Open Source - Neil JadhavSwapnil (Neil) Jadhav
 
Big data and you
Big data and you Big data and you
Big data and you
IBM
 

Similar to Simplifying Big Data ETL with Talend (20)

Talend webinar
Talend webinarTalend webinar
Talend webinar
 
Manipulating Data with Talend.
Manipulating Data with Talend.Manipulating Data with Talend.
Manipulating Data with Talend.
 
Meet the experts dwo bde vds v7
Meet the experts dwo bde vds v7Meet the experts dwo bde vds v7
Meet the experts dwo bde vds v7
 
Talend for big_data_intorduction
Talend for big_data_intorductionTalend for big_data_intorduction
Talend for big_data_intorduction
 
Hadoop : The Pile of Big Data
Hadoop : The Pile of Big DataHadoop : The Pile of Big Data
Hadoop : The Pile of Big Data
 
Hadoop for Finance - sample chapter
Hadoop for Finance - sample chapterHadoop for Finance - sample chapter
Hadoop for Finance - sample chapter
 
2015 02 12 talend hortonworks webinar challenges to hadoop adoption
2015 02 12 talend hortonworks webinar challenges to hadoop adoption2015 02 12 talend hortonworks webinar challenges to hadoop adoption
2015 02 12 talend hortonworks webinar challenges to hadoop adoption
 
Scalable ETL with Talend and Hadoop, CĂ©dric Carbone, Talend.
Scalable ETL with Talend and Hadoop, CĂ©dric Carbone, Talend.Scalable ETL with Talend and Hadoop, CĂ©dric Carbone, Talend.
Scalable ETL with Talend and Hadoop, CĂ©dric Carbone, Talend.
 
Coding software and tools used for data science management - Phdassistance
Coding software and tools used for data science management - PhdassistanceCoding software and tools used for data science management - Phdassistance
Coding software and tools used for data science management - Phdassistance
 
How pig and hadoop fit in data processing architecture
How pig and hadoop fit in data processing architectureHow pig and hadoop fit in data processing architecture
How pig and hadoop fit in data processing architecture
 
Capgemini Data Warehouse Optimization Using Hadoop
Capgemini Data Warehouse Optimization Using HadoopCapgemini Data Warehouse Optimization Using Hadoop
Capgemini Data Warehouse Optimization Using Hadoop
 
Big data an elephant business opportunities
Big data an elephant   business opportunitiesBig data an elephant   business opportunities
Big data an elephant business opportunities
 
Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...
Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...
Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...
 
Oct 2011 CHADNUG Presentation on Hadoop
Oct 2011 CHADNUG Presentation on HadoopOct 2011 CHADNUG Presentation on Hadoop
Oct 2011 CHADNUG Presentation on Hadoop
 
WJAX 2013 Slides online: Big Data beyond Apache Hadoop - How to integrate ALL...
WJAX 2013 Slides online: Big Data beyond Apache Hadoop - How to integrate ALL...WJAX 2013 Slides online: Big Data beyond Apache Hadoop - How to integrate ALL...
WJAX 2013 Slides online: Big Data beyond Apache Hadoop - How to integrate ALL...
 
Hadoop and the Data Warehouse: Point/Counter Point
Hadoop and the Data Warehouse: Point/Counter PointHadoop and the Data Warehouse: Point/Counter Point
Hadoop and the Data Warehouse: Point/Counter Point
 
JAZOON'13 - Kai Waehner - Hadoop Integration
JAZOON'13 - Kai Waehner - Hadoop IntegrationJAZOON'13 - Kai Waehner - Hadoop Integration
JAZOON'13 - Kai Waehner - Hadoop Integration
 
Big Data and Enterprise Data - Oracle -1663869
Big Data and Enterprise Data - Oracle -1663869Big Data and Enterprise Data - Oracle -1663869
Big Data and Enterprise Data - Oracle -1663869
 
Big Data & Open Source - Neil Jadhav
Big Data & Open Source - Neil JadhavBig Data & Open Source - Neil Jadhav
Big Data & Open Source - Neil Jadhav
 
Big data and you
Big data and you Big data and you
Big data and you
 

More from Edureka!

What to learn during the 21 days Lockdown | Edureka
What to learn during the 21 days Lockdown | EdurekaWhat to learn during the 21 days Lockdown | Edureka
What to learn during the 21 days Lockdown | Edureka
Edureka!
 
Top 10 Dying Programming Languages in 2020 | Edureka
Top 10 Dying Programming Languages in 2020 | EdurekaTop 10 Dying Programming Languages in 2020 | Edureka
Top 10 Dying Programming Languages in 2020 | Edureka
Edureka!
 
Top 5 Trending Business Intelligence Tools | Edureka
Top 5 Trending Business Intelligence Tools | EdurekaTop 5 Trending Business Intelligence Tools | Edureka
Top 5 Trending Business Intelligence Tools | Edureka
Edureka!
 
Tableau Tutorial for Data Science | Edureka
Tableau Tutorial for Data Science | EdurekaTableau Tutorial for Data Science | Edureka
Tableau Tutorial for Data Science | Edureka
Edureka!
 
Python Programming Tutorial | Edureka
Python Programming Tutorial | EdurekaPython Programming Tutorial | Edureka
Python Programming Tutorial | Edureka
Edureka!
 
Top 5 PMP Certifications | Edureka
Top 5 PMP Certifications | EdurekaTop 5 PMP Certifications | Edureka
Top 5 PMP Certifications | Edureka
Edureka!
 
Top Maven Interview Questions in 2020 | Edureka
Top Maven Interview Questions in 2020 | EdurekaTop Maven Interview Questions in 2020 | Edureka
Top Maven Interview Questions in 2020 | Edureka
Edureka!
 
Linux Mint Tutorial | Edureka
Linux Mint Tutorial | EdurekaLinux Mint Tutorial | Edureka
Linux Mint Tutorial | Edureka
Edureka!
 
How to Deploy Java Web App in AWS| Edureka
How to Deploy Java Web App in AWS| EdurekaHow to Deploy Java Web App in AWS| Edureka
How to Deploy Java Web App in AWS| Edureka
Edureka!
 
Importance of Digital Marketing | Edureka
Importance of Digital Marketing | EdurekaImportance of Digital Marketing | Edureka
Importance of Digital Marketing | Edureka
Edureka!
 
RPA in 2020 | Edureka
RPA in 2020 | EdurekaRPA in 2020 | Edureka
RPA in 2020 | Edureka
Edureka!
 
Email Notifications in Jenkins | Edureka
Email Notifications in Jenkins | EdurekaEmail Notifications in Jenkins | Edureka
Email Notifications in Jenkins | Edureka
Edureka!
 
EA Algorithm in Machine Learning | Edureka
EA Algorithm in Machine Learning | EdurekaEA Algorithm in Machine Learning | Edureka
EA Algorithm in Machine Learning | Edureka
Edureka!
 
Cognitive AI Tutorial | Edureka
Cognitive AI Tutorial | EdurekaCognitive AI Tutorial | Edureka
Cognitive AI Tutorial | Edureka
Edureka!
 
AWS Cloud Practitioner Tutorial | Edureka
AWS Cloud Practitioner Tutorial | EdurekaAWS Cloud Practitioner Tutorial | Edureka
AWS Cloud Practitioner Tutorial | Edureka
Edureka!
 
Blue Prism Top Interview Questions | Edureka
Blue Prism Top Interview Questions | EdurekaBlue Prism Top Interview Questions | Edureka
Blue Prism Top Interview Questions | Edureka
Edureka!
 
Big Data on AWS Tutorial | Edureka
Big Data on AWS Tutorial | Edureka Big Data on AWS Tutorial | Edureka
Big Data on AWS Tutorial | Edureka
Edureka!
 
A star algorithm | A* Algorithm in Artificial Intelligence | Edureka
A star algorithm | A* Algorithm in Artificial Intelligence | EdurekaA star algorithm | A* Algorithm in Artificial Intelligence | Edureka
A star algorithm | A* Algorithm in Artificial Intelligence | Edureka
Edureka!
 
Kubernetes Installation on Ubuntu | Edureka
Kubernetes Installation on Ubuntu | EdurekaKubernetes Installation on Ubuntu | Edureka
Kubernetes Installation on Ubuntu | Edureka
Edureka!
 
Introduction to DevOps | Edureka
Introduction to DevOps | EdurekaIntroduction to DevOps | Edureka
Introduction to DevOps | Edureka
Edureka!
 

More from Edureka! (20)

What to learn during the 21 days Lockdown | Edureka
What to learn during the 21 days Lockdown | EdurekaWhat to learn during the 21 days Lockdown | Edureka
What to learn during the 21 days Lockdown | Edureka
 
Top 10 Dying Programming Languages in 2020 | Edureka
Top 10 Dying Programming Languages in 2020 | EdurekaTop 10 Dying Programming Languages in 2020 | Edureka
Top 10 Dying Programming Languages in 2020 | Edureka
 
Top 5 Trending Business Intelligence Tools | Edureka
Top 5 Trending Business Intelligence Tools | EdurekaTop 5 Trending Business Intelligence Tools | Edureka
Top 5 Trending Business Intelligence Tools | Edureka
 
Tableau Tutorial for Data Science | Edureka
Tableau Tutorial for Data Science | EdurekaTableau Tutorial for Data Science | Edureka
Tableau Tutorial for Data Science | Edureka
 
Python Programming Tutorial | Edureka
Python Programming Tutorial | EdurekaPython Programming Tutorial | Edureka
Python Programming Tutorial | Edureka
 
Top 5 PMP Certifications | Edureka
Top 5 PMP Certifications | EdurekaTop 5 PMP Certifications | Edureka
Top 5 PMP Certifications | Edureka
 
Top Maven Interview Questions in 2020 | Edureka
Top Maven Interview Questions in 2020 | EdurekaTop Maven Interview Questions in 2020 | Edureka
Top Maven Interview Questions in 2020 | Edureka
 
Linux Mint Tutorial | Edureka
Linux Mint Tutorial | EdurekaLinux Mint Tutorial | Edureka
Linux Mint Tutorial | Edureka
 
How to Deploy Java Web App in AWS| Edureka
How to Deploy Java Web App in AWS| EdurekaHow to Deploy Java Web App in AWS| Edureka
How to Deploy Java Web App in AWS| Edureka
 
Importance of Digital Marketing | Edureka
Importance of Digital Marketing | EdurekaImportance of Digital Marketing | Edureka
Importance of Digital Marketing | Edureka
 
RPA in 2020 | Edureka
RPA in 2020 | EdurekaRPA in 2020 | Edureka
RPA in 2020 | Edureka
 
Email Notifications in Jenkins | Edureka
Email Notifications in Jenkins | EdurekaEmail Notifications in Jenkins | Edureka
Email Notifications in Jenkins | Edureka
 
EA Algorithm in Machine Learning | Edureka
EA Algorithm in Machine Learning | EdurekaEA Algorithm in Machine Learning | Edureka
EA Algorithm in Machine Learning | Edureka
 
Cognitive AI Tutorial | Edureka
Cognitive AI Tutorial | EdurekaCognitive AI Tutorial | Edureka
Cognitive AI Tutorial | Edureka
 
AWS Cloud Practitioner Tutorial | Edureka
AWS Cloud Practitioner Tutorial | EdurekaAWS Cloud Practitioner Tutorial | Edureka
AWS Cloud Practitioner Tutorial | Edureka
 
Blue Prism Top Interview Questions | Edureka
Blue Prism Top Interview Questions | EdurekaBlue Prism Top Interview Questions | Edureka
Blue Prism Top Interview Questions | Edureka
 
Big Data on AWS Tutorial | Edureka
Big Data on AWS Tutorial | Edureka Big Data on AWS Tutorial | Edureka
Big Data on AWS Tutorial | Edureka
 
A star algorithm | A* Algorithm in Artificial Intelligence | Edureka
A star algorithm | A* Algorithm in Artificial Intelligence | EdurekaA star algorithm | A* Algorithm in Artificial Intelligence | Edureka
A star algorithm | A* Algorithm in Artificial Intelligence | Edureka
 
Kubernetes Installation on Ubuntu | Edureka
Kubernetes Installation on Ubuntu | EdurekaKubernetes Installation on Ubuntu | Edureka
Kubernetes Installation on Ubuntu | Edureka
 
Introduction to DevOps | Edureka
Introduction to DevOps | EdurekaIntroduction to DevOps | Edureka
Introduction to DevOps | Edureka
 

Recently uploaded

Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
UiPathCommunity
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
Cheryl Hung
 
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Ramesh Iyer
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
Product School
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
DianaGray10
 
ODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User GroupODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User Group
CatarinaPereira64715
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
OnBoard
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
BookNet Canada
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
DanBrown980551
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
DianaGray10
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
Sri Ambati
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
Paul Groth
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Product School
 
Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*
Frank van Harmelen
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
Product School
 
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Product School
 
"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi
Fwdays
 
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Product School
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
Laura Byrne
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance
 

Recently uploaded (20)

Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
 
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
 
ODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User GroupODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User Group
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
 
Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
 
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
 
"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi
 
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
 

Simplifying Big Data ETL with Talend

  • 2. Slide 2 www.edureka.co/talend-for-big-data ď‚® Understand how ETL is complementing Hadoop Ecosystem ď‚® Adapt to ETL-Big Data industry ď‚® Understand why Talend is used with Big Data ď‚® Learn Big Data not in months but in Minutes ď‚® Find out why Talend is important for Data Enthusiasts ď‚® Understand the Use Case – Banking Industry ď‚® Implement a Talend job with Hadoop At the end of this session, you will be able to: Objectives
  • 3. Slide 3 www.edureka.co/talend-for-big-data ď‚®A Graphical Abstraction Layer on top of Hadoop Applications – this makes life so much easy in the Big Data buzz world ETL with Big Data » What no one seems to question in response to these sorts of comments is the naive assumptions these statements are based on !! » Is it realistic for most companies to move all of their data into Hadoop? The typical assertion is that "Hadoop eliminates the need for ETL”…. Seriously ?
  • 4. Slide 4 www.edureka.co/talend-for-big-data ETL with Big Data Machine Data Transactional Data Business Apps Data ETL Workflow Big Data Extra and Load
  • 5. Slide 5 www.edureka.co/talend-for-big-data Is writing ETL scripts in MapReduce code still ETL? Is ETL running faster (in few cases & slower in others) on Hadoop eliminating ETL? Is introduction of Hadoop changing when, where and how ETL happens? Yes No Yes The question isn't really that are we eliminating ETL, but where does ETL take place & how are we changing its definition ETL with Big Data (Contd.)
  • 6. Slide 6 www.edureka.co/talend-for-big-data Defining ETL E • represents the ability to consistently and reliably extract data with high performance and minimal impact to the source system T • represents the ability to transform one or more data sets in batch or real-time into a consumable format L • stands for loading data into a persistent or virtual data store
  • 7. Slide 7 www.edureka.co/talend-for-big-data How learning ETL (along Big Data) is addressing major business problems ? Why ETL + Hadoop? BIG DATA DATA INTEGRATION DATA QUALITY MDM ESB BPM TALEND UNIFIED PLATFORM
  • 8. Slide 8 www.edureka.co/talend-for-big-data One Stop Solution!! Improves efficiency of big data job design with graphic interface Abstract and generates code Run transforms inside Hadoop Native support for HDFS, Sqoop, HBase, Mahout, Pig, Hive & MapReduce code generate Apache License 2.0 Embedded in Hortonworks Data Platform Certified with Cloudera, MapR and Grenplum An open source ecosystem
  • 9. Slide 9 www.edureka.co/talend-for-big-data Talend Q. Why Talend? Ans . Because the more connected the world becomes, the more quickly a business must adapt
  • 10. Slide 10 www.edureka.co/talend-for-big-data ď‚®Talend is the only Graphical User Interface tool which is capable enough to “translate” an ETL job to a MapReduce job. Thus, Talend ETL job gets executed as a MapReduce job on Hadoop and get the big data work done in minutes ď‚®This is a key innovation which helps to reduce entry barriers in Big Data technology and allows ETL job developers (beginners and advanced) to carry out Data Warehouse offloading to greater extent ď‚®With its Eclipse-based graphical workspace, Talend Open Studio for Big Data enables the developer and data scientist to leverage Hadoop loading and processing technologies like HDFS, HBase, Hive, and Pig without having to write Hadoop application code ď‚®Hadoop Applications, Seamlessly gets Integrated within minutes using Talend Why Talend?
  • 11. Slide 11 www.edureka.co/talend-for-big-data ď‚®By simply selecting graphical components from a palette, arranging and configuring them, you can create Hadoop jobs For example: 1. Load data into HDFS (Hadoop Distributed File System) 2. Use Hadoop Pig to transform data in HDFS 3. Load data into a Hadoop Hive based data warehouse 4. Perform ELT (extract, load, transform) aggregations in Hive 5. Leverage Sqoop to integrate relational databases and Hadoop Why Talend? (Contd.)
  • 13. Slide 13 www.edureka.co/talend-for-big-data ď‚® For Hadoop applications to be truly accessible to your organization, they need to be smoothly integrated into your overall data flows ď‚® Talend Open Studio for Big Data is the ideal tool for integrating Hadoop applications into your broader data architecture ď‚® Talend provides more built-in connector components than any other data integration solution available, with more than 800+ connectors that make it easy to read from or write to any major file format, database, or packaged enterprise application For Example, in Talend Open Studio for Big Data, you can use drag 'n drop configurable components to create data integration flows that move data from delimited log files into Hadoop Hive, perform operations in Hive, and extract data from Hive into a MySQL database (or Oracle, Sybase, SQL Server, and so on) Talend Hadoop Integration (Contd.)
  • 14. Slide 14 www.edureka.co/talend-for-big-data ď‚® More and more enterprise wanted to scale up in Hadoop/Big Data technologies with use of existing pool of talent and reduce overspending on map-reduce programmer (which is pretty new and expensive) ď‚® High rise of job trend in Data Scientist/Data Analysis (Talend also comes along with basic BI transformations which reduces your dependency on simple excel dash board/ BI tools) ď‚® Gartner is featuring Talend as the best technology in market for Data Integration and Big Data ď‚® 3 major players in Big Data industry, Hortonworks, Cloudera, MapR have already tied up with Talend for big data solutions ď‚® And mostly any level person in industry can quickly get started on this without much pre-requisites Myth : I don’t know Java programming , how would this course help me learn and excel in Big Data? The biggest advantage you get with Talend for Big Data is “there is no prerequisite” to learn this concept. Whether you come with prior knowledge of Hadoop or not , this course has some or other best things to offer Talend Hadoop Integration (Contd.)
  • 15. Slide 15 www.edureka.co/talend-for-big-data Learn Big Data not in months but in Minutes!! Sounds too good ? But true Big Data in 10 minutes HADOOP HORTONWORKSMAPR CLOUDERA Go from zero to big data in under 10 minutes Get big data without coding. The Talend Big Data Sandbox is a ready-to-run virtual environment that includes Talend Platform for Big Data, popular Hadoop distributions and data examples
  • 16. Slide 16 www.edureka.co/talend-for-big-data Who can use “Talend for Big Data”!!
  • 17. Slide 17 www.edureka.co/talend-for-big-data Let us all see quickly, what Talend can do in minutes, reducing the man-hours in doing MapReduce programming in Hadoop, shall we? We are just about to see the Bigger Picture
  • 18. Slide 18 www.edureka.co/talend-for-big-data A Banking industry use case : “Addressing the challenges in growing the business with use of Big Data“ . We will use customer filled web-log data (collected by bank) and with the help of Pig-ETL job will answer the question “where should bank hold marketing campaigns for new product launch to get more business” , in ETL-Big Data Analytics style In this section, you will be able to sense the true power of Talend+Big Data Real time Use Case : ETL + Big Data
  • 19. Slide 19Slide 19Slide 19 Project ď‚®Use Case A Leading bank has initiated a new product launch campaign across the cities. Post campaign , the bank wants to analyze the collected data to increase Business and attract more customer. How quickly can the huge log files will be analysed and made some business value out of it within seconds ? Wanted to know , explore the “Talend for Big Data” and join us in the next exciting webinar and see how beautifully talend does the trick without any complex programming (because seeing is believing). If that is not all enough , the same talend can generate graphical interpretation of the business data giving tough time to Business Analytics tools.
  • 20. Slide 20 www.edureka.co/talend-for-big-data Our use case setup is using the below : » Hortonworks Sandbox 1.3 » Talend Open Studio for Big Data 5.5 » Windows 7 (64 Bit OS) » Machine : 4GB RAM , i3 processor Environment Setup
  • 21. Slide 21 www.edureka.co/talend-for-big-data Use-case Snapshot Combination of Integration , HDFS , Pig and BI Graphs … yes its true.
  • 23. Slide 23 www.edureka.co/talend-for-big-data References ď‚® https://www.talend.com/resource/hadoop-applications.html ď‚® http://www.edureka.co/blog/big-data-and-etl-are-family/

Editor's Notes

  1. Title and Content Slide – Font: Tahoma 12/14 (depending on the amount of text) Heading: Calibri Heading 26 (consistent) Bullet code – 174 for bullet OOBB for sub bullet
  2. Title and Content Slide – Font: Tahoma 12/14 (depending on the amount of text) Heading: Calibri Heading 26 (consistent) Bullet code – 174 for bullet OOBB for sub bullet
  3. Title and Content Slide – Font: Tahoma 12/14 (depending on the amount of text) Heading: Calibri Heading 26 (consistent) Bullet code – 174 for bullet OOBB for sub bullet
  4. Title and Content Slide – Font: Tahoma 12/14 (depending on the amount of text) Heading: Calibri Heading 26 (consistent) Bullet code – 174 for bullet OOBB for sub bullet
  5. Title and Content Slide – Font: Tahoma 12/14 (depending on the amount of text) Heading: Calibri Heading 26 (consistent) Bullet code – 174 for bullet OOBB for sub bullet
  6. Title and Content Slide – Font: Tahoma 12/14 (depending on the amount of text) Heading: Calibri Heading 26 (consistent) Bullet code – 174 for bullet OOBB for sub bullet
  7. Title and Content Slide – Font: Tahoma 12/14 (depending on the amount of text) Heading: Calibri Heading 26 (consistent) Bullet code – 174 for bullet OOBB for sub bullet
  8. Title and Content Slide – Font: Tahoma 12/14 (depending on the amount of text) Heading: Calibri Heading 26 (consistent) Bullet code – 174 for bullet OOBB for sub bullet
  9. Project Details
  10. Title and Content Slide – Font: Tahoma 12/14 (depending on the amount of text) Heading: Calibri Heading 26 (consistent) Bullet code – 174 for bullet OOBB for sub bullet
  11. Title and Content Slide – Font: Tahoma 12/14 (depending on the amount of text) Heading: Calibri Heading 26 (consistent) Bullet code – 174 for bullet OOBB for sub bullet