Offloading ETL processes to Hadoop is often one of the first Big Data efforts because of the obvious ROI benefits. However, you have hundreds, maybe thousands, of legacy ETL processes to migrate which makes achieving the benefits of Hadoop and ROI a distant goal.
What if you could automatically convert up to 70% of your existing ETL processes to run on Hadoop with no code changes?
In this presentation you will see:
- A detailed walk-through of migrating existing ETL processes to Hadoop without changing anything
- How you can cut development time of new ETL process on Hadoop by up to 50%
- How you can leverage your existing developers’ Java skills to turn them into Hadoop developers
- Best practices for monitoring the performance of your ETL processes to ensure you meet your service level agreements
2. Confidential
2
TRUSTED
by over 10,000
companies as their big
data app platform
BACKED
by top Silicon Valley
investors True Ventures,
Rembrandt VP, Bain
Capital
FOUNDED
in 2008, with
headquarters in San
Francisco
4. Confidential
4
Java, Scala (Scalding), SQL
SIMPLE
Ensure best practices
at any scale thanks to
easy-to-learn design
principles
FLEXIBLE
Leverage existing Java,
Scala, and SQL skills
and easily adapt to new
systems
WE ARE THEDEVELOPERS BEHIND CASCADING
RELIABLE
Always get optimal
performance and
reliability for big data
applications
5. Confidential
• Use Hadoop for ETL / ELT
• Ensure quality and manageability
of our ELT / ELT applications
• Translate existing ETL work to
Hadoop
• GUI ETL tool for developers that
don’t know Java, Scala, SQL
5
MIGRATING TO HADOOPFOR ETLATENTERPRISE SCALE.
Cascading
Driven
?
?
6. Confidential6
TODAY’S SPEAKERS
Shahab Kamal
Vice President at BitWise Inc.
Shahab is responsible for strategy, growth and client relations. Shahab works with client executives on IT
Strategy for Business Intelligence, Big Data, Data Warehousing and Enterprise Applications. Shahab has
worked at Ford Motors, Aon Hewitt and Tribune Company on their PeopleSoft ERP implementation and support.
His expertise has been around retrofitting data from legacy applications without loss of data integrity.
Mark Castillo
Driven, Inc.
Mark is a Solutions Architect with 15+ years of software engineering background. He has worked in the
finance, security, healthcare, streaming music, marketing, and social networking industries. His technical
knowledge and skills are focused on distributed systems, data processing, networking, Linux appliances and
Big Data.