Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.
1 ©2014Cloudera, Inc. All rights reserved.1
©2014Cloudera, Syncsort, Tableau Inc. All rights reserved.
©2014Cloudera, Sync...
2
Agenda
©2014Cloudera, Syncsort, Tableau Inc. All rights reserved.
• Data Warehouse Vision & Reality
• What is legacy dat...
3
What is this?
©2014Cloudera, Syncsort, Tableau Inc. All rights reserved.3
4
Real-Time
Mainframe
Oracle
ERP
ETL ETL
Data Mart
Data
Warehouse
File
XML
The Data Warehouse Vision -1998
4
Data Integrat...
5
Data Warehouse Reality 2014
5
Real-Time
Mainframe
Oracle
ERP
ETL ETL
Data Mart
File
XML
Data Integration & ETL Tools wou...
6
The Data Warehouse Vision vs Reality
Fresher data
Longer history data
Faster analytics
More data sources
Lower costs
Lon...
7
Mainframes | A Critical Source of Big Data
7
Top 25
World Banks
9 of World’s
Top Insurers
23 of Top 25 US
Retailers
71%
...
8
Suits & Hoodies – Working Together
8
Integration
Gaps
Expertise
Gaps
• COBOL appeared in 1959, Hadoop in 2005
• Mainfram...
9
Expanding Data Requires A New Approach
9
1980s
Bring Data to Compute
Now
Bring Compute to Data
Relative size & complexit...
10
From Apache Hadoop to an enterprise data
hub
10
Open Source
Scalable
Flexible
Cost-Effective
✔
Managed
Open
Architectur...
11
From Apache Hadoop to an enterprise data
hub
11
Open Source
Scalable
Flexible
Cost-Effective
✔
Managed
Open
Architectur...
12
From Apache Hadoop to an enterprise data
hub
12
Open Source
Scalable
Flexible
Cost-Effective
✔
Managed
Open
Architectur...
13
From Apache Hadoop to an enterprise data
hub
13
Open Source
Scalable
Flexible
Cost-Effective
✔
Managed
Open
Architectur...
14
From Apache Hadoop to an enterprise data
hub
14
Open Source
Scalable
Flexible
Cost-Effective
✔
Managed
Open
Architectur...
15
Partners
Proactive &
Predictive Support
Professional
Services
Training
Cloudera: Your Trusted Advisor for Big Data
15
A...
16 ©2014Cloudera, Inc. All rights reserved.16 ©2014Cloudera, Syncsort, Tableau Inc. All rights reserved.
17
The Impact of ELT & Dormant Data on the EDW
17 ©2014Cloudera, Syncsort, Tableau Inc. All rights reserved.
 ELT drives ...
1818 ©2014Cloudera, Syncsort, Tableau Inc. All rights reserved.©2014Cloudera, Syncsort, Tableau Inc. All rights reserved.
19
Where to Start?
19
How to identify dormant data?
What workloads will deliver the biggest impact?
How will you access &
...
2020
Offload Legacy Data & Workloads to The Enterprise Data Hub
Phase III:
Optimize & Secure
Phase II:
Offload
Phase I:
Id...
21
22
The Problem: Volume of DataBusinesses are struggling to unlock exploding data
©2014Cloudera, Syncsort, Tableau Inc. All...
24
The Problem: Old School
Software
Traditional technologies are complicated, inflexible and slow moving
©2014Cloudera, Sy...
25
The Tableau RevolutionFast and easy analytics for everyone
©2014Cloudera, Syncsort, Tableau Inc. All rights reserved.
26
FlexibleTransform all types of data into self-service analytics
©2014Cloudera, Syncsort, Tableau Inc. All rights reserv...
27
For EveryoneEase of use leads to adoption across all departments and use cases
©2014Cloudera, Syncsort, Tableau Inc. Al...
28
•LIVE DEMO
©2014Cloudera, Syncsort, Tableau Inc. All rights reserved.
29
Case Study: Optimize EDW Leading Financial Org
29
0
50
100
150
200
250
ElapsedTime(m)
HiveQL
217 min
Syncsort
DMX-h
9 m...
3030
Final Thoughts..
Rusty Sears
©2014Cloudera, Syncsort, Tableau Inc. All rights reserved.
Vice President of Enterprise ...
31 ©2014Cloudera, Inc. All rights reserved.31
QUESTIONS?
©2014Cloudera, Syncsort, Tableau Inc. All rights reserved.
Upcoming SlideShare
Loading in …5
×

Syncsort, Tableau, & Cloudera present: Break the Barriers to Big Data Insight

913 views

Published on

Demand for quicker access to multiple integrated sources of data continues to rise. Immediate access to data stored in a variety of systems - such as mainframes, data warehouses, and data marts - to mine visually for business intelligence is the competitive differentiation enterprises need to win in today’s economy.

Stop playing the waiting game and learn about a new end-to-end solution for combining, analyzing, and visualizing data from practically any source in your enterprise environment.

Leading organizations are already taking advantage of this architectural innovation to gain modern insights while reducing costs and propelling their businesses ahead of the competition.

Are you tired of waiting? Don't let your architecture hold you back. Access this webinar and hear from a team of industry experts on how you can Break the Barriers to Big Data Insight.

Published in: Data & Analytics, Technology
  • Be the first to comment

Syncsort, Tableau, & Cloudera present: Break the Barriers to Big Data Insight

  1. 1. 1 ©2014Cloudera, Inc. All rights reserved.1 ©2014Cloudera, Syncsort, Tableau Inc. All rights reserved. ©2014Cloudera, Syncsort, Tableau Inc. All rights reserved. Cloudera Sessions Dallas
  2. 2. 2 Agenda ©2014Cloudera, Syncsort, Tableau Inc. All rights reserved. • Data Warehouse Vision & Reality • What is legacy data & why an Enterprise Data Hub • Offloading legacy data and workloads to Hadoop • Transform all types of data into self-service analytics • Live Demonstration • Customer case study • Q&A
  3. 3. 3 What is this? ©2014Cloudera, Syncsort, Tableau Inc. All rights reserved.3
  4. 4. 4 Real-Time Mainframe Oracle ERP ETL ETL Data Mart Data Warehouse File XML The Data Warehouse Vision -1998 4 Data Integration & ETL Tools would enable a Single, Consistent Version of the Truth Data Mart Data Mart ©2014Cloudera, Syncsort, Tableau Inc. All rights reserved.
  5. 5. 5 Data Warehouse Reality 2014 5 Real-Time Mainframe Oracle ERP ETL ETL Data Mart File XML Data Integration & ETL Tools would enable a Single, Consistent Version of the Truth Data Mart Data Mart Dormant Data Staging / ELT New Reports SLA’s New Column Complete History ©2014Cloudera, Syncsort, Tableau Inc. All rights reserved.
  6. 6. 6 The Data Warehouse Vision vs Reality Fresher data Longer history data Faster analytics More data sources Lower costs Longer ELT batch windows Shorter data retention Slower queries Weeks/months just to add new data fields Growing costs Vision Reality ©2014Cloudera, Syncsort, Tableau Inc. All rights reserved.
  7. 7. 7 Mainframes | A Critical Source of Big Data 7 Top 25 World Banks 9 of World’s Top Insurers 23 of Top 25 US Retailers 71% Fortune 500 30 Billion Bus. Transactions / day ©2014Cloudera, Syncsort, Tableau Inc. All rights reserved.
  8. 8. 8 Suits & Hoodies – Working Together 8 Integration Gaps Expertise Gaps • COBOL appeared in 1959, Hadoop in 2005 • Mainframe & Hadoop skills shortage Security Gaps • Hosts mission critical sensitive data • Very difficult to install new software on MF Costs Gaps • Mainframe data is (expensive) Big Data • Even FTP costs CPU cycles (MIPS) • Connectivity • Data conversion (EBCDIC vs ASCII) Suits & Hoodies idea: Merv Adrian, Gartner Research. ©2014Cloudera, Syncsort, Tableau Inc. All rights reserved.
  9. 9. 9 Expanding Data Requires A New Approach 9 1980s Bring Data to Compute Now Bring Compute to Data Relative size & complexity Data Information-centric businesses use all data: Multi-structured, internal & external data of all types Compute Compute Compute Process-centric businesses use: • Structured data mainly • Internal data only • “Important” data only Compute Compute Compute Data Data Data Data ©2014Cloudera, Syncsort, Tableau Inc. All rights reserved.
  10. 10. 10 From Apache Hadoop to an enterprise data hub 10 Open Source Scalable Flexible Cost-Effective ✔ Managed Open Architecture Secure and Governed ✖ ✖ ✖ BATCH PROCESSING STORAGE FOR ANY TYPE OF DATA UNIFIED, ELASTIC, RESILIENT, SECURE FILESYSTEM MAPREDUCE HDFS Core Apache Hadoop is great, but… 1) Hard to use and manage. 2) Only supports batch processing. 3) Not comprehensively secure. ©2014Cloudera, Syncsort, Tableau Inc. All rights reserved.
  11. 11. 11 From Apache Hadoop to an enterprise data hub 11 Open Source Scalable Flexible Cost-Effective ✔ Managed Open Architecture Secure and Governed ✔ BATCH PROCESSING STORAGE FOR ANY TYPE OF DATA UNIFIED, ELASTIC, RESILIENT, SECURE SYSTEM MANAGEMENT FILESYSTEM MAPREDUCE HDFS CLOUDERAMANAGER ✖ ✖ ©2014Cloudera, Syncsort, Tableau Inc. All rights reserved.
  12. 12. 12 From Apache Hadoop to an enterprise data hub 12 Open Source Scalable Flexible Cost-Effective ✔ Managed Open Architecture Secure and Governed ✔ ✔ BATCH PROCESSING ANALYTIC SQL SEARCH ENGINE MACHINE LEARNING STREAM PROCESSING 3RD PARTY APPS WORKLOAD MANAGEMENT STORAGE FOR ANY TYPE OF DATA UNIFIED, ELASTIC, RESILIENT, SECURE SYSTEM MANAGEMENT FILESYSTEM ONLINE NOSQL MAPREDUCE IMPALA SOLR SPARK SPARK STREAMING YARN HDFS HBASE CLOUDERAMANAGER ✖ ©2014Cloudera, Syncsort, Tableau Inc. All rights reserved.
  13. 13. 13 From Apache Hadoop to an enterprise data hub 13 Open Source Scalable Flexible Cost-Effective ✔ Managed Open Architecture Secure and Governed ✔ ✔ ✔ BATCH PROCESSING ANALYTIC SQL SEARCH ENGINE MACHINE LEARNING STREAM PROCESSING 3RD PARTY APPS WORKLOAD MANAGEMENT STORAGE FOR ANY TYPE OF DATA UNIFIED, ELASTIC, RESILIENT, SECURE DATA MANAGEMENT SYSTEM MANAGEMENT FILESYSTEM ONLINE NOSQL MAPREDUCE IMPALA SOLR SPARK SPARK STREAMING YARN HDFS HBASE CLOUDERANAVIGATORCLOUDERAMANAGER SENTRY ©2014Cloudera, Syncsort, Tableau Inc. All rights reserved.
  14. 14. 14 From Apache Hadoop to an enterprise data hub 14 Open Source Scalable Flexible Cost-Effective ✔ Managed Open Architecture Secure and Governed ✔ ✔ ✔ BATCH PROCESSING ANALYTIC SQL SEARCH ENGINE MACHINE LEARNING STREAM PROCESSING 3RD PARTY APPS WORKLOAD MANAGEMENT STORAGE FOR ANY TYPE OF DATA UNIFIED, ELASTIC, RESILIENT, SECURE DATA MANAGEMENT SYSTEM MANAGEMENT CLOUDERA’S ENTERPRISE DATA HUB FILESYSTEM ONLINE NOSQL MAPREDUCE IMPALA SOLR SPARK SPARK STREAMING YARN HDFS HBASE CLOUDERANAVIGATORCLOUDERAMANAGER SENTRY ©2014Cloudera, Syncsort, Tableau Inc. All rights reserved.
  15. 15. 15 Partners Proactive & Predictive Support Professional Services Training Cloudera: Your Trusted Advisor for Big Data 15 Advance from Strategy to ROI with Best Practices and Peak Performance ©2014Cloudera, Syncsort, Tableau Inc. All rights reserved.
  16. 16. 16 ©2014Cloudera, Inc. All rights reserved.16 ©2014Cloudera, Syncsort, Tableau Inc. All rights reserved.
  17. 17. 17 The Impact of ELT & Dormant Data on the EDW 17 ©2014Cloudera, Syncsort, Tableau Inc. All rights reserved.  ELT drives up to 80% of database capacity  Dormant – rarely used data – waste premium storage  ETL/ELT processes on dormant data waste premium CPU cycles Hot Warm Cold Data Transformations (ELT) of unused data
  18. 18. 1818 ©2014Cloudera, Syncsort, Tableau Inc. All rights reserved.©2014Cloudera, Syncsort, Tableau Inc. All rights reserved.
  19. 19. 19 Where to Start? 19 How to identify dormant data? What workloads will deliver the biggest impact? How will you access & move all your data? Can you secure the new environment? How do you optimize it? How do you manage it? How do you make it business-class? What tools do you need? How will you leverage all your data, including mainframes? ©2014Cloudera, Syncsort, Tableau Inc. All rights reserved.
  20. 20. 2020 Offload Legacy Data & Workloads to The Enterprise Data Hub Phase III: Optimize & Secure Phase II: Offload Phase I: Identify One Framework. Blazing Performance, Iron-Clad Security, Disruptive Economics • Identify data & workloads most suitable for offload • Focus on those that will deliver maximum savings & performance • Access and move virtually any data e.g. mainframe to Enterprise Data Hub with one tool • Easily replicate existing staging workloads in Hadoop using a graphical user interface • Deploy on premises and in Cloud • Optimize the new environment • Manage & secure all your data with business class tools • Deliver self-service reporting ©2014Cloudera, Syncsort, Tableau Inc. All rights reserved.
  21. 21. 21
  22. 22. 22 The Problem: Volume of DataBusinesses are struggling to unlock exploding data ©2014Cloudera, Syncsort, Tableau Inc. All rights reserved.
  23. 23. 24 The Problem: Old School Software Traditional technologies are complicated, inflexible and slow moving ©2014Cloudera, Syncsort, Tableau Inc. All rights reserved.
  24. 24. 25 The Tableau RevolutionFast and easy analytics for everyone ©2014Cloudera, Syncsort, Tableau Inc. All rights reserved.
  25. 25. 26 FlexibleTransform all types of data into self-service analytics ©2014Cloudera, Syncsort, Tableau Inc. All rights reserved.
  26. 26. 27 For EveryoneEase of use leads to adoption across all departments and use cases ©2014Cloudera, Syncsort, Tableau Inc. All rights reserved.
  27. 27. 28 •LIVE DEMO ©2014Cloudera, Syncsort, Tableau Inc. All rights reserved.
  28. 28. 29 Case Study: Optimize EDW Leading Financial Org 29 0 50 100 150 200 250 ElapsedTime(m) HiveQL 217 min Syncsort DMX-h 9 min HiveQL 217 min Mainframe Offload (74-page COBOL copybook) Development Effort Syncsort DMX-h: 4 hrs. Manual Coding: Weeks! Benefits:  Cut development time from weeks to hours  Reduced complexity 47 HiveQL scripts to 4 DMX-h graphical jobs  Easily validate COBOL copybooks and find errors  Mainframe Data available to business for analytics  Staging & ELT moved out of RDBMS – Queries run faster ©2014Cloudera, Syncsort, Tableau Inc. All rights reserved.
  29. 29. 3030 Final Thoughts.. Rusty Sears ©2014Cloudera, Syncsort, Tableau Inc. All rights reserved. Vice President of Enterprise Data Services and Big Data at Regions Financial Corporation
  30. 30. 31 ©2014Cloudera, Inc. All rights reserved.31 QUESTIONS? ©2014Cloudera, Syncsort, Tableau Inc. All rights reserved.

×