© Cloudera, Inc. All rights reserved.
Data Driven With the Cloudera Data
Warehouse
David Dichmann | ddichmann@cloudera.com
© Cloudera, Inc. All rights reserved. 2© Cloudera, Inc. All rights reserved.
What’s YOUR Data Strategy?
© Cloudera, Inc. All rights reserved. 3
OUTCOMES
• Curated Data and Agile Discovery
with HIPAA compliance
• Accelerated new Drug
Development
NEW PRODUCT DEVELOPMENT
GLOBAL
PHARMACEUTICAL
Use Cases
Users
Fewer Silos
Diverse Data
© Cloudera, Inc. All rights reserved. 4
OUTCOMES
• LoB Data Analysts access all data
• Saved $4M+ in deposit fraud
FRAUD PREVENTION
LARGE NORTH
AMERICAN BANK
Terabytes
Users
Databases
Queries / Month
© Cloudera, Inc. All rights reserved. 5
OUTCOMES
• $10 M new revenue
• $30 M+ price optimization
• $100K+ weather correlation
BUSINESS OPTIMIZATION
MAJOR TELCO
MANUFACTURER
Query
Responses
New Sources
Data Sets
Users
© Cloudera, Inc. All rights reserved.6 © Cloudera, Inc. All rights reserved.
Quickly enable business analytics by sharing petabytes of verified data
across thousands of users while surpassing demands of SLAs and costs
Massive, Diverse Data Security, Governance
User Profiles, Use Cases Self Service EverythingAutomation, Consistency
Experiments, Time To Value
© Cloudera, Inc. All rights reserved. 7
TRADITIONAL CHANGES MODERN
Users Internal Transparency +External
Curation Planned ETLs Flexibility On-Demand ELTs
Exploration Constrained Self-Service Freeform
Volume Finite Correlations Virtually Infinite
© Cloudera, Inc. All rights reserved. 8© Cloudera, Inc. All rights reserved.
TRADITIONAL DATA WAREHOUSE
Structured Data
Sources
(ERP, CRM, SCM)
Transformations
EDW
Advanced
Analytics
Dashboards
Ad Hoc
Canned
Reports
Staging
Data Marts
Seceral Months
Master Schema
ETLODS
2 3
4
1 5
Struggle to handle volume
and variety
Limited
Access
© Cloudera, Inc. All rights reserved. 9© Cloudera, Inc. All rights reserved.
MODERN DATA WAREHOUSE
Advanced
Analytics
Dashboards
Ad Hoc
Canned
Reports
Data Store
Within Days
Data Marts
1
2
Ingest & Store All Data
At Scale
Self-service /
On-demand
Variety of Data
Sources/Types
© Cloudera, Inc. All rights reserved. 10© Cloudera, Inc. All rights reserved.
MODERN DATA WAREHOUSE
Fixed
Reports
DATA SOURCES
Flexible
Reporting
Advanced
Analytics
Self-Service
BI/Ad Hoc
Dashboards/
Analytic Apps
EDW
COMPLIMENTING A TRADITIONAL EDW
© Cloudera, Inc. All rights reserved. 11© Cloudera, Inc. All rights reserved.
CLOUD NATIVE WITH ALTUS DW
Multi-Cloud PaaS for Agile Analytics
● Quick time to value for analytics - no
software or clusters to manage
● Bring the warehouse to the data with
zero copy simplicity
● Use your security policies with your
data - no proprietary stacks
● Apply enterprise governance to
transient workloads
● Shared data experience with SDX, for
analytic workloads
● Optimized for Azure & AWS
DATA WAREHOUSE
GOVERNANCESECURITY
ALTUS CONTROL
PLANE
LIFECYCLE
MANAGEMENT
MULTI-CLOUD
Amazon
S3
Microsoft
ADLS
© Cloudera, Inc. All rights reserved. 12© Cloudera, Inc. All rights reserved.
Traditional Data
Warehouse Optimization
Transform Status Quo
TRANSFORMATIONAL AREAS OF DATA WAREHOUSING
Operations & Events Data
Warehouse
Run Business Better
Research & Discovery
Data Warehouse
Change the Culture
© Cloudera, Inc. All rights reserved. 13
DRIVERS FOR MODERNIZATION
Deeper Business Insights
Grow
• Customer Sentiment
• Fault Prevention
• Improve Product Quality
• New Revenue Streams
Experimentation and
collaboration at scale
Protect
• Proactive Fraud Prevention
• Keep up with Regulatory
Compliance
• Preempt Cyberthreats
Real-time response on
massive data volume and
variety
Connect
• Improve Operational
Efficiency
• Support Internet of Things
(IoT)
New analytics techniques
democratized to all users
© Cloudera, Inc. All rights reserved. 14
CHALLENGES OF A MODERN DATA WAREHOUSE
Extreme Speed and Scale
More Data
• Massive amounts handled
faster at scale
• More variety from new
sources (social media, IoT)
• Insight within minutes of
new data arrival
Performance and
flexibility at scale
More Workloads
• 100’s of production grade
deployments
• Enterprise grade
dependability
• Strict security and
governance
On-demand scale out,
discovery, collaboration
More People
• 1,000’s of new users and
new user types
• 1,000’s of new use cases
• All skill levels: Analytics,
Data Science, and Machine
Learning
All workloads with a
shared data experience
© Cloudera, Inc. All rights reserved. 15
Optimize Core
Processes
● Versatile Solution
● Broaden Data Reach
● Reduce IT Burden or Costs
Dynamic
Consumption
● Transient, Short-lived, Long-lived
● Public, Private, Hybrid Multi-Cloud
● Adaptive Compute & Storage
Self-Service
Everything
● Resource Provisioning
● Workload Development
● Optimizing & Troubleshooting
CLOUDERA MODERN DATA WAREHOUSE
Optimize Processes, Consumption and Costs
https://www.cloudera.com/about/customers/xl-axiata.html
https://blog.cloudera.com/blog/2018/03/automated-provisioning-
of-cdh-in-the-cloud-with-cloudera-director-and-ansible/
https://www.cloudera.com/about/customers/komatsu-mining.html
© Cloudera, Inc. All rights reserved. 16© Cloudera, Inc. All rights reserved.
Financial Services Telecom Government Healthcare Manufacturing
Customer 360
Personalized Medicine
Supply Chain Analysis
Operational Efficiencies
Network Quality Analysis
Equipment Health (IoT)
Fraud
Compliance
Cyber Threat Analysis
Regulatory Reporting
TOP 10 DATA WAREHOUSE USE CASES BY INDUSTRYGROWCONNECTPROTECT
© Cloudera, Inc. All rights reserved.17
A MODERN DATA WAREHOUSE FROM CLOUDERA
HYBRID
Storage
Preferred BI & ELT ToolsHue Analytic Workbench,
Superset Dashboards, CDSW
Workload XM,
Data Analytics Studio
Navigator & Sentry,
Atlas & Ranger
Impala / Hive LLAP
Query Engine
Hive on Tez / Spark
ELT Processing
KUDU | HDFS | Druid
Local Storage
AWS S3 | ADLS
Object Storage
Shared Data Experience (SDX)
Optimized File Formats
(ORC, Parquet, Avro, JSON)
Solr
Search Analytics
Cloudera Manager,
Ambari, Altus, Data Plane
HYBRID
Controls
HYBRID
Compute
HYBRID
Storage
HYBRID
Reporting
© Cloudera, Inc. All rights reserved.18 © Cloudera, Inc. All rights reserved.
EXTREME SPEED & SCALE
Fastest ELT at Scale
for Data Engineers
● Fast data with distributed, in-memory
processing
● Curated data, metadata instantly
available
Fastest Self-Service BI at Scale
for Analysts & Developers
● Interactive multi-user queries without rigid
modeling for exploration
● Elastic scalability for more users/data
Impala
LLAP
© Cloudera, Inc. All rights reserved. 19
EXTENSIVE PARTNER ECOSYSTEM
System
Integrators
ISV IHV
Alliances
Cloud
Alliances
OEM
Alliances
Market Expansion
© Cloudera, Inc. All rights reserved.20 © Cloudera, Inc. All rights reserved.
CLOUDERA DW - PARTING THOUGHTS
Hybrid Optimized Shared Data ExperiencePerformance @Scale
Shared Data
Exponential Use Cases, Successful Outcomes
© Cloudera, Inc. All rights reserved.
THANK YOU

Data Driven With the Cloudera Modern Data Warehouse 3.19.19

  • 1.
    © Cloudera, Inc.All rights reserved. Data Driven With the Cloudera Data Warehouse David Dichmann | ddichmann@cloudera.com
  • 2.
    © Cloudera, Inc.All rights reserved. 2© Cloudera, Inc. All rights reserved. What’s YOUR Data Strategy?
  • 3.
    © Cloudera, Inc.All rights reserved. 3 OUTCOMES • Curated Data and Agile Discovery with HIPAA compliance • Accelerated new Drug Development NEW PRODUCT DEVELOPMENT GLOBAL PHARMACEUTICAL Use Cases Users Fewer Silos Diverse Data
  • 4.
    © Cloudera, Inc.All rights reserved. 4 OUTCOMES • LoB Data Analysts access all data • Saved $4M+ in deposit fraud FRAUD PREVENTION LARGE NORTH AMERICAN BANK Terabytes Users Databases Queries / Month
  • 5.
    © Cloudera, Inc.All rights reserved. 5 OUTCOMES • $10 M new revenue • $30 M+ price optimization • $100K+ weather correlation BUSINESS OPTIMIZATION MAJOR TELCO MANUFACTURER Query Responses New Sources Data Sets Users
  • 6.
    © Cloudera, Inc.All rights reserved.6 © Cloudera, Inc. All rights reserved. Quickly enable business analytics by sharing petabytes of verified data across thousands of users while surpassing demands of SLAs and costs Massive, Diverse Data Security, Governance User Profiles, Use Cases Self Service EverythingAutomation, Consistency Experiments, Time To Value
  • 7.
    © Cloudera, Inc.All rights reserved. 7 TRADITIONAL CHANGES MODERN Users Internal Transparency +External Curation Planned ETLs Flexibility On-Demand ELTs Exploration Constrained Self-Service Freeform Volume Finite Correlations Virtually Infinite
  • 8.
    © Cloudera, Inc.All rights reserved. 8© Cloudera, Inc. All rights reserved. TRADITIONAL DATA WAREHOUSE Structured Data Sources (ERP, CRM, SCM) Transformations EDW Advanced Analytics Dashboards Ad Hoc Canned Reports Staging Data Marts Seceral Months Master Schema ETLODS 2 3 4 1 5 Struggle to handle volume and variety Limited Access
  • 9.
    © Cloudera, Inc.All rights reserved. 9© Cloudera, Inc. All rights reserved. MODERN DATA WAREHOUSE Advanced Analytics Dashboards Ad Hoc Canned Reports Data Store Within Days Data Marts 1 2 Ingest & Store All Data At Scale Self-service / On-demand Variety of Data Sources/Types
  • 10.
    © Cloudera, Inc.All rights reserved. 10© Cloudera, Inc. All rights reserved. MODERN DATA WAREHOUSE Fixed Reports DATA SOURCES Flexible Reporting Advanced Analytics Self-Service BI/Ad Hoc Dashboards/ Analytic Apps EDW COMPLIMENTING A TRADITIONAL EDW
  • 11.
    © Cloudera, Inc.All rights reserved. 11© Cloudera, Inc. All rights reserved. CLOUD NATIVE WITH ALTUS DW Multi-Cloud PaaS for Agile Analytics ● Quick time to value for analytics - no software or clusters to manage ● Bring the warehouse to the data with zero copy simplicity ● Use your security policies with your data - no proprietary stacks ● Apply enterprise governance to transient workloads ● Shared data experience with SDX, for analytic workloads ● Optimized for Azure & AWS DATA WAREHOUSE GOVERNANCESECURITY ALTUS CONTROL PLANE LIFECYCLE MANAGEMENT MULTI-CLOUD Amazon S3 Microsoft ADLS
  • 12.
    © Cloudera, Inc.All rights reserved. 12© Cloudera, Inc. All rights reserved. Traditional Data Warehouse Optimization Transform Status Quo TRANSFORMATIONAL AREAS OF DATA WAREHOUSING Operations & Events Data Warehouse Run Business Better Research & Discovery Data Warehouse Change the Culture
  • 13.
    © Cloudera, Inc.All rights reserved. 13 DRIVERS FOR MODERNIZATION Deeper Business Insights Grow • Customer Sentiment • Fault Prevention • Improve Product Quality • New Revenue Streams Experimentation and collaboration at scale Protect • Proactive Fraud Prevention • Keep up with Regulatory Compliance • Preempt Cyberthreats Real-time response on massive data volume and variety Connect • Improve Operational Efficiency • Support Internet of Things (IoT) New analytics techniques democratized to all users
  • 14.
    © Cloudera, Inc.All rights reserved. 14 CHALLENGES OF A MODERN DATA WAREHOUSE Extreme Speed and Scale More Data • Massive amounts handled faster at scale • More variety from new sources (social media, IoT) • Insight within minutes of new data arrival Performance and flexibility at scale More Workloads • 100’s of production grade deployments • Enterprise grade dependability • Strict security and governance On-demand scale out, discovery, collaboration More People • 1,000’s of new users and new user types • 1,000’s of new use cases • All skill levels: Analytics, Data Science, and Machine Learning All workloads with a shared data experience
  • 15.
    © Cloudera, Inc.All rights reserved. 15 Optimize Core Processes ● Versatile Solution ● Broaden Data Reach ● Reduce IT Burden or Costs Dynamic Consumption ● Transient, Short-lived, Long-lived ● Public, Private, Hybrid Multi-Cloud ● Adaptive Compute & Storage Self-Service Everything ● Resource Provisioning ● Workload Development ● Optimizing & Troubleshooting CLOUDERA MODERN DATA WAREHOUSE Optimize Processes, Consumption and Costs https://www.cloudera.com/about/customers/xl-axiata.html https://blog.cloudera.com/blog/2018/03/automated-provisioning- of-cdh-in-the-cloud-with-cloudera-director-and-ansible/ https://www.cloudera.com/about/customers/komatsu-mining.html
  • 16.
    © Cloudera, Inc.All rights reserved. 16© Cloudera, Inc. All rights reserved. Financial Services Telecom Government Healthcare Manufacturing Customer 360 Personalized Medicine Supply Chain Analysis Operational Efficiencies Network Quality Analysis Equipment Health (IoT) Fraud Compliance Cyber Threat Analysis Regulatory Reporting TOP 10 DATA WAREHOUSE USE CASES BY INDUSTRYGROWCONNECTPROTECT
  • 17.
    © Cloudera, Inc.All rights reserved.17 A MODERN DATA WAREHOUSE FROM CLOUDERA HYBRID Storage Preferred BI & ELT ToolsHue Analytic Workbench, Superset Dashboards, CDSW Workload XM, Data Analytics Studio Navigator & Sentry, Atlas & Ranger Impala / Hive LLAP Query Engine Hive on Tez / Spark ELT Processing KUDU | HDFS | Druid Local Storage AWS S3 | ADLS Object Storage Shared Data Experience (SDX) Optimized File Formats (ORC, Parquet, Avro, JSON) Solr Search Analytics Cloudera Manager, Ambari, Altus, Data Plane HYBRID Controls HYBRID Compute HYBRID Storage HYBRID Reporting
  • 18.
    © Cloudera, Inc.All rights reserved.18 © Cloudera, Inc. All rights reserved. EXTREME SPEED & SCALE Fastest ELT at Scale for Data Engineers ● Fast data with distributed, in-memory processing ● Curated data, metadata instantly available Fastest Self-Service BI at Scale for Analysts & Developers ● Interactive multi-user queries without rigid modeling for exploration ● Elastic scalability for more users/data Impala LLAP
  • 19.
    © Cloudera, Inc.All rights reserved. 19 EXTENSIVE PARTNER ECOSYSTEM System Integrators ISV IHV Alliances Cloud Alliances OEM Alliances Market Expansion
  • 20.
    © Cloudera, Inc.All rights reserved.20 © Cloudera, Inc. All rights reserved. CLOUDERA DW - PARTING THOUGHTS Hybrid Optimized Shared Data ExperiencePerformance @Scale Shared Data Exponential Use Cases, Successful Outcomes
  • 21.
    © Cloudera, Inc.All rights reserved. THANK YOU