© Cloudera, Inc. All rights reserved.
FLEXIBLE CLOUD WHATEVER THE WORKLOAD
Colm Moynihan, Partner SE Manager EMEA, Cloudera
Brett Cooper, Data Solution Architect, Microsoft
Nov 2018
© Cloudera, Inc. All rights reserved. 2© Cloudera, Inc. All rights reserved.
CLOUDERA & MICROSOFT PARTNERSHIP
Microsoft and Cloudera are collaborating to help customers realize big data insights with the
cloud. Now Azure customers can deploy Cloudera Enterprise with a few clicks, visualize their
data with Microsoft Power BI and gain insights to transform their business – all within minutes.
Scott Guthrie, Executive Vice President of Cloud + Enterprise, Microsoft
© Cloudera, Inc. All rights reserved. 3© Cloudera, Inc. All rights reserved.
MICROSOFT AZURE: THE RIGHT CHOICE
TrustedProductive IntelligentHybrid
•
•
•
•
•
•
•
•
•
•
•
•
© Cloudera, Inc. All rights reserved.4 © Cloudera, Inc. All rights reserved.
ENTERPRISE CUSTOMER REQUIREMENTS FOR JOURNEY TO CLOUD
Enterprise Customer Needs
• Avoid lock-in (multi-cloud)
• Same solution as on-premises
and intelligent edge
• Open source standards
• Hybrid or multi-cloud
• Enterprise Grade Security
• Governance - Easy to track data
lineage for audits/compliance
• GDPR Ready
• Unified data services
• Eliminate data copies
• Easy to troubleshoot workloads
• Universally shared metadata
Flexibility Manageability Security
Only Cloudera can fulfill all of these customer requirements
© Cloudera, Inc. All rights reserved. 5© Cloudera, Inc. All rights reserved.
Sparks Loyalty 360
Campaigns
Recommendations
All Data one CDW
Connected Vessel
Advanced Analytics
Tidal patterns
Customer360
Complete Warehouse
All data feeds
Real-Time
Enhanced Analytics
Predict human flow
Pred maintenance
Travel Satisfaction
Staffing levels
NYSE
Barclays
BT
Siemens
BMW
Navistar
Adecco
Cox Automotive
Emirates
GSK
RBS
Deutsche Boerse
Warehouse IoT
Advanced
Analytics
Data Science
JOINT CUSTOMER SUCCESS ON AZURE
© Cloudera, Inc. All rights reserved. 6
SHARED DATA EXPERIENCEDEPLOYMENT OPTIONSMODERN PLATFORM
Amazon
S3
LOCATION
STORAGE
MANAGEABILITY
Microsoft
ADLS
HDFS KUDU
Data Center
Self Managed Managed Service
DATA
ENGINEERING
DATA
WAREHOUSE
DATA
SCIENCE
OPERATIONAL
DATABASE
SECURITY
GOVERNANCE
LIFECYCLE MANAGEMENT
DATA CATALOG
CLOUDERA ON CLOUD – ALTUS DIRECTOR & ALTUS
SERVICES
© Cloudera, Inc. All rights reserved. 7© Cloudera, Inc. All rights reserved.
Azure
VMs
Oracle Netezza Teradata
Tableau, BO
Qlik, Arcadia
PowerBI
Workload
XM
Impala / HIVE / Spark
Cloudera Data Warehouse
SDX
Security
Governance
Catalog
Offload Traditional Warehouses
Microsoft
Cloudera
Kafka, Flume
SQOOP
Migrate
Navigator
Optimizer
AZ
Data Factory
ISVs
Use Cases
HUE
(SQL)
DB2
Mainframe
Kudu
Active
Directory
Cloudera Data Warehouse on Azure – SQL, Search and new workloads
Structured
Reporting
Ad Hoc
Analytics
Customer
360
Discovery
Search
Big Data
Applications
HDFS
Web logs Unstructured Data
ETL
ETL Vendors
© Cloudera, Inc. All rights reserved. 8© Cloudera, Inc. All rights reserved.
CLOUDERA DATA WAREHOUSE – SQL and SEARCH
Decoupled architecture on-prem & cloud-native
Analytic Workbench:
SQL Developers
HUE
Preferred BI Tools:
Analysts
Workload XM:
Migrate, Analyze,
Optimize, Scale
Cloudera Navigator:
Trust & Stewardship
Apache Impala
Query Engine
Hive-on-Spark
ETL Processing
Amazon
S3
Microsoft
ADLSHDFS KUDU
© Cloudera, Inc. All rights reserved. 9© Cloudera, Inc. All rights reserved.
Azure
VMs Active
Directory
HDFS
Cloudera Enterprise Data Hub
Cloudera Data Science Workbench
SDX
Security
Governance
Catalog
Open Source Libraries
Models to Production
Data Scientists
Github,
BitBucket
Microsoft
Cloudera
Use Cases
Enterprise Data Science at Scale – Secure and Governed
Kudu
GPU CPU
NLP - ChatBot
Customer
Churn
Predictive
Maintenance
Recommend
Engine
Image / Facial
Recognition
Threat
Detection
HBase
Real-Time Model REST API
© Cloudera, Inc. All rights reserved. 10© Cloudera, Inc. All rights reserved.
CLOUDERA DATA SCIENCE WORKBENCH
Accelerate Machine Learning from Research to Production
For data scientists
• Experiment faster
Use R, Python, or Scala with on-
demand compute and secure
CDH data access
• Work together
Share reproducible research
with your whole team
• Deploy with confidence
Get to production repeatably
and without recoding
For IT professionals
• Bring data science to the data
Give your data science team
more freedom while reducing
the risk and cost of silos
• Secure by default
Leverage common security and
governance across workloads
• Run anywhere
On-premises or in the cloud
© Cloudera, Inc. All rights reserved. 11© Cloudera, Inc. All rights reserved.
WHY CLOUDERA?
Hybrid Runs anywhere Faster Migration to Azure
Security and
Governance
RBAC, Audit,
Lineage
Remove Infosec blockers to
Cloud migration
Productionise
ML/AI
Deploy models as
REST API
Production on Azure, not
just Dev and Test
Native ADLS
Support
No data movement Move Workload to the data
Certified MSFT
Products
PowerBI, AD Existing investments works
Partner
Ecosystem
Qlik, Informatica,
Microsoft
Accelerate deployment and
consumption
© Cloudera, Inc. All rights reserved.
QUESTIONS
© Cloudera, Inc. All rights reserved. 13© Cloudera, Inc. All rights reserved.
TALK TO US AFTERWARDS
STAND 533

Big Data LDN 2018: MICROSOFT AZURE AND CLOUDERA – FLEXIBLE CLOUD, WHATEVER THE WORKLOAD

  • 1.
    © Cloudera, Inc.All rights reserved. FLEXIBLE CLOUD WHATEVER THE WORKLOAD Colm Moynihan, Partner SE Manager EMEA, Cloudera Brett Cooper, Data Solution Architect, Microsoft Nov 2018
  • 2.
    © Cloudera, Inc.All rights reserved. 2© Cloudera, Inc. All rights reserved. CLOUDERA & MICROSOFT PARTNERSHIP Microsoft and Cloudera are collaborating to help customers realize big data insights with the cloud. Now Azure customers can deploy Cloudera Enterprise with a few clicks, visualize their data with Microsoft Power BI and gain insights to transform their business – all within minutes. Scott Guthrie, Executive Vice President of Cloud + Enterprise, Microsoft
  • 3.
    © Cloudera, Inc.All rights reserved. 3© Cloudera, Inc. All rights reserved. MICROSOFT AZURE: THE RIGHT CHOICE TrustedProductive IntelligentHybrid • • • • • • • • • • • •
  • 4.
    © Cloudera, Inc.All rights reserved.4 © Cloudera, Inc. All rights reserved. ENTERPRISE CUSTOMER REQUIREMENTS FOR JOURNEY TO CLOUD Enterprise Customer Needs • Avoid lock-in (multi-cloud) • Same solution as on-premises and intelligent edge • Open source standards • Hybrid or multi-cloud • Enterprise Grade Security • Governance - Easy to track data lineage for audits/compliance • GDPR Ready • Unified data services • Eliminate data copies • Easy to troubleshoot workloads • Universally shared metadata Flexibility Manageability Security Only Cloudera can fulfill all of these customer requirements
  • 5.
    © Cloudera, Inc.All rights reserved. 5© Cloudera, Inc. All rights reserved. Sparks Loyalty 360 Campaigns Recommendations All Data one CDW Connected Vessel Advanced Analytics Tidal patterns Customer360 Complete Warehouse All data feeds Real-Time Enhanced Analytics Predict human flow Pred maintenance Travel Satisfaction Staffing levels NYSE Barclays BT Siemens BMW Navistar Adecco Cox Automotive Emirates GSK RBS Deutsche Boerse Warehouse IoT Advanced Analytics Data Science JOINT CUSTOMER SUCCESS ON AZURE
  • 6.
    © Cloudera, Inc.All rights reserved. 6 SHARED DATA EXPERIENCEDEPLOYMENT OPTIONSMODERN PLATFORM Amazon S3 LOCATION STORAGE MANAGEABILITY Microsoft ADLS HDFS KUDU Data Center Self Managed Managed Service DATA ENGINEERING DATA WAREHOUSE DATA SCIENCE OPERATIONAL DATABASE SECURITY GOVERNANCE LIFECYCLE MANAGEMENT DATA CATALOG CLOUDERA ON CLOUD – ALTUS DIRECTOR & ALTUS SERVICES
  • 7.
    © Cloudera, Inc.All rights reserved. 7© Cloudera, Inc. All rights reserved. Azure VMs Oracle Netezza Teradata Tableau, BO Qlik, Arcadia PowerBI Workload XM Impala / HIVE / Spark Cloudera Data Warehouse SDX Security Governance Catalog Offload Traditional Warehouses Microsoft Cloudera Kafka, Flume SQOOP Migrate Navigator Optimizer AZ Data Factory ISVs Use Cases HUE (SQL) DB2 Mainframe Kudu Active Directory Cloudera Data Warehouse on Azure – SQL, Search and new workloads Structured Reporting Ad Hoc Analytics Customer 360 Discovery Search Big Data Applications HDFS Web logs Unstructured Data ETL ETL Vendors
  • 8.
    © Cloudera, Inc.All rights reserved. 8© Cloudera, Inc. All rights reserved. CLOUDERA DATA WAREHOUSE – SQL and SEARCH Decoupled architecture on-prem & cloud-native Analytic Workbench: SQL Developers HUE Preferred BI Tools: Analysts Workload XM: Migrate, Analyze, Optimize, Scale Cloudera Navigator: Trust & Stewardship Apache Impala Query Engine Hive-on-Spark ETL Processing Amazon S3 Microsoft ADLSHDFS KUDU
  • 9.
    © Cloudera, Inc.All rights reserved. 9© Cloudera, Inc. All rights reserved. Azure VMs Active Directory HDFS Cloudera Enterprise Data Hub Cloudera Data Science Workbench SDX Security Governance Catalog Open Source Libraries Models to Production Data Scientists Github, BitBucket Microsoft Cloudera Use Cases Enterprise Data Science at Scale – Secure and Governed Kudu GPU CPU NLP - ChatBot Customer Churn Predictive Maintenance Recommend Engine Image / Facial Recognition Threat Detection HBase Real-Time Model REST API
  • 10.
    © Cloudera, Inc.All rights reserved. 10© Cloudera, Inc. All rights reserved. CLOUDERA DATA SCIENCE WORKBENCH Accelerate Machine Learning from Research to Production For data scientists • Experiment faster Use R, Python, or Scala with on- demand compute and secure CDH data access • Work together Share reproducible research with your whole team • Deploy with confidence Get to production repeatably and without recoding For IT professionals • Bring data science to the data Give your data science team more freedom while reducing the risk and cost of silos • Secure by default Leverage common security and governance across workloads • Run anywhere On-premises or in the cloud
  • 11.
    © Cloudera, Inc.All rights reserved. 11© Cloudera, Inc. All rights reserved. WHY CLOUDERA? Hybrid Runs anywhere Faster Migration to Azure Security and Governance RBAC, Audit, Lineage Remove Infosec blockers to Cloud migration Productionise ML/AI Deploy models as REST API Production on Azure, not just Dev and Test Native ADLS Support No data movement Move Workload to the data Certified MSFT Products PowerBI, AD Existing investments works Partner Ecosystem Qlik, Informatica, Microsoft Accelerate deployment and consumption
  • 12.
    © Cloudera, Inc.All rights reserved. QUESTIONS
  • 13.
    © Cloudera, Inc.All rights reserved. 13© Cloudera, Inc. All rights reserved. TALK TO US AFTERWARDS STAND 533