1© Cloudera, Inc. All rights reserved.
Machine Learning, AI & the Future of Data Analytics
Dr. Amr Awadallah | Co-Founder, CTO | Cloudera
2© Cloudera, Inc. All rights reserved.
We believe
data can make what is impossible
today, possible tomorrow
3© Cloudera, Inc. All rights reserved.
Today
log(time)
100,000 years ago 10,000 1,000 100 10
Wave 1
4© Cloudera, Inc. All rights reserved.
Today
log(time)
100,000 years 10,000 1,000 100 10
Wave 2
5© Cloudera, Inc. All rights reserved.
Today
log(time)
100,000 years 10,000 1,000 100 10
Wave 3
6© Cloudera, Inc. All rights reserved.
Today
log(time)
100,000 years 10,000 1,000 100 10
Wave 4
7© Cloudera, Inc. All rights reserved.
Today
log(time)
100,000 years 10,000 1,000 100 10
Wave 5
8© Cloudera, Inc. All rights reserved.
Today
log(time)
100,000 years 10,000 1,000 100 10
Wave 6
9© Cloudera, Inc. All rights reserved.
Cost of compute
Data volume
Time
Machine
Learning
NO
Machine
Learning
1950s 1960s 1970s 1980s 1990s 2000s 2010s
Age of Machine Learning
10© Cloudera, Inc. All rights reserved.
We deliver
the modern platform for machine learning and
analytics optimized for the cloud
RUNS ANYWHERE
Cloud
Multi-cloud
On-premises
SCALABLE
Elastic
Cost-effective
Lower TCO
ENTERPRISE GRADE
Secure
Performant
Compliant
11© Cloudera, Inc. All rights reserved. 11
The modern platform for machine learning and analytics optimized for the cloud
EXTENSIBLE
SERVICES
CORE
SERVICES DATA
ENGINEERING
OPERATIONAL
DATABASE
ANALYTIC
DATABASE
DATA CATALOG
INGEST &
REPLICATION
SECURITY GOVERNANCE
WORKLOAD
MANAGEMENT
DATA
SCIENCE
NEW
OFFERINGS
Cloudera Enterprise
Amazon S3 Microsoft ADLS HDFS KUDU
STORAGE
SERVICES
12© Cloudera, Inc. All rights reserved.
• Unified security – protects sensitive data with consistent
controls, even for transient and recurring workloads
• Consistent governance – enables secure self-service access
to all relevant data and increases compliance
• Easy workload management – increases user productivity and
boosts job predictability
• Flexible ingest and replication – aggregates a single copy of
all data, provides disaster recovery, and eases migration
• Shared catalog – defines and preserves structure and
business context of data for new applications and partner
solutions
Open platform services
Built for multi-function analytics | Optimized for cloud
13© Cloudera, Inc. All rights reserved.
PATTERN
RECOGNITION
ANOMALY
DETECTION
PREDICTION
700+CUSTOMERS RUN
ON
DRIVE CUSTOMER INSIGHTS
Market segmentation
Customer 360
Next best offer
Churn analysis & prevention
PROTECT BUSINESS
Cybersecurity
Fraud
Anti-money laundering
Risk modeling & assessment
SPAM detection
CONNECT PRODUCTS & SERVICES
(IoT)
Predictive maintenance
Genomics & personalized medicine
Predicting and preventing disease
Natural language
The enterprise platform for machine learning
14© Cloudera, Inc. All rights reserved. 14© Cloudera, Inc. All rights reserved.
Supports multiple languages — TensorFlow, R,
Python
Direct, secure access to production data
with Impala & Spark
Collaborative and reproducible data science
Accelerating data science and
ML from exploration to production
Continued machine
learning innovation
15© Cloudera, Inc. All rights reserved.
Published research subscription service
Delivers cutting edge advances in applied ML / AI
Accelerates adoption in large enterprises
Drives demand for our platform
Applied research for machine
learning and data science
Continued machine
learning innovation
15© Cloudera, Inc. All rights reserved.
16© Cloudera, Inc. All rights reserved.
Multi-cloud
Platform-as-a-Service
Powered by
16© Cloudera, Inc. All rights reserved.
17© Cloudera, Inc. All rights reserved.
5 keys to success
1) Build a data-driven culture
2) Develop the right team and skills
3) Be agile/lean in development
4) Leverage DevOps for production
5) Right-size data governance
17© Cloudera, Inc. All rights reserved.
18© Cloudera, Inc. All rights reserved.
Adoption driven by large enterprises
1000+ customers
across all verticals
~600 Global 8000
customers
7/10 9/10 29 6/10 8/10
Top Global Top Global Top Global Top GlobalGovernment
customers
BANKING TELCO PUBLIC HEALTHCARE TECHNOLOGY
19© Cloudera, Inc. All rights reserved.
DRIVE CUSTOMER INSIGHTS CONNECT PRODUCTS & SERVICES
(IoT)
PROTECT
BUSINESS
Powering predictive analytics to increase
performance and reduce fleet downtime
Creating new revenue streams with an
advanced anti-fraud solution
Cloudera powering data-driven customers
Applying Predictive Analytics to Retain
Human Capital & Monetize Data
20© Cloudera, Inc. All rights reserved.
DRIVE CUSTOMER INSIGHTS CONNECT PRODUCTS & SERVICES
(IoT)
PROTECT
BUSINESS
21© Cloudera, Inc. All rights reserved.
Thank you
@awadallah

Big Data LDN 2017: Machine Learning, AI & The Future of Data Analytics

  • 1.
    1© Cloudera, Inc.All rights reserved. Machine Learning, AI & the Future of Data Analytics Dr. Amr Awadallah | Co-Founder, CTO | Cloudera
  • 2.
    2© Cloudera, Inc.All rights reserved. We believe data can make what is impossible today, possible tomorrow
  • 3.
    3© Cloudera, Inc.All rights reserved. Today log(time) 100,000 years ago 10,000 1,000 100 10 Wave 1
  • 4.
    4© Cloudera, Inc.All rights reserved. Today log(time) 100,000 years 10,000 1,000 100 10 Wave 2
  • 5.
    5© Cloudera, Inc.All rights reserved. Today log(time) 100,000 years 10,000 1,000 100 10 Wave 3
  • 6.
    6© Cloudera, Inc.All rights reserved. Today log(time) 100,000 years 10,000 1,000 100 10 Wave 4
  • 7.
    7© Cloudera, Inc.All rights reserved. Today log(time) 100,000 years 10,000 1,000 100 10 Wave 5
  • 8.
    8© Cloudera, Inc.All rights reserved. Today log(time) 100,000 years 10,000 1,000 100 10 Wave 6
  • 9.
    9© Cloudera, Inc.All rights reserved. Cost of compute Data volume Time Machine Learning NO Machine Learning 1950s 1960s 1970s 1980s 1990s 2000s 2010s Age of Machine Learning
  • 10.
    10© Cloudera, Inc.All rights reserved. We deliver the modern platform for machine learning and analytics optimized for the cloud RUNS ANYWHERE Cloud Multi-cloud On-premises SCALABLE Elastic Cost-effective Lower TCO ENTERPRISE GRADE Secure Performant Compliant
  • 11.
    11© Cloudera, Inc.All rights reserved. 11 The modern platform for machine learning and analytics optimized for the cloud EXTENSIBLE SERVICES CORE SERVICES DATA ENGINEERING OPERATIONAL DATABASE ANALYTIC DATABASE DATA CATALOG INGEST & REPLICATION SECURITY GOVERNANCE WORKLOAD MANAGEMENT DATA SCIENCE NEW OFFERINGS Cloudera Enterprise Amazon S3 Microsoft ADLS HDFS KUDU STORAGE SERVICES
  • 12.
    12© Cloudera, Inc.All rights reserved. • Unified security – protects sensitive data with consistent controls, even for transient and recurring workloads • Consistent governance – enables secure self-service access to all relevant data and increases compliance • Easy workload management – increases user productivity and boosts job predictability • Flexible ingest and replication – aggregates a single copy of all data, provides disaster recovery, and eases migration • Shared catalog – defines and preserves structure and business context of data for new applications and partner solutions Open platform services Built for multi-function analytics | Optimized for cloud
  • 13.
    13© Cloudera, Inc.All rights reserved. PATTERN RECOGNITION ANOMALY DETECTION PREDICTION 700+CUSTOMERS RUN ON DRIVE CUSTOMER INSIGHTS Market segmentation Customer 360 Next best offer Churn analysis & prevention PROTECT BUSINESS Cybersecurity Fraud Anti-money laundering Risk modeling & assessment SPAM detection CONNECT PRODUCTS & SERVICES (IoT) Predictive maintenance Genomics & personalized medicine Predicting and preventing disease Natural language The enterprise platform for machine learning
  • 14.
    14© Cloudera, Inc.All rights reserved. 14© Cloudera, Inc. All rights reserved. Supports multiple languages — TensorFlow, R, Python Direct, secure access to production data with Impala & Spark Collaborative and reproducible data science Accelerating data science and ML from exploration to production Continued machine learning innovation
  • 15.
    15© Cloudera, Inc.All rights reserved. Published research subscription service Delivers cutting edge advances in applied ML / AI Accelerates adoption in large enterprises Drives demand for our platform Applied research for machine learning and data science Continued machine learning innovation 15© Cloudera, Inc. All rights reserved.
  • 16.
    16© Cloudera, Inc.All rights reserved. Multi-cloud Platform-as-a-Service Powered by 16© Cloudera, Inc. All rights reserved.
  • 17.
    17© Cloudera, Inc.All rights reserved. 5 keys to success 1) Build a data-driven culture 2) Develop the right team and skills 3) Be agile/lean in development 4) Leverage DevOps for production 5) Right-size data governance 17© Cloudera, Inc. All rights reserved.
  • 18.
    18© Cloudera, Inc.All rights reserved. Adoption driven by large enterprises 1000+ customers across all verticals ~600 Global 8000 customers 7/10 9/10 29 6/10 8/10 Top Global Top Global Top Global Top GlobalGovernment customers BANKING TELCO PUBLIC HEALTHCARE TECHNOLOGY
  • 19.
    19© Cloudera, Inc.All rights reserved. DRIVE CUSTOMER INSIGHTS CONNECT PRODUCTS & SERVICES (IoT) PROTECT BUSINESS Powering predictive analytics to increase performance and reduce fleet downtime Creating new revenue streams with an advanced anti-fraud solution Cloudera powering data-driven customers Applying Predictive Analytics to Retain Human Capital & Monetize Data
  • 20.
    20© Cloudera, Inc.All rights reserved. DRIVE CUSTOMER INSIGHTS CONNECT PRODUCTS & SERVICES (IoT) PROTECT BUSINESS
  • 21.
    21© Cloudera, Inc.All rights reserved. Thank you @awadallah