EXTENDING CLOUDERA SDX
BEYOND THE PLATFORM
2 © Cloudera, Inc. All rights reserved.
TODAY’S SPEAKERS
Wim Stoop
Product Marketing Manager
wim@cloudera.com
Philip Duplisey
Sr. Director Consulting Services
pduplisey@bardess.com
Daniel Parton
Lead Data Scientist
dparton@bardess.com
David Freriks
Emerging Technology Evangelist
david.freriks@qlik.com
3 © Cloudera, Inc. All rights reserved.
MULTI-
DISCIPLINARY
ANALYTICS
© Cloudera, Inc. All rights reserved.
WE ALL HAVE BAGGAGE
5 © Cloudera, Inc. All rights reserved.
TRADITIONAL
APPLICATION SILOS
CONTEXT
STORAGE
APPLICATION
SECURITY
GOVERNANCE
LIFECYCLE
CATALOG
DATA
SCIENCE
FS
SQL
ANALYTIC
DATABASE
SECURITY
GOVERNANCE
LIFECYCLE
CATALOG
RDBM
S
NOSQL & RT
DATABASE
SECURITY
GOVERNANCE
LIFECYCLE
CATALOG
FS
ETL & DATA
ENGINEERIN
G
SECURITY
GOVERNANCE
LIFECYCLE
CATALOG
RDBM
S
DATA WARE-
HOUSE/MAR
T
RDBM
S
SECURITY
GOVERNANCE
LIFECYCLE
CATALOG
7 © Cloudera, Inc. All rights reserved.
BUSINESS IMPACT OF SILOED SYSTEMS
Average lost revenue
Inaccurate and duplicated data
directly impacts bottom line of
88% of all companies.
Suffer from legacy
Legacy technology is holding
back four in five organizations
from taking advantage of data-
driven opportunities.
Costly compliance
Average annual cost for financial
services organizations to achieve
compliance. By 2023, 35% of
organizations in the sector will
spend more than 5% of their
revenue on achieving and
maintaining it.
8 © Cloudera, Inc. All rights reserved.
NEGATIVE BUSINESS IMPACT
• Increased operational costs
many distinct environments
to buy and build
• Increased staff overhead
many distinct tools to learn
and support
• Increased security risks
many distinct frameworks to
enforce
• Decreased business insights
narrow data sets and analytics
rigidity
• Decreased business agility
outdated and limiting for
applications blah
• Decreased governance capability
no common visibility across stores
9 © Cloudera, Inc. All rights reserved.
DATA CONTEXT CHALLENGE
Data
stateful
Compute
stateless
Context
stateless
10 © Cloudera, Inc. All rights reserved.
CLOUDERA
ENTERPRISE WITH
SDX
Benefits for IT infra & ops
● Central control and security
● Focus on curating not
firefighting
Benefits for users
● Value from single source of
truth
● Bring the best tools for each
job
WORKLOADS 3RD PARTY
SERVICES
DATA
ENGINEERIN
G
DATA
SCIENCE
DATA
WAREHOUS
E
OPERATIONA
L DATABASE
DATA CATALOG
GOVERNANCESECURITY LIFECYCLE
MANAGEMENT
STORAGE
Microsoft
ADLS
COMMON SERVICES
HDFS
Amazon
S3 KUDU
11 © Cloudera, Inc. All rights reserved.
WORKLOADS 3RD PARTY
SERVICES
DATA
ENGINEERING
DATA
SCIENCE
DATA
WAREHOUSE
OPERATIONAL
DATABASE
DATA CATALOG
GOVERNANCESECURITY LIFECYCLE
MANAGEMENT
STORAGE
Microsoft
ADLS
COMMON SERVICES
HDFS
Amazon
S3 KUDU
Security: role-based access control applied consistently across the platform.
Includes full stack encryption and key management
Governance: enterprise-grade auditing, lineage, and governance capabilities
applied across the platform with rich extensibility for partner integrations
Lifecycle Management: comprehensive ingest-to-purge management of data set
lifecycle activities
Data Catalog: a comprehensive catalog of all data sets, spanning on-premises,
cloud object stores, structured, unstructured, and semi-structured
SHARED DATA EXPERIENCE
Built for multi-function analytics anywhere
© Cloudera, Inc. All rights reserved.
WE’RE NOT AN ISLAND
13 © Cloudera, Inc. All rights reserved.
PARTNER
ECOSYSTEM
ISVs & SOLUTIONS
CLOUD & PLATFORM
SYSTEM
INTEGRATORSRESELLERS
14 © Cloudera, Inc. All rights reserved.
SOLUTION GALLERY
© Cloudera, Inc. All rights reserved.
BARDESS – FROM DATA TO INSIGHTS. EVERYDAY
16 © Cloudera, Inc. All rights reserved.
BARDESS – FROM DATA TO INSIGHTS. EVERYDAY
Zero2HeroTM Made possible by SDX.
• Bardess Group and Zero2Hero intro
• Importance of data context solutions involving multiple partners
• SDX orchestrating context between moving parts
• Demo, also highlighting SDX elements
We transform data into
insights and action, everyday.
Bardess is a consulting company focused on designing
and implementing data analytics solutions.
We are a team of data and business professionals, who ask insightful questions,
extend boundaries and take action.
• Bardess is organized around the modern data analytics ecosystem to
identify strategic business opportunities at every step.
•
• We build our teams to uncover
insights hidden deep within data
MANAGEMENT CONSULTING DATA ENGINEERING
DATA SCIENCE DATA ANALYTICS
DATA PRACTICES
Zero2Hero
z2H Stack
Summary
Stack Function
Description
Any data platform must start with the
storage and processing layer, and modern
"schema-on-read" architectures and Big
Data processing frameworks providing a
high-performance, scalable base.
Data prep is the most time consuming
aspect of an analytics project. Modern
tools make it easy to democratize this
workflow, and keep it scalable and
integrated with data governance systems.
Visual analytics is key in exposing
patterns, relationships, and outliers in
the data to users. Because data is only
useful to the extent that it can be
successfully interpreted and analyzed.
Apply artificial intelligence, machine
learning, predictive, prescriptive and
geospatial capabilities to create meaningful
insights that drive additional value.
ADVANCED ANAYTICSVISUAL ANALYTICSDATA SHAPINGSTORAGE & PROCESSING
Zero2Hero
z2H Stack
Summary
Stack Function
Description
Bardess
Accelerator
Partner
Any data platform must start with the
storage and processing layer, and modern
"schema-on-read" architectures and Big
Data processing frameworks providing a
high-performance, scalable base.
Data prep is the most time consuming
aspect of an analytics project. Modern
tools make it easy to democratize this
workflow, and keep it scalable and
integrated with data governance systems.
Visual analytics is key in exposing
patterns, relationships, and outliers in the
data to users. Because data is only useful
to the extent that it can be successfully
interpreted and analyzed.
Apply artificial intelligence, machine
learning, predictive, prescriptive and
geospatial capabilities to create meaningful
insights that drive additional value.
ValueCreation
Cluster Prerequisites Installed
Metadata cataloged and Indexed
Data lineage reporting
Workload management
Hadoop explorer
Integration with external analytics engines
Integrated Model training
Model performance monitoring
Real-time predictions
ADVANCED ANAYTICSVISUAL ANALYTICSDATA SHAPINGSTORAGE & PROCESSING
Zero2Hero
Stack Solution
A pre-built data processing and analysis stack of exceptional tools, Bardess
accelerators, preloaded with relevant industry data, designed to solve modern
scale problems and deliver rapid value.
Demo
Zero2HeroTM Live in Action
Daniel Parton – PhD – Lead Data Scientist – Bardess Group
Demo
Zero2HeroTM Live in Action
David Freriks – Emerging Technology Evangelist – Qlik
© Cloudera, Inc. All rights reserved.
SDX – TRULY SHARED, BEYOND THE PLATFORM
26 © Cloudera, Inc. All rights reserved.
DATA-DRIVEN
JOURNEY
USE CASES
VISIBILITY
Preventive
& Proactive
Maintenance
IoT Hub for
Industry 4.0
Advanced
Threat
Detection
Risk
Modelling &
Analysis
Marketing
Systems
Integration
Customer
360
Insights
Exploratory
Data
Science
Data
Warehouse
Applied
Machine
Learning
GROW
Sales & Marketing
CONNECT
Operations & Product
PROTECT
Security & Compliance
MODERNIZE
IT, Tech, Data Science & Analytics
27 © Cloudera, Inc. All rights reserved.
POSITIVE BUSINESS OUTCOMES
• Increased business insights
diverse data together with
analytics flexibility
• Increased business agility
modern and nimble application
innovation
• Increased governance capability
one common viewpoint and store
• Decreased operational costs
one environment for all needs
blahhhhh
• Decreased staff overhead
one set of controls for everything
blahhhh
• Decreased security risks
comprehensive controls
everywhere
28 © Cloudera, Inc. All rights reserved.
EVEN BETTER WITH SDX
Cloudera and partners
• Tight integration
• Differentiated, repeatable solutions
For customer benefits
• Complete, use case specific solutions
• Delivered at lower risk
• For faster time to value
THANK YOU

Extending Cloudera SDX beyond the Platform

  • 1.
  • 2.
    2 © Cloudera,Inc. All rights reserved. TODAY’S SPEAKERS Wim Stoop Product Marketing Manager wim@cloudera.com Philip Duplisey Sr. Director Consulting Services pduplisey@bardess.com Daniel Parton Lead Data Scientist dparton@bardess.com David Freriks Emerging Technology Evangelist david.freriks@qlik.com
  • 3.
    3 © Cloudera,Inc. All rights reserved. MULTI- DISCIPLINARY ANALYTICS
  • 4.
    © Cloudera, Inc.All rights reserved. WE ALL HAVE BAGGAGE
  • 5.
    5 © Cloudera,Inc. All rights reserved. TRADITIONAL APPLICATION SILOS CONTEXT STORAGE APPLICATION SECURITY GOVERNANCE LIFECYCLE CATALOG DATA SCIENCE FS SQL ANALYTIC DATABASE SECURITY GOVERNANCE LIFECYCLE CATALOG RDBM S NOSQL & RT DATABASE SECURITY GOVERNANCE LIFECYCLE CATALOG FS ETL & DATA ENGINEERIN G SECURITY GOVERNANCE LIFECYCLE CATALOG RDBM S DATA WARE- HOUSE/MAR T RDBM S SECURITY GOVERNANCE LIFECYCLE CATALOG
  • 6.
    7 © Cloudera,Inc. All rights reserved. BUSINESS IMPACT OF SILOED SYSTEMS Average lost revenue Inaccurate and duplicated data directly impacts bottom line of 88% of all companies. Suffer from legacy Legacy technology is holding back four in five organizations from taking advantage of data- driven opportunities. Costly compliance Average annual cost for financial services organizations to achieve compliance. By 2023, 35% of organizations in the sector will spend more than 5% of their revenue on achieving and maintaining it.
  • 7.
    8 © Cloudera,Inc. All rights reserved. NEGATIVE BUSINESS IMPACT • Increased operational costs many distinct environments to buy and build • Increased staff overhead many distinct tools to learn and support • Increased security risks many distinct frameworks to enforce • Decreased business insights narrow data sets and analytics rigidity • Decreased business agility outdated and limiting for applications blah • Decreased governance capability no common visibility across stores
  • 8.
    9 © Cloudera,Inc. All rights reserved. DATA CONTEXT CHALLENGE Data stateful Compute stateless Context stateless
  • 9.
    10 © Cloudera,Inc. All rights reserved. CLOUDERA ENTERPRISE WITH SDX Benefits for IT infra & ops ● Central control and security ● Focus on curating not firefighting Benefits for users ● Value from single source of truth ● Bring the best tools for each job WORKLOADS 3RD PARTY SERVICES DATA ENGINEERIN G DATA SCIENCE DATA WAREHOUS E OPERATIONA L DATABASE DATA CATALOG GOVERNANCESECURITY LIFECYCLE MANAGEMENT STORAGE Microsoft ADLS COMMON SERVICES HDFS Amazon S3 KUDU
  • 10.
    11 © Cloudera,Inc. All rights reserved. WORKLOADS 3RD PARTY SERVICES DATA ENGINEERING DATA SCIENCE DATA WAREHOUSE OPERATIONAL DATABASE DATA CATALOG GOVERNANCESECURITY LIFECYCLE MANAGEMENT STORAGE Microsoft ADLS COMMON SERVICES HDFS Amazon S3 KUDU Security: role-based access control applied consistently across the platform. Includes full stack encryption and key management Governance: enterprise-grade auditing, lineage, and governance capabilities applied across the platform with rich extensibility for partner integrations Lifecycle Management: comprehensive ingest-to-purge management of data set lifecycle activities Data Catalog: a comprehensive catalog of all data sets, spanning on-premises, cloud object stores, structured, unstructured, and semi-structured SHARED DATA EXPERIENCE Built for multi-function analytics anywhere
  • 11.
    © Cloudera, Inc.All rights reserved. WE’RE NOT AN ISLAND
  • 12.
    13 © Cloudera,Inc. All rights reserved. PARTNER ECOSYSTEM ISVs & SOLUTIONS CLOUD & PLATFORM SYSTEM INTEGRATORSRESELLERS
  • 13.
    14 © Cloudera,Inc. All rights reserved. SOLUTION GALLERY
  • 14.
    © Cloudera, Inc.All rights reserved. BARDESS – FROM DATA TO INSIGHTS. EVERYDAY
  • 15.
    16 © Cloudera,Inc. All rights reserved. BARDESS – FROM DATA TO INSIGHTS. EVERYDAY Zero2HeroTM Made possible by SDX. • Bardess Group and Zero2Hero intro • Importance of data context solutions involving multiple partners • SDX orchestrating context between moving parts • Demo, also highlighting SDX elements
  • 16.
    We transform datainto insights and action, everyday. Bardess is a consulting company focused on designing and implementing data analytics solutions. We are a team of data and business professionals, who ask insightful questions, extend boundaries and take action.
  • 17.
    • Bardess isorganized around the modern data analytics ecosystem to identify strategic business opportunities at every step. • • We build our teams to uncover insights hidden deep within data MANAGEMENT CONSULTING DATA ENGINEERING DATA SCIENCE DATA ANALYTICS DATA PRACTICES
  • 18.
    Zero2Hero z2H Stack Summary Stack Function Description Anydata platform must start with the storage and processing layer, and modern "schema-on-read" architectures and Big Data processing frameworks providing a high-performance, scalable base. Data prep is the most time consuming aspect of an analytics project. Modern tools make it easy to democratize this workflow, and keep it scalable and integrated with data governance systems. Visual analytics is key in exposing patterns, relationships, and outliers in the data to users. Because data is only useful to the extent that it can be successfully interpreted and analyzed. Apply artificial intelligence, machine learning, predictive, prescriptive and geospatial capabilities to create meaningful insights that drive additional value. ADVANCED ANAYTICSVISUAL ANALYTICSDATA SHAPINGSTORAGE & PROCESSING
  • 19.
    Zero2Hero z2H Stack Summary Stack Function Description Bardess Accelerator Partner Anydata platform must start with the storage and processing layer, and modern "schema-on-read" architectures and Big Data processing frameworks providing a high-performance, scalable base. Data prep is the most time consuming aspect of an analytics project. Modern tools make it easy to democratize this workflow, and keep it scalable and integrated with data governance systems. Visual analytics is key in exposing patterns, relationships, and outliers in the data to users. Because data is only useful to the extent that it can be successfully interpreted and analyzed. Apply artificial intelligence, machine learning, predictive, prescriptive and geospatial capabilities to create meaningful insights that drive additional value. ValueCreation Cluster Prerequisites Installed Metadata cataloged and Indexed Data lineage reporting Workload management Hadoop explorer Integration with external analytics engines Integrated Model training Model performance monitoring Real-time predictions ADVANCED ANAYTICSVISUAL ANALYTICSDATA SHAPINGSTORAGE & PROCESSING
  • 20.
    Zero2Hero Stack Solution A pre-builtdata processing and analysis stack of exceptional tools, Bardess accelerators, preloaded with relevant industry data, designed to solve modern scale problems and deliver rapid value.
  • 21.
    Demo Zero2HeroTM Live inAction Daniel Parton – PhD – Lead Data Scientist – Bardess Group
  • 22.
    Demo Zero2HeroTM Live inAction David Freriks – Emerging Technology Evangelist – Qlik
  • 23.
    © Cloudera, Inc.All rights reserved. SDX – TRULY SHARED, BEYOND THE PLATFORM
  • 24.
    26 © Cloudera,Inc. All rights reserved. DATA-DRIVEN JOURNEY USE CASES VISIBILITY Preventive & Proactive Maintenance IoT Hub for Industry 4.0 Advanced Threat Detection Risk Modelling & Analysis Marketing Systems Integration Customer 360 Insights Exploratory Data Science Data Warehouse Applied Machine Learning GROW Sales & Marketing CONNECT Operations & Product PROTECT Security & Compliance MODERNIZE IT, Tech, Data Science & Analytics
  • 25.
    27 © Cloudera,Inc. All rights reserved. POSITIVE BUSINESS OUTCOMES • Increased business insights diverse data together with analytics flexibility • Increased business agility modern and nimble application innovation • Increased governance capability one common viewpoint and store • Decreased operational costs one environment for all needs blahhhhh • Decreased staff overhead one set of controls for everything blahhhh • Decreased security risks comprehensive controls everywhere
  • 26.
    28 © Cloudera,Inc. All rights reserved. EVEN BETTER WITH SDX Cloudera and partners • Tight integration • Differentiated, repeatable solutions For customer benefits • Complete, use case specific solutions • Delivered at lower risk • For faster time to value
  • 27.