Submit Search
Upload
Meet up roadmap cloudera 2020 - janeiro
ā¢
5 likes
ā¢
307 views
Thiago Santiago
Follow
What is coming in the new Cloudera data platform
Read less
Read more
Technology
Report
Share
Report
Share
1 of 98
Download now
Download to read offline
Recommended
Databricks Fundamentals
Databricks Fundamentals
Dalibor Wijas
Ā
Stl meetup cloudera platform - january 2020
Stl meetup cloudera platform - january 2020
Adam Doyle
Ā
Getting Started with Delta Lake on Databricks
Getting Started with Delta Lake on Databricks
Knoldus Inc.
Ā
Building Modern Data Platform with Microsoft Azure
Building Modern Data Platform with Microsoft Azure
Dmitry Anoshin
Ā
Snowflake: The most cost-effective agile and scalable data warehouse ever!
Snowflake: The most cost-effective agile and scalable data warehouse ever!
Visual_BI
Ā
DevOps for Databricks
DevOps for Databricks
Databricks
Ā
Apache Iceberg - A Table Format for Hige Analytic Datasets
Apache Iceberg - A Table Format for Hige Analytic Datasets
Alluxio, Inc.
Ā
Intro to Delta Lake
Intro to Delta Lake
Databricks
Ā
Recommended
Databricks Fundamentals
Databricks Fundamentals
Dalibor Wijas
Ā
Stl meetup cloudera platform - january 2020
Stl meetup cloudera platform - january 2020
Adam Doyle
Ā
Getting Started with Delta Lake on Databricks
Getting Started with Delta Lake on Databricks
Knoldus Inc.
Ā
Building Modern Data Platform with Microsoft Azure
Building Modern Data Platform with Microsoft Azure
Dmitry Anoshin
Ā
Snowflake: The most cost-effective agile and scalable data warehouse ever!
Snowflake: The most cost-effective agile and scalable data warehouse ever!
Visual_BI
Ā
DevOps for Databricks
DevOps for Databricks
Databricks
Ā
Apache Iceberg - A Table Format for Hige Analytic Datasets
Apache Iceberg - A Table Format for Hige Analytic Datasets
Alluxio, Inc.
Ā
Intro to Delta Lake
Intro to Delta Lake
Databricks
Ā
Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A...
Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A...
Cathrine Wilhelmsen
Ā
Databricks Platform.pptx
Databricks Platform.pptx
Alex Ivy
Ā
Moving to Databricks & Delta
Moving to Databricks & Delta
Databricks
Ā
Real-Life Use Cases & Architectures for Event Streaming with Apache Kafka
Real-Life Use Cases & Architectures for Event Streaming with Apache Kafka
Kai WƤhner
Ā
Amazon S3 Best Practice and Tuning for Hadoop/Spark in the Cloud
Amazon S3 Best Practice and Tuning for Hadoop/Spark in the Cloud
Noritaka Sekiyama
Ā
Delta lake and the delta architecture
Delta lake and the delta architecture
Adam Doyle
Ā
Building a Data Pipeline using Apache Airflow (on AWS / GCP)
Building a Data Pipeline using Apache Airflow (on AWS / GCP)
Yohei Onishi
Ā
Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4
Databricks
Ā
Building the Data Lake with Azure Data Factory and Data Lake Analytics
Building the Data Lake with Azure Data Factory and Data Lake Analytics
Khalid Salama
Ā
Databricks Delta Lake and Its Benefits
Databricks Delta Lake and Its Benefits
Databricks
Ā
Apache Iceberg Presentation for the St. Louis Big Data IDEA
Apache Iceberg Presentation for the St. Louis Big Data IDEA
Adam Doyle
Ā
Azure purview
Azure purview
Shafqat Turza
Ā
Migrating Data and Databases to Azure
Migrating Data and Databases to Azure
Karen Lopez
Ā
Getting Started with Databricks SQL Analytics
Getting Started with Databricks SQL Analytics
Databricks
Ā
Designing Apache Hudi for Incremental Processing With Vinoth Chandar and Etha...
Designing Apache Hudi for Incremental Processing With Vinoth Chandar and Etha...
HostedbyConfluent
Ā
Delta Lake OSS: Create reliable and performant Data Lake by Quentin Ambard
Delta Lake OSS: Create reliable and performant Data Lake by Quentin Ambard
Paris Data Engineers !
Ā
Designing modern dw and data lake
Designing modern dw and data lake
punedevscom
Ā
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
DataScienceConferenc1
Ā
Introduction to Azure Databricks
Introduction to Azure Databricks
James Serra
Ā
Putting the Ops in DataOps: Orchestrate the Flow of Data Across Data Pipelines
Putting the Ops in DataOps: Orchestrate the Flow of Data Across Data Pipelines
DATAVERSITY
Ā
Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019
Cloudera, Inc.
Ā
VSD Paris 2018 - PrƩsentation Finale
VSD Paris 2018 - PrƩsentation Finale
Veritas Technologies LLC
Ā
More Related Content
What's hot
Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A...
Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A...
Cathrine Wilhelmsen
Ā
Databricks Platform.pptx
Databricks Platform.pptx
Alex Ivy
Ā
Moving to Databricks & Delta
Moving to Databricks & Delta
Databricks
Ā
Real-Life Use Cases & Architectures for Event Streaming with Apache Kafka
Real-Life Use Cases & Architectures for Event Streaming with Apache Kafka
Kai WƤhner
Ā
Amazon S3 Best Practice and Tuning for Hadoop/Spark in the Cloud
Amazon S3 Best Practice and Tuning for Hadoop/Spark in the Cloud
Noritaka Sekiyama
Ā
Delta lake and the delta architecture
Delta lake and the delta architecture
Adam Doyle
Ā
Building a Data Pipeline using Apache Airflow (on AWS / GCP)
Building a Data Pipeline using Apache Airflow (on AWS / GCP)
Yohei Onishi
Ā
Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4
Databricks
Ā
Building the Data Lake with Azure Data Factory and Data Lake Analytics
Building the Data Lake with Azure Data Factory and Data Lake Analytics
Khalid Salama
Ā
Databricks Delta Lake and Its Benefits
Databricks Delta Lake and Its Benefits
Databricks
Ā
Apache Iceberg Presentation for the St. Louis Big Data IDEA
Apache Iceberg Presentation for the St. Louis Big Data IDEA
Adam Doyle
Ā
Azure purview
Azure purview
Shafqat Turza
Ā
Migrating Data and Databases to Azure
Migrating Data and Databases to Azure
Karen Lopez
Ā
Getting Started with Databricks SQL Analytics
Getting Started with Databricks SQL Analytics
Databricks
Ā
Designing Apache Hudi for Incremental Processing With Vinoth Chandar and Etha...
Designing Apache Hudi for Incremental Processing With Vinoth Chandar and Etha...
HostedbyConfluent
Ā
Delta Lake OSS: Create reliable and performant Data Lake by Quentin Ambard
Delta Lake OSS: Create reliable and performant Data Lake by Quentin Ambard
Paris Data Engineers !
Ā
Designing modern dw and data lake
Designing modern dw and data lake
punedevscom
Ā
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
DataScienceConferenc1
Ā
Introduction to Azure Databricks
Introduction to Azure Databricks
James Serra
Ā
Putting the Ops in DataOps: Orchestrate the Flow of Data Across Data Pipelines
Putting the Ops in DataOps: Orchestrate the Flow of Data Across Data Pipelines
DATAVERSITY
Ā
What's hot
(20)
Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A...
Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A...
Ā
Databricks Platform.pptx
Databricks Platform.pptx
Ā
Moving to Databricks & Delta
Moving to Databricks & Delta
Ā
Real-Life Use Cases & Architectures for Event Streaming with Apache Kafka
Real-Life Use Cases & Architectures for Event Streaming with Apache Kafka
Ā
Amazon S3 Best Practice and Tuning for Hadoop/Spark in the Cloud
Amazon S3 Best Practice and Tuning for Hadoop/Spark in the Cloud
Ā
Delta lake and the delta architecture
Delta lake and the delta architecture
Ā
Building a Data Pipeline using Apache Airflow (on AWS / GCP)
Building a Data Pipeline using Apache Airflow (on AWS / GCP)
Ā
Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4
Ā
Building the Data Lake with Azure Data Factory and Data Lake Analytics
Building the Data Lake with Azure Data Factory and Data Lake Analytics
Ā
Databricks Delta Lake and Its Benefits
Databricks Delta Lake and Its Benefits
Ā
Apache Iceberg Presentation for the St. Louis Big Data IDEA
Apache Iceberg Presentation for the St. Louis Big Data IDEA
Ā
Azure purview
Azure purview
Ā
Migrating Data and Databases to Azure
Migrating Data and Databases to Azure
Ā
Getting Started with Databricks SQL Analytics
Getting Started with Databricks SQL Analytics
Ā
Designing Apache Hudi for Incremental Processing With Vinoth Chandar and Etha...
Designing Apache Hudi for Incremental Processing With Vinoth Chandar and Etha...
Ā
Delta Lake OSS: Create reliable and performant Data Lake by Quentin Ambard
Delta Lake OSS: Create reliable and performant Data Lake by Quentin Ambard
Ā
Designing modern dw and data lake
Designing modern dw and data lake
Ā
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
Ā
Introduction to Azure Databricks
Introduction to Azure Databricks
Ā
Putting the Ops in DataOps: Orchestrate the Flow of Data Across Data Pipelines
Putting the Ops in DataOps: Orchestrate the Flow of Data Across Data Pipelines
Ā
Similar to Meet up roadmap cloudera 2020 - janeiro
Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019
Cloudera, Inc.
Ā
VSD Paris 2018 - PrƩsentation Finale
VSD Paris 2018 - PrƩsentation Finale
Veritas Technologies LLC
Ā
Cloud Computing and CDO (April 29).pdf
Cloud Computing and CDO (April 29).pdf
Pablo Junco
Ā
Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19
Cloudera, Inc.
Ā
Cloud beyond the obvious, an approach for innovation
Cloud beyond the obvious, an approach for innovation
Christian Verstraete
Ā
State of cloud computing v2
State of cloud computing v2
Md Aminul Hassan
Ā
Cloud & Big Data - Digital Transformation in Banking
Cloud & Big Data - Digital Transformation in Banking
Sutedjo Tjahjadi
Ā
Keynote: Art of the Possible - Moore
Keynote: Art of the Possible - Moore
Neo4j
Ā
The Art of Data Science - event slides
The Art of Data Science - event slides
RedPixie
Ā
Big Data
Big Data
BBDO
Ā
141900791 big-data
141900791 big-data
glittaz
Ā
BBDO Proximity: Big-data May 2013
BBDO Proximity: Big-data May 2013
Brian Crotty
Ā
IoT meets AI in the Clouds
IoT meets AI in the Clouds
Dr. Mirko KƤmpf
Ā
top 10 Digital transformation Technologies in 2022.docx
top 10 Digital transformation Technologies in 2022.docx
Advance Tech
Ā
Big Data PPT by Rohit Dubey
Big Data PPT by Rohit Dubey
Rohit Dubey
Ā
SessionA-Keynote-NSIT-AMS-Aug15b.pptx
SessionA-Keynote-NSIT-AMS-Aug15b.pptx
ssuser993127
Ā
å·Øéč³ęå „é The evolution of data architecture
å·Øéč³ęå „é The evolution of data architecture
Wei-Chiu Chuang
Ā
KUBRICK Graphs: A journey from in vogue to success-ion
KUBRICK Graphs: A journey from in vogue to success-ion
Neo4j
Ā
Digital Transformation in the Lab
Digital Transformation in the Lab
accenture
Ā
Tech + Built Environment Trends 22
Tech + Built Environment Trends 22
Matthew Marson
Ā
Similar to Meet up roadmap cloudera 2020 - janeiro
(20)
Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019
Ā
VSD Paris 2018 - PrƩsentation Finale
VSD Paris 2018 - PrƩsentation Finale
Ā
Cloud Computing and CDO (April 29).pdf
Cloud Computing and CDO (April 29).pdf
Ā
Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19
Ā
Cloud beyond the obvious, an approach for innovation
Cloud beyond the obvious, an approach for innovation
Ā
State of cloud computing v2
State of cloud computing v2
Ā
Cloud & Big Data - Digital Transformation in Banking
Cloud & Big Data - Digital Transformation in Banking
Ā
Keynote: Art of the Possible - Moore
Keynote: Art of the Possible - Moore
Ā
The Art of Data Science - event slides
The Art of Data Science - event slides
Ā
Big Data
Big Data
Ā
141900791 big-data
141900791 big-data
Ā
BBDO Proximity: Big-data May 2013
BBDO Proximity: Big-data May 2013
Ā
IoT meets AI in the Clouds
IoT meets AI in the Clouds
Ā
top 10 Digital transformation Technologies in 2022.docx
top 10 Digital transformation Technologies in 2022.docx
Ā
Big Data PPT by Rohit Dubey
Big Data PPT by Rohit Dubey
Ā
SessionA-Keynote-NSIT-AMS-Aug15b.pptx
SessionA-Keynote-NSIT-AMS-Aug15b.pptx
Ā
å·Øéč³ęå „é The evolution of data architecture
å·Øéč³ęå „é The evolution of data architecture
Ā
KUBRICK Graphs: A journey from in vogue to success-ion
KUBRICK Graphs: A journey from in vogue to success-ion
Ā
Digital Transformation in the Lab
Digital Transformation in the Lab
Ā
Tech + Built Environment Trends 22
Tech + Built Environment Trends 22
Ā
More from Thiago Santiago
LGPD - Webinar Cloudera e FIAP
LGPD - Webinar Cloudera e FIAP
Thiago Santiago
Ā
Harvard Business Review - LGPD
Harvard Business Review - LGPD
Thiago Santiago
Ā
Hortonworks - IBM - Cloud Event
Hortonworks - IBM - Cloud Event
Thiago Santiago
Ā
Hortonworks - IBM Cognitive - The Future of Data Science
Hortonworks - IBM Cognitive - The Future of Data Science
Thiago Santiago
Ā
Social Media Monitoring with NiFi, Druid and Superset
Social Media Monitoring with NiFi, Druid and Superset
Thiago Santiago
Ā
PGDay Brasilia 2017
PGDay Brasilia 2017
Thiago Santiago
Ā
Big Data Week SaĢo Paulo 2017
Big Data Week SaĢo Paulo 2017
Thiago Santiago
Ā
Hortonworks & IBM solutions
Hortonworks & IBM solutions
Thiago Santiago
Ā
Instituto Infnet - BigData e Hadoop
Instituto Infnet - BigData e Hadoop
Thiago Santiago
Ā
Hadoop Day - MeetUp - O poder da InformaĆ§Ć£o
Hadoop Day - MeetUp - O poder da InformaĆ§Ć£o
Thiago Santiago
Ā
BigData & Hadoop - Technology Latinoware 2016
BigData & Hadoop - Technology Latinoware 2016
Thiago Santiago
Ā
TDC 2014 - Hadoop Hands ON
TDC 2014 - Hadoop Hands ON
Thiago Santiago
Ā
Hadoop - MĆ£os Ć massa! Qcon2014
Hadoop - MĆ£os Ć massa! Qcon2014
Thiago Santiago
Ā
More from Thiago Santiago
(13)
LGPD - Webinar Cloudera e FIAP
LGPD - Webinar Cloudera e FIAP
Ā
Harvard Business Review - LGPD
Harvard Business Review - LGPD
Ā
Hortonworks - IBM - Cloud Event
Hortonworks - IBM - Cloud Event
Ā
Hortonworks - IBM Cognitive - The Future of Data Science
Hortonworks - IBM Cognitive - The Future of Data Science
Ā
Social Media Monitoring with NiFi, Druid and Superset
Social Media Monitoring with NiFi, Druid and Superset
Ā
PGDay Brasilia 2017
PGDay Brasilia 2017
Ā
Big Data Week SaĢo Paulo 2017
Big Data Week SaĢo Paulo 2017
Ā
Hortonworks & IBM solutions
Hortonworks & IBM solutions
Ā
Instituto Infnet - BigData e Hadoop
Instituto Infnet - BigData e Hadoop
Ā
Hadoop Day - MeetUp - O poder da InformaĆ§Ć£o
Hadoop Day - MeetUp - O poder da InformaĆ§Ć£o
Ā
BigData & Hadoop - Technology Latinoware 2016
BigData & Hadoop - Technology Latinoware 2016
Ā
TDC 2014 - Hadoop Hands ON
TDC 2014 - Hadoop Hands ON
Ā
Hadoop - MĆ£os Ć massa! Qcon2014
Hadoop - MĆ£os Ć massa! Qcon2014
Ā
Recently uploaded
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
apidays
Ā
Navi Mumbai Call Girls š„° 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls š„° 8617370543 Service Offer VIP Hot Model
Deepika Singh
Ā
presentation ICT roal in 21st century education
presentation ICT roal in 21st century education
jfdjdjcjdnsjd
Ā
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
Khem
Ā
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Jeffrey Haguewood
Ā
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(āļø+971_581248768%)**%*]'#abortion pills for sale in dubai@
Ā
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
MIND CTI
Ā
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
The Digital Insurer
Ā
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
apidays
Ā
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc
Ā
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Zilliz
Ā
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Edi Saputra
Ā
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
Dropbox
Ā
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Miguel AraĆŗjo
Ā
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
The Digital Insurer
Ā
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
The Digital Insurer
Ā
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
Ā
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Juan lago vƔzquez
Ā
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
Andrey Devyatkin
Ā
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
Overkill Security
Ā
Recently uploaded
(20)
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Ā
Navi Mumbai Call Girls š„° 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls š„° 8617370543 Service Offer VIP Hot Model
Ā
presentation ICT roal in 21st century education
presentation ICT roal in 21st century education
Ā
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
Ā
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Ā
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
Ā
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
Ā
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
Ā
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Ā
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
Ā
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Ā
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Ā
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
Ā
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Ā
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
Ā
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
Ā
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Ā
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Ā
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
Ā
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
Ā
Meet up roadmap cloudera 2020 - janeiro
1.
linkedin.com/company/cloudera/ twitter.com/cloudera facebook.com/cloudera instagram.com/cloudera #MeetUpClouderaBrasil MeetUp ROADMAP CLOUDERA 2020 What
is coming in the new Cloudera data platform
2.
Ā© 2020 Cloudera,
Inc. All rights reserved. 2 Thiago Santiago Solution Engineering linkedin.com/in/thiagosantiago/ thiago@cloudera.com
3.
Ā© 2020 Cloudera,
Inc. All rights reserved. 3 ā Why are we here today? ā CDP Cloud ā CDP Data Center ā CDF for CDP ā CML for CDP ā Data Driven Journey and Use Cases Agenda
4.
Ā© 2020 Cloudera,
Inc. All rights reserved. 4 Why are you here now? You could be... ...Netflix?...Soap opera? ...Soccer?
5.
Ā© 2020 Cloudera,
Inc. All rights reserved. 5 Why are you here now? ...Make 2020 your BigData Year!
6.
Ā© 2020 Cloudera,
Inc. All rights reserved. 6 Why do you want BigData?
7.
Ā© 2020 Cloudera,
Inc. All rights reserved. 7 $138,918$122,306 BigData Salaries? https://www.indeed.com/salaries/Big-Data-Salaries BigData ArchitectData Scientist *per year $137,054 Data Warehouse Architect $113,222 Senior Software Engineer
8.
Ā© 2020 Cloudera,
Inc. All rights reserved. 8 Technology Trends? Artificial Intelligence Internet of Things Cloud Computing Streaming Data Industrial Internet Connected Business Consumer Devices Smart Devices Autonomy Prescriptive Analytics SaaS/PaaS Applications Ephemeral Use Cases Operational Efficiency Collaboration Real-time Applications Targeted Retail Recommendations Industrial Applications Shifting the Data Paradigm
9.
Ā© 2020 Cloudera,
Inc. All rights reserved. 9 Security The Way BigData is Changing the world Big data is being heavily used by law enforcement, particularly by national organisations such NSA. These organisations have access to vast amounts of data which they use to catch criminals, foil terrorist plots . Surveillance video from currently deployed Border Patrol assets such as fixed and mobile towers, imaging unattended ground sensors (UGS), and unmanned air systems (drones) Analytical techniques applied to detect anomalies and/or trends leading to actionable intelligence Machine learning used to help with predictive and proactive deployment of resources
10.
Ā© 2020 Cloudera,
Inc. All rights reserved. 10 Decline in Corruption The Way BigData is Changing the world Better monitoring of assets through foolproof big data analytics will help governments to track economies and facilitate a better allocation among everyone in the society. This also tackles the problems with unwieldy bureaucracy, misinformation and other types of obstacles to transparent economics.
11.
Ā© 2020 Cloudera,
Inc. All rights reserved. 11 Environmental Health The Way BigData is Changing the world Increased carbon emission, greenhouse gasses, global warming and other climate changes can be better monitored and fought with the help of Big Data. The easiest example lies in wearable devices connected through internet providing awareness and means to stand against local environmental challenges.
12.
Ā© 2020 Cloudera,
Inc. All rights reserved. 12 Fighting Poverty The Way BigData is Changing the world By taking data from developing nations, non-profit organizations are able to find areas where people can benefit the most from having access to better education, financial services, developed infrastructure, and health services. Having this information on hand can aid in efforts to get help for areas that are struck by natural disasters or health catastrophes. Big data may also help developing nations fight government corruption, which can cause extreme levels of poverty and impede relief efforts. Similarly, access to large amounts of information from various sources can also help organizations identify and react better to health epidemics, natural disasters (earthquakes, cyclones, etc.) and agricultural related trends (drought, famine, etc)."
13.
Ā© 2020 Cloudera,
Inc. All rights reserved. 13 HealthCare The Way BigData is Changing the world Big data analytics is accelerating the speed at which researches can work, for example DNA strings can now be decoded in minutes which can lead to the faster creation of cures and the ability to predict disease patterns. Big data is being used to monitor premature and sick babies in some specialist units, with the techniques allowing the doctors to analyse every heart beat and breathing patterns. This leads to the development of algorithms which now allow for the prediction of infections 24 hours before any physical symptoms occur.
14.
Ā© 2020 Cloudera,
Inc. All rights reserved. 14 Science The Way BigData is Changing the world CERN with the Large Hadron Collider producing astronomical amounts of data designed to unlock the secrets of the universe. The processing power is necessary to be able to analyse the 30 petabytes of data that the Hadron Collider produces annually. Big data is also aiding with space exploration ā the Square Kilometre Array generates 700 terabytes of data a second. NASA Exchange Platform will help to manage the data. The technology which can detect radar on a planet 50 light years away could eventually help to discover life on another planet.
15.
Ā© 2020 Cloudera,
Inc. All rights reserved. 15
16.
We Believe āØ data
can make what was impossible āØ yesterday, possible today.
17.
We Believe āØ data
can make what was impossible āØ yesterday, possible today.
18.
We Believe āØ data
can make what was impossible āØ yesterday, possible today.
19.
Ā© Cloudera, Inc.
All rights reserved.19 THE ENTERPRISE DATA CLOUD COMPANY We believe that data can make what was impossible yesterday, possible today We empower people āØ to transform complex data into clear and actionable insights We deliver an āØ enterprise data cloud for any data, anywhere, from the Edge to AI
20.
Ā© Cloudera, Inc.
All rights reserved.20 SNAPSHOT OF THE āNEWā CLOUDERA 85Countries Customers 3,000+Employees 2,000+
21.
Ā© Cloudera, Inc.
All rights reserved.21 LEADING IN TOP INDUSTRIES 8/10 TOPāØ GLOBAL 10/10 TOP āØ GLOBAL 9/10 TOP āØ GLOBAL 40+ GOVERNMENT CUSTOMERS BANKING TELCO PHARMAPUBLIC 8/10 TOP āØ GLOBAL TECHNOLOGY 10/10 TOP āØ GLOBAL AUTOMOTIVE
22.
Ā© 2019 Cloudera,
Inc. All rights reserved. 22 WORLD-CLASS TRAINING, SERVICES & SUPPORT Fastest route from āØ zero to production PROFESSIONAL SERVICES SCP-certified support āØ anywhere in the world CLOUDERA SUPPORT 3 top big data āØ certifications CLOUDERA UNIVERSITY
23.
Ā© Cloudera, Inc.
All rights reserved.23 ENTERPRISE DATA CLOUD ARCHITECTURE ā¢Multi-function analytics ā¢Hybrid and multi-cloud ā¢Secure and governed ā¢Open platform IOT, INGEST & STREAMING DATA āØ WAREHOUSING SECURITY & GOVERNANCE ML / AI DATA SCIENCE PUBLIC CLOUDS āØ compute & storage DATACENTERāØ compute & storage
24.
Ā© Cloudera, Inc.
All rights reserved.24 Any Cloud Multi-Function OpenSecure & Governed THE ENTERPRISE DATA CLOUD COMPANY
25.
Ā© 2019 Cloudera,
Inc. All rights reserved. 25 CLOUDERA āØ DATA PLATFORM ā¢ Public, private & hybrid clouds ā¢ Shared data experience ā¢ Powered by open source ā¢ Analytics from the Edge to AI ā¢ Unified data control plane Analytic experiences Data Flow & Streaming Data āØ Engineering Data āØ Warehouse Operational Database Machine Learning Identity | Orchestration | Management | OperationsControlāØ plane ManagementāØ Console Data Hub & Cloudera Runtime Any Infrastructure Edge Public Multi-Cloud Hybrid Cloud Private Cloud Catalog | Schema | Migration | Security | GovernanceData āØ anywhere
26.
Ā© 2019 Cloudera,
Inc. All rights reserved. 26 CLOUDERA DATA PLATFORM - FORM FACTORS Data Center, Public cloud, Private cloud, hybrid Control plane DW Data Hub & Cloudera Runtime MLODDEDF SDX ā security, governance & metadata Edge to AI CDP ā Public Cloud Storage ComputePublic Multi-Cloud Control plane DW Data Hub & Cloudera Runtime MLODDEDF SDX ā security, governance & metadata Edge to AI CDP ā Private Cloud Datacenter Storage Container āØ Cloud Private DW DS/āØ ML DF OpDBDE Control plane SDX ā security, governance & metadata CDP ā Data Center Storage & Compute SDX ā security, governance & metadata Control plane
27.
Ā© 2020 Cloudera,
Inc. All rights reserved. 27 CDP Data Center EDH Cloudera Enterprise Data Hub The Most Comprehensive Data Analytics Platform + + New Features = CDP Data Center
28.
CDP Public Cloud
29.
Ā© 2020 Cloudera,
Inc. All rights reserved. 29 Environment ā¢ 1 Template ā¢ 1 Region ā¢ 1 VPC ā¢ Multiple Roles/Buckets KEY CONCEPTS & COMPONENTS 1:1 ENVIRONMENTS Data Lake ā¢ SDX: Atlas, Ranger, Knox, IdBroker, CM ā¢ Associated with groups/users Data Hub Clusters / Experiences ā¢ DH templates ā¢ ML Env ā¢ DW Database Catalogs/Virtual Compute 1:N
30.
Ā© 2020 Cloudera,
Inc. All rights reserved. 30 KEY CONCEPTS & COMPONENTS Typical user flow Enterprise IT CDP Control Plane Enterprise Cloud Resources (IAM, Network, VMs, Buckets, etc.) Management Console 1 Step 1 User connects to CDP with their enterprise identity Step 2 They create an environment and data lake for their enterprise 2 Environment Step 3 They create data hub clusters for traditional workloads Data Lake Atlas Ranger Knox IdBroker FreeIPA CM HMS 3 BI Team Cluster ETL Team Cluster 4 Node 1 Node 2 Node 3 Step 4 They create access points for containerized analytic experiences Node 1 Node 2 Node 3 Data Warehouse Experience Machine Learning Experience
31.
Ā© 2020 Cloudera,
Inc. All rights reserved. 31 CONSISTENT SECURITY AND GOVERNANCE Built for multi-functional analytics anywhere ā¢ Data Catalog: a comprehensive catalog of all data sets, spanning on- premises, cloud object stores, structured, unstructured, and semi- structured ā¢ Schema: automatic capture and storage of any and all schema and metadata definitions as they are used and created by platform workloads ā¢ Replication: deliver data as well as data policies there where the enterprise needs to work, with complete consistency and security ā¢ Security: role-based access control applied consistently across the platform. Includes full stack encryption and key management ā¢ Governance: enterprise-grade auditing, lineage, and governance capabilities applied across the platform with rich extensibility for partner integrations
32.
Ā© 2019 Cloudera,
Inc. All rights reserved. 32 CDP HOME A single login to access the full platform, documentation, and support - all controlled through corporate SSO
33.
Ā© 2019 Cloudera,
Inc. All rights reserved. 33 A single pane of glass to manage 100s of clusters all with different lifecycles - across multiple environments MANAGEMENT CONSOLE
34.
Ā© 2020 Cloudera,
Inc. All rights reserved. 34 DATA LAKE What is a Data Lake? A common set of Services (SDX) within an Environment that are shared across multiple Clusters/ Experiences. These include Services for: ā¢ Security ā¢ Auditing ā¢ Governance ā¢ Data Discovery
35.
Ā© 2020 Cloudera,
Inc. All rights reserved. 35 DATA HUB CLUSTERS AND EXPERIENCES What are the consumption options? A Data Hub Cluster is a customizable environment that runs like a traditional Hadoop cluster, but is designed to leverage Cloud Storage. An Experience is a container-based compute environment for specific purposes: ML, DW, DE, OD, DF
36.
Ā© 2019 Cloudera,
Inc. All rights reserved. 36 DATA HUB A familiar and highly customizable cluster service optimized for the separation of storage and compute
37.
Ā© 2019 Cloudera,
Inc. All rights reserved. 37 DATA WAREHOUSE A data warehousing service optimized for concurrency, caching, and isolation
38.
Ā© 2019 Cloudera,
Inc. All rights reserved. 38 DATA CATALOG A centralized data stewardship tool for searching, organizing, securing, and governing data across environments
39.
Ā© 2019 Cloudera,
Inc. All rights reserved. 39 WORKLOAD MANAGER A centralized management tool for analyzing and optimizing workloads within and across environments
40.
Ā© 2019 Cloudera,
Inc. All rights reserved. 40 REPLICATION MANAGER A centralized management tool for replicating and migrating data, metadata, and policies between environments
41.
Ā© 2019 Cloudera,
Inc. All rights reserved. 41 A machine learning workspace service to connect teams of data scientists to enterprise data MACHINE LEARNING
42.
Ā© 2020 Cloudera,
Inc. All rights reserved. 42 Tour CDP Public Cloud https://console.cdp.cloudera.com/#/
43.
CDP Data Center
44.
Ā© 2020 Cloudera,
Inc. All rights reserved. 44 New Features for everyone... New features for CDH 6 customers Ranger 2.0 ā¢ Dynamic row filtering & column masking ā¢ Attribute-based access control ā¢ SparkSQL fine-grained access control Atlas 2.0 ā¢ Advanced data discovery ā¢ Improved performance and scalability Hive 3 ā¢ Hive-on-Tez for better ETL performance ā¢ ACID transactions Ozone (Preview) ā¢ 10x scalability of HDFS Knox ā¢ Gateway-based SSO Druid ā¢ Low-latency DataMart for real-time and aggregate data Spark on Docker ā¢ Simplified dependency management New features for HDP 3 customers Cloudera Manager ā¢ Virtual private clusters ā¢ Automated wire encryption setup ā¢ Fine-grained RBAC for administrators ā¢ Streamlined maintenance workflows Atlas 2.0 ā¢ Advanced data lineage ā¢ Faceted search Solr 7 ā¢ Relevance-based text search over unstructured data (text, pdf, .jpg, ...) Impala ā¢ Better fit for Data Mart migration use cases (interactive, BI style queries) Hue ā¢ Built-in SQL editor Kudu ā¢ Better performance for fast changing / updateable data Better at-rest Encryption ā¢ Key Trustee Server, NavEncrypt
45.
Ā© 2020 Cloudera,
Inc. All rights reserved. 45 Whatās in the box? CDP Data Center 7.0 (2H 2019) Coming soon... ā¢ Cloudera Manager 7.0 ā¢ Hadoop 3.1 ā¢ Spark 2.4 ā¢ Hive 3.1 ā¢ Impala 3.2 ā¢ Oozie 5.1 ā¢ Hue 4.3 ā¢ Ranger 2.0 ā¢ Atlas 2.0 ā¢ Solr 7.4 ā¢ Tez 0.9 ā¢ HBase 2.2 ā¢ Phoenix 5.0 ā¢ Kudu 1.11 ā¢ Sqoop 1.4.7 ā¢ Parquet 1.10 ā¢ Avro 1.8 ā¢ ORC 1.5 ā¢ Zookeeper 3.5 ā¢ Kafka 2.3 ā¢ Key Trustee Server ā¢ Ozone (Tech Preview) ā¢ LLAP ā¢ Livy ā¢ Druid ā¢ Ranger KMS ā¢ Key HSM ā¢ Navigator Encrypt ā¢ Zeppelin ā¢ Knox ā¢ Accumulo
46.
Ā© 2020 Cloudera,
Inc. All rights reserved. 46 Foundation for Containerized Applications Latest upstream features Best of CDH and HDP features CDH 5 / HDP 2 Cluster Existing Apps Existing Data Existing Hardware Upgrade CDH 6 / HDP 3 Cluster Existing Apps Existing Data Existing Hardware CDP Data Center Cluster Existing Apps SDX Storage CDP Private Cloud Management Console Container Cloud Data Hub DW, ML, more Upgrade Direct Upgrade CDP Data Center provides the stateful elements for new wave of containerized applications ā¢ Storage ā¢ Table Schema ā¢ Authentication & Authorization ā¢ Governance Plan your path to CDP-DC now, expand to new experiences in this year
47.
New for CDH
Customers
48.
Ā© 2020 Cloudera,
Inc. All rights reserved. 48 Ranger Authorization ā¢ Standard CDP authorization model across services ā Replaces Sentry ā¢ Better fine-grained access controls ā Dynamic Row Filtering ā Dynamic Column Masking ā Attribute-based Access Control ā SparkSQL fine-grained access control ā¢ Rich policy features ā Allow/Deny constructs, Custom policy conditions/context enrichers, time bound policies, Atlas integration (for tag based policies) ā¢ Extensive Access Auditing with rich event metadata
49.
Ā© 2020 Cloudera,
Inc. All rights reserved. 49 New in Ranger ā¢ Ranger AuthZ for Impala, HMS, Solr (doc level), Ozone (TP) ā¢ Security Zones ā¢ RBAC in Ranger New for both Cloudera and Hortonworks customers
50.
Ā© 2020 Cloudera,
Inc. All rights reserved. 50 Apache Ranger - Impala Support ā Single policy store for Hive and Impala to enable consistent policy authoring ā Independent AuthZ plugin to enforce policies locally ā Resource and tag based policies supported ā Masking/Row filtering on roadmap for Impala
51.
Ā© 2020 Cloudera,
Inc. All rights reserved. 51 Apache Ranger - Security Zones ā Resource Isolation (especially for multi- tenancy) ā Policy administration isolation ā Cross-service logical grouping
52.
Ā© 2020 Cloudera,
Inc. All rights reserved. 52 Apache Ranger - Roles
53.
Ā© 2020 Cloudera,
Inc. All rights reserved. 53 Apache Ranger Roadmap ā¢ Authz Integration with more services ā Kudu, Nifi Registry, Schema Registry etc ā¢ Incremental policy/tag downloads ā¢ Ranger audit extensions ā¢ REST based Authz server ā¢ RangerKMS-KeyTrustee integration ā¢ Row filtering capability extension (to Hbase etc) ā¢ Ranger authz for Ranger ā¢ Supporting multiple versions of plugins
54.
Ā© 2020 Cloudera,
Inc. All rights reserved. 54 Apache Atlas ā¢ Metadata catalog & search ā¢ Lineage & chain of custody ā¢ Business glossary ā¢ Metadata audits & security
55.
Ā© 2020 Cloudera,
Inc. All rights reserved. 55 Apache Atlas: Overview ā¢ A catalog for metadata of enterprise assets ā¢ Large number of integrations to gather metadata and lineage
56.
Ā© 2020 Cloudera,
Inc. All rights reserved. 56 Apache Atlas: Overview (cont..) ā¢ Rich, dynamic type-system makes it easy to onboard new components ā¢ APIs to define types: entity, classification, struct, relationship, enum
57.
Ā© 2020 Cloudera,
Inc. All rights reserved. 57 Apache Atlas: Metadata - Hive Column
58.
Ā© 2020 Cloudera,
Inc. All rights reserved. 58 Apache Atlas: lineage - Hive Table ā Propagation of Tags ā Filter and search ā Export Lineage
59.
Ā© 2020 Cloudera,
Inc. All rights reserved. 59 Apache Atlas: Search
60.
Ā© 2020 Cloudera,
Inc. All rights reserved. 60 Apache Atlas Whatās New in CDP-DC? ā¢ Impala and HMS new hooks ā¢ Spark-Atlas connector ā¢ Lineage Improvements ā¢ Runtime Stats ā¢ Optimized Search ā¢ Improvements to address Navigator metadata import
61.
Ā© 2020 Cloudera,
Inc. All rights reserved. 61 HIVE 3 FOR DATA WAREHOUSING IN CDP-DC - OVERVIEW ā¢ Comprehensive ANSI SQL 2016 coverage ā¢ Use Cases: Pre-built reports, more efficient SQL constructs, BI tool compatibility ā¢ Capabilities: Implements 120/163 SQL 2016 mandatory features and > 70 optional features Runs all 99 TPC-DS queries without modifications Additional SQL friendly capabilities e.g. surrogate keys, information_schema, ā¦ ā¢ ACID Support: Transactions and INSERT/UPDATE/DELETE/MERGE ā¢ Use Cases: Delete individual rows (GDPR), data cleansing/correction, merge for CDC data, ... ā¢ Capabilities: SQL 2011 compliant, transactional (snapshot isolation), set based insert/update/delete Managed tables (ACID default) on ORC; External tables (non-ACID) on ORC/Parquet
62.
New for HDP
customers
63.
Ā© 2020 Cloudera,
Inc. All rights reserved. 63 IMPALA AND KUDU FOR DATA WAREHOUSING IN CDP-DC ā¢ Apache Impala: Leading MPP SQL Engine for DW - optimized for Parquet/Kudu ā¢ Ideal for: Data Mart Implementations that require Interactive/Ad-hoc BI ā¢ 1000+ enterprise customers - many running on 10s of PBs and 100s of nodes ā¢ Certified with leading BI tools with broad SQL coverage ā¢ Latest release adds improvements for resiliency, concurrency, and metadata ā¢ Apache Kudu: Leading columnar storage engine for fast analytics on fast data ā¢ Ideal for: Low latency time series data ingest and analytics (with Impala SQL engine) ā¢ Strength of fast ingest with single rows like HBASE and allows large scans like HDFS ā¢ ACID (insert/update/delete) semantics with single rows
64.
Ā© 2020 Cloudera,
Inc. All rights reserved. 64 HUE FOR DATA WAREHOUSING IN CDP-DC ā¢ Apache Hue: Leading SQL Workbench for Ad-hoc BI ā¢ Ideal for: Ad-hoc queries/exploration on Data Marts/HDFS files using Impala and/or HIVE ā¢ Very high adoption rate across hadoop landscapes with thousands of active users ā¢ Key features: ā¢ SQL editor - autocomplete, query history, query plans ā¢ File browser - Object Stores (S3, ADLS), HDFS ā¢ Document Handling - Sharing, Downloading, Importing, Exporting ā¢ Load balancing for large scale deployments with hundreds of concurrent users
65.
Ā© 2020 Cloudera,
Inc. All rights reserved. 65 Cloudera Manager 7 - Whatās new for HDP Users ā¢ Single pane of glass ā Multiple clusters! (up to 3,000 nodes total) ā āComputeā clusters & āBaseā clusters (āVPCsā) ā¢ Security ā Automated wire encryption (TLS 1.2) ā HDFS encryption-at-rest wizard (KTS/KMS) ā Fine-grained access control for admins ā¢ Ease of administration ā Global configuration search / config ādiffā before restart ā Edge/āgatewayā node configuration ā Proper rolling restart (HA-sensitive) ā View of YARN/Impala workloads ā¢ Performance ā BitTorrent based distribution of binaries
66.
Ā© 2020 Cloudera,
Inc. All rights reserved. 66 Cloudera Manager 7 - Whatās new for Everyone! ā¢ Management of new services ā Ranger,Atlas,Hive-on-Tez,DAS ā¢ CDP Look-and-Feel ā¢ Cluster-level configuration history ā¢ Improved global search ā¢ Resume errors in enabling Kerberos ā¢ Minor scalability improvements (hosts page) ā¢ Improved alerts configuration ā¢ JQuery 3.4 (improved security)
67.
YARN
68.
Ā© 2020 Cloudera,
Inc. All rights reserved. 68 Capacity Scheduler & Queue Manager UI ā¢ Capacity Scheduler is now default scheduler in YARN ! GPU support ! Node Labels ! Global scheduling support ! Better placement support ā¢ A new Queue management UI experience for better usability
69.
Ā© 2020 Cloudera,
Inc. All rights reserved. 69 Capacity Scheduler & Queue Manager UI ā¢ New Queue Manager UI in CM to configure resources and queues List of all queues in cluster
70.
Ā© 2020 Cloudera,
Inc. All rights reserved. 70 Spark Dependency Management ā¢ Simplify dependency management with Spark-on-Docker support ā¢ No need to install dependencies on individual cluster hosts Enable Docker on YARN with a click from CM for Spark workloads
71.
CDF for CDP-DC
72.
Ā© 2020 Cloudera,
Inc. All rights reserved. 72 Cloudera DataFlow (CDF) Platform - When will it be Supported on CDP-DC? Deployment Spectrum CDP DataHub CDP DataCenter CDP DataFlow Service CDH On-Premise
73.
Ā© 2020 Cloudera,
Inc. All rights reserved. 73 CFM 2.0 Highlights & Platform Integration Based on Apache NiFi 1.10 Allows parameterization of all processor properties Support for āpublicā (accessible to remote site to site clients) ports for any processor Queue Length and time to Backpressure are now predicted First release to include K8s Operator (Tech Preview) Allows customers to try out NiFi clusters on Kubernetes Operator takes care of NiFi cluster installation, configuration and scaling OpenShift certified (pending) Goal is to to gather feedback from customers about requirements First release to include Stateless NiFi Runtime New NiFi Runtime Flow Files stored in memory, not persisted on disk Data Durability provided by source/ target systems Allows for abstraction of ājobsā Allows for flows to be ātriggeredā CFM 2.0 will be Available as Add-On to CDP-DC Available Post GA - Target Q4
74.
Ā© 2020 Cloudera,
Inc. All rights reserved. 74 Flume to CEM / CFM migration Yes, Flume is really gone. Opportunity Flume Use Case Migration Questions? Need Help for specific customer use case? Flume Offload Sales Play We now have a powerful data distribution / ingest tool in our stack Door opener for new analytics use cases - Flexible Data Movement architecture - Foundation for real-time stream processing Identify your customerās Use Cases Most common use cases: - Hadoop Ingest (HDFS, HBase) - āFlafkaā (Read/Write Kafka) - HTTP, File sources/sinks Flume used as agent -> CEM Flume used for central ingest -> CFM We will host a deep-dive Flume Offload enablement Check out Flume Offload Collateral (Decks, example Flows, migration strategy) Reach out to dim- field@cloudera.com mkohs@cloudera.com fce_streaming / fce_nifi
75.
Ā© 2020 Cloudera,
Inc. All rights reserved. 75 Kafka 2.3 Available in CDP-DC 7.0 Secure and Governed Kafka Clusters with New Ranger & Atlas Integration What's New? ā¢ Kafka 2.3 available in CR 7.0 Parcel ā¢ Kafka / Ranger Integration ā¢ Kafka / Atlas Integration ā¢ Support Hive 3.X / Kafka Storage Handler ā¢ Support LDAP Base Auth ā¢ Support multiple Kafka compute clusters using shared Security Data Lake with Ranger & Atlas Shared Security Context from Data Lake consisting of Ranger and Atlas Kafka Compute Cluster using Shared Security Context
76.
Ā© 2020 Cloudera,
Inc. All rights reserved. 76 Kafka Management Services Support on CDP-DC SR, SMM & SRM Available as Add-On to CDP-DC Available Post GA - Target Q4 Schema Registry New Kafka Schema Governance Streams Replication Manager (SRM) New Kafka Replication Engine powered by MirrorMaker2 Streams Messaging Manager (SMM) New Kafka Monitoring Service
77.
Ā© 2020 Cloudera,
Inc. All rights reserved. 77 New Flink Support on CDP-DC Flink Yarn Support Available as Add-On to CDP-DC Available Post GA - Target Q4 Why Flink ā¢ Next Gen streaming engine offers more superior solution than Storm ā¢ Flink runs as Yarn app ā¢ Key Features ! Ultra Low Latency ( < 100 MS) ! Advanced features (late arriving data, checkpointing, event time processing) ! Exactly Once Processing ! Complex Stateful Stream Processing ! Growing / Vibrant Community
78.
CLOUDERA MACHINE LEARNING
79.
Ā© 2020 Cloudera,
Inc. All rights reserved. 79
80.
Ā© 2020 Cloudera,
Inc. All rights reserved. 80
81.
Ā© 2020 Cloudera,
Inc. All rights reserved. 81
82.
Ā© 2020 Cloudera,
Inc. All rights reserved. 82
83.
GETTING TO PRODUCTION
84.
Ā© 2020 Cloudera,
Inc. All rights reserved. 84
85.
Ā© 2020 Cloudera,
Inc. All rights reserved. 85
86.
Ā© 2020 Cloudera,
Inc. All rights reserved. 86 Tour CML for CDP https://console.cdp.cloudera.com/#/
87.
Ā© 2020 Cloudera,
Inc. All rights reserved. 87 DATA-DRIVEN JOURNEY
88.
Ā© 2019 Cloudera,
Inc. All rights reserved. 88 DATA-DRIVEN JOURNEY USE CASES VISIBILITY PRODUCTIVITY TRANSFORMATION Preventive āØ & Proactive Maintenance IoT Hub for Industry 4.0 Advanced Threat Detection Risk āØ Modelling & Analysis Marketing Systems Integration Customer 360 āØ Insights Exploratory Data Science Data Warehouse Applied Machine Learning GROW Sales & Marketing CONNECT Operations & Product PROTECT Security & Compliance MODERNIZE IT, Tech, Data Science & Analytics
89.
Ā© Cloudera, Inc.
All rights reserved.89 HIERARCHY OF NEEDS FOR THE DATA-DRIVEN ENTERPRISE The āAI Ladderā AI MACHINE LEARNING DATA SCIENCE ANALYTICS "BIG DATA"
90.
Ā© 2020 Cloudera,
Inc. All rights reserved. 90 Actionable Intelligence Powers Todayās Financial Services OFAC Lists Credit Records ATM Streams Transactions & Wires Stock Tickers Trade Settlements DIGITAL CUSTOMER 360 RISK DATA AGGREGATION ANTI-MONEY LAUNDERING FRAUD DETECTION TRADE SURVEILLANCE Mobile App Data Trade Data Web Logs Banker Notes Demographi c Data Customer Transactio n Data
91.
Ā© 2020 Cloudera,
Inc. All rights reserved. 91 Connected Data Drives Success in Telecommunications Call Detail Records Product Catalogs Cyber Threat Metadata Sensor Data Server Logs Voice-to-Text SINGLE VIEW OF THE CUSTOMER CHURN REDUCTION CDR ANALYSIS NETWORK OPTIMIZATION DYNAMIC BANDWIDTH ALLOCATION Clickstrea m ERP System Data Social Media Billing Data Subscriber Profiles CRM Record s
92.
Ā© 2020 Cloudera,
Inc. All rights reserved. 92 Actionable Intelligence Drives Retail Sales Growth Product Catalogs Sales Forecasts Beacons & RFID Server Logs In-Store WiFi Logs Store Communicatio ns SINGLE VIEW OF THE CUSTOMER PRODUCT RECOMMENDATIONS INVENTORY & SUPPLY CHAIN PRICING OPTIMIZATION TARGETED PROMOTIONS Clickstrea m ERP Data Social Media Staffing Plans Store Reporting CRM Record s
93.
Ā© 2020 Cloudera,
Inc. All rights reserved. 93 Actionable Intelligence Makes Healthcare Precise and Personal Patient Records Lab Data Pharmacy Data Patient Locations Wearable s Intra-Network Data Sensor Data Claims Data Social Media Physician Notes Patient Satisfaction Data Clinical (EMR) Data SINGLE VIEW OF PATIENT REAL-TIME VITAL SIGN MONITORING BILLING & REIMBURSEMENTS EMR OPTIMIZATION SUPPLY CHAIN OPTIMIZATION
94.
Ā© 2020 Cloudera,
Inc. All rights reserved. 94 Actionable Intelligence Makes Pharmaceuticals Safe & Effective Research Cohort Data Molecular Data RFID Data Social Media Biometri cs Sensor Data DRUG TRIAL COHORT SELECTION YIELD OPTIMIZATION RAW MATERIAL WASTE REDUCTION SEARCHABLE RESEARCH REPOS NEXT-GEN SEQUENCING (NGS) Supply Chain Geo-location Data Scientific Studies Manufacturing Machine Data Clinical Records Sales Reports Genomic Data
95.
Ā© 2020 Cloudera,
Inc. All rights reserved. 95 Actionable Intelligence Powers Modern Manufacturing Defect Testing Data Product Designs MES System s RFID Streams SCADA Systems Shop Floor Sensors PREVENTATIVE MAINTENANCE SUPPLY CHAIN OPTIMIZATION YIELD MAXIMIZATION QUALITY CONTROL RECALL AVOIDANCE ERP Systems Supplier Receipts Machine Data Assembly Line Sensors Data Historians Work Orders
96.
Ā© 2020 Cloudera,
Inc. All rights reserved. 96 Actionable Intelligence Enhances Public Sector Efficiency Historical Archives Cyber Threat Metadata Vehicle Telemetry Data Disease Outbreaks Natural Disasters PUBLIC TRANSPORTATION INFRASTUCTURE MAINTENANCE PUBLIC HEALTH NATIONAL DEFENSE HOMELAND SECURITY Socia l āØ Medi a Work Orders Meeting Notes Voter Rolls Public Benefits Claims Financial Audits Extreme Weather Alerts
97.
Ā© 2020 Cloudera,
Inc. All rights reserved. 97 Why are you here now?
98.
Ā© 2020 Cloudera,
Inc. All rights reserved. 98 THANK YOU! Because This is Your BigData Year! 2020
Download now