SlideShare a Scribd company logo
Transforming Insurance
Analytics with Big Data and
Automated Machine Learning
A formula for higher ROI
Agenda
Mihaela Risca
Sr. Solutions Marketing Manager
Financial Services
Cloudera
Unlocking the Value of Insurance Data
Satadru Sengupta
Gen Mgr. Insurance
DataRobot
Automated Machine Learning – A Formula
for Higher ROI for Insurers
There are two different
alignments of these components
in the market:
• When data and analytics
capability are bundled with
capital, we have an insurance
company.
• When it is bundled with
demand, we have an advisor or
broker
Data is at the center of the Insurance market
Explosion of Data
Why Machine Learning?
• Analytics return $13 for every $1 invested (Nucleus Research)
• Only 12% of data is leveraged for analytics (Forrester)
What is Machine Learning?
Why Big Data + Machine Learning?
• Machine learning thrives on
growing data sets
• Bring disparate data
sources together
• Real time streaming
Machine Learning Use Cases in Insurance
Pricing
Customer Acquisition Underwriting
Marketing, customer
retention, prioritization.
Equating risk and price,
driving life-time value
(LTV)
Prevent Claim Fraud
Underwriting triage:
select the top 10% of the
available risk for further
analysis .
Identifying claims with
highest likelihood of being
fraudulent.
Poll the Audience
Where in your organization you see the most value for introducing
machine learning?
1. Customer acquisition and retention
2. Underwriting/Actuarial
3. Quoting/Claims management
4. Fraud detection and prevention
5. Other
Key Data Management Challenges for Insurers
Fragmented Systems
and Data Silos
Limited Access to
Right Data at the
Right Time
Strategic Decisions
Based on Subsets of
Data
Unable to Tap into
New Data Sources or
Correlate Data from
Multiple Sources
Simultaneously
Disparate View of
Customers, Markets
and Risks
Poor Data Quality
and Lack of
Governance
One Data Platform for Many Applications
Handle real-time
data ingest from
diverse sources
Governance and
Security
Data Streams
Deployment Flexibility
Machine Learning
Capabilities
Diverse Analytical
Options
Combine Data from Different Sources
Data Mgmt. Hub
Scale easily & Cost
effectively
Batch or Real- time
Data Streams
Data Sources
Data Sources
Data Storage &
Processing
Reporting, Analytics &
Auditing
Data Ingest
Other
Data Governance (Data Lineage, Data Protection)
Fitness Car Telematics
Applications
"New technology is transforming the
way we work, and it is allowing the
competition to do better than what we
can. The strange thing is we know the
urgency, and yet there is inertia."
Inga Beale, CEO of Lloyd's of London
February 2017
1. Technology
2. Consumer & Market Economics
3. Data Science & Machine Learning
… and they are interconnected.
Three Strategic Areas of Focus
Machine Learning Applications in Insurance
1. Risk Selection & Pricing
2. Claims, Fraud and Litigation Management
3. Operations and Expenses Management
“machine learning is the secret sauce for the product of
tomorrow.” Google, 2015
Profitable Growth & Managing Expenses
Becoming a 21st Century Insurance Company
Life Insurance Example 1
Underwriting Triage
• Predicted low risk to fast track
process
• Predicted high risk to traditional
underwriting for manual review
Business Impact
• Cost reduction through automation of
reviews of applicants
• Increased likelihood of acquisition
due to fast track underwriting
• Higher underwriting profitability by
targeting the review process on
underwriting loss avoidance
Specific examples from clients
• Predict the likelihood of an insured being in a preferred class or not – as
determined by risk factors such as smoking status, existing condition, terminal
disease
• Predict the most likely class among several classes
Predict mortality risks among patients in remission of cancer:
○ Simplify Underwriting Process: Patients with good health prospects don’t need to go
through a manual medical verification and avoid adverse selection
○ Reduce Costs of Claim by identifying high-risk patients and create more accurate
underwriting rules
ML model predicts patients with
a very high risk of mortality
● 5 times more risky than
average
● Around 10% of patients
Life Insurance Example 2
… InsurTech and Future of Insurance
Machine Learning Strategy: Where It Is Failing?
• A lack of data vision
• Hiring and retaining good data scientists is impossible
• Lack of Inclusiveness: Targeted end-users are not included in
the machine learning problem solving process.
HBR Article : “Stop searching for that elusive Data Scientist”
New Technology Opens Up New Possibilities To Executives
Artificial Intelligence & Automation
makes Machine Learning Affordable,
Pervasive and Inclusive
Poll the Audience
How do you primarily develop and deploy machine learning solutions
in your organization today?
1. Multiple, small data science teams
2. One, big enterprise data science team
3. Outsource to consulting
4. We use automated machine learning
5. We currently don’t use machine learning
Elements of Automated Machine Learning
Smart
● Accurate
● Appeal to experienced data scientists
● Control buttons are accessible to the users
Easy to Use
● Intuitive, fully automated workflow
● Needs minimum inputs but has guardrails
● Interpretable & transparent
● Deployment focused
A 10 min journey to Automated Machine
Learning (AML) using DataRobot Platform
can we predict which patient is coming back to
hospital within the first 30 days?
Demo
What capabilities for DataRobot on Cloudera?
HDFS ingest: DR can utilize data stored in HDFS directly
Hadoop Modeling: Train ML models on the Cloudera data nodes
directly
Hadoop scoring: Any model can then be deployed on Hadoop directly
Distributed (each node scores a data split)
Uses Spark
Cloudera/DataRobot Integration Details
DataRobot has the highest level of integration with Cloudera
Cloudera Parcels A few click to install DR in Cloudera
Manager!
Cloudera CSDs Can use all the functionalities of Cloudera
Manager (monitoring, resource mgmt…)
Kerberos / Sentry Secured authentication
YARN All the resources consumed by DataRobot
are managed by YARN
Spark DataRobot uses Spark for Hadoop scoring
Cloudera/DataRobot Integration Details
Apache Spark Ecosystem with Spark ML lib
Spark MLlib API is available in Scala, Java, and Python programming
languages
Training from Cloudera and DataRobot
● Introduction to Machine Learning - Cloudera Training
https://www.cloudera.com/more/training/courses/intro-machine-learning.html
● Data Science for Executives - DataRobot Training
https://www.datarobot.com/education/for-executives/
● Machine Learning with DataRobot - DataRobot Training
https://www.datarobot.com/education/for-business-analysts/
Learn More & Contact Us
https://www.cloudera.com/solutions/insurance.html
Cloudera
Follow us: @Cloudera
mihaela@cloudera.com
Taneja Group Spark Market Adoption Report : LINK
DataRobot Overview: LINK
https://www.datarobot.com/go/insurance/
Follow us: @DataRobot
satadru@datarobot.com
DataRobot
Executive Briefing: LINK
The Machine Learning Renaissance: LINK
Register for Wrangle Conference: July 20, San Francisco
http://wrangleconf.com/
Thank you
Appendices
Some screenshots
Cloudera - DataRobot Integration
DataRobot - Ease of Deployment on Cloudera
● Deployment
● Mgmt/Monitoring
The DataRobot Service on Cloudera
DataRobot – HDFS Ingest
Copyright © DataRobot, Inc. - All Rights Reserved
DataRobot Modeling on Hadoop
Storage
Application
DR Edge Node
… …
Worker 2
Worker 1
Worker 3
Hadoop Data Node 1
Hadoop Data Node 2
YARN
container
60GB
(Worker 2)
YARN
container
60GB
(Worker 3)
YARN
container
60GB
(Worker 1)
• YARN allocates memory on a data node when a worker wants to train a model
• Each model is trained in memory on an available data node
DataRobot – Cloudera “in-place” Scoring
DataRobot & Cloudera – Seamless LDAP Authentication

More Related Content

What's hot

Intro to SageMaker
Intro to SageMakerIntro to SageMaker
Intro to SageMaker
Soji Adeshina
 
Building a Modern Data Platform in the Cloud
Building a Modern Data Platform in the CloudBuilding a Modern Data Platform in the Cloud
Building a Modern Data Platform in the Cloud
Amazon Web Services
 
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
DATAVERSITY
 
Big Data analytics
Big Data analyticsBig Data analytics
Big Data analytics
ArunKumar5524
 
Microsoft Data Platform - What's included
Microsoft Data Platform - What's includedMicrosoft Data Platform - What's included
Microsoft Data Platform - What's included
James Serra
 
What is big data?
What is big data?What is big data?
What is big data?
David Wellman
 
Implementing a Data Lake
Implementing a Data LakeImplementing a Data Lake
Implementing a Data Lake
Amazon Web Services
 
Audrey Chia - Supercharge Your Growth.pdf
Audrey Chia - Supercharge Your Growth.pdfAudrey Chia - Supercharge Your Growth.pdf
Audrey Chia - Supercharge Your Growth.pdf
SOLTUIONSpeople, THINKubators, THINKathons
 
Building Data Lakes and Analytics on AWS
Building Data Lakes and Analytics on AWSBuilding Data Lakes and Analytics on AWS
Building Data Lakes and Analytics on AWS
Amazon Web Services
 
Scaling and Modernizing Data Platform with Databricks
Scaling and Modernizing Data Platform with DatabricksScaling and Modernizing Data Platform with Databricks
Scaling and Modernizing Data Platform with Databricks
Databricks
 
Databricks Platform.pptx
Databricks Platform.pptxDatabricks Platform.pptx
Databricks Platform.pptx
Alex Ivy
 
Data Led Migration
Data Led Migration Data Led Migration
Data Led Migration
Sandy Carter
 
Data Modeling Fundamentals
Data Modeling FundamentalsData Modeling Fundamentals
Data Modeling Fundamentals
DATAVERSITY
 
Big data
Big dataBig data
Data Governance for Enterprises
Data Governance for EnterprisesData Governance for Enterprises
Data Governance for Enterprises
Chaitanya Avasarala
 
Ai in financial services
Ai in financial servicesAi in financial services
Ai in financial services
Seldon
 
MLOps journey at Swisscom: AI Use Cases, Architecture and Future Vision
MLOps journey at Swisscom: AI Use Cases, Architecture and Future VisionMLOps journey at Swisscom: AI Use Cases, Architecture and Future Vision
MLOps journey at Swisscom: AI Use Cases, Architecture and Future Vision
BATbern
 
Exploring Generative AI with GAN Models
Exploring Generative AI with GAN ModelsExploring Generative AI with GAN Models
Exploring Generative AI with GAN Models
KonfHubTechConferenc
 
Talking to your CEO about the Chief Data Officer Role
Talking to your CEO about the Chief Data Officer Role Talking to your CEO about the Chief Data Officer Role
Talking to your CEO about the Chief Data Officer Role
Craig Milroy
 
Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...
Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...
Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...
Simplilearn
 

What's hot (20)

Intro to SageMaker
Intro to SageMakerIntro to SageMaker
Intro to SageMaker
 
Building a Modern Data Platform in the Cloud
Building a Modern Data Platform in the CloudBuilding a Modern Data Platform in the Cloud
Building a Modern Data Platform in the Cloud
 
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
 
Big Data analytics
Big Data analyticsBig Data analytics
Big Data analytics
 
Microsoft Data Platform - What's included
Microsoft Data Platform - What's includedMicrosoft Data Platform - What's included
Microsoft Data Platform - What's included
 
What is big data?
What is big data?What is big data?
What is big data?
 
Implementing a Data Lake
Implementing a Data LakeImplementing a Data Lake
Implementing a Data Lake
 
Audrey Chia - Supercharge Your Growth.pdf
Audrey Chia - Supercharge Your Growth.pdfAudrey Chia - Supercharge Your Growth.pdf
Audrey Chia - Supercharge Your Growth.pdf
 
Building Data Lakes and Analytics on AWS
Building Data Lakes and Analytics on AWSBuilding Data Lakes and Analytics on AWS
Building Data Lakes and Analytics on AWS
 
Scaling and Modernizing Data Platform with Databricks
Scaling and Modernizing Data Platform with DatabricksScaling and Modernizing Data Platform with Databricks
Scaling and Modernizing Data Platform with Databricks
 
Databricks Platform.pptx
Databricks Platform.pptxDatabricks Platform.pptx
Databricks Platform.pptx
 
Data Led Migration
Data Led Migration Data Led Migration
Data Led Migration
 
Data Modeling Fundamentals
Data Modeling FundamentalsData Modeling Fundamentals
Data Modeling Fundamentals
 
Big data
Big dataBig data
Big data
 
Data Governance for Enterprises
Data Governance for EnterprisesData Governance for Enterprises
Data Governance for Enterprises
 
Ai in financial services
Ai in financial servicesAi in financial services
Ai in financial services
 
MLOps journey at Swisscom: AI Use Cases, Architecture and Future Vision
MLOps journey at Swisscom: AI Use Cases, Architecture and Future VisionMLOps journey at Swisscom: AI Use Cases, Architecture and Future Vision
MLOps journey at Swisscom: AI Use Cases, Architecture and Future Vision
 
Exploring Generative AI with GAN Models
Exploring Generative AI with GAN ModelsExploring Generative AI with GAN Models
Exploring Generative AI with GAN Models
 
Talking to your CEO about the Chief Data Officer Role
Talking to your CEO about the Chief Data Officer Role Talking to your CEO about the Chief Data Officer Role
Talking to your CEO about the Chief Data Officer Role
 
Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...
Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...
Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...
 

Similar to Transforming Insurance Analytics with Big Data and Automated Machine Learning


ISC2 Privacy-Preserving Analytics and Secure Multiparty Computation
ISC2 Privacy-Preserving Analytics and Secure Multiparty ComputationISC2 Privacy-Preserving Analytics and Secure Multiparty Computation
ISC2 Privacy-Preserving Analytics and Secure Multiparty Computation
UlfMattsson7
 
New technologies for data protection
New technologies for data protectionNew technologies for data protection
New technologies for data protection
Ulf Mattsson
 
Advanced Analytics for Investment Firms and Machine Learning
Advanced Analytics for Investment Firms and Machine LearningAdvanced Analytics for Investment Firms and Machine Learning
Advanced Analytics for Investment Firms and Machine Learning
Cloudera, Inc.
 
How Insurers Can Tame Data to Drive Innovation
How Insurers Can Tame Data to Drive InnovationHow Insurers Can Tame Data to Drive Innovation
How Insurers Can Tame Data to Drive Innovation
Cognizant
 
Ai in insurance how to automate insurance claim processing with machine lear...
Ai in insurance  how to automate insurance claim processing with machine lear...Ai in insurance  how to automate insurance claim processing with machine lear...
Ai in insurance how to automate insurance claim processing with machine lear...
Skyl.ai
 
Business Intelligence, Data Analytics, and AI
Business Intelligence, Data Analytics, and AIBusiness Intelligence, Data Analytics, and AI
Business Intelligence, Data Analytics, and AI
Johnny Jepp
 
Accenture Insurance Data Capture
Accenture Insurance Data Capture Accenture Insurance Data Capture
Accenture Insurance Data Capture
Accenture Insurance
 
Modernizing Insurance Data to Drive Intelligent Decisions
Modernizing Insurance Data to Drive Intelligent DecisionsModernizing Insurance Data to Drive Intelligent Decisions
Modernizing Insurance Data to Drive Intelligent Decisions
Cognizant
 
Cloud-Based IoT Analytics and Machine Learning
Cloud-Based IoT Analytics and Machine LearningCloud-Based IoT Analytics and Machine Learning
Cloud-Based IoT Analytics and Machine Learning
SatyaKVivek
 
Data Leaders Summit Barcelona 2018
Data Leaders Summit Barcelona 2018Data Leaders Summit Barcelona 2018
Data Leaders Summit Barcelona 2018
Harvinder Atwal
 
Why machine learning is the best way to reduce fraud
Why machine learning is the best way to reduce fraud Why machine learning is the best way to reduce fraud
Why machine learning is the best way to reduce fraud
GlobalTechCouncil
 
Machine Learning in Customer Analytics
Machine Learning in Customer AnalyticsMachine Learning in Customer Analytics
Machine Learning in Customer Analytics
Course5i
 
Machine Learning: Addressing the Disillusionment to Bring Actual Business Ben...
Machine Learning: Addressing the Disillusionment to Bring Actual Business Ben...Machine Learning: Addressing the Disillusionment to Bring Actual Business Ben...
Machine Learning: Addressing the Disillusionment to Bring Actual Business Ben...
Jon Mead
 
Machine Learning In Insurance
Machine Learning In InsuranceMachine Learning In Insurance
Machine Learning In Insurance
Accenture Insurance
 
Machine Leaning Insurance
Machine Leaning InsuranceMachine Leaning Insurance
Machine Leaning Insurance
Federico Katsicas
 
Internet of things, Big Data and Analytics 101
Internet of things, Big Data and Analytics 101Internet of things, Big Data and Analytics 101
Internet of things, Big Data and Analytics 101
Mukul Krishna
 
eMStream
eMStreameMStream
Protecting data privacy in analytics and machine learning ISACA London UK
Protecting data privacy in analytics and machine learning ISACA London UKProtecting data privacy in analytics and machine learning ISACA London UK
Protecting data privacy in analytics and machine learning ISACA London UK
Ulf Mattsson
 
Machine Learning in Banking
Machine Learning in BankingMachine Learning in Banking
Machine Learning in Banking
accenture
 
AI in Insurance: How to Automate Insurance Claim Processing with Machine Lear...
AI in Insurance: How to Automate Insurance Claim Processing with Machine Lear...AI in Insurance: How to Automate Insurance Claim Processing with Machine Lear...
AI in Insurance: How to Automate Insurance Claim Processing with Machine Lear...
Skyl.ai
 

Similar to Transforming Insurance Analytics with Big Data and Automated Machine Learning
 (20)

ISC2 Privacy-Preserving Analytics and Secure Multiparty Computation
ISC2 Privacy-Preserving Analytics and Secure Multiparty ComputationISC2 Privacy-Preserving Analytics and Secure Multiparty Computation
ISC2 Privacy-Preserving Analytics and Secure Multiparty Computation
 
New technologies for data protection
New technologies for data protectionNew technologies for data protection
New technologies for data protection
 
Advanced Analytics for Investment Firms and Machine Learning
Advanced Analytics for Investment Firms and Machine LearningAdvanced Analytics for Investment Firms and Machine Learning
Advanced Analytics for Investment Firms and Machine Learning
 
How Insurers Can Tame Data to Drive Innovation
How Insurers Can Tame Data to Drive InnovationHow Insurers Can Tame Data to Drive Innovation
How Insurers Can Tame Data to Drive Innovation
 
Ai in insurance how to automate insurance claim processing with machine lear...
Ai in insurance  how to automate insurance claim processing with machine lear...Ai in insurance  how to automate insurance claim processing with machine lear...
Ai in insurance how to automate insurance claim processing with machine lear...
 
Business Intelligence, Data Analytics, and AI
Business Intelligence, Data Analytics, and AIBusiness Intelligence, Data Analytics, and AI
Business Intelligence, Data Analytics, and AI
 
Accenture Insurance Data Capture
Accenture Insurance Data Capture Accenture Insurance Data Capture
Accenture Insurance Data Capture
 
Modernizing Insurance Data to Drive Intelligent Decisions
Modernizing Insurance Data to Drive Intelligent DecisionsModernizing Insurance Data to Drive Intelligent Decisions
Modernizing Insurance Data to Drive Intelligent Decisions
 
Cloud-Based IoT Analytics and Machine Learning
Cloud-Based IoT Analytics and Machine LearningCloud-Based IoT Analytics and Machine Learning
Cloud-Based IoT Analytics and Machine Learning
 
Data Leaders Summit Barcelona 2018
Data Leaders Summit Barcelona 2018Data Leaders Summit Barcelona 2018
Data Leaders Summit Barcelona 2018
 
Why machine learning is the best way to reduce fraud
Why machine learning is the best way to reduce fraud Why machine learning is the best way to reduce fraud
Why machine learning is the best way to reduce fraud
 
Machine Learning in Customer Analytics
Machine Learning in Customer AnalyticsMachine Learning in Customer Analytics
Machine Learning in Customer Analytics
 
Machine Learning: Addressing the Disillusionment to Bring Actual Business Ben...
Machine Learning: Addressing the Disillusionment to Bring Actual Business Ben...Machine Learning: Addressing the Disillusionment to Bring Actual Business Ben...
Machine Learning: Addressing the Disillusionment to Bring Actual Business Ben...
 
Machine Learning In Insurance
Machine Learning In InsuranceMachine Learning In Insurance
Machine Learning In Insurance
 
Machine Leaning Insurance
Machine Leaning InsuranceMachine Leaning Insurance
Machine Leaning Insurance
 
Internet of things, Big Data and Analytics 101
Internet of things, Big Data and Analytics 101Internet of things, Big Data and Analytics 101
Internet of things, Big Data and Analytics 101
 
eMStream
eMStreameMStream
eMStream
 
Protecting data privacy in analytics and machine learning ISACA London UK
Protecting data privacy in analytics and machine learning ISACA London UKProtecting data privacy in analytics and machine learning ISACA London UK
Protecting data privacy in analytics and machine learning ISACA London UK
 
Machine Learning in Banking
Machine Learning in BankingMachine Learning in Banking
Machine Learning in Banking
 
AI in Insurance: How to Automate Insurance Claim Processing with Machine Lear...
AI in Insurance: How to Automate Insurance Claim Processing with Machine Lear...AI in Insurance: How to Automate Insurance Claim Processing with Machine Lear...
AI in Insurance: How to Automate Insurance Claim Processing with Machine Lear...
 

More from Cloudera, Inc.

Partner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptxPartner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptx
Cloudera, Inc.
 
Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists
Cloudera, Inc.
 
2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists
Cloudera, Inc.
 
Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019
Cloudera, Inc.
 
Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19
Cloudera, Inc.
 
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Cloudera, Inc.
 
Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19
Cloudera, Inc.
 
Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19
Cloudera, Inc.
 
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Cloudera, Inc.
 
Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19
Cloudera, Inc.
 
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Cloudera, Inc.
 
Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18
Cloudera, Inc.
 
Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3
Cloudera, Inc.
 
Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2
Cloudera, Inc.
 
Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1
Cloudera, Inc.
 
Extending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the PlatformExtending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the Platform
Cloudera, Inc.
 
Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18
Cloudera, Inc.
 
Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360
Cloudera, Inc.
 
Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18
Cloudera, Inc.
 
Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18
Cloudera, Inc.
 

More from Cloudera, Inc. (20)

Partner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptxPartner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptx
 
Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists
 
2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists
 
Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019
 
Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19
 
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
 
Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19
 
Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19
 
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
 
Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19
 
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
 
Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18
 
Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3
 
Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2
 
Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1
 
Extending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the PlatformExtending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the Platform
 
Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18
 
Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360
 
Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18
 
Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18
 

Recently uploaded

SMS API Integration in Saudi Arabia| Best SMS API Service
SMS API Integration in Saudi Arabia| Best SMS API ServiceSMS API Integration in Saudi Arabia| Best SMS API Service
SMS API Integration in Saudi Arabia| Best SMS API Service
Yara Milbes
 
Microservice Teams - How the cloud changes the way we work
Microservice Teams - How the cloud changes the way we workMicroservice Teams - How the cloud changes the way we work
Microservice Teams - How the cloud changes the way we work
Sven Peters
 
UI5con 2024 - Boost Your Development Experience with UI5 Tooling Extensions
UI5con 2024 - Boost Your Development Experience with UI5 Tooling ExtensionsUI5con 2024 - Boost Your Development Experience with UI5 Tooling Extensions
UI5con 2024 - Boost Your Development Experience with UI5 Tooling Extensions
Peter Muessig
 
Graspan: A Big Data System for Big Code Analysis
Graspan: A Big Data System for Big Code AnalysisGraspan: A Big Data System for Big Code Analysis
Graspan: A Big Data System for Big Code Analysis
Aftab Hussain
 
Neo4j - Product Vision and Knowledge Graphs - GraphSummit Paris
Neo4j - Product Vision and Knowledge Graphs - GraphSummit ParisNeo4j - Product Vision and Knowledge Graphs - GraphSummit Paris
Neo4j - Product Vision and Knowledge Graphs - GraphSummit Paris
Neo4j
 
Revolutionizing Visual Effects Mastering AI Face Swaps.pdf
Revolutionizing Visual Effects Mastering AI Face Swaps.pdfRevolutionizing Visual Effects Mastering AI Face Swaps.pdf
Revolutionizing Visual Effects Mastering AI Face Swaps.pdf
Undress Baby
 
Oracle Database 19c New Features for DBAs and Developers.pptx
Oracle Database 19c New Features for DBAs and Developers.pptxOracle Database 19c New Features for DBAs and Developers.pptx
Oracle Database 19c New Features for DBAs and Developers.pptx
Remote DBA Services
 
Fundamentals of Programming and Language Processors
Fundamentals of Programming and Language ProcessorsFundamentals of Programming and Language Processors
Fundamentals of Programming and Language Processors
Rakesh Kumar R
 
原版定制美国纽约州立大学奥尔巴尼分校毕业证学位证书原版一模一样
原版定制美国纽约州立大学奥尔巴尼分校毕业证学位证书原版一模一样原版定制美国纽约州立大学奥尔巴尼分校毕业证学位证书原版一模一样
原版定制美国纽约州立大学奥尔巴尼分校毕业证学位证书原版一模一样
mz5nrf0n
 
Energy consumption of Database Management - Florina Jonuzi
Energy consumption of Database Management - Florina JonuziEnergy consumption of Database Management - Florina Jonuzi
Energy consumption of Database Management - Florina Jonuzi
Green Software Development
 
Webinar On-Demand: Using Flutter for Embedded
Webinar On-Demand: Using Flutter for EmbeddedWebinar On-Demand: Using Flutter for Embedded
Webinar On-Demand: Using Flutter for Embedded
ICS
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
Safe Software
 
OpenMetadata Community Meeting - 5th June 2024
OpenMetadata Community Meeting - 5th June 2024OpenMetadata Community Meeting - 5th June 2024
OpenMetadata Community Meeting - 5th June 2024
OpenMetadata
 
8 Best Automated Android App Testing Tool and Framework in 2024.pdf
8 Best Automated Android App Testing Tool and Framework in 2024.pdf8 Best Automated Android App Testing Tool and Framework in 2024.pdf
8 Best Automated Android App Testing Tool and Framework in 2024.pdf
kalichargn70th171
 
2024 eCommerceDays Toulouse - Sylius 2.0.pdf
2024 eCommerceDays Toulouse - Sylius 2.0.pdf2024 eCommerceDays Toulouse - Sylius 2.0.pdf
2024 eCommerceDays Toulouse - Sylius 2.0.pdf
Łukasz Chruściel
 
Using Xen Hypervisor for Functional Safety
Using Xen Hypervisor for Functional SafetyUsing Xen Hypervisor for Functional Safety
Using Xen Hypervisor for Functional Safety
Ayan Halder
 
UI5con 2024 - Keynote: Latest News about UI5 and it’s Ecosystem
UI5con 2024 - Keynote: Latest News about UI5 and it’s EcosystemUI5con 2024 - Keynote: Latest News about UI5 and it’s Ecosystem
UI5con 2024 - Keynote: Latest News about UI5 and it’s Ecosystem
Peter Muessig
 
Automated software refactoring with OpenRewrite and Generative AI.pptx.pdf
Automated software refactoring with OpenRewrite and Generative AI.pptx.pdfAutomated software refactoring with OpenRewrite and Generative AI.pptx.pdf
Automated software refactoring with OpenRewrite and Generative AI.pptx.pdf
timtebeek1
 
openEuler Case Study - The Journey to Supply Chain Security
openEuler Case Study - The Journey to Supply Chain SecurityopenEuler Case Study - The Journey to Supply Chain Security
openEuler Case Study - The Journey to Supply Chain Security
Shane Coughlan
 
Unveiling the Advantages of Agile Software Development.pdf
Unveiling the Advantages of Agile Software Development.pdfUnveiling the Advantages of Agile Software Development.pdf
Unveiling the Advantages of Agile Software Development.pdf
brainerhub1
 

Recently uploaded (20)

SMS API Integration in Saudi Arabia| Best SMS API Service
SMS API Integration in Saudi Arabia| Best SMS API ServiceSMS API Integration in Saudi Arabia| Best SMS API Service
SMS API Integration in Saudi Arabia| Best SMS API Service
 
Microservice Teams - How the cloud changes the way we work
Microservice Teams - How the cloud changes the way we workMicroservice Teams - How the cloud changes the way we work
Microservice Teams - How the cloud changes the way we work
 
UI5con 2024 - Boost Your Development Experience with UI5 Tooling Extensions
UI5con 2024 - Boost Your Development Experience with UI5 Tooling ExtensionsUI5con 2024 - Boost Your Development Experience with UI5 Tooling Extensions
UI5con 2024 - Boost Your Development Experience with UI5 Tooling Extensions
 
Graspan: A Big Data System for Big Code Analysis
Graspan: A Big Data System for Big Code AnalysisGraspan: A Big Data System for Big Code Analysis
Graspan: A Big Data System for Big Code Analysis
 
Neo4j - Product Vision and Knowledge Graphs - GraphSummit Paris
Neo4j - Product Vision and Knowledge Graphs - GraphSummit ParisNeo4j - Product Vision and Knowledge Graphs - GraphSummit Paris
Neo4j - Product Vision and Knowledge Graphs - GraphSummit Paris
 
Revolutionizing Visual Effects Mastering AI Face Swaps.pdf
Revolutionizing Visual Effects Mastering AI Face Swaps.pdfRevolutionizing Visual Effects Mastering AI Face Swaps.pdf
Revolutionizing Visual Effects Mastering AI Face Swaps.pdf
 
Oracle Database 19c New Features for DBAs and Developers.pptx
Oracle Database 19c New Features for DBAs and Developers.pptxOracle Database 19c New Features for DBAs and Developers.pptx
Oracle Database 19c New Features for DBAs and Developers.pptx
 
Fundamentals of Programming and Language Processors
Fundamentals of Programming and Language ProcessorsFundamentals of Programming and Language Processors
Fundamentals of Programming and Language Processors
 
原版定制美国纽约州立大学奥尔巴尼分校毕业证学位证书原版一模一样
原版定制美国纽约州立大学奥尔巴尼分校毕业证学位证书原版一模一样原版定制美国纽约州立大学奥尔巴尼分校毕业证学位证书原版一模一样
原版定制美国纽约州立大学奥尔巴尼分校毕业证学位证书原版一模一样
 
Energy consumption of Database Management - Florina Jonuzi
Energy consumption of Database Management - Florina JonuziEnergy consumption of Database Management - Florina Jonuzi
Energy consumption of Database Management - Florina Jonuzi
 
Webinar On-Demand: Using Flutter for Embedded
Webinar On-Demand: Using Flutter for EmbeddedWebinar On-Demand: Using Flutter for Embedded
Webinar On-Demand: Using Flutter for Embedded
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
 
OpenMetadata Community Meeting - 5th June 2024
OpenMetadata Community Meeting - 5th June 2024OpenMetadata Community Meeting - 5th June 2024
OpenMetadata Community Meeting - 5th June 2024
 
8 Best Automated Android App Testing Tool and Framework in 2024.pdf
8 Best Automated Android App Testing Tool and Framework in 2024.pdf8 Best Automated Android App Testing Tool and Framework in 2024.pdf
8 Best Automated Android App Testing Tool and Framework in 2024.pdf
 
2024 eCommerceDays Toulouse - Sylius 2.0.pdf
2024 eCommerceDays Toulouse - Sylius 2.0.pdf2024 eCommerceDays Toulouse - Sylius 2.0.pdf
2024 eCommerceDays Toulouse - Sylius 2.0.pdf
 
Using Xen Hypervisor for Functional Safety
Using Xen Hypervisor for Functional SafetyUsing Xen Hypervisor for Functional Safety
Using Xen Hypervisor for Functional Safety
 
UI5con 2024 - Keynote: Latest News about UI5 and it’s Ecosystem
UI5con 2024 - Keynote: Latest News about UI5 and it’s EcosystemUI5con 2024 - Keynote: Latest News about UI5 and it’s Ecosystem
UI5con 2024 - Keynote: Latest News about UI5 and it’s Ecosystem
 
Automated software refactoring with OpenRewrite and Generative AI.pptx.pdf
Automated software refactoring with OpenRewrite and Generative AI.pptx.pdfAutomated software refactoring with OpenRewrite and Generative AI.pptx.pdf
Automated software refactoring with OpenRewrite and Generative AI.pptx.pdf
 
openEuler Case Study - The Journey to Supply Chain Security
openEuler Case Study - The Journey to Supply Chain SecurityopenEuler Case Study - The Journey to Supply Chain Security
openEuler Case Study - The Journey to Supply Chain Security
 
Unveiling the Advantages of Agile Software Development.pdf
Unveiling the Advantages of Agile Software Development.pdfUnveiling the Advantages of Agile Software Development.pdf
Unveiling the Advantages of Agile Software Development.pdf
 

Transforming Insurance Analytics with Big Data and Automated Machine Learning


  • 1. Transforming Insurance Analytics with Big Data and Automated Machine Learning A formula for higher ROI
  • 2. Agenda Mihaela Risca Sr. Solutions Marketing Manager Financial Services Cloudera Unlocking the Value of Insurance Data Satadru Sengupta Gen Mgr. Insurance DataRobot Automated Machine Learning – A Formula for Higher ROI for Insurers
  • 3. There are two different alignments of these components in the market: • When data and analytics capability are bundled with capital, we have an insurance company. • When it is bundled with demand, we have an advisor or broker Data is at the center of the Insurance market
  • 5.
  • 6. Why Machine Learning? • Analytics return $13 for every $1 invested (Nucleus Research) • Only 12% of data is leveraged for analytics (Forrester)
  • 7. What is Machine Learning?
  • 8. Why Big Data + Machine Learning? • Machine learning thrives on growing data sets • Bring disparate data sources together • Real time streaming
  • 9. Machine Learning Use Cases in Insurance Pricing Customer Acquisition Underwriting Marketing, customer retention, prioritization. Equating risk and price, driving life-time value (LTV) Prevent Claim Fraud Underwriting triage: select the top 10% of the available risk for further analysis . Identifying claims with highest likelihood of being fraudulent.
  • 10. Poll the Audience Where in your organization you see the most value for introducing machine learning? 1. Customer acquisition and retention 2. Underwriting/Actuarial 3. Quoting/Claims management 4. Fraud detection and prevention 5. Other
  • 11. Key Data Management Challenges for Insurers Fragmented Systems and Data Silos Limited Access to Right Data at the Right Time Strategic Decisions Based on Subsets of Data Unable to Tap into New Data Sources or Correlate Data from Multiple Sources Simultaneously Disparate View of Customers, Markets and Risks Poor Data Quality and Lack of Governance
  • 12. One Data Platform for Many Applications Handle real-time data ingest from diverse sources Governance and Security Data Streams Deployment Flexibility Machine Learning Capabilities Diverse Analytical Options Combine Data from Different Sources Data Mgmt. Hub Scale easily & Cost effectively Batch or Real- time Data Streams Data Sources Data Sources Data Storage & Processing Reporting, Analytics & Auditing Data Ingest Other Data Governance (Data Lineage, Data Protection) Fitness Car Telematics Applications
  • 13. "New technology is transforming the way we work, and it is allowing the competition to do better than what we can. The strange thing is we know the urgency, and yet there is inertia." Inga Beale, CEO of Lloyd's of London February 2017
  • 14. 1. Technology 2. Consumer & Market Economics 3. Data Science & Machine Learning … and they are interconnected. Three Strategic Areas of Focus
  • 15. Machine Learning Applications in Insurance 1. Risk Selection & Pricing 2. Claims, Fraud and Litigation Management 3. Operations and Expenses Management “machine learning is the secret sauce for the product of tomorrow.” Google, 2015
  • 16. Profitable Growth & Managing Expenses Becoming a 21st Century Insurance Company
  • 17. Life Insurance Example 1 Underwriting Triage • Predicted low risk to fast track process • Predicted high risk to traditional underwriting for manual review Business Impact • Cost reduction through automation of reviews of applicants • Increased likelihood of acquisition due to fast track underwriting • Higher underwriting profitability by targeting the review process on underwriting loss avoidance Specific examples from clients • Predict the likelihood of an insured being in a preferred class or not – as determined by risk factors such as smoking status, existing condition, terminal disease • Predict the most likely class among several classes
  • 18. Predict mortality risks among patients in remission of cancer: ○ Simplify Underwriting Process: Patients with good health prospects don’t need to go through a manual medical verification and avoid adverse selection ○ Reduce Costs of Claim by identifying high-risk patients and create more accurate underwriting rules ML model predicts patients with a very high risk of mortality ● 5 times more risky than average ● Around 10% of patients Life Insurance Example 2
  • 19. … InsurTech and Future of Insurance
  • 20. Machine Learning Strategy: Where It Is Failing? • A lack of data vision • Hiring and retaining good data scientists is impossible • Lack of Inclusiveness: Targeted end-users are not included in the machine learning problem solving process. HBR Article : “Stop searching for that elusive Data Scientist”
  • 21. New Technology Opens Up New Possibilities To Executives Artificial Intelligence & Automation makes Machine Learning Affordable, Pervasive and Inclusive
  • 22. Poll the Audience How do you primarily develop and deploy machine learning solutions in your organization today? 1. Multiple, small data science teams 2. One, big enterprise data science team 3. Outsource to consulting 4. We use automated machine learning 5. We currently don’t use machine learning
  • 23. Elements of Automated Machine Learning Smart ● Accurate ● Appeal to experienced data scientists ● Control buttons are accessible to the users Easy to Use ● Intuitive, fully automated workflow ● Needs minimum inputs but has guardrails ● Interpretable & transparent ● Deployment focused
  • 24. A 10 min journey to Automated Machine Learning (AML) using DataRobot Platform can we predict which patient is coming back to hospital within the first 30 days? Demo
  • 25. What capabilities for DataRobot on Cloudera? HDFS ingest: DR can utilize data stored in HDFS directly Hadoop Modeling: Train ML models on the Cloudera data nodes directly Hadoop scoring: Any model can then be deployed on Hadoop directly Distributed (each node scores a data split) Uses Spark
  • 26. Cloudera/DataRobot Integration Details DataRobot has the highest level of integration with Cloudera Cloudera Parcels A few click to install DR in Cloudera Manager! Cloudera CSDs Can use all the functionalities of Cloudera Manager (monitoring, resource mgmt…) Kerberos / Sentry Secured authentication YARN All the resources consumed by DataRobot are managed by YARN Spark DataRobot uses Spark for Hadoop scoring
  • 28. Apache Spark Ecosystem with Spark ML lib Spark MLlib API is available in Scala, Java, and Python programming languages
  • 29. Training from Cloudera and DataRobot ● Introduction to Machine Learning - Cloudera Training https://www.cloudera.com/more/training/courses/intro-machine-learning.html ● Data Science for Executives - DataRobot Training https://www.datarobot.com/education/for-executives/ ● Machine Learning with DataRobot - DataRobot Training https://www.datarobot.com/education/for-business-analysts/
  • 30. Learn More & Contact Us https://www.cloudera.com/solutions/insurance.html Cloudera Follow us: @Cloudera mihaela@cloudera.com Taneja Group Spark Market Adoption Report : LINK DataRobot Overview: LINK https://www.datarobot.com/go/insurance/ Follow us: @DataRobot satadru@datarobot.com DataRobot Executive Briefing: LINK The Machine Learning Renaissance: LINK Register for Wrangle Conference: July 20, San Francisco http://wrangleconf.com/
  • 32. Appendices Some screenshots Cloudera - DataRobot Integration
  • 33. DataRobot - Ease of Deployment on Cloudera ● Deployment ● Mgmt/Monitoring
  • 34. The DataRobot Service on Cloudera
  • 35.
  • 37. Copyright © DataRobot, Inc. - All Rights Reserved DataRobot Modeling on Hadoop Storage Application DR Edge Node … … Worker 2 Worker 1 Worker 3 Hadoop Data Node 1 Hadoop Data Node 2 YARN container 60GB (Worker 2) YARN container 60GB (Worker 3) YARN container 60GB (Worker 1) • YARN allocates memory on a data node when a worker wants to train a model • Each model is trained in memory on an available data node
  • 38. DataRobot – Cloudera “in-place” Scoring
  • 39. DataRobot & Cloudera – Seamless LDAP Authentication