SlideShare a Scribd company logo
1 of 19
Agile Data
Science
Alexander Bauer
Lead Data Scientist @ Lidl
Frankfurt Analytics Meetup, 2017/02/24
Agenda
• Data Science
• Challenges
• Agile Data Science Projects
• Case Study
What is Data Science?
• Data science, also known as data-driven
science, is an interdisciplinary field about
scientific methods, processes and systems
to extract knowledge or insights from data
in various forms, either structured or
unstructured – Wikipedia
Business Goals
Why do companies hire data scientists?
• Reduce costs
• Increase revenue
• Reduce risk
• Create innovation
Deliverables
How do data scientists deliver?
• Actionable insights (reports)
• Data products
• New product features
• Trials, A/B Testing
Challenges
Why do many data science projects fail?
• Lack of Business Understanding
• Data Access (Security, Privacy)
• Deployment and Operation (Scalability,
Acceptance)
• Time to market (Competition, Budget)
Case Study: Data Science for Sales Department
I want a
recommender
system for my
Sales Reps
Sure, we can use
Alternating Least
Square Singular
Value
Decomposition!
Case Study: Data Science for Sales Department
Show me what you
can do with Deep
Learning
Cool, we can do
something with
Tensorflow on
your data
Case Study: Data Science for Sales Department
I want a
dashboard of
sales by country
and product
Well, we can do
visualizations - but
that‘s actually not my
job!
Typical pitfalls during project execution
Modeling
Trial/Pilot
Operationalization
No access to data
Model does
not scale
Users don‘t
accept solution
Fails to meet business objective
Not enough signal
12 months
Out of budget
Solution: Iterative Approach
CRISP-DM
Agile Data Science
How can we implement CRISP-DM in practice?
• Agile Product Management
• Agile Development
• Data Science Platform / Data Lake
Agile Product Management – The Product Vision Statement1
13
 Close deals
 Prioritize leads
 Prevent churn
 Acquire new leads
 Up-sell
 Cross-sell
 Sales Reps
 Sales Manager
Target Group Needs Product Business Goals
 Increase
conversion rate
 Increase average
basket size
 Reduce churn rate
 Grow customer base
„Leverage data science to increase sales team productivity“
?
1Roman Pichler: Agile Product Management with Scrum
User Stories – Briding the gap between
algorithms and business needs
Association Rules:
As a sales rep, I need to understand which products are often bought together, so that I
can recommend additional products during sales calls and increase upsale.
Churn Factor Analysis:
As a sales rep, I need to understand the factors that drive churn so that I can select
customers to call, make sure they are satisfied with our products and reduce churn.
Recommender system:
As a sales rep, for each customer I need to understand which products were bought by
customers with similar purchase history, so that I can make personalized
recommendations and increase upsale.
Story Mapping and Release Planning
Up/Cross-Selling Churn Prevention Leads Prioritization
User
Interface/Deployment
Association Rules Factor Analysis
Conversion - Factor
Analysis
Item-Item
Recommender
Viz: Top N Items per
customer
A/B Testing
Simple Predictive
Model for Churn
(sales history data)
Improved predictive
model for churn
(incl. CRM data)
Content-based
recommender for cold-
start (incl. CRM data)
Release 1
Release 2
Release 3
A/B Testing
Viz: Top N customer to
likely to churn
Agile Development with Scrum
Data Science is a Team Sport
Data Lake/
Agile Platform
CRM Purchase Data Call Center Tickets
Platform Layer
Application
Layer
Docker/VMs
App
Security/Auth
Auditing
Monitoring
Unstructured Data Structured Data
Scalable Job Execution / Query Engine
App REST
ETL
Query Interface
/Notebooks
Visualization Tools
Scheduling
Legacy
Systems
Business Users
Analysts/
Data Scientists
Summary / Call for Action
• Data science projects rarely fail because of insufficient modeling skills
• Focus on business value, deliver „good enough“ models first
• Deliver in small increments that already provide value end-to-end, present
in Sprint Reviews to all stakeholders
• Manage stakeholers using a clear product vision, a user story backlog and
release plans
• Deploy as early as possible to ensure user acceptance, declare as „beta“
mode
• Build an infrastructure that enables agile development
Thank you! Questions?

More Related Content

What's hot

Avoiding the Pitfalls of Capitalizing Software in an Agile World
Avoiding the Pitfalls of Capitalizing Software in an Agile WorldAvoiding the Pitfalls of Capitalizing Software in an Agile World
Avoiding the Pitfalls of Capitalizing Software in an Agile World
LeadingAgile
 
Agile Estimation and Release Planning
Agile Estimation and Release PlanningAgile Estimation and Release Planning
Agile Estimation and Release Planning
Sumeet Moghe
 
From capabilities to services modelling for business-it alignment v.2
From capabilities to services   modelling for business-it alignment v.2From capabilities to services   modelling for business-it alignment v.2
From capabilities to services modelling for business-it alignment v.2
Trond Hjorteland
 

What's hot (20)

Avoiding the Pitfalls of Capitalizing Software in an Agile World
Avoiding the Pitfalls of Capitalizing Software in an Agile WorldAvoiding the Pitfalls of Capitalizing Software in an Agile World
Avoiding the Pitfalls of Capitalizing Software in an Agile World
 
DataOps: An Agile Method for Data-Driven Organizations
DataOps: An Agile Method for Data-Driven OrganizationsDataOps: An Agile Method for Data-Driven Organizations
DataOps: An Agile Method for Data-Driven Organizations
 
SAP HANA Use Cases in 27 Industries
SAP HANA Use Cases in 27 IndustriesSAP HANA Use Cases in 27 Industries
SAP HANA Use Cases in 27 Industries
 
SAP and Microsoft Manufacturing Solution
SAP and Microsoft Manufacturing SolutionSAP and Microsoft Manufacturing Solution
SAP and Microsoft Manufacturing Solution
 
Future of Data and AI in Retail - NRF 2023
Future of Data and AI in Retail - NRF 2023Future of Data and AI in Retail - NRF 2023
Future of Data and AI in Retail - NRF 2023
 
Agile Estimation and Release Planning
Agile Estimation and Release PlanningAgile Estimation and Release Planning
Agile Estimation and Release Planning
 
Business case for SAP HANA
Business case for SAP HANABusiness case for SAP HANA
Business case for SAP HANA
 
A Process for Being Data Driven
A Process for Being Data DrivenA Process for Being Data Driven
A Process for Being Data Driven
 
Data engineering and analytics using python
Data engineering and analytics using pythonData engineering and analytics using python
Data engineering and analytics using python
 
When NOT to use Apache Kafka?
When NOT to use Apache Kafka?When NOT to use Apache Kafka?
When NOT to use Apache Kafka?
 
From capabilities to services modelling for business-it alignment v.2
From capabilities to services   modelling for business-it alignment v.2From capabilities to services   modelling for business-it alignment v.2
From capabilities to services modelling for business-it alignment v.2
 
DesignChain Business-by-Design Workshop Pack for IIBA
DesignChain Business-by-Design Workshop Pack for IIBADesignChain Business-by-Design Workshop Pack for IIBA
DesignChain Business-by-Design Workshop Pack for IIBA
 
Fundamentals of Agile Product Management
Fundamentals of Agile Product ManagementFundamentals of Agile Product Management
Fundamentals of Agile Product Management
 
Spotify in the Cloud - An evolution of data infrastructure - Strata NYC
Spotify in the Cloud - An evolution of data infrastructure - Strata NYCSpotify in the Cloud - An evolution of data infrastructure - Strata NYC
Spotify in the Cloud - An evolution of data infrastructure - Strata NYC
 
Cloudera - The Modern Platform for Analytics
Cloudera - The Modern Platform for AnalyticsCloudera - The Modern Platform for Analytics
Cloudera - The Modern Platform for Analytics
 
Big Data Architecture and Design Patterns
Big Data Architecture and Design PatternsBig Data Architecture and Design Patterns
Big Data Architecture and Design Patterns
 
Case Study on agile scrum methodology on shopping cart
Case Study on agile scrum methodology on shopping cartCase Study on agile scrum methodology on shopping cart
Case Study on agile scrum methodology on shopping cart
 
How to Build Interactive Data Apps by ThoughtSpot Product Leaders
How to Build Interactive Data Apps by ThoughtSpot Product LeadersHow to Build Interactive Data Apps by ThoughtSpot Product Leaders
How to Build Interactive Data Apps by ThoughtSpot Product Leaders
 
App Mod 04: Reactive microservices with eclipse vert.x
App Mod 04: Reactive microservices with eclipse vert.xApp Mod 04: Reactive microservices with eclipse vert.x
App Mod 04: Reactive microservices with eclipse vert.x
 
Airbyte - Seed deck
Airbyte  - Seed deckAirbyte  - Seed deck
Airbyte - Seed deck
 

Viewers also liked

An Overview of AI on the AWS Platform - February 2017 Online Tech Talks
An Overview of AI on the AWS Platform - February 2017 Online Tech TalksAn Overview of AI on the AWS Platform - February 2017 Online Tech Talks
An Overview of AI on the AWS Platform - February 2017 Online Tech Talks
Amazon Web Services
 

Viewers also liked (20)

Hubba Deep Learning
Hubba Deep LearningHubba Deep Learning
Hubba Deep Learning
 
Deep learning - Part I
Deep learning - Part IDeep learning - Part I
Deep learning - Part I
 
Intro to TensorFlow and PyTorch Workshop at Tubular Labs
Intro to TensorFlow and PyTorch Workshop at Tubular LabsIntro to TensorFlow and PyTorch Workshop at Tubular Labs
Intro to TensorFlow and PyTorch Workshop at Tubular Labs
 
Deep learning Tutorial - Part II
Deep learning Tutorial - Part IIDeep learning Tutorial - Part II
Deep learning Tutorial - Part II
 
Deep learning and Apache Spark
Deep learning and Apache SparkDeep learning and Apache Spark
Deep learning and Apache Spark
 
Intro to Python
Intro to PythonIntro to Python
Intro to Python
 
An Overview of AI on the AWS Platform - February 2017 Online Tech Talks
An Overview of AI on the AWS Platform - February 2017 Online Tech TalksAn Overview of AI on the AWS Platform - February 2017 Online Tech Talks
An Overview of AI on the AWS Platform - February 2017 Online Tech Talks
 
Parallelizing Existing R Packages with SparkR
Parallelizing Existing R Packages with SparkRParallelizing Existing R Packages with SparkR
Parallelizing Existing R Packages with SparkR
 
Introduction to Deep Learning (NVIDIA)
Introduction to Deep Learning (NVIDIA)Introduction to Deep Learning (NVIDIA)
Introduction to Deep Learning (NVIDIA)
 
Google Dev Summit Extended Seoul - TensorFlow: Tensorboard & Keras
Google Dev Summit Extended Seoul - TensorFlow: Tensorboard & KerasGoogle Dev Summit Extended Seoul - TensorFlow: Tensorboard & Keras
Google Dev Summit Extended Seoul - TensorFlow: Tensorboard & Keras
 
How to Become a Thought Leader in Your Niche
How to Become a Thought Leader in Your NicheHow to Become a Thought Leader in Your Niche
How to Become a Thought Leader in Your Niche
 
TDD a piccoli passi
TDD a piccoli passiTDD a piccoli passi
TDD a piccoli passi
 
Product Owner - Scopriamo questo sconosciuto!
Product Owner - Scopriamo questo sconosciuto!Product Owner - Scopriamo questo sconosciuto!
Product Owner - Scopriamo questo sconosciuto!
 
Gestione del Product Backlog: un decluttering efficace
Gestione del Product Backlog: un decluttering efficaceGestione del Product Backlog: un decluttering efficace
Gestione del Product Backlog: un decluttering efficace
 
Welcome Note In GDG Helwan TensorFlow Dev Summit 2017 Extended
Welcome Note In GDG Helwan TensorFlow Dev Summit 2017 ExtendedWelcome Note In GDG Helwan TensorFlow Dev Summit 2017 Extended
Welcome Note In GDG Helwan TensorFlow Dev Summit 2017 Extended
 
Definition of Ready (XP2011)
Definition of Ready (XP2011)Definition of Ready (XP2011)
Definition of Ready (XP2011)
 
A look under the hood at Apache Spark's API and engine evolutions
A look under the hood at Apache Spark's API and engine evolutionsA look under the hood at Apache Spark's API and engine evolutions
A look under the hood at Apache Spark's API and engine evolutions
 
High level-api in tensorflow
High level-api in tensorflowHigh level-api in tensorflow
High level-api in tensorflow
 
Top 5 Deep Learning Stories 2/24
Top 5 Deep Learning Stories 2/24Top 5 Deep Learning Stories 2/24
Top 5 Deep Learning Stories 2/24
 
Micro services vs hadoop
Micro services vs hadoopMicro services vs hadoop
Micro services vs hadoop
 

Similar to Agile Data Science

Building a Complete View Across the Customer Experience on Oracle BICS
Building a Complete View Across the Customer Experience on Oracle BICSBuilding a Complete View Across the Customer Experience on Oracle BICS
Building a Complete View Across the Customer Experience on Oracle BICS
Shiv Bharti
 

Similar to Agile Data Science (20)

Operationalizing Customer Analytics with Azure and Power BI
Operationalizing Customer Analytics with Azure and Power BIOperationalizing Customer Analytics with Azure and Power BI
Operationalizing Customer Analytics with Azure and Power BI
 
AWS Webcast - Sales Productivity Solutions with MicroStrategy and Redshift
AWS Webcast - Sales Productivity Solutions with MicroStrategy and RedshiftAWS Webcast - Sales Productivity Solutions with MicroStrategy and Redshift
AWS Webcast - Sales Productivity Solutions with MicroStrategy and Redshift
 
Making Money Out of Data
Making Money Out of DataMaking Money Out of Data
Making Money Out of Data
 
How to Build Your Product Manager Toolbox by former Microsoft PM
How to Build Your Product Manager Toolbox by former Microsoft PMHow to Build Your Product Manager Toolbox by former Microsoft PM
How to Build Your Product Manager Toolbox by former Microsoft PM
 
How to Build an AI/ML Product and Sell it by SalesChoice CPO
How to Build an AI/ML Product and Sell it by SalesChoice CPOHow to Build an AI/ML Product and Sell it by SalesChoice CPO
How to Build an AI/ML Product and Sell it by SalesChoice CPO
 
Get your data analytics strategy right!
Get your data analytics strategy right!Get your data analytics strategy right!
Get your data analytics strategy right!
 
AI for Growth: tips, tricks and tools to improve your retention and conversio...
AI for Growth: tips, tricks and tools to improve your retention and conversio...AI for Growth: tips, tricks and tools to improve your retention and conversio...
AI for Growth: tips, tricks and tools to improve your retention and conversio...
 
QuestionPro Advanced Training Keys to Success - Discrete Conjoint Analysis 101
QuestionPro Advanced Training Keys to Success - Discrete Conjoint Analysis 101QuestionPro Advanced Training Keys to Success - Discrete Conjoint Analysis 101
QuestionPro Advanced Training Keys to Success - Discrete Conjoint Analysis 101
 
PrADS Introduction & offerings 2017
PrADS Introduction & offerings 2017 PrADS Introduction & offerings 2017
PrADS Introduction & offerings 2017
 
Building a 360 Degree View of Your Customers on BICS
Building a 360 Degree View of Your Customers on BICSBuilding a 360 Degree View of Your Customers on BICS
Building a 360 Degree View of Your Customers on BICS
 
Tableau Conference 2014 Presentation
Tableau Conference 2014 PresentationTableau Conference 2014 Presentation
Tableau Conference 2014 Presentation
 
Innovative Data Leveraging for Procurement Analytics
Innovative Data Leveraging for Procurement AnalyticsInnovative Data Leveraging for Procurement Analytics
Innovative Data Leveraging for Procurement Analytics
 
Building a Complete View Across the Customer Experience on Oracle BICS
Building a Complete View Across the Customer Experience on Oracle BICSBuilding a Complete View Across the Customer Experience on Oracle BICS
Building a Complete View Across the Customer Experience on Oracle BICS
 
Big Data en Retail
Big Data en RetailBig Data en Retail
Big Data en Retail
 
Driving Customer Loyalty with Azure Machine Learning
Driving Customer Loyalty with Azure Machine LearningDriving Customer Loyalty with Azure Machine Learning
Driving Customer Loyalty with Azure Machine Learning
 
Qlik - Unlocking the Power of Big Data Analytics
Qlik - Unlocking the Power of Big Data AnalyticsQlik - Unlocking the Power of Big Data Analytics
Qlik - Unlocking the Power of Big Data Analytics
 
Dashboards Beyond the Boardroom
Dashboards Beyond the BoardroomDashboards Beyond the Boardroom
Dashboards Beyond the Boardroom
 
What MBA Students Need to Know about CX, Data Science and Surveys
What MBA Students Need to Know about CX, Data Science and SurveysWhat MBA Students Need to Know about CX, Data Science and Surveys
What MBA Students Need to Know about CX, Data Science and Surveys
 
Rplus Retail analytics solution
Rplus Retail analytics solutionRplus Retail analytics solution
Rplus Retail analytics solution
 
Analytic Excellence - Saying Goodbye to Old Constraints
Analytic Excellence - Saying Goodbye to Old ConstraintsAnalytic Excellence - Saying Goodbye to Old Constraints
Analytic Excellence - Saying Goodbye to Old Constraints
 

Recently uploaded

Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 

Recently uploaded (20)

Introduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDMIntroduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDM
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
API Governance and Monetization - The evolution of API governance
API Governance and Monetization -  The evolution of API governanceAPI Governance and Monetization -  The evolution of API governance
API Governance and Monetization - The evolution of API governance
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Stronger Together: Developing an Organizational Strategy for Accessible Desig...
Stronger Together: Developing an Organizational Strategy for Accessible Desig...Stronger Together: Developing an Organizational Strategy for Accessible Desig...
Stronger Together: Developing an Organizational Strategy for Accessible Desig...
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Quantum Leap in Next-Generation Computing
Quantum Leap in Next-Generation ComputingQuantum Leap in Next-Generation Computing
Quantum Leap in Next-Generation Computing
 
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Simplifying Mobile A11y Presentation.pptx
Simplifying Mobile A11y Presentation.pptxSimplifying Mobile A11y Presentation.pptx
Simplifying Mobile A11y Presentation.pptx
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Choreo: Empowering the Future of Enterprise Software Engineering
Choreo: Empowering the Future of Enterprise Software EngineeringChoreo: Empowering the Future of Enterprise Software Engineering
Choreo: Empowering the Future of Enterprise Software Engineering
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 

Agile Data Science

  • 1. Agile Data Science Alexander Bauer Lead Data Scientist @ Lidl Frankfurt Analytics Meetup, 2017/02/24
  • 2. Agenda • Data Science • Challenges • Agile Data Science Projects • Case Study
  • 3. What is Data Science? • Data science, also known as data-driven science, is an interdisciplinary field about scientific methods, processes and systems to extract knowledge or insights from data in various forms, either structured or unstructured – Wikipedia
  • 4. Business Goals Why do companies hire data scientists? • Reduce costs • Increase revenue • Reduce risk • Create innovation
  • 5. Deliverables How do data scientists deliver? • Actionable insights (reports) • Data products • New product features • Trials, A/B Testing
  • 6. Challenges Why do many data science projects fail? • Lack of Business Understanding • Data Access (Security, Privacy) • Deployment and Operation (Scalability, Acceptance) • Time to market (Competition, Budget)
  • 7. Case Study: Data Science for Sales Department I want a recommender system for my Sales Reps Sure, we can use Alternating Least Square Singular Value Decomposition!
  • 8. Case Study: Data Science for Sales Department Show me what you can do with Deep Learning Cool, we can do something with Tensorflow on your data
  • 9. Case Study: Data Science for Sales Department I want a dashboard of sales by country and product Well, we can do visualizations - but that‘s actually not my job!
  • 10. Typical pitfalls during project execution Modeling Trial/Pilot Operationalization No access to data Model does not scale Users don‘t accept solution Fails to meet business objective Not enough signal 12 months Out of budget
  • 12. Agile Data Science How can we implement CRISP-DM in practice? • Agile Product Management • Agile Development • Data Science Platform / Data Lake
  • 13. Agile Product Management – The Product Vision Statement1 13  Close deals  Prioritize leads  Prevent churn  Acquire new leads  Up-sell  Cross-sell  Sales Reps  Sales Manager Target Group Needs Product Business Goals  Increase conversion rate  Increase average basket size  Reduce churn rate  Grow customer base „Leverage data science to increase sales team productivity“ ? 1Roman Pichler: Agile Product Management with Scrum
  • 14. User Stories – Briding the gap between algorithms and business needs Association Rules: As a sales rep, I need to understand which products are often bought together, so that I can recommend additional products during sales calls and increase upsale. Churn Factor Analysis: As a sales rep, I need to understand the factors that drive churn so that I can select customers to call, make sure they are satisfied with our products and reduce churn. Recommender system: As a sales rep, for each customer I need to understand which products were bought by customers with similar purchase history, so that I can make personalized recommendations and increase upsale.
  • 15. Story Mapping and Release Planning Up/Cross-Selling Churn Prevention Leads Prioritization User Interface/Deployment Association Rules Factor Analysis Conversion - Factor Analysis Item-Item Recommender Viz: Top N Items per customer A/B Testing Simple Predictive Model for Churn (sales history data) Improved predictive model for churn (incl. CRM data) Content-based recommender for cold- start (incl. CRM data) Release 1 Release 2 Release 3 A/B Testing Viz: Top N customer to likely to churn
  • 16. Agile Development with Scrum Data Science is a Team Sport
  • 17. Data Lake/ Agile Platform CRM Purchase Data Call Center Tickets Platform Layer Application Layer Docker/VMs App Security/Auth Auditing Monitoring Unstructured Data Structured Data Scalable Job Execution / Query Engine App REST ETL Query Interface /Notebooks Visualization Tools Scheduling Legacy Systems Business Users Analysts/ Data Scientists
  • 18. Summary / Call for Action • Data science projects rarely fail because of insufficient modeling skills • Focus on business value, deliver „good enough“ models first • Deliver in small increments that already provide value end-to-end, present in Sprint Reviews to all stakeholders • Manage stakeholers using a clear product vision, a user story backlog and release plans • Deploy as early as possible to ensure user acceptance, declare as „beta“ mode • Build an infrastructure that enables agile development