SlideShare a Scribd company logo
How Starbucks Forecasts Demand
at Scale with Facebook Prophet
and Databricks
Rob Saker, Retail Industry Leader at Databricks
Brendan O’Shaughnessy, Data Science Manager at Starbucks
Bilal Obeidat, Solutions Architect, Databricks
Housekeeping
• Your connection will be muted
• Submit questions via the Q&A panel
• Questions will be answered at the end of the webinar
• Any outstanding questions will be answered in the Databricks Forum
(https://forums.databricks.com)
• Webinar will be recorded and attachments will be made available via
www.databricks.com
Introducing Our Speakers
3
Bilal Obeidat, Lead Solutions Architect
● 15+ years of Software Development at Microsoft, Hitachi
● MS, Computer Science, Bradley University
Brendan O’Shaughnessy, Data Science Manager
● 10+ years of Analytics and Data Science at Starbucks, NRPC
● MS, Spatial Information Science and Engineering, University of Maine
Rob Saker, Retail Industry Leader
● Formerly Retail industry Chief Data Officer (CDO)
● 15+ years in Data & Analytic leadership across Retail & CPG
● MBA, University of Nebraska
Unified data analytics platform for accelerating innovation across
data science, data engineering, and business analytics
Original creators of popular data and machine learning open source projects
Global company with 5,000 customers and 450+ partners
Databricks Customers - Over Half of Top 25 Retailers by Revenue
Media & Entertainment Technology
Public Sector Retail & CPG Consumer Services Energy & IndustrialMarketing & AdTech
Data & Analytics ServicesHealthcare & Pharma Financial Services
Agenda
➔ A Consumer-Driven Supply Chain
◆ Why Demand Forecasting is Relevant?
◆ The Need for Granular Demand Forecasting at Scale
◆ How Starbucks does Demand Forecasting?
➔ Demo: ML based Forecasting With Facebook Prophet
➔ Conclusion
Customers want what they want,
where they want it, when they want it
Consumer Behavior is Changing Supply Networks
DIRECT TO CONSUMER
PERSONALIZATION
40% of Best Buy’s online purchases are picked up at the store
50% of all restaurant prepared food is consumed away from restaurant
CUSTOMER ENGAGEMENT
FROM ANYWHERE
CONVENIENCE
IS KEY
Your Supply Chain Needs to be
Consumer-Driven
A Consumer-Driven Supply Chain Starts with Demand
Forecasting
What is Demand Forecasting?
Demand forecasting is the
process where we use
historical sales, promotions,
shopper and causal data to
understand and predict
customer demand.
Shipments
CPG Direct-to-consumer shipments
Manufacturers
Orders Store shipments
Distribution
Centers
E-commerce shipments to consumers
Shoppers
In-store purchases
Buy-online, pickup
in store (BOPIS)
E-commerce
delivery
Subscription
Retail Stores
Sales
Retail Supply Chain
Orders Store shipments
Shipments
CPG Direct-to-consumer shipments
Manufacturers Distribution
Centers
Retail Stores Shoppers
E-commerce shipments to consumers
In-store purchases
Buy-online, pickup
in store (BOPIS)
E-commerce
delivery
Subscription
Sales
Retail Supply Chain Demand Forecast
Orders Store shipments
Shipments
CPG Direct-to-consumer shipments
Manufacturers Distribution
Centers
Retail Stores Shoppers
E-commerce shipments to consumers
In-store purchases
Buy-online, pickup
in store (BOPIS)
E-commerce
delivery
Subscription
Sales
Retail Supply Chain Demand Forecast
Intrastore
shipments
Store shipments
Inventory levels
Reorder quantity
DC shipments
Production plan
Home delivery logistics
Home delivery logistics
Retail Capability Model
Supply Chain Management Merchandising Store Operations
Customer
Management
Planning Freight & Logistics Planning Operations
Supplier Management
Distribution
Replenishment
Inventory
Purchasing
Sourcing
Advanced Planning
Demand Forecasting
Warehouse
Management
Reverse Logistics
Transportation
Procurement
Logistics
Allocation
Demand Planning
Revenue/Price
Management
Promotions Planning
Assortment
Category Management
Space Planning
Fresh Planning
Demand Planning
Joint planning
Trade management
Merchandising
Store Channel
Promotions
Digital
Store Auditing
Broker Management
Store Operations
Workforce
Point-of-Sale
Shelf tags/Signage
Smart Store Devices
Store Inventory
Management
Returns Management
Fresh Production
Loss Prevention
Loyalty
Digital/Ecommerce
Mobile
Customer Support
Customer
Segmentation/CRM
Voice AI/Call Center
Back Office
People Management Recruiting Finance Asset Management Real Estate
Energy & UtilitiesIT
Industry Standards &
Compliance
Legal
Master Data
Management
Retail Capability Model
Supply Chain Management Merchandising Store Operations
Customer
Management
Planning Freight & Logistics Planning Operations
Supplier Management
Distribution
Replenishment
Inventory
Purchasing
Sourcing
Advanced Planning
Demand Forecasting
Warehouse
Management
Reverse Logistics
Transportation
Procurement
Logistics
Allocation
Demand Planning
Revenue/Price
Management
Promotions Planning
Assortment
Category Management
Space Planning
Fresh Planning
Demand Planning
Joint planning
Trade management
Merchandising
Store Channel
Promotions
Digital
Store Auditing
Broker Management
Store Operations
Workforce
Point-of-Sale
Shelf tags/Signage
Smart Store Devices
Store Inventory
Management
Returns Management
Fresh Production
Loss Prevention
Loyalty
Digital/Ecommerce
Mobile
Customer Support
Customer
Segmentation/CRM
Voice AI/Call Center
Back Office
People Management Recruiting Finance Asset Management Real Estate
Energy & UtilitiesIT
Industry Standards &
Compliance
Legal
Master Data
Management
Using Data and ML Across the Supply Chain is Challenging
FORECASTING NOT
ACCURATE OR GRANULAR
LARGE VOLUMES OF
RAPIDLY CHANGING DATA
LIMITED REAL-TIME AND
CAUSAL DATA
NOT EASY TO GET TO
ACTIONABLE INSIGHTS
Managers unable to get per
day /store/SKU forecast
Data is constantly shifting and
changing. Eg. Revised data to
account for shoplifting
Omnichannel is making local
(weather), real-time (IOT),
causal (competitor pricing)
data more important
Store/Distribution managers
get BI tools with lots of data
that they have no time to
explore
Traditional Demand Forecasting with Allocations
Market AreaPromo Group Week
Day Day Day
Traditional tools can’t scale to fine-grain
DC
Promo Group
Week
105,000
1 billion
500 million
2 billion
1.5 billion
DMA
Promo Group
Week
525,000
Store
Promo Group
Week
11.922,500
Store
SKU
Week
238,450,000
Store
SKU
Day
1,669,150,000
Traditional
Forecasting
Tools
Fine grained
forecast with
Databricks
Special run/few
times a year
Demand Forecasting is Complicated
You Need Forecasts by Product by Store
Stores SKUs
You Have a Large # of Store-SKU Combinations
Processing each
model sequentially is
slow
Each model may have
local causal influences
Distribute Model Training with Apache SparkTM
Apache Spark: De-Facto Unified Analytics Engine
Runtime
Delta
Spark Core Engine
Big Data Processing
ETL + SQL + Streaming
Machine Learning
MLlib + SparkR
Uniquely combines Data & AI technologies
R Python Scala Java
Use a Unified Data Analytics Platform Across the Data and
ML Lifecycle
DO GRANULAR AND
ACCURATE FORECASTS
KEEP UP WITH CHANGING
DATA
USE REAL-TIME AND
CAUSAL DATA
ACTIONABLE AND EASY
INSIGHTS FOR MANAGERS
Point BI tools that store managers
use directly at ML insights stored
on Delta Lake tables
Use and track 100s of ML
models to forecast demand by
day/store/SKU using MLflow
Use Delta Lake UPSERTS to keep
data consistent
Single streamlined pipeline for
real time and streaming data with
Delta Lake and Apache SparkTM
Unified Data Analytics for Consumer-Driven Supply Chain
Forecast
Demand
INVENTORY
DATA
IOT DATA
COMPETITOR
DATA
PRICING
DATA
SKU
DATA
DEMOGRAPHICS
DATA
GEO-LOCATION
DATA
PO0
DATAS
VIDEO
DATA
SHIPMEN
DATAT
Optimize
Inventory
Faster
Freight and
Logistics
Databricks Delivers Fine-Grained Demand Forecasting
Traditional Analysis Suites Databricks
Fine grained forecasting Aggregate level Day, store & SKU
Real-time data No Streaming data
Custom causal Data Limited
Integrate weather, online &
mobile interactions
Multi-modal data for training No
Structured, unstructured,
image, video, sensor data.
Localize models for greater accuracy No Yes
Push predictions to the edge No Yes
Starbucks Data Science
Forecasting Framework
“Flexible forecasting at scale”
Brendan O’Shaughnessy
Data Science Manager at
Starbucks
Delivery
Forecasting has utility across business
functions
Promotions Market Planning
Operations
Equipment Usage
Labor
Challenges
▪ Many metrics and granularities
▪ Timely delivery
▪ Easy accessibility for business stakeholders
▪ Iterative nature of data science
▪ Accuracy
▪ Quick diagnostics
Forecasting framework
▪ Built-in diagnostics
▪ Status monitoring
▪ Leverages Delta Lake and ADLS for data ingress and
egress
▪ Executable from Databricks CLI
Robust and reliable
▪ Agnostic to metric and granularity
▪ Leverages diverse forecasting algorithms using
grouped pandas UDFs
▪ Modular structure for straightforward additions
and enhancements
▪ Apache SparkTM enables quick runtimes regardless
of the number of individual forecasts
Flexible
Example structure
Create custom forecasts
for any use case with
relative ease
Enterprise data
(DB Delta)
Historical
Metric
Custom data
(ADLS)
Build dataset
Compile results
To stakeholders
Model 1 Model 2 Model n
To data products
…
https://www.linkedin.com/company/starbucks/jobs/
Demand Forecasting
Demo
Key Takeaways Slide
ü A Consumer-driven supply chain is indispensable
ü Start with demand forecasting at scale
ü Localization of models is slow and tedious
ü Using Data and ML across supply chain is challenging
Databricks provides Unified Data Analytics to bring Data and ML
together for accurate and granular demand forecasting
Thank you! Questions?
38
Sign up for a Free Trial : databricks.com/trial
Register For Spark + AI Summit: https://databricks.com/sparkaisummit/
Read Blog: https://dbricks.co/DemFcst

More Related Content

What's hot

Customer 360
Customer 360Customer 360
Customer 360
Dave Birckhead
 
Data Lake Overview
Data Lake OverviewData Lake Overview
Data Lake Overview
James Serra
 
Apache Kafka With Spark Structured Streaming With Emma Liu, Nitin Saksena, Ra...
Apache Kafka With Spark Structured Streaming With Emma Liu, Nitin Saksena, Ra...Apache Kafka With Spark Structured Streaming With Emma Liu, Nitin Saksena, Ra...
Apache Kafka With Spark Structured Streaming With Emma Liu, Nitin Saksena, Ra...
HostedbyConfluent
 
Data Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to MeshData Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to Mesh
Jeffrey T. Pollock
 
Building Reliable Data Lakes at Scale with Delta Lake
Building Reliable Data Lakes at Scale with Delta LakeBuilding Reliable Data Lakes at Scale with Delta Lake
Building Reliable Data Lakes at Scale with Delta Lake
Databricks
 
Building a Data Streaming Center of Excellence With Steve Gonzalez and Derek ...
Building a Data Streaming Center of Excellence With Steve Gonzalez and Derek ...Building a Data Streaming Center of Excellence With Steve Gonzalez and Derek ...
Building a Data Streaming Center of Excellence With Steve Gonzalez and Derek ...
HostedbyConfluent
 
Apache Kafka and API Management / API Gateway – Friends, Enemies or Frenemies?
Apache Kafka and API Management / API Gateway – Friends, Enemies or Frenemies?Apache Kafka and API Management / API Gateway – Friends, Enemies or Frenemies?
Apache Kafka and API Management / API Gateway – Friends, Enemies or Frenemies?
Kai Wähner
 
Summary introduction to data engineering
Summary introduction to data engineeringSummary introduction to data engineering
Summary introduction to data engineering
Novita Sari
 
Continuous Data Replication into Cloud Storage with Oracle GoldenGate
Continuous Data Replication into Cloud Storage with Oracle GoldenGateContinuous Data Replication into Cloud Storage with Oracle GoldenGate
Continuous Data Replication into Cloud Storage with Oracle GoldenGate
Michael Rainey
 
CHARACTERISTICS OF SUPPLY CHAIN DRIVERS AND MANAGEMENT IN STARBUCKS
CHARACTERISTICS OF SUPPLY CHAIN DRIVERS AND MANAGEMENT IN STARBUCKSCHARACTERISTICS OF SUPPLY CHAIN DRIVERS AND MANAGEMENT IN STARBUCKS
CHARACTERISTICS OF SUPPLY CHAIN DRIVERS AND MANAGEMENT IN STARBUCKS
JerishAmul
 
Webinar: 99 Ways to Enrich Streaming Data with Apache Flink - Konstantin Knauf
Webinar: 99 Ways to Enrich Streaming Data with Apache Flink - Konstantin KnaufWebinar: 99 Ways to Enrich Streaming Data with Apache Flink - Konstantin Knauf
Webinar: 99 Ways to Enrich Streaming Data with Apache Flink - Konstantin Knauf
Ververica
 
How Uber scaled its Real Time Infrastructure to Trillion events per day
How Uber scaled its Real Time Infrastructure to Trillion events per dayHow Uber scaled its Real Time Infrastructure to Trillion events per day
How Uber scaled its Real Time Infrastructure to Trillion events per day
DataWorks Summit
 
The Rise Of Event Streaming – Why Apache Kafka Changes Everything
The Rise Of Event Streaming – Why Apache Kafka Changes EverythingThe Rise Of Event Streaming – Why Apache Kafka Changes Everything
The Rise Of Event Streaming – Why Apache Kafka Changes Everything
Kai Wähner
 
Apache Kafka for Automotive Industry, Mobility Services & Smart City
Apache Kafka for Automotive Industry, Mobility Services & Smart CityApache Kafka for Automotive Industry, Mobility Services & Smart City
Apache Kafka for Automotive Industry, Mobility Services & Smart City
Kai Wähner
 
Building Serverless ETL Pipelines with AWS Glue
Building Serverless ETL Pipelines with AWS GlueBuilding Serverless ETL Pipelines with AWS Glue
Building Serverless ETL Pipelines with AWS Glue
Amazon Web Services
 
Introduction SQL Analytics on Lakehouse Architecture
Introduction SQL Analytics on Lakehouse ArchitectureIntroduction SQL Analytics on Lakehouse Architecture
Introduction SQL Analytics on Lakehouse Architecture
Databricks
 
Demystifying data engineering
Demystifying data engineeringDemystifying data engineering
Demystifying data engineering
Thang Bui (Bob)
 
ETL Made Easy with Azure Data Factory and Azure Databricks
ETL Made Easy with Azure Data Factory and Azure DatabricksETL Made Easy with Azure Data Factory and Azure Databricks
ETL Made Easy with Azure Data Factory and Azure Databricks
Databricks
 
The digital transformation of CPG and manufacturing
The digital transformation of CPG and manufacturingThe digital transformation of CPG and manufacturing
The digital transformation of CPG and manufacturing
Cloudera, Inc.
 
Oracle Stream Analytics - Developer Introduction
Oracle Stream Analytics - Developer IntroductionOracle Stream Analytics - Developer Introduction
Oracle Stream Analytics - Developer Introduction
Jeffrey T. Pollock
 

What's hot (20)

Customer 360
Customer 360Customer 360
Customer 360
 
Data Lake Overview
Data Lake OverviewData Lake Overview
Data Lake Overview
 
Apache Kafka With Spark Structured Streaming With Emma Liu, Nitin Saksena, Ra...
Apache Kafka With Spark Structured Streaming With Emma Liu, Nitin Saksena, Ra...Apache Kafka With Spark Structured Streaming With Emma Liu, Nitin Saksena, Ra...
Apache Kafka With Spark Structured Streaming With Emma Liu, Nitin Saksena, Ra...
 
Data Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to MeshData Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to Mesh
 
Building Reliable Data Lakes at Scale with Delta Lake
Building Reliable Data Lakes at Scale with Delta LakeBuilding Reliable Data Lakes at Scale with Delta Lake
Building Reliable Data Lakes at Scale with Delta Lake
 
Building a Data Streaming Center of Excellence With Steve Gonzalez and Derek ...
Building a Data Streaming Center of Excellence With Steve Gonzalez and Derek ...Building a Data Streaming Center of Excellence With Steve Gonzalez and Derek ...
Building a Data Streaming Center of Excellence With Steve Gonzalez and Derek ...
 
Apache Kafka and API Management / API Gateway – Friends, Enemies or Frenemies?
Apache Kafka and API Management / API Gateway – Friends, Enemies or Frenemies?Apache Kafka and API Management / API Gateway – Friends, Enemies or Frenemies?
Apache Kafka and API Management / API Gateway – Friends, Enemies or Frenemies?
 
Summary introduction to data engineering
Summary introduction to data engineeringSummary introduction to data engineering
Summary introduction to data engineering
 
Continuous Data Replication into Cloud Storage with Oracle GoldenGate
Continuous Data Replication into Cloud Storage with Oracle GoldenGateContinuous Data Replication into Cloud Storage with Oracle GoldenGate
Continuous Data Replication into Cloud Storage with Oracle GoldenGate
 
CHARACTERISTICS OF SUPPLY CHAIN DRIVERS AND MANAGEMENT IN STARBUCKS
CHARACTERISTICS OF SUPPLY CHAIN DRIVERS AND MANAGEMENT IN STARBUCKSCHARACTERISTICS OF SUPPLY CHAIN DRIVERS AND MANAGEMENT IN STARBUCKS
CHARACTERISTICS OF SUPPLY CHAIN DRIVERS AND MANAGEMENT IN STARBUCKS
 
Webinar: 99 Ways to Enrich Streaming Data with Apache Flink - Konstantin Knauf
Webinar: 99 Ways to Enrich Streaming Data with Apache Flink - Konstantin KnaufWebinar: 99 Ways to Enrich Streaming Data with Apache Flink - Konstantin Knauf
Webinar: 99 Ways to Enrich Streaming Data with Apache Flink - Konstantin Knauf
 
How Uber scaled its Real Time Infrastructure to Trillion events per day
How Uber scaled its Real Time Infrastructure to Trillion events per dayHow Uber scaled its Real Time Infrastructure to Trillion events per day
How Uber scaled its Real Time Infrastructure to Trillion events per day
 
The Rise Of Event Streaming – Why Apache Kafka Changes Everything
The Rise Of Event Streaming – Why Apache Kafka Changes EverythingThe Rise Of Event Streaming – Why Apache Kafka Changes Everything
The Rise Of Event Streaming – Why Apache Kafka Changes Everything
 
Apache Kafka for Automotive Industry, Mobility Services & Smart City
Apache Kafka for Automotive Industry, Mobility Services & Smart CityApache Kafka for Automotive Industry, Mobility Services & Smart City
Apache Kafka for Automotive Industry, Mobility Services & Smart City
 
Building Serverless ETL Pipelines with AWS Glue
Building Serverless ETL Pipelines with AWS GlueBuilding Serverless ETL Pipelines with AWS Glue
Building Serverless ETL Pipelines with AWS Glue
 
Introduction SQL Analytics on Lakehouse Architecture
Introduction SQL Analytics on Lakehouse ArchitectureIntroduction SQL Analytics on Lakehouse Architecture
Introduction SQL Analytics on Lakehouse Architecture
 
Demystifying data engineering
Demystifying data engineeringDemystifying data engineering
Demystifying data engineering
 
ETL Made Easy with Azure Data Factory and Azure Databricks
ETL Made Easy with Azure Data Factory and Azure DatabricksETL Made Easy with Azure Data Factory and Azure Databricks
ETL Made Easy with Azure Data Factory and Azure Databricks
 
The digital transformation of CPG and manufacturing
The digital transformation of CPG and manufacturingThe digital transformation of CPG and manufacturing
The digital transformation of CPG and manufacturing
 
Oracle Stream Analytics - Developer Introduction
Oracle Stream Analytics - Developer IntroductionOracle Stream Analytics - Developer Introduction
Oracle Stream Analytics - Developer Introduction
 

Similar to How Starbucks Forecasts Demand at Scale with Facebook Prophet and Databricks

Knowledge Graphs for Supply Chain Operations.pdf
Knowledge Graphs for Supply Chain Operations.pdfKnowledge Graphs for Supply Chain Operations.pdf
Knowledge Graphs for Supply Chain Operations.pdf
Vaticle
 
Using ML and Azure to improve Customer Lifetime Value
Using ML and Azure to improve Customer Lifetime ValueUsing ML and Azure to improve Customer Lifetime Value
Using ML and Azure to improve Customer Lifetime Value
Navin Albert
 
Big Data & Analytics 101: How Customer Lifetime Value Enhances Predictive Mar...
Big Data & Analytics 101: How Customer Lifetime Value Enhances Predictive Mar...Big Data & Analytics 101: How Customer Lifetime Value Enhances Predictive Mar...
Big Data & Analytics 101: How Customer Lifetime Value Enhances Predictive Mar...
Big Cloud Analytics, Inc.
 
Turning Big Data into Better Business Outcomes
Turning Big Data into Better Business OutcomesTurning Big Data into Better Business Outcomes
Turning Big Data into Better Business Outcomes
Cisco Canada
 
Data Integrity for Banking and Financial Services
Data Integrity for Banking and Financial ServicesData Integrity for Banking and Financial Services
Data Integrity for Banking and Financial Services
Precisely
 
Data Integrity for Banking and Financial Services
Data Integrity for Banking and Financial ServicesData Integrity for Banking and Financial Services
Data Integrity for Banking and Financial Services
Precisely
 
Webinar: Know Where, Why, What: Big Data’s Role In Predictive And Location A...
Webinar:  Know Where, Why, What: Big Data’s Role In Predictive And Location A...Webinar:  Know Where, Why, What: Big Data’s Role In Predictive And Location A...
Webinar: Know Where, Why, What: Big Data’s Role In Predictive And Location A...
G3 Communications
 
Retail Analytics and BI with Looker, BigQuery, GCP & Leigha Jarett
Retail Analytics and BI with Looker, BigQuery, GCP & Leigha JarettRetail Analytics and BI with Looker, BigQuery, GCP & Leigha Jarett
Retail Analytics and BI with Looker, BigQuery, GCP & Leigha Jarett
Daniel Zivkovic
 
Gain a Holistic View of your Customer's Journey
Gain a Holistic View of your Customer's JourneyGain a Holistic View of your Customer's Journey
Gain a Holistic View of your Customer's Journey
Platfora
 
Moving Targets: Harnessing Real-time Value from Data in Motion
Moving Targets: Harnessing Real-time Value from Data in Motion Moving Targets: Harnessing Real-time Value from Data in Motion
Moving Targets: Harnessing Real-time Value from Data in Motion
Inside Analysis
 
Accelerating Data-Driven Enterprise Transformation in Banking, Financial Serv...
Accelerating Data-Driven Enterprise Transformation in Banking, Financial Serv...Accelerating Data-Driven Enterprise Transformation in Banking, Financial Serv...
Accelerating Data-Driven Enterprise Transformation in Banking, Financial Serv...
Denodo
 
Use of Analytics to recover from COVID19 hit economy
Use of Analytics to recover from COVID19 hit economyUse of Analytics to recover from COVID19 hit economy
Use of Analytics to recover from COVID19 hit economy
Amit Parija
 
Customer Intelligence_ Harnessing Elephants at Transamerica Presentation (1)
Customer Intelligence_ Harnessing Elephants at Transamerica    Presentation (1)Customer Intelligence_ Harnessing Elephants at Transamerica    Presentation (1)
Customer Intelligence_ Harnessing Elephants at Transamerica Presentation (1)
Vishal Bamba
 
Big Data Everywhere Chicago: Platfora - Practices for Customer Analytics on H...
Big Data Everywhere Chicago: Platfora - Practices for Customer Analytics on H...Big Data Everywhere Chicago: Platfora - Practices for Customer Analytics on H...
Big Data Everywhere Chicago: Platfora - Practices for Customer Analytics on H...
BigDataEverywhere
 
Big Data: The Road to Know More About Your Business
Big Data:  The Road to Know More About Your BusinessBig Data:  The Road to Know More About Your Business
Big Data: The Road to Know More About Your Business
OAUGNJ
 
Finding Data at Risk for CCPA Compliance
Finding Data at Risk for CCPA ComplianceFinding Data at Risk for CCPA Compliance
Finding Data at Risk for CCPA Compliance
Precisely
 
Supply Chain 2030: Presentation by Lora Cecere at CLX Conference
Supply Chain 2030: Presentation by Lora Cecere at CLX ConferenceSupply Chain 2030: Presentation by Lora Cecere at CLX Conference
Supply Chain 2030: Presentation by Lora Cecere at CLX Conference
Lora Cecere
 
Journey to a Modern Data Architecture
Journey to a Modern Data ArchitectureJourney to a Modern Data Architecture
Journey to a Modern Data Architecture
Precisely
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business Goals
DATAVERSITY
 
Do You Trust Your Machine Learning Outcomes?
 Do You Trust Your Machine Learning Outcomes?  Do You Trust Your Machine Learning Outcomes?
Do You Trust Your Machine Learning Outcomes?
Precisely
 

Similar to How Starbucks Forecasts Demand at Scale with Facebook Prophet and Databricks (20)

Knowledge Graphs for Supply Chain Operations.pdf
Knowledge Graphs for Supply Chain Operations.pdfKnowledge Graphs for Supply Chain Operations.pdf
Knowledge Graphs for Supply Chain Operations.pdf
 
Using ML and Azure to improve Customer Lifetime Value
Using ML and Azure to improve Customer Lifetime ValueUsing ML and Azure to improve Customer Lifetime Value
Using ML and Azure to improve Customer Lifetime Value
 
Big Data & Analytics 101: How Customer Lifetime Value Enhances Predictive Mar...
Big Data & Analytics 101: How Customer Lifetime Value Enhances Predictive Mar...Big Data & Analytics 101: How Customer Lifetime Value Enhances Predictive Mar...
Big Data & Analytics 101: How Customer Lifetime Value Enhances Predictive Mar...
 
Turning Big Data into Better Business Outcomes
Turning Big Data into Better Business OutcomesTurning Big Data into Better Business Outcomes
Turning Big Data into Better Business Outcomes
 
Data Integrity for Banking and Financial Services
Data Integrity for Banking and Financial ServicesData Integrity for Banking and Financial Services
Data Integrity for Banking and Financial Services
 
Data Integrity for Banking and Financial Services
Data Integrity for Banking and Financial ServicesData Integrity for Banking and Financial Services
Data Integrity for Banking and Financial Services
 
Webinar: Know Where, Why, What: Big Data’s Role In Predictive And Location A...
Webinar:  Know Where, Why, What: Big Data’s Role In Predictive And Location A...Webinar:  Know Where, Why, What: Big Data’s Role In Predictive And Location A...
Webinar: Know Where, Why, What: Big Data’s Role In Predictive And Location A...
 
Retail Analytics and BI with Looker, BigQuery, GCP & Leigha Jarett
Retail Analytics and BI with Looker, BigQuery, GCP & Leigha JarettRetail Analytics and BI with Looker, BigQuery, GCP & Leigha Jarett
Retail Analytics and BI with Looker, BigQuery, GCP & Leigha Jarett
 
Gain a Holistic View of your Customer's Journey
Gain a Holistic View of your Customer's JourneyGain a Holistic View of your Customer's Journey
Gain a Holistic View of your Customer's Journey
 
Moving Targets: Harnessing Real-time Value from Data in Motion
Moving Targets: Harnessing Real-time Value from Data in Motion Moving Targets: Harnessing Real-time Value from Data in Motion
Moving Targets: Harnessing Real-time Value from Data in Motion
 
Accelerating Data-Driven Enterprise Transformation in Banking, Financial Serv...
Accelerating Data-Driven Enterprise Transformation in Banking, Financial Serv...Accelerating Data-Driven Enterprise Transformation in Banking, Financial Serv...
Accelerating Data-Driven Enterprise Transformation in Banking, Financial Serv...
 
Use of Analytics to recover from COVID19 hit economy
Use of Analytics to recover from COVID19 hit economyUse of Analytics to recover from COVID19 hit economy
Use of Analytics to recover from COVID19 hit economy
 
Customer Intelligence_ Harnessing Elephants at Transamerica Presentation (1)
Customer Intelligence_ Harnessing Elephants at Transamerica    Presentation (1)Customer Intelligence_ Harnessing Elephants at Transamerica    Presentation (1)
Customer Intelligence_ Harnessing Elephants at Transamerica Presentation (1)
 
Big Data Everywhere Chicago: Platfora - Practices for Customer Analytics on H...
Big Data Everywhere Chicago: Platfora - Practices for Customer Analytics on H...Big Data Everywhere Chicago: Platfora - Practices for Customer Analytics on H...
Big Data Everywhere Chicago: Platfora - Practices for Customer Analytics on H...
 
Big Data: The Road to Know More About Your Business
Big Data:  The Road to Know More About Your BusinessBig Data:  The Road to Know More About Your Business
Big Data: The Road to Know More About Your Business
 
Finding Data at Risk for CCPA Compliance
Finding Data at Risk for CCPA ComplianceFinding Data at Risk for CCPA Compliance
Finding Data at Risk for CCPA Compliance
 
Supply Chain 2030: Presentation by Lora Cecere at CLX Conference
Supply Chain 2030: Presentation by Lora Cecere at CLX ConferenceSupply Chain 2030: Presentation by Lora Cecere at CLX Conference
Supply Chain 2030: Presentation by Lora Cecere at CLX Conference
 
Journey to a Modern Data Architecture
Journey to a Modern Data ArchitectureJourney to a Modern Data Architecture
Journey to a Modern Data Architecture
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business Goals
 
Do You Trust Your Machine Learning Outcomes?
 Do You Trust Your Machine Learning Outcomes?  Do You Trust Your Machine Learning Outcomes?
Do You Trust Your Machine Learning Outcomes?
 

Recently uploaded

原版复制澳洲莫纳什大学毕业证硕士学历原版一模一样
原版复制澳洲莫纳什大学毕业证硕士学历原版一模一样原版复制澳洲莫纳什大学毕业证硕士学历原版一模一样
原版复制澳洲莫纳什大学毕业证硕士学历原版一模一样
w6zyq7uj
 
A Guide to Planting and Caring for Your Land
A Guide to Planting and Caring for Your LandA Guide to Planting and Caring for Your Land
A Guide to Planting and Caring for Your Land
The Green Corner
 
Supermarket Floral Ad Roundup- Week 23 2024.pdf
Supermarket Floral Ad Roundup- Week 23  2024.pdfSupermarket Floral Ad Roundup- Week 23  2024.pdf
Supermarket Floral Ad Roundup- Week 23 2024.pdf
KarliNelson4
 
The Future of E-commerce: first-hands insights.
The Future of E-commerce: first-hands insights.The Future of E-commerce: first-hands insights.
The Future of E-commerce: first-hands insights.
Solvd, Inc.
 
Introducing Bliss Point by Tinuiti Webinar
Introducing Bliss Point by Tinuiti WebinarIntroducing Bliss Point by Tinuiti Webinar
Introducing Bliss Point by Tinuiti Webinar
Tinuiti
 
快速办理(BC毕业证书)波士顿学院毕业证PDF成绩单一模一样
快速办理(BC毕业证书)波士顿学院毕业证PDF成绩单一模一样快速办理(BC毕业证书)波士顿学院毕业证PDF成绩单一模一样
快速办理(BC毕业证书)波士顿学院毕业证PDF成绩单一模一样
aezncfe
 
Social Media is Eating Retail- The $Trillion Trend You Can't Ignore.pdf
Social Media is Eating Retail- The $Trillion Trend You Can't Ignore.pdfSocial Media is Eating Retail- The $Trillion Trend You Can't Ignore.pdf
Social Media is Eating Retail- The $Trillion Trend You Can't Ignore.pdf
Jasper Colin
 
怎么购买澳洲南十字星大学毕业证文凭毕业证原版一模一样
怎么购买澳洲南十字星大学毕业证文凭毕业证原版一模一样怎么购买澳洲南十字星大学毕业证文凭毕业证原版一模一样
怎么购买澳洲南十字星大学毕业证文凭毕业证原版一模一样
w6zyq7uj
 
Supermarket Floral Ad Roundup- Week 22 2024.pdf
Supermarket Floral Ad Roundup- Week 22 2024.pdfSupermarket Floral Ad Roundup- Week 22 2024.pdf
Supermarket Floral Ad Roundup- Week 22 2024.pdf
KarliNelson4
 

Recently uploaded (9)

原版复制澳洲莫纳什大学毕业证硕士学历原版一模一样
原版复制澳洲莫纳什大学毕业证硕士学历原版一模一样原版复制澳洲莫纳什大学毕业证硕士学历原版一模一样
原版复制澳洲莫纳什大学毕业证硕士学历原版一模一样
 
A Guide to Planting and Caring for Your Land
A Guide to Planting and Caring for Your LandA Guide to Planting and Caring for Your Land
A Guide to Planting and Caring for Your Land
 
Supermarket Floral Ad Roundup- Week 23 2024.pdf
Supermarket Floral Ad Roundup- Week 23  2024.pdfSupermarket Floral Ad Roundup- Week 23  2024.pdf
Supermarket Floral Ad Roundup- Week 23 2024.pdf
 
The Future of E-commerce: first-hands insights.
The Future of E-commerce: first-hands insights.The Future of E-commerce: first-hands insights.
The Future of E-commerce: first-hands insights.
 
Introducing Bliss Point by Tinuiti Webinar
Introducing Bliss Point by Tinuiti WebinarIntroducing Bliss Point by Tinuiti Webinar
Introducing Bliss Point by Tinuiti Webinar
 
快速办理(BC毕业证书)波士顿学院毕业证PDF成绩单一模一样
快速办理(BC毕业证书)波士顿学院毕业证PDF成绩单一模一样快速办理(BC毕业证书)波士顿学院毕业证PDF成绩单一模一样
快速办理(BC毕业证书)波士顿学院毕业证PDF成绩单一模一样
 
Social Media is Eating Retail- The $Trillion Trend You Can't Ignore.pdf
Social Media is Eating Retail- The $Trillion Trend You Can't Ignore.pdfSocial Media is Eating Retail- The $Trillion Trend You Can't Ignore.pdf
Social Media is Eating Retail- The $Trillion Trend You Can't Ignore.pdf
 
怎么购买澳洲南十字星大学毕业证文凭毕业证原版一模一样
怎么购买澳洲南十字星大学毕业证文凭毕业证原版一模一样怎么购买澳洲南十字星大学毕业证文凭毕业证原版一模一样
怎么购买澳洲南十字星大学毕业证文凭毕业证原版一模一样
 
Supermarket Floral Ad Roundup- Week 22 2024.pdf
Supermarket Floral Ad Roundup- Week 22 2024.pdfSupermarket Floral Ad Roundup- Week 22 2024.pdf
Supermarket Floral Ad Roundup- Week 22 2024.pdf
 

How Starbucks Forecasts Demand at Scale with Facebook Prophet and Databricks

  • 1. How Starbucks Forecasts Demand at Scale with Facebook Prophet and Databricks Rob Saker, Retail Industry Leader at Databricks Brendan O’Shaughnessy, Data Science Manager at Starbucks Bilal Obeidat, Solutions Architect, Databricks
  • 2. Housekeeping • Your connection will be muted • Submit questions via the Q&A panel • Questions will be answered at the end of the webinar • Any outstanding questions will be answered in the Databricks Forum (https://forums.databricks.com) • Webinar will be recorded and attachments will be made available via www.databricks.com
  • 3. Introducing Our Speakers 3 Bilal Obeidat, Lead Solutions Architect ● 15+ years of Software Development at Microsoft, Hitachi ● MS, Computer Science, Bradley University Brendan O’Shaughnessy, Data Science Manager ● 10+ years of Analytics and Data Science at Starbucks, NRPC ● MS, Spatial Information Science and Engineering, University of Maine Rob Saker, Retail Industry Leader ● Formerly Retail industry Chief Data Officer (CDO) ● 15+ years in Data & Analytic leadership across Retail & CPG ● MBA, University of Nebraska
  • 4. Unified data analytics platform for accelerating innovation across data science, data engineering, and business analytics Original creators of popular data and machine learning open source projects Global company with 5,000 customers and 450+ partners
  • 5. Databricks Customers - Over Half of Top 25 Retailers by Revenue Media & Entertainment Technology Public Sector Retail & CPG Consumer Services Energy & IndustrialMarketing & AdTech Data & Analytics ServicesHealthcare & Pharma Financial Services
  • 6. Agenda ➔ A Consumer-Driven Supply Chain ◆ Why Demand Forecasting is Relevant? ◆ The Need for Granular Demand Forecasting at Scale ◆ How Starbucks does Demand Forecasting? ➔ Demo: ML based Forecasting With Facebook Prophet ➔ Conclusion
  • 7. Customers want what they want, where they want it, when they want it Consumer Behavior is Changing Supply Networks DIRECT TO CONSUMER PERSONALIZATION 40% of Best Buy’s online purchases are picked up at the store 50% of all restaurant prepared food is consumed away from restaurant CUSTOMER ENGAGEMENT FROM ANYWHERE CONVENIENCE IS KEY
  • 8. Your Supply Chain Needs to be Consumer-Driven
  • 9. A Consumer-Driven Supply Chain Starts with Demand Forecasting What is Demand Forecasting? Demand forecasting is the process where we use historical sales, promotions, shopper and causal data to understand and predict customer demand.
  • 10. Shipments CPG Direct-to-consumer shipments Manufacturers Orders Store shipments Distribution Centers E-commerce shipments to consumers Shoppers In-store purchases Buy-online, pickup in store (BOPIS) E-commerce delivery Subscription Retail Stores Sales Retail Supply Chain
  • 11. Orders Store shipments Shipments CPG Direct-to-consumer shipments Manufacturers Distribution Centers Retail Stores Shoppers E-commerce shipments to consumers In-store purchases Buy-online, pickup in store (BOPIS) E-commerce delivery Subscription Sales Retail Supply Chain Demand Forecast
  • 12. Orders Store shipments Shipments CPG Direct-to-consumer shipments Manufacturers Distribution Centers Retail Stores Shoppers E-commerce shipments to consumers In-store purchases Buy-online, pickup in store (BOPIS) E-commerce delivery Subscription Sales Retail Supply Chain Demand Forecast Intrastore shipments Store shipments Inventory levels Reorder quantity DC shipments Production plan Home delivery logistics Home delivery logistics
  • 13. Retail Capability Model Supply Chain Management Merchandising Store Operations Customer Management Planning Freight & Logistics Planning Operations Supplier Management Distribution Replenishment Inventory Purchasing Sourcing Advanced Planning Demand Forecasting Warehouse Management Reverse Logistics Transportation Procurement Logistics Allocation Demand Planning Revenue/Price Management Promotions Planning Assortment Category Management Space Planning Fresh Planning Demand Planning Joint planning Trade management Merchandising Store Channel Promotions Digital Store Auditing Broker Management Store Operations Workforce Point-of-Sale Shelf tags/Signage Smart Store Devices Store Inventory Management Returns Management Fresh Production Loss Prevention Loyalty Digital/Ecommerce Mobile Customer Support Customer Segmentation/CRM Voice AI/Call Center Back Office People Management Recruiting Finance Asset Management Real Estate Energy & UtilitiesIT Industry Standards & Compliance Legal Master Data Management
  • 14. Retail Capability Model Supply Chain Management Merchandising Store Operations Customer Management Planning Freight & Logistics Planning Operations Supplier Management Distribution Replenishment Inventory Purchasing Sourcing Advanced Planning Demand Forecasting Warehouse Management Reverse Logistics Transportation Procurement Logistics Allocation Demand Planning Revenue/Price Management Promotions Planning Assortment Category Management Space Planning Fresh Planning Demand Planning Joint planning Trade management Merchandising Store Channel Promotions Digital Store Auditing Broker Management Store Operations Workforce Point-of-Sale Shelf tags/Signage Smart Store Devices Store Inventory Management Returns Management Fresh Production Loss Prevention Loyalty Digital/Ecommerce Mobile Customer Support Customer Segmentation/CRM Voice AI/Call Center Back Office People Management Recruiting Finance Asset Management Real Estate Energy & UtilitiesIT Industry Standards & Compliance Legal Master Data Management
  • 15. Using Data and ML Across the Supply Chain is Challenging FORECASTING NOT ACCURATE OR GRANULAR LARGE VOLUMES OF RAPIDLY CHANGING DATA LIMITED REAL-TIME AND CAUSAL DATA NOT EASY TO GET TO ACTIONABLE INSIGHTS Managers unable to get per day /store/SKU forecast Data is constantly shifting and changing. Eg. Revised data to account for shoplifting Omnichannel is making local (weather), real-time (IOT), causal (competitor pricing) data more important Store/Distribution managers get BI tools with lots of data that they have no time to explore
  • 16. Traditional Demand Forecasting with Allocations Market AreaPromo Group Week Day Day Day
  • 17. Traditional tools can’t scale to fine-grain DC Promo Group Week 105,000 1 billion 500 million 2 billion 1.5 billion DMA Promo Group Week 525,000 Store Promo Group Week 11.922,500 Store SKU Week 238,450,000 Store SKU Day 1,669,150,000 Traditional Forecasting Tools Fine grained forecast with Databricks Special run/few times a year
  • 18. Demand Forecasting is Complicated You Need Forecasts by Product by Store Stores SKUs
  • 19. You Have a Large # of Store-SKU Combinations
  • 20.
  • 21.
  • 23. Each model may have local causal influences
  • 24. Distribute Model Training with Apache SparkTM
  • 25. Apache Spark: De-Facto Unified Analytics Engine Runtime Delta Spark Core Engine Big Data Processing ETL + SQL + Streaming Machine Learning MLlib + SparkR Uniquely combines Data & AI technologies R Python Scala Java
  • 26. Use a Unified Data Analytics Platform Across the Data and ML Lifecycle DO GRANULAR AND ACCURATE FORECASTS KEEP UP WITH CHANGING DATA USE REAL-TIME AND CAUSAL DATA ACTIONABLE AND EASY INSIGHTS FOR MANAGERS Point BI tools that store managers use directly at ML insights stored on Delta Lake tables Use and track 100s of ML models to forecast demand by day/store/SKU using MLflow Use Delta Lake UPSERTS to keep data consistent Single streamlined pipeline for real time and streaming data with Delta Lake and Apache SparkTM
  • 27. Unified Data Analytics for Consumer-Driven Supply Chain Forecast Demand INVENTORY DATA IOT DATA COMPETITOR DATA PRICING DATA SKU DATA DEMOGRAPHICS DATA GEO-LOCATION DATA PO0 DATAS VIDEO DATA SHIPMEN DATAT Optimize Inventory Faster Freight and Logistics
  • 28. Databricks Delivers Fine-Grained Demand Forecasting Traditional Analysis Suites Databricks Fine grained forecasting Aggregate level Day, store & SKU Real-time data No Streaming data Custom causal Data Limited Integrate weather, online & mobile interactions Multi-modal data for training No Structured, unstructured, image, video, sensor data. Localize models for greater accuracy No Yes Push predictions to the edge No Yes
  • 29. Starbucks Data Science Forecasting Framework “Flexible forecasting at scale” Brendan O’Shaughnessy Data Science Manager at Starbucks
  • 30. Delivery Forecasting has utility across business functions Promotions Market Planning Operations Equipment Usage Labor
  • 31. Challenges ▪ Many metrics and granularities ▪ Timely delivery ▪ Easy accessibility for business stakeholders ▪ Iterative nature of data science ▪ Accuracy ▪ Quick diagnostics
  • 32. Forecasting framework ▪ Built-in diagnostics ▪ Status monitoring ▪ Leverages Delta Lake and ADLS for data ingress and egress ▪ Executable from Databricks CLI Robust and reliable ▪ Agnostic to metric and granularity ▪ Leverages diverse forecasting algorithms using grouped pandas UDFs ▪ Modular structure for straightforward additions and enhancements ▪ Apache SparkTM enables quick runtimes regardless of the number of individual forecasts Flexible
  • 33. Example structure Create custom forecasts for any use case with relative ease Enterprise data (DB Delta) Historical Metric Custom data (ADLS) Build dataset Compile results To stakeholders Model 1 Model 2 Model n To data products …
  • 36. Key Takeaways Slide ü A Consumer-driven supply chain is indispensable ü Start with demand forecasting at scale ü Localization of models is slow and tedious ü Using Data and ML across supply chain is challenging Databricks provides Unified Data Analytics to bring Data and ML together for accurate and granular demand forecasting
  • 37. Thank you! Questions? 38 Sign up for a Free Trial : databricks.com/trial Register For Spark + AI Summit: https://databricks.com/sparkaisummit/ Read Blog: https://dbricks.co/DemFcst