SlideShare a Scribd company logo
Applying data science to sales pipelines –
for fun and profit!
!
Andy Twigg!
CTO!
Data!
science!
Domain!
expertise!
Machine!
learning! Data!
•  62B sales pipeline records!
•  Structured, unstructured!
•  3rd party public data!
•  Fine-grained temporal data!
Deep expertise:!
•  sales!
•  forecasting!
•  revenue models!
•  Automated ML infrastructure!
•  ML models tuned for specific
problems!
CUSTOMERS!
DATA SCIENCE @ C9!
•  Opportunity Scoring!
•  What is Pr(win) for this deal?!
•  What is Pr(win in quarter) for this deal?!
•  How does this compare to sales team commits?!
•  Which deals can we influence most?!
•  Forecasting!
•  How much will we close this quarter?!
SALES PIPELINES & OPPORTUNITIES!
•  Opportunities are temporal creatures; while ‘open’ they proceed through a number of observations
and terminate in one of a discrete set of ‘closed’ states – typically won or lost!
•  Usually they proceed through ‘stages’, except:!
•  An opportunity can be entered into the CRM system as closed (no open observations)!
•  Stages are only a partial order - can skip / revisit stages!
•  Can be re-opened after closed!
•  As the opportunity evolves, we get more and more data about the opportunity!
•  A pipeline is a set of open opportunities!
Lead
created!
Stage:
Qualifying!
Email
sent!
Email sent:
response!
Amount=
$1000! Call!
Stage:
demo!
Meeting! Demo!
Push
close
date!
Stage:
negotiation!
Closed/
won!
Reopened
Amount=
$2000!
Closed/won!
ANATOMY OF AN OPPTY!
ANATOMY OF AN OPPTY!
Pushed out
Pulled back
in
Final outcome:
won
Committed
here (by the
sales rep)
ANATOMY OF AN OPPTY!
Pushed out
Pulled back
in
Final outcome:
won
Committed
here (by the
sales rep)
Predicted
won from
the start
Predicted won
in the correct
quarter
PREDICTIVE ENGINE!
Build a fine-grained
history of closed
opportunities
•  Cleaning!
•  Preprocessing!
•  Featurizing!
•  ~10 GB/customer!
•  ~ 1M training rows!
!
1,000s of raw
signals per
opportunity
•  Structured (CRM, ERP)!
•  Unstructured (NLP)!
•  Firmographic!
•  Gov sources!
•  SEC filings!
•  Crunchbase!
•  …!
!
Identify historic deals
with similar behavior
Continuously re-score
opportunities as they
evolve
Update model as
opportunities close
•  Fully-automated model rebuilding and scoring platform!
•  Model input features:!
•  Historic observations of opportunity!
•  Sales-specific features e.g. momentum!
•  Temporal features e.g. std(amount over last 30 days)!
•  Industry-wide features e.g. avg_sales_cycle(target)!
•  Continuously cross-validated model tuning!
•  Extensible, scalable platform using Hadoop (HDFS), Python!
Win/Loss Model (Random Forest)
Estimate Pr(win)
Duration Model (Poisson Regression)
Estimate Pr(win in quarter)
Influencer Model (Linear)
Positive/Negative Drivers
•  Standard Features
•  Temporal Features
•  Derived Features
BEHIND THE SCORES!
©2014 All Rights Reserved
©2014 All Rights Reserved
sales team: good precision (~70-80%) but poor recall (~10-40%)!
C9 won precision ~ sales team won precision!
C9 won recall ~ 3 x sales team won recall!
First observation Last observation
precision recall f1 precision recall f1
C9 scoring 0.65 0.86 0.74 0.75 0.93 0.83
Commit 0.70 0.07 0.13 0.87 0.45 0.59
FORECASTING: TOP-DOWN VS BOTTOM-UP!
Top-down: Predict current quarter based on
previous quarters!
!
Accounts for seasonality and trending!
!
Ignores state of current pipeline!
0.0e+002.5e+08
observed
5.0e+072.5e+08
trend
−5e+065e+06
seasonal
−1e+075e+06
2013.0 2013.2 2013.4 2013.6 2013.8 2014.0 2014.2 2014.4
random
Time
Decomposition of additive time series
Bottom-up: Predict current quarter based
on currently open pipeline!
!
Considers quality of deals in pipe!
!
Ignores trends, deals not in pipe!
$265,410!
$157,000
77%
$200,000
37%
$82,000
86%
+!
-!
+!
-!
HYBRID FORECASTING!
top down + bottom up!
20
40
60
11 10 9 8 7 6 5 4 3 2 1
Weeks to EOQ
Amount($M)
C9
Final Amount
Actual Amount
Amount Forecast
•  Augment time-series model with
side information from bottom-up
model, e.g.:!
•  Amount predicted to close
in current quarter!
•  Average score of currently
open opportunities!
•  Average predicted days to
close!
!
•  Sometimes known as ARIMAX!
log(yt) ∼
12
i=1
log(yt−i) + log(x
(1)
t−12) + log(x
(2)
t−12) + log(x
(3)
t−12)
Applying data science to sales pipelines - for fun and profit

More Related Content

What's hot

OCCLUSION IN COMPLETE DENTURES.pptx
OCCLUSION IN COMPLETE DENTURES.pptxOCCLUSION IN COMPLETE DENTURES.pptx
OCCLUSION IN COMPLETE DENTURES.pptx
drbiniravi
 
Clasps in orthodontics /certified fixed orthodontic courses by Indian dental ...
Clasps in orthodontics /certified fixed orthodontic courses by Indian dental ...Clasps in orthodontics /certified fixed orthodontic courses by Indian dental ...
Clasps in orthodontics /certified fixed orthodontic courses by Indian dental ...
Indian dental academy
 
Mechanical and physical properties of Prosthodontic materials
Mechanical and physical properties of Prosthodontic materialsMechanical and physical properties of Prosthodontic materials
Mechanical and physical properties of Prosthodontic materials
Subuhi Siddiqui
 
Anatomy and physiology of denture bearing areas /certified fixed orthodontic...
Anatomy and physiology of denture bearing areas  /certified fixed orthodontic...Anatomy and physiology of denture bearing areas  /certified fixed orthodontic...
Anatomy and physiology of denture bearing areas /certified fixed orthodontic...
Indian dental academy
 
Properties of orthodontic wires /certified fixed orthodontic courses by India...
Properties of orthodontic wires /certified fixed orthodontic courses by India...Properties of orthodontic wires /certified fixed orthodontic courses by India...
Properties of orthodontic wires /certified fixed orthodontic courses by India...
Indian dental academy
 
Ppt for spotters
Ppt for spottersPpt for spotters
Physical properties of dental materials
Physical properties of dental materialsPhysical properties of dental materials
Physical properties of dental materials
Shruti MISHRA
 
Corticosteroids in dentistry
Corticosteroids in dentistryCorticosteroids in dentistry
Corticosteroids in dentistry
DrUshaVyasBohra
 
Activator , bionator /certified fixed orthodontic courses by Indian dental ac...
Activator , bionator /certified fixed orthodontic courses by Indian dental ac...Activator , bionator /certified fixed orthodontic courses by Indian dental ac...
Activator , bionator /certified fixed orthodontic courses by Indian dental ac...
Indian dental academy
 
Elastics in orthodontics /certified fixed orthodontic courses by Indian denta...
Elastics in orthodontics /certified fixed orthodontic courses by Indian denta...Elastics in orthodontics /certified fixed orthodontic courses by Indian denta...
Elastics in orthodontics /certified fixed orthodontic courses by Indian denta...
Indian dental academy
 
Elastic impression materials /certified fixed orthodontic courses by Indian d...
Elastic impression materials /certified fixed orthodontic courses by Indian d...Elastic impression materials /certified fixed orthodontic courses by Indian d...
Elastic impression materials /certified fixed orthodontic courses by Indian d...
Indian dental academy
 
Calcium metabolism/ oral surgery courses /certified fixed orthodontic courses...
Calcium metabolism/ oral surgery courses /certified fixed orthodontic courses...Calcium metabolism/ oral surgery courses /certified fixed orthodontic courses...
Calcium metabolism/ oral surgery courses /certified fixed orthodontic courses...
Indian dental academy
 
Mouth preperation
Mouth preperationMouth preperation
Mouth preperation
IAU Dent
 
Nerve injuries seen in oral and maxillofacial surgery
Nerve injuries seen in oral and maxillofacial surgeryNerve injuries seen in oral and maxillofacial surgery
Nerve injuries seen in oral and maxillofacial surgery
VinodThangaswamyS
 
Mandibular movements
Mandibular movementsMandibular movements
Mandibular movements
SwetaAnand13
 
Trajectories and rotations
Trajectories and rotationsTrajectories and rotations
Trajectories and rotations
Indian dental academy
 
Alloys in prosthodontics
Alloys in prosthodonticsAlloys in prosthodontics
Alloys in prosthodontics
AniketShinde102
 
Composite resin /certified fixed orthodontic courses by Indian dental academy
Composite resin /certified fixed orthodontic courses by Indian dental academy Composite resin /certified fixed orthodontic courses by Indian dental academy
Composite resin /certified fixed orthodontic courses by Indian dental academy
Indian dental academy
 
Fixed functional appliances /certified fixed orthodontic courses by Indian de...
Fixed functional appliances /certified fixed orthodontic courses by Indian de...Fixed functional appliances /certified fixed orthodontic courses by Indian de...
Fixed functional appliances /certified fixed orthodontic courses by Indian de...
Indian dental academy
 
Local Anaesthetics
Local AnaestheticsLocal Anaesthetics
Local Anaesthetics
http://neigrihms.gov.in/
 

What's hot (20)

OCCLUSION IN COMPLETE DENTURES.pptx
OCCLUSION IN COMPLETE DENTURES.pptxOCCLUSION IN COMPLETE DENTURES.pptx
OCCLUSION IN COMPLETE DENTURES.pptx
 
Clasps in orthodontics /certified fixed orthodontic courses by Indian dental ...
Clasps in orthodontics /certified fixed orthodontic courses by Indian dental ...Clasps in orthodontics /certified fixed orthodontic courses by Indian dental ...
Clasps in orthodontics /certified fixed orthodontic courses by Indian dental ...
 
Mechanical and physical properties of Prosthodontic materials
Mechanical and physical properties of Prosthodontic materialsMechanical and physical properties of Prosthodontic materials
Mechanical and physical properties of Prosthodontic materials
 
Anatomy and physiology of denture bearing areas /certified fixed orthodontic...
Anatomy and physiology of denture bearing areas  /certified fixed orthodontic...Anatomy and physiology of denture bearing areas  /certified fixed orthodontic...
Anatomy and physiology of denture bearing areas /certified fixed orthodontic...
 
Properties of orthodontic wires /certified fixed orthodontic courses by India...
Properties of orthodontic wires /certified fixed orthodontic courses by India...Properties of orthodontic wires /certified fixed orthodontic courses by India...
Properties of orthodontic wires /certified fixed orthodontic courses by India...
 
Ppt for spotters
Ppt for spottersPpt for spotters
Ppt for spotters
 
Physical properties of dental materials
Physical properties of dental materialsPhysical properties of dental materials
Physical properties of dental materials
 
Corticosteroids in dentistry
Corticosteroids in dentistryCorticosteroids in dentistry
Corticosteroids in dentistry
 
Activator , bionator /certified fixed orthodontic courses by Indian dental ac...
Activator , bionator /certified fixed orthodontic courses by Indian dental ac...Activator , bionator /certified fixed orthodontic courses by Indian dental ac...
Activator , bionator /certified fixed orthodontic courses by Indian dental ac...
 
Elastics in orthodontics /certified fixed orthodontic courses by Indian denta...
Elastics in orthodontics /certified fixed orthodontic courses by Indian denta...Elastics in orthodontics /certified fixed orthodontic courses by Indian denta...
Elastics in orthodontics /certified fixed orthodontic courses by Indian denta...
 
Elastic impression materials /certified fixed orthodontic courses by Indian d...
Elastic impression materials /certified fixed orthodontic courses by Indian d...Elastic impression materials /certified fixed orthodontic courses by Indian d...
Elastic impression materials /certified fixed orthodontic courses by Indian d...
 
Calcium metabolism/ oral surgery courses /certified fixed orthodontic courses...
Calcium metabolism/ oral surgery courses /certified fixed orthodontic courses...Calcium metabolism/ oral surgery courses /certified fixed orthodontic courses...
Calcium metabolism/ oral surgery courses /certified fixed orthodontic courses...
 
Mouth preperation
Mouth preperationMouth preperation
Mouth preperation
 
Nerve injuries seen in oral and maxillofacial surgery
Nerve injuries seen in oral and maxillofacial surgeryNerve injuries seen in oral and maxillofacial surgery
Nerve injuries seen in oral and maxillofacial surgery
 
Mandibular movements
Mandibular movementsMandibular movements
Mandibular movements
 
Trajectories and rotations
Trajectories and rotationsTrajectories and rotations
Trajectories and rotations
 
Alloys in prosthodontics
Alloys in prosthodonticsAlloys in prosthodontics
Alloys in prosthodontics
 
Composite resin /certified fixed orthodontic courses by Indian dental academy
Composite resin /certified fixed orthodontic courses by Indian dental academy Composite resin /certified fixed orthodontic courses by Indian dental academy
Composite resin /certified fixed orthodontic courses by Indian dental academy
 
Fixed functional appliances /certified fixed orthodontic courses by Indian de...
Fixed functional appliances /certified fixed orthodontic courses by Indian de...Fixed functional appliances /certified fixed orthodontic courses by Indian de...
Fixed functional appliances /certified fixed orthodontic courses by Indian de...
 
Local Anaesthetics
Local AnaestheticsLocal Anaesthetics
Local Anaesthetics
 

Similar to Applying data science to sales pipelines - for fun and profit

MLSEV. Machine Learning: Business Perspective
MLSEV. Machine Learning: Business PerspectiveMLSEV. Machine Learning: Business Perspective
MLSEV. Machine Learning: Business Perspective
BigML, Inc
 
Damien Lefortier, Senior Machine Learning Engineer and Tech Lead in the Predi...
Damien Lefortier, Senior Machine Learning Engineer and Tech Lead in the Predi...Damien Lefortier, Senior Machine Learning Engineer and Tech Lead in the Predi...
Damien Lefortier, Senior Machine Learning Engineer and Tech Lead in the Predi...
MLconf
 
Applying data science to sales pipelines -- for fun and profit
Applying data science to sales pipelines -- for fun and profitApplying data science to sales pipelines -- for fun and profit
Applying data science to sales pipelines -- for fun and profit
Andy Twigg
 
Applying Data Science - for Fun and Profit
Applying Data Science - for Fun and ProfitApplying Data Science - for Fun and Profit
Applying Data Science - for Fun and Profit
C9 Inc
 
Embracing a Taxonomy of Types to Simplify Machine Learning with Leah McGuire
Embracing a Taxonomy of Types to Simplify Machine Learning with Leah McGuireEmbracing a Taxonomy of Types to Simplify Machine Learning with Leah McGuire
Embracing a Taxonomy of Types to Simplify Machine Learning with Leah McGuire
Databricks
 
Marketing Concepts - Go-to-Market Templates
Marketing Concepts - Go-to-Market TemplatesMarketing Concepts - Go-to-Market Templates
Marketing Concepts - Go-to-Market Templates
Four Quadrant LLC
 
H2O World - Solving Customer Churn with Machine Learning - Julian Bharadwaj
H2O World - Solving Customer Churn with Machine Learning - Julian BharadwajH2O World - Solving Customer Churn with Machine Learning - Julian Bharadwaj
H2O World - Solving Customer Churn with Machine Learning - Julian Bharadwaj
Sri Ambati
 
AI Builder Deepdive DynamicsPower! Brussels 2019
AI Builder Deepdive DynamicsPower! Brussels 2019AI Builder Deepdive DynamicsPower! Brussels 2019
AI Builder Deepdive DynamicsPower! Brussels 2019
Rebekka Aalbers-de Jong
 
Transforming B2B Sales with Spark-Powered Sales Intelligence with Songtao Guo...
Transforming B2B Sales with Spark-Powered Sales Intelligence with Songtao Guo...Transforming B2B Sales with Spark-Powered Sales Intelligence with Songtao Guo...
Transforming B2B Sales with Spark-Powered Sales Intelligence with Songtao Guo...
Databricks
 
Spark summit 2017- Transforming B2B sales with Spark powered sales intelligence
Spark summit 2017- Transforming B2B sales with Spark powered sales intelligenceSpark summit 2017- Transforming B2B sales with Spark powered sales intelligence
Spark summit 2017- Transforming B2B sales with Spark powered sales intelligence
Wei Di
 
Building Wall St Risk Systems with Apache Geode
Building Wall St Risk Systems with Apache GeodeBuilding Wall St Risk Systems with Apache Geode
Building Wall St Risk Systems with Apache Geode
Andre Langevin
 
#GeodeSummit - Wall St. Derivative Risk Solutions Using Geode
#GeodeSummit - Wall St. Derivative Risk Solutions Using Geode#GeodeSummit - Wall St. Derivative Risk Solutions Using Geode
#GeodeSummit - Wall St. Derivative Risk Solutions Using Geode
PivotalOpenSourceHub
 
REX Hadoop et R
REX Hadoop et RREX Hadoop et R
REX Hadoop et R
pkernevez
 
Transforming B2B Sales with Spark Powered Sales Intelligence
Transforming B2B Sales with Spark Powered Sales IntelligenceTransforming B2B Sales with Spark Powered Sales Intelligence
Transforming B2B Sales with Spark Powered Sales Intelligence
Songtao Guo
 
From 6 hours to 1 minute... in 2 days! How we managed to stream our (long) Ha...
From 6 hours to 1 minute... in 2 days! How we managed to stream our (long) Ha...From 6 hours to 1 minute... in 2 days! How we managed to stream our (long) Ha...
From 6 hours to 1 minute... in 2 days! How we managed to stream our (long) Ha...
Dataconomy Media
 
Big Data Berlin - Criteo
Big Data Berlin - CriteoBig Data Berlin - Criteo
Big Data Berlin - Criteo
Sofian Djamaa
 
Data science at InsideSales.com
Data science at InsideSales.comData science at InsideSales.com
Data science at InsideSales.com
Andy Twigg
 
Applying data science to sales pipelines — for fun and profit
 Applying data science to sales pipelines — for fun and profit Applying data science to sales pipelines — for fun and profit
Applying data science to sales pipelines — for fun and profit
Turi, Inc.
 
Data Refinement: The missing link between data collection and decisions
Data Refinement: The missing link between data collection and decisionsData Refinement: The missing link between data collection and decisions
Data Refinement: The missing link between data collection and decisionsVivastream
 

Similar to Applying data science to sales pipelines - for fun and profit (20)

MLSEV. Machine Learning: Business Perspective
MLSEV. Machine Learning: Business PerspectiveMLSEV. Machine Learning: Business Perspective
MLSEV. Machine Learning: Business Perspective
 
Damien Lefortier, Senior Machine Learning Engineer and Tech Lead in the Predi...
Damien Lefortier, Senior Machine Learning Engineer and Tech Lead in the Predi...Damien Lefortier, Senior Machine Learning Engineer and Tech Lead in the Predi...
Damien Lefortier, Senior Machine Learning Engineer and Tech Lead in the Predi...
 
Applying data science to sales pipelines -- for fun and profit
Applying data science to sales pipelines -- for fun and profitApplying data science to sales pipelines -- for fun and profit
Applying data science to sales pipelines -- for fun and profit
 
Applying Data Science - for Fun and Profit
Applying Data Science - for Fun and ProfitApplying Data Science - for Fun and Profit
Applying Data Science - for Fun and Profit
 
Embracing a Taxonomy of Types to Simplify Machine Learning with Leah McGuire
Embracing a Taxonomy of Types to Simplify Machine Learning with Leah McGuireEmbracing a Taxonomy of Types to Simplify Machine Learning with Leah McGuire
Embracing a Taxonomy of Types to Simplify Machine Learning with Leah McGuire
 
Marketing Concepts - Go-to-Market Templates
Marketing Concepts - Go-to-Market TemplatesMarketing Concepts - Go-to-Market Templates
Marketing Concepts - Go-to-Market Templates
 
Sharp electronics corporation
Sharp electronics corporationSharp electronics corporation
Sharp electronics corporation
 
H2O World - Solving Customer Churn with Machine Learning - Julian Bharadwaj
H2O World - Solving Customer Churn with Machine Learning - Julian BharadwajH2O World - Solving Customer Churn with Machine Learning - Julian Bharadwaj
H2O World - Solving Customer Churn with Machine Learning - Julian Bharadwaj
 
AI Builder Deepdive DynamicsPower! Brussels 2019
AI Builder Deepdive DynamicsPower! Brussels 2019AI Builder Deepdive DynamicsPower! Brussels 2019
AI Builder Deepdive DynamicsPower! Brussels 2019
 
Transforming B2B Sales with Spark-Powered Sales Intelligence with Songtao Guo...
Transforming B2B Sales with Spark-Powered Sales Intelligence with Songtao Guo...Transforming B2B Sales with Spark-Powered Sales Intelligence with Songtao Guo...
Transforming B2B Sales with Spark-Powered Sales Intelligence with Songtao Guo...
 
Spark summit 2017- Transforming B2B sales with Spark powered sales intelligence
Spark summit 2017- Transforming B2B sales with Spark powered sales intelligenceSpark summit 2017- Transforming B2B sales with Spark powered sales intelligence
Spark summit 2017- Transforming B2B sales with Spark powered sales intelligence
 
Building Wall St Risk Systems with Apache Geode
Building Wall St Risk Systems with Apache GeodeBuilding Wall St Risk Systems with Apache Geode
Building Wall St Risk Systems with Apache Geode
 
#GeodeSummit - Wall St. Derivative Risk Solutions Using Geode
#GeodeSummit - Wall St. Derivative Risk Solutions Using Geode#GeodeSummit - Wall St. Derivative Risk Solutions Using Geode
#GeodeSummit - Wall St. Derivative Risk Solutions Using Geode
 
REX Hadoop et R
REX Hadoop et RREX Hadoop et R
REX Hadoop et R
 
Transforming B2B Sales with Spark Powered Sales Intelligence
Transforming B2B Sales with Spark Powered Sales IntelligenceTransforming B2B Sales with Spark Powered Sales Intelligence
Transforming B2B Sales with Spark Powered Sales Intelligence
 
From 6 hours to 1 minute... in 2 days! How we managed to stream our (long) Ha...
From 6 hours to 1 minute... in 2 days! How we managed to stream our (long) Ha...From 6 hours to 1 minute... in 2 days! How we managed to stream our (long) Ha...
From 6 hours to 1 minute... in 2 days! How we managed to stream our (long) Ha...
 
Big Data Berlin - Criteo
Big Data Berlin - CriteoBig Data Berlin - Criteo
Big Data Berlin - Criteo
 
Data science at InsideSales.com
Data science at InsideSales.comData science at InsideSales.com
Data science at InsideSales.com
 
Applying data science to sales pipelines — for fun and profit
 Applying data science to sales pipelines — for fun and profit Applying data science to sales pipelines — for fun and profit
Applying data science to sales pipelines — for fun and profit
 
Data Refinement: The missing link between data collection and decisions
Data Refinement: The missing link between data collection and decisionsData Refinement: The missing link between data collection and decisions
Data Refinement: The missing link between data collection and decisions
 

Recently uploaded

Machine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptxMachine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptx
balafet
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
ahzuo
 
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
74nqk8xf
 
Analysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performanceAnalysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performance
roli9797
 
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptxData_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
AnirbanRoy608946
 
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
oz8q3jxlp
 
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
ahzuo
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
axoqas
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
slg6lamcq
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
Timothy Spann
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
u86oixdj
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
v3tuleee
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
axoqas
 
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Subhajit Sahu
 
Learn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queriesLearn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queries
manishkhaire30
 
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
g4dpvqap0
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
NABLAS株式会社
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Subhajit Sahu
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
Subhajit Sahu
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
jerlynmaetalle
 

Recently uploaded (20)

Machine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptxMachine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptx
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
 
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
 
Analysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performanceAnalysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performance
 
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptxData_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
 
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
 
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
 
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
 
Learn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queriesLearn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queries
 
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
 

Applying data science to sales pipelines - for fun and profit

  • 1. Applying data science to sales pipelines – for fun and profit! ! Andy Twigg! CTO!
  • 2. Data! science! Domain! expertise! Machine! learning! Data! •  62B sales pipeline records! •  Structured, unstructured! •  3rd party public data! •  Fine-grained temporal data! Deep expertise:! •  sales! •  forecasting! •  revenue models! •  Automated ML infrastructure! •  ML models tuned for specific problems!
  • 4. DATA SCIENCE @ C9! •  Opportunity Scoring! •  What is Pr(win) for this deal?! •  What is Pr(win in quarter) for this deal?! •  How does this compare to sales team commits?! •  Which deals can we influence most?! •  Forecasting! •  How much will we close this quarter?!
  • 5. SALES PIPELINES & OPPORTUNITIES! •  Opportunities are temporal creatures; while ‘open’ they proceed through a number of observations and terminate in one of a discrete set of ‘closed’ states – typically won or lost! •  Usually they proceed through ‘stages’, except:! •  An opportunity can be entered into the CRM system as closed (no open observations)! •  Stages are only a partial order - can skip / revisit stages! •  Can be re-opened after closed! •  As the opportunity evolves, we get more and more data about the opportunity! •  A pipeline is a set of open opportunities! Lead created! Stage: Qualifying! Email sent! Email sent: response! Amount= $1000! Call! Stage: demo! Meeting! Demo! Push close date! Stage: negotiation! Closed/ won! Reopened Amount= $2000! Closed/won!
  • 6. ANATOMY OF AN OPPTY!
  • 7. ANATOMY OF AN OPPTY! Pushed out Pulled back in Final outcome: won Committed here (by the sales rep)
  • 8. ANATOMY OF AN OPPTY! Pushed out Pulled back in Final outcome: won Committed here (by the sales rep) Predicted won from the start Predicted won in the correct quarter
  • 9. PREDICTIVE ENGINE! Build a fine-grained history of closed opportunities •  Cleaning! •  Preprocessing! •  Featurizing! •  ~10 GB/customer! •  ~ 1M training rows! ! 1,000s of raw signals per opportunity •  Structured (CRM, ERP)! •  Unstructured (NLP)! •  Firmographic! •  Gov sources! •  SEC filings! •  Crunchbase! •  …! ! Identify historic deals with similar behavior Continuously re-score opportunities as they evolve Update model as opportunities close •  Fully-automated model rebuilding and scoring platform! •  Model input features:! •  Historic observations of opportunity! •  Sales-specific features e.g. momentum! •  Temporal features e.g. std(amount over last 30 days)! •  Industry-wide features e.g. avg_sales_cycle(target)! •  Continuously cross-validated model tuning! •  Extensible, scalable platform using Hadoop (HDFS), Python!
  • 10. Win/Loss Model (Random Forest) Estimate Pr(win) Duration Model (Poisson Regression) Estimate Pr(win in quarter) Influencer Model (Linear) Positive/Negative Drivers •  Standard Features •  Temporal Features •  Derived Features BEHIND THE SCORES!
  • 11. ©2014 All Rights Reserved
  • 12. ©2014 All Rights Reserved
  • 13.
  • 14. sales team: good precision (~70-80%) but poor recall (~10-40%)! C9 won precision ~ sales team won precision! C9 won recall ~ 3 x sales team won recall! First observation Last observation precision recall f1 precision recall f1 C9 scoring 0.65 0.86 0.74 0.75 0.93 0.83 Commit 0.70 0.07 0.13 0.87 0.45 0.59
  • 15.
  • 16. FORECASTING: TOP-DOWN VS BOTTOM-UP! Top-down: Predict current quarter based on previous quarters! ! Accounts for seasonality and trending! ! Ignores state of current pipeline! 0.0e+002.5e+08 observed 5.0e+072.5e+08 trend −5e+065e+06 seasonal −1e+075e+06 2013.0 2013.2 2013.4 2013.6 2013.8 2014.0 2014.2 2014.4 random Time Decomposition of additive time series Bottom-up: Predict current quarter based on currently open pipeline! ! Considers quality of deals in pipe! ! Ignores trends, deals not in pipe! $265,410! $157,000 77% $200,000 37% $82,000 86% +! -! +! -!
  • 17. HYBRID FORECASTING! top down + bottom up! 20 40 60 11 10 9 8 7 6 5 4 3 2 1 Weeks to EOQ Amount($M) C9 Final Amount Actual Amount Amount Forecast •  Augment time-series model with side information from bottom-up model, e.g.:! •  Amount predicted to close in current quarter! •  Average score of currently open opportunities! •  Average predicted days to close! ! •  Sometimes known as ARIMAX! log(yt) ∼ 12 i=1 log(yt−i) + log(x (1) t−12) + log(x (2) t−12) + log(x (3) t−12)

Editor's Notes

  1. why is data science work @ c9 different to other MLaaS/BI/SaaS companies? it’s ML put to work for a specific application, using domain knowledge
  2. Answering each of these questions uses different techniques
  3. Let’s dig deeper into what goes into the scores. We take more than 1,000 raw signals per opportunity – this is structured CRM data, unstructured text data (NLP), firmographic data including government sources (registration filings, credit unions, SEC filings), and other sources such as job postings, crunchbase, etc. This data is noisy so a lot of effort goes into cleaning and preparing this data, removing highly correlated subsets of fields, matching across tables, etc. Finally, this preprocessed data is transformed into a form that can be used for machine learning. For each customer, we have a large, fine-grained history of closed opportunities, stored in a temporal database. Each customer will typically have around several GB of raw historical data. The predictive model pipeline is fully automated and runs in production using HDFS, Java and Python. The This shows some of the detail surfaced by our predictive models for a specific opportunity. We surface three main results: Is the deal likely to win? Is it likely to be win this quarter? What are the main positive/negative influencers? Can I win: Random forest model, estimating Pr(win) regardless of time horizon # signals ~ 200-500 raw fields, covering > 1000 signals after cleaning Structured CRM data, NLP on unstructured data, ~30 government sources (registration filings, credit unions, SEC filings - # employees, revenue), crunchbase, NLP Can I win this quarter: We construct a duration model that attempts to estimate the number of days until the opportunity will close, assuming the eventual outcome is a win. We then transform this into an estimate of Pr(win in quarter). Key indicators: We use a model to that takes in specific influencing features, such as ‘budget secured, deal momentum’ etc. It includes both standard features in CRM data (eg industry), but also domain-specific and temporal features (eg ‘momentum’, ‘amount is fluctuating’) It then surfaces the 3 features that most strongly positively or negatively influence the overall score of the deal Constructing these fields requires domain expertise – in knowing what measures are importance, and which are also actionable by the managers and sales reps
  4. This shows clearly the temporal nature of an opportunity Opportunities are (usually) either open/closed Usually proceed through stages, but: Can go straight to closed Stages are only a partial order - can skip / revisit stages Can be re-opened after closed As the opportunity evolves, we get more and more data A pipeline is a set of open opportunities
  5. Let’s dig deeper into what goes into the scores. We take more than 1,000 raw signals per opportunity – this is structured CRM data, unstructured text data (NLP), firmographic data including government sources (registration filings, credit unions, SEC filings), and other sources such as job postings, crunchbase, etc. This data is noisy so a lot of effort goes into cleaning and preparing this data, removing highly correlated subsets of fields, matching across tables, etc. Finally, this preprocessed data is transformed into a form that can be used for machine learning. For each customer, we have a large, fine-grained history of closed opportunities, stored in a temporal database. Each customer will typically have around several GB of raw historical data. The predictive model pipeline is fully automated and runs in production using HDFS, Java and Python. The This shows some of the detail surfaced by our predictive models for a specific opportunity. We surface three main results: Is the deal likely to win? Is it likely to be win this quarter? What are the main positive/negative influencers? Can I win: Random forest model, estimating Pr(win) regardless of time horizon # signals ~ 200-500 raw fields, covering > 1000 signals after cleaning Structured CRM data, NLP on unstructured data, ~30 government sources (registration filings, credit unions, SEC filings - # employees, revenue), crunchbase, NLP Can I win this quarter: We construct a duration model that attempts to estimate the number of days until the opportunity will close, assuming the eventual outcome is a win. We then transform this into an estimate of Pr(win in quarter). Key indicators: We use a model to that takes in specific influencing features, such as ‘budget secured, deal momentum’ etc. It includes both standard features in CRM data (eg industry), but also domain-specific and temporal features (eg ‘momentum’, ‘amount is fluctuating’) It then surfaces the 3 features that most strongly positively or negatively influence the overall score of the deal Constructing these fields requires domain expertise – in knowing what measures are importance, and which are also actionable by the managers and sales reps
  6. Let’s dig deeper into what goes into the scores. We take more than 1,000 raw signals per opportunity – this is structured CRM data, unstructured text data (NLP), firmographic data including government sources (registration filings, credit unions, SEC filings), and other sources such as job postings, crunchbase, etc. This data is noisy so a lot of effort goes into cleaning and preparing this data, removing highly correlated subsets of fields, matching across tables, etc. Finally, this preprocessed data is transformed into a form that can be used for machine learning. For each customer, we have a large, fine-grained history of closed opportunities, stored in a temporal database. Each customer will typically have around several GB of raw historical data. The predictive model pipeline is fully automated and runs in production using HDFS, Java and Python. The This shows some of the detail surfaced by our predictive models for a specific opportunity. We surface three main results: Is the deal likely to win? Is it likely to be win this quarter? What are the main positive/negative influencers? Can I win: Random forest model, estimating Pr(win) regardless of time horizon # signals ~ 200-500 raw fields, covering > 1000 signals after cleaning Structured CRM data, NLP on unstructured data, ~30 government sources (registration filings, credit unions, SEC filings - # employees, revenue), crunchbase, NLP Can I win this quarter: We construct a duration model that attempts to estimate the number of days until the opportunity will close, assuming the eventual outcome is a win. We then transform this into an estimate of Pr(win in quarter). Key indicators: We use a model to that takes in specific influencing features, such as ‘budget secured, deal momentum’ etc. It includes both standard features in CRM data (eg industry), but also domain-specific and temporal features (eg ‘momentum’, ‘amount is fluctuating’) It then surfaces the 3 features that most strongly positively or negatively influence the overall score of the deal Constructing these fields requires domain expertise – in knowing what measures are importance, and which are also actionable by the managers and sales reps
  7. what’s the competition? reps’ commits we have found that typically, sales people have good won precision (~70%) but poor recall (30-45%) but they aren’t good at identifying _which_ deals will close we can identify about 2x as many deals that will eventually win, at the start of the quarter def f1(p,r): return 2.0*(p*r)/(p+r) Many customers follow this pattern.
  8. world’s most interesting salesman
  9. Slide Objective: Present the unique capabilities of C9 OppScore, show those capabilities in the demo and then connect those capabilities to business benefit. Questions: Let’s dive into the product. We’ll start by talking about how C9 OppScore helps reps determine which deals they should focus on and the specific actions they can take to improve the probability of winning. How much quality coaching time do managers have with reps each week? How do your reps prioritize which deals to pursue and how do they determine when to walk away? How long does it take a new rep to ramp and what are you doing to enable new reps? How do you ensure they reps are identify and execute on the sales strategies that have had the highest rate of success? Talking Points: C9 can help you address all of these points. In the next few minutes you’re going to see how C9 allows you understand how data science can be used to get reps focused on the right deals and applying the best actions to close those deals. After the demo recap the business benefits
  10. Slide Objective: Present the unique capabilities of C9 OppScore, show those capabilities in the demo and then connect those capabilities to business benefit. Questions: Let’s dive into the product. We’ll start by talking about how C9 OppScore helps reps determine which deals they should focus on and the specific actions they can take to improve the probability of winning. How much quality coaching time do managers have with reps each week? How do your reps prioritize which deals to pursue and how do they determine when to walk away? How long does it take a new rep to ramp and what are you doing to enable new reps? How do you ensure they reps are identify and execute on the sales strategies that have had the highest rate of success? Talking Points: C9 can help you address all of these points. In the next few minutes you’re going to see how C9 allows you understand how data science can be used to get reps focused on the right deals and applying the best actions to close those deals. After the demo recap the business benefits