SlideShare a Scribd company logo
1 of 17
Download to read offline
© 2013, PAYCHEX, Inc. All rights reserved.
Retention Modeling in
Uncertain Economic
Times
Mark Heiler
Data Scientist
Paychex, Inc.
© 2013, PAYCHEX, Inc. All rights reserved.
History of Paychex
1983
1971
Market Cap
$18 Billion
Total Revenue
$2.7 Billion
A leading provider of integrated
human capital management,
providing businesses the
freedom to succeed.
Today
Data Science @ Paychex
3
Data
Science
Team
Marketing
Sales
Information
Technology
FinanceRisk
Human
Resources
Service
Strategic Client Retention
 2009: developed and deployed first predictive retention model
4
2009 2010 201620152014201320122011
V1 V2 V3 V4
 2012: Clients with retention strategy applied we’re 80% less likely to leave
Controllable Losses
 Price
 Service Issue
 Dissatisfaction w/ Product
Uncontrollable Losses Out of Business
Business Acquired
Business Change
Out of Business
Business Acquired
Business Change
 Out of Business
 Business Acquired
 Business Change
𝑃 𝐶𝑙𝑖𝑒𝑛𝑡 𝐿𝑜𝑠𝑠 𝑿𝑖 ) =
1
1 + 𝑒−(β𝑿 𝑖)
Retention Tracking System
F
• Business 1: Annual Est. Revenue
• Business 2: Annual Est. Revenue
• …
D
• Business 3: Annual Est. Revenue
• Business 4: Annual Est. Revenue
• …
- Retention specialists proactively
call clients based on retention
score to mitigate potential issues
- Client payroll liability
- Employee counts
- ...
- Business Type
- Geographic
- Credit Info
- …
Model
Scores
Clients
Monthly
Inputs Output
- Probability
of Client
leaving
Paychex
600K clients; 8 full-time equivalent retention specialists
National Retention Team
Problem: Concept Drift
6
The underlying assumptions and distributions change over time
with respect to predicting losses
time
 Population Stability Index (PSI) scores help us identify shifts in the
distributions of the variables in our models
 What happens when the relationship with the target changes?
Example of Concept Drift
7
-0.8
-0.6
-0.4
-0.2
0
0.2
0.4
0.6
0.8
1
1.2
1.4
Date
11/1/2012
12/1/2012
1/1/2013
2/1/2013
3/1/2013
4/1/2013
5/1/2013
6/1/2013
7/1/2013
8/1/2013
9/1/2013
10/1/2013
11/1/2013
12/1/2013
1/1/2014
2/1/2014
3/1/2014
4/1/2014
5/1/2014
6/1/2014
7/1/2014
8/1/2014
9/1/2014
10/1/2014
11/1/2014
12/1/2014
1/1/2015
2/1/2015
3/1/2015
4/1/2015
5/1/2015
6/1/2015
7/1/2015
8/1/2015
9/1/2015
10/1/2015
11/1/2015
12/1/2015
1/1/2016
2/1/2016
3/1/2016
4/1/2016
5/1/2016
6/1/2016
7/1/2016
8/1/2016
9/1/2016
10/1/2016
11/1/2016
12/1/2016
1/1/2017
2/1/2017
3/1/2017
California Accommodation & Food Services:
Year over Year Change in Losses
yoy 6 per. Mov. Avg. (yoy)
Solution: Dynamic Model Averaging*
8
Dynamic
 Adapts to account for concept drift
 Allows for the inclusion of new variables as they become available
Model Averaging
 Use an ensemble of historical models to estimate probability of client loss
 Establishing a weighting heuristic to apply each model
* Not Associated w/ Raftery et al. (2010)
Model Averaging
9
 Tested four alternatives:
Dynamic Model Averaging
Regression Combiner
Use a weighted average base on ridge regression
,t-12 t-3*
t-12 t-11 t-4 t-3…
Average Model Scores From Each Model
Baseline Dynamic
,t-3*
Use Most Recent Model
One Year Prior Dynamic
,t-12*
Use Model from One Year Prior
 The best performance
achieved by the averaged
scores of a trend model (t-3)
and seasonal model (t-12)
Dynamic Variables Inclusion
 Access to data is continually improving. How can we adapt to this?
 Just join new variables to production data set!
 What if variables are discontinued?
 We can identify which variables were used in our model 12 months ago
but aren’t in our current scoring data set.
 Add zero-filled column (not optimal… can retrain if needed)
10
0
0
⋮
Model Fitting: Logistic Regression via
LASSO (aka Elastic Net)
 Least Absolute Shrinkage and Selection Operator(LASSO)
 Regularize logistic regression
 Standardize variables (scale them to mean = 0 and sd = 1)
 In traditional modeling, we choose to either include or exclude a variable in a
model (domain knowledge, AIC/BIC)
 LASSO automatically includes all variables in the model, but shrinks
their impact on the model if it is not important
 By shrinking the coefficients in the model, we can drive some
coefficients to zero.
11
Example LASSO Plot
12
• A vertical line represents a cross-
sectional view of the coefficients for
a model.
• The red line indicates a chosen
parameter/model. It is a regression
with 3 variables (3 coefficients)
 Lcavol: 0.55
 Weight: 0.17
 Svi : 0.09
• With a very large shrinkage factor,
the sum of usual regression
coefficients are less than the
constraint, so this is exactly the
same as using a regular regression
• With a very small shrinkage factor,
the coefficients must sum to zero, so
there are no coefficients
Variables*
13
Continuous Variables
 Trend: Mann-Kendall Trend Test Statistic
 Standard Deviation
 Average
 Demand Index
 Length of Service transformed using a Spline
Categorical Variables
 # of Discount Products > # of Billed Products – {0,1}
 Has Penalty in last 6 months & LOS < 2 years – {0,1}
 Has Case in last 6 months & LOS < 2 years – {0,1}
 Has Return in last 6 months & LOS < 2 years – {0,1}
 Start Month - {1,…,12}
 401k, Taxpay, Time & Attendance Indicators – {0,1}
* Final variables(w/ non Zero Coefficients) will change month by month
Performance Tracking
14
Model
Performance
Technical
Discriminative
Ability of Model
 ROC Curve
 Brier score
Diagnostic
 Variable
Importance
 Client Score
Shifts
Business
Strategic
Diagnostics
 Calls / Save
Model Valuation
 $$$
 Proper
resource
allocation
Technical Performance Tracking
15
Next Steps
16
 Experiment with scoring
 Update weighting heuristic
 Apply strategy to other model developments
© 2013, PAYCHEX, Inc. All rights reserved.
Thank You
Mark Heiler
Data Scientist
mheiler@paychex.com

More Related Content

What's hot

1555 track 1 huang_using his mac
1555 track 1 huang_using his mac1555 track 1 huang_using his mac
1555 track 1 huang_using his macRising Media, Inc.
 
840 plenary elder_using his laptop
840 plenary elder_using his laptop840 plenary elder_using his laptop
840 plenary elder_using his laptopRising Media, Inc.
 
1530 track 3 gunther_using our laptop
1530 track 3 gunther_using our laptop1530 track 3 gunther_using our laptop
1530 track 3 gunther_using our laptopRising Media, Inc.
 
1440 track 2 boire_using our laptop
1440 track 2 boire_using our laptop1440 track 2 boire_using our laptop
1440 track 2 boire_using our laptopRising Media, Inc.
 
The Value of Predictive Analytics and Decision Modeling
The Value of Predictive Analytics and Decision ModelingThe Value of Predictive Analytics and Decision Modeling
The Value of Predictive Analytics and Decision ModelingDecision Management Solutions
 
1645 track 1 bress_using his laptop
1645 track 1 bress_using his laptop1645 track 1 bress_using his laptop
1645 track 1 bress_using his laptopRising Media, Inc.
 
Predictive Analysis PowerPoint Presentation Slides
Predictive Analysis PowerPoint Presentation SlidesPredictive Analysis PowerPoint Presentation Slides
Predictive Analysis PowerPoint Presentation SlidesSlideTeam
 

What's hot (20)

1315 keynote jopia_shareable
1315 keynote jopia_shareable1315 keynote jopia_shareable
1315 keynote jopia_shareable
 
1555 track 1 huang_using his mac
1555 track 1 huang_using his mac1555 track 1 huang_using his mac
1555 track 1 huang_using his mac
 
1120 track1 taylor
1120 track1 taylor1120 track1 taylor
1120 track1 taylor
 
1055 track3 soules
1055 track3 soules1055 track3 soules
1055 track3 soules
 
1645 track2 short
1645 track2 short1645 track2 short
1645 track2 short
 
840 plenary elder_using his laptop
840 plenary elder_using his laptop840 plenary elder_using his laptop
840 plenary elder_using his laptop
 
1000 track1 gland_sims
1000 track1 gland_sims1000 track1 gland_sims
1000 track1 gland_sims
 
1530 track1 rosenbaum
1530 track1 rosenbaum1530 track1 rosenbaum
1530 track1 rosenbaum
 
1530 track 3 gunther_using our laptop
1530 track 3 gunther_using our laptop1530 track 3 gunther_using our laptop
1530 track 3 gunther_using our laptop
 
1440 track 2 boire_using our laptop
1440 track 2 boire_using our laptop1440 track 2 boire_using our laptop
1440 track 2 boire_using our laptop
 
1645 track 3 porter
1645 track 3 porter1645 track 3 porter
1645 track 3 porter
 
The Value of Predictive Analytics and Decision Modeling
The Value of Predictive Analytics and Decision ModelingThe Value of Predictive Analytics and Decision Modeling
The Value of Predictive Analytics and Decision Modeling
 
1440 track3 galusha
1440 track3 galusha1440 track3 galusha
1440 track3 galusha
 
1030 track3 boire
1030 track3 boire1030 track3 boire
1030 track3 boire
 
Causal Inference in Marketing
Causal Inference in MarketingCausal Inference in Marketing
Causal Inference in Marketing
 
Data driven decision making
Data driven decision makingData driven decision making
Data driven decision making
 
Predictive Model
Predictive ModelPredictive Model
Predictive Model
 
1645 track 1 bress_using his laptop
1645 track 1 bress_using his laptop1645 track 1 bress_using his laptop
1645 track 1 bress_using his laptop
 
Master Of Science Dissertation
Master Of Science DissertationMaster Of Science Dissertation
Master Of Science Dissertation
 
Predictive Analysis PowerPoint Presentation Slides
Predictive Analysis PowerPoint Presentation SlidesPredictive Analysis PowerPoint Presentation Slides
Predictive Analysis PowerPoint Presentation Slides
 

Similar to Retention Modeling in Uncertain Economic Times

Prediction of customer propensity to churn - Telecom Industry
Prediction of customer propensity to churn - Telecom IndustryPrediction of customer propensity to churn - Telecom Industry
Prediction of customer propensity to churn - Telecom IndustryPranov Mishra
 
Useful Tools for Problem Solving by Operational Excellence Consulting
Useful Tools for Problem Solving by Operational Excellence ConsultingUseful Tools for Problem Solving by Operational Excellence Consulting
Useful Tools for Problem Solving by Operational Excellence ConsultingOperational Excellence Consulting
 
Data Mining Problems in Retail
Data Mining Problems in RetailData Mining Problems in Retail
Data Mining Problems in RetailIlya Katsov
 
six-sigma2040.ppt
six-sigma2040.pptsix-sigma2040.ppt
six-sigma2040.pptOnurUyar6
 
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...ThinkInnovation
 
Neural Network Model
Neural Network ModelNeural Network Model
Neural Network ModelEric Esajian
 
Improving the performance of Telco Churn Predictive Model with SPSS & 6 Sigma
Improving the performance of Telco Churn Predictive Model with SPSS & 6 SigmaImproving the performance of Telco Churn Predictive Model with SPSS & 6 Sigma
Improving the performance of Telco Churn Predictive Model with SPSS & 6 SigmaGeetha Nadarajan
 
Churn in the Telecommunications Industry
Churn in the Telecommunications IndustryChurn in the Telecommunications Industry
Churn in the Telecommunications Industryskewdlogix
 
Operations Management VTU BE Mechanical 2015 Solved paper
Operations Management VTU BE Mechanical 2015 Solved paperOperations Management VTU BE Mechanical 2015 Solved paper
Operations Management VTU BE Mechanical 2015 Solved paperSomashekar S.M
 
[MPKD1] Introduction to business analytics and simulation
[MPKD1] Introduction to business analytics and simulation[MPKD1] Introduction to business analytics and simulation
[MPKD1] Introduction to business analytics and simulationNguyen Ngoc Binh Phuong
 

Similar to Retention Modeling in Uncertain Economic Times (18)

Prediction of customer propensity to churn - Telecom Industry
Prediction of customer propensity to churn - Telecom IndustryPrediction of customer propensity to churn - Telecom Industry
Prediction of customer propensity to churn - Telecom Industry
 
Useful Tools for Problem Solving by Operational Excellence Consulting
Useful Tools for Problem Solving by Operational Excellence ConsultingUseful Tools for Problem Solving by Operational Excellence Consulting
Useful Tools for Problem Solving by Operational Excellence Consulting
 
Data Mining Problems in Retail
Data Mining Problems in RetailData Mining Problems in Retail
Data Mining Problems in Retail
 
six-sigma2040.ppt
six-sigma2040.pptsix-sigma2040.ppt
six-sigma2040.ppt
 
Analytics
AnalyticsAnalytics
Analytics
 
6 sigma what is it
6 sigma what is it6 sigma what is it
6 sigma what is it
 
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
 
Neural Network Model
Neural Network ModelNeural Network Model
Neural Network Model
 
Improving the performance of Telco Churn Predictive Model with SPSS & 6 Sigma
Improving the performance of Telco Churn Predictive Model with SPSS & 6 SigmaImproving the performance of Telco Churn Predictive Model with SPSS & 6 Sigma
Improving the performance of Telco Churn Predictive Model with SPSS & 6 Sigma
 
Machine learning project
Machine learning project Machine learning project
Machine learning project
 
Churn in the Telecommunications Industry
Churn in the Telecommunications IndustryChurn in the Telecommunications Industry
Churn in the Telecommunications Industry
 
Operations Management VTU BE Mechanical 2015 Solved paper
Operations Management VTU BE Mechanical 2015 Solved paperOperations Management VTU BE Mechanical 2015 Solved paper
Operations Management VTU BE Mechanical 2015 Solved paper
 
[MPKD1] Introduction to business analytics and simulation
[MPKD1] Introduction to business analytics and simulation[MPKD1] Introduction to business analytics and simulation
[MPKD1] Introduction to business analytics and simulation
 
Ch3. Demand Forecasting.ppt
Ch3. Demand Forecasting.pptCh3. Demand Forecasting.ppt
Ch3. Demand Forecasting.ppt
 
Six sigma
Six sigma Six sigma
Six sigma
 
Six sigma pedagogy
Six sigma pedagogySix sigma pedagogy
Six sigma pedagogy
 
6 sigma
6 sigma6 sigma
6 sigma
 
Project selection
Project selectionProject selection
Project selection
 

More from Rising Media, Inc.

1415 track 1 wu_using his laptop
1415 track 1 wu_using his laptop1415 track 1 wu_using his laptop
1415 track 1 wu_using his laptopRising Media, Inc.
 
1620 keynote olson_using our laptop
1620 keynote olson_using our laptop1620 keynote olson_using our laptop
1620 keynote olson_using our laptopRising Media, Inc.
 
1530 track 2 stuart_using our laptop
1530 track 2 stuart_using our laptop1530 track 2 stuart_using our laptop
1530 track 2 stuart_using our laptopRising Media, Inc.
 
1530 track 1 fader_using our laptop
1530 track 1 fader_using our laptop1530 track 1 fader_using our laptop
1530 track 1 fader_using our laptopRising Media, Inc.
 
1215 daa lunch owusu_using our laptop
1215 daa lunch owusu_using our laptop1215 daa lunch owusu_using our laptop
1215 daa lunch owusu_using our laptopRising Media, Inc.
 
1215 daa lunch a bos intro slides_using our laptop
1215 daa lunch a bos intro slides_using our laptop1215 daa lunch a bos intro slides_using our laptop
1215 daa lunch a bos intro slides_using our laptopRising Media, Inc.
 
855 sponsor movassate_using our laptop
855 sponsor movassate_using our laptop855 sponsor movassate_using our laptop
855 sponsor movassate_using our laptopRising Media, Inc.
 
1325 keynote yale_pdf shareable
1325 keynote yale_pdf shareable1325 keynote yale_pdf shareable
1325 keynote yale_pdf shareableRising Media, Inc.
 
905 keynote peele_using our laptop
905 keynote peele_using our laptop905 keynote peele_using our laptop
905 keynote peele_using our laptopRising Media, Inc.
 

More from Rising Media, Inc. (20)

1415 track 1 wu_using his laptop
1415 track 1 wu_using his laptop1415 track 1 wu_using his laptop
1415 track 1 wu_using his laptop
 
Matt gershoff
Matt gershoffMatt gershoff
Matt gershoff
 
Keynote adam greco
Keynote adam grecoKeynote adam greco
Keynote adam greco
 
1620 keynote olson_using our laptop
1620 keynote olson_using our laptop1620 keynote olson_using our laptop
1620 keynote olson_using our laptop
 
1530 track 2 stuart_using our laptop
1530 track 2 stuart_using our laptop1530 track 2 stuart_using our laptop
1530 track 2 stuart_using our laptop
 
1530 track 1 fader_using our laptop
1530 track 1 fader_using our laptop1530 track 1 fader_using our laptop
1530 track 1 fader_using our laptop
 
1415 track 2 richardson
1415 track 2 richardson1415 track 2 richardson
1415 track 2 richardson
 
1215 daa lunch owusu_using our laptop
1215 daa lunch owusu_using our laptop1215 daa lunch owusu_using our laptop
1215 daa lunch owusu_using our laptop
 
1215 daa lunch a bos intro slides_using our laptop
1215 daa lunch a bos intro slides_using our laptop1215 daa lunch a bos intro slides_using our laptop
1215 daa lunch a bos intro slides_using our laptop
 
915 e metrics_claudia perlich
915 e metrics_claudia perlich915 e metrics_claudia perlich
915 e metrics_claudia perlich
 
855 sponsor movassate_using our laptop
855 sponsor movassate_using our laptop855 sponsor movassate_using our laptop
855 sponsor movassate_using our laptop
 
1615 plack using our laptop
1615 plack using our laptop1615 plack using our laptop
1615 plack using our laptop
 
1530 rimmele do not share
1530 rimmele do not share1530 rimmele do not share
1530 rimmele do not share
 
1325 keynote yale_pdf shareable
1325 keynote yale_pdf shareable1325 keynote yale_pdf shareable
1325 keynote yale_pdf shareable
 
1115 fiztgerald schuchardt
1115 fiztgerald schuchardt1115 fiztgerald schuchardt
1115 fiztgerald schuchardt
 
1000 kondic do not share
1000 kondic do not share1000 kondic do not share
1000 kondic do not share
 
905 keynote peele_using our laptop
905 keynote peele_using our laptop905 keynote peele_using our laptop
905 keynote peele_using our laptop
 
Stephen morse sharable
Stephen morse sharableStephen morse sharable
Stephen morse sharable
 
Elder shareable
Elder shareableElder shareable
Elder shareable
 
1115 ramirez using our laptop
1115 ramirez using our laptop1115 ramirez using our laptop
1115 ramirez using our laptop
 

Recently uploaded

Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubaihf8803863
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfSocial Samosa
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSAishani27
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiSuhani Kapoor
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxolyaivanovalion
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxolyaivanovalion
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxStephen266013
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts ServiceSapana Sha
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach
 

Recently uploaded (20)

Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICS
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docx
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts Service
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
 

Retention Modeling in Uncertain Economic Times

  • 1. © 2013, PAYCHEX, Inc. All rights reserved. Retention Modeling in Uncertain Economic Times Mark Heiler Data Scientist Paychex, Inc.
  • 2. © 2013, PAYCHEX, Inc. All rights reserved. History of Paychex 1983 1971 Market Cap $18 Billion Total Revenue $2.7 Billion A leading provider of integrated human capital management, providing businesses the freedom to succeed. Today
  • 3. Data Science @ Paychex 3 Data Science Team Marketing Sales Information Technology FinanceRisk Human Resources Service
  • 4. Strategic Client Retention  2009: developed and deployed first predictive retention model 4 2009 2010 201620152014201320122011 V1 V2 V3 V4  2012: Clients with retention strategy applied we’re 80% less likely to leave Controllable Losses  Price  Service Issue  Dissatisfaction w/ Product Uncontrollable Losses Out of Business Business Acquired Business Change Out of Business Business Acquired Business Change  Out of Business  Business Acquired  Business Change 𝑃 𝐶𝑙𝑖𝑒𝑛𝑡 𝐿𝑜𝑠𝑠 𝑿𝑖 ) = 1 1 + 𝑒−(β𝑿 𝑖)
  • 5. Retention Tracking System F • Business 1: Annual Est. Revenue • Business 2: Annual Est. Revenue • … D • Business 3: Annual Est. Revenue • Business 4: Annual Est. Revenue • … - Retention specialists proactively call clients based on retention score to mitigate potential issues - Client payroll liability - Employee counts - ... - Business Type - Geographic - Credit Info - … Model Scores Clients Monthly Inputs Output - Probability of Client leaving Paychex 600K clients; 8 full-time equivalent retention specialists National Retention Team
  • 6. Problem: Concept Drift 6 The underlying assumptions and distributions change over time with respect to predicting losses time  Population Stability Index (PSI) scores help us identify shifts in the distributions of the variables in our models  What happens when the relationship with the target changes?
  • 7. Example of Concept Drift 7 -0.8 -0.6 -0.4 -0.2 0 0.2 0.4 0.6 0.8 1 1.2 1.4 Date 11/1/2012 12/1/2012 1/1/2013 2/1/2013 3/1/2013 4/1/2013 5/1/2013 6/1/2013 7/1/2013 8/1/2013 9/1/2013 10/1/2013 11/1/2013 12/1/2013 1/1/2014 2/1/2014 3/1/2014 4/1/2014 5/1/2014 6/1/2014 7/1/2014 8/1/2014 9/1/2014 10/1/2014 11/1/2014 12/1/2014 1/1/2015 2/1/2015 3/1/2015 4/1/2015 5/1/2015 6/1/2015 7/1/2015 8/1/2015 9/1/2015 10/1/2015 11/1/2015 12/1/2015 1/1/2016 2/1/2016 3/1/2016 4/1/2016 5/1/2016 6/1/2016 7/1/2016 8/1/2016 9/1/2016 10/1/2016 11/1/2016 12/1/2016 1/1/2017 2/1/2017 3/1/2017 California Accommodation & Food Services: Year over Year Change in Losses yoy 6 per. Mov. Avg. (yoy)
  • 8. Solution: Dynamic Model Averaging* 8 Dynamic  Adapts to account for concept drift  Allows for the inclusion of new variables as they become available Model Averaging  Use an ensemble of historical models to estimate probability of client loss  Establishing a weighting heuristic to apply each model * Not Associated w/ Raftery et al. (2010)
  • 9. Model Averaging 9  Tested four alternatives: Dynamic Model Averaging Regression Combiner Use a weighted average base on ridge regression ,t-12 t-3* t-12 t-11 t-4 t-3… Average Model Scores From Each Model Baseline Dynamic ,t-3* Use Most Recent Model One Year Prior Dynamic ,t-12* Use Model from One Year Prior  The best performance achieved by the averaged scores of a trend model (t-3) and seasonal model (t-12)
  • 10. Dynamic Variables Inclusion  Access to data is continually improving. How can we adapt to this?  Just join new variables to production data set!  What if variables are discontinued?  We can identify which variables were used in our model 12 months ago but aren’t in our current scoring data set.  Add zero-filled column (not optimal… can retrain if needed) 10 0 0 ⋮
  • 11. Model Fitting: Logistic Regression via LASSO (aka Elastic Net)  Least Absolute Shrinkage and Selection Operator(LASSO)  Regularize logistic regression  Standardize variables (scale them to mean = 0 and sd = 1)  In traditional modeling, we choose to either include or exclude a variable in a model (domain knowledge, AIC/BIC)  LASSO automatically includes all variables in the model, but shrinks their impact on the model if it is not important  By shrinking the coefficients in the model, we can drive some coefficients to zero. 11
  • 12. Example LASSO Plot 12 • A vertical line represents a cross- sectional view of the coefficients for a model. • The red line indicates a chosen parameter/model. It is a regression with 3 variables (3 coefficients)  Lcavol: 0.55  Weight: 0.17  Svi : 0.09 • With a very large shrinkage factor, the sum of usual regression coefficients are less than the constraint, so this is exactly the same as using a regular regression • With a very small shrinkage factor, the coefficients must sum to zero, so there are no coefficients
  • 13. Variables* 13 Continuous Variables  Trend: Mann-Kendall Trend Test Statistic  Standard Deviation  Average  Demand Index  Length of Service transformed using a Spline Categorical Variables  # of Discount Products > # of Billed Products – {0,1}  Has Penalty in last 6 months & LOS < 2 years – {0,1}  Has Case in last 6 months & LOS < 2 years – {0,1}  Has Return in last 6 months & LOS < 2 years – {0,1}  Start Month - {1,…,12}  401k, Taxpay, Time & Attendance Indicators – {0,1} * Final variables(w/ non Zero Coefficients) will change month by month
  • 14. Performance Tracking 14 Model Performance Technical Discriminative Ability of Model  ROC Curve  Brier score Diagnostic  Variable Importance  Client Score Shifts Business Strategic Diagnostics  Calls / Save Model Valuation  $$$  Proper resource allocation
  • 16. Next Steps 16  Experiment with scoring  Update weighting heuristic  Apply strategy to other model developments
  • 17. © 2013, PAYCHEX, Inc. All rights reserved. Thank You Mark Heiler Data Scientist mheiler@paychex.com