SlideShare a Scribd company logo
Subscription Survival Analysis

                  SF Data Mining Meetup
                          San Francisco
                              June 2012




          Jim Porzak – Senior Director, Business Intelligence
What We’ll Cover…
•   Quick introduction to survival analysis.
•   Rational for marketing & business.
•   Subscription businesses past & present.
•   The example data set. How real is it?
•   Calculating survival, average tenure & LTV.
•   Applications of Survival & Hazard curves.
•   Discussion.
•   Appendix (links & details).


                                                  2
Traditional Survival Analysis
• Modeling of “time to event” data
   – Death in biological organisms (medicine)
   – Failure of machines (reliability engineering)
   – Political or sociological change (duration
     analysis)
• These kinds of data share common gotcha:
   – If an “individual” has not yet died, failed, or
     changed, what can we say about the
     expected time to the event?


                                                       3
Trick: Use what we know – everything!




• All have started, some have stopped, & some continue on.
    – “Right censored”
• From this kind of data we can calculate probability of stopping.
    – “Hazard Ratio” = f(tenure, … )
• Probability of survival at tenure, T, is just ( 1 – cumulative hazard(T))

                                                                              4
Classical “Bathtub” Hazard Curve




Source: http://en.wikipedia.org/wiki/Bathtub_curve



                                                     5
The Subscription Business Model
1st Visit   Acquisition    Conversion    Lapse   Re-activation   Today’s #’s
                                                 or Win-back

                                           Re-
                                           activate
                 Engage                          Win-Back
    Acquire
                                Retain                 Retain
                 Convert




                                                                               6
Aside: Subscription Stints – Rational & Method

• For each member we need to know subscription start &, if
  terminated, stop dates for each member:
         Subscriber A


• But data often has the accounting view with gaps in the subscription:
                        CC Issue   Prod Chg   Whatever
         Subscriber A




• From the customer’s perspective, he had one continuous
  subscription. It’s important to take that view since, in general, the
  longer a subscriber is active the lower the risk of churn.
• A subscription stint algorithm typically ignores any gaps up
  to, say, 30 days – which is also taken as the boundary between a
  lapsed subscriber in the “re-activate” vrs “win-back” pools.



                                                                          7
Why Subscription Survival?
• Expected churn probability of new subscribers
   – Project future subscriber count
       • Especially after a successful campaign
   – Define tenure segment boundaries
• Expected tenure of a new subscriber & LTV
   – As function of duration, product, channel, offer
   – Evaluate marketing & site tests by value not rates
• Projected value of retention efforts
   – How improved renewal rates increase value
• A new way of thinking about subscribers
   – Retention metrics, as used in marketing, are deceptive
     (see Appendix slide)


                                                              8
Typical Subscription Survival Curves




Linoff, Gordon (2004) Link in appendix.

                                          9
Assumptions

• Homogeneity of strata
  – Each includes only one segment of customers
• Stability over time
  – Hardest to satisfy?
  – Propensity to renew a function of:
     •   General economic conditions
     •   Offer(s) at start of subscription
     •   Engagement efforts by marketing group
     •   Perceived long term value


                                                  10
Example Data Set
• 100k records:
    –   CustomerID
    –   SubStartDate
    –   SubEndDate (NULL if subscription active)
    –   Duration = [A, Q, M] for Annual, Quarterly, or Monthly
    –   Product = [B, P] for Basic or Professional
    –   Channel = [R, A, S, B, O] for Referral, Affiliate, Search, Banner, or Other

• Randomly generated with day-of-week and week-in-year
  seasonality; 15% year over year growth for starts.
• Randomly generated number of renewals
    –   With different churn rates for A, Q, & M
• Mimics general properties of subscription data from
  Playboy.com, LA Times, 24 Hour Fitness, Chicago Sun
  Times, Ancestry.com, & Viadeo.com
    – Additional parameters could be Offer, Promotion Cohort, …
• Assumes “no-return” policy, i.e. no early refunds.


                                                                                      11
Algorithm – Part 1: Hazard Ratio




          (# terminated during day i)
        ______________________________________
HRi =
        (# active at beginning of day i)

    = 1/4       = 0.25

Remember N is typically huge in a
subscription business so we tend to get
smooth curves.
                                                 12
Algorithm – Part 2 Survival Curve
                                    Hazard Ratio Rate
                                     USDeluxe Domestic Acom Hazard
              0.14




                                                                                                   Duration
                                                                                                   A (N = 204,538)
                                                                                                   M (N = 196,345)
              0.12
              0.10
Hazard Rate

              0.08
              0.06
              0.04
              0.02
              0.00




                     0      200                 400                       600                800              1000

                                                        Tenure (days)
                                    Subscribers since 2006-01-01 (N = 400,883 subscribers)




                                  Average 2-year Tenure                                                                                                              Survival Probability
                                                                                                                                                                      USDeluxe Domestic Acom Subscriber Survival




                                                                                                                                               1.0
                                                                                                                                                                                                                                        Duration
                                                                                                                                                                                                                                        A (N = 204,538)
                                                                                                                                                                                                                                        M (N = 196,345)




                                                                                                                                               0.8
                                                                                                                     Probability of Survival

                                                                                                                                               0.6
                                                                                                                                                         120                        404

                                                                                                                                               0.4
                                                                                                                                               0.2
                                                                                                                                               0.0




                                                                                                                                                     0         200                 400                         600                800               1000

                                                                                                                                                                                             Tenure (days)
                                                                                                                                                                         Subscribers since 2006-01-01 (N = 400,883 subscribers)



                     LTR = (Daily Sales Price) * (Average Tenure)
                                                                                                                                                                                                                                                   13
Application: Average Tenure of Renewal Durations
                          Subscription Stint Survival by Duration




                            1-Year       1-year     2-Year       2-year     3-Year       3-year
   Duration   Half Life    Average   Probability   Average   Probability   Average   Probability
   (Months)     (days)      Tenure   of Survival    Tenure   of Survival    Tenure   of Survival
      1            149       180.9        0.217        NA            NA        NA            NA
      3            273       260.9        0.380      354.4        0.131      387.7        0.058
     12            456       364.7        0.880      557.8        0.452      629.9        0.159
     24            822       363.8        0.991      723.1        0.978      840.3        0.125




                                                                                                   14
Application: Evaluating a Conversion Test
We test a great new design for the web site conversion
page flow & get 1050 conversions for new flow vs. 1000 for
old flow - a 5% lift!

                              A: Control Page Flow     B: New Page Flow

         2-Yr Daily    2-Yr             Projected 2-            Projected 2-
       Tenure ASP     Value     # Conv.    Yr Value     # Conv.    Yr Value

  M       220 0.15 $33.00           500     $16,500        650     $21,450
  Q       350 0.12 $42.00           300     $12,600        275     $11,550
   A      560 0.09 $50.40           200     $10,080        125      $6,300
Total                              1000     $39,180       1050     $39,300
Revenue Per Unit (2-Year)                    $39.18                 $37.43



                                                                          15
Application: Projected Value of Base
So far we have looked at survival from start of subscription. Now we
need remaining survival for current subscribers. For example a 1-year
subscriber:
               Just reset
               Ps to 1.0




Projected n-year value of your current subscriber base is:
For all subscribers, Sum their
          (daily sales price) *
          (Area under appropriate survival curve
                    from their current tenure over the next n years) /
          (Ps (current tenure))

                                                                         16
Application: Thinking in Hazard Space
Picking Tenure Segment Boundaries
Simple minded customer segmentation. But very useful!
   •   Newbie: 0 - 100 days
                   ___
   •   Builder: 101 - ___ days
                ___ 380
   •   Established: 381 - 735 days
                    ___ ___
   •   Core: 736 days & up
             ___

                                             Hazard Ratio
                                         USDeluxe Domestic Acom Hazard Rate
                      0.14




                                                                                                      Duration
                                                                                                      A (N = 204,538)
                                                                                                      M (N = 196,345)
                      0.12
                      0.10
        Hazard Rate

                      0.08
                      0.06
                      0.04
                      0.02
                      0.00




                             0   200               400                       600                800              1000

                                                           Tenure (days)
                                       Subscribers since 2006-01-01 (N = 400,883 subscribers)




                                                                                                                        17
Application: Modeling in Hazard Space

What-if Newbie Churn Could be Reduced by 10%?


                Red: Newbie
                Hazard down by                  Red: Newbie
                10%                             2-year value up
                                                by $10




                                                              18
Wrap-up. What we saw.
• Introduction to classical survival
• Subscribers & calculating their hazard & survival
• Applying survival & hazard to real-world
  questions

• See Appendix for links & some more details.




                                                      19
Questions?




And, ping me later with questions or
comments: JPorzak@gmail.com




                                       20
Appendix
• Links to learn more
• Retention vs Survival Curves
Subscription Survival Background
• On subscription survival for customer intelligence:
    – Gordon Linoff’s 2004 Article from Intelligent Enterprise (now
      InformationWeek) http://www.data-miners.com/resources/Customer-
      Insight-Article.pdf
    – Will Potts’ technical white paper http://www.data-
      miners.com/resources/Will%20Survival.pdf
    – Berry & Linoff’s white paper for SAS, emphasizes forecasting
      application http://www.data-
      miners.com/companion/sas/forecastingWP_001.pdf
    – Chapter 10 in http://amzn.to/mRAVpp
• Background on “classical” survival analysis:
    – http://en.wikipedia.org/wiki/Survival_analysis
• R packages used
    – survival by Terry Therneau
    – ggplot2, plyr, lubridate by Hadley Wickham, et al
    – zoo, xts by Achim Zeileis, Jeffrey Ryan, et al

                                                                        22
Retention vs Survival Curves

• Retention Rate = 1 – Churn Rate
• Churn Rate (typically) defined as:
           (# Subscribers leaving)
          (Average # subscribers)
    over some period.
• Which means:
  – Ignores information out of period
  – Not monotonically decreasing


                                        23
Traditional vs. Subscription Survival

Propertity          Traditional              Subscription
N                   Smallish (10^1 – 10^3)   Large to huge (10^5 &
                                             up)
Hazard              Assumed continuous       Usually spiky
Survival curves     Often Modeled            Empirical
Survival Curve(s)   Assumed continuous       Usually stair step




                                                                     24

More Related Content

Viewers also liked

Survival analysis
Survival analysisSurvival analysis
Survival analysis
Sanjaya Sahoo
 
A gentle introduction to survival analysis
A gentle introduction to survival analysisA gentle introduction to survival analysis
A gentle introduction to survival analysis
Angelo Tinazzi
 
Survival Analysis of Web Users
Survival Analysis of Web UsersSurvival Analysis of Web Users
Survival Analysis of Web Users
Data Science London
 
Survival analysis
Survival analysisSurvival analysis
Survival analysis
Har Jindal
 
Survival analysis
Survival  analysisSurvival  analysis
Introduction To Survival Analysis
Introduction To Survival AnalysisIntroduction To Survival Analysis
Introduction To Survival Analysis
federicorotolo
 

Viewers also liked (6)

Survival analysis
Survival analysisSurvival analysis
Survival analysis
 
A gentle introduction to survival analysis
A gentle introduction to survival analysisA gentle introduction to survival analysis
A gentle introduction to survival analysis
 
Survival Analysis of Web Users
Survival Analysis of Web UsersSurvival Analysis of Web Users
Survival Analysis of Web Users
 
Survival analysis
Survival analysisSurvival analysis
Survival analysis
 
Survival analysis
Survival  analysisSurvival  analysis
Survival analysis
 
Introduction To Survival Analysis
Introduction To Survival AnalysisIntroduction To Survival Analysis
Introduction To Survival Analysis
 

Similar to Subscription Survival Analysis

How to apply CRM using data mining techniques.
How to apply CRM using data mining techniques.How to apply CRM using data mining techniques.
How to apply CRM using data mining techniques.
customersforever
 
GeoLivi_T24_Banking (2)
GeoLivi_T24_Banking (2)GeoLivi_T24_Banking (2)
GeoLivi_T24_Banking (2)
GEO LIVI L
 
OA predicta net week data360 predictive analytics ralph overbeck published
OA predicta net week data360 predictive analytics ralph overbeck publishedOA predicta net week data360 predictive analytics ralph overbeck published
OA predicta net week data360 predictive analytics ralph overbeck published
Ralph Overbeck
 
Building Innovative Subscription-based Businesses: Lecture 2
Building Innovative Subscription-based Businesses: Lecture 2Building Innovative Subscription-based Businesses: Lecture 2
Building Innovative Subscription-based Businesses: Lecture 2
Martin Westhead
 
Survival_Analysis
Survival_AnalysisSurvival_Analysis
Survival_Analysis
Rushil Goyal
 
TopQuants2014_JokTang_DenysSemagin
TopQuants2014_JokTang_DenysSemaginTopQuants2014_JokTang_DenysSemagin
TopQuants2014_JokTang_DenysSemagin
Denys Semagin
 
Hedge_Your_Insurance_Company-Liu
Hedge_Your_Insurance_Company-LiuHedge_Your_Insurance_Company-Liu
Hedge_Your_Insurance_Company-Liu
Winter Liu
 
Project Pre-Assessment Using a Min/Max Analysis with Matt Hansen at StatStuff
Project Pre-Assessment Using a Min/Max Analysis with Matt Hansen at StatStuffProject Pre-Assessment Using a Min/Max Analysis with Matt Hansen at StatStuff
Project Pre-Assessment Using a Min/Max Analysis with Matt Hansen at StatStuff
Matt Hansen
 
Modelling Customer Lifetime Revenue for Subscription Business
Modelling Customer Lifetime Revenue for Subscription BusinessModelling Customer Lifetime Revenue for Subscription Business
Modelling Customer Lifetime Revenue for Subscription Business
Databricks
 
D05 Define VOC, VOB and CTQ
D05 Define VOC, VOB and CTQD05 Define VOC, VOB and CTQ
D05 Define VOC, VOB and CTQ
Leanleaders.org
 
Primer on angel investing 1000 angels
Primer on angel investing 1000 angelsPrimer on angel investing 1000 angels
Primer on angel investing 1000 angels
Diego Jose Ramos
 
Metrics: Talking about Churn and Retention in the Board Room
Metrics: Talking about Churn and Retention in the Board RoomMetrics: Talking about Churn and Retention in the Board Room
Metrics: Talking about Churn and Retention in the Board Room
Gainsight
 
Bank Customer Churn Prediction- Saurav Singh.pptx
Bank Customer Churn Prediction- Saurav Singh.pptxBank Customer Churn Prediction- Saurav Singh.pptx
Bank Customer Churn Prediction- Saurav Singh.pptx
Boston Institute of Analytics
 
Pitch deck templates for seed capital
Pitch deck templates for seed capital Pitch deck templates for seed capital
Pitch deck templates for seed capital
Tech in Asia ID
 
Revenue Cycle: Tracking Reimbursement for DRGs, APCs and MPFS
Revenue Cycle: Tracking Reimbursement for DRGs, APCs and MPFSRevenue Cycle: Tracking Reimbursement for DRGs, APCs and MPFS
Revenue Cycle: Tracking Reimbursement for DRGs, APCs and MPFS
AudioEducator
 
Attrition Calculation
Attrition CalculationAttrition Calculation
Attrition Calculation
swati18
 
Analysing corporate performance with system dynamics
Analysing corporate performance with system dynamicsAnalysing corporate performance with system dynamics
Analysing corporate performance with system dynamics
Daniel Jarosch
 
205250 crystall ball
205250 crystall ball205250 crystall ball
205250 crystall ball
p6academy
 
10minute pitch clinic_w_examples_v2
10minute pitch clinic_w_examples_v210minute pitch clinic_w_examples_v2
10minute pitch clinic_w_examples_v2
e527766
 
Research and Advicemonster Deep Dive - July 2019: Session Two
Research and Advicemonster Deep Dive - July 2019: Session TwoResearch and Advicemonster Deep Dive - July 2019: Session Two
Research and Advicemonster Deep Dive - July 2019: Session Two
Quality Product Research Limited
 

Similar to Subscription Survival Analysis (20)

How to apply CRM using data mining techniques.
How to apply CRM using data mining techniques.How to apply CRM using data mining techniques.
How to apply CRM using data mining techniques.
 
GeoLivi_T24_Banking (2)
GeoLivi_T24_Banking (2)GeoLivi_T24_Banking (2)
GeoLivi_T24_Banking (2)
 
OA predicta net week data360 predictive analytics ralph overbeck published
OA predicta net week data360 predictive analytics ralph overbeck publishedOA predicta net week data360 predictive analytics ralph overbeck published
OA predicta net week data360 predictive analytics ralph overbeck published
 
Building Innovative Subscription-based Businesses: Lecture 2
Building Innovative Subscription-based Businesses: Lecture 2Building Innovative Subscription-based Businesses: Lecture 2
Building Innovative Subscription-based Businesses: Lecture 2
 
Survival_Analysis
Survival_AnalysisSurvival_Analysis
Survival_Analysis
 
TopQuants2014_JokTang_DenysSemagin
TopQuants2014_JokTang_DenysSemaginTopQuants2014_JokTang_DenysSemagin
TopQuants2014_JokTang_DenysSemagin
 
Hedge_Your_Insurance_Company-Liu
Hedge_Your_Insurance_Company-LiuHedge_Your_Insurance_Company-Liu
Hedge_Your_Insurance_Company-Liu
 
Project Pre-Assessment Using a Min/Max Analysis with Matt Hansen at StatStuff
Project Pre-Assessment Using a Min/Max Analysis with Matt Hansen at StatStuffProject Pre-Assessment Using a Min/Max Analysis with Matt Hansen at StatStuff
Project Pre-Assessment Using a Min/Max Analysis with Matt Hansen at StatStuff
 
Modelling Customer Lifetime Revenue for Subscription Business
Modelling Customer Lifetime Revenue for Subscription BusinessModelling Customer Lifetime Revenue for Subscription Business
Modelling Customer Lifetime Revenue for Subscription Business
 
D05 Define VOC, VOB and CTQ
D05 Define VOC, VOB and CTQD05 Define VOC, VOB and CTQ
D05 Define VOC, VOB and CTQ
 
Primer on angel investing 1000 angels
Primer on angel investing 1000 angelsPrimer on angel investing 1000 angels
Primer on angel investing 1000 angels
 
Metrics: Talking about Churn and Retention in the Board Room
Metrics: Talking about Churn and Retention in the Board RoomMetrics: Talking about Churn and Retention in the Board Room
Metrics: Talking about Churn and Retention in the Board Room
 
Bank Customer Churn Prediction- Saurav Singh.pptx
Bank Customer Churn Prediction- Saurav Singh.pptxBank Customer Churn Prediction- Saurav Singh.pptx
Bank Customer Churn Prediction- Saurav Singh.pptx
 
Pitch deck templates for seed capital
Pitch deck templates for seed capital Pitch deck templates for seed capital
Pitch deck templates for seed capital
 
Revenue Cycle: Tracking Reimbursement for DRGs, APCs and MPFS
Revenue Cycle: Tracking Reimbursement for DRGs, APCs and MPFSRevenue Cycle: Tracking Reimbursement for DRGs, APCs and MPFS
Revenue Cycle: Tracking Reimbursement for DRGs, APCs and MPFS
 
Attrition Calculation
Attrition CalculationAttrition Calculation
Attrition Calculation
 
Analysing corporate performance with system dynamics
Analysing corporate performance with system dynamicsAnalysing corporate performance with system dynamics
Analysing corporate performance with system dynamics
 
205250 crystall ball
205250 crystall ball205250 crystall ball
205250 crystall ball
 
10minute pitch clinic_w_examples_v2
10minute pitch clinic_w_examples_v210minute pitch clinic_w_examples_v2
10minute pitch clinic_w_examples_v2
 
Research and Advicemonster Deep Dive - July 2019: Session Two
Research and Advicemonster Deep Dive - July 2019: Session TwoResearch and Advicemonster Deep Dive - July 2019: Session Two
Research and Advicemonster Deep Dive - July 2019: Session Two
 

Recently uploaded

Full-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalizationFull-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalization
Zilliz
 
Choosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptxChoosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptx
Brandon Minnick, MBA
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
Aftab Hussain
 
Things to Consider When Choosing a Website Developer for your Website | FODUU
Things to Consider When Choosing a Website Developer for your Website | FODUUThings to Consider When Choosing a Website Developer for your Website | FODUU
Things to Consider When Choosing a Website Developer for your Website | FODUU
FODUU
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems S.M.S.A.
 
Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
Kumud Singh
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
Zilliz
 
GraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracyGraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracy
Tomaz Bratanic
 
GenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizationsGenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizations
kumardaparthi1024
 
How to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For FlutterHow to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For Flutter
Daiki Mogmet Ito
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
tolgahangng
 
Microsoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdfMicrosoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdf
Uni Systems S.M.S.A.
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Safe Software
 
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdfMonitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Tosin Akinosho
 
UI5 Controls simplified - UI5con2024 presentation
UI5 Controls simplified - UI5con2024 presentationUI5 Controls simplified - UI5con2024 presentation
UI5 Controls simplified - UI5con2024 presentation
Wouter Lemaire
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
名前 です男
 
Generating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and MilvusGenerating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and Milvus
Zilliz
 
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
Edge AI and Vision Alliance
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
Quotidiano Piemontese
 

Recently uploaded (20)

Full-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalizationFull-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalization
 
Choosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptxChoosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptx
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
 
Things to Consider When Choosing a Website Developer for your Website | FODUU
Things to Consider When Choosing a Website Developer for your Website | FODUUThings to Consider When Choosing a Website Developer for your Website | FODUU
Things to Consider When Choosing a Website Developer for your Website | FODUU
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
 
Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
 
GraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracyGraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracy
 
GenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizationsGenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizations
 
How to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For FlutterHow to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For Flutter
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
 
Microsoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdfMicrosoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdf
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
 
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdfMonitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdf
 
UI5 Controls simplified - UI5con2024 presentation
UI5 Controls simplified - UI5con2024 presentationUI5 Controls simplified - UI5con2024 presentation
UI5 Controls simplified - UI5con2024 presentation
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
 
Generating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and MilvusGenerating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and Milvus
 
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
 

Subscription Survival Analysis

  • 1. Subscription Survival Analysis SF Data Mining Meetup San Francisco June 2012 Jim Porzak – Senior Director, Business Intelligence
  • 2. What We’ll Cover… • Quick introduction to survival analysis. • Rational for marketing & business. • Subscription businesses past & present. • The example data set. How real is it? • Calculating survival, average tenure & LTV. • Applications of Survival & Hazard curves. • Discussion. • Appendix (links & details). 2
  • 3. Traditional Survival Analysis • Modeling of “time to event” data – Death in biological organisms (medicine) – Failure of machines (reliability engineering) – Political or sociological change (duration analysis) • These kinds of data share common gotcha: – If an “individual” has not yet died, failed, or changed, what can we say about the expected time to the event? 3
  • 4. Trick: Use what we know – everything! • All have started, some have stopped, & some continue on. – “Right censored” • From this kind of data we can calculate probability of stopping. – “Hazard Ratio” = f(tenure, … ) • Probability of survival at tenure, T, is just ( 1 – cumulative hazard(T)) 4
  • 5. Classical “Bathtub” Hazard Curve Source: http://en.wikipedia.org/wiki/Bathtub_curve 5
  • 6. The Subscription Business Model 1st Visit Acquisition Conversion Lapse Re-activation Today’s #’s or Win-back Re- activate Engage Win-Back Acquire Retain Retain Convert 6
  • 7. Aside: Subscription Stints – Rational & Method • For each member we need to know subscription start &, if terminated, stop dates for each member: Subscriber A • But data often has the accounting view with gaps in the subscription: CC Issue Prod Chg Whatever Subscriber A • From the customer’s perspective, he had one continuous subscription. It’s important to take that view since, in general, the longer a subscriber is active the lower the risk of churn. • A subscription stint algorithm typically ignores any gaps up to, say, 30 days – which is also taken as the boundary between a lapsed subscriber in the “re-activate” vrs “win-back” pools. 7
  • 8. Why Subscription Survival? • Expected churn probability of new subscribers – Project future subscriber count • Especially after a successful campaign – Define tenure segment boundaries • Expected tenure of a new subscriber & LTV – As function of duration, product, channel, offer – Evaluate marketing & site tests by value not rates • Projected value of retention efforts – How improved renewal rates increase value • A new way of thinking about subscribers – Retention metrics, as used in marketing, are deceptive (see Appendix slide) 8
  • 9. Typical Subscription Survival Curves Linoff, Gordon (2004) Link in appendix. 9
  • 10. Assumptions • Homogeneity of strata – Each includes only one segment of customers • Stability over time – Hardest to satisfy? – Propensity to renew a function of: • General economic conditions • Offer(s) at start of subscription • Engagement efforts by marketing group • Perceived long term value 10
  • 11. Example Data Set • 100k records: – CustomerID – SubStartDate – SubEndDate (NULL if subscription active) – Duration = [A, Q, M] for Annual, Quarterly, or Monthly – Product = [B, P] for Basic or Professional – Channel = [R, A, S, B, O] for Referral, Affiliate, Search, Banner, or Other • Randomly generated with day-of-week and week-in-year seasonality; 15% year over year growth for starts. • Randomly generated number of renewals – With different churn rates for A, Q, & M • Mimics general properties of subscription data from Playboy.com, LA Times, 24 Hour Fitness, Chicago Sun Times, Ancestry.com, & Viadeo.com – Additional parameters could be Offer, Promotion Cohort, … • Assumes “no-return” policy, i.e. no early refunds. 11
  • 12. Algorithm – Part 1: Hazard Ratio (# terminated during day i) ______________________________________ HRi = (# active at beginning of day i) = 1/4 = 0.25 Remember N is typically huge in a subscription business so we tend to get smooth curves. 12
  • 13. Algorithm – Part 2 Survival Curve Hazard Ratio Rate USDeluxe Domestic Acom Hazard 0.14 Duration A (N = 204,538) M (N = 196,345) 0.12 0.10 Hazard Rate 0.08 0.06 0.04 0.02 0.00 0 200 400 600 800 1000 Tenure (days) Subscribers since 2006-01-01 (N = 400,883 subscribers) Average 2-year Tenure Survival Probability USDeluxe Domestic Acom Subscriber Survival 1.0 Duration A (N = 204,538) M (N = 196,345) 0.8 Probability of Survival 0.6 120 404 0.4 0.2 0.0 0 200 400 600 800 1000 Tenure (days) Subscribers since 2006-01-01 (N = 400,883 subscribers) LTR = (Daily Sales Price) * (Average Tenure) 13
  • 14. Application: Average Tenure of Renewal Durations Subscription Stint Survival by Duration 1-Year 1-year 2-Year 2-year 3-Year 3-year Duration Half Life Average Probability Average Probability Average Probability (Months) (days) Tenure of Survival Tenure of Survival Tenure of Survival 1 149 180.9 0.217 NA NA NA NA 3 273 260.9 0.380 354.4 0.131 387.7 0.058 12 456 364.7 0.880 557.8 0.452 629.9 0.159 24 822 363.8 0.991 723.1 0.978 840.3 0.125 14
  • 15. Application: Evaluating a Conversion Test We test a great new design for the web site conversion page flow & get 1050 conversions for new flow vs. 1000 for old flow - a 5% lift! A: Control Page Flow B: New Page Flow 2-Yr Daily 2-Yr Projected 2- Projected 2- Tenure ASP Value # Conv. Yr Value # Conv. Yr Value M 220 0.15 $33.00 500 $16,500 650 $21,450 Q 350 0.12 $42.00 300 $12,600 275 $11,550 A 560 0.09 $50.40 200 $10,080 125 $6,300 Total 1000 $39,180 1050 $39,300 Revenue Per Unit (2-Year) $39.18 $37.43 15
  • 16. Application: Projected Value of Base So far we have looked at survival from start of subscription. Now we need remaining survival for current subscribers. For example a 1-year subscriber: Just reset Ps to 1.0 Projected n-year value of your current subscriber base is: For all subscribers, Sum their (daily sales price) * (Area under appropriate survival curve from their current tenure over the next n years) / (Ps (current tenure)) 16
  • 17. Application: Thinking in Hazard Space Picking Tenure Segment Boundaries Simple minded customer segmentation. But very useful! • Newbie: 0 - 100 days ___ • Builder: 101 - ___ days ___ 380 • Established: 381 - 735 days ___ ___ • Core: 736 days & up ___ Hazard Ratio USDeluxe Domestic Acom Hazard Rate 0.14 Duration A (N = 204,538) M (N = 196,345) 0.12 0.10 Hazard Rate 0.08 0.06 0.04 0.02 0.00 0 200 400 600 800 1000 Tenure (days) Subscribers since 2006-01-01 (N = 400,883 subscribers) 17
  • 18. Application: Modeling in Hazard Space What-if Newbie Churn Could be Reduced by 10%? Red: Newbie Hazard down by Red: Newbie 10% 2-year value up by $10 18
  • 19. Wrap-up. What we saw. • Introduction to classical survival • Subscribers & calculating their hazard & survival • Applying survival & hazard to real-world questions • See Appendix for links & some more details. 19
  • 20. Questions? And, ping me later with questions or comments: JPorzak@gmail.com 20
  • 21. Appendix • Links to learn more • Retention vs Survival Curves
  • 22. Subscription Survival Background • On subscription survival for customer intelligence: – Gordon Linoff’s 2004 Article from Intelligent Enterprise (now InformationWeek) http://www.data-miners.com/resources/Customer- Insight-Article.pdf – Will Potts’ technical white paper http://www.data- miners.com/resources/Will%20Survival.pdf – Berry & Linoff’s white paper for SAS, emphasizes forecasting application http://www.data- miners.com/companion/sas/forecastingWP_001.pdf – Chapter 10 in http://amzn.to/mRAVpp • Background on “classical” survival analysis: – http://en.wikipedia.org/wiki/Survival_analysis • R packages used – survival by Terry Therneau – ggplot2, plyr, lubridate by Hadley Wickham, et al – zoo, xts by Achim Zeileis, Jeffrey Ryan, et al 22
  • 23. Retention vs Survival Curves • Retention Rate = 1 – Churn Rate • Churn Rate (typically) defined as: (# Subscribers leaving) (Average # subscribers) over some period. • Which means: – Ignores information out of period – Not monotonically decreasing 23
  • 24. Traditional vs. Subscription Survival Propertity Traditional Subscription N Smallish (10^1 – 10^3) Large to huge (10^5 & up) Hazard Assumed continuous Usually spiky Survival curves Often Modeled Empirical Survival Curve(s) Assumed continuous Usually stair step 24

Editor's Notes

  1. Point out the two strata: CC & No-CC. Then rolled up to All.