SlideShare a Scribd company logo
1 of 20
Download to read offline
Churn Prediction in
Mobile Social Games:
Towards a Complete
Assessment Using Survival
Ensembles
1
África Periáñez, Alain Saas, Anna Guitart and Colin Magne
IEEE/ACM DSAA 2016
Montreal, October 19th, 2016
About us
2
Who are we?
● Game and technology company based in Tokyo (spin-off of
Silicon Graphics)
● Research project to provide Game Data Science as a Service
● Goals: predict player behavior, scale to big data and
intuitive result visualization
3
● Free-to-play mobile social games
● in-app purchases and activity behavioral data
Our data
4
Churn prediction in Free-To-Play games
We focus on the top spenders: the whales
➔ 0.2% of the players, 50 % of the revenues
➔ Their high engagement make them more likely to answer positively to
action taken to retain them
➔ For this group, we can define churn as 10 days of inactivity
◆ The definition of churn in F2P games is not straightforward
Features selection
◎ Game independent features:
○ player attention: time spent per day, lifetime
○ player loyalty : number of days connecting, loyalty index (number of days
played over lifetime), days from registration to first purchase, days since
last purchase
○ player intensity: number of actions, sessions, amount in-app purchases,
action activity distance (total average actions compared to last days
behaviour)
○ player level: concept common to most games)
◎ Game dependent features researched but ultimately not part of our model:
○ participation in a guild (social feature)
○ actions measured by categories
5
The model
Survival Ensembles
6
Challenge: modeling churn
◎ Survival analysis focuses on predicting the
time-to-event, e.g. churn
○ when a player will stop playing?
◎ Classical methods, like regressions, are appropriate
when all players have left the game
◎ Censoring Problem: dataset with incomplete churning
information
◎ Censoring is the nature of churn
➔ Survival analysis is used in biology and medicine to
deal with this problem
➔ Ensemble learning techniques provide high-class
prediction results
7
◎ We focus on whales
◎ Cumulative survival probability (Kaplan-Meier estimates)
◎ Step function that changes every time that a player churns
8
Output of the model
◎ Two approaches:
○ Churn as a binary classification
○ Churn as a censored data problem
◎ One model: Conditional Inference Survival Ensembles1
○ deals with censoring
○ high accuracy due to ensemble learning
Survival Analysis
➔ Survival analysis methods (e.g. Cox regression) does not follow any
particular statistical distribution: fitted from data
➔ Fixed link between output and features: efforts to model selection and
evaluation
1) Hothorn et al., 2006. Unbiased recursive partitioning: A conditional inference framework 9
Challenge: modeling churn
Survival Tree
➔ Split the feature space
recursively
➔ Based on survival statistical
criterion the root node is
divided in two daughter nodes
➔ Maximize the survival
difference between nodes
➔ A single tree produces
instability predictions
Conditional Survival Ensembles
➔ Outstanding predictions
➔ Make use of hundreds of trees
➔ Conditional inference survival
ensemble use a Kaplan-Meier
function as splitting criterion
➔ Overfit is not present
➔ Robust information about
variable importance
➔ Not biased approach
10
Conditional inference survival ensembles
Conditional inference survival tree partition with
Kaplan-Meier estimates of the survival time which
characterizes the players placed in every terminal node group
11
Linear rank
statistics as
splitting criterion
Survival tree
◎ Two steps algorithm:
○ 1) the optimal split variable is selected: association between
covariates and response
○ 2) the optimal split point is determined by comparing two sample
linear statistics for all possible partitions of the split variable
Random Survival Forest
➔ RSF is based on original random forest algorithm1
➔ RSF favors variables with many possible split points over variables
with fewer
121) Breiman L. 2001. Random Forests.
Conditional inference survival ensembles
The Results
With “Age of Ishtaria” Game Data
13
14
Binary classification results and comparison with other
models
15
Predicted Kaplan-Meier survival curves as a function
of time (days) for new or existing players
Censored data problem results
16
Validation -- Churn prediction
17
Validation -- Churn prediction
1000 bootstrap cross-validation error curves for
the survival ensemble model and Cox
regression
◎ Censoring problem is the right approach
○ the median survival time, i.e. time when the percentage of
surviving in the game is 50%, can be used as a time threshold
to categorize a player in the risk of churning
◎ Binary problem -- static model
○ also bring relevant information
○ useful insight for a short-term prediction
◎ SVM, ANN, Decision Trees, etc. are useful tools for regression or
classification problems.
○ in their original form cannot handle with censored data
○ 1) modification of algorithm or 2) transformation of the data
18
Survival ensembles approach
◎ Application of state-of-the-art algorithm “conditional inference
survival ensembles”
○ to predict churn
○ and survival probability of players in social games
◎ Model able to make predictions every day in operational
environment
◎ adapts to other game data: Democratize Game Data Science
◎ relevant information about whales behaviour
○ discovering new playing patterns as a function of time
○ classifying gamers by risk factors of survival experience
◎ Step towards the challenging goal of the comprehensive
understanding of players
19
Summary and conclusion
20
Other work of the authors related to Game Data Science
Discovering Playing Patterns:
Time Series Clustering of Free-To-Play Game Data
Alain Saas, Anna Guitart and África Periáñez
IEEE CIG 2016
Special Session on Game Data Science
Chaired by Alain Saas and África Periáñez
IEEE/ACM DSAA 2016
www.gamedatascience.org

More Related Content

Similar to DSAA 2016 Churn Prediction in Mobile Social Games

IEEE CIG 2017 New York, Games and Big Data: A Scalable Multi-Dimensional Chur...
IEEE CIG 2017 New York, Games and Big Data: A Scalable Multi-Dimensional Chur...IEEE CIG 2017 New York, Games and Big Data: A Scalable Multi-Dimensional Chur...
IEEE CIG 2017 New York, Games and Big Data: A Scalable Multi-Dimensional Chur...Anna Guitart Atienza
 
ACM FDG 2019, SLO, CA, USA, From Non-Paying to Premium: Predicting User Conve...
ACM FDG 2019, SLO, CA, USA, From Non-Paying to Premium: Predicting User Conve...ACM FDG 2019, SLO, CA, USA, From Non-Paying to Premium: Predicting User Conve...
ACM FDG 2019, SLO, CA, USA, From Non-Paying to Premium: Predicting User Conve...Anna Guitart Atienza
 
Recommender Systems Fairness Evaluation via Generalized Cross Entropy
Recommender Systems Fairness Evaluation via Generalized Cross EntropyRecommender Systems Fairness Evaluation via Generalized Cross Entropy
Recommender Systems Fairness Evaluation via Generalized Cross EntropyVito Walter Anelli
 
Jeffrey xu yu large graph processing
Jeffrey xu yu large graph processingJeffrey xu yu large graph processing
Jeffrey xu yu large graph processingjins0618
 
A new CPXR Based Logistic Regression Method and Clinical Prognostic Modeling ...
A new CPXR Based Logistic Regression Method and Clinical Prognostic Modeling ...A new CPXR Based Logistic Regression Method and Clinical Prognostic Modeling ...
A new CPXR Based Logistic Regression Method and Clinical Prognostic Modeling ...Vahid Taslimitehrani
 
An Evolutionary-based Neural Network for Distinguishing between Genuine and P...
An Evolutionary-based Neural Network for Distinguishing between Genuine and P...An Evolutionary-based Neural Network for Distinguishing between Genuine and P...
An Evolutionary-based Neural Network for Distinguishing between Genuine and P...Md Rakibul Hasan
 
On the Dynamics of Machine Learning Algorithms and Behavioral Game Theory
On the Dynamics of Machine Learning Algorithms and Behavioral Game TheoryOn the Dynamics of Machine Learning Algorithms and Behavioral Game Theory
On the Dynamics of Machine Learning Algorithms and Behavioral Game TheoryRikiya Takahashi
 
Interpretable machine learning : Methods for understanding complex models
Interpretable machine learning : Methods for understanding complex modelsInterpretable machine learning : Methods for understanding complex models
Interpretable machine learning : Methods for understanding complex modelsManojit Nandi
 
Game Theory and Engineering Applications
Game Theory and Engineering ApplicationsGame Theory and Engineering Applications
Game Theory and Engineering ApplicationsDr Ganesh Iyer
 
Icdec2020_presentation_slides_13
Icdec2020_presentation_slides_13Icdec2020_presentation_slides_13
Icdec2020_presentation_slides_13ICDEcCnferenece
 
DC02. Interpretation of predictions
DC02. Interpretation of predictionsDC02. Interpretation of predictions
DC02. Interpretation of predictionsAnton Kulesh
 
IMAGE CLASSIFICATION USING DIFFERENT CLASSICAL APPROACHES
IMAGE CLASSIFICATION USING DIFFERENT CLASSICAL APPROACHESIMAGE CLASSIFICATION USING DIFFERENT CLASSICAL APPROACHES
IMAGE CLASSIFICATION USING DIFFERENT CLASSICAL APPROACHESVikash Kumar
 
Machine learning ppt unit one syllabuspptx
Machine learning ppt unit one syllabuspptxMachine learning ppt unit one syllabuspptx
Machine learning ppt unit one syllabuspptxVenkateswaraBabuRavi
 
Sentiment analysis of Twitter Data
Sentiment analysis of Twitter DataSentiment analysis of Twitter Data
Sentiment analysis of Twitter DataNurendra Choudhary
 
Intro to machine learning
Intro to machine learningIntro to machine learning
Intro to machine learningAkshay Kanchan
 
Gdmc v11 presentation
Gdmc v11 presentationGdmc v11 presentation
Gdmc v11 presentationjihoon jeon
 
Simplified Knowledge Prediction: Application of Machine Learning in Real Life
Simplified Knowledge Prediction: Application of Machine Learning in Real LifeSimplified Knowledge Prediction: Application of Machine Learning in Real Life
Simplified Knowledge Prediction: Application of Machine Learning in Real LifePeea Bal Chakraborty
 

Similar to DSAA 2016 Churn Prediction in Mobile Social Games (20)

IEEE CIG 2017 New York, Games and Big Data: A Scalable Multi-Dimensional Chur...
IEEE CIG 2017 New York, Games and Big Data: A Scalable Multi-Dimensional Chur...IEEE CIG 2017 New York, Games and Big Data: A Scalable Multi-Dimensional Chur...
IEEE CIG 2017 New York, Games and Big Data: A Scalable Multi-Dimensional Chur...
 
ACM FDG 2019, SLO, CA, USA, From Non-Paying to Premium: Predicting User Conve...
ACM FDG 2019, SLO, CA, USA, From Non-Paying to Premium: Predicting User Conve...ACM FDG 2019, SLO, CA, USA, From Non-Paying to Premium: Predicting User Conve...
ACM FDG 2019, SLO, CA, USA, From Non-Paying to Premium: Predicting User Conve...
 
Recommender Systems Fairness Evaluation via Generalized Cross Entropy
Recommender Systems Fairness Evaluation via Generalized Cross EntropyRecommender Systems Fairness Evaluation via Generalized Cross Entropy
Recommender Systems Fairness Evaluation via Generalized Cross Entropy
 
Jeffrey xu yu large graph processing
Jeffrey xu yu large graph processingJeffrey xu yu large graph processing
Jeffrey xu yu large graph processing
 
A new CPXR Based Logistic Regression Method and Clinical Prognostic Modeling ...
A new CPXR Based Logistic Regression Method and Clinical Prognostic Modeling ...A new CPXR Based Logistic Regression Method and Clinical Prognostic Modeling ...
A new CPXR Based Logistic Regression Method and Clinical Prognostic Modeling ...
 
An Evolutionary-based Neural Network for Distinguishing between Genuine and P...
An Evolutionary-based Neural Network for Distinguishing between Genuine and P...An Evolutionary-based Neural Network for Distinguishing between Genuine and P...
An Evolutionary-based Neural Network for Distinguishing between Genuine and P...
 
On the Dynamics of Machine Learning Algorithms and Behavioral Game Theory
On the Dynamics of Machine Learning Algorithms and Behavioral Game TheoryOn the Dynamics of Machine Learning Algorithms and Behavioral Game Theory
On the Dynamics of Machine Learning Algorithms and Behavioral Game Theory
 
Interpretable machine learning : Methods for understanding complex models
Interpretable machine learning : Methods for understanding complex modelsInterpretable machine learning : Methods for understanding complex models
Interpretable machine learning : Methods for understanding complex models
 
Game Theory and Engineering Applications
Game Theory and Engineering ApplicationsGame Theory and Engineering Applications
Game Theory and Engineering Applications
 
Icdec2020_presentation_slides_13
Icdec2020_presentation_slides_13Icdec2020_presentation_slides_13
Icdec2020_presentation_slides_13
 
DC02. Interpretation of predictions
DC02. Interpretation of predictionsDC02. Interpretation of predictions
DC02. Interpretation of predictions
 
IMAGE CLASSIFICATION USING DIFFERENT CLASSICAL APPROACHES
IMAGE CLASSIFICATION USING DIFFERENT CLASSICAL APPROACHESIMAGE CLASSIFICATION USING DIFFERENT CLASSICAL APPROACHES
IMAGE CLASSIFICATION USING DIFFERENT CLASSICAL APPROACHES
 
Machine learning ppt unit one syllabuspptx
Machine learning ppt unit one syllabuspptxMachine learning ppt unit one syllabuspptx
Machine learning ppt unit one syllabuspptx
 
Sentiment analysis of Twitter Data
Sentiment analysis of Twitter DataSentiment analysis of Twitter Data
Sentiment analysis of Twitter Data
 
Intro to machine learning
Intro to machine learningIntro to machine learning
Intro to machine learning
 
Gdmc v11 presentation
Gdmc v11 presentationGdmc v11 presentation
Gdmc v11 presentation
 
Extensive Survey on Datamining Algoritms for Pattern Extraction
Extensive Survey on Datamining Algoritms for Pattern ExtractionExtensive Survey on Datamining Algoritms for Pattern Extraction
Extensive Survey on Datamining Algoritms for Pattern Extraction
 
Simplified Knowledge Prediction: Application of Machine Learning in Real Life
Simplified Knowledge Prediction: Application of Machine Learning in Real LifeSimplified Knowledge Prediction: Application of Machine Learning in Real Life
Simplified Knowledge Prediction: Application of Machine Learning in Real Life
 
Ml ppt at
Ml ppt atMl ppt at
Ml ppt at
 
Machine learning meetup
Machine learning meetupMachine learning meetup
Machine learning meetup
 

Recently uploaded

Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...nirzagarg
 
Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...
Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...
Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...HyderabadDolls
 
20240412-SmartCityIndex-2024-Full-Report.pdf
20240412-SmartCityIndex-2024-Full-Report.pdf20240412-SmartCityIndex-2024-Full-Report.pdf
20240412-SmartCityIndex-2024-Full-Report.pdfkhraisr
 
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...Elaine Werffeli
 
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...SOFTTECHHUB
 
Kings of Saudi Arabia, information about them
Kings of Saudi Arabia, information about themKings of Saudi Arabia, information about them
Kings of Saudi Arabia, information about themeitharjee
 
Ranking and Scoring Exercises for Research
Ranking and Scoring Exercises for ResearchRanking and Scoring Exercises for Research
Ranking and Scoring Exercises for ResearchRajesh Mondal
 
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...nirzagarg
 
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样wsppdmt
 
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book nowVadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book nowgargpaaro
 
7. Epi of Chronic respiratory diseases.ppt
7. Epi of Chronic respiratory diseases.ppt7. Epi of Chronic respiratory diseases.ppt
7. Epi of Chronic respiratory diseases.pptibrahimabdi22
 
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...gajnagarg
 
Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1ranjankumarbehera14
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Valters Lauzums
 
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Klinik kandungan
 
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...gajnagarg
 
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...HyderabadDolls
 
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...gajnagarg
 
Computer science Sql cheat sheet.pdf.pdf
Computer science Sql cheat sheet.pdf.pdfComputer science Sql cheat sheet.pdf.pdf
Computer science Sql cheat sheet.pdf.pdfSayantanBiswas37
 

Recently uploaded (20)

Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
 
Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...
Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...
Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...
 
20240412-SmartCityIndex-2024-Full-Report.pdf
20240412-SmartCityIndex-2024-Full-Report.pdf20240412-SmartCityIndex-2024-Full-Report.pdf
20240412-SmartCityIndex-2024-Full-Report.pdf
 
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
 
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
 
Kings of Saudi Arabia, information about them
Kings of Saudi Arabia, information about themKings of Saudi Arabia, information about them
Kings of Saudi Arabia, information about them
 
Ranking and Scoring Exercises for Research
Ranking and Scoring Exercises for ResearchRanking and Scoring Exercises for Research
Ranking and Scoring Exercises for Research
 
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
 
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get CytotecAbortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get Cytotec
 
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
 
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book nowVadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
 
7. Epi of Chronic respiratory diseases.ppt
7. Epi of Chronic respiratory diseases.ppt7. Epi of Chronic respiratory diseases.ppt
7. Epi of Chronic respiratory diseases.ppt
 
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
 
Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
 
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
 
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...
 
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
 
Computer science Sql cheat sheet.pdf.pdf
Computer science Sql cheat sheet.pdf.pdfComputer science Sql cheat sheet.pdf.pdf
Computer science Sql cheat sheet.pdf.pdf
 

DSAA 2016 Churn Prediction in Mobile Social Games

  • 1. Churn Prediction in Mobile Social Games: Towards a Complete Assessment Using Survival Ensembles 1 África Periáñez, Alain Saas, Anna Guitart and Colin Magne IEEE/ACM DSAA 2016 Montreal, October 19th, 2016
  • 2. About us 2 Who are we? ● Game and technology company based in Tokyo (spin-off of Silicon Graphics) ● Research project to provide Game Data Science as a Service ● Goals: predict player behavior, scale to big data and intuitive result visualization
  • 3. 3 ● Free-to-play mobile social games ● in-app purchases and activity behavioral data Our data
  • 4. 4 Churn prediction in Free-To-Play games We focus on the top spenders: the whales ➔ 0.2% of the players, 50 % of the revenues ➔ Their high engagement make them more likely to answer positively to action taken to retain them ➔ For this group, we can define churn as 10 days of inactivity ◆ The definition of churn in F2P games is not straightforward
  • 5. Features selection ◎ Game independent features: ○ player attention: time spent per day, lifetime ○ player loyalty : number of days connecting, loyalty index (number of days played over lifetime), days from registration to first purchase, days since last purchase ○ player intensity: number of actions, sessions, amount in-app purchases, action activity distance (total average actions compared to last days behaviour) ○ player level: concept common to most games) ◎ Game dependent features researched but ultimately not part of our model: ○ participation in a guild (social feature) ○ actions measured by categories 5
  • 7. Challenge: modeling churn ◎ Survival analysis focuses on predicting the time-to-event, e.g. churn ○ when a player will stop playing? ◎ Classical methods, like regressions, are appropriate when all players have left the game ◎ Censoring Problem: dataset with incomplete churning information ◎ Censoring is the nature of churn ➔ Survival analysis is used in biology and medicine to deal with this problem ➔ Ensemble learning techniques provide high-class prediction results 7
  • 8. ◎ We focus on whales ◎ Cumulative survival probability (Kaplan-Meier estimates) ◎ Step function that changes every time that a player churns 8 Output of the model
  • 9. ◎ Two approaches: ○ Churn as a binary classification ○ Churn as a censored data problem ◎ One model: Conditional Inference Survival Ensembles1 ○ deals with censoring ○ high accuracy due to ensemble learning Survival Analysis ➔ Survival analysis methods (e.g. Cox regression) does not follow any particular statistical distribution: fitted from data ➔ Fixed link between output and features: efforts to model selection and evaluation 1) Hothorn et al., 2006. Unbiased recursive partitioning: A conditional inference framework 9 Challenge: modeling churn
  • 10. Survival Tree ➔ Split the feature space recursively ➔ Based on survival statistical criterion the root node is divided in two daughter nodes ➔ Maximize the survival difference between nodes ➔ A single tree produces instability predictions Conditional Survival Ensembles ➔ Outstanding predictions ➔ Make use of hundreds of trees ➔ Conditional inference survival ensemble use a Kaplan-Meier function as splitting criterion ➔ Overfit is not present ➔ Robust information about variable importance ➔ Not biased approach 10 Conditional inference survival ensembles
  • 11. Conditional inference survival tree partition with Kaplan-Meier estimates of the survival time which characterizes the players placed in every terminal node group 11 Linear rank statistics as splitting criterion Survival tree
  • 12. ◎ Two steps algorithm: ○ 1) the optimal split variable is selected: association between covariates and response ○ 2) the optimal split point is determined by comparing two sample linear statistics for all possible partitions of the split variable Random Survival Forest ➔ RSF is based on original random forest algorithm1 ➔ RSF favors variables with many possible split points over variables with fewer 121) Breiman L. 2001. Random Forests. Conditional inference survival ensembles
  • 13. The Results With “Age of Ishtaria” Game Data 13
  • 14. 14 Binary classification results and comparison with other models
  • 15. 15 Predicted Kaplan-Meier survival curves as a function of time (days) for new or existing players Censored data problem results
  • 17. 17 Validation -- Churn prediction 1000 bootstrap cross-validation error curves for the survival ensemble model and Cox regression
  • 18. ◎ Censoring problem is the right approach ○ the median survival time, i.e. time when the percentage of surviving in the game is 50%, can be used as a time threshold to categorize a player in the risk of churning ◎ Binary problem -- static model ○ also bring relevant information ○ useful insight for a short-term prediction ◎ SVM, ANN, Decision Trees, etc. are useful tools for regression or classification problems. ○ in their original form cannot handle with censored data ○ 1) modification of algorithm or 2) transformation of the data 18 Survival ensembles approach
  • 19. ◎ Application of state-of-the-art algorithm “conditional inference survival ensembles” ○ to predict churn ○ and survival probability of players in social games ◎ Model able to make predictions every day in operational environment ◎ adapts to other game data: Democratize Game Data Science ◎ relevant information about whales behaviour ○ discovering new playing patterns as a function of time ○ classifying gamers by risk factors of survival experience ◎ Step towards the challenging goal of the comprehensive understanding of players 19 Summary and conclusion
  • 20. 20 Other work of the authors related to Game Data Science Discovering Playing Patterns: Time Series Clustering of Free-To-Play Game Data Alain Saas, Anna Guitart and África Periáñez IEEE CIG 2016 Special Session on Game Data Science Chaired by Alain Saas and África Periáñez IEEE/ACM DSAA 2016 www.gamedatascience.org