SlideShare a Scribd company logo
Predicting Churn in Telecom
Outline
• Business Problem
• Variable Description
• Exploratory Data Analysis
• Feature Selection
• Data Pre-Processing
• Model Development
• Model Validation
Business Problem
• Consumers today go through a complex decision
making process before subscribing to any one of the
numerous Telecom service options.
• The services provided by the Telecom vendors are not
highly differentiated and number portability is
commonplace.
• customer loyalty becomes an issue. Hence, it is
becoming increasingly important for
telecommunications companies to proactively
identify factors that have a tendency to unsubscribe
and take preventive measures to retain customers.
Variable Description
• State : categorical, for the 50 states and the District of Columbia
• Account Length : integer-valued, how long account has been active
• Area Code : categorical
• Phone : Phone number of customer
• Int'l Plan : International plan activated ( yes , no)
• VMail Plan : Voice Mail plan activated ( yes , no )
• VMail Message : No. of voice mail messages
• Day Mins : Total day minutes used
• Day Calls : Total day calls made
• Day Charge : Total day charge
• Eve Mins : Total evening minutes
• Eve Calls : Total evening calls
• Eve Charge : Total evening charge
• Night Mins : Total night minutes
• Night Calls : Total night calls
• Night Charge : Total night charge
• Intl Mins : Total International minutes used
• Intl Calls : Total International calls made
• Intl Charge : Total International charge
• CustServ Calls : Number of customer service calls made
• Churn : Customer churn (Target Variable 1= churn , 0= not churned )
Exploratory Data Analysis
Summary statistics
Visualizing statistics
Plot 1
Plot 2:
Plot 3
Few observation from exploratory
analysis
• Customers with the International Plan tend to
churn more frequently
• Customers with the Voice Mail Plan tend to
churn less frequently.
• Customers with four or more customer service
calls churn more than four times as often as
do the other customers.
Feature Selection
• Important features were identified during
model building process for ex:
– Stepwise regression indicates important variable
to consider
– Variable importance graph has been generated
using random forest and so on
Data Pre-Processing
• Dataset considered for this project is already
cleaned
• We have partitioned our dataset into training and
testing set using simple random sampling
• We have dropped following four variables as they
are not adding any meaning for modelling
purpose
– State
– Area.code
– Account.length
– Phone number
Model 1: Decision Tree
• Easy to interpret
• Generates if-else business rules
• Recursive partitioning and classification
technique is used
• Tree build
– Fully grown (results in overfitting of data)
– Pruned tree (optimal tree)
• R packages used:
– Rpart
– Caret
Tree 1: Full Tree
Performance measure of full tree :
ROC Curve
Performance measure of full tree :
Confusion Matrix and other statistics
Tree 2: Pruned Tree
Performance Measure of Pruned Tree:
ROC Curve
Performance measure of Pruned tree :
Confusion Matrix and other statistics
Comparing Performance of both the
tree: ROC Curve
Compare : Confusion Matrix and other
statistics
Full Tree Pruned Tree
Model 2: Logistic Regression
• Widely used across industry
• R packages used
– Glm for model building
– Caret for model evaluation
Model Summary on all variable as
Input
Model summary on statistically
significant variables
Model Evaluation-Confusion Matrix
Model 3: Support Vector Machine
• Widely used black box technique for binary
classification
• R packages used
– e1071 (for model building)
– Caret (for model evaluation)
Model performance: Confusion Matrix
Model Evaluation: SVM Roc Curve
Model 4: Ensemble (Random Forest)
• Ensembling of decision trees will be done
• R packages used:
– randomForest (model development)
– caret (model evaluation)
Variable Importance Plot : Random
Forest
Model Evaluation : Confusion Matrix
Model Evaluation : ROC curve
(Random Forest)
Models Comparison: ROC curve
CUSTMER SEGMENTATION & CLTV
CALCULATION
• Different techniques are available for
customer segmentation.
• Customer can be segmented into different
kind of profiles like high value, low value,
warm, cold and so on.
• RFM analaysis, CLTV based segmentation,
clustering based segmentation are few
techniques to name
CLTV( customer life time value)
• CLTV (Customer LifeTime Value) refers to the
amount of revenues that you expect to
generate from a customer during the period
over which your service will be of value.
• On the basis of above values we segment
customer profiles and treat them accordingly
Assumptions
• Due to limitation in our dataset we performed CLTV
analysis on the basis of the following assumptions:
– Given data contains one year of transaction details
– Unit of amount is dollars
– following are the margins that company is getting from
their customer
• 5% of day charge
• 10% of evening hours
• 20% of night and international calls
– Monthly churn rate of telecom industry is 4%
Note: above numbers are for illustration purpose only and it depends on domain knowledge of analyst.
CLTV calculation
• On the basis of this assumptions net profit
from any customer can be calculated as:
-> Net profit = 0.05*daycharge + 0.10* eve.charge + 0.15 *night charge + 0.20 * Intnl charge
->Churnrate = 0.04
->Customer_cltv = (netprofit-0.5*cust_serv_call)/churnrate
• For illustration purpose in our case customers
whose cltv is less than mean(cltv) are
considered as LVC and other are HVC
Note: Above segmentation can be done in a better way with the help of
business domain expert
• THANK YOU

More Related Content

What's hot

churn prediction in telecom
churn prediction in telecom churn prediction in telecom
churn prediction in telecom Hong Bui Van
 
Churn Prediction in Practice
Churn Prediction in PracticeChurn Prediction in Practice
Churn Prediction in PracticeBigData Republic
 
Churn prediction data modeling
Churn prediction data modelingChurn prediction data modeling
Churn prediction data modelingPierre Gutierrez
 
Customer Churn Analysis and Prediction
Customer Churn Analysis and PredictionCustomer Churn Analysis and Prediction
Customer Churn Analysis and PredictionSOUMIT KAR
 
Data mining and analysis of customer churn dataset
Data mining and analysis of customer churn datasetData mining and analysis of customer churn dataset
Data mining and analysis of customer churn datasetRohan Choksi
 
Customer Churn Prevention Powerpoint Presentation Slides
Customer Churn Prevention Powerpoint Presentation SlidesCustomer Churn Prevention Powerpoint Presentation Slides
Customer Churn Prevention Powerpoint Presentation SlidesSlideTeam
 
Telco churn presentation
Telco churn presentationTelco churn presentation
Telco churn presentationAditya Bahl
 
Churn in the Telecommunications Industry
Churn in the Telecommunications IndustryChurn in the Telecommunications Industry
Churn in the Telecommunications Industryskewdlogix
 
Customer attrition and churn modeling
Customer attrition and churn modelingCustomer attrition and churn modeling
Customer attrition and churn modelingMariya Korsakova
 
Prediction of customer propensity to churn - Telecom Industry
Prediction of customer propensity to churn - Telecom IndustryPrediction of customer propensity to churn - Telecom Industry
Prediction of customer propensity to churn - Telecom IndustryPranov Mishra
 
A case study on churn analysis1
A case study on churn analysis1A case study on churn analysis1
A case study on churn analysis1Amit Kumar
 
Churn customer analysis
Churn customer analysisChurn customer analysis
Churn customer analysisDr.Bechoo Lal
 
Telecommunication Analysis (3 use-cases) with IBM watson analytics
Telecommunication Analysis (3 use-cases) with IBM watson analyticsTelecommunication Analysis (3 use-cases) with IBM watson analytics
Telecommunication Analysis (3 use-cases) with IBM watson analyticssheetal sharma
 
Beyond Churn Prediction : An Introduction to uplift modeling
Beyond Churn Prediction : An Introduction to uplift modelingBeyond Churn Prediction : An Introduction to uplift modeling
Beyond Churn Prediction : An Introduction to uplift modelingPierre Gutierrez
 

What's hot (20)

churn prediction in telecom
churn prediction in telecom churn prediction in telecom
churn prediction in telecom
 
Churn Prediction in Practice
Churn Prediction in PracticeChurn Prediction in Practice
Churn Prediction in Practice
 
Churn prediction data modeling
Churn prediction data modelingChurn prediction data modeling
Churn prediction data modeling
 
Customer Churn Analysis and Prediction
Customer Churn Analysis and PredictionCustomer Churn Analysis and Prediction
Customer Churn Analysis and Prediction
 
Data mining and analysis of customer churn dataset
Data mining and analysis of customer churn datasetData mining and analysis of customer churn dataset
Data mining and analysis of customer churn dataset
 
Customer Churn Prevention Powerpoint Presentation Slides
Customer Churn Prevention Powerpoint Presentation SlidesCustomer Churn Prevention Powerpoint Presentation Slides
Customer Churn Prevention Powerpoint Presentation Slides
 
Telecom Churn Prediction
Telecom Churn PredictionTelecom Churn Prediction
Telecom Churn Prediction
 
Telco churn presentation
Telco churn presentationTelco churn presentation
Telco churn presentation
 
Churn in the Telecommunications Industry
Churn in the Telecommunications IndustryChurn in the Telecommunications Industry
Churn in the Telecommunications Industry
 
Telcom churn .pptx
Telcom churn .pptxTelcom churn .pptx
Telcom churn .pptx
 
Customer attrition and churn modeling
Customer attrition and churn modelingCustomer attrition and churn modeling
Customer attrition and churn modeling
 
Predicting the e-commerce churn
Predicting the e-commerce churnPredicting the e-commerce churn
Predicting the e-commerce churn
 
Churn Predictive Modelling
Churn Predictive ModellingChurn Predictive Modelling
Churn Predictive Modelling
 
Prediction of customer propensity to churn - Telecom Industry
Prediction of customer propensity to churn - Telecom IndustryPrediction of customer propensity to churn - Telecom Industry
Prediction of customer propensity to churn - Telecom Industry
 
A case study on churn analysis1
A case study on churn analysis1A case study on churn analysis1
A case study on churn analysis1
 
Churn customer analysis
Churn customer analysisChurn customer analysis
Churn customer analysis
 
Customer churn prediction in banking
Customer churn prediction in bankingCustomer churn prediction in banking
Customer churn prediction in banking
 
Telecommunication Analysis (3 use-cases) with IBM watson analytics
Telecommunication Analysis (3 use-cases) with IBM watson analyticsTelecommunication Analysis (3 use-cases) with IBM watson analytics
Telecommunication Analysis (3 use-cases) with IBM watson analytics
 
Beyond Churn Prediction : An Introduction to uplift modeling
Beyond Churn Prediction : An Introduction to uplift modelingBeyond Churn Prediction : An Introduction to uplift modeling
Beyond Churn Prediction : An Introduction to uplift modeling
 
Three Big Data Case Studies
Three Big Data Case StudiesThree Big Data Case Studies
Three Big Data Case Studies
 

Viewers also liked

Telco Churn Roi V3
Telco Churn Roi V3Telco Churn Roi V3
Telco Churn Roi V3hkaul
 
Subscriber Churn Prediction Model using Social Network Analysis In Telecommun...
Subscriber Churn Prediction Model using Social Network Analysis In Telecommun...Subscriber Churn Prediction Model using Social Network Analysis In Telecommun...
Subscriber Churn Prediction Model using Social Network Analysis In Telecommun...BAINIDA
 
Deriving economic value for CSPs with Big Data [read-only]
Deriving economic value for CSPs with Big Data [read-only]Deriving economic value for CSPs with Big Data [read-only]
Deriving economic value for CSPs with Big Data [read-only]Flytxt
 
Idiro Analytics - What is Rotational Churn and how can we tackle it?
Idiro Analytics - What is Rotational Churn and how can we tackle it?Idiro Analytics - What is Rotational Churn and how can we tackle it?
Idiro Analytics - What is Rotational Churn and how can we tackle it?Idiro Analytics
 
Big Telco - Yousun Jeong
Big Telco - Yousun JeongBig Telco - Yousun Jeong
Big Telco - Yousun JeongSpark Summit
 
Idiro Analytics - Social Network Analysis for Online Gaming
Idiro Analytics - Social Network Analysis for Online GamingIdiro Analytics - Social Network Analysis for Online Gaming
Idiro Analytics - Social Network Analysis for Online GamingIdiro Analytics
 
Idiro Analytics - Identifying Families using Social Network Analysis and Big ...
Idiro Analytics - Identifying Families using Social Network Analysis and Big ...Idiro Analytics - Identifying Families using Social Network Analysis and Big ...
Idiro Analytics - Identifying Families using Social Network Analysis and Big ...Idiro Analytics
 
Idiro Analytics - Analytics & Big Data
Idiro Analytics - Analytics & Big DataIdiro Analytics - Analytics & Big Data
Idiro Analytics - Analytics & Big DataIdiro Analytics
 
Social Network Analysis for Telecoms
Social Network Analysis for TelecomsSocial Network Analysis for Telecoms
Social Network Analysis for TelecomsDataspora
 
Predicting churn in telco industry: machine learning approach - Marko Mitić
 Predicting churn in telco industry: machine learning approach - Marko Mitić Predicting churn in telco industry: machine learning approach - Marko Mitić
Predicting churn in telco industry: machine learning approach - Marko MitićInstitute of Contemporary Sciences
 
Leveraging Kafka for Big Data in Real Time Bidding, Analytics, ML & Campaign ...
Leveraging Kafka for Big Data in Real Time Bidding, Analytics, ML & Campaign ...Leveraging Kafka for Big Data in Real Time Bidding, Analytics, ML & Campaign ...
Leveraging Kafka for Big Data in Real Time Bidding, Analytics, ML & Campaign ...Helena Edelson
 
Decide on technology stack & data architecture
Decide on technology stack & data architectureDecide on technology stack & data architecture
Decide on technology stack & data architectureSV.CO
 
How to use your CRM for upselling and cross-selling
How to use your CRM for upselling and cross-sellingHow to use your CRM for upselling and cross-selling
How to use your CRM for upselling and cross-sellingRedspire Ltd
 
Big Data Analytics : A Social Network Approach
Big Data Analytics : A Social Network ApproachBig Data Analytics : A Social Network Approach
Big Data Analytics : A Social Network ApproachAndry Alamsyah
 
Big Data: Social Network Analysis
Big Data: Social Network AnalysisBig Data: Social Network Analysis
Big Data: Social Network AnalysisMichel Bruley
 

Viewers also liked (15)

Telco Churn Roi V3
Telco Churn Roi V3Telco Churn Roi V3
Telco Churn Roi V3
 
Subscriber Churn Prediction Model using Social Network Analysis In Telecommun...
Subscriber Churn Prediction Model using Social Network Analysis In Telecommun...Subscriber Churn Prediction Model using Social Network Analysis In Telecommun...
Subscriber Churn Prediction Model using Social Network Analysis In Telecommun...
 
Deriving economic value for CSPs with Big Data [read-only]
Deriving economic value for CSPs with Big Data [read-only]Deriving economic value for CSPs with Big Data [read-only]
Deriving economic value for CSPs with Big Data [read-only]
 
Idiro Analytics - What is Rotational Churn and how can we tackle it?
Idiro Analytics - What is Rotational Churn and how can we tackle it?Idiro Analytics - What is Rotational Churn and how can we tackle it?
Idiro Analytics - What is Rotational Churn and how can we tackle it?
 
Big Telco - Yousun Jeong
Big Telco - Yousun JeongBig Telco - Yousun Jeong
Big Telco - Yousun Jeong
 
Idiro Analytics - Social Network Analysis for Online Gaming
Idiro Analytics - Social Network Analysis for Online GamingIdiro Analytics - Social Network Analysis for Online Gaming
Idiro Analytics - Social Network Analysis for Online Gaming
 
Idiro Analytics - Identifying Families using Social Network Analysis and Big ...
Idiro Analytics - Identifying Families using Social Network Analysis and Big ...Idiro Analytics - Identifying Families using Social Network Analysis and Big ...
Idiro Analytics - Identifying Families using Social Network Analysis and Big ...
 
Idiro Analytics - Analytics & Big Data
Idiro Analytics - Analytics & Big DataIdiro Analytics - Analytics & Big Data
Idiro Analytics - Analytics & Big Data
 
Social Network Analysis for Telecoms
Social Network Analysis for TelecomsSocial Network Analysis for Telecoms
Social Network Analysis for Telecoms
 
Predicting churn in telco industry: machine learning approach - Marko Mitić
 Predicting churn in telco industry: machine learning approach - Marko Mitić Predicting churn in telco industry: machine learning approach - Marko Mitić
Predicting churn in telco industry: machine learning approach - Marko Mitić
 
Leveraging Kafka for Big Data in Real Time Bidding, Analytics, ML & Campaign ...
Leveraging Kafka for Big Data in Real Time Bidding, Analytics, ML & Campaign ...Leveraging Kafka for Big Data in Real Time Bidding, Analytics, ML & Campaign ...
Leveraging Kafka for Big Data in Real Time Bidding, Analytics, ML & Campaign ...
 
Decide on technology stack & data architecture
Decide on technology stack & data architectureDecide on technology stack & data architecture
Decide on technology stack & data architecture
 
How to use your CRM for upselling and cross-selling
How to use your CRM for upselling and cross-sellingHow to use your CRM for upselling and cross-selling
How to use your CRM for upselling and cross-selling
 
Big Data Analytics : A Social Network Approach
Big Data Analytics : A Social Network ApproachBig Data Analytics : A Social Network Approach
Big Data Analytics : A Social Network Approach
 
Big Data: Social Network Analysis
Big Data: Social Network AnalysisBig Data: Social Network Analysis
Big Data: Social Network Analysis
 

Similar to Churn modelling

Customer_Churn_prediction.pptx
Customer_Churn_prediction.pptxCustomer_Churn_prediction.pptx
Customer_Churn_prediction.pptxAniket Patil
 
Customer_Churn_prediction.pptx
Customer_Churn_prediction.pptxCustomer_Churn_prediction.pptx
Customer_Churn_prediction.pptxpatilaniket2418
 
Revenue assurance 102
Revenue assurance 102Revenue assurance 102
Revenue assurance 102ntel
 
NJTC Workshop on Financial Services Technology and Application Development Pr...
NJTC Workshop on Financial Services Technology and Application Development Pr...NJTC Workshop on Financial Services Technology and Application Development Pr...
NJTC Workshop on Financial Services Technology and Application Development Pr...DATA Inc.
 
Implementing Advanced Analytics Platform
Implementing Advanced Analytics PlatformImplementing Advanced Analytics Platform
Implementing Advanced Analytics PlatformArvind Sathi
 
Insight-2015-Session-3193
Insight-2015-Session-3193Insight-2015-Session-3193
Insight-2015-Session-3193Michal Miklas
 
Business Intelligence and OLAP Practice
Business Intelligence and OLAP PracticeBusiness Intelligence and OLAP Practice
Business Intelligence and OLAP PracticeTatiana Ivanova
 
Acqueon's LCM - for Cisco Unified CCE Dialer - Presentation
Acqueon's LCM - for Cisco Unified CCE Dialer - PresentationAcqueon's LCM - for Cisco Unified CCE Dialer - Presentation
Acqueon's LCM - for Cisco Unified CCE Dialer - PresentationAcqueon Technologies Inc.
 
Mathematical Model For Customer Life Time Based Offer Management
Mathematical Model For Customer Life Time Based Offer ManagementMathematical Model For Customer Life Time Based Offer Management
Mathematical Model For Customer Life Time Based Offer ManagementZehra Kendir
 
Inspire2015 Bank of America Merrill Lynch
Inspire2015 Bank of America Merrill LynchInspire2015 Bank of America Merrill Lynch
Inspire2015 Bank of America Merrill LynchAltan Atabarut, MSc.
 
ASUG Utilities Presentation
ASUG Utilities PresentationASUG Utilities Presentation
ASUG Utilities PresentationMichael Robinson
 
Modelling Customer Lifetime Revenue for Subscription Business
Modelling Customer Lifetime Revenue for Subscription BusinessModelling Customer Lifetime Revenue for Subscription Business
Modelling Customer Lifetime Revenue for Subscription BusinessDatabricks
 
Barga Galvanize Sept 2015
Barga Galvanize Sept 2015Barga Galvanize Sept 2015
Barga Galvanize Sept 2015Roger Barga
 
Key findings when upgrading your sap crm system
Key findings when upgrading your sap crm systemKey findings when upgrading your sap crm system
Key findings when upgrading your sap crm systemrobgirvan
 
Calculating Client Profitability: Analysis to Action
Calculating Client Profitability: Analysis to ActionCalculating Client Profitability: Analysis to Action
Calculating Client Profitability: Analysis to ActionPerformLaw
 

Similar to Churn modelling (20)

Customer Life Time Value
Customer Life Time ValueCustomer Life Time Value
Customer Life Time Value
 
Customer_Churn_prediction.pptx
Customer_Churn_prediction.pptxCustomer_Churn_prediction.pptx
Customer_Churn_prediction.pptx
 
Customer_Churn_prediction.pptx
Customer_Churn_prediction.pptxCustomer_Churn_prediction.pptx
Customer_Churn_prediction.pptx
 
Revenue assurance 102
Revenue assurance 102Revenue assurance 102
Revenue assurance 102
 
NJTC Workshop on Financial Services Technology and Application Development Pr...
NJTC Workshop on Financial Services Technology and Application Development Pr...NJTC Workshop on Financial Services Technology and Application Development Pr...
NJTC Workshop on Financial Services Technology and Application Development Pr...
 
Implementing Advanced Analytics Platform
Implementing Advanced Analytics PlatformImplementing Advanced Analytics Platform
Implementing Advanced Analytics Platform
 
Final mis power
Final mis powerFinal mis power
Final mis power
 
Insight-2015-Session-3193
Insight-2015-Session-3193Insight-2015-Session-3193
Insight-2015-Session-3193
 
Business Intelligence and OLAP Practice
Business Intelligence and OLAP PracticeBusiness Intelligence and OLAP Practice
Business Intelligence and OLAP Practice
 
Krrushnan Resume - Mainframe (2)
Krrushnan Resume - Mainframe (2)Krrushnan Resume - Mainframe (2)
Krrushnan Resume - Mainframe (2)
 
Acqueon's LCM - for Cisco Unified CCE Dialer - Presentation
Acqueon's LCM - for Cisco Unified CCE Dialer - PresentationAcqueon's LCM - for Cisco Unified CCE Dialer - Presentation
Acqueon's LCM - for Cisco Unified CCE Dialer - Presentation
 
Mathematical Model For Customer Life Time Based Offer Management
Mathematical Model For Customer Life Time Based Offer ManagementMathematical Model For Customer Life Time Based Offer Management
Mathematical Model For Customer Life Time Based Offer Management
 
Inspire2015 Bank of America Merrill Lynch
Inspire2015 Bank of America Merrill LynchInspire2015 Bank of America Merrill Lynch
Inspire2015 Bank of America Merrill Lynch
 
ASUG Utilities Presentation
ASUG Utilities PresentationASUG Utilities Presentation
ASUG Utilities Presentation
 
Modelling Customer Lifetime Revenue for Subscription Business
Modelling Customer Lifetime Revenue for Subscription BusinessModelling Customer Lifetime Revenue for Subscription Business
Modelling Customer Lifetime Revenue for Subscription Business
 
VAN
VANVAN
VAN
 
Barga Galvanize Sept 2015
Barga Galvanize Sept 2015Barga Galvanize Sept 2015
Barga Galvanize Sept 2015
 
1710 track3 zhu
1710 track3 zhu1710 track3 zhu
1710 track3 zhu
 
Key findings when upgrading your sap crm system
Key findings when upgrading your sap crm systemKey findings when upgrading your sap crm system
Key findings when upgrading your sap crm system
 
Calculating Client Profitability: Analysis to Action
Calculating Client Profitability: Analysis to ActionCalculating Client Profitability: Analysis to Action
Calculating Client Profitability: Analysis to Action
 

Recently uploaded

Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundOppotus
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单ewymefz
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay
 
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...elinavihriala
 
Tabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflowsTabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflowsalex933524
 
Using PDB Relocation to Move a Single PDB to Another Existing CDB
Using PDB Relocation to Move a Single PDB to Another Existing CDBUsing PDB Relocation to Move a Single PDB to Another Existing CDB
Using PDB Relocation to Move a Single PDB to Another Existing CDBAlireza Kamrani
 
How can I successfully sell my pi coins in Philippines?
How can I successfully sell my pi coins in Philippines?How can I successfully sell my pi coins in Philippines?
How can I successfully sell my pi coins in Philippines?DOT TECH
 
Jpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization SampleJpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization SampleJames Polillo
 
Supply chain analytics to combat the effects of Ukraine-Russia-conflict
Supply chain analytics to combat the effects of Ukraine-Russia-conflictSupply chain analytics to combat the effects of Ukraine-Russia-conflict
Supply chain analytics to combat the effects of Ukraine-Russia-conflictJack Cole
 
Exploratory Data Analysis - Dilip S.pptx
Exploratory Data Analysis - Dilip S.pptxExploratory Data Analysis - Dilip S.pptx
Exploratory Data Analysis - Dilip S.pptxDilipVasan
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单enxupq
 
社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .NABLAS株式会社
 
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...correoyaya
 
Pre-ProductionImproveddsfjgndflghtgg.pptx
Pre-ProductionImproveddsfjgndflghtgg.pptxPre-ProductionImproveddsfjgndflghtgg.pptx
Pre-ProductionImproveddsfjgndflghtgg.pptxStephen266013
 
Business update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMIBusiness update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMIAlejandraGmez176757
 
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPsWebinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPsCEPTES Software Inc
 
Computer Presentation.pptx ecommerce advantage s
Computer Presentation.pptx ecommerce advantage sComputer Presentation.pptx ecommerce advantage s
Computer Presentation.pptx ecommerce advantage sMAQIB18
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单yhkoc
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单enxupq
 

Recently uploaded (20)

Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year Rebound
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
 
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
 
Tabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflowsTabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflows
 
Using PDB Relocation to Move a Single PDB to Another Existing CDB
Using PDB Relocation to Move a Single PDB to Another Existing CDBUsing PDB Relocation to Move a Single PDB to Another Existing CDB
Using PDB Relocation to Move a Single PDB to Another Existing CDB
 
Slip-and-fall Injuries: Top Workers' Comp Claims
Slip-and-fall Injuries: Top Workers' Comp ClaimsSlip-and-fall Injuries: Top Workers' Comp Claims
Slip-and-fall Injuries: Top Workers' Comp Claims
 
How can I successfully sell my pi coins in Philippines?
How can I successfully sell my pi coins in Philippines?How can I successfully sell my pi coins in Philippines?
How can I successfully sell my pi coins in Philippines?
 
Jpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization SampleJpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization Sample
 
Supply chain analytics to combat the effects of Ukraine-Russia-conflict
Supply chain analytics to combat the effects of Ukraine-Russia-conflictSupply chain analytics to combat the effects of Ukraine-Russia-conflict
Supply chain analytics to combat the effects of Ukraine-Russia-conflict
 
Exploratory Data Analysis - Dilip S.pptx
Exploratory Data Analysis - Dilip S.pptxExploratory Data Analysis - Dilip S.pptx
Exploratory Data Analysis - Dilip S.pptx
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
 
社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .
 
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
 
Pre-ProductionImproveddsfjgndflghtgg.pptx
Pre-ProductionImproveddsfjgndflghtgg.pptxPre-ProductionImproveddsfjgndflghtgg.pptx
Pre-ProductionImproveddsfjgndflghtgg.pptx
 
Business update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMIBusiness update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMI
 
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPsWebinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
 
Computer Presentation.pptx ecommerce advantage s
Computer Presentation.pptx ecommerce advantage sComputer Presentation.pptx ecommerce advantage s
Computer Presentation.pptx ecommerce advantage s
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单
 

Churn modelling

  • 2. Outline • Business Problem • Variable Description • Exploratory Data Analysis • Feature Selection • Data Pre-Processing • Model Development • Model Validation
  • 3. Business Problem • Consumers today go through a complex decision making process before subscribing to any one of the numerous Telecom service options. • The services provided by the Telecom vendors are not highly differentiated and number portability is commonplace. • customer loyalty becomes an issue. Hence, it is becoming increasingly important for telecommunications companies to proactively identify factors that have a tendency to unsubscribe and take preventive measures to retain customers.
  • 4. Variable Description • State : categorical, for the 50 states and the District of Columbia • Account Length : integer-valued, how long account has been active • Area Code : categorical • Phone : Phone number of customer • Int'l Plan : International plan activated ( yes , no) • VMail Plan : Voice Mail plan activated ( yes , no ) • VMail Message : No. of voice mail messages • Day Mins : Total day minutes used • Day Calls : Total day calls made • Day Charge : Total day charge • Eve Mins : Total evening minutes • Eve Calls : Total evening calls • Eve Charge : Total evening charge • Night Mins : Total night minutes • Night Calls : Total night calls • Night Charge : Total night charge • Intl Mins : Total International minutes used • Intl Calls : Total International calls made • Intl Charge : Total International charge • CustServ Calls : Number of customer service calls made • Churn : Customer churn (Target Variable 1= churn , 0= not churned )
  • 8.
  • 12. Few observation from exploratory analysis • Customers with the International Plan tend to churn more frequently • Customers with the Voice Mail Plan tend to churn less frequently. • Customers with four or more customer service calls churn more than four times as often as do the other customers.
  • 13. Feature Selection • Important features were identified during model building process for ex: – Stepwise regression indicates important variable to consider – Variable importance graph has been generated using random forest and so on
  • 14. Data Pre-Processing • Dataset considered for this project is already cleaned • We have partitioned our dataset into training and testing set using simple random sampling • We have dropped following four variables as they are not adding any meaning for modelling purpose – State – Area.code – Account.length – Phone number
  • 15. Model 1: Decision Tree • Easy to interpret • Generates if-else business rules • Recursive partitioning and classification technique is used • Tree build – Fully grown (results in overfitting of data) – Pruned tree (optimal tree) • R packages used: – Rpart – Caret
  • 16. Tree 1: Full Tree
  • 17. Performance measure of full tree : ROC Curve
  • 18. Performance measure of full tree : Confusion Matrix and other statistics
  • 20. Performance Measure of Pruned Tree: ROC Curve
  • 21. Performance measure of Pruned tree : Confusion Matrix and other statistics
  • 22. Comparing Performance of both the tree: ROC Curve
  • 23. Compare : Confusion Matrix and other statistics Full Tree Pruned Tree
  • 24. Model 2: Logistic Regression • Widely used across industry • R packages used – Glm for model building – Caret for model evaluation
  • 25. Model Summary on all variable as Input
  • 26. Model summary on statistically significant variables
  • 28. Model 3: Support Vector Machine • Widely used black box technique for binary classification • R packages used – e1071 (for model building) – Caret (for model evaluation)
  • 31. Model 4: Ensemble (Random Forest) • Ensembling of decision trees will be done • R packages used: – randomForest (model development) – caret (model evaluation)
  • 32. Variable Importance Plot : Random Forest
  • 33. Model Evaluation : Confusion Matrix
  • 34. Model Evaluation : ROC curve (Random Forest)
  • 36. CUSTMER SEGMENTATION & CLTV CALCULATION • Different techniques are available for customer segmentation. • Customer can be segmented into different kind of profiles like high value, low value, warm, cold and so on. • RFM analaysis, CLTV based segmentation, clustering based segmentation are few techniques to name
  • 37. CLTV( customer life time value) • CLTV (Customer LifeTime Value) refers to the amount of revenues that you expect to generate from a customer during the period over which your service will be of value. • On the basis of above values we segment customer profiles and treat them accordingly
  • 38. Assumptions • Due to limitation in our dataset we performed CLTV analysis on the basis of the following assumptions: – Given data contains one year of transaction details – Unit of amount is dollars – following are the margins that company is getting from their customer • 5% of day charge • 10% of evening hours • 20% of night and international calls – Monthly churn rate of telecom industry is 4% Note: above numbers are for illustration purpose only and it depends on domain knowledge of analyst.
  • 39. CLTV calculation • On the basis of this assumptions net profit from any customer can be calculated as: -> Net profit = 0.05*daycharge + 0.10* eve.charge + 0.15 *night charge + 0.20 * Intnl charge ->Churnrate = 0.04 ->Customer_cltv = (netprofit-0.5*cust_serv_call)/churnrate • For illustration purpose in our case customers whose cltv is less than mean(cltv) are considered as LVC and other are HVC Note: Above segmentation can be done in a better way with the help of business domain expert

Editor's Notes

  1. We can also add transaction data and demographic data of the customer for better insights . Since we are limited to these dummy data for our analysis we will try to explain different machine learning algorithm on this dataset.
  2. Hypotheses: No of calls to customer care results in more churning & graph b/w both variables is also indicating the same
  3. Hypotheses 2: International call subscriber has more chances of churning figure also reflecting the same
  4. Hypotheses 3: subscribing to voice mail service has no significant impact on churning
  5. More conclusions can be drawn by plotting differnet graphs between different variables as per hypotheses
  6. Since we are already dealing with small number of features we are not going into specific feature selection techniques
  7. There is little difference in sensitivity and specificity which clearly indicates that our full tree has overfitted data in model building therefore one should go for pruned tree where less number of rules are generated as compared to full tree
  8. Underlined variables are statistically significant
  9. _