SlideShare a Scribd company logo
1 of 7
Marketing Analytics
Group2:Logan Moore, Jennifer Eickert, MadelineRynkiewicz, LaurynJashinski
Model Overview
Special Considerations
The main factors affecting the performance of our model was how we optimized the attributes
selected and the parameters within the decision tree.
1. Weight by Gini Index: We ran six different weighting operators and Gini Index provided
the most balanced results.
2. Select by Weight: This easily allowed us to choose the top 10 attributes to base the model
off of.
3. Replace Missing Values: After trial and error, the best model was predicting after
changing missing attribute values to averages.
4. Filter Examples: A rigorous process of examining the weights of each of the attribute, the
mean and standard deviation of each attribute, and the overall effect of outliers on the
model ensued to find the best prediction. (*Important note: ‘Custom Filters’ can only be
applied in RapidMiner 6, downloaded on Logan’s personal computer)
Trial and Error Process for Filtering
eqpdays 1 eqpdays 1 eqpdays 1
months 0.644121 months 0.797907 months 0.790851
retcalls 0.45587 mou 0.248054 mou 0.232883
webcap 0.37495 retcalls 0.19691 retcalls 0.193697
creditde 0.28094 webcap 0.160545 webcap 0.160489
changem 0.266191 incalls 0.136913 incalls 0.137575
changer 0.243359 creditde 0.120039 creditde 0.12025
mou 0.156781 changem 0.110988 changem 0.111546
retaccpt 0.135313 changer 0.106593 outcalls 0.105707
phones 0.079818 outcalls 0.105258 unansvce 0.104361
Chi-Square Info Gain Gini
eqpdays 1 mou 1 retcalls 1
retcalls 0.623101 changem 0.494791 eqpdays 0.804925
webcap 0.59594 eqpdays 0.489207 webcap 0.684583
creditde 0.515741 revenue 0.084119 changer 0.60161
mou 0.500321 changer 0.075769 creditde 0.498742
incalls 0.372892 unansvce 0.072891 months 0.487047
retaccpt 0.353905 outcalls 0.066249 changem 0.467689
phones 0.334674 incalls 0.031656 retaccpt 0.326418
outcalls 0.331368 blckvce 0.02 mou 0.215435
changem 0.327312 months 0.018341 callwait 0.182593
Deviation UncertaintyCorrelation
eqpdays < 907
months < 44
mou < 1000
retcalls < 1
webcap NA NA
creditde NA NA
incalls < 42
outcalls < 95
changem > -500 < 500
retaccpt < 1
changer < 204
MASTER FILTER @95%
5. Decision Tree: The Gini Index was used within the decision tree. This corresponds to the
weighting measure by the Gini Index. Decision trees are the least restrictive of all models
and do not assume normal distributions. This is especially useful since some attributes
had shown that the distribution of their values was subject to skewnewss. A trial and error
process was used to maximize the parameters (shown below).
Base Optimized
Model1 Performance
Training
Validation
Scoring
Filters
This model has a solid performance because the ‘No’ validation is well above 40% and the ‘Yes’
Validation has a relatively high validation of 76.95%. The balanced prediction of ‘Yes’ and ‘No’
in the scoring data can be held with reasonable confidence for ‘Yes’. The validating model
actually performs better than the training data, which is an anomaly, but does further indicate its
solid all-around performance. Five filters were chosen that removed outliers of highly weighted
attributes. This process adequately scrubbed the data. More research could be conducted into
individual responses that contain outlier values, which may boost both ‘Yes’ and ‘No’ validation
performances. This is a very rigorous process, even with adequate RapidMiner operators, which
pervades the scope of this course.
Model2 Performance
Training
Validation
Scoring
Filters
By simply removing ‘outcalls’ from the custom filter, the performance of the model drastically
changed. This model can predict churn with 86% confidence, a 9% increase from the previous
model. However, the retention prediction drops considerably (21%). Ultimately, the marginal
gain in validation performance for churn skews this model and it appears that too many
customers are predicted to will now churn.
Model3 Performance
Training
Validation
Scoring
Filters
When only selecting the 6 most important attributes and filtering them for them accordingly, the
performance validation of this model reflects the higher ‘yes’ of Model 2 and the higher ‘no’ of
Model 1, in relation to whether or not a customer will churn. Once again, the validation
performance of retention seems to be too low, where the prediction is too far out of balance.
Profileof ChurningCustomers
Customers that churn are expected to have fewer days with their equipment and less months than
loyal customers. They will not place calls to the retention team or accept retention offers. They
are also more likely to have lower/poor credit.

More Related Content

Similar to Marketing Analytics RM Report

Leveragin research, behavioural and demeographic data
Leveragin research, behavioural and demeographic dataLeveragin research, behavioural and demeographic data
Leveragin research, behavioural and demeographic dataMRS
 
Failure Modes FMEA-&-Measurement_Systems_Analysis.ppt
Failure Modes FMEA-&-Measurement_Systems_Analysis.pptFailure Modes FMEA-&-Measurement_Systems_Analysis.ppt
Failure Modes FMEA-&-Measurement_Systems_Analysis.pptMadan Karki
 
LSSGB_Project_SimpliLearn.ppt
LSSGB_Project_SimpliLearn.pptLSSGB_Project_SimpliLearn.ppt
LSSGB_Project_SimpliLearn.pptMash92
 
OM2_Lecture 11vvvhhbbjjbjdjjeebjrhvhuuhh
OM2_Lecture 11vvvhhbbjjbjdjjeebjrhvhuuhhOM2_Lecture 11vvvhhbbjjbjdjjeebjrhvhuuhh
OM2_Lecture 11vvvhhbbjjbjdjjeebjrhvhuuhhrammanoharjharupnaga
 
Prediction of customer propensity to churn - Telecom Industry
Prediction of customer propensity to churn - Telecom IndustryPrediction of customer propensity to churn - Telecom Industry
Prediction of customer propensity to churn - Telecom IndustryPranov Mishra
 
Six sigma.pptx
Six sigma.pptxSix sigma.pptx
Six sigma.pptxGulaHangus
 
Six sigma.pptx
Six sigma.pptxSix sigma.pptx
Six sigma.pptxGulaHangus
 
Credit Risk Evaluation Model
Credit Risk Evaluation ModelCredit Risk Evaluation Model
Credit Risk Evaluation ModelMihai Enescu
 
Six sigma-measure-phase2505
Six sigma-measure-phase2505Six sigma-measure-phase2505
Six sigma-measure-phase2505densongco
 
Marc Stein, Underwrite.ai - Driverless AI Use Cases in Finance and Cancer Gen...
Marc Stein, Underwrite.ai - Driverless AI Use Cases in Finance and Cancer Gen...Marc Stein, Underwrite.ai - Driverless AI Use Cases in Finance and Cancer Gen...
Marc Stein, Underwrite.ai - Driverless AI Use Cases in Finance and Cancer Gen...Sri Ambati
 
Forward-Looking ALLL: Computing Qualitative Adjustments
Forward-Looking ALLL: Computing Qualitative AdjustmentsForward-Looking ALLL: Computing Qualitative Adjustments
Forward-Looking ALLL: Computing Qualitative AdjustmentsLibby Bierman
 
Cpk indispensable index or misleading measure? by PQ Systems
Cpk indispensable index or misleading measure? by PQ SystemsCpk indispensable index or misleading measure? by PQ Systems
Cpk indispensable index or misleading measure? by PQ SystemsBlackberry&Cross
 
How To Reduce Costs And Improve Financial Performance using NextGen Practice ...
How To Reduce Costs And Improve Financial Performance using NextGen Practice ...How To Reduce Costs And Improve Financial Performance using NextGen Practice ...
How To Reduce Costs And Improve Financial Performance using NextGen Practice ...James Muir
 

Similar to Marketing Analytics RM Report (20)

Six Sigma
Six SigmaSix Sigma
Six Sigma
 
Leveragin research, behavioural and demeographic data
Leveragin research, behavioural and demeographic dataLeveragin research, behavioural and demeographic data
Leveragin research, behavioural and demeographic data
 
Failure Modes FMEA-&-Measurement_Systems_Analysis.ppt
Failure Modes FMEA-&-Measurement_Systems_Analysis.pptFailure Modes FMEA-&-Measurement_Systems_Analysis.ppt
Failure Modes FMEA-&-Measurement_Systems_Analysis.ppt
 
Lean Six Sigma
Lean Six SigmaLean Six Sigma
Lean Six Sigma
 
LSSGB_Project_SimpliLearn.ppt
LSSGB_Project_SimpliLearn.pptLSSGB_Project_SimpliLearn.ppt
LSSGB_Project_SimpliLearn.ppt
 
OM2_Lecture 11vvvhhbbjjbjdjjeebjrhvhuuhh
OM2_Lecture 11vvvhhbbjjbjdjjeebjrhvhuuhhOM2_Lecture 11vvvhhbbjjbjdjjeebjrhvhuuhh
OM2_Lecture 11vvvhhbbjjbjdjjeebjrhvhuuhh
 
PQMS.pptx
PQMS.pptxPQMS.pptx
PQMS.pptx
 
Prediction of customer propensity to churn - Telecom Industry
Prediction of customer propensity to churn - Telecom IndustryPrediction of customer propensity to churn - Telecom Industry
Prediction of customer propensity to churn - Telecom Industry
 
Six sigma.pptx
Six sigma.pptxSix sigma.pptx
Six sigma.pptx
 
Six sigma.pptx
Six sigma.pptxSix sigma.pptx
Six sigma.pptx
 
Model Validation
Model Validation Model Validation
Model Validation
 
cost of quality
cost of qualitycost of quality
cost of quality
 
Credit Risk Evaluation Model
Credit Risk Evaluation ModelCredit Risk Evaluation Model
Credit Risk Evaluation Model
 
Six sigma-measure-phase2505
Six sigma-measure-phase2505Six sigma-measure-phase2505
Six sigma-measure-phase2505
 
Marc Stein, Underwrite.ai - Driverless AI Use Cases in Finance and Cancer Gen...
Marc Stein, Underwrite.ai - Driverless AI Use Cases in Finance and Cancer Gen...Marc Stein, Underwrite.ai - Driverless AI Use Cases in Finance and Cancer Gen...
Marc Stein, Underwrite.ai - Driverless AI Use Cases in Finance and Cancer Gen...
 
Forward-Looking ALLL: Computing Qualitative Adjustments
Forward-Looking ALLL: Computing Qualitative AdjustmentsForward-Looking ALLL: Computing Qualitative Adjustments
Forward-Looking ALLL: Computing Qualitative Adjustments
 
machineLearningTypingTool_Rev1
machineLearningTypingTool_Rev1machineLearningTypingTool_Rev1
machineLearningTypingTool_Rev1
 
Cpk indispensable index or misleading measure? by PQ Systems
Cpk indispensable index or misleading measure? by PQ SystemsCpk indispensable index or misleading measure? by PQ Systems
Cpk indispensable index or misleading measure? by PQ Systems
 
1030 track3 boire
1030 track3 boire1030 track3 boire
1030 track3 boire
 
How To Reduce Costs And Improve Financial Performance using NextGen Practice ...
How To Reduce Costs And Improve Financial Performance using NextGen Practice ...How To Reduce Costs And Improve Financial Performance using NextGen Practice ...
How To Reduce Costs And Improve Financial Performance using NextGen Practice ...
 

Marketing Analytics RM Report

  • 1. Marketing Analytics Group2:Logan Moore, Jennifer Eickert, MadelineRynkiewicz, LaurynJashinski Model Overview Special Considerations The main factors affecting the performance of our model was how we optimized the attributes selected and the parameters within the decision tree. 1. Weight by Gini Index: We ran six different weighting operators and Gini Index provided the most balanced results. 2. Select by Weight: This easily allowed us to choose the top 10 attributes to base the model off of. 3. Replace Missing Values: After trial and error, the best model was predicting after changing missing attribute values to averages. 4. Filter Examples: A rigorous process of examining the weights of each of the attribute, the mean and standard deviation of each attribute, and the overall effect of outliers on the model ensued to find the best prediction. (*Important note: ‘Custom Filters’ can only be applied in RapidMiner 6, downloaded on Logan’s personal computer)
  • 2. Trial and Error Process for Filtering eqpdays 1 eqpdays 1 eqpdays 1 months 0.644121 months 0.797907 months 0.790851 retcalls 0.45587 mou 0.248054 mou 0.232883 webcap 0.37495 retcalls 0.19691 retcalls 0.193697 creditde 0.28094 webcap 0.160545 webcap 0.160489 changem 0.266191 incalls 0.136913 incalls 0.137575 changer 0.243359 creditde 0.120039 creditde 0.12025 mou 0.156781 changem 0.110988 changem 0.111546 retaccpt 0.135313 changer 0.106593 outcalls 0.105707 phones 0.079818 outcalls 0.105258 unansvce 0.104361 Chi-Square Info Gain Gini eqpdays 1 mou 1 retcalls 1 retcalls 0.623101 changem 0.494791 eqpdays 0.804925 webcap 0.59594 eqpdays 0.489207 webcap 0.684583 creditde 0.515741 revenue 0.084119 changer 0.60161 mou 0.500321 changer 0.075769 creditde 0.498742 incalls 0.372892 unansvce 0.072891 months 0.487047 retaccpt 0.353905 outcalls 0.066249 changem 0.467689 phones 0.334674 incalls 0.031656 retaccpt 0.326418 outcalls 0.331368 blckvce 0.02 mou 0.215435 changem 0.327312 months 0.018341 callwait 0.182593 Deviation UncertaintyCorrelation eqpdays < 907 months < 44 mou < 1000 retcalls < 1 webcap NA NA creditde NA NA incalls < 42 outcalls < 95 changem > -500 < 500 retaccpt < 1 changer < 204 MASTER FILTER @95%
  • 3. 5. Decision Tree: The Gini Index was used within the decision tree. This corresponds to the weighting measure by the Gini Index. Decision trees are the least restrictive of all models and do not assume normal distributions. This is especially useful since some attributes had shown that the distribution of their values was subject to skewnewss. A trial and error process was used to maximize the parameters (shown below). Base Optimized
  • 4. Model1 Performance Training Validation Scoring Filters This model has a solid performance because the ‘No’ validation is well above 40% and the ‘Yes’ Validation has a relatively high validation of 76.95%. The balanced prediction of ‘Yes’ and ‘No’ in the scoring data can be held with reasonable confidence for ‘Yes’. The validating model actually performs better than the training data, which is an anomaly, but does further indicate its solid all-around performance. Five filters were chosen that removed outliers of highly weighted attributes. This process adequately scrubbed the data. More research could be conducted into individual responses that contain outlier values, which may boost both ‘Yes’ and ‘No’ validation performances. This is a very rigorous process, even with adequate RapidMiner operators, which pervades the scope of this course.
  • 5. Model2 Performance Training Validation Scoring Filters By simply removing ‘outcalls’ from the custom filter, the performance of the model drastically changed. This model can predict churn with 86% confidence, a 9% increase from the previous model. However, the retention prediction drops considerably (21%). Ultimately, the marginal gain in validation performance for churn skews this model and it appears that too many customers are predicted to will now churn.
  • 6. Model3 Performance Training Validation Scoring Filters When only selecting the 6 most important attributes and filtering them for them accordingly, the performance validation of this model reflects the higher ‘yes’ of Model 2 and the higher ‘no’ of Model 1, in relation to whether or not a customer will churn. Once again, the validation performance of retention seems to be too low, where the prediction is too far out of balance.
  • 7. Profileof ChurningCustomers Customers that churn are expected to have fewer days with their equipment and less months than loyal customers. They will not place calls to the retention team or accept retention offers. They are also more likely to have lower/poor credit.