1. Name – Sanket V. Butoliya
UID – U95365115
Major - Business Analytics & Information Systems
Predicting Customer churn using WEKA 3.8
2. Introduction:
Data set Analysis Weka 3.8 Classification Accuracy Accuracy Plot
Problem Statement :
A Cellular service provider wants to analyze customer data to predict whether a
customer is going to churn and also identify what are the critical factors that are causing customers to
churn so that preventive actions could be taken based on these factors.
Flow of Presentation :
• Introduction
• Problem Statement
• Dataset Overview
• Data Analysis and Baseline deduction
• ZeroR
• Predictive model
• Decision Tree
• Neural networks
• Naive Bayes
• Ibk (KNN = 50)
• Visualization in Excel
• Conclusion
3. No. Attributes Information Values
1 College? Zero, One
2 Income Numeric
3 Overage Numeric
4 Leftover Numeric
5 House value Numeric
6 Handset price Numeric
7 Avg long calls Numeric
8 Avg duration Numeric
9 Satisfaction Avg, Sat, Unsat, Very_sat, Very_unsat
10 Usage level Avg, high, little, very_high, Very_little
11 Considering change?
No, considering, perhaps, never_thought,
actively_looking_into_it
12 Retained? STAY, LEAVE
Predicted attribute = Customer
Retained?
Number of Instances = 5000
Number of Attributes = 12
Missing Attribute Values = None
Data set Analysis Weka 3.8 Classification Accuracy Accuracy Plot