SlideShare a Scribd company logo
Accurate Campaign Targeting Using Classification 
Algorithms 
CS 229 Project: Final Report 
Jieming Wei Sharon Zhang 
Introduction 
Contact 
<your name> 
<your organization> 
Email: 
Website: 
Phone: 
Methods 
misclassification errors with different nets 
misclassification errors with different decay level 
1. 
2. 
3. 
4. 
5. 
6. 
7. 
8. 
9. 
Many organizations prospect for loyal supporters and 
donors by sending direct mail appeals. This is an 
effective way to build a large base, but can be very 
expensive and have a low efficiency. Eliminating likely 
non-donors is the key to running an efficient 
Prospecting program and ultimately to pursuing the 
mission of the organization, especially when there’s 
limited funding (which is common in most campaigns). 
We built a binary classification model to help 
organizations to build their donor bases in the most 
efficient way possible. We aimed to help them separate 
the likely donors from the non-likely donors and target 
at the most receptive donors by sending direct mail 
campaigns to the likely donors. 
We choose to use Neural Network for this experiment, 
since it has the lowest false positive rate. There are three 
steps – first choose the number of neural nets and the 
threshold for the structural model, then choose the optimal 
regularization parameter under this optimized structural 
model, and lastly choose the threshold for false positive 
minimization. 
We did a looping experiment with 10 nets, with 20 threshold 
values ranging from -0.2 to 0.2. Applying averages to rows 
and columns, we choose the lowest average in both 
column and row to be the number of nets and threshold 
value. We chose optimal threshold of -0.14, 8 layers, decay 
parameter of 0.5. 
Data Preparation 
We tried the following approaches to classify mailings into 
two classes, namely donors who are likely to donate and 
those who are not: Multivariate Regression, SVM, Decision 
Tree, Neural Network, and Random Forest. Training errors , 
test errors, false positive error rates, false negative error 
rates are calculated for each model. 
We noticed that test errors for most models are relatively 
high, with MVR and Random Forest models having high 
variance problems, and SVM having high bias problems. 
False positive errors are high for most models, with Neural 
Net model having the least false positive error. We want to 
minimize false positive error so as to save costs for 
campaign companies. 
To find a threshold between false positice and false 
negative errors, we use the following formula: 
Net Collected $=Donation $*Donors M-Package Cost 
$*Packages Sent (N) 
Where Donation ($) and Package Cost($) are assumptions, 
Donors (M) corresponds to the number of positive 
occurrences, and Packages Sent (N) corresponds to the 
sum of positive and false-positive occurrences since the 
model predicts those prospects will donate. When we use 
the assumptions Donation = $50 and Package Cost = $20, 
we observe the following Net Collected amount trend in 
chart #. 
Thus, the most cost-effective threshold is r=-0.1, with false-positive 
rate 18%. 
Result Savings 
The original dataset is 13-dimensional with 1048575 
observations on demographic features of prospects and 
campaign attributes. In order to serve the purpose of this 
study, we adapted the dataset, reducing it to a 10- 
dimensional dataset to keep the relevant features and 
selected training set and test set. 
Feature selection 
We selected features based on their relevance to the result 
of the campaign, estimated by the correlation of a given 
feature to the result. We selected 7 features with relatively 
large correlation with the result. 
Neural Nets Algorithm 
Chart 1. 
REPLACE THIS BOX 
WITH YOUR 
ORGANIZATION’S 
HIGH RESOLUTION 
LOGO 
REPLACE THIS BOX 
WITH YOUR 
ORGANIZATION’S 
HIGH RESOLUTION 
LOGO 
Training 
error 
Overall 
test error 
False 
positive 
False 
negative 
MVR 0 0.31 0.25 0.06 
SVM 0.3233 0.54 0.18 0.36 
Neural 
Nets 
-- 0.3787 0.1742 0.2045 
Random 
Forest 
0.0033 0.47 0.23 0.24 
company 
type 
project 
type 
cause1 cause2 phase zip donor 
income 
result 
-0.016 -0.0008 -0.0765 -0.0416 0.0149 0.071 0.248 1 
Company Type Histogram 
company type 
Frequency 
1 2 3 4 5 6 
0 20 40 60 80 100 
Cause1 Histogram 
candidate/non-profit 
Frequency 
1.0 1.2 1.4 1.6 1.8 2.0 
0 50 100 150 200 
Project Type Histogram 
project type 
Frequency 
0 2 4 6 8 10 
0 10 30 50 
Cause2 Histogram 
causes 
Frequency 
2 4 6 8 10 
0 20 40 60 80 100 
Phase Histogram 
phase 
Frequency 
0 5 10 15 20 
0 20 40 60 80 100 
Income Histogram 
natural log of income 
Frequency 
10 11 12 13 14 15 16 17 
0 10 20 30 40 50 60 
-0.2 -0.1 0.0 0.1 
40 45 50 55 60 
misclassification errors with different thresholds 
thresholds 
percentage of misclassification error 
threshold selected 
2 4 6 8 10 
44 46 48 50 
nets 
percentage of misclassification error 
nets selected 
0.0 0.2 0.4 0.6 0.8 1.0 
38 40 42 44 
decay 
percentage of misclassification error 
decay level selected 
70 
60 
50 
40 
30 
20 
10 
100 
80 
60 
40 
20 
0 
-20 
-40 
-60 
Classification Results with Different Threshold Values 
r=-0.3, r=-0.28, r=-0.26, r=-0.24, r=-0.22, r=-0.2, r=-0.18, r=-0.16, r=-0.14, r=-0.12, r=-0.1, r=-0.08, r=-0.06, r=-0.04, r=-0.02, r=0 
Net Collected $ with Different Threshold Values 
0 
2 
7 
10 10 11 
13 14 
17 17 18 19 
21 22 23 23 
0 
r=-0.3, r=-0.28, r=-0.26, r=-0.24, r=-0.22, r=-0.2, r=-0.18, r=-0.16, r=-0.14, r=-0.12, r=-0.1, r=-0.08, r=-0.06, r=-0.04, r=-0.02, r=0 
positive false-negative false-positive negative

More Related Content

What's hot

Anomaly Detection in Seasonal Time Series
Anomaly Detection in Seasonal Time SeriesAnomaly Detection in Seasonal Time Series
Anomaly Detection in Seasonal Time Series
Humberto Marchezi
 
Fraud Detection by Stacking Cost-Sensitive Decision Trees
Fraud Detection by Stacking Cost-Sensitive Decision TreesFraud Detection by Stacking Cost-Sensitive Decision Trees
Fraud Detection by Stacking Cost-Sensitive Decision Trees
Alejandro Correa Bahnsen, PhD
 
Uplift Modeling Workshop
Uplift Modeling WorkshopUplift Modeling Workshop
Uplift Modeling Workshop
odsc
 
PredictingYelpReviews
PredictingYelpReviewsPredictingYelpReviews
PredictingYelpReviews
Gary Giust
 
Maximizing a churn campaign’s profitability with cost sensitive predictive an...
Maximizing a churn campaign’s profitability with cost sensitive predictive an...Maximizing a churn campaign’s profitability with cost sensitive predictive an...
Maximizing a churn campaign’s profitability with cost sensitive predictive an...
Alejandro Correa Bahnsen, PhD
 
Outlier analysis and anomaly detection
Outlier analysis and anomaly detectionOutlier analysis and anomaly detection
Outlier analysis and anomaly detection
ShantanuDeosthale
 
Handling Imbalanced Data: SMOTE vs. Random Undersampling
Handling Imbalanced Data: SMOTE vs. Random UndersamplingHandling Imbalanced Data: SMOTE vs. Random Undersampling
Handling Imbalanced Data: SMOTE vs. Random Undersampling
IRJET Journal
 
A review of net lift models
A review of net lift modelsA review of net lift models
A review of net lift models
Zixia Wang
 
Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...
Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...
Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...
Md. Main Uddin Rony
 
Anomaly detection workshop
Anomaly detection workshopAnomaly detection workshop
Anomaly detection workshop
gforgovind
 
Outliers
OutliersOutliers
Uplift Modelling as a Tool for Making Causal Inferences at Shopify - Mojan Hamed
Uplift Modelling as a Tool for Making Causal Inferences at Shopify - Mojan HamedUplift Modelling as a Tool for Making Causal Inferences at Shopify - Mojan Hamed
Uplift Modelling as a Tool for Making Causal Inferences at Shopify - Mojan Hamed
Rising Media Ltd.
 
Module 2: Machine Learning Deep Dive
Module 2:  Machine Learning Deep DiveModule 2:  Machine Learning Deep Dive
Module 2: Machine Learning Deep Dive
Sara Hooker
 
Counterfactual Learning for Recommendation
Counterfactual Learning for RecommendationCounterfactual Learning for Recommendation
Counterfactual Learning for Recommendation
Olivier Jeunen
 
L2. Evaluating Machine Learning Algorithms I
L2. Evaluating Machine Learning Algorithms IL2. Evaluating Machine Learning Algorithms I
L2. Evaluating Machine Learning Algorithms I
Machine Learning Valencia
 
Rouault sfn2014
Rouault sfn2014 Rouault sfn2014
Rouault sfn2014
Clemente Thorez
 
BOOTSTRAPPING TO EVALUATE RESPONSE MODELS: A SAS® MACRO
BOOTSTRAPPING TO EVALUATE RESPONSE MODELS: A SAS® MACROBOOTSTRAPPING TO EVALUATE RESPONSE MODELS: A SAS® MACRO
BOOTSTRAPPING TO EVALUATE RESPONSE MODELS: A SAS® MACRO
Anthony Kilili
 
Introduction to Uplift Modelling
Introduction to Uplift ModellingIntroduction to Uplift Modelling
Introduction to Uplift Modelling
Pierre Gutierrez
 
Module 5: Decision Trees
Module 5: Decision TreesModule 5: Decision Trees
Module 5: Decision Trees
Sara Hooker
 
Outlier Detection
Outlier DetectionOutlier Detection
Outlier Detection
Dr. Abdul Ahad Abro
 

What's hot (20)

Anomaly Detection in Seasonal Time Series
Anomaly Detection in Seasonal Time SeriesAnomaly Detection in Seasonal Time Series
Anomaly Detection in Seasonal Time Series
 
Fraud Detection by Stacking Cost-Sensitive Decision Trees
Fraud Detection by Stacking Cost-Sensitive Decision TreesFraud Detection by Stacking Cost-Sensitive Decision Trees
Fraud Detection by Stacking Cost-Sensitive Decision Trees
 
Uplift Modeling Workshop
Uplift Modeling WorkshopUplift Modeling Workshop
Uplift Modeling Workshop
 
PredictingYelpReviews
PredictingYelpReviewsPredictingYelpReviews
PredictingYelpReviews
 
Maximizing a churn campaign’s profitability with cost sensitive predictive an...
Maximizing a churn campaign’s profitability with cost sensitive predictive an...Maximizing a churn campaign’s profitability with cost sensitive predictive an...
Maximizing a churn campaign’s profitability with cost sensitive predictive an...
 
Outlier analysis and anomaly detection
Outlier analysis and anomaly detectionOutlier analysis and anomaly detection
Outlier analysis and anomaly detection
 
Handling Imbalanced Data: SMOTE vs. Random Undersampling
Handling Imbalanced Data: SMOTE vs. Random UndersamplingHandling Imbalanced Data: SMOTE vs. Random Undersampling
Handling Imbalanced Data: SMOTE vs. Random Undersampling
 
A review of net lift models
A review of net lift modelsA review of net lift models
A review of net lift models
 
Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...
Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...
Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...
 
Anomaly detection workshop
Anomaly detection workshopAnomaly detection workshop
Anomaly detection workshop
 
Outliers
OutliersOutliers
Outliers
 
Uplift Modelling as a Tool for Making Causal Inferences at Shopify - Mojan Hamed
Uplift Modelling as a Tool for Making Causal Inferences at Shopify - Mojan HamedUplift Modelling as a Tool for Making Causal Inferences at Shopify - Mojan Hamed
Uplift Modelling as a Tool for Making Causal Inferences at Shopify - Mojan Hamed
 
Module 2: Machine Learning Deep Dive
Module 2:  Machine Learning Deep DiveModule 2:  Machine Learning Deep Dive
Module 2: Machine Learning Deep Dive
 
Counterfactual Learning for Recommendation
Counterfactual Learning for RecommendationCounterfactual Learning for Recommendation
Counterfactual Learning for Recommendation
 
L2. Evaluating Machine Learning Algorithms I
L2. Evaluating Machine Learning Algorithms IL2. Evaluating Machine Learning Algorithms I
L2. Evaluating Machine Learning Algorithms I
 
Rouault sfn2014
Rouault sfn2014 Rouault sfn2014
Rouault sfn2014
 
BOOTSTRAPPING TO EVALUATE RESPONSE MODELS: A SAS® MACRO
BOOTSTRAPPING TO EVALUATE RESPONSE MODELS: A SAS® MACROBOOTSTRAPPING TO EVALUATE RESPONSE MODELS: A SAS® MACRO
BOOTSTRAPPING TO EVALUATE RESPONSE MODELS: A SAS® MACRO
 
Introduction to Uplift Modelling
Introduction to Uplift ModellingIntroduction to Uplift Modelling
Introduction to Uplift Modelling
 
Module 5: Decision Trees
Module 5: Decision TreesModule 5: Decision Trees
Module 5: Decision Trees
 
Outlier Detection
Outlier DetectionOutlier Detection
Outlier Detection
 

Similar to Accurate Campaign Targeting Using Classification - Poster

MIS637_Final_Project_Rahul_Bhatia
MIS637_Final_Project_Rahul_BhatiaMIS637_Final_Project_Rahul_Bhatia
MIS637_Final_Project_Rahul_Bhatia
Rahul Bhatia
 
MyPoster1
MyPoster1MyPoster1
MyPoster1
Jamie Schreader
 
Project crm submission sonali
Project crm submission sonaliProject crm submission sonali
Project crm submission sonali
Sonali Gupta
 
Credit Risk Evaluation Model
Credit Risk Evaluation ModelCredit Risk Evaluation Model
Credit Risk Evaluation Model
Mihai Enescu
 
IRJET- Credit Card Fraud Detection Analysis
IRJET- Credit Card Fraud Detection AnalysisIRJET- Credit Card Fraud Detection Analysis
IRJET- Credit Card Fraud Detection Analysis
IRJET Journal
 
Top 100+ Google Data Science Interview Questions.pdf
Top 100+ Google Data Science Interview Questions.pdfTop 100+ Google Data Science Interview Questions.pdf
Top 100+ Google Data Science Interview Questions.pdf
Datacademy.ai
 
Mining Credit Card Defults
Mining Credit Card DefultsMining Credit Card Defults
Mining Credit Card Defults
Krunal Khatri
 
Insurance Optimization
Insurance OptimizationInsurance Optimization
Insurance Optimization
Albert Chu
 
Calculating a Sample Size
Calculating a Sample SizeCalculating a Sample Size
Calculating a Sample Size
Matt Hansen
 
Summer 2015 Internship
Summer 2015 InternshipSummer 2015 Internship
Summer 2015 Internship
Taylor Martell
 
Predict Backorder on a supply chain data for an Organization
Predict Backorder on a supply chain data for an OrganizationPredict Backorder on a supply chain data for an Organization
Predict Backorder on a supply chain data for an Organization
Piyush Srivastava
 
A04 Sample Size
A04 Sample SizeA04 Sample Size
A04 Sample Size
Leanleaders.org
 
A04 Sample Size
A04 Sample SizeA04 Sample Size
A04 Sample Size
Leanleaders.org
 
Heart disease classification
Heart disease classificationHeart disease classification
Heart disease classification
SnehaDey21
 
CREDIT_CARD.ppt
CREDIT_CARD.pptCREDIT_CARD.ppt
CREDIT_CARD.ppt
Balasubramani Manickam
 
LSSGB_Project_SimpliLearn.ppt
LSSGB_Project_SimpliLearn.pptLSSGB_Project_SimpliLearn.ppt
LSSGB_Project_SimpliLearn.ppt
Mash92
 
Application of Machine Learning in Agriculture
Application of Machine  Learning in AgricultureApplication of Machine  Learning in Agriculture
Application of Machine Learning in Agriculture
Aman Vasisht
 
An Introduction to boosting
An Introduction to boostingAn Introduction to boosting
An Introduction to boosting
butest
 
7. Plan, perform, and evaluate samples for substantive procedures IPPTChap009...
7. Plan, perform, and evaluate samples for substantive procedures IPPTChap009...7. Plan, perform, and evaluate samples for substantive procedures IPPTChap009...
7. Plan, perform, and evaluate samples for substantive procedures IPPTChap009...
55296
 
Artelnics
ArtelnicsArtelnics
Artelnics
Neural Designer
 

Similar to Accurate Campaign Targeting Using Classification - Poster (20)

MIS637_Final_Project_Rahul_Bhatia
MIS637_Final_Project_Rahul_BhatiaMIS637_Final_Project_Rahul_Bhatia
MIS637_Final_Project_Rahul_Bhatia
 
MyPoster1
MyPoster1MyPoster1
MyPoster1
 
Project crm submission sonali
Project crm submission sonaliProject crm submission sonali
Project crm submission sonali
 
Credit Risk Evaluation Model
Credit Risk Evaluation ModelCredit Risk Evaluation Model
Credit Risk Evaluation Model
 
IRJET- Credit Card Fraud Detection Analysis
IRJET- Credit Card Fraud Detection AnalysisIRJET- Credit Card Fraud Detection Analysis
IRJET- Credit Card Fraud Detection Analysis
 
Top 100+ Google Data Science Interview Questions.pdf
Top 100+ Google Data Science Interview Questions.pdfTop 100+ Google Data Science Interview Questions.pdf
Top 100+ Google Data Science Interview Questions.pdf
 
Mining Credit Card Defults
Mining Credit Card DefultsMining Credit Card Defults
Mining Credit Card Defults
 
Insurance Optimization
Insurance OptimizationInsurance Optimization
Insurance Optimization
 
Calculating a Sample Size
Calculating a Sample SizeCalculating a Sample Size
Calculating a Sample Size
 
Summer 2015 Internship
Summer 2015 InternshipSummer 2015 Internship
Summer 2015 Internship
 
Predict Backorder on a supply chain data for an Organization
Predict Backorder on a supply chain data for an OrganizationPredict Backorder on a supply chain data for an Organization
Predict Backorder on a supply chain data for an Organization
 
A04 Sample Size
A04 Sample SizeA04 Sample Size
A04 Sample Size
 
A04 Sample Size
A04 Sample SizeA04 Sample Size
A04 Sample Size
 
Heart disease classification
Heart disease classificationHeart disease classification
Heart disease classification
 
CREDIT_CARD.ppt
CREDIT_CARD.pptCREDIT_CARD.ppt
CREDIT_CARD.ppt
 
LSSGB_Project_SimpliLearn.ppt
LSSGB_Project_SimpliLearn.pptLSSGB_Project_SimpliLearn.ppt
LSSGB_Project_SimpliLearn.ppt
 
Application of Machine Learning in Agriculture
Application of Machine  Learning in AgricultureApplication of Machine  Learning in Agriculture
Application of Machine Learning in Agriculture
 
An Introduction to boosting
An Introduction to boostingAn Introduction to boosting
An Introduction to boosting
 
7. Plan, perform, and evaluate samples for substantive procedures IPPTChap009...
7. Plan, perform, and evaluate samples for substantive procedures IPPTChap009...7. Plan, perform, and evaluate samples for substantive procedures IPPTChap009...
7. Plan, perform, and evaluate samples for substantive procedures IPPTChap009...
 
Artelnics
ArtelnicsArtelnics
Artelnics
 

Recently uploaded

Embedded machine learning-based road conditions and driving behavior monitoring
Embedded machine learning-based road conditions and driving behavior monitoringEmbedded machine learning-based road conditions and driving behavior monitoring
Embedded machine learning-based road conditions and driving behavior monitoring
IJECEIAES
 
A review on techniques and modelling methodologies used for checking electrom...
A review on techniques and modelling methodologies used for checking electrom...A review on techniques and modelling methodologies used for checking electrom...
A review on techniques and modelling methodologies used for checking electrom...
nooriasukmaningtyas
 
5214-1693458878915-Unit 6 2023 to 2024 academic year assignment (AutoRecovere...
5214-1693458878915-Unit 6 2023 to 2024 academic year assignment (AutoRecovere...5214-1693458878915-Unit 6 2023 to 2024 academic year assignment (AutoRecovere...
5214-1693458878915-Unit 6 2023 to 2024 academic year assignment (AutoRecovere...
ihlasbinance2003
 
Question paper of renewable energy sources
Question paper of renewable energy sourcesQuestion paper of renewable energy sources
Question paper of renewable energy sources
mahammadsalmanmech
 
Iron and Steel Technology Roadmap - Towards more sustainable steelmaking.pdf
Iron and Steel Technology Roadmap - Towards more sustainable steelmaking.pdfIron and Steel Technology Roadmap - Towards more sustainable steelmaking.pdf
Iron and Steel Technology Roadmap - Towards more sustainable steelmaking.pdf
RadiNasr
 
digital fundamental by Thomas L.floydl.pdf
digital fundamental by Thomas L.floydl.pdfdigital fundamental by Thomas L.floydl.pdf
digital fundamental by Thomas L.floydl.pdf
drwaing
 
[JPP-1] - (JEE 3.0) - Kinematics 1D - 14th May..pdf
[JPP-1] - (JEE 3.0) - Kinematics 1D - 14th May..pdf[JPP-1] - (JEE 3.0) - Kinematics 1D - 14th May..pdf
[JPP-1] - (JEE 3.0) - Kinematics 1D - 14th May..pdf
awadeshbabu
 
A SYSTEMATIC RISK ASSESSMENT APPROACH FOR SECURING THE SMART IRRIGATION SYSTEMS
A SYSTEMATIC RISK ASSESSMENT APPROACH FOR SECURING THE SMART IRRIGATION SYSTEMSA SYSTEMATIC RISK ASSESSMENT APPROACH FOR SECURING THE SMART IRRIGATION SYSTEMS
A SYSTEMATIC RISK ASSESSMENT APPROACH FOR SECURING THE SMART IRRIGATION SYSTEMS
IJNSA Journal
 
Electric vehicle and photovoltaic advanced roles in enhancing the financial p...
Electric vehicle and photovoltaic advanced roles in enhancing the financial p...Electric vehicle and photovoltaic advanced roles in enhancing the financial p...
Electric vehicle and photovoltaic advanced roles in enhancing the financial p...
IJECEIAES
 
Exception Handling notes in java exception
Exception Handling notes in java exceptionException Handling notes in java exception
Exception Handling notes in java exception
Ratnakar Mikkili
 
Low power architecture of logic gates using adiabatic techniques
Low power architecture of logic gates using adiabatic techniquesLow power architecture of logic gates using adiabatic techniques
Low power architecture of logic gates using adiabatic techniques
nooriasukmaningtyas
 
22CYT12-Unit-V-E Waste and its Management.ppt
22CYT12-Unit-V-E Waste and its Management.ppt22CYT12-Unit-V-E Waste and its Management.ppt
22CYT12-Unit-V-E Waste and its Management.ppt
KrishnaveniKrishnara1
 
Advanced control scheme of doubly fed induction generator for wind turbine us...
Advanced control scheme of doubly fed induction generator for wind turbine us...Advanced control scheme of doubly fed induction generator for wind turbine us...
Advanced control scheme of doubly fed induction generator for wind turbine us...
IJECEIAES
 
DfMAy 2024 - key insights and contributions
DfMAy 2024 - key insights and contributionsDfMAy 2024 - key insights and contributions
DfMAy 2024 - key insights and contributions
gestioneergodomus
 
bank management system in java and mysql report1.pdf
bank management system in java and mysql report1.pdfbank management system in java and mysql report1.pdf
bank management system in java and mysql report1.pdf
Divyam548318
 
132/33KV substation case study Presentation
132/33KV substation case study Presentation132/33KV substation case study Presentation
132/33KV substation case study Presentation
kandramariana6
 
Modelagem de um CSTR com reação endotermica.pdf
Modelagem de um CSTR com reação endotermica.pdfModelagem de um CSTR com reação endotermica.pdf
Modelagem de um CSTR com reação endotermica.pdf
camseq
 
6th International Conference on Machine Learning & Applications (CMLA 2024)
6th International Conference on Machine Learning & Applications (CMLA 2024)6th International Conference on Machine Learning & Applications (CMLA 2024)
6th International Conference on Machine Learning & Applications (CMLA 2024)
ClaraZara1
 
PPT on GRP pipes manufacturing and testing
PPT on GRP pipes manufacturing and testingPPT on GRP pipes manufacturing and testing
PPT on GRP pipes manufacturing and testing
anoopmanoharan2
 
哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样
哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样
哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样
insn4465
 

Recently uploaded (20)

Embedded machine learning-based road conditions and driving behavior monitoring
Embedded machine learning-based road conditions and driving behavior monitoringEmbedded machine learning-based road conditions and driving behavior monitoring
Embedded machine learning-based road conditions and driving behavior monitoring
 
A review on techniques and modelling methodologies used for checking electrom...
A review on techniques and modelling methodologies used for checking electrom...A review on techniques and modelling methodologies used for checking electrom...
A review on techniques and modelling methodologies used for checking electrom...
 
5214-1693458878915-Unit 6 2023 to 2024 academic year assignment (AutoRecovere...
5214-1693458878915-Unit 6 2023 to 2024 academic year assignment (AutoRecovere...5214-1693458878915-Unit 6 2023 to 2024 academic year assignment (AutoRecovere...
5214-1693458878915-Unit 6 2023 to 2024 academic year assignment (AutoRecovere...
 
Question paper of renewable energy sources
Question paper of renewable energy sourcesQuestion paper of renewable energy sources
Question paper of renewable energy sources
 
Iron and Steel Technology Roadmap - Towards more sustainable steelmaking.pdf
Iron and Steel Technology Roadmap - Towards more sustainable steelmaking.pdfIron and Steel Technology Roadmap - Towards more sustainable steelmaking.pdf
Iron and Steel Technology Roadmap - Towards more sustainable steelmaking.pdf
 
digital fundamental by Thomas L.floydl.pdf
digital fundamental by Thomas L.floydl.pdfdigital fundamental by Thomas L.floydl.pdf
digital fundamental by Thomas L.floydl.pdf
 
[JPP-1] - (JEE 3.0) - Kinematics 1D - 14th May..pdf
[JPP-1] - (JEE 3.0) - Kinematics 1D - 14th May..pdf[JPP-1] - (JEE 3.0) - Kinematics 1D - 14th May..pdf
[JPP-1] - (JEE 3.0) - Kinematics 1D - 14th May..pdf
 
A SYSTEMATIC RISK ASSESSMENT APPROACH FOR SECURING THE SMART IRRIGATION SYSTEMS
A SYSTEMATIC RISK ASSESSMENT APPROACH FOR SECURING THE SMART IRRIGATION SYSTEMSA SYSTEMATIC RISK ASSESSMENT APPROACH FOR SECURING THE SMART IRRIGATION SYSTEMS
A SYSTEMATIC RISK ASSESSMENT APPROACH FOR SECURING THE SMART IRRIGATION SYSTEMS
 
Electric vehicle and photovoltaic advanced roles in enhancing the financial p...
Electric vehicle and photovoltaic advanced roles in enhancing the financial p...Electric vehicle and photovoltaic advanced roles in enhancing the financial p...
Electric vehicle and photovoltaic advanced roles in enhancing the financial p...
 
Exception Handling notes in java exception
Exception Handling notes in java exceptionException Handling notes in java exception
Exception Handling notes in java exception
 
Low power architecture of logic gates using adiabatic techniques
Low power architecture of logic gates using adiabatic techniquesLow power architecture of logic gates using adiabatic techniques
Low power architecture of logic gates using adiabatic techniques
 
22CYT12-Unit-V-E Waste and its Management.ppt
22CYT12-Unit-V-E Waste and its Management.ppt22CYT12-Unit-V-E Waste and its Management.ppt
22CYT12-Unit-V-E Waste and its Management.ppt
 
Advanced control scheme of doubly fed induction generator for wind turbine us...
Advanced control scheme of doubly fed induction generator for wind turbine us...Advanced control scheme of doubly fed induction generator for wind turbine us...
Advanced control scheme of doubly fed induction generator for wind turbine us...
 
DfMAy 2024 - key insights and contributions
DfMAy 2024 - key insights and contributionsDfMAy 2024 - key insights and contributions
DfMAy 2024 - key insights and contributions
 
bank management system in java and mysql report1.pdf
bank management system in java and mysql report1.pdfbank management system in java and mysql report1.pdf
bank management system in java and mysql report1.pdf
 
132/33KV substation case study Presentation
132/33KV substation case study Presentation132/33KV substation case study Presentation
132/33KV substation case study Presentation
 
Modelagem de um CSTR com reação endotermica.pdf
Modelagem de um CSTR com reação endotermica.pdfModelagem de um CSTR com reação endotermica.pdf
Modelagem de um CSTR com reação endotermica.pdf
 
6th International Conference on Machine Learning & Applications (CMLA 2024)
6th International Conference on Machine Learning & Applications (CMLA 2024)6th International Conference on Machine Learning & Applications (CMLA 2024)
6th International Conference on Machine Learning & Applications (CMLA 2024)
 
PPT on GRP pipes manufacturing and testing
PPT on GRP pipes manufacturing and testingPPT on GRP pipes manufacturing and testing
PPT on GRP pipes manufacturing and testing
 
哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样
哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样
哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样
 

Accurate Campaign Targeting Using Classification - Poster

  • 1. Accurate Campaign Targeting Using Classification Algorithms CS 229 Project: Final Report Jieming Wei Sharon Zhang Introduction Contact <your name> <your organization> Email: Website: Phone: Methods misclassification errors with different nets misclassification errors with different decay level 1. 2. 3. 4. 5. 6. 7. 8. 9. Many organizations prospect for loyal supporters and donors by sending direct mail appeals. This is an effective way to build a large base, but can be very expensive and have a low efficiency. Eliminating likely non-donors is the key to running an efficient Prospecting program and ultimately to pursuing the mission of the organization, especially when there’s limited funding (which is common in most campaigns). We built a binary classification model to help organizations to build their donor bases in the most efficient way possible. We aimed to help them separate the likely donors from the non-likely donors and target at the most receptive donors by sending direct mail campaigns to the likely donors. We choose to use Neural Network for this experiment, since it has the lowest false positive rate. There are three steps – first choose the number of neural nets and the threshold for the structural model, then choose the optimal regularization parameter under this optimized structural model, and lastly choose the threshold for false positive minimization. We did a looping experiment with 10 nets, with 20 threshold values ranging from -0.2 to 0.2. Applying averages to rows and columns, we choose the lowest average in both column and row to be the number of nets and threshold value. We chose optimal threshold of -0.14, 8 layers, decay parameter of 0.5. Data Preparation We tried the following approaches to classify mailings into two classes, namely donors who are likely to donate and those who are not: Multivariate Regression, SVM, Decision Tree, Neural Network, and Random Forest. Training errors , test errors, false positive error rates, false negative error rates are calculated for each model. We noticed that test errors for most models are relatively high, with MVR and Random Forest models having high variance problems, and SVM having high bias problems. False positive errors are high for most models, with Neural Net model having the least false positive error. We want to minimize false positive error so as to save costs for campaign companies. To find a threshold between false positice and false negative errors, we use the following formula: Net Collected $=Donation $*Donors M-Package Cost $*Packages Sent (N) Where Donation ($) and Package Cost($) are assumptions, Donors (M) corresponds to the number of positive occurrences, and Packages Sent (N) corresponds to the sum of positive and false-positive occurrences since the model predicts those prospects will donate. When we use the assumptions Donation = $50 and Package Cost = $20, we observe the following Net Collected amount trend in chart #. Thus, the most cost-effective threshold is r=-0.1, with false-positive rate 18%. Result Savings The original dataset is 13-dimensional with 1048575 observations on demographic features of prospects and campaign attributes. In order to serve the purpose of this study, we adapted the dataset, reducing it to a 10- dimensional dataset to keep the relevant features and selected training set and test set. Feature selection We selected features based on their relevance to the result of the campaign, estimated by the correlation of a given feature to the result. We selected 7 features with relatively large correlation with the result. Neural Nets Algorithm Chart 1. REPLACE THIS BOX WITH YOUR ORGANIZATION’S HIGH RESOLUTION LOGO REPLACE THIS BOX WITH YOUR ORGANIZATION’S HIGH RESOLUTION LOGO Training error Overall test error False positive False negative MVR 0 0.31 0.25 0.06 SVM 0.3233 0.54 0.18 0.36 Neural Nets -- 0.3787 0.1742 0.2045 Random Forest 0.0033 0.47 0.23 0.24 company type project type cause1 cause2 phase zip donor income result -0.016 -0.0008 -0.0765 -0.0416 0.0149 0.071 0.248 1 Company Type Histogram company type Frequency 1 2 3 4 5 6 0 20 40 60 80 100 Cause1 Histogram candidate/non-profit Frequency 1.0 1.2 1.4 1.6 1.8 2.0 0 50 100 150 200 Project Type Histogram project type Frequency 0 2 4 6 8 10 0 10 30 50 Cause2 Histogram causes Frequency 2 4 6 8 10 0 20 40 60 80 100 Phase Histogram phase Frequency 0 5 10 15 20 0 20 40 60 80 100 Income Histogram natural log of income Frequency 10 11 12 13 14 15 16 17 0 10 20 30 40 50 60 -0.2 -0.1 0.0 0.1 40 45 50 55 60 misclassification errors with different thresholds thresholds percentage of misclassification error threshold selected 2 4 6 8 10 44 46 48 50 nets percentage of misclassification error nets selected 0.0 0.2 0.4 0.6 0.8 1.0 38 40 42 44 decay percentage of misclassification error decay level selected 70 60 50 40 30 20 10 100 80 60 40 20 0 -20 -40 -60 Classification Results with Different Threshold Values r=-0.3, r=-0.28, r=-0.26, r=-0.24, r=-0.22, r=-0.2, r=-0.18, r=-0.16, r=-0.14, r=-0.12, r=-0.1, r=-0.08, r=-0.06, r=-0.04, r=-0.02, r=0 Net Collected $ with Different Threshold Values 0 2 7 10 10 11 13 14 17 17 18 19 21 22 23 23 0 r=-0.3, r=-0.28, r=-0.26, r=-0.24, r=-0.22, r=-0.2, r=-0.18, r=-0.16, r=-0.14, r=-0.12, r=-0.1, r=-0.08, r=-0.06, r=-0.04, r=-0.02, r=0 positive false-negative false-positive negative