SlideShare a Scribd company logo
Database	
  Marketing	
  and	
  CRM	
  –	
  
Analyzing	
  DONOR	
  data	
  set	
  	
  
Akanksha	
  Jain	
  
Project	
  Goals	
  
•  Goal:	
  Using	
  historical	
  data	
  set	
  DONOR_RAW,	
  develop	
  a	
  

model	
  which	
  can	
  predict	
  whether	
  the	
  prospect	
  will	
  
donate/	
  not	
  donate	
  

•  Scope:	
  DONOR_RAW	
  data	
  set	
  
•  50	
  Variables	
  
•  19,372	
  observaKons	
  
•  Dependent	
  Variable:	
  TARGET_B(Binary)	
  
•  Responder:	
  1	
  
•  Non-­‐Responder:	
  0	
  
TOOLS	
  
•  SAS	
  Enterprise	
  Miner	
  4.3	
  
•  SAS	
  9.3_M1	
  
Diagram 	
  	
  
Diagram	
  (con’t)	
  
Data	
  Source	
  
•  Reject	
  Variables:	
  
•  TARGET_D	
  (using	
  TARGET_B	
  as	
  target)	
  
•  ID	
  (an	
  id	
  number)	
  
•  WEALTH_RATING	
  (huge	
  no.	
  of	
  missing	
  values)	
  
•  Variable	
  TARGET_B	
  
•  Change	
  Role	
  to	
  TARGET	
  
•  Change	
  Order	
  to	
  DESCENDING	
  

•  Select	
  complete	
  data	
  set	
  as	
  Sample	
  
•  Set	
  Prior	
  ProbabiliKes	
  	
  
•  Responder:	
  0.05	
  
•  Non-­‐Responder:	
  0.95	
  
Data	
  Partition	
  
•  Train	
  –	
  60%	
  
•  Validate	
  –	
  25%	
  
•  Test	
  –	
  15%	
  
Variable	
  Transformation	
  
Taking	
  Log	
  TransformaKon	
  to	
  reduce	
  Skewness	
  
•  LIFETIME_GIFT_RANGE	
  
•  LIFETIME_MAX_GIFT_AMT	
  
•  LIFETIME_MIN_GIFT_AMT	
  
•  MOR_HIT_RATE	
  
•  FILE_AVG_GIFT	
  
•  LIFETIME_AVG_GIFT_AMT	
  
•  PCT_ATTRIBUTE1	
  
•  LAST_GIFT_AMT	
  
•  RECENT_AVG_GIFT_AMT	
  
	
  
Keep	
  all	
  variables,	
  original	
  and	
  log	
  transformaKons	
  
Model:	
  CHAID	
  
•  Nominal	
  Criterion:	
  Chi	
  Square	
  
•  Significance	
  Level:	
  0.1	
  
•  Minimum	
  number	
  of	
  observaKons	
  in	
  a	
  leaf	
  =	
  25	
  
•  ObservaKons	
  required	
  for	
  a	
  split	
  search	
  =	
  55	
  
•  Model	
  assessment	
  measure:	
  Total	
  Leaf	
  Impurity	
  (Gini	
  
Index)	
  
Model:	
  CHAID	
  (con’t)	
  
Model:	
  CHAID	
  (con’t)	
  
Inference:	
  
	
  FREQUENCY_STATUS_97NK	
  =	
  3	
  or	
  4;	
  
MONTHS_SINCE_LAST_GIFT	
  <	
  8.5	
  	
  
	
  1%	
  =	
  56%	
  

Less	
  MarkeKng	
  Effort	
  needed	
  as	
  most	
  likely	
  that	
  
they	
  will	
  donate	
  anyways	
  

FREQUENCY_STATUS_97NK	
  =	
  3	
  or	
  4;	
  
MONTHS_SINCE_LAST_GIFT	
  >=	
  8.5;	
  
NUMBER_PROM_12	
  <11.5	
  
1%	
  =	
  43%	
  
	
  

Will	
  also	
  donate	
  but	
  the	
  company	
  should	
  be	
  
careful	
  and	
  not	
  send	
  them	
  too	
  many	
  promoKons	
  

FREQUENCY_STATUS_97NK	
  =	
  3	
  or	
  4;	
  
MONTHS_SINCE_LAST_GIFT	
  >=	
  8.5;	
  
NUMBER_PROM_12	
  >=	
  11.5	
  
1%	
  =	
  30%	
  
	
  

Are	
  geong	
  too	
  many	
  promoKons;	
  and	
  hence	
  
company	
  should	
  cut	
  on	
  sending	
  them	
  
promoKons	
  

FREQUENCY_STATUS_97NK	
  =	
  1,	
  2	
  or	
  Missing	
  
1%	
  =	
  21%	
  

Study	
  them	
  more	
  closely	
  as	
  in	
  why	
  they	
  are	
  not	
  
donaKng,	
  what	
  other	
  factors	
  are	
  responsible	
  and	
  
then	
  decide	
  how	
  to	
  design	
  a	
  markeKng	
  
campaign	
  for	
  them.	
  
	
  
Variable	
  Selection	
  
•  Target	
  AssociaKons:	
  Select	
  Chi	
  Square	
  
Model:	
  Forward	
  Regression	
  
MODEL	
  OPTIONS	
  -­‐>	
  INPUT	
  CODING	
  -­‐>DEVIATION	
  	
  
SELECTION	
  METHOD	
  -­‐>	
  FORWARD	
  
CRITERIA	
  -­‐>	
  CROSS	
  VALIDATION	
  MISCLASSIFICATION	
  
ADVANCED	
  -­‐>	
  OPTIMIZATION	
  METHOD	
  -­‐>	
  NEWTON-­‐RAPHSON	
  
w/	
  LINE	
  SEARCH	
  
•  SL	
  Entry:	
  0.05	
  
• 
• 
• 
• 
Model:	
  Forward	
  Regression	
  
(con’t)	
  
Model:	
  Forward	
  Regression	
  
(con’t)	
  
Model:	
  Backward	
  Regression	
  
MODEL	
  OPTIONS	
  -­‐>	
  INPUT	
  CODING	
  -­‐>DEVIATION	
  	
  
SELECTION	
  METHOD	
  -­‐>	
  BACKWARD	
  
CRITERIA	
  -­‐>	
  CROSS	
  VALIDATION	
  MISCLASSIFICATION	
  
ADVANCED	
  -­‐>	
  OPTIMIZATION	
  METHOD	
  -­‐>	
  NEWTON-­‐RAPHSON	
  
w/	
  LINE	
  SEARCH	
  
•  SL	
  Stay:	
  0.05	
  
• 
• 
• 
• 
Model:	
  Backward	
  Regression	
  
(con’t)	
  
Model:	
  Backward	
  Regression	
  
(con’t)	
  
Model:	
  Stepwise	
  Regression	
  
MODEL	
  OPTIONS	
  -­‐>	
  INPUT	
  CODING	
  -­‐>DEVIATION	
  	
  
SELECTION	
  METHOD	
  -­‐>	
  STEPWISE	
  
CRITERIA	
  -­‐>	
  CROSS	
  VALIDATION	
  MISCLASSIFICATION	
  
ADVANCED	
  -­‐>	
  OPTIMIZATION	
  METHOD	
  -­‐>	
  NEWTON-­‐RAPHSON	
  
w/	
  LINE	
  SEARCH	
  
•  SL	
  Entry:	
  0.15	
  
•  SL	
  Stay:	
  0.05	
  
• 
• 
• 
• 
Model:	
  Stepwise	
  Regression	
  
(con’t)	
  
Model:	
  Stepwise	
  Regression	
  
(con’t)	
  
Variable	
  Comparison	
  
Forward	
  

Backward	
  

Stepwise	
  

FILE_CARD_GIFT	
  

FILE_CARD_GIFT	
  

FILE_CARD_GIFT	
  

FREQUENCY_STATUS_97NK	
  

FREQUENCY_STATUS_97NK	
  

FREQUENCY_STATUS_97NK	
  

INCOME_GROUP	
  

INCOME_GROUP	
  

INCOME_GROUP	
  

LIFE_AV9*	
  

LIFE_AV9*	
  

LIFE_AV9*	
  

MONTHS_SINCE_LAST_GIFT	
  

MONTHS_SINCE_LAST_GIFT	
  

MONTHS_SINCE_LAST_GIFT	
  

PEP_STAR	
  

PEP_STAR	
  

PEP_STAR	
  

LIFETIME_GIFT_AMOUNT	
  
MEDIAN_HOUSEHOLD_INCOME	
  
RECENT_RESPONSE_PROP	
  

*LIFE_AV9	
  is	
  the	
  log(LIFETIME_AVG_GIFT_AMOUNT)	
  
Model	
  Comparison	
  
(Validation):	
  Cumulative	
  LIFT	
  
	
  
Model	
  Comparison	
  
(Validation):	
  Cumulative	
  LIFT	
  
Inference:	
  
•  Capture	
  top	
  20%	
  of	
  the	
  market	
  -­‐>FORWARD	
  	
  
•  Capture	
  top	
  30%	
  of	
  the	
  market	
  -­‐>BACKWARD	
  
Model	
  Comparison	
  (TEST):	
  
Cumulative	
  LIFT	
  
Model	
  Comparison	
  (TEST):	
  
Cumulative	
  LIFT	
  
Inference:	
  
•  Capture	
  top	
  20%	
  of	
  the	
  market	
  -­‐>FORWARD	
  	
  
•  Capture	
  top	
  30%	
  of	
  the	
  market	
  -­‐>	
  ANY	
  
Forward	
  versus	
  Backward	
  	
  
•  Variables:	
  

•  LIFETIME_GIFT_AMOUNT	
  
•  MEDIAN_HOUSEHOLD_INCOME	
  
•  RECENT_RESPONSE_PROP	
  

•  CorrelaKons:	
  
•  MEDIAN_HOUSEHOLD_INCOME	
  and	
  INCOME_GROUP	
  =	
  43%	
  
•  LIFE_AV9	
  and	
  LIFETIME_AVG_GIFT_AMT	
  =	
  83%	
  	
  
•  FILE_CARD_GIFT	
  and	
  RECENT_RESPONSE_PROP	
  =	
  30%	
  
Model:	
  Forward	
  +	
  
RECENT_RESPONSE_PROP	
  	
  	
  
•  Variable	
  SelecKon	
  (call	
  it	
  Variable_1Extra):	
  
• 
• 
• 
• 
• 
• 
• 

FILE_CARD_GIFT	
  
FREQUENCY_STATUS_97NK	
  
INCOME_GROUP	
  
LIFE_AV9	
  
MONTHS_SINCE_LAST_GIFT	
  
PEP_STAR	
  
RECENT_RESPONSE_PROP	
  

•  Reject	
  other	
  variables	
  manually	
  
•  Call	
  this	
  model	
  For_1Extra	
  
Model	
  Comparison	
  
(Validation):	
  Cumulative	
  LIFT	
  
Model	
  Comparison	
  
(Validation):	
  Cumulative	
  LIFT	
  
Inference:	
  
•  Capture	
  top	
  20%	
  of	
  the	
  market	
  -­‐>For_1Extra	
  
•  Capture	
  top	
  30%	
  of	
  the	
  market	
  -­‐>	
  BACKWARD	
  
Model	
  Comparison	
  (TEST):	
  
Cumulative	
  LIFT	
  
Model	
  Comparison	
  (TEST):	
  
Cumulative	
  LIFT	
  
Inference:	
  
•  Capture	
  top	
  20%	
  of	
  the	
  market	
  -­‐>For_1Extra	
  
•  Capture	
  top	
  30%	
  of	
  the	
  market	
  -­‐>	
  ANY	
  
	
  
Model:	
  Decision	
  
Final	
  Model:	
  FOR_1EXTRA	
  
	
  
Variables:	
  
• 
• 
• 
• 
• 
• 
• 

FILE_CARD_GIFT	
  
FREQUENCY_STATUS_97NK	
  
INCOME_GROUP	
  
LIFE_AV9	
  
MONTHS_SINCE_LAST_GIFT	
  
PEP_STAR	
  
RECENT_RESPONSE_PROP	
  
Interaction	
  Terms	
  
•  FREQ_PEP	
  =	
  FREQUENCY_STATUS_97NK	
  *	
  PEP_STAR	
  
•  FREQ_MONTH	
  =	
  FREQUENCY_STATUS_97NK	
  *	
  
MONTHS_SINCE_LAST_GIFT	
  	
  
•  FREQ_INCOME	
  =	
  FREQUENCY_STATUS_97NK	
  *	
  
INCOME_GROUP	
  
Model:	
  Forward	
  Regression	
  
with	
  Interaction	
  Terms	
  
Rename	
  model	
  as	
  FOR_1E_INT	
  
MODEL	
  OPTIONS	
  -­‐>	
  INPUT	
  CODING	
  -­‐>DEVIATION	
  	
  
SELECTION	
  METHOD	
  -­‐>	
  FORWARD	
  
CRITERIA	
  -­‐>	
  CROSS	
  VALIDATION	
  MISCLASSIFICATION	
  
ADVANCED	
  -­‐>	
  OPTIMIZATION	
  METHOD	
  -­‐>	
  NEWTON-­‐RAPHSON	
  
w/	
  LINE	
  SEARCH	
  
•  SL	
  Entry:	
  0.05	
  
• 
• 
• 
• 
• 
Model	
  FOR_1E_INT:	
  
Cumulative	
  LIFT	
  
Model	
  FOR_1E_INT:	
  Variable	
  
List	
  
Model	
  Comparison	
  
(Validation):	
  Cumulative	
  LIFT	
  
Model	
  Comparison	
  
(Validation):	
  Cumulative	
  LIFT	
  
Inference:	
  
•  Capture	
  top	
  20%	
  of	
  the	
  market	
  -­‐>FOR_1E_INT	
  
•  Capture	
  top	
  30%	
  of	
  the	
  market	
  -­‐>	
  FOR_1EXTRA	
  
Model	
  Comparison	
  (TEST):	
  
Cumulative	
  LIFT	
  
Model	
  Comparison	
  (TEST):	
  
Cumulative	
  LIFT	
  
Inference:	
  
•  Capture	
  top	
  20%	
  of	
  the	
  market	
  -­‐>ANY	
  
•  Capture	
  top	
  30%	
  of	
  the	
  market	
  -­‐>	
  FOR_1EXTRA	
  
Model:	
  For_1EXTRA	
  +	
  
Interaction	
  terms	
  	
  	
  
•  Variable	
  SelecKon	
  (call	
  it	
  Variable_UNION):	
  
• 
• 
• 
• 
• 
• 
• 
• 
• 
• 
	
  

FILE_CARD_GIFT	
  
FREQENCY_STATUS_97NK	
  
INCOME_GROUP	
  
LIFE_AV9	
  
MONTHS_SINCE_LAST_GIFT	
  
PEP_STAR	
  
RECENT_RESPONSE_PROP	
  
FREQ_PEP	
  	
  
FREQ_MONTH	
  	
  
FREQ_INCOME	
  

•  Reject	
  other	
  variables	
  manually	
  
•  Call	
  this	
  model	
  For_Union	
  
Model	
  Comparison	
  
(Validation):	
  Cumulative	
  LIFT	
  
Model	
  Comparison	
  
(Validation):	
  Cumulative	
  LIFT	
  
Inference:	
  
•  Capture	
  top	
  20%	
  of	
  the	
  market	
  -­‐>	
  FOR_1E_INT	
  
•  Capture	
  top	
  30%	
  of	
  the	
  market	
  -­‐>	
  FOR_UNION	
  
Model	
  Comparison	
  (Test):	
  
Cumulative	
  LIFT	
  
Model	
  Comparison	
  (Test):	
  
Cumulative	
  LIFT	
  
Inference:	
  
•  Capture	
  top	
  20%	
  of	
  the	
  market	
  -­‐>	
  ANY	
  
•  Capture	
  top	
  30%	
  of	
  the	
  market	
  -­‐>	
  FOR_UNION	
  
Model:	
  Decision	
  
Final	
  Model:	
  FOR_1EXTRA	
  because	
  	
  
•  No	
  significant	
  improvement	
  with	
  other	
  models	
  
•  InteracKon	
  terms	
  bring	
  along	
  complexity	
  
	
  
Variables:	
  
• 
• 
• 
• 
• 
• 
• 

FILE_CARD_GIFT	
  
FREQUENCY_STATUS_97NK	
  
INCOME_GROUP	
  
LIFE_AV9	
  
MONTHS_SINCE_LAST_GIFT	
  
PEP_STAR	
  
RECENT_RESPONSE_PROP	
  
Score:	
  On	
  Donor_Raw_Data	
  
THANK	
  YOU	
  

More Related Content

Similar to Predictive Model for Customer Segmentation using Database Marketing Techniques

Presentation top tips for getting optimal sql execution
Presentation    top tips for getting optimal sql executionPresentation    top tips for getting optimal sql execution
Presentation top tips for getting optimal sql execution
xKinAnx
 
Vertical Recommendation Using Collaborative Filtering
Vertical Recommendation Using Collaborative FilteringVertical Recommendation Using Collaborative Filtering
Vertical Recommendation Using Collaborative Filtering
gorass
 
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
ThinkInnovation
 
Company segmentation - an approach with R
Company segmentation - an approach with RCompany segmentation - an approach with R
Company segmentation - an approach with R
Casper Crause
 
Deepak-Computational Advertising-The LinkedIn Way
Deepak-Computational Advertising-The LinkedIn WayDeepak-Computational Advertising-The LinkedIn Way
Deepak-Computational Advertising-The LinkedIn Way
yingfeng
 
Customer_Churn_prediction.pptx
Customer_Churn_prediction.pptxCustomer_Churn_prediction.pptx
Customer_Churn_prediction.pptx
Aniket Patil
 
Customer_Churn_prediction.pptx
Customer_Churn_prediction.pptxCustomer_Churn_prediction.pptx
Customer_Churn_prediction.pptx
patilaniket2418
 
Maximizing a churn campaigns profitability with cost sensitive machine learning
Maximizing a churn campaigns profitability with cost sensitive machine learningMaximizing a churn campaigns profitability with cost sensitive machine learning
Maximizing a churn campaigns profitability with cost sensitive machine learning
Alejandro Correa Bahnsen, PhD
 
BDAS-2017 | Maximizing a churn campaign’s profitability with cost sensitive m...
BDAS-2017 | Maximizing a churn campaign’s profitability with cost sensitive m...BDAS-2017 | Maximizing a churn campaign’s profitability with cost sensitive m...
BDAS-2017 | Maximizing a churn campaign’s profitability with cost sensitive m...
Big-Data-Summit
 
Lead Scoring Group Case Study Presentation.pdf
Lead Scoring Group Case Study Presentation.pdfLead Scoring Group Case Study Presentation.pdf
Lead Scoring Group Case Study Presentation.pdf
KrishP2
 
Monte Carlo Simulation for Trading System in AmiBroker
Monte Carlo Simulation for Trading System in AmiBrokerMonte Carlo Simulation for Trading System in AmiBroker
Monte Carlo Simulation for Trading System in AmiBroker
ThaiQuants
 
Phase 2 of Predicting Payment default on Vehicle Loan EMI
Phase 2 of Predicting Payment default on Vehicle Loan EMIPhase 2 of Predicting Payment default on Vehicle Loan EMI
Phase 2 of Predicting Payment default on Vehicle Loan EMI
Vikas Virani
 
Supply Chain Basic
Supply Chain BasicSupply Chain Basic
Supply Chain Basic
PARTHA DAS, Dip.SCM℗(ITC)
 
Machine Learning with Binary Logistic Regression - APAC
Machine Learning with Binary Logistic Regression - APACMachine Learning with Binary Logistic Regression - APAC
Machine Learning with Binary Logistic Regression - APAC
Minitab, LLC
 
1000 track2 boire
1000 track2 boire1000 track2 boire
1000 track2 boire
Rising Media, Inc.
 
The metrics that matter using scalability metrics for project planning of a d...
The metrics that matter using scalability metrics for project planning of a d...The metrics that matter using scalability metrics for project planning of a d...
The metrics that matter using scalability metrics for project planning of a d...
Mary Chan
 
Virtual segment brief
Virtual segment briefVirtual segment brief
Virtual segment brief
Ahmed Shaaban
 
1030 track1 heiler
1030 track1 heiler1030 track1 heiler
1030 track1 heiler
Rising Media, Inc.
 
Telecom Churn Analysis
Telecom Churn AnalysisTelecom Churn Analysis
Telecom Churn Analysis
Vasudev pendyala
 
Wooing the Best Bank Deposit Customers
Wooing the Best Bank Deposit CustomersWooing the Best Bank Deposit Customers
Wooing the Best Bank Deposit Customers
Lucinda Linde
 

Similar to Predictive Model for Customer Segmentation using Database Marketing Techniques (20)

Presentation top tips for getting optimal sql execution
Presentation    top tips for getting optimal sql executionPresentation    top tips for getting optimal sql execution
Presentation top tips for getting optimal sql execution
 
Vertical Recommendation Using Collaborative Filtering
Vertical Recommendation Using Collaborative FilteringVertical Recommendation Using Collaborative Filtering
Vertical Recommendation Using Collaborative Filtering
 
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
 
Company segmentation - an approach with R
Company segmentation - an approach with RCompany segmentation - an approach with R
Company segmentation - an approach with R
 
Deepak-Computational Advertising-The LinkedIn Way
Deepak-Computational Advertising-The LinkedIn WayDeepak-Computational Advertising-The LinkedIn Way
Deepak-Computational Advertising-The LinkedIn Way
 
Customer_Churn_prediction.pptx
Customer_Churn_prediction.pptxCustomer_Churn_prediction.pptx
Customer_Churn_prediction.pptx
 
Customer_Churn_prediction.pptx
Customer_Churn_prediction.pptxCustomer_Churn_prediction.pptx
Customer_Churn_prediction.pptx
 
Maximizing a churn campaigns profitability with cost sensitive machine learning
Maximizing a churn campaigns profitability with cost sensitive machine learningMaximizing a churn campaigns profitability with cost sensitive machine learning
Maximizing a churn campaigns profitability with cost sensitive machine learning
 
BDAS-2017 | Maximizing a churn campaign’s profitability with cost sensitive m...
BDAS-2017 | Maximizing a churn campaign’s profitability with cost sensitive m...BDAS-2017 | Maximizing a churn campaign’s profitability with cost sensitive m...
BDAS-2017 | Maximizing a churn campaign’s profitability with cost sensitive m...
 
Lead Scoring Group Case Study Presentation.pdf
Lead Scoring Group Case Study Presentation.pdfLead Scoring Group Case Study Presentation.pdf
Lead Scoring Group Case Study Presentation.pdf
 
Monte Carlo Simulation for Trading System in AmiBroker
Monte Carlo Simulation for Trading System in AmiBrokerMonte Carlo Simulation for Trading System in AmiBroker
Monte Carlo Simulation for Trading System in AmiBroker
 
Phase 2 of Predicting Payment default on Vehicle Loan EMI
Phase 2 of Predicting Payment default on Vehicle Loan EMIPhase 2 of Predicting Payment default on Vehicle Loan EMI
Phase 2 of Predicting Payment default on Vehicle Loan EMI
 
Supply Chain Basic
Supply Chain BasicSupply Chain Basic
Supply Chain Basic
 
Machine Learning with Binary Logistic Regression - APAC
Machine Learning with Binary Logistic Regression - APACMachine Learning with Binary Logistic Regression - APAC
Machine Learning with Binary Logistic Regression - APAC
 
1000 track2 boire
1000 track2 boire1000 track2 boire
1000 track2 boire
 
The metrics that matter using scalability metrics for project planning of a d...
The metrics that matter using scalability metrics for project planning of a d...The metrics that matter using scalability metrics for project planning of a d...
The metrics that matter using scalability metrics for project planning of a d...
 
Virtual segment brief
Virtual segment briefVirtual segment brief
Virtual segment brief
 
1030 track1 heiler
1030 track1 heiler1030 track1 heiler
1030 track1 heiler
 
Telecom Churn Analysis
Telecom Churn AnalysisTelecom Churn Analysis
Telecom Churn Analysis
 
Wooing the Best Bank Deposit Customers
Wooing the Best Bank Deposit CustomersWooing the Best Bank Deposit Customers
Wooing the Best Bank Deposit Customers
 

Recently uploaded

QuickBooks Sync Manager Repair Tool- What You Need to Know
QuickBooks Sync Manager Repair Tool- What You Need to KnowQuickBooks Sync Manager Repair Tool- What You Need to Know
QuickBooks Sync Manager Repair Tool- What You Need to Know
markmargaret23
 
Influencer Marketing Master Class - Alexis Andreasik
Influencer Marketing Master Class - Alexis AndreasikInfluencer Marketing Master Class - Alexis Andreasik
Influencer Marketing Master Class - Alexis Andreasik
DigiMarCon - Digital Marketing, Media and Advertising Conferences & Exhibitions
 
How To Navigate AI - The Future is Yours to Define - Tim Hayden
How To Navigate AI - The Future is Yours to Define - Tim HaydenHow To Navigate AI - The Future is Yours to Define - Tim Hayden
How To Navigate AI - The Future is Yours to Define - Tim Hayden
DigiMarCon - Digital Marketing, Media and Advertising Conferences & Exhibitions
 
Story Telling Master Class - Jennifer Morilla
Story Telling Master Class - Jennifer MorillaStory Telling Master Class - Jennifer Morilla
No Cookies, No Problem - Steve Krull, Be Found Online
No Cookies, No Problem - Steve Krull, Be Found OnlineNo Cookies, No Problem - Steve Krull, Be Found Online
No Cookies, No Problem - Steve Krull, Be Found Online
DigiMarCon - Digital Marketing, Media and Advertising Conferences & Exhibitions
 
janani Digital Marketer|Digital Marketing consultant|Marketing Promotion|Coim...
janani Digital Marketer|Digital Marketing consultant|Marketing Promotion|Coim...janani Digital Marketer|Digital Marketing consultant|Marketing Promotion|Coim...
janani Digital Marketer|Digital Marketing consultant|Marketing Promotion|Coim...
janudm24
 
Mastering The Best Restaurant Advertising Campaigns Detailed Guide
Mastering The Best Restaurant Advertising Campaigns Detailed GuideMastering The Best Restaurant Advertising Campaigns Detailed Guide
Mastering The Best Restaurant Advertising Campaigns Detailed Guide
Kopa Global Technologies
 
Mastering Your Online Visibility - Fernando Angulo
Mastering Your Online Visibility - Fernando AnguloMastering Your Online Visibility - Fernando Angulo
Etsy Marketing Guide - Tips For Selling Digital Products
Etsy Marketing Guide - Tips For Selling Digital ProductsEtsy Marketing Guide - Tips For Selling Digital Products
Etsy Marketing Guide - Tips For Selling Digital Products
kcblog21
 
Future-Proof Like Beyoncé - Syncing Email and Social Media for Iconic Brand L...
Future-Proof Like Beyoncé - Syncing Email and Social Media for Iconic Brand L...Future-Proof Like Beyoncé - Syncing Email and Social Media for Iconic Brand L...
Future-Proof Like Beyoncé - Syncing Email and Social Media for Iconic Brand L...
DigiMarCon - Digital Marketing, Media and Advertising Conferences & Exhibitions
 
Efficient Website Management for Digital Marketing Pros
Efficient Website Management for Digital Marketing ProsEfficient Website Management for Digital Marketing Pros
Efficient Website Management for Digital Marketing Pros
Lauren Polinsky
 
Mastering SEO for Google in the AI Era - Dennis Yu
Mastering SEO for Google in the AI Era - Dennis YuMastering SEO for Google in the AI Era - Dennis Yu
Global Growth Starts With Translation - How To Unlock Global Markets - Tim Kirby
Global Growth Starts With Translation - How To Unlock Global Markets - Tim KirbyGlobal Growth Starts With Translation - How To Unlock Global Markets - Tim Kirby
Global Growth Starts With Translation - How To Unlock Global Markets - Tim Kirby
DigiMarCon - Digital Marketing, Media and Advertising Conferences & Exhibitions
 
Get Off the Bandwagon - Separating Digital Marketing Myths from Truth - Scott...
Get Off the Bandwagon - Separating Digital Marketing Myths from Truth - Scott...Get Off the Bandwagon - Separating Digital Marketing Myths from Truth - Scott...
Get Off the Bandwagon - Separating Digital Marketing Myths from Truth - Scott...
DigiMarCon - Digital Marketing, Media and Advertising Conferences & Exhibitions
 
How American Bath Group Leveraged Kontent
How American Bath Group Leveraged KontentHow American Bath Group Leveraged Kontent
PickUp_conversational AI_Capex, Inc._20240610
PickUp_conversational AI_Capex, Inc._20240610PickUp_conversational AI_Capex, Inc._20240610
PickUp_conversational AI_Capex, Inc._20240610
Shuntaro Kogame
 
Luxury Hanloom Saree Brand ,Capstone Project_Kiran Bansal.pdf
Luxury Hanloom Saree Brand ,Capstone Project_Kiran Bansal.pdfLuxury Hanloom Saree Brand ,Capstone Project_Kiran Bansal.pdf
Luxury Hanloom Saree Brand ,Capstone Project_Kiran Bansal.pdf
KiranRai75
 
Pillar-Based Marketing Master Class - Ryan Brock
Pillar-Based Marketing Master Class - Ryan BrockPillar-Based Marketing Master Class - Ryan Brock
Breaking Silos To Break Bank: Shattering The Divide Between Search And Social
Breaking Silos To Break Bank: Shattering The Divide Between Search And SocialBreaking Silos To Break Bank: Shattering The Divide Between Search And Social
Breaking Silos To Break Bank: Shattering The Divide Between Search And Social
Navah Hopkins
 
From Hope to Despair The Top 10 Reasons Businesses Ditch SEO Tactics.pptx
From Hope to Despair The Top 10 Reasons Businesses Ditch SEO Tactics.pptxFrom Hope to Despair The Top 10 Reasons Businesses Ditch SEO Tactics.pptx
From Hope to Despair The Top 10 Reasons Businesses Ditch SEO Tactics.pptx
Boston SEO Services
 

Recently uploaded (20)

QuickBooks Sync Manager Repair Tool- What You Need to Know
QuickBooks Sync Manager Repair Tool- What You Need to KnowQuickBooks Sync Manager Repair Tool- What You Need to Know
QuickBooks Sync Manager Repair Tool- What You Need to Know
 
Influencer Marketing Master Class - Alexis Andreasik
Influencer Marketing Master Class - Alexis AndreasikInfluencer Marketing Master Class - Alexis Andreasik
Influencer Marketing Master Class - Alexis Andreasik
 
How To Navigate AI - The Future is Yours to Define - Tim Hayden
How To Navigate AI - The Future is Yours to Define - Tim HaydenHow To Navigate AI - The Future is Yours to Define - Tim Hayden
How To Navigate AI - The Future is Yours to Define - Tim Hayden
 
Story Telling Master Class - Jennifer Morilla
Story Telling Master Class - Jennifer MorillaStory Telling Master Class - Jennifer Morilla
Story Telling Master Class - Jennifer Morilla
 
No Cookies, No Problem - Steve Krull, Be Found Online
No Cookies, No Problem - Steve Krull, Be Found OnlineNo Cookies, No Problem - Steve Krull, Be Found Online
No Cookies, No Problem - Steve Krull, Be Found Online
 
janani Digital Marketer|Digital Marketing consultant|Marketing Promotion|Coim...
janani Digital Marketer|Digital Marketing consultant|Marketing Promotion|Coim...janani Digital Marketer|Digital Marketing consultant|Marketing Promotion|Coim...
janani Digital Marketer|Digital Marketing consultant|Marketing Promotion|Coim...
 
Mastering The Best Restaurant Advertising Campaigns Detailed Guide
Mastering The Best Restaurant Advertising Campaigns Detailed GuideMastering The Best Restaurant Advertising Campaigns Detailed Guide
Mastering The Best Restaurant Advertising Campaigns Detailed Guide
 
Mastering Your Online Visibility - Fernando Angulo
Mastering Your Online Visibility - Fernando AnguloMastering Your Online Visibility - Fernando Angulo
Mastering Your Online Visibility - Fernando Angulo
 
Etsy Marketing Guide - Tips For Selling Digital Products
Etsy Marketing Guide - Tips For Selling Digital ProductsEtsy Marketing Guide - Tips For Selling Digital Products
Etsy Marketing Guide - Tips For Selling Digital Products
 
Future-Proof Like Beyoncé - Syncing Email and Social Media for Iconic Brand L...
Future-Proof Like Beyoncé - Syncing Email and Social Media for Iconic Brand L...Future-Proof Like Beyoncé - Syncing Email and Social Media for Iconic Brand L...
Future-Proof Like Beyoncé - Syncing Email and Social Media for Iconic Brand L...
 
Efficient Website Management for Digital Marketing Pros
Efficient Website Management for Digital Marketing ProsEfficient Website Management for Digital Marketing Pros
Efficient Website Management for Digital Marketing Pros
 
Mastering SEO for Google in the AI Era - Dennis Yu
Mastering SEO for Google in the AI Era - Dennis YuMastering SEO for Google in the AI Era - Dennis Yu
Mastering SEO for Google in the AI Era - Dennis Yu
 
Global Growth Starts With Translation - How To Unlock Global Markets - Tim Kirby
Global Growth Starts With Translation - How To Unlock Global Markets - Tim KirbyGlobal Growth Starts With Translation - How To Unlock Global Markets - Tim Kirby
Global Growth Starts With Translation - How To Unlock Global Markets - Tim Kirby
 
Get Off the Bandwagon - Separating Digital Marketing Myths from Truth - Scott...
Get Off the Bandwagon - Separating Digital Marketing Myths from Truth - Scott...Get Off the Bandwagon - Separating Digital Marketing Myths from Truth - Scott...
Get Off the Bandwagon - Separating Digital Marketing Myths from Truth - Scott...
 
How American Bath Group Leveraged Kontent
How American Bath Group Leveraged KontentHow American Bath Group Leveraged Kontent
How American Bath Group Leveraged Kontent
 
PickUp_conversational AI_Capex, Inc._20240610
PickUp_conversational AI_Capex, Inc._20240610PickUp_conversational AI_Capex, Inc._20240610
PickUp_conversational AI_Capex, Inc._20240610
 
Luxury Hanloom Saree Brand ,Capstone Project_Kiran Bansal.pdf
Luxury Hanloom Saree Brand ,Capstone Project_Kiran Bansal.pdfLuxury Hanloom Saree Brand ,Capstone Project_Kiran Bansal.pdf
Luxury Hanloom Saree Brand ,Capstone Project_Kiran Bansal.pdf
 
Pillar-Based Marketing Master Class - Ryan Brock
Pillar-Based Marketing Master Class - Ryan BrockPillar-Based Marketing Master Class - Ryan Brock
Pillar-Based Marketing Master Class - Ryan Brock
 
Breaking Silos To Break Bank: Shattering The Divide Between Search And Social
Breaking Silos To Break Bank: Shattering The Divide Between Search And SocialBreaking Silos To Break Bank: Shattering The Divide Between Search And Social
Breaking Silos To Break Bank: Shattering The Divide Between Search And Social
 
From Hope to Despair The Top 10 Reasons Businesses Ditch SEO Tactics.pptx
From Hope to Despair The Top 10 Reasons Businesses Ditch SEO Tactics.pptxFrom Hope to Despair The Top 10 Reasons Businesses Ditch SEO Tactics.pptx
From Hope to Despair The Top 10 Reasons Businesses Ditch SEO Tactics.pptx
 

Predictive Model for Customer Segmentation using Database Marketing Techniques

  • 1. Database  Marketing  and  CRM  –   Analyzing  DONOR  data  set     Akanksha  Jain  
  • 2. Project  Goals   •  Goal:  Using  historical  data  set  DONOR_RAW,  develop  a   model  which  can  predict  whether  the  prospect  will   donate/  not  donate   •  Scope:  DONOR_RAW  data  set   •  50  Variables   •  19,372  observaKons   •  Dependent  Variable:  TARGET_B(Binary)   •  Responder:  1   •  Non-­‐Responder:  0  
  • 3. TOOLS   •  SAS  Enterprise  Miner  4.3   •  SAS  9.3_M1  
  • 6. Data  Source   •  Reject  Variables:   •  TARGET_D  (using  TARGET_B  as  target)   •  ID  (an  id  number)   •  WEALTH_RATING  (huge  no.  of  missing  values)   •  Variable  TARGET_B   •  Change  Role  to  TARGET   •  Change  Order  to  DESCENDING   •  Select  complete  data  set  as  Sample   •  Set  Prior  ProbabiliKes     •  Responder:  0.05   •  Non-­‐Responder:  0.95  
  • 7. Data  Partition   •  Train  –  60%   •  Validate  –  25%   •  Test  –  15%  
  • 8. Variable  Transformation   Taking  Log  TransformaKon  to  reduce  Skewness   •  LIFETIME_GIFT_RANGE   •  LIFETIME_MAX_GIFT_AMT   •  LIFETIME_MIN_GIFT_AMT   •  MOR_HIT_RATE   •  FILE_AVG_GIFT   •  LIFETIME_AVG_GIFT_AMT   •  PCT_ATTRIBUTE1   •  LAST_GIFT_AMT   •  RECENT_AVG_GIFT_AMT     Keep  all  variables,  original  and  log  transformaKons  
  • 9. Model:  CHAID   •  Nominal  Criterion:  Chi  Square   •  Significance  Level:  0.1   •  Minimum  number  of  observaKons  in  a  leaf  =  25   •  ObservaKons  required  for  a  split  search  =  55   •  Model  assessment  measure:  Total  Leaf  Impurity  (Gini   Index)  
  • 11. Model:  CHAID  (con’t)   Inference:    FREQUENCY_STATUS_97NK  =  3  or  4;   MONTHS_SINCE_LAST_GIFT  <  8.5      1%  =  56%   Less  MarkeKng  Effort  needed  as  most  likely  that   they  will  donate  anyways   FREQUENCY_STATUS_97NK  =  3  or  4;   MONTHS_SINCE_LAST_GIFT  >=  8.5;   NUMBER_PROM_12  <11.5   1%  =  43%     Will  also  donate  but  the  company  should  be   careful  and  not  send  them  too  many  promoKons   FREQUENCY_STATUS_97NK  =  3  or  4;   MONTHS_SINCE_LAST_GIFT  >=  8.5;   NUMBER_PROM_12  >=  11.5   1%  =  30%     Are  geong  too  many  promoKons;  and  hence   company  should  cut  on  sending  them   promoKons   FREQUENCY_STATUS_97NK  =  1,  2  or  Missing   1%  =  21%   Study  them  more  closely  as  in  why  they  are  not   donaKng,  what  other  factors  are  responsible  and   then  decide  how  to  design  a  markeKng   campaign  for  them.    
  • 12. Variable  Selection   •  Target  AssociaKons:  Select  Chi  Square  
  • 13. Model:  Forward  Regression   MODEL  OPTIONS  -­‐>  INPUT  CODING  -­‐>DEVIATION     SELECTION  METHOD  -­‐>  FORWARD   CRITERIA  -­‐>  CROSS  VALIDATION  MISCLASSIFICATION   ADVANCED  -­‐>  OPTIMIZATION  METHOD  -­‐>  NEWTON-­‐RAPHSON   w/  LINE  SEARCH   •  SL  Entry:  0.05   •  •  •  • 
  • 14. Model:  Forward  Regression   (con’t)  
  • 15. Model:  Forward  Regression   (con’t)  
  • 16. Model:  Backward  Regression   MODEL  OPTIONS  -­‐>  INPUT  CODING  -­‐>DEVIATION     SELECTION  METHOD  -­‐>  BACKWARD   CRITERIA  -­‐>  CROSS  VALIDATION  MISCLASSIFICATION   ADVANCED  -­‐>  OPTIMIZATION  METHOD  -­‐>  NEWTON-­‐RAPHSON   w/  LINE  SEARCH   •  SL  Stay:  0.05   •  •  •  • 
  • 19. Model:  Stepwise  Regression   MODEL  OPTIONS  -­‐>  INPUT  CODING  -­‐>DEVIATION     SELECTION  METHOD  -­‐>  STEPWISE   CRITERIA  -­‐>  CROSS  VALIDATION  MISCLASSIFICATION   ADVANCED  -­‐>  OPTIMIZATION  METHOD  -­‐>  NEWTON-­‐RAPHSON   w/  LINE  SEARCH   •  SL  Entry:  0.15   •  SL  Stay:  0.05   •  •  •  • 
  • 22. Variable  Comparison   Forward   Backward   Stepwise   FILE_CARD_GIFT   FILE_CARD_GIFT   FILE_CARD_GIFT   FREQUENCY_STATUS_97NK   FREQUENCY_STATUS_97NK   FREQUENCY_STATUS_97NK   INCOME_GROUP   INCOME_GROUP   INCOME_GROUP   LIFE_AV9*   LIFE_AV9*   LIFE_AV9*   MONTHS_SINCE_LAST_GIFT   MONTHS_SINCE_LAST_GIFT   MONTHS_SINCE_LAST_GIFT   PEP_STAR   PEP_STAR   PEP_STAR   LIFETIME_GIFT_AMOUNT   MEDIAN_HOUSEHOLD_INCOME   RECENT_RESPONSE_PROP   *LIFE_AV9  is  the  log(LIFETIME_AVG_GIFT_AMOUNT)  
  • 23. Model  Comparison   (Validation):  Cumulative  LIFT    
  • 24. Model  Comparison   (Validation):  Cumulative  LIFT   Inference:   •  Capture  top  20%  of  the  market  -­‐>FORWARD     •  Capture  top  30%  of  the  market  -­‐>BACKWARD  
  • 25. Model  Comparison  (TEST):   Cumulative  LIFT  
  • 26. Model  Comparison  (TEST):   Cumulative  LIFT   Inference:   •  Capture  top  20%  of  the  market  -­‐>FORWARD     •  Capture  top  30%  of  the  market  -­‐>  ANY  
  • 27. Forward  versus  Backward     •  Variables:   •  LIFETIME_GIFT_AMOUNT   •  MEDIAN_HOUSEHOLD_INCOME   •  RECENT_RESPONSE_PROP   •  CorrelaKons:   •  MEDIAN_HOUSEHOLD_INCOME  and  INCOME_GROUP  =  43%   •  LIFE_AV9  and  LIFETIME_AVG_GIFT_AMT  =  83%     •  FILE_CARD_GIFT  and  RECENT_RESPONSE_PROP  =  30%  
  • 28. Model:  Forward  +   RECENT_RESPONSE_PROP       •  Variable  SelecKon  (call  it  Variable_1Extra):   •  •  •  •  •  •  •  FILE_CARD_GIFT   FREQUENCY_STATUS_97NK   INCOME_GROUP   LIFE_AV9   MONTHS_SINCE_LAST_GIFT   PEP_STAR   RECENT_RESPONSE_PROP   •  Reject  other  variables  manually   •  Call  this  model  For_1Extra  
  • 29. Model  Comparison   (Validation):  Cumulative  LIFT  
  • 30. Model  Comparison   (Validation):  Cumulative  LIFT   Inference:   •  Capture  top  20%  of  the  market  -­‐>For_1Extra   •  Capture  top  30%  of  the  market  -­‐>  BACKWARD  
  • 31. Model  Comparison  (TEST):   Cumulative  LIFT  
  • 32. Model  Comparison  (TEST):   Cumulative  LIFT   Inference:   •  Capture  top  20%  of  the  market  -­‐>For_1Extra   •  Capture  top  30%  of  the  market  -­‐>  ANY    
  • 33. Model:  Decision   Final  Model:  FOR_1EXTRA     Variables:   •  •  •  •  •  •  •  FILE_CARD_GIFT   FREQUENCY_STATUS_97NK   INCOME_GROUP   LIFE_AV9   MONTHS_SINCE_LAST_GIFT   PEP_STAR   RECENT_RESPONSE_PROP  
  • 34. Interaction  Terms   •  FREQ_PEP  =  FREQUENCY_STATUS_97NK  *  PEP_STAR   •  FREQ_MONTH  =  FREQUENCY_STATUS_97NK  *   MONTHS_SINCE_LAST_GIFT     •  FREQ_INCOME  =  FREQUENCY_STATUS_97NK  *   INCOME_GROUP  
  • 35. Model:  Forward  Regression   with  Interaction  Terms   Rename  model  as  FOR_1E_INT   MODEL  OPTIONS  -­‐>  INPUT  CODING  -­‐>DEVIATION     SELECTION  METHOD  -­‐>  FORWARD   CRITERIA  -­‐>  CROSS  VALIDATION  MISCLASSIFICATION   ADVANCED  -­‐>  OPTIMIZATION  METHOD  -­‐>  NEWTON-­‐RAPHSON   w/  LINE  SEARCH   •  SL  Entry:  0.05   •  •  •  •  • 
  • 38. Model  Comparison   (Validation):  Cumulative  LIFT  
  • 39. Model  Comparison   (Validation):  Cumulative  LIFT   Inference:   •  Capture  top  20%  of  the  market  -­‐>FOR_1E_INT   •  Capture  top  30%  of  the  market  -­‐>  FOR_1EXTRA  
  • 40. Model  Comparison  (TEST):   Cumulative  LIFT  
  • 41. Model  Comparison  (TEST):   Cumulative  LIFT   Inference:   •  Capture  top  20%  of  the  market  -­‐>ANY   •  Capture  top  30%  of  the  market  -­‐>  FOR_1EXTRA  
  • 42. Model:  For_1EXTRA  +   Interaction  terms       •  Variable  SelecKon  (call  it  Variable_UNION):   •  •  •  •  •  •  •  •  •  •    FILE_CARD_GIFT   FREQENCY_STATUS_97NK   INCOME_GROUP   LIFE_AV9   MONTHS_SINCE_LAST_GIFT   PEP_STAR   RECENT_RESPONSE_PROP   FREQ_PEP     FREQ_MONTH     FREQ_INCOME   •  Reject  other  variables  manually   •  Call  this  model  For_Union  
  • 43. Model  Comparison   (Validation):  Cumulative  LIFT  
  • 44. Model  Comparison   (Validation):  Cumulative  LIFT   Inference:   •  Capture  top  20%  of  the  market  -­‐>  FOR_1E_INT   •  Capture  top  30%  of  the  market  -­‐>  FOR_UNION  
  • 45. Model  Comparison  (Test):   Cumulative  LIFT  
  • 46. Model  Comparison  (Test):   Cumulative  LIFT   Inference:   •  Capture  top  20%  of  the  market  -­‐>  ANY   •  Capture  top  30%  of  the  market  -­‐>  FOR_UNION  
  • 47. Model:  Decision   Final  Model:  FOR_1EXTRA  because     •  No  significant  improvement  with  other  models   •  InteracKon  terms  bring  along  complexity     Variables:   •  •  •  •  •  •  •  FILE_CARD_GIFT   FREQUENCY_STATUS_97NK   INCOME_GROUP   LIFE_AV9   MONTHS_SINCE_LAST_GIFT   PEP_STAR   RECENT_RESPONSE_PROP