SlideShare a Scribd company logo
Discriminant Analysis
Database Marketing
Instructor:Nanda Kumar
Multiple Regression
 Y = b0 + b1 X1 + b2 X2 + …+ bn Xn
 Same as Simple Regression in principle
 New Issues:
– Each Xi must represent something unique
– Variable selection
Multiple Regression
 Example 1:
– Spending = a + b income + c age
 Example 2:
– weight = a + b height + c sex + d age
Real Estate Example
 How is price related to the characteristics
of the house?
SAS Code
proc reg;
model price = section lotsize
bed bath age other;
run;
Interpreting the Regression Output
 Parameter Estimates or Slope Coefficients
capture the marginal impact of explanatory
variable on price
 Example: the coefficient of the variable
beds represents the impact of increasing
the number of bedrooms by one on price
Significance of the Coefficients
 Are they significantly different from zero?
– Look at the T values and p values
• T value higher than 1.8 or p<0.05 good
• Sometimes p<0.10 is considered reasonably
significant
 Overall Goodness of Fit
– Look at R2 (also refer to note in Session 1)
Where are we Now?
Behavior
Segment 1
Segment 2
Secondary
Data
Distinguishing
Characteristics
Targeting
Factor
Analysis Cluster
Analysis
Discriminant
/Logit
Analysis
Web Browsing
 Identified two groups of consumers
– One that visits your website frequently
– One that doesn’t
 Can the differences in behavior be related
to socio-demographic variables?
 Can we use these discriminators to classify
prospects into one of these two groups?
Catalog Business
 Identified two consumer segments
– One which buys a lot
– Other which does not buy as much
 Can we find variables that help discriminate the
behavior of these two groups?
 Can we use these discriminators to classify other
consumers into one of these two groups?
Promotional Campaigns
 Identify groups based on their response to
promotional campaigns
– One group purchases a lot on promotion
– Other does not
 Identify characteristics that distinguish these two
groups
 Can we use these discriminators to identify price
sensitive prospects from the not so price sensitive
ones?
Segmentation Analysis
 General Problem
– Identified segments in the population based on
behavior
– Want to find targetable characteristics that
discriminate these groups
– Classify prospects into different groups
Data
Stock # GE/A ROI Stock # GE/A ROI
1 0.158 0.182 13 -0.012 -0.031
2 0.21 0.206 14 0.036 0.053
3 0.207 0.188 15 0.038 0.036
4 0.28 0.236 16 -0.063 -0.074
5 0.197 0.193 17 -0.054 -0.119
6 0.227 0.173 18 0 -0.005
7 0.148 0.196 19 0.005 0.039
8 0.254 0.212 20 0.091 0.122
9 0.079 0.147 21 -0.036 -0.072
10 0.149 0.128 22 0.045 0.064
11 0.2 0.15 23 -0.026 -0.024
12 0.187 0.191 24 0.016 0.026
Good Stocks
Good Stocks
0
0.05
0.1
0.15
0.2
0.25
0 0.05 0.1 0.15 0.2 0.25 0.3
GE/A
ROI
ROI
Bad Stocks
Bad Stocks
-0.15
-0.1
-0.05
0
0.05
0.1
0.15
-0.1 -0.05 0 0.05 0.1
GE/A
ROI
ROI
All Stocks
All Stocks
-0.15
-0.1
-0.05
0
0.05
0.1
0.15
0.2
0.25
0.3
-0.1 0 0.1 0.2 0.3
GE/A
ROI
Identifying the Best Discriminators
 Two groups appear to be well separated on
each ratio: ROI and GE/A
 Also well separated in two dimensional
space
 But this need not always be the case!
Discriminating Variables
X1
X2
Discriminant Analysis
 Identify a set of variables that best
discriminate between the two groups
 Does so by choosing a new line that
maximizes the similarity between members
of the same group and minimizing the
similarity between members belonging to
different groups
Discriminant Function
Z = w1 GEA + w2 ROI
Between-Group Sum of Squares – SSb
Within-Group Sum of Squares – SSw
 = (SSb/SSw)
More on the Criterion
 For Z to provide maximum separation
between the groups, the following must be
satisfied:
– The means of Z for the two groups should be
as far apart as possible (or high SSb)
– Values of Z for each group should be as
homogenous as possible (or low SSw)
Classification
 Discriminant Function: The line that
separates the members of the two groups
 Methods of Classification
– Cut-Off Value Method
– Decision Theory Approach
– Classification Function Approach
– Mahalanobis Distance Method
Cut-Off Value Method
 Uses the Discriminant Function line to
score new observations (prospects) and
classify them into one of two groups based
on a cut-off value
Classification
Z
Cut-off
Value
R2 R1
Classification Function Approach
 Classifications based on this approach are
identical to those done by Decision Theory
approach
 Classification functions are computed for
each group:
C1 = -7.87 + 61.237*GEA + 21.027*ROI
C2 = -0.004 + 2.551*GEA – 1.404*ROI
Basic Idea
 Score each new observation using these
two scoring functions
 The observation gets assigned to the group
with the higher score
What To Look For In The Results?
 Significance of the Discriminating
Variables
– Idea is to test whether the means of the
discriminating variables are statistically
different across the two groups
– Statistic: Wilks’Lamda must be small (Look
for the p value/significance level)
Estimate of The Discriminant
Function
 Canonical Discriminant Function
Z = -2.0018 + 15.0919*GEA + 5.769*ROI
 It is possible that the group means are statistically
different even though for all practical purposes,
the differences between the groups may not be
large
 Look at the squared Canonical Correlation: ratio
of between group SS/Total SS (High is good)
Importance of the Discriminant Variables
and the Discriminant Function
 How important is a variable to the Discriminant
Function?
 Look at the structure loadings: Pooled Within
Canonical Structure
– Variable with the higher loading is relatively more
important
– Caution: If the variables are highly correlated relative
importance of the variables can change with sample
Classification Summary
 Look at Cross-Validation results
Web Browsing
 Can use the Discriminant function to
classify prospects into one of these two
groups
 Target Appropriately
Catalog Business
 Classify other consumers into one of these
two groups
 Do stuff!
Promotional Campaigns
 Classify Prospects into price sensitive and
not so price sensitive segments
 Target appropriately
Summary
 Discriminant Analysis
 Extremely Useful Segmentation Analysis
tool
 Intermediate step in the overall picture –
helps classify prospects and devise the
appropriate targeting strategies

More Related Content

Similar to DBMS4.ppt

Chotu scaling techniques
Chotu scaling techniquesChotu scaling techniques
Chotu scaling techniques
Pruseth Abhisek
 
Discriminant function analysis (DFA)
Discriminant function analysis (DFA)Discriminant function analysis (DFA)
Discriminant function analysis (DFA)
benazeer fathima
 
Measurement and scaling techniques
Measurement and scaling techniquesMeasurement and scaling techniques
Measurement and scaling techniques
Sarfaraz Ahmad
 
Discriminant analysis tehnik analisa Statistika Chap11.ppt
Discriminant analysis tehnik analisa Statistika  Chap11.pptDiscriminant analysis tehnik analisa Statistika  Chap11.ppt
Discriminant analysis tehnik analisa Statistika Chap11.ppt
wisnuragil2
 
Measurement and scaling techniques
Measurement  and  scaling  techniquesMeasurement  and  scaling  techniques
Measurement and scaling techniques
Ujjwal 'Shanu'
 
Measurement and scaling techniques
Measurement and scaling techniquesMeasurement and scaling techniques
Measurement and scaling techniques
Kritika Jain
 
ch 13.pptx
ch 13.pptxch 13.pptx
ch 13.pptx
ArfanAhmed22
 
A high level overview of all that is Analytics
A high level overview of all that is AnalyticsA high level overview of all that is Analytics
A high level overview of all that is Analytics
Ramkumar Ravichandran
 
discriminant analysis
discriminant analysisdiscriminant analysis
discriminant analysis
krishnadk
 
Unit 3 ppt.pptx
Unit 3 ppt.pptxUnit 3 ppt.pptx
Unit 3 ppt.pptx
Nirmalavenkatachalam
 
Machine learning session 10
Machine learning session 10Machine learning session 10
Machine learning session 10
NirsandhG
 
Measurement & Scaling
Measurement & ScalingMeasurement & Scaling
Measurement & Scaling
HareerahShoaib
 
Lecture 1: NBERMetrics
Lecture 1: NBERMetricsLecture 1: NBERMetrics
Lecture 1: NBERMetrics
NBER
 
Store segmentation progresso
Store segmentation progressoStore segmentation progresso
Store segmentation progresso
veesingh
 
Measurement and scaling
Measurement and scalingMeasurement and scaling
Machine learning Mind Map
Machine learning Mind MapMachine learning Mind Map
Machine learning Mind Map
Ashish Patel
 
Multiple discriminant analysis
Multiple discriminant analysisMultiple discriminant analysis
Multiple discriminant analysis
MUHAMMAD HASRATH
 
Spss software
Spss softwareSpss software
Spss software
ManonmaniA3
 
1645 track2 brandenburger_lempola
1645 track2 brandenburger_lempola1645 track2 brandenburger_lempola
1645 track2 brandenburger_lempola
Rising Media, Inc.
 
T21 conjoint analysis
T21 conjoint analysisT21 conjoint analysis
T21 conjoint analysis
kompellark
 

Similar to DBMS4.ppt (20)

Chotu scaling techniques
Chotu scaling techniquesChotu scaling techniques
Chotu scaling techniques
 
Discriminant function analysis (DFA)
Discriminant function analysis (DFA)Discriminant function analysis (DFA)
Discriminant function analysis (DFA)
 
Measurement and scaling techniques
Measurement and scaling techniquesMeasurement and scaling techniques
Measurement and scaling techniques
 
Discriminant analysis tehnik analisa Statistika Chap11.ppt
Discriminant analysis tehnik analisa Statistika  Chap11.pptDiscriminant analysis tehnik analisa Statistika  Chap11.ppt
Discriminant analysis tehnik analisa Statistika Chap11.ppt
 
Measurement and scaling techniques
Measurement  and  scaling  techniquesMeasurement  and  scaling  techniques
Measurement and scaling techniques
 
Measurement and scaling techniques
Measurement and scaling techniquesMeasurement and scaling techniques
Measurement and scaling techniques
 
ch 13.pptx
ch 13.pptxch 13.pptx
ch 13.pptx
 
A high level overview of all that is Analytics
A high level overview of all that is AnalyticsA high level overview of all that is Analytics
A high level overview of all that is Analytics
 
discriminant analysis
discriminant analysisdiscriminant analysis
discriminant analysis
 
Unit 3 ppt.pptx
Unit 3 ppt.pptxUnit 3 ppt.pptx
Unit 3 ppt.pptx
 
Machine learning session 10
Machine learning session 10Machine learning session 10
Machine learning session 10
 
Measurement & Scaling
Measurement & ScalingMeasurement & Scaling
Measurement & Scaling
 
Lecture 1: NBERMetrics
Lecture 1: NBERMetricsLecture 1: NBERMetrics
Lecture 1: NBERMetrics
 
Store segmentation progresso
Store segmentation progressoStore segmentation progresso
Store segmentation progresso
 
Measurement and scaling
Measurement and scalingMeasurement and scaling
Measurement and scaling
 
Machine learning Mind Map
Machine learning Mind MapMachine learning Mind Map
Machine learning Mind Map
 
Multiple discriminant analysis
Multiple discriminant analysisMultiple discriminant analysis
Multiple discriminant analysis
 
Spss software
Spss softwareSpss software
Spss software
 
1645 track2 brandenburger_lempola
1645 track2 brandenburger_lempola1645 track2 brandenburger_lempola
1645 track2 brandenburger_lempola
 
T21 conjoint analysis
T21 conjoint analysisT21 conjoint analysis
T21 conjoint analysis
 

Recently uploaded

原版一比一多伦多大学毕业证(UofT毕业证书)如何办理
原版一比一多伦多大学毕业证(UofT毕业证书)如何办理原版一比一多伦多大学毕业证(UofT毕业证书)如何办理
原版一比一多伦多大学毕业证(UofT毕业证书)如何办理
mkkikqvo
 
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging DataPredictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
Kiwi Creative
 
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
Timothy Spann
 
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
nyfuhyz
 
一比一原版巴斯大学毕业证(Bath毕业证书)学历如何办理
一比一原版巴斯大学毕业证(Bath毕业证书)学历如何办理一比一原版巴斯大学毕业证(Bath毕业证书)学历如何办理
一比一原版巴斯大学毕业证(Bath毕业证书)学历如何办理
y3i0qsdzb
 
原版一比一弗林德斯大学毕业证(Flinders毕业证书)如何办理
原版一比一弗林德斯大学毕业证(Flinders毕业证书)如何办理原版一比一弗林德斯大学毕业证(Flinders毕业证书)如何办理
原版一比一弗林德斯大学毕业证(Flinders毕业证书)如何办理
a9qfiubqu
 
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
nuttdpt
 
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
apvysm8
 
"Financial Odyssey: Navigating Past Performance Through Diverse Analytical Lens"
"Financial Odyssey: Navigating Past Performance Through Diverse Analytical Lens""Financial Odyssey: Navigating Past Performance Through Diverse Analytical Lens"
"Financial Odyssey: Navigating Past Performance Through Diverse Analytical Lens"
sameer shah
 
Learn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queriesLearn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queries
manishkhaire30
 
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
Social Samosa
 
DSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelinesDSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelines
Timothy Spann
 
Build applications with generative AI on Google Cloud
Build applications with generative AI on Google CloudBuild applications with generative AI on Google Cloud
Build applications with generative AI on Google Cloud
Márton Kodok
 
The Ipsos - AI - Monitor 2024 Report.pdf
The  Ipsos - AI - Monitor 2024 Report.pdfThe  Ipsos - AI - Monitor 2024 Report.pdf
The Ipsos - AI - Monitor 2024 Report.pdf
Social Samosa
 
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Aggregage
 
Challenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more importantChallenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more important
Sm321
 
一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理
一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理
一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理
xclpvhuk
 
Intelligence supported media monitoring in veterinary medicine
Intelligence supported media monitoring in veterinary medicineIntelligence supported media monitoring in veterinary medicine
Intelligence supported media monitoring in veterinary medicine
AndrzejJarynowski
 
Analysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performanceAnalysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performance
roli9797
 
A presentation that explain the Power BI Licensing
A presentation that explain the Power BI LicensingA presentation that explain the Power BI Licensing
A presentation that explain the Power BI Licensing
AlessioFois2
 

Recently uploaded (20)

原版一比一多伦多大学毕业证(UofT毕业证书)如何办理
原版一比一多伦多大学毕业证(UofT毕业证书)如何办理原版一比一多伦多大学毕业证(UofT毕业证书)如何办理
原版一比一多伦多大学毕业证(UofT毕业证书)如何办理
 
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging DataPredictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
 
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
 
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
 
一比一原版巴斯大学毕业证(Bath毕业证书)学历如何办理
一比一原版巴斯大学毕业证(Bath毕业证书)学历如何办理一比一原版巴斯大学毕业证(Bath毕业证书)学历如何办理
一比一原版巴斯大学毕业证(Bath毕业证书)学历如何办理
 
原版一比一弗林德斯大学毕业证(Flinders毕业证书)如何办理
原版一比一弗林德斯大学毕业证(Flinders毕业证书)如何办理原版一比一弗林德斯大学毕业证(Flinders毕业证书)如何办理
原版一比一弗林德斯大学毕业证(Flinders毕业证书)如何办理
 
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
 
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
 
"Financial Odyssey: Navigating Past Performance Through Diverse Analytical Lens"
"Financial Odyssey: Navigating Past Performance Through Diverse Analytical Lens""Financial Odyssey: Navigating Past Performance Through Diverse Analytical Lens"
"Financial Odyssey: Navigating Past Performance Through Diverse Analytical Lens"
 
Learn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queriesLearn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queries
 
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
 
DSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelinesDSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelines
 
Build applications with generative AI on Google Cloud
Build applications with generative AI on Google CloudBuild applications with generative AI on Google Cloud
Build applications with generative AI on Google Cloud
 
The Ipsos - AI - Monitor 2024 Report.pdf
The  Ipsos - AI - Monitor 2024 Report.pdfThe  Ipsos - AI - Monitor 2024 Report.pdf
The Ipsos - AI - Monitor 2024 Report.pdf
 
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
 
Challenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more importantChallenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more important
 
一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理
一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理
一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理
 
Intelligence supported media monitoring in veterinary medicine
Intelligence supported media monitoring in veterinary medicineIntelligence supported media monitoring in veterinary medicine
Intelligence supported media monitoring in veterinary medicine
 
Analysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performanceAnalysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performance
 
A presentation that explain the Power BI Licensing
A presentation that explain the Power BI LicensingA presentation that explain the Power BI Licensing
A presentation that explain the Power BI Licensing
 

DBMS4.ppt

  • 2. Multiple Regression  Y = b0 + b1 X1 + b2 X2 + …+ bn Xn  Same as Simple Regression in principle  New Issues: – Each Xi must represent something unique – Variable selection
  • 3. Multiple Regression  Example 1: – Spending = a + b income + c age  Example 2: – weight = a + b height + c sex + d age
  • 4. Real Estate Example  How is price related to the characteristics of the house?
  • 5. SAS Code proc reg; model price = section lotsize bed bath age other; run;
  • 6. Interpreting the Regression Output  Parameter Estimates or Slope Coefficients capture the marginal impact of explanatory variable on price  Example: the coefficient of the variable beds represents the impact of increasing the number of bedrooms by one on price
  • 7. Significance of the Coefficients  Are they significantly different from zero? – Look at the T values and p values • T value higher than 1.8 or p<0.05 good • Sometimes p<0.10 is considered reasonably significant  Overall Goodness of Fit – Look at R2 (also refer to note in Session 1)
  • 8. Where are we Now? Behavior Segment 1 Segment 2 Secondary Data Distinguishing Characteristics Targeting Factor Analysis Cluster Analysis Discriminant /Logit Analysis
  • 9. Web Browsing  Identified two groups of consumers – One that visits your website frequently – One that doesn’t  Can the differences in behavior be related to socio-demographic variables?  Can we use these discriminators to classify prospects into one of these two groups?
  • 10. Catalog Business  Identified two consumer segments – One which buys a lot – Other which does not buy as much  Can we find variables that help discriminate the behavior of these two groups?  Can we use these discriminators to classify other consumers into one of these two groups?
  • 11. Promotional Campaigns  Identify groups based on their response to promotional campaigns – One group purchases a lot on promotion – Other does not  Identify characteristics that distinguish these two groups  Can we use these discriminators to identify price sensitive prospects from the not so price sensitive ones?
  • 12. Segmentation Analysis  General Problem – Identified segments in the population based on behavior – Want to find targetable characteristics that discriminate these groups – Classify prospects into different groups
  • 13. Data Stock # GE/A ROI Stock # GE/A ROI 1 0.158 0.182 13 -0.012 -0.031 2 0.21 0.206 14 0.036 0.053 3 0.207 0.188 15 0.038 0.036 4 0.28 0.236 16 -0.063 -0.074 5 0.197 0.193 17 -0.054 -0.119 6 0.227 0.173 18 0 -0.005 7 0.148 0.196 19 0.005 0.039 8 0.254 0.212 20 0.091 0.122 9 0.079 0.147 21 -0.036 -0.072 10 0.149 0.128 22 0.045 0.064 11 0.2 0.15 23 -0.026 -0.024 12 0.187 0.191 24 0.016 0.026
  • 14. Good Stocks Good Stocks 0 0.05 0.1 0.15 0.2 0.25 0 0.05 0.1 0.15 0.2 0.25 0.3 GE/A ROI ROI
  • 17. Identifying the Best Discriminators  Two groups appear to be well separated on each ratio: ROI and GE/A  Also well separated in two dimensional space  But this need not always be the case!
  • 19. Discriminant Analysis  Identify a set of variables that best discriminate between the two groups  Does so by choosing a new line that maximizes the similarity between members of the same group and minimizing the similarity between members belonging to different groups
  • 20. Discriminant Function Z = w1 GEA + w2 ROI Between-Group Sum of Squares – SSb Within-Group Sum of Squares – SSw  = (SSb/SSw)
  • 21. More on the Criterion  For Z to provide maximum separation between the groups, the following must be satisfied: – The means of Z for the two groups should be as far apart as possible (or high SSb) – Values of Z for each group should be as homogenous as possible (or low SSw)
  • 22. Classification  Discriminant Function: The line that separates the members of the two groups  Methods of Classification – Cut-Off Value Method – Decision Theory Approach – Classification Function Approach – Mahalanobis Distance Method
  • 23. Cut-Off Value Method  Uses the Discriminant Function line to score new observations (prospects) and classify them into one of two groups based on a cut-off value
  • 25. Classification Function Approach  Classifications based on this approach are identical to those done by Decision Theory approach  Classification functions are computed for each group: C1 = -7.87 + 61.237*GEA + 21.027*ROI C2 = -0.004 + 2.551*GEA – 1.404*ROI
  • 26. Basic Idea  Score each new observation using these two scoring functions  The observation gets assigned to the group with the higher score
  • 27. What To Look For In The Results?  Significance of the Discriminating Variables – Idea is to test whether the means of the discriminating variables are statistically different across the two groups – Statistic: Wilks’Lamda must be small (Look for the p value/significance level)
  • 28. Estimate of The Discriminant Function  Canonical Discriminant Function Z = -2.0018 + 15.0919*GEA + 5.769*ROI  It is possible that the group means are statistically different even though for all practical purposes, the differences between the groups may not be large  Look at the squared Canonical Correlation: ratio of between group SS/Total SS (High is good)
  • 29. Importance of the Discriminant Variables and the Discriminant Function  How important is a variable to the Discriminant Function?  Look at the structure loadings: Pooled Within Canonical Structure – Variable with the higher loading is relatively more important – Caution: If the variables are highly correlated relative importance of the variables can change with sample
  • 30. Classification Summary  Look at Cross-Validation results
  • 31. Web Browsing  Can use the Discriminant function to classify prospects into one of these two groups  Target Appropriately
  • 32. Catalog Business  Classify other consumers into one of these two groups  Do stuff!
  • 33. Promotional Campaigns  Classify Prospects into price sensitive and not so price sensitive segments  Target appropriately
  • 34. Summary  Discriminant Analysis  Extremely Useful Segmentation Analysis tool  Intermediate step in the overall picture – helps classify prospects and devise the appropriate targeting strategies