SlideShare a Scribd company logo
1 of 20
Linear Discriminant Analysis and Its
Variations
Abu Minhajuddin
CSE 8331
Department of Statistical Science
Southern Methodist University
April 27, 2002
Plan…
The Problem
Linear Discriminant Analysis
Quadratic Discriminant Analysis
Other Extensions
Evaluation of the Method
An Example
Summary
The Problem…
Classification Problem:
Data on p predictors
Unknown group membership
Training Situation:
Data on p predictors,
Membership of one of g groups
The Problem…
Sepal Width
PetalWidth
2.0 2.5 3.0 3.5 4.0
0.51.01.52.02.5
Fisher’s Iris Data: Identify the three species?
Linear Discriminant Analysis
Classify the item x at hand to one of J groups based on
measurements on p predictors.
Rule: Assign x to group j that has the closest mean
j = 1, 2, …, J
Distance Measure: Mahalanobis Distance….
Takes the spread of the data into Consideration
Linear Discriminant Analysis
Distance Measure:
For j = 1, 2, …, J, compute
( ) ( ) ( )xSxd jpl
T
jj
xxx −−=
−1
Assign x to the group for which dj is minimum
Spl is the pooled estimate of the covariance
matrix
Linear Discriminant AnalysisLinear Discriminant Analysis
…or equivalently, assign x to the group
for which
( ) xSxSxL jpl
T
jpl
T
jj
xx
11
2
1 −−
−=
is a maximum.
(Notice the linear form of the equation!)
Linear Discriminant Analysis
…optimal if….
• Multivariate normal distribution for the
observation in each of the groups
• Equal covariance matrix for all groups
• Equal prior probability for each group
• Equal costs for misclassification
Linear Discriminant Analysis
Relaxing the assumption of equal prior
probabilities…
( ) xSxSxpL jpl
T
jpl
T
j
j
xjx
11
2
1
ln −−
−= +
p j being the prior probability for the jth
group.
Linear Discriminant Analysis
Relaxing the assumption of equal covariance
matrices…
( )
( ) ( )jx
jj
x
xxSx
SpQ
j
T
j
j
−
−
−
−−
=
1
ln
2
1
ln
result?…Quadratic Discriminant Analysis
Quadratic Discriminant Analysis
Rule: assign to group j if is the
largest.
( )xQj
Optimal if the J groups of
measurements are multivariate normal
Other Extensions & Related Methods
Relaxing the assumption of normality…
Kernel density based LDA and QDA
Other extensions…..
Regularized discriminant analysis
Penalized discriminant analysis
Flexible discriminant analysis
Other Extensions & Related Methods
Related Methods:
Logistic regression for binary classification
Multinomial logistic regression
These methods models the probability of
being in a class as a linear function of the
predictor.
Evaluations of the Methods
Classification Table (confusion matrix)
Actual
group
Number of
observations
Predicted group
A B
A
B
nA
nB
n11
n21
n12
n22
Evaluations of the Methods
Apparent Error Rate (APER):
APER = # misclassified/Total # of cases
….underestimates the actual error rate.
Improved estimate of APER:
Holdout Method or cross validation
An Example: Fisher’s Iris Data
Actual
Group
Number of
Observations
Predicted Group
Setosa Versicolo
r
Virginica
Setosa
Versicolor
Virginica
50
50
50
50
0
0
0
48
1
0
2
49
Table 1: Linear Discriminant Analysis
(APER = 0.0200)
An Example: Fisher’s Iris DataAn Example: Fisher’s Iris Data
Actual
Group
Number of
Observations
Predicted Group
Setosa Versicolo
r
Virginica
Setosa
Versicolor
Virginica
50
50
50
50
0
0
0
47
1
0
3
49
Table 1: Quadratic Discriminant Analysis
(APER = 0.0267)
An Example: Fisher’s Iris Data
Sepal Width
PetalWidth
2.0 2.5 3.0 3.5 4.0
0.51.01.52.02.5
ss ss s
s
s
ss
s
ss
ss
s
ss
s ss
s
s
s
s
ss
s
ssss
s
s
ss s s
s
s s
ss
s
s
s
s
ss ss
v
v
v
v
v
v
v
vv
v
v
v
v
v
v
v
v
v
v
v
v
vv
v
v
vv v
v
v
v
v
v
v
v
v
v
vv
v
v
v
v
v
v
v
v
v
v
v
c
cc
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
cc
c
c
c
c
c
c
c
c
c
c
c cc
c
c
c
c
c
c
cc
c
c
ss ss s
s
s
ss
s
ss
ss
s
ss
s ss
s
s
s
s
ss
s
ssss
s
s
ss s s
s
s s
ss
s
s
s
s
ss ss
v
v
v
v
v
v
v
v
v
vvv
v
v
vv
v
v
v
vv
v
vvvv
v
vvv
v
vv
v
v
v
v
vv
v
vvv
v
vv
v
v
v
v
v
c
cc
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
cc
c
c
c
c
c
c
c
c
c
c cc
c
c
c
c
c
c
cc
c
c
c
ss ss s
s
s
ss
s
ss
ss
s
ss
s ss
s
s
s
s
ss
s
ssss
s
s
ss s s
s
s s
ss
s
s
s
s
ss ss
v
v
v
v
v
v
v
v
v
vvv
v
v
vv
v
v
v
vv
v
vvvv
v
vvv
v
vv
v
v
v
v
vv
v
vvv
v
vv
v
v
v
v
v
c
cc
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
cc
c
c
c
c
c
c
c
c
c
c cc
c
c
c
c
c
c
cc
c
c
c
ss ss s
s
s
ss
s
ss
ss
s
ss
s ss
s
s
s
s
ss
s
ssss
s
s
ss s s
s
s s
ss
s
s
s
s
ss ss
v
v
v
v
v
v
v
vv
v
v
v
v
v
v
v
v
v
v
v
v
vv
v
v
vv v
v
v
v
v
v
v
v
v
v
vv
v
v
v
v
v
v
v
v
v
v
v
c
cc
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
cc
c
c
c
c
c
c
c
c
c
c
c cc
c
c
c
c
c
c
cc
c
c
ss ss s
s
s
ss
s
ss
ss
s
ss
s ss
s
s
s
s
ss
s
ssss
s
s
ss s s
s
s s
ss
s
s
s
s
ss ss
v
v
v
v
v
v
v
v
v
vvv
v
v
vv
v
v
v
vv
v
vvvv
v
vvv
v
vv
v
v
v
v
vv
v
vvv
v
vv
v
v
v
v
v
c
cc
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
cc
c
c
c
c
c
c
c
c
c
c cc
c
c
c
c
c
c
cc
c
c
c
ss ss s
s
s
ss
s
ss
ss
s
ss
s ss
s
s
s
s
ss
s
ssss
s
s
ss s s
s
s s
ss
s
s
s
s
ss ss
v
v
v
v
v
v
v
vv
v
v
v
v
v
v
v
v
v
v
v
v
vv
v
v
vv v
v
v
v
v
v
v
v
v
v
vv
v
v
v
v
v
v
v
v
v
v
v
c
cc
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
cc
c
c
c
c
c
c
c
c
c
c
c cc
c
c
c
c
c
c
cc
c
c
ss ss s
s
s
ss
s
ss
ss
s
ss
s ss
s
s
s
s
ss
s
ssss
s
s
ss s s
s
s s
ss
s
s
s
s
ss ss
v
v
v
v
v
v
v
v
v
vvv
v
v
vv
v
v
v
vv
v
vvvv
v
vvv
v
vv
v
v
v
v
vv
v
vvv
v
vv
v
v
v
v
v
c
cc
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
cc
c
c
c
c
c
c
c
c
c
c cc
c
c
c
c
c
c
cc
c
c
c
ss ss s
s
s
ss
s
ss
ss
s
ss
s ss
s
s
s
s
ss
s
ssss
s
s
ss s s
s
s s
ss
s
s
s
s
ss ss
v
v
v
v
v
v
v
vv
v
v
v
v
v
v
v
v
v
v
v
v
vv
v
v
vv v
v
v
v
v
v
v
v
v
v
vv
v
v
v
v
v
v
v
v
v
v
v
c
cc
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
cc
c
c
c
c
c
c
c
c
c
c
c cc
c
c
c
c
c
c
cc
c
c
An Example: Fisher’s Iris Data
Sepal Width
PetalWidth
2.0 2.5 3.0 3.5 4.0
0.51.01.52.02.5
ss ss s
s
s
ss
s
ss
ss
s
ss
s ss
s
s
s
s
ss
s
ssss
s
s
ss s s
s
s s
ss
s
s
s
s
ss ss
v
v
v
v
v
v
v
vv
v
v
v
v
v
v
v
v
v
v
v
v
vv
v
v
vv v
v
v
v
v
v
v
v
v
v
vv
v
v
v
v
v
v
v
v
v
v
v
c
cc
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
c
cc
c
c
c
c
c
c
c
c
c
c
c cc
c
c
c
c
c
c
cc
c
c
Sepal Width
PetalWidth
2.0 2.5 3.0 3.5 4.0
0.51.01.52.02.5
++ ++ +
+
+
++
+
++
++
+
++
+ ++
+
+
+
+
++
+
++++
+
+
++ + +
+
+ +
++
+
+
+
+
++ ++
o
o
o
o
o
o
o
o
o
ooo
o
o
oo
o
o
o
oo
o
oooo
o
ooo
o
oo
o
o
o
o
oo
o
ooo
o
oo
o
o
o
o
o
x
xx
x
x
x
x
x
x
x
x
x
x
x
x
x
x
x
x
x
x
x
x
x
xx
x
x
x
x
x
x
x
x
x
x xx
x
x
x
x
x
x
xx
x
x
x
Summary
LDA is a powerful tool available for
classification.
Widely implemented through various
software
Theoretical properties well researched
SAS implementation available for
large data sets.

More Related Content

What's hot

Data Mining: Concepts and Techniques — Chapter 2 —
Data Mining:  Concepts and Techniques — Chapter 2 —Data Mining:  Concepts and Techniques — Chapter 2 —
Data Mining: Concepts and Techniques — Chapter 2 —Salah Amean
 
NON-PARAMETRIC TESTS by Prajakta Sawant
NON-PARAMETRIC TESTS by Prajakta SawantNON-PARAMETRIC TESTS by Prajakta Sawant
NON-PARAMETRIC TESTS by Prajakta SawantPRAJAKTASAWANT33
 
Biostatistical methods
Biostatistical methodsBiostatistical methods
Biostatistical methodsPrakkan Hillol
 
T test^jsample size^j ethics
T test^jsample size^j ethicsT test^jsample size^j ethics
T test^jsample size^j ethicsAbhishek Thakur
 
Data mining technique (decision tree)
Data mining technique (decision tree)Data mining technique (decision tree)
Data mining technique (decision tree)Shweta Ghate
 
PG STAT 531 Lecture 4 Exploratory Data Analysis
PG STAT 531 Lecture 4 Exploratory Data AnalysisPG STAT 531 Lecture 4 Exploratory Data Analysis
PG STAT 531 Lecture 4 Exploratory Data AnalysisAashish Patel
 
Performance evaluation of hepatitis diagnosis using single and multi classifi...
Performance evaluation of hepatitis diagnosis using single and multi classifi...Performance evaluation of hepatitis diagnosis using single and multi classifi...
Performance evaluation of hepatitis diagnosis using single and multi classifi...ahmedbohy
 
Cluster spss week7
Cluster spss week7Cluster spss week7
Cluster spss week7Birat Sharma
 
Sample size calculation - a brief overview
Sample size calculation - a brief overviewSample size calculation - a brief overview
Sample size calculation - a brief overviewAzmi Mohd Tamil
 
Cluster Analysis Assignment 2013-2014(2)
Cluster Analysis Assignment 2013-2014(2)Cluster Analysis Assignment 2013-2014(2)
Cluster Analysis Assignment 2013-2014(2)TIEZHENG YUAN
 
Evaluating clinical librarian services: a systematic review
Evaluating clinical librarian services: a systematic reviewEvaluating clinical librarian services: a systematic review
Evaluating clinical librarian services: a systematic reviewAlison Brettle
 
Cluster analysis using spss
Cluster analysis using spssCluster analysis using spss
Cluster analysis using spssDr Nisha Arora
 
Comparative study of ksvdd and fsvm for classification of mislabeled data
Comparative study of ksvdd and fsvm for classification of mislabeled dataComparative study of ksvdd and fsvm for classification of mislabeled data
Comparative study of ksvdd and fsvm for classification of mislabeled dataeSAT Journals
 
PMM23 Week 3 Lectures
PMM23 Week 3 LecturesPMM23 Week 3 Lectures
PMM23 Week 3 Lecturespdiddyboy2
 
Pm m23 & pmnm06 week 3 lectures 2015
Pm m23 & pmnm06 week 3 lectures 2015Pm m23 & pmnm06 week 3 lectures 2015
Pm m23 & pmnm06 week 3 lectures 2015pdiddyboy2
 

What's hot (20)

Data Mining: Concepts and Techniques — Chapter 2 —
Data Mining:  Concepts and Techniques — Chapter 2 —Data Mining:  Concepts and Techniques — Chapter 2 —
Data Mining: Concepts and Techniques — Chapter 2 —
 
NON-PARAMETRIC TESTS by Prajakta Sawant
NON-PARAMETRIC TESTS by Prajakta SawantNON-PARAMETRIC TESTS by Prajakta Sawant
NON-PARAMETRIC TESTS by Prajakta Sawant
 
Biostatistical methods
Biostatistical methodsBiostatistical methods
Biostatistical methods
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
 
T test^jsample size^j ethics
T test^jsample size^j ethicsT test^jsample size^j ethics
T test^jsample size^j ethics
 
Student t test
Student t testStudent t test
Student t test
 
Cluster Validation
Cluster ValidationCluster Validation
Cluster Validation
 
Data mining technique (decision tree)
Data mining technique (decision tree)Data mining technique (decision tree)
Data mining technique (decision tree)
 
Parametric tests
Parametric testsParametric tests
Parametric tests
 
PG STAT 531 Lecture 4 Exploratory Data Analysis
PG STAT 531 Lecture 4 Exploratory Data AnalysisPG STAT 531 Lecture 4 Exploratory Data Analysis
PG STAT 531 Lecture 4 Exploratory Data Analysis
 
Performance evaluation of hepatitis diagnosis using single and multi classifi...
Performance evaluation of hepatitis diagnosis using single and multi classifi...Performance evaluation of hepatitis diagnosis using single and multi classifi...
Performance evaluation of hepatitis diagnosis using single and multi classifi...
 
Cluster analysis
Cluster analysisCluster analysis
Cluster analysis
 
Cluster spss week7
Cluster spss week7Cluster spss week7
Cluster spss week7
 
Sample size calculation - a brief overview
Sample size calculation - a brief overviewSample size calculation - a brief overview
Sample size calculation - a brief overview
 
Cluster Analysis Assignment 2013-2014(2)
Cluster Analysis Assignment 2013-2014(2)Cluster Analysis Assignment 2013-2014(2)
Cluster Analysis Assignment 2013-2014(2)
 
Evaluating clinical librarian services: a systematic review
Evaluating clinical librarian services: a systematic reviewEvaluating clinical librarian services: a systematic review
Evaluating clinical librarian services: a systematic review
 
Cluster analysis using spss
Cluster analysis using spssCluster analysis using spss
Cluster analysis using spss
 
Comparative study of ksvdd and fsvm for classification of mislabeled data
Comparative study of ksvdd and fsvm for classification of mislabeled dataComparative study of ksvdd and fsvm for classification of mislabeled data
Comparative study of ksvdd and fsvm for classification of mislabeled data
 
PMM23 Week 3 Lectures
PMM23 Week 3 LecturesPMM23 Week 3 Lectures
PMM23 Week 3 Lectures
 
Pm m23 & pmnm06 week 3 lectures 2015
Pm m23 & pmnm06 week 3 lectures 2015Pm m23 & pmnm06 week 3 lectures 2015
Pm m23 & pmnm06 week 3 lectures 2015
 

Similar to Abu

Discriminant Analysis-lecture 8
Discriminant Analysis-lecture 8Discriminant Analysis-lecture 8
Discriminant Analysis-lecture 8Laila Fatehy
 
Comparative Analysis of Weighted Emphirical Optimization Algorithm and Lazy C...
Comparative Analysis of Weighted Emphirical Optimization Algorithm and Lazy C...Comparative Analysis of Weighted Emphirical Optimization Algorithm and Lazy C...
Comparative Analysis of Weighted Emphirical Optimization Algorithm and Lazy C...IIRindia
 
Basic course for computer based methods
Basic course for computer based methodsBasic course for computer based methods
Basic course for computer based methodsimprovemed
 
Basic course on computer-based methods
Basic course on computer-based methodsBasic course on computer-based methods
Basic course on computer-based methodsimprovemed
 
FUNCTION OF RIVAL SIMILARITY IN A COGNITIVE DATA ANALYSIS

FUNCTION OF RIVAL SIMILARITY IN A COGNITIVE DATA ANALYSIS
FUNCTION OF RIVAL SIMILARITY IN A COGNITIVE DATA ANALYSIS

FUNCTION OF RIVAL SIMILARITY IN A COGNITIVE DATA ANALYSIS
Maxim Kazantsev
 
Data mining classifiers.
Data mining classifiers.Data mining classifiers.
Data mining classifiers.ShwetaPatil174
 
An Overview and Application of Discriminant Analysis in Data Analysis
An Overview and Application of Discriminant Analysis in Data AnalysisAn Overview and Application of Discriminant Analysis in Data Analysis
An Overview and Application of Discriminant Analysis in Data AnalysisIOSR Journals
 
Anova n metaanalysis
Anova n metaanalysisAnova n metaanalysis
Anova n metaanalysisutpal sharma
 
Intro to Model Selection
Intro to Model SelectionIntro to Model Selection
Intro to Model Selectionchenhm
 
Decision Support Analyss for Software Effort Estimation by Analogy
Decision Support Analyss for Software Effort Estimation by AnalogyDecision Support Analyss for Software Effort Estimation by Analogy
Decision Support Analyss for Software Effort Estimation by AnalogyTim Menzies
 
Multiple comparison - Descriptive Statistic
Multiple comparison - Descriptive StatisticMultiple comparison - Descriptive Statistic
Multiple comparison - Descriptive StatisticAbegailSotto
 

Similar to Abu (20)

Discriminant Analysis-lecture 8
Discriminant Analysis-lecture 8Discriminant Analysis-lecture 8
Discriminant Analysis-lecture 8
 
Gene expression profiling ii
Gene expression profiling  iiGene expression profiling  ii
Gene expression profiling ii
 
Comparative Analysis of Weighted Emphirical Optimization Algorithm and Lazy C...
Comparative Analysis of Weighted Emphirical Optimization Algorithm and Lazy C...Comparative Analysis of Weighted Emphirical Optimization Algorithm and Lazy C...
Comparative Analysis of Weighted Emphirical Optimization Algorithm and Lazy C...
 
Classification Continued
Classification ContinuedClassification Continued
Classification Continued
 
Classification Continued
Classification ContinuedClassification Continued
Classification Continued
 
Basic course for computer based methods
Basic course for computer based methodsBasic course for computer based methods
Basic course for computer based methods
 
Basic course on computer-based methods
Basic course on computer-based methodsBasic course on computer-based methods
Basic course on computer-based methods
 
Clustering
ClusteringClustering
Clustering
 
FUNCTION OF RIVAL SIMILARITY IN A COGNITIVE DATA ANALYSIS

FUNCTION OF RIVAL SIMILARITY IN A COGNITIVE DATA ANALYSIS
FUNCTION OF RIVAL SIMILARITY IN A COGNITIVE DATA ANALYSIS

FUNCTION OF RIVAL SIMILARITY IN A COGNITIVE DATA ANALYSIS

 
Igert glmm
Igert glmmIgert glmm
Igert glmm
 
Bayes ML.ppt
Bayes ML.pptBayes ML.ppt
Bayes ML.ppt
 
Data mining classifiers.
Data mining classifiers.Data mining classifiers.
Data mining classifiers.
 
An Overview and Application of Discriminant Analysis in Data Analysis
An Overview and Application of Discriminant Analysis in Data AnalysisAn Overview and Application of Discriminant Analysis in Data Analysis
An Overview and Application of Discriminant Analysis in Data Analysis
 
Anova n metaanalysis
Anova n metaanalysisAnova n metaanalysis
Anova n metaanalysis
 
Discriminant analysis.pptx
Discriminant analysis.pptxDiscriminant analysis.pptx
Discriminant analysis.pptx
 
Intro to Model Selection
Intro to Model SelectionIntro to Model Selection
Intro to Model Selection
 
Di35605610
Di35605610Di35605610
Di35605610
 
Decision Support Analyss for Software Effort Estimation by Analogy
Decision Support Analyss for Software Effort Estimation by AnalogyDecision Support Analyss for Software Effort Estimation by Analogy
Decision Support Analyss for Software Effort Estimation by Analogy
 
Multiple comparison - Descriptive Statistic
Multiple comparison - Descriptive StatisticMultiple comparison - Descriptive Statistic
Multiple comparison - Descriptive Statistic
 
KNN.pptx
KNN.pptxKNN.pptx
KNN.pptx
 

Recently uploaded

Research Methodolgy & Intellectual Property Rights Series 2
Research Methodolgy & Intellectual Property Rights Series 2Research Methodolgy & Intellectual Property Rights Series 2
Research Methodolgy & Intellectual Property Rights Series 2T.D. Shashikala
 
Fabrication Of Automatic Star Delta Starter Using Relay And GSM Module By Utk...
Fabrication Of Automatic Star Delta Starter Using Relay And GSM Module By Utk...Fabrication Of Automatic Star Delta Starter Using Relay And GSM Module By Utk...
Fabrication Of Automatic Star Delta Starter Using Relay And GSM Module By Utk...ShivamTiwari995432
 
Electrical shop management system project report.pdf
Electrical shop management system project report.pdfElectrical shop management system project report.pdf
Electrical shop management system project report.pdfKamal Acharya
 
Dairy management system project report..pdf
Dairy management system project report..pdfDairy management system project report..pdf
Dairy management system project report..pdfKamal Acharya
 
Introduction to Heat Exchangers: Principle, Types and Applications
Introduction to Heat Exchangers: Principle, Types and ApplicationsIntroduction to Heat Exchangers: Principle, Types and Applications
Introduction to Heat Exchangers: Principle, Types and ApplicationsKineticEngineeringCo
 
2024 DevOps Pro Europe - Growing at the edge
2024 DevOps Pro Europe - Growing at the edge2024 DevOps Pro Europe - Growing at the edge
2024 DevOps Pro Europe - Growing at the edgePaco Orozco
 
Lesson no16 application of Induction Generator in Wind.ppsx
Lesson no16 application of Induction Generator in Wind.ppsxLesson no16 application of Induction Generator in Wind.ppsx
Lesson no16 application of Induction Generator in Wind.ppsxmichaelprrior
 
SLIDESHARE PPT-DECISION MAKING METHODS.pptx
SLIDESHARE PPT-DECISION MAKING METHODS.pptxSLIDESHARE PPT-DECISION MAKING METHODS.pptx
SLIDESHARE PPT-DECISION MAKING METHODS.pptxCHAIRMAN M
 
Supermarket billing system project report..pdf
Supermarket billing system project report..pdfSupermarket billing system project report..pdf
Supermarket billing system project report..pdfKamal Acharya
 
Seismic Hazard Assessment Software in Python by Prof. Dr. Costas Sachpazis
Seismic Hazard Assessment Software in Python by Prof. Dr. Costas SachpazisSeismic Hazard Assessment Software in Python by Prof. Dr. Costas Sachpazis
Seismic Hazard Assessment Software in Python by Prof. Dr. Costas SachpazisDr.Costas Sachpazis
 
ROAD CONSTRUCTION PRESENTATION.PPTX.pptx
ROAD CONSTRUCTION PRESENTATION.PPTX.pptxROAD CONSTRUCTION PRESENTATION.PPTX.pptx
ROAD CONSTRUCTION PRESENTATION.PPTX.pptxGagandeepKaur617299
 
E-Commerce Shopping using MERN Stack where different modules are present
E-Commerce Shopping using MERN Stack where different modules are presentE-Commerce Shopping using MERN Stack where different modules are present
E-Commerce Shopping using MERN Stack where different modules are presentjatinraor66
 
Introduction to Artificial Intelligence and History of AI
Introduction to Artificial Intelligence and History of AIIntroduction to Artificial Intelligence and History of AI
Introduction to Artificial Intelligence and History of AISheetal Jain
 
15-Minute City: A Completely New Horizon
15-Minute City: A Completely New Horizon15-Minute City: A Completely New Horizon
15-Minute City: A Completely New HorizonMorshed Ahmed Rahath
 
DR PROF ING GURUDUTT SAHNI WIKIPEDIA.pdf
DR PROF ING GURUDUTT SAHNI WIKIPEDIA.pdfDR PROF ING GURUDUTT SAHNI WIKIPEDIA.pdf
DR PROF ING GURUDUTT SAHNI WIKIPEDIA.pdfDrGurudutt
 
ONLINE VEHICLE RENTAL SYSTEM PROJECT REPORT.pdf
ONLINE VEHICLE RENTAL SYSTEM PROJECT REPORT.pdfONLINE VEHICLE RENTAL SYSTEM PROJECT REPORT.pdf
ONLINE VEHICLE RENTAL SYSTEM PROJECT REPORT.pdfKamal Acharya
 
RM&IPR M5 notes.pdfResearch Methodolgy & Intellectual Property Rights Series 5
RM&IPR M5 notes.pdfResearch Methodolgy & Intellectual Property Rights Series 5RM&IPR M5 notes.pdfResearch Methodolgy & Intellectual Property Rights Series 5
RM&IPR M5 notes.pdfResearch Methodolgy & Intellectual Property Rights Series 5T.D. Shashikala
 
Diploma Engineering Drawing Qp-2024 Ece .pdf
Diploma Engineering Drawing Qp-2024 Ece .pdfDiploma Engineering Drawing Qp-2024 Ece .pdf
Diploma Engineering Drawing Qp-2024 Ece .pdfJNTUA
 
Intelligent Agents, A discovery on How A Rational Agent Acts
Intelligent Agents, A discovery on How A Rational Agent ActsIntelligent Agents, A discovery on How A Rational Agent Acts
Intelligent Agents, A discovery on How A Rational Agent ActsSheetal Jain
 
RESORT MANAGEMENT AND RESERVATION SYSTEM PROJECT REPORT.pdf
RESORT MANAGEMENT AND RESERVATION SYSTEM PROJECT REPORT.pdfRESORT MANAGEMENT AND RESERVATION SYSTEM PROJECT REPORT.pdf
RESORT MANAGEMENT AND RESERVATION SYSTEM PROJECT REPORT.pdfKamal Acharya
 

Recently uploaded (20)

Research Methodolgy & Intellectual Property Rights Series 2
Research Methodolgy & Intellectual Property Rights Series 2Research Methodolgy & Intellectual Property Rights Series 2
Research Methodolgy & Intellectual Property Rights Series 2
 
Fabrication Of Automatic Star Delta Starter Using Relay And GSM Module By Utk...
Fabrication Of Automatic Star Delta Starter Using Relay And GSM Module By Utk...Fabrication Of Automatic Star Delta Starter Using Relay And GSM Module By Utk...
Fabrication Of Automatic Star Delta Starter Using Relay And GSM Module By Utk...
 
Electrical shop management system project report.pdf
Electrical shop management system project report.pdfElectrical shop management system project report.pdf
Electrical shop management system project report.pdf
 
Dairy management system project report..pdf
Dairy management system project report..pdfDairy management system project report..pdf
Dairy management system project report..pdf
 
Introduction to Heat Exchangers: Principle, Types and Applications
Introduction to Heat Exchangers: Principle, Types and ApplicationsIntroduction to Heat Exchangers: Principle, Types and Applications
Introduction to Heat Exchangers: Principle, Types and Applications
 
2024 DevOps Pro Europe - Growing at the edge
2024 DevOps Pro Europe - Growing at the edge2024 DevOps Pro Europe - Growing at the edge
2024 DevOps Pro Europe - Growing at the edge
 
Lesson no16 application of Induction Generator in Wind.ppsx
Lesson no16 application of Induction Generator in Wind.ppsxLesson no16 application of Induction Generator in Wind.ppsx
Lesson no16 application of Induction Generator in Wind.ppsx
 
SLIDESHARE PPT-DECISION MAKING METHODS.pptx
SLIDESHARE PPT-DECISION MAKING METHODS.pptxSLIDESHARE PPT-DECISION MAKING METHODS.pptx
SLIDESHARE PPT-DECISION MAKING METHODS.pptx
 
Supermarket billing system project report..pdf
Supermarket billing system project report..pdfSupermarket billing system project report..pdf
Supermarket billing system project report..pdf
 
Seismic Hazard Assessment Software in Python by Prof. Dr. Costas Sachpazis
Seismic Hazard Assessment Software in Python by Prof. Dr. Costas SachpazisSeismic Hazard Assessment Software in Python by Prof. Dr. Costas Sachpazis
Seismic Hazard Assessment Software in Python by Prof. Dr. Costas Sachpazis
 
ROAD CONSTRUCTION PRESENTATION.PPTX.pptx
ROAD CONSTRUCTION PRESENTATION.PPTX.pptxROAD CONSTRUCTION PRESENTATION.PPTX.pptx
ROAD CONSTRUCTION PRESENTATION.PPTX.pptx
 
E-Commerce Shopping using MERN Stack where different modules are present
E-Commerce Shopping using MERN Stack where different modules are presentE-Commerce Shopping using MERN Stack where different modules are present
E-Commerce Shopping using MERN Stack where different modules are present
 
Introduction to Artificial Intelligence and History of AI
Introduction to Artificial Intelligence and History of AIIntroduction to Artificial Intelligence and History of AI
Introduction to Artificial Intelligence and History of AI
 
15-Minute City: A Completely New Horizon
15-Minute City: A Completely New Horizon15-Minute City: A Completely New Horizon
15-Minute City: A Completely New Horizon
 
DR PROF ING GURUDUTT SAHNI WIKIPEDIA.pdf
DR PROF ING GURUDUTT SAHNI WIKIPEDIA.pdfDR PROF ING GURUDUTT SAHNI WIKIPEDIA.pdf
DR PROF ING GURUDUTT SAHNI WIKIPEDIA.pdf
 
ONLINE VEHICLE RENTAL SYSTEM PROJECT REPORT.pdf
ONLINE VEHICLE RENTAL SYSTEM PROJECT REPORT.pdfONLINE VEHICLE RENTAL SYSTEM PROJECT REPORT.pdf
ONLINE VEHICLE RENTAL SYSTEM PROJECT REPORT.pdf
 
RM&IPR M5 notes.pdfResearch Methodolgy & Intellectual Property Rights Series 5
RM&IPR M5 notes.pdfResearch Methodolgy & Intellectual Property Rights Series 5RM&IPR M5 notes.pdfResearch Methodolgy & Intellectual Property Rights Series 5
RM&IPR M5 notes.pdfResearch Methodolgy & Intellectual Property Rights Series 5
 
Diploma Engineering Drawing Qp-2024 Ece .pdf
Diploma Engineering Drawing Qp-2024 Ece .pdfDiploma Engineering Drawing Qp-2024 Ece .pdf
Diploma Engineering Drawing Qp-2024 Ece .pdf
 
Intelligent Agents, A discovery on How A Rational Agent Acts
Intelligent Agents, A discovery on How A Rational Agent ActsIntelligent Agents, A discovery on How A Rational Agent Acts
Intelligent Agents, A discovery on How A Rational Agent Acts
 
RESORT MANAGEMENT AND RESERVATION SYSTEM PROJECT REPORT.pdf
RESORT MANAGEMENT AND RESERVATION SYSTEM PROJECT REPORT.pdfRESORT MANAGEMENT AND RESERVATION SYSTEM PROJECT REPORT.pdf
RESORT MANAGEMENT AND RESERVATION SYSTEM PROJECT REPORT.pdf
 

Abu

  • 1. Linear Discriminant Analysis and Its Variations Abu Minhajuddin CSE 8331 Department of Statistical Science Southern Methodist University April 27, 2002
  • 2. Plan… The Problem Linear Discriminant Analysis Quadratic Discriminant Analysis Other Extensions Evaluation of the Method An Example Summary
  • 3. The Problem… Classification Problem: Data on p predictors Unknown group membership Training Situation: Data on p predictors, Membership of one of g groups
  • 4. The Problem… Sepal Width PetalWidth 2.0 2.5 3.0 3.5 4.0 0.51.01.52.02.5 Fisher’s Iris Data: Identify the three species?
  • 5. Linear Discriminant Analysis Classify the item x at hand to one of J groups based on measurements on p predictors. Rule: Assign x to group j that has the closest mean j = 1, 2, …, J Distance Measure: Mahalanobis Distance…. Takes the spread of the data into Consideration
  • 6. Linear Discriminant Analysis Distance Measure: For j = 1, 2, …, J, compute ( ) ( ) ( )xSxd jpl T jj xxx −−= −1 Assign x to the group for which dj is minimum Spl is the pooled estimate of the covariance matrix
  • 7. Linear Discriminant AnalysisLinear Discriminant Analysis …or equivalently, assign x to the group for which ( ) xSxSxL jpl T jpl T jj xx 11 2 1 −− −= is a maximum. (Notice the linear form of the equation!)
  • 8. Linear Discriminant Analysis …optimal if…. • Multivariate normal distribution for the observation in each of the groups • Equal covariance matrix for all groups • Equal prior probability for each group • Equal costs for misclassification
  • 9. Linear Discriminant Analysis Relaxing the assumption of equal prior probabilities… ( ) xSxSxpL jpl T jpl T j j xjx 11 2 1 ln −− −= + p j being the prior probability for the jth group.
  • 10. Linear Discriminant Analysis Relaxing the assumption of equal covariance matrices… ( ) ( ) ( )jx jj x xxSx SpQ j T j j − − − −− = 1 ln 2 1 ln result?…Quadratic Discriminant Analysis
  • 11. Quadratic Discriminant Analysis Rule: assign to group j if is the largest. ( )xQj Optimal if the J groups of measurements are multivariate normal
  • 12. Other Extensions & Related Methods Relaxing the assumption of normality… Kernel density based LDA and QDA Other extensions….. Regularized discriminant analysis Penalized discriminant analysis Flexible discriminant analysis
  • 13. Other Extensions & Related Methods Related Methods: Logistic regression for binary classification Multinomial logistic regression These methods models the probability of being in a class as a linear function of the predictor.
  • 14. Evaluations of the Methods Classification Table (confusion matrix) Actual group Number of observations Predicted group A B A B nA nB n11 n21 n12 n22
  • 15. Evaluations of the Methods Apparent Error Rate (APER): APER = # misclassified/Total # of cases ….underestimates the actual error rate. Improved estimate of APER: Holdout Method or cross validation
  • 16. An Example: Fisher’s Iris Data Actual Group Number of Observations Predicted Group Setosa Versicolo r Virginica Setosa Versicolor Virginica 50 50 50 50 0 0 0 48 1 0 2 49 Table 1: Linear Discriminant Analysis (APER = 0.0200)
  • 17. An Example: Fisher’s Iris DataAn Example: Fisher’s Iris Data Actual Group Number of Observations Predicted Group Setosa Versicolo r Virginica Setosa Versicolor Virginica 50 50 50 50 0 0 0 47 1 0 3 49 Table 1: Quadratic Discriminant Analysis (APER = 0.0267)
  • 18. An Example: Fisher’s Iris Data Sepal Width PetalWidth 2.0 2.5 3.0 3.5 4.0 0.51.01.52.02.5 ss ss s s s ss s ss ss s ss s ss s s s s ss s ssss s s ss s s s s s ss s s s s ss ss v v v v v v v vv v v v v v v v v v v v v vv v v vv v v v v v v v v v v vv v v v v v v v v v v v c cc c c c c c c c c c c c c c c c c c c c c c c cc c c c c c c c c c c c cc c c c c c c cc c c ss ss s s s ss s ss ss s ss s ss s s s s ss s ssss s s ss s s s s s ss s s s s ss ss v v v v v v v v v vvv v v vv v v v vv v vvvv v vvv v vv v v v v vv v vvv v vv v v v v v c cc c c c c c c c c c c c c c c c c c c c c c cc c c c c c c c c c c cc c c c c c c cc c c c ss ss s s s ss s ss ss s ss s ss s s s s ss s ssss s s ss s s s s s ss s s s s ss ss v v v v v v v v v vvv v v vv v v v vv v vvvv v vvv v vv v v v v vv v vvv v vv v v v v v c cc c c c c c c c c c c c c c c c c c c c c c cc c c c c c c c c c c cc c c c c c c cc c c c ss ss s s s ss s ss ss s ss s ss s s s s ss s ssss s s ss s s s s s ss s s s s ss ss v v v v v v v vv v v v v v v v v v v v v vv v v vv v v v v v v v v v v vv v v v v v v v v v v v c cc c c c c c c c c c c c c c c c c c c c c c c cc c c c c c c c c c c c cc c c c c c c cc c c ss ss s s s ss s ss ss s ss s ss s s s s ss s ssss s s ss s s s s s ss s s s s ss ss v v v v v v v v v vvv v v vv v v v vv v vvvv v vvv v vv v v v v vv v vvv v vv v v v v v c cc c c c c c c c c c c c c c c c c c c c c c cc c c c c c c c c c c cc c c c c c c cc c c c ss ss s s s ss s ss ss s ss s ss s s s s ss s ssss s s ss s s s s s ss s s s s ss ss v v v v v v v vv v v v v v v v v v v v v vv v v vv v v v v v v v v v v vv v v v v v v v v v v v c cc c c c c c c c c c c c c c c c c c c c c c c cc c c c c c c c c c c c cc c c c c c c cc c c ss ss s s s ss s ss ss s ss s ss s s s s ss s ssss s s ss s s s s s ss s s s s ss ss v v v v v v v v v vvv v v vv v v v vv v vvvv v vvv v vv v v v v vv v vvv v vv v v v v v c cc c c c c c c c c c c c c c c c c c c c c c cc c c c c c c c c c c cc c c c c c c cc c c c ss ss s s s ss s ss ss s ss s ss s s s s ss s ssss s s ss s s s s s ss s s s s ss ss v v v v v v v vv v v v v v v v v v v v v vv v v vv v v v v v v v v v v vv v v v v v v v v v v v c cc c c c c c c c c c c c c c c c c c c c c c c cc c c c c c c c c c c c cc c c c c c c cc c c
  • 19. An Example: Fisher’s Iris Data Sepal Width PetalWidth 2.0 2.5 3.0 3.5 4.0 0.51.01.52.02.5 ss ss s s s ss s ss ss s ss s ss s s s s ss s ssss s s ss s s s s s ss s s s s ss ss v v v v v v v vv v v v v v v v v v v v v vv v v vv v v v v v v v v v v vv v v v v v v v v v v v c cc c c c c c c c c c c c c c c c c c c c c c c cc c c c c c c c c c c c cc c c c c c c cc c c Sepal Width PetalWidth 2.0 2.5 3.0 3.5 4.0 0.51.01.52.02.5 ++ ++ + + + ++ + ++ ++ + ++ + ++ + + + + ++ + ++++ + + ++ + + + + + ++ + + + + ++ ++ o o o o o o o o o ooo o o oo o o o oo o oooo o ooo o oo o o o o oo o ooo o oo o o o o o x xx x x x x x x x x x x x x x x x x x x x x x xx x x x x x x x x x x xx x x x x x x xx x x x
  • 20. Summary LDA is a powerful tool available for classification. Widely implemented through various software Theoretical properties well researched SAS implementation available for large data sets.