SlideShare a Scribd company logo
1 of 24
Dimension Reduction & PCA
Prof. A.L. Yuille
Stat 231. Fall 2004.
Curse of Dimensionality.
• A major problem is the curse of dimensionality.
• If the data x lies in high dimensional space, then
an enormous amount of data is required to learn
distributions or decision rules.
• Example: 50 dimensions. Each dimension has
20 levels. This gives a total of cells. But
the no. of data samples will be far less. There
will not be enough data samples to learn.
Curse of Dimensionality
• One way to deal with dimensionality is to
assume that we know the form of the
probability distribution.
• For example, a Gaussian model in N
dimensions has N + N(N-1)/2 parameters
to estimate.
• Requires data to learn reliably.
This may be practical.
Dimension Reduction
• One way to avoid the curse of
dimensionality is by projecting the data
onto a lower-dimensional space.
• Techniques for dimension reduction:
• Principal Component Analysis (PCA)
• Fisher’s Linear Discriminant
• Multi-dimensional Scaling.
• Independent Component Analysis.
Principal Component Analysis
• PCA is the most commonly used
dimension reduction technique.
• (Also called the Karhunen-Loeve
transform).
• PCA – data samples
• Compute the mean
• Computer the covariance:
Principal Component Analysis
• Compute the eigenvalues
and eigenvectors of the matrix
• Solve
• Order them by magnitude:
• PCA reduces the dimension by keeping
direction such that
Principal Component Analysis
• For many datasets, most of the eigenvalues
lambda are negligible and can be discarded.
The eigenvalue measures the variation
In the direction e
Example:
Principal Component Analysis
• Project the data onto the selected
eigenvectors:
• Where
• is the proportion of data covered by the
first M eigenvalues.
PCA Example
• The images of an object under different lighting
lie in a low-dimensional space.
• The original images are 256x 256. But the data
lies mostly in 3-5 dimensions.
• First we show the PCA for a face under a range
of lighting conditions. The PCA components
have simple interpretations.
• Then we plot as a function of M for
several objects under a range of lighting.
PCA on Faces.
5 plus or minus 2.
Most Objects project to
Cost Function for PCA
• Minimize the sum of squared error:
• Can verify that the solutions are
• The eigenvectors of K are
• The are the projection coefficients of the
datavectors onto the eigenvectors
PCA & Gaussian Distributions.
• PCA is similar to learning a Gaussian
distribution for the data.
• is the mean of the distribution.
• K is the estimate of the covariance.
• Dimension reduction occurs by ignoring
the directions in which the covariance is
small.
Limitations of PCA
• PCA is not effective for some datasets.
• For example, if the data is a set of strings
• (1,0,0,0,…), (0,1,0,0…),…,(0,0,0,…,1)
then the eigenvalues do not fall off as PCA
requires.
PCA and Discrimination
• PCA may not find the best directions for
discriminating between two classes.
• Example: suppose the two classes have 2D
Gaussian densities as ellipsoids.
• 1st
eigenvector is best for representing the
probabilities.
• 2nd
eigenvector is best for discrimination.
Fisher’s Linear Discriminant.
• 2-class classification. Given samples in
class 1 and samples in class 2.
• Goal: to find a vector w, project data onto this
axis so that data is well separated.
Fisher’s Linear Discriminant
• Sample means
• Scatter matrices:
•
• Between-class scatter matrix:
• Within-class scatter matrix:
Fisher’s Linear Discriminant
• The sample means of the projected
points:
• The scatter of the projected points is:
• These are both one-dimensional variables.
Fisher’s Linear Discriminant
• Choose the projection direction w to
maximize:
•
• Maximize the ratio of the between-class
distance to the within-class scatter.
Fisher’s Linear Discriminant
• Proposition. The vector that maximizes
• Proof.
• Maximize
• is a constant, a Lagrange multiplier.
• Now
Fisher’s Linear Discriminant
• Example: two Gaussians with the same
covariance and means
• The Bayes classifier is a straight line whose
normal is the Fisher Linear Discriminant
direction w.
•
Multiple Classes
• For c classes, compute c-1 discriminants,
project d-dimensional features into c-1 space.
Multiple Classes
• Within-class scatter:
•
• Between-class scatter:
• is scatter matrix from all classes.
Multiple Discriminant Analysis
• Seek vectors and project
samples to c-1 dimensional space:
• Criterion is:
• where |.| is the determinant.
• Solution is the eigenvectors whose eigenvalues
are the c-1 largest in

More Related Content

What's hot

Principal Component Analysis and Clustering
Principal Component Analysis and ClusteringPrincipal Component Analysis and Clustering
Principal Component Analysis and ClusteringUsha Vijay
 
An intro to applied multi stat with r by everitt et al
An intro to applied multi stat with r by everitt et alAn intro to applied multi stat with r by everitt et al
An intro to applied multi stat with r by everitt et alRazzaqe
 
Pca(principal components analysis)
Pca(principal components analysis)Pca(principal components analysis)
Pca(principal components analysis)kalung0313
 
Principal component analysis, Code and Time Complexity
Principal component analysis, Code and Time ComplexityPrincipal component analysis, Code and Time Complexity
Principal component analysis, Code and Time ComplexityYounesCharfaoui
 
Statistical Methods
Statistical MethodsStatistical Methods
Statistical Methodsguest2137aa
 
Power System State Estimation Using Weighted Least Squares (WLS) and Regulari...
Power System State Estimation Using Weighted Least Squares (WLS) and Regulari...Power System State Estimation Using Weighted Least Squares (WLS) and Regulari...
Power System State Estimation Using Weighted Least Squares (WLS) and Regulari...IJERA Editor
 
QT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central TendencyQT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central TendencyPrithwis Mukerjee
 
Matematika terapan week 6
Matematika terapan week 6 Matematika terapan week 6
Matematika terapan week 6 nellylawar
 
Measures of central tendency dispersion
Measures of central tendency dispersionMeasures of central tendency dispersion
Measures of central tendency dispersionAbhinav yadav
 
CABT Math 8 measures of central tendency and dispersion
CABT Math 8   measures of central tendency and dispersionCABT Math 8   measures of central tendency and dispersion
CABT Math 8 measures of central tendency and dispersionGilbert Joseph Abueg
 
Measures of central tendency
Measures of central tendencyMeasures of central tendency
Measures of central tendencyRichard Paulino
 
Eigen value and eigen vectors shwetak
Eigen value and eigen vectors shwetakEigen value and eigen vectors shwetak
Eigen value and eigen vectors shwetakMrsShwetaBanait1
 
Cmc chapter 02
Cmc chapter 02Cmc chapter 02
Cmc chapter 02Jane Hamze
 
Quantum generalized linear models
Quantum generalized linear modelsQuantum generalized linear models
Quantum generalized linear modelsColleen Farrelly
 
Chapter 3 mathematical modeling
Chapter 3 mathematical modelingChapter 3 mathematical modeling
Chapter 3 mathematical modelingBin Biny Bino
 
Statisticsforbiologists colstons
Statisticsforbiologists colstonsStatisticsforbiologists colstons
Statisticsforbiologists colstonsandymartin
 
Measure of central tendency
Measure of central tendency Measure of central tendency
Measure of central tendency Kannan Iyanar
 
Alg II Unit 3-6-solvingsystemsmatrices
Alg II Unit 3-6-solvingsystemsmatricesAlg II Unit 3-6-solvingsystemsmatrices
Alg II Unit 3-6-solvingsystemsmatricesjtentinger
 

What's hot (20)

Principal Component Analysis and Clustering
Principal Component Analysis and ClusteringPrincipal Component Analysis and Clustering
Principal Component Analysis and Clustering
 
An intro to applied multi stat with r by everitt et al
An intro to applied multi stat with r by everitt et alAn intro to applied multi stat with r by everitt et al
An intro to applied multi stat with r by everitt et al
 
Pca(principal components analysis)
Pca(principal components analysis)Pca(principal components analysis)
Pca(principal components analysis)
 
Principal component analysis, Code and Time Complexity
Principal component analysis, Code and Time ComplexityPrincipal component analysis, Code and Time Complexity
Principal component analysis, Code and Time Complexity
 
Statistical Methods
Statistical MethodsStatistical Methods
Statistical Methods
 
Scaling and Normalization
Scaling and NormalizationScaling and Normalization
Scaling and Normalization
 
Power System State Estimation Using Weighted Least Squares (WLS) and Regulari...
Power System State Estimation Using Weighted Least Squares (WLS) and Regulari...Power System State Estimation Using Weighted Least Squares (WLS) and Regulari...
Power System State Estimation Using Weighted Least Squares (WLS) and Regulari...
 
QT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central TendencyQT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central Tendency
 
Matematika terapan week 6
Matematika terapan week 6 Matematika terapan week 6
Matematika terapan week 6
 
Measures of central tendency dispersion
Measures of central tendency dispersionMeasures of central tendency dispersion
Measures of central tendency dispersion
 
CABT Math 8 measures of central tendency and dispersion
CABT Math 8   measures of central tendency and dispersionCABT Math 8   measures of central tendency and dispersion
CABT Math 8 measures of central tendency and dispersion
 
Measures of central tendency
Measures of central tendencyMeasures of central tendency
Measures of central tendency
 
Eigen value and eigen vectors shwetak
Eigen value and eigen vectors shwetakEigen value and eigen vectors shwetak
Eigen value and eigen vectors shwetak
 
Cmc chapter 02
Cmc chapter 02Cmc chapter 02
Cmc chapter 02
 
Quantum generalized linear models
Quantum generalized linear modelsQuantum generalized linear models
Quantum generalized linear models
 
Chapter 3 mathematical modeling
Chapter 3 mathematical modelingChapter 3 mathematical modeling
Chapter 3 mathematical modeling
 
Statisticsforbiologists colstons
Statisticsforbiologists colstonsStatisticsforbiologists colstons
Statisticsforbiologists colstons
 
Feature scaling
Feature scalingFeature scaling
Feature scaling
 
Measure of central tendency
Measure of central tendency Measure of central tendency
Measure of central tendency
 
Alg II Unit 3-6-solvingsystemsmatrices
Alg II Unit 3-6-solvingsystemsmatricesAlg II Unit 3-6-solvingsystemsmatrices
Alg II Unit 3-6-solvingsystemsmatrices
 

Similar to Lecture6 pca

Machine Learning Foundations for Professional Managers
Machine Learning Foundations for Professional ManagersMachine Learning Foundations for Professional Managers
Machine Learning Foundations for Professional ManagersAlbert Y. C. Chen
 
k-Nearest Neighbors with brief explanation.pptx
k-Nearest Neighbors with brief explanation.pptxk-Nearest Neighbors with brief explanation.pptx
k-Nearest Neighbors with brief explanation.pptxgamingzonedead880
 
Excel and research
Excel and researchExcel and research
Excel and researchNursing Path
 
Excel and research
Excel and researchExcel and research
Excel and researchNursing Path
 
pcappt-140121072949-phpapp01.pptx
pcappt-140121072949-phpapp01.pptxpcappt-140121072949-phpapp01.pptx
pcappt-140121072949-phpapp01.pptxABINASHPADHY6
 
ODSC India 2018: Topological space creation & Clustering at BigData scale
ODSC India 2018: Topological space creation & Clustering at BigData scaleODSC India 2018: Topological space creation & Clustering at BigData scale
ODSC India 2018: Topological space creation & Clustering at BigData scaleKuldeep Jiwani
 
Lecture 11 - KNN and Clustering, a lecture in subject module Statistical & Ma...
Lecture 11 - KNN and Clustering, a lecture in subject module Statistical & Ma...Lecture 11 - KNN and Clustering, a lecture in subject module Statistical & Ma...
Lecture 11 - KNN and Clustering, a lecture in subject module Statistical & Ma...Maninda Edirisooriya
 
Uncertainty aware multidimensional ensemble data visualization and exploration
Uncertainty aware multidimensional ensemble data visualization and explorationUncertainty aware multidimensional ensemble data visualization and exploration
Uncertainty aware multidimensional ensemble data visualization and explorationSubhashis Hazarika
 
Computational Giants_nhom.pptx
Computational Giants_nhom.pptxComputational Giants_nhom.pptx
Computational Giants_nhom.pptxThAnhonc
 
probability.pptx
probability.pptxprobability.pptx
probability.pptxbisan3
 
How to analyse bulk transcriptomic data using Deseq2
How to analyse bulk transcriptomic data using Deseq2How to analyse bulk transcriptomic data using Deseq2
How to analyse bulk transcriptomic data using Deseq2AdamCribbs1
 
Advanced database and data mining & clustering concepts
Advanced database and data mining & clustering conceptsAdvanced database and data mining & clustering concepts
Advanced database and data mining & clustering conceptsNithyananthSengottai
 
CSA 3702 machine learning module 3
CSA 3702 machine learning module 3CSA 3702 machine learning module 3
CSA 3702 machine learning module 3Nandhini S
 

Similar to Lecture6 pca (20)

Machine Learning Foundations for Professional Managers
Machine Learning Foundations for Professional ManagersMachine Learning Foundations for Professional Managers
Machine Learning Foundations for Professional Managers
 
Covariance.pdf
Covariance.pdfCovariance.pdf
Covariance.pdf
 
k-Nearest Neighbors with brief explanation.pptx
k-Nearest Neighbors with brief explanation.pptxk-Nearest Neighbors with brief explanation.pptx
k-Nearest Neighbors with brief explanation.pptx
 
Excel and research
Excel and researchExcel and research
Excel and research
 
Excel and research
Excel and researchExcel and research
Excel and research
 
Machine Learning (Decisoion Trees)
Machine Learning (Decisoion Trees)Machine Learning (Decisoion Trees)
Machine Learning (Decisoion Trees)
 
pcappt-140121072949-phpapp01.pptx
pcappt-140121072949-phpapp01.pptxpcappt-140121072949-phpapp01.pptx
pcappt-140121072949-phpapp01.pptx
 
STAN_MS_PPT.pptx
STAN_MS_PPT.pptxSTAN_MS_PPT.pptx
STAN_MS_PPT.pptx
 
KNN
KNNKNN
KNN
 
Understandig PCA and LDA
Understandig PCA and LDAUnderstandig PCA and LDA
Understandig PCA and LDA
 
Pca ppt
Pca pptPca ppt
Pca ppt
 
ODSC India 2018: Topological space creation & Clustering at BigData scale
ODSC India 2018: Topological space creation & Clustering at BigData scaleODSC India 2018: Topological space creation & Clustering at BigData scale
ODSC India 2018: Topological space creation & Clustering at BigData scale
 
4646150.ppt
4646150.ppt4646150.ppt
4646150.ppt
 
Lecture 11 - KNN and Clustering, a lecture in subject module Statistical & Ma...
Lecture 11 - KNN and Clustering, a lecture in subject module Statistical & Ma...Lecture 11 - KNN and Clustering, a lecture in subject module Statistical & Ma...
Lecture 11 - KNN and Clustering, a lecture in subject module Statistical & Ma...
 
Uncertainty aware multidimensional ensemble data visualization and exploration
Uncertainty aware multidimensional ensemble data visualization and explorationUncertainty aware multidimensional ensemble data visualization and exploration
Uncertainty aware multidimensional ensemble data visualization and exploration
 
Computational Giants_nhom.pptx
Computational Giants_nhom.pptxComputational Giants_nhom.pptx
Computational Giants_nhom.pptx
 
probability.pptx
probability.pptxprobability.pptx
probability.pptx
 
How to analyse bulk transcriptomic data using Deseq2
How to analyse bulk transcriptomic data using Deseq2How to analyse bulk transcriptomic data using Deseq2
How to analyse bulk transcriptomic data using Deseq2
 
Advanced database and data mining & clustering concepts
Advanced database and data mining & clustering conceptsAdvanced database and data mining & clustering concepts
Advanced database and data mining & clustering concepts
 
CSA 3702 machine learning module 3
CSA 3702 machine learning module 3CSA 3702 machine learning module 3
CSA 3702 machine learning module 3
 

Recently uploaded

The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...ranjana rawat
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxpranjaldaimarysona
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escortsranjana rawat
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Call Girls in Nagpur High Profile
 
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).pptssuser5c9d4b1
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSHARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSRajkumarAkumalla
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escortsranjana rawat
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
Porous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingPorous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingrakeshbaidya232001
 
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingUNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingrknatarajan
 
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINESIVASHANKAR N
 
Extrusion Processes and Their Limitations
Extrusion Processes and Their LimitationsExtrusion Processes and Their Limitations
Extrusion Processes and Their Limitations120cr0395
 
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 

Recently uploaded (20)

The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptx
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
 
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
 
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
 
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSHARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
 
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCRCall Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
 
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
 
Porous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingPorous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writing
 
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingUNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
 
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
 
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINEDJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
 
Extrusion Processes and Their Limitations
Extrusion Processes and Their LimitationsExtrusion Processes and Their Limitations
Extrusion Processes and Their Limitations
 
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
 

Lecture6 pca

  • 1. Dimension Reduction & PCA Prof. A.L. Yuille Stat 231. Fall 2004.
  • 2. Curse of Dimensionality. • A major problem is the curse of dimensionality. • If the data x lies in high dimensional space, then an enormous amount of data is required to learn distributions or decision rules. • Example: 50 dimensions. Each dimension has 20 levels. This gives a total of cells. But the no. of data samples will be far less. There will not be enough data samples to learn.
  • 3. Curse of Dimensionality • One way to deal with dimensionality is to assume that we know the form of the probability distribution. • For example, a Gaussian model in N dimensions has N + N(N-1)/2 parameters to estimate. • Requires data to learn reliably. This may be practical.
  • 4. Dimension Reduction • One way to avoid the curse of dimensionality is by projecting the data onto a lower-dimensional space. • Techniques for dimension reduction: • Principal Component Analysis (PCA) • Fisher’s Linear Discriminant • Multi-dimensional Scaling. • Independent Component Analysis.
  • 5. Principal Component Analysis • PCA is the most commonly used dimension reduction technique. • (Also called the Karhunen-Loeve transform). • PCA – data samples • Compute the mean • Computer the covariance:
  • 6. Principal Component Analysis • Compute the eigenvalues and eigenvectors of the matrix • Solve • Order them by magnitude: • PCA reduces the dimension by keeping direction such that
  • 7. Principal Component Analysis • For many datasets, most of the eigenvalues lambda are negligible and can be discarded. The eigenvalue measures the variation In the direction e Example:
  • 8. Principal Component Analysis • Project the data onto the selected eigenvectors: • Where • is the proportion of data covered by the first M eigenvalues.
  • 9. PCA Example • The images of an object under different lighting lie in a low-dimensional space. • The original images are 256x 256. But the data lies mostly in 3-5 dimensions. • First we show the PCA for a face under a range of lighting conditions. The PCA components have simple interpretations. • Then we plot as a function of M for several objects under a range of lighting.
  • 11. 5 plus or minus 2. Most Objects project to
  • 12. Cost Function for PCA • Minimize the sum of squared error: • Can verify that the solutions are • The eigenvectors of K are • The are the projection coefficients of the datavectors onto the eigenvectors
  • 13. PCA & Gaussian Distributions. • PCA is similar to learning a Gaussian distribution for the data. • is the mean of the distribution. • K is the estimate of the covariance. • Dimension reduction occurs by ignoring the directions in which the covariance is small.
  • 14. Limitations of PCA • PCA is not effective for some datasets. • For example, if the data is a set of strings • (1,0,0,0,…), (0,1,0,0…),…,(0,0,0,…,1) then the eigenvalues do not fall off as PCA requires.
  • 15. PCA and Discrimination • PCA may not find the best directions for discriminating between two classes. • Example: suppose the two classes have 2D Gaussian densities as ellipsoids. • 1st eigenvector is best for representing the probabilities. • 2nd eigenvector is best for discrimination.
  • 16. Fisher’s Linear Discriminant. • 2-class classification. Given samples in class 1 and samples in class 2. • Goal: to find a vector w, project data onto this axis so that data is well separated.
  • 17. Fisher’s Linear Discriminant • Sample means • Scatter matrices: • • Between-class scatter matrix: • Within-class scatter matrix:
  • 18. Fisher’s Linear Discriminant • The sample means of the projected points: • The scatter of the projected points is: • These are both one-dimensional variables.
  • 19. Fisher’s Linear Discriminant • Choose the projection direction w to maximize: • • Maximize the ratio of the between-class distance to the within-class scatter.
  • 20. Fisher’s Linear Discriminant • Proposition. The vector that maximizes • Proof. • Maximize • is a constant, a Lagrange multiplier. • Now
  • 21. Fisher’s Linear Discriminant • Example: two Gaussians with the same covariance and means • The Bayes classifier is a straight line whose normal is the Fisher Linear Discriminant direction w. •
  • 22. Multiple Classes • For c classes, compute c-1 discriminants, project d-dimensional features into c-1 space.
  • 23. Multiple Classes • Within-class scatter: • • Between-class scatter: • is scatter matrix from all classes.
  • 24. Multiple Discriminant Analysis • Seek vectors and project samples to c-1 dimensional space: • Criterion is: • where |.| is the determinant. • Solution is the eigenvectors whose eigenvalues are the c-1 largest in