SlideShare a Scribd company logo
Seoul National University2/25/2017 1
PRINCIPAL COMPONENT ANALYSIS (PCA)
주성분분석
박정호 박사과정*
서울대학교 기계항공공학부
시스템 건전성 및 리스크 관리 연구실
*hihijung@snu.ac.kr
Seoul National University
Principles of PCA
2/25/2017 2
1) Maximum variance 2) Minimum error
• To maximize the variance of the 
projected data on the certain dimension.
Var1
PC1
PC2
PC1
PC2Var2
PC1
PC2
SSE1 SSE2
• To minimize the mean squared distance 
between the data and their projections.
SSE : Sum or squared errors
Seoul National University
Maximum variance – (1)
2/25/2017 3
x
1
xSample set mean : 
1
u x u xVariance of the projected data :  maximize
where  is the data covariance matrix defined by 
x x x x
전개하면
똑같음
 Projected data의 variance 를 maximize 하는 것은 결국 를 maximize 하는 것과 동일하게 됨.
* 은 data 가 projection 되는 vector 를 말한다. (unit vector 임. 즉,  1) 
Seoul National University
Maximum variance – (2)
2/25/2017 4
Formulation of maximization using Lagrange multiplier
x, 1
원래식 Constraint
(derivative w.r.t.  )
• 은 의 eigenvector,  은 eigenvalue 이다.
• 1 라는 사실을 이용하면,  =  이 된다. 즉, variance 의
maximization 문제가 eigenvalue 의 maximum을 구하는 문제와 같아진다. 
Summary
1
u x u x 													→										 										→										
Variance
Different expression of 
the variance using 
covariance matrix
Eigenvalue of the 
covariance matrix
Seoul National University
Minimum error – (1)
2/25/2017 5
x u
Representation of each data point by a 
linear combination of the basis vectors :
Where  δ , 
i.e. D‐dimensional basis vectors {u } 
x x u u
x u u u → x u
Approximation of each data point by a 
restricted number M < D : 
x u u
Seoul National University
Minimum error – (2)
2/25/2017 6
Distortion measure : 1
x x Need to be minimized
x u u
x u x u
Derivative w.r.t. 
& orthonormality Derivative w.r.t. 
& orthonormality
x x x x u u
1
x u x u  
• 최소의 distortion measure, J를
구하기 위해서는 1~M 까지의
eigenvalue 들의 최대값을 가져야함
↔ Variance의 maximization 문제와
같은 결론

More Related Content

What's hot

Principal component analysis
Principal component analysisPrincipal component analysis
Principal component analysis
Partha Sarathi Kar
 
Introduction to Principle Component Analysis
Introduction to Principle Component AnalysisIntroduction to Principle Component Analysis
Introduction to Principle Component Analysis
Sunjeet Jena
 
Implement principal component analysis (PCA) in python from scratch
Implement principal component analysis (PCA) in python from scratchImplement principal component analysis (PCA) in python from scratch
Implement principal component analysis (PCA) in python from scratch
EshanAgarwal4
 
"Principal Component Analysis - the original paper" presentation @ Papers We ...
"Principal Component Analysis - the original paper" presentation @ Papers We ..."Principal Component Analysis - the original paper" presentation @ Papers We ...
"Principal Component Analysis - the original paper" presentation @ Papers We ...
Adrian Florea
 
Logistic regression
Logistic regressionLogistic regression
Logistic regression
saba khan
 
Goodness of Fit Notation
Goodness of Fit NotationGoodness of Fit Notation
Goodness of Fit Notation
Long Beach City College
 
Cannonical correlation
Cannonical correlationCannonical correlation
Cannonical correlation
domsr
 
Linear models for data science
Linear models for data scienceLinear models for data science
Linear models for data science
Brad Klingenberg
 
Lect5 principal component analysis
Lect5 principal component analysisLect5 principal component analysis
Lect5 principal component analysis
hktripathy
 
Missing data handling
Missing data handlingMissing data handling
Missing data handling
QuantUniversity
 
Complements and Conditional Probability, and Bayes' Theorem
 Complements and Conditional Probability, and Bayes' Theorem Complements and Conditional Probability, and Bayes' Theorem
Complements and Conditional Probability, and Bayes' Theorem
Long Beach City College
 
Inferential statistics powerpoint
Inferential statistics powerpointInferential statistics powerpoint
Inferential statistics powerpoint
kellula
 
Counting
CountingCounting
Lect4 principal component analysis-I
Lect4 principal component analysis-ILect4 principal component analysis-I
Lect4 principal component analysis-I
hktripathy
 
CART – Classification & Regression Trees
CART – Classification & Regression TreesCART – Classification & Regression Trees
CART – Classification & Regression Trees
Hemant Chetwani
 
Logistic regression
Logistic regressionLogistic regression
Logistic regression
VARUN KUMAR
 
Pca
PcaPca
ForecastIT 2. Linear Regression & Model Statistics
ForecastIT 2. Linear Regression & Model StatisticsForecastIT 2. Linear Regression & Model Statistics
ForecastIT 2. Linear Regression & Model Statistics
DeepThought, Inc.
 
Pca analysis
Pca analysisPca analysis
Logistic regression
Logistic regressionLogistic regression
Logistic regression
DrZahid Khan
 

What's hot (20)

Principal component analysis
Principal component analysisPrincipal component analysis
Principal component analysis
 
Introduction to Principle Component Analysis
Introduction to Principle Component AnalysisIntroduction to Principle Component Analysis
Introduction to Principle Component Analysis
 
Implement principal component analysis (PCA) in python from scratch
Implement principal component analysis (PCA) in python from scratchImplement principal component analysis (PCA) in python from scratch
Implement principal component analysis (PCA) in python from scratch
 
"Principal Component Analysis - the original paper" presentation @ Papers We ...
"Principal Component Analysis - the original paper" presentation @ Papers We ..."Principal Component Analysis - the original paper" presentation @ Papers We ...
"Principal Component Analysis - the original paper" presentation @ Papers We ...
 
Logistic regression
Logistic regressionLogistic regression
Logistic regression
 
Goodness of Fit Notation
Goodness of Fit NotationGoodness of Fit Notation
Goodness of Fit Notation
 
Cannonical correlation
Cannonical correlationCannonical correlation
Cannonical correlation
 
Linear models for data science
Linear models for data scienceLinear models for data science
Linear models for data science
 
Lect5 principal component analysis
Lect5 principal component analysisLect5 principal component analysis
Lect5 principal component analysis
 
Missing data handling
Missing data handlingMissing data handling
Missing data handling
 
Complements and Conditional Probability, and Bayes' Theorem
 Complements and Conditional Probability, and Bayes' Theorem Complements and Conditional Probability, and Bayes' Theorem
Complements and Conditional Probability, and Bayes' Theorem
 
Inferential statistics powerpoint
Inferential statistics powerpointInferential statistics powerpoint
Inferential statistics powerpoint
 
Counting
CountingCounting
Counting
 
Lect4 principal component analysis-I
Lect4 principal component analysis-ILect4 principal component analysis-I
Lect4 principal component analysis-I
 
CART – Classification & Regression Trees
CART – Classification & Regression TreesCART – Classification & Regression Trees
CART – Classification & Regression Trees
 
Logistic regression
Logistic regressionLogistic regression
Logistic regression
 
Pca
PcaPca
Pca
 
ForecastIT 2. Linear Regression & Model Statistics
ForecastIT 2. Linear Regression & Model StatisticsForecastIT 2. Linear Regression & Model Statistics
ForecastIT 2. Linear Regression & Model Statistics
 
Pca analysis
Pca analysisPca analysis
Pca analysis
 
Logistic regression
Logistic regressionLogistic regression
Logistic regression
 

PCA (Principal Component Analysis)