Principal component analysis

•Download as PPTX, PDF•

12 likes•7,128 views

Partha Sarathi Kar

Engineering

CONTENTS
• WHAT IS PCA
• HOW IT WORKS
• HISTORY OF PCA
• PCA IMPLEMENTATION
• USES of PCA
• LIMITATION OF PCA
2/28

WHAT IS PCA
PCA takes a dataset with a
lots of dimension (i.e. Lots
of Cells) and flattens it to
2 or 3 dimensions so we
can look on it.
3/28
Principal component analysis (PCA) is a technique used to
emphasize variation and bring out strong patterns in a dataset.
It's often used to make data easy to explore and visualize.

HOW IT WORKS
4/28
Eating in the UK (a 17D example) Here's the plot of the data along the first
principal component. Already we can see
something is different about Northern
Ireland.
Northern Irish eat way more grams of fresh
potatoes and way fewer of fresh fruits, cheese,
fish and alcoholic drinks

HISTORY OF PCA
• PCA was invented in 1901
by Karl Pearson
• as an analogue of the principal
axis theorem in mechanics
5/28
src: https://commons.wikimedia.org/wiki/File:GaussianScatterPCA.svg

HISTORY OF PCA
Depending on the field of application, it is also
named:
6/28
• discrete Kosambi-Karhunen–Loève transform (KLT) in signal
processing,
• the Hotelling transform in multivariate quality control,
• proper orthogonal decomposition (POD) in mechanical engineering,
• singular value decomposition (SVD) of X (Golub and Van Loan, 1983),
• eigenvalue decomposition (EVD) of XTX in linear algebra,
• Eckart–Young theorem (Harman, 1960), or Schmidt–Mirsky theorem in
psychometrics,
• empirical orthogonal functions (EOF) in meteorological science,
• empirical eigenfunction decomposition (Sirovich, 1987) etc

PCA IMPLEMENTATION
PCA could have different implementations.
But most popular ones are
• eigenvalue decomposition (EVD) and
• singular value decomposition (SVD).
7/28

PCA IMPLEMENTATION
Eigenvalue decomposition
(EVD)
8/28
NxM > NxK (K<=M)
X = original data matrix
W and D new Matrix from
W contains all principal
component vectors, while D
contains all ranks of those
vectors (ordered from the largest
variance to the least one
XXTW=WD

PCA IMPLEMENTATION
Singular Value decomposition
(SVD)
9/28
NxM > NxK (K<=M)
X = original data matrix
X=UΣVT
XV=UΣ

PCA IMPLEMENTATION
14/28
EXAMPLE
Dot are spread out along a diagonal line and
maximum variation of data is between the two
end points of line

PCA IMPLEMENTATION
15/28
EXAMPLE
Dot are spread out along a
diagonal line and maximum
variation of data is between the
two end points of line
Dots are also spread out a little above
and below the first line and 2nd largest
amount of variation is at the endpoints
of the new line

PCA IMPLEMENTATION
17/28
EXAMPLE
These two new axes that describe the variation in the data
are “Principal Components”

USES OF PCA
26/28
PCA is mostly used as a tool for Compression and Simplifying
data for easier learning in exploratory data analysis and for
making predictive models.
1- Better Perspective and less Complexity
2 - Better visualization
3- Reduce size
4- Different perspective:

LIMITATION OF PCA
27/28
If the data does not
follow a
multidimensional
normal (Gaussian)
distribution, PCA
may not give the
best principal
components

REFERENCES
28/28
Figure: pixel representation
Information and Image Credit :
• http://www.mit.edu/~gari/teaching/6.555/LECTURE_NOTES/ch28_bss.pdf
• https://www.quora.com/What-are-some-of-the-limitations-of-principal-component-analysis
• http://mengnote.blogspot.com.ee/2013/05/an-intuitive-explanation-of-pca.html
• https://www.youtube.com/watch?v=_UVHneBUBW0&t=2s
• http://setosa.io/ev/principal-component-analysis/

What's hot

PCAmathurnidhi

Pca(principal components analysis)kalung0313

Principal component analysis and ldaSuresh Pokharel

Principal component analysisFarah M. Altufaili

Principal Component Analysis (PCA) and LDA PPT SlidesAbhishekKumar4995

Principal Component Analysis and ClusteringUsha Vijay

Implement principal component analysis (PCA) in python from scratchEshanAgarwal4

Clusters techniquesrajshreemuthiah

PCA (Principal component analysis)Learnbay Datascience

PCA (Principal Component Analysis)Luis Serrano

PcaNalini. Yadav

Bias and variance trade offVARUN KUMAR

Exploratory data analysis data visualizationDr. Hamdan Al-Sabri

The Method Of Maximum LikelihoodMax Chipulu

ML - Multiple Linear RegressionAndrew Ferlitsch

Self-organizing mapTarat Diloksawatdikul

Decision TreesStudent

Dimensionality Reduction | Machine Learning | CloudxLabCloudxLab

Decision treeR A Akerkar

Unsupervised learningamalalhait

What's hot (20)

PCA

Pca(principal components analysis)

Principal component analysis and lda

Principal component analysis

Principal Component Analysis (PCA) and LDA PPT Slides

Principal Component Analysis and Clustering

Implement principal component analysis (PCA) in python from scratch

Clusters techniques

PCA (Principal component analysis)

PCA (Principal Component Analysis)

Pca

Bias and variance trade off

Exploratory data analysis data visualization

The Method Of Maximum Likelihood

ML - Multiple Linear Regression

Self-organizing map

Decision Trees

Dimensionality Reduction | Machine Learning | CloudxLab

Decision tree

Unsupervised learning

Similar to Principal component analysis

Getting started with chemometric classificationAlex Henderson

prescalers and dual modulus prescalersSakshi Bhargava

Daamen r 2010scwr-cpaperJohn B. Cook, PE, CEO

The effect of microscale spatial variability of wind on estimation of technic...IEA-ETSAP

Search for Diboson Resonances in CMSJose Cupertino Ruiz Vargas

SIMULTANEOUS OPTIMIZATION OF STANDBY AND ACTIVE ENERGY FOR SUB-THRESHOLD CIRC...VLSICS Design

Matrix Factorization In Recommender SystemsYONG ZHENG

SpectrumEstimation.pptMaryanne678733

Photovoltaic thermal (PV/T) collectors with nanofluids and nano-Phase Change ...Ali Al-Waeli

ECP Application Developmentinside-BigData.com

CP2K: How to use the constrained DFT moduleNico Holmberg

Battery Energy Storage Systems - TEPCO.pptxDr. Rama Rao Pokanati V V

How to validate your modelAlex Henderson

Mars STEM poster final draftAlexander Reedy

23AFMC_Beamer.pdfLorenzoCampoli1

Automated Generation of High-accuracy Interatomic Potentials Using Quantum Dataaimsnist

Improving Genetic Algorithm (GA) based NoC mapping algorithm using a formal ...Vinita Palaniveloo

Similar to Principal component analysis (20)

Getting started with chemometric classification

prescalers and dual modulus prescalers

Daamen r 2010scwr-cpaper

The effect of microscale spatial variability of wind on estimation of technic...

Search for Diboson Resonances in CMS

SIMULTANEOUS OPTIMIZATION OF STANDBY AND ACTIVE ENERGY FOR SUB-THRESHOLD CIRC...

Matrix Factorization In Recommender Systems

SpectrumEstimation.ppt

Photovoltaic thermal (PV/T) collectors with nanofluids and nano-Phase Change ...

ECP Application Development

CP2K: How to use the constrained DFT module

Battery Energy Storage Systems - TEPCO.pptx

How to validate your model

Mars STEM poster final draft

23AFMC_Beamer.pdf

Automated Generation of High-accuracy Interatomic Potentials Using Quantum Data

Improving Genetic Algorithm (GA) based NoC mapping algorithm using a formal ...

Recently uploaded

VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...VICTOR MAESTRE RAMIREZ

High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile

Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxJoão Esperancinha

CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfAsst.prof M.Gokilavani

Call Girls Narol 7397865700 Independent Call Girlsssuser7cb4ff

Oxy acetylene welding presentation note.eptoze12

Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionDr.Costas Sachpazis

SPICE PARK APR2024 ( 6,793 SPICE Models )Tsuyoshi Horigome

VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130Suhani Kapoor

Study on Air-Water & Water-Water Heat Exchange in a Finned Tube ExchangerAnamika Sarkar

ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...ZTE

IVE Industry Focused Event - Defence Sector 2024Mark Billinghurst

Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort servicejennyeacort

young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service9953056974 Low Rate Call Girls In Saket, Delhi NCR

Introduction to Microprocesso programming and interfacing.pptxvipinkmenon1

Application of Residue Theorem to evaluate real integrations.pptx959SahilShah

Artificial-Intelligence-in-Electronics (K).pptxbritheesh05

9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf9953056974 Low Rate Call Girls In Saket, Delhi NCR

microprocessor 8085 and its interfacingjaychoudhary37

High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escortsranjana rawat

Recently uploaded (20)

VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...

High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts

Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx

CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf

Call Girls Narol 7397865700 Independent Call Girls

Oxy acetylene welding presentation note.

Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction

SPICE PARK APR2024 ( 6,793 SPICE Models )

VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130

Study on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger

ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...

IVE Industry Focused Event - Defence Sector 2024

Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service

young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service

Introduction to Microprocesso programming and interfacing.pptx

Application of Residue Theorem to evaluate real integrations.pptx

Artificial-Intelligence-in-Electronics (K).pptx

9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf

microprocessor 8085 and its interfacing

High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts

Principal component analysis

1. PRINCIPAL COMPONENT ANALYSIS Partha Sarathi Kar IVSM 166777 1

2. CONTENTS • WHAT IS PCA • HOW IT WORKS • HISTORY OF PCA • PCA IMPLEMENTATION • USES of PCA • LIMITATION OF PCA 2/28

3. WHAT IS PCA PCA takes a dataset with a lots of dimension (i.e. Lots of Cells) and flattens it to 2 or 3 dimensions so we can look on it. 3/28 Principal component analysis (PCA) is a technique used to emphasize variation and bring out strong patterns in a dataset. It's often used to make data easy to explore and visualize.

4. HOW IT WORKS 4/28 Eating in the UK (a 17D example) Here's the plot of the data along the first principal component. Already we can see something is different about Northern Ireland. Northern Irish eat way more grams of fresh potatoes and way fewer of fresh fruits, cheese, fish and alcoholic drinks

5. HISTORY OF PCA • PCA was invented in 1901 by Karl Pearson • as an analogue of the principal axis theorem in mechanics 5/28 src: https://commons.wikimedia.org/wiki/File:GaussianScatterPCA.svg

6. HISTORY OF PCA Depending on the field of application, it is also named: 6/28 • discrete Kosambi-Karhunen–Loève transform (KLT) in signal processing, • the Hotelling transform in multivariate quality control, • proper orthogonal decomposition (POD) in mechanical engineering, • singular value decomposition (SVD) of X (Golub and Van Loan, 1983), • eigenvalue decomposition (EVD) of XTX in linear algebra, • Eckart–Young theorem (Harman, 1960), or Schmidt–Mirsky theorem in psychometrics, • empirical orthogonal functions (EOF) in meteorological science, • empirical eigenfunction decomposition (Sirovich, 1987) etc

7. PCA IMPLEMENTATION PCA could have different implementations. But most popular ones are • eigenvalue decomposition (EVD) and • singular value decomposition (SVD). 7/28

8. PCA IMPLEMENTATION Eigenvalue decomposition (EVD) 8/28 NxM > NxK (K<=M) X = original data matrix W and D new Matrix from W contains all principal component vectors, while D contains all ranks of those vectors (ordered from the largest variance to the least one XXTW=WD

9. PCA IMPLEMENTATION Singular Value decomposition (SVD) 9/28 NxM > NxK (K<=M) X = original data matrix X=UΣVT XV=UΣ

10. PCA IMPLEMENTATION 10/28

11. PCA IMPLEMENTATION 11/28 EXAMPLE

12. PCA IMPLEMENTATION 12/28 EXAMPLE

13. PCA IMPLEMENTATION 13/28 EXAMPLE

14. PCA IMPLEMENTATION 14/28 EXAMPLE Dot are spread out along a diagonal line and maximum variation of data is between the two end points of line

15. PCA IMPLEMENTATION 15/28 EXAMPLE Dot are spread out along a diagonal line and maximum variation of data is between the two end points of line Dots are also spread out a little above and below the first line and 2nd largest amount of variation is at the endpoints of the new line

16. PCA IMPLEMENTATION 16/28 EXAMPLE

17. PCA IMPLEMENTATION 17/28 EXAMPLE These two new axes that describe the variation in the data are “Principal Components”

18. PCA IMPLEMENTATION 18/28 EXAMPLE

19. PCA IMPLEMENTATION 19/28 EXAMPLE

20. PCA IMPLEMENTATION 20/28 EXAMPLE

21. PCA IMPLEMENTATION 21/28 EXAMPLE

22. PCA IMPLEMENTATION 22/28 EXAMPLE

23. PCA IMPLEMENTATION 23/28 EXAMPLE

24. PCA IMPLEMENTATION 24/28 EXAMPLE

25. PCA IMPLEMENTATION 25/28 EXAMPLE

26. USES OF PCA 26/28 PCA is mostly used as a tool for Compression and Simplifying data for easier learning in exploratory data analysis and for making predictive models. 1- Better Perspective and less Complexity 2 - Better visualization 3- Reduce size 4- Different perspective:

27. LIMITATION OF PCA 27/28 If the data does not follow a multidimensional normal (Gaussian) distribution, PCA may not give the best principal components

28. REFERENCES 28/28 Figure: pixel representation Information and Image Credit : • http://www.mit.edu/~gari/teaching/6.555/LECTURE_NOTES/ch28_bss.pdf • https://www.quora.com/What-are-some-of-the-limitations-of-principal-component-analysis • http://mengnote.blogspot.com.ee/2013/05/an-intuitive-explanation-of-pca.html • https://www.youtube.com/watch?v=_UVHneBUBW0&t=2s • http://setosa.io/ev/principal-component-analysis/

29. 29 THANKS

Principal component analysis

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Principal component analysis

Similar to Principal component analysis (20)

Recently uploaded

Recently uploaded (20)

Principal component analysis