SlideShare a Scribd company logo
1 of 51
PRINCIPAL COMPONENT ANALYSIS
Mohammed Sameer
2021-19-002
Department of Agricultural Statistics
Kerala Agricultural University
Data reduction technique developed by
Hotelling H
• Main Aim
• Lower the dimensions
• Orthogonality of new (transformed) dimensions
(principal components)
If correlated
Why only ellipse?
x1
x2
Scatter plot of the data with
original axis X1 and X2
(original data)
Shift the original axis to the
center of the data(mean)
x1
x2
Rotate the original axis
• Rotate X1(axis 1) by some
angle such that variability
of the data along that axis
is maximum
• Rotate X2(axis 2) such that
it is perpendicular to the
first axis and variability of
the data along that axis is
second maximum
Z1
Z2
• Scatter plot of transformed data
• Transformed axis Z1 and Z2
Variability of x1
Variability
of
x2
Original
Variance of x1 and x2 are large
X1 and x2 are correlated
Z1
Z2
Variability of z1
Variability
of
z2
Transformed axis
Variance of z2 is much smaller than
variance of z1
Z1 and z2 are uncorrelated
Red dots (projection of the original data points onto the rotating line)
The spread of the red dots will be maximum when it aligns with the pink mark(line)
Projection of points on to a line, the line is such that
The projected points has the greatest variability.
Projection of points on to a plane, the plane is such that
the spread of the points onto that plane is the greatest.
Image compression
Principal Components
* First principal component is the direction of greatest
variability (covariance) in the data
* Second is the next orthogonal (uncorrelated) direction
of greatest variability
— So first remove all the variability along the first
component, and then find the next direction of
greatest variability and so on…
Principal Components Analysis
(PCA)
Principle
— Linear projection method to reduce the number of parameters
— Transfer a set of correlated variables into a new set of uncorrelated
variabies
— Map the data into a space of lower dimensionality
— Form of unsupervised learning
Properties
— It can be viewed as a rotation of the existing axes to new positions in the
space defined by original variables
— New axes are orthogonal and represent the directions with maximum
variability
Computing the components
• First center the data points
• Project the data points(vectors) onto an axis such that the variability
of the projected data points onto that axis is greatest.
• It turns out that the variability of x along the transformed axis is the
eigen values of cov(x) and the direction of the new axis is along the
eigen vectors of cov(x)
Dimensionality reduction
Choose only first p eigenvectors, based
On their eigenvalues
Final data set has only p dimensions
Bartlets test of sphericity
• H0: R=I
• H1: R not equal to I
In other words
H0: scatter plot is sort of sphere centered at origin
H1: scatter plot is not a sphere
• If scatterplot is a sphere, then no use of PCA
• If scatter plot is not a sphere( is ellipse/ellipsoid) then go
for PCA
• The results of the principal component analysis in milk production of
the state of Tamil Nadu revealed that milk production was having
positive relationship with the indigenous cattle population, she-
buffalo population, number of veterinary institutions, gross cropped
area, area under paddy. area under groundnut, native purebred cattle
population, graded and indigenous buffalo population, agricultural
labour population, crossbred cattle population, no. of financial
institutions and graded buffalo population.
• This suggests that effecting a shift in herd structure in favour of cross-
bred cows and graded buffalos can augment the milk production
potential.
Results
CLUSTER ANALYSIS
Introduction
• Cluster is a number of things of the same kind growing or joined
together
• A group of homogeneous things
The principle:
• Objects in the same group are similar to each other
• Objects in the different group are as dissimilar as possible
Cluster 1
Cluster 2
Cluster 3
Cluster Analysis Model
partition
Obtain similarity or
dissimilarity
Objects to be
clustered
output
Cluster 1
Cluster 2
Cluster 3
Cluster k
Distance measures
Euclidean distance
Manhattan distance Manhattan(A,B)=
Clustering algorithms
• Hierarchical clustering
• Centroid-based clustering
• Graph-based clustering
• Density-based clustering
Single(nearest neighbour): distance between two clusters =
distance between two members of the two clusters
Farthest(complete) neighbour
Nearest distance
Farthest distance
Centroid : distance between multivariate means
of each clusters
OTHER JOINING ALGORITHMS
• AVERAGE
• MEDIAN
• WARD
objects 1 2 3 4 5
1 0
2 9 0
3 3 7 0
4 6 5 9 0
5 11 10 2 8 0
2
3,5 is made as one cluster
objects 5,3 1 2 4
5,3 0
1 3 0
2 7 9 0
4 8 6 5 0
3
1,3,5 is made as one cluster
Object 5 object 3 object 1 object 2 object 4
Dendogram
• Graph based clustering
• HCS (Highly Connected Subgraphs) clustering algorithm
• Points which are highly connected are clustered
Similarity graph
Density based clustering
The algorithms work via sliding windows moving toward the high density of points
How many clusters to retain?
At what stage I have to stop the algorithm.
Scree plot
• The cluster analysis was carried out based on area, production, and
productivity of different agricultural and horticultural crops which
were predominantly grown in the districts of Rajasthan
• calculated for two different periods 1980-1995 and 1996-2014
independently.
• Crop cluster based on area during 1980- 1995
• Crop cluster based on area during 1996- 2014
• Crop cluster based on production during 1980-1995
• Crop cluster based on production during 1996-2014
• Crop cluster based on productivity during 1990-1995
• Crop cluster based on productivity during 1996-2014
Conclusions
• From the present study we concluded that when the performance of
crop clusters based on area between two periods was compared, it
was evident that gram and cotton has shifted over the years in the
second period of study.
• When comparison of the performances of crop clusters based on
production between two periods was observed that gram, mustard &
rapeseed and cotton production shifted over the period.
• It means these crops were made cluster in the first period but not in
the second period. While wheat and bajra were the crops which
made clusters or had similarity in production across all the districts
of Rajasthan from first period to second period.
• The present study also concluded that horticultural crops had
similarity in productivity across all the districts of Rajasthan during
the both period.
• It means coriander, garlic and pea productivity included over the
years in the second period of the study. Only wheat and bajra were
the crops which had similarity in productivity across all the districts of
Rajasthan from first period to second period.
THANKYOU

More Related Content

What's hot

What's hot (20)

Concepts of Correlation and Path Analysis
Concepts of Correlation and Path AnalysisConcepts of Correlation and Path Analysis
Concepts of Correlation and Path Analysis
 
Exploring Data
Exploring DataExploring Data
Exploring Data
 
Lect4 principal component analysis-I
Lect4 principal component analysis-ILect4 principal component analysis-I
Lect4 principal component analysis-I
 
D-Square statistic
D-Square statisticD-Square statistic
D-Square statistic
 
PCA
PCAPCA
PCA
 
Principal component analysis
Principal component analysisPrincipal component analysis
Principal component analysis
 
Genome wide association studies seminar
Genome wide association studies seminarGenome wide association studies seminar
Genome wide association studies seminar
 
Lecture 6. univariate and bivariate analysis
Lecture 6. univariate and bivariate analysisLecture 6. univariate and bivariate analysis
Lecture 6. univariate and bivariate analysis
 
Spatial data for GIS
Spatial data for GISSpatial data for GIS
Spatial data for GIS
 
Design of Experiment
Design of ExperimentDesign of Experiment
Design of Experiment
 
Association mapping
Association mappingAssociation mapping
Association mapping
 
Introduction to Multivariate analysis
Introduction to Multivariate analysisIntroduction to Multivariate analysis
Introduction to Multivariate analysis
 
Split-plot Designs
Split-plot DesignsSplit-plot Designs
Split-plot Designs
 
Spatial databases
Spatial databasesSpatial databases
Spatial databases
 
Multivariate analysis - Multiple regression analysis
Multivariate analysis -  Multiple regression analysisMultivariate analysis -  Multiple regression analysis
Multivariate analysis - Multiple regression analysis
 
Introduction to R Graphics with ggplot2
Introduction to R Graphics with ggplot2Introduction to R Graphics with ggplot2
Introduction to R Graphics with ggplot2
 
Univariate Analysis
Univariate AnalysisUnivariate Analysis
Univariate Analysis
 
PCA (Principal component analysis)
PCA (Principal component analysis)PCA (Principal component analysis)
PCA (Principal component analysis)
 
Logistic regression
Logistic regressionLogistic regression
Logistic regression
 
ppt spatial data
ppt spatial datappt spatial data
ppt spatial data
 

Similar to Principal Component Analysis and Cluster Analysis

Cluster Analysis
Cluster Analysis Cluster Analysis
Cluster Analysis Baivab Nag
 
Biostatistics_descriptive stats.pptx
Biostatistics_descriptive stats.pptxBiostatistics_descriptive stats.pptx
Biostatistics_descriptive stats.pptxMohammedAbdela7
 
ODSC India 2018: Topological space creation & Clustering at BigData scale
ODSC India 2018: Topological space creation & Clustering at BigData scaleODSC India 2018: Topological space creation & Clustering at BigData scale
ODSC India 2018: Topological space creation & Clustering at BigData scaleKuldeep Jiwani
 
Data presentation.pptx
Data presentation.pptxData presentation.pptx
Data presentation.pptxssusera0e0e9
 
3. Statistical Analysis.pptx
3. Statistical Analysis.pptx3. Statistical Analysis.pptx
3. Statistical Analysis.pptxjeyanthisivakumar
 
Excel and research
Excel and researchExcel and research
Excel and researchNursing Path
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statisticsAnand Thokal
 
Statisticsforbiologists colstons
Statisticsforbiologists colstonsStatisticsforbiologists colstons
Statisticsforbiologists colstonsandymartin
 
Frequency Distributions
Frequency DistributionsFrequency Distributions
Frequency Distributionsjasondroesch
 
Lecture 3 Dispersion(1).pptx
Lecture 3 Dispersion(1).pptxLecture 3 Dispersion(1).pptx
Lecture 3 Dispersion(1).pptxssuser378d7c
 
Enhancing Parallel Coordinates with Curves
Enhancing Parallel Coordinates with CurvesEnhancing Parallel Coordinates with Curves
Enhancing Parallel Coordinates with Curvesmartinjgraham
 
Topic 3 Measures of Central Tendency -Grouped Data.pptx
Topic 3 Measures of Central Tendency -Grouped Data.pptxTopic 3 Measures of Central Tendency -Grouped Data.pptx
Topic 3 Measures of Central Tendency -Grouped Data.pptxCallplanetsDeveloper
 
GIS in Public Health Research: Understanding Spatial Analysis and Interpretin...
GIS in Public Health Research: Understanding Spatial Analysis and Interpretin...GIS in Public Health Research: Understanding Spatial Analysis and Interpretin...
GIS in Public Health Research: Understanding Spatial Analysis and Interpretin...hpaocec
 
DATA GRAPHICS 8th Sem.pdf
DATA GRAPHICS 8th Sem.pdfDATA GRAPHICS 8th Sem.pdf
DATA GRAPHICS 8th Sem.pdfRavinandan A P
 
Excel and research
Excel and researchExcel and research
Excel and researchNursing Path
 

Similar to Principal Component Analysis and Cluster Analysis (20)

Cluster Analysis
Cluster Analysis Cluster Analysis
Cluster Analysis
 
Cluster analysis
Cluster analysisCluster analysis
Cluster analysis
 
GEOSTATISTICAL_ANALYST
GEOSTATISTICAL_ANALYSTGEOSTATISTICAL_ANALYST
GEOSTATISTICAL_ANALYST
 
Statistics
StatisticsStatistics
Statistics
 
Biostatistics_descriptive stats.pptx
Biostatistics_descriptive stats.pptxBiostatistics_descriptive stats.pptx
Biostatistics_descriptive stats.pptx
 
ODSC India 2018: Topological space creation & Clustering at BigData scale
ODSC India 2018: Topological space creation & Clustering at BigData scaleODSC India 2018: Topological space creation & Clustering at BigData scale
ODSC India 2018: Topological space creation & Clustering at BigData scale
 
Data presentation.pptx
Data presentation.pptxData presentation.pptx
Data presentation.pptx
 
Presenting scientific data graphing
Presenting scientific data graphingPresenting scientific data graphing
Presenting scientific data graphing
 
Types of graphs
Types of graphsTypes of graphs
Types of graphs
 
3. Statistical Analysis.pptx
3. Statistical Analysis.pptx3. Statistical Analysis.pptx
3. Statistical Analysis.pptx
 
Excel and research
Excel and researchExcel and research
Excel and research
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
 
Statisticsforbiologists colstons
Statisticsforbiologists colstonsStatisticsforbiologists colstons
Statisticsforbiologists colstons
 
Frequency Distributions
Frequency DistributionsFrequency Distributions
Frequency Distributions
 
Lecture 3 Dispersion(1).pptx
Lecture 3 Dispersion(1).pptxLecture 3 Dispersion(1).pptx
Lecture 3 Dispersion(1).pptx
 
Enhancing Parallel Coordinates with Curves
Enhancing Parallel Coordinates with CurvesEnhancing Parallel Coordinates with Curves
Enhancing Parallel Coordinates with Curves
 
Topic 3 Measures of Central Tendency -Grouped Data.pptx
Topic 3 Measures of Central Tendency -Grouped Data.pptxTopic 3 Measures of Central Tendency -Grouped Data.pptx
Topic 3 Measures of Central Tendency -Grouped Data.pptx
 
GIS in Public Health Research: Understanding Spatial Analysis and Interpretin...
GIS in Public Health Research: Understanding Spatial Analysis and Interpretin...GIS in Public Health Research: Understanding Spatial Analysis and Interpretin...
GIS in Public Health Research: Understanding Spatial Analysis and Interpretin...
 
DATA GRAPHICS 8th Sem.pdf
DATA GRAPHICS 8th Sem.pdfDATA GRAPHICS 8th Sem.pdf
DATA GRAPHICS 8th Sem.pdf
 
Excel and research
Excel and researchExcel and research
Excel and research
 

More from Muhammed Ameer

Degradome sequencing and small RNA targets
Degradome sequencing and small RNA targetsDegradome sequencing and small RNA targets
Degradome sequencing and small RNA targetsMuhammed Ameer
 
Rural Agricultural Work Experience Programme (RAWEP) Final Report ALB 7108.pdf
Rural Agricultural Work Experience Programme (RAWEP) Final Report ALB 7108.pdfRural Agricultural Work Experience Programme (RAWEP) Final Report ALB 7108.pdf
Rural Agricultural Work Experience Programme (RAWEP) Final Report ALB 7108.pdfMuhammed Ameer
 
Climate Change and Climate Scenario
Climate Change and Climate ScenarioClimate Change and Climate Scenario
Climate Change and Climate ScenarioMuhammed Ameer
 
Case studies in Disaster Management (Four)
Case studies in Disaster Management (Four)Case studies in Disaster Management (Four)
Case studies in Disaster Management (Four)Muhammed Ameer
 
Disaster Preparedness.pptx
Disaster Preparedness.pptxDisaster Preparedness.pptx
Disaster Preparedness.pptxMuhammed Ameer
 
Animal Cell Culture.pptx
Animal Cell Culture.pptxAnimal Cell Culture.pptx
Animal Cell Culture.pptxMuhammed Ameer
 
Gibson Assembly in Cloning
Gibson Assembly in CloningGibson Assembly in Cloning
Gibson Assembly in CloningMuhammed Ameer
 
Label for Honey bottle
Label for Honey bottleLabel for Honey bottle
Label for Honey bottleMuhammed Ameer
 
STUDY ON FORAGING BEHAVIOUR OF STINGLESS BEE, Trigona iridipennis IN NATURAL...
STUDY ON FORAGING BEHAVIOUR OF STINGLESS BEE, Trigona  iridipennis IN NATURAL...STUDY ON FORAGING BEHAVIOUR OF STINGLESS BEE, Trigona  iridipennis IN NATURAL...
STUDY ON FORAGING BEHAVIOUR OF STINGLESS BEE, Trigona iridipennis IN NATURAL...Muhammed Ameer
 
Label Design for strawberry honey
Label Design for strawberry honeyLabel Design for strawberry honey
Label Design for strawberry honeyMuhammed Ameer
 
Digital herbarium of plants
Digital herbarium of plantsDigital herbarium of plants
Digital herbarium of plantsMuhammed Ameer
 
Bee floral plants in India
Bee floral plants in IndiaBee floral plants in India
Bee floral plants in IndiaMuhammed Ameer
 
Herbarium of Bee floral Plants
Herbarium of Bee floral PlantsHerbarium of Bee floral Plants
Herbarium of Bee floral PlantsMuhammed Ameer
 
Stingless bee Box making (Apiculture)
Stingless bee Box making (Apiculture)Stingless bee Box making (Apiculture)
Stingless bee Box making (Apiculture)Muhammed Ameer
 
Vaccination and medication in poultry
Vaccination and medication in poultryVaccination and medication in poultry
Vaccination and medication in poultryMuhammed Ameer
 

More from Muhammed Ameer (20)

Degradome sequencing and small RNA targets
Degradome sequencing and small RNA targetsDegradome sequencing and small RNA targets
Degradome sequencing and small RNA targets
 
Blood Typing.pptx
Blood Typing.pptxBlood Typing.pptx
Blood Typing.pptx
 
Rural Agricultural Work Experience Programme (RAWEP) Final Report ALB 7108.pdf
Rural Agricultural Work Experience Programme (RAWEP) Final Report ALB 7108.pdfRural Agricultural Work Experience Programme (RAWEP) Final Report ALB 7108.pdf
Rural Agricultural Work Experience Programme (RAWEP) Final Report ALB 7108.pdf
 
Goal Setting
Goal SettingGoal Setting
Goal Setting
 
Climate Change and Climate Scenario
Climate Change and Climate ScenarioClimate Change and Climate Scenario
Climate Change and Climate Scenario
 
Case studies in Disaster Management (Four)
Case studies in Disaster Management (Four)Case studies in Disaster Management (Four)
Case studies in Disaster Management (Four)
 
Disaster Preparedness.pptx
Disaster Preparedness.pptxDisaster Preparedness.pptx
Disaster Preparedness.pptx
 
Animal Cell Culture.pptx
Animal Cell Culture.pptxAnimal Cell Culture.pptx
Animal Cell Culture.pptx
 
Gibson Assembly in Cloning
Gibson Assembly in CloningGibson Assembly in Cloning
Gibson Assembly in Cloning
 
Label for Honey bottle
Label for Honey bottleLabel for Honey bottle
Label for Honey bottle
 
STUDY ON FORAGING BEHAVIOUR OF STINGLESS BEE, Trigona iridipennis IN NATURAL...
STUDY ON FORAGING BEHAVIOUR OF STINGLESS BEE, Trigona  iridipennis IN NATURAL...STUDY ON FORAGING BEHAVIOUR OF STINGLESS BEE, Trigona  iridipennis IN NATURAL...
STUDY ON FORAGING BEHAVIOUR OF STINGLESS BEE, Trigona iridipennis IN NATURAL...
 
Label Design for strawberry honey
Label Design for strawberry honeyLabel Design for strawberry honey
Label Design for strawberry honey
 
Bee flora
Bee floraBee flora
Bee flora
 
Digital herbarium of plants
Digital herbarium of plantsDigital herbarium of plants
Digital herbarium of plants
 
Label Honey Bottle
Label Honey BottleLabel Honey Bottle
Label Honey Bottle
 
Bee floral plants in India
Bee floral plants in IndiaBee floral plants in India
Bee floral plants in India
 
Herbarium of Bee floral Plants
Herbarium of Bee floral PlantsHerbarium of Bee floral Plants
Herbarium of Bee floral Plants
 
Stingless bee Box making (Apiculture)
Stingless bee Box making (Apiculture)Stingless bee Box making (Apiculture)
Stingless bee Box making (Apiculture)
 
Emu Bird Production
Emu Bird ProductionEmu Bird Production
Emu Bird Production
 
Vaccination and medication in poultry
Vaccination and medication in poultryVaccination and medication in poultry
Vaccination and medication in poultry
 

Recently uploaded

代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改atducpo
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Sapana Sha
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...Suhani Kapoor
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfSocial Samosa
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingNeil Barnes
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubaihf8803863
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh
 
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...Suhani Kapoor
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...Suhani Kapoor
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystSamantha Rae Coolbeth
 
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...shivangimorya083
 
Aminabad Call Girl Agent 9548273370 , Call Girls Service Lucknow
Aminabad Call Girl Agent 9548273370 , Call Girls Service LucknowAminabad Call Girl Agent 9548273370 , Call Girls Service Lucknow
Aminabad Call Girl Agent 9548273370 , Call Girls Service Lucknowmakika9823
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...Pooja Nehwal
 

Recently uploaded (20)

代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data Storytelling
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998
 
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
 
E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data Analyst
 
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...
 
Aminabad Call Girl Agent 9548273370 , Call Girls Service Lucknow
Aminabad Call Girl Agent 9548273370 , Call Girls Service LucknowAminabad Call Girl Agent 9548273370 , Call Girls Service Lucknow
Aminabad Call Girl Agent 9548273370 , Call Girls Service Lucknow
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
 

Principal Component Analysis and Cluster Analysis

  • 1. PRINCIPAL COMPONENT ANALYSIS Mohammed Sameer 2021-19-002 Department of Agricultural Statistics Kerala Agricultural University
  • 2. Data reduction technique developed by Hotelling H • Main Aim • Lower the dimensions • Orthogonality of new (transformed) dimensions (principal components)
  • 3.
  • 5.
  • 6. x1 x2 Scatter plot of the data with original axis X1 and X2 (original data) Shift the original axis to the center of the data(mean)
  • 7. x1 x2 Rotate the original axis • Rotate X1(axis 1) by some angle such that variability of the data along that axis is maximum • Rotate X2(axis 2) such that it is perpendicular to the first axis and variability of the data along that axis is second maximum
  • 8. Z1 Z2 • Scatter plot of transformed data • Transformed axis Z1 and Z2
  • 9. Variability of x1 Variability of x2 Original Variance of x1 and x2 are large X1 and x2 are correlated
  • 10. Z1 Z2 Variability of z1 Variability of z2 Transformed axis Variance of z2 is much smaller than variance of z1 Z1 and z2 are uncorrelated
  • 11. Red dots (projection of the original data points onto the rotating line) The spread of the red dots will be maximum when it aligns with the pink mark(line)
  • 12. Projection of points on to a line, the line is such that The projected points has the greatest variability. Projection of points on to a plane, the plane is such that the spread of the points onto that plane is the greatest.
  • 13.
  • 15. Principal Components * First principal component is the direction of greatest variability (covariance) in the data * Second is the next orthogonal (uncorrelated) direction of greatest variability — So first remove all the variability along the first component, and then find the next direction of greatest variability and so on…
  • 16. Principal Components Analysis (PCA) Principle — Linear projection method to reduce the number of parameters — Transfer a set of correlated variables into a new set of uncorrelated variabies — Map the data into a space of lower dimensionality — Form of unsupervised learning Properties — It can be viewed as a rotation of the existing axes to new positions in the space defined by original variables — New axes are orthogonal and represent the directions with maximum variability
  • 17. Computing the components • First center the data points • Project the data points(vectors) onto an axis such that the variability of the projected data points onto that axis is greatest. • It turns out that the variability of x along the transformed axis is the eigen values of cov(x) and the direction of the new axis is along the eigen vectors of cov(x)
  • 18. Dimensionality reduction Choose only first p eigenvectors, based On their eigenvalues Final data set has only p dimensions
  • 19. Bartlets test of sphericity • H0: R=I • H1: R not equal to I In other words H0: scatter plot is sort of sphere centered at origin H1: scatter plot is not a sphere • If scatterplot is a sphere, then no use of PCA • If scatter plot is not a sphere( is ellipse/ellipsoid) then go for PCA
  • 20.
  • 21.
  • 22.
  • 23.
  • 24. • The results of the principal component analysis in milk production of the state of Tamil Nadu revealed that milk production was having positive relationship with the indigenous cattle population, she- buffalo population, number of veterinary institutions, gross cropped area, area under paddy. area under groundnut, native purebred cattle population, graded and indigenous buffalo population, agricultural labour population, crossbred cattle population, no. of financial institutions and graded buffalo population. • This suggests that effecting a shift in herd structure in favour of cross- bred cows and graded buffalos can augment the milk production potential. Results
  • 26. Introduction • Cluster is a number of things of the same kind growing or joined together • A group of homogeneous things The principle: • Objects in the same group are similar to each other • Objects in the different group are as dissimilar as possible
  • 28. Cluster Analysis Model partition Obtain similarity or dissimilarity Objects to be clustered output Cluster 1 Cluster 2 Cluster 3 Cluster k
  • 31.
  • 32. Clustering algorithms • Hierarchical clustering • Centroid-based clustering • Graph-based clustering • Density-based clustering
  • 33. Single(nearest neighbour): distance between two clusters = distance between two members of the two clusters
  • 35. Centroid : distance between multivariate means of each clusters
  • 36. OTHER JOINING ALGORITHMS • AVERAGE • MEDIAN • WARD
  • 37. objects 1 2 3 4 5 1 0 2 9 0 3 3 7 0 4 6 5 9 0 5 11 10 2 8 0 2 3,5 is made as one cluster
  • 38. objects 5,3 1 2 4 5,3 0 1 3 0 2 7 9 0 4 8 6 5 0 3 1,3,5 is made as one cluster
  • 39. Object 5 object 3 object 1 object 2 object 4 Dendogram
  • 40. • Graph based clustering • HCS (Highly Connected Subgraphs) clustering algorithm • Points which are highly connected are clustered Similarity graph
  • 41. Density based clustering The algorithms work via sliding windows moving toward the high density of points
  • 42. How many clusters to retain? At what stage I have to stop the algorithm. Scree plot
  • 43.
  • 44. • The cluster analysis was carried out based on area, production, and productivity of different agricultural and horticultural crops which were predominantly grown in the districts of Rajasthan • calculated for two different periods 1980-1995 and 1996-2014 independently.
  • 45. • Crop cluster based on area during 1980- 1995 • Crop cluster based on area during 1996- 2014 • Crop cluster based on production during 1980-1995 • Crop cluster based on production during 1996-2014 • Crop cluster based on productivity during 1990-1995 • Crop cluster based on productivity during 1996-2014
  • 46.
  • 47.
  • 48.
  • 49. Conclusions • From the present study we concluded that when the performance of crop clusters based on area between two periods was compared, it was evident that gram and cotton has shifted over the years in the second period of study. • When comparison of the performances of crop clusters based on production between two periods was observed that gram, mustard & rapeseed and cotton production shifted over the period. • It means these crops were made cluster in the first period but not in the second period. While wheat and bajra were the crops which made clusters or had similarity in production across all the districts of Rajasthan from first period to second period.
  • 50. • The present study also concluded that horticultural crops had similarity in productivity across all the districts of Rajasthan during the both period. • It means coriander, garlic and pea productivity included over the years in the second period of the study. Only wheat and bajra were the crops which had similarity in productivity across all the districts of Rajasthan from first period to second period.

Editor's Notes

  1. noting