SlideShare a Scribd company logo
1 of 17
K – MEANS
CLUSTER
ANALYSIS
SUBMITTED TO:
PROF. SOMEN SAHU
DEPT. OF FES
-AGNIVA PRADHAN
M.F.Sc 2ND SEMESTER
DEPT. OF FNT
M/F/2021/03
 K-means clustering is a simple unsupervised learning algorithm
that is used to solve clustering problems. It follows a simple
procedure of classifying a given data set into a number of clusters,
defined by the letter "k," which is fixed beforehand. The clusters are
then positioned as points and all observations or data points are
associated with the nearest cluster, computed, adjusted and then the
process starts over using the new adjustments until a desired result
is reached.
This is a versatile algorithm that can be used for any type of
grouping. Some examples of use cases are:
Behavioural segmentation:
Segment by purchase history
Segment by activities on application, website, or platform
Define personas based on interests
Create profiles based on activity monitoring
Inventory categorization:
Group inventory by sales activity
Group inventory by manufacturing metrics
Sorting sensor measurements:
Detect activity types in motion sensors
Group images
Separate audio
Identify groups in health monitoring
Detecting bots or anomalies:
Separate valid activity groups from bots
Group valid activity to clean up outlier detection
In addition, monitoring if a tracked data point switches
between groups over time can be used to detect
meaningful changes in the data.
 Suppose we have some data of Height of
Students and Weight of student
No. Height of
Student (in
cms)
Weight of student
(in Kgs)
1 185 72
2 170 56
3 168 60
4 179 68
5 182 72
6 188 77
7 180 71
8 180 70
9 183 84
10 180 88
11 180 67
12 177 76
0
10
20
30
40
50
60
70
80
90
100
165 170 175 180 185 190
Weight
 Now I need to classify the data points using K- Means algorithm into 2
Cluster in the name K1 and K2.
 Now here I am using the centroid Concept i.e. For Every cluster there will
a Centroid value associated.
 Centroid value is such value by using the value the rest data points will
be clustered.
 Then we need to calculate the distance of the data points from the
centroid value.
 Here the distance will be Euclidean Distance.
 ED = (𝑋𝑜 − 𝑋𝐶)2+(𝑌𝑜 + 𝑌𝐶)2
 𝑋𝑜 & 𝑌𝑜 - Observed Value
 𝑋𝐶 & 𝑌𝐶 - are centroid value
 I have taken the 1st row as a centroid value for K1i.e.
(185, 72) and 2nd Row as a centroid value of K2 i.e.
(170,56).
 Now, we need to cluster the data into two clusters by
measuring Euclidean Distance.
 Now ED for 3rd row =
K1: (168 − 185)2+(60 − 72)2 = 20.82
K2 (168 − 170)2+(60 − 56)2 = 4.48
 As 3rd row value ED is nearer to K2 [ ED for K2 < ED for
K1] so 3rd row will be in K2
So our New cluster will be like :
K1 – 1st row
K2 – 2nd and 3rd Row
185,72 170,56
K1 K2
 Now we need to recalculate the new
centroid for K2 [ as 3rd row gone under
K2]
 So new Cetroid value of K2 =
(
170+168
2
,
60+56
2
) = (169, 58)
 Now we need to recalculate the ED for
4th Row as before.
 Thus, we get the final K1 and K2
Cluster as
 K1: {1,4,5,6,7,8,9,10,11,12}
 K2: {2,3}
185,72 169,58
K1 K2
Got 2 new
variables
K – means cluster analysis.pptx
K – means cluster analysis.pptx
K – means cluster analysis.pptx
K – means cluster analysis.pptx
K – means cluster analysis.pptx

More Related Content

Similar to K – means cluster analysis.pptx

Parametric Comparison of K-means and Adaptive K-means Clustering Performance ...
Parametric Comparison of K-means and Adaptive K-means Clustering Performance ...Parametric Comparison of K-means and Adaptive K-means Clustering Performance ...
Parametric Comparison of K-means and Adaptive K-means Clustering Performance ...IJECEIAES
 
K mean-clustering
K mean-clusteringK mean-clustering
K mean-clusteringPVP College
 
K mean-clustering algorithm
K mean-clustering algorithmK mean-clustering algorithm
K mean-clustering algorithmparry prabhu
 
ANLY 501 Lab 7 Presentation Group 8 slide.pptx
ANLY 501 Lab 7 Presentation Group 8 slide.pptxANLY 501 Lab 7 Presentation Group 8 slide.pptx
ANLY 501 Lab 7 Presentation Group 8 slide.pptxrinehi3578
 
The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)theijes
 
k-mean-clustering.ppt
k-mean-clustering.pptk-mean-clustering.ppt
k-mean-clustering.pptRanimeLoutar
 
k-mean-Clustering impact on AI using DSS
k-mean-Clustering impact on AI using DSSk-mean-Clustering impact on AI using DSS
k-mean-Clustering impact on AI using DSSMarkNaguibElAbd
 
Lecture_3_k-mean-clustering.ppt
Lecture_3_k-mean-clustering.pptLecture_3_k-mean-clustering.ppt
Lecture_3_k-mean-clustering.pptSyedNahin1
 
New Approach for K-mean and K-medoids Algorithm
New Approach for K-mean and K-medoids AlgorithmNew Approach for K-mean and K-medoids Algorithm
New Approach for K-mean and K-medoids AlgorithmEditor IJCATR
 
CS8091_BDA_Unit_II_Clustering
CS8091_BDA_Unit_II_ClusteringCS8091_BDA_Unit_II_Clustering
CS8091_BDA_Unit_II_ClusteringPalani Kumar
 
Machine learning session9(clustering)
Machine learning   session9(clustering)Machine learning   session9(clustering)
Machine learning session9(clustering)Abhimanyu Dwivedi
 
Analysis and implementation of modified k medoids
Analysis and implementation of modified k medoidsAnalysis and implementation of modified k medoids
Analysis and implementation of modified k medoidseSAT Publishing House
 
Clustering &amp; classification
Clustering &amp; classificationClustering &amp; classification
Clustering &amp; classificationJamshed Khan
 
Clustering techniques
Clustering techniquesClustering techniques
Clustering techniquestalktoharry
 
Optimising Data Using K-Means Clustering Algorithm
Optimising Data Using K-Means Clustering AlgorithmOptimising Data Using K-Means Clustering Algorithm
Optimising Data Using K-Means Clustering AlgorithmIJERA Editor
 

Similar to K – means cluster analysis.pptx (20)

Parametric Comparison of K-means and Adaptive K-means Clustering Performance ...
Parametric Comparison of K-means and Adaptive K-means Clustering Performance ...Parametric Comparison of K-means and Adaptive K-means Clustering Performance ...
Parametric Comparison of K-means and Adaptive K-means Clustering Performance ...
 
K mean-clustering
K mean-clusteringK mean-clustering
K mean-clustering
 
K mean-clustering algorithm
K mean-clustering algorithmK mean-clustering algorithm
K mean-clustering algorithm
 
ANLY 501 Lab 7 Presentation Group 8 slide.pptx
ANLY 501 Lab 7 Presentation Group 8 slide.pptxANLY 501 Lab 7 Presentation Group 8 slide.pptx
ANLY 501 Lab 7 Presentation Group 8 slide.pptx
 
The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)
 
k-mean-clustering.ppt
k-mean-clustering.pptk-mean-clustering.ppt
k-mean-clustering.ppt
 
k-mean-Clustering impact on AI using DSS
k-mean-Clustering impact on AI using DSSk-mean-Clustering impact on AI using DSS
k-mean-Clustering impact on AI using DSS
 
Lecture_3_k-mean-clustering.ppt
Lecture_3_k-mean-clustering.pptLecture_3_k-mean-clustering.ppt
Lecture_3_k-mean-clustering.ppt
 
Neural nw k means
Neural nw k meansNeural nw k means
Neural nw k means
 
New Approach for K-mean and K-medoids Algorithm
New Approach for K-mean and K-medoids AlgorithmNew Approach for K-mean and K-medoids Algorithm
New Approach for K-mean and K-medoids Algorithm
 
Lec13 Clustering.pptx
Lec13 Clustering.pptxLec13 Clustering.pptx
Lec13 Clustering.pptx
 
CS8091_BDA_Unit_II_Clustering
CS8091_BDA_Unit_II_ClusteringCS8091_BDA_Unit_II_Clustering
CS8091_BDA_Unit_II_Clustering
 
Project PPT
Project PPTProject PPT
Project PPT
 
Machine learning session9(clustering)
Machine learning   session9(clustering)Machine learning   session9(clustering)
Machine learning session9(clustering)
 
Analysis and implementation of modified k medoids
Analysis and implementation of modified k medoidsAnalysis and implementation of modified k medoids
Analysis and implementation of modified k medoids
 
K means report
K means reportK means report
K means report
 
Clustering &amp; classification
Clustering &amp; classificationClustering &amp; classification
Clustering &amp; classification
 
Clustering techniques
Clustering techniquesClustering techniques
Clustering techniques
 
Noura2
Noura2Noura2
Noura2
 
Optimising Data Using K-Means Clustering Algorithm
Optimising Data Using K-Means Clustering AlgorithmOptimising Data Using K-Means Clustering Algorithm
Optimising Data Using K-Means Clustering Algorithm
 

More from agniva pradhan

CLIENT SIDE PROCESSING.pptx
CLIENT SIDE PROCESSING.pptxCLIENT SIDE PROCESSING.pptx
CLIENT SIDE PROCESSING.pptxagniva pradhan
 
1. METHODS OF CLUSTER ANALYSIS.pptx
1. METHODS OF CLUSTER ANALYSIS.pptx1. METHODS OF CLUSTER ANALYSIS.pptx
1. METHODS OF CLUSTER ANALYSIS.pptxagniva pradhan
 
TWO STEP CLUSTER ANALYSIS.pptx
TWO STEP CLUSTER ANALYSIS.pptxTWO STEP CLUSTER ANALYSIS.pptx
TWO STEP CLUSTER ANALYSIS.pptxagniva pradhan
 
NEAREST NEIGHBOUR CLUSTER ANALYSIS.pptx
NEAREST NEIGHBOUR CLUSTER ANALYSIS.pptxNEAREST NEIGHBOUR CLUSTER ANALYSIS.pptx
NEAREST NEIGHBOUR CLUSTER ANALYSIS.pptxagniva pradhan
 
HIERARCHICAL CLUSTER ANALYSIS.pptx
HIERARCHICAL CLUSTER ANALYSIS.pptxHIERARCHICAL CLUSTER ANALYSIS.pptx
HIERARCHICAL CLUSTER ANALYSIS.pptxagniva pradhan
 
CLUSTER SILHOUETTES.pptx
CLUSTER SILHOUETTES.pptxCLUSTER SILHOUETTES.pptx
CLUSTER SILHOUETTES.pptxagniva pradhan
 
METHODS OF CLUSTER ANALYSIS.pptx
METHODS OF CLUSTER ANALYSIS.pptxMETHODS OF CLUSTER ANALYSIS.pptx
METHODS OF CLUSTER ANALYSIS.pptxagniva pradhan
 
ROC CURVE AND ANALYSIS.pptx
ROC CURVE AND ANALYSIS.pptxROC CURVE AND ANALYSIS.pptx
ROC CURVE AND ANALYSIS.pptxagniva pradhan
 
DISCRIMINABLE CLUSTER ANALYSIS.pptx
DISCRIMINABLE CLUSTER ANALYSIS.pptxDISCRIMINABLE CLUSTER ANALYSIS.pptx
DISCRIMINABLE CLUSTER ANALYSIS.pptxagniva pradhan
 
Agniva pradhan seminar on disinfectant
Agniva pradhan seminar on disinfectantAgniva pradhan seminar on disinfectant
Agniva pradhan seminar on disinfectantagniva pradhan
 

More from agniva pradhan (11)

CLIENT SIDE PROCESSING.pptx
CLIENT SIDE PROCESSING.pptxCLIENT SIDE PROCESSING.pptx
CLIENT SIDE PROCESSING.pptx
 
1. METHODS OF CLUSTER ANALYSIS.pptx
1. METHODS OF CLUSTER ANALYSIS.pptx1. METHODS OF CLUSTER ANALYSIS.pptx
1. METHODS OF CLUSTER ANALYSIS.pptx
 
TWO STEP CLUSTER ANALYSIS.pptx
TWO STEP CLUSTER ANALYSIS.pptxTWO STEP CLUSTER ANALYSIS.pptx
TWO STEP CLUSTER ANALYSIS.pptx
 
NEAREST NEIGHBOUR CLUSTER ANALYSIS.pptx
NEAREST NEIGHBOUR CLUSTER ANALYSIS.pptxNEAREST NEIGHBOUR CLUSTER ANALYSIS.pptx
NEAREST NEIGHBOUR CLUSTER ANALYSIS.pptx
 
HIERARCHICAL CLUSTER ANALYSIS.pptx
HIERARCHICAL CLUSTER ANALYSIS.pptxHIERARCHICAL CLUSTER ANALYSIS.pptx
HIERARCHICAL CLUSTER ANALYSIS.pptx
 
CLUSTER SILHOUETTES.pptx
CLUSTER SILHOUETTES.pptxCLUSTER SILHOUETTES.pptx
CLUSTER SILHOUETTES.pptx
 
METHODS OF CLUSTER ANALYSIS.pptx
METHODS OF CLUSTER ANALYSIS.pptxMETHODS OF CLUSTER ANALYSIS.pptx
METHODS OF CLUSTER ANALYSIS.pptx
 
ROC CURVE AND ANALYSIS.pptx
ROC CURVE AND ANALYSIS.pptxROC CURVE AND ANALYSIS.pptx
ROC CURVE AND ANALYSIS.pptx
 
DECESSION TREE.pptx
DECESSION TREE.pptxDECESSION TREE.pptx
DECESSION TREE.pptx
 
DISCRIMINABLE CLUSTER ANALYSIS.pptx
DISCRIMINABLE CLUSTER ANALYSIS.pptxDISCRIMINABLE CLUSTER ANALYSIS.pptx
DISCRIMINABLE CLUSTER ANALYSIS.pptx
 
Agniva pradhan seminar on disinfectant
Agniva pradhan seminar on disinfectantAgniva pradhan seminar on disinfectant
Agniva pradhan seminar on disinfectant
 

Recently uploaded

dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxolyaivanovalion
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingNeil Barnes
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxolyaivanovalion
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiSuhani Kapoor
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfSocial Samosa
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxolyaivanovalion
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
 
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一ffjhghh
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxolyaivanovalion
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...Suhani Kapoor
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxStephen266013
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfLars Albertsson
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 

Recently uploaded (20)

dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data Storytelling
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptx
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docx
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdf
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 

K – means cluster analysis.pptx

  • 1. K – MEANS CLUSTER ANALYSIS SUBMITTED TO: PROF. SOMEN SAHU DEPT. OF FES -AGNIVA PRADHAN M.F.Sc 2ND SEMESTER DEPT. OF FNT M/F/2021/03
  • 2.  K-means clustering is a simple unsupervised learning algorithm that is used to solve clustering problems. It follows a simple procedure of classifying a given data set into a number of clusters, defined by the letter "k," which is fixed beforehand. The clusters are then positioned as points and all observations or data points are associated with the nearest cluster, computed, adjusted and then the process starts over using the new adjustments until a desired result is reached.
  • 3. This is a versatile algorithm that can be used for any type of grouping. Some examples of use cases are: Behavioural segmentation: Segment by purchase history Segment by activities on application, website, or platform Define personas based on interests Create profiles based on activity monitoring Inventory categorization: Group inventory by sales activity Group inventory by manufacturing metrics
  • 4. Sorting sensor measurements: Detect activity types in motion sensors Group images Separate audio Identify groups in health monitoring Detecting bots or anomalies: Separate valid activity groups from bots Group valid activity to clean up outlier detection In addition, monitoring if a tracked data point switches between groups over time can be used to detect meaningful changes in the data.
  • 5.  Suppose we have some data of Height of Students and Weight of student No. Height of Student (in cms) Weight of student (in Kgs) 1 185 72 2 170 56 3 168 60 4 179 68 5 182 72 6 188 77 7 180 71 8 180 70 9 183 84 10 180 88 11 180 67 12 177 76 0 10 20 30 40 50 60 70 80 90 100 165 170 175 180 185 190 Weight
  • 6.  Now I need to classify the data points using K- Means algorithm into 2 Cluster in the name K1 and K2.  Now here I am using the centroid Concept i.e. For Every cluster there will a Centroid value associated.  Centroid value is such value by using the value the rest data points will be clustered.  Then we need to calculate the distance of the data points from the centroid value.  Here the distance will be Euclidean Distance.  ED = (𝑋𝑜 − 𝑋𝐶)2+(𝑌𝑜 + 𝑌𝐶)2  𝑋𝑜 & 𝑌𝑜 - Observed Value  𝑋𝐶 & 𝑌𝐶 - are centroid value
  • 7.  I have taken the 1st row as a centroid value for K1i.e. (185, 72) and 2nd Row as a centroid value of K2 i.e. (170,56).  Now, we need to cluster the data into two clusters by measuring Euclidean Distance.  Now ED for 3rd row = K1: (168 − 185)2+(60 − 72)2 = 20.82 K2 (168 − 170)2+(60 − 56)2 = 4.48  As 3rd row value ED is nearer to K2 [ ED for K2 < ED for K1] so 3rd row will be in K2 So our New cluster will be like : K1 – 1st row K2 – 2nd and 3rd Row 185,72 170,56 K1 K2
  • 8.  Now we need to recalculate the new centroid for K2 [ as 3rd row gone under K2]  So new Cetroid value of K2 = ( 170+168 2 , 60+56 2 ) = (169, 58)  Now we need to recalculate the ED for 4th Row as before.  Thus, we get the final K1 and K2 Cluster as  K1: {1,4,5,6,7,8,9,10,11,12}  K2: {2,3} 185,72 169,58 K1 K2
  • 9.
  • 10.
  • 11.