SlideShare a Scribd company logo
1 of 4
Rough K-Means DR. E. N. SATHISHKUMAR
Department of Computer Science, Periyar University, Salem - 11
ROUGH K-MEANS CLUSTERING ALGORITHM
Algorithm: Rough K-Means
Input: Dataset of n objects with d features, number of clusters K and values of
parameters Wlower, Wupper, and Epsilon.
Output: Lower approximation 𝑈(𝐾) and Upper approximation 𝑈(𝐾) of K
Clusters.
Step1: Randomly assign each data object one lower approximation𝑈(𝐾). By
definition (property 2) the data object also belongs to upper approximation
𝑈𝐾 of the same Cluster.
Step 2: Compute Cluster Centroids Cj
If 𝑈(𝐾) ≠ ∅ and 𝑈(𝐾) - 𝑈(𝐾) = ∅
Cj =
𝑥 𝑖
𝑈(𝐾)𝑥∈ 𝑈(𝐾)
Else If 𝑈(𝐾) = ∅ and 𝑈(𝐾) - 𝑈(𝐾) ≠ ∅
Cj =
𝑥 𝑖
𝑈(𝐾) − 𝑈(𝐾)𝑥∈ 𝑈(𝐾) − 𝑈(𝐾)
Else
Cj = Wlower ×
𝑥 𝑖
𝑈(𝐾)𝑥∈ 𝑈(𝐾) + Wupper ×
𝑥 𝑖
𝑈(𝐾) − 𝑈(𝐾)𝑥∈ 𝑈(𝐾) − 𝑈(𝐾)
Step 3: Assign each object to the lower approximation 𝑈 𝐾 or upper
approximation 𝑈(𝐾) of cluster i respectively. For each object vector x, let
d(X, Cj) be the distance between itself and the centroid of cluster Cj.
d(X, Cj) = 𝑚𝑖𝑛1 ≤j ≤K d(X, Cj).
The ratio d(X, Ci) / d(X, Cj), 1≤ i, j ≤ K is used to determine the membership
of x as follow: If d(X, Ci) / d(X, Cj) ≤ epsilon, for any pair (i, j), the
𝑥 ∈ 𝑈(𝐶𝑖) and 𝑥 ∈ 𝑈(𝐶𝑗 ) and x will not be a part of any lower
approximation. Otherwise, 𝑥 ∈ 𝑈(𝐶𝑖), such that d(X, Ci) is the minimum of
1 ≤ i ≤ K. In addition 𝑥 ∈ 𝑈(𝐶𝑖).
Step 4: Repeat Steps 2 and 3 until convergence.
Rough K-Means DR. E. N. SATHISHKUMAR
Department of Computer Science, Periyar University, Salem - 11
Illustrative Example
Table 1 shows example information system with real-valued conditional attributes.
It consists of six objects/genes, and two features/samples. k = 2, which is the
number of clusters. Weight of the lower approximation Wlower=0.7, Weight of the
upper approximation Wupper = 0.3 and Relative threshold = 2.
Table 1 Example dataset for Rough K-Means
U X Y
1 0 3
2 1 3
3 3 1
4 3 0.5
5 5 0
6 6 0
Step1: Randomly assign each data objects to exactly one lower approximation
K1 = {(0, 3), (1, 3), (3, 1)}
K2 = {(3, 0.5), (5, 0), (6, 0)}
Step 2: In this case 𝑈(𝐾) ≠ ∅ and 𝑈(𝐾) - 𝑈(𝐾) = ∅, so we compute the centroid
using Cj =
𝑥 𝑖
𝑈(𝐾)𝑥∈ 𝑈(𝐾) ,
C1 =
0+1+3
3
,
3+3+1
3
= (1.33, 2.33)
C2 =
3+5+6
3
,
0.5+0+0
3
= (4.67, 0.17)
Find the distance from centroid to each point using equlidean distance,
𝐷𝑖 = (𝑥2 − 𝑥1 )2 + (𝑦2 − 𝑦1 )2
Rough K-Means DR. E. N. SATHISHKUMAR
Department of Computer Science, Periyar University, Salem - 11
d1(X, C1):
(0, 3)(1.33, 2.33) ⇒ (1.33 − 0)2 + (2.33 − 3)2 = 1.49
(1, 3)(1.33, 2.33) ⇒ (1.33 − 1)2 + (2.33 − 3)2 = 0.75
(3, 1)(1.33, 2.33) ⇒ (1.33 − 3)2 + (2.33 − 1)2 = 2.13
(3, 0.5)(1.33, 2.33) ⇒ (1.33 − 3)2 + (2.33 − 0.5)2 = 2.48
(5, 0)(1.33, 2.33) ⇒ (1.33 − 5)2 + (2.33 − 0)2 = 4.45
(6, 0)(1.33, 2.33) ⇒ (1.33 − 6)2 + (2.33 − 0)2 = 5.22
d2(X, C2):
(0, 3)(4.67, 0.17) ⇒ (4.67 − 0)2 + (0.17 − 3)2 = 5.46
(1, 3)(4.67, 0.17) ⇒ (4.67 − 1)2 + (0.17 − 3)2 = 4.63
(3, 1)(4.67, 0.17) ⇒ (4.67 − 3)2 + (0.17 − 1)2 = 1.86
(3, 0.5)(4.67, 0.17) ⇒ (4.67 − 3)2 + (0.17 − 0.5)2 = 1.70
(5, 0)(4.67, 0.17) ⇒ (4.67 − 5)2 + (0.17 − 0)2 = 0.37
(6, 0)(4.67, 0.17) ⇒ (4.67 − 6)2 + (0.17 − 0)2 = 1.34
Step 3: Assign each object to the lower approximation 𝑈 𝐾 or upper
approximation 𝑈(𝐾) of cluster i respectively. Check If d(X, Ci) /d(X, Cj) ≤ epsilon.
1. (0, 3) ⇒ d2 / d1 = 5.46/1.49 = 3.66443 ≰ 2. So, x1 will be a part of 𝐾1
2. (1, 3) ⇒ 4.63/0.75 = 6.173 ≰ 2. So, x2 will be a part of 𝐾1
3. (3, 1) ⇒ 2.13/1.86 = 1.145 < 2 , so x3 will not be a part of 𝐾1& 𝐾2
4. (3, 0.5) ⇒ 2.48/1.70 = 1.458 < 2 , so x4 will not be a part of 𝐾1& 𝐾2
5. (5, 0) ⇒ 4.35/0.37 = 11.756 ≰ 2. So, x5 will be a part of 𝐾2
6. (6, 0) ⇒ 5.22/1.34 = 3.895 ≰ 2. So, x6 will be a part of 𝐾2
Rough K-Means DR. E. N. SATHISHKUMAR
Department of Computer Science, Periyar University, Salem - 11
Now, we have clusters
𝐾1= {(0, 3), (1, 3)} 𝐾1 = {(0, 3), (1, 3), (3, 1), (3, 0.5)}
𝐾2= {(5, 0), (6, 0)} 𝐾2 = {(5, 0), (6, 0), (3, 1), (3, 0.5)}
Here, 𝑈(𝐾) ≠ ∅ and 𝑈(𝐾) - 𝑈(𝐾) ≠ ∅ then find out the new centroid by using
below equation,
Cj = Wlower ×
𝑥 𝑖
𝑈(𝐾)𝑥∈ 𝑈(𝐾) + Wupper ×
𝑥 𝑖
𝑈(𝐾) − 𝑈(𝐾)𝑥∈ 𝑈(𝐾) − 𝑈(𝐾)
C1 = 0.7 ×
0+1
2
,
3+3
2
+ 0.3 ×
3+3
2
,
1+0.5
2
= (1.25, 2.325)
C2 = 0.7 ×
5+6
2
,
0+0
2
+ 0.3 ×
3+3
2
,
1+0.5
2
= (4.75, 0.225)
Step 4: Repeat Steps 2 and 3 until convergence (Old Centroid = New Centroid).

More Related Content

What's hot

Predicate logic_2(Artificial Intelligence)
Predicate logic_2(Artificial Intelligence)Predicate logic_2(Artificial Intelligence)
Predicate logic_2(Artificial Intelligence)SHUBHAM KUMAR GUPTA
 
Knowledge Engineering in FOL.
Knowledge Engineering in FOL.Knowledge Engineering in FOL.
Knowledge Engineering in FOL.Megha Sharma
 
Knowledge representation and Predicate logic
Knowledge representation and Predicate logicKnowledge representation and Predicate logic
Knowledge representation and Predicate logicAmey Kerkar
 
Convolution Neural Network (CNN)
Convolution Neural Network (CNN)Convolution Neural Network (CNN)
Convolution Neural Network (CNN)Suraj Aavula
 
Birch Algorithm With Solved Example
Birch Algorithm With Solved ExampleBirch Algorithm With Solved Example
Birch Algorithm With Solved Examplekailash shaw
 
Convolutional neural network from VGG to DenseNet
Convolutional neural network from VGG to DenseNetConvolutional neural network from VGG to DenseNet
Convolutional neural network from VGG to DenseNetSungminYou
 
Unit I & II in Principles of Soft computing
Unit I & II in Principles of Soft computing Unit I & II in Principles of Soft computing
Unit I & II in Principles of Soft computing Sivagowry Shathesh
 
Diabetes prediction using machine learning
Diabetes prediction using machine learningDiabetes prediction using machine learning
Diabetes prediction using machine learningdataalcott
 
Performance analysis(Time & Space Complexity)
Performance analysis(Time & Space Complexity)Performance analysis(Time & Space Complexity)
Performance analysis(Time & Space Complexity)swapnac12
 
K means clustering
K means clusteringK means clustering
K means clusteringkeshav goyal
 
backpropagation in neural networks
backpropagation in neural networksbackpropagation in neural networks
backpropagation in neural networksAkash Goel
 
K Means Clustering Algorithm | K Means Clustering Example | Machine Learning ...
K Means Clustering Algorithm | K Means Clustering Example | Machine Learning ...K Means Clustering Algorithm | K Means Clustering Example | Machine Learning ...
K Means Clustering Algorithm | K Means Clustering Example | Machine Learning ...Simplilearn
 
Feedforward neural network
Feedforward neural networkFeedforward neural network
Feedforward neural networkSopheaktra YONG
 

What's hot (20)

Predicate logic
 Predicate logic Predicate logic
Predicate logic
 
Predicate logic_2(Artificial Intelligence)
Predicate logic_2(Artificial Intelligence)Predicate logic_2(Artificial Intelligence)
Predicate logic_2(Artificial Intelligence)
 
L7 fuzzy relations
L7 fuzzy relationsL7 fuzzy relations
L7 fuzzy relations
 
Knowledge Engineering in FOL.
Knowledge Engineering in FOL.Knowledge Engineering in FOL.
Knowledge Engineering in FOL.
 
Knowledge representation and Predicate logic
Knowledge representation and Predicate logicKnowledge representation and Predicate logic
Knowledge representation and Predicate logic
 
Convolution Neural Network (CNN)
Convolution Neural Network (CNN)Convolution Neural Network (CNN)
Convolution Neural Network (CNN)
 
Optimal binary search tree dynamic programming
Optimal binary search tree   dynamic programmingOptimal binary search tree   dynamic programming
Optimal binary search tree dynamic programming
 
Birch Algorithm With Solved Example
Birch Algorithm With Solved ExampleBirch Algorithm With Solved Example
Birch Algorithm With Solved Example
 
Support Vector Machines ( SVM )
Support Vector Machines ( SVM ) Support Vector Machines ( SVM )
Support Vector Machines ( SVM )
 
Convolutional neural network from VGG to DenseNet
Convolutional neural network from VGG to DenseNetConvolutional neural network from VGG to DenseNet
Convolutional neural network from VGG to DenseNet
 
Unit I & II in Principles of Soft computing
Unit I & II in Principles of Soft computing Unit I & II in Principles of Soft computing
Unit I & II in Principles of Soft computing
 
Diabetes prediction using machine learning
Diabetes prediction using machine learningDiabetes prediction using machine learning
Diabetes prediction using machine learning
 
Performance analysis(Time & Space Complexity)
Performance analysis(Time & Space Complexity)Performance analysis(Time & Space Complexity)
Performance analysis(Time & Space Complexity)
 
Knn 160904075605-converted
Knn 160904075605-convertedKnn 160904075605-converted
Knn 160904075605-converted
 
Dempster shafer theory
Dempster shafer theoryDempster shafer theory
Dempster shafer theory
 
Soft computing
Soft computingSoft computing
Soft computing
 
K means clustering
K means clusteringK means clustering
K means clustering
 
backpropagation in neural networks
backpropagation in neural networksbackpropagation in neural networks
backpropagation in neural networks
 
K Means Clustering Algorithm | K Means Clustering Example | Machine Learning ...
K Means Clustering Algorithm | K Means Clustering Example | Machine Learning ...K Means Clustering Algorithm | K Means Clustering Example | Machine Learning ...
K Means Clustering Algorithm | K Means Clustering Example | Machine Learning ...
 
Feedforward neural network
Feedforward neural networkFeedforward neural network
Feedforward neural network
 

Similar to Rough K-Means Clustering Algorithm Explained

Dynamic Programing_LCS.ppt
Dynamic Programing_LCS.pptDynamic Programing_LCS.ppt
Dynamic Programing_LCS.pptmanasgaming4
 
Rosser's theorem
Rosser's theoremRosser's theorem
Rosser's theoremWathna
 
On approximate bounds of zeros of polynomials within
On approximate bounds of zeros of polynomials withinOn approximate bounds of zeros of polynomials within
On approximate bounds of zeros of polynomials withineSAT Publishing House
 
Response Surface in Tensor Train format for Uncertainty Quantification
Response Surface in Tensor Train format for Uncertainty QuantificationResponse Surface in Tensor Train format for Uncertainty Quantification
Response Surface in Tensor Train format for Uncertainty QuantificationAlexander Litvinenko
 
Some properties of two-fuzzy Nor med spaces
Some properties of two-fuzzy Nor med spacesSome properties of two-fuzzy Nor med spaces
Some properties of two-fuzzy Nor med spacesIOSR Journals
 
machinelearning project
machinelearning projectmachinelearning project
machinelearning projectLianli Liu
 
Decoding BCH-Code.pdf
Decoding BCH-Code.pdfDecoding BCH-Code.pdf
Decoding BCH-Code.pdfKundanSasi
 
Identification of the Mathematical Models of Complex Relaxation Processes in ...
Identification of the Mathematical Models of Complex Relaxation Processes in ...Identification of the Mathematical Models of Complex Relaxation Processes in ...
Identification of the Mathematical Models of Complex Relaxation Processes in ...Vladimir Bakhrushin
 
1 illustrating limit of a function
1 illustrating limit of a function1 illustrating limit of a function
1 illustrating limit of a functionJRCatador
 
51554 0131469657 ism-13
51554 0131469657 ism-1351554 0131469657 ism-13
51554 0131469657 ism-13Carlos Fuentes
 
Random variables.pptx
Random variables.pptxRandom variables.pptx
Random variables.pptxSKalyani_cbe
 
Applied Mathematics Multiple Integration by Mrs. Geetanjali P.Kale.pdf
Applied Mathematics Multiple Integration by Mrs. Geetanjali P.Kale.pdfApplied Mathematics Multiple Integration by Mrs. Geetanjali P.Kale.pdf
Applied Mathematics Multiple Integration by Mrs. Geetanjali P.Kale.pdfGeetanjaliRao6
 
Higher order derivatives for N -body simulations
Higher order derivatives for N -body simulationsHigher order derivatives for N -body simulations
Higher order derivatives for N -body simulationsKeigo Nitadori
 
A Probabilistic Algorithm for Computation of Polynomial Greatest Common with ...
A Probabilistic Algorithm for Computation of Polynomial Greatest Common with ...A Probabilistic Algorithm for Computation of Polynomial Greatest Common with ...
A Probabilistic Algorithm for Computation of Polynomial Greatest Common with ...mathsjournal
 
Bayesian inference on mixtures
Bayesian inference on mixturesBayesian inference on mixtures
Bayesian inference on mixturesChristian Robert
 

Similar to Rough K-Means Clustering Algorithm Explained (20)

Dynamic Programing_LCS.ppt
Dynamic Programing_LCS.pptDynamic Programing_LCS.ppt
Dynamic Programing_LCS.ppt
 
Rosser's theorem
Rosser's theoremRosser's theorem
Rosser's theorem
 
On approximate bounds of zeros of polynomials within
On approximate bounds of zeros of polynomials withinOn approximate bounds of zeros of polynomials within
On approximate bounds of zeros of polynomials within
 
Response Surface in Tensor Train format for Uncertainty Quantification
Response Surface in Tensor Train format for Uncertainty QuantificationResponse Surface in Tensor Train format for Uncertainty Quantification
Response Surface in Tensor Train format for Uncertainty Quantification
 
Some properties of two-fuzzy Nor med spaces
Some properties of two-fuzzy Nor med spacesSome properties of two-fuzzy Nor med spaces
Some properties of two-fuzzy Nor med spaces
 
machinelearning project
machinelearning projectmachinelearning project
machinelearning project
 
Decoding BCH-Code.pdf
Decoding BCH-Code.pdfDecoding BCH-Code.pdf
Decoding BCH-Code.pdf
 
Identification of the Mathematical Models of Complex Relaxation Processes in ...
Identification of the Mathematical Models of Complex Relaxation Processes in ...Identification of the Mathematical Models of Complex Relaxation Processes in ...
Identification of the Mathematical Models of Complex Relaxation Processes in ...
 
1 illustrating limit of a function
1 illustrating limit of a function1 illustrating limit of a function
1 illustrating limit of a function
 
51554 0131469657 ism-13
51554 0131469657 ism-1351554 0131469657 ism-13
51554 0131469657 ism-13
 
Random variables.pptx
Random variables.pptxRandom variables.pptx
Random variables.pptx
 
Applied Mathematics Multiple Integration by Mrs. Geetanjali P.Kale.pdf
Applied Mathematics Multiple Integration by Mrs. Geetanjali P.Kale.pdfApplied Mathematics Multiple Integration by Mrs. Geetanjali P.Kale.pdf
Applied Mathematics Multiple Integration by Mrs. Geetanjali P.Kale.pdf
 
Higher order derivatives for N -body simulations
Higher order derivatives for N -body simulationsHigher order derivatives for N -body simulations
Higher order derivatives for N -body simulations
 
Cluster
ClusterCluster
Cluster
 
CBSE Mathematics sample question paper with marking scheme
CBSE Mathematics sample question paper with marking schemeCBSE Mathematics sample question paper with marking scheme
CBSE Mathematics sample question paper with marking scheme
 
lec33.ppt
lec33.pptlec33.ppt
lec33.ppt
 
A05330107
A05330107A05330107
A05330107
 
A Probabilistic Algorithm for Computation of Polynomial Greatest Common with ...
A Probabilistic Algorithm for Computation of Polynomial Greatest Common with ...A Probabilistic Algorithm for Computation of Polynomial Greatest Common with ...
A Probabilistic Algorithm for Computation of Polynomial Greatest Common with ...
 
maths 12th.pdf
maths 12th.pdfmaths 12th.pdf
maths 12th.pdf
 
Bayesian inference on mixtures
Bayesian inference on mixturesBayesian inference on mixtures
Bayesian inference on mixtures
 

Recently uploaded

Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxthorishapillay1
 
Painted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of IndiaPainted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of IndiaVirag Sontakke
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting DataJhengPantaleon
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon AUnboundStockton
 
Final demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxFinal demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxAvyJaneVismanos
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTiammrhaywood
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfSumit Tiwari
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
Biting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfBiting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfadityarao40181
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...Marc Dusseiller Dusjagr
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application ) Sakshi Ghasle
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 

Recently uploaded (20)

Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptx
 
Painted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of IndiaPainted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of India
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
 
Staff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSDStaff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSD
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon A
 
Final demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxFinal demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptx
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
Biting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfBiting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdf
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application )
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 

Rough K-Means Clustering Algorithm Explained

  • 1. Rough K-Means DR. E. N. SATHISHKUMAR Department of Computer Science, Periyar University, Salem - 11 ROUGH K-MEANS CLUSTERING ALGORITHM Algorithm: Rough K-Means Input: Dataset of n objects with d features, number of clusters K and values of parameters Wlower, Wupper, and Epsilon. Output: Lower approximation 𝑈(𝐾) and Upper approximation 𝑈(𝐾) of K Clusters. Step1: Randomly assign each data object one lower approximation𝑈(𝐾). By definition (property 2) the data object also belongs to upper approximation 𝑈𝐾 of the same Cluster. Step 2: Compute Cluster Centroids Cj If 𝑈(𝐾) ≠ ∅ and 𝑈(𝐾) - 𝑈(𝐾) = ∅ Cj = 𝑥 𝑖 𝑈(𝐾)𝑥∈ 𝑈(𝐾) Else If 𝑈(𝐾) = ∅ and 𝑈(𝐾) - 𝑈(𝐾) ≠ ∅ Cj = 𝑥 𝑖 𝑈(𝐾) − 𝑈(𝐾)𝑥∈ 𝑈(𝐾) − 𝑈(𝐾) Else Cj = Wlower × 𝑥 𝑖 𝑈(𝐾)𝑥∈ 𝑈(𝐾) + Wupper × 𝑥 𝑖 𝑈(𝐾) − 𝑈(𝐾)𝑥∈ 𝑈(𝐾) − 𝑈(𝐾) Step 3: Assign each object to the lower approximation 𝑈 𝐾 or upper approximation 𝑈(𝐾) of cluster i respectively. For each object vector x, let d(X, Cj) be the distance between itself and the centroid of cluster Cj. d(X, Cj) = 𝑚𝑖𝑛1 ≤j ≤K d(X, Cj). The ratio d(X, Ci) / d(X, Cj), 1≤ i, j ≤ K is used to determine the membership of x as follow: If d(X, Ci) / d(X, Cj) ≤ epsilon, for any pair (i, j), the 𝑥 ∈ 𝑈(𝐶𝑖) and 𝑥 ∈ 𝑈(𝐶𝑗 ) and x will not be a part of any lower approximation. Otherwise, 𝑥 ∈ 𝑈(𝐶𝑖), such that d(X, Ci) is the minimum of 1 ≤ i ≤ K. In addition 𝑥 ∈ 𝑈(𝐶𝑖). Step 4: Repeat Steps 2 and 3 until convergence.
  • 2. Rough K-Means DR. E. N. SATHISHKUMAR Department of Computer Science, Periyar University, Salem - 11 Illustrative Example Table 1 shows example information system with real-valued conditional attributes. It consists of six objects/genes, and two features/samples. k = 2, which is the number of clusters. Weight of the lower approximation Wlower=0.7, Weight of the upper approximation Wupper = 0.3 and Relative threshold = 2. Table 1 Example dataset for Rough K-Means U X Y 1 0 3 2 1 3 3 3 1 4 3 0.5 5 5 0 6 6 0 Step1: Randomly assign each data objects to exactly one lower approximation K1 = {(0, 3), (1, 3), (3, 1)} K2 = {(3, 0.5), (5, 0), (6, 0)} Step 2: In this case 𝑈(𝐾) ≠ ∅ and 𝑈(𝐾) - 𝑈(𝐾) = ∅, so we compute the centroid using Cj = 𝑥 𝑖 𝑈(𝐾)𝑥∈ 𝑈(𝐾) , C1 = 0+1+3 3 , 3+3+1 3 = (1.33, 2.33) C2 = 3+5+6 3 , 0.5+0+0 3 = (4.67, 0.17) Find the distance from centroid to each point using equlidean distance, 𝐷𝑖 = (𝑥2 − 𝑥1 )2 + (𝑦2 − 𝑦1 )2
  • 3. Rough K-Means DR. E. N. SATHISHKUMAR Department of Computer Science, Periyar University, Salem - 11 d1(X, C1): (0, 3)(1.33, 2.33) ⇒ (1.33 − 0)2 + (2.33 − 3)2 = 1.49 (1, 3)(1.33, 2.33) ⇒ (1.33 − 1)2 + (2.33 − 3)2 = 0.75 (3, 1)(1.33, 2.33) ⇒ (1.33 − 3)2 + (2.33 − 1)2 = 2.13 (3, 0.5)(1.33, 2.33) ⇒ (1.33 − 3)2 + (2.33 − 0.5)2 = 2.48 (5, 0)(1.33, 2.33) ⇒ (1.33 − 5)2 + (2.33 − 0)2 = 4.45 (6, 0)(1.33, 2.33) ⇒ (1.33 − 6)2 + (2.33 − 0)2 = 5.22 d2(X, C2): (0, 3)(4.67, 0.17) ⇒ (4.67 − 0)2 + (0.17 − 3)2 = 5.46 (1, 3)(4.67, 0.17) ⇒ (4.67 − 1)2 + (0.17 − 3)2 = 4.63 (3, 1)(4.67, 0.17) ⇒ (4.67 − 3)2 + (0.17 − 1)2 = 1.86 (3, 0.5)(4.67, 0.17) ⇒ (4.67 − 3)2 + (0.17 − 0.5)2 = 1.70 (5, 0)(4.67, 0.17) ⇒ (4.67 − 5)2 + (0.17 − 0)2 = 0.37 (6, 0)(4.67, 0.17) ⇒ (4.67 − 6)2 + (0.17 − 0)2 = 1.34 Step 3: Assign each object to the lower approximation 𝑈 𝐾 or upper approximation 𝑈(𝐾) of cluster i respectively. Check If d(X, Ci) /d(X, Cj) ≤ epsilon. 1. (0, 3) ⇒ d2 / d1 = 5.46/1.49 = 3.66443 ≰ 2. So, x1 will be a part of 𝐾1 2. (1, 3) ⇒ 4.63/0.75 = 6.173 ≰ 2. So, x2 will be a part of 𝐾1 3. (3, 1) ⇒ 2.13/1.86 = 1.145 < 2 , so x3 will not be a part of 𝐾1& 𝐾2 4. (3, 0.5) ⇒ 2.48/1.70 = 1.458 < 2 , so x4 will not be a part of 𝐾1& 𝐾2 5. (5, 0) ⇒ 4.35/0.37 = 11.756 ≰ 2. So, x5 will be a part of 𝐾2 6. (6, 0) ⇒ 5.22/1.34 = 3.895 ≰ 2. So, x6 will be a part of 𝐾2
  • 4. Rough K-Means DR. E. N. SATHISHKUMAR Department of Computer Science, Periyar University, Salem - 11 Now, we have clusters 𝐾1= {(0, 3), (1, 3)} 𝐾1 = {(0, 3), (1, 3), (3, 1), (3, 0.5)} 𝐾2= {(5, 0), (6, 0)} 𝐾2 = {(5, 0), (6, 0), (3, 1), (3, 0.5)} Here, 𝑈(𝐾) ≠ ∅ and 𝑈(𝐾) - 𝑈(𝐾) ≠ ∅ then find out the new centroid by using below equation, Cj = Wlower × 𝑥 𝑖 𝑈(𝐾)𝑥∈ 𝑈(𝐾) + Wupper × 𝑥 𝑖 𝑈(𝐾) − 𝑈(𝐾)𝑥∈ 𝑈(𝐾) − 𝑈(𝐾) C1 = 0.7 × 0+1 2 , 3+3 2 + 0.3 × 3+3 2 , 1+0.5 2 = (1.25, 2.325) C2 = 0.7 × 5+6 2 , 0+0 2 + 0.3 × 3+3 2 , 1+0.5 2 = (4.75, 0.225) Step 4: Repeat Steps 2 and 3 until convergence (Old Centroid = New Centroid).