SlideShare a Scribd company logo
1 of 35
Download to read offline
Towards Cluster Validity Index Evaluation and
Selection
Andrey Filchenkov, Sergey Muravyov, Vladimir Parfenov
ITMO University St. Petersburg, Russia
{afilchenkov,smuravyov}@corp.ifmo.ru, parfenov@mail.ifmo.ru
11.11.2016
What is clustering?
Clustering means grouping the objects based on the
information found in the data describing the objects or their
relationships.
The goal is that the objects in a group will be similar (or
related) to one other and different from (or unrelated to) the
objects in other groups.
2 / 35
Examples of clustering
3 / 35
Applications of clustering
Biology
Medicine
Business and marketing
Social science
Computer science
4 / 35
Clustering Evaluation
No target field
Two ways of partition evaluation:
• External
• Internal
5 / 35
External measures
Based on known class labels and external benchmarks
Examples:
• F-measure
• Rand measure
• Jaccard index
Not applicable for real-life tasks!
6 / 35
Internal measures
Usually assign the best score to the algorithm that produces
clusters with high similarity within a cluster and low similarity
between clusters
No need for extra information
7 / 35
List of the most popular existing internal measures
(clustering validity indicies)
Dunn index (D) COP-index (COP)
Davies-Bouldin index (DB) OS-Index (OS)
Silhouette index (Sil) Generalized Dunn indices (6 indicies: GD*)
Calinski–Harabasz (CH) C-Index (CI)
CS index (CS) Sym-index (Symm)
Modified Davies-Bouldin index (DB*) S Dbw index (SDbw)
Score function (SF)
8 / 35
Problem of choice
There are a lot of CVIs
Which one should be chosen for a hard clustering problem?
• How to compare the existing ones?
• How to evaluate each one?
9 / 35
Approaches for CVI comparison
Visual-based comparison
Comparison with known labels
Purely theoretical comparison that is based on studying CVI
properties
Comparison based on stability, robustness of structure, or
other desired properties
10 / 35
Approaches for CVI comparison
Visual-based comparison
Comparison with known labels
Purely theoretical comparison that is based on studying CVI
properties
Comparison based on stability, robustness of structure, or
other desired properties
11 / 35
Practical approach for partition evaluation
It is stated in the classical book [1], ”Understanding our world
requires conceptualizing the similarities and differences
between the entities that compose it”
M. Ackerman showed that there is no difference in partition
evaluations made by clustering experts and non-experts
The only way of obtaining ground truth for partition quality
estimation is human estimation
12 / 35
[1] - R.C. Tryon and D. E. Bailey, Cluster analysis, 1970
Formalized ground truth for partitions
Let 𝐶 denote the space of clustering tasks
Weak ordering:
• 𝐻𝑟 𝑐 : 𝐶 → 𝑃𝑎𝑟𝑡𝑖𝑡𝑖𝑜𝑛 𝑐 × 𝑃𝑎𝑟𝑡𝑖𝑡𝑖𝑜𝑛 𝑐 , 𝑐 ∈ 𝐶
Binary estimation scale
• 𝐻 𝑏: 𝑃𝑎𝑟𝑡𝑖𝑡𝑖𝑜𝑛 𝑐 → {0,1}
13 / 35
Measure evaluation framework
Procedure for one assessor
Procedure for multiple assessors
14 / 35
Procedure for one assessor
Evaluate all possible partitions
Set partial order, compare all pairs of partitions
15 / 35
Constraints on one-assessors procedure
Choose finite set of partitions
Mark each partition with a number
• There is a score function that can be put into
correspondence to any weak strict order
16 / 35
Procedure for multiple assessors
Four criteria for each CVI:
• Binarized adequacy
• Weighted adequacy
• Adequacy of the best
• Aggregated ranking
17 / 35
Binarized adequacy
𝑅 𝐵𝐴
=
𝜌 𝐾𝜏(𝑏 𝑏𝑒𝑠𝑡,𝑏 𝐶𝑉𝐼)
𝜌 𝐾𝜏(𝑏 𝑏𝑒𝑠𝑡,𝑏 𝑤𝑜𝑟𝑠𝑡)
• 𝑏 𝐶𝑉𝐼 — permutation on binary marks with respect to the
CVIs estimations of these partitions, contains of + and −
• 𝑏 𝑏𝑒𝑠𝑡 — best permutation: + + ⋯ + − ⋯ −
• 𝑏 𝑤𝑜𝑟𝑠𝑡— worst permutation: − ⋯ − + ⋯ +
• 𝜌 𝐾𝜏 — modified Kendall tau distance
18 / 35
Weighted adequacy
𝑅 𝑊𝐴
=
𝜌 𝐾𝜏(𝑟 𝑏𝑒𝑠𝑡,𝑟 𝐶𝑉𝐼)
𝜌 𝐾𝜏(𝑟 𝑏𝑒𝑠𝑡,𝑟 𝑤𝑜𝑟𝑠𝑡)
• Instead of + and − we take the amount of assessors that
gave an adequate mark to CVI: 𝑤𝑖
• 𝑟𝐶𝑉𝐼 — permutation on weighted marks with respect to the
CVIs estimations of these partitions
• 𝑟𝑏𝑒𝑠𝑡 — best permutation: 𝑤1 ≥ ⋯ ≥ 𝑤 𝑛
• 𝑟 𝑤𝑜𝑟𝑠𝑡— worst permutation:𝑤1 ≤ ⋯ ≤ 𝑤 𝑛
• 𝜌 𝐾𝜏 — modified Kendall tau distance
19 / 35
Adequacy of the best
Chose the adequacy mark, which was assigned to the
partition having the highest value of CVI being estimated.
20 / 35
Aggregated ranking
Represents how many orderings produced by the assessors
and by each CVI differ
Weak order aggregation algorithm and distance measure 𝜌 J.
L. Garcia-Lapresta and D. Perez-Roman
𝑅 𝐴𝑅 =
𝜌(𝑎𝑟 𝑏𝑒𝑠𝑡,𝑎𝑟 𝐶𝑉𝐼)
𝜌(𝑎𝑟 𝑏𝑒𝑠𝑡,𝑎𝑟 𝑤𝑜𝑟𝑠𝑡)
• 𝑎𝑟∗— aggregated ordering
21 / 35
Experimental evaluation
19 of the most popular CVIs were taken to find out if any of
them matches the real quality of resulting clusters
22 / 35
Experimental data
41 datasets:
• Artificial
• 2D
23 / 35
Clustering algorithms
6 algorithms:
• 𝑘-Means
• X-Means
• EM
• DBSCAN
• FarthestFirst
• Hierarchical
14 configurations totally
24 / 35
Example of partitions (densed clusters)
25 / 35
Initial K-Means
EM Hierarchical
Example of partitions (curves topology)
26 / 35
Initial K-Means
EM Hierarchical
Summarization for binarized adequacy
CVI Adequacy CVI Adequacy
DB 0.151 Symm 0.333
D 0.515 CI 0.121
Syl 0.393 DB* 0.121
CH 0.352 GD31 0.333
SDBW 0.454 GD41 0.435
SF 0.393 GD51 0.393
CS 0.303 GD33 0.352
COP 0.575 GD43 0.272
SV 0.272 GD53 0.333
OS 0.575 threshold 0.7
27 / 35
Summarization for weighted adequacy
CVI Adequacy CVI Adequacy
DB 0.121 Symm 0.353
D 0.317 CI 0.073
Syl 0.121 DB* 0.000
CH 0.097 GD31 0.097
SDBW 0.195 GD41 0.097
SF 0.170 GD51 0.121
CS 0.146 GD33 0.073
COP 0.414 GD43 0.035
SV 0.195 GD53 0.035
OS 0.317 threshold 0.7
28 / 35
Adequacy of the cluster partitions, claimed to be
the best by CVIs
CVI # of the best CVI # of the best
DB 4 (9.7%) Symm 5 (12.1%)
D 4 (9.7%) CI 4 (9.7%)
Syl 4 (9.7%) DB* 4 (9.7%)
CH 4 (9.7%) GD31 5 (12.1%)
SDBW 6 (14.6%) GD41 5 (12.1%)
SF 4 (9.7%) GD51 6 (14.6%)
CS 6 (14.6%) GD33 4 (9.7%)
COP 6 (14.6%) GD43 4 (9.7%)
SV 6 (14.6%) GD53 5 (12.1%)
OS 5 (12.1%) threshold 70%
29 / 35
Summarization for weighted adequacy
CVI Rank CVI Rank
DB 0.390 Symm 0.292
D 0.353 CI 0.353
Syl 0.170 DB* 0.317
CH 0.073 GD31 0.073
SDBW 0.353 GD41 0.146
SF 0.146 GD51 0.195
CS 0.146 GD33 0.035
COP 0.414 GD43 0.035
SV 0.414 GD53 0.073
OS 0.073 threshold 0.7
30 / 35
Results
None of the metrics fits the requirements described above
No perfectly applicable, universal metric
CVIs should be chosen for problems specifically
31 / 35
Main disadvantage of proposed measure evaluation
framework
Expensive: each new dataset has to be evaluated by assessors
Improvement:
• Meta-learning
32 / 35
Meta-learning approach
Selecting a CVI based on the set of meta-features
Classification task:
• Train on datasets by using framework described above
33 / 35
Conclusion
Assessors’ estimates should be used as the ground truth (and
measure of CVI quality)
No universal clustering validity index existing at the moment
Cluster validity indices should be chosen for problems
specifically (meta-leaning approach)
34 / 35
QA Session
35 / 35

More Related Content

Viewers also liked

Viewers also liked (20)

AINL 2016: Bodrunova, Blekanov, Maksimov
AINL 2016: Bodrunova, Blekanov, MaksimovAINL 2016: Bodrunova, Blekanov, Maksimov
AINL 2016: Bodrunova, Blekanov, Maksimov
 
AINL 2016: Castro, Lopez, Cavalcante, Couto
AINL 2016: Castro, Lopez, Cavalcante, CoutoAINL 2016: Castro, Lopez, Cavalcante, Couto
AINL 2016: Castro, Lopez, Cavalcante, Couto
 
AINL 2016: Bastrakova, Ledesma, Millan, Zighed
AINL 2016: Bastrakova, Ledesma, Millan, ZighedAINL 2016: Bastrakova, Ledesma, Millan, Zighed
AINL 2016: Bastrakova, Ledesma, Millan, Zighed
 
AINL 2016: Kuznetsova
AINL 2016: KuznetsovaAINL 2016: Kuznetsova
AINL 2016: Kuznetsova
 
AINL 2016: Romanova, Nefedov
AINL 2016: Romanova, NefedovAINL 2016: Romanova, Nefedov
AINL 2016: Romanova, Nefedov
 
AINL 2016: Proncheva
AINL 2016: PronchevaAINL 2016: Proncheva
AINL 2016: Proncheva
 
AINL 2016: Ustalov
AINL 2016: Ustalov AINL 2016: Ustalov
AINL 2016: Ustalov
 
AINL 2016: Yagunova
AINL 2016: YagunovaAINL 2016: Yagunova
AINL 2016: Yagunova
 
AINL 2016: Goncharov
AINL 2016: GoncharovAINL 2016: Goncharov
AINL 2016: Goncharov
 
AINL 2016: Nikolenko
AINL 2016: NikolenkoAINL 2016: Nikolenko
AINL 2016: Nikolenko
 
AINL 2016: Fenogenova, Karpov, Kazorin
AINL 2016: Fenogenova, Karpov, KazorinAINL 2016: Fenogenova, Karpov, Kazorin
AINL 2016: Fenogenova, Karpov, Kazorin
 
AINL 2016: Boldyreva
AINL 2016: BoldyrevaAINL 2016: Boldyreva
AINL 2016: Boldyreva
 
AINL 2016: Alekseev, Nikolenko
AINL 2016: Alekseev, NikolenkoAINL 2016: Alekseev, Nikolenko
AINL 2016: Alekseev, Nikolenko
 
AINL 2016: Moskvichev
AINL 2016: MoskvichevAINL 2016: Moskvichev
AINL 2016: Moskvichev
 
AINL 2016: Strijov
AINL 2016: StrijovAINL 2016: Strijov
AINL 2016: Strijov
 
AINL 2016: Maraev
AINL 2016: MaraevAINL 2016: Maraev
AINL 2016: Maraev
 
AINL 2016: Khudobakhshov
AINL 2016: KhudobakhshovAINL 2016: Khudobakhshov
AINL 2016: Khudobakhshov
 
AINL 2016: Malykh
AINL 2016: MalykhAINL 2016: Malykh
AINL 2016: Malykh
 
AINL 2016: Filchenkov
AINL 2016: FilchenkovAINL 2016: Filchenkov
AINL 2016: Filchenkov
 
AINL 2016: Just AI
AINL 2016: Just AIAINL 2016: Just AI
AINL 2016: Just AI
 

Similar to AINL 2016: Muravyov

A new development in the hierarchical clustering of repertory grid data
A new development in the hierarchical clustering of repertory grid dataA new development in the hierarchical clustering of repertory grid data
A new development in the hierarchical clustering of repertory grid dataMark Heckmann
 
Analytic Hierarchy Process AHP
Analytic Hierarchy Process AHPAnalytic Hierarchy Process AHP
Analytic Hierarchy Process AHPadcom2015
 
multi criteria decision making
multi criteria decision makingmulti criteria decision making
multi criteria decision makingShankha Goswami
 
Decision Forests and discriminant analysis
Decision Forests and discriminant analysisDecision Forests and discriminant analysis
Decision Forests and discriminant analysispotaters
 
Xmeasures - Accuracy evaluation of overlapping and multi-resolution clusterin...
Xmeasures - Accuracy evaluation of overlapping and multi-resolution clusterin...Xmeasures - Accuracy evaluation of overlapping and multi-resolution clusterin...
Xmeasures - Accuracy evaluation of overlapping and multi-resolution clusterin...Artem Lutov
 
[CIKM 2014] Deviation-Based Contextual SLIM Recommenders
[CIKM 2014] Deviation-Based Contextual SLIM Recommenders[CIKM 2014] Deviation-Based Contextual SLIM Recommenders
[CIKM 2014] Deviation-Based Contextual SLIM RecommendersYONG ZHENG
 
FUNCTION OF RIVAL SIMILARITY IN A COGNITIVE DATA ANALYSIS

FUNCTION OF RIVAL SIMILARITY IN A COGNITIVE DATA ANALYSIS
FUNCTION OF RIVAL SIMILARITY IN A COGNITIVE DATA ANALYSIS

FUNCTION OF RIVAL SIMILARITY IN A COGNITIVE DATA ANALYSIS
Maxim Kazantsev
 
Lazy Association Classification
Lazy Association ClassificationLazy Association Classification
Lazy Association ClassificationJason Yang
 
Harnessing Ratings and Aspect-Sentiment to Estimate Contradiction Intensity i...
Harnessing Ratings and Aspect-Sentiment to Estimate Contradiction Intensity i...Harnessing Ratings and Aspect-Sentiment to Estimate Contradiction Intensity i...
Harnessing Ratings and Aspect-Sentiment to Estimate Contradiction Intensity i...Ismail BADACHE
 
Clustering introduction
Clustering introductionClustering introduction
Clustering introductionYan Xu
 
CDAC 2018 Pellegrini clustering ppi networks
CDAC 2018 Pellegrini clustering ppi networksCDAC 2018 Pellegrini clustering ppi networks
CDAC 2018 Pellegrini clustering ppi networksMarco Antoniotti
 
Decision tree for data mining and computer
Decision tree for data mining and computerDecision tree for data mining and computer
Decision tree for data mining and computertttiba
 
Feature recognition and classification
Feature recognition and classificationFeature recognition and classification
Feature recognition and classificationSooraz Sresta
 
Unsupervised learning clustering
Unsupervised learning clusteringUnsupervised learning clustering
Unsupervised learning clusteringDr Nisha Arora
 
Test Case Design Techniques
Test Case Design TechniquesTest Case Design Techniques
Test Case Design TechniquesMurageppa-QA
 

Similar to AINL 2016: Muravyov (20)

A new development in the hierarchical clustering of repertory grid data
A new development in the hierarchical clustering of repertory grid dataA new development in the hierarchical clustering of repertory grid data
A new development in the hierarchical clustering of repertory grid data
 
Clusters (4).pptx
Clusters (4).pptxClusters (4).pptx
Clusters (4).pptx
 
Analytic Hierarchy Process AHP
Analytic Hierarchy Process AHPAnalytic Hierarchy Process AHP
Analytic Hierarchy Process AHP
 
multi criteria decision making
multi criteria decision makingmulti criteria decision making
multi criteria decision making
 
Decision Forests and discriminant analysis
Decision Forests and discriminant analysisDecision Forests and discriminant analysis
Decision Forests and discriminant analysis
 
Xmeasures - Accuracy evaluation of overlapping and multi-resolution clusterin...
Xmeasures - Accuracy evaluation of overlapping and multi-resolution clusterin...Xmeasures - Accuracy evaluation of overlapping and multi-resolution clusterin...
Xmeasures - Accuracy evaluation of overlapping and multi-resolution clusterin...
 
[CIKM 2014] Deviation-Based Contextual SLIM Recommenders
[CIKM 2014] Deviation-Based Contextual SLIM Recommenders[CIKM 2014] Deviation-Based Contextual SLIM Recommenders
[CIKM 2014] Deviation-Based Contextual SLIM Recommenders
 
Cluster Validation
Cluster ValidationCluster Validation
Cluster Validation
 
FUNCTION OF RIVAL SIMILARITY IN A COGNITIVE DATA ANALYSIS

FUNCTION OF RIVAL SIMILARITY IN A COGNITIVE DATA ANALYSIS
FUNCTION OF RIVAL SIMILARITY IN A COGNITIVE DATA ANALYSIS

FUNCTION OF RIVAL SIMILARITY IN A COGNITIVE DATA ANALYSIS

 
Practice test1 solution
Practice test1 solutionPractice test1 solution
Practice test1 solution
 
Lazy Association Classification
Lazy Association ClassificationLazy Association Classification
Lazy Association Classification
 
Harnessing Ratings and Aspect-Sentiment to Estimate Contradiction Intensity i...
Harnessing Ratings and Aspect-Sentiment to Estimate Contradiction Intensity i...Harnessing Ratings and Aspect-Sentiment to Estimate Contradiction Intensity i...
Harnessing Ratings and Aspect-Sentiment to Estimate Contradiction Intensity i...
 
Clustering introduction
Clustering introductionClustering introduction
Clustering introduction
 
CDAC 2018 Pellegrini clustering ppi networks
CDAC 2018 Pellegrini clustering ppi networksCDAC 2018 Pellegrini clustering ppi networks
CDAC 2018 Pellegrini clustering ppi networks
 
Decision tree for data mining and computer
Decision tree for data mining and computerDecision tree for data mining and computer
Decision tree for data mining and computer
 
Feature recognition and classification
Feature recognition and classificationFeature recognition and classification
Feature recognition and classification
 
Unsupervised learning clustering
Unsupervised learning clusteringUnsupervised learning clustering
Unsupervised learning clustering
 
Measures of Dispersion.pptx
Measures of Dispersion.pptxMeasures of Dispersion.pptx
Measures of Dispersion.pptx
 
Test Case Design Techniques
Test Case Design TechniquesTest Case Design Techniques
Test Case Design Techniques
 
Unit 4 rm
Unit 4  rmUnit 4  rm
Unit 4 rm
 

More from Lidia Pivovarova

Classification and clustering in media monitoring: from knowledge engineering...
Classification and clustering in media monitoring: from knowledge engineering...Classification and clustering in media monitoring: from knowledge engineering...
Classification and clustering in media monitoring: from knowledge engineering...Lidia Pivovarova
 
Convolutional neural networks for text classification
Convolutional neural networks for text classificationConvolutional neural networks for text classification
Convolutional neural networks for text classificationLidia Pivovarova
 
Grouping business news stories based on salience of named entities
Grouping business news stories based on salience of named entitiesGrouping business news stories based on salience of named entities
Grouping business news stories based on salience of named entitiesLidia Pivovarova
 
Интеллектуальный анализ текста
Интеллектуальный анализ текстаИнтеллектуальный анализ текста
Интеллектуальный анализ текстаLidia Pivovarova
 
AINL 2016: Shavrina, Selegey
AINL 2016: Shavrina, SelegeyAINL 2016: Shavrina, Selegey
AINL 2016: Shavrina, SelegeyLidia Pivovarova
 

More from Lidia Pivovarova (7)

Classification and clustering in media monitoring: from knowledge engineering...
Classification and clustering in media monitoring: from knowledge engineering...Classification and clustering in media monitoring: from knowledge engineering...
Classification and clustering in media monitoring: from knowledge engineering...
 
Convolutional neural networks for text classification
Convolutional neural networks for text classificationConvolutional neural networks for text classification
Convolutional neural networks for text classification
 
Grouping business news stories based on salience of named entities
Grouping business news stories based on salience of named entitiesGrouping business news stories based on salience of named entities
Grouping business news stories based on salience of named entities
 
Интеллектуальный анализ текста
Интеллектуальный анализ текстаИнтеллектуальный анализ текста
Интеллектуальный анализ текста
 
AINL 2016: Shavrina, Selegey
AINL 2016: Shavrina, SelegeyAINL 2016: Shavrina, Selegey
AINL 2016: Shavrina, Selegey
 
AINL 2016:
AINL 2016: AINL 2016:
AINL 2016:
 
AINL 2016: Grigorieva
AINL 2016: GrigorievaAINL 2016: Grigorieva
AINL 2016: Grigorieva
 

Recently uploaded

Speech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptxSpeech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptxpriyankatabhane
 
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.PraveenaKalaiselvan1
 
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...D. B. S. College Kanpur
 
Base editing, prime editing, Cas13 & RNA editing and organelle base editing
Base editing, prime editing, Cas13 & RNA editing and organelle base editingBase editing, prime editing, Cas13 & RNA editing and organelle base editing
Base editing, prime editing, Cas13 & RNA editing and organelle base editingNetHelix
 
Functional group interconversions(oxidation reduction)
Functional group interconversions(oxidation reduction)Functional group interconversions(oxidation reduction)
Functional group interconversions(oxidation reduction)itwameryclare
 
STOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptx
STOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptxSTOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptx
STOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptxMurugaveni B
 
Good agricultural practices 3rd year bpharm. herbal drug technology .pptx
Good agricultural practices 3rd year bpharm. herbal drug technology .pptxGood agricultural practices 3rd year bpharm. herbal drug technology .pptx
Good agricultural practices 3rd year bpharm. herbal drug technology .pptxSimeonChristian
 
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuine
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 GenuineCall Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuine
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuinethapagita
 
Microteaching on terms used in filtration .Pharmaceutical Engineering
Microteaching on terms used in filtration .Pharmaceutical EngineeringMicroteaching on terms used in filtration .Pharmaceutical Engineering
Microteaching on terms used in filtration .Pharmaceutical EngineeringPrajakta Shinde
 
User Guide: Capricorn FLX™ Weather Station
User Guide: Capricorn FLX™ Weather StationUser Guide: Capricorn FLX™ Weather Station
User Guide: Capricorn FLX™ Weather StationColumbia Weather Systems
 
Environmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial BiosensorEnvironmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial Biosensorsonawaneprad
 
RESPIRATORY ADAPTATIONS TO HYPOXIA IN HUMNAS.pptx
RESPIRATORY ADAPTATIONS TO HYPOXIA IN HUMNAS.pptxRESPIRATORY ADAPTATIONS TO HYPOXIA IN HUMNAS.pptx
RESPIRATORY ADAPTATIONS TO HYPOXIA IN HUMNAS.pptxFarihaAbdulRasheed
 
《Queensland毕业文凭-昆士兰大学毕业证成绩单》
《Queensland毕业文凭-昆士兰大学毕业证成绩单》《Queensland毕业文凭-昆士兰大学毕业证成绩单》
《Queensland毕业文凭-昆士兰大学毕业证成绩单》rnrncn29
 
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCRCall Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCRlizamodels9
 
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubai
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In DubaiDubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubai
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubaikojalkojal131
 
Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024AyushiRastogi48
 
FREE NURSING BUNDLE FOR NURSES.PDF by na
FREE NURSING BUNDLE FOR NURSES.PDF by naFREE NURSING BUNDLE FOR NURSES.PDF by na
FREE NURSING BUNDLE FOR NURSES.PDF by naJASISJULIANOELYNV
 
Bioteknologi kelas 10 kumer smapsa .pptx
Bioteknologi kelas 10 kumer smapsa .pptxBioteknologi kelas 10 kumer smapsa .pptx
Bioteknologi kelas 10 kumer smapsa .pptx023NiWayanAnggiSriWa
 
Davis plaque method.pptx recombinant DNA technology
Davis plaque method.pptx recombinant DNA technologyDavis plaque method.pptx recombinant DNA technology
Davis plaque method.pptx recombinant DNA technologycaarthichand2003
 

Recently uploaded (20)

Speech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptxSpeech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptx
 
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
 
Volatile Oils Pharmacognosy And Phytochemistry -I
Volatile Oils Pharmacognosy And Phytochemistry -IVolatile Oils Pharmacognosy And Phytochemistry -I
Volatile Oils Pharmacognosy And Phytochemistry -I
 
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
 
Base editing, prime editing, Cas13 & RNA editing and organelle base editing
Base editing, prime editing, Cas13 & RNA editing and organelle base editingBase editing, prime editing, Cas13 & RNA editing and organelle base editing
Base editing, prime editing, Cas13 & RNA editing and organelle base editing
 
Functional group interconversions(oxidation reduction)
Functional group interconversions(oxidation reduction)Functional group interconversions(oxidation reduction)
Functional group interconversions(oxidation reduction)
 
STOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptx
STOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptxSTOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptx
STOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptx
 
Good agricultural practices 3rd year bpharm. herbal drug technology .pptx
Good agricultural practices 3rd year bpharm. herbal drug technology .pptxGood agricultural practices 3rd year bpharm. herbal drug technology .pptx
Good agricultural practices 3rd year bpharm. herbal drug technology .pptx
 
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuine
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 GenuineCall Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuine
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuine
 
Microteaching on terms used in filtration .Pharmaceutical Engineering
Microteaching on terms used in filtration .Pharmaceutical EngineeringMicroteaching on terms used in filtration .Pharmaceutical Engineering
Microteaching on terms used in filtration .Pharmaceutical Engineering
 
User Guide: Capricorn FLX™ Weather Station
User Guide: Capricorn FLX™ Weather StationUser Guide: Capricorn FLX™ Weather Station
User Guide: Capricorn FLX™ Weather Station
 
Environmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial BiosensorEnvironmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial Biosensor
 
RESPIRATORY ADAPTATIONS TO HYPOXIA IN HUMNAS.pptx
RESPIRATORY ADAPTATIONS TO HYPOXIA IN HUMNAS.pptxRESPIRATORY ADAPTATIONS TO HYPOXIA IN HUMNAS.pptx
RESPIRATORY ADAPTATIONS TO HYPOXIA IN HUMNAS.pptx
 
《Queensland毕业文凭-昆士兰大学毕业证成绩单》
《Queensland毕业文凭-昆士兰大学毕业证成绩单》《Queensland毕业文凭-昆士兰大学毕业证成绩单》
《Queensland毕业文凭-昆士兰大学毕业证成绩单》
 
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCRCall Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
 
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubai
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In DubaiDubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubai
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubai
 
Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024
 
FREE NURSING BUNDLE FOR NURSES.PDF by na
FREE NURSING BUNDLE FOR NURSES.PDF by naFREE NURSING BUNDLE FOR NURSES.PDF by na
FREE NURSING BUNDLE FOR NURSES.PDF by na
 
Bioteknologi kelas 10 kumer smapsa .pptx
Bioteknologi kelas 10 kumer smapsa .pptxBioteknologi kelas 10 kumer smapsa .pptx
Bioteknologi kelas 10 kumer smapsa .pptx
 
Davis plaque method.pptx recombinant DNA technology
Davis plaque method.pptx recombinant DNA technologyDavis plaque method.pptx recombinant DNA technology
Davis plaque method.pptx recombinant DNA technology
 

AINL 2016: Muravyov

  • 1. Towards Cluster Validity Index Evaluation and Selection Andrey Filchenkov, Sergey Muravyov, Vladimir Parfenov ITMO University St. Petersburg, Russia {afilchenkov,smuravyov}@corp.ifmo.ru, parfenov@mail.ifmo.ru 11.11.2016
  • 2. What is clustering? Clustering means grouping the objects based on the information found in the data describing the objects or their relationships. The goal is that the objects in a group will be similar (or related) to one other and different from (or unrelated to) the objects in other groups. 2 / 35
  • 4. Applications of clustering Biology Medicine Business and marketing Social science Computer science 4 / 35
  • 5. Clustering Evaluation No target field Two ways of partition evaluation: • External • Internal 5 / 35
  • 6. External measures Based on known class labels and external benchmarks Examples: • F-measure • Rand measure • Jaccard index Not applicable for real-life tasks! 6 / 35
  • 7. Internal measures Usually assign the best score to the algorithm that produces clusters with high similarity within a cluster and low similarity between clusters No need for extra information 7 / 35
  • 8. List of the most popular existing internal measures (clustering validity indicies) Dunn index (D) COP-index (COP) Davies-Bouldin index (DB) OS-Index (OS) Silhouette index (Sil) Generalized Dunn indices (6 indicies: GD*) Calinski–Harabasz (CH) C-Index (CI) CS index (CS) Sym-index (Symm) Modified Davies-Bouldin index (DB*) S Dbw index (SDbw) Score function (SF) 8 / 35
  • 9. Problem of choice There are a lot of CVIs Which one should be chosen for a hard clustering problem? • How to compare the existing ones? • How to evaluate each one? 9 / 35
  • 10. Approaches for CVI comparison Visual-based comparison Comparison with known labels Purely theoretical comparison that is based on studying CVI properties Comparison based on stability, robustness of structure, or other desired properties 10 / 35
  • 11. Approaches for CVI comparison Visual-based comparison Comparison with known labels Purely theoretical comparison that is based on studying CVI properties Comparison based on stability, robustness of structure, or other desired properties 11 / 35
  • 12. Practical approach for partition evaluation It is stated in the classical book [1], ”Understanding our world requires conceptualizing the similarities and differences between the entities that compose it” M. Ackerman showed that there is no difference in partition evaluations made by clustering experts and non-experts The only way of obtaining ground truth for partition quality estimation is human estimation 12 / 35 [1] - R.C. Tryon and D. E. Bailey, Cluster analysis, 1970
  • 13. Formalized ground truth for partitions Let 𝐶 denote the space of clustering tasks Weak ordering: • 𝐻𝑟 𝑐 : 𝐶 → 𝑃𝑎𝑟𝑡𝑖𝑡𝑖𝑜𝑛 𝑐 × 𝑃𝑎𝑟𝑡𝑖𝑡𝑖𝑜𝑛 𝑐 , 𝑐 ∈ 𝐶 Binary estimation scale • 𝐻 𝑏: 𝑃𝑎𝑟𝑡𝑖𝑡𝑖𝑜𝑛 𝑐 → {0,1} 13 / 35
  • 14. Measure evaluation framework Procedure for one assessor Procedure for multiple assessors 14 / 35
  • 15. Procedure for one assessor Evaluate all possible partitions Set partial order, compare all pairs of partitions 15 / 35
  • 16. Constraints on one-assessors procedure Choose finite set of partitions Mark each partition with a number • There is a score function that can be put into correspondence to any weak strict order 16 / 35
  • 17. Procedure for multiple assessors Four criteria for each CVI: • Binarized adequacy • Weighted adequacy • Adequacy of the best • Aggregated ranking 17 / 35
  • 18. Binarized adequacy 𝑅 𝐵𝐴 = 𝜌 𝐾𝜏(𝑏 𝑏𝑒𝑠𝑡,𝑏 𝐶𝑉𝐼) 𝜌 𝐾𝜏(𝑏 𝑏𝑒𝑠𝑡,𝑏 𝑤𝑜𝑟𝑠𝑡) • 𝑏 𝐶𝑉𝐼 — permutation on binary marks with respect to the CVIs estimations of these partitions, contains of + and − • 𝑏 𝑏𝑒𝑠𝑡 — best permutation: + + ⋯ + − ⋯ − • 𝑏 𝑤𝑜𝑟𝑠𝑡— worst permutation: − ⋯ − + ⋯ + • 𝜌 𝐾𝜏 — modified Kendall tau distance 18 / 35
  • 19. Weighted adequacy 𝑅 𝑊𝐴 = 𝜌 𝐾𝜏(𝑟 𝑏𝑒𝑠𝑡,𝑟 𝐶𝑉𝐼) 𝜌 𝐾𝜏(𝑟 𝑏𝑒𝑠𝑡,𝑟 𝑤𝑜𝑟𝑠𝑡) • Instead of + and − we take the amount of assessors that gave an adequate mark to CVI: 𝑤𝑖 • 𝑟𝐶𝑉𝐼 — permutation on weighted marks with respect to the CVIs estimations of these partitions • 𝑟𝑏𝑒𝑠𝑡 — best permutation: 𝑤1 ≥ ⋯ ≥ 𝑤 𝑛 • 𝑟 𝑤𝑜𝑟𝑠𝑡— worst permutation:𝑤1 ≤ ⋯ ≤ 𝑤 𝑛 • 𝜌 𝐾𝜏 — modified Kendall tau distance 19 / 35
  • 20. Adequacy of the best Chose the adequacy mark, which was assigned to the partition having the highest value of CVI being estimated. 20 / 35
  • 21. Aggregated ranking Represents how many orderings produced by the assessors and by each CVI differ Weak order aggregation algorithm and distance measure 𝜌 J. L. Garcia-Lapresta and D. Perez-Roman 𝑅 𝐴𝑅 = 𝜌(𝑎𝑟 𝑏𝑒𝑠𝑡,𝑎𝑟 𝐶𝑉𝐼) 𝜌(𝑎𝑟 𝑏𝑒𝑠𝑡,𝑎𝑟 𝑤𝑜𝑟𝑠𝑡) • 𝑎𝑟∗— aggregated ordering 21 / 35
  • 22. Experimental evaluation 19 of the most popular CVIs were taken to find out if any of them matches the real quality of resulting clusters 22 / 35
  • 23. Experimental data 41 datasets: • Artificial • 2D 23 / 35
  • 24. Clustering algorithms 6 algorithms: • 𝑘-Means • X-Means • EM • DBSCAN • FarthestFirst • Hierarchical 14 configurations totally 24 / 35
  • 25. Example of partitions (densed clusters) 25 / 35 Initial K-Means EM Hierarchical
  • 26. Example of partitions (curves topology) 26 / 35 Initial K-Means EM Hierarchical
  • 27. Summarization for binarized adequacy CVI Adequacy CVI Adequacy DB 0.151 Symm 0.333 D 0.515 CI 0.121 Syl 0.393 DB* 0.121 CH 0.352 GD31 0.333 SDBW 0.454 GD41 0.435 SF 0.393 GD51 0.393 CS 0.303 GD33 0.352 COP 0.575 GD43 0.272 SV 0.272 GD53 0.333 OS 0.575 threshold 0.7 27 / 35
  • 28. Summarization for weighted adequacy CVI Adequacy CVI Adequacy DB 0.121 Symm 0.353 D 0.317 CI 0.073 Syl 0.121 DB* 0.000 CH 0.097 GD31 0.097 SDBW 0.195 GD41 0.097 SF 0.170 GD51 0.121 CS 0.146 GD33 0.073 COP 0.414 GD43 0.035 SV 0.195 GD53 0.035 OS 0.317 threshold 0.7 28 / 35
  • 29. Adequacy of the cluster partitions, claimed to be the best by CVIs CVI # of the best CVI # of the best DB 4 (9.7%) Symm 5 (12.1%) D 4 (9.7%) CI 4 (9.7%) Syl 4 (9.7%) DB* 4 (9.7%) CH 4 (9.7%) GD31 5 (12.1%) SDBW 6 (14.6%) GD41 5 (12.1%) SF 4 (9.7%) GD51 6 (14.6%) CS 6 (14.6%) GD33 4 (9.7%) COP 6 (14.6%) GD43 4 (9.7%) SV 6 (14.6%) GD53 5 (12.1%) OS 5 (12.1%) threshold 70% 29 / 35
  • 30. Summarization for weighted adequacy CVI Rank CVI Rank DB 0.390 Symm 0.292 D 0.353 CI 0.353 Syl 0.170 DB* 0.317 CH 0.073 GD31 0.073 SDBW 0.353 GD41 0.146 SF 0.146 GD51 0.195 CS 0.146 GD33 0.035 COP 0.414 GD43 0.035 SV 0.414 GD53 0.073 OS 0.073 threshold 0.7 30 / 35
  • 31. Results None of the metrics fits the requirements described above No perfectly applicable, universal metric CVIs should be chosen for problems specifically 31 / 35
  • 32. Main disadvantage of proposed measure evaluation framework Expensive: each new dataset has to be evaluated by assessors Improvement: • Meta-learning 32 / 35
  • 33. Meta-learning approach Selecting a CVI based on the set of meta-features Classification task: • Train on datasets by using framework described above 33 / 35
  • 34. Conclusion Assessors’ estimates should be used as the ground truth (and measure of CVI quality) No universal clustering validity index existing at the moment Cluster validity indices should be chosen for problems specifically (meta-leaning approach) 34 / 35