35. .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. .. .. .. .
Introduction
. .. .. .. .. .. .. .
Hierarchical clustering
. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .
Nonhierarchical clustering
. .. .. .
Feature importances
. .. .
External validation
. .. .. .
Conclusions
Conclusions:
1 The silhouette plots and the Calinski index suggest k = 2 for
all the clusterings.
2 All the clustering methods give the same output → our results
are reasonably reliable.
3 We can use PCA to improve efficiency and get equally good
clustering results!
4 The feature importances extraction proves that there are more
core-like issues and fundamental incompantibilities than
expected, which could be related to the data source (Turkey).
Paula Robles López Universidad Politécnica de Madrid (UPM)
Machine Learning: unsupervised classifiers 35 / 36