hierarchical clustering for machine learning

2
• Divisive approach is a top-down approach.
• Start with one, all-inclusive cluster.
• Smaller clusters are created by splitting the group by using the
continuous iteration.
• Split until each cluster contains a point.
– Cannot undo after the group is split or merged, and that is
why this method is not so flexible.
*
DIVISIVE APPROACH

15
*
Linkage Criterion: The linkage criterion is where exactly
the distance is measured.
Types of Linkage
⮚ Single-Linkage
⮚ Complete-Linkage
⮚ Average-Linkage
⮚ Centroid-Linkage
⮚ Ward’s Minimum Variance
HIERARCHICAL CLUSTERING METHODS

16
*

17
*
Minimum or Single-Linkage (Nearest Neighbor)
Shortest distance between a pair of observations in two clusters. It tends to
produce long, “loose” clusters.
Maximum or Complete-Linkage (Farthest Neighbor)
Distance measured between the farthest pair of observations in two clusters.
This method usually produces “tighter” clusters than single-linkage.
Mean or Average-Linkage
Computes all pairwise dissimilarities between the elements in cluster 1 and
the elements in cluster 2, and considers the average of these
dissimilarities as the distance between the two clusters.
Centroid-Linkage
Computes the dissimilarity between the centroid for cluster 1 (a mean vector
of length p variables) and the centroid for cluster 2.
Ward’s Minimum Variance Method:
Minimizes the total within-cluster variance. At each step the pair of clusters
with minimum between-cluster distance are merged.

18
*
• Starts with one cluster, individual item in its own cluster and
iteratively merge clusters until all the items belong to one
cluster.
• Bottom up approach is followed to merge the clusters
together.
• Dendrograms are pictorially used to represent the
Hierarchical Agglomerative Clustering (HAC).
• HAC groups similar data points together step-by-
step to form a hierarchy of clusters.
• It starts with each point as its own cluster and
merges the closest clusters one by one until all
points are in a single big cluster.
Single Linkage Clustering

19
1. Agglomerative Algorithm: Single
Link
• Single-nearest distance or single linkage is the
agglomerative method that uses the distance between the
closest members of the two clusters.

23
2. Agglomerative Algorithm:
Complete Link
• In this algorithm, complete farthest distance or complete
linkage is the agglomerative method that uses the distance
between the members that are the farthest apart.

24
*
Choose
the min.
distance
Merge the two
data pts. into
one cluster
Dendrogram
Construct the distance matrix

25
Update the distance matrix after merging

35
*
• The height in the dendrogram at which two clusters are merged
represents the Distance between two clusters in the data space.
• The decision of merging two clusters is taken on the basis of
closeness of these clusters. There are multiple metrics for
deciding the closeness of two clusters. (Distance).
• The red horizontal line in the dendrogram below covers
maximum vertical distance AB.
• Hierarchical clustering, as the name suggests is an algorithm that
builds hierarchy of clusters.
HIERARCHICAL CLUSTERING (1/2)

36
*
• This algorithm starts with all the data points assigned to a cluster
of their own.
• Then two nearest clusters are merged into the same cluster. In
the end, this algorithm terminates when there is only a single
cluster left.
HIERARCHICAL CLUSTERING (2/2)

37
*
Advantages:
•We can obtain the optimal (desired) number of clusters from
the model itself, human intervention not required.
•Dendrograms help us in clear visualization, which is practical
and easy to understand.
Disadvantages:
•Not suitable for large datasets due to high time and space
complexity.
•In hierarchical Clustering, once a decision is made to combine
two clusters, it can not be undone.
•The time complexity for the clustering can result in very
long computation times.
HIERARCHICAL CLUSTERING

38
a. Search Engine Result Grouping.
b. Document Clustering.
c. Banking and Insurance fraud detection.
d. Image Segmentation.
e. Customer Segmentation.
f. Recommendation Engines.
g. Social Network Analysis.
h. Network Traffic Analysis.
*
APPLICATIONS OF CLUSTERING

hierarchical clustering for machine learning

More Related Content

Similar to hierarchical clustering for machine learning

Recently uploaded

hierarchical clustering for machine learning

Editor's Notes