More Related Content
Similar to Clustering Algorithms Hierarchical k-Means Distance Measures
Similar to Clustering Algorithms Hierarchical k-Means Distance Measures (20)
Clustering Algorithms Hierarchical k-Means Distance Measures
- 3. Example x x x x x x x x x x x x x x x x xx x x x x x x x x x x x x x x x x x x x x x x x
- 15. Examples of Euclidean Distances x = (5,5) y = (9,8) L 2 -norm: dist(x,y) = (4 2 +3 2 ) = 5 L 1 -norm: dist(x,y) = 4+3 = 7 4 3 5
- 24. Why? p 1 = (x 1 ,y 1 ) p 2 = (x 2 ,0) x 1 Dot product is invariant under rotation, so pick convenient coordinate system. x 1 = p 1 .p 2 /|p 2 | p 1 .p 2 = x 1 *x 2 . |p 2 | = x 2 .
- 32. Example (5,3) o (1,2) o o (2,1) o (4,1) o (0,0) o (5,0) x (1.5,1.5) x (4.5,0.5) x (1,1) x (4.7,1.3)
- 35. Example 1 2 3 4 5 6 intercluster distance clustroid clustroid
- 39. Example 1 2 3 4 5 6 7 8 x x Clusters after first round Reassigned points
- 41. Example x x x x x x x x x x x x x x x x xx x x x x x x x x x x x x x x x x x x x x x x x Too few; many long distances to centroid.
- 42. Example x x x x x x x x x x x x x x x x xx x x x x x x x x x x x x x x x x x x x x x x x Just right; distances rather short.
- 43. Example x x x x x x x x x x x x x x x x xx x x x x x x x x x x x x x x x x x x x x x x x Too many; little improvement in average distance.
- 51. “Galaxies” Picture A cluster. Its points are in the DS. The centroid Compressed sets. Their points are in the CS. Points in the RS