SlideShare a Scribd company logo
1 of 92
Download to read offline
Machine Learning for Data Mining
Measure Properties for Clustering
Andres Mendez-Vazquez
July 27, 2015
1 / 37
Images/cinvestav-
Outline
1 Introduction
Introduction
2 Features
Types of Features
3 Similarity and Dissimilarity Measures
Basic Deļ¬nition
4 Proximity Measures between Two Points
Real-Valued Vectors
Dissimilarity Measures
Similarity Measures
Discrete-Valued Vectors
Dissimilarity Measures
Similarity Measures
5 Other Examples
Between Sets
2 / 37
Images/cinvestav-
Outline
1 Introduction
Introduction
2 Features
Types of Features
3 Similarity and Dissimilarity Measures
Basic Deļ¬nition
4 Proximity Measures between Two Points
Real-Valued Vectors
Dissimilarity Measures
Similarity Measures
Discrete-Valued Vectors
Dissimilarity Measures
Similarity Measures
5 Other Examples
Between Sets
3 / 37
Images/cinvestav-
Introduction
Observation
Clusters are considered as groups containing data objects that are similar
to each other.
When they are not
We assume there is some way to measure that diļ¬€erence!!!
4 / 37
Images/cinvestav-
Introduction
Observation
Clusters are considered as groups containing data objects that are similar
to each other.
When they are not
We assume there is some way to measure that diļ¬€erence!!!
4 / 37
Images/cinvestav-
Therefore
It is possible to give a mathematical deļ¬nition of two important types
of clustering
Partitional Clustering
Hierarchical Clustering
Given
Given a set of input patterns X = {x1, x2, ..., xN }, where
xj = (x1j, x2j, ..., xdj)T
āˆˆ Rd, with each measure xij called a feature
(attribute, dimension, or variable).
5 / 37
Images/cinvestav-
Therefore
It is possible to give a mathematical deļ¬nition of two important types
of clustering
Partitional Clustering
Hierarchical Clustering
Given
Given a set of input patterns X = {x1, x2, ..., xN }, where
xj = (x1j, x2j, ..., xdj)T
āˆˆ Rd, with each measure xij called a feature
(attribute, dimension, or variable).
5 / 37
Images/cinvestav-
Therefore
It is possible to give a mathematical deļ¬nition of two important types
of clustering
Partitional Clustering
Hierarchical Clustering
Given
Given a set of input patterns X = {x1, x2, ..., xN }, where
xj = (x1j, x2j, ..., xdj)T
āˆˆ Rd, with each measure xij called a feature
(attribute, dimension, or variable).
5 / 37
Images/cinvestav-
Hard Partitional Clustering
Deļ¬nition
Hard partitional clustering attempts to seek a K-partition of X,
C = {C1, C2, ..., CK } with K ā‰¤ N such that
1 Ci = āˆ… for i = 1, ..., K.
2 āˆŖK
i=1Ci = X.
3 Ci āˆ© Cj = āˆ… for all i, j = 1, ..., K and i = j.
Clearly
The third property can be relaxed to obtain the mathematical deļ¬nition of
fuzzy and possibilistic clustering.
6 / 37
Images/cinvestav-
Hard Partitional Clustering
Deļ¬nition
Hard partitional clustering attempts to seek a K-partition of X,
C = {C1, C2, ..., CK } with K ā‰¤ N such that
1 Ci = āˆ… for i = 1, ..., K.
2 āˆŖK
i=1Ci = X.
3 Ci āˆ© Cj = āˆ… for all i, j = 1, ..., K and i = j.
Clearly
The third property can be relaxed to obtain the mathematical deļ¬nition of
fuzzy and possibilistic clustering.
6 / 37
Images/cinvestav-
Hard Partitional Clustering
Deļ¬nition
Hard partitional clustering attempts to seek a K-partition of X,
C = {C1, C2, ..., CK } with K ā‰¤ N such that
1 Ci = āˆ… for i = 1, ..., K.
2 āˆŖK
i=1Ci = X.
3 Ci āˆ© Cj = āˆ… for all i, j = 1, ..., K and i = j.
Clearly
The third property can be relaxed to obtain the mathematical deļ¬nition of
fuzzy and possibilistic clustering.
6 / 37
Images/cinvestav-
Hard Partitional Clustering
Deļ¬nition
Hard partitional clustering attempts to seek a K-partition of X,
C = {C1, C2, ..., CK } with K ā‰¤ N such that
1 Ci = āˆ… for i = 1, ..., K.
2 āˆŖK
i=1Ci = X.
3 Ci āˆ© Cj = āˆ… for all i, j = 1, ..., K and i = j.
Clearly
The third property can be relaxed to obtain the mathematical deļ¬nition of
fuzzy and possibilistic clustering.
6 / 37
Images/cinvestav-
Hard Partitional Clustering
Deļ¬nition
Hard partitional clustering attempts to seek a K-partition of X,
C = {C1, C2, ..., CK } with K ā‰¤ N such that
1 Ci = āˆ… for i = 1, ..., K.
2 āˆŖK
i=1Ci = X.
3 Ci āˆ© Cj = āˆ… for all i, j = 1, ..., K and i = j.
Clearly
The third property can be relaxed to obtain the mathematical deļ¬nition of
fuzzy and possibilistic clustering.
6 / 37
Images/cinvestav-
For example
Given the membership Ai (xj) āˆˆ [0, 1]
We can have K
i=1 Ai (xj) = 1, āˆ€j and N
j=1 Ai (xj) < N, āˆ€i.
7 / 37
Images/cinvestav-
Hierarchical Clustering
Deļ¬nition
Hierarchical clustering attempts to construct a tree-like, nested structure
partition of X, H = {H1, ..., HQ} with Q ā‰¤ N such that
If Ci āˆˆ Hm and Cj āˆˆ Hl and m > l imply Ci āŠ‚ Cj or Ci āˆ© Cj = āˆ… for
all i, j = 1, ..., K, i = j and m, l = 1, ..., Q.
8 / 37
Images/cinvestav-
Hierarchical Clustering
Deļ¬nition
Hierarchical clustering attempts to construct a tree-like, nested structure
partition of X, H = {H1, ..., HQ} with Q ā‰¤ N such that
If Ci āˆˆ Hm and Cj āˆˆ Hl and m > l imply Ci āŠ‚ Cj or Ci āˆ© Cj = āˆ… for
all i, j = 1, ..., K, i = j and m, l = 1, ..., Q.
8 / 37
Images/cinvestav-
Outline
1 Introduction
Introduction
2 Features
Types of Features
3 Similarity and Dissimilarity Measures
Basic Deļ¬nition
4 Proximity Measures between Two Points
Real-Valued Vectors
Dissimilarity Measures
Similarity Measures
Discrete-Valued Vectors
Dissimilarity Measures
Similarity Measures
5 Other Examples
Between Sets
9 / 37
Images/cinvestav-
Features
Usually
A data object is described by a set of features or variables, usually
represented as a multidimensional vector.
Thus
We tend to collect all that information as a pattern matrix of
dimensionality N Ɨ d.
Then
A feature can be classiļ¬ed as continuous, discrete, or binary.
10 / 37
Images/cinvestav-
Features
Usually
A data object is described by a set of features or variables, usually
represented as a multidimensional vector.
Thus
We tend to collect all that information as a pattern matrix of
dimensionality N Ɨ d.
Then
A feature can be classiļ¬ed as continuous, discrete, or binary.
10 / 37
Images/cinvestav-
Features
Usually
A data object is described by a set of features or variables, usually
represented as a multidimensional vector.
Thus
We tend to collect all that information as a pattern matrix of
dimensionality N Ɨ d.
Then
A feature can be classiļ¬ed as continuous, discrete, or binary.
10 / 37
Images/cinvestav-
Thus
Continuous
A continuous feature takes values from an uncountably inļ¬nite range set.
Discrete
Discrete features only have ļ¬nite, or at most, a countably inļ¬nite number
of values.
Binary
Binary or dichotomous features are a special case of discrete features when
they have exactly two values.
11 / 37
Images/cinvestav-
Thus
Continuous
A continuous feature takes values from an uncountably inļ¬nite range set.
Discrete
Discrete features only have ļ¬nite, or at most, a countably inļ¬nite number
of values.
Binary
Binary or dichotomous features are a special case of discrete features when
they have exactly two values.
11 / 37
Images/cinvestav-
Thus
Continuous
A continuous feature takes values from an uncountably inļ¬nite range set.
Discrete
Discrete features only have ļ¬nite, or at most, a countably inļ¬nite number
of values.
Binary
Binary or dichotomous features are a special case of discrete features when
they have exactly two values.
11 / 37
Images/cinvestav-
Measurement Level
Meaning
Another property of features is the measurement level, which reļ¬‚ects the
relative signiļ¬cance of numbers.
We have four from lowest to highest
Nominal, ordinal, interval, and ratio.
12 / 37
Images/cinvestav-
Measurement Level
Meaning
Another property of features is the measurement level, which reļ¬‚ects the
relative signiļ¬cance of numbers.
We have four from lowest to highest
Nominal, ordinal, interval, and ratio.
12 / 37
Images/cinvestav-
Measurement Level
Nominal
Features at this level are represented with labels, states, or names.
The numbers are meaningless in any mathematical sense and no
mathematical calculation can be made.
For example, peristalsis āˆˆ {hypermotile, normal, hypomotile, absent}.
Ordinal
Features at this level are also names, but with a certain order implied.
However, the diļ¬€erence between the values is again meaningless.
For example, abdominal distension āˆˆ {slight, moderate, high}.
13 / 37
Images/cinvestav-
Measurement Level
Nominal
Features at this level are represented with labels, states, or names.
The numbers are meaningless in any mathematical sense and no
mathematical calculation can be made.
For example, peristalsis āˆˆ {hypermotile, normal, hypomotile, absent}.
Ordinal
Features at this level are also names, but with a certain order implied.
However, the diļ¬€erence between the values is again meaningless.
For example, abdominal distension āˆˆ {slight, moderate, high}.
13 / 37
Images/cinvestav-
Measurement Level
Nominal
Features at this level are represented with labels, states, or names.
The numbers are meaningless in any mathematical sense and no
mathematical calculation can be made.
For example, peristalsis āˆˆ {hypermotile, normal, hypomotile, absent}.
Ordinal
Features at this level are also names, but with a certain order implied.
However, the diļ¬€erence between the values is again meaningless.
For example, abdominal distension āˆˆ {slight, moderate, high}.
13 / 37
Images/cinvestav-
Measurement Level
Nominal
Features at this level are represented with labels, states, or names.
The numbers are meaningless in any mathematical sense and no
mathematical calculation can be made.
For example, peristalsis āˆˆ {hypermotile, normal, hypomotile, absent}.
Ordinal
Features at this level are also names, but with a certain order implied.
However, the diļ¬€erence between the values is again meaningless.
For example, abdominal distension āˆˆ {slight, moderate, high}.
13 / 37
Images/cinvestav-
Measurement Level
Nominal
Features at this level are represented with labels, states, or names.
The numbers are meaningless in any mathematical sense and no
mathematical calculation can be made.
For example, peristalsis āˆˆ {hypermotile, normal, hypomotile, absent}.
Ordinal
Features at this level are also names, but with a certain order implied.
However, the diļ¬€erence between the values is again meaningless.
For example, abdominal distension āˆˆ {slight, moderate, high}.
13 / 37
Images/cinvestav-
Measurement Level
Nominal
Features at this level are represented with labels, states, or names.
The numbers are meaningless in any mathematical sense and no
mathematical calculation can be made.
For example, peristalsis āˆˆ {hypermotile, normal, hypomotile, absent}.
Ordinal
Features at this level are also names, but with a certain order implied.
However, the diļ¬€erence between the values is again meaningless.
For example, abdominal distension āˆˆ {slight, moderate, high}.
13 / 37
Images/cinvestav-
Measurement Level
Interval
Features at this level oļ¬€er a meaningful interpretation of the
diļ¬€erence between two values.
However, there exists no true zero and the ratio between two values is
meaningless.
For example, the concept cold can be seen as an interval.
Ratio
Features at this level possess all the properties of the above levels.
There is an absolute zero.
Thus, we have the meaning of ratio!!!
14 / 37
Images/cinvestav-
Measurement Level
Interval
Features at this level oļ¬€er a meaningful interpretation of the
diļ¬€erence between two values.
However, there exists no true zero and the ratio between two values is
meaningless.
For example, the concept cold can be seen as an interval.
Ratio
Features at this level possess all the properties of the above levels.
There is an absolute zero.
Thus, we have the meaning of ratio!!!
14 / 37
Images/cinvestav-
Measurement Level
Interval
Features at this level oļ¬€er a meaningful interpretation of the
diļ¬€erence between two values.
However, there exists no true zero and the ratio between two values is
meaningless.
For example, the concept cold can be seen as an interval.
Ratio
Features at this level possess all the properties of the above levels.
There is an absolute zero.
Thus, we have the meaning of ratio!!!
14 / 37
Images/cinvestav-
Measurement Level
Interval
Features at this level oļ¬€er a meaningful interpretation of the
diļ¬€erence between two values.
However, there exists no true zero and the ratio between two values is
meaningless.
For example, the concept cold can be seen as an interval.
Ratio
Features at this level possess all the properties of the above levels.
There is an absolute zero.
Thus, we have the meaning of ratio!!!
14 / 37
Images/cinvestav-
Measurement Level
Interval
Features at this level oļ¬€er a meaningful interpretation of the
diļ¬€erence between two values.
However, there exists no true zero and the ratio between two values is
meaningless.
For example, the concept cold can be seen as an interval.
Ratio
Features at this level possess all the properties of the above levels.
There is an absolute zero.
Thus, we have the meaning of ratio!!!
14 / 37
Images/cinvestav-
Measurement Level
Interval
Features at this level oļ¬€er a meaningful interpretation of the
diļ¬€erence between two values.
However, there exists no true zero and the ratio between two values is
meaningless.
For example, the concept cold can be seen as an interval.
Ratio
Features at this level possess all the properties of the above levels.
There is an absolute zero.
Thus, we have the meaning of ratio!!!
14 / 37
Images/cinvestav-
Outline
1 Introduction
Introduction
2 Features
Types of Features
3 Similarity and Dissimilarity Measures
Basic Deļ¬nition
4 Proximity Measures between Two Points
Real-Valued Vectors
Dissimilarity Measures
Similarity Measures
Discrete-Valued Vectors
Dissimilarity Measures
Similarity Measures
5 Other Examples
Between Sets
15 / 37
Images/cinvestav-
Similarity Measures
Deļ¬nition
A Similarity Measure s on X is a function
s : X Ɨ X ā†’ R (1)
Such that
1 s (x, y) ā‰¤ s0.
2 s (x, x) = s0 for all x āˆˆ X.
3 s (x, y) = s (y, x) for all x, y āˆˆ X.
4 If s (x, y) = s0 ā‡ā‡’ x = y.
5 s (x, y) s (y, z) ā‰¤ s (x, z) [s (x, y) + s (y, z)] for all x, y, z āˆˆ X.
16 / 37
Images/cinvestav-
Similarity Measures
Deļ¬nition
A Similarity Measure s on X is a function
s : X Ɨ X ā†’ R (1)
Such that
1 s (x, y) ā‰¤ s0.
2 s (x, x) = s0 for all x āˆˆ X.
3 s (x, y) = s (y, x) for all x, y āˆˆ X.
4 If s (x, y) = s0 ā‡ā‡’ x = y.
5 s (x, y) s (y, z) ā‰¤ s (x, z) [s (x, y) + s (y, z)] for all x, y, z āˆˆ X.
16 / 37
Images/cinvestav-
Similarity Measures
Deļ¬nition
A Similarity Measure s on X is a function
s : X Ɨ X ā†’ R (1)
Such that
1 s (x, y) ā‰¤ s0.
2 s (x, x) = s0 for all x āˆˆ X.
3 s (x, y) = s (y, x) for all x, y āˆˆ X.
4 If s (x, y) = s0 ā‡ā‡’ x = y.
5 s (x, y) s (y, z) ā‰¤ s (x, z) [s (x, y) + s (y, z)] for all x, y, z āˆˆ X.
16 / 37
Images/cinvestav-
Similarity Measures
Deļ¬nition
A Similarity Measure s on X is a function
s : X Ɨ X ā†’ R (1)
Such that
1 s (x, y) ā‰¤ s0.
2 s (x, x) = s0 for all x āˆˆ X.
3 s (x, y) = s (y, x) for all x, y āˆˆ X.
4 If s (x, y) = s0 ā‡ā‡’ x = y.
5 s (x, y) s (y, z) ā‰¤ s (x, z) [s (x, y) + s (y, z)] for all x, y, z āˆˆ X.
16 / 37
Images/cinvestav-
Similarity Measures
Deļ¬nition
A Similarity Measure s on X is a function
s : X Ɨ X ā†’ R (1)
Such that
1 s (x, y) ā‰¤ s0.
2 s (x, x) = s0 for all x āˆˆ X.
3 s (x, y) = s (y, x) for all x, y āˆˆ X.
4 If s (x, y) = s0 ā‡ā‡’ x = y.
5 s (x, y) s (y, z) ā‰¤ s (x, z) [s (x, y) + s (y, z)] for all x, y, z āˆˆ X.
16 / 37
Images/cinvestav-
Similarity Measures
Deļ¬nition
A Similarity Measure s on X is a function
s : X Ɨ X ā†’ R (1)
Such that
1 s (x, y) ā‰¤ s0.
2 s (x, x) = s0 for all x āˆˆ X.
3 s (x, y) = s (y, x) for all x, y āˆˆ X.
4 If s (x, y) = s0 ā‡ā‡’ x = y.
5 s (x, y) s (y, z) ā‰¤ s (x, z) [s (x, y) + s (y, z)] for all x, y, z āˆˆ X.
16 / 37
Images/cinvestav-
Dissimilarity Measures
Deļ¬ntion
A Dissimilarity Measure d on X is a function
d : X Ɨ X ā†’ R (2)
Such that
1 d (x, y) ā‰„ d0 for all x, y āˆˆ X.
2 d (x, x) = d0 for all x āˆˆ X.
3 d (x, y) = d (y, x) for all x, y āˆˆ X.
4 If d (x, y) = d0 ā‡ā‡’ x = y.
5 d (x, z) ā‰¤ d (x, y) + d (y, z) for all x, y, z āˆˆ X.
17 / 37
Images/cinvestav-
Dissimilarity Measures
Deļ¬ntion
A Dissimilarity Measure d on X is a function
d : X Ɨ X ā†’ R (2)
Such that
1 d (x, y) ā‰„ d0 for all x, y āˆˆ X.
2 d (x, x) = d0 for all x āˆˆ X.
3 d (x, y) = d (y, x) for all x, y āˆˆ X.
4 If d (x, y) = d0 ā‡ā‡’ x = y.
5 d (x, z) ā‰¤ d (x, y) + d (y, z) for all x, y, z āˆˆ X.
17 / 37
Images/cinvestav-
Dissimilarity Measures
Deļ¬ntion
A Dissimilarity Measure d on X is a function
d : X Ɨ X ā†’ R (2)
Such that
1 d (x, y) ā‰„ d0 for all x, y āˆˆ X.
2 d (x, x) = d0 for all x āˆˆ X.
3 d (x, y) = d (y, x) for all x, y āˆˆ X.
4 If d (x, y) = d0 ā‡ā‡’ x = y.
5 d (x, z) ā‰¤ d (x, y) + d (y, z) for all x, y, z āˆˆ X.
17 / 37
Images/cinvestav-
Dissimilarity Measures
Deļ¬ntion
A Dissimilarity Measure d on X is a function
d : X Ɨ X ā†’ R (2)
Such that
1 d (x, y) ā‰„ d0 for all x, y āˆˆ X.
2 d (x, x) = d0 for all x āˆˆ X.
3 d (x, y) = d (y, x) for all x, y āˆˆ X.
4 If d (x, y) = d0 ā‡ā‡’ x = y.
5 d (x, z) ā‰¤ d (x, y) + d (y, z) for all x, y, z āˆˆ X.
17 / 37
Images/cinvestav-
Dissimilarity Measures
Deļ¬ntion
A Dissimilarity Measure d on X is a function
d : X Ɨ X ā†’ R (2)
Such that
1 d (x, y) ā‰„ d0 for all x, y āˆˆ X.
2 d (x, x) = d0 for all x āˆˆ X.
3 d (x, y) = d (y, x) for all x, y āˆˆ X.
4 If d (x, y) = d0 ā‡ā‡’ x = y.
5 d (x, z) ā‰¤ d (x, y) + d (y, z) for all x, y, z āˆˆ X.
17 / 37
Images/cinvestav-
Dissimilarity Measures
Deļ¬ntion
A Dissimilarity Measure d on X is a function
d : X Ɨ X ā†’ R (2)
Such that
1 d (x, y) ā‰„ d0 for all x, y āˆˆ X.
2 d (x, x) = d0 for all x āˆˆ X.
3 d (x, y) = d (y, x) for all x, y āˆˆ X.
4 If d (x, y) = d0 ā‡ā‡’ x = y.
5 d (x, z) ā‰¤ d (x, y) + d (y, z) for all x, y, z āˆˆ X.
17 / 37
Images/cinvestav-
Outline
1 Introduction
Introduction
2 Features
Types of Features
3 Similarity and Dissimilarity Measures
Basic Deļ¬nition
4 Proximity Measures between Two Points
Real-Valued Vectors
Dissimilarity Measures
Similarity Measures
Discrete-Valued Vectors
Dissimilarity Measures
Similarity Measures
5 Other Examples
Between Sets
18 / 37
Images/cinvestav-
Dissimilarity Measures
The Hamming distance for {0, 1}
The vectors here are collections of zeros and ones. Thus,
dH (x, y) =
d
i=1
(xi āˆ’ yi)2
(3)
Euclidean distance
d (x, y) =
d
i=1
|xi āˆ’ yi|
1
2
2
(4)
19 / 37
Images/cinvestav-
Dissimilarity Measures
The Hamming distance for {0, 1}
The vectors here are collections of zeros and ones. Thus,
dH (x, y) =
d
i=1
(xi āˆ’ yi)2
(3)
Euclidean distance
d (x, y) =
d
i=1
|xi āˆ’ yi|
1
2
2
(4)
19 / 37
Images/cinvestav-
Dissimilarity Measures
The weighted lp metric DMs
dp (x, y) =
d
i=1
wi |xi āˆ’ yi|p
1
p
(5)
Where
Where xi, yi are the ith coordinates of x and y, i = 1, ..., d and wi ā‰„ 0 is
the ith weight coeļ¬ƒcient.
Important
A well-known representative of the latter category of measures is the
Euclidean distance.
20 / 37
Images/cinvestav-
Dissimilarity Measures
The weighted lp metric DMs
dp (x, y) =
d
i=1
wi |xi āˆ’ yi|p
1
p
(5)
Where
Where xi, yi are the ith coordinates of x and y, i = 1, ..., d and wi ā‰„ 0 is
the ith weight coeļ¬ƒcient.
Important
A well-known representative of the latter category of measures is the
Euclidean distance.
20 / 37
Images/cinvestav-
Dissimilarity Measures
The weighted lp metric DMs
dp (x, y) =
d
i=1
wi |xi āˆ’ yi|p
1
p
(5)
Where
Where xi, yi are the ith coordinates of x and y, i = 1, ..., d and wi ā‰„ 0 is
the ith weight coeļ¬ƒcient.
Important
A well-known representative of the latter category of measures is the
Euclidean distance.
20 / 37
Images/cinvestav-
Dissimilarity Measures
The weighted l2 metric DM can be further generalized as follows
d (x, y) = (x āˆ’ y)T
B (x āˆ’ y) (6)
Where B is a symmetric, positive deļ¬nite matrix
A special case
This includes the Mahalanobis distance as a special case
21 / 37
Images/cinvestav-
Dissimilarity Measures
The weighted l2 metric DM can be further generalized as follows
d (x, y) = (x āˆ’ y)T
B (x āˆ’ y) (6)
Where B is a symmetric, positive deļ¬nite matrix
A special case
This includes the Mahalanobis distance as a special case
21 / 37
Images/cinvestav-
Dissimilarity Measures
Special lp metric DMs that are also encountered in practice are
d1 (x, y) =
d
i=1
wi |xi āˆ’ yi| (Weighted Manhattan Norm)
dāˆž (x, y) = max
1ā‰¤iā‰¤d
wi |xi āˆ’ yi| (Weighted lāˆž Norm)
22 / 37
Images/cinvestav-
Dissimilarity Measures
Special lp metric DMs that are also encountered in practice are
d1 (x, y) =
d
i=1
wi |xi āˆ’ yi| (Weighted Manhattan Norm)
dāˆž (x, y) = max
1ā‰¤iā‰¤d
wi |xi āˆ’ yi| (Weighted lāˆž Norm)
22 / 37
Images/cinvestav-
Similarity Measures
The inner product
sinner (x, y) = xT
y =
d
i=1
xiyi (7)
Closely related to the inner product is the cosine similarity measure
scosine (x, y) =
xT y
x y
(8)
Where x = d
i=1 x2
i and y = d
i=1 y2
i , the length of the
vectors!!!
23 / 37
Images/cinvestav-
Similarity Measures
The inner product
sinner (x, y) = xT
y =
d
i=1
xiyi (7)
Closely related to the inner product is the cosine similarity measure
scosine (x, y) =
xT y
x y
(8)
Where x = d
i=1 x2
i and y = d
i=1 y2
i , the length of the
vectors!!!
23 / 37
Images/cinvestav-
Similarity Measures
Pearsonā€™s correlation coeļ¬ƒcient
rPearson (x, y) =
(x āˆ’ x)T
(y āˆ’ y)
x y
(9)
Where
x =
1
d
d
i=1
xi and y =
1
d
d
i=1
yi (10)
Properties
It is clear that rPearson (x, y) takes values between -1 and +1.
The diļ¬€erence between sinner and rPearson does not depend directly
on x and y but on their corresponding diļ¬€erence vectors.
24 / 37
Images/cinvestav-
Similarity Measures
Pearsonā€™s correlation coeļ¬ƒcient
rPearson (x, y) =
(x āˆ’ x)T
(y āˆ’ y)
x y
(9)
Where
x =
1
d
d
i=1
xi and y =
1
d
d
i=1
yi (10)
Properties
It is clear that rPearson (x, y) takes values between -1 and +1.
The diļ¬€erence between sinner and rPearson does not depend directly
on x and y but on their corresponding diļ¬€erence vectors.
24 / 37
Images/cinvestav-
Similarity Measures
Pearsonā€™s correlation coeļ¬ƒcient
rPearson (x, y) =
(x āˆ’ x)T
(y āˆ’ y)
x y
(9)
Where
x =
1
d
d
i=1
xi and y =
1
d
d
i=1
yi (10)
Properties
It is clear that rPearson (x, y) takes values between -1 and +1.
The diļ¬€erence between sinner and rPearson does not depend directly
on x and y but on their corresponding diļ¬€erence vectors.
24 / 37
Images/cinvestav-
Similarity Measures
Pearsonā€™s correlation coeļ¬ƒcient
rPearson (x, y) =
(x āˆ’ x)T
(y āˆ’ y)
x y
(9)
Where
x =
1
d
d
i=1
xi and y =
1
d
d
i=1
yi (10)
Properties
It is clear that rPearson (x, y) takes values between -1 and +1.
The diļ¬€erence between sinner and rPearson does not depend directly
on x and y but on their corresponding diļ¬€erence vectors.
24 / 37
Images/cinvestav-
Similarity Measures
The Tanimoto measure or distance
sT (x, y) =
xT y
x 2
+ y 2
āˆ’ xT y
(11)
Something Notable
sT (x, y) =
1
1 + (xāˆ’y)T
(xāˆ’y)
xT y
(12)
Meaning
The Tanimoto measure is inversely proportional to the squared Euclidean
distance.
25 / 37
Images/cinvestav-
Similarity Measures
The Tanimoto measure or distance
sT (x, y) =
xT y
x 2
+ y 2
āˆ’ xT y
(11)
Something Notable
sT (x, y) =
1
1 + (xāˆ’y)T
(xāˆ’y)
xT y
(12)
Meaning
The Tanimoto measure is inversely proportional to the squared Euclidean
distance.
25 / 37
Images/cinvestav-
Similarity Measures
The Tanimoto measure or distance
sT (x, y) =
xT y
x 2
+ y 2
āˆ’ xT y
(11)
Something Notable
sT (x, y) =
1
1 + (xāˆ’y)T
(xāˆ’y)
xT y
(12)
Meaning
The Tanimoto measure is inversely proportional to the squared Euclidean
distance.
25 / 37
Images/cinvestav-
Outline
1 Introduction
Introduction
2 Features
Types of Features
3 Similarity and Dissimilarity Measures
Basic Deļ¬nition
4 Proximity Measures between Two Points
Real-Valued Vectors
Dissimilarity Measures
Similarity Measures
Discrete-Valued Vectors
Dissimilarity Measures
Similarity Measures
5 Other Examples
Between Sets
26 / 37
Images/cinvestav-
We consider now vectors whose coordinates belong to a
ļ¬nite set
Given F = {0, 1, 2, ..., k āˆ’ 1} where k is a positive integer
There are exactly kd vectors x āˆˆ Fd
Now consider x, y āˆˆ Fd
With A (x, y) = [aij], i, j = 0, 1, ..., k āˆ’ 1 be a k Ɨ k matrix
Where
The element aij is the number of places where the ļ¬rst vector has the i
symbol and the corresponding element of the second vector has the j
symbol.
27 / 37
Images/cinvestav-
We consider now vectors whose coordinates belong to a
ļ¬nite set
Given F = {0, 1, 2, ..., k āˆ’ 1} where k is a positive integer
There are exactly kd vectors x āˆˆ Fd
Now consider x, y āˆˆ Fd
With A (x, y) = [aij], i, j = 0, 1, ..., k āˆ’ 1 be a k Ɨ k matrix
Where
The element aij is the number of places where the ļ¬rst vector has the i
symbol and the corresponding element of the second vector has the j
symbol.
27 / 37
Images/cinvestav-
We consider now vectors whose coordinates belong to a
ļ¬nite set
Given F = {0, 1, 2, ..., k āˆ’ 1} where k is a positive integer
There are exactly kd vectors x āˆˆ Fd
Now consider x, y āˆˆ Fd
With A (x, y) = [aij], i, j = 0, 1, ..., k āˆ’ 1 be a k Ɨ k matrix
Where
The element aij is the number of places where the ļ¬rst vector has the i
symbol and the corresponding element of the second vector has the j
symbol.
27 / 37
Images/cinvestav-
Using Indicator functions
We can think of each element aij
aij =
d
k=
I [(i, j) == (xk, yk)] (13)
Thanks to Ricardo Llamas and Gibran Felix!!! Class Summer 2015.
28 / 37
Images/cinvestav-
Contingency Matrix
Thus
This matrix is called a contingency table.
For example if d = 6 and k = 3
With x = [0, 1, 2, 1, 2, 1]T
and y = [1, 0, 2, 1, 0, 1]T
Then
A (x, y) =
ļ£®
ļ£Æ
ļ£°
0 1 0
1 2 0
1 0 1
ļ£¹
ļ£ŗ
ļ£» (14)
29 / 37
Images/cinvestav-
Contingency Matrix
Thus
This matrix is called a contingency table.
For example if d = 6 and k = 3
With x = [0, 1, 2, 1, 2, 1]T
and y = [1, 0, 2, 1, 0, 1]T
Then
A (x, y) =
ļ£®
ļ£Æ
ļ£°
0 1 0
1 2 0
1 0 1
ļ£¹
ļ£ŗ
ļ£» (14)
29 / 37
Images/cinvestav-
Contingency Matrix
Thus
This matrix is called a contingency table.
For example if d = 6 and k = 3
With x = [0, 1, 2, 1, 2, 1]T
and y = [1, 0, 2, 1, 0, 1]T
Then
A (x, y) =
ļ£®
ļ£Æ
ļ£°
0 1 0
1 2 0
1 0 1
ļ£¹
ļ£ŗ
ļ£» (14)
29 / 37
Images/cinvestav-
Thus, we have that
Easy to verify that
kāˆ’1
i=0
kāˆ’1
j=0
aij = d (15)
Something Notable
Most of the proximity measures between two discrete-valued vectors may
be expressed as combinations of elements of matrix A (x, y).
30 / 37
Images/cinvestav-
Thus, we have that
Easy to verify that
kāˆ’1
i=0
kāˆ’1
j=0
aij = d (15)
Something Notable
Most of the proximity measures between two discrete-valued vectors may
be expressed as combinations of elements of matrix A (x, y).
30 / 37
Images/cinvestav-
Dissimilarity Measures
The Hamming distance
It is deļ¬ned as the number of places where two vectors diļ¬€er:
d (x, y) =
kāˆ’1
i=0
kāˆ’1
j=0,j=i
aij (16)
The summation of all the oļ¬€-diagonal elements of A, which indicate the
positions where x and y diļ¬€er.
Special case k = 2, thus vectors are binary valued
The vectors x āˆˆ Fd are binary valued and the Hamming distance becomes
dH (x, y) =
d
i=1
(xi + yi āˆ’ 2xiyi)
=
d
i=1
(xi āˆ’ yi)2
31 / 37
Images/cinvestav-
Dissimilarity Measures
The Hamming distance
It is deļ¬ned as the number of places where two vectors diļ¬€er:
d (x, y) =
kāˆ’1
i=0
kāˆ’1
j=0,j=i
aij (16)
The summation of all the oļ¬€-diagonal elements of A, which indicate the
positions where x and y diļ¬€er.
Special case k = 2, thus vectors are binary valued
The vectors x āˆˆ Fd are binary valued and the Hamming distance becomes
dH (x, y) =
d
i=1
(xi + yi āˆ’ 2xiyi)
=
d
i=1
(xi āˆ’ yi)2
31 / 37
Images/cinvestav-
Dissimilarity Measures
The l1 distance
It is deļ¬ned as in the case of the continuous-valued vectors,
d1 (x, y) =
d
i=1
wi |xi āˆ’ yi| (17)
32 / 37
Images/cinvestav-
Similarity Measures
The Tanimoto measure
Given two sets X and Y , we have that
sT (X, Y ) =
nXāˆ©Y
nX + nY āˆ’ nXāˆ©Y
(18)
where nX = |X|, nY = |Y |, nXāˆ©Y = |X āˆ© Y |.
Thus, if x, y are discrete-valued vectors
The measure takes into account all pairs of corresponding coordinates,
except those whose corresponding coordinates (xi, yi) are both 0.
33 / 37
Images/cinvestav-
Similarity Measures
The Tanimoto measure
Given two sets X and Y , we have that
sT (X, Y ) =
nXāˆ©Y
nX + nY āˆ’ nXāˆ©Y
(18)
where nX = |X|, nY = |Y |, nXāˆ©Y = |X āˆ© Y |.
Thus, if x, y are discrete-valued vectors
The measure takes into account all pairs of corresponding coordinates,
except those whose corresponding coordinates (xi, yi) are both 0.
33 / 37
Images/cinvestav-
Similarity Measures
The Tanimoto measure
Given two sets X and Y , we have that
sT (X, Y ) =
nXāˆ©Y
nX + nY āˆ’ nXāˆ©Y
(18)
where nX = |X|, nY = |Y |, nXāˆ©Y = |X āˆ© Y |.
Thus, if x, y are discrete-valued vectors
The measure takes into account all pairs of corresponding coordinates,
except those whose corresponding coordinates (xi, yi) are both 0.
33 / 37
Images/cinvestav-
Thus
We now deļ¬ne
nx = kāˆ’1
i=1
kāˆ’1
j=0 aij and ny = kāˆ’1
i=0
kāˆ’1
j=1 aij
In other words
nx and ny denotes the number of the nonzero coordinate of x and y.
Thus, we have
sT (x, y) ==
kāˆ’1
i=1
kāˆ’1
j=i aii
nX + nY āˆ’ kāˆ’1
i=1
kāˆ’1
j=i aij
(19)
34 / 37
Images/cinvestav-
Thus
We now deļ¬ne
nx = kāˆ’1
i=1
kāˆ’1
j=0 aij and ny = kāˆ’1
i=0
kāˆ’1
j=1 aij
In other words
nx and ny denotes the number of the nonzero coordinate of x and y.
Thus, we have
sT (x, y) ==
kāˆ’1
i=1
kāˆ’1
j=i aii
nX + nY āˆ’ kāˆ’1
i=1
kāˆ’1
j=i aij
(19)
34 / 37
Images/cinvestav-
Thus
We now deļ¬ne
nx = kāˆ’1
i=1
kāˆ’1
j=0 aij and ny = kāˆ’1
i=0
kāˆ’1
j=1 aij
In other words
nx and ny denotes the number of the nonzero coordinate of x and y.
Thus, we have
sT (x, y) ==
kāˆ’1
i=1
kāˆ’1
j=i aii
nX + nY āˆ’ kāˆ’1
i=1
kāˆ’1
j=i aij
(19)
34 / 37
Images/cinvestav-
Outline
1 Introduction
Introduction
2 Features
Types of Features
3 Similarity and Dissimilarity Measures
Basic Deļ¬nition
4 Proximity Measures between Two Points
Real-Valued Vectors
Dissimilarity Measures
Similarity Measures
Discrete-Valued Vectors
Dissimilarity Measures
Similarity Measures
5 Other Examples
Between Sets
35 / 37
Images/cinvestav-
Between Sets
Jaccard Similarity
Given two sets X and Y , we have that
J (X, Y ) =
|X āˆ© Y |
|X āˆŖ Y |
(20)
36 / 37
Images/cinvestav-
Between Sets
Jaccard Similarity
Given two sets X and Y , we have that
J (X, Y ) =
|X āˆ© Y |
|X āˆŖ Y |
(20)
36 / 37
Images/cinvestav-
Although there are more stuļ¬€ to look for
Please Refer to
1 Theodoridisā€™ Book Chapter 11.
2 Rui Xu and Don Wunsch. 2009. Clustering. Wiley-IEEE Press.
37 / 37

More Related Content

What's hot

Introduction to logistic regression
Introduction to logistic regressionIntroduction to logistic regression
Introduction to logistic regressionAndres Mendez-Vazquez
Ā 
Machine learning in science and industry ā€” day 2
Machine learning in science and industry ā€” day 2Machine learning in science and industry ā€” day 2
Machine learning in science and industry ā€” day 2arogozhnikov
Ā 
Machine learning in science and industry ā€” day 1
Machine learning in science and industry ā€” day 1Machine learning in science and industry ā€” day 1
Machine learning in science and industry ā€” day 1arogozhnikov
Ā 
Machine learning in science and industry ā€” day 3
Machine learning in science and industry ā€” day 3Machine learning in science and industry ā€” day 3
Machine learning in science and industry ā€” day 3arogozhnikov
Ā 
Tree models with Scikit-Learn: Great models with little assumptions
Tree models with Scikit-Learn: Great models with little assumptionsTree models with Scikit-Learn: Great models with little assumptions
Tree models with Scikit-Learn: Great models with little assumptionsGilles Louppe
Ā 
Evaluating classifierperformance ml-cs6923
Evaluating classifierperformance ml-cs6923Evaluating classifierperformance ml-cs6923
Evaluating classifierperformance ml-cs6923Raman Kannan
Ā 
Ridge regression, lasso and elastic net
Ridge regression, lasso and elastic netRidge regression, lasso and elastic net
Ridge regression, lasso and elastic netVivian S. Zhang
Ā 
(DL hacksč¼ŖčŖ­) How to Train Deep Variational Autoencoders and Probabilistic Lad...
(DL hacksč¼ŖčŖ­) How to Train Deep Variational Autoencoders and Probabilistic Lad...(DL hacksč¼ŖčŖ­) How to Train Deep Variational Autoencoders and Probabilistic Lad...
(DL hacksč¼ŖčŖ­) How to Train Deep Variational Autoencoders and Probabilistic Lad...Masahiro Suzuki
Ā 
Preparation Data Structures 10 trees
Preparation Data Structures 10 treesPreparation Data Structures 10 trees
Preparation Data Structures 10 treesAndres Mendez-Vazquez
Ā 
Cluster Analysis
Cluster AnalysisCluster Analysis
Cluster AnalysisSSA KPI
Ā 
15857 cse422 unsupervised-learning
15857 cse422 unsupervised-learning15857 cse422 unsupervised-learning
15857 cse422 unsupervised-learningAnil Yadav
Ā 
Scaling Multinomial Logistic Regression via Hybrid Parallelism
Scaling Multinomial Logistic Regression via Hybrid ParallelismScaling Multinomial Logistic Regression via Hybrid Parallelism
Scaling Multinomial Logistic Regression via Hybrid ParallelismParameswaran Raman
Ā 
Statistical Pattern recognition(1)
Statistical Pattern recognition(1)Statistical Pattern recognition(1)
Statistical Pattern recognition(1)Syed Atif Naseem
Ā 
06 Machine Learning - Naive Bayes
06 Machine Learning - Naive Bayes06 Machine Learning - Naive Bayes
06 Machine Learning - Naive BayesAndres Mendez-Vazquez
Ā 
Kaggle Winning Solution Xgboost algorithm -- Let us learn from its author
Kaggle Winning Solution Xgboost algorithm -- Let us learn from its authorKaggle Winning Solution Xgboost algorithm -- Let us learn from its author
Kaggle Winning Solution Xgboost algorithm -- Let us learn from its authorVivian S. Zhang
Ā 
Cluster analysis
Cluster analysisCluster analysis
Cluster analysisAcad
Ā 

What's hot (20)

Introduction to logistic regression
Introduction to logistic regressionIntroduction to logistic regression
Introduction to logistic regression
Ā 
Machine learning in science and industry ā€” day 2
Machine learning in science and industry ā€” day 2Machine learning in science and industry ā€” day 2
Machine learning in science and industry ā€” day 2
Ā 
18.1 combining models
18.1 combining models18.1 combining models
18.1 combining models
Ā 
Machine learning in science and industry ā€” day 1
Machine learning in science and industry ā€” day 1Machine learning in science and industry ā€” day 1
Machine learning in science and industry ā€” day 1
Ā 
Machine learning in science and industry ā€” day 3
Machine learning in science and industry ā€” day 3Machine learning in science and industry ā€” day 3
Machine learning in science and industry ā€” day 3
Ā 
Tree models with Scikit-Learn: Great models with little assumptions
Tree models with Scikit-Learn: Great models with little assumptionsTree models with Scikit-Learn: Great models with little assumptions
Tree models with Scikit-Learn: Great models with little assumptions
Ā 
Evaluating classifierperformance ml-cs6923
Evaluating classifierperformance ml-cs6923Evaluating classifierperformance ml-cs6923
Evaluating classifierperformance ml-cs6923
Ā 
Cluster analysis
Cluster analysisCluster analysis
Cluster analysis
Ā 
M03 nb-02
M03 nb-02M03 nb-02
M03 nb-02
Ā 
Ridge regression, lasso and elastic net
Ridge regression, lasso and elastic netRidge regression, lasso and elastic net
Ridge regression, lasso and elastic net
Ā 
(DL hacksč¼ŖčŖ­) How to Train Deep Variational Autoencoders and Probabilistic Lad...
(DL hacksč¼ŖčŖ­) How to Train Deep Variational Autoencoders and Probabilistic Lad...(DL hacksč¼ŖčŖ­) How to Train Deep Variational Autoencoders and Probabilistic Lad...
(DL hacksč¼ŖčŖ­) How to Train Deep Variational Autoencoders and Probabilistic Lad...
Ā 
Preparation Data Structures 10 trees
Preparation Data Structures 10 treesPreparation Data Structures 10 trees
Preparation Data Structures 10 trees
Ā 
Cluster Analysis
Cluster AnalysisCluster Analysis
Cluster Analysis
Ā 
15857 cse422 unsupervised-learning
15857 cse422 unsupervised-learning15857 cse422 unsupervised-learning
15857 cse422 unsupervised-learning
Ā 
Scaling Multinomial Logistic Regression via Hybrid Parallelism
Scaling Multinomial Logistic Regression via Hybrid ParallelismScaling Multinomial Logistic Regression via Hybrid Parallelism
Scaling Multinomial Logistic Regression via Hybrid Parallelism
Ā 
Statistical Pattern recognition(1)
Statistical Pattern recognition(1)Statistical Pattern recognition(1)
Statistical Pattern recognition(1)
Ā 
06 Machine Learning - Naive Bayes
06 Machine Learning - Naive Bayes06 Machine Learning - Naive Bayes
06 Machine Learning - Naive Bayes
Ā 
Kaggle Winning Solution Xgboost algorithm -- Let us learn from its author
Kaggle Winning Solution Xgboost algorithm -- Let us learn from its authorKaggle Winning Solution Xgboost algorithm -- Let us learn from its author
Kaggle Winning Solution Xgboost algorithm -- Let us learn from its author
Ā 
Cluster analysis
Cluster analysisCluster analysis
Cluster analysis
Ā 
Machine Learning
Machine LearningMachine Learning
Machine Learning
Ā 

Viewers also liked

What is cluster analysis
What is cluster analysisWhat is cluster analysis
What is cluster analysisPrabhat gangwar
Ā 
Data Mining: Concepts and Techniques_ Chapter 6: Mining Frequent Patterns, ...
Data Mining:  Concepts and Techniques_ Chapter 6: Mining Frequent Patterns, ...Data Mining:  Concepts and Techniques_ Chapter 6: Mining Frequent Patterns, ...
Data Mining: Concepts and Techniques_ Chapter 6: Mining Frequent Patterns, ...Salah Amean
Ā 
Data mining :Concepts and Techniques Chapter 2, data
Data mining :Concepts and Techniques Chapter 2, dataData mining :Concepts and Techniques Chapter 2, data
Data mining :Concepts and Techniques Chapter 2, dataSalah Amean
Ā 
similarity measure
similarity measure similarity measure
similarity measure ZHAO Sam
Ā 
Cluster analysis for market segmentation
Cluster analysis for market segmentationCluster analysis for market segmentation
Cluster analysis for market segmentationVishal Tandel
Ā 

Viewers also liked (7)

What is cluster analysis
What is cluster analysisWhat is cluster analysis
What is cluster analysis
Ā 
Collaborative filtering
Collaborative filteringCollaborative filtering
Collaborative filtering
Ā 
Data Mining: Concepts and Techniques_ Chapter 6: Mining Frequent Patterns, ...
Data Mining:  Concepts and Techniques_ Chapter 6: Mining Frequent Patterns, ...Data Mining:  Concepts and Techniques_ Chapter 6: Mining Frequent Patterns, ...
Data Mining: Concepts and Techniques_ Chapter 6: Mining Frequent Patterns, ...
Ā 
Data mining :Concepts and Techniques Chapter 2, data
Data mining :Concepts and Techniques Chapter 2, dataData mining :Concepts and Techniques Chapter 2, data
Data mining :Concepts and Techniques Chapter 2, data
Ā 
similarity measure
similarity measure similarity measure
similarity measure
Ā 
Dataa miining
Dataa miiningDataa miining
Dataa miining
Ā 
Cluster analysis for market segmentation
Cluster analysis for market segmentationCluster analysis for market segmentation
Cluster analysis for market segmentation
Ā 

Similar to 27 Machine Learning Unsupervised Measure Properties

Lect4
Lect4Lect4
Lect4sumit621
Ā 
20 k-means, k-center, k-meoids and variations
20 k-means, k-center, k-meoids and variations20 k-means, k-center, k-meoids and variations
20 k-means, k-center, k-meoids and variationsAndres Mendez-Vazquez
Ā 
Machine Learning Basics
Machine Learning BasicsMachine Learning Basics
Machine Learning BasicsHumberto Marchezi
Ā 
Similarity Measures (pptx)
Similarity Measures (pptx)Similarity Measures (pptx)
Similarity Measures (pptx)JackDi2
Ā 
Preparation Data Structures 11 graphs
Preparation Data Structures 11 graphsPreparation Data Structures 11 graphs
Preparation Data Structures 11 graphsAndres Mendez-Vazquez
Ā 
cluster analysis
cluster analysiscluster analysis
cluster analysissudesh regmi
Ā 
Decision tree
Decision treeDecision tree
Decision treeAmi_Surati
Ā 
Decision tree
Decision treeDecision tree
Decision treeSoujanya V
Ā 
Machine Learning Algorithms (Part 1)
Machine Learning Algorithms (Part 1)Machine Learning Algorithms (Part 1)
Machine Learning Algorithms (Part 1)Zihui Li
Ā 
Cs345 cl
Cs345 clCs345 cl
Cs345 clmattriley
Ā 
Decision Trees and Bayes Classifiers
Decision Trees and Bayes ClassifiersDecision Trees and Bayes Classifiers
Decision Trees and Bayes ClassifiersAlexander Jung
Ā 
PR07.pdf
PR07.pdfPR07.pdf
PR07.pdfRadhwan2
Ā 
K Means Clustering Algorithm | K Means Clustering Example | Machine Learning ...
K Means Clustering Algorithm | K Means Clustering Example | Machine Learning ...K Means Clustering Algorithm | K Means Clustering Example | Machine Learning ...
K Means Clustering Algorithm | K Means Clustering Example | Machine Learning ...Simplilearn
Ā 
kmean_naivebayes.pptx
kmean_naivebayes.pptxkmean_naivebayes.pptx
kmean_naivebayes.pptxAryanhayaran
Ā 
2_9_asset-v1-ColumbiaX+CSMM.101x+2T2017+type@asset+block@AI_edx_ml_unsupervis...
2_9_asset-v1-ColumbiaX+CSMM.101x+2T2017+type@asset+block@AI_edx_ml_unsupervis...2_9_asset-v1-ColumbiaX+CSMM.101x+2T2017+type@asset+block@AI_edx_ml_unsupervis...
2_9_asset-v1-ColumbiaX+CSMM.101x+2T2017+type@asset+block@AI_edx_ml_unsupervis...MostafaHazemMostafaa
Ā 
Cheatsheet unsupervised-learning
Cheatsheet unsupervised-learningCheatsheet unsupervised-learning
Cheatsheet unsupervised-learningSteve Nouri
Ā 
Output Units and Cost Function in FNN
Output Units and Cost Function in FNNOutput Units and Cost Function in FNN
Output Units and Cost Function in FNNLin JiaMing
Ā 
[PPT]
[PPT][PPT]
[PPT]butest
Ā 
Cantor Infinity theorems
Cantor Infinity theoremsCantor Infinity theorems
Cantor Infinity theoremsOren Ish-Am
Ā 
TunUp final presentation
TunUp final presentationTunUp final presentation
TunUp final presentationGianmario Spacagna
Ā 

Similar to 27 Machine Learning Unsupervised Measure Properties (20)

Lect4
Lect4Lect4
Lect4
Ā 
20 k-means, k-center, k-meoids and variations
20 k-means, k-center, k-meoids and variations20 k-means, k-center, k-meoids and variations
20 k-means, k-center, k-meoids and variations
Ā 
Machine Learning Basics
Machine Learning BasicsMachine Learning Basics
Machine Learning Basics
Ā 
Similarity Measures (pptx)
Similarity Measures (pptx)Similarity Measures (pptx)
Similarity Measures (pptx)
Ā 
Preparation Data Structures 11 graphs
Preparation Data Structures 11 graphsPreparation Data Structures 11 graphs
Preparation Data Structures 11 graphs
Ā 
cluster analysis
cluster analysiscluster analysis
cluster analysis
Ā 
Decision tree
Decision treeDecision tree
Decision tree
Ā 
Decision tree
Decision treeDecision tree
Decision tree
Ā 
Machine Learning Algorithms (Part 1)
Machine Learning Algorithms (Part 1)Machine Learning Algorithms (Part 1)
Machine Learning Algorithms (Part 1)
Ā 
Cs345 cl
Cs345 clCs345 cl
Cs345 cl
Ā 
Decision Trees and Bayes Classifiers
Decision Trees and Bayes ClassifiersDecision Trees and Bayes Classifiers
Decision Trees and Bayes Classifiers
Ā 
PR07.pdf
PR07.pdfPR07.pdf
PR07.pdf
Ā 
K Means Clustering Algorithm | K Means Clustering Example | Machine Learning ...
K Means Clustering Algorithm | K Means Clustering Example | Machine Learning ...K Means Clustering Algorithm | K Means Clustering Example | Machine Learning ...
K Means Clustering Algorithm | K Means Clustering Example | Machine Learning ...
Ā 
kmean_naivebayes.pptx
kmean_naivebayes.pptxkmean_naivebayes.pptx
kmean_naivebayes.pptx
Ā 
2_9_asset-v1-ColumbiaX+CSMM.101x+2T2017+type@asset+block@AI_edx_ml_unsupervis...
2_9_asset-v1-ColumbiaX+CSMM.101x+2T2017+type@asset+block@AI_edx_ml_unsupervis...2_9_asset-v1-ColumbiaX+CSMM.101x+2T2017+type@asset+block@AI_edx_ml_unsupervis...
2_9_asset-v1-ColumbiaX+CSMM.101x+2T2017+type@asset+block@AI_edx_ml_unsupervis...
Ā 
Cheatsheet unsupervised-learning
Cheatsheet unsupervised-learningCheatsheet unsupervised-learning
Cheatsheet unsupervised-learning
Ā 
Output Units and Cost Function in FNN
Output Units and Cost Function in FNNOutput Units and Cost Function in FNN
Output Units and Cost Function in FNN
Ā 
[PPT]
[PPT][PPT]
[PPT]
Ā 
Cantor Infinity theorems
Cantor Infinity theoremsCantor Infinity theorems
Cantor Infinity theorems
Ā 
TunUp final presentation
TunUp final presentationTunUp final presentation
TunUp final presentation
Ā 

More from Andres Mendez-Vazquez

01.04 orthonormal basis_eigen_vectors
01.04 orthonormal basis_eigen_vectors01.04 orthonormal basis_eigen_vectors
01.04 orthonormal basis_eigen_vectorsAndres Mendez-Vazquez
Ā 
01.03 squared matrices_and_other_issues
01.03 squared matrices_and_other_issues01.03 squared matrices_and_other_issues
01.03 squared matrices_and_other_issuesAndres Mendez-Vazquez
Ā 
05 backpropagation automatic_differentiation
05 backpropagation automatic_differentiation05 backpropagation automatic_differentiation
05 backpropagation automatic_differentiationAndres Mendez-Vazquez
Ā 
01 Introduction to Neural Networks and Deep Learning
01 Introduction to Neural Networks and Deep Learning01 Introduction to Neural Networks and Deep Learning
01 Introduction to Neural Networks and Deep LearningAndres Mendez-Vazquez
Ā 
25 introduction reinforcement_learning
25 introduction reinforcement_learning25 introduction reinforcement_learning
25 introduction reinforcement_learningAndres Mendez-Vazquez
Ā 
Neural Networks and Deep Learning Syllabus
Neural Networks and Deep Learning SyllabusNeural Networks and Deep Learning Syllabus
Neural Networks and Deep Learning SyllabusAndres Mendez-Vazquez
Ā 
Introduction to artificial_intelligence_syllabus
Introduction to artificial_intelligence_syllabusIntroduction to artificial_intelligence_syllabus
Introduction to artificial_intelligence_syllabusAndres Mendez-Vazquez
Ā 
Ideas about a Bachelor in Machine Learning/Data Sciences
Ideas about a Bachelor in Machine Learning/Data SciencesIdeas about a Bachelor in Machine Learning/Data Sciences
Ideas about a Bachelor in Machine Learning/Data SciencesAndres Mendez-Vazquez
Ā 
Analysis of Algorithms Syllabus
Analysis of Algorithms  SyllabusAnalysis of Algorithms  Syllabus
Analysis of Algorithms SyllabusAndres Mendez-Vazquez
Ā 
17 vapnik chervonenkis dimension
17 vapnik chervonenkis dimension17 vapnik chervonenkis dimension
17 vapnik chervonenkis dimensionAndres Mendez-Vazquez
Ā 
A basic introduction to learning
A basic introduction to learningA basic introduction to learning
A basic introduction to learningAndres Mendez-Vazquez
Ā 
Introduction Mathematics Intelligent Systems Syllabus
Introduction Mathematics Intelligent Systems SyllabusIntroduction Mathematics Intelligent Systems Syllabus
Introduction Mathematics Intelligent Systems SyllabusAndres Mendez-Vazquez
Ā 
Introduction Machine Learning Syllabus
Introduction Machine Learning SyllabusIntroduction Machine Learning Syllabus
Introduction Machine Learning SyllabusAndres Mendez-Vazquez
Ā 

More from Andres Mendez-Vazquez (20)

2.03 bayesian estimation
2.03 bayesian estimation2.03 bayesian estimation
2.03 bayesian estimation
Ā 
05 linear transformations
05 linear transformations05 linear transformations
05 linear transformations
Ā 
01.04 orthonormal basis_eigen_vectors
01.04 orthonormal basis_eigen_vectors01.04 orthonormal basis_eigen_vectors
01.04 orthonormal basis_eigen_vectors
Ā 
01.03 squared matrices_and_other_issues
01.03 squared matrices_and_other_issues01.03 squared matrices_and_other_issues
01.03 squared matrices_and_other_issues
Ā 
01.02 linear equations
01.02 linear equations01.02 linear equations
01.02 linear equations
Ā 
01.01 vector spaces
01.01 vector spaces01.01 vector spaces
01.01 vector spaces
Ā 
06 recurrent neural_networks
06 recurrent neural_networks06 recurrent neural_networks
06 recurrent neural_networks
Ā 
05 backpropagation automatic_differentiation
05 backpropagation automatic_differentiation05 backpropagation automatic_differentiation
05 backpropagation automatic_differentiation
Ā 
Zetta global
Zetta globalZetta global
Zetta global
Ā 
01 Introduction to Neural Networks and Deep Learning
01 Introduction to Neural Networks and Deep Learning01 Introduction to Neural Networks and Deep Learning
01 Introduction to Neural Networks and Deep Learning
Ā 
25 introduction reinforcement_learning
25 introduction reinforcement_learning25 introduction reinforcement_learning
25 introduction reinforcement_learning
Ā 
Neural Networks and Deep Learning Syllabus
Neural Networks and Deep Learning SyllabusNeural Networks and Deep Learning Syllabus
Neural Networks and Deep Learning Syllabus
Ā 
Introduction to artificial_intelligence_syllabus
Introduction to artificial_intelligence_syllabusIntroduction to artificial_intelligence_syllabus
Introduction to artificial_intelligence_syllabus
Ā 
Ideas 09 22_2018
Ideas 09 22_2018Ideas 09 22_2018
Ideas 09 22_2018
Ā 
Ideas about a Bachelor in Machine Learning/Data Sciences
Ideas about a Bachelor in Machine Learning/Data SciencesIdeas about a Bachelor in Machine Learning/Data Sciences
Ideas about a Bachelor in Machine Learning/Data Sciences
Ā 
Analysis of Algorithms Syllabus
Analysis of Algorithms  SyllabusAnalysis of Algorithms  Syllabus
Analysis of Algorithms Syllabus
Ā 
17 vapnik chervonenkis dimension
17 vapnik chervonenkis dimension17 vapnik chervonenkis dimension
17 vapnik chervonenkis dimension
Ā 
A basic introduction to learning
A basic introduction to learningA basic introduction to learning
A basic introduction to learning
Ā 
Introduction Mathematics Intelligent Systems Syllabus
Introduction Mathematics Intelligent Systems SyllabusIntroduction Mathematics Intelligent Systems Syllabus
Introduction Mathematics Intelligent Systems Syllabus
Ā 
Introduction Machine Learning Syllabus
Introduction Machine Learning SyllabusIntroduction Machine Learning Syllabus
Introduction Machine Learning Syllabus
Ā 

Recently uploaded

chaitra-1.pptx fake news detection using machine learning
chaitra-1.pptx  fake news detection using machine learningchaitra-1.pptx  fake news detection using machine learning
chaitra-1.pptx fake news detection using machine learningmisbanausheenparvam
Ā 
Model Call Girl in Narela Delhi reach out to us at šŸ”8264348440šŸ”
Model Call Girl in Narela Delhi reach out to us at šŸ”8264348440šŸ”Model Call Girl in Narela Delhi reach out to us at šŸ”8264348440šŸ”
Model Call Girl in Narela Delhi reach out to us at šŸ”8264348440šŸ”soniya singh
Ā 
Artificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptxArtificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptxbritheesh05
Ā 
Introduction to Microprocesso programming and interfacing.pptx
Introduction to Microprocesso programming and interfacing.pptxIntroduction to Microprocesso programming and interfacing.pptx
Introduction to Microprocesso programming and interfacing.pptxvipinkmenon1
Ā 
Application of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptxApplication of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptx959SahilShah
Ā 
HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2RajaP95
Ā 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVRajaP95
Ā 
Call Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile serviceCall Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile servicerehmti665
Ā 
What are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxWhat are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxwendy cai
Ā 
IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024Mark Billinghurst
Ā 
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...VICTOR MAESTRE RAMIREZ
Ā 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024hassan khalil
Ā 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escortsranjana rawat
Ā 
Call Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call GirlsCall Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call Girlsssuser7cb4ff
Ā 
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130Suhani Kapoor
Ā 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
Ā 
power system scada applications and uses
power system scada applications and usespower system scada applications and uses
power system scada applications and usesDevarapalliHaritha
Ā 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...Soham Mondal
Ā 

Recently uploaded (20)

chaitra-1.pptx fake news detection using machine learning
chaitra-1.pptx  fake news detection using machine learningchaitra-1.pptx  fake news detection using machine learning
chaitra-1.pptx fake news detection using machine learning
Ā 
Model Call Girl in Narela Delhi reach out to us at šŸ”8264348440šŸ”
Model Call Girl in Narela Delhi reach out to us at šŸ”8264348440šŸ”Model Call Girl in Narela Delhi reach out to us at šŸ”8264348440šŸ”
Model Call Girl in Narela Delhi reach out to us at šŸ”8264348440šŸ”
Ā 
Artificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptxArtificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptx
Ā 
Introduction to Microprocesso programming and interfacing.pptx
Introduction to Microprocesso programming and interfacing.pptxIntroduction to Microprocesso programming and interfacing.pptx
Introduction to Microprocesso programming and interfacing.pptx
Ā 
Application of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptxApplication of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptx
Ā 
HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2
Ā 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
Ā 
Call Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile serviceCall Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile service
Ā 
What are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxWhat are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptx
Ā 
IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024
Ā 
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
Ā 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024
Ā 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
Ā 
Call Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call GirlsCall Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call Girls
Ā 
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
Ā 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
Ā 
power system scada applications and uses
power system scada applications and usespower system scada applications and uses
power system scada applications and uses
Ā 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
Ā 
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptxExploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
Ā 
young call girls in Rajiv ChowkšŸ” 9953056974 šŸ” Delhi escort Service
young call girls in Rajiv ChowkšŸ” 9953056974 šŸ” Delhi escort Serviceyoung call girls in Rajiv ChowkšŸ” 9953056974 šŸ” Delhi escort Service
young call girls in Rajiv ChowkšŸ” 9953056974 šŸ” Delhi escort Service
Ā 

27 Machine Learning Unsupervised Measure Properties

  • 1. Machine Learning for Data Mining Measure Properties for Clustering Andres Mendez-Vazquez July 27, 2015 1 / 37
  • 2. Images/cinvestav- Outline 1 Introduction Introduction 2 Features Types of Features 3 Similarity and Dissimilarity Measures Basic Deļ¬nition 4 Proximity Measures between Two Points Real-Valued Vectors Dissimilarity Measures Similarity Measures Discrete-Valued Vectors Dissimilarity Measures Similarity Measures 5 Other Examples Between Sets 2 / 37
  • 3. Images/cinvestav- Outline 1 Introduction Introduction 2 Features Types of Features 3 Similarity and Dissimilarity Measures Basic Deļ¬nition 4 Proximity Measures between Two Points Real-Valued Vectors Dissimilarity Measures Similarity Measures Discrete-Valued Vectors Dissimilarity Measures Similarity Measures 5 Other Examples Between Sets 3 / 37
  • 4. Images/cinvestav- Introduction Observation Clusters are considered as groups containing data objects that are similar to each other. When they are not We assume there is some way to measure that diļ¬€erence!!! 4 / 37
  • 5. Images/cinvestav- Introduction Observation Clusters are considered as groups containing data objects that are similar to each other. When they are not We assume there is some way to measure that diļ¬€erence!!! 4 / 37
  • 6. Images/cinvestav- Therefore It is possible to give a mathematical deļ¬nition of two important types of clustering Partitional Clustering Hierarchical Clustering Given Given a set of input patterns X = {x1, x2, ..., xN }, where xj = (x1j, x2j, ..., xdj)T āˆˆ Rd, with each measure xij called a feature (attribute, dimension, or variable). 5 / 37
  • 7. Images/cinvestav- Therefore It is possible to give a mathematical deļ¬nition of two important types of clustering Partitional Clustering Hierarchical Clustering Given Given a set of input patterns X = {x1, x2, ..., xN }, where xj = (x1j, x2j, ..., xdj)T āˆˆ Rd, with each measure xij called a feature (attribute, dimension, or variable). 5 / 37
  • 8. Images/cinvestav- Therefore It is possible to give a mathematical deļ¬nition of two important types of clustering Partitional Clustering Hierarchical Clustering Given Given a set of input patterns X = {x1, x2, ..., xN }, where xj = (x1j, x2j, ..., xdj)T āˆˆ Rd, with each measure xij called a feature (attribute, dimension, or variable). 5 / 37
  • 9. Images/cinvestav- Hard Partitional Clustering Deļ¬nition Hard partitional clustering attempts to seek a K-partition of X, C = {C1, C2, ..., CK } with K ā‰¤ N such that 1 Ci = āˆ… for i = 1, ..., K. 2 āˆŖK i=1Ci = X. 3 Ci āˆ© Cj = āˆ… for all i, j = 1, ..., K and i = j. Clearly The third property can be relaxed to obtain the mathematical deļ¬nition of fuzzy and possibilistic clustering. 6 / 37
  • 10. Images/cinvestav- Hard Partitional Clustering Deļ¬nition Hard partitional clustering attempts to seek a K-partition of X, C = {C1, C2, ..., CK } with K ā‰¤ N such that 1 Ci = āˆ… for i = 1, ..., K. 2 āˆŖK i=1Ci = X. 3 Ci āˆ© Cj = āˆ… for all i, j = 1, ..., K and i = j. Clearly The third property can be relaxed to obtain the mathematical deļ¬nition of fuzzy and possibilistic clustering. 6 / 37
  • 11. Images/cinvestav- Hard Partitional Clustering Deļ¬nition Hard partitional clustering attempts to seek a K-partition of X, C = {C1, C2, ..., CK } with K ā‰¤ N such that 1 Ci = āˆ… for i = 1, ..., K. 2 āˆŖK i=1Ci = X. 3 Ci āˆ© Cj = āˆ… for all i, j = 1, ..., K and i = j. Clearly The third property can be relaxed to obtain the mathematical deļ¬nition of fuzzy and possibilistic clustering. 6 / 37
  • 12. Images/cinvestav- Hard Partitional Clustering Deļ¬nition Hard partitional clustering attempts to seek a K-partition of X, C = {C1, C2, ..., CK } with K ā‰¤ N such that 1 Ci = āˆ… for i = 1, ..., K. 2 āˆŖK i=1Ci = X. 3 Ci āˆ© Cj = āˆ… for all i, j = 1, ..., K and i = j. Clearly The third property can be relaxed to obtain the mathematical deļ¬nition of fuzzy and possibilistic clustering. 6 / 37
  • 13. Images/cinvestav- Hard Partitional Clustering Deļ¬nition Hard partitional clustering attempts to seek a K-partition of X, C = {C1, C2, ..., CK } with K ā‰¤ N such that 1 Ci = āˆ… for i = 1, ..., K. 2 āˆŖK i=1Ci = X. 3 Ci āˆ© Cj = āˆ… for all i, j = 1, ..., K and i = j. Clearly The third property can be relaxed to obtain the mathematical deļ¬nition of fuzzy and possibilistic clustering. 6 / 37
  • 14. Images/cinvestav- For example Given the membership Ai (xj) āˆˆ [0, 1] We can have K i=1 Ai (xj) = 1, āˆ€j and N j=1 Ai (xj) < N, āˆ€i. 7 / 37
  • 15. Images/cinvestav- Hierarchical Clustering Deļ¬nition Hierarchical clustering attempts to construct a tree-like, nested structure partition of X, H = {H1, ..., HQ} with Q ā‰¤ N such that If Ci āˆˆ Hm and Cj āˆˆ Hl and m > l imply Ci āŠ‚ Cj or Ci āˆ© Cj = āˆ… for all i, j = 1, ..., K, i = j and m, l = 1, ..., Q. 8 / 37
  • 16. Images/cinvestav- Hierarchical Clustering Deļ¬nition Hierarchical clustering attempts to construct a tree-like, nested structure partition of X, H = {H1, ..., HQ} with Q ā‰¤ N such that If Ci āˆˆ Hm and Cj āˆˆ Hl and m > l imply Ci āŠ‚ Cj or Ci āˆ© Cj = āˆ… for all i, j = 1, ..., K, i = j and m, l = 1, ..., Q. 8 / 37
  • 17. Images/cinvestav- Outline 1 Introduction Introduction 2 Features Types of Features 3 Similarity and Dissimilarity Measures Basic Deļ¬nition 4 Proximity Measures between Two Points Real-Valued Vectors Dissimilarity Measures Similarity Measures Discrete-Valued Vectors Dissimilarity Measures Similarity Measures 5 Other Examples Between Sets 9 / 37
  • 18. Images/cinvestav- Features Usually A data object is described by a set of features or variables, usually represented as a multidimensional vector. Thus We tend to collect all that information as a pattern matrix of dimensionality N Ɨ d. Then A feature can be classiļ¬ed as continuous, discrete, or binary. 10 / 37
  • 19. Images/cinvestav- Features Usually A data object is described by a set of features or variables, usually represented as a multidimensional vector. Thus We tend to collect all that information as a pattern matrix of dimensionality N Ɨ d. Then A feature can be classiļ¬ed as continuous, discrete, or binary. 10 / 37
  • 20. Images/cinvestav- Features Usually A data object is described by a set of features or variables, usually represented as a multidimensional vector. Thus We tend to collect all that information as a pattern matrix of dimensionality N Ɨ d. Then A feature can be classiļ¬ed as continuous, discrete, or binary. 10 / 37
  • 21. Images/cinvestav- Thus Continuous A continuous feature takes values from an uncountably inļ¬nite range set. Discrete Discrete features only have ļ¬nite, or at most, a countably inļ¬nite number of values. Binary Binary or dichotomous features are a special case of discrete features when they have exactly two values. 11 / 37
  • 22. Images/cinvestav- Thus Continuous A continuous feature takes values from an uncountably inļ¬nite range set. Discrete Discrete features only have ļ¬nite, or at most, a countably inļ¬nite number of values. Binary Binary or dichotomous features are a special case of discrete features when they have exactly two values. 11 / 37
  • 23. Images/cinvestav- Thus Continuous A continuous feature takes values from an uncountably inļ¬nite range set. Discrete Discrete features only have ļ¬nite, or at most, a countably inļ¬nite number of values. Binary Binary or dichotomous features are a special case of discrete features when they have exactly two values. 11 / 37
  • 24. Images/cinvestav- Measurement Level Meaning Another property of features is the measurement level, which reļ¬‚ects the relative signiļ¬cance of numbers. We have four from lowest to highest Nominal, ordinal, interval, and ratio. 12 / 37
  • 25. Images/cinvestav- Measurement Level Meaning Another property of features is the measurement level, which reļ¬‚ects the relative signiļ¬cance of numbers. We have four from lowest to highest Nominal, ordinal, interval, and ratio. 12 / 37
  • 26. Images/cinvestav- Measurement Level Nominal Features at this level are represented with labels, states, or names. The numbers are meaningless in any mathematical sense and no mathematical calculation can be made. For example, peristalsis āˆˆ {hypermotile, normal, hypomotile, absent}. Ordinal Features at this level are also names, but with a certain order implied. However, the diļ¬€erence between the values is again meaningless. For example, abdominal distension āˆˆ {slight, moderate, high}. 13 / 37
  • 27. Images/cinvestav- Measurement Level Nominal Features at this level are represented with labels, states, or names. The numbers are meaningless in any mathematical sense and no mathematical calculation can be made. For example, peristalsis āˆˆ {hypermotile, normal, hypomotile, absent}. Ordinal Features at this level are also names, but with a certain order implied. However, the diļ¬€erence between the values is again meaningless. For example, abdominal distension āˆˆ {slight, moderate, high}. 13 / 37
  • 28. Images/cinvestav- Measurement Level Nominal Features at this level are represented with labels, states, or names. The numbers are meaningless in any mathematical sense and no mathematical calculation can be made. For example, peristalsis āˆˆ {hypermotile, normal, hypomotile, absent}. Ordinal Features at this level are also names, but with a certain order implied. However, the diļ¬€erence between the values is again meaningless. For example, abdominal distension āˆˆ {slight, moderate, high}. 13 / 37
  • 29. Images/cinvestav- Measurement Level Nominal Features at this level are represented with labels, states, or names. The numbers are meaningless in any mathematical sense and no mathematical calculation can be made. For example, peristalsis āˆˆ {hypermotile, normal, hypomotile, absent}. Ordinal Features at this level are also names, but with a certain order implied. However, the diļ¬€erence between the values is again meaningless. For example, abdominal distension āˆˆ {slight, moderate, high}. 13 / 37
  • 30. Images/cinvestav- Measurement Level Nominal Features at this level are represented with labels, states, or names. The numbers are meaningless in any mathematical sense and no mathematical calculation can be made. For example, peristalsis āˆˆ {hypermotile, normal, hypomotile, absent}. Ordinal Features at this level are also names, but with a certain order implied. However, the diļ¬€erence between the values is again meaningless. For example, abdominal distension āˆˆ {slight, moderate, high}. 13 / 37
  • 31. Images/cinvestav- Measurement Level Nominal Features at this level are represented with labels, states, or names. The numbers are meaningless in any mathematical sense and no mathematical calculation can be made. For example, peristalsis āˆˆ {hypermotile, normal, hypomotile, absent}. Ordinal Features at this level are also names, but with a certain order implied. However, the diļ¬€erence between the values is again meaningless. For example, abdominal distension āˆˆ {slight, moderate, high}. 13 / 37
  • 32. Images/cinvestav- Measurement Level Interval Features at this level oļ¬€er a meaningful interpretation of the diļ¬€erence between two values. However, there exists no true zero and the ratio between two values is meaningless. For example, the concept cold can be seen as an interval. Ratio Features at this level possess all the properties of the above levels. There is an absolute zero. Thus, we have the meaning of ratio!!! 14 / 37
  • 33. Images/cinvestav- Measurement Level Interval Features at this level oļ¬€er a meaningful interpretation of the diļ¬€erence between two values. However, there exists no true zero and the ratio between two values is meaningless. For example, the concept cold can be seen as an interval. Ratio Features at this level possess all the properties of the above levels. There is an absolute zero. Thus, we have the meaning of ratio!!! 14 / 37
  • 34. Images/cinvestav- Measurement Level Interval Features at this level oļ¬€er a meaningful interpretation of the diļ¬€erence between two values. However, there exists no true zero and the ratio between two values is meaningless. For example, the concept cold can be seen as an interval. Ratio Features at this level possess all the properties of the above levels. There is an absolute zero. Thus, we have the meaning of ratio!!! 14 / 37
  • 35. Images/cinvestav- Measurement Level Interval Features at this level oļ¬€er a meaningful interpretation of the diļ¬€erence between two values. However, there exists no true zero and the ratio between two values is meaningless. For example, the concept cold can be seen as an interval. Ratio Features at this level possess all the properties of the above levels. There is an absolute zero. Thus, we have the meaning of ratio!!! 14 / 37
  • 36. Images/cinvestav- Measurement Level Interval Features at this level oļ¬€er a meaningful interpretation of the diļ¬€erence between two values. However, there exists no true zero and the ratio between two values is meaningless. For example, the concept cold can be seen as an interval. Ratio Features at this level possess all the properties of the above levels. There is an absolute zero. Thus, we have the meaning of ratio!!! 14 / 37
  • 37. Images/cinvestav- Measurement Level Interval Features at this level oļ¬€er a meaningful interpretation of the diļ¬€erence between two values. However, there exists no true zero and the ratio between two values is meaningless. For example, the concept cold can be seen as an interval. Ratio Features at this level possess all the properties of the above levels. There is an absolute zero. Thus, we have the meaning of ratio!!! 14 / 37
  • 38. Images/cinvestav- Outline 1 Introduction Introduction 2 Features Types of Features 3 Similarity and Dissimilarity Measures Basic Deļ¬nition 4 Proximity Measures between Two Points Real-Valued Vectors Dissimilarity Measures Similarity Measures Discrete-Valued Vectors Dissimilarity Measures Similarity Measures 5 Other Examples Between Sets 15 / 37
  • 39. Images/cinvestav- Similarity Measures Deļ¬nition A Similarity Measure s on X is a function s : X Ɨ X ā†’ R (1) Such that 1 s (x, y) ā‰¤ s0. 2 s (x, x) = s0 for all x āˆˆ X. 3 s (x, y) = s (y, x) for all x, y āˆˆ X. 4 If s (x, y) = s0 ā‡ā‡’ x = y. 5 s (x, y) s (y, z) ā‰¤ s (x, z) [s (x, y) + s (y, z)] for all x, y, z āˆˆ X. 16 / 37
  • 40. Images/cinvestav- Similarity Measures Deļ¬nition A Similarity Measure s on X is a function s : X Ɨ X ā†’ R (1) Such that 1 s (x, y) ā‰¤ s0. 2 s (x, x) = s0 for all x āˆˆ X. 3 s (x, y) = s (y, x) for all x, y āˆˆ X. 4 If s (x, y) = s0 ā‡ā‡’ x = y. 5 s (x, y) s (y, z) ā‰¤ s (x, z) [s (x, y) + s (y, z)] for all x, y, z āˆˆ X. 16 / 37
  • 41. Images/cinvestav- Similarity Measures Deļ¬nition A Similarity Measure s on X is a function s : X Ɨ X ā†’ R (1) Such that 1 s (x, y) ā‰¤ s0. 2 s (x, x) = s0 for all x āˆˆ X. 3 s (x, y) = s (y, x) for all x, y āˆˆ X. 4 If s (x, y) = s0 ā‡ā‡’ x = y. 5 s (x, y) s (y, z) ā‰¤ s (x, z) [s (x, y) + s (y, z)] for all x, y, z āˆˆ X. 16 / 37
  • 42. Images/cinvestav- Similarity Measures Deļ¬nition A Similarity Measure s on X is a function s : X Ɨ X ā†’ R (1) Such that 1 s (x, y) ā‰¤ s0. 2 s (x, x) = s0 for all x āˆˆ X. 3 s (x, y) = s (y, x) for all x, y āˆˆ X. 4 If s (x, y) = s0 ā‡ā‡’ x = y. 5 s (x, y) s (y, z) ā‰¤ s (x, z) [s (x, y) + s (y, z)] for all x, y, z āˆˆ X. 16 / 37
  • 43. Images/cinvestav- Similarity Measures Deļ¬nition A Similarity Measure s on X is a function s : X Ɨ X ā†’ R (1) Such that 1 s (x, y) ā‰¤ s0. 2 s (x, x) = s0 for all x āˆˆ X. 3 s (x, y) = s (y, x) for all x, y āˆˆ X. 4 If s (x, y) = s0 ā‡ā‡’ x = y. 5 s (x, y) s (y, z) ā‰¤ s (x, z) [s (x, y) + s (y, z)] for all x, y, z āˆˆ X. 16 / 37
  • 44. Images/cinvestav- Similarity Measures Deļ¬nition A Similarity Measure s on X is a function s : X Ɨ X ā†’ R (1) Such that 1 s (x, y) ā‰¤ s0. 2 s (x, x) = s0 for all x āˆˆ X. 3 s (x, y) = s (y, x) for all x, y āˆˆ X. 4 If s (x, y) = s0 ā‡ā‡’ x = y. 5 s (x, y) s (y, z) ā‰¤ s (x, z) [s (x, y) + s (y, z)] for all x, y, z āˆˆ X. 16 / 37
  • 45. Images/cinvestav- Dissimilarity Measures Deļ¬ntion A Dissimilarity Measure d on X is a function d : X Ɨ X ā†’ R (2) Such that 1 d (x, y) ā‰„ d0 for all x, y āˆˆ X. 2 d (x, x) = d0 for all x āˆˆ X. 3 d (x, y) = d (y, x) for all x, y āˆˆ X. 4 If d (x, y) = d0 ā‡ā‡’ x = y. 5 d (x, z) ā‰¤ d (x, y) + d (y, z) for all x, y, z āˆˆ X. 17 / 37
  • 46. Images/cinvestav- Dissimilarity Measures Deļ¬ntion A Dissimilarity Measure d on X is a function d : X Ɨ X ā†’ R (2) Such that 1 d (x, y) ā‰„ d0 for all x, y āˆˆ X. 2 d (x, x) = d0 for all x āˆˆ X. 3 d (x, y) = d (y, x) for all x, y āˆˆ X. 4 If d (x, y) = d0 ā‡ā‡’ x = y. 5 d (x, z) ā‰¤ d (x, y) + d (y, z) for all x, y, z āˆˆ X. 17 / 37
  • 47. Images/cinvestav- Dissimilarity Measures Deļ¬ntion A Dissimilarity Measure d on X is a function d : X Ɨ X ā†’ R (2) Such that 1 d (x, y) ā‰„ d0 for all x, y āˆˆ X. 2 d (x, x) = d0 for all x āˆˆ X. 3 d (x, y) = d (y, x) for all x, y āˆˆ X. 4 If d (x, y) = d0 ā‡ā‡’ x = y. 5 d (x, z) ā‰¤ d (x, y) + d (y, z) for all x, y, z āˆˆ X. 17 / 37
  • 48. Images/cinvestav- Dissimilarity Measures Deļ¬ntion A Dissimilarity Measure d on X is a function d : X Ɨ X ā†’ R (2) Such that 1 d (x, y) ā‰„ d0 for all x, y āˆˆ X. 2 d (x, x) = d0 for all x āˆˆ X. 3 d (x, y) = d (y, x) for all x, y āˆˆ X. 4 If d (x, y) = d0 ā‡ā‡’ x = y. 5 d (x, z) ā‰¤ d (x, y) + d (y, z) for all x, y, z āˆˆ X. 17 / 37
  • 49. Images/cinvestav- Dissimilarity Measures Deļ¬ntion A Dissimilarity Measure d on X is a function d : X Ɨ X ā†’ R (2) Such that 1 d (x, y) ā‰„ d0 for all x, y āˆˆ X. 2 d (x, x) = d0 for all x āˆˆ X. 3 d (x, y) = d (y, x) for all x, y āˆˆ X. 4 If d (x, y) = d0 ā‡ā‡’ x = y. 5 d (x, z) ā‰¤ d (x, y) + d (y, z) for all x, y, z āˆˆ X. 17 / 37
  • 50. Images/cinvestav- Dissimilarity Measures Deļ¬ntion A Dissimilarity Measure d on X is a function d : X Ɨ X ā†’ R (2) Such that 1 d (x, y) ā‰„ d0 for all x, y āˆˆ X. 2 d (x, x) = d0 for all x āˆˆ X. 3 d (x, y) = d (y, x) for all x, y āˆˆ X. 4 If d (x, y) = d0 ā‡ā‡’ x = y. 5 d (x, z) ā‰¤ d (x, y) + d (y, z) for all x, y, z āˆˆ X. 17 / 37
  • 51. Images/cinvestav- Outline 1 Introduction Introduction 2 Features Types of Features 3 Similarity and Dissimilarity Measures Basic Deļ¬nition 4 Proximity Measures between Two Points Real-Valued Vectors Dissimilarity Measures Similarity Measures Discrete-Valued Vectors Dissimilarity Measures Similarity Measures 5 Other Examples Between Sets 18 / 37
  • 52. Images/cinvestav- Dissimilarity Measures The Hamming distance for {0, 1} The vectors here are collections of zeros and ones. Thus, dH (x, y) = d i=1 (xi āˆ’ yi)2 (3) Euclidean distance d (x, y) = d i=1 |xi āˆ’ yi| 1 2 2 (4) 19 / 37
  • 53. Images/cinvestav- Dissimilarity Measures The Hamming distance for {0, 1} The vectors here are collections of zeros and ones. Thus, dH (x, y) = d i=1 (xi āˆ’ yi)2 (3) Euclidean distance d (x, y) = d i=1 |xi āˆ’ yi| 1 2 2 (4) 19 / 37
  • 54. Images/cinvestav- Dissimilarity Measures The weighted lp metric DMs dp (x, y) = d i=1 wi |xi āˆ’ yi|p 1 p (5) Where Where xi, yi are the ith coordinates of x and y, i = 1, ..., d and wi ā‰„ 0 is the ith weight coeļ¬ƒcient. Important A well-known representative of the latter category of measures is the Euclidean distance. 20 / 37
  • 55. Images/cinvestav- Dissimilarity Measures The weighted lp metric DMs dp (x, y) = d i=1 wi |xi āˆ’ yi|p 1 p (5) Where Where xi, yi are the ith coordinates of x and y, i = 1, ..., d and wi ā‰„ 0 is the ith weight coeļ¬ƒcient. Important A well-known representative of the latter category of measures is the Euclidean distance. 20 / 37
  • 56. Images/cinvestav- Dissimilarity Measures The weighted lp metric DMs dp (x, y) = d i=1 wi |xi āˆ’ yi|p 1 p (5) Where Where xi, yi are the ith coordinates of x and y, i = 1, ..., d and wi ā‰„ 0 is the ith weight coeļ¬ƒcient. Important A well-known representative of the latter category of measures is the Euclidean distance. 20 / 37
  • 57. Images/cinvestav- Dissimilarity Measures The weighted l2 metric DM can be further generalized as follows d (x, y) = (x āˆ’ y)T B (x āˆ’ y) (6) Where B is a symmetric, positive deļ¬nite matrix A special case This includes the Mahalanobis distance as a special case 21 / 37
  • 58. Images/cinvestav- Dissimilarity Measures The weighted l2 metric DM can be further generalized as follows d (x, y) = (x āˆ’ y)T B (x āˆ’ y) (6) Where B is a symmetric, positive deļ¬nite matrix A special case This includes the Mahalanobis distance as a special case 21 / 37
  • 59. Images/cinvestav- Dissimilarity Measures Special lp metric DMs that are also encountered in practice are d1 (x, y) = d i=1 wi |xi āˆ’ yi| (Weighted Manhattan Norm) dāˆž (x, y) = max 1ā‰¤iā‰¤d wi |xi āˆ’ yi| (Weighted lāˆž Norm) 22 / 37
  • 60. Images/cinvestav- Dissimilarity Measures Special lp metric DMs that are also encountered in practice are d1 (x, y) = d i=1 wi |xi āˆ’ yi| (Weighted Manhattan Norm) dāˆž (x, y) = max 1ā‰¤iā‰¤d wi |xi āˆ’ yi| (Weighted lāˆž Norm) 22 / 37
  • 61. Images/cinvestav- Similarity Measures The inner product sinner (x, y) = xT y = d i=1 xiyi (7) Closely related to the inner product is the cosine similarity measure scosine (x, y) = xT y x y (8) Where x = d i=1 x2 i and y = d i=1 y2 i , the length of the vectors!!! 23 / 37
  • 62. Images/cinvestav- Similarity Measures The inner product sinner (x, y) = xT y = d i=1 xiyi (7) Closely related to the inner product is the cosine similarity measure scosine (x, y) = xT y x y (8) Where x = d i=1 x2 i and y = d i=1 y2 i , the length of the vectors!!! 23 / 37
  • 63. Images/cinvestav- Similarity Measures Pearsonā€™s correlation coeļ¬ƒcient rPearson (x, y) = (x āˆ’ x)T (y āˆ’ y) x y (9) Where x = 1 d d i=1 xi and y = 1 d d i=1 yi (10) Properties It is clear that rPearson (x, y) takes values between -1 and +1. The diļ¬€erence between sinner and rPearson does not depend directly on x and y but on their corresponding diļ¬€erence vectors. 24 / 37
  • 64. Images/cinvestav- Similarity Measures Pearsonā€™s correlation coeļ¬ƒcient rPearson (x, y) = (x āˆ’ x)T (y āˆ’ y) x y (9) Where x = 1 d d i=1 xi and y = 1 d d i=1 yi (10) Properties It is clear that rPearson (x, y) takes values between -1 and +1. The diļ¬€erence between sinner and rPearson does not depend directly on x and y but on their corresponding diļ¬€erence vectors. 24 / 37
  • 65. Images/cinvestav- Similarity Measures Pearsonā€™s correlation coeļ¬ƒcient rPearson (x, y) = (x āˆ’ x)T (y āˆ’ y) x y (9) Where x = 1 d d i=1 xi and y = 1 d d i=1 yi (10) Properties It is clear that rPearson (x, y) takes values between -1 and +1. The diļ¬€erence between sinner and rPearson does not depend directly on x and y but on their corresponding diļ¬€erence vectors. 24 / 37
  • 66. Images/cinvestav- Similarity Measures Pearsonā€™s correlation coeļ¬ƒcient rPearson (x, y) = (x āˆ’ x)T (y āˆ’ y) x y (9) Where x = 1 d d i=1 xi and y = 1 d d i=1 yi (10) Properties It is clear that rPearson (x, y) takes values between -1 and +1. The diļ¬€erence between sinner and rPearson does not depend directly on x and y but on their corresponding diļ¬€erence vectors. 24 / 37
  • 67. Images/cinvestav- Similarity Measures The Tanimoto measure or distance sT (x, y) = xT y x 2 + y 2 āˆ’ xT y (11) Something Notable sT (x, y) = 1 1 + (xāˆ’y)T (xāˆ’y) xT y (12) Meaning The Tanimoto measure is inversely proportional to the squared Euclidean distance. 25 / 37
  • 68. Images/cinvestav- Similarity Measures The Tanimoto measure or distance sT (x, y) = xT y x 2 + y 2 āˆ’ xT y (11) Something Notable sT (x, y) = 1 1 + (xāˆ’y)T (xāˆ’y) xT y (12) Meaning The Tanimoto measure is inversely proportional to the squared Euclidean distance. 25 / 37
  • 69. Images/cinvestav- Similarity Measures The Tanimoto measure or distance sT (x, y) = xT y x 2 + y 2 āˆ’ xT y (11) Something Notable sT (x, y) = 1 1 + (xāˆ’y)T (xāˆ’y) xT y (12) Meaning The Tanimoto measure is inversely proportional to the squared Euclidean distance. 25 / 37
  • 70. Images/cinvestav- Outline 1 Introduction Introduction 2 Features Types of Features 3 Similarity and Dissimilarity Measures Basic Deļ¬nition 4 Proximity Measures between Two Points Real-Valued Vectors Dissimilarity Measures Similarity Measures Discrete-Valued Vectors Dissimilarity Measures Similarity Measures 5 Other Examples Between Sets 26 / 37
  • 71. Images/cinvestav- We consider now vectors whose coordinates belong to a ļ¬nite set Given F = {0, 1, 2, ..., k āˆ’ 1} where k is a positive integer There are exactly kd vectors x āˆˆ Fd Now consider x, y āˆˆ Fd With A (x, y) = [aij], i, j = 0, 1, ..., k āˆ’ 1 be a k Ɨ k matrix Where The element aij is the number of places where the ļ¬rst vector has the i symbol and the corresponding element of the second vector has the j symbol. 27 / 37
  • 72. Images/cinvestav- We consider now vectors whose coordinates belong to a ļ¬nite set Given F = {0, 1, 2, ..., k āˆ’ 1} where k is a positive integer There are exactly kd vectors x āˆˆ Fd Now consider x, y āˆˆ Fd With A (x, y) = [aij], i, j = 0, 1, ..., k āˆ’ 1 be a k Ɨ k matrix Where The element aij is the number of places where the ļ¬rst vector has the i symbol and the corresponding element of the second vector has the j symbol. 27 / 37
  • 73. Images/cinvestav- We consider now vectors whose coordinates belong to a ļ¬nite set Given F = {0, 1, 2, ..., k āˆ’ 1} where k is a positive integer There are exactly kd vectors x āˆˆ Fd Now consider x, y āˆˆ Fd With A (x, y) = [aij], i, j = 0, 1, ..., k āˆ’ 1 be a k Ɨ k matrix Where The element aij is the number of places where the ļ¬rst vector has the i symbol and the corresponding element of the second vector has the j symbol. 27 / 37
  • 74. Images/cinvestav- Using Indicator functions We can think of each element aij aij = d k= I [(i, j) == (xk, yk)] (13) Thanks to Ricardo Llamas and Gibran Felix!!! Class Summer 2015. 28 / 37
  • 75. Images/cinvestav- Contingency Matrix Thus This matrix is called a contingency table. For example if d = 6 and k = 3 With x = [0, 1, 2, 1, 2, 1]T and y = [1, 0, 2, 1, 0, 1]T Then A (x, y) = ļ£® ļ£Æ ļ£° 0 1 0 1 2 0 1 0 1 ļ£¹ ļ£ŗ ļ£» (14) 29 / 37
  • 76. Images/cinvestav- Contingency Matrix Thus This matrix is called a contingency table. For example if d = 6 and k = 3 With x = [0, 1, 2, 1, 2, 1]T and y = [1, 0, 2, 1, 0, 1]T Then A (x, y) = ļ£® ļ£Æ ļ£° 0 1 0 1 2 0 1 0 1 ļ£¹ ļ£ŗ ļ£» (14) 29 / 37
  • 77. Images/cinvestav- Contingency Matrix Thus This matrix is called a contingency table. For example if d = 6 and k = 3 With x = [0, 1, 2, 1, 2, 1]T and y = [1, 0, 2, 1, 0, 1]T Then A (x, y) = ļ£® ļ£Æ ļ£° 0 1 0 1 2 0 1 0 1 ļ£¹ ļ£ŗ ļ£» (14) 29 / 37
  • 78. Images/cinvestav- Thus, we have that Easy to verify that kāˆ’1 i=0 kāˆ’1 j=0 aij = d (15) Something Notable Most of the proximity measures between two discrete-valued vectors may be expressed as combinations of elements of matrix A (x, y). 30 / 37
  • 79. Images/cinvestav- Thus, we have that Easy to verify that kāˆ’1 i=0 kāˆ’1 j=0 aij = d (15) Something Notable Most of the proximity measures between two discrete-valued vectors may be expressed as combinations of elements of matrix A (x, y). 30 / 37
  • 80. Images/cinvestav- Dissimilarity Measures The Hamming distance It is deļ¬ned as the number of places where two vectors diļ¬€er: d (x, y) = kāˆ’1 i=0 kāˆ’1 j=0,j=i aij (16) The summation of all the oļ¬€-diagonal elements of A, which indicate the positions where x and y diļ¬€er. Special case k = 2, thus vectors are binary valued The vectors x āˆˆ Fd are binary valued and the Hamming distance becomes dH (x, y) = d i=1 (xi + yi āˆ’ 2xiyi) = d i=1 (xi āˆ’ yi)2 31 / 37
  • 81. Images/cinvestav- Dissimilarity Measures The Hamming distance It is deļ¬ned as the number of places where two vectors diļ¬€er: d (x, y) = kāˆ’1 i=0 kāˆ’1 j=0,j=i aij (16) The summation of all the oļ¬€-diagonal elements of A, which indicate the positions where x and y diļ¬€er. Special case k = 2, thus vectors are binary valued The vectors x āˆˆ Fd are binary valued and the Hamming distance becomes dH (x, y) = d i=1 (xi + yi āˆ’ 2xiyi) = d i=1 (xi āˆ’ yi)2 31 / 37
  • 82. Images/cinvestav- Dissimilarity Measures The l1 distance It is deļ¬ned as in the case of the continuous-valued vectors, d1 (x, y) = d i=1 wi |xi āˆ’ yi| (17) 32 / 37
  • 83. Images/cinvestav- Similarity Measures The Tanimoto measure Given two sets X and Y , we have that sT (X, Y ) = nXāˆ©Y nX + nY āˆ’ nXāˆ©Y (18) where nX = |X|, nY = |Y |, nXāˆ©Y = |X āˆ© Y |. Thus, if x, y are discrete-valued vectors The measure takes into account all pairs of corresponding coordinates, except those whose corresponding coordinates (xi, yi) are both 0. 33 / 37
  • 84. Images/cinvestav- Similarity Measures The Tanimoto measure Given two sets X and Y , we have that sT (X, Y ) = nXāˆ©Y nX + nY āˆ’ nXāˆ©Y (18) where nX = |X|, nY = |Y |, nXāˆ©Y = |X āˆ© Y |. Thus, if x, y are discrete-valued vectors The measure takes into account all pairs of corresponding coordinates, except those whose corresponding coordinates (xi, yi) are both 0. 33 / 37
  • 85. Images/cinvestav- Similarity Measures The Tanimoto measure Given two sets X and Y , we have that sT (X, Y ) = nXāˆ©Y nX + nY āˆ’ nXāˆ©Y (18) where nX = |X|, nY = |Y |, nXāˆ©Y = |X āˆ© Y |. Thus, if x, y are discrete-valued vectors The measure takes into account all pairs of corresponding coordinates, except those whose corresponding coordinates (xi, yi) are both 0. 33 / 37
  • 86. Images/cinvestav- Thus We now deļ¬ne nx = kāˆ’1 i=1 kāˆ’1 j=0 aij and ny = kāˆ’1 i=0 kāˆ’1 j=1 aij In other words nx and ny denotes the number of the nonzero coordinate of x and y. Thus, we have sT (x, y) == kāˆ’1 i=1 kāˆ’1 j=i aii nX + nY āˆ’ kāˆ’1 i=1 kāˆ’1 j=i aij (19) 34 / 37
  • 87. Images/cinvestav- Thus We now deļ¬ne nx = kāˆ’1 i=1 kāˆ’1 j=0 aij and ny = kāˆ’1 i=0 kāˆ’1 j=1 aij In other words nx and ny denotes the number of the nonzero coordinate of x and y. Thus, we have sT (x, y) == kāˆ’1 i=1 kāˆ’1 j=i aii nX + nY āˆ’ kāˆ’1 i=1 kāˆ’1 j=i aij (19) 34 / 37
  • 88. Images/cinvestav- Thus We now deļ¬ne nx = kāˆ’1 i=1 kāˆ’1 j=0 aij and ny = kāˆ’1 i=0 kāˆ’1 j=1 aij In other words nx and ny denotes the number of the nonzero coordinate of x and y. Thus, we have sT (x, y) == kāˆ’1 i=1 kāˆ’1 j=i aii nX + nY āˆ’ kāˆ’1 i=1 kāˆ’1 j=i aij (19) 34 / 37
  • 89. Images/cinvestav- Outline 1 Introduction Introduction 2 Features Types of Features 3 Similarity and Dissimilarity Measures Basic Deļ¬nition 4 Proximity Measures between Two Points Real-Valued Vectors Dissimilarity Measures Similarity Measures Discrete-Valued Vectors Dissimilarity Measures Similarity Measures 5 Other Examples Between Sets 35 / 37
  • 90. Images/cinvestav- Between Sets Jaccard Similarity Given two sets X and Y , we have that J (X, Y ) = |X āˆ© Y | |X āˆŖ Y | (20) 36 / 37
  • 91. Images/cinvestav- Between Sets Jaccard Similarity Given two sets X and Y , we have that J (X, Y ) = |X āˆ© Y | |X āˆŖ Y | (20) 36 / 37
  • 92. Images/cinvestav- Although there are more stuļ¬€ to look for Please Refer to 1 Theodoridisā€™ Book Chapter 11. 2 Rui Xu and Don Wunsch. 2009. Clustering. Wiley-IEEE Press. 37 / 37