unsupervised classification.pdf

Introduction to Machine Learning
BECC0305
Prof Santanu Chowdhury
GLA University, Mathura

Supervised Learning
Unsupervised Learning
Supervised
Unsupervised
Types of Learning

0
1
2
3
4
5
6
0 1 2 3 4 5 6
y
x
Feature Space
# x y
1 0 4
2 0 5
3 1 5
4 4 0
5 5 0
6 5 1
7 4 5
8 5 5
To find Tentative Clusters
Examples

UNSUPERVISED CLASSIFICATION
1. Cluster Seeking Algorithm
2. Cluster Refinement Algorithms

0
1
2
3
4
5
6
0 1 2 3 4 5 6
y
x
Feature Space
No 𝑷𝒊(x,y) 𝑪𝟎 D(𝑷𝒊, 𝑪𝟎) Argmax
D(𝑷𝒊, 𝑪𝟎)
𝑪𝟏
1 (0,4)
(0,5)
2 (0,5)
3 (1,5)
4 (4,0)
5 (5,0)
6 (5,1)
7 (4,5)
8 (5,5)
1. Initiate with arbitrary 𝐶0 (𝑥0, 𝑦0)
2. Find next Tentative 𝐶1 (𝑥1, 𝑦1)
𝐶1= 𝑃𝑖(x,y) for which D(𝑃𝑖, 𝐶0) is maximum
Use Euclidean Distance
Let 𝐶0 = (0,5) Row wise Scanning
Pass 1

0
1
2
3
4
5
6
0 1 2 3 4 5 6
y
x
Feature Space
𝑪𝟏
1 (0,4)
(0,5)
1
5 (5,0)
2 (0,5) 0
3 (1,5) 1
4 (4,0) 41
5 (5,0) 50
6 (5,1) 41
7 (4,5) 16
8 (5,5) 25
1. Initiate with arbitrary 𝐶0 (𝑥0, 𝑦0)
𝐶1= 𝑃𝑖(x,y) for which D(𝑃𝑖, 𝐶0) is maximum
Let 𝐶0 = (0,5) Row wise Scanning
Pass 1

0
1
2
3
4
5
6
0 1 2 3 4 5 6
y
x
Feature Space
𝑪𝟏 D(𝑷𝒊, 𝑪𝟏) 𝑫𝒎𝒊𝒏 =min
(D(𝑷𝒊, 𝑪𝟎),
D(𝑷𝒊, 𝑪𝟏) )
Argmax
(𝑫𝒎𝒊𝒏)
𝑪𝟐
1 (0,4)
(0,5)
1
5 (5,0)
2 (0,5) 0
3 (1,5) 1
4 (4,0) 41
5 (5,0) 50
6 (5,1) 41
7 (4,5) 16
8 (5,5) 25
𝐂𝟐=argmax { min (D(𝐏𝐢, 𝐂𝟎), D(𝐏𝐢, 𝐂𝟏))
Pass 2

0
1
2
3
4
5
6
0 1 2 3 4 5 6
y
x
Feature Space
𝑪𝟏 D(𝑷𝒊, 𝑪𝟏) 𝑫𝒎𝒊𝒏 =min
(D(𝑷𝒊, 𝑪𝟎),
D(𝑷𝒊, 𝑪𝟎) )
Argmax
(𝑫𝒎𝒊𝒏)
𝑪𝟐
1 (0,4)
(0,5)
1
5 (5,0)
41 1
8 (5,5)
2 (0,5) 0 50 0
3 (1,5) 1 41 1
4 (4,0) 41 1 1
5 (5,0) 50 0 0
6 (5,1) 41 1 1
7 (4,5) 16 26 16
8 (5,5) 25 25 25
𝐂𝟐=argmax { min (D(𝐏𝐢, 𝐂𝟎), D(𝐏𝐢, 𝐂𝟏))
Pass 2

𝐶0 = (0,5) Row wise Scanning
𝐶1 = (5,0) Pass 1
𝐶2 = (5,5) Pass 2
Summary
0
1
2
3
4
5
6
0 1 2 3 4 5 6
y
x
Feature Space
𝐶0
𝐶1
𝐶2
1. Cluster Seeking Algorithm
2. Cluster Refinement Algorithms

Unsupervised Classification :
Cluster seeking by Maximin Algorithm :
1. Choose any Xl as the first cluster center.
2. The vector Xm that maximises the vector-to-first-cluster
distance becomes the second cluster center.
3. The vector Xn that maximizes the minimum-vector-to
cluster-distance-among-k-clusters becomes the (k+l)th
cluster center.
1 2
1
3
2
k
l
k
l
1

0
1
2
3
4
5
6
0 1 2 3 4 5 6
y
x
Feature Space
𝐶0
𝐶1
𝐶2
Refinement of Cluster Centres
using KMeans Algorithm
# x y
1 0 4
2 0 5
3 1 5
4 4 0
5 5 0
6 5 1
7 4 5
8 5 5
Examples
𝐶0𝑖 = (0,5)
𝐶1𝑖 = (5,0)
𝐶2𝑖 = (5,5)
Initial Cluster Centres

2D Feature Space Refinement of Cluster Centres
3 Cluster Problem
No. 𝑷𝒊
(x,y)
D
(𝑷𝒊,𝑪𝟎)
D
(𝑷𝒊,𝑪𝟏)
D
(𝑷𝒊,𝑪𝟐)
min
(D)
k=
Argmin
(min(D))
for
k=0,1,2
𝑁𝑘
for
k=0,1,2
𝐶𝑘𝑓
1 (0,4)
2 (0,5)
3 (1,5)
4 (4,0)
5 (5,0)
6 (5,1)
7 (4,5)
8 (5,5)
KMeans Algorithm for First Iteration
𝐶0𝑖 = (0,5) 𝐶1𝑖 = (5,0) 𝐶2𝑖 = (5,5)
𝐶0𝑓 = (0.33,, 4.67) 𝐶1𝑓 = (4.67,0.33) 𝐶2𝑓 = (4.5,5)

3 Cluster Problem
No. 𝑷𝒊
(x,y)
D
(𝑷𝒊,𝑪𝟎)
D
(𝑷𝒊,𝑪𝟏)
D
(𝑷𝒊,𝑪𝟐)
min
(D)
k=
Argmin
(min(D))
for
k=0,1,2
𝑁𝑘
for
k=0,1,2
𝐶𝑘𝑓
1 (0,4) 1 41 26 1 0
2 (0,5) 0 50 25 0 0
3 (1,5) 1 41 16 1 0
4 (4,0) 41 1 26 1 1
5 (5,0) 50 0 25 0 1
6 (5,1) 41 1 16 1 1
7 (4,5) 16 26 1 1 2
8 (5,5) 25 25 0 0 2
𝐶0𝑖 = (0,5) 𝐶1𝑖 = (5,0) 𝐶2𝑖 = (5,5)
𝐶2𝑓 = (4.5,5)

3 Cluster Problem
No. 𝑷𝒊
(x,y)
𝑫𝟎
(𝑷𝒊,𝑪𝟎)
𝑫𝟏
(𝑷𝒊,𝑪𝟏)
𝑫𝟐
(𝑷𝒊,𝑪𝟐)
min
(𝑫𝒊)
k=
Argmin
(min(𝐷𝑖))
for
k=0,1,2
𝑁𝑘
for
k=0,1,2
𝐶𝑘𝑓
1 (0,4) 1 41 26 1 0
(1, 14) 3 (0.33,4.67)
2 (0,5) 0 50 25 0 0
3 (1,5) 1 41 16 1 0
4 (4,0) 41 1 26 1 1
(14, 1) 3 (4.67,0.33)
5 (5,0) 50 0 25 0 1
6 (5,1) 41 1 16 1 1
7 (4,5) 16 26 1 1 2
(9, 10) 2 (4.5. 5)
8 (5,5) 25 25 0 0 2
𝐶0𝑖 = (0,5) 𝐶1𝑖 = (5,0) 𝐶2𝑖 = (5,5)
𝐶0𝑓 = (0.33,, 4.67) 𝐶1𝑓 = (4.67,0.33) 𝐶2𝑓 = (4.5,5)

3 Cluster Problem
No. 𝑷𝒊
(x,y)
D
(𝑷𝒊,𝑪𝟎)
D
(𝑷𝒊,𝑪𝟏)
D
(𝑷𝒊,𝑪𝟐)
min
(D)
k=
Argmin
(min(D))
for
k=0,1,2
𝑁𝑘
for
k=0,1,2
𝐶𝑘𝑓
1 (0,4) 1 41 17 1 0
(1, 14) 3 (0.33,4.67)
2 (0,5) 0 50 16 0 0
3 (1,5) 1 41 9 1 0
4 (4,0) 41 1 25 1 1
(14, 1) 3 (4.67,0.33)
5 (5,0) 50 0 26 0 1
6 (5,1) 41 1 17 1 1
7 (4,5) 16 26 0 0 2
(9, 10) 2 (4.5. 5)
8 (5,5) 25 25 1 1 2
𝐶0𝑓 = (0.33,4.67) 𝐶1𝑓 = (4.67,0.33) 𝐶2𝑓 = (4.5,5)
𝐶0𝑓 = (0.33,, 4.67) 𝐶1𝑓 = (4.67,0.33) 𝐶2𝑓 = (4.5,5)

Cluster Refininement by K-Means Algorithm
1. Determine initial K cluster centers by maximin algorithm
2. Assign each object to the group that has the closest centroid.
3. When all objects have been assigned, recalculate the
positions of the K centroids.
4. Repeat steps 2 and 3 until the centroids no longer move.
This produces a separation of the objects into groups from
which the metric to be minimised can be calculated.
1 2
1
2

Classification :
1. Supervised Classification
• Training samples have known labels – enabling estimating the
characteristics of classes –by a set of parameters to represent a class.
• The goal is to assign each pixel vector a label by computing the
distance of the pixel vector to each class and finding the class to
which it has minimum distance using a Euclidean, City Block,
Mahalanobis or other distance measures.
2. Unsupervised Classification
• Initially samples do not have a label.
• Employ a clustering technique to partition the n samples from your
dataset into k clusters where each dataset belongs to one of the
clusters
• K-means is a clustering algorithm where initial cluster centres can be
obtained randomly or using a cluster seeking algorithm like Maximin
or a neural network like Self Organization Map.

ISODATA Algorithms – Extension of K-Means Algorithm
ISODATA – Iterative Self
Organising Data Analysis
Techniques
2D Feature Space
𝑥2
𝑥1
𝑪𝟐
𝑪𝟏 𝑪𝟑 𝑪𝟒
𝑪𝟓 𝑪𝟔
𝑪𝟕
#Examples Mean
Vector
Variance
Vector
Principal
Eigen Vector
𝒗𝟏
𝑻
Principal
Eigen
Value
𝑪𝟏 100 (2,8) (-0.25,0.25)
𝑪𝟐 100 (2,7.5) (-0.25,0.25)
𝑪𝟑 100 (2.5,8) (-0.25,0.25)
𝑪𝟒 1000 (5,5) (7,7) (0.707,0.707) 10
𝑪𝟓 100 (6,3) (0.25,0.25)
𝑪𝟔 300 (8,3) (1,1) (1,0)
𝑪𝟕 20 (9,6) (0.12,0.12)
Need for
a. Splitting
b. Merging
c. Rejecting
d. Refining (K-Means)

Rejecting a Clusters
#Examples Mean
Vector
𝑪𝟏 100 (2,8)
𝑪𝟐 100 (2,7.5)
𝑪𝟑 100 (2.5,8)
𝑪𝟒 1000 (5,5)
𝑪𝟓 100 (6,3)
𝑪𝟔 300 (8,3)
𝑪𝟕 20 (9,6)
Regect a Cluster if population ≤ 50
#Examples Mean
Vector
𝑪𝟏 100 (2,8)
𝑪𝟐 100 (2,7.5)
𝑪𝟑 100 (2.5,8)
𝑪𝟒 1000 (5,5)
𝑪𝟓 100 (6,3)
𝑪𝟔 300 (8,3)
Rejecting Cluster 7 :
Examples of Cluster 7 redistributed to other clusters
7 clusters to 6 clusters

Merging of Clusters
Mean
Vector
𝑪𝟏 (2,8)
𝑪𝟐 (2,7.5)
𝑪𝟑 (2.5,8)
𝑪𝟒 (5,5)
𝑪𝟓 (6,3)
𝑪𝟔 (8,3)
𝑪𝟏 𝑪𝟐 𝑪𝟑 𝑪𝟒 𝑪𝟓 𝑪𝟔
𝑪𝟏 0
𝑪𝟐 0
𝑪𝟑 0
𝑪𝟒 0
𝑪𝟓 0
𝑪𝟔 0
Inter-Cluster Distances
Use Euclidean Distances
Merging Criteria : Merge if Inter-Cluster Distance ≤ 1
𝐷𝑐1𝑐2, 𝐷𝑐1𝑐3, 𝐷𝑐2𝑐3 < 1 :
Merge Clusters 𝑪𝟏 , 𝑪𝟐 & 𝑪𝟑

Merging of Clusters
Mean
Vector
𝑪𝟏 (2,8)
𝑪𝟐 (2,7.5)
𝑪𝟑 (2.5,8)
𝑪𝟒 (5,5)
𝑪𝟓 (6,3)
𝑪𝟔 (8,3)
𝑪𝟏 𝑪𝟐 𝑪𝟑 𝑪𝟒 𝑪𝟓 𝑪𝟔
𝑪𝟏 0 0.5 0.5 4.24 6.4 7.8
𝑪𝟐 0 0.707 3.9 6.02 7.5
𝑪𝟑 0 3.9 6.1 7.43
𝑪𝟒 0 2.24 3.6
𝑪𝟓 0 2
𝑪𝟔 0
Inter-Cluster Distances
Use Euclidean Distances
Merging Criteria : Merge if Inter-Cluster Distance ≤ 1
𝐷𝑐1𝑐2, 𝐷𝑐1𝑐3, 𝐷𝑐2𝑐3 < 1 :
Merge Clusters 𝑪𝟏 , 𝑪𝟐 & 𝑪𝟑

Merging of Clusters 𝐷𝑐1𝑐2, 𝐷𝑐1𝑐3, 𝐷𝑐2𝑐3 < 1 :
Merge Clusters 𝑪𝟏 , 𝑪𝟐 & 𝑪𝟑 into 𝑪𝑵𝟏
#Examples Mean
Vector
𝑪𝟏 100 (2,8)
𝑪𝟐 100 (2,7.5)
𝑪𝟑 100 (2.5,8)
Mean of Merged Cluster 𝑪𝑵𝟏
=
𝟏𝟎𝟎× 𝟐,𝟖 +𝟏𝟎𝟎× 𝟐,𝟕.𝟓 +𝟏𝟎𝟎×(𝟐.𝟓,𝟖)
𝟑𝟎𝟎
=
Population of Merged Cluster 𝑪𝑵𝟏
= 100 + 100 + 100 =
#Examples Mean
Vector
𝑪𝟏 100 (2,8)
𝑪𝟐 100 (2,7.5)
𝑪𝟑 100 (2.5,8)
𝑪𝟒 1000 (5,5)
𝑪𝟓 100 (6,3)
𝑪𝟔 300 (8,3)
#Examples Mean
Vector
𝑪𝑵𝟏 300 (2.16, 7,83)
𝑪𝟒 1000 (5,5)
𝑪𝟓 100 (6,3)
𝑪𝟔 300 (8,3)

Merging of Clusters 𝐷𝑐1𝑐2, 𝐷𝑐1𝑐3, 𝐷𝑐2𝑐3 < 1 :
Merge Clusters 𝑪𝟏 , 𝑪𝟐 & 𝑪𝟑 into 𝑪𝑵𝟏
#Examples Mean
Vector
𝑪𝟏 100 (2,8)
𝑪𝟐 100 (2,7.5)
𝑪𝟑 100 (2.5,8)
Mean of Merged Cluster 𝑪𝑵𝟏
=
𝟏𝟎𝟎× 𝟐,𝟖 +𝟏𝟎𝟎× 𝟐,𝟕.𝟓 +𝟏𝟎𝟎×(𝟐.𝟓,𝟖)
𝟑𝟎𝟎
= (2.16, 7.83)
Population of Merged Cluster 𝑪𝑵𝟏
= 100 + 100 + 100 = 300
#Examples Mean
Vector
𝑪𝟏 100 (2,8)
𝑪𝟐 100 (2,7.5)
𝑪𝟑 100 (2.5,8)
𝑪𝟒 1000 (5,5)
𝑪𝟓 100 (6,3)
𝑪𝟔 300 (8,3)
#Examples Mean
Vector
𝑪𝑵𝟏 300 (2.16, 7,83)
𝑪𝟒 1000 (5,5)
𝑪𝟓 100 (6,3)
𝑪𝟔 300 (8,3)

Splitting of Clusters
Split a cluster if Principal Eigen Value > 7 units
#Examples Mean
Vector
Variance
Vector
Principal
Eigen Vector
𝒗𝟏
𝑻
Principal
Eigen
Value
𝝀𝟏
𝑪𝑵𝟏 100 (2.16,7.83)
𝑪𝟒 1000 (5,5) (7,7) (0.707,0.707) 10
𝑪𝟓 100 (6,3) (0.25,0.25)
𝑪𝟔 300 (8,3) (1,1) (1,0)
Split 𝑪𝟒 into 2 new clusters 𝑪𝟒𝒂 & 𝑪𝟒𝒃
Mean of 𝑪𝟒𝒂 = (5,5) - 𝝀𝟏 × 𝒗𝟏
𝑻
=
Mean of 𝑪𝟒𝒂 = (5,5) + 𝝀𝟏 × 𝒗𝟏
𝑻
=

#Examples Mean
Vector
Variance
Vector
Principal
Eigen Vector
𝒗𝟏
𝑻
Principal
Eigen
Value
𝝀𝟏
𝑪𝑵𝟏 100 (2.16,7.83)
𝑪𝟒 1000 (5,5) (7,7) (0.707,0.707) 10
𝑪𝟓 100 (6,3) (0.25,0.25)
𝑪𝟔 300 (8,3) (1,1) (1,0)
Split 𝑪𝟒 into 2 new clusters 𝑪𝟒𝒂 & 𝑪𝟒𝒃
Mean of 𝑪𝟒𝒂 = (5,5) - 𝝀𝟏 × 𝒗𝟏
𝑻
= (5,5) - 3.16 ×(0.707,0.707)
= (2.76, 2.76)
Mean of 𝑪𝟒𝒂 = (5,5) + 𝝀𝟏 × 𝒗𝟏
𝑻
= (5,5) +3.16 × (0.707,0.707)
= (7.23, 7.23)

Mean
Vector
𝑪𝑵𝟏 (2.16,7.83)
𝑪𝟒𝒂 (2.76, 2.76)
𝑪𝟒𝒃 (7.23, 7.23)
𝑪𝟓 (6,3)
𝑪𝟔 (8,3)
Mean
Vector
𝑪𝑵𝟏 (2.16, 7,83)
𝑪𝟒 (5,5)
𝑪𝟓 (6,3)
𝑪𝟔 (8,3)

K-Mean Algorithm – Fixed Number of Clusters
ISODATA Algorithms – Number of Clusters varying with iteration

Competitive Learning

Competitive & Co-operative Learning : Self Organization Map
1. Initialization
Initialize Neuron Weights (Output Layer) to Random Nos
2. Competition : Select Winner Node
Using a Distance Measure – Euclidean, Manhattan, etc
Minimum distance Node is winner 𝑤𝑤𝑖𝑛
𝑡
3. Adaptation : Adapt Weights of the winning node
𝑤𝑤𝑖𝑛
𝑡+1
= 𝑤𝑤𝑖𝑛
𝑡
+ 𝛼. (𝑥𝑖 - 𝑤𝑤𝑖𝑛
𝑡
)
4. Continue Steps 2 to 3 for each 𝑥𝑖 ( Presentation of all xi constitute one epoch)
5. Continue Epochs till convergence (Steps 2 to 4 )
4. Result : The converged Neuron Weights 𝑤𝑘
Algorithm
Data 𝑥𝑖
Neurons 𝑤𝑘
Features : 𝑥0, 𝑥1, 𝑥2,…, 𝑥𝑁−1
Neurons : 𝑤𝑐0, 𝑤𝑐1,…, 𝑤𝑐𝑀−1
Examples are sequentially
presented to the input layer
& Neuron weights updated

Example
Data
Vectors
𝑥𝑖
(𝑥0𝑖, 𝑥1𝑖)
Current
Neuron 0
Weights
𝑤𝑐0
Current
Neuron 1
Weights
𝑤𝑐1
(1,2)
(3,4) (6,5)
(2,1)
(8,9)
(10,9)
(9,9)
Execute for one Epoch
At the start of an epoch :
Let Neuron 1 𝑤𝑐0(3,4)
Neuron 2 𝑤𝑐1(6,5)
Step 1: Initialize Neuron Weights/or Take last epoch’s weights
0
1
2
3
4
5
6
7
8
9
10
0 2 4 6 8 10 12
x1 x0
2D Feature Space
(𝒘𝟎𝒄𝟎, 𝒘𝟏𝒄𝟎)
(𝒘𝟎𝒄𝟏, 𝒘𝟏𝒄𝟏)
Neural Network
Input
Node
𝒙𝟎𝒊
𝒙𝟏𝒊

Data
Vectors
𝑥𝑖
Current
Neuron 1
Weights
𝑤𝑐0
Current
Neuron 2
Weights
𝑤𝑐1
(8,9)
(3,4) (6,5)
(1,2)
(10.9)
(2,1)
(9,9)
Executing for an Epoch
Step 2: Scrambling of Example Set
Example
Data
Vectors
𝑥𝑖
Current
Neuron 0
Weights
𝑤𝑐0
Current
Neuron 1
Weights
𝑤𝑐1
(1,2)
(3,4) (6,5)
(2,1)
(8,9)
(10,9)
(9,9)

Data
Vecto
rs
𝑥𝑖
Current
Neuron 0
Weights
𝑤𝑐0
Current
Neuron 1
weights
𝑤𝑐1
𝑑𝑐0 𝑑𝑐1 Current
Winning
Neuron
Weights
𝑤𝑤𝑖𝑛
𝑡
𝛼. (𝑥𝑖-𝑤𝑤𝑖𝑛
𝑡
) New
Winning
Neuron
Weights
𝑤𝑤𝑖𝑛
𝑡+1
(8,9) (3,4) (6,5) 10 6 (6,5) (0.5,1.0) (6.5,6.0)
(1,2) (3,4) (6.5,6.0)
Learning Rate 𝛼 = 0.25
Distance Metric : Manhattan
Step 3 : Learning Executing for one Epoch
Adaptation : Adapt Weights of the winning node
𝑤𝑤𝑖𝑛
𝑡+1
= 𝑤𝑤𝑖𝑛
𝑡
𝑡
)

Data
Vecto
rs
𝑥𝑖
Current
Neuron 0
Weights
𝑤𝑐0
Current
Neuron 1
weights
𝑤𝑐1
Winning
Neuron
Weights
𝑤𝑤𝑖𝑛
𝑡
𝑡
) New
Winning
Neuron
Weights
𝑤𝑤𝑖𝑛
𝑡+1
(8,9) (3,4) (6,5) 10 6 (6,5) (0.5,1.0) (6.5,6.0)
(1,2) (3,4) (6.5,6.0) 4 9.5 (3,4) (-0.5,-0.5) (2.5,3.5)
(10,9) (2.5,3.5) (6.5,6.0)
Adaptation : Adapt Weights of the winning node
𝑤𝑤𝑖𝑛
𝑡+1
= 𝑤𝑤𝑖𝑛
𝑡
𝑡
)

Data
Vecto
rs
𝑥𝑖
Current
Neuron 0
Weights
𝑤𝑐0
Current
Neuron 1
weights
𝑤𝑐1
Winning
Neuron
Weights
𝑤𝑤𝑖𝑛
𝑡
𝑡
) New
Winning
Neuron
Weights
𝑤𝑤𝑖𝑛
𝑡+1
(8,9) (3,4) (6,5) 10 6 (6,5) (0.5,1.0) (6.5,6.0)
(1,2) (3,4) (6.5,6.0) 4 9.5 (3,4) (-0.5,-0.5) (2.5,3.5)
(10,9) (2.5,3.5) (6.5,6.0) 13 6.5 (6.5,6.0) (0.875,0.75) (7.375,6.75)
(2,1) (2.5,3.5) (7.375,6.75) 3 11.1 (2.5,3.5) (-0.125,-0.625) (2.375,2.88)
(9,9) (2.375,2.875) (7.375,6.75) 12.75 3.88 (7.38,6.8) (0.406,0.56) (7.786,7.36)
At the end of an epoch :
Neuron 1 𝑤𝑐0(2.375,2.875)
Neuron 2 𝑤𝑐1(7.786,7.36)

Executing for an Epoch
Data
Vectors
𝑥𝑖
Current
Neuron 1
Weights
𝑤𝑐0
Current
Neuron 2
Weights
𝑤𝑐1
(8,9)
(3,4) (6,5)
(1,2)
(10.9)
(2,1)
(9,9)
0
1
2
3
4
5
6
7
8
9
10
0 2 4 6 8 10 12
x1
x0
2D Feature Space
0
1
2
3
4
5
6
7
8
9
10
0 2 4 6 8 10 12
x1
x0
2D Feature Space
At the end of an epoch :
Neuron 1 𝑤𝑐0(2.375,2.875)
Neuron 2 𝑤𝑐1(7.786,7.36)

Competitive & Co-operative Learning : Self Organization Map
1. Initialization
Initialize Neuron Weights (Output Layer) to Random Nos
2. Competition : Select Winner Node
Using a Distance Measure – Euclidean, Manhattan, etc
Minimum distance Node is winner 𝑤𝑤𝑖𝑛
𝑡
Neighbour of winning node
Neighbours of winning node 𝑤𝑛𝑒𝑖𝑔ℎ1
𝑡
, 𝑤𝑛𝑒𝑖𝑔ℎ2
𝑡
, 𝑤𝑛𝑒𝑖𝑔ℎ3
𝑡
, etc,
3. Adaptation : Adapt Weights of the winning node
𝑤𝑤𝑖𝑛
𝑡+1
= 𝑤𝑤𝑖𝑛
𝑡
𝑡
)
4. Cooperation: Adapt Weights of the neighboring nodes k
𝑤𝑛𝑒𝑖𝑔ℎ−𝑘
𝑡+1
= 𝑤𝑛𝑒𝑖𝑔ℎ−𝑘
𝑡
+ 𝛼1. (𝑥𝑖 - 𝑤𝑛𝑒𝑖𝑔ℎ−𝑘
𝑡
) for all k , 𝛼1 < 𝛼
5. Continue Steps 2 to 4 for each 𝑥𝑖 ( Presentation of all xi constitute one epoch)
6. Continue Epochs till convergence (Steps 2 to 5 )
7. Result : The converged Neuron Weights 𝑤𝑘
Algorithm
Data 𝑥𝑖
Neurons 𝑤𝑘

unsupervised classification.pdf

Recommended

Recommended

More Related Content

Similar to unsupervised classification.pdf

Similar to unsupervised classification.pdf (20)

Recently uploaded

Recently uploaded (20)

unsupervised classification.pdf