SlideShare a Scribd company logo
Segmenting Images through Representation as a Content-Rich Network
Author: Yanira Corvera, Advisor: Dr. Laura Smith
Department of Mathematics, California State University Fullerton
Abstract
Image segmentation is the process consisting of partitioning digital images into multiple
sections with the goal of detecting objects within the image. In this work, we represent an
image as a graph, where each pixel is a node in the graph and an edge is present between
two nodes if they are close spatially within the image. In particular, we treat the image
as a content-rich graph, where each node is associated with a vector of different features
of the image, such as RGB and HSV color representations and textures. We consider
graph cut and clustering methods to partition the nodes, resulting in a segmented image.
We use the information theoretic approach, the Content Map Equation, which minimizes
the description length of the network by compressing the graph into distinct modules
of nodes, giving the partitioning of the image. We compare this method to existing
graph-cut methods, such as normalized cut, on a variety of images. Implementation
of the Content Map Equation on a simple black and white image provides promising
results which encourage further investigation on more complex images.
Introduction
• We want to partition a given image.
• An image is represented as a network with pixels as the nodes.
• Each node has an associated feature vector, e.g. RGB.
• A dictionary vector is created from the feature values.
• The image is segmented by using graph-cut methods
Methods
• Weights of edges between nodes are calculated by:
wij = e
−||F(i)−F(j)||
σF
feature
distance
∗ e
−||X(i)−X(j)||
σX
spatial
distance
,
if ||X(i) − X(j)|| < r, and wij = 0 otherwise [1].
• For an RGB feature vector, we define 8 words per color band
and create dictionary vectors of length 24, e.g.
Fv = [0, 65, 198] → {da
j} = [10000000|00001000|00000001].
• The 8 words are determined by the CDF of the color band,
giving each word an equal probability of 1/8 of occuring.
Figure 1: The CDF of an image’s red color band that is utilized
to define words for the dictionary vector.
Segmented Image Results of Toy Image with Noise
(a) Noisy Image with
= 0
(b) Content Map Equation Results (c) K-Means
Clustering Results
(d) Noisy Image with
= 0.00006
(e) Content Map Equation Results (f) K-Means
Clustering Results
(g) Noisy Image with
= 0.00008
(h) Content Map Equation Results (i) K-Means
Clustering Results
(j) Noisy Image with
= 0.0001
(k) Content Map Equation Results (l) K-Means
Clustering Results
Figure 2: Partitions of Noisy Toy Images with the Implementation of the Content Map Equation (CME) and K-Means Clustering Method
on K eigenvectors. These are the resulting segmented noisy toy images where we defined 8 words for the CME and used constraints F = 120 and R = 24
with σF = 72 and σX = 98. The partitions resulting from the K-Means Clustering method on K eigenvectors required the constraints F = 176, R = 10, and
K = 3 with σF = 5 and σX = 4.
Graph Cut Methods
We use the Content Map Equation (CME) [2] to partition
the graph. This is done by using a bottom-up approach [3] to
find a local minimum of
LC(M) = ( q H(Q))
contribution of
movement
between
modules
+


m
i=1
pi
H(Pi
)


contribution of
movement
within
modules
+


m
i=1
pi
H(Xi
)


contribution
of node
dictionary
vectors
.
• H(Q) is the entropy of the movement between modules.
• H(Pi
) is the entropy of movement within a module.
• H(Xi
) is the entropy of movement between dictionary words.
• These terms are weighted by the frequencies of
• exiting a module, q
• exiting or remaining in the module, m
i=1 pi
• and remaining in the module, m
i=1 pi .
• The goal is to decrease the description length of a network by
clustering nodes based on distance and dictionary similarity.
For comparison, we use a K-Means Clustering method:
• We find the k eigenvectors associated with the k smallest
eigenvalues of the symmetric normalized Laplacian matrix [1],
Ls = D−1
2(D − W)D−1
2,
where W is the weighted adjacency matrix and D is a
diagonal matrix with Dii = j Wij, the sum of the edge
weights adjacent to node i.
• The eigenvectors then create points in Rk
, which are
partitioned into k clusters by applying the k−means
clustering algorithm.
• The resulting partitioning gives k clusters of nodes. k is
predetermined.
Metrics for Comparison
• We use Normalized Mutual Information (NMI) and Purity
measures to compare the partitioning results to the ground
truth communities. Values of 1.0 are optimal.
• NMI is calculated by:
NMI =
I(U, V )
H(U)H(V )
,
where I(U, V ) is the mutual information between sets U and
V , H(U) is the entropy of a set U, and H(V ) is the entropy
of a set V .
• Purity of one cluster is calculated by:
P(Cj) =
1
|Cj|
max
k=1,...,c
|Cj,k|,
where Cj,k consists of the cluster Cj of an image that
originally belongs to the class set k.
Results
• Noise was added to a toy image. The epsilon values indicate
the amount of noise.
• The CME produces higher NMI values than the K-Means
Clustering method for = 0, 0.00003 − 0.00005, & 0.00007.
• CME has higher purity for = 0.00002 − 0.00005, & 0.00007.
• Clusters resulting from the CME model have a larger number
of nodes in each cluster belonging to a particular class than
the K-Means Clustering method.
• One advantage of CME model is the number of communities
does not have to be predetermined, but this may be a dis-
advantageous when partitioned into too many clusters.
Evaluation of Partitions with Groundtruth
(a) NMI Measure
(b) Purity Measure
Figure 3: NMI and Purity Measures. This gives a comparison of the
Content Map Equation partitions with those of the K- Means Clustering
method. The purity results are impressive for CME partitions since the
model distinguished the distinct objects of the image. However, its NMI is
low since the objects, for example the rectangle, are being clustered into
several distinct communities as opposed to the optimal full object.
Conclusion
• High purity values for the CME model indicate the method is
able to distinguish the different objects.
• With a low NMI, the CME model puts the different objects in
distinct clusters consisting of several smaller communities.
• Lower purity values for the K-Means Clustering method
suggest the method is not able to fully distinguish the objects.
Discussion/Future work
• To address the multiple communities for a single object, we
will extend the method by treating communities resulting
from the CME as new nodes that we want to merge.
• We would like to experiment with different edge weights and
dictionary definitions.
• We would like to examine the approach on more complex
images.
References
1 Shi, J., & Malik, J. IEEE Transactions on Pattern Analysis and Machine Intelligence.
Vol. 22. No. 8. p. 888–905. 2000.
2 Smith, L., Zhu, L., Lerman, K., & Percus, A. (2014). Partitioning networks with node
attributes by compressing information flow. Unpublished manuscript.
3 Rosvall, M., & Bergstrom, C. T.. Proceedings of the National Academy of Sciences. Vol.
105. No. 4. p. 1118–1123. Jan. 2008.
4 Danon, L., Diaz-Guileram, A., Duch, J. & Arenas, A. Journal of Statistical Mechanics:
Theory and Experiment. 2005.
5 Strehl, A, & Ghosh, J.. Journal of Machine Learning Research. Vol. 3. p. 583–617. 2002.
6 Huang, A.Similarity Measures for Text Document Clustering. Proceedings of the New
Zealand Computer Science Research Student Conference. 2008.
7 Chen,Y., Wang, J. Z., & Krovetz, R.. IEEE Transactions on Image Processing. Vol.14.
No. 8. p. 1187–1201. 2005.
8 Chen, M. Normalized Mutual Information Code. March 2012. Online. May 2014.
http://www.mathworks.com/matlabcentral/fileexchange/35625-information-theory-
toolbox/content/nmi.m.
9 Smith, L., Zhu, L., Lerman, K., & Percus, A.The Content Map Equation Code. June
2013.

More Related Content

What's hot

A CHINESE CHARACTER RECOGNITION METHOD BASED ON POPULATION MATRIX AND RELATIO...
A CHINESE CHARACTER RECOGNITION METHOD BASED ON POPULATION MATRIX AND RELATIO...A CHINESE CHARACTER RECOGNITION METHOD BASED ON POPULATION MATRIX AND RELATIO...
A CHINESE CHARACTER RECOGNITION METHOD BASED ON POPULATION MATRIX AND RELATIO...
Teady Matius
 
Improving search time for contentment based image retrieval via, LSH, MTRee, ...
Improving search time for contentment based image retrieval via, LSH, MTRee, ...Improving search time for contentment based image retrieval via, LSH, MTRee, ...
Improving search time for contentment based image retrieval via, LSH, MTRee, ...
IOSR Journals
 
OTSU Thresholding Method for Flower Image Segmentation
OTSU Thresholding Method for Flower Image SegmentationOTSU Thresholding Method for Flower Image Segmentation
OTSU Thresholding Method for Flower Image Segmentation
ijceronline
 
Investigation on the Pattern Synthesis of Subarray Weights for Low EMI Applic...
Investigation on the Pattern Synthesis of Subarray Weights for Low EMI Applic...Investigation on the Pattern Synthesis of Subarray Weights for Low EMI Applic...
Investigation on the Pattern Synthesis of Subarray Weights for Low EMI Applic...
IOSRJECE
 
Steganography Using Adaptive Pixel Value Differencing(APVD) of Gray Images Th...
Steganography Using Adaptive Pixel Value Differencing(APVD) of Gray Images Th...Steganography Using Adaptive Pixel Value Differencing(APVD) of Gray Images Th...
Steganography Using Adaptive Pixel Value Differencing(APVD) of Gray Images Th...
cscpconf
 
K means clustering
K means clusteringK means clustering
K means clustering
keshav goyal
 
Noise tolerant color image segmentation using support vector machine
Noise tolerant color image segmentation using support vector machineNoise tolerant color image segmentation using support vector machine
Noise tolerant color image segmentation using support vector machine
eSAT Publishing House
 
Tracking and counting human in visual surveillance system
Tracking and counting human in visual surveillance systemTracking and counting human in visual surveillance system
Tracking and counting human in visual surveillance system
iaemedu
 
CLUSTERING HYPERSPECTRAL DATA
CLUSTERING HYPERSPECTRAL DATACLUSTERING HYPERSPECTRAL DATA
CLUSTERING HYPERSPECTRAL DATA
csandit
 
Effect of Similarity Measures for CBIR using Bins Approach
Effect of Similarity Measures for CBIR using Bins ApproachEffect of Similarity Measures for CBIR using Bins Approach
Effect of Similarity Measures for CBIR using Bins Approach
CSCJournals
 
Multi Resolution features of Content Based Image Retrieval
Multi Resolution features of Content Based Image RetrievalMulti Resolution features of Content Based Image Retrieval
Multi Resolution features of Content Based Image Retrieval
IDES Editor
 
Segmentation by Fusion of Self-Adaptive SFCM Cluster in Multi-Color Space Com...
Segmentation by Fusion of Self-Adaptive SFCM Cluster in Multi-Color Space Com...Segmentation by Fusion of Self-Adaptive SFCM Cluster in Multi-Color Space Com...
Segmentation by Fusion of Self-Adaptive SFCM Cluster in Multi-Color Space Com...
CSCJournals
 
Behavior study of entropy in a digital image through an iterative algorithm
Behavior study of entropy in a digital image through an iterative algorithmBehavior study of entropy in a digital image through an iterative algorithm
Behavior study of entropy in a digital image through an iterative algorithm
ijscmcj
 
Developing Document Image Retrieval System
Developing Document Image Retrieval SystemDeveloping Document Image Retrieval System
Developing Document Image Retrieval System
Konstantinos Zagoris
 
Scene Text Detection on Images using Cellular Automata
Scene Text Detection on Images using Cellular AutomataScene Text Detection on Images using Cellular Automata
Scene Text Detection on Images using Cellular Automata
Konstantinos Zagoris
 
Multi graph encoder
Multi graph encoderMulti graph encoder
Multi graph encoder
Yajie Zhou
 
gilbert_iccv11_paper
gilbert_iccv11_papergilbert_iccv11_paper
gilbert_iccv11_paper
Andrew Gilbert
 
International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)
inventionjournals
 

What's hot (18)

A CHINESE CHARACTER RECOGNITION METHOD BASED ON POPULATION MATRIX AND RELATIO...
A CHINESE CHARACTER RECOGNITION METHOD BASED ON POPULATION MATRIX AND RELATIO...A CHINESE CHARACTER RECOGNITION METHOD BASED ON POPULATION MATRIX AND RELATIO...
A CHINESE CHARACTER RECOGNITION METHOD BASED ON POPULATION MATRIX AND RELATIO...
 
Improving search time for contentment based image retrieval via, LSH, MTRee, ...
Improving search time for contentment based image retrieval via, LSH, MTRee, ...Improving search time for contentment based image retrieval via, LSH, MTRee, ...
Improving search time for contentment based image retrieval via, LSH, MTRee, ...
 
OTSU Thresholding Method for Flower Image Segmentation
OTSU Thresholding Method for Flower Image SegmentationOTSU Thresholding Method for Flower Image Segmentation
OTSU Thresholding Method for Flower Image Segmentation
 
Investigation on the Pattern Synthesis of Subarray Weights for Low EMI Applic...
Investigation on the Pattern Synthesis of Subarray Weights for Low EMI Applic...Investigation on the Pattern Synthesis of Subarray Weights for Low EMI Applic...
Investigation on the Pattern Synthesis of Subarray Weights for Low EMI Applic...
 
Steganography Using Adaptive Pixel Value Differencing(APVD) of Gray Images Th...
Steganography Using Adaptive Pixel Value Differencing(APVD) of Gray Images Th...Steganography Using Adaptive Pixel Value Differencing(APVD) of Gray Images Th...
Steganography Using Adaptive Pixel Value Differencing(APVD) of Gray Images Th...
 
K means clustering
K means clusteringK means clustering
K means clustering
 
Noise tolerant color image segmentation using support vector machine
Noise tolerant color image segmentation using support vector machineNoise tolerant color image segmentation using support vector machine
Noise tolerant color image segmentation using support vector machine
 
Tracking and counting human in visual surveillance system
Tracking and counting human in visual surveillance systemTracking and counting human in visual surveillance system
Tracking and counting human in visual surveillance system
 
CLUSTERING HYPERSPECTRAL DATA
CLUSTERING HYPERSPECTRAL DATACLUSTERING HYPERSPECTRAL DATA
CLUSTERING HYPERSPECTRAL DATA
 
Effect of Similarity Measures for CBIR using Bins Approach
Effect of Similarity Measures for CBIR using Bins ApproachEffect of Similarity Measures for CBIR using Bins Approach
Effect of Similarity Measures for CBIR using Bins Approach
 
Multi Resolution features of Content Based Image Retrieval
Multi Resolution features of Content Based Image RetrievalMulti Resolution features of Content Based Image Retrieval
Multi Resolution features of Content Based Image Retrieval
 
Segmentation by Fusion of Self-Adaptive SFCM Cluster in Multi-Color Space Com...
Segmentation by Fusion of Self-Adaptive SFCM Cluster in Multi-Color Space Com...Segmentation by Fusion of Self-Adaptive SFCM Cluster in Multi-Color Space Com...
Segmentation by Fusion of Self-Adaptive SFCM Cluster in Multi-Color Space Com...
 
Behavior study of entropy in a digital image through an iterative algorithm
Behavior study of entropy in a digital image through an iterative algorithmBehavior study of entropy in a digital image through an iterative algorithm
Behavior study of entropy in a digital image through an iterative algorithm
 
Developing Document Image Retrieval System
Developing Document Image Retrieval SystemDeveloping Document Image Retrieval System
Developing Document Image Retrieval System
 
Scene Text Detection on Images using Cellular Automata
Scene Text Detection on Images using Cellular AutomataScene Text Detection on Images using Cellular Automata
Scene Text Detection on Images using Cellular Automata
 
Multi graph encoder
Multi graph encoderMulti graph encoder
Multi graph encoder
 
gilbert_iccv11_paper
gilbert_iccv11_papergilbert_iccv11_paper
gilbert_iccv11_paper
 
International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)
 

Similar to poster

PERFORMANCE EVALUATION OF DIFFERENT TECHNIQUES FOR TEXTURE CLASSIFICATION
PERFORMANCE EVALUATION OF DIFFERENT TECHNIQUES FOR TEXTURE CLASSIFICATION PERFORMANCE EVALUATION OF DIFFERENT TECHNIQUES FOR TEXTURE CLASSIFICATION
PERFORMANCE EVALUATION OF DIFFERENT TECHNIQUES FOR TEXTURE CLASSIFICATION
cscpconf
 
DOMAIN SPECIFIC CBIR FOR HIGHLY TEXTURED IMAGES
DOMAIN SPECIFIC CBIR FOR HIGHLY TEXTURED IMAGESDOMAIN SPECIFIC CBIR FOR HIGHLY TEXTURED IMAGES
DOMAIN SPECIFIC CBIR FOR HIGHLY TEXTURED IMAGES
cseij
 
Image segmentation for high resolution images
Image segmentation for high resolution imagesImage segmentation for high resolution images
Image segmentation for high resolution images
jeet patalia
 
[PPT]
[PPT][PPT]
[PPT]
butest
 
Week06 bme429-cbir
Week06 bme429-cbirWeek06 bme429-cbir
Week06 bme429-cbir
Ikram Moalla
 
4 image segmentation through clustering
4 image segmentation through clustering4 image segmentation through clustering
4 image segmentation through clustering
IAEME Publication
 
4 image segmentation through clustering
4 image segmentation through clustering4 image segmentation through clustering
4 image segmentation through clustering
prjpublications
 
Imagethresholding
ImagethresholdingImagethresholding
Imagethresholding
ananta200
 
IJET-V2I6P17
IJET-V2I6P17IJET-V2I6P17
On mesh
On meshOn mesh
A proposed accelerated image copy-move forgery detection-vcip2014
A proposed accelerated image copy-move forgery detection-vcip2014A proposed accelerated image copy-move forgery detection-vcip2014
A proposed accelerated image copy-move forgery detection-vcip2014
SondosFadl
 
A comprehensive survey of contemporary
A comprehensive survey of contemporaryA comprehensive survey of contemporary
A comprehensive survey of contemporary
prjpublications
 
Quantum persistent k cores for community detection
Quantum persistent k cores for community detectionQuantum persistent k cores for community detection
Quantum persistent k cores for community detection
Colleen Farrelly
 
PPT s12-machine vision-s2
PPT s12-machine vision-s2PPT s12-machine vision-s2
PPT s12-machine vision-s2
Binus Online Learning
 
I1803026164
I1803026164I1803026164
I1803026164
IOSR Journals
 
I010135760
I010135760I010135760
I010135760
IOSR Journals
 
Content Based Image Retrieval Using 2-D Discrete Wavelet Transform
Content Based Image Retrieval Using 2-D Discrete Wavelet TransformContent Based Image Retrieval Using 2-D Discrete Wavelet Transform
Content Based Image Retrieval Using 2-D Discrete Wavelet Transform
IOSR Journals
 
Multi Wavelet for Image Retrival Based On Using Texture and Color Querys
Multi Wavelet for Image Retrival Based On Using Texture and  Color QuerysMulti Wavelet for Image Retrival Based On Using Texture and  Color Querys
Multi Wavelet for Image Retrival Based On Using Texture and Color Querys
IOSR Journals
 
Fractal Image Compression By Range Block Classification
Fractal Image Compression By Range Block ClassificationFractal Image Compression By Range Block Classification
Fractal Image Compression By Range Block Classification
IRJET Journal
 
A COMPARATIVE STUDY ON DISTANCE MEASURING APPROACHES FOR CLUSTERING
A COMPARATIVE STUDY ON DISTANCE MEASURING APPROACHES FOR CLUSTERINGA COMPARATIVE STUDY ON DISTANCE MEASURING APPROACHES FOR CLUSTERING
A COMPARATIVE STUDY ON DISTANCE MEASURING APPROACHES FOR CLUSTERING
IJORCS
 

Similar to poster (20)

PERFORMANCE EVALUATION OF DIFFERENT TECHNIQUES FOR TEXTURE CLASSIFICATION
PERFORMANCE EVALUATION OF DIFFERENT TECHNIQUES FOR TEXTURE CLASSIFICATION PERFORMANCE EVALUATION OF DIFFERENT TECHNIQUES FOR TEXTURE CLASSIFICATION
PERFORMANCE EVALUATION OF DIFFERENT TECHNIQUES FOR TEXTURE CLASSIFICATION
 
DOMAIN SPECIFIC CBIR FOR HIGHLY TEXTURED IMAGES
DOMAIN SPECIFIC CBIR FOR HIGHLY TEXTURED IMAGESDOMAIN SPECIFIC CBIR FOR HIGHLY TEXTURED IMAGES
DOMAIN SPECIFIC CBIR FOR HIGHLY TEXTURED IMAGES
 
Image segmentation for high resolution images
Image segmentation for high resolution imagesImage segmentation for high resolution images
Image segmentation for high resolution images
 
[PPT]
[PPT][PPT]
[PPT]
 
Week06 bme429-cbir
Week06 bme429-cbirWeek06 bme429-cbir
Week06 bme429-cbir
 
4 image segmentation through clustering
4 image segmentation through clustering4 image segmentation through clustering
4 image segmentation through clustering
 
4 image segmentation through clustering
4 image segmentation through clustering4 image segmentation through clustering
4 image segmentation through clustering
 
Imagethresholding
ImagethresholdingImagethresholding
Imagethresholding
 
IJET-V2I6P17
IJET-V2I6P17IJET-V2I6P17
IJET-V2I6P17
 
On mesh
On meshOn mesh
On mesh
 
A proposed accelerated image copy-move forgery detection-vcip2014
A proposed accelerated image copy-move forgery detection-vcip2014A proposed accelerated image copy-move forgery detection-vcip2014
A proposed accelerated image copy-move forgery detection-vcip2014
 
A comprehensive survey of contemporary
A comprehensive survey of contemporaryA comprehensive survey of contemporary
A comprehensive survey of contemporary
 
Quantum persistent k cores for community detection
Quantum persistent k cores for community detectionQuantum persistent k cores for community detection
Quantum persistent k cores for community detection
 
PPT s12-machine vision-s2
PPT s12-machine vision-s2PPT s12-machine vision-s2
PPT s12-machine vision-s2
 
I1803026164
I1803026164I1803026164
I1803026164
 
I010135760
I010135760I010135760
I010135760
 
Content Based Image Retrieval Using 2-D Discrete Wavelet Transform
Content Based Image Retrieval Using 2-D Discrete Wavelet TransformContent Based Image Retrieval Using 2-D Discrete Wavelet Transform
Content Based Image Retrieval Using 2-D Discrete Wavelet Transform
 
Multi Wavelet for Image Retrival Based On Using Texture and Color Querys
Multi Wavelet for Image Retrival Based On Using Texture and  Color QuerysMulti Wavelet for Image Retrival Based On Using Texture and  Color Querys
Multi Wavelet for Image Retrival Based On Using Texture and Color Querys
 
Fractal Image Compression By Range Block Classification
Fractal Image Compression By Range Block ClassificationFractal Image Compression By Range Block Classification
Fractal Image Compression By Range Block Classification
 
A COMPARATIVE STUDY ON DISTANCE MEASURING APPROACHES FOR CLUSTERING
A COMPARATIVE STUDY ON DISTANCE MEASURING APPROACHES FOR CLUSTERINGA COMPARATIVE STUDY ON DISTANCE MEASURING APPROACHES FOR CLUSTERING
A COMPARATIVE STUDY ON DISTANCE MEASURING APPROACHES FOR CLUSTERING
 

poster

  • 1. Segmenting Images through Representation as a Content-Rich Network Author: Yanira Corvera, Advisor: Dr. Laura Smith Department of Mathematics, California State University Fullerton Abstract Image segmentation is the process consisting of partitioning digital images into multiple sections with the goal of detecting objects within the image. In this work, we represent an image as a graph, where each pixel is a node in the graph and an edge is present between two nodes if they are close spatially within the image. In particular, we treat the image as a content-rich graph, where each node is associated with a vector of different features of the image, such as RGB and HSV color representations and textures. We consider graph cut and clustering methods to partition the nodes, resulting in a segmented image. We use the information theoretic approach, the Content Map Equation, which minimizes the description length of the network by compressing the graph into distinct modules of nodes, giving the partitioning of the image. We compare this method to existing graph-cut methods, such as normalized cut, on a variety of images. Implementation of the Content Map Equation on a simple black and white image provides promising results which encourage further investigation on more complex images. Introduction • We want to partition a given image. • An image is represented as a network with pixels as the nodes. • Each node has an associated feature vector, e.g. RGB. • A dictionary vector is created from the feature values. • The image is segmented by using graph-cut methods Methods • Weights of edges between nodes are calculated by: wij = e −||F(i)−F(j)|| σF feature distance ∗ e −||X(i)−X(j)|| σX spatial distance , if ||X(i) − X(j)|| < r, and wij = 0 otherwise [1]. • For an RGB feature vector, we define 8 words per color band and create dictionary vectors of length 24, e.g. Fv = [0, 65, 198] → {da j} = [10000000|00001000|00000001]. • The 8 words are determined by the CDF of the color band, giving each word an equal probability of 1/8 of occuring. Figure 1: The CDF of an image’s red color band that is utilized to define words for the dictionary vector. Segmented Image Results of Toy Image with Noise (a) Noisy Image with = 0 (b) Content Map Equation Results (c) K-Means Clustering Results (d) Noisy Image with = 0.00006 (e) Content Map Equation Results (f) K-Means Clustering Results (g) Noisy Image with = 0.00008 (h) Content Map Equation Results (i) K-Means Clustering Results (j) Noisy Image with = 0.0001 (k) Content Map Equation Results (l) K-Means Clustering Results Figure 2: Partitions of Noisy Toy Images with the Implementation of the Content Map Equation (CME) and K-Means Clustering Method on K eigenvectors. These are the resulting segmented noisy toy images where we defined 8 words for the CME and used constraints F = 120 and R = 24 with σF = 72 and σX = 98. The partitions resulting from the K-Means Clustering method on K eigenvectors required the constraints F = 176, R = 10, and K = 3 with σF = 5 and σX = 4. Graph Cut Methods We use the Content Map Equation (CME) [2] to partition the graph. This is done by using a bottom-up approach [3] to find a local minimum of LC(M) = ( q H(Q)) contribution of movement between modules +   m i=1 pi H(Pi )   contribution of movement within modules +   m i=1 pi H(Xi )   contribution of node dictionary vectors . • H(Q) is the entropy of the movement between modules. • H(Pi ) is the entropy of movement within a module. • H(Xi ) is the entropy of movement between dictionary words. • These terms are weighted by the frequencies of • exiting a module, q • exiting or remaining in the module, m i=1 pi • and remaining in the module, m i=1 pi . • The goal is to decrease the description length of a network by clustering nodes based on distance and dictionary similarity. For comparison, we use a K-Means Clustering method: • We find the k eigenvectors associated with the k smallest eigenvalues of the symmetric normalized Laplacian matrix [1], Ls = D−1 2(D − W)D−1 2, where W is the weighted adjacency matrix and D is a diagonal matrix with Dii = j Wij, the sum of the edge weights adjacent to node i. • The eigenvectors then create points in Rk , which are partitioned into k clusters by applying the k−means clustering algorithm. • The resulting partitioning gives k clusters of nodes. k is predetermined. Metrics for Comparison • We use Normalized Mutual Information (NMI) and Purity measures to compare the partitioning results to the ground truth communities. Values of 1.0 are optimal. • NMI is calculated by: NMI = I(U, V ) H(U)H(V ) , where I(U, V ) is the mutual information between sets U and V , H(U) is the entropy of a set U, and H(V ) is the entropy of a set V . • Purity of one cluster is calculated by: P(Cj) = 1 |Cj| max k=1,...,c |Cj,k|, where Cj,k consists of the cluster Cj of an image that originally belongs to the class set k. Results • Noise was added to a toy image. The epsilon values indicate the amount of noise. • The CME produces higher NMI values than the K-Means Clustering method for = 0, 0.00003 − 0.00005, & 0.00007. • CME has higher purity for = 0.00002 − 0.00005, & 0.00007. • Clusters resulting from the CME model have a larger number of nodes in each cluster belonging to a particular class than the K-Means Clustering method. • One advantage of CME model is the number of communities does not have to be predetermined, but this may be a dis- advantageous when partitioned into too many clusters. Evaluation of Partitions with Groundtruth (a) NMI Measure (b) Purity Measure Figure 3: NMI and Purity Measures. This gives a comparison of the Content Map Equation partitions with those of the K- Means Clustering method. The purity results are impressive for CME partitions since the model distinguished the distinct objects of the image. However, its NMI is low since the objects, for example the rectangle, are being clustered into several distinct communities as opposed to the optimal full object. Conclusion • High purity values for the CME model indicate the method is able to distinguish the different objects. • With a low NMI, the CME model puts the different objects in distinct clusters consisting of several smaller communities. • Lower purity values for the K-Means Clustering method suggest the method is not able to fully distinguish the objects. Discussion/Future work • To address the multiple communities for a single object, we will extend the method by treating communities resulting from the CME as new nodes that we want to merge. • We would like to experiment with different edge weights and dictionary definitions. • We would like to examine the approach on more complex images. References 1 Shi, J., & Malik, J. IEEE Transactions on Pattern Analysis and Machine Intelligence. Vol. 22. No. 8. p. 888–905. 2000. 2 Smith, L., Zhu, L., Lerman, K., & Percus, A. (2014). Partitioning networks with node attributes by compressing information flow. Unpublished manuscript. 3 Rosvall, M., & Bergstrom, C. T.. Proceedings of the National Academy of Sciences. Vol. 105. No. 4. p. 1118–1123. Jan. 2008. 4 Danon, L., Diaz-Guileram, A., Duch, J. & Arenas, A. Journal of Statistical Mechanics: Theory and Experiment. 2005. 5 Strehl, A, & Ghosh, J.. Journal of Machine Learning Research. Vol. 3. p. 583–617. 2002. 6 Huang, A.Similarity Measures for Text Document Clustering. Proceedings of the New Zealand Computer Science Research Student Conference. 2008. 7 Chen,Y., Wang, J. Z., & Krovetz, R.. IEEE Transactions on Image Processing. Vol.14. No. 8. p. 1187–1201. 2005. 8 Chen, M. Normalized Mutual Information Code. March 2012. Online. May 2014. http://www.mathworks.com/matlabcentral/fileexchange/35625-information-theory- toolbox/content/nmi.m. 9 Smith, L., Zhu, L., Lerman, K., & Percus, A.The Content Map Equation Code. June 2013.