SlideShare a Scribd company logo
International Journal of Research in Computer Science
eISSN 2249-8265 Volume 2 Issue 1 (2011) pp. 29-31
© White Globe Publications
www.ijorcs.org


     A COMPARATIVE STUDY ON DISTANCE MEASURING
            APPROACHES FOR CLUSTERING
                                        Shraddha Pandit1, Suchita Gupta2
           *Assistant Professor,Gyan Ganga Institute of Information Technology and Management,Bhopal

  Abstract: Clustering plays a vital role in the various    can be measured and interpretation of the comparison;
areas of research like Data Mining, Image Retrieval,        And we conclude the report.
Bio-computing and many a lot. Distance measure
plays an important role in clustering data points.                 II. DISTANCE MEASURES AND ITS
Choosing the right distance measure for a given                              SIGNIFICANCE
dataset is a biggest challenge. In this paper, we study        A cluster is a collection of data objects that are
various distance measures and their effect on different     similar to objects within the same cluster and
clustering. This paper surveys existing distance            dissimilar to those in other clusters. Similarity between
measures for clustering and present a comparison            two objects is calculated using a distance measure
between them based on application domain, efficiency,       [6].Since clustering forms groups; it can be used as a
benefits and drawbacks. This comparison helps the           pre-processing step for methods like classifications.
researchers to take quick decision about which
distance measure to use for clustering. We conclude             Many distance measures have been proposed in
this work by identifying trends and challenges of           literature for data clustering. Most often, these
research and development towards clustering.                measures are metric functions; Manhattan distance,
Keywords: Clustering, Distance Measure, Clustering          Minkowski distance and Hamming distance. Jaccard
Algorithms                                                  index, Cosine Similarity and Dice Coefficient are also
                                                            popular distance measures. For non-numeric datasets,
                 I. INTRODUCTION                            special distance functions are proposed. For example,
                                                            edit distance is a well-known distance measure for text
   Clustering is an important data mining technique         attributes.
that has a wide range of applications in many areas
like biology, medicine, market research and image           In this section we briefly elaborate seven commonly
analysis etc.It is the process of partitioning a set of     used distance measures.
objects into different subsets such that the data in each   A. Euclidean Distance
subset are similar to each other. In Cluster analysis
Distance measure and clustering algorithm plays an             The Euclidean distance or Euclidean metric is the
important role [1].                                         ordinary distance between two points that one would
                                                            measure with a ruler. It is the straight line distance
   An important step in any clustering is to select a       between two points.
distance measure, which will determine how similarity
[1] of two elements is calculated. This will influence      In a plane with p1 at (x1, y1) and p2 at (x2, y2), it is
the shape of the clusters, as some elements may be          √((x1 - x2)² + (y1 - y2)²).
close to one another according to one distance and          In N dimensions, the Euclidean distance between two
farther away according to another.                          points p and q is √ (Σi=1N (pi-qi)²) where pi (or qi) is
    It is expected that distance between objects within a   the coordinate of p (or q) in dimension i.
cluster should be minimum and distance between              B. Manhattan Distance
objects within different clusters should be maximum.
In this paper we compare different distance measures.          The distance between two points measured along
Comparison of these distance measures show that             axes at right angles. In a plane with p1 at (x1, y1) and
different distance measures behave differently              p2 at (x2, y2), it is |x1 - x2| + |y1 - y2|.
depending on application domain. The rest of the paper         This is easily generalized to higher dimensions.
is organized as follows:                                    Manhattan distance is often used in integrated circuits
   In section II, we discuss distance measures and its      where wires only run parallel to the X or Y axis. It is
significance in nutshell; in section III, we present the    also known as rectilinear distance, Minkowski’s [7] [3]
comparison between these distances measures in              L1 distance, taxi cab metric, or city block distance.
TABLE I; In section IV, we describe how the accuracy


                                                                             www.ijorcs.org
30                                                                                   Shraddha Pandit, Suchita Gupta

C. Bit-Vector Distance                                     G. Dice Index
   An N x N matrix Mb is calculated. Each point has d         Dice's coefficient [11] (also known as the Dice
dimensions and Mb (Pi, Pj) is determined as d-bit          Coefficient) is a similarity measure related to the
vector. This vector is obtained as follows:                Jaccard index.
                                                           For sets X and Y of keywords used in information
   If the numerical value of the xth dimension of point

                                                                2|𝑋 ∩ 𝑌|
is greater than the numerical value of the xth             retrieval, the coefficient may be defined as:


                                                           𝑆=
                                                                |𝑋| + |𝑌|
dimension of point pj,y then the bit x of Mb(Pi ,Pj) is
set to 1 and bit x of Mb(Pj, Pi) is set to 0.All the bit
vectors in Mb are then converted to integers.
D. Hamming Distance                                        When taken as string similarity measure, the
                                                           coefficient may be calculated for two strings, x and y


                                                           𝑆=
                                                                  2𝑛 𝑡
   The Hamming distance between two strings of             using bigrams as follows:
equal length is the number of positions for which the
                                                                 𝑛 𝑥 +𝑛 𝑦
corresponding symbols are different.
   Let x, y A^n. We define the Hamming distance
between x and y, denoted dH(x, y), to be the number        Where nt is the number of character bigrams found in
of places where x and y are different.                     both strings, nx is the number of bigrams in string x
                                                           and ny is the number of bigrams in string y.
   The Hamming distance [1] [6] can be interpreted as
the number of bits which need to be changed                III. ACCURACY AND RESULT INTERPRETATION
(corrupted) to turn one string into other. Sometimes the
number of characters is used instead of the number of         In general, the larger the number of sub-clusters
bits. Hamming distance can be seen as Manhattan            produced by the clustering the more accurate the final
distance between bit vectors.                              result is. However, too many sub-clusters will slow
                                                           down the clustering. The above comparison table
E. Jaccard Index                                           compares 5 proximity measures. This comparison is
   The Jaccard index, also known as the Jaccard            based on 4 different criteria which are generally
similarity coefficient is a statistic used for comparing   required to decide upon distance measure and
the similarity and diversity of sample sets.               clustering algorithms.
   The Jaccard coefficient [11] measures similarity           All above comparisons are tested using standard
between sample sets, and is defined as the size of the     synthetic dataset generated by Syndeca [3] Software
intersection divided by the size of the union of the       and few of it is tested using open source clustering tool
sample sets:                                               CLUTO.
√(A,B) = ‫׀‬A∩B‫׀‬                                                               IV.CONCLUSION
    ‫׀‬AUB ‫׀‬                                                    This paper surveys existing proximity measures for
F.   Cosine Index                                          clustering and presents a comparison between them
                                                           based on application domain, efficiency, benefits and
   It is a measure of similarity between two vectors of    drawbacks. This comparison helps the researchers to
n dimensions by finding the angle between them, often      take quick decision about which distance measure to
used to compare documents in text mining. Given two        use for clustering. We ran our experiments on
vectors of attributes, A and B, the cosine similarity      synthetic datasets for its validation. Future work
[11], θ, is represented using a dot product and

               𝐴. 𝐵
                                                           involves running the experiments on larger and


𝜃 = 𝑎𝑟𝑐𝑐𝑜𝑠
magnitude as                                               different kinds of datasets and extending our study to

             ‖𝐴‖‖𝐵‖
                                                           other proximity measures and clustering algorithms.

                                                                             V. REFERENCES
   For text matching, the attribute vectors A and B are    [1] Ankita Vimal, Satyanarayana R Valluri, Kamalakar
usually the tf-idf vectors of the documents. Since the         Karlapalem , “An Experiment with Distance Measures
angle, θ, is in the range of [0, π], the resulting             for Clustering” , Technical Report: IIIT/TR/2008/132
similarity will yield the value of π as meaning exactly    [2] John W. Ratcliff and David E. Metzener, Pattern
opposite, π / 2 meaning independent, 0 meaning                 Matching: The Gestalt Approach, DR. DOBB’S
exactly the same, with in-between values indicating            JOURNAL, 1998, p. 46.
intermediate similarities or dissimilarities               [3] Martin Ester Hans-Peter Kriegel Jrg Sander and
                                                               Xiaowei Xu, A Density-Based Algorithm for
                                                               Discovering Clusters in Large Spatial Databases with
                                                               Noise, AAAI Press, 1996, pp. 226–231.


                                                                            www.ijorcs.org
A Comparative Study on Distance Measuring Approaches for Clustering                                                       31

                                   TABLE I: COMPARISON OF DISTANCE MEASURE

  Distance                                      Algorithms In                                               Application
                       Formula                                     Benefits           Drawbacks
  Measure                                       which it is Used                                            Area
                                                                                      Results are greatly
                                                                   Easy to                                  -Appl.
                       √ ((x1 - x2) ² + (y1 -   -Partitional                          influenced by
  Euclidean                                                        implement and                            Involving
                       y2) ²)                   Algorithms                            variables that have
                                                                   Test                                     Interval Data
                                                                                      the largest value.
                                                                                                            - In health
                                                -K Modes                                                    psychology
                                                                                                            analysis
                                                                                      Does not work well
                                                                                      for image data,
                                                - AutoClass
                                                                                      Document
                                                                                      Classification
                                                                                                            - DNA
                                                -ROCK
                                                                                                            Analysis
                                                                   Easily             Does not work well
                                                Partitional        generalized to     for image data and    In Integrated
  Manhattan            |x1 - x2| + |y1 - y2|.
                                                Algorithms         higher             Document              Circuits
                                                                   dimensions         Classification
                                                                   Handles both
                                                                   Continuous and     Does not work well
  Cosine                                        Ontology and                                                Text Mining
                                                                   categorical        for nominal data
                                                                   variables
                       Ө = arccos A • B
  Similarity                                    Graph based
                       ‫ ׀׀‬A ‫ ׀׀ ׀׀‬B‫׀׀‬
                                                                   Handles both
                       √(A,B) = ‫׀‬A∩B‫׀‬                              Continuous and     Does not work well    Document
  Jaccard Index                                 Neural Network
                       ‫׀‬AUB‫׀‬                                       categorical        for nominal data      classification
                                                                   variables


[4] Bar-Hilel, A., Hertz, T., Shental, N., & Weinshall, D.
    (2003). Learning distance functions using equivalence
[5] Fukunaga, K. (1990). Statistical pattern recognition.
    San Diego: Academic Press. 2nd edition.
[6] Rui Xu, Donald Wunsch “Survey of Clustering
    Algorithms” IEEE Transactions on Neural Networks ,
    VOL. 16, NO. 3, MAY 2005.
[7] http://en.wikipedia.org/wiki/Data clustering

[8] http://en.wikipedia.org/wiki/K-means

[9] http://en.wikipedia.org/wiki/ DBSCAN

[10] http://en.wikipedia.org/wiki/Jaccard index

[11] http://en.wikipedia.org/wiki/Dice coefficient




                                                                                    www.ijorcs.org

More Related Content

What's hot

Unsupervised learning clustering
Unsupervised learning clusteringUnsupervised learning clustering
Unsupervised learning clustering
Dr Nisha Arora
 
Data Clusterng
Data ClusterngData Clusterng
Data Clusterng
VIDYA NAND JHA
 
07 Tensor Visualization
07 Tensor Visualization07 Tensor Visualization
07 Tensor Visualization
Valerii Klymchuk
 
Optics ordering points to identify the clustering structure
Optics ordering points to identify the clustering structureOptics ordering points to identify the clustering structure
Optics ordering points to identify the clustering structure
Rajesh Piryani
 
A comprehensive survey of contemporary
A comprehensive survey of contemporaryA comprehensive survey of contemporary
A comprehensive survey of contemporaryprjpublications
 
MULTI-OBJECTIVE ENERGY EFFICIENT OPTIMIZATION ALGORITHM FOR COVERAGE CONTROL ...
MULTI-OBJECTIVE ENERGY EFFICIENT OPTIMIZATION ALGORITHM FOR COVERAGE CONTROL ...MULTI-OBJECTIVE ENERGY EFFICIENT OPTIMIZATION ALGORITHM FOR COVERAGE CONTROL ...
MULTI-OBJECTIVE ENERGY EFFICIENT OPTIMIZATION ALGORITHM FOR COVERAGE CONTROL ...
ijcseit
 
Clustering: A Survey
Clustering: A SurveyClustering: A Survey
Clustering: A Survey
Raffaele Capaldo
 
Clustering
ClusteringClustering
Clustering
NLPseminar
 
Satellite image compression technique
Satellite image compression techniqueSatellite image compression technique
Satellite image compression technique
acijjournal
 
CC282 Unsupervised Learning (Clustering) Lecture 7 slides for ...
CC282 Unsupervised Learning (Clustering) Lecture 7 slides for ...CC282 Unsupervised Learning (Clustering) Lecture 7 slides for ...
CC282 Unsupervised Learning (Clustering) Lecture 7 slides for ...butest
 
Clique and sting
Clique and stingClique and sting
Clique and sting
Subramanyam Natarajan
 
3.1 clustering
3.1 clustering3.1 clustering
3.1 clustering
Krish_ver2
 
Data clustering
Data clustering Data clustering
Data clustering
GARIMA SHAKYA
 
Clustering, k-means clustering
Clustering, k-means clusteringClustering, k-means clustering
Clustering, k-means clustering
Megha Sharma
 
K-MEDOIDS CLUSTERING USING PARTITIONING AROUND MEDOIDS FOR PERFORMING FACE R...
K-MEDOIDS CLUSTERING  USING PARTITIONING AROUND MEDOIDS FOR PERFORMING FACE R...K-MEDOIDS CLUSTERING  USING PARTITIONING AROUND MEDOIDS FOR PERFORMING FACE R...
K-MEDOIDS CLUSTERING USING PARTITIONING AROUND MEDOIDS FOR PERFORMING FACE R...
ijscmc
 
Clustered Compressive Sensingbased Image Denoising Using Bayesian Framework
Clustered Compressive Sensingbased Image Denoising Using Bayesian FrameworkClustered Compressive Sensingbased Image Denoising Using Bayesian Framework
Clustered Compressive Sensingbased Image Denoising Using Bayesian Framework
csandit
 
Web image annotation by diffusion maps manifold learning algorithm
Web image annotation by diffusion maps manifold learning algorithmWeb image annotation by diffusion maps manifold learning algorithm
Web image annotation by diffusion maps manifold learning algorithm
ijfcstjournal
 

What's hot (19)

Unsupervised learning clustering
Unsupervised learning clusteringUnsupervised learning clustering
Unsupervised learning clustering
 
Data Clusterng
Data ClusterngData Clusterng
Data Clusterng
 
07 Tensor Visualization
07 Tensor Visualization07 Tensor Visualization
07 Tensor Visualization
 
Optics ordering points to identify the clustering structure
Optics ordering points to identify the clustering structureOptics ordering points to identify the clustering structure
Optics ordering points to identify the clustering structure
 
A comprehensive survey of contemporary
A comprehensive survey of contemporaryA comprehensive survey of contemporary
A comprehensive survey of contemporary
 
MULTI-OBJECTIVE ENERGY EFFICIENT OPTIMIZATION ALGORITHM FOR COVERAGE CONTROL ...
MULTI-OBJECTIVE ENERGY EFFICIENT OPTIMIZATION ALGORITHM FOR COVERAGE CONTROL ...MULTI-OBJECTIVE ENERGY EFFICIENT OPTIMIZATION ALGORITHM FOR COVERAGE CONTROL ...
MULTI-OBJECTIVE ENERGY EFFICIENT OPTIMIZATION ALGORITHM FOR COVERAGE CONTROL ...
 
Clustering: A Survey
Clustering: A SurveyClustering: A Survey
Clustering: A Survey
 
Clustering
ClusteringClustering
Clustering
 
Satellite image compression technique
Satellite image compression techniqueSatellite image compression technique
Satellite image compression technique
 
CC282 Unsupervised Learning (Clustering) Lecture 7 slides for ...
CC282 Unsupervised Learning (Clustering) Lecture 7 slides for ...CC282 Unsupervised Learning (Clustering) Lecture 7 slides for ...
CC282 Unsupervised Learning (Clustering) Lecture 7 slides for ...
 
Clique and sting
Clique and stingClique and sting
Clique and sting
 
3.1 clustering
3.1 clustering3.1 clustering
3.1 clustering
 
Data clustering
Data clustering Data clustering
Data clustering
 
Clustering, k-means clustering
Clustering, k-means clusteringClustering, k-means clustering
Clustering, k-means clustering
 
K-MEDOIDS CLUSTERING USING PARTITIONING AROUND MEDOIDS FOR PERFORMING FACE R...
K-MEDOIDS CLUSTERING  USING PARTITIONING AROUND MEDOIDS FOR PERFORMING FACE R...K-MEDOIDS CLUSTERING  USING PARTITIONING AROUND MEDOIDS FOR PERFORMING FACE R...
K-MEDOIDS CLUSTERING USING PARTITIONING AROUND MEDOIDS FOR PERFORMING FACE R...
 
ssc_icml13
ssc_icml13ssc_icml13
ssc_icml13
 
lcr
lcrlcr
lcr
 
Clustered Compressive Sensingbased Image Denoising Using Bayesian Framework
Clustered Compressive Sensingbased Image Denoising Using Bayesian FrameworkClustered Compressive Sensingbased Image Denoising Using Bayesian Framework
Clustered Compressive Sensingbased Image Denoising Using Bayesian Framework
 
Web image annotation by diffusion maps manifold learning algorithm
Web image annotation by diffusion maps manifold learning algorithmWeb image annotation by diffusion maps manifold learning algorithm
Web image annotation by diffusion maps manifold learning algorithm
 

Viewers also liked

Distance measure between two biological sequences
Distance  measure between  two biological  sequences Distance  measure between  two biological  sequences
Distance measure between two biological sequences
ShwetA Kumari
 
dot plot analysis
dot plot analysisdot plot analysis
dot plot analysis
ShwetA Kumari
 
Dotplots for Bioinformatics
Dotplots for BioinformaticsDotplots for Bioinformatics
Dotplots for Bioinformaticsavrilcoghlan
 
Introduction to sequence alignment
Introduction to sequence alignmentIntroduction to sequence alignment
Introduction to sequence alignment
Kubuldinho
 
Pairwise sequence alignment
Pairwise sequence alignmentPairwise sequence alignment
Pairwise sequence alignmentavrilcoghlan
 
Sequence Alignment In Bioinformatics
Sequence Alignment In BioinformaticsSequence Alignment In Bioinformatics
Sequence Alignment In Bioinformatics
Nikesh Narayanan
 

Viewers also liked (6)

Distance measure between two biological sequences
Distance  measure between  two biological  sequences Distance  measure between  two biological  sequences
Distance measure between two biological sequences
 
dot plot analysis
dot plot analysisdot plot analysis
dot plot analysis
 
Dotplots for Bioinformatics
Dotplots for BioinformaticsDotplots for Bioinformatics
Dotplots for Bioinformatics
 
Introduction to sequence alignment
Introduction to sequence alignmentIntroduction to sequence alignment
Introduction to sequence alignment
 
Pairwise sequence alignment
Pairwise sequence alignmentPairwise sequence alignment
Pairwise sequence alignment
 
Sequence Alignment In Bioinformatics
Sequence Alignment In BioinformaticsSequence Alignment In Bioinformatics
Sequence Alignment In Bioinformatics
 

Similar to A COMPARATIVE STUDY ON DISTANCE MEASURING APPROACHES FOR CLUSTERING

Combined cosine-linear regression model similarity with application to handwr...
Combined cosine-linear regression model similarity with application to handwr...Combined cosine-linear regression model similarity with application to handwr...
Combined cosine-linear regression model similarity with application to handwr...
IJECEIAES
 
Connectivity-Based Clustering for Mixed Discrete and Continuous Data
Connectivity-Based Clustering for Mixed Discrete and Continuous DataConnectivity-Based Clustering for Mixed Discrete and Continuous Data
Connectivity-Based Clustering for Mixed Discrete and Continuous Data
IJCI JOURNAL
 
Cluster Analysis: Measuring Similarity & Dissimilarity
Cluster Analysis: Measuring Similarity & DissimilarityCluster Analysis: Measuring Similarity & Dissimilarity
Cluster Analysis: Measuring Similarity & Dissimilarity
ShivarkarSandip
 
CLUSTERING HYPERSPECTRAL DATA
CLUSTERING HYPERSPECTRAL DATACLUSTERING HYPERSPECTRAL DATA
CLUSTERING HYPERSPECTRAL DATA
csandit
 
Clustering Algorithms for Data Stream
Clustering Algorithms for Data StreamClustering Algorithms for Data Stream
Clustering Algorithms for Data Stream
IRJET Journal
 
Speech recognition using vector quantization through modified k means lbg alg...
Speech recognition using vector quantization through modified k means lbg alg...Speech recognition using vector quantization through modified k means lbg alg...
Speech recognition using vector quantization through modified k means lbg alg...Alexander Decker
 
A general multiobjective clustering approach based on multiple distance measures
A general multiobjective clustering approach based on multiple distance measuresA general multiobjective clustering approach based on multiple distance measures
A general multiobjective clustering approach based on multiple distance measures
Mehran Mesbahzadeh
 
Improved probabilistic distance based locality preserving projections method ...
Improved probabilistic distance based locality preserving projections method ...Improved probabilistic distance based locality preserving projections method ...
Improved probabilistic distance based locality preserving projections method ...
IJECEIAES
 
BEHAVIOR STUDY OF ENTROPY IN A DIGITAL IMAGE THROUGH AN ITERATIVE ALGORITHM O...
BEHAVIOR STUDY OF ENTROPY IN A DIGITAL IMAGE THROUGH AN ITERATIVE ALGORITHM O...BEHAVIOR STUDY OF ENTROPY IN A DIGITAL IMAGE THROUGH AN ITERATIVE ALGORITHM O...
BEHAVIOR STUDY OF ENTROPY IN A DIGITAL IMAGE THROUGH AN ITERATIVE ALGORITHM O...
ijscmcj
 
GraphSignalProcessingFinalPaper
GraphSignalProcessingFinalPaperGraphSignalProcessingFinalPaper
GraphSignalProcessingFinalPaperChiraz Nafouki
 
An improvised similarity measure for generalized fuzzy numbers
An improvised similarity measure for generalized fuzzy numbersAn improvised similarity measure for generalized fuzzy numbers
An improvised similarity measure for generalized fuzzy numbers
journalBEEI
 
EVOLUTIONARY CENTRALITY AND MAXIMAL CLIQUES IN MOBILE SOCIAL NETWORKS
EVOLUTIONARY CENTRALITY AND MAXIMAL CLIQUES IN MOBILE SOCIAL NETWORKSEVOLUTIONARY CENTRALITY AND MAXIMAL CLIQUES IN MOBILE SOCIAL NETWORKS
EVOLUTIONARY CENTRALITY AND MAXIMAL CLIQUES IN MOBILE SOCIAL NETWORKS
ijcsit
 
DM UNIT_4 PPT for btech final year students
DM UNIT_4 PPT for btech final year studentsDM UNIT_4 PPT for btech final year students
DM UNIT_4 PPT for btech final year students
sriharipatilin
 
Clustering techniques final
Clustering techniques finalClustering techniques final
Clustering techniques final
Benard Maina
 
An approximate possibilistic
An approximate possibilisticAn approximate possibilistic
An approximate possibilistic
csandit
 
A COMPARATIVE ANALYSIS OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT APPROACHES ...
A COMPARATIVE ANALYSIS OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT APPROACHES ...A COMPARATIVE ANALYSIS OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT APPROACHES ...
A COMPARATIVE ANALYSIS OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT APPROACHES ...
ijcseit
 
A COMPARATIVE ANALYSIS OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT APPROACHES
A COMPARATIVE ANALYSIS OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT APPROACHES A COMPARATIVE ANALYSIS OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT APPROACHES
A COMPARATIVE ANALYSIS OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT APPROACHES
ijcseit
 

Similar to A COMPARATIVE STUDY ON DISTANCE MEASURING APPROACHES FOR CLUSTERING (20)

1376846406 14447221
1376846406  144472211376846406  14447221
1376846406 14447221
 
Combined cosine-linear regression model similarity with application to handwr...
Combined cosine-linear regression model similarity with application to handwr...Combined cosine-linear regression model similarity with application to handwr...
Combined cosine-linear regression model similarity with application to handwr...
 
call for papers, research paper publishing, where to publish research paper, ...
call for papers, research paper publishing, where to publish research paper, ...call for papers, research paper publishing, where to publish research paper, ...
call for papers, research paper publishing, where to publish research paper, ...
 
Connectivity-Based Clustering for Mixed Discrete and Continuous Data
Connectivity-Based Clustering for Mixed Discrete and Continuous DataConnectivity-Based Clustering for Mixed Discrete and Continuous Data
Connectivity-Based Clustering for Mixed Discrete and Continuous Data
 
Cluster Analysis: Measuring Similarity & Dissimilarity
Cluster Analysis: Measuring Similarity & DissimilarityCluster Analysis: Measuring Similarity & Dissimilarity
Cluster Analysis: Measuring Similarity & Dissimilarity
 
CLUSTERING HYPERSPECTRAL DATA
CLUSTERING HYPERSPECTRAL DATACLUSTERING HYPERSPECTRAL DATA
CLUSTERING HYPERSPECTRAL DATA
 
Clustering Algorithms for Data Stream
Clustering Algorithms for Data StreamClustering Algorithms for Data Stream
Clustering Algorithms for Data Stream
 
Speech recognition using vector quantization through modified k means lbg alg...
Speech recognition using vector quantization through modified k means lbg alg...Speech recognition using vector quantization through modified k means lbg alg...
Speech recognition using vector quantization through modified k means lbg alg...
 
A general multiobjective clustering approach based on multiple distance measures
A general multiobjective clustering approach based on multiple distance measuresA general multiobjective clustering approach based on multiple distance measures
A general multiobjective clustering approach based on multiple distance measures
 
[PPT]
[PPT][PPT]
[PPT]
 
Improved probabilistic distance based locality preserving projections method ...
Improved probabilistic distance based locality preserving projections method ...Improved probabilistic distance based locality preserving projections method ...
Improved probabilistic distance based locality preserving projections method ...
 
BEHAVIOR STUDY OF ENTROPY IN A DIGITAL IMAGE THROUGH AN ITERATIVE ALGORITHM O...
BEHAVIOR STUDY OF ENTROPY IN A DIGITAL IMAGE THROUGH AN ITERATIVE ALGORITHM O...BEHAVIOR STUDY OF ENTROPY IN A DIGITAL IMAGE THROUGH AN ITERATIVE ALGORITHM O...
BEHAVIOR STUDY OF ENTROPY IN A DIGITAL IMAGE THROUGH AN ITERATIVE ALGORITHM O...
 
GraphSignalProcessingFinalPaper
GraphSignalProcessingFinalPaperGraphSignalProcessingFinalPaper
GraphSignalProcessingFinalPaper
 
An improvised similarity measure for generalized fuzzy numbers
An improvised similarity measure for generalized fuzzy numbersAn improvised similarity measure for generalized fuzzy numbers
An improvised similarity measure for generalized fuzzy numbers
 
EVOLUTIONARY CENTRALITY AND MAXIMAL CLIQUES IN MOBILE SOCIAL NETWORKS
EVOLUTIONARY CENTRALITY AND MAXIMAL CLIQUES IN MOBILE SOCIAL NETWORKSEVOLUTIONARY CENTRALITY AND MAXIMAL CLIQUES IN MOBILE SOCIAL NETWORKS
EVOLUTIONARY CENTRALITY AND MAXIMAL CLIQUES IN MOBILE SOCIAL NETWORKS
 
DM UNIT_4 PPT for btech final year students
DM UNIT_4 PPT for btech final year studentsDM UNIT_4 PPT for btech final year students
DM UNIT_4 PPT for btech final year students
 
Clustering techniques final
Clustering techniques finalClustering techniques final
Clustering techniques final
 
An approximate possibilistic
An approximate possibilisticAn approximate possibilistic
An approximate possibilistic
 
A COMPARATIVE ANALYSIS OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT APPROACHES ...
A COMPARATIVE ANALYSIS OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT APPROACHES ...A COMPARATIVE ANALYSIS OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT APPROACHES ...
A COMPARATIVE ANALYSIS OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT APPROACHES ...
 
A COMPARATIVE ANALYSIS OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT APPROACHES
A COMPARATIVE ANALYSIS OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT APPROACHES A COMPARATIVE ANALYSIS OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT APPROACHES
A COMPARATIVE ANALYSIS OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT APPROACHES
 

More from IJORCS

Help the Genetic Algorithm to Minimize the Urban Traffic on Intersections
Help the Genetic Algorithm to Minimize the Urban Traffic on IntersectionsHelp the Genetic Algorithm to Minimize the Urban Traffic on Intersections
Help the Genetic Algorithm to Minimize the Urban Traffic on Intersections
IJORCS
 
Call for Papers - IJORCS, Volume 4 Issue 4
Call for Papers - IJORCS, Volume 4 Issue 4Call for Papers - IJORCS, Volume 4 Issue 4
Call for Papers - IJORCS, Volume 4 Issue 4
IJORCS
 
Real-Time Multiple License Plate Recognition System
Real-Time Multiple License Plate Recognition SystemReal-Time Multiple License Plate Recognition System
Real-Time Multiple License Plate Recognition System
IJORCS
 
FPGA Implementation of FIR Filter using Various Algorithms: A Retrospective
FPGA Implementation of FIR Filter using Various Algorithms: A RetrospectiveFPGA Implementation of FIR Filter using Various Algorithms: A Retrospective
FPGA Implementation of FIR Filter using Various Algorithms: A Retrospective
IJORCS
 
Using Virtualization Technique to Increase Security and Reduce Energy Consump...
Using Virtualization Technique to Increase Security and Reduce Energy Consump...Using Virtualization Technique to Increase Security and Reduce Energy Consump...
Using Virtualization Technique to Increase Security and Reduce Energy Consump...
IJORCS
 
Algebraic Fault Attack on the SHA-256 Compression Function
Algebraic Fault Attack on the SHA-256 Compression FunctionAlgebraic Fault Attack on the SHA-256 Compression Function
Algebraic Fault Attack on the SHA-256 Compression Function
IJORCS
 
Enhancement of DES Algorithm with Multi State Logic
Enhancement of DES Algorithm with Multi State LogicEnhancement of DES Algorithm with Multi State Logic
Enhancement of DES Algorithm with Multi State Logic
IJORCS
 
Hybrid Simulated Annealing and Nelder-Mead Algorithm for Solving Large-Scale ...
Hybrid Simulated Annealing and Nelder-Mead Algorithm for Solving Large-Scale ...Hybrid Simulated Annealing and Nelder-Mead Algorithm for Solving Large-Scale ...
Hybrid Simulated Annealing and Nelder-Mead Algorithm for Solving Large-Scale ...
IJORCS
 
CFP. IJORCS, Volume 4 - Issue2
CFP. IJORCS, Volume 4 - Issue2CFP. IJORCS, Volume 4 - Issue2
CFP. IJORCS, Volume 4 - Issue2
IJORCS
 
Call for Papers - IJORCS - Vol 4, Issue 1
Call for Papers - IJORCS - Vol 4, Issue 1Call for Papers - IJORCS - Vol 4, Issue 1
Call for Papers - IJORCS - Vol 4, Issue 1
IJORCS
 
Voice Recognition System using Template Matching
Voice Recognition System using Template MatchingVoice Recognition System using Template Matching
Voice Recognition System using Template Matching
IJORCS
 
Channel Aware Mac Protocol for Maximizing Throughput and Fairness
Channel Aware Mac Protocol for Maximizing Throughput and FairnessChannel Aware Mac Protocol for Maximizing Throughput and Fairness
Channel Aware Mac Protocol for Maximizing Throughput and Fairness
IJORCS
 
A Review and Analysis on Mobile Application Development Processes using Agile...
A Review and Analysis on Mobile Application Development Processes using Agile...A Review and Analysis on Mobile Application Development Processes using Agile...
A Review and Analysis on Mobile Application Development Processes using Agile...
IJORCS
 
Congestion Prediction and Adaptive Rate Adjustment Technique for Wireless Sen...
Congestion Prediction and Adaptive Rate Adjustment Technique for Wireless Sen...Congestion Prediction and Adaptive Rate Adjustment Technique for Wireless Sen...
Congestion Prediction and Adaptive Rate Adjustment Technique for Wireless Sen...
IJORCS
 
A Study of Routing Techniques in Intermittently Connected MANETs
A Study of Routing Techniques in Intermittently Connected MANETsA Study of Routing Techniques in Intermittently Connected MANETs
A Study of Routing Techniques in Intermittently Connected MANETs
IJORCS
 
Improving the Efficiency of Spectral Subtraction Method by Combining it with ...
Improving the Efficiency of Spectral Subtraction Method by Combining it with ...Improving the Efficiency of Spectral Subtraction Method by Combining it with ...
Improving the Efficiency of Spectral Subtraction Method by Combining it with ...
IJORCS
 
An Adaptive Load Sharing Algorithm for Heterogeneous Distributed System
An Adaptive Load Sharing Algorithm for Heterogeneous Distributed SystemAn Adaptive Load Sharing Algorithm for Heterogeneous Distributed System
An Adaptive Load Sharing Algorithm for Heterogeneous Distributed System
IJORCS
 
The Design of Cognitive Social Simulation Framework using Statistical Methodo...
The Design of Cognitive Social Simulation Framework using Statistical Methodo...The Design of Cognitive Social Simulation Framework using Statistical Methodo...
The Design of Cognitive Social Simulation Framework using Statistical Methodo...
IJORCS
 
An Enhanced Framework for Improving Spatio-Temporal Queries for Global Positi...
An Enhanced Framework for Improving Spatio-Temporal Queries for Global Positi...An Enhanced Framework for Improving Spatio-Temporal Queries for Global Positi...
An Enhanced Framework for Improving Spatio-Temporal Queries for Global Positi...
IJORCS
 
A PSO-Based Subtractive Data Clustering Algorithm
A PSO-Based Subtractive Data Clustering AlgorithmA PSO-Based Subtractive Data Clustering Algorithm
A PSO-Based Subtractive Data Clustering Algorithm
IJORCS
 

More from IJORCS (20)

Help the Genetic Algorithm to Minimize the Urban Traffic on Intersections
Help the Genetic Algorithm to Minimize the Urban Traffic on IntersectionsHelp the Genetic Algorithm to Minimize the Urban Traffic on Intersections
Help the Genetic Algorithm to Minimize the Urban Traffic on Intersections
 
Call for Papers - IJORCS, Volume 4 Issue 4
Call for Papers - IJORCS, Volume 4 Issue 4Call for Papers - IJORCS, Volume 4 Issue 4
Call for Papers - IJORCS, Volume 4 Issue 4
 
Real-Time Multiple License Plate Recognition System
Real-Time Multiple License Plate Recognition SystemReal-Time Multiple License Plate Recognition System
Real-Time Multiple License Plate Recognition System
 
FPGA Implementation of FIR Filter using Various Algorithms: A Retrospective
FPGA Implementation of FIR Filter using Various Algorithms: A RetrospectiveFPGA Implementation of FIR Filter using Various Algorithms: A Retrospective
FPGA Implementation of FIR Filter using Various Algorithms: A Retrospective
 
Using Virtualization Technique to Increase Security and Reduce Energy Consump...
Using Virtualization Technique to Increase Security and Reduce Energy Consump...Using Virtualization Technique to Increase Security and Reduce Energy Consump...
Using Virtualization Technique to Increase Security and Reduce Energy Consump...
 
Algebraic Fault Attack on the SHA-256 Compression Function
Algebraic Fault Attack on the SHA-256 Compression FunctionAlgebraic Fault Attack on the SHA-256 Compression Function
Algebraic Fault Attack on the SHA-256 Compression Function
 
Enhancement of DES Algorithm with Multi State Logic
Enhancement of DES Algorithm with Multi State LogicEnhancement of DES Algorithm with Multi State Logic
Enhancement of DES Algorithm with Multi State Logic
 
Hybrid Simulated Annealing and Nelder-Mead Algorithm for Solving Large-Scale ...
Hybrid Simulated Annealing and Nelder-Mead Algorithm for Solving Large-Scale ...Hybrid Simulated Annealing and Nelder-Mead Algorithm for Solving Large-Scale ...
Hybrid Simulated Annealing and Nelder-Mead Algorithm for Solving Large-Scale ...
 
CFP. IJORCS, Volume 4 - Issue2
CFP. IJORCS, Volume 4 - Issue2CFP. IJORCS, Volume 4 - Issue2
CFP. IJORCS, Volume 4 - Issue2
 
Call for Papers - IJORCS - Vol 4, Issue 1
Call for Papers - IJORCS - Vol 4, Issue 1Call for Papers - IJORCS - Vol 4, Issue 1
Call for Papers - IJORCS - Vol 4, Issue 1
 
Voice Recognition System using Template Matching
Voice Recognition System using Template MatchingVoice Recognition System using Template Matching
Voice Recognition System using Template Matching
 
Channel Aware Mac Protocol for Maximizing Throughput and Fairness
Channel Aware Mac Protocol for Maximizing Throughput and FairnessChannel Aware Mac Protocol for Maximizing Throughput and Fairness
Channel Aware Mac Protocol for Maximizing Throughput and Fairness
 
A Review and Analysis on Mobile Application Development Processes using Agile...
A Review and Analysis on Mobile Application Development Processes using Agile...A Review and Analysis on Mobile Application Development Processes using Agile...
A Review and Analysis on Mobile Application Development Processes using Agile...
 
Congestion Prediction and Adaptive Rate Adjustment Technique for Wireless Sen...
Congestion Prediction and Adaptive Rate Adjustment Technique for Wireless Sen...Congestion Prediction and Adaptive Rate Adjustment Technique for Wireless Sen...
Congestion Prediction and Adaptive Rate Adjustment Technique for Wireless Sen...
 
A Study of Routing Techniques in Intermittently Connected MANETs
A Study of Routing Techniques in Intermittently Connected MANETsA Study of Routing Techniques in Intermittently Connected MANETs
A Study of Routing Techniques in Intermittently Connected MANETs
 
Improving the Efficiency of Spectral Subtraction Method by Combining it with ...
Improving the Efficiency of Spectral Subtraction Method by Combining it with ...Improving the Efficiency of Spectral Subtraction Method by Combining it with ...
Improving the Efficiency of Spectral Subtraction Method by Combining it with ...
 
An Adaptive Load Sharing Algorithm for Heterogeneous Distributed System
An Adaptive Load Sharing Algorithm for Heterogeneous Distributed SystemAn Adaptive Load Sharing Algorithm for Heterogeneous Distributed System
An Adaptive Load Sharing Algorithm for Heterogeneous Distributed System
 
The Design of Cognitive Social Simulation Framework using Statistical Methodo...
The Design of Cognitive Social Simulation Framework using Statistical Methodo...The Design of Cognitive Social Simulation Framework using Statistical Methodo...
The Design of Cognitive Social Simulation Framework using Statistical Methodo...
 
An Enhanced Framework for Improving Spatio-Temporal Queries for Global Positi...
An Enhanced Framework for Improving Spatio-Temporal Queries for Global Positi...An Enhanced Framework for Improving Spatio-Temporal Queries for Global Positi...
An Enhanced Framework for Improving Spatio-Temporal Queries for Global Positi...
 
A PSO-Based Subtractive Data Clustering Algorithm
A PSO-Based Subtractive Data Clustering AlgorithmA PSO-Based Subtractive Data Clustering Algorithm
A PSO-Based Subtractive Data Clustering Algorithm
 

Recently uploaded

ESC Beyond Borders _From EU to You_ InfoPack general.pdf
ESC Beyond Borders _From EU to You_ InfoPack general.pdfESC Beyond Borders _From EU to You_ InfoPack general.pdf
ESC Beyond Borders _From EU to You_ InfoPack general.pdf
Fundacja Rozwoju Społeczeństwa Przedsiębiorczego
 
GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...
GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...
GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...
Nguyen Thanh Tu Collection
 
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptxStudents, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
EduSkills OECD
 
Palestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptxPalestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptx
RaedMohamed3
 
How to Create Map Views in the Odoo 17 ERP
How to Create Map Views in the Odoo 17 ERPHow to Create Map Views in the Odoo 17 ERP
How to Create Map Views in the Odoo 17 ERP
Celine George
 
How to Break the cycle of negative Thoughts
How to Break the cycle of negative ThoughtsHow to Break the cycle of negative Thoughts
How to Break the cycle of negative Thoughts
Col Mukteshwar Prasad
 
How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17
Celine George
 
Home assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdfHome assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdf
Tamralipta Mahavidyalaya
 
Chapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptxChapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptx
Mohd Adib Abd Muin, Senior Lecturer at Universiti Utara Malaysia
 
PART A. Introduction to Costumer Service
PART A. Introduction to Costumer ServicePART A. Introduction to Costumer Service
PART A. Introduction to Costumer Service
PedroFerreira53928
 
Model Attribute Check Company Auto Property
Model Attribute  Check Company Auto PropertyModel Attribute  Check Company Auto Property
Model Attribute Check Company Auto Property
Celine George
 
The approach at University of Liverpool.pptx
The approach at University of Liverpool.pptxThe approach at University of Liverpool.pptx
The approach at University of Liverpool.pptx
Jisc
 
Ethnobotany and Ethnopharmacology ......
Ethnobotany and Ethnopharmacology ......Ethnobotany and Ethnopharmacology ......
Ethnobotany and Ethnopharmacology ......
Ashokrao Mane college of Pharmacy Peth-Vadgaon
 
Additional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdfAdditional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdf
joachimlavalley1
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
JosvitaDsouza2
 
Language Across the Curriculm LAC B.Ed.
Language Across the  Curriculm LAC B.Ed.Language Across the  Curriculm LAC B.Ed.
Language Across the Curriculm LAC B.Ed.
Atul Kumar Singh
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptx
Pavel ( NSTU)
 
The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
Delapenabediema
 
Thesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.pptThesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.ppt
EverAndrsGuerraGuerr
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
DeeptiGupta154
 

Recently uploaded (20)

ESC Beyond Borders _From EU to You_ InfoPack general.pdf
ESC Beyond Borders _From EU to You_ InfoPack general.pdfESC Beyond Borders _From EU to You_ InfoPack general.pdf
ESC Beyond Borders _From EU to You_ InfoPack general.pdf
 
GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...
GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...
GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...
 
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptxStudents, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
 
Palestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptxPalestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptx
 
How to Create Map Views in the Odoo 17 ERP
How to Create Map Views in the Odoo 17 ERPHow to Create Map Views in the Odoo 17 ERP
How to Create Map Views in the Odoo 17 ERP
 
How to Break the cycle of negative Thoughts
How to Break the cycle of negative ThoughtsHow to Break the cycle of negative Thoughts
How to Break the cycle of negative Thoughts
 
How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17
 
Home assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdfHome assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdf
 
Chapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptxChapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptx
 
PART A. Introduction to Costumer Service
PART A. Introduction to Costumer ServicePART A. Introduction to Costumer Service
PART A. Introduction to Costumer Service
 
Model Attribute Check Company Auto Property
Model Attribute  Check Company Auto PropertyModel Attribute  Check Company Auto Property
Model Attribute Check Company Auto Property
 
The approach at University of Liverpool.pptx
The approach at University of Liverpool.pptxThe approach at University of Liverpool.pptx
The approach at University of Liverpool.pptx
 
Ethnobotany and Ethnopharmacology ......
Ethnobotany and Ethnopharmacology ......Ethnobotany and Ethnopharmacology ......
Ethnobotany and Ethnopharmacology ......
 
Additional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdfAdditional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdf
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
 
Language Across the Curriculm LAC B.Ed.
Language Across the  Curriculm LAC B.Ed.Language Across the  Curriculm LAC B.Ed.
Language Across the Curriculm LAC B.Ed.
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptx
 
The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
 
Thesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.pptThesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.ppt
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
 

A COMPARATIVE STUDY ON DISTANCE MEASURING APPROACHES FOR CLUSTERING

  • 1. International Journal of Research in Computer Science eISSN 2249-8265 Volume 2 Issue 1 (2011) pp. 29-31 © White Globe Publications www.ijorcs.org A COMPARATIVE STUDY ON DISTANCE MEASURING APPROACHES FOR CLUSTERING Shraddha Pandit1, Suchita Gupta2 *Assistant Professor,Gyan Ganga Institute of Information Technology and Management,Bhopal Abstract: Clustering plays a vital role in the various can be measured and interpretation of the comparison; areas of research like Data Mining, Image Retrieval, And we conclude the report. Bio-computing and many a lot. Distance measure plays an important role in clustering data points. II. DISTANCE MEASURES AND ITS Choosing the right distance measure for a given SIGNIFICANCE dataset is a biggest challenge. In this paper, we study A cluster is a collection of data objects that are various distance measures and their effect on different similar to objects within the same cluster and clustering. This paper surveys existing distance dissimilar to those in other clusters. Similarity between measures for clustering and present a comparison two objects is calculated using a distance measure between them based on application domain, efficiency, [6].Since clustering forms groups; it can be used as a benefits and drawbacks. This comparison helps the pre-processing step for methods like classifications. researchers to take quick decision about which distance measure to use for clustering. We conclude Many distance measures have been proposed in this work by identifying trends and challenges of literature for data clustering. Most often, these research and development towards clustering. measures are metric functions; Manhattan distance, Keywords: Clustering, Distance Measure, Clustering Minkowski distance and Hamming distance. Jaccard Algorithms index, Cosine Similarity and Dice Coefficient are also popular distance measures. For non-numeric datasets, I. INTRODUCTION special distance functions are proposed. For example, edit distance is a well-known distance measure for text Clustering is an important data mining technique attributes. that has a wide range of applications in many areas like biology, medicine, market research and image In this section we briefly elaborate seven commonly analysis etc.It is the process of partitioning a set of used distance measures. objects into different subsets such that the data in each A. Euclidean Distance subset are similar to each other. In Cluster analysis Distance measure and clustering algorithm plays an The Euclidean distance or Euclidean metric is the important role [1]. ordinary distance between two points that one would measure with a ruler. It is the straight line distance An important step in any clustering is to select a between two points. distance measure, which will determine how similarity [1] of two elements is calculated. This will influence In a plane with p1 at (x1, y1) and p2 at (x2, y2), it is the shape of the clusters, as some elements may be √((x1 - x2)² + (y1 - y2)²). close to one another according to one distance and In N dimensions, the Euclidean distance between two farther away according to another. points p and q is √ (Σi=1N (pi-qi)²) where pi (or qi) is It is expected that distance between objects within a the coordinate of p (or q) in dimension i. cluster should be minimum and distance between B. Manhattan Distance objects within different clusters should be maximum. In this paper we compare different distance measures. The distance between two points measured along Comparison of these distance measures show that axes at right angles. In a plane with p1 at (x1, y1) and different distance measures behave differently p2 at (x2, y2), it is |x1 - x2| + |y1 - y2|. depending on application domain. The rest of the paper This is easily generalized to higher dimensions. is organized as follows: Manhattan distance is often used in integrated circuits In section II, we discuss distance measures and its where wires only run parallel to the X or Y axis. It is significance in nutshell; in section III, we present the also known as rectilinear distance, Minkowski’s [7] [3] comparison between these distances measures in L1 distance, taxi cab metric, or city block distance. TABLE I; In section IV, we describe how the accuracy www.ijorcs.org
  • 2. 30 Shraddha Pandit, Suchita Gupta C. Bit-Vector Distance G. Dice Index An N x N matrix Mb is calculated. Each point has d Dice's coefficient [11] (also known as the Dice dimensions and Mb (Pi, Pj) is determined as d-bit Coefficient) is a similarity measure related to the vector. This vector is obtained as follows: Jaccard index. For sets X and Y of keywords used in information If the numerical value of the xth dimension of point 2|𝑋 ∩ 𝑌| is greater than the numerical value of the xth retrieval, the coefficient may be defined as: 𝑆= |𝑋| + |𝑌| dimension of point pj,y then the bit x of Mb(Pi ,Pj) is set to 1 and bit x of Mb(Pj, Pi) is set to 0.All the bit vectors in Mb are then converted to integers. D. Hamming Distance When taken as string similarity measure, the coefficient may be calculated for two strings, x and y 𝑆= 2𝑛 𝑡 The Hamming distance between two strings of using bigrams as follows: equal length is the number of positions for which the 𝑛 𝑥 +𝑛 𝑦 corresponding symbols are different. Let x, y A^n. We define the Hamming distance between x and y, denoted dH(x, y), to be the number Where nt is the number of character bigrams found in of places where x and y are different. both strings, nx is the number of bigrams in string x and ny is the number of bigrams in string y. The Hamming distance [1] [6] can be interpreted as the number of bits which need to be changed III. ACCURACY AND RESULT INTERPRETATION (corrupted) to turn one string into other. Sometimes the number of characters is used instead of the number of In general, the larger the number of sub-clusters bits. Hamming distance can be seen as Manhattan produced by the clustering the more accurate the final distance between bit vectors. result is. However, too many sub-clusters will slow down the clustering. The above comparison table E. Jaccard Index compares 5 proximity measures. This comparison is The Jaccard index, also known as the Jaccard based on 4 different criteria which are generally similarity coefficient is a statistic used for comparing required to decide upon distance measure and the similarity and diversity of sample sets. clustering algorithms. The Jaccard coefficient [11] measures similarity All above comparisons are tested using standard between sample sets, and is defined as the size of the synthetic dataset generated by Syndeca [3] Software intersection divided by the size of the union of the and few of it is tested using open source clustering tool sample sets: CLUTO. √(A,B) = ‫׀‬A∩B‫׀‬ IV.CONCLUSION ‫׀‬AUB ‫׀‬ This paper surveys existing proximity measures for F. Cosine Index clustering and presents a comparison between them based on application domain, efficiency, benefits and It is a measure of similarity between two vectors of drawbacks. This comparison helps the researchers to n dimensions by finding the angle between them, often take quick decision about which distance measure to used to compare documents in text mining. Given two use for clustering. We ran our experiments on vectors of attributes, A and B, the cosine similarity synthetic datasets for its validation. Future work [11], θ, is represented using a dot product and 𝐴. 𝐵 involves running the experiments on larger and 𝜃 = 𝑎𝑟𝑐𝑐𝑜𝑠 magnitude as different kinds of datasets and extending our study to ‖𝐴‖‖𝐵‖ other proximity measures and clustering algorithms. V. REFERENCES For text matching, the attribute vectors A and B are [1] Ankita Vimal, Satyanarayana R Valluri, Kamalakar usually the tf-idf vectors of the documents. Since the Karlapalem , “An Experiment with Distance Measures angle, θ, is in the range of [0, π], the resulting for Clustering” , Technical Report: IIIT/TR/2008/132 similarity will yield the value of π as meaning exactly [2] John W. Ratcliff and David E. Metzener, Pattern opposite, π / 2 meaning independent, 0 meaning Matching: The Gestalt Approach, DR. DOBB’S exactly the same, with in-between values indicating JOURNAL, 1998, p. 46. intermediate similarities or dissimilarities [3] Martin Ester Hans-Peter Kriegel Jrg Sander and Xiaowei Xu, A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise, AAAI Press, 1996, pp. 226–231. www.ijorcs.org
  • 3. A Comparative Study on Distance Measuring Approaches for Clustering 31 TABLE I: COMPARISON OF DISTANCE MEASURE Distance Algorithms In Application Formula Benefits Drawbacks Measure which it is Used Area Results are greatly Easy to -Appl. √ ((x1 - x2) ² + (y1 - -Partitional influenced by Euclidean implement and Involving y2) ²) Algorithms variables that have Test Interval Data the largest value. - In health -K Modes psychology analysis Does not work well for image data, - AutoClass Document Classification - DNA -ROCK Analysis Easily Does not work well Partitional generalized to for image data and In Integrated Manhattan |x1 - x2| + |y1 - y2|. Algorithms higher Document Circuits dimensions Classification Handles both Continuous and Does not work well Cosine Ontology and Text Mining categorical for nominal data variables Ө = arccos A • B Similarity Graph based ‫ ׀׀‬A ‫ ׀׀ ׀׀‬B‫׀׀‬ Handles both √(A,B) = ‫׀‬A∩B‫׀‬ Continuous and Does not work well Document Jaccard Index Neural Network ‫׀‬AUB‫׀‬ categorical for nominal data classification variables [4] Bar-Hilel, A., Hertz, T., Shental, N., & Weinshall, D. (2003). Learning distance functions using equivalence [5] Fukunaga, K. (1990). Statistical pattern recognition. San Diego: Academic Press. 2nd edition. [6] Rui Xu, Donald Wunsch “Survey of Clustering Algorithms” IEEE Transactions on Neural Networks , VOL. 16, NO. 3, MAY 2005. [7] http://en.wikipedia.org/wiki/Data clustering [8] http://en.wikipedia.org/wiki/K-means [9] http://en.wikipedia.org/wiki/ DBSCAN [10] http://en.wikipedia.org/wiki/Jaccard index [11] http://en.wikipedia.org/wiki/Dice coefficient www.ijorcs.org