SlideShare a Scribd company logo
1 of 5
Download to read offline
Monali Patil, Vidya Chitre, Dipti Patil / International Journal of Engineering Research and Applications
                          (IJERA)              ISSN: 2248-9622        www.ijera.com
                                Vol. 2, Issue 4, July-August 2012, pp.144-148
                        Improved K-mean clustering with Mobile Agent

                                     Monali Patil, Vidya Chitre, Dipti Patil
                                    Department of Information Technology, K. C. COE, Thane
                                         Department of Computer Engineering, Bharati Vidyapidth, Navi Mumbai
                                    Department of Information Technology, PIIT, New Panvel
                                                       Mumbai University
                                                         Maharashtra
                                                             India



Abstract— The data stream model has recently                       A good clustering algorithm should be able to identity
attracted attention for its applicability to numerous types        clusters irrespective of their shapes. D-Stream, is a
of data, including telephone records, Web documents,               framework for clustering stream data using a density-based
and click streams. For analysis of such data, the ability          approach. The algorithm uses an online component which
to process the data in a single pass, or a small number of         maps each input data record into a grid and an offline
passes, while using little memory, is crucial. D-Stream            component which computes the grid density and clusters the
algorithm is an extended grid-based clustering algorithm           grids based on the density. The algorithm adopts a density
for different dimensional data streams. One of the                 decaying technique to
disadvantages of D-Stream algorithm is the large                   capture the dynamic changes of a data stream [4]. It exploits
number of grids, especially for high-dimensional data. So          the intricate relationships between the decay factor, data
we use the mobile agent to deal with the different                 density and cluster structure. An extended DBSCAN
dimensional data. This method can reduce the number of             algorithm was proposed by Cao F et.al called DenStream
the grids and the number of the update times. So using             algorithm in 2006 , which also used two-phase scheme the
mobile agent technique with data stream algorithm we               online component and the offline component. D-Stream
make the data stream clustering more simplified and                algorithm was an extended Grid-based method proposed by
flexible.                                                          Chen et.al in 2007, which also used two-phase scheme and
                                                                   emphasized the detection of the isolated points.
Keywords— D-Stream, Grid based clustering

I. INTRODUCTION                                                    II. THE CONCEPTS AND DEFINITIONS
          Since 1991 Weiser proposed pervasive computing ,                    Let A = (A1,A2, ..., Ad) be a bounded, all ordered
pervasive computing has been paid more attention to.               domain set, then we will denote a d-dimensional numerical
Pervasive application emphasizes on "real-time" reactions          space by S = A1 × A2 × ... × Ad, and one dimension of S be
because of the special requirement of the environment. With        represented Ad . Data stream is defined as a series of
the development of the distributed system and wireless             chronological order to reach set of data points, it will be
communication, streams of data are able to be obtained from        denoted by DS = (X1, X2 ,…, Xn, ...). It is assumed that the
high-bandwidth and data streams mining has become one of           data stream consists of a set of multi-dimensional records
the hot points in the pervasive application. Clustering can        X1,X2,…,Xn, …arriving at time stamps t1,t2, ... ,tn, ... Each
provide the real-time analyzing results in the pervasive           Xi is a multi-dimensional record containing d dimensions
environment. There are many mature methods can be                  which are denoted by Xi = ( xi1 , xi2 ,…, xid ). the Xi =
extended to cluster the data streams. Guha et.al gave an           (xi1, xi2, ..., xid) represents a data point (xi1 _ A1, xi2 _ A2,
extended k-means clustering method to deal with data               ..., xid _ Ad) [4].
streams in 2000. In 2003 Babcock et.al improved Guha's
method using the data structure called exponential                 A. DEFINITION: 1. THE GRID
histogram. O'Challagham et.al proposed STREAM                          The each dimension of the numerical space S is divided
algorithm, which improved k-means using LSEARCH and                evenly into ξ intervals, each interval is the dimension on a
they got the more qualitative results. CluStream algorithm         right-open interval of paragraph [li ,hi], length will be
was given by Aggarwal et.al, which extended the clustering         denoted by ln, where i is followed by numbered 1, 2, ..., ξ.
feature concept of BIRCH algorithm.                                The numerical space S is divided into ξd disjoint grid, each
    Clustering is the process of grouping the data into classes    grid g is a d-dimensional hypercube in the S , can be
or clusters so that objects within a cluster have high
similarity but are very dissimilar to objects in other clusters.




                                                                                                                   144 | P a g e
Monali Patil, Vidya Chitre, Dipti Patil / International Journal of Engineering Research and Applications
                           (IJERA)              ISSN: 2248-9622        www.ijera.com
                                 Vol. 2, Issue 4, July-August 2012, pp.144-148
expressed as ( v1 , v2 ,…, vd ) , where vi is the ith (1 ≤ i ≤ d)     Dmin(t2 +1,t3);
dimensional interval on the corresponding number in grid g.           (2) Dmin(t1 ,t1)=1/N;
                                                                      (3) lim t->∞ (t1 ,t )= N(1-λ)
                                                                      (4) Let t1<t2, then Dmin(t1 ,tc)= Dmin(t2 ,tc).
B. DEFINITION: 2 DATA DENSITY, ATTENUATION FACTOR

   The data arriving at time tc be denoted by p, its density        III.     D-STREAM ALGORITHM
D(p,t) is a change in the amount of time t, is defined as:                    D-Stream is a new framework for clustering real-
D(p,t) =λt-tc , where λ (0<λ<1) is a constant, called the           time stream data. There are two components: the online
attenuation factor.                                                 component and the offline component. The online
   Definition: 3 Grid density                                       component reads and maps each input data record into a
   Set up the grid g at time t contains the data items are p1,      grid, and the offline component computes the density of
p2,…,pN , their time stamp (referring to the arrival time),         each grid and clusters the grids using a density-based
respectively, t1, t2, …, tn , thus at time t, the density of grid   algorithm in every fixed interval of time (gap). Actually, in
is defined as the sum of the data density of all data items in      the offline component there is the general grid-based
the grid g at the time t , we will denote the density of grid by    clustering algorithm. In the online component the algorithm
   D(g,t) = ∑i=1 n λ t-ti                                           adopts a density decaying technique to capture the dynamic
                                                                    changes of a data stream and exploits the intricate
    Definition: 4 Dense grid                                        relationships between the decay factor, data density and
    In the t moment, if D(g,t)≥ c/N(1-λ)= Dm grid g , then          cluster structure.
    claimed the grid as a dense grid , where C is C ≥ 1,and is          Once the densities of grids are updated in the gap, the
a                                                                   clustering procedure is similar to the general grid-based
    constant (in this algorithm is set up C = 3).                   clustering. This algorithm can find arbitrary shape clusters
                                                                    by a systematic visit of the neighbors and it is not necessary
   Definition 5. Eigenvector of grid                                to specify the number of cluster like k-means algorithm.
   The vector is defined as a six-tuple ( Fx ,Qx , P, density       However, there is a higher requirement that if the dimension
,class , tg ), Where: wherein Fx and Qx each correspond to a        is high, the number of the grids must be very large.
vector of d entries. The definition of each of these entries is         For a data stream, at each time step, the online
as follows:                                                         component of D-Stream continuously reads a new data
   For each dimension, the sum of the squares of the data           record, place the multi-dimensional data into a
values is maintained in Qx. Thus, Qx contains d values. The         corresponding discretized density grid in the multi-
p-th entry of Qx is equal to                                        dimensional space, and update the characteristic vector of
   ∑ j€grid (xij p )2                                               the density grid (Lines 5-8 in figure 1). The offline
              g                                                     component dynamically adjusts the clusters every gap time
   For each dimension, the sum of the data values is                steps, where gap is an integer parameter. After the first gap,
maintained in Fx. Thus, Fx contains d values. The p-th entry        the algorithm generates the initial cluster (Lines 9-11). Then,
of Fx is equal to                                                   the algorithm periodically removes sporadic grids and
                      ∑ j€grid (xij p )                             regulates the clusters (Lines 12-15).
                                g
   Where P is a d-dimensional vector and the grid                   1.procedure D-Stream
coordinates, it will be denoted by the corresponding                2. tc = 0;
dimension's interval numbers in grid. Where density is              3. initialize an empty hash table grid list;
density values of the grid at time tg , class is the grid cluster   4. while data stream is active do
labeling belongs to cluster ,tg is a time grid data points          5. read record x = (x1, x2, · · · , xd);
recently.                                                           6. determine the density grid g that contains x;
                                                                    7. if(g not in grid list) insert g to grid list;
    Definition 6. Density threshold function                        8. update the characteristic vector of g;
                                                                    9. if tc == gap then
   Let tc be the current time, and tg be the time of the last       10. call initial clustering(grid list);
increased data in grid g. Then the density threshold function       11. end if
   is defined as:                                                   12. if tc mod gap == 0 then
   D min (tg, tc) = (1-λ tc-tg+1)/N(1-λ)                            13. detect and remove sporadic grids from grid list;
                                                                    14. call adjust clustering(grid list);
    where N is the total number of the grid space.                  15. end if
    Its property is as follows:                                     16. tc = tc + 1;
    (1) If t1≤t2≤t3 then Dmin(t1 ,t3)= Dmin(t1 ,t2) λt3-t2+         17. end while




                                                                                                                  145 | P a g e
Monali Patil, Vidya Chitre, Dipti Patil / International Journal of Engineering Research and Applications
                          (IJERA)              ISSN: 2248-9622        www.ijera.com
                                Vol. 2, Issue 4, July-August 2012, pp.144-148
18. end procedure.                                          IV.     MOBILE AGENT TECHNIQUES
                                                                              An agent is a program, which can act independently
   The algorithm adopts a density decaying technique to              and autonomously. A mobile agent is an agent, which can
capture the dynamic changes of a data stream. Exploiting the         move in various networks under their own control, migrating
intricate relationships between the decay factor, data density       from one host to another host and interacting with other
and cluster structure, the algorithm can efficiently and             agents and resources on each. When a mobile agent's task is
effectively generate and adjust the clusters in real time. We        done, it can return to its home or accept other arrangement.
assume that the input data has d dimensions, and each input          The structures of the mobile agent are different for the
data record is defined within the space,                             different system. However, there are two parts generally
S= S1*S2*……*Sd                                                       speaking, which are MA (Mobile Agent) and MAE (Mobile
where Si is the definition space for the ith dimension. In D-        Agent Environment). MAE realizes the migration of the
Stream, we partition the d−dimensional space S into density          mobile agents among the hosts using agent transfer protocol
grids. Suppose for each dimension, its space Si, i = 1, · · · , d    and distributes the executing environment and service
is divided into pi partitions as                                     interface to them [1]. It also controls the security,
                                                                     communication, basic service and so on. MA is above the
    Si= Si,1U Si,2 U…… U Si,pi,                                      MAE and it can move into another MAE. MA can
                                                                     communicate with other MA using agent communication
then the data space S is partitioned into N = πdi=1 Pi density       language.
grids. For a density grid g that is composed of
S1,j1 *S2,j2 ……….. * Sd,jd , ji = 1 ……….. pi, we denote              V. K-MEAN ALGORITHM
it as g = (j1,j2, …………. , jd).                                                I/p: D= {t1,t2,t3,…….,tn}          // set of
                                                                     elements.
A data record x = (x1, x2, · · · , xd) can be mapped to a             k                                 // no. of desired
density grid g(x) as follows:                                        clusters
g(x) = (j1,j2,….., jd) where xi € Si,ji                               o/p: K                   // set of clusters

For each data record x, we assign it a density coefficient                  1) Assign initial values for means m1,
which decreases with as x ages. In fact, if x arrives at time tc,               m2, m3, ……………, mk;
we define its time stamp T(x) = tc, and its density coefficient             2) Repeat
D(x, t) at time t is                                                                 a. Assign each item ti to the
                                                                                         cluster which has the closest
          D(x,t) = λt-T(x) = λt-tc ,                                                     mean;
                                                                                     b. Calculate new mean for each
where λ€ (0, 1) is a constant called the decay factor. Grid                              cluster;
Density for a grid g, at a given time t, let E(g, t) be the set of          Until convergence criteria is met;
data records that are map tog at or before time t, its density
D(g, t) is defined as the sum of the density coefficients of all     VI.   THE NEW FRAMEWORK BASED ON MOBILE
data records that mapped to g. Namely, the density of g at t
                                                                        AGENT
is:
                                                                               We propose a new framework based on the mobile
                                                                     agent to gather and cluster the data. The Fig.2 shows that the
D(g,t) = ∑ x€ E(g, t) D(x,t)
                                                                     structure of the framework based on mobile agent. There are
                                                                     three sorts of the mobile agents: the main agent, the
          The density of any grid is constantly changing.
                                                                     subagent and the result agent. The main agent is in the
However, we have found that it is unnecessary to update the
                                                                     online component and it gathers the data in every gap. It
density values of all data records and grids at every time step.
                                                                     distributes the each dimension data to the right subagent. If
Instead, it is possible to update the density of a grid only
                                                                     we have d-dimensional data, we need about d subagents.
when a new data record is mapped to that grid [3]. For each
                                                                     However, during clustering the data, if the data is not
grid, the time when it receives the last data record should be
                                                                     changed in a certain dimension, the main agent will
recorded so that the density of the grid can be updated
                                                                     distribute another dimension data to this subagent. This
according to the following result when a new data record
                                                                     method can save the space.
arrives at the grid.




                                                                                                                    146 | P a g e
Monali Patil, Vidya Chitre, Dipti Patil / International Journal of Engineering Research and Applications
                          (IJERA)              ISSN: 2248-9622        www.ijera.com
                                Vol. 2, Issue 4, July-August 2012, pp.144-148




                                                                                                                        
                                                                    Fig. 2 Statistical Analysis of k-mean and mobile agent
Fig. 1 The structure of the data streams clustering based on
mobile agent
                                                                     For statistical analysis we have consider different input
                                                                 files for k-mean and mobile agent algorithm. For k-mean
Every subagent contains a clustering algorithm model.
                                                                 algorithm input datasets are of only 2attributes, and for
When a subagent receives the data from the main agent, it
                                                                 mobile agent project we have consider dataset up to 28
detects whether there is change of the data. If there is no
                                                                 attributes and for handling dynamic input entry EXT folder
change in this dimension, it will send the message to the
                                                                 is made in which different files save datasets, and that files
result agent to use the last time result. Then send the
                                                                 are renamed according to time interval.
message to the main agent, the main agent will distribute
another dimension data to it. If there is change in this
dimension at this gap, the subagent creates new grids to
cluster the data. This method avoids creating large number
grids every time. All subagents execute clustering task
parallel.


     VII.     PROCESS OF THE CLUSTERING
                   ALGORITHM
          The initial clustering part is not changed and we
improve the clustering part and add the programming of the
mobile agent including the main agent, the subagent and the
result agent. The clustering algorithm is described as follow:
Step l: Main agent gets the data in a gap time.
Step2: Main agent distributes the data into the subagents.
Each subagent gets one dimension data. If the data is d-
dimension, we need d subagents.
Step3: The subagent detects the data. If there is no change
compared with the last gap, go to step5. Or else go to step4.
Step4: The subagent creates new grids for the new data.
Step5: The subagent sends the result to the result agent and
sends the message to the main agent to require the new task.        Fig. 3 Graphical results of k-mean and mobile agent
Step6: All the results of the subagent are collected in the
result agent. The result agent computes these results and gets
the final result. Go to step 1.

VIII.    RESULTS                                                 IX.      CONCLUSION
                                                                          The data stream model is relevant to new classes of
                                                                 applications involving massive data sets, such as web click
                                                                 stream analysis and multimedia data analysis. To handle
                                                                 such massive input data in this paper we suggested Density




                                                                                                              147 | P a g e
Monali Patil, Vidya Chitre, Dipti Patil / International Journal of Engineering Research and Applications
                             (IJERA)              ISSN: 2248-9622        www.ijera.com
                                   Vol. 2, Issue 4, July-August 2012, pp.144-148
stream algorithm with mobile agent technique and clustering
is carried out by k-mean. We improve D-Stream algorithm
using mobile agent in form of main agent, sub agent and
result agent, so computing techniques to cluster data in the
offline and online component become parallel and fast.


REFERENCES

[1]      Zhang Hongyan, Liu Xiyu, “ A Data Streams
         Clustering     Algorithm Using DNA Computing
         Techniques Based on Mobile Agent”, IEEE,
         2000, Science and Technology Project of Shandong
         Education Bureau.

[2]      Sudipto Guha, Nina Mishrat, Rajeev Motwani,
         Liadan O’Callaghan, “Clustering Data Streams”,
         Department of Computer Science, Stanford
         University, 2000.

[3]      `Yixin Chen, Li Tu, “Density-Based Clustering for
         Real-Time Stream Data”, Washington University in
         St. Louis St. Louis, USA.

[4]      WeiserMark The Computer for the Twenty-first
         Century [J]. Scientific American, 1991,265 (3): 94-
         100.

[5]      Luo Ke ,Wang Lin, ” Data Streams Clustering
         Algorithm Based on Grid and Particle Swarm
         Optimization”, 2009 International Forum on
         computer Science-Technology and Applications.




                                                                                                   148 | P a g e

More Related Content

What's hot

K means clustering
K means clusteringK means clustering
K means clusteringkeshav goyal
 
Finding Relationships between the Our-NIR Cluster Results
Finding Relationships between the Our-NIR Cluster ResultsFinding Relationships between the Our-NIR Cluster Results
Finding Relationships between the Our-NIR Cluster ResultsCSCJournals
 
TOWARDS MORE ACCURATE CLUSTERING METHOD BY USING DYNAMIC TIME WARPING
TOWARDS MORE ACCURATE CLUSTERING METHOD BY USING DYNAMIC TIME WARPINGTOWARDS MORE ACCURATE CLUSTERING METHOD BY USING DYNAMIC TIME WARPING
TOWARDS MORE ACCURATE CLUSTERING METHOD BY USING DYNAMIC TIME WARPINGijdkp
 
K-MEDOIDS CLUSTERING USING PARTITIONING AROUND MEDOIDS FOR PERFORMING FACE R...
K-MEDOIDS CLUSTERING  USING PARTITIONING AROUND MEDOIDS FOR PERFORMING FACE R...K-MEDOIDS CLUSTERING  USING PARTITIONING AROUND MEDOIDS FOR PERFORMING FACE R...
K-MEDOIDS CLUSTERING USING PARTITIONING AROUND MEDOIDS FOR PERFORMING FACE R...ijscmc
 
A Novel Algorithm for Design Tree Classification with PCA
A Novel Algorithm for Design Tree Classification with PCAA Novel Algorithm for Design Tree Classification with PCA
A Novel Algorithm for Design Tree Classification with PCAEditor Jacotech
 
A NOBEL HYBRID APPROACH FOR EDGE DETECTION
A NOBEL HYBRID APPROACH FOR EDGE  DETECTIONA NOBEL HYBRID APPROACH FOR EDGE  DETECTION
A NOBEL HYBRID APPROACH FOR EDGE DETECTIONijcses
 
Grid based method & model based clustering method
Grid based method & model based clustering methodGrid based method & model based clustering method
Grid based method & model based clustering methodrajshreemuthiah
 
Dynamic time wrapping (dtw), vector quantization(vq), linear predictive codin...
Dynamic time wrapping (dtw), vector quantization(vq), linear predictive codin...Dynamic time wrapping (dtw), vector quantization(vq), linear predictive codin...
Dynamic time wrapping (dtw), vector quantization(vq), linear predictive codin...Tanjarul Islam Mishu
 
Parallel k nn on gpu architecture using opencl
Parallel k nn on gpu architecture using openclParallel k nn on gpu architecture using opencl
Parallel k nn on gpu architecture using opencleSAT Publishing House
 
CC282 Unsupervised Learning (Clustering) Lecture 7 slides for ...
CC282 Unsupervised Learning (Clustering) Lecture 7 slides for ...CC282 Unsupervised Learning (Clustering) Lecture 7 slides for ...
CC282 Unsupervised Learning (Clustering) Lecture 7 slides for ...butest
 
A frame work for clustering time evolving data
A frame work for clustering time evolving dataA frame work for clustering time evolving data
A frame work for clustering time evolving dataiaemedu
 
Power quality disturbances classification using complex wavelet phasor space ...
Power quality disturbances classification using complex wavelet phasor space ...Power quality disturbances classification using complex wavelet phasor space ...
Power quality disturbances classification using complex wavelet phasor space ...journalBEEI
 
E XTENDED F AST S EARCH C LUSTERING A LGORITHM : W IDELY D ENSITY C LUSTERS ,...
E XTENDED F AST S EARCH C LUSTERING A LGORITHM : W IDELY D ENSITY C LUSTERS ,...E XTENDED F AST S EARCH C LUSTERING A LGORITHM : W IDELY D ENSITY C LUSTERS ,...
E XTENDED F AST S EARCH C LUSTERING A LGORITHM : W IDELY D ENSITY C LUSTERS ,...csandit
 
FAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETS
FAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETSFAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETS
FAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETScsandit
 

What's hot (16)

K means clustering
K means clusteringK means clustering
K means clustering
 
Finding Relationships between the Our-NIR Cluster Results
Finding Relationships between the Our-NIR Cluster ResultsFinding Relationships between the Our-NIR Cluster Results
Finding Relationships between the Our-NIR Cluster Results
 
TOWARDS MORE ACCURATE CLUSTERING METHOD BY USING DYNAMIC TIME WARPING
TOWARDS MORE ACCURATE CLUSTERING METHOD BY USING DYNAMIC TIME WARPINGTOWARDS MORE ACCURATE CLUSTERING METHOD BY USING DYNAMIC TIME WARPING
TOWARDS MORE ACCURATE CLUSTERING METHOD BY USING DYNAMIC TIME WARPING
 
K-MEDOIDS CLUSTERING USING PARTITIONING AROUND MEDOIDS FOR PERFORMING FACE R...
K-MEDOIDS CLUSTERING  USING PARTITIONING AROUND MEDOIDS FOR PERFORMING FACE R...K-MEDOIDS CLUSTERING  USING PARTITIONING AROUND MEDOIDS FOR PERFORMING FACE R...
K-MEDOIDS CLUSTERING USING PARTITIONING AROUND MEDOIDS FOR PERFORMING FACE R...
 
A Novel Algorithm for Design Tree Classification with PCA
A Novel Algorithm for Design Tree Classification with PCAA Novel Algorithm for Design Tree Classification with PCA
A Novel Algorithm for Design Tree Classification with PCA
 
A NOBEL HYBRID APPROACH FOR EDGE DETECTION
A NOBEL HYBRID APPROACH FOR EDGE  DETECTIONA NOBEL HYBRID APPROACH FOR EDGE  DETECTION
A NOBEL HYBRID APPROACH FOR EDGE DETECTION
 
Grid based method & model based clustering method
Grid based method & model based clustering methodGrid based method & model based clustering method
Grid based method & model based clustering method
 
Dynamic time wrapping (dtw), vector quantization(vq), linear predictive codin...
Dynamic time wrapping (dtw), vector quantization(vq), linear predictive codin...Dynamic time wrapping (dtw), vector quantization(vq), linear predictive codin...
Dynamic time wrapping (dtw), vector quantization(vq), linear predictive codin...
 
Parallel k nn on gpu architecture using opencl
Parallel k nn on gpu architecture using openclParallel k nn on gpu architecture using opencl
Parallel k nn on gpu architecture using opencl
 
CC282 Unsupervised Learning (Clustering) Lecture 7 slides for ...
CC282 Unsupervised Learning (Clustering) Lecture 7 slides for ...CC282 Unsupervised Learning (Clustering) Lecture 7 slides for ...
CC282 Unsupervised Learning (Clustering) Lecture 7 slides for ...
 
Clique
Clique Clique
Clique
 
A frame work for clustering time evolving data
A frame work for clustering time evolving dataA frame work for clustering time evolving data
A frame work for clustering time evolving data
 
Journal_IJABME
Journal_IJABMEJournal_IJABME
Journal_IJABME
 
Power quality disturbances classification using complex wavelet phasor space ...
Power quality disturbances classification using complex wavelet phasor space ...Power quality disturbances classification using complex wavelet phasor space ...
Power quality disturbances classification using complex wavelet phasor space ...
 
E XTENDED F AST S EARCH C LUSTERING A LGORITHM : W IDELY D ENSITY C LUSTERS ,...
E XTENDED F AST S EARCH C LUSTERING A LGORITHM : W IDELY D ENSITY C LUSTERS ,...E XTENDED F AST S EARCH C LUSTERING A LGORITHM : W IDELY D ENSITY C LUSTERS ,...
E XTENDED F AST S EARCH C LUSTERING A LGORITHM : W IDELY D ENSITY C LUSTERS ,...
 
FAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETS
FAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETSFAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETS
FAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETS
 

Viewers also liked

Licenza edilizia assentita 2009 vassallo antonietta c.e.s. n.12 2009[1]
Licenza edilizia assentita 2009 vassallo antonietta c.e.s. n.12 2009[1]Licenza edilizia assentita 2009 vassallo antonietta c.e.s. n.12 2009[1]
Licenza edilizia assentita 2009 vassallo antonietta c.e.s. n.12 2009[1]lasvolta
 
04b tcp ip-ip v06
04b tcp ip-ip v0604b tcp ip-ip v06
04b tcp ip-ip v06arifhusni92
 
Parque nacional do itatiaia 2
Parque nacional do itatiaia 2Parque nacional do itatiaia 2
Parque nacional do itatiaia 2GuiaDidatico
 
1ª division provincial benjamin sala 1ª v
1ª division provincial benjamin sala 1ª v1ª division provincial benjamin sala 1ª v
1ª division provincial benjamin sala 1ª vpepelumila
 
Davidshalls Höstfest 24/9 - fotoutställning "Vi är Davidshallare"
Davidshalls Höstfest 24/9 - fotoutställning "Vi är Davidshallare"Davidshalls Höstfest 24/9 - fotoutställning "Vi är Davidshallare"
Davidshalls Höstfest 24/9 - fotoutställning "Vi är Davidshallare"Jansäter Kommunikation AB
 
Projeto do Estatuto da Família
Projeto do Estatuto da FamíliaProjeto do Estatuto da Família
Projeto do Estatuto da FamíliaPaulo Veras
 
Nimble, CRM Social Media, nueva App para #iOS
Nimble, CRM Social Media, nueva App para #iOSNimble, CRM Social Media, nueva App para #iOS
Nimble, CRM Social Media, nueva App para #iOSSoluciona Facil
 
PEC 116/2011 - "PEC das Cadeiras Negras"
PEC 116/2011 - "PEC das Cadeiras Negras"PEC 116/2011 - "PEC das Cadeiras Negras"
PEC 116/2011 - "PEC das Cadeiras Negras"Paulo Veras
 

Viewers also liked (20)

Gf2411221128
Gf2411221128Gf2411221128
Gf2411221128
 
Fx2410731080
Fx2410731080Fx2410731080
Fx2410731080
 
Gp2411851191
Gp2411851191Gp2411851191
Gp2411851191
 
Oj2423402345
Oj2423402345Oj2423402345
Oj2423402345
 
Oo2423882391
Oo2423882391Oo2423882391
Oo2423882391
 
Fr2410361039
Fr2410361039Fr2410361039
Fr2410361039
 
Om2423682376
Om2423682376Om2423682376
Om2423682376
 
Fo2410191024
Fo2410191024Fo2410191024
Fo2410191024
 
Fa3110171022
Fa3110171022Fa3110171022
Fa3110171022
 
Cm31588593
Cm31588593Cm31588593
Cm31588593
 
Licenza edilizia assentita 2009 vassallo antonietta c.e.s. n.12 2009[1]
Licenza edilizia assentita 2009 vassallo antonietta c.e.s. n.12 2009[1]Licenza edilizia assentita 2009 vassallo antonietta c.e.s. n.12 2009[1]
Licenza edilizia assentita 2009 vassallo antonietta c.e.s. n.12 2009[1]
 
04b tcp ip-ip v06
04b tcp ip-ip v0604b tcp ip-ip v06
04b tcp ip-ip v06
 
Torrox
TorroxTorrox
Torrox
 
Parque nacional do itatiaia 2
Parque nacional do itatiaia 2Parque nacional do itatiaia 2
Parque nacional do itatiaia 2
 
1ª division provincial benjamin sala 1ª v
1ª division provincial benjamin sala 1ª v1ª division provincial benjamin sala 1ª v
1ª division provincial benjamin sala 1ª v
 
Davidshalls Höstfest 24/9 - fotoutställning "Vi är Davidshallare"
Davidshalls Höstfest 24/9 - fotoutställning "Vi är Davidshallare"Davidshalls Höstfest 24/9 - fotoutställning "Vi är Davidshallare"
Davidshalls Höstfest 24/9 - fotoutställning "Vi är Davidshallare"
 
Amiizade
AmiizadeAmiizade
Amiizade
 
Projeto do Estatuto da Família
Projeto do Estatuto da FamíliaProjeto do Estatuto da Família
Projeto do Estatuto da Família
 
Nimble, CRM Social Media, nueva App para #iOS
Nimble, CRM Social Media, nueva App para #iOSNimble, CRM Social Media, nueva App para #iOS
Nimble, CRM Social Media, nueva App para #iOS
 
PEC 116/2011 - "PEC das Cadeiras Negras"
PEC 116/2011 - "PEC das Cadeiras Negras"PEC 116/2011 - "PEC das Cadeiras Negras"
PEC 116/2011 - "PEC das Cadeiras Negras"
 

Similar to T24144148

Volume 2-issue-6-2143-2147
Volume 2-issue-6-2143-2147Volume 2-issue-6-2143-2147
Volume 2-issue-6-2143-2147Editor IJARCET
 
Volume 2-issue-6-2143-2147
Volume 2-issue-6-2143-2147Volume 2-issue-6-2143-2147
Volume 2-issue-6-2143-2147Editor IJARCET
 
Reduct generation for the incremental data using rough set theory
Reduct generation for the incremental data using rough set theoryReduct generation for the incremental data using rough set theory
Reduct generation for the incremental data using rough set theorycsandit
 
Parallel Implementation of K Means Clustering on CUDA
Parallel Implementation of K Means Clustering on CUDAParallel Implementation of K Means Clustering on CUDA
Parallel Implementation of K Means Clustering on CUDAprithan
 
Increasing electrical grid stability classification performance using ensemb...
Increasing electrical grid stability classification performance  using ensemb...Increasing electrical grid stability classification performance  using ensemb...
Increasing electrical grid stability classification performance using ensemb...IJECEIAES
 
Content addressable network(can)
Content addressable network(can)Content addressable network(can)
Content addressable network(can)Amit Dahal
 
Clustering Algorithms for Data Stream
Clustering Algorithms for Data StreamClustering Algorithms for Data Stream
Clustering Algorithms for Data StreamIRJET Journal
 
Compressive Data Gathering using NACS in Wireless Sensor Network
Compressive Data Gathering using NACS in Wireless Sensor NetworkCompressive Data Gathering using NACS in Wireless Sensor Network
Compressive Data Gathering using NACS in Wireless Sensor NetworkIRJET Journal
 
A novel technique for speech encryption based on k-means clustering and quant...
A novel technique for speech encryption based on k-means clustering and quant...A novel technique for speech encryption based on k-means clustering and quant...
A novel technique for speech encryption based on k-means clustering and quant...journalBEEI
 
Bivariatealgebraic integerencoded arai algorithm for
Bivariatealgebraic integerencoded arai algorithm forBivariatealgebraic integerencoded arai algorithm for
Bivariatealgebraic integerencoded arai algorithm foreSAT Publishing House
 
International Journal of Computational Engineering Research(IJCER)
 International Journal of Computational Engineering Research(IJCER)  International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER) ijceronline
 
FAST DETECTION OF DDOS ATTACKS USING NON-ADAPTIVE GROUP TESTING
FAST DETECTION OF DDOS ATTACKS USING NON-ADAPTIVE GROUP TESTINGFAST DETECTION OF DDOS ATTACKS USING NON-ADAPTIVE GROUP TESTING
FAST DETECTION OF DDOS ATTACKS USING NON-ADAPTIVE GROUP TESTINGIJNSA Journal
 
Graphical Visualization of MAC Traces for Wireless Ad-hoc Networks Simulated ...
Graphical Visualization of MAC Traces for Wireless Ad-hoc Networks Simulated ...Graphical Visualization of MAC Traces for Wireless Ad-hoc Networks Simulated ...
Graphical Visualization of MAC Traces for Wireless Ad-hoc Networks Simulated ...idescitation
 
A comprehensive survey of contemporary
A comprehensive survey of contemporaryA comprehensive survey of contemporary
A comprehensive survey of contemporaryprjpublications
 
FAST DETECTION OF DDOS ATTACKS USING NON-ADAPTIVE GROUP TESTING
FAST DETECTION OF DDOS ATTACKS USING NON-ADAPTIVE GROUP TESTINGFAST DETECTION OF DDOS ATTACKS USING NON-ADAPTIVE GROUP TESTING
FAST DETECTION OF DDOS ATTACKS USING NON-ADAPTIVE GROUP TESTINGIJNSA Journal
 
Parallel KNN for Big Data using Adaptive Indexing
Parallel KNN for Big Data using Adaptive IndexingParallel KNN for Big Data using Adaptive Indexing
Parallel KNN for Big Data using Adaptive IndexingIRJET Journal
 
Dijkstra Shortest Path Visualization
Dijkstra Shortest Path VisualizationDijkstra Shortest Path Visualization
Dijkstra Shortest Path VisualizationIRJET Journal
 

Similar to T24144148 (20)

Volume 2-issue-6-2143-2147
Volume 2-issue-6-2143-2147Volume 2-issue-6-2143-2147
Volume 2-issue-6-2143-2147
 
Volume 2-issue-6-2143-2147
Volume 2-issue-6-2143-2147Volume 2-issue-6-2143-2147
Volume 2-issue-6-2143-2147
 
Ijebea14 272
Ijebea14 272Ijebea14 272
Ijebea14 272
 
Reduct generation for the incremental data using rough set theory
Reduct generation for the incremental data using rough set theoryReduct generation for the incremental data using rough set theory
Reduct generation for the incremental data using rough set theory
 
Parallel Implementation of K Means Clustering on CUDA
Parallel Implementation of K Means Clustering on CUDAParallel Implementation of K Means Clustering on CUDA
Parallel Implementation of K Means Clustering on CUDA
 
Increasing electrical grid stability classification performance using ensemb...
Increasing electrical grid stability classification performance  using ensemb...Increasing electrical grid stability classification performance  using ensemb...
Increasing electrical grid stability classification performance using ensemb...
 
Content addressable network(can)
Content addressable network(can)Content addressable network(can)
Content addressable network(can)
 
Clustering Algorithms for Data Stream
Clustering Algorithms for Data StreamClustering Algorithms for Data Stream
Clustering Algorithms for Data Stream
 
Compressive Data Gathering using NACS in Wireless Sensor Network
Compressive Data Gathering using NACS in Wireless Sensor NetworkCompressive Data Gathering using NACS in Wireless Sensor Network
Compressive Data Gathering using NACS in Wireless Sensor Network
 
A novel technique for speech encryption based on k-means clustering and quant...
A novel technique for speech encryption based on k-means clustering and quant...A novel technique for speech encryption based on k-means clustering and quant...
A novel technique for speech encryption based on k-means clustering and quant...
 
D031202018023
D031202018023D031202018023
D031202018023
 
Bivariatealgebraic integerencoded arai algorithm for
Bivariatealgebraic integerencoded arai algorithm forBivariatealgebraic integerencoded arai algorithm for
Bivariatealgebraic integerencoded arai algorithm for
 
International Journal of Computational Engineering Research(IJCER)
 International Journal of Computational Engineering Research(IJCER)  International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)
 
FAST DETECTION OF DDOS ATTACKS USING NON-ADAPTIVE GROUP TESTING
FAST DETECTION OF DDOS ATTACKS USING NON-ADAPTIVE GROUP TESTINGFAST DETECTION OF DDOS ATTACKS USING NON-ADAPTIVE GROUP TESTING
FAST DETECTION OF DDOS ATTACKS USING NON-ADAPTIVE GROUP TESTING
 
Graphical Visualization of MAC Traces for Wireless Ad-hoc Networks Simulated ...
Graphical Visualization of MAC Traces for Wireless Ad-hoc Networks Simulated ...Graphical Visualization of MAC Traces for Wireless Ad-hoc Networks Simulated ...
Graphical Visualization of MAC Traces for Wireless Ad-hoc Networks Simulated ...
 
A comprehensive survey of contemporary
A comprehensive survey of contemporaryA comprehensive survey of contemporary
A comprehensive survey of contemporary
 
FAST DETECTION OF DDOS ATTACKS USING NON-ADAPTIVE GROUP TESTING
FAST DETECTION OF DDOS ATTACKS USING NON-ADAPTIVE GROUP TESTINGFAST DETECTION OF DDOS ATTACKS USING NON-ADAPTIVE GROUP TESTING
FAST DETECTION OF DDOS ATTACKS USING NON-ADAPTIVE GROUP TESTING
 
ME Synopsis
ME SynopsisME Synopsis
ME Synopsis
 
Parallel KNN for Big Data using Adaptive Indexing
Parallel KNN for Big Data using Adaptive IndexingParallel KNN for Big Data using Adaptive Indexing
Parallel KNN for Big Data using Adaptive Indexing
 
Dijkstra Shortest Path Visualization
Dijkstra Shortest Path VisualizationDijkstra Shortest Path Visualization
Dijkstra Shortest Path Visualization
 

Recently uploaded

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?XfilesPro
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 

Recently uploaded (20)

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 

T24144148

  • 1. Monali Patil, Vidya Chitre, Dipti Patil / International Journal of Engineering Research and Applications (IJERA) ISSN: 2248-9622 www.ijera.com Vol. 2, Issue 4, July-August 2012, pp.144-148 Improved K-mean clustering with Mobile Agent Monali Patil, Vidya Chitre, Dipti Patil Department of Information Technology, K. C. COE, Thane Department of Computer Engineering, Bharati Vidyapidth, Navi Mumbai Department of Information Technology, PIIT, New Panvel Mumbai University Maharashtra India Abstract— The data stream model has recently A good clustering algorithm should be able to identity attracted attention for its applicability to numerous types clusters irrespective of their shapes. D-Stream, is a of data, including telephone records, Web documents, framework for clustering stream data using a density-based and click streams. For analysis of such data, the ability approach. The algorithm uses an online component which to process the data in a single pass, or a small number of maps each input data record into a grid and an offline passes, while using little memory, is crucial. D-Stream component which computes the grid density and clusters the algorithm is an extended grid-based clustering algorithm grids based on the density. The algorithm adopts a density for different dimensional data streams. One of the decaying technique to disadvantages of D-Stream algorithm is the large capture the dynamic changes of a data stream [4]. It exploits number of grids, especially for high-dimensional data. So the intricate relationships between the decay factor, data we use the mobile agent to deal with the different density and cluster structure. An extended DBSCAN dimensional data. This method can reduce the number of algorithm was proposed by Cao F et.al called DenStream the grids and the number of the update times. So using algorithm in 2006 , which also used two-phase scheme the mobile agent technique with data stream algorithm we online component and the offline component. D-Stream make the data stream clustering more simplified and algorithm was an extended Grid-based method proposed by flexible. Chen et.al in 2007, which also used two-phase scheme and emphasized the detection of the isolated points. Keywords— D-Stream, Grid based clustering I. INTRODUCTION II. THE CONCEPTS AND DEFINITIONS Since 1991 Weiser proposed pervasive computing , Let A = (A1,A2, ..., Ad) be a bounded, all ordered pervasive computing has been paid more attention to. domain set, then we will denote a d-dimensional numerical Pervasive application emphasizes on "real-time" reactions space by S = A1 × A2 × ... × Ad, and one dimension of S be because of the special requirement of the environment. With represented Ad . Data stream is defined as a series of the development of the distributed system and wireless chronological order to reach set of data points, it will be communication, streams of data are able to be obtained from denoted by DS = (X1, X2 ,…, Xn, ...). It is assumed that the high-bandwidth and data streams mining has become one of data stream consists of a set of multi-dimensional records the hot points in the pervasive application. Clustering can X1,X2,…,Xn, …arriving at time stamps t1,t2, ... ,tn, ... Each provide the real-time analyzing results in the pervasive Xi is a multi-dimensional record containing d dimensions environment. There are many mature methods can be which are denoted by Xi = ( xi1 , xi2 ,…, xid ). the Xi = extended to cluster the data streams. Guha et.al gave an (xi1, xi2, ..., xid) represents a data point (xi1 _ A1, xi2 _ A2, extended k-means clustering method to deal with data ..., xid _ Ad) [4]. streams in 2000. In 2003 Babcock et.al improved Guha's method using the data structure called exponential A. DEFINITION: 1. THE GRID histogram. O'Challagham et.al proposed STREAM The each dimension of the numerical space S is divided algorithm, which improved k-means using LSEARCH and evenly into ξ intervals, each interval is the dimension on a they got the more qualitative results. CluStream algorithm right-open interval of paragraph [li ,hi], length will be was given by Aggarwal et.al, which extended the clustering denoted by ln, where i is followed by numbered 1, 2, ..., ξ. feature concept of BIRCH algorithm. The numerical space S is divided into ξd disjoint grid, each Clustering is the process of grouping the data into classes grid g is a d-dimensional hypercube in the S , can be or clusters so that objects within a cluster have high similarity but are very dissimilar to objects in other clusters. 144 | P a g e
  • 2. Monali Patil, Vidya Chitre, Dipti Patil / International Journal of Engineering Research and Applications (IJERA) ISSN: 2248-9622 www.ijera.com Vol. 2, Issue 4, July-August 2012, pp.144-148 expressed as ( v1 , v2 ,…, vd ) , where vi is the ith (1 ≤ i ≤ d) Dmin(t2 +1,t3); dimensional interval on the corresponding number in grid g. (2) Dmin(t1 ,t1)=1/N; (3) lim t->∞ (t1 ,t )= N(1-λ) (4) Let t1<t2, then Dmin(t1 ,tc)= Dmin(t2 ,tc). B. DEFINITION: 2 DATA DENSITY, ATTENUATION FACTOR The data arriving at time tc be denoted by p, its density III. D-STREAM ALGORITHM D(p,t) is a change in the amount of time t, is defined as: D-Stream is a new framework for clustering real- D(p,t) =λt-tc , where λ (0<λ<1) is a constant, called the time stream data. There are two components: the online attenuation factor. component and the offline component. The online Definition: 3 Grid density component reads and maps each input data record into a Set up the grid g at time t contains the data items are p1, grid, and the offline component computes the density of p2,…,pN , their time stamp (referring to the arrival time), each grid and clusters the grids using a density-based respectively, t1, t2, …, tn , thus at time t, the density of grid algorithm in every fixed interval of time (gap). Actually, in is defined as the sum of the data density of all data items in the offline component there is the general grid-based the grid g at the time t , we will denote the density of grid by clustering algorithm. In the online component the algorithm D(g,t) = ∑i=1 n λ t-ti adopts a density decaying technique to capture the dynamic changes of a data stream and exploits the intricate Definition: 4 Dense grid relationships between the decay factor, data density and In the t moment, if D(g,t)≥ c/N(1-λ)= Dm grid g , then cluster structure. claimed the grid as a dense grid , where C is C ≥ 1,and is Once the densities of grids are updated in the gap, the a clustering procedure is similar to the general grid-based constant (in this algorithm is set up C = 3). clustering. This algorithm can find arbitrary shape clusters by a systematic visit of the neighbors and it is not necessary Definition 5. Eigenvector of grid to specify the number of cluster like k-means algorithm. The vector is defined as a six-tuple ( Fx ,Qx , P, density However, there is a higher requirement that if the dimension ,class , tg ), Where: wherein Fx and Qx each correspond to a is high, the number of the grids must be very large. vector of d entries. The definition of each of these entries is For a data stream, at each time step, the online as follows: component of D-Stream continuously reads a new data For each dimension, the sum of the squares of the data record, place the multi-dimensional data into a values is maintained in Qx. Thus, Qx contains d values. The corresponding discretized density grid in the multi- p-th entry of Qx is equal to dimensional space, and update the characteristic vector of ∑ j€grid (xij p )2 the density grid (Lines 5-8 in figure 1). The offline g component dynamically adjusts the clusters every gap time For each dimension, the sum of the data values is steps, where gap is an integer parameter. After the first gap, maintained in Fx. Thus, Fx contains d values. The p-th entry the algorithm generates the initial cluster (Lines 9-11). Then, of Fx is equal to the algorithm periodically removes sporadic grids and ∑ j€grid (xij p ) regulates the clusters (Lines 12-15). g Where P is a d-dimensional vector and the grid 1.procedure D-Stream coordinates, it will be denoted by the corresponding 2. tc = 0; dimension's interval numbers in grid. Where density is 3. initialize an empty hash table grid list; density values of the grid at time tg , class is the grid cluster 4. while data stream is active do labeling belongs to cluster ,tg is a time grid data points 5. read record x = (x1, x2, · · · , xd); recently. 6. determine the density grid g that contains x; 7. if(g not in grid list) insert g to grid list; Definition 6. Density threshold function 8. update the characteristic vector of g; 9. if tc == gap then Let tc be the current time, and tg be the time of the last 10. call initial clustering(grid list); increased data in grid g. Then the density threshold function 11. end if is defined as: 12. if tc mod gap == 0 then D min (tg, tc) = (1-λ tc-tg+1)/N(1-λ) 13. detect and remove sporadic grids from grid list; 14. call adjust clustering(grid list); where N is the total number of the grid space. 15. end if Its property is as follows: 16. tc = tc + 1; (1) If t1≤t2≤t3 then Dmin(t1 ,t3)= Dmin(t1 ,t2) λt3-t2+ 17. end while 145 | P a g e
  • 3. Monali Patil, Vidya Chitre, Dipti Patil / International Journal of Engineering Research and Applications (IJERA) ISSN: 2248-9622 www.ijera.com Vol. 2, Issue 4, July-August 2012, pp.144-148 18. end procedure. IV. MOBILE AGENT TECHNIQUES An agent is a program, which can act independently The algorithm adopts a density decaying technique to and autonomously. A mobile agent is an agent, which can capture the dynamic changes of a data stream. Exploiting the move in various networks under their own control, migrating intricate relationships between the decay factor, data density from one host to another host and interacting with other and cluster structure, the algorithm can efficiently and agents and resources on each. When a mobile agent's task is effectively generate and adjust the clusters in real time. We done, it can return to its home or accept other arrangement. assume that the input data has d dimensions, and each input The structures of the mobile agent are different for the data record is defined within the space, different system. However, there are two parts generally S= S1*S2*……*Sd speaking, which are MA (Mobile Agent) and MAE (Mobile where Si is the definition space for the ith dimension. In D- Agent Environment). MAE realizes the migration of the Stream, we partition the d−dimensional space S into density mobile agents among the hosts using agent transfer protocol grids. Suppose for each dimension, its space Si, i = 1, · · · , d and distributes the executing environment and service is divided into pi partitions as interface to them [1]. It also controls the security, communication, basic service and so on. MA is above the Si= Si,1U Si,2 U…… U Si,pi, MAE and it can move into another MAE. MA can communicate with other MA using agent communication then the data space S is partitioned into N = πdi=1 Pi density language. grids. For a density grid g that is composed of S1,j1 *S2,j2 ……….. * Sd,jd , ji = 1 ……….. pi, we denote V. K-MEAN ALGORITHM it as g = (j1,j2, …………. , jd). I/p: D= {t1,t2,t3,…….,tn} // set of elements. A data record x = (x1, x2, · · · , xd) can be mapped to a k // no. of desired density grid g(x) as follows: clusters g(x) = (j1,j2,….., jd) where xi € Si,ji o/p: K // set of clusters For each data record x, we assign it a density coefficient 1) Assign initial values for means m1, which decreases with as x ages. In fact, if x arrives at time tc, m2, m3, ……………, mk; we define its time stamp T(x) = tc, and its density coefficient 2) Repeat D(x, t) at time t is a. Assign each item ti to the cluster which has the closest D(x,t) = λt-T(x) = λt-tc , mean; b. Calculate new mean for each where λ€ (0, 1) is a constant called the decay factor. Grid cluster; Density for a grid g, at a given time t, let E(g, t) be the set of Until convergence criteria is met; data records that are map tog at or before time t, its density D(g, t) is defined as the sum of the density coefficients of all VI. THE NEW FRAMEWORK BASED ON MOBILE data records that mapped to g. Namely, the density of g at t AGENT is: We propose a new framework based on the mobile agent to gather and cluster the data. The Fig.2 shows that the D(g,t) = ∑ x€ E(g, t) D(x,t) structure of the framework based on mobile agent. There are three sorts of the mobile agents: the main agent, the The density of any grid is constantly changing. subagent and the result agent. The main agent is in the However, we have found that it is unnecessary to update the online component and it gathers the data in every gap. It density values of all data records and grids at every time step. distributes the each dimension data to the right subagent. If Instead, it is possible to update the density of a grid only we have d-dimensional data, we need about d subagents. when a new data record is mapped to that grid [3]. For each However, during clustering the data, if the data is not grid, the time when it receives the last data record should be changed in a certain dimension, the main agent will recorded so that the density of the grid can be updated distribute another dimension data to this subagent. This according to the following result when a new data record method can save the space. arrives at the grid. 146 | P a g e
  • 4. Monali Patil, Vidya Chitre, Dipti Patil / International Journal of Engineering Research and Applications (IJERA) ISSN: 2248-9622 www.ijera.com Vol. 2, Issue 4, July-August 2012, pp.144-148 Fig. 2 Statistical Analysis of k-mean and mobile agent Fig. 1 The structure of the data streams clustering based on mobile agent For statistical analysis we have consider different input files for k-mean and mobile agent algorithm. For k-mean Every subagent contains a clustering algorithm model. algorithm input datasets are of only 2attributes, and for When a subagent receives the data from the main agent, it mobile agent project we have consider dataset up to 28 detects whether there is change of the data. If there is no attributes and for handling dynamic input entry EXT folder change in this dimension, it will send the message to the is made in which different files save datasets, and that files result agent to use the last time result. Then send the are renamed according to time interval. message to the main agent, the main agent will distribute another dimension data to it. If there is change in this dimension at this gap, the subagent creates new grids to cluster the data. This method avoids creating large number grids every time. All subagents execute clustering task parallel. VII. PROCESS OF THE CLUSTERING ALGORITHM The initial clustering part is not changed and we improve the clustering part and add the programming of the mobile agent including the main agent, the subagent and the result agent. The clustering algorithm is described as follow: Step l: Main agent gets the data in a gap time. Step2: Main agent distributes the data into the subagents. Each subagent gets one dimension data. If the data is d- dimension, we need d subagents. Step3: The subagent detects the data. If there is no change compared with the last gap, go to step5. Or else go to step4. Step4: The subagent creates new grids for the new data. Step5: The subagent sends the result to the result agent and sends the message to the main agent to require the new task. Fig. 3 Graphical results of k-mean and mobile agent Step6: All the results of the subagent are collected in the result agent. The result agent computes these results and gets the final result. Go to step 1. VIII. RESULTS IX. CONCLUSION The data stream model is relevant to new classes of applications involving massive data sets, such as web click stream analysis and multimedia data analysis. To handle such massive input data in this paper we suggested Density 147 | P a g e
  • 5. Monali Patil, Vidya Chitre, Dipti Patil / International Journal of Engineering Research and Applications (IJERA) ISSN: 2248-9622 www.ijera.com Vol. 2, Issue 4, July-August 2012, pp.144-148 stream algorithm with mobile agent technique and clustering is carried out by k-mean. We improve D-Stream algorithm using mobile agent in form of main agent, sub agent and result agent, so computing techniques to cluster data in the offline and online component become parallel and fast. REFERENCES [1] Zhang Hongyan, Liu Xiyu, “ A Data Streams Clustering Algorithm Using DNA Computing Techniques Based on Mobile Agent”, IEEE, 2000, Science and Technology Project of Shandong Education Bureau. [2] Sudipto Guha, Nina Mishrat, Rajeev Motwani, Liadan O’Callaghan, “Clustering Data Streams”, Department of Computer Science, Stanford University, 2000. [3] `Yixin Chen, Li Tu, “Density-Based Clustering for Real-Time Stream Data”, Washington University in St. Louis St. Louis, USA. [4] WeiserMark The Computer for the Twenty-first Century [J]. Scientific American, 1991,265 (3): 94- 100. [5] Luo Ke ,Wang Lin, ” Data Streams Clustering Algorithm Based on Grid and Particle Swarm Optimization”, 2009 International Forum on computer Science-Technology and Applications. 148 | P a g e