SlideShare a Scribd company logo
1 of 4
Download to read offline
International Journal of Innovative Research in Advanced Engineering (IJIRAE)
Volume 1 Issue 1 (March 2014)
ISSN: 2278-2311 IJIRAE | http://ijirae.com
© 2014, IJIRAE – All Rights Reserved Page - 1
Extended Fuzzy C-Means with Random Sampling
Techniques for Clustering Large Data
Dhanesh Kothari
Department of Information Technology
Sri Venkateswara College of Engineering
dhaneshkanak@gmail.com
S. Thavasi Narayanan
Department of Information Technology
Sri Venkateswara College of Engineering
thavasinarayanan@gmail.com
K. Kiruthika Devi
Department of Information Technology
Sri Venkateswara College of Engineering
kiruthika@svce.ac.in
Abstract – Big data are any data that you cannot load into your computer’s primary memory. Clustering is a primary
task in pattern recognition and data mining. We need algorithms that scale well with the data size. The former
implementation, literal Fuzzy C-Means is linear or serialized. FCM algorithm attempts to partition a finite collection
of n elements into collection of c fuzzy clusters. So, given a finite set of data, this algorithm returns a list of c cluster
centers. However it doesn't scale well and slows down with increase in the size of data and is thus impractical and
sometimes undesirable. In this paper, we propose an extended version of fuzzy c-means clustering algorithm by means
of various random sampling techniques to study which method scales well for large or very large data.
Keywords - Big data, very large data, fuzzy c-means (FCM) clustering, sampling, probability
I. INTRODUCTION
CLUSTERING [1] is a form of data analysis in which data are separated into groups such that data with similar
properties are in same group. Clustering is used in pattern recognition, compression, image processing[9] and many other
fields as mentioned in references [6] and [10]. IoT or Internet of Things, personal computing has produced an incredible
amount of data ranging from few terabytes to Exabyte of data. These big data are still unloadable in the modern day
computers. There are many approaches applied for clustering the big data. One technique is distributed clustering, where
the data is distributed to various systems and the clustering is done independently on each system. Then, the average
cluster centers are evaluated based on the cluster centers from all systems. This method is however quite expensive and
result data may not be accurate.
There are three main types of partitions: crisp, fuzzy and possibilistic. Crisp or hard partitioning group data into mutually
disjoint subsets of points and the partition element, uki = 1 if oi is labelled k, else 0. Fuzzy partitions are more flexible
than crisp portioning in that each object can belong to more than one cluster group. This is particularly useful in identify
the online behavior of person for advertisements. The set of all fuzzy c-partitions is
1
Each column of the fuzzy partition U must sum to 1, thus ensuring that every object has unit total membership in a
partition. Many algorithms have been proposed to cluster the big data but only few of them use fuzzy partitions. One
such is fuzzy c-means clustering. However, its literal schemes simply cluster the entire dataset. In contrast, extended
clustering [2] schemes apply clustering algorithm to subset (or representative) of this dataset and non-iteratively extend
the sample data result to the full dataset. One such algorithm is random sampled extended fuzzy c-means algorithm. In
this paper, we compare three different sampling techniques to compute fuzzy partitions of the vector data: simple random
sampling, systematic random sampling and multistage random sampling. Section II describes the simple random
sampling based extended fuzzy c-means, and Section III proposes systematic and multistage sampling for extending
fuzzy c-means algorithm. Section IV presents the results. Section V contains our conclusions and some ideas for future
enhancements.
II. SIMPLE RANDOM SAMPLED FUZZY C-MEANS ALGORITHM
The FCM algorithm is defined as the optimization of the squared-error distortion
(2)
Where U is (c x n) partition matrix, V={v1,v2,…., vc} is the set of c clusters centers in Rd , m>1 is the fuzzification
constant, and ||.||A is any inner product A-induced norm, i.e., ||x||A= . We use here Euclidean norm (A=I) in our
example. Algorithm 1 outlines the steps for the literal FCM algorithm. We initialize the algorithm by choosing c objects
randomly from the dataset as initial cluster centers. The most commonly used sampling technique to address big data is
simple random sampling [4].
International Journal of Innovative Research in Advanced Engineering (IJIRAE)
Volume 1 Issue 1 (March 2014)
ISSN: 2278-2311 IJIRAE | http://ijirae.com
© 2014, IJIRAE – All Rights Reserved Page - 2
In this method, each record in the full set has equal probability of being selected into the sample set, i.e. 1/N, where N is
the total number of records. The FCM algorithm then can be applied onto this sample data to get membership matrix and
cluster centers. We can then extend the V to complete dataset. This can be called as rseFCM. rseFCM is shown in
algorithm 2.
The step 3 is an important step. In this we extend the result of the FCM algorithm on sample dataset to the complete
dataset. It is a non-iterative step as it is run only once in the end of extended FCM algorithm.
Algorithm 1: LFCM
Input: X, c, m
Output: U, V
Initialize V
while do
(3)
(4)
Algorithm 2: rseFCM to approximately minimize Jm(U, V)
Input: X, c, m
Output: U, V
1 Sample ns objects from X without replacement, denoted Xs
2 Us, V = LFCM(Xs, c, m)
3 Extend the partition (Us, V) to X, , using Eq. (3), giving (U, V)
III. MULTISTAGE SAMPLING AND SYSTEMATIC RANDOM SAMPLING
The are other sampling methods[8]
other than commonly used simple random sampling method. Here, we
introduce multistage sampling and systematic random sampling techniques. ultistage sampling is a complex form of
cluster sampling. It involves dividing the population into groups or strata based on the best feature[3]
. With the multistage
sample, there is equal chance of selecting each unit from within a particular group. In some cases, several levels of group
selection may be applied before the final sample elements are reached. It should be noted that multistage and stratified
sampling are different from each other as in multistage, selected clusters are studied. The multistage sampling is given in
algorithm 3.
Algorithm 3: Multistage sampling based extended FCM algorithm
Input: X, c, m
Output: Xs, U, V
1 Organize the sampling process into stages where the unit of analysis is systematically grouped.
2 Select a sampling technique for each stage.
3 Systematically apply the sampling technique to each stage until the unit of analysis has been selected.
4 Execute rseFCM steps 2 and 3.
Systematic random sampling is a type of probability sampling technique. With the systematic random sample,
there is an equal chance of selecting each unit from within the population when creating the sample. Here, rather than
selecting referring to random number tables to select the cases (as in simple random sampling) in your sample, you select
the units directly from the sample frame. Systematic sampling algorithm is given in Algorithm 4.
International Journal of Innovative Research in Advanced Engineering (IJIRAE)
Volume 1 Issue 1 (March 2014)
ISSN: 2278-2311 IJIRAE | http://ijirae.com
© 2014, IJIRAE – All Rights Reserved Page - 3
Algorithm 4: Systematic Random Sampling based extended FCM algorithm
Input: X, c, m
Output: Xs, U, V
1 Calculate the sampling interval, k
where, N is the population size and n is the sample size
2 Select a random start between 1 and sampling interval
3 Repeatedly add sampling interval to select subsequent record.
4 Execute rseFCM steps 2 and 3.
IV. EXPERIMENTS
We performed one set of experiment. In the experiment we compare the performance of the big data FCM
algorithm on the real data or data with ground truth (Collected data) as well as on synthetic dataset. For FCM, we
initialize V by randomly choosing c objects as the initial cluster centers. We fix the value of ϵ = 10-3
and fuzzifier m= 1.7.
All code were written in JAVA eclipse environment.
A. Evaluation Criter a
We judge the performance of the VL FCM algorithm using run-time criteria. It is computed for 20 independent
runs. We present statistical comparisons of the algorithms’ performance over the 20 experiments.
Speed-up or Run-Time: This criterion represents an actual run-time comparison. When the LFCM is available, speedup is
defined as tfull/tsample, where these values are in milliseconds to compute the cluster centers V for data and the membership
matrix, U.
B. Performance on loadable data
We compared the performance of VL FCM algorithm on the following datasets[7]
. BIRCH-Sets (n=100000, c=
12, d=2): These data are composed of 100000 2-d vectors, with a visual preferred grouping into 15 clusters. See table I
for result.
Forest-Data (n=581012, c=7, d=54): These data are composed of cartographic variables that are obtained from
United States Geological Survey and United State Forest Service. There are 10 quantifiable variables, and 40 binary soil
types, and one cover type. See table II for result.
We take 10% as sample size and run-time results (seconds) of various sampling technique based FCM algorithm
is shown in Fig.1. It is clear from the results that systematic sampling shows far better scalability than simple random
sampling. Multistage sampling can be applied only on datasets with more than two variables. It has a run-time almost
near to simple random sampling.
C. Performance on unloadable data
For, our last experiment, we demonstrate the VL vector data FCM algorithm on an unloadable dataset. The
performance of the VL algorithms can be measured by how well they find the clusters and measuring the run-time in
milliseconds. The SUSY (Super Symmetry) data are composed of 5 million objects. In order to show how the VL
algorithms could be used to process this data in PC we tested at sample size 0.01 % and 0.001% of data. Table III shows
the results of this experiment. Unexpectedly, multistage sampling was faster than systematic sampling. Simple random
sampling was a very expensive method among the three sampling techniques tested on the data inputs.
Fig. 1 Performance bar chart of the various sampling techniques on Birch-Sets and Forest Data
International Journal of Innovative Research in Advanced Engineering (IJIRAE)
Volume 1 Issue 1 (March 2014)
ISSN: 2278-2311 IJIRAE | http://ijirae.com
© 2014, IJIRAE – All Rights Reserved Page - 4
TABLE I
Results of VL FCM Algorithm on Birch Data (Avg. Of 20 Runs)
10% sample size
Simple Random Sampling Systematic Random Sampling Literal FCM
Run-Time
(seconds)
No. of
Iterations
Run-Time
(seconds)
No. of
Iterations
Run-Time
(seconds)
No. of
Iterations
33.287 63 29.475 57 442.135 83
TABLE II
Results of VL FCM Algorithm on Forest Data (Avg. Of 20 Runs)
10% sample size
Simple Random Sampling Systematic Random Sampling Multistage Sampling Literal FCM
Run-Time
(seconds)
No. of
Iterations
Run-Time
(seconds)
No. of
Iterations
Run-Time
(seconds)
No. of
Iterations
Run-Time
(seconds)
No. of
Iterations
170.269 76 90.822 49 163.169 68 528.644 39
TABLE III
Results of VL Data Algorithm on SUSY Unloadable Data
Sample Size
0.001% 0.01%
Sampling Technique Run-time (Sec) No. Of Iterations Run-time (Sec) No. of Iterations
Multistage Sampling 37.050 22 43.526 21
Simple Random Sampling 460.043 43 2086.614 99
Systematic Random
Sampling
64.015 36 372.497 76
V. DISCUSSION AND CONCLUSIONS
As this paper shows, there are many ways to sample the big data for VL FCM algorithm. Systematic random sampling
had the best performance on large but loadable dataset. Multistage was better in case of very large or unloadable dataset.
Systematic and multistage show better scalability compared to traditionally used simple random sampling for extending
fuzzy c-means algorithm. In the future, we will continue to develop and investigate scalable solutions for VL fuzzy
clustering.
REFERENCES
[1] Timothy C. Havens, James C. Bezdek, Christopher Leckie, Lawrence O. Hall, and Marimuthu Palaniswami, "Fuzzy
c-Means Algorithms for Very Large Data," in IEEE Transactions On Fuzzy Systems, Vol. 20, No. 6, December
2012.
[2] Richard J. Hathway and James C. Bezdek, “Extending fuzzy and probabilistic clustering to very large data sets”,
Computational Statistics and Data Analysis, vol. 51, 2006
[3] V. Schwammle and O. Jensen, “A simple and fast method to determine the parameters for fuzzy c-means cluster
analysis,” Bioinformatics, vol. 26, no. 22, pp. 2841–2848, 2010.
[4] John F. Kolen and Tim Hutcheson, “Reducing the Time Complexity of the Fuzzy c-means Algorithm”, IEEE
Transactions On Fuzzy Systems, Vol. 10, No. 2, April 2002.
[5] James C. Bezdek, Robert L. Cannon, Jitendra V. Dave, “Efficient Implementation of the fuzzy c-Means Clustering
Algorithms”, Vol. PAMI-8, No. 2, March,1986.
[6] D. Dembele and P. Kastner, “Fuzzy c-means method for clustering microarray data,” Bioinformatics, vol. 19, pp.
973–980, 2003.
[7] http://archive.ics.uci.edu/ml/datasets/
[8] http://dissertation.laerd.com/sampling-strategy.php
[9] N. Pal and J. Bezdek, “Complexity reduction for “large image” processing,” IEEE Trans. Syst., Man, Cybern., vol.
32, no. 5, pp. 598–611, Oct.2002.
[10]J. Bezdek and R. Hathaway, “Convergence of alternating optmization,” Nueral, Parallel, Sci. Comput., vol. 11, no.
4, pp. 351–368, Dec. 2003.

More Related Content

What's hot

Dynamic clustering algorithm using fuzzy c means
Dynamic clustering algorithm using fuzzy c meansDynamic clustering algorithm using fuzzy c means
Dynamic clustering algorithm using fuzzy c meansWrishin Bhattacharya
 
Q UANTUM C LUSTERING -B ASED F EATURE SUBSET S ELECTION FOR MAMMOGRAPHIC I...
Q UANTUM  C LUSTERING -B ASED  F EATURE SUBSET  S ELECTION FOR MAMMOGRAPHIC I...Q UANTUM  C LUSTERING -B ASED  F EATURE SUBSET  S ELECTION FOR MAMMOGRAPHIC I...
Q UANTUM C LUSTERING -B ASED F EATURE SUBSET S ELECTION FOR MAMMOGRAPHIC I...ijcsit
 
A New Detection and Decoding Technique for (2×N_r ) MIMO Communication Systems
A New Detection and Decoding Technique for (2×N_r ) MIMO Communication SystemsA New Detection and Decoding Technique for (2×N_r ) MIMO Communication Systems
A New Detection and Decoding Technique for (2×N_r ) MIMO Communication Systemsrahulmonikasharma
 
MATLAB IMPLEMENTATION OF SELF-ORGANIZING MAPS FOR CLUSTERING OF REMOTE SENSIN...
MATLAB IMPLEMENTATION OF SELF-ORGANIZING MAPS FOR CLUSTERING OF REMOTE SENSIN...MATLAB IMPLEMENTATION OF SELF-ORGANIZING MAPS FOR CLUSTERING OF REMOTE SENSIN...
MATLAB IMPLEMENTATION OF SELF-ORGANIZING MAPS FOR CLUSTERING OF REMOTE SENSIN...Daksh Raj Chopra
 
Efficient Implementation of Self-Organizing Map for Sparse Input Data
Efficient Implementation of Self-Organizing Map for Sparse Input DataEfficient Implementation of Self-Organizing Map for Sparse Input Data
Efficient Implementation of Self-Organizing Map for Sparse Input Dataymelka
 
DESIGN OF DELAY COMPUTATION METHOD FOR CYCLOTOMIC FAST FOURIER TRANSFORM
DESIGN OF DELAY COMPUTATION METHOD FOR CYCLOTOMIC FAST FOURIER TRANSFORMDESIGN OF DELAY COMPUTATION METHOD FOR CYCLOTOMIC FAST FOURIER TRANSFORM
DESIGN OF DELAY COMPUTATION METHOD FOR CYCLOTOMIC FAST FOURIER TRANSFORMsipij
 
Analysis of computational
Analysis of computationalAnalysis of computational
Analysis of computationalcsandit
 
An efficient hardware logarithm generator with modified quasi-symmetrical app...
An efficient hardware logarithm generator with modified quasi-symmetrical app...An efficient hardware logarithm generator with modified quasi-symmetrical app...
An efficient hardware logarithm generator with modified quasi-symmetrical app...IJECEIAES
 
A comparative study of three validities computation methods for multimodel ap...
A comparative study of three validities computation methods for multimodel ap...A comparative study of three validities computation methods for multimodel ap...
A comparative study of three validities computation methods for multimodel ap...IJECEIAES
 
Application of interpolation in CSE
Application of interpolation in CSEApplication of interpolation in CSE
Application of interpolation in CSEMd. Tanvir Hossain
 
An optimal general type-2 fuzzy controller for Urban Traffic Network
An optimal general type-2 fuzzy controller for Urban Traffic NetworkAn optimal general type-2 fuzzy controller for Urban Traffic Network
An optimal general type-2 fuzzy controller for Urban Traffic NetworkISA Interchange
 
A Novel Methodology to Implement Optimization Algorithms in Machine Learning
A Novel Methodology to Implement Optimization Algorithms in Machine LearningA Novel Methodology to Implement Optimization Algorithms in Machine Learning
A Novel Methodology to Implement Optimization Algorithms in Machine LearningVenkata Karthik Gullapalli
 
An Intelligent Skin Color Detection Method based on Fuzzy C-Means with Big Da...
An Intelligent Skin Color Detection Method based on Fuzzy C-Means with Big Da...An Intelligent Skin Color Detection Method based on Fuzzy C-Means with Big Da...
An Intelligent Skin Color Detection Method based on Fuzzy C-Means with Big Da...CrimsonpublishersTTEFT
 
OPTIMIZED TASK ALLOCATION IN SENSOR NETWORKS
OPTIMIZED TASK ALLOCATION IN SENSOR NETWORKSOPTIMIZED TASK ALLOCATION IN SENSOR NETWORKS
OPTIMIZED TASK ALLOCATION IN SENSOR NETWORKSZac Darcy
 
Extended pso algorithm for improvement problems k means clustering algorithm
Extended pso algorithm for improvement problems k means clustering algorithmExtended pso algorithm for improvement problems k means clustering algorithm
Extended pso algorithm for improvement problems k means clustering algorithmIJMIT JOURNAL
 

What's hot (18)

Dynamic clustering algorithm using fuzzy c means
Dynamic clustering algorithm using fuzzy c meansDynamic clustering algorithm using fuzzy c means
Dynamic clustering algorithm using fuzzy c means
 
Q UANTUM C LUSTERING -B ASED F EATURE SUBSET S ELECTION FOR MAMMOGRAPHIC I...
Q UANTUM  C LUSTERING -B ASED  F EATURE SUBSET  S ELECTION FOR MAMMOGRAPHIC I...Q UANTUM  C LUSTERING -B ASED  F EATURE SUBSET  S ELECTION FOR MAMMOGRAPHIC I...
Q UANTUM C LUSTERING -B ASED F EATURE SUBSET S ELECTION FOR MAMMOGRAPHIC I...
 
Self Organizing Maps
Self Organizing MapsSelf Organizing Maps
Self Organizing Maps
 
A New Detection and Decoding Technique for (2×N_r ) MIMO Communication Systems
A New Detection and Decoding Technique for (2×N_r ) MIMO Communication SystemsA New Detection and Decoding Technique for (2×N_r ) MIMO Communication Systems
A New Detection and Decoding Technique for (2×N_r ) MIMO Communication Systems
 
MATLAB IMPLEMENTATION OF SELF-ORGANIZING MAPS FOR CLUSTERING OF REMOTE SENSIN...
MATLAB IMPLEMENTATION OF SELF-ORGANIZING MAPS FOR CLUSTERING OF REMOTE SENSIN...MATLAB IMPLEMENTATION OF SELF-ORGANIZING MAPS FOR CLUSTERING OF REMOTE SENSIN...
MATLAB IMPLEMENTATION OF SELF-ORGANIZING MAPS FOR CLUSTERING OF REMOTE SENSIN...
 
Efficient Implementation of Self-Organizing Map for Sparse Input Data
Efficient Implementation of Self-Organizing Map for Sparse Input DataEfficient Implementation of Self-Organizing Map for Sparse Input Data
Efficient Implementation of Self-Organizing Map for Sparse Input Data
 
DESIGN OF DELAY COMPUTATION METHOD FOR CYCLOTOMIC FAST FOURIER TRANSFORM
DESIGN OF DELAY COMPUTATION METHOD FOR CYCLOTOMIC FAST FOURIER TRANSFORMDESIGN OF DELAY COMPUTATION METHOD FOR CYCLOTOMIC FAST FOURIER TRANSFORM
DESIGN OF DELAY COMPUTATION METHOD FOR CYCLOTOMIC FAST FOURIER TRANSFORM
 
A045030105
A045030105A045030105
A045030105
 
Analysis of computational
Analysis of computationalAnalysis of computational
Analysis of computational
 
An efficient hardware logarithm generator with modified quasi-symmetrical app...
An efficient hardware logarithm generator with modified quasi-symmetrical app...An efficient hardware logarithm generator with modified quasi-symmetrical app...
An efficient hardware logarithm generator with modified quasi-symmetrical app...
 
A comparative study of three validities computation methods for multimodel ap...
A comparative study of three validities computation methods for multimodel ap...A comparative study of three validities computation methods for multimodel ap...
A comparative study of three validities computation methods for multimodel ap...
 
Application of interpolation in CSE
Application of interpolation in CSEApplication of interpolation in CSE
Application of interpolation in CSE
 
An optimal general type-2 fuzzy controller for Urban Traffic Network
An optimal general type-2 fuzzy controller for Urban Traffic NetworkAn optimal general type-2 fuzzy controller for Urban Traffic Network
An optimal general type-2 fuzzy controller for Urban Traffic Network
 
A Novel Methodology to Implement Optimization Algorithms in Machine Learning
A Novel Methodology to Implement Optimization Algorithms in Machine LearningA Novel Methodology to Implement Optimization Algorithms in Machine Learning
A Novel Methodology to Implement Optimization Algorithms in Machine Learning
 
An Intelligent Skin Color Detection Method based on Fuzzy C-Means with Big Da...
An Intelligent Skin Color Detection Method based on Fuzzy C-Means with Big Da...An Intelligent Skin Color Detection Method based on Fuzzy C-Means with Big Da...
An Intelligent Skin Color Detection Method based on Fuzzy C-Means with Big Da...
 
OPTIMIZED TASK ALLOCATION IN SENSOR NETWORKS
OPTIMIZED TASK ALLOCATION IN SENSOR NETWORKSOPTIMIZED TASK ALLOCATION IN SENSOR NETWORKS
OPTIMIZED TASK ALLOCATION IN SENSOR NETWORKS
 
Planes de negocios paper
Planes de negocios paperPlanes de negocios paper
Planes de negocios paper
 
Extended pso algorithm for improvement problems k means clustering algorithm
Extended pso algorithm for improvement problems k means clustering algorithmExtended pso algorithm for improvement problems k means clustering algorithm
Extended pso algorithm for improvement problems k means clustering algorithm
 

Viewers also liked

Viewers also liked (10)

Kasem
KasemKasem
Kasem
 
LynnLogo_OPT1_REV
LynnLogo_OPT1_REVLynnLogo_OPT1_REV
LynnLogo_OPT1_REV
 
Ap1 проект
Ap1 проектAp1 проект
Ap1 проект
 
Pooooooooooooooooooooooooooooooooooo
PoooooooooooooooooooooooooooooooooooPooooooooooooooooooooooooooooooooooo
Pooooooooooooooooooooooooooooooooooo
 
Design of an Effective Method for Image Retrieval
Design of an Effective Method for Image RetrievalDesign of an Effective Method for Image Retrieval
Design of an Effective Method for Image Retrieval
 
Knowledge management b 47
Knowledge management b 47Knowledge management b 47
Knowledge management b 47
 
6666666666
66666666666666666666
6666666666
 
Как сделать город дружественным к пожилым людям: подходы Международной сети г...
Как сделать город дружественным к пожилым людям: подходы Международной сети г...Как сделать город дружественным к пожилым людям: подходы Международной сети г...
Как сделать город дружественным к пожилым людям: подходы Международной сети г...
 
Delay Sensitive Packet Scheduling Algorithm for MANETs by Cross Layer
Delay Sensitive Packet Scheduling Algorithm for MANETs by Cross LayerDelay Sensitive Packet Scheduling Algorithm for MANETs by Cross Layer
Delay Sensitive Packet Scheduling Algorithm for MANETs by Cross Layer
 
STUDY OF BEHAVIOUR OF GEO-POLYMER CONCRETE WITH RESPECT TO ITS MECHANICAL PRO...
STUDY OF BEHAVIOUR OF GEO-POLYMER CONCRETE WITH RESPECT TO ITS MECHANICAL PRO...STUDY OF BEHAVIOUR OF GEO-POLYMER CONCRETE WITH RESPECT TO ITS MECHANICAL PRO...
STUDY OF BEHAVIOUR OF GEO-POLYMER CONCRETE WITH RESPECT TO ITS MECHANICAL PRO...
 

Similar to Extended Fuzzy C-Means with Random Sampling Techniques for Clustering Large Data

Comparison Between Clustering Algorithms for Microarray Data Analysis
Comparison Between Clustering Algorithms for Microarray Data AnalysisComparison Between Clustering Algorithms for Microarray Data Analysis
Comparison Between Clustering Algorithms for Microarray Data AnalysisIOSR Journals
 
IRJET- Expert Independent Bayesian Data Fusion and Decision Making Model for ...
IRJET- Expert Independent Bayesian Data Fusion and Decision Making Model for ...IRJET- Expert Independent Bayesian Data Fusion and Decision Making Model for ...
IRJET- Expert Independent Bayesian Data Fusion and Decision Making Model for ...IRJET Journal
 
AMAZON STOCK PRICE PREDICTION BY USING SMLT
AMAZON STOCK PRICE PREDICTION BY USING SMLTAMAZON STOCK PRICE PREDICTION BY USING SMLT
AMAZON STOCK PRICE PREDICTION BY USING SMLTIRJET Journal
 
AN ANN APPROACH FOR NETWORK INTRUSION DETECTION USING ENTROPY BASED FEATURE S...
AN ANN APPROACH FOR NETWORK INTRUSION DETECTION USING ENTROPY BASED FEATURE S...AN ANN APPROACH FOR NETWORK INTRUSION DETECTION USING ENTROPY BASED FEATURE S...
AN ANN APPROACH FOR NETWORK INTRUSION DETECTION USING ENTROPY BASED FEATURE S...IJNSA Journal
 
A F AULT D IAGNOSIS M ETHOD BASED ON S EMI - S UPERVISED F UZZY C-M EANS...
A F AULT  D IAGNOSIS  M ETHOD BASED ON  S EMI - S UPERVISED  F UZZY  C-M EANS...A F AULT  D IAGNOSIS  M ETHOD BASED ON  S EMI - S UPERVISED  F UZZY  C-M EANS...
A F AULT D IAGNOSIS M ETHOD BASED ON S EMI - S UPERVISED F UZZY C-M EANS...IJCI JOURNAL
 
FAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETS
FAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETSFAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETS
FAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETScsandit
 
A SURVEY OF CLUSTERING ALGORITHMS IN ASSOCIATION RULES MINING
A SURVEY OF CLUSTERING ALGORITHMS IN ASSOCIATION RULES MININGA SURVEY OF CLUSTERING ALGORITHMS IN ASSOCIATION RULES MINING
A SURVEY OF CLUSTERING ALGORITHMS IN ASSOCIATION RULES MININGijcsit
 
A SURVEY OF CLUSTERING ALGORITHMS IN ASSOCIATION RULES MINING
A SURVEY OF CLUSTERING ALGORITHMS IN ASSOCIATION RULES MININGA SURVEY OF CLUSTERING ALGORITHMS IN ASSOCIATION RULES MINING
A SURVEY OF CLUSTERING ALGORITHMS IN ASSOCIATION RULES MININGijcsit
 
A SURVEY OF CLUSTERING ALGORITHMS IN ASSOCIATION RULES MINING
A SURVEY OF CLUSTERING ALGORITHMS IN ASSOCIATION RULES MININGA SURVEY OF CLUSTERING ALGORITHMS IN ASSOCIATION RULES MINING
A SURVEY OF CLUSTERING ALGORITHMS IN ASSOCIATION RULES MININGAIRCC Publishing Corporation
 
Failure prediction of e-banking application system using adaptive neuro fuzzy...
Failure prediction of e-banking application system using adaptive neuro fuzzy...Failure prediction of e-banking application system using adaptive neuro fuzzy...
Failure prediction of e-banking application system using adaptive neuro fuzzy...IJECEIAES
 
Applications of Artificial Neural Networks in Cancer Prediction
Applications of Artificial Neural Networks in Cancer PredictionApplications of Artificial Neural Networks in Cancer Prediction
Applications of Artificial Neural Networks in Cancer PredictionIRJET Journal
 
Vol 16 No 2 - July-December 2016
Vol 16 No 2 - July-December 2016Vol 16 No 2 - July-December 2016
Vol 16 No 2 - July-December 2016ijcsbi
 
IRJET- An Effective Brain Tumor Segmentation using K-means Clustering
IRJET-  	  An Effective Brain Tumor Segmentation using K-means ClusteringIRJET-  	  An Effective Brain Tumor Segmentation using K-means Clustering
IRJET- An Effective Brain Tumor Segmentation using K-means ClusteringIRJET Journal
 
Experimental study of Data clustering using k- Means and modified algorithms
Experimental study of Data clustering using k- Means and modified algorithmsExperimental study of Data clustering using k- Means and modified algorithms
Experimental study of Data clustering using k- Means and modified algorithmsIJDKP
 
IRJET- A Secured Method of Data Aggregation for Wireless Sensor Networks in t...
IRJET- A Secured Method of Data Aggregation for Wireless Sensor Networks in t...IRJET- A Secured Method of Data Aggregation for Wireless Sensor Networks in t...
IRJET- A Secured Method of Data Aggregation for Wireless Sensor Networks in t...IRJET Journal
 
Comprehensive Performance Evaluation on Multiplication of Matrices using MPI
Comprehensive Performance Evaluation on Multiplication of Matrices using MPIComprehensive Performance Evaluation on Multiplication of Matrices using MPI
Comprehensive Performance Evaluation on Multiplication of Matrices using MPIijtsrd
 
Fuzzy clustering1
Fuzzy clustering1Fuzzy clustering1
Fuzzy clustering1abc
 
Real-time PMU Data Recovery Application Based on Singular Value Decomposition
Real-time PMU Data Recovery Application Based on Singular Value DecompositionReal-time PMU Data Recovery Application Based on Singular Value Decomposition
Real-time PMU Data Recovery Application Based on Singular Value DecompositionPower System Operation
 

Similar to Extended Fuzzy C-Means with Random Sampling Techniques for Clustering Large Data (20)

Comparison Between Clustering Algorithms for Microarray Data Analysis
Comparison Between Clustering Algorithms for Microarray Data AnalysisComparison Between Clustering Algorithms for Microarray Data Analysis
Comparison Between Clustering Algorithms for Microarray Data Analysis
 
IRJET- Expert Independent Bayesian Data Fusion and Decision Making Model for ...
IRJET- Expert Independent Bayesian Data Fusion and Decision Making Model for ...IRJET- Expert Independent Bayesian Data Fusion and Decision Making Model for ...
IRJET- Expert Independent Bayesian Data Fusion and Decision Making Model for ...
 
AMAZON STOCK PRICE PREDICTION BY USING SMLT
AMAZON STOCK PRICE PREDICTION BY USING SMLTAMAZON STOCK PRICE PREDICTION BY USING SMLT
AMAZON STOCK PRICE PREDICTION BY USING SMLT
 
AN ANN APPROACH FOR NETWORK INTRUSION DETECTION USING ENTROPY BASED FEATURE S...
AN ANN APPROACH FOR NETWORK INTRUSION DETECTION USING ENTROPY BASED FEATURE S...AN ANN APPROACH FOR NETWORK INTRUSION DETECTION USING ENTROPY BASED FEATURE S...
AN ANN APPROACH FOR NETWORK INTRUSION DETECTION USING ENTROPY BASED FEATURE S...
 
A F AULT D IAGNOSIS M ETHOD BASED ON S EMI - S UPERVISED F UZZY C-M EANS...
A F AULT  D IAGNOSIS  M ETHOD BASED ON  S EMI - S UPERVISED  F UZZY  C-M EANS...A F AULT  D IAGNOSIS  M ETHOD BASED ON  S EMI - S UPERVISED  F UZZY  C-M EANS...
A F AULT D IAGNOSIS M ETHOD BASED ON S EMI - S UPERVISED F UZZY C-M EANS...
 
FAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETS
FAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETSFAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETS
FAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETS
 
A SURVEY OF CLUSTERING ALGORITHMS IN ASSOCIATION RULES MINING
A SURVEY OF CLUSTERING ALGORITHMS IN ASSOCIATION RULES MININGA SURVEY OF CLUSTERING ALGORITHMS IN ASSOCIATION RULES MINING
A SURVEY OF CLUSTERING ALGORITHMS IN ASSOCIATION RULES MINING
 
A SURVEY OF CLUSTERING ALGORITHMS IN ASSOCIATION RULES MINING
A SURVEY OF CLUSTERING ALGORITHMS IN ASSOCIATION RULES MININGA SURVEY OF CLUSTERING ALGORITHMS IN ASSOCIATION RULES MINING
A SURVEY OF CLUSTERING ALGORITHMS IN ASSOCIATION RULES MINING
 
A SURVEY OF CLUSTERING ALGORITHMS IN ASSOCIATION RULES MINING
A SURVEY OF CLUSTERING ALGORITHMS IN ASSOCIATION RULES MININGA SURVEY OF CLUSTERING ALGORITHMS IN ASSOCIATION RULES MINING
A SURVEY OF CLUSTERING ALGORITHMS IN ASSOCIATION RULES MINING
 
Failure prediction of e-banking application system using adaptive neuro fuzzy...
Failure prediction of e-banking application system using adaptive neuro fuzzy...Failure prediction of e-banking application system using adaptive neuro fuzzy...
Failure prediction of e-banking application system using adaptive neuro fuzzy...
 
Applications of Artificial Neural Networks in Cancer Prediction
Applications of Artificial Neural Networks in Cancer PredictionApplications of Artificial Neural Networks in Cancer Prediction
Applications of Artificial Neural Networks in Cancer Prediction
 
V01 i010401
V01 i010401V01 i010401
V01 i010401
 
Vol 16 No 2 - July-December 2016
Vol 16 No 2 - July-December 2016Vol 16 No 2 - July-December 2016
Vol 16 No 2 - July-December 2016
 
IRJET- An Effective Brain Tumor Segmentation using K-means Clustering
IRJET-  	  An Effective Brain Tumor Segmentation using K-means ClusteringIRJET-  	  An Effective Brain Tumor Segmentation using K-means Clustering
IRJET- An Effective Brain Tumor Segmentation using K-means Clustering
 
Experimental study of Data clustering using k- Means and modified algorithms
Experimental study of Data clustering using k- Means and modified algorithmsExperimental study of Data clustering using k- Means and modified algorithms
Experimental study of Data clustering using k- Means and modified algorithms
 
Data clustering
Data clustering Data clustering
Data clustering
 
IRJET- A Secured Method of Data Aggregation for Wireless Sensor Networks in t...
IRJET- A Secured Method of Data Aggregation for Wireless Sensor Networks in t...IRJET- A Secured Method of Data Aggregation for Wireless Sensor Networks in t...
IRJET- A Secured Method of Data Aggregation for Wireless Sensor Networks in t...
 
Comprehensive Performance Evaluation on Multiplication of Matrices using MPI
Comprehensive Performance Evaluation on Multiplication of Matrices using MPIComprehensive Performance Evaluation on Multiplication of Matrices using MPI
Comprehensive Performance Evaluation on Multiplication of Matrices using MPI
 
Fuzzy clustering1
Fuzzy clustering1Fuzzy clustering1
Fuzzy clustering1
 
Real-time PMU Data Recovery Application Based on Singular Value Decomposition
Real-time PMU Data Recovery Application Based on Singular Value DecompositionReal-time PMU Data Recovery Application Based on Singular Value Decomposition
Real-time PMU Data Recovery Application Based on Singular Value Decomposition
 

More from AM Publications

DEVELOPMENT OF TODDLER FAMILY CADRE TRAINING BASED ON ANDROID APPLICATIONS IN...
DEVELOPMENT OF TODDLER FAMILY CADRE TRAINING BASED ON ANDROID APPLICATIONS IN...DEVELOPMENT OF TODDLER FAMILY CADRE TRAINING BASED ON ANDROID APPLICATIONS IN...
DEVELOPMENT OF TODDLER FAMILY CADRE TRAINING BASED ON ANDROID APPLICATIONS IN...AM Publications
 
TESTING OF COMPOSITE ON DROP-WEIGHT IMPACT TESTING AND DAMAGE IDENTIFICATION ...
TESTING OF COMPOSITE ON DROP-WEIGHT IMPACT TESTING AND DAMAGE IDENTIFICATION ...TESTING OF COMPOSITE ON DROP-WEIGHT IMPACT TESTING AND DAMAGE IDENTIFICATION ...
TESTING OF COMPOSITE ON DROP-WEIGHT IMPACT TESTING AND DAMAGE IDENTIFICATION ...AM Publications
 
THE USE OF FRACTAL GEOMETRY IN TILING MOTIF DESIGN
THE USE OF FRACTAL GEOMETRY IN TILING MOTIF DESIGNTHE USE OF FRACTAL GEOMETRY IN TILING MOTIF DESIGN
THE USE OF FRACTAL GEOMETRY IN TILING MOTIF DESIGNAM Publications
 
TWO-DIMENSIONAL INVERSION FINITE ELEMENT MODELING OF MAGNETOTELLURIC DATA: CA...
TWO-DIMENSIONAL INVERSION FINITE ELEMENT MODELING OF MAGNETOTELLURIC DATA: CA...TWO-DIMENSIONAL INVERSION FINITE ELEMENT MODELING OF MAGNETOTELLURIC DATA: CA...
TWO-DIMENSIONAL INVERSION FINITE ELEMENT MODELING OF MAGNETOTELLURIC DATA: CA...AM Publications
 
USING THE GENETIC ALGORITHM TO OPTIMIZE LASER WELDING PARAMETERS FOR MARTENSI...
USING THE GENETIC ALGORITHM TO OPTIMIZE LASER WELDING PARAMETERS FOR MARTENSI...USING THE GENETIC ALGORITHM TO OPTIMIZE LASER WELDING PARAMETERS FOR MARTENSI...
USING THE GENETIC ALGORITHM TO OPTIMIZE LASER WELDING PARAMETERS FOR MARTENSI...AM Publications
 
ANALYSIS AND DESIGN E-MARKETPLACE FOR MICRO, SMALL AND MEDIUM ENTERPRISES
ANALYSIS AND DESIGN E-MARKETPLACE FOR MICRO, SMALL AND MEDIUM ENTERPRISESANALYSIS AND DESIGN E-MARKETPLACE FOR MICRO, SMALL AND MEDIUM ENTERPRISES
ANALYSIS AND DESIGN E-MARKETPLACE FOR MICRO, SMALL AND MEDIUM ENTERPRISESAM Publications
 
REMOTE SENSING AND GEOGRAPHIC INFORMATION SYSTEMS
REMOTE SENSING AND GEOGRAPHIC INFORMATION SYSTEMS REMOTE SENSING AND GEOGRAPHIC INFORMATION SYSTEMS
REMOTE SENSING AND GEOGRAPHIC INFORMATION SYSTEMS AM Publications
 
EVALUATE THE STRAIN ENERGY ERROR FOR THE LASER WELD BY THE H-REFINEMENT OF TH...
EVALUATE THE STRAIN ENERGY ERROR FOR THE LASER WELD BY THE H-REFINEMENT OF TH...EVALUATE THE STRAIN ENERGY ERROR FOR THE LASER WELD BY THE H-REFINEMENT OF TH...
EVALUATE THE STRAIN ENERGY ERROR FOR THE LASER WELD BY THE H-REFINEMENT OF TH...AM Publications
 
HMM APPLICATION IN ISOLATED WORD SPEECH RECOGNITION
HMM APPLICATION IN ISOLATED WORD SPEECH RECOGNITIONHMM APPLICATION IN ISOLATED WORD SPEECH RECOGNITION
HMM APPLICATION IN ISOLATED WORD SPEECH RECOGNITIONAM Publications
 
PEDESTRIAN DETECTION IN LOW RESOLUTION VIDEOS USING A MULTI-FRAME HOG-BASED D...
PEDESTRIAN DETECTION IN LOW RESOLUTION VIDEOS USING A MULTI-FRAME HOG-BASED D...PEDESTRIAN DETECTION IN LOW RESOLUTION VIDEOS USING A MULTI-FRAME HOG-BASED D...
PEDESTRIAN DETECTION IN LOW RESOLUTION VIDEOS USING A MULTI-FRAME HOG-BASED D...AM Publications
 
EFFECT OF SILICON - RUBBER (SR) SHEETS AS AN ALTERNATIVE FILTER ON HIGH AND L...
EFFECT OF SILICON - RUBBER (SR) SHEETS AS AN ALTERNATIVE FILTER ON HIGH AND L...EFFECT OF SILICON - RUBBER (SR) SHEETS AS AN ALTERNATIVE FILTER ON HIGH AND L...
EFFECT OF SILICON - RUBBER (SR) SHEETS AS AN ALTERNATIVE FILTER ON HIGH AND L...AM Publications
 
UTILIZATION OF IMMUNIZATION SERVICES AMONG CHILDREN UNDER FIVE YEARS OF AGE I...
UTILIZATION OF IMMUNIZATION SERVICES AMONG CHILDREN UNDER FIVE YEARS OF AGE I...UTILIZATION OF IMMUNIZATION SERVICES AMONG CHILDREN UNDER FIVE YEARS OF AGE I...
UTILIZATION OF IMMUNIZATION SERVICES AMONG CHILDREN UNDER FIVE YEARS OF AGE I...AM Publications
 
REPRESENTATION OF THE BLOCK DATA ENCRYPTION ALGORITHM IN AN ANALYTICAL FORM F...
REPRESENTATION OF THE BLOCK DATA ENCRYPTION ALGORITHM IN AN ANALYTICAL FORM F...REPRESENTATION OF THE BLOCK DATA ENCRYPTION ALGORITHM IN AN ANALYTICAL FORM F...
REPRESENTATION OF THE BLOCK DATA ENCRYPTION ALGORITHM IN AN ANALYTICAL FORM F...AM Publications
 
OPTICAL CHARACTER RECOGNITION USING RBFNN
OPTICAL CHARACTER RECOGNITION USING RBFNNOPTICAL CHARACTER RECOGNITION USING RBFNN
OPTICAL CHARACTER RECOGNITION USING RBFNNAM Publications
 
DETECTION OF MOVING OBJECT
DETECTION OF MOVING OBJECTDETECTION OF MOVING OBJECT
DETECTION OF MOVING OBJECTAM Publications
 
SIMULATION OF ATMOSPHERIC POLLUTANTS DISPERSION IN AN URBAN ENVIRONMENT
SIMULATION OF ATMOSPHERIC POLLUTANTS DISPERSION IN AN URBAN ENVIRONMENTSIMULATION OF ATMOSPHERIC POLLUTANTS DISPERSION IN AN URBAN ENVIRONMENT
SIMULATION OF ATMOSPHERIC POLLUTANTS DISPERSION IN AN URBAN ENVIRONMENTAM Publications
 
PREPARATION AND EVALUATION OF WOOL KERATIN BASED CHITOSAN NANOFIBERS FOR AIR ...
PREPARATION AND EVALUATION OF WOOL KERATIN BASED CHITOSAN NANOFIBERS FOR AIR ...PREPARATION AND EVALUATION OF WOOL KERATIN BASED CHITOSAN NANOFIBERS FOR AIR ...
PREPARATION AND EVALUATION OF WOOL KERATIN BASED CHITOSAN NANOFIBERS FOR AIR ...AM Publications
 
ANALYSIS ON LOAD BALANCING ALGORITHMS IMPLEMENTATION ON CLOUD COMPUTING ENVIR...
ANALYSIS ON LOAD BALANCING ALGORITHMS IMPLEMENTATION ON CLOUD COMPUTING ENVIR...ANALYSIS ON LOAD BALANCING ALGORITHMS IMPLEMENTATION ON CLOUD COMPUTING ENVIR...
ANALYSIS ON LOAD BALANCING ALGORITHMS IMPLEMENTATION ON CLOUD COMPUTING ENVIR...AM Publications
 
A MODEL BASED APPROACH FOR IMPLEMENTING WLAN SECURITY
A MODEL BASED APPROACH FOR IMPLEMENTING WLAN SECURITY A MODEL BASED APPROACH FOR IMPLEMENTING WLAN SECURITY
A MODEL BASED APPROACH FOR IMPLEMENTING WLAN SECURITY AM Publications
 

More from AM Publications (20)

DEVELOPMENT OF TODDLER FAMILY CADRE TRAINING BASED ON ANDROID APPLICATIONS IN...
DEVELOPMENT OF TODDLER FAMILY CADRE TRAINING BASED ON ANDROID APPLICATIONS IN...DEVELOPMENT OF TODDLER FAMILY CADRE TRAINING BASED ON ANDROID APPLICATIONS IN...
DEVELOPMENT OF TODDLER FAMILY CADRE TRAINING BASED ON ANDROID APPLICATIONS IN...
 
TESTING OF COMPOSITE ON DROP-WEIGHT IMPACT TESTING AND DAMAGE IDENTIFICATION ...
TESTING OF COMPOSITE ON DROP-WEIGHT IMPACT TESTING AND DAMAGE IDENTIFICATION ...TESTING OF COMPOSITE ON DROP-WEIGHT IMPACT TESTING AND DAMAGE IDENTIFICATION ...
TESTING OF COMPOSITE ON DROP-WEIGHT IMPACT TESTING AND DAMAGE IDENTIFICATION ...
 
THE USE OF FRACTAL GEOMETRY IN TILING MOTIF DESIGN
THE USE OF FRACTAL GEOMETRY IN TILING MOTIF DESIGNTHE USE OF FRACTAL GEOMETRY IN TILING MOTIF DESIGN
THE USE OF FRACTAL GEOMETRY IN TILING MOTIF DESIGN
 
TWO-DIMENSIONAL INVERSION FINITE ELEMENT MODELING OF MAGNETOTELLURIC DATA: CA...
TWO-DIMENSIONAL INVERSION FINITE ELEMENT MODELING OF MAGNETOTELLURIC DATA: CA...TWO-DIMENSIONAL INVERSION FINITE ELEMENT MODELING OF MAGNETOTELLURIC DATA: CA...
TWO-DIMENSIONAL INVERSION FINITE ELEMENT MODELING OF MAGNETOTELLURIC DATA: CA...
 
USING THE GENETIC ALGORITHM TO OPTIMIZE LASER WELDING PARAMETERS FOR MARTENSI...
USING THE GENETIC ALGORITHM TO OPTIMIZE LASER WELDING PARAMETERS FOR MARTENSI...USING THE GENETIC ALGORITHM TO OPTIMIZE LASER WELDING PARAMETERS FOR MARTENSI...
USING THE GENETIC ALGORITHM TO OPTIMIZE LASER WELDING PARAMETERS FOR MARTENSI...
 
ANALYSIS AND DESIGN E-MARKETPLACE FOR MICRO, SMALL AND MEDIUM ENTERPRISES
ANALYSIS AND DESIGN E-MARKETPLACE FOR MICRO, SMALL AND MEDIUM ENTERPRISESANALYSIS AND DESIGN E-MARKETPLACE FOR MICRO, SMALL AND MEDIUM ENTERPRISES
ANALYSIS AND DESIGN E-MARKETPLACE FOR MICRO, SMALL AND MEDIUM ENTERPRISES
 
REMOTE SENSING AND GEOGRAPHIC INFORMATION SYSTEMS
REMOTE SENSING AND GEOGRAPHIC INFORMATION SYSTEMS REMOTE SENSING AND GEOGRAPHIC INFORMATION SYSTEMS
REMOTE SENSING AND GEOGRAPHIC INFORMATION SYSTEMS
 
EVALUATE THE STRAIN ENERGY ERROR FOR THE LASER WELD BY THE H-REFINEMENT OF TH...
EVALUATE THE STRAIN ENERGY ERROR FOR THE LASER WELD BY THE H-REFINEMENT OF TH...EVALUATE THE STRAIN ENERGY ERROR FOR THE LASER WELD BY THE H-REFINEMENT OF TH...
EVALUATE THE STRAIN ENERGY ERROR FOR THE LASER WELD BY THE H-REFINEMENT OF TH...
 
HMM APPLICATION IN ISOLATED WORD SPEECH RECOGNITION
HMM APPLICATION IN ISOLATED WORD SPEECH RECOGNITIONHMM APPLICATION IN ISOLATED WORD SPEECH RECOGNITION
HMM APPLICATION IN ISOLATED WORD SPEECH RECOGNITION
 
PEDESTRIAN DETECTION IN LOW RESOLUTION VIDEOS USING A MULTI-FRAME HOG-BASED D...
PEDESTRIAN DETECTION IN LOW RESOLUTION VIDEOS USING A MULTI-FRAME HOG-BASED D...PEDESTRIAN DETECTION IN LOW RESOLUTION VIDEOS USING A MULTI-FRAME HOG-BASED D...
PEDESTRIAN DETECTION IN LOW RESOLUTION VIDEOS USING A MULTI-FRAME HOG-BASED D...
 
INTELLIGENT BLIND STICK
INTELLIGENT BLIND STICKINTELLIGENT BLIND STICK
INTELLIGENT BLIND STICK
 
EFFECT OF SILICON - RUBBER (SR) SHEETS AS AN ALTERNATIVE FILTER ON HIGH AND L...
EFFECT OF SILICON - RUBBER (SR) SHEETS AS AN ALTERNATIVE FILTER ON HIGH AND L...EFFECT OF SILICON - RUBBER (SR) SHEETS AS AN ALTERNATIVE FILTER ON HIGH AND L...
EFFECT OF SILICON - RUBBER (SR) SHEETS AS AN ALTERNATIVE FILTER ON HIGH AND L...
 
UTILIZATION OF IMMUNIZATION SERVICES AMONG CHILDREN UNDER FIVE YEARS OF AGE I...
UTILIZATION OF IMMUNIZATION SERVICES AMONG CHILDREN UNDER FIVE YEARS OF AGE I...UTILIZATION OF IMMUNIZATION SERVICES AMONG CHILDREN UNDER FIVE YEARS OF AGE I...
UTILIZATION OF IMMUNIZATION SERVICES AMONG CHILDREN UNDER FIVE YEARS OF AGE I...
 
REPRESENTATION OF THE BLOCK DATA ENCRYPTION ALGORITHM IN AN ANALYTICAL FORM F...
REPRESENTATION OF THE BLOCK DATA ENCRYPTION ALGORITHM IN AN ANALYTICAL FORM F...REPRESENTATION OF THE BLOCK DATA ENCRYPTION ALGORITHM IN AN ANALYTICAL FORM F...
REPRESENTATION OF THE BLOCK DATA ENCRYPTION ALGORITHM IN AN ANALYTICAL FORM F...
 
OPTICAL CHARACTER RECOGNITION USING RBFNN
OPTICAL CHARACTER RECOGNITION USING RBFNNOPTICAL CHARACTER RECOGNITION USING RBFNN
OPTICAL CHARACTER RECOGNITION USING RBFNN
 
DETECTION OF MOVING OBJECT
DETECTION OF MOVING OBJECTDETECTION OF MOVING OBJECT
DETECTION OF MOVING OBJECT
 
SIMULATION OF ATMOSPHERIC POLLUTANTS DISPERSION IN AN URBAN ENVIRONMENT
SIMULATION OF ATMOSPHERIC POLLUTANTS DISPERSION IN AN URBAN ENVIRONMENTSIMULATION OF ATMOSPHERIC POLLUTANTS DISPERSION IN AN URBAN ENVIRONMENT
SIMULATION OF ATMOSPHERIC POLLUTANTS DISPERSION IN AN URBAN ENVIRONMENT
 
PREPARATION AND EVALUATION OF WOOL KERATIN BASED CHITOSAN NANOFIBERS FOR AIR ...
PREPARATION AND EVALUATION OF WOOL KERATIN BASED CHITOSAN NANOFIBERS FOR AIR ...PREPARATION AND EVALUATION OF WOOL KERATIN BASED CHITOSAN NANOFIBERS FOR AIR ...
PREPARATION AND EVALUATION OF WOOL KERATIN BASED CHITOSAN NANOFIBERS FOR AIR ...
 
ANALYSIS ON LOAD BALANCING ALGORITHMS IMPLEMENTATION ON CLOUD COMPUTING ENVIR...
ANALYSIS ON LOAD BALANCING ALGORITHMS IMPLEMENTATION ON CLOUD COMPUTING ENVIR...ANALYSIS ON LOAD BALANCING ALGORITHMS IMPLEMENTATION ON CLOUD COMPUTING ENVIR...
ANALYSIS ON LOAD BALANCING ALGORITHMS IMPLEMENTATION ON CLOUD COMPUTING ENVIR...
 
A MODEL BASED APPROACH FOR IMPLEMENTING WLAN SECURITY
A MODEL BASED APPROACH FOR IMPLEMENTING WLAN SECURITY A MODEL BASED APPROACH FOR IMPLEMENTING WLAN SECURITY
A MODEL BASED APPROACH FOR IMPLEMENTING WLAN SECURITY
 

Recently uploaded

Computer Networks Basics of Network Devices
Computer Networks  Basics of Network DevicesComputer Networks  Basics of Network Devices
Computer Networks Basics of Network DevicesChandrakantDivate1
 
Hospital management system project report.pdf
Hospital management system project report.pdfHospital management system project report.pdf
Hospital management system project report.pdfKamal Acharya
 
A Study of Urban Area Plan for Pabna Municipality
A Study of Urban Area Plan for Pabna MunicipalityA Study of Urban Area Plan for Pabna Municipality
A Study of Urban Area Plan for Pabna MunicipalityMorshed Ahmed Rahath
 
School management system project Report.pdf
School management system project Report.pdfSchool management system project Report.pdf
School management system project Report.pdfKamal Acharya
 
DeepFakes presentation : brief idea of DeepFakes
DeepFakes presentation : brief idea of DeepFakesDeepFakes presentation : brief idea of DeepFakes
DeepFakes presentation : brief idea of DeepFakesMayuraD1
 
DC MACHINE-Motoring and generation, Armature circuit equation
DC MACHINE-Motoring and generation, Armature circuit equationDC MACHINE-Motoring and generation, Armature circuit equation
DC MACHINE-Motoring and generation, Armature circuit equationBhangaleSonal
 
Electromagnetic relays used for power system .pptx
Electromagnetic relays used for power system .pptxElectromagnetic relays used for power system .pptx
Electromagnetic relays used for power system .pptxNANDHAKUMARA10
 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXssuser89054b
 
COST-EFFETIVE and Energy Efficient BUILDINGS ptx
COST-EFFETIVE  and Energy Efficient BUILDINGS ptxCOST-EFFETIVE  and Energy Efficient BUILDINGS ptx
COST-EFFETIVE and Energy Efficient BUILDINGS ptxJIT KUMAR GUPTA
 
fitting shop and tools used in fitting shop .ppt
fitting shop and tools used in fitting shop .pptfitting shop and tools used in fitting shop .ppt
fitting shop and tools used in fitting shop .pptAfnanAhmad53
 
Double Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torqueDouble Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torqueBhangaleSonal
 
Introduction to Serverless with AWS Lambda
Introduction to Serverless with AWS LambdaIntroduction to Serverless with AWS Lambda
Introduction to Serverless with AWS LambdaOmar Fathy
 
Computer Graphics Introduction To Curves
Computer Graphics Introduction To CurvesComputer Graphics Introduction To Curves
Computer Graphics Introduction To CurvesChandrakantDivate1
 
Digital Communication Essentials: DPCM, DM, and ADM .pptx
Digital Communication Essentials: DPCM, DM, and ADM .pptxDigital Communication Essentials: DPCM, DM, and ADM .pptx
Digital Communication Essentials: DPCM, DM, and ADM .pptxpritamlangde
 
Ghuma $ Russian Call Girls Ahmedabad ₹7.5k Pick Up & Drop With Cash Payment 8...
Ghuma $ Russian Call Girls Ahmedabad ₹7.5k Pick Up & Drop With Cash Payment 8...Ghuma $ Russian Call Girls Ahmedabad ₹7.5k Pick Up & Drop With Cash Payment 8...
Ghuma $ Russian Call Girls Ahmedabad ₹7.5k Pick Up & Drop With Cash Payment 8...gragchanchal546
 
Thermal Engineering Unit - I & II . ppt
Thermal Engineering  Unit - I & II . pptThermal Engineering  Unit - I & II . ppt
Thermal Engineering Unit - I & II . pptDineshKumar4165
 
Basic Electronics for diploma students as per technical education Kerala Syll...
Basic Electronics for diploma students as per technical education Kerala Syll...Basic Electronics for diploma students as per technical education Kerala Syll...
Basic Electronics for diploma students as per technical education Kerala Syll...ppkakm
 
Introduction to Data Visualization,Matplotlib.pdf
Introduction to Data Visualization,Matplotlib.pdfIntroduction to Data Visualization,Matplotlib.pdf
Introduction to Data Visualization,Matplotlib.pdfsumitt6_25730773
 
Employee leave management system project.
Employee leave management system project.Employee leave management system project.
Employee leave management system project.Kamal Acharya
 

Recently uploaded (20)

Computer Networks Basics of Network Devices
Computer Networks  Basics of Network DevicesComputer Networks  Basics of Network Devices
Computer Networks Basics of Network Devices
 
Hospital management system project report.pdf
Hospital management system project report.pdfHospital management system project report.pdf
Hospital management system project report.pdf
 
A Study of Urban Area Plan for Pabna Municipality
A Study of Urban Area Plan for Pabna MunicipalityA Study of Urban Area Plan for Pabna Municipality
A Study of Urban Area Plan for Pabna Municipality
 
School management system project Report.pdf
School management system project Report.pdfSchool management system project Report.pdf
School management system project Report.pdf
 
DeepFakes presentation : brief idea of DeepFakes
DeepFakes presentation : brief idea of DeepFakesDeepFakes presentation : brief idea of DeepFakes
DeepFakes presentation : brief idea of DeepFakes
 
DC MACHINE-Motoring and generation, Armature circuit equation
DC MACHINE-Motoring and generation, Armature circuit equationDC MACHINE-Motoring and generation, Armature circuit equation
DC MACHINE-Motoring and generation, Armature circuit equation
 
Electromagnetic relays used for power system .pptx
Electromagnetic relays used for power system .pptxElectromagnetic relays used for power system .pptx
Electromagnetic relays used for power system .pptx
 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
 
COST-EFFETIVE and Energy Efficient BUILDINGS ptx
COST-EFFETIVE  and Energy Efficient BUILDINGS ptxCOST-EFFETIVE  and Energy Efficient BUILDINGS ptx
COST-EFFETIVE and Energy Efficient BUILDINGS ptx
 
fitting shop and tools used in fitting shop .ppt
fitting shop and tools used in fitting shop .pptfitting shop and tools used in fitting shop .ppt
fitting shop and tools used in fitting shop .ppt
 
Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
 
Double Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torqueDouble Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torque
 
Introduction to Serverless with AWS Lambda
Introduction to Serverless with AWS LambdaIntroduction to Serverless with AWS Lambda
Introduction to Serverless with AWS Lambda
 
Computer Graphics Introduction To Curves
Computer Graphics Introduction To CurvesComputer Graphics Introduction To Curves
Computer Graphics Introduction To Curves
 
Digital Communication Essentials: DPCM, DM, and ADM .pptx
Digital Communication Essentials: DPCM, DM, and ADM .pptxDigital Communication Essentials: DPCM, DM, and ADM .pptx
Digital Communication Essentials: DPCM, DM, and ADM .pptx
 
Ghuma $ Russian Call Girls Ahmedabad ₹7.5k Pick Up & Drop With Cash Payment 8...
Ghuma $ Russian Call Girls Ahmedabad ₹7.5k Pick Up & Drop With Cash Payment 8...Ghuma $ Russian Call Girls Ahmedabad ₹7.5k Pick Up & Drop With Cash Payment 8...
Ghuma $ Russian Call Girls Ahmedabad ₹7.5k Pick Up & Drop With Cash Payment 8...
 
Thermal Engineering Unit - I & II . ppt
Thermal Engineering  Unit - I & II . pptThermal Engineering  Unit - I & II . ppt
Thermal Engineering Unit - I & II . ppt
 
Basic Electronics for diploma students as per technical education Kerala Syll...
Basic Electronics for diploma students as per technical education Kerala Syll...Basic Electronics for diploma students as per technical education Kerala Syll...
Basic Electronics for diploma students as per technical education Kerala Syll...
 
Introduction to Data Visualization,Matplotlib.pdf
Introduction to Data Visualization,Matplotlib.pdfIntroduction to Data Visualization,Matplotlib.pdf
Introduction to Data Visualization,Matplotlib.pdf
 
Employee leave management system project.
Employee leave management system project.Employee leave management system project.
Employee leave management system project.
 

Extended Fuzzy C-Means with Random Sampling Techniques for Clustering Large Data

  • 1. International Journal of Innovative Research in Advanced Engineering (IJIRAE) Volume 1 Issue 1 (March 2014) ISSN: 2278-2311 IJIRAE | http://ijirae.com © 2014, IJIRAE – All Rights Reserved Page - 1 Extended Fuzzy C-Means with Random Sampling Techniques for Clustering Large Data Dhanesh Kothari Department of Information Technology Sri Venkateswara College of Engineering dhaneshkanak@gmail.com S. Thavasi Narayanan Department of Information Technology Sri Venkateswara College of Engineering thavasinarayanan@gmail.com K. Kiruthika Devi Department of Information Technology Sri Venkateswara College of Engineering kiruthika@svce.ac.in Abstract – Big data are any data that you cannot load into your computer’s primary memory. Clustering is a primary task in pattern recognition and data mining. We need algorithms that scale well with the data size. The former implementation, literal Fuzzy C-Means is linear or serialized. FCM algorithm attempts to partition a finite collection of n elements into collection of c fuzzy clusters. So, given a finite set of data, this algorithm returns a list of c cluster centers. However it doesn't scale well and slows down with increase in the size of data and is thus impractical and sometimes undesirable. In this paper, we propose an extended version of fuzzy c-means clustering algorithm by means of various random sampling techniques to study which method scales well for large or very large data. Keywords - Big data, very large data, fuzzy c-means (FCM) clustering, sampling, probability I. INTRODUCTION CLUSTERING [1] is a form of data analysis in which data are separated into groups such that data with similar properties are in same group. Clustering is used in pattern recognition, compression, image processing[9] and many other fields as mentioned in references [6] and [10]. IoT or Internet of Things, personal computing has produced an incredible amount of data ranging from few terabytes to Exabyte of data. These big data are still unloadable in the modern day computers. There are many approaches applied for clustering the big data. One technique is distributed clustering, where the data is distributed to various systems and the clustering is done independently on each system. Then, the average cluster centers are evaluated based on the cluster centers from all systems. This method is however quite expensive and result data may not be accurate. There are three main types of partitions: crisp, fuzzy and possibilistic. Crisp or hard partitioning group data into mutually disjoint subsets of points and the partition element, uki = 1 if oi is labelled k, else 0. Fuzzy partitions are more flexible than crisp portioning in that each object can belong to more than one cluster group. This is particularly useful in identify the online behavior of person for advertisements. The set of all fuzzy c-partitions is 1 Each column of the fuzzy partition U must sum to 1, thus ensuring that every object has unit total membership in a partition. Many algorithms have been proposed to cluster the big data but only few of them use fuzzy partitions. One such is fuzzy c-means clustering. However, its literal schemes simply cluster the entire dataset. In contrast, extended clustering [2] schemes apply clustering algorithm to subset (or representative) of this dataset and non-iteratively extend the sample data result to the full dataset. One such algorithm is random sampled extended fuzzy c-means algorithm. In this paper, we compare three different sampling techniques to compute fuzzy partitions of the vector data: simple random sampling, systematic random sampling and multistage random sampling. Section II describes the simple random sampling based extended fuzzy c-means, and Section III proposes systematic and multistage sampling for extending fuzzy c-means algorithm. Section IV presents the results. Section V contains our conclusions and some ideas for future enhancements. II. SIMPLE RANDOM SAMPLED FUZZY C-MEANS ALGORITHM The FCM algorithm is defined as the optimization of the squared-error distortion (2) Where U is (c x n) partition matrix, V={v1,v2,…., vc} is the set of c clusters centers in Rd , m>1 is the fuzzification constant, and ||.||A is any inner product A-induced norm, i.e., ||x||A= . We use here Euclidean norm (A=I) in our example. Algorithm 1 outlines the steps for the literal FCM algorithm. We initialize the algorithm by choosing c objects randomly from the dataset as initial cluster centers. The most commonly used sampling technique to address big data is simple random sampling [4].
  • 2. International Journal of Innovative Research in Advanced Engineering (IJIRAE) Volume 1 Issue 1 (March 2014) ISSN: 2278-2311 IJIRAE | http://ijirae.com © 2014, IJIRAE – All Rights Reserved Page - 2 In this method, each record in the full set has equal probability of being selected into the sample set, i.e. 1/N, where N is the total number of records. The FCM algorithm then can be applied onto this sample data to get membership matrix and cluster centers. We can then extend the V to complete dataset. This can be called as rseFCM. rseFCM is shown in algorithm 2. The step 3 is an important step. In this we extend the result of the FCM algorithm on sample dataset to the complete dataset. It is a non-iterative step as it is run only once in the end of extended FCM algorithm. Algorithm 1: LFCM Input: X, c, m Output: U, V Initialize V while do (3) (4) Algorithm 2: rseFCM to approximately minimize Jm(U, V) Input: X, c, m Output: U, V 1 Sample ns objects from X without replacement, denoted Xs 2 Us, V = LFCM(Xs, c, m) 3 Extend the partition (Us, V) to X, , using Eq. (3), giving (U, V) III. MULTISTAGE SAMPLING AND SYSTEMATIC RANDOM SAMPLING The are other sampling methods[8] other than commonly used simple random sampling method. Here, we introduce multistage sampling and systematic random sampling techniques. ultistage sampling is a complex form of cluster sampling. It involves dividing the population into groups or strata based on the best feature[3] . With the multistage sample, there is equal chance of selecting each unit from within a particular group. In some cases, several levels of group selection may be applied before the final sample elements are reached. It should be noted that multistage and stratified sampling are different from each other as in multistage, selected clusters are studied. The multistage sampling is given in algorithm 3. Algorithm 3: Multistage sampling based extended FCM algorithm Input: X, c, m Output: Xs, U, V 1 Organize the sampling process into stages where the unit of analysis is systematically grouped. 2 Select a sampling technique for each stage. 3 Systematically apply the sampling technique to each stage until the unit of analysis has been selected. 4 Execute rseFCM steps 2 and 3. Systematic random sampling is a type of probability sampling technique. With the systematic random sample, there is an equal chance of selecting each unit from within the population when creating the sample. Here, rather than selecting referring to random number tables to select the cases (as in simple random sampling) in your sample, you select the units directly from the sample frame. Systematic sampling algorithm is given in Algorithm 4.
  • 3. International Journal of Innovative Research in Advanced Engineering (IJIRAE) Volume 1 Issue 1 (March 2014) ISSN: 2278-2311 IJIRAE | http://ijirae.com © 2014, IJIRAE – All Rights Reserved Page - 3 Algorithm 4: Systematic Random Sampling based extended FCM algorithm Input: X, c, m Output: Xs, U, V 1 Calculate the sampling interval, k where, N is the population size and n is the sample size 2 Select a random start between 1 and sampling interval 3 Repeatedly add sampling interval to select subsequent record. 4 Execute rseFCM steps 2 and 3. IV. EXPERIMENTS We performed one set of experiment. In the experiment we compare the performance of the big data FCM algorithm on the real data or data with ground truth (Collected data) as well as on synthetic dataset. For FCM, we initialize V by randomly choosing c objects as the initial cluster centers. We fix the value of ϵ = 10-3 and fuzzifier m= 1.7. All code were written in JAVA eclipse environment. A. Evaluation Criter a We judge the performance of the VL FCM algorithm using run-time criteria. It is computed for 20 independent runs. We present statistical comparisons of the algorithms’ performance over the 20 experiments. Speed-up or Run-Time: This criterion represents an actual run-time comparison. When the LFCM is available, speedup is defined as tfull/tsample, where these values are in milliseconds to compute the cluster centers V for data and the membership matrix, U. B. Performance on loadable data We compared the performance of VL FCM algorithm on the following datasets[7] . BIRCH-Sets (n=100000, c= 12, d=2): These data are composed of 100000 2-d vectors, with a visual preferred grouping into 15 clusters. See table I for result. Forest-Data (n=581012, c=7, d=54): These data are composed of cartographic variables that are obtained from United States Geological Survey and United State Forest Service. There are 10 quantifiable variables, and 40 binary soil types, and one cover type. See table II for result. We take 10% as sample size and run-time results (seconds) of various sampling technique based FCM algorithm is shown in Fig.1. It is clear from the results that systematic sampling shows far better scalability than simple random sampling. Multistage sampling can be applied only on datasets with more than two variables. It has a run-time almost near to simple random sampling. C. Performance on unloadable data For, our last experiment, we demonstrate the VL vector data FCM algorithm on an unloadable dataset. The performance of the VL algorithms can be measured by how well they find the clusters and measuring the run-time in milliseconds. The SUSY (Super Symmetry) data are composed of 5 million objects. In order to show how the VL algorithms could be used to process this data in PC we tested at sample size 0.01 % and 0.001% of data. Table III shows the results of this experiment. Unexpectedly, multistage sampling was faster than systematic sampling. Simple random sampling was a very expensive method among the three sampling techniques tested on the data inputs. Fig. 1 Performance bar chart of the various sampling techniques on Birch-Sets and Forest Data
  • 4. International Journal of Innovative Research in Advanced Engineering (IJIRAE) Volume 1 Issue 1 (March 2014) ISSN: 2278-2311 IJIRAE | http://ijirae.com © 2014, IJIRAE – All Rights Reserved Page - 4 TABLE I Results of VL FCM Algorithm on Birch Data (Avg. Of 20 Runs) 10% sample size Simple Random Sampling Systematic Random Sampling Literal FCM Run-Time (seconds) No. of Iterations Run-Time (seconds) No. of Iterations Run-Time (seconds) No. of Iterations 33.287 63 29.475 57 442.135 83 TABLE II Results of VL FCM Algorithm on Forest Data (Avg. Of 20 Runs) 10% sample size Simple Random Sampling Systematic Random Sampling Multistage Sampling Literal FCM Run-Time (seconds) No. of Iterations Run-Time (seconds) No. of Iterations Run-Time (seconds) No. of Iterations Run-Time (seconds) No. of Iterations 170.269 76 90.822 49 163.169 68 528.644 39 TABLE III Results of VL Data Algorithm on SUSY Unloadable Data Sample Size 0.001% 0.01% Sampling Technique Run-time (Sec) No. Of Iterations Run-time (Sec) No. of Iterations Multistage Sampling 37.050 22 43.526 21 Simple Random Sampling 460.043 43 2086.614 99 Systematic Random Sampling 64.015 36 372.497 76 V. DISCUSSION AND CONCLUSIONS As this paper shows, there are many ways to sample the big data for VL FCM algorithm. Systematic random sampling had the best performance on large but loadable dataset. Multistage was better in case of very large or unloadable dataset. Systematic and multistage show better scalability compared to traditionally used simple random sampling for extending fuzzy c-means algorithm. In the future, we will continue to develop and investigate scalable solutions for VL fuzzy clustering. REFERENCES [1] Timothy C. Havens, James C. Bezdek, Christopher Leckie, Lawrence O. Hall, and Marimuthu Palaniswami, "Fuzzy c-Means Algorithms for Very Large Data," in IEEE Transactions On Fuzzy Systems, Vol. 20, No. 6, December 2012. [2] Richard J. Hathway and James C. Bezdek, “Extending fuzzy and probabilistic clustering to very large data sets”, Computational Statistics and Data Analysis, vol. 51, 2006 [3] V. Schwammle and O. Jensen, “A simple and fast method to determine the parameters for fuzzy c-means cluster analysis,” Bioinformatics, vol. 26, no. 22, pp. 2841–2848, 2010. [4] John F. Kolen and Tim Hutcheson, “Reducing the Time Complexity of the Fuzzy c-means Algorithm”, IEEE Transactions On Fuzzy Systems, Vol. 10, No. 2, April 2002. [5] James C. Bezdek, Robert L. Cannon, Jitendra V. Dave, “Efficient Implementation of the fuzzy c-Means Clustering Algorithms”, Vol. PAMI-8, No. 2, March,1986. [6] D. Dembele and P. Kastner, “Fuzzy c-means method for clustering microarray data,” Bioinformatics, vol. 19, pp. 973–980, 2003. [7] http://archive.ics.uci.edu/ml/datasets/ [8] http://dissertation.laerd.com/sampling-strategy.php [9] N. Pal and J. Bezdek, “Complexity reduction for “large image” processing,” IEEE Trans. Syst., Man, Cybern., vol. 32, no. 5, pp. 598–611, Oct.2002. [10]J. Bezdek and R. Hathaway, “Convergence of alternating optmization,” Nueral, Parallel, Sci. Comput., vol. 11, no. 4, pp. 351–368, Dec. 2003.