SlideShare a Scribd company logo
1 of 7
Download to read offline
TELKOMNIKA, Vol.16, No.3, June 2018, pp. 1351~1357
ISSN: 1693-6930, accredited A by DIKTI, Decree No: 58/DIKTI/Kep/2013
DOI: 10.12928/TELKOMNIKA.v16i3.8910  1351
Received September 18, 2017; Revised February 8, 2018; Accepted April 5, 2018
K Means Clustering and Meanshift Analysis for
Grouping the Data of Coal Term in Puslitbang tekMIRA
Rolly Maulana Awangga*
1
, Syafrial Fachri Pane
2
, Khaera Tunnisa
3
,
Iping Supriana Suwardi
4
1,2,3
Program in Informatics Engineering, Politeknik Pos Indonesia, Indonesia
4
School of Electrical Engineering and Informatics, Bandung Institute of Technology, Indonesia
*Corresponding author, e-mail: awangga@poltekpos.ac.id
Abstract
Indonesian government agencies under the Ministry of Energy and Mineral Resources have
problems in classifying data dictionary of coal. This research conduct grouping coal dictionary using K-
Means and MeanShift algorithm. K-means algorithm is used to get cluster value on character and word
criteria. The last iteration of Euclidian distance calculation data on k-means combine with Meanshift
algorithm. The meanshift calculates centroid by selecting different bandwidths. The result of grouping
using k-means and meanshift algorithm shows different centroid to find optimum bandwidth value. The
data dictionary of this research has sorted in alphabetically.
Keywords:coal dictionary,clustering, K-means,meanshift
Copyright © 2018 Universitas Ahmad Dahlan. All rights reserved.
1. Introduction
Data Mining is a process of extracting data or filtering data by utilising a collection of a
big data. The data through a series of operations to obtain valuable information from the data.
According to Daryl Pregibon, the composition of artificial statistics, artificial intelligence, and
database research called data mining [1]. Data mining techniques used to explore the data
using classification, prediction, grouping, outlier detection, association rules, sequence analysis,
time series analysis and text mining, and some new techniques such as social networking
analysis and sentiment analysis [2]. The center of research and development mineral and coal
technology (Puslitbang tekMIRA) is an Indonesia government institution under the Ministry of
Energy and Mineral Resources. Puslitbang tekMIRA has a dictionary of coal. This coal
dictionary has one thousand coal terms. It takes time between three to five minutes to find a
term inside a coal dictionary.
Based on these problems, data mining method is used to classify data dictionary. This
research conduct grouping data dictionary of a coal term using data mining method. K-Means
and Meanshift Algorithm were chosen in this research. K-Means algorithm was used to
categories students with skills such as cognitive, communication and relational[3], to evaluate
student achievement levels for course content [4], and to group data based on user information
created on SNS and recommend to users in the future [5]. The grouping data based on user
sentences by utilising regularity among data pursued by the user using K-Means Algorithm [6].
The genetic algorithm and K-Means are used to calculate clustered centroid with heterogeneous
populations that lead to better results than using random numbers [7]. The Meanshift algorithm
is used to accurately detect the location of target tracking [8] by using this method and then
make it easy the accuracy of the calculation of the tracking results [9]. The Meanshift algorithm
is also used to solve facial-detection and tracking-based systems [10], if an imbalance occurs, it
will affect the performance of [11].
In this research the algorithm grouping a coal term in a dictionary based on character
and word in a cluster. The cluster serve data dictionary of coal based on predetermined criteria.
The result data of clustering using K-Means and Meanshift algorithm is shown using the
matloplib plot. Matplotlib is a Python package for Plotting that produces quality production
graphs. Matplotlib is designed to be able to create simple and complex plots with multiple
commands [12].
 ISSN: 1693-6930
TELKOMNIKA Vol. 16, No. 3, June 2018: 1351-1357
1352
2. Research Method
2.1. K-Means
K-Means Clustering is a grouping of data, where the data in K Means Clustering K is
the amount of data or the number of constants. Means is the average value of the data set as
Cluster [7]. So K-Means is a method to analyse data or called data mining method where this
data modelling method without using supervise or unsupervised method and K-Means is a
method to classify data by using partition system. The K-Means method solves large amounts of
data in groups, and the data has the same characteristics as other data. Also, the team feature
also has a feature [13]. Similitude is some matrix used for similarities between instances in a
cluster. K-Means is an algorithm used to generate k clusters from a collection of data sets in a
simple way [14]. Algorithm 1 is an explanation of k-means implementation. Algorithm 1: k-
means clustering: Randomly select k cluster centers c1, ... ck, Repeat, Set each data entity to
the closest cluster center ci, Change the cluster center with the average cluster i, until the
cluster center does not change[15]. K - Means algorithm formula d = distance, j = amount of
data, c = centroid, x = data, c = centroid 1 . The Euclidean distance formula is described in
Equation 2.
( ) √∑
√
D (i, j) is the distance of data between i and center cluster j. Xki is the data to i on
attribute data to k. Xkj is the center point to j at attribute to k.
Recalculate cluster center with current cluster membership. The cluster center is the
average of all data/objects in a particular cluster. If desired it can also use the median of the
cluster. So mean (mean) is not the only size that can use.Reassign each object using the new
cluster center. If the cluster center does not change again, then the clustering process is
complete. Alternatively, return to step number 3 until the center of the cluster does not change
anymore [16].
2.2. Meanshift
The average shift method is a method for determining the maximum function limit of
density with separate data from a function. The average shift use as a media grouping method,
in which each mode represents each group [17]-18]. The average shift method classifies data in
its search mode that directs and moves data to the point region along with the iteration of the
data environment built with the Gaussian Kernel [19]. Starting from one data point and iteratively
will improve the approximate mode [20]. The Gaussian kernel is a differentiated multivariate
kernel function used for actual calculations in assumptions [21]. Bandwidth is a free parameter
that shows the effect on the estimated density generated.
3. Results and Analysis
Grouping of data of coal dictionary using K-Means and Meanshift Algorithm. The
grouping results are displayed using matplotlib plot. The sample data taken are vowel data (A, I,
U, E, O) for the coal dictionary. Calculation data using K-Means algorithm. The result data of the
last iteration, i.e. data that has been stable and without changes.
3.1. Result K-Means
3.1.1. The letter A
The data of the last iteration calculation using euclidian distance k- means for letter A.
Clustering data using character and word criteria and use the centroid value of the highest and
lowest values on the criteria. In table 1 is a cluster value that has a fixed and unchanged cluster.
TELKOMNIKA ISSN: 1693-6930 
K Means Clustering and Meanshift Analysis for Grouping… (Rolly Maulana Awangga)
1353
Table 1. Data cluster of the letter A
3.1.2. The letter I
The data of the last iteration calculation using euclidian distance k- means for letter I.
Clustering data using character and word criteria and use the centroid value of the highest and
lowest values on the criteria. In Table 2 is a cluster value that has a fixed and unchanged
cluster.
Table 2. Data cluster of the letter I
3.1.3. The letter U
The data of the last iteration calculation using euclidian distance k- means for letter U.
Clustering data using character and word criteria and use the centroid value of the highest and
lowest values on the criteria. In Table 3 is a cluster value that has a fixed and unchanged
cluster.
Table 3. Data cluster of the letter U
Character Word
0 6.297319
24.02082 17.96443
24.02082 17.96443
....... .......
29.01724 22.93403
23.02173 16.97265
3.1.4. The letter E
The data of the last iteration calculation using euclidian distance k- means for letter E.
Clustering data using character and word criteria and use the centroid value of the highest and
Character Word
19.04536 3.365126
12.0041 3.78689
7.033561 8.84361
17.00289 1.21311
....... .......
8.006149 7.78689
11.00447 4.78689
27.20259 11.60749
Character Word
56.4358 1.234823
62.39391 4.774678
37.21559 20.42897
43.18565 14.5126
....... .......
59.4138 1.777967
67.47592 9.838576
 ISSN: 1693-6930
TELKOMNIKA Vol. 16, No. 3, June 2018: 1351-1357
1354
lowest values on the criteria. In Table 4 is a cluster value that has a fixed and unchanged
cluster.
Table 4. Data Cluster of the Letter E
3.1.5. The letter O
The data of the last iteration calculation using euclidian distance k- means for letter O.
Clustering data using character and word criteria and use the centroid value of the highest and
lowest values on the criteria. In Table 5 is a cluster value that has a fixed and unchanged
cluster.
Table 5. Data Cluster of the Letter O
Character Word
12.33333 6.342096
3.33333 2.687423
....... .......
2.33333 3.726783
14.66667 20.73377
3.2. Meanshift
3.2.1. The letter A
In Figure 1 is the plot result for the data of the letter A. The data used is the term data of
coal letter A. The value used by the data of the term A to get bandwidth one on the plot
meanshift is at point 6.76.
Figure 1. Plot Mean Shift letter A
Character Word
13.15295 1.009718
14.14214 1.973457
14.14214 1.973457
....... .......
7.071068 5.100934
10 2.625744
TELKOMNIKA ISSN: 1693-6930 
K Means Clustering and Meanshift Analysis for Grouping… (Rolly Maulana Awangga)
1355
3.2.2. The letter I
In Figure 2 is the plot result for the data of the letter I. The data used is the term data of
coal letter I. The value used by the data of the term I to get bandwidth one on the plot meanshift
is at point 18.95.
Figure 2. Plot MeanShift letter I
3.2.3. The letter U
In Figure 3 is the plot result for the data of the letter U. The data used is the term data of
coal letter U. The value used by the data of the term I to get bandwidth one on the plot
meanshift is at point 5.75.
Figure 3. Plot MeanShift letter U
3.2.4. The letter E
In Figure 4 is the plot result for the data of the letter E. The data used is the term data
of coal letter E. The value used by the data of the term I to get bandwidth one on the plot
meanshift is at point 3.36.
 ISSN: 1693-6930
TELKOMNIKA Vol. 16, No. 3, June 2018: 1351-1357
1356
Figure 4. Plot MeanShift letter E
3.2.5. The letter O
In Figure 5 is the plot result for the data of the letter O. The data used is the term data
of coal letter O. The value used by the data of the term I to get bandwidth one on the plot
meanshift is at point 7.
Figure 5. Plot MeanShift letter O
4. Conclusion
Grouping a coal dictionary using the k-means algorithm produces a cluster value on
character and word criteria. The last iteration of the Euclidian distance calculation results in
different cluster values in each alphabet. The centroid value of the k-means calculation is then
combined with the MeanShift algorithm. Centroid calculation results of the meanshift algorithm
result in different bandwidth. Letter A produces a bandwidth of 6.76, for the letter I produce
bandwidth 18.95, letter U produces bandwidth 5.75, and letter E produces bandwidth 3.36. The
grouping of coal dictionaries in this study facilitates for a group of terms in the letter.
Suggestions for future research, can classify the data dictionary of coal for the second, third and
subsequent letters, and combine other algorithms in grouping such as K-Nearest Neighbour
(KNN) algorithm.
TELKOMNIKA ISSN: 1693-6930 
K Means Clustering and Meanshift Analysis for Grouping… (Rolly Maulana Awangga)
1357
References
[1] F Gorunescu,Data Mining: Concepts, models and techniques. Springer Science & Business Media,
2011; 12.
[2] MI Ramadhan etal. An analysis of natural disaster data by using k-means and k-medoids algorithm
of data mining techniques. in Quality in Research (QiR): International Symposium on Electrical and
Computer Engineering, 2017 15th International Conference on. IEEE. 2017: 221–225.
[3] J Yi, S Li, M Wu, HA Yeung, WW Fok, Y Wang, F Liu. Cloud-based educational big data application
of apriori algorithm and k-means clustering algorithm based on students’information. in Big Data and
Cloud Computing (BdCloud), 2014 IEEE Fourth International Conference on. IEEE. 2014: 151–158.
[4] L Guoli, W Tingting, Y Limei, L Yanping, G Jinqiao. The improved research on k-means clustering
algorithm in initial values. in Mechatronic Sciences, Electric Engineering and Computer (MEC),
Proceedings 2013 International Conference on. IEEE. 2013; 2124–2127.
[5] SH Jung, JC Kim,CB Sim.Prediction data processing scheme using an artificial neural network and
data clustering for big data. International Journal of Electrical and Computer Engineering (IJECE).
2016; 6(1): 330-336.
[6] A Bedboudi, C Bouras, MT Kimour. An heterogeneous population-based genetic algorithm for data
clustering.Indonesian Journal ofElectrical Engineering and Informatics (IJEEI).2017;5(3): 275–284.
[7] Q Shi, L Xu, Z Shi, Y Chen, Y Shao. Analysis and research of the campus network user’s behavior
based on k-means clustering algorithm. in Digital Manufacturing and Automation (ICDMA), 2013
Fourth International Conference on. IEEE, 2013: 196–201.
[8] Y Cheng,W Hongyu, W Xiaohong.Positioning method research for unmanned aerial vehicles based
on meanshift tracking algorithm. in Control and Decision Conference (CCDC), 2017 29th Chinese.
IEEE 2017: 989–994.
[9] RM Awangga, NS Fathonah, TI Hasanudin. Colenak: GPS tracking model for post-stroke
rehabilitation program using AES-CBC URL encryption and QR-Code. in 2017 2nd International
conferences on Information Technology,Information Systems and Electrical Engineering (ICITISEE).
IEEE. Nov 2017: 255–260. [Online]. Available: http://ieeexplore.ieee.org/document/8285506/
[10] A Salhi, Y Moresly, F Ghozzi, A Fakhfakh. Face detection and tracking system with block-matching,
meanshift and camshift algorithms and kalman filter. in Sciences and Techniques of Automatic
Control and Computer Engineering (STA), 2017 18th International Conference on. IEEE. 2017: 139–
145.
[11] IN Yulita, S Purwani, R Rosadi, RM Awangga. A quantization of deep belief networks for long short-
term memory in sleep stage detection. in Advanced Informatics, Concepts, Theory, and Applications
(ICAICTA), 2017 International Conference on. IEEE, 2017: 1-5.
[12] N Ari, M Ustazhanov. Matplotlib in python. in 2014 11th International Conference on Electronics,
Computer and Computation (ICECCO). IEEE, sep 2014. [Online]. Available:
https://doi.org/10.1109/icecco.2014.6997585
[13] J MacQueen et al. Some methods for classification and analysis of multivariate observations in
Proceedings ofthe fifth Berkeley symposium on mathematical statistics and probability. Oakland, CA,
USA. 1967; 1(14): 281–297.
[14] AK Jain. Data clustering:50 years beyond k-means.Pattern recognition letters.2010;31(8): 651-666.
[15] Z Gheid, Y Challal. Efficient and privacy-preserving k-means clustering for big data mining. in
Trustcom/BigDataSE/I SPA, IEEE. 2016: 791–798.
[16] K Obbie. Penerapan algoritma klasifikasi data mining id3 untuk menentukan penjurusan siswasman6
semarang. Skripsi, Fakultas Ilmu Komputer, 2014.
[17] Y Liu, SZ Li, W Wu, R Huang. Dynamics of a mean-shift-like algorithm and its applications on
clustering. Information Processing Letters. 2013; 113(1): 8–16.
[18] G Chen,Q Chen,D Zhang. Mean shift: A method for measurementmatrix ofcompressive sensing.in
Digital Home (ICDH), 2014 5th International Conference on. IEEE. 2014: 64–69.
[19] YA Ghassabeh.A sufficientcondition for the convergence of the mean shift algorithm with gaussian
kernel. Journal of Multivariate Analysis. 2015; 135: 1–10.
[20] K Fukunaga, L Hostetler. The estimation of the gradient of a density function, with applications in
pattern recognition. IEEE Transactions on information theory. 1975; 21(1): 32–40.
[21] C Yang, R Duraiswami, NA Gumerov, L Davis. Improved fast gauss transform and efficient kernel
density estimation. in null. IEEE. 2003: 464.

More Related Content

What's hot

Scalable and efficient cluster based framework for
Scalable and efficient cluster based framework forScalable and efficient cluster based framework for
Scalable and efficient cluster based framework foreSAT Publishing House
 
The step construction of penalized spline in electrical power load data
The step construction of penalized spline in electrical power load dataThe step construction of penalized spline in electrical power load data
The step construction of penalized spline in electrical power load dataTELKOMNIKA JOURNAL
 
MK-Prototypes: A Novel Algorithm for Clustering Mixed Type Data
MK-Prototypes: A Novel Algorithm for Clustering Mixed Type  Data MK-Prototypes: A Novel Algorithm for Clustering Mixed Type  Data
MK-Prototypes: A Novel Algorithm for Clustering Mixed Type Data IJMER
 
Mine Blood Donors Information through Improved K-Means Clustering
Mine Blood Donors Information through Improved K-Means ClusteringMine Blood Donors Information through Improved K-Means Clustering
Mine Blood Donors Information through Improved K-Means Clusteringijcsity
 
A Novel Approach for Clustering Big Data based on MapReduce
A Novel Approach for Clustering Big Data based on MapReduce A Novel Approach for Clustering Big Data based on MapReduce
A Novel Approach for Clustering Big Data based on MapReduce IJECEIAES
 
Improving K-NN Internet Traffic Classification Using Clustering and Principle...
Improving K-NN Internet Traffic Classification Using Clustering and Principle...Improving K-NN Internet Traffic Classification Using Clustering and Principle...
Improving K-NN Internet Traffic Classification Using Clustering and Principle...journalBEEI
 
TOWARDS REDUCTION OF DATA FLOW IN A DISTRIBUTED NETWORK USING PRINCIPAL COMPO...
TOWARDS REDUCTION OF DATA FLOW IN A DISTRIBUTED NETWORK USING PRINCIPAL COMPO...TOWARDS REDUCTION OF DATA FLOW IN A DISTRIBUTED NETWORK USING PRINCIPAL COMPO...
TOWARDS REDUCTION OF DATA FLOW IN A DISTRIBUTED NETWORK USING PRINCIPAL COMPO...cscpconf
 
System for Prediction of Non Stationary Time Series based on the Wavelet Radi...
System for Prediction of Non Stationary Time Series based on the Wavelet Radi...System for Prediction of Non Stationary Time Series based on the Wavelet Radi...
System for Prediction of Non Stationary Time Series based on the Wavelet Radi...IJECEIAES
 
Classification of Iris Data using Kernel Radial Basis Probabilistic Neural Ne...
Classification of Iris Data using Kernel Radial Basis Probabilistic Neural Ne...Classification of Iris Data using Kernel Radial Basis Probabilistic Neural Ne...
Classification of Iris Data using Kernel Radial Basis Probabilistic Neural Ne...Scientific Review
 
Textural Feature Extraction of Natural Objects for Image Classification
Textural Feature Extraction of Natural Objects for Image ClassificationTextural Feature Extraction of Natural Objects for Image Classification
Textural Feature Extraction of Natural Objects for Image ClassificationCSCJournals
 
Imagethresholding
ImagethresholdingImagethresholding
Imagethresholdingananta200
 
SOLVING OPTIMAL COMPONENTS ASSIGNMENT PROBLEM FOR A MULTISTATE NETWORK USING ...
SOLVING OPTIMAL COMPONENTS ASSIGNMENT PROBLEM FOR A MULTISTATE NETWORK USING ...SOLVING OPTIMAL COMPONENTS ASSIGNMENT PROBLEM FOR A MULTISTATE NETWORK USING ...
SOLVING OPTIMAL COMPONENTS ASSIGNMENT PROBLEM FOR A MULTISTATE NETWORK USING ...ijmnct
 
SOLVING OPTIMAL COMPONENTS ASSIGNMENT PROBLEM FOR A MULTISTATE NETWORK USING ...
SOLVING OPTIMAL COMPONENTS ASSIGNMENT PROBLEM FOR A MULTISTATE NETWORK USING ...SOLVING OPTIMAL COMPONENTS ASSIGNMENT PROBLEM FOR A MULTISTATE NETWORK USING ...
SOLVING OPTIMAL COMPONENTS ASSIGNMENT PROBLEM FOR A MULTISTATE NETWORK USING ...ijmnct
 
Survey on classification algorithms for data mining (comparison and evaluation)
Survey on classification algorithms for data mining (comparison and evaluation)Survey on classification algorithms for data mining (comparison and evaluation)
Survey on classification algorithms for data mining (comparison and evaluation)Alexander Decker
 
The improved k means with particle swarm optimization
The improved k means with particle swarm optimizationThe improved k means with particle swarm optimization
The improved k means with particle swarm optimizationAlexander Decker
 

What's hot (20)

Scalable and efficient cluster based framework for
Scalable and efficient cluster based framework forScalable and efficient cluster based framework for
Scalable and efficient cluster based framework for
 
The step construction of penalized spline in electrical power load data
The step construction of penalized spline in electrical power load dataThe step construction of penalized spline in electrical power load data
The step construction of penalized spline in electrical power load data
 
MK-Prototypes: A Novel Algorithm for Clustering Mixed Type Data
MK-Prototypes: A Novel Algorithm for Clustering Mixed Type  Data MK-Prototypes: A Novel Algorithm for Clustering Mixed Type  Data
MK-Prototypes: A Novel Algorithm for Clustering Mixed Type Data
 
Mine Blood Donors Information through Improved K-Means Clustering
Mine Blood Donors Information through Improved K-Means ClusteringMine Blood Donors Information through Improved K-Means Clustering
Mine Blood Donors Information through Improved K-Means Clustering
 
A Novel Approach for Clustering Big Data based on MapReduce
A Novel Approach for Clustering Big Data based on MapReduce A Novel Approach for Clustering Big Data based on MapReduce
A Novel Approach for Clustering Big Data based on MapReduce
 
Improving K-NN Internet Traffic Classification Using Clustering and Principle...
Improving K-NN Internet Traffic Classification Using Clustering and Principle...Improving K-NN Internet Traffic Classification Using Clustering and Principle...
Improving K-NN Internet Traffic Classification Using Clustering and Principle...
 
Cg33504508
Cg33504508Cg33504508
Cg33504508
 
TOWARDS REDUCTION OF DATA FLOW IN A DISTRIBUTED NETWORK USING PRINCIPAL COMPO...
TOWARDS REDUCTION OF DATA FLOW IN A DISTRIBUTED NETWORK USING PRINCIPAL COMPO...TOWARDS REDUCTION OF DATA FLOW IN A DISTRIBUTED NETWORK USING PRINCIPAL COMPO...
TOWARDS REDUCTION OF DATA FLOW IN A DISTRIBUTED NETWORK USING PRINCIPAL COMPO...
 
System for Prediction of Non Stationary Time Series based on the Wavelet Radi...
System for Prediction of Non Stationary Time Series based on the Wavelet Radi...System for Prediction of Non Stationary Time Series based on the Wavelet Radi...
System for Prediction of Non Stationary Time Series based on the Wavelet Radi...
 
Classification of Iris Data using Kernel Radial Basis Probabilistic Neural Ne...
Classification of Iris Data using Kernel Radial Basis Probabilistic Neural Ne...Classification of Iris Data using Kernel Radial Basis Probabilistic Neural Ne...
Classification of Iris Data using Kernel Radial Basis Probabilistic Neural Ne...
 
Textural Feature Extraction of Natural Objects for Image Classification
Textural Feature Extraction of Natural Objects for Image ClassificationTextural Feature Extraction of Natural Objects for Image Classification
Textural Feature Extraction of Natural Objects for Image Classification
 
Imagethresholding
ImagethresholdingImagethresholding
Imagethresholding
 
Ip3514921495
Ip3514921495Ip3514921495
Ip3514921495
 
A046010107
A046010107A046010107
A046010107
 
MCCS
MCCSMCCS
MCCS
 
SOLVING OPTIMAL COMPONENTS ASSIGNMENT PROBLEM FOR A MULTISTATE NETWORK USING ...
SOLVING OPTIMAL COMPONENTS ASSIGNMENT PROBLEM FOR A MULTISTATE NETWORK USING ...SOLVING OPTIMAL COMPONENTS ASSIGNMENT PROBLEM FOR A MULTISTATE NETWORK USING ...
SOLVING OPTIMAL COMPONENTS ASSIGNMENT PROBLEM FOR A MULTISTATE NETWORK USING ...
 
SOLVING OPTIMAL COMPONENTS ASSIGNMENT PROBLEM FOR A MULTISTATE NETWORK USING ...
SOLVING OPTIMAL COMPONENTS ASSIGNMENT PROBLEM FOR A MULTISTATE NETWORK USING ...SOLVING OPTIMAL COMPONENTS ASSIGNMENT PROBLEM FOR A MULTISTATE NETWORK USING ...
SOLVING OPTIMAL COMPONENTS ASSIGNMENT PROBLEM FOR A MULTISTATE NETWORK USING ...
 
Big Data Clustering Model based on Fuzzy Gaussian
Big Data Clustering Model based on Fuzzy GaussianBig Data Clustering Model based on Fuzzy Gaussian
Big Data Clustering Model based on Fuzzy Gaussian
 
Survey on classification algorithms for data mining (comparison and evaluation)
Survey on classification algorithms for data mining (comparison and evaluation)Survey on classification algorithms for data mining (comparison and evaluation)
Survey on classification algorithms for data mining (comparison and evaluation)
 
The improved k means with particle swarm optimization
The improved k means with particle swarm optimizationThe improved k means with particle swarm optimization
The improved k means with particle swarm optimization
 

Similar to K Means Clustering and Meanshift Analysis for Grouping the Data of Coal Term in Puslitbang tekMIRA

An improvement in k mean clustering algorithm using better time and accuracy
An improvement in k mean clustering algorithm using better time and accuracyAn improvement in k mean clustering algorithm using better time and accuracy
An improvement in k mean clustering algorithm using better time and accuracyijpla
 
Parametric Comparison of K-means and Adaptive K-means Clustering Performance ...
Parametric Comparison of K-means and Adaptive K-means Clustering Performance ...Parametric Comparison of K-means and Adaptive K-means Clustering Performance ...
Parametric Comparison of K-means and Adaptive K-means Clustering Performance ...IJECEIAES
 
Max stable set problem to found the initial centroids in clustering problem
Max stable set problem to found the initial centroids in clustering problemMax stable set problem to found the initial centroids in clustering problem
Max stable set problem to found the initial centroids in clustering problemnooriasukmaningtyas
 
Premeditated Initial Points for K-Means Clustering
Premeditated Initial Points for K-Means ClusteringPremeditated Initial Points for K-Means Clustering
Premeditated Initial Points for K-Means ClusteringIJCSIS Research Publications
 
K-means Clustering Method for the Analysis of Log Data
K-means Clustering Method for the Analysis of Log DataK-means Clustering Method for the Analysis of Log Data
K-means Clustering Method for the Analysis of Log Dataidescitation
 
crime rate pridicition using k-means.pdf
crime rate pridicition using k-means.pdfcrime rate pridicition using k-means.pdf
crime rate pridicition using k-means.pdfsaiKrishnaReddy558028
 
Experimental study of Data clustering using k- Means and modified algorithms
Experimental study of Data clustering using k- Means and modified algorithmsExperimental study of Data clustering using k- Means and modified algorithms
Experimental study of Data clustering using k- Means and modified algorithmsIJDKP
 
Dynamic approach to k means clustering algorithm-2
Dynamic approach to k means clustering algorithm-2Dynamic approach to k means clustering algorithm-2
Dynamic approach to k means clustering algorithm-2IAEME Publication
 
Increasing electrical grid stability classification performance using ensemb...
Increasing electrical grid stability classification performance  using ensemb...Increasing electrical grid stability classification performance  using ensemb...
Increasing electrical grid stability classification performance using ensemb...IJECEIAES
 
84cc04ff77007e457df6aa2b814d2346bf1b
84cc04ff77007e457df6aa2b814d2346bf1b84cc04ff77007e457df6aa2b814d2346bf1b
84cc04ff77007e457df6aa2b814d2346bf1bPRAWEEN KUMAR
 
A Preference Model on Adaptive Affinity Propagation
A Preference Model on Adaptive Affinity PropagationA Preference Model on Adaptive Affinity Propagation
A Preference Model on Adaptive Affinity PropagationIJECEIAES
 
FAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETS
FAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETSFAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETS
FAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETScsandit
 
Clustering in Aggregated User Profiles across Multiple Social Networks
Clustering in Aggregated User Profiles across Multiple Social Networks Clustering in Aggregated User Profiles across Multiple Social Networks
Clustering in Aggregated User Profiles across Multiple Social Networks IJECEIAES
 
Comparision of methods for combination of multiple classifiers that predict b...
Comparision of methods for combination of multiple classifiers that predict b...Comparision of methods for combination of multiple classifiers that predict b...
Comparision of methods for combination of multiple classifiers that predict b...IJERA Editor
 
Enhancement of student performance prediction using modified K-nearest neighbor
Enhancement of student performance prediction using modified K-nearest neighborEnhancement of student performance prediction using modified K-nearest neighbor
Enhancement of student performance prediction using modified K-nearest neighborTELKOMNIKA JOURNAL
 
Spectral Clustering and Vantage Point Indexing for Efficient Data Retrieval
Spectral Clustering and Vantage Point Indexing for Efficient Data Retrieval Spectral Clustering and Vantage Point Indexing for Efficient Data Retrieval
Spectral Clustering and Vantage Point Indexing for Efficient Data Retrieval IJECEIAES
 
A Comparative Study Of Various Clustering Algorithms In Data Mining
A Comparative Study Of Various Clustering Algorithms In Data MiningA Comparative Study Of Various Clustering Algorithms In Data Mining
A Comparative Study Of Various Clustering Algorithms In Data MiningNatasha Grant
 
Optimising Data Using K-Means Clustering Algorithm
Optimising Data Using K-Means Clustering AlgorithmOptimising Data Using K-Means Clustering Algorithm
Optimising Data Using K-Means Clustering AlgorithmIJERA Editor
 

Similar to K Means Clustering and Meanshift Analysis for Grouping the Data of Coal Term in Puslitbang tekMIRA (20)

K means report
K means reportK means report
K means report
 
An improvement in k mean clustering algorithm using better time and accuracy
An improvement in k mean clustering algorithm using better time and accuracyAn improvement in k mean clustering algorithm using better time and accuracy
An improvement in k mean clustering algorithm using better time and accuracy
 
Parametric Comparison of K-means and Adaptive K-means Clustering Performance ...
Parametric Comparison of K-means and Adaptive K-means Clustering Performance ...Parametric Comparison of K-means and Adaptive K-means Clustering Performance ...
Parametric Comparison of K-means and Adaptive K-means Clustering Performance ...
 
Max stable set problem to found the initial centroids in clustering problem
Max stable set problem to found the initial centroids in clustering problemMax stable set problem to found the initial centroids in clustering problem
Max stable set problem to found the initial centroids in clustering problem
 
Premeditated Initial Points for K-Means Clustering
Premeditated Initial Points for K-Means ClusteringPremeditated Initial Points for K-Means Clustering
Premeditated Initial Points for K-Means Clustering
 
K-means Clustering Method for the Analysis of Log Data
K-means Clustering Method for the Analysis of Log DataK-means Clustering Method for the Analysis of Log Data
K-means Clustering Method for the Analysis of Log Data
 
crime rate pridicition using k-means.pdf
crime rate pridicition using k-means.pdfcrime rate pridicition using k-means.pdf
crime rate pridicition using k-means.pdf
 
Experimental study of Data clustering using k- Means and modified algorithms
Experimental study of Data clustering using k- Means and modified algorithmsExperimental study of Data clustering using k- Means and modified algorithms
Experimental study of Data clustering using k- Means and modified algorithms
 
Af4201214217
Af4201214217Af4201214217
Af4201214217
 
Dynamic approach to k means clustering algorithm-2
Dynamic approach to k means clustering algorithm-2Dynamic approach to k means clustering algorithm-2
Dynamic approach to k means clustering algorithm-2
 
Increasing electrical grid stability classification performance using ensemb...
Increasing electrical grid stability classification performance  using ensemb...Increasing electrical grid stability classification performance  using ensemb...
Increasing electrical grid stability classification performance using ensemb...
 
84cc04ff77007e457df6aa2b814d2346bf1b
84cc04ff77007e457df6aa2b814d2346bf1b84cc04ff77007e457df6aa2b814d2346bf1b
84cc04ff77007e457df6aa2b814d2346bf1b
 
A Preference Model on Adaptive Affinity Propagation
A Preference Model on Adaptive Affinity PropagationA Preference Model on Adaptive Affinity Propagation
A Preference Model on Adaptive Affinity Propagation
 
FAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETS
FAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETSFAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETS
FAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETS
 
Clustering in Aggregated User Profiles across Multiple Social Networks
Clustering in Aggregated User Profiles across Multiple Social Networks Clustering in Aggregated User Profiles across Multiple Social Networks
Clustering in Aggregated User Profiles across Multiple Social Networks
 
Comparision of methods for combination of multiple classifiers that predict b...
Comparision of methods for combination of multiple classifiers that predict b...Comparision of methods for combination of multiple classifiers that predict b...
Comparision of methods for combination of multiple classifiers that predict b...
 
Enhancement of student performance prediction using modified K-nearest neighbor
Enhancement of student performance prediction using modified K-nearest neighborEnhancement of student performance prediction using modified K-nearest neighbor
Enhancement of student performance prediction using modified K-nearest neighbor
 
Spectral Clustering and Vantage Point Indexing for Efficient Data Retrieval
Spectral Clustering and Vantage Point Indexing for Efficient Data Retrieval Spectral Clustering and Vantage Point Indexing for Efficient Data Retrieval
Spectral Clustering and Vantage Point Indexing for Efficient Data Retrieval
 
A Comparative Study Of Various Clustering Algorithms In Data Mining
A Comparative Study Of Various Clustering Algorithms In Data MiningA Comparative Study Of Various Clustering Algorithms In Data Mining
A Comparative Study Of Various Clustering Algorithms In Data Mining
 
Optimising Data Using K-Means Clustering Algorithm
Optimising Data Using K-Means Clustering AlgorithmOptimising Data Using K-Means Clustering Algorithm
Optimising Data Using K-Means Clustering Algorithm
 

More from TELKOMNIKA JOURNAL

Amazon products reviews classification based on machine learning, deep learni...
Amazon products reviews classification based on machine learning, deep learni...Amazon products reviews classification based on machine learning, deep learni...
Amazon products reviews classification based on machine learning, deep learni...TELKOMNIKA JOURNAL
 
Design, simulation, and analysis of microstrip patch antenna for wireless app...
Design, simulation, and analysis of microstrip patch antenna for wireless app...Design, simulation, and analysis of microstrip patch antenna for wireless app...
Design, simulation, and analysis of microstrip patch antenna for wireless app...TELKOMNIKA JOURNAL
 
Design and simulation an optimal enhanced PI controller for congestion avoida...
Design and simulation an optimal enhanced PI controller for congestion avoida...Design and simulation an optimal enhanced PI controller for congestion avoida...
Design and simulation an optimal enhanced PI controller for congestion avoida...TELKOMNIKA JOURNAL
 
Improving the detection of intrusion in vehicular ad-hoc networks with modifi...
Improving the detection of intrusion in vehicular ad-hoc networks with modifi...Improving the detection of intrusion in vehicular ad-hoc networks with modifi...
Improving the detection of intrusion in vehicular ad-hoc networks with modifi...TELKOMNIKA JOURNAL
 
Conceptual model of internet banking adoption with perceived risk and trust f...
Conceptual model of internet banking adoption with perceived risk and trust f...Conceptual model of internet banking adoption with perceived risk and trust f...
Conceptual model of internet banking adoption with perceived risk and trust f...TELKOMNIKA JOURNAL
 
Efficient combined fuzzy logic and LMS algorithm for smart antenna
Efficient combined fuzzy logic and LMS algorithm for smart antennaEfficient combined fuzzy logic and LMS algorithm for smart antenna
Efficient combined fuzzy logic and LMS algorithm for smart antennaTELKOMNIKA JOURNAL
 
Design and implementation of a LoRa-based system for warning of forest fire
Design and implementation of a LoRa-based system for warning of forest fireDesign and implementation of a LoRa-based system for warning of forest fire
Design and implementation of a LoRa-based system for warning of forest fireTELKOMNIKA JOURNAL
 
Wavelet-based sensing technique in cognitive radio network
Wavelet-based sensing technique in cognitive radio networkWavelet-based sensing technique in cognitive radio network
Wavelet-based sensing technique in cognitive radio networkTELKOMNIKA JOURNAL
 
A novel compact dual-band bandstop filter with enhanced rejection bands
A novel compact dual-band bandstop filter with enhanced rejection bandsA novel compact dual-band bandstop filter with enhanced rejection bands
A novel compact dual-band bandstop filter with enhanced rejection bandsTELKOMNIKA JOURNAL
 
Deep learning approach to DDoS attack with imbalanced data at the application...
Deep learning approach to DDoS attack with imbalanced data at the application...Deep learning approach to DDoS attack with imbalanced data at the application...
Deep learning approach to DDoS attack with imbalanced data at the application...TELKOMNIKA JOURNAL
 
Brief note on match and miss-match uncertainties
Brief note on match and miss-match uncertaintiesBrief note on match and miss-match uncertainties
Brief note on match and miss-match uncertaintiesTELKOMNIKA JOURNAL
 
Implementation of FinFET technology based low power 4×4 Wallace tree multipli...
Implementation of FinFET technology based low power 4×4 Wallace tree multipli...Implementation of FinFET technology based low power 4×4 Wallace tree multipli...
Implementation of FinFET technology based low power 4×4 Wallace tree multipli...TELKOMNIKA JOURNAL
 
Evaluation of the weighted-overlap add model with massive MIMO in a 5G system
Evaluation of the weighted-overlap add model with massive MIMO in a 5G systemEvaluation of the weighted-overlap add model with massive MIMO in a 5G system
Evaluation of the weighted-overlap add model with massive MIMO in a 5G systemTELKOMNIKA JOURNAL
 
Reflector antenna design in different frequencies using frequency selective s...
Reflector antenna design in different frequencies using frequency selective s...Reflector antenna design in different frequencies using frequency selective s...
Reflector antenna design in different frequencies using frequency selective s...TELKOMNIKA JOURNAL
 
Reagentless iron detection in water based on unclad fiber optical sensor
Reagentless iron detection in water based on unclad fiber optical sensorReagentless iron detection in water based on unclad fiber optical sensor
Reagentless iron detection in water based on unclad fiber optical sensorTELKOMNIKA JOURNAL
 
Impact of CuS counter electrode calcination temperature on quantum dot sensit...
Impact of CuS counter electrode calcination temperature on quantum dot sensit...Impact of CuS counter electrode calcination temperature on quantum dot sensit...
Impact of CuS counter electrode calcination temperature on quantum dot sensit...TELKOMNIKA JOURNAL
 
A progressive learning for structural tolerance online sequential extreme lea...
A progressive learning for structural tolerance online sequential extreme lea...A progressive learning for structural tolerance online sequential extreme lea...
A progressive learning for structural tolerance online sequential extreme lea...TELKOMNIKA JOURNAL
 
Electroencephalography-based brain-computer interface using neural networks
Electroencephalography-based brain-computer interface using neural networksElectroencephalography-based brain-computer interface using neural networks
Electroencephalography-based brain-computer interface using neural networksTELKOMNIKA JOURNAL
 
Adaptive segmentation algorithm based on level set model in medical imaging
Adaptive segmentation algorithm based on level set model in medical imagingAdaptive segmentation algorithm based on level set model in medical imaging
Adaptive segmentation algorithm based on level set model in medical imagingTELKOMNIKA JOURNAL
 
Automatic channel selection using shuffled frog leaping algorithm for EEG bas...
Automatic channel selection using shuffled frog leaping algorithm for EEG bas...Automatic channel selection using shuffled frog leaping algorithm for EEG bas...
Automatic channel selection using shuffled frog leaping algorithm for EEG bas...TELKOMNIKA JOURNAL
 

More from TELKOMNIKA JOURNAL (20)

Amazon products reviews classification based on machine learning, deep learni...
Amazon products reviews classification based on machine learning, deep learni...Amazon products reviews classification based on machine learning, deep learni...
Amazon products reviews classification based on machine learning, deep learni...
 
Design, simulation, and analysis of microstrip patch antenna for wireless app...
Design, simulation, and analysis of microstrip patch antenna for wireless app...Design, simulation, and analysis of microstrip patch antenna for wireless app...
Design, simulation, and analysis of microstrip patch antenna for wireless app...
 
Design and simulation an optimal enhanced PI controller for congestion avoida...
Design and simulation an optimal enhanced PI controller for congestion avoida...Design and simulation an optimal enhanced PI controller for congestion avoida...
Design and simulation an optimal enhanced PI controller for congestion avoida...
 
Improving the detection of intrusion in vehicular ad-hoc networks with modifi...
Improving the detection of intrusion in vehicular ad-hoc networks with modifi...Improving the detection of intrusion in vehicular ad-hoc networks with modifi...
Improving the detection of intrusion in vehicular ad-hoc networks with modifi...
 
Conceptual model of internet banking adoption with perceived risk and trust f...
Conceptual model of internet banking adoption with perceived risk and trust f...Conceptual model of internet banking adoption with perceived risk and trust f...
Conceptual model of internet banking adoption with perceived risk and trust f...
 
Efficient combined fuzzy logic and LMS algorithm for smart antenna
Efficient combined fuzzy logic and LMS algorithm for smart antennaEfficient combined fuzzy logic and LMS algorithm for smart antenna
Efficient combined fuzzy logic and LMS algorithm for smart antenna
 
Design and implementation of a LoRa-based system for warning of forest fire
Design and implementation of a LoRa-based system for warning of forest fireDesign and implementation of a LoRa-based system for warning of forest fire
Design and implementation of a LoRa-based system for warning of forest fire
 
Wavelet-based sensing technique in cognitive radio network
Wavelet-based sensing technique in cognitive radio networkWavelet-based sensing technique in cognitive radio network
Wavelet-based sensing technique in cognitive radio network
 
A novel compact dual-band bandstop filter with enhanced rejection bands
A novel compact dual-band bandstop filter with enhanced rejection bandsA novel compact dual-band bandstop filter with enhanced rejection bands
A novel compact dual-band bandstop filter with enhanced rejection bands
 
Deep learning approach to DDoS attack with imbalanced data at the application...
Deep learning approach to DDoS attack with imbalanced data at the application...Deep learning approach to DDoS attack with imbalanced data at the application...
Deep learning approach to DDoS attack with imbalanced data at the application...
 
Brief note on match and miss-match uncertainties
Brief note on match and miss-match uncertaintiesBrief note on match and miss-match uncertainties
Brief note on match and miss-match uncertainties
 
Implementation of FinFET technology based low power 4×4 Wallace tree multipli...
Implementation of FinFET technology based low power 4×4 Wallace tree multipli...Implementation of FinFET technology based low power 4×4 Wallace tree multipli...
Implementation of FinFET technology based low power 4×4 Wallace tree multipli...
 
Evaluation of the weighted-overlap add model with massive MIMO in a 5G system
Evaluation of the weighted-overlap add model with massive MIMO in a 5G systemEvaluation of the weighted-overlap add model with massive MIMO in a 5G system
Evaluation of the weighted-overlap add model with massive MIMO in a 5G system
 
Reflector antenna design in different frequencies using frequency selective s...
Reflector antenna design in different frequencies using frequency selective s...Reflector antenna design in different frequencies using frequency selective s...
Reflector antenna design in different frequencies using frequency selective s...
 
Reagentless iron detection in water based on unclad fiber optical sensor
Reagentless iron detection in water based on unclad fiber optical sensorReagentless iron detection in water based on unclad fiber optical sensor
Reagentless iron detection in water based on unclad fiber optical sensor
 
Impact of CuS counter electrode calcination temperature on quantum dot sensit...
Impact of CuS counter electrode calcination temperature on quantum dot sensit...Impact of CuS counter electrode calcination temperature on quantum dot sensit...
Impact of CuS counter electrode calcination temperature on quantum dot sensit...
 
A progressive learning for structural tolerance online sequential extreme lea...
A progressive learning for structural tolerance online sequential extreme lea...A progressive learning for structural tolerance online sequential extreme lea...
A progressive learning for structural tolerance online sequential extreme lea...
 
Electroencephalography-based brain-computer interface using neural networks
Electroencephalography-based brain-computer interface using neural networksElectroencephalography-based brain-computer interface using neural networks
Electroencephalography-based brain-computer interface using neural networks
 
Adaptive segmentation algorithm based on level set model in medical imaging
Adaptive segmentation algorithm based on level set model in medical imagingAdaptive segmentation algorithm based on level set model in medical imaging
Adaptive segmentation algorithm based on level set model in medical imaging
 
Automatic channel selection using shuffled frog leaping algorithm for EEG bas...
Automatic channel selection using shuffled frog leaping algorithm for EEG bas...Automatic channel selection using shuffled frog leaping algorithm for EEG bas...
Automatic channel selection using shuffled frog leaping algorithm for EEG bas...
 

Recently uploaded

University management System project report..pdf
University management System project report..pdfUniversity management System project report..pdf
University management System project report..pdfKamal Acharya
 
Extrusion Processes and Their Limitations
Extrusion Processes and Their LimitationsExtrusion Processes and Their Limitations
Extrusion Processes and Their Limitations120cr0395
 
PVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELL
PVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELLPVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELL
PVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELLManishPatel169454
 
Double rodded leveling 1 pdf activity 01
Double rodded leveling 1 pdf activity 01Double rodded leveling 1 pdf activity 01
Double rodded leveling 1 pdf activity 01KreezheaRecto
 
Thermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptThermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptDineshKumar4165
 
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptxBSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptxfenichawla
 
AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes ---  Unit 3.pdfAKTU Computer Networks notes ---  Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdfankushspencer015
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Call Girls in Nagpur High Profile
 
data_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfdata_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfJiananWang21
 
UNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular ConduitsUNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular Conduitsrknatarajan
 
result management system report for college project
result management system report for college projectresult management system report for college project
result management system report for college projectTonystark477637
 
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...Call Girls in Nagpur High Profile
 
Glass Ceramics: Processing and Properties
Glass Ceramics: Processing and PropertiesGlass Ceramics: Processing and Properties
Glass Ceramics: Processing and PropertiesPrabhanshu Chaturvedi
 
Thermal Engineering Unit - I & II . ppt
Thermal Engineering  Unit - I & II . pptThermal Engineering  Unit - I & II . ppt
Thermal Engineering Unit - I & II . pptDineshKumar4165
 
Intze Overhead Water Tank Design by Working Stress - IS Method.pdf
Intze Overhead Water Tank  Design by Working Stress - IS Method.pdfIntze Overhead Water Tank  Design by Working Stress - IS Method.pdf
Intze Overhead Water Tank Design by Working Stress - IS Method.pdfSuman Jyoti
 
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...ranjana rawat
 

Recently uploaded (20)

University management System project report..pdf
University management System project report..pdfUniversity management System project report..pdf
University management System project report..pdf
 
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
 
Extrusion Processes and Their Limitations
Extrusion Processes and Their LimitationsExtrusion Processes and Their Limitations
Extrusion Processes and Their Limitations
 
PVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELL
PVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELLPVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELL
PVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELL
 
Double rodded leveling 1 pdf activity 01
Double rodded leveling 1 pdf activity 01Double rodded leveling 1 pdf activity 01
Double rodded leveling 1 pdf activity 01
 
Thermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptThermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.ppt
 
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptxBSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
 
AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes ---  Unit 3.pdfAKTU Computer Networks notes ---  Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdf
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
 
data_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfdata_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdf
 
UNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular ConduitsUNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular Conduits
 
result management system report for college project
result management system report for college projectresult management system report for college project
result management system report for college project
 
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
 
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...
 
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
 
Glass Ceramics: Processing and Properties
Glass Ceramics: Processing and PropertiesGlass Ceramics: Processing and Properties
Glass Ceramics: Processing and Properties
 
NFPA 5000 2024 standard .
NFPA 5000 2024 standard                                  .NFPA 5000 2024 standard                                  .
NFPA 5000 2024 standard .
 
Thermal Engineering Unit - I & II . ppt
Thermal Engineering  Unit - I & II . pptThermal Engineering  Unit - I & II . ppt
Thermal Engineering Unit - I & II . ppt
 
Intze Overhead Water Tank Design by Working Stress - IS Method.pdf
Intze Overhead Water Tank  Design by Working Stress - IS Method.pdfIntze Overhead Water Tank  Design by Working Stress - IS Method.pdf
Intze Overhead Water Tank Design by Working Stress - IS Method.pdf
 
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
 

K Means Clustering and Meanshift Analysis for Grouping the Data of Coal Term in Puslitbang tekMIRA

  • 1. TELKOMNIKA, Vol.16, No.3, June 2018, pp. 1351~1357 ISSN: 1693-6930, accredited A by DIKTI, Decree No: 58/DIKTI/Kep/2013 DOI: 10.12928/TELKOMNIKA.v16i3.8910  1351 Received September 18, 2017; Revised February 8, 2018; Accepted April 5, 2018 K Means Clustering and Meanshift Analysis for Grouping the Data of Coal Term in Puslitbang tekMIRA Rolly Maulana Awangga* 1 , Syafrial Fachri Pane 2 , Khaera Tunnisa 3 , Iping Supriana Suwardi 4 1,2,3 Program in Informatics Engineering, Politeknik Pos Indonesia, Indonesia 4 School of Electrical Engineering and Informatics, Bandung Institute of Technology, Indonesia *Corresponding author, e-mail: awangga@poltekpos.ac.id Abstract Indonesian government agencies under the Ministry of Energy and Mineral Resources have problems in classifying data dictionary of coal. This research conduct grouping coal dictionary using K- Means and MeanShift algorithm. K-means algorithm is used to get cluster value on character and word criteria. The last iteration of Euclidian distance calculation data on k-means combine with Meanshift algorithm. The meanshift calculates centroid by selecting different bandwidths. The result of grouping using k-means and meanshift algorithm shows different centroid to find optimum bandwidth value. The data dictionary of this research has sorted in alphabetically. Keywords:coal dictionary,clustering, K-means,meanshift Copyright © 2018 Universitas Ahmad Dahlan. All rights reserved. 1. Introduction Data Mining is a process of extracting data or filtering data by utilising a collection of a big data. The data through a series of operations to obtain valuable information from the data. According to Daryl Pregibon, the composition of artificial statistics, artificial intelligence, and database research called data mining [1]. Data mining techniques used to explore the data using classification, prediction, grouping, outlier detection, association rules, sequence analysis, time series analysis and text mining, and some new techniques such as social networking analysis and sentiment analysis [2]. The center of research and development mineral and coal technology (Puslitbang tekMIRA) is an Indonesia government institution under the Ministry of Energy and Mineral Resources. Puslitbang tekMIRA has a dictionary of coal. This coal dictionary has one thousand coal terms. It takes time between three to five minutes to find a term inside a coal dictionary. Based on these problems, data mining method is used to classify data dictionary. This research conduct grouping data dictionary of a coal term using data mining method. K-Means and Meanshift Algorithm were chosen in this research. K-Means algorithm was used to categories students with skills such as cognitive, communication and relational[3], to evaluate student achievement levels for course content [4], and to group data based on user information created on SNS and recommend to users in the future [5]. The grouping data based on user sentences by utilising regularity among data pursued by the user using K-Means Algorithm [6]. The genetic algorithm and K-Means are used to calculate clustered centroid with heterogeneous populations that lead to better results than using random numbers [7]. The Meanshift algorithm is used to accurately detect the location of target tracking [8] by using this method and then make it easy the accuracy of the calculation of the tracking results [9]. The Meanshift algorithm is also used to solve facial-detection and tracking-based systems [10], if an imbalance occurs, it will affect the performance of [11]. In this research the algorithm grouping a coal term in a dictionary based on character and word in a cluster. The cluster serve data dictionary of coal based on predetermined criteria. The result data of clustering using K-Means and Meanshift algorithm is shown using the matloplib plot. Matplotlib is a Python package for Plotting that produces quality production graphs. Matplotlib is designed to be able to create simple and complex plots with multiple commands [12].
  • 2.  ISSN: 1693-6930 TELKOMNIKA Vol. 16, No. 3, June 2018: 1351-1357 1352 2. Research Method 2.1. K-Means K-Means Clustering is a grouping of data, where the data in K Means Clustering K is the amount of data or the number of constants. Means is the average value of the data set as Cluster [7]. So K-Means is a method to analyse data or called data mining method where this data modelling method without using supervise or unsupervised method and K-Means is a method to classify data by using partition system. The K-Means method solves large amounts of data in groups, and the data has the same characteristics as other data. Also, the team feature also has a feature [13]. Similitude is some matrix used for similarities between instances in a cluster. K-Means is an algorithm used to generate k clusters from a collection of data sets in a simple way [14]. Algorithm 1 is an explanation of k-means implementation. Algorithm 1: k- means clustering: Randomly select k cluster centers c1, ... ck, Repeat, Set each data entity to the closest cluster center ci, Change the cluster center with the average cluster i, until the cluster center does not change[15]. K - Means algorithm formula d = distance, j = amount of data, c = centroid, x = data, c = centroid 1 . The Euclidean distance formula is described in Equation 2. ( ) √∑ √ D (i, j) is the distance of data between i and center cluster j. Xki is the data to i on attribute data to k. Xkj is the center point to j at attribute to k. Recalculate cluster center with current cluster membership. The cluster center is the average of all data/objects in a particular cluster. If desired it can also use the median of the cluster. So mean (mean) is not the only size that can use.Reassign each object using the new cluster center. If the cluster center does not change again, then the clustering process is complete. Alternatively, return to step number 3 until the center of the cluster does not change anymore [16]. 2.2. Meanshift The average shift method is a method for determining the maximum function limit of density with separate data from a function. The average shift use as a media grouping method, in which each mode represents each group [17]-18]. The average shift method classifies data in its search mode that directs and moves data to the point region along with the iteration of the data environment built with the Gaussian Kernel [19]. Starting from one data point and iteratively will improve the approximate mode [20]. The Gaussian kernel is a differentiated multivariate kernel function used for actual calculations in assumptions [21]. Bandwidth is a free parameter that shows the effect on the estimated density generated. 3. Results and Analysis Grouping of data of coal dictionary using K-Means and Meanshift Algorithm. The grouping results are displayed using matplotlib plot. The sample data taken are vowel data (A, I, U, E, O) for the coal dictionary. Calculation data using K-Means algorithm. The result data of the last iteration, i.e. data that has been stable and without changes. 3.1. Result K-Means 3.1.1. The letter A The data of the last iteration calculation using euclidian distance k- means for letter A. Clustering data using character and word criteria and use the centroid value of the highest and lowest values on the criteria. In table 1 is a cluster value that has a fixed and unchanged cluster.
  • 3. TELKOMNIKA ISSN: 1693-6930  K Means Clustering and Meanshift Analysis for Grouping… (Rolly Maulana Awangga) 1353 Table 1. Data cluster of the letter A 3.1.2. The letter I The data of the last iteration calculation using euclidian distance k- means for letter I. Clustering data using character and word criteria and use the centroid value of the highest and lowest values on the criteria. In Table 2 is a cluster value that has a fixed and unchanged cluster. Table 2. Data cluster of the letter I 3.1.3. The letter U The data of the last iteration calculation using euclidian distance k- means for letter U. Clustering data using character and word criteria and use the centroid value of the highest and lowest values on the criteria. In Table 3 is a cluster value that has a fixed and unchanged cluster. Table 3. Data cluster of the letter U Character Word 0 6.297319 24.02082 17.96443 24.02082 17.96443 ....... ....... 29.01724 22.93403 23.02173 16.97265 3.1.4. The letter E The data of the last iteration calculation using euclidian distance k- means for letter E. Clustering data using character and word criteria and use the centroid value of the highest and Character Word 19.04536 3.365126 12.0041 3.78689 7.033561 8.84361 17.00289 1.21311 ....... ....... 8.006149 7.78689 11.00447 4.78689 27.20259 11.60749 Character Word 56.4358 1.234823 62.39391 4.774678 37.21559 20.42897 43.18565 14.5126 ....... ....... 59.4138 1.777967 67.47592 9.838576
  • 4.  ISSN: 1693-6930 TELKOMNIKA Vol. 16, No. 3, June 2018: 1351-1357 1354 lowest values on the criteria. In Table 4 is a cluster value that has a fixed and unchanged cluster. Table 4. Data Cluster of the Letter E 3.1.5. The letter O The data of the last iteration calculation using euclidian distance k- means for letter O. Clustering data using character and word criteria and use the centroid value of the highest and lowest values on the criteria. In Table 5 is a cluster value that has a fixed and unchanged cluster. Table 5. Data Cluster of the Letter O Character Word 12.33333 6.342096 3.33333 2.687423 ....... ....... 2.33333 3.726783 14.66667 20.73377 3.2. Meanshift 3.2.1. The letter A In Figure 1 is the plot result for the data of the letter A. The data used is the term data of coal letter A. The value used by the data of the term A to get bandwidth one on the plot meanshift is at point 6.76. Figure 1. Plot Mean Shift letter A Character Word 13.15295 1.009718 14.14214 1.973457 14.14214 1.973457 ....... ....... 7.071068 5.100934 10 2.625744
  • 5. TELKOMNIKA ISSN: 1693-6930  K Means Clustering and Meanshift Analysis for Grouping… (Rolly Maulana Awangga) 1355 3.2.2. The letter I In Figure 2 is the plot result for the data of the letter I. The data used is the term data of coal letter I. The value used by the data of the term I to get bandwidth one on the plot meanshift is at point 18.95. Figure 2. Plot MeanShift letter I 3.2.3. The letter U In Figure 3 is the plot result for the data of the letter U. The data used is the term data of coal letter U. The value used by the data of the term I to get bandwidth one on the plot meanshift is at point 5.75. Figure 3. Plot MeanShift letter U 3.2.4. The letter E In Figure 4 is the plot result for the data of the letter E. The data used is the term data of coal letter E. The value used by the data of the term I to get bandwidth one on the plot meanshift is at point 3.36.
  • 6.  ISSN: 1693-6930 TELKOMNIKA Vol. 16, No. 3, June 2018: 1351-1357 1356 Figure 4. Plot MeanShift letter E 3.2.5. The letter O In Figure 5 is the plot result for the data of the letter O. The data used is the term data of coal letter O. The value used by the data of the term I to get bandwidth one on the plot meanshift is at point 7. Figure 5. Plot MeanShift letter O 4. Conclusion Grouping a coal dictionary using the k-means algorithm produces a cluster value on character and word criteria. The last iteration of the Euclidian distance calculation results in different cluster values in each alphabet. The centroid value of the k-means calculation is then combined with the MeanShift algorithm. Centroid calculation results of the meanshift algorithm result in different bandwidth. Letter A produces a bandwidth of 6.76, for the letter I produce bandwidth 18.95, letter U produces bandwidth 5.75, and letter E produces bandwidth 3.36. The grouping of coal dictionaries in this study facilitates for a group of terms in the letter. Suggestions for future research, can classify the data dictionary of coal for the second, third and subsequent letters, and combine other algorithms in grouping such as K-Nearest Neighbour (KNN) algorithm.
  • 7. TELKOMNIKA ISSN: 1693-6930  K Means Clustering and Meanshift Analysis for Grouping… (Rolly Maulana Awangga) 1357 References [1] F Gorunescu,Data Mining: Concepts, models and techniques. Springer Science & Business Media, 2011; 12. [2] MI Ramadhan etal. An analysis of natural disaster data by using k-means and k-medoids algorithm of data mining techniques. in Quality in Research (QiR): International Symposium on Electrical and Computer Engineering, 2017 15th International Conference on. IEEE. 2017: 221–225. [3] J Yi, S Li, M Wu, HA Yeung, WW Fok, Y Wang, F Liu. Cloud-based educational big data application of apriori algorithm and k-means clustering algorithm based on students’information. in Big Data and Cloud Computing (BdCloud), 2014 IEEE Fourth International Conference on. IEEE. 2014: 151–158. [4] L Guoli, W Tingting, Y Limei, L Yanping, G Jinqiao. The improved research on k-means clustering algorithm in initial values. in Mechatronic Sciences, Electric Engineering and Computer (MEC), Proceedings 2013 International Conference on. IEEE. 2013; 2124–2127. [5] SH Jung, JC Kim,CB Sim.Prediction data processing scheme using an artificial neural network and data clustering for big data. International Journal of Electrical and Computer Engineering (IJECE). 2016; 6(1): 330-336. [6] A Bedboudi, C Bouras, MT Kimour. An heterogeneous population-based genetic algorithm for data clustering.Indonesian Journal ofElectrical Engineering and Informatics (IJEEI).2017;5(3): 275–284. [7] Q Shi, L Xu, Z Shi, Y Chen, Y Shao. Analysis and research of the campus network user’s behavior based on k-means clustering algorithm. in Digital Manufacturing and Automation (ICDMA), 2013 Fourth International Conference on. IEEE, 2013: 196–201. [8] Y Cheng,W Hongyu, W Xiaohong.Positioning method research for unmanned aerial vehicles based on meanshift tracking algorithm. in Control and Decision Conference (CCDC), 2017 29th Chinese. IEEE 2017: 989–994. [9] RM Awangga, NS Fathonah, TI Hasanudin. Colenak: GPS tracking model for post-stroke rehabilitation program using AES-CBC URL encryption and QR-Code. in 2017 2nd International conferences on Information Technology,Information Systems and Electrical Engineering (ICITISEE). IEEE. Nov 2017: 255–260. [Online]. Available: http://ieeexplore.ieee.org/document/8285506/ [10] A Salhi, Y Moresly, F Ghozzi, A Fakhfakh. Face detection and tracking system with block-matching, meanshift and camshift algorithms and kalman filter. in Sciences and Techniques of Automatic Control and Computer Engineering (STA), 2017 18th International Conference on. IEEE. 2017: 139– 145. [11] IN Yulita, S Purwani, R Rosadi, RM Awangga. A quantization of deep belief networks for long short- term memory in sleep stage detection. in Advanced Informatics, Concepts, Theory, and Applications (ICAICTA), 2017 International Conference on. IEEE, 2017: 1-5. [12] N Ari, M Ustazhanov. Matplotlib in python. in 2014 11th International Conference on Electronics, Computer and Computation (ICECCO). IEEE, sep 2014. [Online]. Available: https://doi.org/10.1109/icecco.2014.6997585 [13] J MacQueen et al. Some methods for classification and analysis of multivariate observations in Proceedings ofthe fifth Berkeley symposium on mathematical statistics and probability. Oakland, CA, USA. 1967; 1(14): 281–297. [14] AK Jain. Data clustering:50 years beyond k-means.Pattern recognition letters.2010;31(8): 651-666. [15] Z Gheid, Y Challal. Efficient and privacy-preserving k-means clustering for big data mining. in Trustcom/BigDataSE/I SPA, IEEE. 2016: 791–798. [16] K Obbie. Penerapan algoritma klasifikasi data mining id3 untuk menentukan penjurusan siswasman6 semarang. Skripsi, Fakultas Ilmu Komputer, 2014. [17] Y Liu, SZ Li, W Wu, R Huang. Dynamics of a mean-shift-like algorithm and its applications on clustering. Information Processing Letters. 2013; 113(1): 8–16. [18] G Chen,Q Chen,D Zhang. Mean shift: A method for measurementmatrix ofcompressive sensing.in Digital Home (ICDH), 2014 5th International Conference on. IEEE. 2014: 64–69. [19] YA Ghassabeh.A sufficientcondition for the convergence of the mean shift algorithm with gaussian kernel. Journal of Multivariate Analysis. 2015; 135: 1–10. [20] K Fukunaga, L Hostetler. The estimation of the gradient of a density function, with applications in pattern recognition. IEEE Transactions on information theory. 1975; 21(1): 32–40. [21] C Yang, R Duraiswami, NA Gumerov, L Davis. Improved fast gauss transform and efficient kernel density estimation. in null. IEEE. 2003: 464.