SlideShare a Scribd company logo
TELKOMNIKA, Vol.16, No.3, June 2018, pp. 1351~1357
ISSN: 1693-6930, accredited A by DIKTI, Decree No: 58/DIKTI/Kep/2013
DOI: 10.12928/TELKOMNIKA.v16i3.8910  1351
Received September 18, 2017; Revised February 8, 2018; Accepted April 5, 2018
K Means Clustering and Meanshift Analysis for
Grouping the Data of Coal Term in Puslitbang tekMIRA
Rolly Maulana Awangga*
1
, Syafrial Fachri Pane
2
, Khaera Tunnisa
3
,
Iping Supriana Suwardi
4
1,2,3
Program in Informatics Engineering, Politeknik Pos Indonesia, Indonesia
4
School of Electrical Engineering and Informatics, Bandung Institute of Technology, Indonesia
*Corresponding author, e-mail: awangga@poltekpos.ac.id
Abstract
Indonesian government agencies under the Ministry of Energy and Mineral Resources have
problems in classifying data dictionary of coal. This research conduct grouping coal dictionary using K-
Means and MeanShift algorithm. K-means algorithm is used to get cluster value on character and word
criteria. The last iteration of Euclidian distance calculation data on k-means combine with Meanshift
algorithm. The meanshift calculates centroid by selecting different bandwidths. The result of grouping
using k-means and meanshift algorithm shows different centroid to find optimum bandwidth value. The
data dictionary of this research has sorted in alphabetically.
Keywords:coal dictionary,clustering, K-means,meanshift
Copyright © 2018 Universitas Ahmad Dahlan. All rights reserved.
1. Introduction
Data Mining is a process of extracting data or filtering data by utilising a collection of a
big data. The data through a series of operations to obtain valuable information from the data.
According to Daryl Pregibon, the composition of artificial statistics, artificial intelligence, and
database research called data mining [1]. Data mining techniques used to explore the data
using classification, prediction, grouping, outlier detection, association rules, sequence analysis,
time series analysis and text mining, and some new techniques such as social networking
analysis and sentiment analysis [2]. The center of research and development mineral and coal
technology (Puslitbang tekMIRA) is an Indonesia government institution under the Ministry of
Energy and Mineral Resources. Puslitbang tekMIRA has a dictionary of coal. This coal
dictionary has one thousand coal terms. It takes time between three to five minutes to find a
term inside a coal dictionary.
Based on these problems, data mining method is used to classify data dictionary. This
research conduct grouping data dictionary of a coal term using data mining method. K-Means
and Meanshift Algorithm were chosen in this research. K-Means algorithm was used to
categories students with skills such as cognitive, communication and relational[3], to evaluate
student achievement levels for course content [4], and to group data based on user information
created on SNS and recommend to users in the future [5]. The grouping data based on user
sentences by utilising regularity among data pursued by the user using K-Means Algorithm [6].
The genetic algorithm and K-Means are used to calculate clustered centroid with heterogeneous
populations that lead to better results than using random numbers [7]. The Meanshift algorithm
is used to accurately detect the location of target tracking [8] by using this method and then
make it easy the accuracy of the calculation of the tracking results [9]. The Meanshift algorithm
is also used to solve facial-detection and tracking-based systems [10], if an imbalance occurs, it
will affect the performance of [11].
In this research the algorithm grouping a coal term in a dictionary based on character
and word in a cluster. The cluster serve data dictionary of coal based on predetermined criteria.
The result data of clustering using K-Means and Meanshift algorithm is shown using the
matloplib plot. Matplotlib is a Python package for Plotting that produces quality production
graphs. Matplotlib is designed to be able to create simple and complex plots with multiple
commands [12].
 ISSN: 1693-6930
TELKOMNIKA Vol. 16, No. 3, June 2018: 1351-1357
1352
2. Research Method
2.1. K-Means
K-Means Clustering is a grouping of data, where the data in K Means Clustering K is
the amount of data or the number of constants. Means is the average value of the data set as
Cluster [7]. So K-Means is a method to analyse data or called data mining method where this
data modelling method without using supervise or unsupervised method and K-Means is a
method to classify data by using partition system. The K-Means method solves large amounts of
data in groups, and the data has the same characteristics as other data. Also, the team feature
also has a feature [13]. Similitude is some matrix used for similarities between instances in a
cluster. K-Means is an algorithm used to generate k clusters from a collection of data sets in a
simple way [14]. Algorithm 1 is an explanation of k-means implementation. Algorithm 1: k-
means clustering: Randomly select k cluster centers c1, ... ck, Repeat, Set each data entity to
the closest cluster center ci, Change the cluster center with the average cluster i, until the
cluster center does not change[15]. K - Means algorithm formula d = distance, j = amount of
data, c = centroid, x = data, c = centroid 1 . The Euclidean distance formula is described in
Equation 2.
( ) √∑
√
D (i, j) is the distance of data between i and center cluster j. Xki is the data to i on
attribute data to k. Xkj is the center point to j at attribute to k.
Recalculate cluster center with current cluster membership. The cluster center is the
average of all data/objects in a particular cluster. If desired it can also use the median of the
cluster. So mean (mean) is not the only size that can use.Reassign each object using the new
cluster center. If the cluster center does not change again, then the clustering process is
complete. Alternatively, return to step number 3 until the center of the cluster does not change
anymore [16].
2.2. Meanshift
The average shift method is a method for determining the maximum function limit of
density with separate data from a function. The average shift use as a media grouping method,
in which each mode represents each group [17]-18]. The average shift method classifies data in
its search mode that directs and moves data to the point region along with the iteration of the
data environment built with the Gaussian Kernel [19]. Starting from one data point and iteratively
will improve the approximate mode [20]. The Gaussian kernel is a differentiated multivariate
kernel function used for actual calculations in assumptions [21]. Bandwidth is a free parameter
that shows the effect on the estimated density generated.
3. Results and Analysis
Grouping of data of coal dictionary using K-Means and Meanshift Algorithm. The
grouping results are displayed using matplotlib plot. The sample data taken are vowel data (A, I,
U, E, O) for the coal dictionary. Calculation data using K-Means algorithm. The result data of the
last iteration, i.e. data that has been stable and without changes.
3.1. Result K-Means
3.1.1. The letter A
The data of the last iteration calculation using euclidian distance k- means for letter A.
Clustering data using character and word criteria and use the centroid value of the highest and
lowest values on the criteria. In table 1 is a cluster value that has a fixed and unchanged cluster.
TELKOMNIKA ISSN: 1693-6930 
K Means Clustering and Meanshift Analysis for Grouping… (Rolly Maulana Awangga)
1353
Table 1. Data cluster of the letter A
3.1.2. The letter I
The data of the last iteration calculation using euclidian distance k- means for letter I.
Clustering data using character and word criteria and use the centroid value of the highest and
lowest values on the criteria. In Table 2 is a cluster value that has a fixed and unchanged
cluster.
Table 2. Data cluster of the letter I
3.1.3. The letter U
The data of the last iteration calculation using euclidian distance k- means for letter U.
Clustering data using character and word criteria and use the centroid value of the highest and
lowest values on the criteria. In Table 3 is a cluster value that has a fixed and unchanged
cluster.
Table 3. Data cluster of the letter U
Character Word
0 6.297319
24.02082 17.96443
24.02082 17.96443
....... .......
29.01724 22.93403
23.02173 16.97265
3.1.4. The letter E
The data of the last iteration calculation using euclidian distance k- means for letter E.
Clustering data using character and word criteria and use the centroid value of the highest and
Character Word
19.04536 3.365126
12.0041 3.78689
7.033561 8.84361
17.00289 1.21311
....... .......
8.006149 7.78689
11.00447 4.78689
27.20259 11.60749
Character Word
56.4358 1.234823
62.39391 4.774678
37.21559 20.42897
43.18565 14.5126
....... .......
59.4138 1.777967
67.47592 9.838576
 ISSN: 1693-6930
TELKOMNIKA Vol. 16, No. 3, June 2018: 1351-1357
1354
lowest values on the criteria. In Table 4 is a cluster value that has a fixed and unchanged
cluster.
Table 4. Data Cluster of the Letter E
3.1.5. The letter O
The data of the last iteration calculation using euclidian distance k- means for letter O.
Clustering data using character and word criteria and use the centroid value of the highest and
lowest values on the criteria. In Table 5 is a cluster value that has a fixed and unchanged
cluster.
Table 5. Data Cluster of the Letter O
Character Word
12.33333 6.342096
3.33333 2.687423
....... .......
2.33333 3.726783
14.66667 20.73377
3.2. Meanshift
3.2.1. The letter A
In Figure 1 is the plot result for the data of the letter A. The data used is the term data of
coal letter A. The value used by the data of the term A to get bandwidth one on the plot
meanshift is at point 6.76.
Figure 1. Plot Mean Shift letter A
Character Word
13.15295 1.009718
14.14214 1.973457
14.14214 1.973457
....... .......
7.071068 5.100934
10 2.625744
TELKOMNIKA ISSN: 1693-6930 
K Means Clustering and Meanshift Analysis for Grouping… (Rolly Maulana Awangga)
1355
3.2.2. The letter I
In Figure 2 is the plot result for the data of the letter I. The data used is the term data of
coal letter I. The value used by the data of the term I to get bandwidth one on the plot meanshift
is at point 18.95.
Figure 2. Plot MeanShift letter I
3.2.3. The letter U
In Figure 3 is the plot result for the data of the letter U. The data used is the term data of
coal letter U. The value used by the data of the term I to get bandwidth one on the plot
meanshift is at point 5.75.
Figure 3. Plot MeanShift letter U
3.2.4. The letter E
In Figure 4 is the plot result for the data of the letter E. The data used is the term data
of coal letter E. The value used by the data of the term I to get bandwidth one on the plot
meanshift is at point 3.36.
 ISSN: 1693-6930
TELKOMNIKA Vol. 16, No. 3, June 2018: 1351-1357
1356
Figure 4. Plot MeanShift letter E
3.2.5. The letter O
In Figure 5 is the plot result for the data of the letter O. The data used is the term data
of coal letter O. The value used by the data of the term I to get bandwidth one on the plot
meanshift is at point 7.
Figure 5. Plot MeanShift letter O
4. Conclusion
Grouping a coal dictionary using the k-means algorithm produces a cluster value on
character and word criteria. The last iteration of the Euclidian distance calculation results in
different cluster values in each alphabet. The centroid value of the k-means calculation is then
combined with the MeanShift algorithm. Centroid calculation results of the meanshift algorithm
result in different bandwidth. Letter A produces a bandwidth of 6.76, for the letter I produce
bandwidth 18.95, letter U produces bandwidth 5.75, and letter E produces bandwidth 3.36. The
grouping of coal dictionaries in this study facilitates for a group of terms in the letter.
Suggestions for future research, can classify the data dictionary of coal for the second, third and
subsequent letters, and combine other algorithms in grouping such as K-Nearest Neighbour
(KNN) algorithm.
TELKOMNIKA ISSN: 1693-6930 
K Means Clustering and Meanshift Analysis for Grouping… (Rolly Maulana Awangga)
1357
References
[1] F Gorunescu,Data Mining: Concepts, models and techniques. Springer Science & Business Media,
2011; 12.
[2] MI Ramadhan etal. An analysis of natural disaster data by using k-means and k-medoids algorithm
of data mining techniques. in Quality in Research (QiR): International Symposium on Electrical and
Computer Engineering, 2017 15th International Conference on. IEEE. 2017: 221–225.
[3] J Yi, S Li, M Wu, HA Yeung, WW Fok, Y Wang, F Liu. Cloud-based educational big data application
of apriori algorithm and k-means clustering algorithm based on students’information. in Big Data and
Cloud Computing (BdCloud), 2014 IEEE Fourth International Conference on. IEEE. 2014: 151–158.
[4] L Guoli, W Tingting, Y Limei, L Yanping, G Jinqiao. The improved research on k-means clustering
algorithm in initial values. in Mechatronic Sciences, Electric Engineering and Computer (MEC),
Proceedings 2013 International Conference on. IEEE. 2013; 2124–2127.
[5] SH Jung, JC Kim,CB Sim.Prediction data processing scheme using an artificial neural network and
data clustering for big data. International Journal of Electrical and Computer Engineering (IJECE).
2016; 6(1): 330-336.
[6] A Bedboudi, C Bouras, MT Kimour. An heterogeneous population-based genetic algorithm for data
clustering.Indonesian Journal ofElectrical Engineering and Informatics (IJEEI).2017;5(3): 275–284.
[7] Q Shi, L Xu, Z Shi, Y Chen, Y Shao. Analysis and research of the campus network user’s behavior
based on k-means clustering algorithm. in Digital Manufacturing and Automation (ICDMA), 2013
Fourth International Conference on. IEEE, 2013: 196–201.
[8] Y Cheng,W Hongyu, W Xiaohong.Positioning method research for unmanned aerial vehicles based
on meanshift tracking algorithm. in Control and Decision Conference (CCDC), 2017 29th Chinese.
IEEE 2017: 989–994.
[9] RM Awangga, NS Fathonah, TI Hasanudin. Colenak: GPS tracking model for post-stroke
rehabilitation program using AES-CBC URL encryption and QR-Code. in 2017 2nd International
conferences on Information Technology,Information Systems and Electrical Engineering (ICITISEE).
IEEE. Nov 2017: 255–260. [Online]. Available: http://ieeexplore.ieee.org/document/8285506/
[10] A Salhi, Y Moresly, F Ghozzi, A Fakhfakh. Face detection and tracking system with block-matching,
meanshift and camshift algorithms and kalman filter. in Sciences and Techniques of Automatic
Control and Computer Engineering (STA), 2017 18th International Conference on. IEEE. 2017: 139–
145.
[11] IN Yulita, S Purwani, R Rosadi, RM Awangga. A quantization of deep belief networks for long short-
term memory in sleep stage detection. in Advanced Informatics, Concepts, Theory, and Applications
(ICAICTA), 2017 International Conference on. IEEE, 2017: 1-5.
[12] N Ari, M Ustazhanov. Matplotlib in python. in 2014 11th International Conference on Electronics,
Computer and Computation (ICECCO). IEEE, sep 2014. [Online]. Available:
https://doi.org/10.1109/icecco.2014.6997585
[13] J MacQueen et al. Some methods for classification and analysis of multivariate observations in
Proceedings ofthe fifth Berkeley symposium on mathematical statistics and probability. Oakland, CA,
USA. 1967; 1(14): 281–297.
[14] AK Jain. Data clustering:50 years beyond k-means.Pattern recognition letters.2010;31(8): 651-666.
[15] Z Gheid, Y Challal. Efficient and privacy-preserving k-means clustering for big data mining. in
Trustcom/BigDataSE/I SPA, IEEE. 2016: 791–798.
[16] K Obbie. Penerapan algoritma klasifikasi data mining id3 untuk menentukan penjurusan siswasman6
semarang. Skripsi, Fakultas Ilmu Komputer, 2014.
[17] Y Liu, SZ Li, W Wu, R Huang. Dynamics of a mean-shift-like algorithm and its applications on
clustering. Information Processing Letters. 2013; 113(1): 8–16.
[18] G Chen,Q Chen,D Zhang. Mean shift: A method for measurementmatrix ofcompressive sensing.in
Digital Home (ICDH), 2014 5th International Conference on. IEEE. 2014: 64–69.
[19] YA Ghassabeh.A sufficientcondition for the convergence of the mean shift algorithm with gaussian
kernel. Journal of Multivariate Analysis. 2015; 135: 1–10.
[20] K Fukunaga, L Hostetler. The estimation of the gradient of a density function, with applications in
pattern recognition. IEEE Transactions on information theory. 1975; 21(1): 32–40.
[21] C Yang, R Duraiswami, NA Gumerov, L Davis. Improved fast gauss transform and efficient kernel
density estimation. in null. IEEE. 2003: 464.

More Related Content

What's hot

Scalable and efficient cluster based framework for
Scalable and efficient cluster based framework forScalable and efficient cluster based framework for
Scalable and efficient cluster based framework for
eSAT Publishing House
 
The step construction of penalized spline in electrical power load data
The step construction of penalized spline in electrical power load dataThe step construction of penalized spline in electrical power load data
The step construction of penalized spline in electrical power load data
TELKOMNIKA JOURNAL
 
MK-Prototypes: A Novel Algorithm for Clustering Mixed Type Data
MK-Prototypes: A Novel Algorithm for Clustering Mixed Type  Data MK-Prototypes: A Novel Algorithm for Clustering Mixed Type  Data
MK-Prototypes: A Novel Algorithm for Clustering Mixed Type Data
IJMER
 
Mine Blood Donors Information through Improved K-Means Clustering
Mine Blood Donors Information through Improved K-Means ClusteringMine Blood Donors Information through Improved K-Means Clustering
Mine Blood Donors Information through Improved K-Means Clustering
ijcsity
 
A Novel Approach for Clustering Big Data based on MapReduce
A Novel Approach for Clustering Big Data based on MapReduce A Novel Approach for Clustering Big Data based on MapReduce
A Novel Approach for Clustering Big Data based on MapReduce
IJECEIAES
 
Improving K-NN Internet Traffic Classification Using Clustering and Principle...
Improving K-NN Internet Traffic Classification Using Clustering and Principle...Improving K-NN Internet Traffic Classification Using Clustering and Principle...
Improving K-NN Internet Traffic Classification Using Clustering and Principle...
journalBEEI
 
Cg33504508
Cg33504508Cg33504508
Cg33504508
IJERA Editor
 
TOWARDS REDUCTION OF DATA FLOW IN A DISTRIBUTED NETWORK USING PRINCIPAL COMPO...
TOWARDS REDUCTION OF DATA FLOW IN A DISTRIBUTED NETWORK USING PRINCIPAL COMPO...TOWARDS REDUCTION OF DATA FLOW IN A DISTRIBUTED NETWORK USING PRINCIPAL COMPO...
TOWARDS REDUCTION OF DATA FLOW IN A DISTRIBUTED NETWORK USING PRINCIPAL COMPO...
cscpconf
 
System for Prediction of Non Stationary Time Series based on the Wavelet Radi...
System for Prediction of Non Stationary Time Series based on the Wavelet Radi...System for Prediction of Non Stationary Time Series based on the Wavelet Radi...
System for Prediction of Non Stationary Time Series based on the Wavelet Radi...
IJECEIAES
 
Classification of Iris Data using Kernel Radial Basis Probabilistic Neural Ne...
Classification of Iris Data using Kernel Radial Basis Probabilistic Neural Ne...Classification of Iris Data using Kernel Radial Basis Probabilistic Neural Ne...
Classification of Iris Data using Kernel Radial Basis Probabilistic Neural Ne...
Scientific Review
 
Textural Feature Extraction of Natural Objects for Image Classification
Textural Feature Extraction of Natural Objects for Image ClassificationTextural Feature Extraction of Natural Objects for Image Classification
Textural Feature Extraction of Natural Objects for Image Classification
CSCJournals
 
Imagethresholding
ImagethresholdingImagethresholding
Imagethresholding
ananta200
 
Ip3514921495
Ip3514921495Ip3514921495
Ip3514921495
IJERA Editor
 
A046010107
A046010107A046010107
A046010107
IJERA Editor
 
MCCS
MCCSMCCS
SOLVING OPTIMAL COMPONENTS ASSIGNMENT PROBLEM FOR A MULTISTATE NETWORK USING ...
SOLVING OPTIMAL COMPONENTS ASSIGNMENT PROBLEM FOR A MULTISTATE NETWORK USING ...SOLVING OPTIMAL COMPONENTS ASSIGNMENT PROBLEM FOR A MULTISTATE NETWORK USING ...
SOLVING OPTIMAL COMPONENTS ASSIGNMENT PROBLEM FOR A MULTISTATE NETWORK USING ...
ijmnct
 
SOLVING OPTIMAL COMPONENTS ASSIGNMENT PROBLEM FOR A MULTISTATE NETWORK USING ...
SOLVING OPTIMAL COMPONENTS ASSIGNMENT PROBLEM FOR A MULTISTATE NETWORK USING ...SOLVING OPTIMAL COMPONENTS ASSIGNMENT PROBLEM FOR A MULTISTATE NETWORK USING ...
SOLVING OPTIMAL COMPONENTS ASSIGNMENT PROBLEM FOR A MULTISTATE NETWORK USING ...
ijmnct
 
Big Data Clustering Model based on Fuzzy Gaussian
Big Data Clustering Model based on Fuzzy GaussianBig Data Clustering Model based on Fuzzy Gaussian
Big Data Clustering Model based on Fuzzy Gaussian
IJCSIS Research Publications
 
Survey on classification algorithms for data mining (comparison and evaluation)
Survey on classification algorithms for data mining (comparison and evaluation)Survey on classification algorithms for data mining (comparison and evaluation)
Survey on classification algorithms for data mining (comparison and evaluation)
Alexander Decker
 
The improved k means with particle swarm optimization
The improved k means with particle swarm optimizationThe improved k means with particle swarm optimization
The improved k means with particle swarm optimization
Alexander Decker
 

What's hot (20)

Scalable and efficient cluster based framework for
Scalable and efficient cluster based framework forScalable and efficient cluster based framework for
Scalable and efficient cluster based framework for
 
The step construction of penalized spline in electrical power load data
The step construction of penalized spline in electrical power load dataThe step construction of penalized spline in electrical power load data
The step construction of penalized spline in electrical power load data
 
MK-Prototypes: A Novel Algorithm for Clustering Mixed Type Data
MK-Prototypes: A Novel Algorithm for Clustering Mixed Type  Data MK-Prototypes: A Novel Algorithm for Clustering Mixed Type  Data
MK-Prototypes: A Novel Algorithm for Clustering Mixed Type Data
 
Mine Blood Donors Information through Improved K-Means Clustering
Mine Blood Donors Information through Improved K-Means ClusteringMine Blood Donors Information through Improved K-Means Clustering
Mine Blood Donors Information through Improved K-Means Clustering
 
A Novel Approach for Clustering Big Data based on MapReduce
A Novel Approach for Clustering Big Data based on MapReduce A Novel Approach for Clustering Big Data based on MapReduce
A Novel Approach for Clustering Big Data based on MapReduce
 
Improving K-NN Internet Traffic Classification Using Clustering and Principle...
Improving K-NN Internet Traffic Classification Using Clustering and Principle...Improving K-NN Internet Traffic Classification Using Clustering and Principle...
Improving K-NN Internet Traffic Classification Using Clustering and Principle...
 
Cg33504508
Cg33504508Cg33504508
Cg33504508
 
TOWARDS REDUCTION OF DATA FLOW IN A DISTRIBUTED NETWORK USING PRINCIPAL COMPO...
TOWARDS REDUCTION OF DATA FLOW IN A DISTRIBUTED NETWORK USING PRINCIPAL COMPO...TOWARDS REDUCTION OF DATA FLOW IN A DISTRIBUTED NETWORK USING PRINCIPAL COMPO...
TOWARDS REDUCTION OF DATA FLOW IN A DISTRIBUTED NETWORK USING PRINCIPAL COMPO...
 
System for Prediction of Non Stationary Time Series based on the Wavelet Radi...
System for Prediction of Non Stationary Time Series based on the Wavelet Radi...System for Prediction of Non Stationary Time Series based on the Wavelet Radi...
System for Prediction of Non Stationary Time Series based on the Wavelet Radi...
 
Classification of Iris Data using Kernel Radial Basis Probabilistic Neural Ne...
Classification of Iris Data using Kernel Radial Basis Probabilistic Neural Ne...Classification of Iris Data using Kernel Radial Basis Probabilistic Neural Ne...
Classification of Iris Data using Kernel Radial Basis Probabilistic Neural Ne...
 
Textural Feature Extraction of Natural Objects for Image Classification
Textural Feature Extraction of Natural Objects for Image ClassificationTextural Feature Extraction of Natural Objects for Image Classification
Textural Feature Extraction of Natural Objects for Image Classification
 
Imagethresholding
ImagethresholdingImagethresholding
Imagethresholding
 
Ip3514921495
Ip3514921495Ip3514921495
Ip3514921495
 
A046010107
A046010107A046010107
A046010107
 
MCCS
MCCSMCCS
MCCS
 
SOLVING OPTIMAL COMPONENTS ASSIGNMENT PROBLEM FOR A MULTISTATE NETWORK USING ...
SOLVING OPTIMAL COMPONENTS ASSIGNMENT PROBLEM FOR A MULTISTATE NETWORK USING ...SOLVING OPTIMAL COMPONENTS ASSIGNMENT PROBLEM FOR A MULTISTATE NETWORK USING ...
SOLVING OPTIMAL COMPONENTS ASSIGNMENT PROBLEM FOR A MULTISTATE NETWORK USING ...
 
SOLVING OPTIMAL COMPONENTS ASSIGNMENT PROBLEM FOR A MULTISTATE NETWORK USING ...
SOLVING OPTIMAL COMPONENTS ASSIGNMENT PROBLEM FOR A MULTISTATE NETWORK USING ...SOLVING OPTIMAL COMPONENTS ASSIGNMENT PROBLEM FOR A MULTISTATE NETWORK USING ...
SOLVING OPTIMAL COMPONENTS ASSIGNMENT PROBLEM FOR A MULTISTATE NETWORK USING ...
 
Big Data Clustering Model based on Fuzzy Gaussian
Big Data Clustering Model based on Fuzzy GaussianBig Data Clustering Model based on Fuzzy Gaussian
Big Data Clustering Model based on Fuzzy Gaussian
 
Survey on classification algorithms for data mining (comparison and evaluation)
Survey on classification algorithms for data mining (comparison and evaluation)Survey on classification algorithms for data mining (comparison and evaluation)
Survey on classification algorithms for data mining (comparison and evaluation)
 
The improved k means with particle swarm optimization
The improved k means with particle swarm optimizationThe improved k means with particle swarm optimization
The improved k means with particle swarm optimization
 

Similar to K Means Clustering and Meanshift Analysis for Grouping the Data of Coal Term in Puslitbang tekMIRA

K means report
K means reportK means report
K means report
Gaurav Handa
 
An improvement in k mean clustering algorithm using better time and accuracy
An improvement in k mean clustering algorithm using better time and accuracyAn improvement in k mean clustering algorithm using better time and accuracy
An improvement in k mean clustering algorithm using better time and accuracy
ijpla
 
Parametric Comparison of K-means and Adaptive K-means Clustering Performance ...
Parametric Comparison of K-means and Adaptive K-means Clustering Performance ...Parametric Comparison of K-means and Adaptive K-means Clustering Performance ...
Parametric Comparison of K-means and Adaptive K-means Clustering Performance ...
IJECEIAES
 
Max stable set problem to found the initial centroids in clustering problem
Max stable set problem to found the initial centroids in clustering problemMax stable set problem to found the initial centroids in clustering problem
Max stable set problem to found the initial centroids in clustering problem
nooriasukmaningtyas
 
Premeditated Initial Points for K-Means Clustering
Premeditated Initial Points for K-Means ClusteringPremeditated Initial Points for K-Means Clustering
Premeditated Initial Points for K-Means Clustering
IJCSIS Research Publications
 
K-means Clustering Method for the Analysis of Log Data
K-means Clustering Method for the Analysis of Log DataK-means Clustering Method for the Analysis of Log Data
K-means Clustering Method for the Analysis of Log Data
idescitation
 
crime rate pridicition using k-means.pdf
crime rate pridicition using k-means.pdfcrime rate pridicition using k-means.pdf
crime rate pridicition using k-means.pdf
saiKrishnaReddy558028
 
Experimental study of Data clustering using k- Means and modified algorithms
Experimental study of Data clustering using k- Means and modified algorithmsExperimental study of Data clustering using k- Means and modified algorithms
Experimental study of Data clustering using k- Means and modified algorithms
IJDKP
 
Af4201214217
Af4201214217Af4201214217
Af4201214217
IJERA Editor
 
Dynamic approach to k means clustering algorithm-2
Dynamic approach to k means clustering algorithm-2Dynamic approach to k means clustering algorithm-2
Dynamic approach to k means clustering algorithm-2IAEME Publication
 
Increasing electrical grid stability classification performance using ensemb...
Increasing electrical grid stability classification performance  using ensemb...Increasing electrical grid stability classification performance  using ensemb...
Increasing electrical grid stability classification performance using ensemb...
IJECEIAES
 
84cc04ff77007e457df6aa2b814d2346bf1b
84cc04ff77007e457df6aa2b814d2346bf1b84cc04ff77007e457df6aa2b814d2346bf1b
84cc04ff77007e457df6aa2b814d2346bf1b
PRAWEEN KUMAR
 
A Preference Model on Adaptive Affinity Propagation
A Preference Model on Adaptive Affinity PropagationA Preference Model on Adaptive Affinity Propagation
A Preference Model on Adaptive Affinity Propagation
IJECEIAES
 
FAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETS
FAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETSFAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETS
FAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETS
csandit
 
Clustering in Aggregated User Profiles across Multiple Social Networks
Clustering in Aggregated User Profiles across Multiple Social Networks Clustering in Aggregated User Profiles across Multiple Social Networks
Clustering in Aggregated User Profiles across Multiple Social Networks
IJECEIAES
 
Comparision of methods for combination of multiple classifiers that predict b...
Comparision of methods for combination of multiple classifiers that predict b...Comparision of methods for combination of multiple classifiers that predict b...
Comparision of methods for combination of multiple classifiers that predict b...
IJERA Editor
 
Enhancement of student performance prediction using modified K-nearest neighbor
Enhancement of student performance prediction using modified K-nearest neighborEnhancement of student performance prediction using modified K-nearest neighbor
Enhancement of student performance prediction using modified K-nearest neighbor
TELKOMNIKA JOURNAL
 
Spectral Clustering and Vantage Point Indexing for Efficient Data Retrieval
Spectral Clustering and Vantage Point Indexing for Efficient Data Retrieval Spectral Clustering and Vantage Point Indexing for Efficient Data Retrieval
Spectral Clustering and Vantage Point Indexing for Efficient Data Retrieval
IJECEIAES
 
A Comparative Study Of Various Clustering Algorithms In Data Mining
A Comparative Study Of Various Clustering Algorithms In Data MiningA Comparative Study Of Various Clustering Algorithms In Data Mining
A Comparative Study Of Various Clustering Algorithms In Data Mining
Natasha Grant
 
Optimising Data Using K-Means Clustering Algorithm
Optimising Data Using K-Means Clustering AlgorithmOptimising Data Using K-Means Clustering Algorithm
Optimising Data Using K-Means Clustering Algorithm
IJERA Editor
 

Similar to K Means Clustering and Meanshift Analysis for Grouping the Data of Coal Term in Puslitbang tekMIRA (20)

K means report
K means reportK means report
K means report
 
An improvement in k mean clustering algorithm using better time and accuracy
An improvement in k mean clustering algorithm using better time and accuracyAn improvement in k mean clustering algorithm using better time and accuracy
An improvement in k mean clustering algorithm using better time and accuracy
 
Parametric Comparison of K-means and Adaptive K-means Clustering Performance ...
Parametric Comparison of K-means and Adaptive K-means Clustering Performance ...Parametric Comparison of K-means and Adaptive K-means Clustering Performance ...
Parametric Comparison of K-means and Adaptive K-means Clustering Performance ...
 
Max stable set problem to found the initial centroids in clustering problem
Max stable set problem to found the initial centroids in clustering problemMax stable set problem to found the initial centroids in clustering problem
Max stable set problem to found the initial centroids in clustering problem
 
Premeditated Initial Points for K-Means Clustering
Premeditated Initial Points for K-Means ClusteringPremeditated Initial Points for K-Means Clustering
Premeditated Initial Points for K-Means Clustering
 
K-means Clustering Method for the Analysis of Log Data
K-means Clustering Method for the Analysis of Log DataK-means Clustering Method for the Analysis of Log Data
K-means Clustering Method for the Analysis of Log Data
 
crime rate pridicition using k-means.pdf
crime rate pridicition using k-means.pdfcrime rate pridicition using k-means.pdf
crime rate pridicition using k-means.pdf
 
Experimental study of Data clustering using k- Means and modified algorithms
Experimental study of Data clustering using k- Means and modified algorithmsExperimental study of Data clustering using k- Means and modified algorithms
Experimental study of Data clustering using k- Means and modified algorithms
 
Af4201214217
Af4201214217Af4201214217
Af4201214217
 
Dynamic approach to k means clustering algorithm-2
Dynamic approach to k means clustering algorithm-2Dynamic approach to k means clustering algorithm-2
Dynamic approach to k means clustering algorithm-2
 
Increasing electrical grid stability classification performance using ensemb...
Increasing electrical grid stability classification performance  using ensemb...Increasing electrical grid stability classification performance  using ensemb...
Increasing electrical grid stability classification performance using ensemb...
 
84cc04ff77007e457df6aa2b814d2346bf1b
84cc04ff77007e457df6aa2b814d2346bf1b84cc04ff77007e457df6aa2b814d2346bf1b
84cc04ff77007e457df6aa2b814d2346bf1b
 
A Preference Model on Adaptive Affinity Propagation
A Preference Model on Adaptive Affinity PropagationA Preference Model on Adaptive Affinity Propagation
A Preference Model on Adaptive Affinity Propagation
 
FAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETS
FAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETSFAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETS
FAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETS
 
Clustering in Aggregated User Profiles across Multiple Social Networks
Clustering in Aggregated User Profiles across Multiple Social Networks Clustering in Aggregated User Profiles across Multiple Social Networks
Clustering in Aggregated User Profiles across Multiple Social Networks
 
Comparision of methods for combination of multiple classifiers that predict b...
Comparision of methods for combination of multiple classifiers that predict b...Comparision of methods for combination of multiple classifiers that predict b...
Comparision of methods for combination of multiple classifiers that predict b...
 
Enhancement of student performance prediction using modified K-nearest neighbor
Enhancement of student performance prediction using modified K-nearest neighborEnhancement of student performance prediction using modified K-nearest neighbor
Enhancement of student performance prediction using modified K-nearest neighbor
 
Spectral Clustering and Vantage Point Indexing for Efficient Data Retrieval
Spectral Clustering and Vantage Point Indexing for Efficient Data Retrieval Spectral Clustering and Vantage Point Indexing for Efficient Data Retrieval
Spectral Clustering and Vantage Point Indexing for Efficient Data Retrieval
 
A Comparative Study Of Various Clustering Algorithms In Data Mining
A Comparative Study Of Various Clustering Algorithms In Data MiningA Comparative Study Of Various Clustering Algorithms In Data Mining
A Comparative Study Of Various Clustering Algorithms In Data Mining
 
Optimising Data Using K-Means Clustering Algorithm
Optimising Data Using K-Means Clustering AlgorithmOptimising Data Using K-Means Clustering Algorithm
Optimising Data Using K-Means Clustering Algorithm
 

More from TELKOMNIKA JOURNAL

Amazon products reviews classification based on machine learning, deep learni...
Amazon products reviews classification based on machine learning, deep learni...Amazon products reviews classification based on machine learning, deep learni...
Amazon products reviews classification based on machine learning, deep learni...
TELKOMNIKA JOURNAL
 
Design, simulation, and analysis of microstrip patch antenna for wireless app...
Design, simulation, and analysis of microstrip patch antenna for wireless app...Design, simulation, and analysis of microstrip patch antenna for wireless app...
Design, simulation, and analysis of microstrip patch antenna for wireless app...
TELKOMNIKA JOURNAL
 
Design and simulation an optimal enhanced PI controller for congestion avoida...
Design and simulation an optimal enhanced PI controller for congestion avoida...Design and simulation an optimal enhanced PI controller for congestion avoida...
Design and simulation an optimal enhanced PI controller for congestion avoida...
TELKOMNIKA JOURNAL
 
Improving the detection of intrusion in vehicular ad-hoc networks with modifi...
Improving the detection of intrusion in vehicular ad-hoc networks with modifi...Improving the detection of intrusion in vehicular ad-hoc networks with modifi...
Improving the detection of intrusion in vehicular ad-hoc networks with modifi...
TELKOMNIKA JOURNAL
 
Conceptual model of internet banking adoption with perceived risk and trust f...
Conceptual model of internet banking adoption with perceived risk and trust f...Conceptual model of internet banking adoption with perceived risk and trust f...
Conceptual model of internet banking adoption with perceived risk and trust f...
TELKOMNIKA JOURNAL
 
Efficient combined fuzzy logic and LMS algorithm for smart antenna
Efficient combined fuzzy logic and LMS algorithm for smart antennaEfficient combined fuzzy logic and LMS algorithm for smart antenna
Efficient combined fuzzy logic and LMS algorithm for smart antenna
TELKOMNIKA JOURNAL
 
Design and implementation of a LoRa-based system for warning of forest fire
Design and implementation of a LoRa-based system for warning of forest fireDesign and implementation of a LoRa-based system for warning of forest fire
Design and implementation of a LoRa-based system for warning of forest fire
TELKOMNIKA JOURNAL
 
Wavelet-based sensing technique in cognitive radio network
Wavelet-based sensing technique in cognitive radio networkWavelet-based sensing technique in cognitive radio network
Wavelet-based sensing technique in cognitive radio network
TELKOMNIKA JOURNAL
 
A novel compact dual-band bandstop filter with enhanced rejection bands
A novel compact dual-band bandstop filter with enhanced rejection bandsA novel compact dual-band bandstop filter with enhanced rejection bands
A novel compact dual-band bandstop filter with enhanced rejection bands
TELKOMNIKA JOURNAL
 
Deep learning approach to DDoS attack with imbalanced data at the application...
Deep learning approach to DDoS attack with imbalanced data at the application...Deep learning approach to DDoS attack with imbalanced data at the application...
Deep learning approach to DDoS attack with imbalanced data at the application...
TELKOMNIKA JOURNAL
 
Brief note on match and miss-match uncertainties
Brief note on match and miss-match uncertaintiesBrief note on match and miss-match uncertainties
Brief note on match and miss-match uncertainties
TELKOMNIKA JOURNAL
 
Implementation of FinFET technology based low power 4×4 Wallace tree multipli...
Implementation of FinFET technology based low power 4×4 Wallace tree multipli...Implementation of FinFET technology based low power 4×4 Wallace tree multipli...
Implementation of FinFET technology based low power 4×4 Wallace tree multipli...
TELKOMNIKA JOURNAL
 
Evaluation of the weighted-overlap add model with massive MIMO in a 5G system
Evaluation of the weighted-overlap add model with massive MIMO in a 5G systemEvaluation of the weighted-overlap add model with massive MIMO in a 5G system
Evaluation of the weighted-overlap add model with massive MIMO in a 5G system
TELKOMNIKA JOURNAL
 
Reflector antenna design in different frequencies using frequency selective s...
Reflector antenna design in different frequencies using frequency selective s...Reflector antenna design in different frequencies using frequency selective s...
Reflector antenna design in different frequencies using frequency selective s...
TELKOMNIKA JOURNAL
 
Reagentless iron detection in water based on unclad fiber optical sensor
Reagentless iron detection in water based on unclad fiber optical sensorReagentless iron detection in water based on unclad fiber optical sensor
Reagentless iron detection in water based on unclad fiber optical sensor
TELKOMNIKA JOURNAL
 
Impact of CuS counter electrode calcination temperature on quantum dot sensit...
Impact of CuS counter electrode calcination temperature on quantum dot sensit...Impact of CuS counter electrode calcination temperature on quantum dot sensit...
Impact of CuS counter electrode calcination temperature on quantum dot sensit...
TELKOMNIKA JOURNAL
 
A progressive learning for structural tolerance online sequential extreme lea...
A progressive learning for structural tolerance online sequential extreme lea...A progressive learning for structural tolerance online sequential extreme lea...
A progressive learning for structural tolerance online sequential extreme lea...
TELKOMNIKA JOURNAL
 
Electroencephalography-based brain-computer interface using neural networks
Electroencephalography-based brain-computer interface using neural networksElectroencephalography-based brain-computer interface using neural networks
Electroencephalography-based brain-computer interface using neural networks
TELKOMNIKA JOURNAL
 
Adaptive segmentation algorithm based on level set model in medical imaging
Adaptive segmentation algorithm based on level set model in medical imagingAdaptive segmentation algorithm based on level set model in medical imaging
Adaptive segmentation algorithm based on level set model in medical imaging
TELKOMNIKA JOURNAL
 
Automatic channel selection using shuffled frog leaping algorithm for EEG bas...
Automatic channel selection using shuffled frog leaping algorithm for EEG bas...Automatic channel selection using shuffled frog leaping algorithm for EEG bas...
Automatic channel selection using shuffled frog leaping algorithm for EEG bas...
TELKOMNIKA JOURNAL
 

More from TELKOMNIKA JOURNAL (20)

Amazon products reviews classification based on machine learning, deep learni...
Amazon products reviews classification based on machine learning, deep learni...Amazon products reviews classification based on machine learning, deep learni...
Amazon products reviews classification based on machine learning, deep learni...
 
Design, simulation, and analysis of microstrip patch antenna for wireless app...
Design, simulation, and analysis of microstrip patch antenna for wireless app...Design, simulation, and analysis of microstrip patch antenna for wireless app...
Design, simulation, and analysis of microstrip patch antenna for wireless app...
 
Design and simulation an optimal enhanced PI controller for congestion avoida...
Design and simulation an optimal enhanced PI controller for congestion avoida...Design and simulation an optimal enhanced PI controller for congestion avoida...
Design and simulation an optimal enhanced PI controller for congestion avoida...
 
Improving the detection of intrusion in vehicular ad-hoc networks with modifi...
Improving the detection of intrusion in vehicular ad-hoc networks with modifi...Improving the detection of intrusion in vehicular ad-hoc networks with modifi...
Improving the detection of intrusion in vehicular ad-hoc networks with modifi...
 
Conceptual model of internet banking adoption with perceived risk and trust f...
Conceptual model of internet banking adoption with perceived risk and trust f...Conceptual model of internet banking adoption with perceived risk and trust f...
Conceptual model of internet banking adoption with perceived risk and trust f...
 
Efficient combined fuzzy logic and LMS algorithm for smart antenna
Efficient combined fuzzy logic and LMS algorithm for smart antennaEfficient combined fuzzy logic and LMS algorithm for smart antenna
Efficient combined fuzzy logic and LMS algorithm for smart antenna
 
Design and implementation of a LoRa-based system for warning of forest fire
Design and implementation of a LoRa-based system for warning of forest fireDesign and implementation of a LoRa-based system for warning of forest fire
Design and implementation of a LoRa-based system for warning of forest fire
 
Wavelet-based sensing technique in cognitive radio network
Wavelet-based sensing technique in cognitive radio networkWavelet-based sensing technique in cognitive radio network
Wavelet-based sensing technique in cognitive radio network
 
A novel compact dual-band bandstop filter with enhanced rejection bands
A novel compact dual-band bandstop filter with enhanced rejection bandsA novel compact dual-band bandstop filter with enhanced rejection bands
A novel compact dual-band bandstop filter with enhanced rejection bands
 
Deep learning approach to DDoS attack with imbalanced data at the application...
Deep learning approach to DDoS attack with imbalanced data at the application...Deep learning approach to DDoS attack with imbalanced data at the application...
Deep learning approach to DDoS attack with imbalanced data at the application...
 
Brief note on match and miss-match uncertainties
Brief note on match and miss-match uncertaintiesBrief note on match and miss-match uncertainties
Brief note on match and miss-match uncertainties
 
Implementation of FinFET technology based low power 4×4 Wallace tree multipli...
Implementation of FinFET technology based low power 4×4 Wallace tree multipli...Implementation of FinFET technology based low power 4×4 Wallace tree multipli...
Implementation of FinFET technology based low power 4×4 Wallace tree multipli...
 
Evaluation of the weighted-overlap add model with massive MIMO in a 5G system
Evaluation of the weighted-overlap add model with massive MIMO in a 5G systemEvaluation of the weighted-overlap add model with massive MIMO in a 5G system
Evaluation of the weighted-overlap add model with massive MIMO in a 5G system
 
Reflector antenna design in different frequencies using frequency selective s...
Reflector antenna design in different frequencies using frequency selective s...Reflector antenna design in different frequencies using frequency selective s...
Reflector antenna design in different frequencies using frequency selective s...
 
Reagentless iron detection in water based on unclad fiber optical sensor
Reagentless iron detection in water based on unclad fiber optical sensorReagentless iron detection in water based on unclad fiber optical sensor
Reagentless iron detection in water based on unclad fiber optical sensor
 
Impact of CuS counter electrode calcination temperature on quantum dot sensit...
Impact of CuS counter electrode calcination temperature on quantum dot sensit...Impact of CuS counter electrode calcination temperature on quantum dot sensit...
Impact of CuS counter electrode calcination temperature on quantum dot sensit...
 
A progressive learning for structural tolerance online sequential extreme lea...
A progressive learning for structural tolerance online sequential extreme lea...A progressive learning for structural tolerance online sequential extreme lea...
A progressive learning for structural tolerance online sequential extreme lea...
 
Electroencephalography-based brain-computer interface using neural networks
Electroencephalography-based brain-computer interface using neural networksElectroencephalography-based brain-computer interface using neural networks
Electroencephalography-based brain-computer interface using neural networks
 
Adaptive segmentation algorithm based on level set model in medical imaging
Adaptive segmentation algorithm based on level set model in medical imagingAdaptive segmentation algorithm based on level set model in medical imaging
Adaptive segmentation algorithm based on level set model in medical imaging
 
Automatic channel selection using shuffled frog leaping algorithm for EEG bas...
Automatic channel selection using shuffled frog leaping algorithm for EEG bas...Automatic channel selection using shuffled frog leaping algorithm for EEG bas...
Automatic channel selection using shuffled frog leaping algorithm for EEG bas...
 

Recently uploaded

Student information management system project report ii.pdf
Student information management system project report ii.pdfStudent information management system project report ii.pdf
Student information management system project report ii.pdf
Kamal Acharya
 
DESIGN A COTTON SEED SEPARATION MACHINE.docx
DESIGN A COTTON SEED SEPARATION MACHINE.docxDESIGN A COTTON SEED SEPARATION MACHINE.docx
DESIGN A COTTON SEED SEPARATION MACHINE.docx
FluxPrime1
 
Immunizing Image Classifiers Against Localized Adversary Attacks
Immunizing Image Classifiers Against Localized Adversary AttacksImmunizing Image Classifiers Against Localized Adversary Attacks
Immunizing Image Classifiers Against Localized Adversary Attacks
gerogepatton
 
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptxCFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
R&R Consult
 
Nuclear Power Economics and Structuring 2024
Nuclear Power Economics and Structuring 2024Nuclear Power Economics and Structuring 2024
Nuclear Power Economics and Structuring 2024
Massimo Talia
 
一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理
一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理
一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理
ydteq
 
The Benefits and Techniques of Trenchless Pipe Repair.pdf
The Benefits and Techniques of Trenchless Pipe Repair.pdfThe Benefits and Techniques of Trenchless Pipe Repair.pdf
The Benefits and Techniques of Trenchless Pipe Repair.pdf
Pipe Restoration Solutions
 
ML for identifying fraud using open blockchain data.pptx
ML for identifying fraud using open blockchain data.pptxML for identifying fraud using open blockchain data.pptx
ML for identifying fraud using open blockchain data.pptx
Vijay Dialani, PhD
 
Runway Orientation Based on the Wind Rose Diagram.pptx
Runway Orientation Based on the Wind Rose Diagram.pptxRunway Orientation Based on the Wind Rose Diagram.pptx
Runway Orientation Based on the Wind Rose Diagram.pptx
SupreethSP4
 
Hierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power SystemHierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power System
Kerry Sado
 
Railway Signalling Principles Edition 3.pdf
Railway Signalling Principles Edition 3.pdfRailway Signalling Principles Edition 3.pdf
Railway Signalling Principles Edition 3.pdf
TeeVichai
 
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
Amil Baba Dawood bangali
 
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&BDesign and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Sreedhar Chowdam
 
AKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdf
AKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdfAKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdf
AKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdf
SamSarthak3
 
AP LAB PPT.pdf ap lab ppt no title specific
AP LAB PPT.pdf ap lab ppt no title specificAP LAB PPT.pdf ap lab ppt no title specific
AP LAB PPT.pdf ap lab ppt no title specific
BrazilAccount1
 
Standard Reomte Control Interface - Neometrix
Standard Reomte Control Interface - NeometrixStandard Reomte Control Interface - Neometrix
Standard Reomte Control Interface - Neometrix
Neometrix_Engineering_Pvt_Ltd
 
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
obonagu
 
power quality voltage fluctuation UNIT - I.pptx
power quality voltage fluctuation UNIT - I.pptxpower quality voltage fluctuation UNIT - I.pptx
power quality voltage fluctuation UNIT - I.pptx
ViniHema
 
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
thanhdowork
 
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
Dr.Costas Sachpazis
 

Recently uploaded (20)

Student information management system project report ii.pdf
Student information management system project report ii.pdfStudent information management system project report ii.pdf
Student information management system project report ii.pdf
 
DESIGN A COTTON SEED SEPARATION MACHINE.docx
DESIGN A COTTON SEED SEPARATION MACHINE.docxDESIGN A COTTON SEED SEPARATION MACHINE.docx
DESIGN A COTTON SEED SEPARATION MACHINE.docx
 
Immunizing Image Classifiers Against Localized Adversary Attacks
Immunizing Image Classifiers Against Localized Adversary AttacksImmunizing Image Classifiers Against Localized Adversary Attacks
Immunizing Image Classifiers Against Localized Adversary Attacks
 
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptxCFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
 
Nuclear Power Economics and Structuring 2024
Nuclear Power Economics and Structuring 2024Nuclear Power Economics and Structuring 2024
Nuclear Power Economics and Structuring 2024
 
一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理
一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理
一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理
 
The Benefits and Techniques of Trenchless Pipe Repair.pdf
The Benefits and Techniques of Trenchless Pipe Repair.pdfThe Benefits and Techniques of Trenchless Pipe Repair.pdf
The Benefits and Techniques of Trenchless Pipe Repair.pdf
 
ML for identifying fraud using open blockchain data.pptx
ML for identifying fraud using open blockchain data.pptxML for identifying fraud using open blockchain data.pptx
ML for identifying fraud using open blockchain data.pptx
 
Runway Orientation Based on the Wind Rose Diagram.pptx
Runway Orientation Based on the Wind Rose Diagram.pptxRunway Orientation Based on the Wind Rose Diagram.pptx
Runway Orientation Based on the Wind Rose Diagram.pptx
 
Hierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power SystemHierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power System
 
Railway Signalling Principles Edition 3.pdf
Railway Signalling Principles Edition 3.pdfRailway Signalling Principles Edition 3.pdf
Railway Signalling Principles Edition 3.pdf
 
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
 
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&BDesign and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
 
AKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdf
AKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdfAKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdf
AKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdf
 
AP LAB PPT.pdf ap lab ppt no title specific
AP LAB PPT.pdf ap lab ppt no title specificAP LAB PPT.pdf ap lab ppt no title specific
AP LAB PPT.pdf ap lab ppt no title specific
 
Standard Reomte Control Interface - Neometrix
Standard Reomte Control Interface - NeometrixStandard Reomte Control Interface - Neometrix
Standard Reomte Control Interface - Neometrix
 
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
 
power quality voltage fluctuation UNIT - I.pptx
power quality voltage fluctuation UNIT - I.pptxpower quality voltage fluctuation UNIT - I.pptx
power quality voltage fluctuation UNIT - I.pptx
 
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
 
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
 

K Means Clustering and Meanshift Analysis for Grouping the Data of Coal Term in Puslitbang tekMIRA

  • 1. TELKOMNIKA, Vol.16, No.3, June 2018, pp. 1351~1357 ISSN: 1693-6930, accredited A by DIKTI, Decree No: 58/DIKTI/Kep/2013 DOI: 10.12928/TELKOMNIKA.v16i3.8910  1351 Received September 18, 2017; Revised February 8, 2018; Accepted April 5, 2018 K Means Clustering and Meanshift Analysis for Grouping the Data of Coal Term in Puslitbang tekMIRA Rolly Maulana Awangga* 1 , Syafrial Fachri Pane 2 , Khaera Tunnisa 3 , Iping Supriana Suwardi 4 1,2,3 Program in Informatics Engineering, Politeknik Pos Indonesia, Indonesia 4 School of Electrical Engineering and Informatics, Bandung Institute of Technology, Indonesia *Corresponding author, e-mail: awangga@poltekpos.ac.id Abstract Indonesian government agencies under the Ministry of Energy and Mineral Resources have problems in classifying data dictionary of coal. This research conduct grouping coal dictionary using K- Means and MeanShift algorithm. K-means algorithm is used to get cluster value on character and word criteria. The last iteration of Euclidian distance calculation data on k-means combine with Meanshift algorithm. The meanshift calculates centroid by selecting different bandwidths. The result of grouping using k-means and meanshift algorithm shows different centroid to find optimum bandwidth value. The data dictionary of this research has sorted in alphabetically. Keywords:coal dictionary,clustering, K-means,meanshift Copyright © 2018 Universitas Ahmad Dahlan. All rights reserved. 1. Introduction Data Mining is a process of extracting data or filtering data by utilising a collection of a big data. The data through a series of operations to obtain valuable information from the data. According to Daryl Pregibon, the composition of artificial statistics, artificial intelligence, and database research called data mining [1]. Data mining techniques used to explore the data using classification, prediction, grouping, outlier detection, association rules, sequence analysis, time series analysis and text mining, and some new techniques such as social networking analysis and sentiment analysis [2]. The center of research and development mineral and coal technology (Puslitbang tekMIRA) is an Indonesia government institution under the Ministry of Energy and Mineral Resources. Puslitbang tekMIRA has a dictionary of coal. This coal dictionary has one thousand coal terms. It takes time between three to five minutes to find a term inside a coal dictionary. Based on these problems, data mining method is used to classify data dictionary. This research conduct grouping data dictionary of a coal term using data mining method. K-Means and Meanshift Algorithm were chosen in this research. K-Means algorithm was used to categories students with skills such as cognitive, communication and relational[3], to evaluate student achievement levels for course content [4], and to group data based on user information created on SNS and recommend to users in the future [5]. The grouping data based on user sentences by utilising regularity among data pursued by the user using K-Means Algorithm [6]. The genetic algorithm and K-Means are used to calculate clustered centroid with heterogeneous populations that lead to better results than using random numbers [7]. The Meanshift algorithm is used to accurately detect the location of target tracking [8] by using this method and then make it easy the accuracy of the calculation of the tracking results [9]. The Meanshift algorithm is also used to solve facial-detection and tracking-based systems [10], if an imbalance occurs, it will affect the performance of [11]. In this research the algorithm grouping a coal term in a dictionary based on character and word in a cluster. The cluster serve data dictionary of coal based on predetermined criteria. The result data of clustering using K-Means and Meanshift algorithm is shown using the matloplib plot. Matplotlib is a Python package for Plotting that produces quality production graphs. Matplotlib is designed to be able to create simple and complex plots with multiple commands [12].
  • 2.  ISSN: 1693-6930 TELKOMNIKA Vol. 16, No. 3, June 2018: 1351-1357 1352 2. Research Method 2.1. K-Means K-Means Clustering is a grouping of data, where the data in K Means Clustering K is the amount of data or the number of constants. Means is the average value of the data set as Cluster [7]. So K-Means is a method to analyse data or called data mining method where this data modelling method without using supervise or unsupervised method and K-Means is a method to classify data by using partition system. The K-Means method solves large amounts of data in groups, and the data has the same characteristics as other data. Also, the team feature also has a feature [13]. Similitude is some matrix used for similarities between instances in a cluster. K-Means is an algorithm used to generate k clusters from a collection of data sets in a simple way [14]. Algorithm 1 is an explanation of k-means implementation. Algorithm 1: k- means clustering: Randomly select k cluster centers c1, ... ck, Repeat, Set each data entity to the closest cluster center ci, Change the cluster center with the average cluster i, until the cluster center does not change[15]. K - Means algorithm formula d = distance, j = amount of data, c = centroid, x = data, c = centroid 1 . The Euclidean distance formula is described in Equation 2. ( ) √∑ √ D (i, j) is the distance of data between i and center cluster j. Xki is the data to i on attribute data to k. Xkj is the center point to j at attribute to k. Recalculate cluster center with current cluster membership. The cluster center is the average of all data/objects in a particular cluster. If desired it can also use the median of the cluster. So mean (mean) is not the only size that can use.Reassign each object using the new cluster center. If the cluster center does not change again, then the clustering process is complete. Alternatively, return to step number 3 until the center of the cluster does not change anymore [16]. 2.2. Meanshift The average shift method is a method for determining the maximum function limit of density with separate data from a function. The average shift use as a media grouping method, in which each mode represents each group [17]-18]. The average shift method classifies data in its search mode that directs and moves data to the point region along with the iteration of the data environment built with the Gaussian Kernel [19]. Starting from one data point and iteratively will improve the approximate mode [20]. The Gaussian kernel is a differentiated multivariate kernel function used for actual calculations in assumptions [21]. Bandwidth is a free parameter that shows the effect on the estimated density generated. 3. Results and Analysis Grouping of data of coal dictionary using K-Means and Meanshift Algorithm. The grouping results are displayed using matplotlib plot. The sample data taken are vowel data (A, I, U, E, O) for the coal dictionary. Calculation data using K-Means algorithm. The result data of the last iteration, i.e. data that has been stable and without changes. 3.1. Result K-Means 3.1.1. The letter A The data of the last iteration calculation using euclidian distance k- means for letter A. Clustering data using character and word criteria and use the centroid value of the highest and lowest values on the criteria. In table 1 is a cluster value that has a fixed and unchanged cluster.
  • 3. TELKOMNIKA ISSN: 1693-6930  K Means Clustering and Meanshift Analysis for Grouping… (Rolly Maulana Awangga) 1353 Table 1. Data cluster of the letter A 3.1.2. The letter I The data of the last iteration calculation using euclidian distance k- means for letter I. Clustering data using character and word criteria and use the centroid value of the highest and lowest values on the criteria. In Table 2 is a cluster value that has a fixed and unchanged cluster. Table 2. Data cluster of the letter I 3.1.3. The letter U The data of the last iteration calculation using euclidian distance k- means for letter U. Clustering data using character and word criteria and use the centroid value of the highest and lowest values on the criteria. In Table 3 is a cluster value that has a fixed and unchanged cluster. Table 3. Data cluster of the letter U Character Word 0 6.297319 24.02082 17.96443 24.02082 17.96443 ....... ....... 29.01724 22.93403 23.02173 16.97265 3.1.4. The letter E The data of the last iteration calculation using euclidian distance k- means for letter E. Clustering data using character and word criteria and use the centroid value of the highest and Character Word 19.04536 3.365126 12.0041 3.78689 7.033561 8.84361 17.00289 1.21311 ....... ....... 8.006149 7.78689 11.00447 4.78689 27.20259 11.60749 Character Word 56.4358 1.234823 62.39391 4.774678 37.21559 20.42897 43.18565 14.5126 ....... ....... 59.4138 1.777967 67.47592 9.838576
  • 4.  ISSN: 1693-6930 TELKOMNIKA Vol. 16, No. 3, June 2018: 1351-1357 1354 lowest values on the criteria. In Table 4 is a cluster value that has a fixed and unchanged cluster. Table 4. Data Cluster of the Letter E 3.1.5. The letter O The data of the last iteration calculation using euclidian distance k- means for letter O. Clustering data using character and word criteria and use the centroid value of the highest and lowest values on the criteria. In Table 5 is a cluster value that has a fixed and unchanged cluster. Table 5. Data Cluster of the Letter O Character Word 12.33333 6.342096 3.33333 2.687423 ....... ....... 2.33333 3.726783 14.66667 20.73377 3.2. Meanshift 3.2.1. The letter A In Figure 1 is the plot result for the data of the letter A. The data used is the term data of coal letter A. The value used by the data of the term A to get bandwidth one on the plot meanshift is at point 6.76. Figure 1. Plot Mean Shift letter A Character Word 13.15295 1.009718 14.14214 1.973457 14.14214 1.973457 ....... ....... 7.071068 5.100934 10 2.625744
  • 5. TELKOMNIKA ISSN: 1693-6930  K Means Clustering and Meanshift Analysis for Grouping… (Rolly Maulana Awangga) 1355 3.2.2. The letter I In Figure 2 is the plot result for the data of the letter I. The data used is the term data of coal letter I. The value used by the data of the term I to get bandwidth one on the plot meanshift is at point 18.95. Figure 2. Plot MeanShift letter I 3.2.3. The letter U In Figure 3 is the plot result for the data of the letter U. The data used is the term data of coal letter U. The value used by the data of the term I to get bandwidth one on the plot meanshift is at point 5.75. Figure 3. Plot MeanShift letter U 3.2.4. The letter E In Figure 4 is the plot result for the data of the letter E. The data used is the term data of coal letter E. The value used by the data of the term I to get bandwidth one on the plot meanshift is at point 3.36.
  • 6.  ISSN: 1693-6930 TELKOMNIKA Vol. 16, No. 3, June 2018: 1351-1357 1356 Figure 4. Plot MeanShift letter E 3.2.5. The letter O In Figure 5 is the plot result for the data of the letter O. The data used is the term data of coal letter O. The value used by the data of the term I to get bandwidth one on the plot meanshift is at point 7. Figure 5. Plot MeanShift letter O 4. Conclusion Grouping a coal dictionary using the k-means algorithm produces a cluster value on character and word criteria. The last iteration of the Euclidian distance calculation results in different cluster values in each alphabet. The centroid value of the k-means calculation is then combined with the MeanShift algorithm. Centroid calculation results of the meanshift algorithm result in different bandwidth. Letter A produces a bandwidth of 6.76, for the letter I produce bandwidth 18.95, letter U produces bandwidth 5.75, and letter E produces bandwidth 3.36. The grouping of coal dictionaries in this study facilitates for a group of terms in the letter. Suggestions for future research, can classify the data dictionary of coal for the second, third and subsequent letters, and combine other algorithms in grouping such as K-Nearest Neighbour (KNN) algorithm.
  • 7. TELKOMNIKA ISSN: 1693-6930  K Means Clustering and Meanshift Analysis for Grouping… (Rolly Maulana Awangga) 1357 References [1] F Gorunescu,Data Mining: Concepts, models and techniques. Springer Science & Business Media, 2011; 12. [2] MI Ramadhan etal. An analysis of natural disaster data by using k-means and k-medoids algorithm of data mining techniques. in Quality in Research (QiR): International Symposium on Electrical and Computer Engineering, 2017 15th International Conference on. IEEE. 2017: 221–225. [3] J Yi, S Li, M Wu, HA Yeung, WW Fok, Y Wang, F Liu. Cloud-based educational big data application of apriori algorithm and k-means clustering algorithm based on students’information. in Big Data and Cloud Computing (BdCloud), 2014 IEEE Fourth International Conference on. IEEE. 2014: 151–158. [4] L Guoli, W Tingting, Y Limei, L Yanping, G Jinqiao. The improved research on k-means clustering algorithm in initial values. in Mechatronic Sciences, Electric Engineering and Computer (MEC), Proceedings 2013 International Conference on. IEEE. 2013; 2124–2127. [5] SH Jung, JC Kim,CB Sim.Prediction data processing scheme using an artificial neural network and data clustering for big data. International Journal of Electrical and Computer Engineering (IJECE). 2016; 6(1): 330-336. [6] A Bedboudi, C Bouras, MT Kimour. An heterogeneous population-based genetic algorithm for data clustering.Indonesian Journal ofElectrical Engineering and Informatics (IJEEI).2017;5(3): 275–284. [7] Q Shi, L Xu, Z Shi, Y Chen, Y Shao. Analysis and research of the campus network user’s behavior based on k-means clustering algorithm. in Digital Manufacturing and Automation (ICDMA), 2013 Fourth International Conference on. IEEE, 2013: 196–201. [8] Y Cheng,W Hongyu, W Xiaohong.Positioning method research for unmanned aerial vehicles based on meanshift tracking algorithm. in Control and Decision Conference (CCDC), 2017 29th Chinese. IEEE 2017: 989–994. [9] RM Awangga, NS Fathonah, TI Hasanudin. Colenak: GPS tracking model for post-stroke rehabilitation program using AES-CBC URL encryption and QR-Code. in 2017 2nd International conferences on Information Technology,Information Systems and Electrical Engineering (ICITISEE). IEEE. Nov 2017: 255–260. [Online]. Available: http://ieeexplore.ieee.org/document/8285506/ [10] A Salhi, Y Moresly, F Ghozzi, A Fakhfakh. Face detection and tracking system with block-matching, meanshift and camshift algorithms and kalman filter. in Sciences and Techniques of Automatic Control and Computer Engineering (STA), 2017 18th International Conference on. IEEE. 2017: 139– 145. [11] IN Yulita, S Purwani, R Rosadi, RM Awangga. A quantization of deep belief networks for long short- term memory in sleep stage detection. in Advanced Informatics, Concepts, Theory, and Applications (ICAICTA), 2017 International Conference on. IEEE, 2017: 1-5. [12] N Ari, M Ustazhanov. Matplotlib in python. in 2014 11th International Conference on Electronics, Computer and Computation (ICECCO). IEEE, sep 2014. [Online]. Available: https://doi.org/10.1109/icecco.2014.6997585 [13] J MacQueen et al. Some methods for classification and analysis of multivariate observations in Proceedings ofthe fifth Berkeley symposium on mathematical statistics and probability. Oakland, CA, USA. 1967; 1(14): 281–297. [14] AK Jain. Data clustering:50 years beyond k-means.Pattern recognition letters.2010;31(8): 651-666. [15] Z Gheid, Y Challal. Efficient and privacy-preserving k-means clustering for big data mining. in Trustcom/BigDataSE/I SPA, IEEE. 2016: 791–798. [16] K Obbie. Penerapan algoritma klasifikasi data mining id3 untuk menentukan penjurusan siswasman6 semarang. Skripsi, Fakultas Ilmu Komputer, 2014. [17] Y Liu, SZ Li, W Wu, R Huang. Dynamics of a mean-shift-like algorithm and its applications on clustering. Information Processing Letters. 2013; 113(1): 8–16. [18] G Chen,Q Chen,D Zhang. Mean shift: A method for measurementmatrix ofcompressive sensing.in Digital Home (ICDH), 2014 5th International Conference on. IEEE. 2014: 64–69. [19] YA Ghassabeh.A sufficientcondition for the convergence of the mean shift algorithm with gaussian kernel. Journal of Multivariate Analysis. 2015; 135: 1–10. [20] K Fukunaga, L Hostetler. The estimation of the gradient of a density function, with applications in pattern recognition. IEEE Transactions on information theory. 1975; 21(1): 32–40. [21] C Yang, R Duraiswami, NA Gumerov, L Davis. Improved fast gauss transform and efficient kernel density estimation. in null. IEEE. 2003: 464.