SlideShare a Scribd company logo
1 of 6
Download to read offline
Indonesian Journal of Electrical Engineering and Informatics (IJEEI)
Vol. 2, No. 3, September 2014, pp. 119~124
ISSN: 2089-3272  119
Received June 25, 2014; Revised August 6, 2014; Accepted September 1, 2014
KNN and ARL Based Imputation to Estimate Missing
Values
Thirumahal R, Deepali A Patil
Departement of Information Technology, Thadomal Shahani Engineering College (TSEC)
Email: deep.patil1987@gmail.com
Abstract
Missing data are the absence of data items for a subject; they hide some information that may be
important. In practice, missing data have been one major factor affecting data quality. Thus, Missing value
imputation is needed. Methods such as hierarchical clustering and K-means clustering are not robust to
missing data and may lose effectiveness even with a few missing values. Therefore, to improve the quality
of data method for missing value imputation is needed. In this paper KNN and ARL based Imputation are
introduced to impute missing values and accuracy of both the algorithms are measured by using
normalized root mean sqare error. The result shows that ARL is more accurate and robust method for
missing value estimation.
Keywords: Autoregressive model (AR), K-nearest neighbor, Missing value estimation, Time series
analysis.
1. Introduction
To get the required information from huge, incomplete, noisy and inconsistent set of
data it is necessary to use data preprocessing. Data cleaning phase of preprocessing is used
because the aim here is to impute (fill) the missing values using estimated ones. During filling of
the missing values three things are important, they are as follows: A. Estimated values without
bias, B. The relation between attributes should be maintained, and C. Minimize the cost.
In most cases, a data set’s attributes are not independent of each other. Thus, through
the identification of relationships among attributes, missing values can be determined. K-nearest
neighbor imputation [1] can find missing values in a particular column but if many missing
values are present then this method will give incorrect result. Also, the choice of distance
function must be properly selected to estimate the missing values. ARL [2] will perform when
there are many missing values at particular time points, and even when the experiments for time
points fail or are missing.
In this paper, Normalized root mean square error is calculated based on the
experimental results which are then used to compare both the algorithms. Comparision results
will show the accuracy of algorithm.
2. Earliear Work
Many existing, industrial, and research data sets contain missing values (MVs). There
are various reasons for their existence, such as manual data entry procedures, equipment
errors, and incorrect measurements. The presence of such imperfections usually requires a
preprocessing stage in which the data are prepared and cleaned. Missing Value Analysis
provides a slightly different set of descriptive tools for analyzing missing and includes a variety
of single imputation methods.
Simplest technique to impute the missing values that is mean imputation was metioned
in paper “dealing with missing software project data” by M.J. Shepperd and M.H. Cartwright [3].
Bu it distorts relationships between variables by “pulling” estimates of the correlation toward
zero.
Trevor Hastie, Robert Tibshirani, Gavin Sherlock, Michael Eisen, Patrick Brown, David
Botstein described Singular value decomposition method (SVD) in paper “Imputing Missing
Data for Gene Expression Arrays” [4] to obtain a set of mutually orthogonal values. But the
drawbacks of SVD are that it can only be performed on complete matrices and the SVD-based
 ISSN: 2089-3272
IJEEI Vol. 2, No. 3, September 2014 : 119 – 124
120
method shows sharp deterioration in performance when a non-optimal fraction of missing values
is used. Therefore, KNN Impute method is introduced which is less sensitive to the exact
parameters used (number of nearest neighbors).
The LLSimpute (Local Least Squares Imputation) algorithm [5] predicts the missing
value using the least squares formulation for the neighborhood column and the non-missing
entries. It works well but the time complexity is higher. Due to above disadvantages in [6] they
discussed an autoregressive-model-based missing value estimation method that takes into
account the dynamic property of temporal data and the local similarity structures in the data.
This method is especially effective for the situation where a particular time point contains many
missing values or where the entire time point is missing.
In this work, we describe and evaluate two methods of estimation for missing values in
Synthetic control dataset [2]. We compare our KNN and ARL based methods by calculating
error rate using NRMSE to show the accuracy.
3. Problem Statement
Given a record of all the time series data, first find the missing values which are
indicated by zeros from each column, estimate that values and finally impute those values which
are same as original values. To impute the missing values KNN Impute and ARL Impute
algorithms are used.
3.1 KNN Impute Algorithm:
KNNI [7] computes the k nearest neighbors and a value from them is imputed. For
nominal values, the most common value among all neighbors is taken, and for numerical
values, the average value is used. Therefore, a proximity measure between instances is needed
for it to be defined. The Euclidean distance is most commonly used in the literature.
The most frequent value among k-nearest neighbor and the mean among the k-nearest
neighbor can be predicted by k-nearest neighbor imputation. In this method the main factor is
Distance metrics. In 1NN imputation method we can replace the missing values with the nearest
neighbor. But if the value of K is greater than one then replace the missing values with the
mean or weighted average of K-nearest neighbors. By setting k-value between 10 and 20
brings the best results for KNN imputation.
Euclidean distance is calculated by following equation:
 

n
t
tytxyx eeVVdist
1
2
,,),( (1)
Weighted average of corresponding entries is calculated by following equation:
ij
k
i
iIJ eWG   1
(2)
Weighted value is given in following equation:


),(
1
*
i
i
ggdis
W (3)
Where,
  

k
i
iggdist
1
*
,/1 (4)
Example 2 (KNNI)
Nearest neighbor: 2
IJEEI ISSN: 2089-3272 
KNN and ARL Based Imputation to Estimate Missing Values (Deepali A Patil)
121
Inserted Array: Gx Gy
29.2171 32.8717
32.337 36.0253
32.8717 34.5249
0.0 34.1173
26.3693 27.6623
24.8923 25.7744
29.9423 30.9493
35.6805 35.2623
Gy: reference column
Predicted Missing gx: 27.8725
3.2 ARL Impute Algorithm
KNN impute, is not able to deal with the situation where a particular time point (column)
of the data is missing entirely. ARL Impute [1] and [6] is an effective method for the situation
where a particular time point contains many missing values or where the entire time point is
missing. ARL impute, consists of two major processes: in the first one, we assume no missing
data and estimate the AR coefficients, and in the second process, we assume that the AR
coefficients are known and we estimate the missing data.
Figure 1. An overview of ARL Impute
AR coefficients [1] can find by using least square approximation [1] or cubic spline interpolation
method. AR model can be described in equation as follows:
jjjj aYy  (5)
where, aj is the coefficient matrix., j is a sequence of independent identically distributed
normal random variable with mean zero. It means that any given value jy in the time series is
directly proportional to the pervious value jY plus some random error j. As the number of AR
parameters increase, jy becomes directly related to additional past values.
 ISSN: 2089-3272
IJEEI Vol. 2, No. 3, September 2014 : 119 – 124
122
(6)
In this paper AR coeffiecients are calcualed by using least sqare approximation principle [1].
Let us assume that (y1,y2,….ys) are the observed data and {x1, . . . , xm} are the missing data.
Estimation of missing data in matrix form is given by:
e=Az (7)
where z is a column vector that consists of the observed data y and the missing data x, and A is
a Toeplitz matrix whose column number is n and row number is n- p , which is written as:


















1100
010
001
1
1
aa
aa
aa
A
p
p
p




(8)
If we separate the observed data from missing data and split A in the block matrix, the equation
can be written as:
e=Bx+Cy (9)
Where B = [B1, B2 · · · ] and C = [C1, C2 · · · ] are block sub matrices of A corresponding to the
respective locations of observed data y and missing data x. Finally the missing data can be
calculated from B# (pseudo inverse of B).The corresponding equation is given by:
x= -B#
Cy (10)
Example 2 (ARLI):
t={1,2,3,4,5,6,7}
ft={ 34.4632, 31.2834, 33.7596, 27.7849,7.569, 29.2171, 32.337};
f={ 34.4632, 31.2834,0,0,0, 29.2171, 32.337};
Predicted missing values are:
28.97423999999749 at t=3.0
27.73036999999391 at t=4.0
27.746439999989708 at t=5.0
3.3 Performance measure:
Normalized RMS Error (NRMSE) is used to measure the performance of missing value
estimation method, it can be calculated as:
 
  
 
 
 

 m
i
n
j
m
i
n
j
jiY
jiYjiY
NRMSE
1 1
2
1
2
1
),(
),(),(
~
(11)
IJEEI ISSN: 2089-3272 
KNN and ARL Based Imputation to Estimate Missing Values (Deepali A Patil)
123
where, Y is the true value, Y
~
is the estimated value, and m and n are the total number of rows
and columns respectively.
4. Results and Discussion
Missing values are estimated for Synthetic control chart dataset [2] using K-nearest neighbor
and Auto Regressive (AR) Model. Dataset is shown in figure 2.
Figure 2. Synthetic contol chart Time series dataset
Performance of KNN Impute was assessed over different K values and different
percentage of missing values shown in figure 3. Result shows that when the numbers of K
values are in the range of 10-20 then, NRMSE error rate is lesser. That is, by setting k-value
between 10 and 20 brings the best results for KNN imputation shown in table 1.
Table 1. NRMSE result for KNN Impute over different K values.
K values KNN Impute
3 0.179
5 0.15876
10 0.1111
15 0.0889
22 0.1245
25 0.1489
Figure 3. Effect of number of nearest neighbors used for KNN-based estimation
Then Performance of KNN Impute and ARL Impute were measured over NRMSE and
% of missing values as shown in figure 4. The graph shows that the error rate of KNNimpute is
higher than ARL impute when the percentage of missing values are inceased in a particular
column. Comparision of NRMSE result is shown in Table 2.
 ISSN: 2089-3272
IJEEI Vol. 2, No. 3, September 2014 : 119 – 124
124
Table 2. NRMSE result for ARL and KNN Impute over % of missing values
% of missing values ARL Impute KNN Impute
10 0.08004 0.12
15 0.0912 0.28
20 0.17 0.33
Figure 4. NRMSE comparison pf ARL and KNN Impute algorithm
5. Conclusion And Future Work
We have developed two algorithms KNN impute and ARL impute to estimate and
impute the missing values. Both the algorithms give best results to estimate the missing values.
KNN Impute gives best results when the K value is in the range of 10-20. KNN Impute estimates
missing value from a particular column and ARL Impute estimates many missing values from a
particular column. When the performance of both algorithms is measured based on normalized
root mean square error (NRMSE) then ARL Impute gives best estimation, as its error rate is
less. Our future work aims at finding the missing values for more than two columns of missing
data using the probablilistic models like PPCA (Probabilistic principle component analysis).
References
[1] David Sheung Chi Fung. “Methods for the Estimation of Missing Values in Time Series”. a thesis
Submitted to the Faculty of Communications, Health and Science Edith Cowan University Perth,
Western Australia.
[2] UCI machine learning repository, http://archive.ics.uci.edu/ml, 2010.
[3] MJ Shepperd and MH Cartwright. “Dealing with Missing Software Project Data”. Empirical Software
Engineering Research Group School of Design, Engineering & Computing, Bournemouth University.
[4] Trevor Hastie, Robert Tibshirani, Gavin Sherlock, Michael Eisen, Patrick Brown, David Botstein.
“Imputing Missing Data for Gene Expression Arrays”.
[5] YTao, D Papadias, and X Lian. Reverse KNN Search in Arbitrary Dimensionality. Proc. 30th Int’l Conf.
Very Large Data Bases (VLDB ’04), 2004.
[6] Miew Keen Choong, Maurice Charbit, and Hong Yan. Autoregressive-Model-Based Missing Value
Estimation for DNA Microarray Time Series Data. IEEE Transactions on Information Technology In
Biomedicine. 2009; 13(1): 131-138.
[7] O Troyanskaya, M Cantor, G Sherlock, et al. "Missing value estimation methods for DNA
microarrays". Bioinformatics. 2001; 17: 520-525.
[8] PMT Broersen. “Finite sample criteria for autoregressive order selection”. IEEE Trans. Signal Process.
2000; 48(12): 3550–3558.
[9] MS Sehgal, L Gondal, LS Dooley. “Collateral Missing value imputation: a new robust missing value
estimation algorithm for microarray data”. Bioinformatics. 2005; 21: 2417-2423.
[10] H Kim, GH Golub, and H Park. “Missing value estimation for DNA microarray gene expression data:
local least squares imputation”. Bioinformatics. 2005; 21: 187-198.

More Related Content

What's hot

Optimising Data Using K-Means Clustering Algorithm
Optimising Data Using K-Means Clustering AlgorithmOptimising Data Using K-Means Clustering Algorithm
Optimising Data Using K-Means Clustering AlgorithmIJERA Editor
 
8.clustering algorithm.k means.em algorithm
8.clustering algorithm.k means.em algorithm8.clustering algorithm.k means.em algorithm
8.clustering algorithm.k means.em algorithmLaura Petrosanu
 
Dimension Reduction: What? Why? and How?
Dimension Reduction: What? Why? and How?Dimension Reduction: What? Why? and How?
Dimension Reduction: What? Why? and How?Kazi Toufiq Wadud
 
Application of Adomian Decomposition Method in Solving Second Order Nonlinear...
Application of Adomian Decomposition Method in Solving Second Order Nonlinear...Application of Adomian Decomposition Method in Solving Second Order Nonlinear...
Application of Adomian Decomposition Method in Solving Second Order Nonlinear...inventionjournals
 
Introduction to Linear Discriminant Analysis
Introduction to Linear Discriminant AnalysisIntroduction to Linear Discriminant Analysis
Introduction to Linear Discriminant AnalysisJaclyn Kokx
 
Estimating Reconstruction Error due to Jitter of Gaussian Markov Processes
Estimating Reconstruction Error due to Jitter of Gaussian Markov ProcessesEstimating Reconstruction Error due to Jitter of Gaussian Markov Processes
Estimating Reconstruction Error due to Jitter of Gaussian Markov ProcessesMudassir Javed
 
Iterative Determinant Method for Solving Eigenvalue Problems
Iterative Determinant Method for Solving Eigenvalue ProblemsIterative Determinant Method for Solving Eigenvalue Problems
Iterative Determinant Method for Solving Eigenvalue Problemsijceronline
 
Data Science - Part XII - Ridge Regression, LASSO, and Elastic Nets
Data Science - Part XII - Ridge Regression, LASSO, and Elastic NetsData Science - Part XII - Ridge Regression, LASSO, and Elastic Nets
Data Science - Part XII - Ridge Regression, LASSO, and Elastic NetsDerek Kane
 
Integration techniques for two dimensional domains
Integration techniques for two dimensional domainsIntegration techniques for two dimensional domains
Integration techniques for two dimensional domainseSAT Publishing House
 
83662164 case-study-1
83662164 case-study-183662164 case-study-1
83662164 case-study-1homeworkping3
 
32 8950 surendar s comparison of surface roughness (edit ari)new
32 8950 surendar s   comparison of surface roughness (edit ari)new32 8950 surendar s   comparison of surface roughness (edit ari)new
32 8950 surendar s comparison of surface roughness (edit ari)newIAESIJEECS
 
IRJET- Supervised Learning Classification Algorithms Comparison
IRJET- Supervised Learning Classification Algorithms ComparisonIRJET- Supervised Learning Classification Algorithms Comparison
IRJET- Supervised Learning Classification Algorithms ComparisonIRJET Journal
 
House price prediction
House price predictionHouse price prediction
House price predictionSabahBegum
 

What's hot (17)

Optimising Data Using K-Means Clustering Algorithm
Optimising Data Using K-Means Clustering AlgorithmOptimising Data Using K-Means Clustering Algorithm
Optimising Data Using K-Means Clustering Algorithm
 
8.clustering algorithm.k means.em algorithm
8.clustering algorithm.k means.em algorithm8.clustering algorithm.k means.em algorithm
8.clustering algorithm.k means.em algorithm
 
Principal Component Analysis
Principal Component AnalysisPrincipal Component Analysis
Principal Component Analysis
 
Dimension Reduction: What? Why? and How?
Dimension Reduction: What? Why? and How?Dimension Reduction: What? Why? and How?
Dimension Reduction: What? Why? and How?
 
Application of Adomian Decomposition Method in Solving Second Order Nonlinear...
Application of Adomian Decomposition Method in Solving Second Order Nonlinear...Application of Adomian Decomposition Method in Solving Second Order Nonlinear...
Application of Adomian Decomposition Method in Solving Second Order Nonlinear...
 
Introduction to Linear Discriminant Analysis
Introduction to Linear Discriminant AnalysisIntroduction to Linear Discriminant Analysis
Introduction to Linear Discriminant Analysis
 
Estimating Reconstruction Error due to Jitter of Gaussian Markov Processes
Estimating Reconstruction Error due to Jitter of Gaussian Markov ProcessesEstimating Reconstruction Error due to Jitter of Gaussian Markov Processes
Estimating Reconstruction Error due to Jitter of Gaussian Markov Processes
 
Iterative Determinant Method for Solving Eigenvalue Problems
Iterative Determinant Method for Solving Eigenvalue ProblemsIterative Determinant Method for Solving Eigenvalue Problems
Iterative Determinant Method for Solving Eigenvalue Problems
 
Pca analysis
Pca analysisPca analysis
Pca analysis
 
Ijetcas14 546
Ijetcas14 546Ijetcas14 546
Ijetcas14 546
 
Understandig PCA and LDA
Understandig PCA and LDAUnderstandig PCA and LDA
Understandig PCA and LDA
 
Data Science - Part XII - Ridge Regression, LASSO, and Elastic Nets
Data Science - Part XII - Ridge Regression, LASSO, and Elastic NetsData Science - Part XII - Ridge Regression, LASSO, and Elastic Nets
Data Science - Part XII - Ridge Regression, LASSO, and Elastic Nets
 
Integration techniques for two dimensional domains
Integration techniques for two dimensional domainsIntegration techniques for two dimensional domains
Integration techniques for two dimensional domains
 
83662164 case-study-1
83662164 case-study-183662164 case-study-1
83662164 case-study-1
 
32 8950 surendar s comparison of surface roughness (edit ari)new
32 8950 surendar s   comparison of surface roughness (edit ari)new32 8950 surendar s   comparison of surface roughness (edit ari)new
32 8950 surendar s comparison of surface roughness (edit ari)new
 
IRJET- Supervised Learning Classification Algorithms Comparison
IRJET- Supervised Learning Classification Algorithms ComparisonIRJET- Supervised Learning Classification Algorithms Comparison
IRJET- Supervised Learning Classification Algorithms Comparison
 
House price prediction
House price predictionHouse price prediction
House price prediction
 

Similar to KNN and ARL algorithms for estimating missing values in time series data

Design of optimized Interval Arithmetic Multiplier
Design of optimized Interval Arithmetic MultiplierDesign of optimized Interval Arithmetic Multiplier
Design of optimized Interval Arithmetic MultiplierVLSICS Design
 
BEHAVIOR STUDY OF ENTROPY IN A DIGITAL IMAGE THROUGH AN ITERATIVE ALGORITHM O...
BEHAVIOR STUDY OF ENTROPY IN A DIGITAL IMAGE THROUGH AN ITERATIVE ALGORITHM O...BEHAVIOR STUDY OF ENTROPY IN A DIGITAL IMAGE THROUGH AN ITERATIVE ALGORITHM O...
BEHAVIOR STUDY OF ENTROPY IN A DIGITAL IMAGE THROUGH AN ITERATIVE ALGORITHM O...ijscmcj
 
Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)IJERD Editor
 
Fault Tolerant Matrix Pencil Method for Direction of Arrival Estimation
Fault Tolerant Matrix Pencil Method for Direction of Arrival EstimationFault Tolerant Matrix Pencil Method for Direction of Arrival Estimation
Fault Tolerant Matrix Pencil Method for Direction of Arrival Estimationsipij
 
Finding Relationships between the Our-NIR Cluster Results
Finding Relationships between the Our-NIR Cluster ResultsFinding Relationships between the Our-NIR Cluster Results
Finding Relationships between the Our-NIR Cluster ResultsCSCJournals
 
A Thresholding Method to Estimate Quantities of Each Class
A Thresholding Method to Estimate Quantities of Each ClassA Thresholding Method to Estimate Quantities of Each Class
A Thresholding Method to Estimate Quantities of Each ClassWaqas Tariq
 
Fault detection based on novel fuzzy modelling
Fault detection based on novel fuzzy modelling Fault detection based on novel fuzzy modelling
Fault detection based on novel fuzzy modelling csijjournal
 
Enhancing Classification Accuracy of K-Nearest Neighbors Algorithm using Gain...
Enhancing Classification Accuracy of K-Nearest Neighbors Algorithm using Gain...Enhancing Classification Accuracy of K-Nearest Neighbors Algorithm using Gain...
Enhancing Classification Accuracy of K-Nearest Neighbors Algorithm using Gain...IRJET Journal
 
Estimating project development effort using clustered regression approach
Estimating project development effort using clustered regression approachEstimating project development effort using clustered regression approach
Estimating project development effort using clustered regression approachcsandit
 
ESTIMATING PROJECT DEVELOPMENT EFFORT USING CLUSTERED REGRESSION APPROACH
ESTIMATING PROJECT DEVELOPMENT EFFORT USING CLUSTERED REGRESSION APPROACHESTIMATING PROJECT DEVELOPMENT EFFORT USING CLUSTERED REGRESSION APPROACH
ESTIMATING PROJECT DEVELOPMENT EFFORT USING CLUSTERED REGRESSION APPROACHcscpconf
 
Quality Prediction in Fingerprint Compression
Quality Prediction in Fingerprint CompressionQuality Prediction in Fingerprint Compression
Quality Prediction in Fingerprint CompressionIJTET Journal
 
The step construction of penalized spline in electrical power load data
The step construction of penalized spline in electrical power load dataThe step construction of penalized spline in electrical power load data
The step construction of penalized spline in electrical power load dataTELKOMNIKA JOURNAL
 
IMAGE REGISTRATION USING ADVANCED TOPOLOGY PRESERVING RELAXATION LABELING
IMAGE REGISTRATION USING ADVANCED TOPOLOGY PRESERVING RELAXATION LABELING IMAGE REGISTRATION USING ADVANCED TOPOLOGY PRESERVING RELAXATION LABELING
IMAGE REGISTRATION USING ADVANCED TOPOLOGY PRESERVING RELAXATION LABELING cscpconf
 
Data mining Part 1
Data mining Part 1Data mining Part 1
Data mining Part 1Gautam Kumar
 
IRJET- Missing Data Imputation by Evidence Chain
IRJET- Missing Data Imputation by Evidence ChainIRJET- Missing Data Imputation by Evidence Chain
IRJET- Missing Data Imputation by Evidence ChainIRJET Journal
 
Comparison of Cost Estimation Methods using Hybrid Artificial Intelligence on...
Comparison of Cost Estimation Methods using Hybrid Artificial Intelligence on...Comparison of Cost Estimation Methods using Hybrid Artificial Intelligence on...
Comparison of Cost Estimation Methods using Hybrid Artificial Intelligence on...IJERA Editor
 
AMAZON STOCK PRICE PREDICTION BY USING SMLT
AMAZON STOCK PRICE PREDICTION BY USING SMLTAMAZON STOCK PRICE PREDICTION BY USING SMLT
AMAZON STOCK PRICE PREDICTION BY USING SMLTIRJET Journal
 
Investigation on Using Fractal Geometry for Classification of Partial Dischar...
Investigation on Using Fractal Geometry for Classification of Partial Dischar...Investigation on Using Fractal Geometry for Classification of Partial Dischar...
Investigation on Using Fractal Geometry for Classification of Partial Dischar...IOSR Journals
 
Multimode system condition monitoring using sparsity reconstruction for quali...
Multimode system condition monitoring using sparsity reconstruction for quali...Multimode system condition monitoring using sparsity reconstruction for quali...
Multimode system condition monitoring using sparsity reconstruction for quali...IJECEIAES
 

Similar to KNN and ARL algorithms for estimating missing values in time series data (20)

C054
C054C054
C054
 
Design of optimized Interval Arithmetic Multiplier
Design of optimized Interval Arithmetic MultiplierDesign of optimized Interval Arithmetic Multiplier
Design of optimized Interval Arithmetic Multiplier
 
BEHAVIOR STUDY OF ENTROPY IN A DIGITAL IMAGE THROUGH AN ITERATIVE ALGORITHM O...
BEHAVIOR STUDY OF ENTROPY IN A DIGITAL IMAGE THROUGH AN ITERATIVE ALGORITHM O...BEHAVIOR STUDY OF ENTROPY IN A DIGITAL IMAGE THROUGH AN ITERATIVE ALGORITHM O...
BEHAVIOR STUDY OF ENTROPY IN A DIGITAL IMAGE THROUGH AN ITERATIVE ALGORITHM O...
 
Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)
 
Fault Tolerant Matrix Pencil Method for Direction of Arrival Estimation
Fault Tolerant Matrix Pencil Method for Direction of Arrival EstimationFault Tolerant Matrix Pencil Method for Direction of Arrival Estimation
Fault Tolerant Matrix Pencil Method for Direction of Arrival Estimation
 
Finding Relationships between the Our-NIR Cluster Results
Finding Relationships between the Our-NIR Cluster ResultsFinding Relationships between the Our-NIR Cluster Results
Finding Relationships between the Our-NIR Cluster Results
 
A Thresholding Method to Estimate Quantities of Each Class
A Thresholding Method to Estimate Quantities of Each ClassA Thresholding Method to Estimate Quantities of Each Class
A Thresholding Method to Estimate Quantities of Each Class
 
Fault detection based on novel fuzzy modelling
Fault detection based on novel fuzzy modelling Fault detection based on novel fuzzy modelling
Fault detection based on novel fuzzy modelling
 
Enhancing Classification Accuracy of K-Nearest Neighbors Algorithm using Gain...
Enhancing Classification Accuracy of K-Nearest Neighbors Algorithm using Gain...Enhancing Classification Accuracy of K-Nearest Neighbors Algorithm using Gain...
Enhancing Classification Accuracy of K-Nearest Neighbors Algorithm using Gain...
 
Estimating project development effort using clustered regression approach
Estimating project development effort using clustered regression approachEstimating project development effort using clustered regression approach
Estimating project development effort using clustered regression approach
 
ESTIMATING PROJECT DEVELOPMENT EFFORT USING CLUSTERED REGRESSION APPROACH
ESTIMATING PROJECT DEVELOPMENT EFFORT USING CLUSTERED REGRESSION APPROACHESTIMATING PROJECT DEVELOPMENT EFFORT USING CLUSTERED REGRESSION APPROACH
ESTIMATING PROJECT DEVELOPMENT EFFORT USING CLUSTERED REGRESSION APPROACH
 
Quality Prediction in Fingerprint Compression
Quality Prediction in Fingerprint CompressionQuality Prediction in Fingerprint Compression
Quality Prediction in Fingerprint Compression
 
The step construction of penalized spline in electrical power load data
The step construction of penalized spline in electrical power load dataThe step construction of penalized spline in electrical power load data
The step construction of penalized spline in electrical power load data
 
IMAGE REGISTRATION USING ADVANCED TOPOLOGY PRESERVING RELAXATION LABELING
IMAGE REGISTRATION USING ADVANCED TOPOLOGY PRESERVING RELAXATION LABELING IMAGE REGISTRATION USING ADVANCED TOPOLOGY PRESERVING RELAXATION LABELING
IMAGE REGISTRATION USING ADVANCED TOPOLOGY PRESERVING RELAXATION LABELING
 
Data mining Part 1
Data mining Part 1Data mining Part 1
Data mining Part 1
 
IRJET- Missing Data Imputation by Evidence Chain
IRJET- Missing Data Imputation by Evidence ChainIRJET- Missing Data Imputation by Evidence Chain
IRJET- Missing Data Imputation by Evidence Chain
 
Comparison of Cost Estimation Methods using Hybrid Artificial Intelligence on...
Comparison of Cost Estimation Methods using Hybrid Artificial Intelligence on...Comparison of Cost Estimation Methods using Hybrid Artificial Intelligence on...
Comparison of Cost Estimation Methods using Hybrid Artificial Intelligence on...
 
AMAZON STOCK PRICE PREDICTION BY USING SMLT
AMAZON STOCK PRICE PREDICTION BY USING SMLTAMAZON STOCK PRICE PREDICTION BY USING SMLT
AMAZON STOCK PRICE PREDICTION BY USING SMLT
 
Investigation on Using Fractal Geometry for Classification of Partial Dischar...
Investigation on Using Fractal Geometry for Classification of Partial Dischar...Investigation on Using Fractal Geometry for Classification of Partial Dischar...
Investigation on Using Fractal Geometry for Classification of Partial Dischar...
 
Multimode system condition monitoring using sparsity reconstruction for quali...
Multimode system condition monitoring using sparsity reconstruction for quali...Multimode system condition monitoring using sparsity reconstruction for quali...
Multimode system condition monitoring using sparsity reconstruction for quali...
 

More from ijeei-iaes

An Heterogeneous Population-Based Genetic Algorithm for Data Clustering
An Heterogeneous Population-Based Genetic Algorithm for Data ClusteringAn Heterogeneous Population-Based Genetic Algorithm for Data Clustering
An Heterogeneous Population-Based Genetic Algorithm for Data Clusteringijeei-iaes
 
Development of a Wireless Sensors Network for Greenhouse Monitoring and Control
Development of a Wireless Sensors Network for Greenhouse Monitoring and ControlDevelopment of a Wireless Sensors Network for Greenhouse Monitoring and Control
Development of a Wireless Sensors Network for Greenhouse Monitoring and Controlijeei-iaes
 
Analysis of Genetic Algorithm for Effective power Delivery and with Best Upsurge
Analysis of Genetic Algorithm for Effective power Delivery and with Best UpsurgeAnalysis of Genetic Algorithm for Effective power Delivery and with Best Upsurge
Analysis of Genetic Algorithm for Effective power Delivery and with Best Upsurgeijeei-iaes
 
Design for Postplacement Mousing based on GSM in Long-Distance
Design for Postplacement Mousing based on GSM in Long-DistanceDesign for Postplacement Mousing based on GSM in Long-Distance
Design for Postplacement Mousing based on GSM in Long-Distanceijeei-iaes
 
Investigation of TTMC-SVPWM Strategies for Diode Clamped and Cascaded H-bridg...
Investigation of TTMC-SVPWM Strategies for Diode Clamped and Cascaded H-bridg...Investigation of TTMC-SVPWM Strategies for Diode Clamped and Cascaded H-bridg...
Investigation of TTMC-SVPWM Strategies for Diode Clamped and Cascaded H-bridg...ijeei-iaes
 
Optimal Power Flow with Reactive Power Compensation for Cost And Loss Minimiz...
Optimal Power Flow with Reactive Power Compensation for Cost And Loss Minimiz...Optimal Power Flow with Reactive Power Compensation for Cost And Loss Minimiz...
Optimal Power Flow with Reactive Power Compensation for Cost And Loss Minimiz...ijeei-iaes
 
Mitigation of Power Quality Problems Using Custom Power Devices: A Review
Mitigation of Power Quality Problems Using Custom Power Devices: A ReviewMitigation of Power Quality Problems Using Custom Power Devices: A Review
Mitigation of Power Quality Problems Using Custom Power Devices: A Reviewijeei-iaes
 
Comparison of Dynamic Stability Response of A SMIB with PI and Fuzzy Controll...
Comparison of Dynamic Stability Response of A SMIB with PI and Fuzzy Controll...Comparison of Dynamic Stability Response of A SMIB with PI and Fuzzy Controll...
Comparison of Dynamic Stability Response of A SMIB with PI and Fuzzy Controll...ijeei-iaes
 
Embellished Particle Swarm Optimization Algorithm for Solving Reactive Power ...
Embellished Particle Swarm Optimization Algorithm for Solving Reactive Power ...Embellished Particle Swarm Optimization Algorithm for Solving Reactive Power ...
Embellished Particle Swarm Optimization Algorithm for Solving Reactive Power ...ijeei-iaes
 
Intelligent Management on the Home Consumers with Zero Energy Consumption
Intelligent Management on the Home Consumers with Zero Energy ConsumptionIntelligent Management on the Home Consumers with Zero Energy Consumption
Intelligent Management on the Home Consumers with Zero Energy Consumptionijeei-iaes
 
Analysing Transportation Data with Open Source Big Data Analytic Tools
Analysing Transportation Data with Open Source Big Data Analytic ToolsAnalysing Transportation Data with Open Source Big Data Analytic Tools
Analysing Transportation Data with Open Source Big Data Analytic Toolsijeei-iaes
 
A Pattern Classification Based approach for Blur Classification
A Pattern Classification Based approach for Blur ClassificationA Pattern Classification Based approach for Blur Classification
A Pattern Classification Based approach for Blur Classificationijeei-iaes
 
Computing Some Degree-Based Topological Indices of Graphene
Computing Some Degree-Based Topological Indices of GrapheneComputing Some Degree-Based Topological Indices of Graphene
Computing Some Degree-Based Topological Indices of Grapheneijeei-iaes
 
A Lyapunov Based Approach to Enchance Wind Turbine Stability
A Lyapunov Based Approach to Enchance Wind Turbine StabilityA Lyapunov Based Approach to Enchance Wind Turbine Stability
A Lyapunov Based Approach to Enchance Wind Turbine Stabilityijeei-iaes
 
Fuzzy Control of a Large Crane Structure
Fuzzy Control of a Large Crane StructureFuzzy Control of a Large Crane Structure
Fuzzy Control of a Large Crane Structureijeei-iaes
 
Site Diversity Technique Application on Rain Attenuation for Lagos
Site Diversity Technique Application on Rain Attenuation for LagosSite Diversity Technique Application on Rain Attenuation for Lagos
Site Diversity Technique Application on Rain Attenuation for Lagosijeei-iaes
 
Impact of Next Generation Cognitive Radio Network on the Wireless Green Eco s...
Impact of Next Generation Cognitive Radio Network on the Wireless Green Eco s...Impact of Next Generation Cognitive Radio Network on the Wireless Green Eco s...
Impact of Next Generation Cognitive Radio Network on the Wireless Green Eco s...ijeei-iaes
 
Music Recommendation System with User-based and Item-based Collaborative Filt...
Music Recommendation System with User-based and Item-based Collaborative Filt...Music Recommendation System with User-based and Item-based Collaborative Filt...
Music Recommendation System with User-based and Item-based Collaborative Filt...ijeei-iaes
 
A Real-Time Implementation of Moving Object Action Recognition System Based o...
A Real-Time Implementation of Moving Object Action Recognition System Based o...A Real-Time Implementation of Moving Object Action Recognition System Based o...
A Real-Time Implementation of Moving Object Action Recognition System Based o...ijeei-iaes
 
Wireless Sensor Network for Radiation Detection
Wireless Sensor Network for Radiation DetectionWireless Sensor Network for Radiation Detection
Wireless Sensor Network for Radiation Detectionijeei-iaes
 

More from ijeei-iaes (20)

An Heterogeneous Population-Based Genetic Algorithm for Data Clustering
An Heterogeneous Population-Based Genetic Algorithm for Data ClusteringAn Heterogeneous Population-Based Genetic Algorithm for Data Clustering
An Heterogeneous Population-Based Genetic Algorithm for Data Clustering
 
Development of a Wireless Sensors Network for Greenhouse Monitoring and Control
Development of a Wireless Sensors Network for Greenhouse Monitoring and ControlDevelopment of a Wireless Sensors Network for Greenhouse Monitoring and Control
Development of a Wireless Sensors Network for Greenhouse Monitoring and Control
 
Analysis of Genetic Algorithm for Effective power Delivery and with Best Upsurge
Analysis of Genetic Algorithm for Effective power Delivery and with Best UpsurgeAnalysis of Genetic Algorithm for Effective power Delivery and with Best Upsurge
Analysis of Genetic Algorithm for Effective power Delivery and with Best Upsurge
 
Design for Postplacement Mousing based on GSM in Long-Distance
Design for Postplacement Mousing based on GSM in Long-DistanceDesign for Postplacement Mousing based on GSM in Long-Distance
Design for Postplacement Mousing based on GSM in Long-Distance
 
Investigation of TTMC-SVPWM Strategies for Diode Clamped and Cascaded H-bridg...
Investigation of TTMC-SVPWM Strategies for Diode Clamped and Cascaded H-bridg...Investigation of TTMC-SVPWM Strategies for Diode Clamped and Cascaded H-bridg...
Investigation of TTMC-SVPWM Strategies for Diode Clamped and Cascaded H-bridg...
 
Optimal Power Flow with Reactive Power Compensation for Cost And Loss Minimiz...
Optimal Power Flow with Reactive Power Compensation for Cost And Loss Minimiz...Optimal Power Flow with Reactive Power Compensation for Cost And Loss Minimiz...
Optimal Power Flow with Reactive Power Compensation for Cost And Loss Minimiz...
 
Mitigation of Power Quality Problems Using Custom Power Devices: A Review
Mitigation of Power Quality Problems Using Custom Power Devices: A ReviewMitigation of Power Quality Problems Using Custom Power Devices: A Review
Mitigation of Power Quality Problems Using Custom Power Devices: A Review
 
Comparison of Dynamic Stability Response of A SMIB with PI and Fuzzy Controll...
Comparison of Dynamic Stability Response of A SMIB with PI and Fuzzy Controll...Comparison of Dynamic Stability Response of A SMIB with PI and Fuzzy Controll...
Comparison of Dynamic Stability Response of A SMIB with PI and Fuzzy Controll...
 
Embellished Particle Swarm Optimization Algorithm for Solving Reactive Power ...
Embellished Particle Swarm Optimization Algorithm for Solving Reactive Power ...Embellished Particle Swarm Optimization Algorithm for Solving Reactive Power ...
Embellished Particle Swarm Optimization Algorithm for Solving Reactive Power ...
 
Intelligent Management on the Home Consumers with Zero Energy Consumption
Intelligent Management on the Home Consumers with Zero Energy ConsumptionIntelligent Management on the Home Consumers with Zero Energy Consumption
Intelligent Management on the Home Consumers with Zero Energy Consumption
 
Analysing Transportation Data with Open Source Big Data Analytic Tools
Analysing Transportation Data with Open Source Big Data Analytic ToolsAnalysing Transportation Data with Open Source Big Data Analytic Tools
Analysing Transportation Data with Open Source Big Data Analytic Tools
 
A Pattern Classification Based approach for Blur Classification
A Pattern Classification Based approach for Blur ClassificationA Pattern Classification Based approach for Blur Classification
A Pattern Classification Based approach for Blur Classification
 
Computing Some Degree-Based Topological Indices of Graphene
Computing Some Degree-Based Topological Indices of GrapheneComputing Some Degree-Based Topological Indices of Graphene
Computing Some Degree-Based Topological Indices of Graphene
 
A Lyapunov Based Approach to Enchance Wind Turbine Stability
A Lyapunov Based Approach to Enchance Wind Turbine StabilityA Lyapunov Based Approach to Enchance Wind Turbine Stability
A Lyapunov Based Approach to Enchance Wind Turbine Stability
 
Fuzzy Control of a Large Crane Structure
Fuzzy Control of a Large Crane StructureFuzzy Control of a Large Crane Structure
Fuzzy Control of a Large Crane Structure
 
Site Diversity Technique Application on Rain Attenuation for Lagos
Site Diversity Technique Application on Rain Attenuation for LagosSite Diversity Technique Application on Rain Attenuation for Lagos
Site Diversity Technique Application on Rain Attenuation for Lagos
 
Impact of Next Generation Cognitive Radio Network on the Wireless Green Eco s...
Impact of Next Generation Cognitive Radio Network on the Wireless Green Eco s...Impact of Next Generation Cognitive Radio Network on the Wireless Green Eco s...
Impact of Next Generation Cognitive Radio Network on the Wireless Green Eco s...
 
Music Recommendation System with User-based and Item-based Collaborative Filt...
Music Recommendation System with User-based and Item-based Collaborative Filt...Music Recommendation System with User-based and Item-based Collaborative Filt...
Music Recommendation System with User-based and Item-based Collaborative Filt...
 
A Real-Time Implementation of Moving Object Action Recognition System Based o...
A Real-Time Implementation of Moving Object Action Recognition System Based o...A Real-Time Implementation of Moving Object Action Recognition System Based o...
A Real-Time Implementation of Moving Object Action Recognition System Based o...
 
Wireless Sensor Network for Radiation Detection
Wireless Sensor Network for Radiation DetectionWireless Sensor Network for Radiation Detection
Wireless Sensor Network for Radiation Detection
 

Recently uploaded

Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Dr.Costas Sachpazis
 
Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.eptoze12
 
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130Suhani Kapoor
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...Soham Mondal
 
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝soniya singh
 
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...ZTE
 
power system scada applications and uses
power system scada applications and usespower system scada applications and uses
power system scada applications and usesDevarapalliHaritha
 
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionSachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionDr.Costas Sachpazis
 
Application of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptxApplication of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptx959SahilShah
 
Call Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call GirlsCall Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call Girlsssuser7cb4ff
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130Suhani Kapoor
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSKurinjimalarL3
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...VICTOR MAESTRE RAMIREZ
 
IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024Mark Billinghurst
 
Current Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCLCurrent Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCLDeelipZope
 
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube ExchangerStudy on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube ExchangerAnamika Sarkar
 
HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2RajaP95
 

Recently uploaded (20)

Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
 
Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.
 
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
 
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCRCall Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
 
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
 
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
 
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...
 
power system scada applications and uses
power system scada applications and usespower system scada applications and uses
power system scada applications and uses
 
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionSachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
 
Application of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptxApplication of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptx
 
Call Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call GirlsCall Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call Girls
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
 
IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024
 
Current Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCLCurrent Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCL
 
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube ExchangerStudy on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
 
HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2
 

KNN and ARL algorithms for estimating missing values in time series data

  • 1. Indonesian Journal of Electrical Engineering and Informatics (IJEEI) Vol. 2, No. 3, September 2014, pp. 119~124 ISSN: 2089-3272  119 Received June 25, 2014; Revised August 6, 2014; Accepted September 1, 2014 KNN and ARL Based Imputation to Estimate Missing Values Thirumahal R, Deepali A Patil Departement of Information Technology, Thadomal Shahani Engineering College (TSEC) Email: deep.patil1987@gmail.com Abstract Missing data are the absence of data items for a subject; they hide some information that may be important. In practice, missing data have been one major factor affecting data quality. Thus, Missing value imputation is needed. Methods such as hierarchical clustering and K-means clustering are not robust to missing data and may lose effectiveness even with a few missing values. Therefore, to improve the quality of data method for missing value imputation is needed. In this paper KNN and ARL based Imputation are introduced to impute missing values and accuracy of both the algorithms are measured by using normalized root mean sqare error. The result shows that ARL is more accurate and robust method for missing value estimation. Keywords: Autoregressive model (AR), K-nearest neighbor, Missing value estimation, Time series analysis. 1. Introduction To get the required information from huge, incomplete, noisy and inconsistent set of data it is necessary to use data preprocessing. Data cleaning phase of preprocessing is used because the aim here is to impute (fill) the missing values using estimated ones. During filling of the missing values three things are important, they are as follows: A. Estimated values without bias, B. The relation between attributes should be maintained, and C. Minimize the cost. In most cases, a data set’s attributes are not independent of each other. Thus, through the identification of relationships among attributes, missing values can be determined. K-nearest neighbor imputation [1] can find missing values in a particular column but if many missing values are present then this method will give incorrect result. Also, the choice of distance function must be properly selected to estimate the missing values. ARL [2] will perform when there are many missing values at particular time points, and even when the experiments for time points fail or are missing. In this paper, Normalized root mean square error is calculated based on the experimental results which are then used to compare both the algorithms. Comparision results will show the accuracy of algorithm. 2. Earliear Work Many existing, industrial, and research data sets contain missing values (MVs). There are various reasons for their existence, such as manual data entry procedures, equipment errors, and incorrect measurements. The presence of such imperfections usually requires a preprocessing stage in which the data are prepared and cleaned. Missing Value Analysis provides a slightly different set of descriptive tools for analyzing missing and includes a variety of single imputation methods. Simplest technique to impute the missing values that is mean imputation was metioned in paper “dealing with missing software project data” by M.J. Shepperd and M.H. Cartwright [3]. Bu it distorts relationships between variables by “pulling” estimates of the correlation toward zero. Trevor Hastie, Robert Tibshirani, Gavin Sherlock, Michael Eisen, Patrick Brown, David Botstein described Singular value decomposition method (SVD) in paper “Imputing Missing Data for Gene Expression Arrays” [4] to obtain a set of mutually orthogonal values. But the drawbacks of SVD are that it can only be performed on complete matrices and the SVD-based
  • 2.  ISSN: 2089-3272 IJEEI Vol. 2, No. 3, September 2014 : 119 – 124 120 method shows sharp deterioration in performance when a non-optimal fraction of missing values is used. Therefore, KNN Impute method is introduced which is less sensitive to the exact parameters used (number of nearest neighbors). The LLSimpute (Local Least Squares Imputation) algorithm [5] predicts the missing value using the least squares formulation for the neighborhood column and the non-missing entries. It works well but the time complexity is higher. Due to above disadvantages in [6] they discussed an autoregressive-model-based missing value estimation method that takes into account the dynamic property of temporal data and the local similarity structures in the data. This method is especially effective for the situation where a particular time point contains many missing values or where the entire time point is missing. In this work, we describe and evaluate two methods of estimation for missing values in Synthetic control dataset [2]. We compare our KNN and ARL based methods by calculating error rate using NRMSE to show the accuracy. 3. Problem Statement Given a record of all the time series data, first find the missing values which are indicated by zeros from each column, estimate that values and finally impute those values which are same as original values. To impute the missing values KNN Impute and ARL Impute algorithms are used. 3.1 KNN Impute Algorithm: KNNI [7] computes the k nearest neighbors and a value from them is imputed. For nominal values, the most common value among all neighbors is taken, and for numerical values, the average value is used. Therefore, a proximity measure between instances is needed for it to be defined. The Euclidean distance is most commonly used in the literature. The most frequent value among k-nearest neighbor and the mean among the k-nearest neighbor can be predicted by k-nearest neighbor imputation. In this method the main factor is Distance metrics. In 1NN imputation method we can replace the missing values with the nearest neighbor. But if the value of K is greater than one then replace the missing values with the mean or weighted average of K-nearest neighbors. By setting k-value between 10 and 20 brings the best results for KNN imputation. Euclidean distance is calculated by following equation:    n t tytxyx eeVVdist 1 2 ,,),( (1) Weighted average of corresponding entries is calculated by following equation: ij k i iIJ eWG   1 (2) Weighted value is given in following equation:   ),( 1 * i i ggdis W (3) Where,     k i iggdist 1 * ,/1 (4) Example 2 (KNNI) Nearest neighbor: 2
  • 3. IJEEI ISSN: 2089-3272  KNN and ARL Based Imputation to Estimate Missing Values (Deepali A Patil) 121 Inserted Array: Gx Gy 29.2171 32.8717 32.337 36.0253 32.8717 34.5249 0.0 34.1173 26.3693 27.6623 24.8923 25.7744 29.9423 30.9493 35.6805 35.2623 Gy: reference column Predicted Missing gx: 27.8725 3.2 ARL Impute Algorithm KNN impute, is not able to deal with the situation where a particular time point (column) of the data is missing entirely. ARL Impute [1] and [6] is an effective method for the situation where a particular time point contains many missing values or where the entire time point is missing. ARL impute, consists of two major processes: in the first one, we assume no missing data and estimate the AR coefficients, and in the second process, we assume that the AR coefficients are known and we estimate the missing data. Figure 1. An overview of ARL Impute AR coefficients [1] can find by using least square approximation [1] or cubic spline interpolation method. AR model can be described in equation as follows: jjjj aYy  (5) where, aj is the coefficient matrix., j is a sequence of independent identically distributed normal random variable with mean zero. It means that any given value jy in the time series is directly proportional to the pervious value jY plus some random error j. As the number of AR parameters increase, jy becomes directly related to additional past values.
  • 4.  ISSN: 2089-3272 IJEEI Vol. 2, No. 3, September 2014 : 119 – 124 122 (6) In this paper AR coeffiecients are calcualed by using least sqare approximation principle [1]. Let us assume that (y1,y2,….ys) are the observed data and {x1, . . . , xm} are the missing data. Estimation of missing data in matrix form is given by: e=Az (7) where z is a column vector that consists of the observed data y and the missing data x, and A is a Toeplitz matrix whose column number is n and row number is n- p , which is written as:                   1100 010 001 1 1 aa aa aa A p p p     (8) If we separate the observed data from missing data and split A in the block matrix, the equation can be written as: e=Bx+Cy (9) Where B = [B1, B2 · · · ] and C = [C1, C2 · · · ] are block sub matrices of A corresponding to the respective locations of observed data y and missing data x. Finally the missing data can be calculated from B# (pseudo inverse of B).The corresponding equation is given by: x= -B# Cy (10) Example 2 (ARLI): t={1,2,3,4,5,6,7} ft={ 34.4632, 31.2834, 33.7596, 27.7849,7.569, 29.2171, 32.337}; f={ 34.4632, 31.2834,0,0,0, 29.2171, 32.337}; Predicted missing values are: 28.97423999999749 at t=3.0 27.73036999999391 at t=4.0 27.746439999989708 at t=5.0 3.3 Performance measure: Normalized RMS Error (NRMSE) is used to measure the performance of missing value estimation method, it can be calculated as:              m i n j m i n j jiY jiYjiY NRMSE 1 1 2 1 2 1 ),( ),(),( ~ (11)
  • 5. IJEEI ISSN: 2089-3272  KNN and ARL Based Imputation to Estimate Missing Values (Deepali A Patil) 123 where, Y is the true value, Y ~ is the estimated value, and m and n are the total number of rows and columns respectively. 4. Results and Discussion Missing values are estimated for Synthetic control chart dataset [2] using K-nearest neighbor and Auto Regressive (AR) Model. Dataset is shown in figure 2. Figure 2. Synthetic contol chart Time series dataset Performance of KNN Impute was assessed over different K values and different percentage of missing values shown in figure 3. Result shows that when the numbers of K values are in the range of 10-20 then, NRMSE error rate is lesser. That is, by setting k-value between 10 and 20 brings the best results for KNN imputation shown in table 1. Table 1. NRMSE result for KNN Impute over different K values. K values KNN Impute 3 0.179 5 0.15876 10 0.1111 15 0.0889 22 0.1245 25 0.1489 Figure 3. Effect of number of nearest neighbors used for KNN-based estimation Then Performance of KNN Impute and ARL Impute were measured over NRMSE and % of missing values as shown in figure 4. The graph shows that the error rate of KNNimpute is higher than ARL impute when the percentage of missing values are inceased in a particular column. Comparision of NRMSE result is shown in Table 2.
  • 6.  ISSN: 2089-3272 IJEEI Vol. 2, No. 3, September 2014 : 119 – 124 124 Table 2. NRMSE result for ARL and KNN Impute over % of missing values % of missing values ARL Impute KNN Impute 10 0.08004 0.12 15 0.0912 0.28 20 0.17 0.33 Figure 4. NRMSE comparison pf ARL and KNN Impute algorithm 5. Conclusion And Future Work We have developed two algorithms KNN impute and ARL impute to estimate and impute the missing values. Both the algorithms give best results to estimate the missing values. KNN Impute gives best results when the K value is in the range of 10-20. KNN Impute estimates missing value from a particular column and ARL Impute estimates many missing values from a particular column. When the performance of both algorithms is measured based on normalized root mean square error (NRMSE) then ARL Impute gives best estimation, as its error rate is less. Our future work aims at finding the missing values for more than two columns of missing data using the probablilistic models like PPCA (Probabilistic principle component analysis). References [1] David Sheung Chi Fung. “Methods for the Estimation of Missing Values in Time Series”. a thesis Submitted to the Faculty of Communications, Health and Science Edith Cowan University Perth, Western Australia. [2] UCI machine learning repository, http://archive.ics.uci.edu/ml, 2010. [3] MJ Shepperd and MH Cartwright. “Dealing with Missing Software Project Data”. Empirical Software Engineering Research Group School of Design, Engineering & Computing, Bournemouth University. [4] Trevor Hastie, Robert Tibshirani, Gavin Sherlock, Michael Eisen, Patrick Brown, David Botstein. “Imputing Missing Data for Gene Expression Arrays”. [5] YTao, D Papadias, and X Lian. Reverse KNN Search in Arbitrary Dimensionality. Proc. 30th Int’l Conf. Very Large Data Bases (VLDB ’04), 2004. [6] Miew Keen Choong, Maurice Charbit, and Hong Yan. Autoregressive-Model-Based Missing Value Estimation for DNA Microarray Time Series Data. IEEE Transactions on Information Technology In Biomedicine. 2009; 13(1): 131-138. [7] O Troyanskaya, M Cantor, G Sherlock, et al. "Missing value estimation methods for DNA microarrays". Bioinformatics. 2001; 17: 520-525. [8] PMT Broersen. “Finite sample criteria for autoregressive order selection”. IEEE Trans. Signal Process. 2000; 48(12): 3550–3558. [9] MS Sehgal, L Gondal, LS Dooley. “Collateral Missing value imputation: a new robust missing value estimation algorithm for microarray data”. Bioinformatics. 2005; 21: 2417-2423. [10] H Kim, GH Golub, and H Park. “Missing value estimation for DNA microarray gene expression data: local least squares imputation”. Bioinformatics. 2005; 21: 187-198.