SlideShare a Scribd company logo
NOVATEUR PUBLICATIONS
INTERNATIONAL JOURNAL OF INNOVATIONS IN ENGINEERING RESEARCH AND TECHNOLOGY [IJIERT]
ISSN: 2394-3696
VOLUME 2, ISSUE 12, DEC.-2015
1 | P a g e
COMPARISION OF PERCENTAGE ERROR BY USING
IMPUTATION METHOD ON MID TERM EXAMINATION
DATA
V.B. Kamble
P.E.S. College of Engineering,
Aurangabad. (M.S.), India
S.N. Deshmukh
Dr. Babasaheb Ambedkar Marathwada University,
Aurangabad. (M.S.) India.
ABSTRACT
The issue of incomplete data exists across the entire field of data mining. In this paper,
Mean Imputation, Median Imputation and Standard Deviation Imputation are used to
deal with challenges of incomplete data on classification problems. By using different
imputation methods converts incomplete dataset in to the complete dataset. On
complete dataset by applying the suitable Imputation Method and comparing the
percentage error of Imputation Method and comparing the result
INTRODUCTION
Missing data are the absence of data items; they hide some information that may be
important. The presence of missing data is a general and challenging problem in the
data analysis field. Fortunately, missing data imputation techniques can be used to
improve the data quality. Missing data imputation techniques refer to any strategy that
fills in missing values of a dataset so that standard data analysis methods can be applied
to analyzed completed dataset [1].
Information quality is important to organization. People use information attribute as a
tool for accessing information quality. Information quality is measured based on users
as well as experts opinion on the information attributes. The commonly known
information attributes for information quality including accuracy, objectivity,
reputation, access, security, relevancy, value added, timeliness, completeness, amount
of data, and ease of understanding and consistent representation. These attribute can
also be applicable to data quality. Commonly, one can rarely find a data set that
contains complete entries [2].
1.1 Related Work- Methods for dealing with missing values can be classified into
three categories 1) Case Deletion, 2) Learning Without Handling of Missing Values,
and 3) Missing Value Imputation. The case deletion is to simply omit those cases with
missing values and only to use the remaining instances to finish the learning
assignments. The second approach is to learn without handling of missing data.
NOVATEUR PUBLICATIONS
INTERNATIONAL JOURNAL OF INNOVATIONS IN ENGINEERING RESEARCH AND TECHNOLOGY [IJIERT]
ISSN: 2394-3696
VOLUME 2, ISSUE 12, DEC.-2015
2 | P a g e
Missing data imputation methods advocates filling in missing values before a learning
application. Missing data imputation is a procedure that replaces the missing values
with some possible values [3].
1.2 Missing Data- Different methods have been applied in data mining to handle
missing values in database. Data with missing values could be ignored, or a global
constant could be used to fill the missing values, such as attribute mean, attribute mean
of the same class, or an algorithm could be applied to find the missing values. Missing
data imputation techniques means a strategy to fill the missing values of a dataset in
order to apply the standard methods which require completed data set for analysis.
These techniques retain data in incomplete cases, as well as impute values of correlated
variables.
Missing data imputations techniques are classified as ignorable missing data
imputations methods, which include single imputation methods and multiple imputation
methods, and non-ignorable missing data imputations methods which include likelihood
based methods and the non-likelihood based methods. Single imputation methods could
fill one value for each missing values and it is more commonly used at present than
multiple imputations which replace each missing value with several possible values and
better reflects sampling variability about actual value [4].
1.3 Patterns of Missing Values- There are a number of ways to know how missing
data arises. Little and Rubin introduced specific missing data terminology as a standard
framework to deal with missing data mechanisms and their effect on data analysis.
(A) Missing completely at Random (MCAR)- If the probability that a response is
missing is independent of both the observed data for that case and the unobserved
responses are simple a random sample from the observed data. An example of MCAR
missing data arises when investigators randomly assign research participants to
complete two-thirds of a survey instrument
(B) Missing at Random (MAR)- If the probability that a response is missing depends
on the observed data, but not on the unobserved data. This assumes the parameters of
the model for the data are distinct from the parameters of the missingness mechanism.
The missingness mechanism is ignorable. For example, in a reading comprehension test
at the beginning of a survey administration session, research participants with lower
reading comprehension scores may be less likely to complete the entire survey. The
missing data are due to some other external influence.
(C) Not-Missing at Random. (NMAR)- When respondents and non-respondents, with
the same values of some variables observed for both, differ systematically with respect
to values of the variable missing for the non-respondent. In other words, the pattern of
data missingness is non-random and it is not predictable from other variables in the
database. For example, a participant in a weight-loss study does not attend a weight-in
due to concerns about his/her weight loss; his/her data are missing due to non-ignorable
factors.
In practice it is usually difficult to meet the MCAR assumption. It is mentioned that
when making sampling distribution inferences about the parameter of the data, it is
NOVATEUR PUBLICATIONS
INTERNATIONAL JOURNAL OF INNOVATIONS IN ENGINEERING RESEARCH AND TECHNOLOGY [IJIERT]
ISSN: 2394-3696
VOLUME 2, ISSUE 12, DEC.-2015
3 | P a g e
appropriate to ignore the process that causes missing data if the missing data are
missing at random and the observed data are observed at random, but the inferences are
generally conditional on the observed pattern of missing data [5].
FOUR DIFFERENT METHODS TO DEAL WITH MISSING VALUES
A. Case Deletion (CD) - It is Also known as complete case analysis. It is available in
all statistical packages and is the default method in many programs. This method
consists of discarding all instances (cases) with missing values for at least one feature.
A variation of this method consists of determining the extent of missing data on each
instance and attribute, and deletes the instances and/or attributes with high levels of
missing data. Before deleting any attribute, it is necessary to evaluate its relevance to
the analysis. Unfortunately, relevant attributes should be kept even with a high degree
of missing values for other situations where the sample size is insufficient or some
structure exists in the missing data, CD has been shown to produce more biased
estimates than alternative methods. CD should be applied only in cases in which data
are missing completely at random.
B. Mean Imputation (MI). The method replaces the missing data for a given feature
(attribute) by mean of all the known values for that particular attribute.
Let us consider that the value of ‫ݔ‬௜௝ of the ݇௧௛ class, ‫ܥ‬௞, is missing then ‫ݔ‬௜௝ is
calculated as
‫ݔ‬௜௝
= ∑
௫
௡ೖ
௜௝
(1)
݅: ‫ݔ‬௜௝
€ck
Where nk represents the number of non-missing values in the ݆௧௛feature of the ݇௧௛ class.
However, According to Little and Rubin, the drawback of MI are
a) Sample size is overestimated
b) Variance is underestimated
c) Correlation is negatively biased
d) The distribution of new values is an incorrect representation of the population values
because the shape of the distribution is distorted by adding values equal to the mean.
Replacing all missing records with a single value will deflate the variance and
artificially inflate the significance of any statistical test based on it.
C. Median Imputation (MDI). This method uses median of all known values of the
feature or attribute in the class where the missing instance with missing value belongs.
Consider the value ‫ݔ‬௜௝ of the ݇௧௛ class, ‫ܥ‬௞, is missing. It will be calculated as
‫ݔ‬௜௝ୀ݉݁݀݅ܽ݊{݅: ‫ݔ‬௜௝€ܿ௞}{‫ݔ‬௜௝}} ( 2)
Instead of mean and median, mode also can be in imputation. Imputation method is
applied separately for many attribute. However, imputation does not consider co-
relation structure of the data [6].
NOVATEUR PUBLICATIONS
INTERNATIONAL JOURNAL OF INNOVATIONS IN ENGINEERING RESEARCH AND TECHNOLOGY [IJIERT]
ISSN: 2394-3696
VOLUME 2, ISSUE 12, DEC.-2015
4 | P a g e
D. Standard Deviation
The standard deviation spread data about the mean value. It is useful in comparing sets
of data which may have the same mean but a different range. The Standard Deviation is
given by the formula
S
୒ୀටଵ
୒
∑ (୶౟ష
ొ
౟సభ ௫ො)ଶ
(3)
Where{x1,x2,…,xn} are the observed values of the sample items and is the mean
value of these observations, while the denominator N stands for the size of the sample
[7].
EXPERIMENTAL ANALYSIS
We use three techniques for data retrieval
1) Imputing the missing values using attribute mean value.
2) Imputing the missing values using Median Imputation
3) Imputing the missing values using Standard Deviation
3.1 Imputing Missing Values Using Attribute Mean Value
This is one of the most frequently used methods. It consists of replacing the missing
data for a given feature (attribute) by the mean of all known values of that attribute in
the class where the instances with missing attribute belongs. In this method by
replacing the missing values by attribute mean and find out percentage accuracy with
original values by using the Mean Imputation Method.
3.2 Imputing Missing Values Using Median Imputation
In this method the missing values of instances are imputed. This method uses
median of all known values of the feature or attribute in the class where the missing
instance with missing value belongs.
Instead of mean and median, mode also can be in imputation. Imputation
method is applied separately for many attribute. However, imputation does not consider
co-relation structure of the data.
3.3 Imputing Missing Values Using Standard Deviation Imputation
The standard deviation spread data about the mean value. It is useful in comparing sets
of data which may have the same mean but a different range. The missing values can be
replaced by Standards Deviation value at respective attribute and then find out the
percentage accuracy with original value.
DATASET USED
In this paper dataset consists of records of class test marks of engineering college. In
dataset contains four attribute Roll No., Imputed value by Mean Imputation, Imputed
value by Median Imputation and Imputed value by Standard Deviation Imputation. In
dataset Roll No. are imaginary and generated for data analysis. Dataset contains some
missing values randomly distributed at particular record number by using imputation
NOVATEUR PUBLICATIONS
INTERNATIONAL JOURNAL OF INNOVATIONS IN ENGINEERING RESEARCH AND TECHNOLOGY [IJIERT]
ISSN: 2394-3696
VOLUME 2, ISSUE 12, DEC.-2015
5 | P a g e
techniques fill up the missing value so that dataset becomes complete dataset so that it
is easy to analyze the dataset. Dataset is shown in following table.
Table 1. Dataset with Mean imputation, Median Imputation, Standard Deviation
Imputation
Roll.
No.
Impute
d Value
by MI
Impute
d value
by MDI
Impute
d value
by SDI
121 8.45 8 4.14
140 11.16 12 4.53
165 8.45 8 4.14
200 8.45 8 4.14
230 10.56 11 4.34
300 11.16 12 4.53
534 11.16 12 4.53
1000 11.16 12 4.53
1429 8.45 8 4.14
1800 11.16 12 4.53
2187 9.75 10 4.34
2220 11.16 12 4.53
2247 10.56 11 4.34
2350 11.16 12 4.53
2557 10.56 11 4.14
3000 11.16 12 4.53
3160 10.56 11 4.34
3225 8.45 8 4.14
3295 10.56 11 4.34
3350 11.16 12 4.53
3467 10.56 11 4.34
3550 11.16 12 4.53
3679 11.16 12 4.53
3700 10.56 11 4.34
3773 11.16 12 4.53
3900 9.75 10 3.9
4113 11.16 12 4.53
4125 11.16 12 4.53
4132 10.56 11 3.9
4150 8.45 8 4.14
4175 10.56 11 4.34
4190 11.16 12 4.53
4202 8.45 8 4.14
4250 11.16 12 4.53
NOVATEUR PUBLICATIONS
INTERNATIONAL JOURNAL OF INNOVATIONS IN ENGINEERING RESEARCH AND TECHNOLOGY [IJIERT]
ISSN: 2394-3696
VOLUME 2, ISSUE 12, DEC.-2015
6 | P a g e
4352 11.16 12 4.53
4500 8.45 8 4.14
4600 8.45 8 4.14
4762 9.75 10 3.9
4850 10.56 11 4.34
4920 11.16 12 4.53
4984 8.45 8 4.14
4998 11.16 12 4.53
Percenta
ge Error
2.1 0.04 0.05
4.1 Comparison of Percentage Errors
Percentage Error calculates by observing the Experimental value and the Actual Value.
By taking the difference between Experimental Value and Actual Value Divided by
Actual Value and Multiplied by hundred so that we calculate the Percentage Error by
using the following formula.
Percentage Error= Experimental Value – Actual Value * 100
Actual Value
Table 2. Comparison of Percentage Error
Imputation Technique Percentage Error
Mean imputation 2.1
Median Imputation 0.04
Standard Deviation
Imputation
0.05
By comparing the percentage error of imputation method. The percentage error of Mean
Imputation is 2.1 %.The percentage error of Median Imputation is 0.04 and percentage
error of Standard Deviation Imputation is 0.05 so the Median Imputation method is
having lowest percentage of error as compare to Mean Imputation method and Standard
Deviation Imputation method. So the Median Imputation Method is more suitable as
compare to other method.
CONCLUSION AND FUTURE WORK
Missing values are regarded as serious problems in most of the information systems due
to unavailability of data and must be impute before the dataset is used. To handle these
missing values three techniques are used named as Mean imputation, Median
Imputation and Standard Deviation Imputation. Median Imputation method is having
lowest percentage of error as compare to Mean Imputation method and Standard
Deviation Imputation method. So the Median Imputation Method is more suitable as
compare to other method.
NOVATEUR PUBLICATIONS
INTERNATIONAL JOURNAL OF INNOVATIONS IN ENGINEERING RESEARCH AND TECHNOLOGY [IJIERT]
ISSN: 2394-3696
VOLUME 2, ISSUE 12, DEC.-2015
7 | P a g e
The proposed work handles missing values only for the numerical attributes. Further it
can be extended to handle a categorical attribute. Different classification algorithm can
be used for comparative analysis of missing data techniques. Missing data techniques
can also be implemented in mat lab.
REFERENCES
[1] Dinesh J. Prajapati, Jagruti H. Prajapati, and Handling Missing Values: Application
to University Data set. Issue 1, Vol. 1(August-2011), ISSN 2249-6149
[2] Shamsher Singh, Prof. Jagdish Prasad, Estimation of Missing Values in the Data
Mining and comparison of Imputation Methods. Mathematical Journal of
Interdisciplinary Sciences Vol. 1, Issue 1, March 2013, pp. 75–90
[3] Xiao Feng Zhu, Shichao Zhang, Senior Member, IEEE, Zhi Jin, Zili Zhang, and
Zhuoming Xu, Missing Value Estimation for Mixed-Attribute Data Sets. IEEE
Transactions on Knowledge And Data Engineering, Vol. 23, No. 1, January 2011.
[4] T.R.Sivapriya, V. Thavavel, A.R.Nadira Banu Kamal, Imputation and classification
of Missing Data Using Least Square Support Vector Machines- A New Approach in
Dementia Diagnosis. International Journal of Advanced Research in Artificial
Intelligence, Vol.1, No.4, 2012
[5] Yann-Yann Shieh, Imputation Methods on General Linear Mixed Models of
Longitudinal Studies, American Institutes for Research
[6] ] Edgar Acu˜Na1 And Caroline Rodriguez, The Treatment Of Missing Values And
Its Effect In The Classifier Accuracy Studies In Classification, Data Analysis, And
Knowledge Organization, 2004, Springer.Com
[7] MS. R. Malarvizhi, Dr. Antony Thanamani, Comparision of Imputation Techniques
after Classifying the Dataset Using Knn Classifier for the Imputation of Missing Data,
International Journal of Computational Engineering Research (IJCER online.com)
ISSN 2250-3005, Janaury-2013
[8] Anjana Sharma, Naina Mehta, Iti Sharma, Reasoning With Missing Values in Multi
Attribute Datasets. International Journal of Advanced Research in Computer Science
and Software Engineering, Volume 3, Issue5, May 2013 ISSN: 2277 128X

More Related Content

What's hot

Lecture 6 guidelines_and_assignment
Lecture 6 guidelines_and_assignmentLecture 6 guidelines_and_assignment
Lecture 6 guidelines_and_assignment
Daria Bogdanova
 
Pca analysis
Pca analysisPca analysis
Statistical Methods to Handle Missing Data
Statistical Methods to Handle Missing DataStatistical Methods to Handle Missing Data
Statistical Methods to Handle Missing Data
Tianfan Song
 
Data Analysis and Statistics
Data Analysis and StatisticsData Analysis and Statistics
Data Analysis and Statistics
T.S. Lim
 
Missing Data and Causes
Missing Data and CausesMissing Data and Causes
Missing Data and Causes
akanni azeez olamide
 
Analysis and Interpretation of Data
Analysis and Interpretation of DataAnalysis and Interpretation of Data
Analysis and Interpretation of Data
Multan Post Graduate College, Multan
 
Data analysis
Data analysisData analysis
Data analysis
Nursing Path
 
ON FEATURE SELECTION ALGORITHMS AND FEATURE SELECTION STABILITY MEASURES: A C...
ON FEATURE SELECTION ALGORITHMS AND FEATURE SELECTION STABILITY MEASURES: A C...ON FEATURE SELECTION ALGORITHMS AND FEATURE SELECTION STABILITY MEASURES: A C...
ON FEATURE SELECTION ALGORITHMS AND FEATURE SELECTION STABILITY MEASURES: A C...
ijcsit
 
Data analysis and Interpretation
Data analysis and InterpretationData analysis and Interpretation
Data analysis and Interpretation
Mehul Gondaliya
 
Statistical Modeling in 3D: Explaining, Predicting, Describing
Statistical Modeling in 3D: Explaining, Predicting, DescribingStatistical Modeling in 3D: Explaining, Predicting, Describing
Statistical Modeling in 3D: Explaining, Predicting, Describing
Galit Shmueli
 
A ROBUST MISSING VALUE IMPUTATION METHOD MIFOIMPUTE FOR INCOMPLETE MOLECULAR ...
A ROBUST MISSING VALUE IMPUTATION METHOD MIFOIMPUTE FOR INCOMPLETE MOLECULAR ...A ROBUST MISSING VALUE IMPUTATION METHOD MIFOIMPUTE FOR INCOMPLETE MOLECULAR ...
A ROBUST MISSING VALUE IMPUTATION METHOD MIFOIMPUTE FOR INCOMPLETE MOLECULAR ...
ijcsa
 
Mba ii rm unit-4.1 data analysis & presentation a
Mba ii rm unit-4.1 data analysis & presentation aMba ii rm unit-4.1 data analysis & presentation a
Mba ii rm unit-4.1 data analysis & presentation a
Rai University
 
Statistical Approaches to Missing Data
Statistical Approaches to Missing DataStatistical Approaches to Missing Data
Statistical Approaches to Missing Data
DataCards
 
Statistical tools in research 1
Statistical tools in research 1Statistical tools in research 1
Statistical tools in research 1
ashish7sattee
 
Eda sri
Eda sriEda sri
Recommender system
Recommender systemRecommender system
Recommender system
Bhumi Patel
 
Survey on Feature Selection and Dimensionality Reduction Techniques
Survey on Feature Selection and Dimensionality Reduction TechniquesSurvey on Feature Selection and Dimensionality Reduction Techniques
Survey on Feature Selection and Dimensionality Reduction Techniques
IRJET Journal
 
Statistics as a discipline
Statistics as a disciplineStatistics as a discipline
Statistics as a discipline
RosalinaTPayumo
 
Statsci
StatsciStatsci

What's hot (19)

Lecture 6 guidelines_and_assignment
Lecture 6 guidelines_and_assignmentLecture 6 guidelines_and_assignment
Lecture 6 guidelines_and_assignment
 
Pca analysis
Pca analysisPca analysis
Pca analysis
 
Statistical Methods to Handle Missing Data
Statistical Methods to Handle Missing DataStatistical Methods to Handle Missing Data
Statistical Methods to Handle Missing Data
 
Data Analysis and Statistics
Data Analysis and StatisticsData Analysis and Statistics
Data Analysis and Statistics
 
Missing Data and Causes
Missing Data and CausesMissing Data and Causes
Missing Data and Causes
 
Analysis and Interpretation of Data
Analysis and Interpretation of DataAnalysis and Interpretation of Data
Analysis and Interpretation of Data
 
Data analysis
Data analysisData analysis
Data analysis
 
ON FEATURE SELECTION ALGORITHMS AND FEATURE SELECTION STABILITY MEASURES: A C...
ON FEATURE SELECTION ALGORITHMS AND FEATURE SELECTION STABILITY MEASURES: A C...ON FEATURE SELECTION ALGORITHMS AND FEATURE SELECTION STABILITY MEASURES: A C...
ON FEATURE SELECTION ALGORITHMS AND FEATURE SELECTION STABILITY MEASURES: A C...
 
Data analysis and Interpretation
Data analysis and InterpretationData analysis and Interpretation
Data analysis and Interpretation
 
Statistical Modeling in 3D: Explaining, Predicting, Describing
Statistical Modeling in 3D: Explaining, Predicting, DescribingStatistical Modeling in 3D: Explaining, Predicting, Describing
Statistical Modeling in 3D: Explaining, Predicting, Describing
 
A ROBUST MISSING VALUE IMPUTATION METHOD MIFOIMPUTE FOR INCOMPLETE MOLECULAR ...
A ROBUST MISSING VALUE IMPUTATION METHOD MIFOIMPUTE FOR INCOMPLETE MOLECULAR ...A ROBUST MISSING VALUE IMPUTATION METHOD MIFOIMPUTE FOR INCOMPLETE MOLECULAR ...
A ROBUST MISSING VALUE IMPUTATION METHOD MIFOIMPUTE FOR INCOMPLETE MOLECULAR ...
 
Mba ii rm unit-4.1 data analysis & presentation a
Mba ii rm unit-4.1 data analysis & presentation aMba ii rm unit-4.1 data analysis & presentation a
Mba ii rm unit-4.1 data analysis & presentation a
 
Statistical Approaches to Missing Data
Statistical Approaches to Missing DataStatistical Approaches to Missing Data
Statistical Approaches to Missing Data
 
Statistical tools in research 1
Statistical tools in research 1Statistical tools in research 1
Statistical tools in research 1
 
Eda sri
Eda sriEda sri
Eda sri
 
Recommender system
Recommender systemRecommender system
Recommender system
 
Survey on Feature Selection and Dimensionality Reduction Techniques
Survey on Feature Selection and Dimensionality Reduction TechniquesSurvey on Feature Selection and Dimensionality Reduction Techniques
Survey on Feature Selection and Dimensionality Reduction Techniques
 
Statistics as a discipline
Statistics as a disciplineStatistics as a discipline
Statistics as a discipline
 
Statsci
StatsciStatsci
Statsci
 

Similar to COMPARISION OF PERCENTAGE ERROR BY USING IMPUTATION METHOD ON MID TERM EXAMINATION DATA

IDENTIFICATION OF OUTLIERS IN OXAZOLINES AND OXAZOLES HIGH DIMENSION MOLECULA...
IDENTIFICATION OF OUTLIERS IN OXAZOLINES AND OXAZOLES HIGH DIMENSION MOLECULA...IDENTIFICATION OF OUTLIERS IN OXAZOLINES AND OXAZOLES HIGH DIMENSION MOLECULA...
IDENTIFICATION OF OUTLIERS IN OXAZOLINES AND OXAZOLES HIGH DIMENSION MOLECULA...
IJDKP
 
Twala2007.doc
Twala2007.docTwala2007.doc
Twala2007.doc
butest
 
Data Analytics on Solar Energy Using Hadoop
Data Analytics on Solar Energy Using HadoopData Analytics on Solar Energy Using Hadoop
Data Analytics on Solar Energy Using Hadoop
IJMERJOURNAL
 
Machine Learning Approaches and its Challenges
Machine Learning Approaches and its ChallengesMachine Learning Approaches and its Challenges
Machine Learning Approaches and its Challenges
ijcnes
 
IRJET- Missing Data Imputation by Evidence Chain
IRJET- Missing Data Imputation by Evidence ChainIRJET- Missing Data Imputation by Evidence Chain
IRJET- Missing Data Imputation by Evidence Chain
IRJET Journal
 
Ijetr021251
Ijetr021251Ijetr021251
PREDICTING CLASS-IMBALANCED BUSINESS RISK USING RESAMPLING, REGULARIZATION, A...
PREDICTING CLASS-IMBALANCED BUSINESS RISK USING RESAMPLING, REGULARIZATION, A...PREDICTING CLASS-IMBALANCED BUSINESS RISK USING RESAMPLING, REGULARIZATION, A...
PREDICTING CLASS-IMBALANCED BUSINESS RISK USING RESAMPLING, REGULARIZATION, A...
IJMIT JOURNAL
 
PREDICTING CLASS-IMBALANCED BUSINESS RISK USING RESAMPLING, REGULARIZATION, A...
PREDICTING CLASS-IMBALANCED BUSINESS RISK USING RESAMPLING, REGULARIZATION, A...PREDICTING CLASS-IMBALANCED BUSINESS RISK USING RESAMPLING, REGULARIZATION, A...
PREDICTING CLASS-IMBALANCED BUSINESS RISK USING RESAMPLING, REGULARIZATION, A...
IJMIT JOURNAL
 
Anomaly detection via eliminating data redundancy and rectifying data error i...
Anomaly detection via eliminating data redundancy and rectifying data error i...Anomaly detection via eliminating data redundancy and rectifying data error i...
Anomaly detection via eliminating data redundancy and rectifying data error i...
nalini manogaran
 
TECHNICAL REVIEW: PERFORMANCE OF EXISTING IMPUTATION METHODS FOR MISSING DATA...
TECHNICAL REVIEW: PERFORMANCE OF EXISTING IMPUTATION METHODS FOR MISSING DATA...TECHNICAL REVIEW: PERFORMANCE OF EXISTING IMPUTATION METHODS FOR MISSING DATA...
TECHNICAL REVIEW: PERFORMANCE OF EXISTING IMPUTATION METHODS FOR MISSING DATA...
IJDKP
 
TECHNICAL REVIEW: PERFORMANCE OF EXISTING IMPUTATION METHODS FOR MISSING DATA...
TECHNICAL REVIEW: PERFORMANCE OF EXISTING IMPUTATION METHODS FOR MISSING DATA...TECHNICAL REVIEW: PERFORMANCE OF EXISTING IMPUTATION METHODS FOR MISSING DATA...
TECHNICAL REVIEW: PERFORMANCE OF EXISTING IMPUTATION METHODS FOR MISSING DATA...
IJDKP
 
INFLUENCE OF DATA GEOMETRY IN RANDOM SUBSET FEATURE SELECTION
INFLUENCE OF DATA GEOMETRY IN RANDOM SUBSET FEATURE SELECTIONINFLUENCE OF DATA GEOMETRY IN RANDOM SUBSET FEATURE SELECTION
INFLUENCE OF DATA GEOMETRY IN RANDOM SUBSET FEATURE SELECTION
IJDKP
 
Missing data exploration in air quality data set using R-package data visuali...
Missing data exploration in air quality data set using R-package data visuali...Missing data exploration in air quality data set using R-package data visuali...
Missing data exploration in air quality data set using R-package data visuali...
journalBEEI
 
Machine learning module 2
Machine learning module 2Machine learning module 2
Machine learning module 2
Gokulks007
 
K-NN Classifier Performs Better Than K-Means Clustering in Missing Value Imp...
K-NN Classifier Performs Better Than K-Means Clustering in  Missing Value Imp...K-NN Classifier Performs Better Than K-Means Clustering in  Missing Value Imp...
K-NN Classifier Performs Better Than K-Means Clustering in Missing Value Imp...
IOSR Journals
 
IRJET- Probability based Missing Value Imputation Method and its Analysis
IRJET- Probability based Missing Value Imputation Method and its AnalysisIRJET- Probability based Missing Value Imputation Method and its Analysis
IRJET- Probability based Missing Value Imputation Method and its Analysis
IRJET Journal
 
A SURVEY OF METHODS FOR HANDLING DISK DATA IMBALANCE
A SURVEY OF METHODS FOR HANDLING DISK DATA IMBALANCEA SURVEY OF METHODS FOR HANDLING DISK DATA IMBALANCE
A SURVEY OF METHODS FOR HANDLING DISK DATA IMBALANCE
IJCI JOURNAL
 
Multiple imputation of missing data
Multiple imputation of missing dataMultiple imputation of missing data
Multiple imputation of missing data
Statistics Specialist
 
Multi-Cluster Based Approach for skewed Data in Data Mining
Multi-Cluster Based Approach for skewed Data in Data MiningMulti-Cluster Based Approach for skewed Data in Data Mining
Multi-Cluster Based Approach for skewed Data in Data Mining
IOSR Journals
 
IT-601 Lecture Notes-UNIT-2.pdf Data Analysis
IT-601 Lecture Notes-UNIT-2.pdf Data AnalysisIT-601 Lecture Notes-UNIT-2.pdf Data Analysis
IT-601 Lecture Notes-UNIT-2.pdf Data Analysis
Dr. Radhey Shyam
 

Similar to COMPARISION OF PERCENTAGE ERROR BY USING IMPUTATION METHOD ON MID TERM EXAMINATION DATA (20)

IDENTIFICATION OF OUTLIERS IN OXAZOLINES AND OXAZOLES HIGH DIMENSION MOLECULA...
IDENTIFICATION OF OUTLIERS IN OXAZOLINES AND OXAZOLES HIGH DIMENSION MOLECULA...IDENTIFICATION OF OUTLIERS IN OXAZOLINES AND OXAZOLES HIGH DIMENSION MOLECULA...
IDENTIFICATION OF OUTLIERS IN OXAZOLINES AND OXAZOLES HIGH DIMENSION MOLECULA...
 
Twala2007.doc
Twala2007.docTwala2007.doc
Twala2007.doc
 
Data Analytics on Solar Energy Using Hadoop
Data Analytics on Solar Energy Using HadoopData Analytics on Solar Energy Using Hadoop
Data Analytics on Solar Energy Using Hadoop
 
Machine Learning Approaches and its Challenges
Machine Learning Approaches and its ChallengesMachine Learning Approaches and its Challenges
Machine Learning Approaches and its Challenges
 
IRJET- Missing Data Imputation by Evidence Chain
IRJET- Missing Data Imputation by Evidence ChainIRJET- Missing Data Imputation by Evidence Chain
IRJET- Missing Data Imputation by Evidence Chain
 
Ijetr021251
Ijetr021251Ijetr021251
Ijetr021251
 
PREDICTING CLASS-IMBALANCED BUSINESS RISK USING RESAMPLING, REGULARIZATION, A...
PREDICTING CLASS-IMBALANCED BUSINESS RISK USING RESAMPLING, REGULARIZATION, A...PREDICTING CLASS-IMBALANCED BUSINESS RISK USING RESAMPLING, REGULARIZATION, A...
PREDICTING CLASS-IMBALANCED BUSINESS RISK USING RESAMPLING, REGULARIZATION, A...
 
PREDICTING CLASS-IMBALANCED BUSINESS RISK USING RESAMPLING, REGULARIZATION, A...
PREDICTING CLASS-IMBALANCED BUSINESS RISK USING RESAMPLING, REGULARIZATION, A...PREDICTING CLASS-IMBALANCED BUSINESS RISK USING RESAMPLING, REGULARIZATION, A...
PREDICTING CLASS-IMBALANCED BUSINESS RISK USING RESAMPLING, REGULARIZATION, A...
 
Anomaly detection via eliminating data redundancy and rectifying data error i...
Anomaly detection via eliminating data redundancy and rectifying data error i...Anomaly detection via eliminating data redundancy and rectifying data error i...
Anomaly detection via eliminating data redundancy and rectifying data error i...
 
TECHNICAL REVIEW: PERFORMANCE OF EXISTING IMPUTATION METHODS FOR MISSING DATA...
TECHNICAL REVIEW: PERFORMANCE OF EXISTING IMPUTATION METHODS FOR MISSING DATA...TECHNICAL REVIEW: PERFORMANCE OF EXISTING IMPUTATION METHODS FOR MISSING DATA...
TECHNICAL REVIEW: PERFORMANCE OF EXISTING IMPUTATION METHODS FOR MISSING DATA...
 
TECHNICAL REVIEW: PERFORMANCE OF EXISTING IMPUTATION METHODS FOR MISSING DATA...
TECHNICAL REVIEW: PERFORMANCE OF EXISTING IMPUTATION METHODS FOR MISSING DATA...TECHNICAL REVIEW: PERFORMANCE OF EXISTING IMPUTATION METHODS FOR MISSING DATA...
TECHNICAL REVIEW: PERFORMANCE OF EXISTING IMPUTATION METHODS FOR MISSING DATA...
 
INFLUENCE OF DATA GEOMETRY IN RANDOM SUBSET FEATURE SELECTION
INFLUENCE OF DATA GEOMETRY IN RANDOM SUBSET FEATURE SELECTIONINFLUENCE OF DATA GEOMETRY IN RANDOM SUBSET FEATURE SELECTION
INFLUENCE OF DATA GEOMETRY IN RANDOM SUBSET FEATURE SELECTION
 
Missing data exploration in air quality data set using R-package data visuali...
Missing data exploration in air quality data set using R-package data visuali...Missing data exploration in air quality data set using R-package data visuali...
Missing data exploration in air quality data set using R-package data visuali...
 
Machine learning module 2
Machine learning module 2Machine learning module 2
Machine learning module 2
 
K-NN Classifier Performs Better Than K-Means Clustering in Missing Value Imp...
K-NN Classifier Performs Better Than K-Means Clustering in  Missing Value Imp...K-NN Classifier Performs Better Than K-Means Clustering in  Missing Value Imp...
K-NN Classifier Performs Better Than K-Means Clustering in Missing Value Imp...
 
IRJET- Probability based Missing Value Imputation Method and its Analysis
IRJET- Probability based Missing Value Imputation Method and its AnalysisIRJET- Probability based Missing Value Imputation Method and its Analysis
IRJET- Probability based Missing Value Imputation Method and its Analysis
 
A SURVEY OF METHODS FOR HANDLING DISK DATA IMBALANCE
A SURVEY OF METHODS FOR HANDLING DISK DATA IMBALANCEA SURVEY OF METHODS FOR HANDLING DISK DATA IMBALANCE
A SURVEY OF METHODS FOR HANDLING DISK DATA IMBALANCE
 
Multiple imputation of missing data
Multiple imputation of missing dataMultiple imputation of missing data
Multiple imputation of missing data
 
Multi-Cluster Based Approach for skewed Data in Data Mining
Multi-Cluster Based Approach for skewed Data in Data MiningMulti-Cluster Based Approach for skewed Data in Data Mining
Multi-Cluster Based Approach for skewed Data in Data Mining
 
IT-601 Lecture Notes-UNIT-2.pdf Data Analysis
IT-601 Lecture Notes-UNIT-2.pdf Data AnalysisIT-601 Lecture Notes-UNIT-2.pdf Data Analysis
IT-601 Lecture Notes-UNIT-2.pdf Data Analysis
 

More from ijiert bestjournal

CRACKS IN STEEL CASTING FOR VOLUTE CASING OF A PUMP
 CRACKS IN STEEL CASTING FOR VOLUTE CASING OF A PUMP CRACKS IN STEEL CASTING FOR VOLUTE CASING OF A PUMP
CRACKS IN STEEL CASTING FOR VOLUTE CASING OF A PUMP
ijiert bestjournal
 
A COMPARATIVE STUDY OF DESIGN OF SIMPLE SPUR GEAR TRAIN AND HELICAL GEAR TRAI...
A COMPARATIVE STUDY OF DESIGN OF SIMPLE SPUR GEAR TRAIN AND HELICAL GEAR TRAI...A COMPARATIVE STUDY OF DESIGN OF SIMPLE SPUR GEAR TRAIN AND HELICAL GEAR TRAI...
A COMPARATIVE STUDY OF DESIGN OF SIMPLE SPUR GEAR TRAIN AND HELICAL GEAR TRAI...
ijiert bestjournal
 
COMPARATIVE ANALYSIS OF CONVENTIONAL LEAF SPRING AND COMPOSITE LEAF
COMPARATIVE ANALYSIS OF CONVENTIONAL LEAF SPRING AND COMPOSITE LEAFCOMPARATIVE ANALYSIS OF CONVENTIONAL LEAF SPRING AND COMPOSITE LEAF
COMPARATIVE ANALYSIS OF CONVENTIONAL LEAF SPRING AND COMPOSITE LEAF
ijiert bestjournal
 
POWER GENERATION BY DIFFUSER AUGMENTED WIND TURBINE
 POWER GENERATION BY DIFFUSER AUGMENTED WIND TURBINE POWER GENERATION BY DIFFUSER AUGMENTED WIND TURBINE
POWER GENERATION BY DIFFUSER AUGMENTED WIND TURBINE
ijiert bestjournal
 
FINITE ELEMENT ANALYSIS OF CONNECTING ROD OF MG-ALLOY
FINITE ELEMENT ANALYSIS OF CONNECTING ROD OF MG-ALLOY FINITE ELEMENT ANALYSIS OF CONNECTING ROD OF MG-ALLOY
FINITE ELEMENT ANALYSIS OF CONNECTING ROD OF MG-ALLOY
ijiert bestjournal
 
REVIEW ON CRITICAL SPEED IMPROVEMENT IN SINGLE CYLINDER ENGINE VALVE TRAIN
REVIEW ON CRITICAL SPEED IMPROVEMENT IN SINGLE CYLINDER ENGINE VALVE TRAINREVIEW ON CRITICAL SPEED IMPROVEMENT IN SINGLE CYLINDER ENGINE VALVE TRAIN
REVIEW ON CRITICAL SPEED IMPROVEMENT IN SINGLE CYLINDER ENGINE VALVE TRAIN
ijiert bestjournal
 
ENERGY CONVERSION PHENOMENON IN IMPLEMENTATION OF WATER LIFTING BY USING PEND...
ENERGY CONVERSION PHENOMENON IN IMPLEMENTATION OF WATER LIFTING BY USING PEND...ENERGY CONVERSION PHENOMENON IN IMPLEMENTATION OF WATER LIFTING BY USING PEND...
ENERGY CONVERSION PHENOMENON IN IMPLEMENTATION OF WATER LIFTING BY USING PEND...
ijiert bestjournal
 
SCUDERI SPLIT CYCLE ENGINE: REVOLUTIONARY TECHNOLOGY & EVOLUTIONARY DESIGN RE...
SCUDERI SPLIT CYCLE ENGINE: REVOLUTIONARY TECHNOLOGY & EVOLUTIONARY DESIGN RE...SCUDERI SPLIT CYCLE ENGINE: REVOLUTIONARY TECHNOLOGY & EVOLUTIONARY DESIGN RE...
SCUDERI SPLIT CYCLE ENGINE: REVOLUTIONARY TECHNOLOGY & EVOLUTIONARY DESIGN RE...
ijiert bestjournal
 
EXPERIMENTAL EVALUATION OF TEMPERATURE DISTRIBUTION IN JOURNAL BEARING OPERAT...
EXPERIMENTAL EVALUATION OF TEMPERATURE DISTRIBUTION IN JOURNAL BEARING OPERAT...EXPERIMENTAL EVALUATION OF TEMPERATURE DISTRIBUTION IN JOURNAL BEARING OPERAT...
EXPERIMENTAL EVALUATION OF TEMPERATURE DISTRIBUTION IN JOURNAL BEARING OPERAT...
ijiert bestjournal
 
STUDY OF SOLAR THERMAL CAVITY RECEIVER FOR PARABOLIC CONCENTRATING COLLECTOR
STUDY OF SOLAR THERMAL CAVITY RECEIVER FOR PARABOLIC CONCENTRATING COLLECTOR STUDY OF SOLAR THERMAL CAVITY RECEIVER FOR PARABOLIC CONCENTRATING COLLECTOR
STUDY OF SOLAR THERMAL CAVITY RECEIVER FOR PARABOLIC CONCENTRATING COLLECTOR
ijiert bestjournal
 
DESIGN, OPTIMIZATION AND FINITE ELEMENT ANALYSIS OF CRANKSHAFT
DESIGN, OPTIMIZATION AND FINITE ELEMENT ANALYSIS OF CRANKSHAFTDESIGN, OPTIMIZATION AND FINITE ELEMENT ANALYSIS OF CRANKSHAFT
DESIGN, OPTIMIZATION AND FINITE ELEMENT ANALYSIS OF CRANKSHAFT
ijiert bestjournal
 
ELECTRO CHEMICAL MACHINING AND ELECTRICAL DISCHARGE MACHINING PROCESSES MICRO...
ELECTRO CHEMICAL MACHINING AND ELECTRICAL DISCHARGE MACHINING PROCESSES MICRO...ELECTRO CHEMICAL MACHINING AND ELECTRICAL DISCHARGE MACHINING PROCESSES MICRO...
ELECTRO CHEMICAL MACHINING AND ELECTRICAL DISCHARGE MACHINING PROCESSES MICRO...
ijiert bestjournal
 
HEAT TRANSFER ENHANCEMENT BY USING NANOFLUID JET IMPINGEMENT
HEAT TRANSFER ENHANCEMENT BY USING NANOFLUID JET IMPINGEMENTHEAT TRANSFER ENHANCEMENT BY USING NANOFLUID JET IMPINGEMENT
HEAT TRANSFER ENHANCEMENT BY USING NANOFLUID JET IMPINGEMENT
ijiert bestjournal
 
MODIFICATION AND OPTIMIZATION IN STEEL SANDWICH PANELS USING ANSYS WORKBENCH
MODIFICATION AND OPTIMIZATION IN STEEL SANDWICH PANELS USING ANSYS WORKBENCH MODIFICATION AND OPTIMIZATION IN STEEL SANDWICH PANELS USING ANSYS WORKBENCH
MODIFICATION AND OPTIMIZATION IN STEEL SANDWICH PANELS USING ANSYS WORKBENCH
ijiert bestjournal
 
IMPACT ANALYSIS OF ALUMINUM HONEYCOMB SANDWICH PANEL BUMPER BEAM: A REVIEW
IMPACT ANALYSIS OF ALUMINUM HONEYCOMB SANDWICH PANEL BUMPER BEAM: A REVIEW IMPACT ANALYSIS OF ALUMINUM HONEYCOMB SANDWICH PANEL BUMPER BEAM: A REVIEW
IMPACT ANALYSIS OF ALUMINUM HONEYCOMB SANDWICH PANEL BUMPER BEAM: A REVIEW
ijiert bestjournal
 
DESIGN OF WELDING FIXTURES AND POSITIONERS
DESIGN OF WELDING FIXTURES AND POSITIONERSDESIGN OF WELDING FIXTURES AND POSITIONERS
DESIGN OF WELDING FIXTURES AND POSITIONERS
ijiert bestjournal
 
ADVANCED TRANSIENT THERMAL AND STRUCTURAL ANALYSIS OF DISC BRAKE BY USING ANS...
ADVANCED TRANSIENT THERMAL AND STRUCTURAL ANALYSIS OF DISC BRAKE BY USING ANS...ADVANCED TRANSIENT THERMAL AND STRUCTURAL ANALYSIS OF DISC BRAKE BY USING ANS...
ADVANCED TRANSIENT THERMAL AND STRUCTURAL ANALYSIS OF DISC BRAKE BY USING ANS...
ijiert bestjournal
 
REVIEW ON MECHANICAL PROPERTIES OF NON-ASBESTOS COMPOSITE MATERIAL USED IN BR...
REVIEW ON MECHANICAL PROPERTIES OF NON-ASBESTOS COMPOSITE MATERIAL USED IN BR...REVIEW ON MECHANICAL PROPERTIES OF NON-ASBESTOS COMPOSITE MATERIAL USED IN BR...
REVIEW ON MECHANICAL PROPERTIES OF NON-ASBESTOS COMPOSITE MATERIAL USED IN BR...
ijiert bestjournal
 
PERFORMANCE EVALUATION OF TRIBOLOGICAL PROPERTIES OF COTTON SEED OIL FOR MULT...
PERFORMANCE EVALUATION OF TRIBOLOGICAL PROPERTIES OF COTTON SEED OIL FOR MULT...PERFORMANCE EVALUATION OF TRIBOLOGICAL PROPERTIES OF COTTON SEED OIL FOR MULT...
PERFORMANCE EVALUATION OF TRIBOLOGICAL PROPERTIES OF COTTON SEED OIL FOR MULT...
ijiert bestjournal
 
MAGNETIC ABRASIVE FINISHING
MAGNETIC ABRASIVE FINISHINGMAGNETIC ABRASIVE FINISHING
MAGNETIC ABRASIVE FINISHING
ijiert bestjournal
 

More from ijiert bestjournal (20)

CRACKS IN STEEL CASTING FOR VOLUTE CASING OF A PUMP
 CRACKS IN STEEL CASTING FOR VOLUTE CASING OF A PUMP CRACKS IN STEEL CASTING FOR VOLUTE CASING OF A PUMP
CRACKS IN STEEL CASTING FOR VOLUTE CASING OF A PUMP
 
A COMPARATIVE STUDY OF DESIGN OF SIMPLE SPUR GEAR TRAIN AND HELICAL GEAR TRAI...
A COMPARATIVE STUDY OF DESIGN OF SIMPLE SPUR GEAR TRAIN AND HELICAL GEAR TRAI...A COMPARATIVE STUDY OF DESIGN OF SIMPLE SPUR GEAR TRAIN AND HELICAL GEAR TRAI...
A COMPARATIVE STUDY OF DESIGN OF SIMPLE SPUR GEAR TRAIN AND HELICAL GEAR TRAI...
 
COMPARATIVE ANALYSIS OF CONVENTIONAL LEAF SPRING AND COMPOSITE LEAF
COMPARATIVE ANALYSIS OF CONVENTIONAL LEAF SPRING AND COMPOSITE LEAFCOMPARATIVE ANALYSIS OF CONVENTIONAL LEAF SPRING AND COMPOSITE LEAF
COMPARATIVE ANALYSIS OF CONVENTIONAL LEAF SPRING AND COMPOSITE LEAF
 
POWER GENERATION BY DIFFUSER AUGMENTED WIND TURBINE
 POWER GENERATION BY DIFFUSER AUGMENTED WIND TURBINE POWER GENERATION BY DIFFUSER AUGMENTED WIND TURBINE
POWER GENERATION BY DIFFUSER AUGMENTED WIND TURBINE
 
FINITE ELEMENT ANALYSIS OF CONNECTING ROD OF MG-ALLOY
FINITE ELEMENT ANALYSIS OF CONNECTING ROD OF MG-ALLOY FINITE ELEMENT ANALYSIS OF CONNECTING ROD OF MG-ALLOY
FINITE ELEMENT ANALYSIS OF CONNECTING ROD OF MG-ALLOY
 
REVIEW ON CRITICAL SPEED IMPROVEMENT IN SINGLE CYLINDER ENGINE VALVE TRAIN
REVIEW ON CRITICAL SPEED IMPROVEMENT IN SINGLE CYLINDER ENGINE VALVE TRAINREVIEW ON CRITICAL SPEED IMPROVEMENT IN SINGLE CYLINDER ENGINE VALVE TRAIN
REVIEW ON CRITICAL SPEED IMPROVEMENT IN SINGLE CYLINDER ENGINE VALVE TRAIN
 
ENERGY CONVERSION PHENOMENON IN IMPLEMENTATION OF WATER LIFTING BY USING PEND...
ENERGY CONVERSION PHENOMENON IN IMPLEMENTATION OF WATER LIFTING BY USING PEND...ENERGY CONVERSION PHENOMENON IN IMPLEMENTATION OF WATER LIFTING BY USING PEND...
ENERGY CONVERSION PHENOMENON IN IMPLEMENTATION OF WATER LIFTING BY USING PEND...
 
SCUDERI SPLIT CYCLE ENGINE: REVOLUTIONARY TECHNOLOGY & EVOLUTIONARY DESIGN RE...
SCUDERI SPLIT CYCLE ENGINE: REVOLUTIONARY TECHNOLOGY & EVOLUTIONARY DESIGN RE...SCUDERI SPLIT CYCLE ENGINE: REVOLUTIONARY TECHNOLOGY & EVOLUTIONARY DESIGN RE...
SCUDERI SPLIT CYCLE ENGINE: REVOLUTIONARY TECHNOLOGY & EVOLUTIONARY DESIGN RE...
 
EXPERIMENTAL EVALUATION OF TEMPERATURE DISTRIBUTION IN JOURNAL BEARING OPERAT...
EXPERIMENTAL EVALUATION OF TEMPERATURE DISTRIBUTION IN JOURNAL BEARING OPERAT...EXPERIMENTAL EVALUATION OF TEMPERATURE DISTRIBUTION IN JOURNAL BEARING OPERAT...
EXPERIMENTAL EVALUATION OF TEMPERATURE DISTRIBUTION IN JOURNAL BEARING OPERAT...
 
STUDY OF SOLAR THERMAL CAVITY RECEIVER FOR PARABOLIC CONCENTRATING COLLECTOR
STUDY OF SOLAR THERMAL CAVITY RECEIVER FOR PARABOLIC CONCENTRATING COLLECTOR STUDY OF SOLAR THERMAL CAVITY RECEIVER FOR PARABOLIC CONCENTRATING COLLECTOR
STUDY OF SOLAR THERMAL CAVITY RECEIVER FOR PARABOLIC CONCENTRATING COLLECTOR
 
DESIGN, OPTIMIZATION AND FINITE ELEMENT ANALYSIS OF CRANKSHAFT
DESIGN, OPTIMIZATION AND FINITE ELEMENT ANALYSIS OF CRANKSHAFTDESIGN, OPTIMIZATION AND FINITE ELEMENT ANALYSIS OF CRANKSHAFT
DESIGN, OPTIMIZATION AND FINITE ELEMENT ANALYSIS OF CRANKSHAFT
 
ELECTRO CHEMICAL MACHINING AND ELECTRICAL DISCHARGE MACHINING PROCESSES MICRO...
ELECTRO CHEMICAL MACHINING AND ELECTRICAL DISCHARGE MACHINING PROCESSES MICRO...ELECTRO CHEMICAL MACHINING AND ELECTRICAL DISCHARGE MACHINING PROCESSES MICRO...
ELECTRO CHEMICAL MACHINING AND ELECTRICAL DISCHARGE MACHINING PROCESSES MICRO...
 
HEAT TRANSFER ENHANCEMENT BY USING NANOFLUID JET IMPINGEMENT
HEAT TRANSFER ENHANCEMENT BY USING NANOFLUID JET IMPINGEMENTHEAT TRANSFER ENHANCEMENT BY USING NANOFLUID JET IMPINGEMENT
HEAT TRANSFER ENHANCEMENT BY USING NANOFLUID JET IMPINGEMENT
 
MODIFICATION AND OPTIMIZATION IN STEEL SANDWICH PANELS USING ANSYS WORKBENCH
MODIFICATION AND OPTIMIZATION IN STEEL SANDWICH PANELS USING ANSYS WORKBENCH MODIFICATION AND OPTIMIZATION IN STEEL SANDWICH PANELS USING ANSYS WORKBENCH
MODIFICATION AND OPTIMIZATION IN STEEL SANDWICH PANELS USING ANSYS WORKBENCH
 
IMPACT ANALYSIS OF ALUMINUM HONEYCOMB SANDWICH PANEL BUMPER BEAM: A REVIEW
IMPACT ANALYSIS OF ALUMINUM HONEYCOMB SANDWICH PANEL BUMPER BEAM: A REVIEW IMPACT ANALYSIS OF ALUMINUM HONEYCOMB SANDWICH PANEL BUMPER BEAM: A REVIEW
IMPACT ANALYSIS OF ALUMINUM HONEYCOMB SANDWICH PANEL BUMPER BEAM: A REVIEW
 
DESIGN OF WELDING FIXTURES AND POSITIONERS
DESIGN OF WELDING FIXTURES AND POSITIONERSDESIGN OF WELDING FIXTURES AND POSITIONERS
DESIGN OF WELDING FIXTURES AND POSITIONERS
 
ADVANCED TRANSIENT THERMAL AND STRUCTURAL ANALYSIS OF DISC BRAKE BY USING ANS...
ADVANCED TRANSIENT THERMAL AND STRUCTURAL ANALYSIS OF DISC BRAKE BY USING ANS...ADVANCED TRANSIENT THERMAL AND STRUCTURAL ANALYSIS OF DISC BRAKE BY USING ANS...
ADVANCED TRANSIENT THERMAL AND STRUCTURAL ANALYSIS OF DISC BRAKE BY USING ANS...
 
REVIEW ON MECHANICAL PROPERTIES OF NON-ASBESTOS COMPOSITE MATERIAL USED IN BR...
REVIEW ON MECHANICAL PROPERTIES OF NON-ASBESTOS COMPOSITE MATERIAL USED IN BR...REVIEW ON MECHANICAL PROPERTIES OF NON-ASBESTOS COMPOSITE MATERIAL USED IN BR...
REVIEW ON MECHANICAL PROPERTIES OF NON-ASBESTOS COMPOSITE MATERIAL USED IN BR...
 
PERFORMANCE EVALUATION OF TRIBOLOGICAL PROPERTIES OF COTTON SEED OIL FOR MULT...
PERFORMANCE EVALUATION OF TRIBOLOGICAL PROPERTIES OF COTTON SEED OIL FOR MULT...PERFORMANCE EVALUATION OF TRIBOLOGICAL PROPERTIES OF COTTON SEED OIL FOR MULT...
PERFORMANCE EVALUATION OF TRIBOLOGICAL PROPERTIES OF COTTON SEED OIL FOR MULT...
 
MAGNETIC ABRASIVE FINISHING
MAGNETIC ABRASIVE FINISHINGMAGNETIC ABRASIVE FINISHING
MAGNETIC ABRASIVE FINISHING
 

Recently uploaded

Pressure Relief valve used in flow line to release the over pressure at our d...
Pressure Relief valve used in flow line to release the over pressure at our d...Pressure Relief valve used in flow line to release the over pressure at our d...
Pressure Relief valve used in flow line to release the over pressure at our d...
cannyengineerings
 
Blood finder application project report (1).pdf
Blood finder application project report (1).pdfBlood finder application project report (1).pdf
Blood finder application project report (1).pdf
Kamal Acharya
 
Transformers design and coooling methods
Transformers design and coooling methodsTransformers design and coooling methods
Transformers design and coooling methods
Roger Rozario
 
P5 Working Drawings.pdf floor plan, civil
P5 Working Drawings.pdf floor plan, civilP5 Working Drawings.pdf floor plan, civil
P5 Working Drawings.pdf floor plan, civil
AnasAhmadNoor
 
Mechatronics material . Mechanical engineering
Mechatronics material . Mechanical engineeringMechatronics material . Mechanical engineering
Mechatronics material . Mechanical engineering
sachin chaurasia
 
4. Mosca vol I -Fisica-Tipler-5ta-Edicion-Vol-1.pdf
4. Mosca vol I -Fisica-Tipler-5ta-Edicion-Vol-1.pdf4. Mosca vol I -Fisica-Tipler-5ta-Edicion-Vol-1.pdf
4. Mosca vol I -Fisica-Tipler-5ta-Edicion-Vol-1.pdf
Gino153088
 
一比一原版(osu毕业证书)美国俄勒冈州立大学毕业证如何办理
一比一原版(osu毕业证书)美国俄勒冈州立大学毕业证如何办理一比一原版(osu毕业证书)美国俄勒冈州立大学毕业证如何办理
一比一原版(osu毕业证书)美国俄勒冈州立大学毕业证如何办理
upoux
 
Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024
Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024
Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024
Sinan KOZAK
 
Supermarket Management System Project Report.pdf
Supermarket Management System Project Report.pdfSupermarket Management System Project Report.pdf
Supermarket Management System Project Report.pdf
Kamal Acharya
 
NATURAL DEEP EUTECTIC SOLVENTS AS ANTI-FREEZING AGENT
NATURAL DEEP EUTECTIC SOLVENTS AS ANTI-FREEZING AGENTNATURAL DEEP EUTECTIC SOLVENTS AS ANTI-FREEZING AGENT
NATURAL DEEP EUTECTIC SOLVENTS AS ANTI-FREEZING AGENT
Addu25809
 
Digital Twins Computer Networking Paper Presentation.pptx
Digital Twins Computer Networking Paper Presentation.pptxDigital Twins Computer Networking Paper Presentation.pptx
Digital Twins Computer Networking Paper Presentation.pptx
aryanpankaj78
 
Null Bangalore | Pentesters Approach to AWS IAM
Null Bangalore | Pentesters Approach to AWS IAMNull Bangalore | Pentesters Approach to AWS IAM
Null Bangalore | Pentesters Approach to AWS IAM
Divyanshu
 
Generative AI Use cases applications solutions and implementation.pdf
Generative AI Use cases applications solutions and implementation.pdfGenerative AI Use cases applications solutions and implementation.pdf
Generative AI Use cases applications solutions and implementation.pdf
mahaffeycheryld
 
AI-Based Home Security System : Home security
AI-Based Home Security System : Home securityAI-Based Home Security System : Home security
AI-Based Home Security System : Home security
AIRCC Publishing Corporation
 
1FIDIC-CONSTRUCTION-CONTRACT-2ND-ED-2017-RED-BOOK.pdf
1FIDIC-CONSTRUCTION-CONTRACT-2ND-ED-2017-RED-BOOK.pdf1FIDIC-CONSTRUCTION-CONTRACT-2ND-ED-2017-RED-BOOK.pdf
1FIDIC-CONSTRUCTION-CONTRACT-2ND-ED-2017-RED-BOOK.pdf
MadhavJungKarki
 
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
ecqow
 
Prediction of Electrical Energy Efficiency Using Information on Consumer's Ac...
Prediction of Electrical Energy Efficiency Using Information on Consumer's Ac...Prediction of Electrical Energy Efficiency Using Information on Consumer's Ac...
Prediction of Electrical Energy Efficiency Using Information on Consumer's Ac...
PriyankaKilaniya
 
Object Oriented Analysis and Design - OOAD
Object Oriented Analysis and Design - OOADObject Oriented Analysis and Design - OOAD
Object Oriented Analysis and Design - OOAD
PreethaV16
 
Introduction to Computer Networks & OSI MODEL.ppt
Introduction to Computer Networks & OSI MODEL.pptIntroduction to Computer Networks & OSI MODEL.ppt
Introduction to Computer Networks & OSI MODEL.ppt
Dwarkadas J Sanghvi College of Engineering
 
smart pill dispenser is designed to improve medication adherence and safety f...
smart pill dispenser is designed to improve medication adherence and safety f...smart pill dispenser is designed to improve medication adherence and safety f...
smart pill dispenser is designed to improve medication adherence and safety f...
um7474492
 

Recently uploaded (20)

Pressure Relief valve used in flow line to release the over pressure at our d...
Pressure Relief valve used in flow line to release the over pressure at our d...Pressure Relief valve used in flow line to release the over pressure at our d...
Pressure Relief valve used in flow line to release the over pressure at our d...
 
Blood finder application project report (1).pdf
Blood finder application project report (1).pdfBlood finder application project report (1).pdf
Blood finder application project report (1).pdf
 
Transformers design and coooling methods
Transformers design and coooling methodsTransformers design and coooling methods
Transformers design and coooling methods
 
P5 Working Drawings.pdf floor plan, civil
P5 Working Drawings.pdf floor plan, civilP5 Working Drawings.pdf floor plan, civil
P5 Working Drawings.pdf floor plan, civil
 
Mechatronics material . Mechanical engineering
Mechatronics material . Mechanical engineeringMechatronics material . Mechanical engineering
Mechatronics material . Mechanical engineering
 
4. Mosca vol I -Fisica-Tipler-5ta-Edicion-Vol-1.pdf
4. Mosca vol I -Fisica-Tipler-5ta-Edicion-Vol-1.pdf4. Mosca vol I -Fisica-Tipler-5ta-Edicion-Vol-1.pdf
4. Mosca vol I -Fisica-Tipler-5ta-Edicion-Vol-1.pdf
 
一比一原版(osu毕业证书)美国俄勒冈州立大学毕业证如何办理
一比一原版(osu毕业证书)美国俄勒冈州立大学毕业证如何办理一比一原版(osu毕业证书)美国俄勒冈州立大学毕业证如何办理
一比一原版(osu毕业证书)美国俄勒冈州立大学毕业证如何办理
 
Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024
Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024
Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024
 
Supermarket Management System Project Report.pdf
Supermarket Management System Project Report.pdfSupermarket Management System Project Report.pdf
Supermarket Management System Project Report.pdf
 
NATURAL DEEP EUTECTIC SOLVENTS AS ANTI-FREEZING AGENT
NATURAL DEEP EUTECTIC SOLVENTS AS ANTI-FREEZING AGENTNATURAL DEEP EUTECTIC SOLVENTS AS ANTI-FREEZING AGENT
NATURAL DEEP EUTECTIC SOLVENTS AS ANTI-FREEZING AGENT
 
Digital Twins Computer Networking Paper Presentation.pptx
Digital Twins Computer Networking Paper Presentation.pptxDigital Twins Computer Networking Paper Presentation.pptx
Digital Twins Computer Networking Paper Presentation.pptx
 
Null Bangalore | Pentesters Approach to AWS IAM
Null Bangalore | Pentesters Approach to AWS IAMNull Bangalore | Pentesters Approach to AWS IAM
Null Bangalore | Pentesters Approach to AWS IAM
 
Generative AI Use cases applications solutions and implementation.pdf
Generative AI Use cases applications solutions and implementation.pdfGenerative AI Use cases applications solutions and implementation.pdf
Generative AI Use cases applications solutions and implementation.pdf
 
AI-Based Home Security System : Home security
AI-Based Home Security System : Home securityAI-Based Home Security System : Home security
AI-Based Home Security System : Home security
 
1FIDIC-CONSTRUCTION-CONTRACT-2ND-ED-2017-RED-BOOK.pdf
1FIDIC-CONSTRUCTION-CONTRACT-2ND-ED-2017-RED-BOOK.pdf1FIDIC-CONSTRUCTION-CONTRACT-2ND-ED-2017-RED-BOOK.pdf
1FIDIC-CONSTRUCTION-CONTRACT-2ND-ED-2017-RED-BOOK.pdf
 
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
 
Prediction of Electrical Energy Efficiency Using Information on Consumer's Ac...
Prediction of Electrical Energy Efficiency Using Information on Consumer's Ac...Prediction of Electrical Energy Efficiency Using Information on Consumer's Ac...
Prediction of Electrical Energy Efficiency Using Information on Consumer's Ac...
 
Object Oriented Analysis and Design - OOAD
Object Oriented Analysis and Design - OOADObject Oriented Analysis and Design - OOAD
Object Oriented Analysis and Design - OOAD
 
Introduction to Computer Networks & OSI MODEL.ppt
Introduction to Computer Networks & OSI MODEL.pptIntroduction to Computer Networks & OSI MODEL.ppt
Introduction to Computer Networks & OSI MODEL.ppt
 
smart pill dispenser is designed to improve medication adherence and safety f...
smart pill dispenser is designed to improve medication adherence and safety f...smart pill dispenser is designed to improve medication adherence and safety f...
smart pill dispenser is designed to improve medication adherence and safety f...
 

COMPARISION OF PERCENTAGE ERROR BY USING IMPUTATION METHOD ON MID TERM EXAMINATION DATA

  • 1. NOVATEUR PUBLICATIONS INTERNATIONAL JOURNAL OF INNOVATIONS IN ENGINEERING RESEARCH AND TECHNOLOGY [IJIERT] ISSN: 2394-3696 VOLUME 2, ISSUE 12, DEC.-2015 1 | P a g e COMPARISION OF PERCENTAGE ERROR BY USING IMPUTATION METHOD ON MID TERM EXAMINATION DATA V.B. Kamble P.E.S. College of Engineering, Aurangabad. (M.S.), India S.N. Deshmukh Dr. Babasaheb Ambedkar Marathwada University, Aurangabad. (M.S.) India. ABSTRACT The issue of incomplete data exists across the entire field of data mining. In this paper, Mean Imputation, Median Imputation and Standard Deviation Imputation are used to deal with challenges of incomplete data on classification problems. By using different imputation methods converts incomplete dataset in to the complete dataset. On complete dataset by applying the suitable Imputation Method and comparing the percentage error of Imputation Method and comparing the result INTRODUCTION Missing data are the absence of data items; they hide some information that may be important. The presence of missing data is a general and challenging problem in the data analysis field. Fortunately, missing data imputation techniques can be used to improve the data quality. Missing data imputation techniques refer to any strategy that fills in missing values of a dataset so that standard data analysis methods can be applied to analyzed completed dataset [1]. Information quality is important to organization. People use information attribute as a tool for accessing information quality. Information quality is measured based on users as well as experts opinion on the information attributes. The commonly known information attributes for information quality including accuracy, objectivity, reputation, access, security, relevancy, value added, timeliness, completeness, amount of data, and ease of understanding and consistent representation. These attribute can also be applicable to data quality. Commonly, one can rarely find a data set that contains complete entries [2]. 1.1 Related Work- Methods for dealing with missing values can be classified into three categories 1) Case Deletion, 2) Learning Without Handling of Missing Values, and 3) Missing Value Imputation. The case deletion is to simply omit those cases with missing values and only to use the remaining instances to finish the learning assignments. The second approach is to learn without handling of missing data.
  • 2. NOVATEUR PUBLICATIONS INTERNATIONAL JOURNAL OF INNOVATIONS IN ENGINEERING RESEARCH AND TECHNOLOGY [IJIERT] ISSN: 2394-3696 VOLUME 2, ISSUE 12, DEC.-2015 2 | P a g e Missing data imputation methods advocates filling in missing values before a learning application. Missing data imputation is a procedure that replaces the missing values with some possible values [3]. 1.2 Missing Data- Different methods have been applied in data mining to handle missing values in database. Data with missing values could be ignored, or a global constant could be used to fill the missing values, such as attribute mean, attribute mean of the same class, or an algorithm could be applied to find the missing values. Missing data imputation techniques means a strategy to fill the missing values of a dataset in order to apply the standard methods which require completed data set for analysis. These techniques retain data in incomplete cases, as well as impute values of correlated variables. Missing data imputations techniques are classified as ignorable missing data imputations methods, which include single imputation methods and multiple imputation methods, and non-ignorable missing data imputations methods which include likelihood based methods and the non-likelihood based methods. Single imputation methods could fill one value for each missing values and it is more commonly used at present than multiple imputations which replace each missing value with several possible values and better reflects sampling variability about actual value [4]. 1.3 Patterns of Missing Values- There are a number of ways to know how missing data arises. Little and Rubin introduced specific missing data terminology as a standard framework to deal with missing data mechanisms and their effect on data analysis. (A) Missing completely at Random (MCAR)- If the probability that a response is missing is independent of both the observed data for that case and the unobserved responses are simple a random sample from the observed data. An example of MCAR missing data arises when investigators randomly assign research participants to complete two-thirds of a survey instrument (B) Missing at Random (MAR)- If the probability that a response is missing depends on the observed data, but not on the unobserved data. This assumes the parameters of the model for the data are distinct from the parameters of the missingness mechanism. The missingness mechanism is ignorable. For example, in a reading comprehension test at the beginning of a survey administration session, research participants with lower reading comprehension scores may be less likely to complete the entire survey. The missing data are due to some other external influence. (C) Not-Missing at Random. (NMAR)- When respondents and non-respondents, with the same values of some variables observed for both, differ systematically with respect to values of the variable missing for the non-respondent. In other words, the pattern of data missingness is non-random and it is not predictable from other variables in the database. For example, a participant in a weight-loss study does not attend a weight-in due to concerns about his/her weight loss; his/her data are missing due to non-ignorable factors. In practice it is usually difficult to meet the MCAR assumption. It is mentioned that when making sampling distribution inferences about the parameter of the data, it is
  • 3. NOVATEUR PUBLICATIONS INTERNATIONAL JOURNAL OF INNOVATIONS IN ENGINEERING RESEARCH AND TECHNOLOGY [IJIERT] ISSN: 2394-3696 VOLUME 2, ISSUE 12, DEC.-2015 3 | P a g e appropriate to ignore the process that causes missing data if the missing data are missing at random and the observed data are observed at random, but the inferences are generally conditional on the observed pattern of missing data [5]. FOUR DIFFERENT METHODS TO DEAL WITH MISSING VALUES A. Case Deletion (CD) - It is Also known as complete case analysis. It is available in all statistical packages and is the default method in many programs. This method consists of discarding all instances (cases) with missing values for at least one feature. A variation of this method consists of determining the extent of missing data on each instance and attribute, and deletes the instances and/or attributes with high levels of missing data. Before deleting any attribute, it is necessary to evaluate its relevance to the analysis. Unfortunately, relevant attributes should be kept even with a high degree of missing values for other situations where the sample size is insufficient or some structure exists in the missing data, CD has been shown to produce more biased estimates than alternative methods. CD should be applied only in cases in which data are missing completely at random. B. Mean Imputation (MI). The method replaces the missing data for a given feature (attribute) by mean of all the known values for that particular attribute. Let us consider that the value of ‫ݔ‬௜௝ of the ݇௧௛ class, ‫ܥ‬௞, is missing then ‫ݔ‬௜௝ is calculated as ‫ݔ‬௜௝ = ∑ ௫ ௡ೖ ௜௝ (1) ݅: ‫ݔ‬௜௝ €ck Where nk represents the number of non-missing values in the ݆௧௛feature of the ݇௧௛ class. However, According to Little and Rubin, the drawback of MI are a) Sample size is overestimated b) Variance is underestimated c) Correlation is negatively biased d) The distribution of new values is an incorrect representation of the population values because the shape of the distribution is distorted by adding values equal to the mean. Replacing all missing records with a single value will deflate the variance and artificially inflate the significance of any statistical test based on it. C. Median Imputation (MDI). This method uses median of all known values of the feature or attribute in the class where the missing instance with missing value belongs. Consider the value ‫ݔ‬௜௝ of the ݇௧௛ class, ‫ܥ‬௞, is missing. It will be calculated as ‫ݔ‬௜௝ୀ݉݁݀݅ܽ݊{݅: ‫ݔ‬௜௝€ܿ௞}{‫ݔ‬௜௝}} ( 2) Instead of mean and median, mode also can be in imputation. Imputation method is applied separately for many attribute. However, imputation does not consider co- relation structure of the data [6].
  • 4. NOVATEUR PUBLICATIONS INTERNATIONAL JOURNAL OF INNOVATIONS IN ENGINEERING RESEARCH AND TECHNOLOGY [IJIERT] ISSN: 2394-3696 VOLUME 2, ISSUE 12, DEC.-2015 4 | P a g e D. Standard Deviation The standard deviation spread data about the mean value. It is useful in comparing sets of data which may have the same mean but a different range. The Standard Deviation is given by the formula S ୒ୀටଵ ୒ ∑ (୶౟ష ొ ౟సభ ௫ො)ଶ (3) Where{x1,x2,…,xn} are the observed values of the sample items and is the mean value of these observations, while the denominator N stands for the size of the sample [7]. EXPERIMENTAL ANALYSIS We use three techniques for data retrieval 1) Imputing the missing values using attribute mean value. 2) Imputing the missing values using Median Imputation 3) Imputing the missing values using Standard Deviation 3.1 Imputing Missing Values Using Attribute Mean Value This is one of the most frequently used methods. It consists of replacing the missing data for a given feature (attribute) by the mean of all known values of that attribute in the class where the instances with missing attribute belongs. In this method by replacing the missing values by attribute mean and find out percentage accuracy with original values by using the Mean Imputation Method. 3.2 Imputing Missing Values Using Median Imputation In this method the missing values of instances are imputed. This method uses median of all known values of the feature or attribute in the class where the missing instance with missing value belongs. Instead of mean and median, mode also can be in imputation. Imputation method is applied separately for many attribute. However, imputation does not consider co-relation structure of the data. 3.3 Imputing Missing Values Using Standard Deviation Imputation The standard deviation spread data about the mean value. It is useful in comparing sets of data which may have the same mean but a different range. The missing values can be replaced by Standards Deviation value at respective attribute and then find out the percentage accuracy with original value. DATASET USED In this paper dataset consists of records of class test marks of engineering college. In dataset contains four attribute Roll No., Imputed value by Mean Imputation, Imputed value by Median Imputation and Imputed value by Standard Deviation Imputation. In dataset Roll No. are imaginary and generated for data analysis. Dataset contains some missing values randomly distributed at particular record number by using imputation
  • 5. NOVATEUR PUBLICATIONS INTERNATIONAL JOURNAL OF INNOVATIONS IN ENGINEERING RESEARCH AND TECHNOLOGY [IJIERT] ISSN: 2394-3696 VOLUME 2, ISSUE 12, DEC.-2015 5 | P a g e techniques fill up the missing value so that dataset becomes complete dataset so that it is easy to analyze the dataset. Dataset is shown in following table. Table 1. Dataset with Mean imputation, Median Imputation, Standard Deviation Imputation Roll. No. Impute d Value by MI Impute d value by MDI Impute d value by SDI 121 8.45 8 4.14 140 11.16 12 4.53 165 8.45 8 4.14 200 8.45 8 4.14 230 10.56 11 4.34 300 11.16 12 4.53 534 11.16 12 4.53 1000 11.16 12 4.53 1429 8.45 8 4.14 1800 11.16 12 4.53 2187 9.75 10 4.34 2220 11.16 12 4.53 2247 10.56 11 4.34 2350 11.16 12 4.53 2557 10.56 11 4.14 3000 11.16 12 4.53 3160 10.56 11 4.34 3225 8.45 8 4.14 3295 10.56 11 4.34 3350 11.16 12 4.53 3467 10.56 11 4.34 3550 11.16 12 4.53 3679 11.16 12 4.53 3700 10.56 11 4.34 3773 11.16 12 4.53 3900 9.75 10 3.9 4113 11.16 12 4.53 4125 11.16 12 4.53 4132 10.56 11 3.9 4150 8.45 8 4.14 4175 10.56 11 4.34 4190 11.16 12 4.53 4202 8.45 8 4.14 4250 11.16 12 4.53
  • 6. NOVATEUR PUBLICATIONS INTERNATIONAL JOURNAL OF INNOVATIONS IN ENGINEERING RESEARCH AND TECHNOLOGY [IJIERT] ISSN: 2394-3696 VOLUME 2, ISSUE 12, DEC.-2015 6 | P a g e 4352 11.16 12 4.53 4500 8.45 8 4.14 4600 8.45 8 4.14 4762 9.75 10 3.9 4850 10.56 11 4.34 4920 11.16 12 4.53 4984 8.45 8 4.14 4998 11.16 12 4.53 Percenta ge Error 2.1 0.04 0.05 4.1 Comparison of Percentage Errors Percentage Error calculates by observing the Experimental value and the Actual Value. By taking the difference between Experimental Value and Actual Value Divided by Actual Value and Multiplied by hundred so that we calculate the Percentage Error by using the following formula. Percentage Error= Experimental Value – Actual Value * 100 Actual Value Table 2. Comparison of Percentage Error Imputation Technique Percentage Error Mean imputation 2.1 Median Imputation 0.04 Standard Deviation Imputation 0.05 By comparing the percentage error of imputation method. The percentage error of Mean Imputation is 2.1 %.The percentage error of Median Imputation is 0.04 and percentage error of Standard Deviation Imputation is 0.05 so the Median Imputation method is having lowest percentage of error as compare to Mean Imputation method and Standard Deviation Imputation method. So the Median Imputation Method is more suitable as compare to other method. CONCLUSION AND FUTURE WORK Missing values are regarded as serious problems in most of the information systems due to unavailability of data and must be impute before the dataset is used. To handle these missing values three techniques are used named as Mean imputation, Median Imputation and Standard Deviation Imputation. Median Imputation method is having lowest percentage of error as compare to Mean Imputation method and Standard Deviation Imputation method. So the Median Imputation Method is more suitable as compare to other method.
  • 7. NOVATEUR PUBLICATIONS INTERNATIONAL JOURNAL OF INNOVATIONS IN ENGINEERING RESEARCH AND TECHNOLOGY [IJIERT] ISSN: 2394-3696 VOLUME 2, ISSUE 12, DEC.-2015 7 | P a g e The proposed work handles missing values only for the numerical attributes. Further it can be extended to handle a categorical attribute. Different classification algorithm can be used for comparative analysis of missing data techniques. Missing data techniques can also be implemented in mat lab. REFERENCES [1] Dinesh J. Prajapati, Jagruti H. Prajapati, and Handling Missing Values: Application to University Data set. Issue 1, Vol. 1(August-2011), ISSN 2249-6149 [2] Shamsher Singh, Prof. Jagdish Prasad, Estimation of Missing Values in the Data Mining and comparison of Imputation Methods. Mathematical Journal of Interdisciplinary Sciences Vol. 1, Issue 1, March 2013, pp. 75–90 [3] Xiao Feng Zhu, Shichao Zhang, Senior Member, IEEE, Zhi Jin, Zili Zhang, and Zhuoming Xu, Missing Value Estimation for Mixed-Attribute Data Sets. IEEE Transactions on Knowledge And Data Engineering, Vol. 23, No. 1, January 2011. [4] T.R.Sivapriya, V. Thavavel, A.R.Nadira Banu Kamal, Imputation and classification of Missing Data Using Least Square Support Vector Machines- A New Approach in Dementia Diagnosis. International Journal of Advanced Research in Artificial Intelligence, Vol.1, No.4, 2012 [5] Yann-Yann Shieh, Imputation Methods on General Linear Mixed Models of Longitudinal Studies, American Institutes for Research [6] ] Edgar Acu˜Na1 And Caroline Rodriguez, The Treatment Of Missing Values And Its Effect In The Classifier Accuracy Studies In Classification, Data Analysis, And Knowledge Organization, 2004, Springer.Com [7] MS. R. Malarvizhi, Dr. Antony Thanamani, Comparision of Imputation Techniques after Classifying the Dataset Using Knn Classifier for the Imputation of Missing Data, International Journal of Computational Engineering Research (IJCER online.com) ISSN 2250-3005, Janaury-2013 [8] Anjana Sharma, Naina Mehta, Iti Sharma, Reasoning With Missing Values in Multi Attribute Datasets. International Journal of Advanced Research in Computer Science and Software Engineering, Volume 3, Issue5, May 2013 ISSN: 2277 128X