NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...
Data cleaning Basics for Managers
1. Data Cleaning
What every Manager should learn in data analytics . . .
Lydia Gitonga
Project Management| Data for Social Good| Business Analytics
2. What is data cleaning?
Why data cleaning?
How?
When?
…………….
3. This is the process of preparing the
data for analysis. It involves identifying,
eliminating or modifying information
that is incorrect, incomplete or could
be misleading in your datasets.
What is Data Cleaning?
4. Data ready for use
6677
549
890
890
90
88
89
0
890
89
0
564 Addres
s
AgeDat
e
-1
89
0
88
89
0
S = sigma =
sqrt{frac{sum (x-
bar{xx̄ = ( Σ xi ) /
n.})^{2}}{n}}
LG
5. End Goal
• What is your goal of analyzing the
data? What is the project/ business
goal?
Understanding your data
• What does your data contain?
• How was the data collected?
• How was the data recorded?
Data Cleaning Guiding Principles
7. The data must conform to some set rules and
constraints corresponding to the real world
Validity
8. The data must be correct
and accurate. Comparison
with third party information
or comparing with known
information can be used to
qualify your data.
Accuracy
9. Is the data complete?
Are all important measures
available?
Completeness
10. The data should not contradict itself.
There should be a reasonable degree of
uniformity across the set of measures
Consistency