The document discusses normal data distribution and provides examples. It defines normal data as being symmetrically distributed about the mean value, with 99.7% of data within 3 standard deviations of the mean. It then generates a sample of 50 random numbers between 14-18 and calculates the mean and standard deviation. A histogram of the deviations from the mean (x-μ) is plotted, showing the normal distribution curve shape.
2. Distribution of Normal Data
A data is said to be normal data if it is
symmetricaly distributed about its mean value and
99.7% of data is withing three times of standar
deviation about both side of mean.
The distibution curve of normal data about mean
is like inverted bell.
3. Population
Population is collection of all possible data.
Number of elements in population are very large.
Size of population is always kept large to
eliminate biasing for selection of particular
element.
4. Population
For example, if you have 10 billion random
numbers between 0 and 1, then probability of
biased selection of 0.0001001 is very small,
practically zero.
Mean or variance of population either given or
not. If mean and variance is not given, we can
not compute it mathematically due to large size of
data.
5. Samples
Sample is data that is collected from the
population.
Size of population is small, hence there is
probability of biased selection. We can compute
mean and variable of the sample.
6. Samples
If sample size is >30, the mean and variance of
the sample is nearly equal to the mean and
sample of population.
Thus, if population is normal, then sample is also
normal.
7. Normal Distribution -
Mathematics
To prove the normal distribution of data, we shall
take a sample of 50 numbers randomly distributed
between 14 and 18 by computer.
16 15 17 18 18
15 14 14 15 18
15 18 17 18 15
15 15 14 16 14
17 18 15 18 16
14 14 16 14 15
18 17 18 17 16
16 15 15 17 15
14 17 14 18 15
16 18 14 17 15
Table 1
8. x x-μ x x-μ x x-μ x x-μ x x-μ
16 0.08 15 -0.92 17 1.08 18 2.08 18 2.08
15 -0.92 14 -1.92 14 -1.92 15 -0.92 18 2.08
15 -0.92 18 2.08 17 1.08 18 2.08 15 -0.92
15 -0.92 15 -0.92 14 -1.92 16 0.08 14 -1.92
17 1.08 18 2.08 15 -0.92 18 2.08 16 0.08
14 -1.92 14 -1.92 16 0.08 14 -1.92 15 -0.92
18 2.08 17 1.08 18 2.08 17 1.08 16 0.08
16 0.08 15 -0.92 15 -0.92 17 1.08 15 -0.92
14 -1.92 17 1.08 14 -1.92 18 2.08 15 -0.92
16 0.08 18 2.08 14 -1.92 17 1.08 15 -0.92
Mean (μ) of the data is 15.92 and standard
deviation (σ) is 1.468. In the following table, x and
x-μ are arranged.
Table 2
9. x-μ f
-1.92 10
-0.92 14
0.08 7
1.08 8
2.08 11
We see that there are unique x-μ values. The
distinct x-μ and their occurance are arranged in
two column table 3, as shown below.
x-μ are positive and negative values. If m is at y-
axis then x-μ lie both sides of the y-axis, i.e. mean
m.
Table 3
10. -2.5 -2 -1.5 -1 -0.5 0 0.5 1 1.5 2 2.5
0
2
4
6
8
10
12
14
16
Now we plot the data of table 3. It looks like as the
graph given below. We see that all x-μ are within
the ±3σ.
+σ
μ
-σ
11. The graph is not like inverted bell shape. This is
due to less data grouping and small data size.
Here, data is generated by computer. This leads,
homogeniety of the data. Real data is unbiased,
independent and practical, which produce
inverted bell shape graph.
You can take other data and apply the same
method to find the normal data distribution curves.
Further check whether all data is distributed about
mean within ±3σ range or not.
This plot proves deviation of data about mean
within ±3σ visually.