SlideShare a Scribd company logo
1 of 40
Download to read offline
World Bank & Government of The Netherlands funded
Training module # WQ - 47
Basic Statistics
New Delhi, September 2000
CSMRS Building, 4th Floor, Olof Palme Marg, Hauz Khas,
New Delhi – 11 00 16 India
Tel: 68 61 681 / 84 Fax: (+ 91 11) 68 61 685
E-Mail: dhvdelft@del2.vsnl.net.in
DHV Consultants BV & DELFT HYDRAULICS
with
HALCROW, TAHAL, CES, ORG & JPS
HP Training Module File: “ 47 Basic Statistics.doc” Version September 2000 Page 1
Table of contents
Page
1. Module context 2
2. Module profile 3
3. Session plan 4
4. Overhead/flipchart master 5
5. Evaluation sheets 21
6. Handout 23
7. Additional handout 29
8. Main text 32
HP Training Module File: “ 47 Basic Statistics.doc” Version September 2000 Page 2
1. Module context
This module introduces basic statistics that can be used by chemists to evaluate the
accuracy of their data. This is a ‘stand alone’ module and no previous training in other
modules is required.
While designing a training course, the relationship between this module and the others,
would be maintained by keeping them close together in the syllabus and place them in a
logical sequence. The actual selection of the topics and the depth of training would, of
course, depend on the training needs of the participants, i.e. their knowledge level and skills
performance upon the start of the course.
HP Training Module File: “ 47 Basic Statistics.doc” Version September 2000 Page 3
2. Module profile
Title : Basic Statistics
Target group : HIS function(s): Q2, Q3, Q5, Q6
Duration : one session of 90 min
Objectives : After the training the participants will be able to:
• understand difference between accuracy and precision
• calculate descriptors of frequency distributions.
Key concepts : • Frequency distribution
• Errors
Training methods : Lecture, exercises
Training tools
required
: Board, flipchart, OHS
Handouts : As provided in this module
Further reading
and references
: • ‘Analytical Chemistry’, Douglas A. Skoog and Donald M. West,
Saunders College Publishing, 1986 (Chapter 3)
• ‘Statistical Procedures for analysis of Environmenrtal monitoring
Data and Risk Assessment’, Edward A. Mc Bean and Frank A.
Rovers, Prentice Hall, 1998.
HP Training Module File: “ 47 Basic Statistics.doc” Version September 2000 Page 4
3. Session plan
No Activities Time Tools
1 Preparations
2 Introduction:
• Discuss the need for statistic as a tool for evaluation of data
10 min OHS
3 Frequency distributions
• construction of histograms
• descriptors of distributions
• normal distributions
25 min OHS
Blackboard
4 Precision and accuracy
• nature of errors
• measures of precision and accuracy
15 min OHS
Blackboard
5 Propagation of errors 10 min OHS
6 Exercise 20 min Handout
7 Wrap up 10 min
HP Training Module File: “ 47 Basic Statistics.doc” Version September 2000 Page 5
4. Overhead/flipchart master
OHS format guidelines
Type of text Style Setting
Headings: OHS-Title Arial 30-36, with bottom border line (not:
underline)
Text: OHS-lev1
OHS-lev2
Arial 24-26, maximum two levels
Case: Sentence case. Avoid full text in UPPERCASE.
Italics: Use occasionally and in a consistent way
Listings: OHS-lev1
OHS-lev1-Numbered
Big bullets.
Numbers for definite series of steps. Avoid
roman numbers and letters.
Colours: None, as these get lost in photocopying and
some colours do not reproduce at all.
Formulas/Equat
ions
OHS-Equation Use of a table will ease horizontal alignment
over more lines (columns)
Use equation editor for advanced formatting
only
HP Training Module File: “ 47 Basic Statistics.doc” Version September 2000 Page 6
Basic Statistics
• Observed data are samples of ‘population’
• Statistics is used to predict population characteristics
• This module will cover
- Frequency distribution
- Errors
- Propagation of errors
HP Training Module File: “ 47 Basic Statistics.doc” Version September 2000 Page 7
Frequency Distribution
• Probability of occurrence of a measurement
• Histograms
- measurements arranged in discrete classes
- number of observations in class gives probability of occurrence
HP Training Module File: “ 47 Basic Statistics.doc” Version September 2000 Page 8
Frequency Distribution
Class Value No. of Values in
the Class
Frequency
(1) (2) (3) (4)
51-52 51.8 1 0.0227
52-53 52.0, 52.8, 52.8 3 0.0682
53-54 53.1, 53.1, 53.3, 53.4, 53.5, 53.6, 53.6,
53.9
8 0.1818
54-55 54.0, 54.1, 54.3, 54.3, 54.4, 54.5, 54.5,
54.6, 54.7, 54.7, 54.9
11 0.2500
55-56 55.1, 55.1, 55.3, 55.3, 55.4, 55.4, 55.6,
55.7, 55.7, 55.8, 55.9, 55.9
12 0.2727
56-57 56.2, 56.3, 56.7, 56.8, 56.9, 56.9 6 0.1364
57-58 57.3, 57.5 2 0.0454
58-59 - 0
59-60 59.1 1 0.0227
Table 1. Results of 44 replicate analyses for hardness, mg/l as CaCO3
HP Training Module File: “ 47 Basic Statistics.doc” Version September 2000 Page 9
Frequency Distribution
Figure 1: Example of a frequency distribution diagram
HP Training Module File: “ 47 Basic Statistics.doc” Version September 2000 Page 10
n
x
x i∑
=
Distribution Characteristics (1)
• Central tendency
- Arithmetic mean
- Geometric mean
G = (x1 . x2 … xn)1/n
- Median
Middle value of distribution
HP Training Module File: “ 47 Basic Statistics.doc” Version September 2000 Page 11
( ) ( )1nxxs
2
i −∑ −=
Distribution Characteristics (2)
• Spread of data
- Tendency to cluster around the mean
- Standard deviation
- Large value indicates spread of data
HP Training Module File: “ 47 Basic Statistics.doc” Version September 2000 Page 12
Normal Distribution
• Frequency distributions for random observations
• Total number of observations is large
σ
( )
πσ
σµ
2
e
y
22
i 2x −−
=
( )∑ −= Nxx
2
i
HP Training Module File: “ 47 Basic Statistics.doc” Version September 2000 Page 13
Normal Distribution
Figure 2: Normal distribution curve and areas under curve for different
distance from the mean
HP Training Module File: “ 47 Basic Statistics.doc” Version September 2000 Page 14
Classes of Errors
• Indeterminate (random)
- results fluctuate randomly
- made up of small uncertainties
- tend to cancel, but also act in concert
• Determinate (system)
- method, equipment, personal
- unidirectional
- can be identified
HP Training Module File: “ 47 Basic Statistics.doc” Version September 2000 Page 15
Precision
• Reproducibility of results
• Concerns random errors
• Can be measured as
- deviation from the mean
- range
- standard deviation
HP Training Module File: “ 47 Basic Statistics.doc” Version September 2000 Page 16
Coefficient of variation
• Relative standard deviation
CV
• Dimensionless can be used to compare two or more sets of
data
x
s100
=
HP Training Module File: “ 47 Basic Statistics.doc” Version September 2000 Page 17
Coefficient of variation
Example: Compare variability of 3 sets of data
A B C
40.0 19.9 37.0
29.2 24.1 33.4
18.6 22.1 36.1
29.3 19.8 40.2
x 29.28 21.48 36.68
s 8.74 2.05 2.80
CV 0.30 0.10 0.07
HP Training Module File: “ 47 Basic Statistics.doc” Version September 2000 Page 18
Accuracy
• Total errors from expected value
- systematic and random
- when precision is high accuracy tends to systematic error
HP Training Module File: “ 47 Basic Statistics.doc” Version September 2000 Page 19
Propagation of Errors (1)
• Error is commonly expressed as standard deviation
- a (± sa)
• When two or more observations are added or subtracted the
error of the result:
2
b
2
at sss +=
HP Training Module File: “ 47 Basic Statistics.doc” Version September 2000 Page 20
Propagation of Errors (2)
• When the final result is product or quotient calculate total error
from relative errors
- For
2
b
2
a
r,t
b
s
a
s
s 





+





=
( )
( )
)s(c
sb
sa
t
b
a
±=
±
±
r,tt s.
b
a
s =
HP Training Module File: “ 47 Basic Statistics.doc” Version September 2000 Page 21
5. Evaluation sheets
HP Training Module File: “ 47 Basic Statistics.doc” Version September 2000 Page 22
HP Training Module File: “ 47 Basic Statistics.doc” Version September 2000 Page 23
6. Handout
HP Training Module File: “ 47 Basic Statistics.doc” Version September 2000 Page 24
Basic Statistics
• Observed data are samples of ‘population’
• Statistics is used to predict population characteristics
- Frequency distribution
- Errors
- Propagation
Frequency Distribution
• Probability of occurrence of a measurement
• Histograms
- measurements arranged in discrete classes
- number of observations in class gives probability of occurrence
Table 1. Results of 44 replicate analyses for hardness, mg/l as CaCO3
Class Value No. of Values in
the Class
Frequency
(1) (2) (3) (4)
51-52 51.8 1 0.0227
52-53 52.0, 52.8, 52.8 3 0.0682
53-54 53.1, 53.1, 53.3, 53.4, 53.5, 53.6,
53.6, 53.9
8 0.1818
54-55 54.0, 54.1, 54.3, 54.3, 54.4, 54.5,
54.5, 54.6, 54.7, 54.7, 54.9
11 0.2500
55-56 55.1, 55.1, 55.3, 55.3, 55.4, 55.4,
55.6, 55.7, 55.7, 55.8, 55.9, 55.9
12 0.2727
56-57 56.2, 56.3, 56.7, 56.8, 56.9, 56.9 6 0.1364
57-58 57.3, 57.5 2 0.0454
58-59 - 0
59-60 59.1 1 0.0227
Figure 1. Example of a frequency distribution diagram
HP Training Module File: “ 47 Basic Statistics.doc” Version September 2000 Page 25
n
x
x i∑
=
( ) ( )1nxxs
2
i −∑ −=
Distribution Characteristics (1)
• Central tendency
- Arithmetic mean
- Geometric mean
G = (x1 . x2 … xn)1/n
- Median
Middle value of distribution
• Spread of data
- Tendency to cluster around the mean
- Standard deviation
- Large value indicates spread of data
Normal Distribution
• Frequency distributions for random observations
• Total number of observations is large
Figure 2. Normal distribution curve and areas under curve for different distances
( )
πσ
=
σµ−−
2
e
y
22x
2
i
( )∑ −=σ Nxx
2
i
HP Training Module File: “ 47 Basic Statistics.doc” Version September 2000 Page 26
Classes of Errors
• Indeterminate (random)
- results fluctuate randomly
- made up of small uncertainties
- tend to cancel, but also act in concert
• Determinate (system)
- method, equipment, personal
- unidirectional
- can be identified
• Reproducability of results
• Concerns random errors
• Can be measured as
- deviation from the mean
- range
- standard deviation
Coefficient of variation
• Relative standard deviation
• Dimensionless can be used to compare two or more sets of data
Example: Compare variability of 3 sets of data
A B C
40.0 19.9 37.0
29.2 24.1 33.4
18.6 22.1 36.1
29.3 19.8 40.2
x 29.28 21.48 36.68
s 8.74 2.05 2.80
COV 0.30 0.10 0.07
Accuracy
• Total errors from expected value
- systematic and random
- when precision is high accuracy
- tends to systematic error
x
s100
COV =
HP Training Module File: “ 47 Basic Statistics.doc” Version September 2000 Page 27
Propagation of Errors (1)
• Error is commonly expressed as standard deviation
- a(± Sa)
• When two or more observations are added on the subtracted error of the result:
• When the final result is product or quotient calculate total error from relative
errors
- For
2
b
2
at sss +=
2
b
2
a
r,t
b
s
a
s
s 





+





=
( )
( )b
a
sb
sa
±
±
r,tt s.
b
a
s =
HP Training Module File: “ 47 Basic Statistics.doc” Version September 2000 Page 28
Add copy of Main text in chapter 8, for all participants.
HP Training Module File: “ 47 Basic Statistics.doc” Version: September 2000 Page 29
7. Additional handout
These handouts are distributed during delivery and contain test questions, answers to
questions, special worksheets, optional information, and other matters you would not like to
be seen in the regular handouts.
It is a good practice to pre-punch these additional handouts, so the participants can easily
insert them in the main handout folder.
HP Training Module File: “ 47 Basic Statistics.doc” Version: September 2000 Page 30
Problem:
Estimate the average deviation and standard deviation of the lead concentration data given
below:
Lead cone
µg/L
37.0
34.9
28.4
27.3
31.8
28.3
26.9
19.9
31.8
32.6
HP Training Module File: “ 47 Basic Statistics.doc” Version: September 2000 Page 31
Solution:
Lead cone
µg/L
x –x (x –x)2
37.0 7.1 50.41
34.9 5.0 25.0
28.4 1.5 2.25
27.3 2.6 6.76
31.8 1.9 3.61
28.3 1.6 2.56
26.9 3 9.0
19.9 10 100.0
31.8 1.9 3.61
32.6 2.7 7.29
Mean 29.9 ∑ = 37.3 ∑ = 210.49
Average deviation = 37.3 / 10 = 3.73
Standard deviation = 84.4949.210 =
HP Training Module File: “ 47 Basic Statistics.doc” Version: September 2000 Page 32
8. Main text
Contents
1. Frequency distribution 1
2. Precision and Accuracy of Experimental Data 4
3. Propagation of errors 6
HP Training Module File: “ 47 Basic Statistics.doc” Version: September 2000 Page 1
Basic Statistics
Every physical measurement is subject to a degree of uncertainty. A result that is not
particularly accurate may be of great use if the limits of the errors affecting it can be set with
a high degree of certainty. Statistical methods are used to evaluate the probable errors in
analytical measurements and interpretation of environmental data. This module discusses
some basic concepts of statistics, which a chemist should be conversant with in order to
evaluate the accuracy of the laboratory data and the estimation of the environmental
characteristics based on the results of limited samples.
1. Frequency distribution
Results of 44 replicate analyses for hardness of a sample of water are given in Table 1.
Table 1. Results of 44 replicate analyses for hardness, mg/l as CaCO3
Class Value No. of Values
in the Class
Frequency
(1) (2) (3) (4)
51-52 51.8 1 0.0227
52-53 52.0, 52.8, 52.8 3 0.0682
53-54 53.1, 53.1, 53.3, 53.4, 53.5,
53.6, 53.6, 53.9
8 0.1818
54-55 54.0, 54.1, 54.3, 54.3, 54.4,
54.5, 54.5, 54.6, 54.7, 54.7,
54.9
11 0.2500
55-56 55.1, 55.1, 55.3, 55.3, 55.4,
55.4, 55.6, 55.7, 55.7, 55.8,
55.9, 55.9
12 0.2727
56-57 56.2, 56.3, 56.7, 56.8, 56.9,
56.9
6 0.1364
57-58 57.3, 57.5 2 0.0454
58-59 - 0
59-60 59.1 1 0.0227
The data are classified in 9 classes, column (1) and arranged in each class according to
their magnitude, column (2). By convention an observation equal to the upper value of the
class is counted in the next higher class. For example the value 52.0 is not placed in class
51-52 but in 52-53. The number of values in each class are given in column (3). The ratio of
number of values in a class to the total number of observations is called frequency and is
given in column (4).
A plot of data of column (1) vs. column (4) is called frequency distribution diagram and is
shown in Figure 1. For clarity of presentation, the frequency value is multiplied by 100 and
shown as %.
Note that if the number of observations is increased and the class interval is reduced, the
histogram can be replaced by a continuous curve.
HP Training Module File: “ 47 Basic Statistics.doc” Version: September 2000 Page 2
Figure 1: Example of a frequency distribution diagram
1.1 Central Tendency
Arithmetic mean: The arithmetic mean, x, of a set of data is calculated by adding all the
observed values of variable (results of analyses) and dividing it by the total number of
observations:
(1)
where x1, x2,........xn are the observed values and n is the total number of observations.
The arithmetic mean is the most common measure of the central tendency. The mean value
of the data of Table 1 was calculated as 54.9
Geometric mean: When there are a few very high values or very low, such as in the cases
of bacteriological analysis, the arithmetic mean is not necessarily representative of the
central tendency. In such cases the geometric mean, g, is used:
g = (2)
Median: The median is the middle value of the set of data. If the sample size n is an odd
number, one-half of the values exceed the median and one-half are less. When n is even, it
is average of the two middle terms. For the data of Table 1, it is 54.8, which is the average of
54.7 and 54.9, the 22nd
and the 23rd
terms, when the data are arranged in ascending order.
Mode: The mode is the most commonly occurring value. It is not widely employed as it
forms a poor basis for any further arithmetic calculations.
1.2 Standard deviation
The data of Table 1 show that 52% of observations lie in the range of 54 - 56 and 84% in the
range of 53-57. This tendency of observations to cluster (or not to cluster) around the mean
value, 54.9, is measured by standard deviation, s. Standard deviation is calculated as:
( ) nx............xxx n21 ++=
( ) n
1
n321 x................xxx ××
)1n/(})xx.......()xx()xx{(s 2
n
2
2
2
1 −−+−+−=
HP Training Module File: “ 47 Basic Statistics.doc” Version: September 2000 Page 3
or (3)
A small value of s signifies that most of the observations are close to the mean value. A
large value indicates that the observed values are spread over a larger range. For example,
if the data of Table 1 has more observations in 52-53 and 57-58 classes and
correspondingly lesser number in the central classes 54-55 and 55-56, the frequency
distribution diagram will be flatter and spread wider at the base and the value of s will be
larger.
The standard deviation has the same units as the quantity measured. The standard
deviation for the data of Table 1 was calculated as 1.545 mg/L.
The square of the standard deviation is called the variance. If the random distribution of data
is due to r different reasons, the total variance is the sum of individual variance.
1.3 Normal Distribution
Most frequency distributions for random observations, when the total number of observations
is very large tending to be the same as the population, N, conform to normal distribution. The
distribution is give by the theoretical equation:
(4)
where y = frequency of observations µ = the mean, and σ = standard deviation. Note the
distinction made in the notations for number of observations, mean and standard deviation
for a sample of a limited set of data used earlier.
The standard deviation is given by:
(5)
For a normal distribution, 68.3 % of the total observations lie in the range µ + σ, 95.5% in the
range µ + 2σ and 99.7% in the range µ + 3σ. This is illustrated in Figure 2.
The number of observations between any two limits of the variable can be equated to the
area bounded by the frequency distribution curve, the ordinates at these limits and the x
axis.
When the data set is small, more often than not, sample mean, x will differ somewhat from
the population mean, µ. Any error in x causes a corresponding error in the standard
deviation calculated with Equation 5. There is a tendency for the standard deviation value
calculated to be small as the number of measurements becomes smaller. It can be shown
that this bias can be largely eliminated by substituting the degree of freedom as (n-1) in the
calculation, Equation (3), which defines the standard deviation for a limited set of
measurements.
( )
( )1n
nxx
s
2
i
2
i
−
∑ ∑−
=
( )
πσ
=
σµ−−
2
e
y
22
i 2x
( )
N
Nxx
2
i
2
i∑ ∑−
=σ
HP Training Module File: “ 47 Basic Statistics.doc” Version: September 2000 Page 4
The rationale for using (n-1) is as follows. When µ is not known, we must extract two
quantities, namely, x and s from the set of data. The need to establish x from the data
removes one degree of freedom: that is if their signs are retained, the sum of individual
deviations must total zero; once (n-1) deviations have been established, the final deviation is
necessarily known. Thus only (n-1) deviations provide independent measures for the
standard deviation of the set.
2. Precision and Accuracy of Experimental Data
2.1 Classes of errors
The phenomena that are responsible for uncertainties in an analytical measurement can be
divided in two broad categories: determinate or system errors and indeterminate or random
errors. The total error is the sum of the two types of errors.
Determinate errors have assignable causes and are unidirectional. It may be possible to
eliminate some of these, for example, errors because of improper calibration of the
equipment, or presence of interfering substances in the sample. Errors inherent in the
method, such as addition of a reagent in excess of theoretical requirement to cause an
indicator to undergo colour change in a titration, may be difficult to eliminate.
Indeterminate errors are encountered whenever a measuring system is extended to its
maximum sensitivity. The results fluctuate in a random manner about a mean value. The
sources of these fluctuations can never be identified because they are made up of a myriad
of instrumental, personal and method uncertainties that are individually so small that they
can never be detected. For example, in he case of addition of an exact volume of reagent in
an analysis through a pipette the uncertainties could be the time allowed for draining the
pipette, the angle at which the pipette is held during delivery, temperature of the reagent,
visual judgement of water level with respect to the graduation mark, etc. What is observed in
the final result is then a summation of a very large number of minute unobservable
uncertainties. The cumulative effect is likewise variable. Ordinarily they tend to cancel one
other and thus exert a minimal effect. Occasionally, however, they act in concert to produce
a relatively large positive or negative error.
2.2 Precision
The term precision is employed to describe the reproducibility of results. It can be defined as
the agreement between the numerical values of two or more measurements that have been
made in an identical fashion.
Absolute methods for expressing precision: The absolute average deviation from the
mean is a common method for describing precision. The spread or range of a set of data is
also a measure of precision and is simply the numerical difference of the highest and the
lowest result. Standard deviation, described earlier, is a more significant measure of
precision.
HP Training Module File: “ 47 Basic Statistics.doc” Version: September 2000 Page 5
Relative methods for expressing precision: It is frequently more informative to indicate
the relative precision. The relative parameters are dimensionless and therefore can be used
to compare two ore more sets of data. Thus, for example, the relative deviation of analysis 1,
in the above example, is (0.077 X 100)/24.313 = 0.32%.
Relative standard deviation or coefficient of variation is defined as:
(6)
2.3 Accuracy
The term accuracy is used to describe the total error of the observation, which is the sum of
the systematic and random errors. It denotes the nearness of the measurement to its
accepted value.
In Example 1, if the accepted value for the chloride concentration is 24.35 mg/L, the absolute
error of the mean is 24.31 – 24.35 = - 0.04 mg/L.
x
s100
CV =
Example 1
From the data of replicate chloride analysis of a water sample given below, calculate the
precision in terms of the average deviation from the mean:
Analysis Chloride, mg/L Abs. dev. from the Mean, mg/L
1 24.39 0.077
2 24.19 0.123
3 24.36 0.047
Total 72.94 0.247
Mean = 24.313 ave. dev. = 0.247/3 = 0.082
Example 2
Monitoring results at three sampling locations are listed below. Compare the sampling
records in terms of variability.
A B C
40.0 19.9 37.0
29.2 24.1 33.4
18.6 22.1 36.1
29.3 19.8 40.2
x 29.28 21.48 36.68
s 8.74 2.05 2.80
CV 0.30 0.10 0.07
In terms of the coefficient of variation the concentration at A varies the most, followed by
B and then C.
HP Training Module File: “ 47 Basic Statistics.doc” Version: September 2000 Page 6
3. Propagation of errors
The indeterminate error or uncertainty is most commonly expresses as standard deviation.
When the final result is computed from two or more data, each of which has an
indeterminate error associated with it, the error of the result will depend on the arithmetic
computations. The following examples illustrate the procedure to be followed:
Example 3:
For the sum + 0.50 (± 0.02)
+ 4.10 (± 0.03)
- 1.97 (± 0.05)
---------------------
+ 2.63 (± ? ?)
where the numbers in parantheses are the absolute standard deviations. Statistically the
most probable standard deviation would be given by the square root of the sum of individual
variances:
= ± 0.06
Therefore the result could be reported as:
2.63 (± 0.06)
Example 4:
The case when products and quotients are involved is illustrated in the following
calculations:
In this case it is necessary to calculate the relative standard deviations:
( ) ( ) ( )222
05.003.002.0s ±+±+±=
( ) ( )
( )
?)(0104.0
04.097.1
0001.00050.002.010.4
±=
±
±×±
( ) 0049.0
10.4
02.0
S ra ±=
±
=
( ) 020.0
005.0
0001.0
S rb ±=
±
=
( ) 020.0
97.1
04.0
S rc ±=
±
=
( ) ( ) ( ) ( ) 029.002.002.00049.0S
222
ry ±=±+±+±=
HP Training Module File: “ 47 Basic Statistics.doc” Version: September 2000 Page 7
The absolute standard deviation of the result will beL
Sy = 0.0104 × (± 0.029) = ± (0.0003)
Therefore the uncertainty of the result can be written as:
0.0104 (± 0.0003)
For calculations that involve both sums and differences as well as products and quotients,
the uncertainties associated with the former are evaluated first and then the latter following
procedures illustrated in the two examples.

More Related Content

Similar to Download-manuals-surface water-software-47basicstatistics

Download-manuals-surface water-software-48appliedstatistics
 Download-manuals-surface water-software-48appliedstatistics Download-manuals-surface water-software-48appliedstatistics
Download-manuals-surface water-software-48appliedstatisticshydrologyproject001
 
Download-manuals-surface water-software-48appliedstatistics
 Download-manuals-surface water-software-48appliedstatistics Download-manuals-surface water-software-48appliedstatistics
Download-manuals-surface water-software-48appliedstatisticshydrologyproject0
 
Download-manuals-surface water-software-48appliedstatistics
 Download-manuals-surface water-software-48appliedstatistics Download-manuals-surface water-software-48appliedstatistics
Download-manuals-surface water-software-48appliedstatisticshydrologyproject0
 
Download-manuals-surface water-waterlevel-41howtoanalysedischargedata
 Download-manuals-surface water-waterlevel-41howtoanalysedischargedata Download-manuals-surface water-waterlevel-41howtoanalysedischargedata
Download-manuals-surface water-waterlevel-41howtoanalysedischargedatahydrologyproject001
 
Download-manuals-water quality-wq-training-44howtocarryoutcorrelationandspec...
 Download-manuals-water quality-wq-training-44howtocarryoutcorrelationandspec... Download-manuals-water quality-wq-training-44howtocarryoutcorrelationandspec...
Download-manuals-water quality-wq-training-44howtocarryoutcorrelationandspec...hydrologywebsite1
 
Download-manuals-water quality-wq-training-44howtocarryoutcorrelationandspec...
 Download-manuals-water quality-wq-training-44howtocarryoutcorrelationandspec... Download-manuals-water quality-wq-training-44howtocarryoutcorrelationandspec...
Download-manuals-water quality-wq-training-44howtocarryoutcorrelationandspec...hydrologyproject001
 
Download-manuals-surface water-waterlevel-36howtocarryoutsecondaryvalidation...
 Download-manuals-surface water-waterlevel-36howtocarryoutsecondaryvalidation... Download-manuals-surface water-waterlevel-36howtocarryoutsecondaryvalidation...
Download-manuals-surface water-waterlevel-36howtocarryoutsecondaryvalidation...hydrologyproject001
 
Engineering Data Analysis-ProfCharlton
Engineering Data  Analysis-ProfCharltonEngineering Data  Analysis-ProfCharlton
Engineering Data Analysis-ProfCharltonCharltonInao1
 
Workshop nwav 47 - LVS - Tool for Quantitative Data Analysis
Workshop nwav 47 - LVS - Tool for Quantitative Data AnalysisWorkshop nwav 47 - LVS - Tool for Quantitative Data Analysis
Workshop nwav 47 - LVS - Tool for Quantitative Data AnalysisOlga Scrivner
 
Download-manuals-surface water-waterlevel-33howtoreportonstagedischargedata
 Download-manuals-surface water-waterlevel-33howtoreportonstagedischargedata Download-manuals-surface water-waterlevel-33howtoreportonstagedischargedata
Download-manuals-surface water-waterlevel-33howtoreportonstagedischargedatahydrologyproject001
 
Download-manuals-surface water-waterlevel-30howtovalidateratingcurve
 Download-manuals-surface water-waterlevel-30howtovalidateratingcurve Download-manuals-surface water-waterlevel-30howtovalidateratingcurve
Download-manuals-surface water-waterlevel-30howtovalidateratingcurvehydrologyproject001
 
master defense hyun-wong choi_2019_05_14_rev19
master defense hyun-wong choi_2019_05_14_rev19master defense hyun-wong choi_2019_05_14_rev19
master defense hyun-wong choi_2019_05_14_rev19Hyun Wong Choi
 
Download-manuals-surface water-manual-45howtoreviewmonitoringnetworks
 Download-manuals-surface water-manual-45howtoreviewmonitoringnetworks Download-manuals-surface water-manual-45howtoreviewmonitoringnetworks
Download-manuals-surface water-manual-45howtoreviewmonitoringnetworkshydrologyproject001
 
Download-manuals-surface water-manual-45howtoreviewmonitoringnetworks
 Download-manuals-surface water-manual-45howtoreviewmonitoringnetworks Download-manuals-surface water-manual-45howtoreviewmonitoringnetworks
Download-manuals-surface water-manual-45howtoreviewmonitoringnetworkshydrologywebsite1
 
Symbolic Background Knowledge for Machine Learning
Symbolic Background Knowledge for Machine LearningSymbolic Background Knowledge for Machine Learning
Symbolic Background Knowledge for Machine LearningSteffen Staab
 
Metabolomic Data Analysis Workshop and Tutorials (2014)
Metabolomic Data Analysis Workshop and Tutorials (2014)Metabolomic Data Analysis Workshop and Tutorials (2014)
Metabolomic Data Analysis Workshop and Tutorials (2014)Dmitry Grapov
 
Lecture_2_Stats.pdf
Lecture_2_Stats.pdfLecture_2_Stats.pdf
Lecture_2_Stats.pdfpaijitk
 
how to build a Length of Stay model for a ProofOfConcept project
how to build a Length of Stay model for a ProofOfConcept projecthow to build a Length of Stay model for a ProofOfConcept project
how to build a Length of Stay model for a ProofOfConcept projectZenodia Charpy
 
Giachos_MSc_Thesis_v05
Giachos_MSc_Thesis_v05Giachos_MSc_Thesis_v05
Giachos_MSc_Thesis_v05Fanis Giachos
 

Similar to Download-manuals-surface water-software-47basicstatistics (20)

Download-manuals-surface water-software-48appliedstatistics
 Download-manuals-surface water-software-48appliedstatistics Download-manuals-surface water-software-48appliedstatistics
Download-manuals-surface water-software-48appliedstatistics
 
Download-manuals-surface water-software-48appliedstatistics
 Download-manuals-surface water-software-48appliedstatistics Download-manuals-surface water-software-48appliedstatistics
Download-manuals-surface water-software-48appliedstatistics
 
Download-manuals-surface water-software-48appliedstatistics
 Download-manuals-surface water-software-48appliedstatistics Download-manuals-surface water-software-48appliedstatistics
Download-manuals-surface water-software-48appliedstatistics
 
Download-manuals-surface water-waterlevel-41howtoanalysedischargedata
 Download-manuals-surface water-waterlevel-41howtoanalysedischargedata Download-manuals-surface water-waterlevel-41howtoanalysedischargedata
Download-manuals-surface water-waterlevel-41howtoanalysedischargedata
 
Download-manuals-water quality-wq-training-44howtocarryoutcorrelationandspec...
 Download-manuals-water quality-wq-training-44howtocarryoutcorrelationandspec... Download-manuals-water quality-wq-training-44howtocarryoutcorrelationandspec...
Download-manuals-water quality-wq-training-44howtocarryoutcorrelationandspec...
 
Download-manuals-water quality-wq-training-44howtocarryoutcorrelationandspec...
 Download-manuals-water quality-wq-training-44howtocarryoutcorrelationandspec... Download-manuals-water quality-wq-training-44howtocarryoutcorrelationandspec...
Download-manuals-water quality-wq-training-44howtocarryoutcorrelationandspec...
 
Download-manuals-surface water-waterlevel-36howtocarryoutsecondaryvalidation...
 Download-manuals-surface water-waterlevel-36howtocarryoutsecondaryvalidation... Download-manuals-surface water-waterlevel-36howtocarryoutsecondaryvalidation...
Download-manuals-surface water-waterlevel-36howtocarryoutsecondaryvalidation...
 
Engineering Data Analysis-ProfCharlton
Engineering Data  Analysis-ProfCharltonEngineering Data  Analysis-ProfCharlton
Engineering Data Analysis-ProfCharlton
 
Workshop nwav 47 - LVS - Tool for Quantitative Data Analysis
Workshop nwav 47 - LVS - Tool for Quantitative Data AnalysisWorkshop nwav 47 - LVS - Tool for Quantitative Data Analysis
Workshop nwav 47 - LVS - Tool for Quantitative Data Analysis
 
Download-manuals-surface water-waterlevel-33howtoreportonstagedischargedata
 Download-manuals-surface water-waterlevel-33howtoreportonstagedischargedata Download-manuals-surface water-waterlevel-33howtoreportonstagedischargedata
Download-manuals-surface water-waterlevel-33howtoreportonstagedischargedata
 
Download-manuals-surface water-waterlevel-30howtovalidateratingcurve
 Download-manuals-surface water-waterlevel-30howtovalidateratingcurve Download-manuals-surface water-waterlevel-30howtovalidateratingcurve
Download-manuals-surface water-waterlevel-30howtovalidateratingcurve
 
Lecture-2 Applied ML .pptx
Lecture-2 Applied ML .pptxLecture-2 Applied ML .pptx
Lecture-2 Applied ML .pptx
 
master defense hyun-wong choi_2019_05_14_rev19
master defense hyun-wong choi_2019_05_14_rev19master defense hyun-wong choi_2019_05_14_rev19
master defense hyun-wong choi_2019_05_14_rev19
 
Download-manuals-surface water-manual-45howtoreviewmonitoringnetworks
 Download-manuals-surface water-manual-45howtoreviewmonitoringnetworks Download-manuals-surface water-manual-45howtoreviewmonitoringnetworks
Download-manuals-surface water-manual-45howtoreviewmonitoringnetworks
 
Download-manuals-surface water-manual-45howtoreviewmonitoringnetworks
 Download-manuals-surface water-manual-45howtoreviewmonitoringnetworks Download-manuals-surface water-manual-45howtoreviewmonitoringnetworks
Download-manuals-surface water-manual-45howtoreviewmonitoringnetworks
 
Symbolic Background Knowledge for Machine Learning
Symbolic Background Knowledge for Machine LearningSymbolic Background Knowledge for Machine Learning
Symbolic Background Knowledge for Machine Learning
 
Metabolomic Data Analysis Workshop and Tutorials (2014)
Metabolomic Data Analysis Workshop and Tutorials (2014)Metabolomic Data Analysis Workshop and Tutorials (2014)
Metabolomic Data Analysis Workshop and Tutorials (2014)
 
Lecture_2_Stats.pdf
Lecture_2_Stats.pdfLecture_2_Stats.pdf
Lecture_2_Stats.pdf
 
how to build a Length of Stay model for a ProofOfConcept project
how to build a Length of Stay model for a ProofOfConcept projecthow to build a Length of Stay model for a ProofOfConcept project
how to build a Length of Stay model for a ProofOfConcept project
 
Giachos_MSc_Thesis_v05
Giachos_MSc_Thesis_v05Giachos_MSc_Thesis_v05
Giachos_MSc_Thesis_v05
 

More from hydrologyproject0

More from hydrologyproject0 (20)

Specification purewater
Specification purewaterSpecification purewater
Specification purewater
 
Water purifier ion exchange resin column
Water purifier ion exchange resin columnWater purifier ion exchange resin column
Water purifier ion exchange resin column
 
Water purifier, double distillation unit
Water purifier, double distillation unitWater purifier, double distillation unit
Water purifier, double distillation unit
 
Water purifier, distillation unit (automatic)
Water purifier, distillation unit (automatic)Water purifier, distillation unit (automatic)
Water purifier, distillation unit (automatic)
 
Water geyser (heater)
Water geyser (heater)Water geyser (heater)
Water geyser (heater)
 
Water bath, general purpose
Water bath, general purposeWater bath, general purpose
Water bath, general purpose
 
Water bath, bacteriological
Water bath, bacteriologicalWater bath, bacteriological
Water bath, bacteriological
 
Wash bottle
Wash bottleWash bottle
Wash bottle
 
Volumetric flask
Volumetric flaskVolumetric flask
Volumetric flask
 
Visual accumulation tube
Visual accumulation tubeVisual accumulation tube
Visual accumulation tube
 
Vacuum pump
Vacuum pumpVacuum pump
Vacuum pump
 
Turbidity meter (nephelometer)
Turbidity meter (nephelometer)Turbidity meter (nephelometer)
Turbidity meter (nephelometer)
 
Tool kit
Tool kitTool kit
Tool kit
 
Toc analyser, microprocessor controlled
Toc analyser, microprocessor controlledToc analyser, microprocessor controlled
Toc analyser, microprocessor controlled
 
Tissue grinder
Tissue grinderTissue grinder
Tissue grinder
 
Thermometers, set of
Thermometers, set ofThermometers, set of
Thermometers, set of
 
Test tubes
Test tubesTest tubes
Test tubes
 
Test sieves with shaker
Test sieves with shakerTest sieves with shaker
Test sieves with shaker
 
Stirrer, magnetic (with hot plate)
Stirrer, magnetic (with hot plate)Stirrer, magnetic (with hot plate)
Stirrer, magnetic (with hot plate)
 
Sterliser, autoclave
Sterliser, autoclaveSterliser, autoclave
Sterliser, autoclave
 

Recently uploaded

A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 

Recently uploaded (20)

A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 

Download-manuals-surface water-software-47basicstatistics

  • 1. World Bank & Government of The Netherlands funded Training module # WQ - 47 Basic Statistics New Delhi, September 2000 CSMRS Building, 4th Floor, Olof Palme Marg, Hauz Khas, New Delhi – 11 00 16 India Tel: 68 61 681 / 84 Fax: (+ 91 11) 68 61 685 E-Mail: dhvdelft@del2.vsnl.net.in DHV Consultants BV & DELFT HYDRAULICS with HALCROW, TAHAL, CES, ORG & JPS
  • 2. HP Training Module File: “ 47 Basic Statistics.doc” Version September 2000 Page 1 Table of contents Page 1. Module context 2 2. Module profile 3 3. Session plan 4 4. Overhead/flipchart master 5 5. Evaluation sheets 21 6. Handout 23 7. Additional handout 29 8. Main text 32
  • 3. HP Training Module File: “ 47 Basic Statistics.doc” Version September 2000 Page 2 1. Module context This module introduces basic statistics that can be used by chemists to evaluate the accuracy of their data. This is a ‘stand alone’ module and no previous training in other modules is required. While designing a training course, the relationship between this module and the others, would be maintained by keeping them close together in the syllabus and place them in a logical sequence. The actual selection of the topics and the depth of training would, of course, depend on the training needs of the participants, i.e. their knowledge level and skills performance upon the start of the course.
  • 4. HP Training Module File: “ 47 Basic Statistics.doc” Version September 2000 Page 3 2. Module profile Title : Basic Statistics Target group : HIS function(s): Q2, Q3, Q5, Q6 Duration : one session of 90 min Objectives : After the training the participants will be able to: • understand difference between accuracy and precision • calculate descriptors of frequency distributions. Key concepts : • Frequency distribution • Errors Training methods : Lecture, exercises Training tools required : Board, flipchart, OHS Handouts : As provided in this module Further reading and references : • ‘Analytical Chemistry’, Douglas A. Skoog and Donald M. West, Saunders College Publishing, 1986 (Chapter 3) • ‘Statistical Procedures for analysis of Environmenrtal monitoring Data and Risk Assessment’, Edward A. Mc Bean and Frank A. Rovers, Prentice Hall, 1998.
  • 5. HP Training Module File: “ 47 Basic Statistics.doc” Version September 2000 Page 4 3. Session plan No Activities Time Tools 1 Preparations 2 Introduction: • Discuss the need for statistic as a tool for evaluation of data 10 min OHS 3 Frequency distributions • construction of histograms • descriptors of distributions • normal distributions 25 min OHS Blackboard 4 Precision and accuracy • nature of errors • measures of precision and accuracy 15 min OHS Blackboard 5 Propagation of errors 10 min OHS 6 Exercise 20 min Handout 7 Wrap up 10 min
  • 6. HP Training Module File: “ 47 Basic Statistics.doc” Version September 2000 Page 5 4. Overhead/flipchart master OHS format guidelines Type of text Style Setting Headings: OHS-Title Arial 30-36, with bottom border line (not: underline) Text: OHS-lev1 OHS-lev2 Arial 24-26, maximum two levels Case: Sentence case. Avoid full text in UPPERCASE. Italics: Use occasionally and in a consistent way Listings: OHS-lev1 OHS-lev1-Numbered Big bullets. Numbers for definite series of steps. Avoid roman numbers and letters. Colours: None, as these get lost in photocopying and some colours do not reproduce at all. Formulas/Equat ions OHS-Equation Use of a table will ease horizontal alignment over more lines (columns) Use equation editor for advanced formatting only
  • 7. HP Training Module File: “ 47 Basic Statistics.doc” Version September 2000 Page 6 Basic Statistics • Observed data are samples of ‘population’ • Statistics is used to predict population characteristics • This module will cover - Frequency distribution - Errors - Propagation of errors
  • 8. HP Training Module File: “ 47 Basic Statistics.doc” Version September 2000 Page 7 Frequency Distribution • Probability of occurrence of a measurement • Histograms - measurements arranged in discrete classes - number of observations in class gives probability of occurrence
  • 9. HP Training Module File: “ 47 Basic Statistics.doc” Version September 2000 Page 8 Frequency Distribution Class Value No. of Values in the Class Frequency (1) (2) (3) (4) 51-52 51.8 1 0.0227 52-53 52.0, 52.8, 52.8 3 0.0682 53-54 53.1, 53.1, 53.3, 53.4, 53.5, 53.6, 53.6, 53.9 8 0.1818 54-55 54.0, 54.1, 54.3, 54.3, 54.4, 54.5, 54.5, 54.6, 54.7, 54.7, 54.9 11 0.2500 55-56 55.1, 55.1, 55.3, 55.3, 55.4, 55.4, 55.6, 55.7, 55.7, 55.8, 55.9, 55.9 12 0.2727 56-57 56.2, 56.3, 56.7, 56.8, 56.9, 56.9 6 0.1364 57-58 57.3, 57.5 2 0.0454 58-59 - 0 59-60 59.1 1 0.0227 Table 1. Results of 44 replicate analyses for hardness, mg/l as CaCO3
  • 10. HP Training Module File: “ 47 Basic Statistics.doc” Version September 2000 Page 9 Frequency Distribution Figure 1: Example of a frequency distribution diagram
  • 11. HP Training Module File: “ 47 Basic Statistics.doc” Version September 2000 Page 10 n x x i∑ = Distribution Characteristics (1) • Central tendency - Arithmetic mean - Geometric mean G = (x1 . x2 … xn)1/n - Median Middle value of distribution
  • 12. HP Training Module File: “ 47 Basic Statistics.doc” Version September 2000 Page 11 ( ) ( )1nxxs 2 i −∑ −= Distribution Characteristics (2) • Spread of data - Tendency to cluster around the mean - Standard deviation - Large value indicates spread of data
  • 13. HP Training Module File: “ 47 Basic Statistics.doc” Version September 2000 Page 12 Normal Distribution • Frequency distributions for random observations • Total number of observations is large σ ( ) πσ σµ 2 e y 22 i 2x −− = ( )∑ −= Nxx 2 i
  • 14. HP Training Module File: “ 47 Basic Statistics.doc” Version September 2000 Page 13 Normal Distribution Figure 2: Normal distribution curve and areas under curve for different distance from the mean
  • 15. HP Training Module File: “ 47 Basic Statistics.doc” Version September 2000 Page 14 Classes of Errors • Indeterminate (random) - results fluctuate randomly - made up of small uncertainties - tend to cancel, but also act in concert • Determinate (system) - method, equipment, personal - unidirectional - can be identified
  • 16. HP Training Module File: “ 47 Basic Statistics.doc” Version September 2000 Page 15 Precision • Reproducibility of results • Concerns random errors • Can be measured as - deviation from the mean - range - standard deviation
  • 17. HP Training Module File: “ 47 Basic Statistics.doc” Version September 2000 Page 16 Coefficient of variation • Relative standard deviation CV • Dimensionless can be used to compare two or more sets of data x s100 =
  • 18. HP Training Module File: “ 47 Basic Statistics.doc” Version September 2000 Page 17 Coefficient of variation Example: Compare variability of 3 sets of data A B C 40.0 19.9 37.0 29.2 24.1 33.4 18.6 22.1 36.1 29.3 19.8 40.2 x 29.28 21.48 36.68 s 8.74 2.05 2.80 CV 0.30 0.10 0.07
  • 19. HP Training Module File: “ 47 Basic Statistics.doc” Version September 2000 Page 18 Accuracy • Total errors from expected value - systematic and random - when precision is high accuracy tends to systematic error
  • 20. HP Training Module File: “ 47 Basic Statistics.doc” Version September 2000 Page 19 Propagation of Errors (1) • Error is commonly expressed as standard deviation - a (± sa) • When two or more observations are added or subtracted the error of the result: 2 b 2 at sss +=
  • 21. HP Training Module File: “ 47 Basic Statistics.doc” Version September 2000 Page 20 Propagation of Errors (2) • When the final result is product or quotient calculate total error from relative errors - For 2 b 2 a r,t b s a s s       +      = ( ) ( ) )s(c sb sa t b a ±= ± ± r,tt s. b a s =
  • 22. HP Training Module File: “ 47 Basic Statistics.doc” Version September 2000 Page 21 5. Evaluation sheets
  • 23. HP Training Module File: “ 47 Basic Statistics.doc” Version September 2000 Page 22
  • 24. HP Training Module File: “ 47 Basic Statistics.doc” Version September 2000 Page 23 6. Handout
  • 25. HP Training Module File: “ 47 Basic Statistics.doc” Version September 2000 Page 24 Basic Statistics • Observed data are samples of ‘population’ • Statistics is used to predict population characteristics - Frequency distribution - Errors - Propagation Frequency Distribution • Probability of occurrence of a measurement • Histograms - measurements arranged in discrete classes - number of observations in class gives probability of occurrence Table 1. Results of 44 replicate analyses for hardness, mg/l as CaCO3 Class Value No. of Values in the Class Frequency (1) (2) (3) (4) 51-52 51.8 1 0.0227 52-53 52.0, 52.8, 52.8 3 0.0682 53-54 53.1, 53.1, 53.3, 53.4, 53.5, 53.6, 53.6, 53.9 8 0.1818 54-55 54.0, 54.1, 54.3, 54.3, 54.4, 54.5, 54.5, 54.6, 54.7, 54.7, 54.9 11 0.2500 55-56 55.1, 55.1, 55.3, 55.3, 55.4, 55.4, 55.6, 55.7, 55.7, 55.8, 55.9, 55.9 12 0.2727 56-57 56.2, 56.3, 56.7, 56.8, 56.9, 56.9 6 0.1364 57-58 57.3, 57.5 2 0.0454 58-59 - 0 59-60 59.1 1 0.0227 Figure 1. Example of a frequency distribution diagram
  • 26. HP Training Module File: “ 47 Basic Statistics.doc” Version September 2000 Page 25 n x x i∑ = ( ) ( )1nxxs 2 i −∑ −= Distribution Characteristics (1) • Central tendency - Arithmetic mean - Geometric mean G = (x1 . x2 … xn)1/n - Median Middle value of distribution • Spread of data - Tendency to cluster around the mean - Standard deviation - Large value indicates spread of data Normal Distribution • Frequency distributions for random observations • Total number of observations is large Figure 2. Normal distribution curve and areas under curve for different distances ( ) πσ = σµ−− 2 e y 22x 2 i ( )∑ −=σ Nxx 2 i
  • 27. HP Training Module File: “ 47 Basic Statistics.doc” Version September 2000 Page 26 Classes of Errors • Indeterminate (random) - results fluctuate randomly - made up of small uncertainties - tend to cancel, but also act in concert • Determinate (system) - method, equipment, personal - unidirectional - can be identified • Reproducability of results • Concerns random errors • Can be measured as - deviation from the mean - range - standard deviation Coefficient of variation • Relative standard deviation • Dimensionless can be used to compare two or more sets of data Example: Compare variability of 3 sets of data A B C 40.0 19.9 37.0 29.2 24.1 33.4 18.6 22.1 36.1 29.3 19.8 40.2 x 29.28 21.48 36.68 s 8.74 2.05 2.80 COV 0.30 0.10 0.07 Accuracy • Total errors from expected value - systematic and random - when precision is high accuracy - tends to systematic error x s100 COV =
  • 28. HP Training Module File: “ 47 Basic Statistics.doc” Version September 2000 Page 27 Propagation of Errors (1) • Error is commonly expressed as standard deviation - a(± Sa) • When two or more observations are added on the subtracted error of the result: • When the final result is product or quotient calculate total error from relative errors - For 2 b 2 at sss += 2 b 2 a r,t b s a s s       +      = ( ) ( )b a sb sa ± ± r,tt s. b a s =
  • 29. HP Training Module File: “ 47 Basic Statistics.doc” Version September 2000 Page 28 Add copy of Main text in chapter 8, for all participants.
  • 30. HP Training Module File: “ 47 Basic Statistics.doc” Version: September 2000 Page 29 7. Additional handout These handouts are distributed during delivery and contain test questions, answers to questions, special worksheets, optional information, and other matters you would not like to be seen in the regular handouts. It is a good practice to pre-punch these additional handouts, so the participants can easily insert them in the main handout folder.
  • 31. HP Training Module File: “ 47 Basic Statistics.doc” Version: September 2000 Page 30 Problem: Estimate the average deviation and standard deviation of the lead concentration data given below: Lead cone µg/L 37.0 34.9 28.4 27.3 31.8 28.3 26.9 19.9 31.8 32.6
  • 32. HP Training Module File: “ 47 Basic Statistics.doc” Version: September 2000 Page 31 Solution: Lead cone µg/L x –x (x –x)2 37.0 7.1 50.41 34.9 5.0 25.0 28.4 1.5 2.25 27.3 2.6 6.76 31.8 1.9 3.61 28.3 1.6 2.56 26.9 3 9.0 19.9 10 100.0 31.8 1.9 3.61 32.6 2.7 7.29 Mean 29.9 ∑ = 37.3 ∑ = 210.49 Average deviation = 37.3 / 10 = 3.73 Standard deviation = 84.4949.210 =
  • 33. HP Training Module File: “ 47 Basic Statistics.doc” Version: September 2000 Page 32 8. Main text Contents 1. Frequency distribution 1 2. Precision and Accuracy of Experimental Data 4 3. Propagation of errors 6
  • 34. HP Training Module File: “ 47 Basic Statistics.doc” Version: September 2000 Page 1 Basic Statistics Every physical measurement is subject to a degree of uncertainty. A result that is not particularly accurate may be of great use if the limits of the errors affecting it can be set with a high degree of certainty. Statistical methods are used to evaluate the probable errors in analytical measurements and interpretation of environmental data. This module discusses some basic concepts of statistics, which a chemist should be conversant with in order to evaluate the accuracy of the laboratory data and the estimation of the environmental characteristics based on the results of limited samples. 1. Frequency distribution Results of 44 replicate analyses for hardness of a sample of water are given in Table 1. Table 1. Results of 44 replicate analyses for hardness, mg/l as CaCO3 Class Value No. of Values in the Class Frequency (1) (2) (3) (4) 51-52 51.8 1 0.0227 52-53 52.0, 52.8, 52.8 3 0.0682 53-54 53.1, 53.1, 53.3, 53.4, 53.5, 53.6, 53.6, 53.9 8 0.1818 54-55 54.0, 54.1, 54.3, 54.3, 54.4, 54.5, 54.5, 54.6, 54.7, 54.7, 54.9 11 0.2500 55-56 55.1, 55.1, 55.3, 55.3, 55.4, 55.4, 55.6, 55.7, 55.7, 55.8, 55.9, 55.9 12 0.2727 56-57 56.2, 56.3, 56.7, 56.8, 56.9, 56.9 6 0.1364 57-58 57.3, 57.5 2 0.0454 58-59 - 0 59-60 59.1 1 0.0227 The data are classified in 9 classes, column (1) and arranged in each class according to their magnitude, column (2). By convention an observation equal to the upper value of the class is counted in the next higher class. For example the value 52.0 is not placed in class 51-52 but in 52-53. The number of values in each class are given in column (3). The ratio of number of values in a class to the total number of observations is called frequency and is given in column (4). A plot of data of column (1) vs. column (4) is called frequency distribution diagram and is shown in Figure 1. For clarity of presentation, the frequency value is multiplied by 100 and shown as %. Note that if the number of observations is increased and the class interval is reduced, the histogram can be replaced by a continuous curve.
  • 35. HP Training Module File: “ 47 Basic Statistics.doc” Version: September 2000 Page 2 Figure 1: Example of a frequency distribution diagram 1.1 Central Tendency Arithmetic mean: The arithmetic mean, x, of a set of data is calculated by adding all the observed values of variable (results of analyses) and dividing it by the total number of observations: (1) where x1, x2,........xn are the observed values and n is the total number of observations. The arithmetic mean is the most common measure of the central tendency. The mean value of the data of Table 1 was calculated as 54.9 Geometric mean: When there are a few very high values or very low, such as in the cases of bacteriological analysis, the arithmetic mean is not necessarily representative of the central tendency. In such cases the geometric mean, g, is used: g = (2) Median: The median is the middle value of the set of data. If the sample size n is an odd number, one-half of the values exceed the median and one-half are less. When n is even, it is average of the two middle terms. For the data of Table 1, it is 54.8, which is the average of 54.7 and 54.9, the 22nd and the 23rd terms, when the data are arranged in ascending order. Mode: The mode is the most commonly occurring value. It is not widely employed as it forms a poor basis for any further arithmetic calculations. 1.2 Standard deviation The data of Table 1 show that 52% of observations lie in the range of 54 - 56 and 84% in the range of 53-57. This tendency of observations to cluster (or not to cluster) around the mean value, 54.9, is measured by standard deviation, s. Standard deviation is calculated as: ( ) nx............xxx n21 ++= ( ) n 1 n321 x................xxx ×× )1n/(})xx.......()xx()xx{(s 2 n 2 2 2 1 −−+−+−=
  • 36. HP Training Module File: “ 47 Basic Statistics.doc” Version: September 2000 Page 3 or (3) A small value of s signifies that most of the observations are close to the mean value. A large value indicates that the observed values are spread over a larger range. For example, if the data of Table 1 has more observations in 52-53 and 57-58 classes and correspondingly lesser number in the central classes 54-55 and 55-56, the frequency distribution diagram will be flatter and spread wider at the base and the value of s will be larger. The standard deviation has the same units as the quantity measured. The standard deviation for the data of Table 1 was calculated as 1.545 mg/L. The square of the standard deviation is called the variance. If the random distribution of data is due to r different reasons, the total variance is the sum of individual variance. 1.3 Normal Distribution Most frequency distributions for random observations, when the total number of observations is very large tending to be the same as the population, N, conform to normal distribution. The distribution is give by the theoretical equation: (4) where y = frequency of observations µ = the mean, and σ = standard deviation. Note the distinction made in the notations for number of observations, mean and standard deviation for a sample of a limited set of data used earlier. The standard deviation is given by: (5) For a normal distribution, 68.3 % of the total observations lie in the range µ + σ, 95.5% in the range µ + 2σ and 99.7% in the range µ + 3σ. This is illustrated in Figure 2. The number of observations between any two limits of the variable can be equated to the area bounded by the frequency distribution curve, the ordinates at these limits and the x axis. When the data set is small, more often than not, sample mean, x will differ somewhat from the population mean, µ. Any error in x causes a corresponding error in the standard deviation calculated with Equation 5. There is a tendency for the standard deviation value calculated to be small as the number of measurements becomes smaller. It can be shown that this bias can be largely eliminated by substituting the degree of freedom as (n-1) in the calculation, Equation (3), which defines the standard deviation for a limited set of measurements. ( ) ( )1n nxx s 2 i 2 i − ∑ ∑− = ( ) πσ = σµ−− 2 e y 22 i 2x ( ) N Nxx 2 i 2 i∑ ∑− =σ
  • 37. HP Training Module File: “ 47 Basic Statistics.doc” Version: September 2000 Page 4 The rationale for using (n-1) is as follows. When µ is not known, we must extract two quantities, namely, x and s from the set of data. The need to establish x from the data removes one degree of freedom: that is if their signs are retained, the sum of individual deviations must total zero; once (n-1) deviations have been established, the final deviation is necessarily known. Thus only (n-1) deviations provide independent measures for the standard deviation of the set. 2. Precision and Accuracy of Experimental Data 2.1 Classes of errors The phenomena that are responsible for uncertainties in an analytical measurement can be divided in two broad categories: determinate or system errors and indeterminate or random errors. The total error is the sum of the two types of errors. Determinate errors have assignable causes and are unidirectional. It may be possible to eliminate some of these, for example, errors because of improper calibration of the equipment, or presence of interfering substances in the sample. Errors inherent in the method, such as addition of a reagent in excess of theoretical requirement to cause an indicator to undergo colour change in a titration, may be difficult to eliminate. Indeterminate errors are encountered whenever a measuring system is extended to its maximum sensitivity. The results fluctuate in a random manner about a mean value. The sources of these fluctuations can never be identified because they are made up of a myriad of instrumental, personal and method uncertainties that are individually so small that they can never be detected. For example, in he case of addition of an exact volume of reagent in an analysis through a pipette the uncertainties could be the time allowed for draining the pipette, the angle at which the pipette is held during delivery, temperature of the reagent, visual judgement of water level with respect to the graduation mark, etc. What is observed in the final result is then a summation of a very large number of minute unobservable uncertainties. The cumulative effect is likewise variable. Ordinarily they tend to cancel one other and thus exert a minimal effect. Occasionally, however, they act in concert to produce a relatively large positive or negative error. 2.2 Precision The term precision is employed to describe the reproducibility of results. It can be defined as the agreement between the numerical values of two or more measurements that have been made in an identical fashion. Absolute methods for expressing precision: The absolute average deviation from the mean is a common method for describing precision. The spread or range of a set of data is also a measure of precision and is simply the numerical difference of the highest and the lowest result. Standard deviation, described earlier, is a more significant measure of precision.
  • 38. HP Training Module File: “ 47 Basic Statistics.doc” Version: September 2000 Page 5 Relative methods for expressing precision: It is frequently more informative to indicate the relative precision. The relative parameters are dimensionless and therefore can be used to compare two ore more sets of data. Thus, for example, the relative deviation of analysis 1, in the above example, is (0.077 X 100)/24.313 = 0.32%. Relative standard deviation or coefficient of variation is defined as: (6) 2.3 Accuracy The term accuracy is used to describe the total error of the observation, which is the sum of the systematic and random errors. It denotes the nearness of the measurement to its accepted value. In Example 1, if the accepted value for the chloride concentration is 24.35 mg/L, the absolute error of the mean is 24.31 – 24.35 = - 0.04 mg/L. x s100 CV = Example 1 From the data of replicate chloride analysis of a water sample given below, calculate the precision in terms of the average deviation from the mean: Analysis Chloride, mg/L Abs. dev. from the Mean, mg/L 1 24.39 0.077 2 24.19 0.123 3 24.36 0.047 Total 72.94 0.247 Mean = 24.313 ave. dev. = 0.247/3 = 0.082 Example 2 Monitoring results at three sampling locations are listed below. Compare the sampling records in terms of variability. A B C 40.0 19.9 37.0 29.2 24.1 33.4 18.6 22.1 36.1 29.3 19.8 40.2 x 29.28 21.48 36.68 s 8.74 2.05 2.80 CV 0.30 0.10 0.07 In terms of the coefficient of variation the concentration at A varies the most, followed by B and then C.
  • 39. HP Training Module File: “ 47 Basic Statistics.doc” Version: September 2000 Page 6 3. Propagation of errors The indeterminate error or uncertainty is most commonly expresses as standard deviation. When the final result is computed from two or more data, each of which has an indeterminate error associated with it, the error of the result will depend on the arithmetic computations. The following examples illustrate the procedure to be followed: Example 3: For the sum + 0.50 (± 0.02) + 4.10 (± 0.03) - 1.97 (± 0.05) --------------------- + 2.63 (± ? ?) where the numbers in parantheses are the absolute standard deviations. Statistically the most probable standard deviation would be given by the square root of the sum of individual variances: = ± 0.06 Therefore the result could be reported as: 2.63 (± 0.06) Example 4: The case when products and quotients are involved is illustrated in the following calculations: In this case it is necessary to calculate the relative standard deviations: ( ) ( ) ( )222 05.003.002.0s ±+±+±= ( ) ( ) ( ) ?)(0104.0 04.097.1 0001.00050.002.010.4 ±= ± ±×± ( ) 0049.0 10.4 02.0 S ra ±= ± = ( ) 020.0 005.0 0001.0 S rb ±= ± = ( ) 020.0 97.1 04.0 S rc ±= ± = ( ) ( ) ( ) ( ) 029.002.002.00049.0S 222 ry ±=±+±+±=
  • 40. HP Training Module File: “ 47 Basic Statistics.doc” Version: September 2000 Page 7 The absolute standard deviation of the result will beL Sy = 0.0104 × (± 0.029) = ± (0.0003) Therefore the uncertainty of the result can be written as: 0.0104 (± 0.0003) For calculations that involve both sums and differences as well as products and quotients, the uncertainties associated with the former are evaluated first and then the latter following procedures illustrated in the two examples.