SlideShare a Scribd company logo
1 of 43
DATA SUMMERISATION
Dr Vincent Yusuph
Ecohas Kibaha 2023
OBJECTIVES
At the end of this session you should be able to:
 Explain data summarization
 Explain the characteristics, uses, advantages, and
disadvantages of each measure of location.
 Calculate mode, mean and median
 Compute and interpret variance, and the standard
deviation
 Identify the position of the arithmetic mean, median, and mode
for both a symmetrical and a skewed distribution.
 Explain the characteristics, uses, advantages, and disadvantages
of this measure of dispersion
4.3
Data summarisation
 Measures of Central Location
 Mean, Median, Mode
 Measures of Variability/spread
 Range, Standard Deviation, Variance, Coefficient
of Variation
 Measures of Relative Standing
 Percentiles, Quartiles
MEASURES OF CENTRAL
TENDENCY/LOCATION
 Often we need to summarise frequency
distributions in a few numbers for ease
of reporting or comparison
 Recall: with qualitative data, useful
summary statistics include ratio,
proportion, rate
Measures of central tendency/location
 The statistical methods used to measure
central tendency include the following
1. Mean
2. Median
3. Mode
MEAN
 Refers to arithmetic mean
 It is obtained by adding the individual observations divided
by the total number of observations.
 Advantages – it is easy to calculate. most useful of all the
averages.
 Disadvantages – influenced by abnormal values.
 Examples: In this case it will be (8 + 16 + 15 + 17 + 18 + 20
+ 25)/7 which comes to 17
Characteristics of the Mean
It is calculated by
summing the values
and dividing by the
number of values.
It requires the interval scale.
All values are used.
It is unique.
The sum of the deviations from the mean is 0.
The Arithmetic Mean
is the most widely used
measure of location and
shows the central value of
the data.
The major characteristics of the mean are:
Average
Joe
3- 7
Population Mean
N
X



where
µ is the population mean
N is the total number of observations.
X is a particular value.
 indicates the operation of adding.
For ungrouped data, the
Population Mean is the
sum of all the population
values divided by the total
number of population
values:
3- 8
Example 1
500
,
48
4
000
,
73
...
000
,
56






N
X

Find the mean mileage for the cars.
A Parameter is a measurable characteristic of a
population.
The Musenge
family owns
four cars.
The following
is the current
mileage on
each of the
four cars.
56,000
23,000
42,000
73,000
3- 9
Example 2
4
.
15
5
77
5
0
.
15
...
0
.
14







n
X
X
A statistic is a measurable characteristic of a sampl
A sample of
five
executives
received the
following
bonus last
year ($000):
14.0,
15.0,
17.0,
16.0,
15.0
3- 10
Statistics is a pattern language
Population Sample
Size N n
Mean
Variance
Standard
Deviation
Properties of the Arithmetic Mean
Every set of interval-level and ratio-level data has a
mean.
All the values are included in computing the mean.
A set of data has a unique mean.
The mean is affected by unusually large or small
data values.
The arithmetic mean is the only measure of location
where the sum of the deviations of each value from
the mean is zero.
Properties of the Arithmetic Mean
3- 12
MEDIAN
 When all the observation are arranged either in ascending
order or descending order, the middle observation is known
as median.
 In case of even number the average of the two middle values
is taken.
 Median is better indicator of central value as it is not affected
by the extreme values
 Example : The median of 4, 1, and 7 is 4 because when the
numbers are put in order (1 , 4, 7) , the number 4 is in the
middle.
The Median
There are as many
values above the
median as below it in
the data array.
For an even set of values, the median will be the
arithmetic average of the two middle numbers and is
found at the (n+1)/2 ranked observation.
The Median is the
midpoint of the values
after they have been
ordered from the smallest
to the largest.
3- 14
The ages for a sample of five BSc.HLS.III
students are: 21, 25, 19, 20, 22.
Arranging the data
in ascending order
gives:
19, 20, 21, 22, 25.
Thus the median is
21.
The median (continued)
3- 15
Example 5
Arranging the data in
ascending order gives:
73, 75, 76, 80
Thus the median is 75.5.
The heights of four basketball players, in inches,
are: 76, 73, 80, 75.
The median is found
at the (n+1)/2 =
(4+1)/2 =2.5th data
point.
3- 16
Properties of the Median
There is a unique median for each data set.
It is not affected by extremely large or small
values and is therefore a valuable measure of
location when such values occur.
It can be computed for ratio-level, interval-
level, and ordinal-level data.
It can be computed for an open-ended
frequency distribution if the median does not
lie in an open-ended class.
Properties of the Median
3- 17
MODE
 Most frequently occurring observation in a data is called
mode
 Not often used in medical statistics.
 EXAMPLE
 Number of decayed teeth in 10 children
 2,2,4,1,3,0,10,2,3,8
 Mean = 34 / 10 = 3.4
 Median = (0,1,2,2,2,3,3,4,8,10) = 2+3 /2
= 2.5
 Mode = 2 ( 3 Times)
Symmetric distribution: A distribution having the
same shape on either side of the center
Skewed distribution: One whose shapes on either
side of the center differ; a nonsymmetrical distribution.
Can be positively or negatively skewed, or bimodal
The Relative Positions of the Mean, Median, and Mode
3- 19
RELATIONSHIP BETWEEN
MEAN, MEDIAN, MODE
The Relative Positions of the Mean, Median, and Mode:
Symmetric Distribution
Zero skewness Mean
=Median
=Mode
Mode
Median
Mean
3- 21
The Relative Positions of the Mean, Median, and Mode:
Right Skewed Distribution
 Positively skewed: Mean and median are to the right of the
mode.
Mean>Median>Mode
Mode
Median
Mean
3- 22
Negatively Skewed: Mean and Median are to the left of the Mode.
Mean<Median<Mode
The Relative Positions of the Mean, Median, and
Mode: Left Skewed Distribution
Mode
Mean
Median
3- 23
CHOICE OF APPROPRIATE
MEASURE
 For symmetric distributions, mean is
preferred to median or mode:
 utilises all values
 mathematical niceties
 For asymmetric distributions, mean not
suitable:
 mean is sensitive to extreme values
 median more preferred since it is not
affected by extreme values
• Measures of central location fail to tell the
whole story about the distribution; that is,
how much are the observations spread out
around the mean value?
Measures of spread…
Measures of Variability…
For example, two sets of class
grades are shown. The mean
(=50) is the same in each case…
But, the red class has greater
variability than the blue class.
Dispersion
refers to the
spread or
variability in
the data.
Measures of dispersion include the following: range,
mean deviation, variance, and standard
deviation.
Range = Largest value – Smallest
value
Measures of Dispersion
0
5
10
15
20
25
30
0 2 4 6 8 10 12
3- 27
The following represents the current year’s Return
on Equity of the 25 companies in an investor’s
portfolio.
-8.1 3.2 5.9 8.1 12.3
-5.1 4.1 6.3 9.2 13.3
-3.1 4.6 7.9 9.5 14.0
-1.4 4.8 7.9 9.7 15.0
1.2 5.7 8.0 10.3 22.1
Example 9
Highest value: 22.1 Lowest value: -8.1
Range = Highest value – lowest value
= 22.1-(-8.1)
= 30.2
3- 28
Range…
 Its major advantage is the ease with which it can be
computed.
 Its major shortcoming is its failure to provide
information on the dispersion of the observations
between the two end points.
 Hence we need a measure of variability that
incorporates all the data and not just two
observations. Hence…
Variance: the
arithmetic mean
of the squared
deviations from
the mean.
Standard deviation: The
square root of the variance.
Variance and standard Deviation
3- 30
Not influenced by extreme values.
The units are awkward, the square of the
original units.
All values are used in the calculation.
The major characteristics of the
Population Variance are:
Population Variance
3- 31
Population Variance formula:
 (X - )2
N

=
X is the value of an observation in the
population
m is the arithmetic mean of the population
N is the number of observations in the
population

Population Standard Deviation formula:
2

Variance and standard deviation
3- 32
(-8.1-6.62)2 + (-5.1-6.62)2 + ... + (22.1-6.62)2
25




= 42.227
= 6.498
In Example 9, the variance and standard deviation are:
 (X - )2
N

=
Example 9 continued
3- 33
Sample variance (s2)
s2 =
(X - X)2
n-1
Sample standard deviation (s)
2
s
s 
Sample variance and standard deviation
3- 34
40
.
7
5
37




n
X
X
     
30
.
5
1
5
2
.
21
1
5
4
.
7
6
...
4
.
7
7
1
2
2
2
2













n
X
X
s
Example 11
The hourly wages earned by a sample of five students
are:
$7, $5, $11, $8, $6.
Find the sample variance and standard deviation.
30
.
2
30
.
5
2


 s
s
3- 35
Empirical Rule: For any symmetrical, bell-
shaped distribution:
About 68% of the observations will lie within 1s
the mean
About 95% of the observations will lie within 2s
of the mean
Virtually all the observations will be within 3s of
the mean
Interpretation and Uses of the
Standard Deviation
3- 36
4.37
The Empirical Rule…
 Approximately 68% of all observations fall
 within one standard deviation of the mean.

 Approximately 95% of all observations fall
 within two standard deviations of the mean.
 Approximately 99.7% of all observations fall
 within three standard deviations of the mean.
Bell-Shaped Curve showing the relationship between and .
 
3  1  1  3
68%
95%
99.7%
Interpretation and Uses of the Standard Deviation
3- 38
Interpreting the standard deviation
 The greater the variation in the data the
greater the standard deviation
 If all the values are the same the standard
deviation is zero
 For a symmetrical distribution almost all the
data will be contained within three standard
deviations
Coefficient of Variation…
 The coefficient of variation of a set of observations
is the standard deviation of the observations divided
by their mean,
 that is:
 Population coefficient of variation = CV =
 Sample coefficient of variation = cv =
4.41
Coefficient of Variation…
 This coefficient provides a
 proportionate measure of variation, e.g.
 A standard deviation of 10 may be perceived
as large when the mean value is 100, but only
moderately large when the mean value is 500.
4.42
Measures of Variability…
 If data are symmetric, with no serious outliers,
use range and standard deviation.
 If comparing variation across two data sets,
use coefficient of variation.
 The measures of variability introduced in this
section can be used only for interval data.
RECAP QUESTIONS

More Related Content

Similar to 5.DATA SUMMERISATION.ppt

Similar to 5.DATA SUMMERISATION.ppt (20)

Empirics of standard deviation
Empirics of standard deviationEmpirics of standard deviation
Empirics of standard deviation
 
Newbold_chap03.ppt
Newbold_chap03.pptNewbold_chap03.ppt
Newbold_chap03.ppt
 
QT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central TendencyQT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central Tendency
 
QT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central TendencyQT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central Tendency
 
LESSON-8-ANALYSIS-INTERPRETATION-AND-USE-OF-TEST-DATA.pptx
LESSON-8-ANALYSIS-INTERPRETATION-AND-USE-OF-TEST-DATA.pptxLESSON-8-ANALYSIS-INTERPRETATION-AND-USE-OF-TEST-DATA.pptx
LESSON-8-ANALYSIS-INTERPRETATION-AND-USE-OF-TEST-DATA.pptx
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
 
Stat11t chapter3
Stat11t chapter3Stat11t chapter3
Stat11t chapter3
 
polar pojhjgfnbhggnbh hnhghgnhbhnhbjnhhhhhh
polar pojhjgfnbhggnbh hnhghgnhbhnhbjnhhhhhhpolar pojhjgfnbhggnbh hnhghgnhbhnhbjnhhhhhh
polar pojhjgfnbhggnbh hnhghgnhbhnhbjnhhhhhh
 
Ppt central tendency measures
Ppt central tendency measuresPpt central tendency measures
Ppt central tendency measures
 
Topic 8a Basic Statistics
Topic 8a Basic StatisticsTopic 8a Basic Statistics
Topic 8a Basic Statistics
 
Statistics
StatisticsStatistics
Statistics
 
Chapter 11 Psrm
Chapter 11 PsrmChapter 11 Psrm
Chapter 11 Psrm
 
Statistics.pdf
Statistics.pdfStatistics.pdf
Statistics.pdf
 
03 ch ken black solution
03 ch ken black solution03 ch ken black solution
03 ch ken black solution
 
Statistics 3, 4
Statistics 3, 4Statistics 3, 4
Statistics 3, 4
 
Measures of dispersion
Measures  of  dispersionMeasures  of  dispersion
Measures of dispersion
 
UNIT III -Measures of Central Tendency 2.ppt
UNIT III -Measures of Central Tendency 2.pptUNIT III -Measures of Central Tendency 2.ppt
UNIT III -Measures of Central Tendency 2.ppt
 
Class1.ppt
Class1.pptClass1.ppt
Class1.ppt
 
Class1.ppt
Class1.pptClass1.ppt
Class1.ppt
 
Class1.ppt
Class1.pptClass1.ppt
Class1.ppt
 

More from chusematelephone

SHOULDER DYSTOCIA AND UTERINE ATONY (0).pptx
SHOULDER DYSTOCIA AND UTERINE ATONY (0).pptxSHOULDER DYSTOCIA AND UTERINE ATONY (0).pptx
SHOULDER DYSTOCIA AND UTERINE ATONY (0).pptxchusematelephone
 
Confidentiality in Medical Practice.ppt
Confidentiality in Medical Practice.pptConfidentiality in Medical Practice.ppt
Confidentiality in Medical Practice.pptchusematelephone
 
SESSION 8.3; NUTITIONAL REQUIRMENTS FOR ADULTS.pptx
SESSION 8.3; NUTITIONAL REQUIRMENTS FOR ADULTS.pptxSESSION 8.3; NUTITIONAL REQUIRMENTS FOR ADULTS.pptx
SESSION 8.3; NUTITIONAL REQUIRMENTS FOR ADULTS.pptxchusematelephone
 
immunopathology-1-130219045942-phpapp01.pdf
immunopathology-1-130219045942-phpapp01.pdfimmunopathology-1-130219045942-phpapp01.pdf
immunopathology-1-130219045942-phpapp01.pdfchusematelephone
 
SESSION 8.2; NUTRITIONAL REQUIREMENTS TO CHILDREN.pptx
SESSION 8.2; NUTRITIONAL REQUIREMENTS TO CHILDREN.pptxSESSION 8.2; NUTRITIONAL REQUIREMENTS TO CHILDREN.pptx
SESSION 8.2; NUTRITIONAL REQUIREMENTS TO CHILDREN.pptxchusematelephone
 
SESSION 7; NUTRITIONAL DISORDERS.pptx
SESSION 7; NUTRITIONAL DISORDERS.pptxSESSION 7; NUTRITIONAL DISORDERS.pptx
SESSION 7; NUTRITIONAL DISORDERS.pptxchusematelephone
 
SESSION 7; NUTRITIONAL DISORDERS.pptx
SESSION 7; NUTRITIONAL DISORDERS.pptxSESSION 7; NUTRITIONAL DISORDERS.pptx
SESSION 7; NUTRITIONAL DISORDERS.pptxchusematelephone
 
GRAM POSITIVE BACTERIA.pptx
GRAM POSITIVE BACTERIA.pptxGRAM POSITIVE BACTERIA.pptx
GRAM POSITIVE BACTERIA.pptxchusematelephone
 
SESSION 3 - Computer Software-1.pptx
SESSION 3 - Computer Software-1.pptxSESSION 3 - Computer Software-1.pptx
SESSION 3 - Computer Software-1.pptxchusematelephone
 
LEVEL OF MEASUREMENTS_2.ppt
LEVEL OF MEASUREMENTS_2.pptLEVEL OF MEASUREMENTS_2.ppt
LEVEL OF MEASUREMENTS_2.pptchusematelephone
 

More from chusematelephone (16)

SHOULDER DYSTOCIA AND UTERINE ATONY (0).pptx
SHOULDER DYSTOCIA AND UTERINE ATONY (0).pptxSHOULDER DYSTOCIA AND UTERINE ATONY (0).pptx
SHOULDER DYSTOCIA AND UTERINE ATONY (0).pptx
 
Confidentiality in Medical Practice.ppt
Confidentiality in Medical Practice.pptConfidentiality in Medical Practice.ppt
Confidentiality in Medical Practice.ppt
 
CONTRACEPTIVES.ppt
CONTRACEPTIVES.pptCONTRACEPTIVES.ppt
CONTRACEPTIVES.ppt
 
Partogram2.ppt
Partogram2.pptPartogram2.ppt
Partogram2.ppt
 
5.Meningitis (2).ppt
5.Meningitis (2).ppt5.Meningitis (2).ppt
5.Meningitis (2).ppt
 
SESSION 8.3; NUTITIONAL REQUIRMENTS FOR ADULTS.pptx
SESSION 8.3; NUTITIONAL REQUIRMENTS FOR ADULTS.pptxSESSION 8.3; NUTITIONAL REQUIRMENTS FOR ADULTS.pptx
SESSION 8.3; NUTITIONAL REQUIRMENTS FOR ADULTS.pptx
 
immunopathology-1-130219045942-phpapp01.pdf
immunopathology-1-130219045942-phpapp01.pdfimmunopathology-1-130219045942-phpapp01.pdf
immunopathology-1-130219045942-phpapp01.pdf
 
SESSION 8.2; NUTRITIONAL REQUIREMENTS TO CHILDREN.pptx
SESSION 8.2; NUTRITIONAL REQUIREMENTS TO CHILDREN.pptxSESSION 8.2; NUTRITIONAL REQUIREMENTS TO CHILDREN.pptx
SESSION 8.2; NUTRITIONAL REQUIREMENTS TO CHILDREN.pptx
 
SESSION 7; NUTRITIONAL DISORDERS.pptx
SESSION 7; NUTRITIONAL DISORDERS.pptxSESSION 7; NUTRITIONAL DISORDERS.pptx
SESSION 7; NUTRITIONAL DISORDERS.pptx
 
SESSION 7; NUTRITIONAL DISORDERS.pptx
SESSION 7; NUTRITIONAL DISORDERS.pptxSESSION 7; NUTRITIONAL DISORDERS.pptx
SESSION 7; NUTRITIONAL DISORDERS.pptx
 
GRAM STAINING- GLM.pptx
GRAM STAINING- GLM.pptxGRAM STAINING- GLM.pptx
GRAM STAINING- GLM.pptx
 
GRAM POSITIVE BACTERIA.pptx
GRAM POSITIVE BACTERIA.pptxGRAM POSITIVE BACTERIA.pptx
GRAM POSITIVE BACTERIA.pptx
 
SESSION 3 - Computer Software-1.pptx
SESSION 3 - Computer Software-1.pptxSESSION 3 - Computer Software-1.pptx
SESSION 3 - Computer Software-1.pptx
 
Session 2.pptx
Session 2.pptxSession 2.pptx
Session 2.pptx
 
LEVEL OF MEASUREMENTS_2.ppt
LEVEL OF MEASUREMENTS_2.pptLEVEL OF MEASUREMENTS_2.ppt
LEVEL OF MEASUREMENTS_2.ppt
 
SESSION 7.ppt
SESSION 7.pptSESSION 7.ppt
SESSION 7.ppt
 

Recently uploaded

VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...
VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...
VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...Garima Khatri
 
Call Girl Chennai Indira 9907093804 Independent Call Girls Service Chennai
Call Girl Chennai Indira 9907093804 Independent Call Girls Service ChennaiCall Girl Chennai Indira 9907093804 Independent Call Girls Service Chennai
Call Girl Chennai Indira 9907093804 Independent Call Girls Service ChennaiNehru place Escorts
 
Call Girl Service Bidadi - For 7001305949 Cheap & Best with original Photos
Call Girl Service Bidadi - For 7001305949 Cheap & Best with original PhotosCall Girl Service Bidadi - For 7001305949 Cheap & Best with original Photos
Call Girl Service Bidadi - For 7001305949 Cheap & Best with original Photosnarwatsonia7
 
Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...
Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...
Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...CALL GIRLS
 
Russian Call Girls Chennai Madhuri 9907093804 Independent Call Girls Service ...
Russian Call Girls Chennai Madhuri 9907093804 Independent Call Girls Service ...Russian Call Girls Chennai Madhuri 9907093804 Independent Call Girls Service ...
Russian Call Girls Chennai Madhuri 9907093804 Independent Call Girls Service ...Nehru place Escorts
 
Call Girls Service in Bommanahalli - 7001305949 with real photos and phone nu...
Call Girls Service in Bommanahalli - 7001305949 with real photos and phone nu...Call Girls Service in Bommanahalli - 7001305949 with real photos and phone nu...
Call Girls Service in Bommanahalli - 7001305949 with real photos and phone nu...narwatsonia7
 
Call Girls Whitefield Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Whitefield Just Call 7001305949 Top Class Call Girl Service AvailableCall Girls Whitefield Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Whitefield Just Call 7001305949 Top Class Call Girl Service Availablenarwatsonia7
 
Call Girls Horamavu WhatsApp Number 7001035870 Meeting With Bangalore Escorts
Call Girls Horamavu WhatsApp Number 7001035870 Meeting With Bangalore EscortsCall Girls Horamavu WhatsApp Number 7001035870 Meeting With Bangalore Escorts
Call Girls Horamavu WhatsApp Number 7001035870 Meeting With Bangalore Escortsvidya singh
 
Kesar Bagh Call Girl Price 9548273370 , Lucknow Call Girls Service
Kesar Bagh Call Girl Price 9548273370 , Lucknow Call Girls ServiceKesar Bagh Call Girl Price 9548273370 , Lucknow Call Girls Service
Kesar Bagh Call Girl Price 9548273370 , Lucknow Call Girls Servicemakika9823
 
Call Girls Service Noida Maya 9711199012 Independent Escort Service Noida
Call Girls Service Noida Maya 9711199012 Independent Escort Service NoidaCall Girls Service Noida Maya 9711199012 Independent Escort Service Noida
Call Girls Service Noida Maya 9711199012 Independent Escort Service NoidaPooja Gupta
 
Russian Call Girls in Chennai Pallavi 9907093804 Independent Call Girls Servi...
Russian Call Girls in Chennai Pallavi 9907093804 Independent Call Girls Servi...Russian Call Girls in Chennai Pallavi 9907093804 Independent Call Girls Servi...
Russian Call Girls in Chennai Pallavi 9907093804 Independent Call Girls Servi...Nehru place Escorts
 
Call Girls Service Jaipur Grishma WhatsApp ❤8445551418 VIP Call Girls Jaipur
Call Girls Service Jaipur Grishma WhatsApp ❤8445551418 VIP Call Girls JaipurCall Girls Service Jaipur Grishma WhatsApp ❤8445551418 VIP Call Girls Jaipur
Call Girls Service Jaipur Grishma WhatsApp ❤8445551418 VIP Call Girls Jaipurparulsinha
 
Aspirin presentation slides by Dr. Rewas Ali
Aspirin presentation slides by Dr. Rewas AliAspirin presentation slides by Dr. Rewas Ali
Aspirin presentation slides by Dr. Rewas AliRewAs ALI
 
Russian Call Girls in Bangalore Manisha 7001305949 Independent Escort Service...
Russian Call Girls in Bangalore Manisha 7001305949 Independent Escort Service...Russian Call Girls in Bangalore Manisha 7001305949 Independent Escort Service...
Russian Call Girls in Bangalore Manisha 7001305949 Independent Escort Service...narwatsonia7
 
Call Girl Bangalore Nandini 7001305949 Independent Escort Service Bangalore
Call Girl Bangalore Nandini 7001305949 Independent Escort Service BangaloreCall Girl Bangalore Nandini 7001305949 Independent Escort Service Bangalore
Call Girl Bangalore Nandini 7001305949 Independent Escort Service Bangalorenarwatsonia7
 
Artifacts in Nuclear Medicine with Identifying and resolving artifacts.
Artifacts in Nuclear Medicine with Identifying and resolving artifacts.Artifacts in Nuclear Medicine with Identifying and resolving artifacts.
Artifacts in Nuclear Medicine with Identifying and resolving artifacts.MiadAlsulami
 
Call Girls Service Pune Vaishnavi 9907093804 Short 1500 Night 6000 Best call ...
Call Girls Service Pune Vaishnavi 9907093804 Short 1500 Night 6000 Best call ...Call Girls Service Pune Vaishnavi 9907093804 Short 1500 Night 6000 Best call ...
Call Girls Service Pune Vaishnavi 9907093804 Short 1500 Night 6000 Best call ...Miss joya
 
Low Rate Call Girls Ambattur Anika 8250192130 Independent Escort Service Amba...
Low Rate Call Girls Ambattur Anika 8250192130 Independent Escort Service Amba...Low Rate Call Girls Ambattur Anika 8250192130 Independent Escort Service Amba...
Low Rate Call Girls Ambattur Anika 8250192130 Independent Escort Service Amba...narwatsonia7
 
Ahmedabad Call Girls CG Road 🔝9907093804 Short 1500 💋 Night 6000
Ahmedabad Call Girls CG Road 🔝9907093804  Short 1500  💋 Night 6000Ahmedabad Call Girls CG Road 🔝9907093804  Short 1500  💋 Night 6000
Ahmedabad Call Girls CG Road 🔝9907093804 Short 1500 💋 Night 6000aliya bhat
 

Recently uploaded (20)

VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...
VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...
VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...
 
Call Girl Chennai Indira 9907093804 Independent Call Girls Service Chennai
Call Girl Chennai Indira 9907093804 Independent Call Girls Service ChennaiCall Girl Chennai Indira 9907093804 Independent Call Girls Service Chennai
Call Girl Chennai Indira 9907093804 Independent Call Girls Service Chennai
 
Call Girl Service Bidadi - For 7001305949 Cheap & Best with original Photos
Call Girl Service Bidadi - For 7001305949 Cheap & Best with original PhotosCall Girl Service Bidadi - For 7001305949 Cheap & Best with original Photos
Call Girl Service Bidadi - For 7001305949 Cheap & Best with original Photos
 
Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...
Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...
Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...
 
Russian Call Girls Chennai Madhuri 9907093804 Independent Call Girls Service ...
Russian Call Girls Chennai Madhuri 9907093804 Independent Call Girls Service ...Russian Call Girls Chennai Madhuri 9907093804 Independent Call Girls Service ...
Russian Call Girls Chennai Madhuri 9907093804 Independent Call Girls Service ...
 
Call Girls Service in Bommanahalli - 7001305949 with real photos and phone nu...
Call Girls Service in Bommanahalli - 7001305949 with real photos and phone nu...Call Girls Service in Bommanahalli - 7001305949 with real photos and phone nu...
Call Girls Service in Bommanahalli - 7001305949 with real photos and phone nu...
 
Russian Call Girls in Delhi Tanvi ➡️ 9711199012 💋📞 Independent Escort Service...
Russian Call Girls in Delhi Tanvi ➡️ 9711199012 💋📞 Independent Escort Service...Russian Call Girls in Delhi Tanvi ➡️ 9711199012 💋📞 Independent Escort Service...
Russian Call Girls in Delhi Tanvi ➡️ 9711199012 💋📞 Independent Escort Service...
 
Call Girls Whitefield Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Whitefield Just Call 7001305949 Top Class Call Girl Service AvailableCall Girls Whitefield Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Whitefield Just Call 7001305949 Top Class Call Girl Service Available
 
Call Girls Horamavu WhatsApp Number 7001035870 Meeting With Bangalore Escorts
Call Girls Horamavu WhatsApp Number 7001035870 Meeting With Bangalore EscortsCall Girls Horamavu WhatsApp Number 7001035870 Meeting With Bangalore Escorts
Call Girls Horamavu WhatsApp Number 7001035870 Meeting With Bangalore Escorts
 
Kesar Bagh Call Girl Price 9548273370 , Lucknow Call Girls Service
Kesar Bagh Call Girl Price 9548273370 , Lucknow Call Girls ServiceKesar Bagh Call Girl Price 9548273370 , Lucknow Call Girls Service
Kesar Bagh Call Girl Price 9548273370 , Lucknow Call Girls Service
 
Call Girls Service Noida Maya 9711199012 Independent Escort Service Noida
Call Girls Service Noida Maya 9711199012 Independent Escort Service NoidaCall Girls Service Noida Maya 9711199012 Independent Escort Service Noida
Call Girls Service Noida Maya 9711199012 Independent Escort Service Noida
 
Russian Call Girls in Chennai Pallavi 9907093804 Independent Call Girls Servi...
Russian Call Girls in Chennai Pallavi 9907093804 Independent Call Girls Servi...Russian Call Girls in Chennai Pallavi 9907093804 Independent Call Girls Servi...
Russian Call Girls in Chennai Pallavi 9907093804 Independent Call Girls Servi...
 
Call Girls Service Jaipur Grishma WhatsApp ❤8445551418 VIP Call Girls Jaipur
Call Girls Service Jaipur Grishma WhatsApp ❤8445551418 VIP Call Girls JaipurCall Girls Service Jaipur Grishma WhatsApp ❤8445551418 VIP Call Girls Jaipur
Call Girls Service Jaipur Grishma WhatsApp ❤8445551418 VIP Call Girls Jaipur
 
Aspirin presentation slides by Dr. Rewas Ali
Aspirin presentation slides by Dr. Rewas AliAspirin presentation slides by Dr. Rewas Ali
Aspirin presentation slides by Dr. Rewas Ali
 
Russian Call Girls in Bangalore Manisha 7001305949 Independent Escort Service...
Russian Call Girls in Bangalore Manisha 7001305949 Independent Escort Service...Russian Call Girls in Bangalore Manisha 7001305949 Independent Escort Service...
Russian Call Girls in Bangalore Manisha 7001305949 Independent Escort Service...
 
Call Girl Bangalore Nandini 7001305949 Independent Escort Service Bangalore
Call Girl Bangalore Nandini 7001305949 Independent Escort Service BangaloreCall Girl Bangalore Nandini 7001305949 Independent Escort Service Bangalore
Call Girl Bangalore Nandini 7001305949 Independent Escort Service Bangalore
 
Artifacts in Nuclear Medicine with Identifying and resolving artifacts.
Artifacts in Nuclear Medicine with Identifying and resolving artifacts.Artifacts in Nuclear Medicine with Identifying and resolving artifacts.
Artifacts in Nuclear Medicine with Identifying and resolving artifacts.
 
Call Girls Service Pune Vaishnavi 9907093804 Short 1500 Night 6000 Best call ...
Call Girls Service Pune Vaishnavi 9907093804 Short 1500 Night 6000 Best call ...Call Girls Service Pune Vaishnavi 9907093804 Short 1500 Night 6000 Best call ...
Call Girls Service Pune Vaishnavi 9907093804 Short 1500 Night 6000 Best call ...
 
Low Rate Call Girls Ambattur Anika 8250192130 Independent Escort Service Amba...
Low Rate Call Girls Ambattur Anika 8250192130 Independent Escort Service Amba...Low Rate Call Girls Ambattur Anika 8250192130 Independent Escort Service Amba...
Low Rate Call Girls Ambattur Anika 8250192130 Independent Escort Service Amba...
 
Ahmedabad Call Girls CG Road 🔝9907093804 Short 1500 💋 Night 6000
Ahmedabad Call Girls CG Road 🔝9907093804  Short 1500  💋 Night 6000Ahmedabad Call Girls CG Road 🔝9907093804  Short 1500  💋 Night 6000
Ahmedabad Call Girls CG Road 🔝9907093804 Short 1500 💋 Night 6000
 

5.DATA SUMMERISATION.ppt

  • 1. DATA SUMMERISATION Dr Vincent Yusuph Ecohas Kibaha 2023
  • 2. OBJECTIVES At the end of this session you should be able to:  Explain data summarization  Explain the characteristics, uses, advantages, and disadvantages of each measure of location.  Calculate mode, mean and median  Compute and interpret variance, and the standard deviation  Identify the position of the arithmetic mean, median, and mode for both a symmetrical and a skewed distribution.  Explain the characteristics, uses, advantages, and disadvantages of this measure of dispersion
  • 3. 4.3 Data summarisation  Measures of Central Location  Mean, Median, Mode  Measures of Variability/spread  Range, Standard Deviation, Variance, Coefficient of Variation  Measures of Relative Standing  Percentiles, Quartiles
  • 4. MEASURES OF CENTRAL TENDENCY/LOCATION  Often we need to summarise frequency distributions in a few numbers for ease of reporting or comparison  Recall: with qualitative data, useful summary statistics include ratio, proportion, rate
  • 5. Measures of central tendency/location  The statistical methods used to measure central tendency include the following 1. Mean 2. Median 3. Mode
  • 6. MEAN  Refers to arithmetic mean  It is obtained by adding the individual observations divided by the total number of observations.  Advantages – it is easy to calculate. most useful of all the averages.  Disadvantages – influenced by abnormal values.  Examples: In this case it will be (8 + 16 + 15 + 17 + 18 + 20 + 25)/7 which comes to 17
  • 7. Characteristics of the Mean It is calculated by summing the values and dividing by the number of values. It requires the interval scale. All values are used. It is unique. The sum of the deviations from the mean is 0. The Arithmetic Mean is the most widely used measure of location and shows the central value of the data. The major characteristics of the mean are: Average Joe 3- 7
  • 8. Population Mean N X    where µ is the population mean N is the total number of observations. X is a particular value.  indicates the operation of adding. For ungrouped data, the Population Mean is the sum of all the population values divided by the total number of population values: 3- 8
  • 9. Example 1 500 , 48 4 000 , 73 ... 000 , 56       N X  Find the mean mileage for the cars. A Parameter is a measurable characteristic of a population. The Musenge family owns four cars. The following is the current mileage on each of the four cars. 56,000 23,000 42,000 73,000 3- 9
  • 10. Example 2 4 . 15 5 77 5 0 . 15 ... 0 . 14        n X X A statistic is a measurable characteristic of a sampl A sample of five executives received the following bonus last year ($000): 14.0, 15.0, 17.0, 16.0, 15.0 3- 10
  • 11. Statistics is a pattern language Population Sample Size N n Mean Variance Standard Deviation
  • 12. Properties of the Arithmetic Mean Every set of interval-level and ratio-level data has a mean. All the values are included in computing the mean. A set of data has a unique mean. The mean is affected by unusually large or small data values. The arithmetic mean is the only measure of location where the sum of the deviations of each value from the mean is zero. Properties of the Arithmetic Mean 3- 12
  • 13. MEDIAN  When all the observation are arranged either in ascending order or descending order, the middle observation is known as median.  In case of even number the average of the two middle values is taken.  Median is better indicator of central value as it is not affected by the extreme values  Example : The median of 4, 1, and 7 is 4 because when the numbers are put in order (1 , 4, 7) , the number 4 is in the middle.
  • 14. The Median There are as many values above the median as below it in the data array. For an even set of values, the median will be the arithmetic average of the two middle numbers and is found at the (n+1)/2 ranked observation. The Median is the midpoint of the values after they have been ordered from the smallest to the largest. 3- 14
  • 15. The ages for a sample of five BSc.HLS.III students are: 21, 25, 19, 20, 22. Arranging the data in ascending order gives: 19, 20, 21, 22, 25. Thus the median is 21. The median (continued) 3- 15
  • 16. Example 5 Arranging the data in ascending order gives: 73, 75, 76, 80 Thus the median is 75.5. The heights of four basketball players, in inches, are: 76, 73, 80, 75. The median is found at the (n+1)/2 = (4+1)/2 =2.5th data point. 3- 16
  • 17. Properties of the Median There is a unique median for each data set. It is not affected by extremely large or small values and is therefore a valuable measure of location when such values occur. It can be computed for ratio-level, interval- level, and ordinal-level data. It can be computed for an open-ended frequency distribution if the median does not lie in an open-ended class. Properties of the Median 3- 17
  • 18. MODE  Most frequently occurring observation in a data is called mode  Not often used in medical statistics.  EXAMPLE  Number of decayed teeth in 10 children  2,2,4,1,3,0,10,2,3,8  Mean = 34 / 10 = 3.4  Median = (0,1,2,2,2,3,3,4,8,10) = 2+3 /2 = 2.5  Mode = 2 ( 3 Times)
  • 19. Symmetric distribution: A distribution having the same shape on either side of the center Skewed distribution: One whose shapes on either side of the center differ; a nonsymmetrical distribution. Can be positively or negatively skewed, or bimodal The Relative Positions of the Mean, Median, and Mode 3- 19
  • 21. The Relative Positions of the Mean, Median, and Mode: Symmetric Distribution Zero skewness Mean =Median =Mode Mode Median Mean 3- 21
  • 22. The Relative Positions of the Mean, Median, and Mode: Right Skewed Distribution  Positively skewed: Mean and median are to the right of the mode. Mean>Median>Mode Mode Median Mean 3- 22
  • 23. Negatively Skewed: Mean and Median are to the left of the Mode. Mean<Median<Mode The Relative Positions of the Mean, Median, and Mode: Left Skewed Distribution Mode Mean Median 3- 23
  • 24. CHOICE OF APPROPRIATE MEASURE  For symmetric distributions, mean is preferred to median or mode:  utilises all values  mathematical niceties  For asymmetric distributions, mean not suitable:  mean is sensitive to extreme values  median more preferred since it is not affected by extreme values
  • 25. • Measures of central location fail to tell the whole story about the distribution; that is, how much are the observations spread out around the mean value? Measures of spread…
  • 26. Measures of Variability… For example, two sets of class grades are shown. The mean (=50) is the same in each case… But, the red class has greater variability than the blue class.
  • 27. Dispersion refers to the spread or variability in the data. Measures of dispersion include the following: range, mean deviation, variance, and standard deviation. Range = Largest value – Smallest value Measures of Dispersion 0 5 10 15 20 25 30 0 2 4 6 8 10 12 3- 27
  • 28. The following represents the current year’s Return on Equity of the 25 companies in an investor’s portfolio. -8.1 3.2 5.9 8.1 12.3 -5.1 4.1 6.3 9.2 13.3 -3.1 4.6 7.9 9.5 14.0 -1.4 4.8 7.9 9.7 15.0 1.2 5.7 8.0 10.3 22.1 Example 9 Highest value: 22.1 Lowest value: -8.1 Range = Highest value – lowest value = 22.1-(-8.1) = 30.2 3- 28
  • 29. Range…  Its major advantage is the ease with which it can be computed.  Its major shortcoming is its failure to provide information on the dispersion of the observations between the two end points.  Hence we need a measure of variability that incorporates all the data and not just two observations. Hence…
  • 30. Variance: the arithmetic mean of the squared deviations from the mean. Standard deviation: The square root of the variance. Variance and standard Deviation 3- 30
  • 31. Not influenced by extreme values. The units are awkward, the square of the original units. All values are used in the calculation. The major characteristics of the Population Variance are: Population Variance 3- 31
  • 32. Population Variance formula:  (X - )2 N  = X is the value of an observation in the population m is the arithmetic mean of the population N is the number of observations in the population  Population Standard Deviation formula: 2  Variance and standard deviation 3- 32
  • 33. (-8.1-6.62)2 + (-5.1-6.62)2 + ... + (22.1-6.62)2 25     = 42.227 = 6.498 In Example 9, the variance and standard deviation are:  (X - )2 N  = Example 9 continued 3- 33
  • 34. Sample variance (s2) s2 = (X - X)2 n-1 Sample standard deviation (s) 2 s s  Sample variance and standard deviation 3- 34
  • 35. 40 . 7 5 37     n X X       30 . 5 1 5 2 . 21 1 5 4 . 7 6 ... 4 . 7 7 1 2 2 2 2              n X X s Example 11 The hourly wages earned by a sample of five students are: $7, $5, $11, $8, $6. Find the sample variance and standard deviation. 30 . 2 30 . 5 2    s s 3- 35
  • 36. Empirical Rule: For any symmetrical, bell- shaped distribution: About 68% of the observations will lie within 1s the mean About 95% of the observations will lie within 2s of the mean Virtually all the observations will be within 3s of the mean Interpretation and Uses of the Standard Deviation 3- 36
  • 37. 4.37 The Empirical Rule…  Approximately 68% of all observations fall  within one standard deviation of the mean.   Approximately 95% of all observations fall  within two standard deviations of the mean.  Approximately 99.7% of all observations fall  within three standard deviations of the mean.
  • 38. Bell-Shaped Curve showing the relationship between and .   3  1  1  3 68% 95% 99.7% Interpretation and Uses of the Standard Deviation 3- 38
  • 39. Interpreting the standard deviation  The greater the variation in the data the greater the standard deviation  If all the values are the same the standard deviation is zero  For a symmetrical distribution almost all the data will be contained within three standard deviations
  • 40. Coefficient of Variation…  The coefficient of variation of a set of observations is the standard deviation of the observations divided by their mean,  that is:  Population coefficient of variation = CV =  Sample coefficient of variation = cv =
  • 41. 4.41 Coefficient of Variation…  This coefficient provides a  proportionate measure of variation, e.g.  A standard deviation of 10 may be perceived as large when the mean value is 100, but only moderately large when the mean value is 500.
  • 42. 4.42 Measures of Variability…  If data are symmetric, with no serious outliers, use range and standard deviation.  If comparing variation across two data sets, use coefficient of variation.  The measures of variability introduced in this section can be used only for interval data.