SlideShare a Scribd company logo
SUMMARIZING
DATA
Dr Lipilekha Patnaik
Professor, Community Medicine
Institute of Medical Sciences & SUM Hospital
Siksha ‘O’ Anusandhan deemed to be University
Bhubaneswar, Odisha, India
Email: drlipilekha@yahoo.co.in
1
•Measures of central tendency –
Mean, Median, mode
•Measures of dispersion – Range,
standard deviation, Standard error
2
Session Objectives
Descriptive Measures for continuous data
•Central tendency measures – They are
computed to give a “center” around
which the measurements in the data
are distributed.
•Variation or variability measures –
They describe data spread or how far
away the measurements are from the
center.
3
Statistics related to continuous variables
• Mean
• Median
• Mode
• Range
• Standard Deviation
• Standard Error
4
Measures of Central Tendency
5
Central tendency measures
•Mean – The average value
Affected by extreme values
•Median – The middle value
Not affected by extremes
•Mode – Most frequently occurring
observation, there may be
more than one mode.
6
Mean
•Average
•Arithmetic Mean = (x )
= sum of individual values
number of observations
= Ʃ x
n
7
Exercise
• The diastolicblood pressureof 10 individualswas
83, 75, 81, 79, 71, 95, 75, 77, 84, 90.
•
• Arithmetic
Mean = 83+75+81+79+71+95+75+77+84+90
10
= 810
10
= 81
8
Median
§ The data are first arranged in an ascending or
descending order of magnitude
§ Middle observation is located, which is called
median.
§If the number of values is odd,
Median = middle value
§If the number of values is even,
Median = average of the two middle values
9
Median divides the data into two equal parts
with 50% of the observations above the median
and 50% below it.
10
Unsorted
Sorted in ascending
order
• Exercise: 1 odd no (11) of observations
• 11, 13, 15, 12, 10, 9, 2, 8, 12, 11, 10
• Median
• 8, 9, 10, 10, 11, 11, 12, 12, 12, 13, 15
• Exercise: 2 even no (12) of observations
• 11, 13, 15, 12, 10, 9, 12, 8, 12, 11, 10,12
• Arranged in ascending order
• 8, 9, 10, 10, 11, 11, 12, 12, 12, 12, 13, 15
11
median = 11+12
2
Exercise
Mode
•Most frequent observation.
•The value that appears most frequently in the data set.
12
11, 13, 15, 12, 10, 9, 12, 8, 12, 11, 10
Mode = 12
13
Exercise
Number of seizures/month:
3, 3, 1, 2, 4, 7, 9
14
•Mean?	 4.1	
•Median? 3
•Mode? 30
1
2
3
4
5
6
7
8
9
10
1 2 3 4 5 6 7
No	of	seizures
What’s wrong with a mean?
•Mean	is	sensitive	to	outliers (values	far	from	the	
middle	of	the	distribution)
–Provides	a	falsely	high	or	low	measure	of	central	
tendency	when	outliers	exist.
–In	such	cases	(look	at	your	data),	use	the	median
as	the	preferred	measure	of	central	tendency.
15
Number	of	seizures/month:	100,2,3,3,4,7,1
•Mean? 17.14
•Median? 3
•Mode? 3
16
0
20
40
60
80
100
120
1 2 3 4 5 6 7
Outlier
Measures of Dispersion
17
Measures of dispersion
• “Dispersion” also called variability, scatter, spread)
•Measure how spread out a set of data is.
•Dispersion is the scatteredness of the data series
around its average.
18
Measures of dispersion
•Range
•Standard deviation
•Variance
•Interquartile range
19
Range
•The difference between the values of
the two extreme items of a series.
•i.e Difference between the maximum
& minimum value in a set of
observations.
20
•For example, from the following record of diastolic
blood pressure of 10 individuals -
93, 75, 81, 79, 7 7, 90, 75, 95, 77, 94.
• Highest value = 95
• Lowest value = 71.
•The Range is expressed as = 95-71=24
& 71 to 95 .
21
Exercise
•Simplest and most crude measure of
dispersion.
• Affected by the extreme values.
•Gives an idea of the variability very quickly.
22
Characteristics of Range
Standard deviation
•Tells us how individual values are deviated from and
around the mean in the sample.
•Provides an index of variability.
23
Characteristics of Standard Deviation
• Very satisfactory and most widely used measure of
dispersion.
•If SD is small, there is a high probability for getting a
value close to the mean and
• If it is large, the value is farther away from the mean.
• It is less affected by fluctuations of sampling.
24
How to determine a SD
1. Calculate the mean
2. Calculate the difference between each value
and the mean
3. Square each of the differences and sum them
4. Divide the sum by one less than the number of
observations (if n< 30) and no. of observations
(if n > 30).
25
Standard deviation
26
Standard deviation
• The diastolic blood pressure was as follows : 83, 75, 81, 79, 71, 95, 75, 77,
84, 90 of 10 individuals.
27
x
_
(		x	– x		)
_
(x	– x		) 2
83 2 4
75 -6 36
81 0 0
79 -2 4
71 -10 100
95 -14 196
75 6 36
77 4 16
84 3 9
90 9 81
Ʃ x = 810 _
Ʃ( x – x ) 2 = 482
n	=	10
Mean	=					810		=		81
10
Uses of the standard deviation
•The standard deviation enables us to determine,
with a great deal of accuracy, where the values
of a frequency distribution are located in relation
to the mean.
28
Standard Deviation (SD) – for ‘Normal
distribution’
2.5 3.5 4.5
Birth Weight
[N]
29
Mean	Birth-wt	=	3.5	kg
Std	Dev.	=	1.0	kg
Mean	±1	SD
3.5	±1kg
2.5	– 4.5	kg	=	68%
Mean		± 2	SD
3.5	±2	kg
1.5	– 5.5	kg	=	95%
3.5
1.5 5.5
(kg)
Variance
• Variance = (SD)2
_
=
! "!" 𝟐
(𝒏!𝟏)
• Indicates the degree of variability among the
observations for a given variable.
30
Percentiles
• The percentile is a number such that most p%
of the measurements are below it and at most
100 – p percent of data are above it.
• Ex – if in a certain data the 85th percentile is
520 means that 15% of the measurements in
the data are above 520 and 85% of the
measurements are below 520.
31
Percentiles - for
non-normally distributed data
32
50 60 70 80 90 100 110 120
Diastolic BP
[N]
25% 25% 25% 25%
25th	%-ile 50th	%-ile
Ð Ï
75th %-ile
Ï
50th percentile is the
MEDIAN.
The 25th to the 75th
percentile is the
INTERQUARTILE
RANGE (IQR).
….% of data that fall below a specific value
INTERQUARTILE RANGE
25% 25% 25% 25%
33
Q 1 Q 2 Q3
“Interquartilerange” is from Q1 to Q3.
interquartile range = Q 3 – Q 1
To calculate it just subtract quartile 1
from quartile 3
Example: 5, 8 , 4, 4, 6, 3, 8.
• First put the list of numbers in order.
• Then cut the list into 4 equal parts.
• The quartiles are the cuts.
3 , 4 , 4 , 5 , 6 , 8 , 8
34
Q 1
Lower
quartile
Q 2
Middle quartile
(median)
Q 3
upper
quartile
Quartile (Q1) =4
Quartile (Q2) = median = 5
Quartile (Q3) = 8
Interquartile range is Q3 – Q1 = 8 – 4 = 4
Standard Error
•If we take a random sample (n) from the population,
and similar samples over and over again we will find
that every sample will have a differentmean (x ).
•If we make a frequency distribution of all the sample
means drawn from the same population, we will find
that the distribution of the mean is nearly a normal
distribution and the mean of the sample means
practically the same as the population mean (p).
35
•This	is	a	very	important	observation	that	the	sample	
means	are	distributed	normally	about	the	population	
mean	(p).	
•The	standard	deviation	of	the	means	is	a	measure	of	
the	sample	error	and	is	given	by	the	formula	б/√n	
which	is	called	the	standard	error	or	the	standard	
error	of	the	mean.
36
95% confidence interval
•Approximately 2 standard errors above and below
the estimate
•The range within which 95% of estimates from
multiple samples would be expected to lie
•Regarded as the range within which the “true
population” value probably lies (with 95% certainty)
37
95% confidence interval of the mean
The SEM is used to describe a 95% confidence interval for an observed
mean. (95% CI = Mean ± 2 SEM)
This confidence interval narrows with larger sample size.
Since SE = '(
)*
38
95% CI of the mean
If based on 4 values,
95% CI is mean ± 2 SE
150 ± 2 x 30/ 4
150 ± 2 x 15
If based on 100 values,
95% CI is mean ± 2 SE
150 ± 2 x 30/ 100
150 ± 2 x 3
120	– 180
144	– 156
Mean	=	150
S.D.	=	30
39
Interpreting Estimates with Confidence
Intervals
•Confident that 95% of all sample
means based on the given sample size
will fall within the range of the CI.
40
Categorical data
• For categorical data
Compare groups
Use proportions
41
Example
• In a prevalence study of Hypertension, we found
that
Hypertension No Hypertension
Non smokers 10 (10%) 90
Smokers 26 (26%) 74
• It is visible from the table that the proportion of
HTN was higher among smokers . The question
that arises is whether HTN was really higher
among smokers or the difference was merely due
to chance.
42
Take – home messages:
§Look at your data
§For continuous data, summarize with mean (for
central tendency) and SD (for dispersion) only
for normal bell – shaped distributions
(otherwise, use median and percentiles)
§Interpret mean with confidence interval while
inferring to population
§For categorical data, use proportions.
43
44

More Related Content

What's hot

Research data collection methods and tools
Research data collection methods and toolsResearch data collection methods and tools
Research data collection methods and toolsLikhila Abraham
 
Overview research process
Overview research processOverview research process
Overview research processNursing Path
 
statistical analysis
statistical analysisstatistical analysis
statistical analysis
Rakhi Kripa Prince
 
Sampling techniques and types
Sampling techniques and typesSampling techniques and types
Sampling techniques and types
NITISH SADOTRA
 
Parametric vs Nonparametric Tests: When to use which
Parametric vs Nonparametric Tests: When to use whichParametric vs Nonparametric Tests: When to use which
Parametric vs Nonparametric Tests: When to use which
Gönenç Dalgıç
 
Tabulation
TabulationTabulation
Tabulation
Nirmal Singh
 
SAMPLING AND SAMPLING ERRORS
SAMPLING AND SAMPLING ERRORSSAMPLING AND SAMPLING ERRORS
SAMPLING AND SAMPLING ERRORS
rambhu21
 
Descriptive Statistics
Descriptive StatisticsDescriptive Statistics
Descriptive Statistics
CIToolkit
 
Biostatistics lec 1
Biostatistics lec 1Biostatistics lec 1
Biostatistics lec 1
Osmanmohamed38
 
Statistical Methods
Statistical MethodsStatistical Methods
Statistical Methodsguest9fa52
 
Methods of Data Collection in Statistics
Methods of Data Collection in StatisticsMethods of Data Collection in Statistics
Methods of Data Collection in Statistics
Clyde Bangligan
 
Analysis and interpretation of data
Analysis and interpretation of dataAnalysis and interpretation of data
Analysis and interpretation of data
teppxcrown98
 
Reliability and validity- research-for BSC/PBBSC AND MSC NURSING
Reliability and validity- research-for BSC/PBBSC AND MSC NURSINGReliability and validity- research-for BSC/PBBSC AND MSC NURSING
Reliability and validity- research-for BSC/PBBSC AND MSC NURSING
SUCHITRARATI1976
 
Data Display and Summary
Data Display and SummaryData Display and Summary
Data Display and Summary
DrZahid Khan
 
Sampling
SamplingSampling
Experimental research design
Experimental research designExperimental research design
Experimental research designNursing Path
 
Data collection method
Data collection methodData collection method
Data collection method
Bapu Khodnapur
 
Analysis of data in research
Analysis of data in researchAnalysis of data in research
Analysis of data in research
Abhijeet Birari
 
Fishers test
Fishers testFishers test
Fishers test
Princy Francis M
 
Ethics in nursing research
Ethics in nursing researchEthics in nursing research
Ethics in nursing researchNursing Path
 

What's hot (20)

Research data collection methods and tools
Research data collection methods and toolsResearch data collection methods and tools
Research data collection methods and tools
 
Overview research process
Overview research processOverview research process
Overview research process
 
statistical analysis
statistical analysisstatistical analysis
statistical analysis
 
Sampling techniques and types
Sampling techniques and typesSampling techniques and types
Sampling techniques and types
 
Parametric vs Nonparametric Tests: When to use which
Parametric vs Nonparametric Tests: When to use whichParametric vs Nonparametric Tests: When to use which
Parametric vs Nonparametric Tests: When to use which
 
Tabulation
TabulationTabulation
Tabulation
 
SAMPLING AND SAMPLING ERRORS
SAMPLING AND SAMPLING ERRORSSAMPLING AND SAMPLING ERRORS
SAMPLING AND SAMPLING ERRORS
 
Descriptive Statistics
Descriptive StatisticsDescriptive Statistics
Descriptive Statistics
 
Biostatistics lec 1
Biostatistics lec 1Biostatistics lec 1
Biostatistics lec 1
 
Statistical Methods
Statistical MethodsStatistical Methods
Statistical Methods
 
Methods of Data Collection in Statistics
Methods of Data Collection in StatisticsMethods of Data Collection in Statistics
Methods of Data Collection in Statistics
 
Analysis and interpretation of data
Analysis and interpretation of dataAnalysis and interpretation of data
Analysis and interpretation of data
 
Reliability and validity- research-for BSC/PBBSC AND MSC NURSING
Reliability and validity- research-for BSC/PBBSC AND MSC NURSINGReliability and validity- research-for BSC/PBBSC AND MSC NURSING
Reliability and validity- research-for BSC/PBBSC AND MSC NURSING
 
Data Display and Summary
Data Display and SummaryData Display and Summary
Data Display and Summary
 
Sampling
SamplingSampling
Sampling
 
Experimental research design
Experimental research designExperimental research design
Experimental research design
 
Data collection method
Data collection methodData collection method
Data collection method
 
Analysis of data in research
Analysis of data in researchAnalysis of data in research
Analysis of data in research
 
Fishers test
Fishers testFishers test
Fishers test
 
Ethics in nursing research
Ethics in nursing researchEthics in nursing research
Ethics in nursing research
 

Similar to Summarizing data

Bio statistics
Bio statisticsBio statistics
Bio statisticsNc Das
 
Epidemiology Lectures for UG
Epidemiology Lectures for UGEpidemiology Lectures for UG
Epidemiology Lectures for UG
amitakashyap1
 
Measures of dispersion
Measures of dispersionMeasures of dispersion
Measures of dispersion
Mayuri Joshi
 
Statistics
StatisticsStatistics
Statistics
dineshmeena53
 
Central tendency
Central tendencyCentral tendency
Central tendency
keerthi samuel
 
DescriptiveStatistics.pdf
DescriptiveStatistics.pdfDescriptiveStatistics.pdf
DescriptiveStatistics.pdf
data2businessinsight
 
Unit-I Measures of Dispersion- Biostatistics - Ravinandan A P.pdf
Unit-I Measures of Dispersion- Biostatistics - Ravinandan A P.pdfUnit-I Measures of Dispersion- Biostatistics - Ravinandan A P.pdf
Unit-I Measures of Dispersion- Biostatistics - Ravinandan A P.pdf
Ravinandan A P
 
Chapter 3 Ken Black 2.ppt
Chapter 3 Ken Black 2.pptChapter 3 Ken Black 2.ppt
Chapter 3 Ken Black 2.ppt
NurinaSWGotami
 
best for normal distribution.ppt
best for normal distribution.pptbest for normal distribution.ppt
best for normal distribution.ppt
DejeneDay
 
statical-data-1 to know how to measure.ppt
statical-data-1 to know how to measure.pptstatical-data-1 to know how to measure.ppt
statical-data-1 to know how to measure.ppt
NazarudinManik1
 
Statistics for Medical students
Statistics for Medical studentsStatistics for Medical students
Statistics for Medical students
ANUSWARUM
 
Lecture 3 Measures of Central Tendency and Dispersion.pptx
Lecture 3 Measures of Central Tendency and Dispersion.pptxLecture 3 Measures of Central Tendency and Dispersion.pptx
Lecture 3 Measures of Central Tendency and Dispersion.pptx
shakirRahman10
 
descriptive data analysis
 descriptive data analysis descriptive data analysis
descriptive data analysis
gnanasarita1
 
Central tendency _dispersion
Central tendency _dispersionCentral tendency _dispersion
Central tendency _dispersion
Kirti Gupta
 
Describing Distributions with Numbers
Describing Distributions with NumbersDescribing Distributions with Numbers
Describing Distributions with Numbers
nszakir
 
2-Descriptive statistics.pptx
2-Descriptive statistics.pptx2-Descriptive statistics.pptx
2-Descriptive statistics.pptx
SandipanMaji3
 
Descriptive statistics and graphs
Descriptive statistics and graphsDescriptive statistics and graphs
Descriptive statistics and graphs
Avjinder (Avi) Kaler
 
Biostatistics Measures of dispersion
Biostatistics Measures of dispersionBiostatistics Measures of dispersion
Biostatistics Measures of dispersion
HARINATHA REDDY ASWARTHA
 
Data analysis ( Bio-statistic )
Data analysis ( Bio-statistic )Data analysis ( Bio-statistic )
Data analysis ( Bio-statistic )
Amany Elsayed
 

Similar to Summarizing data (20)

Bio statistics
Bio statisticsBio statistics
Bio statistics
 
Epidemiology Lectures for UG
Epidemiology Lectures for UGEpidemiology Lectures for UG
Epidemiology Lectures for UG
 
Measures of dispersion
Measures of dispersionMeasures of dispersion
Measures of dispersion
 
Dscriptive statistics
Dscriptive statisticsDscriptive statistics
Dscriptive statistics
 
Statistics
StatisticsStatistics
Statistics
 
Central tendency
Central tendencyCentral tendency
Central tendency
 
DescriptiveStatistics.pdf
DescriptiveStatistics.pdfDescriptiveStatistics.pdf
DescriptiveStatistics.pdf
 
Unit-I Measures of Dispersion- Biostatistics - Ravinandan A P.pdf
Unit-I Measures of Dispersion- Biostatistics - Ravinandan A P.pdfUnit-I Measures of Dispersion- Biostatistics - Ravinandan A P.pdf
Unit-I Measures of Dispersion- Biostatistics - Ravinandan A P.pdf
 
Chapter 3 Ken Black 2.ppt
Chapter 3 Ken Black 2.pptChapter 3 Ken Black 2.ppt
Chapter 3 Ken Black 2.ppt
 
best for normal distribution.ppt
best for normal distribution.pptbest for normal distribution.ppt
best for normal distribution.ppt
 
statical-data-1 to know how to measure.ppt
statical-data-1 to know how to measure.pptstatical-data-1 to know how to measure.ppt
statical-data-1 to know how to measure.ppt
 
Statistics for Medical students
Statistics for Medical studentsStatistics for Medical students
Statistics for Medical students
 
Lecture 3 Measures of Central Tendency and Dispersion.pptx
Lecture 3 Measures of Central Tendency and Dispersion.pptxLecture 3 Measures of Central Tendency and Dispersion.pptx
Lecture 3 Measures of Central Tendency and Dispersion.pptx
 
descriptive data analysis
 descriptive data analysis descriptive data analysis
descriptive data analysis
 
Central tendency _dispersion
Central tendency _dispersionCentral tendency _dispersion
Central tendency _dispersion
 
Describing Distributions with Numbers
Describing Distributions with NumbersDescribing Distributions with Numbers
Describing Distributions with Numbers
 
2-Descriptive statistics.pptx
2-Descriptive statistics.pptx2-Descriptive statistics.pptx
2-Descriptive statistics.pptx
 
Descriptive statistics and graphs
Descriptive statistics and graphsDescriptive statistics and graphs
Descriptive statistics and graphs
 
Biostatistics Measures of dispersion
Biostatistics Measures of dispersionBiostatistics Measures of dispersion
Biostatistics Measures of dispersion
 
Data analysis ( Bio-statistic )
Data analysis ( Bio-statistic )Data analysis ( Bio-statistic )
Data analysis ( Bio-statistic )
 

More from Dr Lipilekha Patnaik

Health Programmes in India.pdf
Health Programmes in India.pdfHealth Programmes in India.pdf
Health Programmes in India.pdf
Dr Lipilekha Patnaik
 
Concept of public health.pdf
Concept of public health.pdfConcept of public health.pdf
Concept of public health.pdf
Dr Lipilekha Patnaik
 
Demographic profile of india
Demographic profile of indiaDemographic profile of india
Demographic profile of india
Dr Lipilekha Patnaik
 
Indicators of health
Indicators of healthIndicators of health
Indicators of health
Dr Lipilekha Patnaik
 
Immunization
ImmunizationImmunization
Immunization
Dr Lipilekha Patnaik
 
Epidemic investigation
Epidemic investigationEpidemic investigation
Epidemic investigation
Dr Lipilekha Patnaik
 
Study designs
Study designsStudy designs
Study designs
Dr Lipilekha Patnaik
 
Cross sectional study
Cross sectional studyCross sectional study
Cross sectional study
Dr Lipilekha Patnaik
 
Descriptive epidemiology
Descriptive epidemiologyDescriptive epidemiology
Descriptive epidemiology
Dr Lipilekha Patnaik
 
Rate, ratio, proportion
Rate, ratio, proportionRate, ratio, proportion
Rate, ratio, proportion
Dr Lipilekha Patnaik
 
Introduction to epidemiology
Introduction to epidemiologyIntroduction to epidemiology
Introduction to epidemiology
Dr Lipilekha Patnaik
 
Health planning in india
Health planning in indiaHealth planning in india
Health planning in india
Dr Lipilekha Patnaik
 
12th five year plan and NITI ayog
12th five year plan and NITI ayog12th five year plan and NITI ayog
12th five year plan and NITI ayog
Dr Lipilekha Patnaik
 
National programme for prevention and control of cancer, diabetes, CVDs and s...
National programme for prevention and control of cancer, diabetes, CVDs and s...National programme for prevention and control of cancer, diabetes, CVDs and s...
National programme for prevention and control of cancer, diabetes, CVDs and s...
Dr Lipilekha Patnaik
 
Universal immunization programme
Universal immunization programmeUniversal immunization programme
Universal immunization programme
Dr Lipilekha Patnaik
 
Health education
Health educationHealth education
Health education
Dr Lipilekha Patnaik
 
National AIDS Control Programme
National AIDS Control ProgrammeNational AIDS Control Programme
National AIDS Control Programme
Dr Lipilekha Patnaik
 
ICD and ICF
ICD and ICFICD and ICF
Nutrition programmes in india
Nutrition programmes in indiaNutrition programmes in india
Nutrition programmes in india
Dr Lipilekha Patnaik
 
Normality tests
Normality testsNormality tests
Normality tests
Dr Lipilekha Patnaik
 

More from Dr Lipilekha Patnaik (20)

Health Programmes in India.pdf
Health Programmes in India.pdfHealth Programmes in India.pdf
Health Programmes in India.pdf
 
Concept of public health.pdf
Concept of public health.pdfConcept of public health.pdf
Concept of public health.pdf
 
Demographic profile of india
Demographic profile of indiaDemographic profile of india
Demographic profile of india
 
Indicators of health
Indicators of healthIndicators of health
Indicators of health
 
Immunization
ImmunizationImmunization
Immunization
 
Epidemic investigation
Epidemic investigationEpidemic investigation
Epidemic investigation
 
Study designs
Study designsStudy designs
Study designs
 
Cross sectional study
Cross sectional studyCross sectional study
Cross sectional study
 
Descriptive epidemiology
Descriptive epidemiologyDescriptive epidemiology
Descriptive epidemiology
 
Rate, ratio, proportion
Rate, ratio, proportionRate, ratio, proportion
Rate, ratio, proportion
 
Introduction to epidemiology
Introduction to epidemiologyIntroduction to epidemiology
Introduction to epidemiology
 
Health planning in india
Health planning in indiaHealth planning in india
Health planning in india
 
12th five year plan and NITI ayog
12th five year plan and NITI ayog12th five year plan and NITI ayog
12th five year plan and NITI ayog
 
National programme for prevention and control of cancer, diabetes, CVDs and s...
National programme for prevention and control of cancer, diabetes, CVDs and s...National programme for prevention and control of cancer, diabetes, CVDs and s...
National programme for prevention and control of cancer, diabetes, CVDs and s...
 
Universal immunization programme
Universal immunization programmeUniversal immunization programme
Universal immunization programme
 
Health education
Health educationHealth education
Health education
 
National AIDS Control Programme
National AIDS Control ProgrammeNational AIDS Control Programme
National AIDS Control Programme
 
ICD and ICF
ICD and ICFICD and ICF
ICD and ICF
 
Nutrition programmes in india
Nutrition programmes in indiaNutrition programmes in india
Nutrition programmes in india
 
Normality tests
Normality testsNormality tests
Normality tests
 

Recently uploaded

Physiology of Chemical Sensation of smell.pdf
Physiology of Chemical Sensation of smell.pdfPhysiology of Chemical Sensation of smell.pdf
Physiology of Chemical Sensation of smell.pdf
MedicoseAcademics
 
ARTHROLOGY PPT NCISM SYLLABUS AYURVEDA STUDENTS
ARTHROLOGY PPT NCISM SYLLABUS AYURVEDA STUDENTSARTHROLOGY PPT NCISM SYLLABUS AYURVEDA STUDENTS
ARTHROLOGY PPT NCISM SYLLABUS AYURVEDA STUDENTS
Dr. Vinay Pareek
 
How STIs Influence the Development of Pelvic Inflammatory Disease.pptx
How STIs Influence the Development of Pelvic Inflammatory Disease.pptxHow STIs Influence the Development of Pelvic Inflammatory Disease.pptx
How STIs Influence the Development of Pelvic Inflammatory Disease.pptx
FFragrant
 
Evaluation of antidepressant activity of clitoris ternatea in animals
Evaluation of antidepressant activity of clitoris ternatea in animalsEvaluation of antidepressant activity of clitoris ternatea in animals
Evaluation of antidepressant activity of clitoris ternatea in animals
Shweta
 
Lung Cancer: Artificial Intelligence, Synergetics, Complex System Analysis, S...
Lung Cancer: Artificial Intelligence, Synergetics, Complex System Analysis, S...Lung Cancer: Artificial Intelligence, Synergetics, Complex System Analysis, S...
Lung Cancer: Artificial Intelligence, Synergetics, Complex System Analysis, S...
Oleg Kshivets
 
ARTIFICIAL INTELLIGENCE IN HEALTHCARE.pdf
ARTIFICIAL INTELLIGENCE IN  HEALTHCARE.pdfARTIFICIAL INTELLIGENCE IN  HEALTHCARE.pdf
ARTIFICIAL INTELLIGENCE IN HEALTHCARE.pdf
Anujkumaranit
 
Hemodialysis: Chapter 3, Dialysis Water Unit - Dr.Gawad
Hemodialysis: Chapter 3, Dialysis Water Unit - Dr.GawadHemodialysis: Chapter 3, Dialysis Water Unit - Dr.Gawad
Hemodialysis: Chapter 3, Dialysis Water Unit - Dr.Gawad
NephroTube - Dr.Gawad
 
basicmodesofventilation2022-220313203758.pdf
basicmodesofventilation2022-220313203758.pdfbasicmodesofventilation2022-220313203758.pdf
basicmodesofventilation2022-220313203758.pdf
aljamhori teaching hospital
 
planning for change nursing Management ppt
planning for change nursing Management pptplanning for change nursing Management ppt
planning for change nursing Management ppt
Thangamjayarani
 
micro teaching on communication m.sc nursing.pdf
micro teaching on communication m.sc nursing.pdfmicro teaching on communication m.sc nursing.pdf
micro teaching on communication m.sc nursing.pdf
Anurag Sharma
 
Pictures of Superficial & Deep Fascia.ppt.pdf
Pictures of Superficial & Deep Fascia.ppt.pdfPictures of Superficial & Deep Fascia.ppt.pdf
Pictures of Superficial & Deep Fascia.ppt.pdf
Dr. Rabia Inam Gandapore
 
Are There Any Natural Remedies To Treat Syphilis.pdf
Are There Any Natural Remedies To Treat Syphilis.pdfAre There Any Natural Remedies To Treat Syphilis.pdf
Are There Any Natural Remedies To Treat Syphilis.pdf
Little Cross Family Clinic
 
Non-respiratory Functions of the Lungs.pdf
Non-respiratory Functions of the Lungs.pdfNon-respiratory Functions of the Lungs.pdf
Non-respiratory Functions of the Lungs.pdf
MedicoseAcademics
 
Basavarajeeyam - Ayurvedic heritage book of Andhra pradesh
Basavarajeeyam - Ayurvedic heritage book of Andhra pradeshBasavarajeeyam - Ayurvedic heritage book of Andhra pradesh
Basavarajeeyam - Ayurvedic heritage book of Andhra pradesh
Dr. Madduru Muni Haritha
 
Effective-Soaps-for-Fungal-Skin-Infections.pptx
Effective-Soaps-for-Fungal-Skin-Infections.pptxEffective-Soaps-for-Fungal-Skin-Infections.pptx
Effective-Soaps-for-Fungal-Skin-Infections.pptx
SwisschemDerma
 
Superficial & Deep Fascia of the NECK.pptx
Superficial & Deep Fascia of the NECK.pptxSuperficial & Deep Fascia of the NECK.pptx
Superficial & Deep Fascia of the NECK.pptx
Dr. Rabia Inam Gandapore
 
Pharma Pcd Franchise in Jharkhand - Yodley Lifesciences
Pharma Pcd Franchise in Jharkhand - Yodley LifesciencesPharma Pcd Franchise in Jharkhand - Yodley Lifesciences
Pharma Pcd Franchise in Jharkhand - Yodley Lifesciences
Yodley Lifesciences
 
263778731218 Abortion Clinic /Pills In Harare ,
263778731218 Abortion Clinic /Pills In Harare ,263778731218 Abortion Clinic /Pills In Harare ,
263778731218 Abortion Clinic /Pills In Harare ,
sisternakatoto
 
Novas diretrizes da OMS para os cuidados perinatais de mais qualidade
Novas diretrizes da OMS para os cuidados perinatais de mais qualidadeNovas diretrizes da OMS para os cuidados perinatais de mais qualidade
Novas diretrizes da OMS para os cuidados perinatais de mais qualidade
Prof. Marcus Renato de Carvalho
 
Flu Vaccine Alert in Bangalore Karnataka
Flu Vaccine Alert in Bangalore KarnatakaFlu Vaccine Alert in Bangalore Karnataka
Flu Vaccine Alert in Bangalore Karnataka
addon Scans
 

Recently uploaded (20)

Physiology of Chemical Sensation of smell.pdf
Physiology of Chemical Sensation of smell.pdfPhysiology of Chemical Sensation of smell.pdf
Physiology of Chemical Sensation of smell.pdf
 
ARTHROLOGY PPT NCISM SYLLABUS AYURVEDA STUDENTS
ARTHROLOGY PPT NCISM SYLLABUS AYURVEDA STUDENTSARTHROLOGY PPT NCISM SYLLABUS AYURVEDA STUDENTS
ARTHROLOGY PPT NCISM SYLLABUS AYURVEDA STUDENTS
 
How STIs Influence the Development of Pelvic Inflammatory Disease.pptx
How STIs Influence the Development of Pelvic Inflammatory Disease.pptxHow STIs Influence the Development of Pelvic Inflammatory Disease.pptx
How STIs Influence the Development of Pelvic Inflammatory Disease.pptx
 
Evaluation of antidepressant activity of clitoris ternatea in animals
Evaluation of antidepressant activity of clitoris ternatea in animalsEvaluation of antidepressant activity of clitoris ternatea in animals
Evaluation of antidepressant activity of clitoris ternatea in animals
 
Lung Cancer: Artificial Intelligence, Synergetics, Complex System Analysis, S...
Lung Cancer: Artificial Intelligence, Synergetics, Complex System Analysis, S...Lung Cancer: Artificial Intelligence, Synergetics, Complex System Analysis, S...
Lung Cancer: Artificial Intelligence, Synergetics, Complex System Analysis, S...
 
ARTIFICIAL INTELLIGENCE IN HEALTHCARE.pdf
ARTIFICIAL INTELLIGENCE IN  HEALTHCARE.pdfARTIFICIAL INTELLIGENCE IN  HEALTHCARE.pdf
ARTIFICIAL INTELLIGENCE IN HEALTHCARE.pdf
 
Hemodialysis: Chapter 3, Dialysis Water Unit - Dr.Gawad
Hemodialysis: Chapter 3, Dialysis Water Unit - Dr.GawadHemodialysis: Chapter 3, Dialysis Water Unit - Dr.Gawad
Hemodialysis: Chapter 3, Dialysis Water Unit - Dr.Gawad
 
basicmodesofventilation2022-220313203758.pdf
basicmodesofventilation2022-220313203758.pdfbasicmodesofventilation2022-220313203758.pdf
basicmodesofventilation2022-220313203758.pdf
 
planning for change nursing Management ppt
planning for change nursing Management pptplanning for change nursing Management ppt
planning for change nursing Management ppt
 
micro teaching on communication m.sc nursing.pdf
micro teaching on communication m.sc nursing.pdfmicro teaching on communication m.sc nursing.pdf
micro teaching on communication m.sc nursing.pdf
 
Pictures of Superficial & Deep Fascia.ppt.pdf
Pictures of Superficial & Deep Fascia.ppt.pdfPictures of Superficial & Deep Fascia.ppt.pdf
Pictures of Superficial & Deep Fascia.ppt.pdf
 
Are There Any Natural Remedies To Treat Syphilis.pdf
Are There Any Natural Remedies To Treat Syphilis.pdfAre There Any Natural Remedies To Treat Syphilis.pdf
Are There Any Natural Remedies To Treat Syphilis.pdf
 
Non-respiratory Functions of the Lungs.pdf
Non-respiratory Functions of the Lungs.pdfNon-respiratory Functions of the Lungs.pdf
Non-respiratory Functions of the Lungs.pdf
 
Basavarajeeyam - Ayurvedic heritage book of Andhra pradesh
Basavarajeeyam - Ayurvedic heritage book of Andhra pradeshBasavarajeeyam - Ayurvedic heritage book of Andhra pradesh
Basavarajeeyam - Ayurvedic heritage book of Andhra pradesh
 
Effective-Soaps-for-Fungal-Skin-Infections.pptx
Effective-Soaps-for-Fungal-Skin-Infections.pptxEffective-Soaps-for-Fungal-Skin-Infections.pptx
Effective-Soaps-for-Fungal-Skin-Infections.pptx
 
Superficial & Deep Fascia of the NECK.pptx
Superficial & Deep Fascia of the NECK.pptxSuperficial & Deep Fascia of the NECK.pptx
Superficial & Deep Fascia of the NECK.pptx
 
Pharma Pcd Franchise in Jharkhand - Yodley Lifesciences
Pharma Pcd Franchise in Jharkhand - Yodley LifesciencesPharma Pcd Franchise in Jharkhand - Yodley Lifesciences
Pharma Pcd Franchise in Jharkhand - Yodley Lifesciences
 
263778731218 Abortion Clinic /Pills In Harare ,
263778731218 Abortion Clinic /Pills In Harare ,263778731218 Abortion Clinic /Pills In Harare ,
263778731218 Abortion Clinic /Pills In Harare ,
 
Novas diretrizes da OMS para os cuidados perinatais de mais qualidade
Novas diretrizes da OMS para os cuidados perinatais de mais qualidadeNovas diretrizes da OMS para os cuidados perinatais de mais qualidade
Novas diretrizes da OMS para os cuidados perinatais de mais qualidade
 
Flu Vaccine Alert in Bangalore Karnataka
Flu Vaccine Alert in Bangalore KarnatakaFlu Vaccine Alert in Bangalore Karnataka
Flu Vaccine Alert in Bangalore Karnataka
 

Summarizing data

  • 1. SUMMARIZING DATA Dr Lipilekha Patnaik Professor, Community Medicine Institute of Medical Sciences & SUM Hospital Siksha ‘O’ Anusandhan deemed to be University Bhubaneswar, Odisha, India Email: drlipilekha@yahoo.co.in 1
  • 2. •Measures of central tendency – Mean, Median, mode •Measures of dispersion – Range, standard deviation, Standard error 2 Session Objectives
  • 3. Descriptive Measures for continuous data •Central tendency measures – They are computed to give a “center” around which the measurements in the data are distributed. •Variation or variability measures – They describe data spread or how far away the measurements are from the center. 3
  • 4. Statistics related to continuous variables • Mean • Median • Mode • Range • Standard Deviation • Standard Error 4
  • 5. Measures of Central Tendency 5
  • 6. Central tendency measures •Mean – The average value Affected by extreme values •Median – The middle value Not affected by extremes •Mode – Most frequently occurring observation, there may be more than one mode. 6
  • 7. Mean •Average •Arithmetic Mean = (x ) = sum of individual values number of observations = Ʃ x n 7
  • 8. Exercise • The diastolicblood pressureof 10 individualswas 83, 75, 81, 79, 71, 95, 75, 77, 84, 90. • • Arithmetic Mean = 83+75+81+79+71+95+75+77+84+90 10 = 810 10 = 81 8
  • 9. Median § The data are first arranged in an ascending or descending order of magnitude § Middle observation is located, which is called median. §If the number of values is odd, Median = middle value §If the number of values is even, Median = average of the two middle values 9
  • 10. Median divides the data into two equal parts with 50% of the observations above the median and 50% below it. 10 Unsorted Sorted in ascending order
  • 11. • Exercise: 1 odd no (11) of observations • 11, 13, 15, 12, 10, 9, 2, 8, 12, 11, 10 • Median • 8, 9, 10, 10, 11, 11, 12, 12, 12, 13, 15 • Exercise: 2 even no (12) of observations • 11, 13, 15, 12, 10, 9, 12, 8, 12, 11, 10,12 • Arranged in ascending order • 8, 9, 10, 10, 11, 11, 12, 12, 12, 12, 13, 15 11 median = 11+12 2 Exercise
  • 12. Mode •Most frequent observation. •The value that appears most frequently in the data set. 12
  • 13. 11, 13, 15, 12, 10, 9, 12, 8, 12, 11, 10 Mode = 12 13 Exercise
  • 14. Number of seizures/month: 3, 3, 1, 2, 4, 7, 9 14 •Mean? 4.1 •Median? 3 •Mode? 30 1 2 3 4 5 6 7 8 9 10 1 2 3 4 5 6 7 No of seizures
  • 15. What’s wrong with a mean? •Mean is sensitive to outliers (values far from the middle of the distribution) –Provides a falsely high or low measure of central tendency when outliers exist. –In such cases (look at your data), use the median as the preferred measure of central tendency. 15
  • 16. Number of seizures/month: 100,2,3,3,4,7,1 •Mean? 17.14 •Median? 3 •Mode? 3 16 0 20 40 60 80 100 120 1 2 3 4 5 6 7 Outlier
  • 18. Measures of dispersion • “Dispersion” also called variability, scatter, spread) •Measure how spread out a set of data is. •Dispersion is the scatteredness of the data series around its average. 18
  • 19. Measures of dispersion •Range •Standard deviation •Variance •Interquartile range 19
  • 20. Range •The difference between the values of the two extreme items of a series. •i.e Difference between the maximum & minimum value in a set of observations. 20
  • 21. •For example, from the following record of diastolic blood pressure of 10 individuals - 93, 75, 81, 79, 7 7, 90, 75, 95, 77, 94. • Highest value = 95 • Lowest value = 71. •The Range is expressed as = 95-71=24 & 71 to 95 . 21 Exercise
  • 22. •Simplest and most crude measure of dispersion. • Affected by the extreme values. •Gives an idea of the variability very quickly. 22 Characteristics of Range
  • 23. Standard deviation •Tells us how individual values are deviated from and around the mean in the sample. •Provides an index of variability. 23
  • 24. Characteristics of Standard Deviation • Very satisfactory and most widely used measure of dispersion. •If SD is small, there is a high probability for getting a value close to the mean and • If it is large, the value is farther away from the mean. • It is less affected by fluctuations of sampling. 24
  • 25. How to determine a SD 1. Calculate the mean 2. Calculate the difference between each value and the mean 3. Square each of the differences and sum them 4. Divide the sum by one less than the number of observations (if n< 30) and no. of observations (if n > 30). 25
  • 27. Standard deviation • The diastolic blood pressure was as follows : 83, 75, 81, 79, 71, 95, 75, 77, 84, 90 of 10 individuals. 27 x _ ( x – x ) _ (x – x ) 2 83 2 4 75 -6 36 81 0 0 79 -2 4 71 -10 100 95 -14 196 75 6 36 77 4 16 84 3 9 90 9 81 Ʃ x = 810 _ Ʃ( x – x ) 2 = 482 n = 10 Mean = 810 = 81 10
  • 28. Uses of the standard deviation •The standard deviation enables us to determine, with a great deal of accuracy, where the values of a frequency distribution are located in relation to the mean. 28
  • 29. Standard Deviation (SD) – for ‘Normal distribution’ 2.5 3.5 4.5 Birth Weight [N] 29 Mean Birth-wt = 3.5 kg Std Dev. = 1.0 kg Mean ±1 SD 3.5 ±1kg 2.5 – 4.5 kg = 68% Mean ± 2 SD 3.5 ±2 kg 1.5 – 5.5 kg = 95% 3.5 1.5 5.5 (kg)
  • 30. Variance • Variance = (SD)2 _ = ! "!" 𝟐 (𝒏!𝟏) • Indicates the degree of variability among the observations for a given variable. 30
  • 31. Percentiles • The percentile is a number such that most p% of the measurements are below it and at most 100 – p percent of data are above it. • Ex – if in a certain data the 85th percentile is 520 means that 15% of the measurements in the data are above 520 and 85% of the measurements are below 520. 31
  • 32. Percentiles - for non-normally distributed data 32 50 60 70 80 90 100 110 120 Diastolic BP [N] 25% 25% 25% 25% 25th %-ile 50th %-ile Ð Ï 75th %-ile Ï 50th percentile is the MEDIAN. The 25th to the 75th percentile is the INTERQUARTILE RANGE (IQR). ….% of data that fall below a specific value
  • 33. INTERQUARTILE RANGE 25% 25% 25% 25% 33 Q 1 Q 2 Q3 “Interquartilerange” is from Q1 to Q3. interquartile range = Q 3 – Q 1
  • 34. To calculate it just subtract quartile 1 from quartile 3 Example: 5, 8 , 4, 4, 6, 3, 8. • First put the list of numbers in order. • Then cut the list into 4 equal parts. • The quartiles are the cuts. 3 , 4 , 4 , 5 , 6 , 8 , 8 34 Q 1 Lower quartile Q 2 Middle quartile (median) Q 3 upper quartile Quartile (Q1) =4 Quartile (Q2) = median = 5 Quartile (Q3) = 8 Interquartile range is Q3 – Q1 = 8 – 4 = 4
  • 35. Standard Error •If we take a random sample (n) from the population, and similar samples over and over again we will find that every sample will have a differentmean (x ). •If we make a frequency distribution of all the sample means drawn from the same population, we will find that the distribution of the mean is nearly a normal distribution and the mean of the sample means practically the same as the population mean (p). 35
  • 37. 95% confidence interval •Approximately 2 standard errors above and below the estimate •The range within which 95% of estimates from multiple samples would be expected to lie •Regarded as the range within which the “true population” value probably lies (with 95% certainty) 37
  • 38. 95% confidence interval of the mean The SEM is used to describe a 95% confidence interval for an observed mean. (95% CI = Mean ± 2 SEM) This confidence interval narrows with larger sample size. Since SE = '( )* 38
  • 39. 95% CI of the mean If based on 4 values, 95% CI is mean ± 2 SE 150 ± 2 x 30/ 4 150 ± 2 x 15 If based on 100 values, 95% CI is mean ± 2 SE 150 ± 2 x 30/ 100 150 ± 2 x 3 120 – 180 144 – 156 Mean = 150 S.D. = 30 39
  • 40. Interpreting Estimates with Confidence Intervals •Confident that 95% of all sample means based on the given sample size will fall within the range of the CI. 40
  • 41. Categorical data • For categorical data Compare groups Use proportions 41
  • 42. Example • In a prevalence study of Hypertension, we found that Hypertension No Hypertension Non smokers 10 (10%) 90 Smokers 26 (26%) 74 • It is visible from the table that the proportion of HTN was higher among smokers . The question that arises is whether HTN was really higher among smokers or the difference was merely due to chance. 42
  • 43. Take – home messages: §Look at your data §For continuous data, summarize with mean (for central tendency) and SD (for dispersion) only for normal bell – shaped distributions (otherwise, use median and percentiles) §Interpret mean with confidence interval while inferring to population §For categorical data, use proportions. 43
  • 44. 44