SlideShare a Scribd company logo
1 of 28
© 2019 AHIMA
ahima.orgahima.org
Calculating and Reporting
Healthcare Statistics, Sixth Edition
Chapter 10:
Descriptive Statistics in
Healthcare
© 2019 AHIMA
ahima.org
Learning Objectives
• Explain how and why percentiles are used
• Compute the percentile from an ungrouped distribution
• Prepare statistics measuring central tendency such as mean,
median or mode
• Prepare statistics measuring variation such as range,
variance, standard deviation, and correlation
2
© 2019 AHIMA
ahima.org
Descriptive Statistics
• Descriptive statistics
• Used to explain data in ways that are manageable and
easily understood
© 2019 AHIMA
ahima.org
Descriptive Statistics (continued)
• Variable
• A variable is a characteristic that can have different
values
• Examples
• A person may be HIV negative or positive
• The variable is HIV status
• The values are negative and positive
• Third-party payers include numerous insurance companies
• The variable is “third-party payer”
• The values are the names of the organizations paying for
services
© 2019 AHIMA
ahima.org
Frequency Distribution
• Shows the values that a variable can take and the
number of observations associated with each value
• A variable is a characteristic or property that may
take on different values
© 2019 AHIMA
ahima.org
Rank
• Denotes a value’s position in a group relative to
other values organized in order of magnitude
• E.g., a value of 50 means that the value is 50th from the
beginning (or end) of a series
• The position of the observation is more important
than the number associated with it
© 2019 AHIMA
ahima.org
Quartile
• Data arranged in four equal parts
• First quartile includes the first 25% of the data
• Second quartile includes the first 25% of the data and
second 25% of the data
• Third quartile extends to 75% of the data
© 2019 AHIMA
ahima.org
Decile
• Data divided into ten equal parts
• First decile correspond to the 10th percentile
• Second decile corresponds to the 20th percentile and so
on
© 2019 AHIMA
ahima.org
Percentile
• Separate the scores into 100 equal parts
• A score at the 54th percentile means that the score is
greater than or equal to 54 percent of the scores in the
group
• Called a percentile rank
© 2019 AHIMA
ahima.org
Measures of Central Tendency
• In summarizing data, it is often useful to have a
single typical number that is representative of the
entire collection of data or specific population
• Such numbers are customarily referred to as
measures of central tendency
• Three measures of central tendency are frequently
used:
• Mean
• Median
• Mode
© 2019 AHIMA
ahima.org
Mean
• The arithmetic average
• It is common to use the term “average” to designate
mean
• To obtain the mean, add all the values in a frequency
distribution and then divide the total by the number of
values in the distribution
• For example, seven hospital inpatients have the following
lengths of stay: 2, 3, 4, 3, 5, 1, and 3
• The frequency distribution in order is 1, 2, 3, 3, 3, 4, and 5
• To construct a frequency distribution, all the values that the
LOS can take are listed, and the number of times a discharged
patient had that particular LOS is entered
© 2019 AHIMA
ahima.org
Mean (continued)
• To determine the mean, we sum all the values in the
frequency distribution and divide by the frequency
• The total is 21
• We arrived at this by adding 2 + 3 + 4 + 3 + 5 + 1 + 3
• To arrive at the mean, divide 21 by the number of
values (or frequency distribution) which in our case is 7
• The mean (or average) equals three days
• Formula
• Total sum of all the values /Number of the values involved = X
or
• Σ scores/N
© 2019 AHIMA
ahima.org
Mean (continued)
• The most common measure of central tendency
• Advantage
• Easy to compute
• Disadvantage
• It is sensitive to extreme values, called outliers that may
distort its representation of the typical value of a set of
numbers
© 2019 AHIMA
ahima.org
Median
• The midpoint (center) of the distribution of values
• Point above and below which 50 percent of the values
lie
• Describes the middle of the data, literally
• The median value is obtained by arranging the
numerical observations in ascending or descending
order and then determining the value in the middle
of the array
• May be the middle observation (if there is an odd
number of values)
• May be a point halfway between the two middle values
(if there is an even number of values)
© 2019 AHIMA
ahima.org
Median (continued)
• To arrive at the median in an even-numbered
distribution, add the two middle values together
and divide by two
• The advantage of using the median as a measure of
central tendency is that it is unaffected by extreme
values
© 2019 AHIMA
ahima.org
Mode
• The value that occurs with highest frequency
• It is the most typical
• Simplest of the measures of central tendency because
it does not require any calculations
• In the case of a small number of values, each value
likely may occur only once and there will be no mode
• The mode is rarely used as a sole descriptive measure
of central tendency because it may not be unique
• There may be two or more modes
• These are called bimodal or multimodal distributions
© 2019 AHIMA
ahima.org
Choice of a Measure of
Central Tendency
• Depends on the number of values and the nature of
their distribution
• Sometimes the mean, median, and mode are identical
• The mean is preferable because it includes information
from all observations
• If the series of values contains a few that are unusually
high or low, the median may represent the series better
than the mean
• The mode is often used in samples where the most
typical value is preferred
• The mode does not have to be numerical
© 2019 AHIMA
ahima.org
Measures of Variation
• Variability
• We also want to consider the spread of the distribution,
which tells us how widely the observations are spread
out around the measure of central tendency
• The mean gives a measure of central tendency of a list
of numbers but tells nothing about the spread of the
numbers in the list
• Variability refers to the difference between each score
and every other score
© 2019 AHIMA
ahima.org
Measures of Variation (continued)
• Range
• The simplest measure of spread
• The range is the difference between the largest and
smallest values in a frequency distribution
• The easiest to compute
• It is the simplest, order-based measure of spread, but it
is far from optimal as a measure of variability for two
reasons
• First, as the sample size increases, the range also tends to
increase
• Second, it is obviously affected by extreme values which are
very different from other values in the data
© 2019 AHIMA
ahima.org
Variance
• A frequency distribution is the average of the
standard deviations from the mean
• The variance of a sample is symbolized by s2
• The variance of a distribution is larger when the
observations are widely spread
© 2019 AHIMA
ahima.org
Variance (continued)
• To calculate the variance, first determine the mean
• Then, the squared deviations of the mean are calculated by subtracting
the mean of the frequency distribution from each value in the
distribution. The difference between the two values is squared (X –
Xbar)2
• The squared differences are summed and divided by N – 1
• S2 = variance
•  = sum
• X = value of a measure or observation
• Xbar = mean
• N = number of values or observations
• The term N – 1 is used in the denominator instead of N to adjust for the
fact that the mean of the sample is used as an estimate of the mean of
the underlying population
© 2019 AHIMA
ahima.org
Standard Deviation (SD)
• Square root of the variance
• Because SD is the square root of the variance, can
be more easily interpreted as a measure of
variation
• When SD is small, less dispersion around the mean
• When SD is large, greater dispersion around the
mean
© 2019 AHIMA
ahima.org
Standard Deviation (continued)
• To understand this concept, it can help to learn
about what mathematicians call normal
distribution of data:
• A normal distribution of data means that most of the
values in a set of data are close to the “average,” while
relatively few values tend to one extreme or the other
• The standard deviation is a statistic that tells you how
closely all the observations are clustered around the
mean in a set of data
• When the examples are closely gathered, and the bell-
shaped curve is steep, the standard deviation is small
• When the examples are spread apart, and the bell curve
is relatively flat, the standard deviation will be larger
© 2019 AHIMA
ahima.org
Standard Deviation (continued)
• Normal distribution means that if the variable on every
person in the population were measured, the frequency
distribution would display a normal pattern, with most
of the measurements near the center of the frequency
• It also would be possible to accurately and summarily
describe the population, with respect to variable, by
calculating the mean, variance, and SD of the values
• In a normal distribution, one SD in both directions from
the mean contains 68.3 percent of all values
• Two SDs in both directions from the mean contain 95.5
percent of all values
• Three SDs in both directions from the mean contain 99.7
percent of all observations
© 2019 AHIMA
ahima.org
Skew
• Not all distributions are symmetrical or have the
usual bell-shaped curve
• Some curves are skewed
• Their numbers do not fall in the middle, but rather on
one end of the curve
• Skewness is the horizontal stretching of a frequency
distribution to one side or the other so that one tail is
longer than the other
• The direction of skewness is on the side of the long tail
• That is, if the longer tail is on the right then the curve is skewed
to the right
• If the longer tail is on the left, then the curve is skewed to the
left
© 2019 AHIMA
ahima.org
Correlation
• Measures the extent of a linear relationship
between two variables and can be described as
strong, moderate, or week and positive or negative
• A positive relationship between two variables is
direct
• Negative relationships are inverse
© 2019 AHIMA
ahima.org
Correlation (continued)
• Correlation does not imply causation
• Just because two variables are highly correlated does
not mean that one causes the other
© 2019 AHIMA
ahima.org
Correlation (continued)
• The value for correlation is always between -1 and
+1
• A correlation of 0 means there is no relationship
between the variables
• -1 implies a perfect negative (inverse) relationship
• +1 implies a perfect positive (direct) relationship

More Related Content

What's hot

Univariate & bivariate analysis
Univariate & bivariate analysisUnivariate & bivariate analysis
Univariate & bivariate analysis
sristi1992
 
Biostatistics CH Lecture Pack
Biostatistics CH Lecture PackBiostatistics CH Lecture Pack
Biostatistics CH Lecture Pack
Shaun Cochrane
 
Review of Basic Statistics and Terminology
Review of Basic Statistics and TerminologyReview of Basic Statistics and Terminology
Review of Basic Statistics and Terminology
aswhite
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
Aiden Yeh
 

What's hot (17)

Assigment 1
Assigment 1Assigment 1
Assigment 1
 
Descriptive Statistics
Descriptive StatisticsDescriptive Statistics
Descriptive Statistics
 
Introduction to Statistics
Introduction to StatisticsIntroduction to Statistics
Introduction to Statistics
 
Univariate & bivariate analysis
Univariate & bivariate analysisUnivariate & bivariate analysis
Univariate & bivariate analysis
 
Biostatistics CH Lecture Pack
Biostatistics CH Lecture PackBiostatistics CH Lecture Pack
Biostatistics CH Lecture Pack
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
 
Stat11t chapter2
Stat11t chapter2Stat11t chapter2
Stat11t chapter2
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
 
Review of Basic Statistics and Terminology
Review of Basic Statistics and TerminologyReview of Basic Statistics and Terminology
Review of Basic Statistics and Terminology
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
 
STATISTICAL PROCEDURES (Discriptive Statistics).pptx
STATISTICAL PROCEDURES (Discriptive Statistics).pptxSTATISTICAL PROCEDURES (Discriptive Statistics).pptx
STATISTICAL PROCEDURES (Discriptive Statistics).pptx
 
Torturing numbers - Descriptive Statistics for Growers (2013)
Torturing numbers - Descriptive Statistics for Growers (2013)Torturing numbers - Descriptive Statistics for Growers (2013)
Torturing numbers - Descriptive Statistics for Growers (2013)
 
Introduction to statistics
Introduction to statisticsIntroduction to statistics
Introduction to statistics
 
Tools and Techniques - Statistics: descriptive statistics
Tools and Techniques - Statistics: descriptive statistics Tools and Techniques - Statistics: descriptive statistics
Tools and Techniques - Statistics: descriptive statistics
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
 
Introduction to statistics...ppt rahul
Introduction to statistics...ppt rahulIntroduction to statistics...ppt rahul
Introduction to statistics...ppt rahul
 

Similar to HI 224 Chapter 10

Ch5-quantitative-data analysis.pptx
Ch5-quantitative-data analysis.pptxCh5-quantitative-data analysis.pptx
Ch5-quantitative-data analysis.pptx
zerihunnana
 
2_Lecture 2_Confidence_Interval_3.pdf
2_Lecture 2_Confidence_Interval_3.pdf2_Lecture 2_Confidence_Interval_3.pdf
2_Lecture 2_Confidence_Interval_3.pdf
CHANSreyya1
 

Similar to HI 224 Chapter 10 (20)

Statistics four
Statistics fourStatistics four
Statistics four
 
chapter3.ppt
chapter3.pptchapter3.ppt
chapter3.ppt
 
UNIT III -Central Tendency.ppt
UNIT III -Central Tendency.pptUNIT III -Central Tendency.ppt
UNIT III -Central Tendency.ppt
 
Basics of Statistical Analysis
Basics of Statistical AnalysisBasics of Statistical Analysis
Basics of Statistical Analysis
 
Ch5-quantitative-data analysis.pptx
Ch5-quantitative-data analysis.pptxCh5-quantitative-data analysis.pptx
Ch5-quantitative-data analysis.pptx
 
chapter4.ppt
chapter4.pptchapter4.ppt
chapter4.ppt
 
chapter4.ppt
chapter4.pptchapter4.ppt
chapter4.ppt
 
Variability
VariabilityVariability
Variability
 
Measures of central tendency
Measures of central tendencyMeasures of central tendency
Measures of central tendency
 
Measure of Central Tendency
Measure of Central Tendency Measure of Central Tendency
Measure of Central Tendency
 
Central tendency
 Central tendency Central tendency
Central tendency
 
Biostatistics.pptx
Biostatistics.pptxBiostatistics.pptx
Biostatistics.pptx
 
Business statistics
Business statisticsBusiness statistics
Business statistics
 
Measures of central tendency
Measures of central tendencyMeasures of central tendency
Measures of central tendency
 
Descriptive_statistics - Sample 1.pptx
Descriptive_statistics - Sample 1.pptxDescriptive_statistics - Sample 1.pptx
Descriptive_statistics - Sample 1.pptx
 
fundamentals of data science and analytics on descriptive analysis.pptx
fundamentals of data science and analytics on descriptive analysis.pptxfundamentals of data science and analytics on descriptive analysis.pptx
fundamentals of data science and analytics on descriptive analysis.pptx
 
MEASURE-OF-VARIABILITY- for students. Ppt
MEASURE-OF-VARIABILITY- for students. PptMEASURE-OF-VARIABILITY- for students. Ppt
MEASURE-OF-VARIABILITY- for students. Ppt
 
HI 224 Chapter 11
HI 224 Chapter 11HI 224 Chapter 11
HI 224 Chapter 11
 
Chapter 12 Data Analysis Descriptive Methods and Index Numbers
Chapter 12 Data Analysis Descriptive Methods and Index NumbersChapter 12 Data Analysis Descriptive Methods and Index Numbers
Chapter 12 Data Analysis Descriptive Methods and Index Numbers
 
2_Lecture 2_Confidence_Interval_3.pdf
2_Lecture 2_Confidence_Interval_3.pdf2_Lecture 2_Confidence_Interval_3.pdf
2_Lecture 2_Confidence_Interval_3.pdf
 

More from BealCollegeOnline (20)

BA650 Week 3 Chapter 3 "Why Change? contemporary drivers and pressures
BA650 Week 3 Chapter 3 "Why Change? contemporary drivers and pressuresBA650 Week 3 Chapter 3 "Why Change? contemporary drivers and pressures
BA650 Week 3 Chapter 3 "Why Change? contemporary drivers and pressures
 
BIO420 Chapter 25
BIO420 Chapter 25BIO420 Chapter 25
BIO420 Chapter 25
 
BIO420 Chapter 24
BIO420 Chapter 24BIO420 Chapter 24
BIO420 Chapter 24
 
BIO420 Chapter 23
BIO420 Chapter 23BIO420 Chapter 23
BIO420 Chapter 23
 
BIO420 Chapter 20
BIO420 Chapter 20BIO420 Chapter 20
BIO420 Chapter 20
 
BIO420 Chapter 18
BIO420 Chapter 18BIO420 Chapter 18
BIO420 Chapter 18
 
BIO420 Chapter 17
BIO420 Chapter 17BIO420 Chapter 17
BIO420 Chapter 17
 
BIO420 Chapter 16
BIO420 Chapter 16BIO420 Chapter 16
BIO420 Chapter 16
 
BIO420 Chapter 13
BIO420 Chapter 13BIO420 Chapter 13
BIO420 Chapter 13
 
BIO420 Chapter 12
BIO420 Chapter 12BIO420 Chapter 12
BIO420 Chapter 12
 
BIO420 Chapter 09
BIO420 Chapter 09BIO420 Chapter 09
BIO420 Chapter 09
 
BIO420 Chapter 08
BIO420 Chapter 08BIO420 Chapter 08
BIO420 Chapter 08
 
BIO420 Chapter 06
BIO420 Chapter 06BIO420 Chapter 06
BIO420 Chapter 06
 
BIO420 Chapter 05
BIO420 Chapter 05BIO420 Chapter 05
BIO420 Chapter 05
 
BIO420 Chapter 04
BIO420 Chapter 04BIO420 Chapter 04
BIO420 Chapter 04
 
BIO420 Chapter 03
BIO420 Chapter 03BIO420 Chapter 03
BIO420 Chapter 03
 
BIO420 Chapter 01
BIO420 Chapter 01BIO420 Chapter 01
BIO420 Chapter 01
 
BA350 Katz esb 6e_chap018_ppt
BA350 Katz esb 6e_chap018_pptBA350 Katz esb 6e_chap018_ppt
BA350 Katz esb 6e_chap018_ppt
 
BA350 Katz esb 6e_chap017_ppt
BA350 Katz esb 6e_chap017_pptBA350 Katz esb 6e_chap017_ppt
BA350 Katz esb 6e_chap017_ppt
 
BA350 Katz esb 6e_chap016_ppt
BA350 Katz esb 6e_chap016_pptBA350 Katz esb 6e_chap016_ppt
BA350 Katz esb 6e_chap016_ppt
 

Recently uploaded

QUATER-1-PE-HEALTH-LC2- this is just a sample of unpacked lesson
QUATER-1-PE-HEALTH-LC2- this is just a sample of unpacked lessonQUATER-1-PE-HEALTH-LC2- this is just a sample of unpacked lesson
QUATER-1-PE-HEALTH-LC2- this is just a sample of unpacked lesson
httgc7rh9c
 
SPLICE Working Group: Reusable Code Examples
SPLICE Working Group:Reusable Code ExamplesSPLICE Working Group:Reusable Code Examples
SPLICE Working Group: Reusable Code Examples
Peter Brusilovsky
 

Recently uploaded (20)

Ernest Hemingway's For Whom the Bell Tolls
Ernest Hemingway's For Whom the Bell TollsErnest Hemingway's For Whom the Bell Tolls
Ernest Hemingway's For Whom the Bell Tolls
 
diagnosting testing bsc 2nd sem.pptx....
diagnosting testing bsc 2nd sem.pptx....diagnosting testing bsc 2nd sem.pptx....
diagnosting testing bsc 2nd sem.pptx....
 
dusjagr & nano talk on open tools for agriculture research and learning
dusjagr & nano talk on open tools for agriculture research and learningdusjagr & nano talk on open tools for agriculture research and learning
dusjagr & nano talk on open tools for agriculture research and learning
 
REMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptxREMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptx
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
21st_Century_Skills_Framework_Final_Presentation_2.pptx
21st_Century_Skills_Framework_Final_Presentation_2.pptx21st_Century_Skills_Framework_Final_Presentation_2.pptx
21st_Century_Skills_Framework_Final_Presentation_2.pptx
 
UGC NET Paper 1 Unit 7 DATA INTERPRETATION.pdf
UGC NET Paper 1 Unit 7 DATA INTERPRETATION.pdfUGC NET Paper 1 Unit 7 DATA INTERPRETATION.pdf
UGC NET Paper 1 Unit 7 DATA INTERPRETATION.pdf
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptx
 
Introduction to TechSoup’s Digital Marketing Services and Use Cases
Introduction to TechSoup’s Digital Marketing  Services and Use CasesIntroduction to TechSoup’s Digital Marketing  Services and Use Cases
Introduction to TechSoup’s Digital Marketing Services and Use Cases
 
QUATER-1-PE-HEALTH-LC2- this is just a sample of unpacked lesson
QUATER-1-PE-HEALTH-LC2- this is just a sample of unpacked lessonQUATER-1-PE-HEALTH-LC2- this is just a sample of unpacked lesson
QUATER-1-PE-HEALTH-LC2- this is just a sample of unpacked lesson
 
Wellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptxWellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptx
 
How to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptxHow to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptx
 
Including Mental Health Support in Project Delivery, 14 May.pdf
Including Mental Health Support in Project Delivery, 14 May.pdfIncluding Mental Health Support in Project Delivery, 14 May.pdf
Including Mental Health Support in Project Delivery, 14 May.pdf
 
SPLICE Working Group: Reusable Code Examples
SPLICE Working Group:Reusable Code ExamplesSPLICE Working Group:Reusable Code Examples
SPLICE Working Group: Reusable Code Examples
 
VAMOS CUIDAR DO NOSSO PLANETA! .
VAMOS CUIDAR DO NOSSO PLANETA!                    .VAMOS CUIDAR DO NOSSO PLANETA!                    .
VAMOS CUIDAR DO NOSSO PLANETA! .
 
FICTIONAL SALESMAN/SALESMAN SNSW 2024.pdf
FICTIONAL SALESMAN/SALESMAN SNSW 2024.pdfFICTIONAL SALESMAN/SALESMAN SNSW 2024.pdf
FICTIONAL SALESMAN/SALESMAN SNSW 2024.pdf
 
Andreas Schleicher presents at the launch of What does child empowerment mean...
Andreas Schleicher presents at the launch of What does child empowerment mean...Andreas Schleicher presents at the launch of What does child empowerment mean...
Andreas Schleicher presents at the launch of What does child empowerment mean...
 
When Quality Assurance Meets Innovation in Higher Education - Report launch w...
When Quality Assurance Meets Innovation in Higher Education - Report launch w...When Quality Assurance Meets Innovation in Higher Education - Report launch w...
When Quality Assurance Meets Innovation in Higher Education - Report launch w...
 
Details on CBSE Compartment Exam.pptx1111
Details on CBSE Compartment Exam.pptx1111Details on CBSE Compartment Exam.pptx1111
Details on CBSE Compartment Exam.pptx1111
 
How to Add a Tool Tip to a Field in Odoo 17
How to Add a Tool Tip to a Field in Odoo 17How to Add a Tool Tip to a Field in Odoo 17
How to Add a Tool Tip to a Field in Odoo 17
 

HI 224 Chapter 10

  • 1. © 2019 AHIMA ahima.orgahima.org Calculating and Reporting Healthcare Statistics, Sixth Edition Chapter 10: Descriptive Statistics in Healthcare
  • 2. © 2019 AHIMA ahima.org Learning Objectives • Explain how and why percentiles are used • Compute the percentile from an ungrouped distribution • Prepare statistics measuring central tendency such as mean, median or mode • Prepare statistics measuring variation such as range, variance, standard deviation, and correlation 2
  • 3. © 2019 AHIMA ahima.org Descriptive Statistics • Descriptive statistics • Used to explain data in ways that are manageable and easily understood
  • 4. © 2019 AHIMA ahima.org Descriptive Statistics (continued) • Variable • A variable is a characteristic that can have different values • Examples • A person may be HIV negative or positive • The variable is HIV status • The values are negative and positive • Third-party payers include numerous insurance companies • The variable is “third-party payer” • The values are the names of the organizations paying for services
  • 5. © 2019 AHIMA ahima.org Frequency Distribution • Shows the values that a variable can take and the number of observations associated with each value • A variable is a characteristic or property that may take on different values
  • 6. © 2019 AHIMA ahima.org Rank • Denotes a value’s position in a group relative to other values organized in order of magnitude • E.g., a value of 50 means that the value is 50th from the beginning (or end) of a series • The position of the observation is more important than the number associated with it
  • 7. © 2019 AHIMA ahima.org Quartile • Data arranged in four equal parts • First quartile includes the first 25% of the data • Second quartile includes the first 25% of the data and second 25% of the data • Third quartile extends to 75% of the data
  • 8. © 2019 AHIMA ahima.org Decile • Data divided into ten equal parts • First decile correspond to the 10th percentile • Second decile corresponds to the 20th percentile and so on
  • 9. © 2019 AHIMA ahima.org Percentile • Separate the scores into 100 equal parts • A score at the 54th percentile means that the score is greater than or equal to 54 percent of the scores in the group • Called a percentile rank
  • 10. © 2019 AHIMA ahima.org Measures of Central Tendency • In summarizing data, it is often useful to have a single typical number that is representative of the entire collection of data or specific population • Such numbers are customarily referred to as measures of central tendency • Three measures of central tendency are frequently used: • Mean • Median • Mode
  • 11. © 2019 AHIMA ahima.org Mean • The arithmetic average • It is common to use the term “average” to designate mean • To obtain the mean, add all the values in a frequency distribution and then divide the total by the number of values in the distribution • For example, seven hospital inpatients have the following lengths of stay: 2, 3, 4, 3, 5, 1, and 3 • The frequency distribution in order is 1, 2, 3, 3, 3, 4, and 5 • To construct a frequency distribution, all the values that the LOS can take are listed, and the number of times a discharged patient had that particular LOS is entered
  • 12. © 2019 AHIMA ahima.org Mean (continued) • To determine the mean, we sum all the values in the frequency distribution and divide by the frequency • The total is 21 • We arrived at this by adding 2 + 3 + 4 + 3 + 5 + 1 + 3 • To arrive at the mean, divide 21 by the number of values (or frequency distribution) which in our case is 7 • The mean (or average) equals three days • Formula • Total sum of all the values /Number of the values involved = X or • Σ scores/N
  • 13. © 2019 AHIMA ahima.org Mean (continued) • The most common measure of central tendency • Advantage • Easy to compute • Disadvantage • It is sensitive to extreme values, called outliers that may distort its representation of the typical value of a set of numbers
  • 14. © 2019 AHIMA ahima.org Median • The midpoint (center) of the distribution of values • Point above and below which 50 percent of the values lie • Describes the middle of the data, literally • The median value is obtained by arranging the numerical observations in ascending or descending order and then determining the value in the middle of the array • May be the middle observation (if there is an odd number of values) • May be a point halfway between the two middle values (if there is an even number of values)
  • 15. © 2019 AHIMA ahima.org Median (continued) • To arrive at the median in an even-numbered distribution, add the two middle values together and divide by two • The advantage of using the median as a measure of central tendency is that it is unaffected by extreme values
  • 16. © 2019 AHIMA ahima.org Mode • The value that occurs with highest frequency • It is the most typical • Simplest of the measures of central tendency because it does not require any calculations • In the case of a small number of values, each value likely may occur only once and there will be no mode • The mode is rarely used as a sole descriptive measure of central tendency because it may not be unique • There may be two or more modes • These are called bimodal or multimodal distributions
  • 17. © 2019 AHIMA ahima.org Choice of a Measure of Central Tendency • Depends on the number of values and the nature of their distribution • Sometimes the mean, median, and mode are identical • The mean is preferable because it includes information from all observations • If the series of values contains a few that are unusually high or low, the median may represent the series better than the mean • The mode is often used in samples where the most typical value is preferred • The mode does not have to be numerical
  • 18. © 2019 AHIMA ahima.org Measures of Variation • Variability • We also want to consider the spread of the distribution, which tells us how widely the observations are spread out around the measure of central tendency • The mean gives a measure of central tendency of a list of numbers but tells nothing about the spread of the numbers in the list • Variability refers to the difference between each score and every other score
  • 19. © 2019 AHIMA ahima.org Measures of Variation (continued) • Range • The simplest measure of spread • The range is the difference between the largest and smallest values in a frequency distribution • The easiest to compute • It is the simplest, order-based measure of spread, but it is far from optimal as a measure of variability for two reasons • First, as the sample size increases, the range also tends to increase • Second, it is obviously affected by extreme values which are very different from other values in the data
  • 20. © 2019 AHIMA ahima.org Variance • A frequency distribution is the average of the standard deviations from the mean • The variance of a sample is symbolized by s2 • The variance of a distribution is larger when the observations are widely spread
  • 21. © 2019 AHIMA ahima.org Variance (continued) • To calculate the variance, first determine the mean • Then, the squared deviations of the mean are calculated by subtracting the mean of the frequency distribution from each value in the distribution. The difference between the two values is squared (X – Xbar)2 • The squared differences are summed and divided by N – 1 • S2 = variance •  = sum • X = value of a measure or observation • Xbar = mean • N = number of values or observations • The term N – 1 is used in the denominator instead of N to adjust for the fact that the mean of the sample is used as an estimate of the mean of the underlying population
  • 22. © 2019 AHIMA ahima.org Standard Deviation (SD) • Square root of the variance • Because SD is the square root of the variance, can be more easily interpreted as a measure of variation • When SD is small, less dispersion around the mean • When SD is large, greater dispersion around the mean
  • 23. © 2019 AHIMA ahima.org Standard Deviation (continued) • To understand this concept, it can help to learn about what mathematicians call normal distribution of data: • A normal distribution of data means that most of the values in a set of data are close to the “average,” while relatively few values tend to one extreme or the other • The standard deviation is a statistic that tells you how closely all the observations are clustered around the mean in a set of data • When the examples are closely gathered, and the bell- shaped curve is steep, the standard deviation is small • When the examples are spread apart, and the bell curve is relatively flat, the standard deviation will be larger
  • 24. © 2019 AHIMA ahima.org Standard Deviation (continued) • Normal distribution means that if the variable on every person in the population were measured, the frequency distribution would display a normal pattern, with most of the measurements near the center of the frequency • It also would be possible to accurately and summarily describe the population, with respect to variable, by calculating the mean, variance, and SD of the values • In a normal distribution, one SD in both directions from the mean contains 68.3 percent of all values • Two SDs in both directions from the mean contain 95.5 percent of all values • Three SDs in both directions from the mean contain 99.7 percent of all observations
  • 25. © 2019 AHIMA ahima.org Skew • Not all distributions are symmetrical or have the usual bell-shaped curve • Some curves are skewed • Their numbers do not fall in the middle, but rather on one end of the curve • Skewness is the horizontal stretching of a frequency distribution to one side or the other so that one tail is longer than the other • The direction of skewness is on the side of the long tail • That is, if the longer tail is on the right then the curve is skewed to the right • If the longer tail is on the left, then the curve is skewed to the left
  • 26. © 2019 AHIMA ahima.org Correlation • Measures the extent of a linear relationship between two variables and can be described as strong, moderate, or week and positive or negative • A positive relationship between two variables is direct • Negative relationships are inverse
  • 27. © 2019 AHIMA ahima.org Correlation (continued) • Correlation does not imply causation • Just because two variables are highly correlated does not mean that one causes the other
  • 28. © 2019 AHIMA ahima.org Correlation (continued) • The value for correlation is always between -1 and +1 • A correlation of 0 means there is no relationship between the variables • -1 implies a perfect negative (inverse) relationship • +1 implies a perfect positive (direct) relationship