SlideShare a Scribd company logo
Descriptive Statistics
by Indramani Tripathi
Measures of Central Tendency
And Dispersion
Measures of Central Tendency
 1. Mode = can be used for any kind of data
but only measure of central tendency for
nominal or qualitative data.
 Formula: value that occurs most often or the
category or interval with highest frequency.
 Note: Omit Formula 3.1 Variation Ratio in
Healey and Prus 2nd
Cdn.
Example for Nominal Variables:

 Religion frequency cf proportion % Cum%
 Catholic 17 17 .41 41 41
 Protestant 4 21 .10 10 51
 Jewish 2 23 .05 5 56
 Muslim 1 24 .02 2 58
 Other 9 33 .22 9 80
 None 8 41 .20 20 100

 Total 41 1.00 100%
 Central Tendency: MODE = largest category = Catholic
Central Tendency (cont.)
 2. Median = exact centre or middle of
ordered data. The 50th percentile.
 Formula:
 Array data.
 When sample size is even, median falls
halfway between two middle numbers.
 To calculate: find (n/2) and (n/2)+1, and
divide the total by 2 to find the exact median.
 When sample size is odd, median is exact
middle (n+1) /2
Example for Raw Data:
 Suppose you have the following set of test
scores:
 66, 89, 41, 98, 76, 77, 68, 60, 60, 67, 69, 66,
98, 52, 74, 66, 89, 95, 66, 69
 1. Array (put in order) your data:
 98 98 95 89 89 77 76 74 69 69
68 67 66 66 66 66 60 60 52 41
N = 20 (N is even)
To calculate:
- find middle numbers(n/2)+(n/2 )+1
- add together the two middle numbers
- divide the total by 2
 First middle number: (20/2) = the 10th
number
 2nd
middle number: (20/2)+1 = the 11th
number
 Look at data:
the middle numbers are 69 and 68
 The median would be (69+68)/2 = 68.5
Median for Aggregate (grouped) Data
 This formula is shown in Healey 1st
Cdn
Edition but NOT in 2/3 Cdn
 We will NOT COVER this one!
Properties of median:
 - for numerical data at interval or ordinal level
 -"balance point“
 -not affected by outliers
 -median is appropriate when distribution is
highly skewed.
3. Mean for Raw Data
 The mean is the sum of measurements /
number of subjects
 Formula: (X-bar) = ΣXi / N
 Data (from above):
66, 89, 41, 98, 76, 77, 68, 60, 60, 67, 69, 66,
98, 52, 74, 66, 89, 95, 66, 69
Example for Mean
 Formula: = ΣXi / N
= 1446 / 20
= 72.3
The mean for these test scores is 72.30
Mean for Aggregate (Grouped) Data
(Note: not in text but covered in class)
 To calculate the mean for grouped data, you
need a frequency table that includes a
column for the midpoints, for the product of
the frequencies times the midpoints (fm).
Formula: = Σ (fm)
N
Frequency table:
Score f m* (fm)
41-50 1 45.5 45.5
51-60 3 55.5 166.5
61-70 8 65.5 524
71-80 3 75.5 226.5
81-90 2 85.5 171
91-100 3 95.5 286.5
N = 20 Σ (fm) = 1420
* Find midpoints first
Calculating Mean for Grouped Data:
Formula: = Σ (fm)
N
= 1420 / 20
= 71
The mean for the grouped data is 71.
Properties of the Mean:
- only for numerical data at interval level
- "balance point“
- can be affected by outliers = skewed distribution
- tail becomes elongated and the mean is pulled in
direction of outlier.
Example…
no outlier:
$30000, 30000, 35000, 25000, 30000 then mean = $30000
but if outlier is present, then:
$130000, 30000, 35000, 25000, 30000 then mean = $50000
(the mean is pulled up or down in the direction of the outlier)
NOTE:
 When distribution is symmetric,
mean = median = mode
 For skewed, mean will lie in direction of skew.
 i.e. skewed to right (tail pulled to right)
mean > median (positive skew)
 skewed to left (tail pulled to left)
median > mean (negative skew)
Measures of Dispersion
 Describe how variable the data are.
 i.e. how spread out around the mean
 Also called measures of variation or
variability
Variability for Non-numerical Data
(Nominal or Ordinal Level Data)
 Measures of variability for non-numerical
nominal or ordinal) data are rarely used
 We will not be covering these in class
 Omit Formula 4.1 IQV in Healey and Prus
1st
Canadian Edition
 Omit Formula 3.1 Variation Ratio in Healey
and Prus 2/3 Canadian Edition
2. Range (for numerical data)
Range = difference between largest and
smallest observations
i.e. if data are $130000, 35000, 30000, 30000,
30000, 30000, 25000, 25000
then range = 130000 - 25000 = $105000
Interquartile Range (Q):
- This is the difference between the 75th and the 25th
percentiles (the middle 50%)
- Gives better idea than range of what the middle of
the distribution looks like.
Formula: Q = Q3 - Q1 (where Q3 = N x .75,
and Q1 = N x .25)
Using above data: Q = Q3 - Q1 = (6th
– 2nd
case)
= $30000-25000 =$5000
The interquartile range (Q) is $5000.
3. Variance and Standard Deviation:
 For raw data at the interval/ratio level.
 Most common measure of variation.
 The numerator in the formula is known as
the sum of squares, and the denominator is
either the population size N or the sample
size n-1
 The variance is denoted by S2
and the
standard deviation, which is the square root
of the variance, by S
Definitional Formula for Variance and
Standard Deviation:
 Variance: s2
= Σ (xi - )2
/ N
 Standard Deviation:
s =
 (the standard deviation is the square root of
the variance; the variance is simply the
standard deviation squared)
Example for S and S2
:
 Data: 66, 89, 41, 98, 76, 77, 68, 60, 60, 67,
69, 66, 98, 52, 74, 66, 89, 95, 66, 69
1. Find ∑ Xi
2
: Square each Xi and find total.
2. Find (∑ Xi)2
: Find total of all Xi and square.
3. Substitute above and N into formula for S.
4. For S2
, simply square S.
S = 14.75 S2
= 217.71
A working formula for the standard
deviation:
Note: the definitional formula for standard deviation is
not practical for use with data when N>10.
The working formula, which is much easier to do on
your calculator, should be used instead.
Both formulae give exactly the same result. Try it!
2
2
X
N
X
S i
−=
∑
Properties of S:
 always greater than or equal to 0
 the greater the variation about mean,
the greater S is
 n-1 corrects for bias when using sample data. S
tends to underestimate the real population standard
deviation when based on sample data so to correct
for this, we use n-1. The larger the sample size, the
smaller difference this correction makes. When
calculating the standard deviation for the whole
population, use N in the denominator.
NOTE:
 σ, N and Mu (µ) denote population
parameters
 s, n, x-bar ( ) denote sample statistics
Remember the Rounding Rules!
 Always use as many decimal places as your
calculator can handle.
 Round your final answer to 2 decimal places,
rounding to nearest number.
 Engineers Rule: When last digit is exactly 5
(followed by 0’s), round the digit before the
last digit to nearest EVEN number.
Homework Questions
 Healey and Prus 1e:
 #3.1, #3.5, #3.11 and 4.9, #4.15
 Healey and Prus 2/3e
 #3.1, #3.5, #3.11 (compute s for 8 nations also), #3.15
 SPSS:
 Read the SPSS sections for Ch. 3 and 4 in 1st
Cdn. Edition
and for Ch. 4 in 2/3 Cdn. Edition
 Try some of the SPSS exercises for practice

More Related Content

What's hot

Measures Of Central Tendencies
Measures Of Central TendenciesMeasures Of Central Tendencies
Measures Of Central Tendencies
Adams City High School
 
Relative frequency distribution
Relative frequency distributionRelative frequency distribution
Relative frequency distribution
Nadeem Uddin
 
Introduction to Descriptive Statistics
Introduction to Descriptive StatisticsIntroduction to Descriptive Statistics
Introduction to Descriptive Statistics
Sanju Rusara Seneviratne
 
Descriptive Statistics
Descriptive StatisticsDescriptive Statistics
Descriptive Statistics
Neny Isharyanti
 
Basic Statistics
Basic  StatisticsBasic  Statistics
Basic Statistics
Chie Pegollo
 
Binary Logistic Regression
Binary Logistic RegressionBinary Logistic Regression
Binary Logistic Regression
Seth Anandaram Jaipuria College
 
Histogram
HistogramHistogram
Histogram
Samiul Ehsan
 
Statistics
StatisticsStatistics
Statistics
itutor
 
Basics of SPSS, Part 2
Basics of SPSS, Part 2Basics of SPSS, Part 2
Basics of SPSS, Part 2
Christine Pereira Ask Brunel
 
Measures of Dispersion - Thiyagu
Measures of Dispersion - ThiyaguMeasures of Dispersion - Thiyagu
Measures of Dispersion - Thiyagu
Thiyagu K
 
Measures of dispersion
Measures  of  dispersionMeasures  of  dispersion
Measures of dispersion
Southern Range, Berhampur, Odisha
 
Moment Generating Functions
Moment Generating FunctionsMoment Generating Functions
Moment Generating Functions
DataminingTools Inc
 
Introduction to statistics
Introduction to statisticsIntroduction to statistics
Introduction to statistics
Andi Koentary
 
Introduction To SPSS
Introduction To SPSSIntroduction To SPSS
Introduction To SPSS
ThankGod Damion Okpe
 
Bbs10 ppt ch03
Bbs10 ppt ch03Bbs10 ppt ch03
Bbs10 ppt ch03
Anwar Afridi
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
Aiden Yeh
 
Measures of Variation
Measures of Variation Measures of Variation
Measures of Variation
Long Beach City College
 
Regression analysis
Regression analysisRegression analysis
Regression analysis
Teachers Mitraa
 
MEASURESOF CENTRAL TENDENCY
MEASURESOF CENTRAL TENDENCYMEASURESOF CENTRAL TENDENCY
MEASURESOF CENTRAL TENDENCY
Richelle Saberon
 
Meaning and Importance of Statistics
Meaning and Importance of StatisticsMeaning and Importance of Statistics
Meaning and Importance of Statistics
Flipped Channel
 

What's hot (20)

Measures Of Central Tendencies
Measures Of Central TendenciesMeasures Of Central Tendencies
Measures Of Central Tendencies
 
Relative frequency distribution
Relative frequency distributionRelative frequency distribution
Relative frequency distribution
 
Introduction to Descriptive Statistics
Introduction to Descriptive StatisticsIntroduction to Descriptive Statistics
Introduction to Descriptive Statistics
 
Descriptive Statistics
Descriptive StatisticsDescriptive Statistics
Descriptive Statistics
 
Basic Statistics
Basic  StatisticsBasic  Statistics
Basic Statistics
 
Binary Logistic Regression
Binary Logistic RegressionBinary Logistic Regression
Binary Logistic Regression
 
Histogram
HistogramHistogram
Histogram
 
Statistics
StatisticsStatistics
Statistics
 
Basics of SPSS, Part 2
Basics of SPSS, Part 2Basics of SPSS, Part 2
Basics of SPSS, Part 2
 
Measures of Dispersion - Thiyagu
Measures of Dispersion - ThiyaguMeasures of Dispersion - Thiyagu
Measures of Dispersion - Thiyagu
 
Measures of dispersion
Measures  of  dispersionMeasures  of  dispersion
Measures of dispersion
 
Moment Generating Functions
Moment Generating FunctionsMoment Generating Functions
Moment Generating Functions
 
Introduction to statistics
Introduction to statisticsIntroduction to statistics
Introduction to statistics
 
Introduction To SPSS
Introduction To SPSSIntroduction To SPSS
Introduction To SPSS
 
Bbs10 ppt ch03
Bbs10 ppt ch03Bbs10 ppt ch03
Bbs10 ppt ch03
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
 
Measures of Variation
Measures of Variation Measures of Variation
Measures of Variation
 
Regression analysis
Regression analysisRegression analysis
Regression analysis
 
MEASURESOF CENTRAL TENDENCY
MEASURESOF CENTRAL TENDENCYMEASURESOF CENTRAL TENDENCY
MEASURESOF CENTRAL TENDENCY
 
Meaning and Importance of Statistics
Meaning and Importance of StatisticsMeaning and Importance of Statistics
Meaning and Importance of Statistics
 

Similar to Descriptive statistics

QT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central TendencyQT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central Tendency
Prithwis Mukerjee
 
QT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central TendencyQT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central Tendency
Prithwis Mukerjee
 
Numerical measures stat ppt @ bec doms
Numerical measures stat ppt @ bec domsNumerical measures stat ppt @ bec doms
Numerical measures stat ppt @ bec doms
Babasab Patil
 
Empirics of standard deviation
Empirics of standard deviationEmpirics of standard deviation
Empirics of standard deviation
Adebanji Ayeni
 
Measures of Central Tendency, Variability and Shapes
Measures of Central Tendency, Variability and ShapesMeasures of Central Tendency, Variability and Shapes
Measures of Central Tendency, Variability and Shapes
ScholarsPoint1
 
Measures of dispersion or variation
Measures of dispersion or variationMeasures of dispersion or variation
Measures of dispersion or variation
Raj Teotia
 
Statistics
StatisticsStatistics
Statistics
dineshmeena53
 
Measures of Dispersion.pptx
Measures of Dispersion.pptxMeasures of Dispersion.pptx
Measures of Dispersion.pptx
Vanmala Buchke
 
S1 pn
S1 pnS1 pn
Bio statistics
Bio statisticsBio statistics
Bio statistics
Nc Das
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
Burak Mızrak
 
Central tendency _dispersion
Central tendency _dispersionCentral tendency _dispersion
Central tendency _dispersion
Kirti Gupta
 
Overview of Advance Marketing Research
Overview of Advance Marketing ResearchOverview of Advance Marketing Research
Overview of Advance Marketing Research
Enamul Islam
 
Describing quantitative data with numbers
Describing quantitative data with numbersDescribing quantitative data with numbers
Describing quantitative data with numbers
Ulster BOCES
 
variance
variance variance
variance
ShahwarKhan16
 
Application of Statistical and mathematical equations in Chemistry Part 2
Application of Statistical and mathematical equations in Chemistry Part 2Application of Statistical and mathematical equations in Chemistry Part 2
Application of Statistical and mathematical equations in Chemistry Part 2
Awad Albalwi
 
Measures of central tendency
Measures of central tendencyMeasures of central tendency
Measures of central tendency
kreshajay
 
Biostatistics Survey Project on Menstrual cup v/s Sanitary Pads
Biostatistics Survey Project on Menstrual cup v/s Sanitary PadsBiostatistics Survey Project on Menstrual cup v/s Sanitary Pads
Biostatistics Survey Project on Menstrual cup v/s Sanitary Pads
Cheshta Rawat
 
Basic Statistical Descriptions of Data.pptx
Basic Statistical Descriptions of Data.pptxBasic Statistical Descriptions of Data.pptx
Basic Statistical Descriptions of Data.pptx
Anusuya123
 
Statistics in Research
Statistics in ResearchStatistics in Research
Statistics in Research
Kent Kawashima
 

Similar to Descriptive statistics (20)

QT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central TendencyQT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central Tendency
 
QT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central TendencyQT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central Tendency
 
Numerical measures stat ppt @ bec doms
Numerical measures stat ppt @ bec domsNumerical measures stat ppt @ bec doms
Numerical measures stat ppt @ bec doms
 
Empirics of standard deviation
Empirics of standard deviationEmpirics of standard deviation
Empirics of standard deviation
 
Measures of Central Tendency, Variability and Shapes
Measures of Central Tendency, Variability and ShapesMeasures of Central Tendency, Variability and Shapes
Measures of Central Tendency, Variability and Shapes
 
Measures of dispersion or variation
Measures of dispersion or variationMeasures of dispersion or variation
Measures of dispersion or variation
 
Statistics
StatisticsStatistics
Statistics
 
Measures of Dispersion.pptx
Measures of Dispersion.pptxMeasures of Dispersion.pptx
Measures of Dispersion.pptx
 
S1 pn
S1 pnS1 pn
S1 pn
 
Bio statistics
Bio statisticsBio statistics
Bio statistics
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
 
Central tendency _dispersion
Central tendency _dispersionCentral tendency _dispersion
Central tendency _dispersion
 
Overview of Advance Marketing Research
Overview of Advance Marketing ResearchOverview of Advance Marketing Research
Overview of Advance Marketing Research
 
Describing quantitative data with numbers
Describing quantitative data with numbersDescribing quantitative data with numbers
Describing quantitative data with numbers
 
variance
variance variance
variance
 
Application of Statistical and mathematical equations in Chemistry Part 2
Application of Statistical and mathematical equations in Chemistry Part 2Application of Statistical and mathematical equations in Chemistry Part 2
Application of Statistical and mathematical equations in Chemistry Part 2
 
Measures of central tendency
Measures of central tendencyMeasures of central tendency
Measures of central tendency
 
Biostatistics Survey Project on Menstrual cup v/s Sanitary Pads
Biostatistics Survey Project on Menstrual cup v/s Sanitary PadsBiostatistics Survey Project on Menstrual cup v/s Sanitary Pads
Biostatistics Survey Project on Menstrual cup v/s Sanitary Pads
 
Basic Statistical Descriptions of Data.pptx
Basic Statistical Descriptions of Data.pptxBasic Statistical Descriptions of Data.pptx
Basic Statistical Descriptions of Data.pptx
 
Statistics in Research
Statistics in ResearchStatistics in Research
Statistics in Research
 

Recently uploaded

Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Dr. Vinod Kumar Kanvaria
 
Executive Directors Chat Leveraging AI for Diversity, Equity, and Inclusion
Executive Directors Chat  Leveraging AI for Diversity, Equity, and InclusionExecutive Directors Chat  Leveraging AI for Diversity, Equity, and Inclusion
Executive Directors Chat Leveraging AI for Diversity, Equity, and Inclusion
TechSoup
 
How to Fix the Import Error in the Odoo 17
How to Fix the Import Error in the Odoo 17How to Fix the Import Error in the Odoo 17
How to Fix the Import Error in the Odoo 17
Celine George
 
South African Journal of Science: Writing with integrity workshop (2024)
South African Journal of Science: Writing with integrity workshop (2024)South African Journal of Science: Writing with integrity workshop (2024)
South African Journal of Science: Writing with integrity workshop (2024)
Academy of Science of South Africa
 
A Independência da América Espanhola LAPBOOK.pdf
A Independência da América Espanhola LAPBOOK.pdfA Independência da América Espanhola LAPBOOK.pdf
A Independência da América Espanhola LAPBOOK.pdf
Jean Carlos Nunes Paixão
 
World environment day ppt For 5 June 2024
World environment day ppt For 5 June 2024World environment day ppt For 5 June 2024
World environment day ppt For 5 June 2024
ak6969907
 
Advanced Java[Extra Concepts, Not Difficult].docx
Advanced Java[Extra Concepts, Not Difficult].docxAdvanced Java[Extra Concepts, Not Difficult].docx
Advanced Java[Extra Concepts, Not Difficult].docx
adhitya5119
 
Smart-Money for SMC traders good time and ICT
Smart-Money for SMC traders good time and ICTSmart-Money for SMC traders good time and ICT
Smart-Money for SMC traders good time and ICT
simonomuemu
 
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
National Information Standards Organization (NISO)
 
Azure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHatAzure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHat
Scholarhat
 
DRUGS AND ITS classification slide share
DRUGS AND ITS classification slide shareDRUGS AND ITS classification slide share
DRUGS AND ITS classification slide share
taiba qazi
 
RPMS TEMPLATE FOR SCHOOL YEAR 2023-2024 FOR TEACHER 1 TO TEACHER 3
RPMS TEMPLATE FOR SCHOOL YEAR 2023-2024 FOR TEACHER 1 TO TEACHER 3RPMS TEMPLATE FOR SCHOOL YEAR 2023-2024 FOR TEACHER 1 TO TEACHER 3
RPMS TEMPLATE FOR SCHOOL YEAR 2023-2024 FOR TEACHER 1 TO TEACHER 3
IreneSebastianRueco1
 
writing about opinions about Australia the movie
writing about opinions about Australia the moviewriting about opinions about Australia the movie
writing about opinions about Australia the movie
Nicholas Montgomery
 
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
Nguyen Thanh Tu Collection
 
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdfANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
Priyankaranawat4
 
S1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptxS1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptx
tarandeep35
 
Digital Artifact 1 - 10VCD Environments Unit
Digital Artifact 1 - 10VCD Environments UnitDigital Artifact 1 - 10VCD Environments Unit
Digital Artifact 1 - 10VCD Environments Unit
chanes7
 
Hindi varnamala | hindi alphabet PPT.pdf
Hindi varnamala | hindi alphabet PPT.pdfHindi varnamala | hindi alphabet PPT.pdf
Hindi varnamala | hindi alphabet PPT.pdf
Dr. Mulla Adam Ali
 
Your Skill Boost Masterclass: Strategies for Effective Upskilling
Your Skill Boost Masterclass: Strategies for Effective UpskillingYour Skill Boost Masterclass: Strategies for Effective Upskilling
Your Skill Boost Masterclass: Strategies for Effective Upskilling
Excellence Foundation for South Sudan
 
PIMS Job Advertisement 2024.pdf Islamabad
PIMS Job Advertisement 2024.pdf IslamabadPIMS Job Advertisement 2024.pdf Islamabad
PIMS Job Advertisement 2024.pdf Islamabad
AyyanKhan40
 

Recently uploaded (20)

Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
 
Executive Directors Chat Leveraging AI for Diversity, Equity, and Inclusion
Executive Directors Chat  Leveraging AI for Diversity, Equity, and InclusionExecutive Directors Chat  Leveraging AI for Diversity, Equity, and Inclusion
Executive Directors Chat Leveraging AI for Diversity, Equity, and Inclusion
 
How to Fix the Import Error in the Odoo 17
How to Fix the Import Error in the Odoo 17How to Fix the Import Error in the Odoo 17
How to Fix the Import Error in the Odoo 17
 
South African Journal of Science: Writing with integrity workshop (2024)
South African Journal of Science: Writing with integrity workshop (2024)South African Journal of Science: Writing with integrity workshop (2024)
South African Journal of Science: Writing with integrity workshop (2024)
 
A Independência da América Espanhola LAPBOOK.pdf
A Independência da América Espanhola LAPBOOK.pdfA Independência da América Espanhola LAPBOOK.pdf
A Independência da América Espanhola LAPBOOK.pdf
 
World environment day ppt For 5 June 2024
World environment day ppt For 5 June 2024World environment day ppt For 5 June 2024
World environment day ppt For 5 June 2024
 
Advanced Java[Extra Concepts, Not Difficult].docx
Advanced Java[Extra Concepts, Not Difficult].docxAdvanced Java[Extra Concepts, Not Difficult].docx
Advanced Java[Extra Concepts, Not Difficult].docx
 
Smart-Money for SMC traders good time and ICT
Smart-Money for SMC traders good time and ICTSmart-Money for SMC traders good time and ICT
Smart-Money for SMC traders good time and ICT
 
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
 
Azure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHatAzure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHat
 
DRUGS AND ITS classification slide share
DRUGS AND ITS classification slide shareDRUGS AND ITS classification slide share
DRUGS AND ITS classification slide share
 
RPMS TEMPLATE FOR SCHOOL YEAR 2023-2024 FOR TEACHER 1 TO TEACHER 3
RPMS TEMPLATE FOR SCHOOL YEAR 2023-2024 FOR TEACHER 1 TO TEACHER 3RPMS TEMPLATE FOR SCHOOL YEAR 2023-2024 FOR TEACHER 1 TO TEACHER 3
RPMS TEMPLATE FOR SCHOOL YEAR 2023-2024 FOR TEACHER 1 TO TEACHER 3
 
writing about opinions about Australia the movie
writing about opinions about Australia the moviewriting about opinions about Australia the movie
writing about opinions about Australia the movie
 
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
 
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdfANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
 
S1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptxS1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptx
 
Digital Artifact 1 - 10VCD Environments Unit
Digital Artifact 1 - 10VCD Environments UnitDigital Artifact 1 - 10VCD Environments Unit
Digital Artifact 1 - 10VCD Environments Unit
 
Hindi varnamala | hindi alphabet PPT.pdf
Hindi varnamala | hindi alphabet PPT.pdfHindi varnamala | hindi alphabet PPT.pdf
Hindi varnamala | hindi alphabet PPT.pdf
 
Your Skill Boost Masterclass: Strategies for Effective Upskilling
Your Skill Boost Masterclass: Strategies for Effective UpskillingYour Skill Boost Masterclass: Strategies for Effective Upskilling
Your Skill Boost Masterclass: Strategies for Effective Upskilling
 
PIMS Job Advertisement 2024.pdf Islamabad
PIMS Job Advertisement 2024.pdf IslamabadPIMS Job Advertisement 2024.pdf Islamabad
PIMS Job Advertisement 2024.pdf Islamabad
 

Descriptive statistics

  • 1. Descriptive Statistics by Indramani Tripathi Measures of Central Tendency And Dispersion
  • 2. Measures of Central Tendency  1. Mode = can be used for any kind of data but only measure of central tendency for nominal or qualitative data.  Formula: value that occurs most often or the category or interval with highest frequency.  Note: Omit Formula 3.1 Variation Ratio in Healey and Prus 2nd Cdn.
  • 3. Example for Nominal Variables:   Religion frequency cf proportion % Cum%  Catholic 17 17 .41 41 41  Protestant 4 21 .10 10 51  Jewish 2 23 .05 5 56  Muslim 1 24 .02 2 58  Other 9 33 .22 9 80  None 8 41 .20 20 100   Total 41 1.00 100%  Central Tendency: MODE = largest category = Catholic
  • 4. Central Tendency (cont.)  2. Median = exact centre or middle of ordered data. The 50th percentile.  Formula:  Array data.  When sample size is even, median falls halfway between two middle numbers.  To calculate: find (n/2) and (n/2)+1, and divide the total by 2 to find the exact median.  When sample size is odd, median is exact middle (n+1) /2
  • 5. Example for Raw Data:  Suppose you have the following set of test scores:  66, 89, 41, 98, 76, 77, 68, 60, 60, 67, 69, 66, 98, 52, 74, 66, 89, 95, 66, 69  1. Array (put in order) your data:  98 98 95 89 89 77 76 74 69 69 68 67 66 66 66 66 60 60 52 41 N = 20 (N is even)
  • 6. To calculate: - find middle numbers(n/2)+(n/2 )+1 - add together the two middle numbers - divide the total by 2  First middle number: (20/2) = the 10th number  2nd middle number: (20/2)+1 = the 11th number  Look at data: the middle numbers are 69 and 68  The median would be (69+68)/2 = 68.5
  • 7. Median for Aggregate (grouped) Data  This formula is shown in Healey 1st Cdn Edition but NOT in 2/3 Cdn  We will NOT COVER this one!
  • 8. Properties of median:  - for numerical data at interval or ordinal level  -"balance point“  -not affected by outliers  -median is appropriate when distribution is highly skewed.
  • 9. 3. Mean for Raw Data  The mean is the sum of measurements / number of subjects  Formula: (X-bar) = ΣXi / N  Data (from above): 66, 89, 41, 98, 76, 77, 68, 60, 60, 67, 69, 66, 98, 52, 74, 66, 89, 95, 66, 69
  • 10. Example for Mean  Formula: = ΣXi / N = 1446 / 20 = 72.3 The mean for these test scores is 72.30
  • 11. Mean for Aggregate (Grouped) Data (Note: not in text but covered in class)  To calculate the mean for grouped data, you need a frequency table that includes a column for the midpoints, for the product of the frequencies times the midpoints (fm). Formula: = Σ (fm) N
  • 12. Frequency table: Score f m* (fm) 41-50 1 45.5 45.5 51-60 3 55.5 166.5 61-70 8 65.5 524 71-80 3 75.5 226.5 81-90 2 85.5 171 91-100 3 95.5 286.5 N = 20 Σ (fm) = 1420 * Find midpoints first
  • 13. Calculating Mean for Grouped Data: Formula: = Σ (fm) N = 1420 / 20 = 71 The mean for the grouped data is 71.
  • 14. Properties of the Mean: - only for numerical data at interval level - "balance point“ - can be affected by outliers = skewed distribution - tail becomes elongated and the mean is pulled in direction of outlier. Example… no outlier: $30000, 30000, 35000, 25000, 30000 then mean = $30000 but if outlier is present, then: $130000, 30000, 35000, 25000, 30000 then mean = $50000 (the mean is pulled up or down in the direction of the outlier)
  • 15. NOTE:  When distribution is symmetric, mean = median = mode  For skewed, mean will lie in direction of skew.  i.e. skewed to right (tail pulled to right) mean > median (positive skew)  skewed to left (tail pulled to left) median > mean (negative skew)
  • 16. Measures of Dispersion  Describe how variable the data are.  i.e. how spread out around the mean  Also called measures of variation or variability
  • 17. Variability for Non-numerical Data (Nominal or Ordinal Level Data)  Measures of variability for non-numerical nominal or ordinal) data are rarely used  We will not be covering these in class  Omit Formula 4.1 IQV in Healey and Prus 1st Canadian Edition  Omit Formula 3.1 Variation Ratio in Healey and Prus 2/3 Canadian Edition
  • 18. 2. Range (for numerical data) Range = difference between largest and smallest observations i.e. if data are $130000, 35000, 30000, 30000, 30000, 30000, 25000, 25000 then range = 130000 - 25000 = $105000
  • 19. Interquartile Range (Q): - This is the difference between the 75th and the 25th percentiles (the middle 50%) - Gives better idea than range of what the middle of the distribution looks like. Formula: Q = Q3 - Q1 (where Q3 = N x .75, and Q1 = N x .25) Using above data: Q = Q3 - Q1 = (6th – 2nd case) = $30000-25000 =$5000 The interquartile range (Q) is $5000.
  • 20. 3. Variance and Standard Deviation:  For raw data at the interval/ratio level.  Most common measure of variation.  The numerator in the formula is known as the sum of squares, and the denominator is either the population size N or the sample size n-1  The variance is denoted by S2 and the standard deviation, which is the square root of the variance, by S
  • 21. Definitional Formula for Variance and Standard Deviation:  Variance: s2 = Σ (xi - )2 / N  Standard Deviation: s =  (the standard deviation is the square root of the variance; the variance is simply the standard deviation squared)
  • 22. Example for S and S2 :  Data: 66, 89, 41, 98, 76, 77, 68, 60, 60, 67, 69, 66, 98, 52, 74, 66, 89, 95, 66, 69 1. Find ∑ Xi 2 : Square each Xi and find total. 2. Find (∑ Xi)2 : Find total of all Xi and square. 3. Substitute above and N into formula for S. 4. For S2 , simply square S. S = 14.75 S2 = 217.71
  • 23. A working formula for the standard deviation: Note: the definitional formula for standard deviation is not practical for use with data when N>10. The working formula, which is much easier to do on your calculator, should be used instead. Both formulae give exactly the same result. Try it! 2 2 X N X S i −= ∑
  • 24. Properties of S:  always greater than or equal to 0  the greater the variation about mean, the greater S is  n-1 corrects for bias when using sample data. S tends to underestimate the real population standard deviation when based on sample data so to correct for this, we use n-1. The larger the sample size, the smaller difference this correction makes. When calculating the standard deviation for the whole population, use N in the denominator.
  • 25. NOTE:  σ, N and Mu (µ) denote population parameters  s, n, x-bar ( ) denote sample statistics
  • 26. Remember the Rounding Rules!  Always use as many decimal places as your calculator can handle.  Round your final answer to 2 decimal places, rounding to nearest number.  Engineers Rule: When last digit is exactly 5 (followed by 0’s), round the digit before the last digit to nearest EVEN number.
  • 27. Homework Questions  Healey and Prus 1e:  #3.1, #3.5, #3.11 and 4.9, #4.15  Healey and Prus 2/3e  #3.1, #3.5, #3.11 (compute s for 8 nations also), #3.15  SPSS:  Read the SPSS sections for Ch. 3 and 4 in 1st Cdn. Edition and for Ch. 4 in 2/3 Cdn. Edition  Try some of the SPSS exercises for practice