SlideShare a Scribd company logo
CHAPTER 3:
Statistical Description of Data
to accompany
Introduction to Business Statistics
fourth edition, by Ronald M. Weiers
Modified from a Presentation by Priscilla Chaffe-Stengel
Donald N. Stengel
© 2002 The Wadsworth Group
Introduction
• Covers numerical measures used as
descriptive statistics
• Box plots (a.k.a. box-and-whisker plots)
are introduced (separate vignette)
• Not all topics in the text will be covered
in this vignette
Chapter 3 - Learning Objectives
• Describe data using measures of central
tendency and dispersion:
– for a set of individual data values, and
– for a set of grouped data.
• Use the computer to visually represent
data.
• Use the coefficient of correlation to
measure association between two
quantitative variables.
© 2002 The Wadsworth Group
Shape – Center - Spread
• When we gather data, we want to uncover the
“information” in it. One easy way to do that is to
think of: “Shape –Center- Spread”
• Shape – What is the shape of the histogram?
• Center – What is the mean or median?
• Spread – What is the range or standard
deviation?
• Chapter 2 was the graphical approach
• Chapter 3 uses numerical measures
Chapter 3 - Key Terms
• Measures of
Central
Tendency,
The Center
• Mean
– µ, population; , sample
• Weighted Mean
• Median
• Mode
(Note comparison of mean,
median, and mode)
x
© 2002 The Wadsworth Group
Chapter 3 - Key Terms
• Measures of
Dispersion,
The Spread
• Range
• Variance
(Note the computational difference
between s2 and s2.)
• Standard deviation
• Interquartile range
© 2002 The Wadsworth Group
Chapter 3 - Key Terms
• Measures of
Relative
Position
• Quantiles
– Quartiles
– Percentiles
Chapter 3 - Key Terms
• Measures of
Association
• Coefficient of correlation, r
– Direction of the relationship:
direct (r > 0) or inverse (r < 0)
– Strength of the relationship:
When r is close to 1 or –1, the linear
relationship between x and y is
strong. When r is close to 0, the linear
relationship between x and y is weak.
When r = 0, there is no linear
relationship between x and y.
• Coefficient of determination, r2
– The percent of total variation in y
that is explained by variation in x.
© 2002 The Wadsworth Group
The Center: Mean
• Mean
– Arithmetic average = (sum all values)/# of values
» Population: µ = (Sxi)/N
» Sample: = (Sxi)/n
Be sure you know how to get the value easily
from your calculator and computer softwares.
Problem: Calculate the average number of truck shipments
from the United States to five Canadian cities for the
following data given in thousands of bags:
Montreal, 64.0; Ottawa, 15.0; Toronto, 285.0;
Vancouver, 228.0; Winnipeg, 45.0 (Ans: 127.4)
x
© 2002 The Wadsworth Group
The Center: Weighted Mean
• When what you have is grouped data,
compute the mean using µ = (Swixi)/Swi
Problem: Calculate the average profit from truck shipments,
United States to Canada, for the following data given in
thousands of bags and profits per thousand bags:
Montreal 64.0 Ottawa 15.0 Toronto 285.0
$15.00 $13.50 $15.50
Vancouver 228.0 Winnipeg 45.0
$12.00 $14.00
(Ans: $14.04 per thous. bags)
© 2002 The Wadsworth Group
The Center: Median
• To find the median:
1. Put the data in an array.
2A. If the data set has an ODD number of numbers, the median
is the middle value.
2B. If the data set has an EVEN number of numbers, the
median is the AVERAGE of the middle two values.
(Note that the median of an even set of data values is not
necessarily a member of the set of values.)
• The median is particularly useful if there are
outliers in the data set, which otherwise tend to
sway the value of an arithmetic mean.
© 2002 The Wadsworth Group
The Center: Mode
• The mode is the most frequent value.
• While there is just one value for the
mean and one value for the median,
there may be more than one value for
the mode of a data set.
• The mode tends to be less frequently
used than the mean or the median.
© 2002 The Wadsworth Group
Shape: The “shape” of the data is
called its “distribution”?
• If mean = median = mode, the shape of the
distribution is symmetric.
• If mode < median < mean, the shape of the
distribution trails to the right, is positively skewed.
• If mean < median < mode, the shape of the
distribution trails to the left, is negatively skewed.
• Distributions of various “shapes” have different
properties and names such as the “normal”
distribution, which is also known as the “bell
curve” (among mathematicians it is called the
Gaussian Distribution).
The Spread: Range
• The range is the distance between the
smallest and the largest data value in the
set.
• Range = largest value – smallest value
• Sometimes range is reported as an
interval, anchored between the smallest
and largest data value, rather than the
actual width of that interval.
© 2002 The Wadsworth Group
The Spread: Variance
• Variance is one of the most frequently used
measures of spread,
– for population,
– for sample,
• The right side of each equation is often used
as a computational shortcut.
s2 
S(x
i
–)2
N

S(x
i
)2 – N2
N
s2 
S(x
i
– x)2
n–1

S(x
i
)2 –nx2
n–1
© 2002 The Wadsworth Group
The Spread: Standard Deviation
• Since variance is given in squared units,
we often find uses for the standard
deviation, which is the square root of
variance:
– for a population,
– for a sample,
Be sure you know how to get the values easily
from your calculator and computer softwares.
s  s2
s s2
© 2002 The Wadsworth Group
Relative Position - Quartiles
• One of the most frequently used quantiles is the
quartile.
• Quartiles divide the values of a data set into four
subsets of equal size, each comprising 25% of the
observations.
• To find the first, second, and third quartiles:
– 1. Arrange the N data values into an array.
– 2. First quartile, Q1 = data value at position (N + 1)/4
– 3. Second quartile, Q2 = data value at position 2(N + 1)/4
– 4. Third quartile, Q3 = data value at position 3(N + 1)/4
© 2002 The Wadsworth Group

More Related Content

Similar to wk1a-basicstats (2).ppt

3.2 measures of variation
3.2 measures of variation3.2 measures of variation
3.2 measures of variationleblance
 
Lect 3 background mathematics
Lect 3 background mathematicsLect 3 background mathematics
Lect 3 background mathematics
hktripathy
 
Measures of central tendancy easy to under this stats topic
Measures of central tendancy easy to under this stats topicMeasures of central tendancy easy to under this stats topic
Measures of central tendancy easy to under this stats topic
Nishant Taralkar
 
Lect 3 background mathematics for Data Mining
Lect 3 background mathematics for Data MiningLect 3 background mathematics for Data Mining
Lect 3 background mathematics for Data Mining
hktripathy
 
Measure of Central Tendency
Measure of Central TendencyMeasure of Central Tendency
Measure of Central Tendency
Mysore University Library
 
measures of central tendency in statistics which is essential for business ma...
measures of central tendency in statistics which is essential for business ma...measures of central tendency in statistics which is essential for business ma...
measures of central tendency in statistics which is essential for business ma...
SoujanyaLk1
 
More about data science post.pdf
More about data science post.pdfMore about data science post.pdf
More about data science post.pdf
SheetalDandge
 
Stat11t chapter3
Stat11t chapter3Stat11t chapter3
Stat11t chapter3
raylenepotter
 
3. Statistical Analysis.pptx
3. Statistical Analysis.pptx3. Statistical Analysis.pptx
3. Statistical Analysis.pptx
jeyanthisivakumar
 
Measures of dispersion
Measures of dispersionMeasures of dispersion
Measures of dispersion
Shiwani Agrawal
 
BMS.ppt
BMS.pptBMS.ppt
Analyzing quantitative data
Analyzing quantitative dataAnalyzing quantitative data
Analyzing quantitative dataBing Villamor
 
Descriptive Statistics.pptx
Descriptive Statistics.pptxDescriptive Statistics.pptx
Descriptive Statistics.pptx
Shashank Mishra
 
CABT Math 8 measures of central tendency and dispersion
CABT Math 8   measures of central tendency and dispersionCABT Math 8   measures of central tendency and dispersion
CABT Math 8 measures of central tendency and dispersionGilbert Joseph Abueg
 
Data Representations
Data RepresentationsData Representations
Data Representationsbujols
 
T7 data analysis
T7 data analysisT7 data analysis
T7 data analysis
kompellark
 
measures of central tendency.pptx
measures of central tendency.pptxmeasures of central tendency.pptx
measures of central tendency.pptx
SabaIrfan11
 
Describing quantitative data with numbers
Describing quantitative data with numbersDescribing quantitative data with numbers
Describing quantitative data with numbersUlster BOCES
 
1.0 Descriptive statistics.pdf
1.0 Descriptive statistics.pdf1.0 Descriptive statistics.pdf
1.0 Descriptive statistics.pdf
thaersyam
 
Pm m23 & pmnm06 week 3 lectures 2015
Pm m23 & pmnm06 week 3 lectures 2015Pm m23 & pmnm06 week 3 lectures 2015
Pm m23 & pmnm06 week 3 lectures 2015pdiddyboy2
 

Similar to wk1a-basicstats (2).ppt (20)

3.2 measures of variation
3.2 measures of variation3.2 measures of variation
3.2 measures of variation
 
Lect 3 background mathematics
Lect 3 background mathematicsLect 3 background mathematics
Lect 3 background mathematics
 
Measures of central tendancy easy to under this stats topic
Measures of central tendancy easy to under this stats topicMeasures of central tendancy easy to under this stats topic
Measures of central tendancy easy to under this stats topic
 
Lect 3 background mathematics for Data Mining
Lect 3 background mathematics for Data MiningLect 3 background mathematics for Data Mining
Lect 3 background mathematics for Data Mining
 
Measure of Central Tendency
Measure of Central TendencyMeasure of Central Tendency
Measure of Central Tendency
 
measures of central tendency in statistics which is essential for business ma...
measures of central tendency in statistics which is essential for business ma...measures of central tendency in statistics which is essential for business ma...
measures of central tendency in statistics which is essential for business ma...
 
More about data science post.pdf
More about data science post.pdfMore about data science post.pdf
More about data science post.pdf
 
Stat11t chapter3
Stat11t chapter3Stat11t chapter3
Stat11t chapter3
 
3. Statistical Analysis.pptx
3. Statistical Analysis.pptx3. Statistical Analysis.pptx
3. Statistical Analysis.pptx
 
Measures of dispersion
Measures of dispersionMeasures of dispersion
Measures of dispersion
 
BMS.ppt
BMS.pptBMS.ppt
BMS.ppt
 
Analyzing quantitative data
Analyzing quantitative dataAnalyzing quantitative data
Analyzing quantitative data
 
Descriptive Statistics.pptx
Descriptive Statistics.pptxDescriptive Statistics.pptx
Descriptive Statistics.pptx
 
CABT Math 8 measures of central tendency and dispersion
CABT Math 8   measures of central tendency and dispersionCABT Math 8   measures of central tendency and dispersion
CABT Math 8 measures of central tendency and dispersion
 
Data Representations
Data RepresentationsData Representations
Data Representations
 
T7 data analysis
T7 data analysisT7 data analysis
T7 data analysis
 
measures of central tendency.pptx
measures of central tendency.pptxmeasures of central tendency.pptx
measures of central tendency.pptx
 
Describing quantitative data with numbers
Describing quantitative data with numbersDescribing quantitative data with numbers
Describing quantitative data with numbers
 
1.0 Descriptive statistics.pdf
1.0 Descriptive statistics.pdf1.0 Descriptive statistics.pdf
1.0 Descriptive statistics.pdf
 
Pm m23 & pmnm06 week 3 lectures 2015
Pm m23 & pmnm06 week 3 lectures 2015Pm m23 & pmnm06 week 3 lectures 2015
Pm m23 & pmnm06 week 3 lectures 2015
 

More from ssuser0be977

IBM list NM portal not updategbbbbbnnmhhh
IBM list NM portal not updategbbbbbnnmhhhIBM list NM portal not updategbbbbbnnmhhh
IBM list NM portal not updategbbbbbnnmhhh
ssuser0be977
 
lect23_optimization.ppt
lect23_optimization.pptlect23_optimization.ppt
lect23_optimization.ppt
ssuser0be977
 
CS540-2-lecture11 - Copy.ppt
CS540-2-lecture11 - Copy.pptCS540-2-lecture11 - Copy.ppt
CS540-2-lecture11 - Copy.ppt
ssuser0be977
 
wk1a-basicstats.ppt
wk1a-basicstats.pptwk1a-basicstats.ppt
wk1a-basicstats.ppt
ssuser0be977
 
dynamicList.ppt
dynamicList.pptdynamicList.ppt
dynamicList.ppt
ssuser0be977
 
lect08.ppt
lect08.pptlect08.ppt
lect08.ppt
ssuser0be977
 
11CS10033.pptx
11CS10033.pptx11CS10033.pptx
11CS10033.pptx
ssuser0be977
 

More from ssuser0be977 (8)

IBM list NM portal not updategbbbbbnnmhhh
IBM list NM portal not updategbbbbbnnmhhhIBM list NM portal not updategbbbbbnnmhhh
IBM list NM portal not updategbbbbbnnmhhh
 
lect23_optimization.ppt
lect23_optimization.pptlect23_optimization.ppt
lect23_optimization.ppt
 
CS540-2-lecture11 - Copy.ppt
CS540-2-lecture11 - Copy.pptCS540-2-lecture11 - Copy.ppt
CS540-2-lecture11 - Copy.ppt
 
wk1a-basicstats.ppt
wk1a-basicstats.pptwk1a-basicstats.ppt
wk1a-basicstats.ppt
 
dynamicList.ppt
dynamicList.pptdynamicList.ppt
dynamicList.ppt
 
lect08.ppt
lect08.pptlect08.ppt
lect08.ppt
 
11CS10033.pptx
11CS10033.pptx11CS10033.pptx
11CS10033.pptx
 
dsa1.ppt
dsa1.pptdsa1.ppt
dsa1.ppt
 

Recently uploaded

DESIGN A COTTON SEED SEPARATION MACHINE.docx
DESIGN A COTTON SEED SEPARATION MACHINE.docxDESIGN A COTTON SEED SEPARATION MACHINE.docx
DESIGN A COTTON SEED SEPARATION MACHINE.docx
FluxPrime1
 
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdf
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdfGoverning Equations for Fundamental Aerodynamics_Anderson2010.pdf
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdf
WENKENLI1
 
ethical hacking-mobile hacking methods.ppt
ethical hacking-mobile hacking methods.pptethical hacking-mobile hacking methods.ppt
ethical hacking-mobile hacking methods.ppt
Jayaprasanna4
 
ASME IX(9) 2007 Full Version .pdf
ASME IX(9)  2007 Full Version       .pdfASME IX(9)  2007 Full Version       .pdf
ASME IX(9) 2007 Full Version .pdf
AhmedHussein950959
 
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptxCFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
R&R Consult
 
Hierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power SystemHierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power System
Kerry Sado
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
zwunae
 
HYDROPOWER - Hydroelectric power generation
HYDROPOWER - Hydroelectric power generationHYDROPOWER - Hydroelectric power generation
HYDROPOWER - Hydroelectric power generation
Robbie Edward Sayers
 
Runway Orientation Based on the Wind Rose Diagram.pptx
Runway Orientation Based on the Wind Rose Diagram.pptxRunway Orientation Based on the Wind Rose Diagram.pptx
Runway Orientation Based on the Wind Rose Diagram.pptx
SupreethSP4
 
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
bakpo1
 
Student information management system project report ii.pdf
Student information management system project report ii.pdfStudent information management system project report ii.pdf
Student information management system project report ii.pdf
Kamal Acharya
 
The role of big data in decision making.
The role of big data in decision making.The role of big data in decision making.
The role of big data in decision making.
ankuprajapati0525
 
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&BDesign and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Sreedhar Chowdam
 
weather web application report.pdf
weather web application report.pdfweather web application report.pdf
weather web application report.pdf
Pratik Pawar
 
English lab ppt no titlespecENG PPTt.pdf
English lab ppt no titlespecENG PPTt.pdfEnglish lab ppt no titlespecENG PPTt.pdf
English lab ppt no titlespecENG PPTt.pdf
BrazilAccount1
 
ML for identifying fraud using open blockchain data.pptx
ML for identifying fraud using open blockchain data.pptxML for identifying fraud using open blockchain data.pptx
ML for identifying fraud using open blockchain data.pptx
Vijay Dialani, PhD
 
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
obonagu
 
Nuclear Power Economics and Structuring 2024
Nuclear Power Economics and Structuring 2024Nuclear Power Economics and Structuring 2024
Nuclear Power Economics and Structuring 2024
Massimo Talia
 
Standard Reomte Control Interface - Neometrix
Standard Reomte Control Interface - NeometrixStandard Reomte Control Interface - Neometrix
Standard Reomte Control Interface - Neometrix
Neometrix_Engineering_Pvt_Ltd
 
J.Yang, ICLR 2024, MLILAB, KAIST AI.pdf
J.Yang,  ICLR 2024, MLILAB, KAIST AI.pdfJ.Yang,  ICLR 2024, MLILAB, KAIST AI.pdf
J.Yang, ICLR 2024, MLILAB, KAIST AI.pdf
MLILAB
 

Recently uploaded (20)

DESIGN A COTTON SEED SEPARATION MACHINE.docx
DESIGN A COTTON SEED SEPARATION MACHINE.docxDESIGN A COTTON SEED SEPARATION MACHINE.docx
DESIGN A COTTON SEED SEPARATION MACHINE.docx
 
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdf
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdfGoverning Equations for Fundamental Aerodynamics_Anderson2010.pdf
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdf
 
ethical hacking-mobile hacking methods.ppt
ethical hacking-mobile hacking methods.pptethical hacking-mobile hacking methods.ppt
ethical hacking-mobile hacking methods.ppt
 
ASME IX(9) 2007 Full Version .pdf
ASME IX(9)  2007 Full Version       .pdfASME IX(9)  2007 Full Version       .pdf
ASME IX(9) 2007 Full Version .pdf
 
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptxCFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
 
Hierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power SystemHierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power System
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
 
HYDROPOWER - Hydroelectric power generation
HYDROPOWER - Hydroelectric power generationHYDROPOWER - Hydroelectric power generation
HYDROPOWER - Hydroelectric power generation
 
Runway Orientation Based on the Wind Rose Diagram.pptx
Runway Orientation Based on the Wind Rose Diagram.pptxRunway Orientation Based on the Wind Rose Diagram.pptx
Runway Orientation Based on the Wind Rose Diagram.pptx
 
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
 
Student information management system project report ii.pdf
Student information management system project report ii.pdfStudent information management system project report ii.pdf
Student information management system project report ii.pdf
 
The role of big data in decision making.
The role of big data in decision making.The role of big data in decision making.
The role of big data in decision making.
 
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&BDesign and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
 
weather web application report.pdf
weather web application report.pdfweather web application report.pdf
weather web application report.pdf
 
English lab ppt no titlespecENG PPTt.pdf
English lab ppt no titlespecENG PPTt.pdfEnglish lab ppt no titlespecENG PPTt.pdf
English lab ppt no titlespecENG PPTt.pdf
 
ML for identifying fraud using open blockchain data.pptx
ML for identifying fraud using open blockchain data.pptxML for identifying fraud using open blockchain data.pptx
ML for identifying fraud using open blockchain data.pptx
 
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
 
Nuclear Power Economics and Structuring 2024
Nuclear Power Economics and Structuring 2024Nuclear Power Economics and Structuring 2024
Nuclear Power Economics and Structuring 2024
 
Standard Reomte Control Interface - Neometrix
Standard Reomte Control Interface - NeometrixStandard Reomte Control Interface - Neometrix
Standard Reomte Control Interface - Neometrix
 
J.Yang, ICLR 2024, MLILAB, KAIST AI.pdf
J.Yang,  ICLR 2024, MLILAB, KAIST AI.pdfJ.Yang,  ICLR 2024, MLILAB, KAIST AI.pdf
J.Yang, ICLR 2024, MLILAB, KAIST AI.pdf
 

wk1a-basicstats (2).ppt

  • 1. CHAPTER 3: Statistical Description of Data to accompany Introduction to Business Statistics fourth edition, by Ronald M. Weiers Modified from a Presentation by Priscilla Chaffe-Stengel Donald N. Stengel © 2002 The Wadsworth Group
  • 2. Introduction • Covers numerical measures used as descriptive statistics • Box plots (a.k.a. box-and-whisker plots) are introduced (separate vignette) • Not all topics in the text will be covered in this vignette
  • 3. Chapter 3 - Learning Objectives • Describe data using measures of central tendency and dispersion: – for a set of individual data values, and – for a set of grouped data. • Use the computer to visually represent data. • Use the coefficient of correlation to measure association between two quantitative variables. © 2002 The Wadsworth Group
  • 4. Shape – Center - Spread • When we gather data, we want to uncover the “information” in it. One easy way to do that is to think of: “Shape –Center- Spread” • Shape – What is the shape of the histogram? • Center – What is the mean or median? • Spread – What is the range or standard deviation? • Chapter 2 was the graphical approach • Chapter 3 uses numerical measures
  • 5. Chapter 3 - Key Terms • Measures of Central Tendency, The Center • Mean – µ, population; , sample • Weighted Mean • Median • Mode (Note comparison of mean, median, and mode) x © 2002 The Wadsworth Group
  • 6. Chapter 3 - Key Terms • Measures of Dispersion, The Spread • Range • Variance (Note the computational difference between s2 and s2.) • Standard deviation • Interquartile range © 2002 The Wadsworth Group
  • 7. Chapter 3 - Key Terms • Measures of Relative Position • Quantiles – Quartiles – Percentiles
  • 8. Chapter 3 - Key Terms • Measures of Association • Coefficient of correlation, r – Direction of the relationship: direct (r > 0) or inverse (r < 0) – Strength of the relationship: When r is close to 1 or –1, the linear relationship between x and y is strong. When r is close to 0, the linear relationship between x and y is weak. When r = 0, there is no linear relationship between x and y. • Coefficient of determination, r2 – The percent of total variation in y that is explained by variation in x. © 2002 The Wadsworth Group
  • 9. The Center: Mean • Mean – Arithmetic average = (sum all values)/# of values » Population: µ = (Sxi)/N » Sample: = (Sxi)/n Be sure you know how to get the value easily from your calculator and computer softwares. Problem: Calculate the average number of truck shipments from the United States to five Canadian cities for the following data given in thousands of bags: Montreal, 64.0; Ottawa, 15.0; Toronto, 285.0; Vancouver, 228.0; Winnipeg, 45.0 (Ans: 127.4) x © 2002 The Wadsworth Group
  • 10. The Center: Weighted Mean • When what you have is grouped data, compute the mean using µ = (Swixi)/Swi Problem: Calculate the average profit from truck shipments, United States to Canada, for the following data given in thousands of bags and profits per thousand bags: Montreal 64.0 Ottawa 15.0 Toronto 285.0 $15.00 $13.50 $15.50 Vancouver 228.0 Winnipeg 45.0 $12.00 $14.00 (Ans: $14.04 per thous. bags) © 2002 The Wadsworth Group
  • 11. The Center: Median • To find the median: 1. Put the data in an array. 2A. If the data set has an ODD number of numbers, the median is the middle value. 2B. If the data set has an EVEN number of numbers, the median is the AVERAGE of the middle two values. (Note that the median of an even set of data values is not necessarily a member of the set of values.) • The median is particularly useful if there are outliers in the data set, which otherwise tend to sway the value of an arithmetic mean. © 2002 The Wadsworth Group
  • 12. The Center: Mode • The mode is the most frequent value. • While there is just one value for the mean and one value for the median, there may be more than one value for the mode of a data set. • The mode tends to be less frequently used than the mean or the median. © 2002 The Wadsworth Group
  • 13. Shape: The “shape” of the data is called its “distribution”? • If mean = median = mode, the shape of the distribution is symmetric. • If mode < median < mean, the shape of the distribution trails to the right, is positively skewed. • If mean < median < mode, the shape of the distribution trails to the left, is negatively skewed. • Distributions of various “shapes” have different properties and names such as the “normal” distribution, which is also known as the “bell curve” (among mathematicians it is called the Gaussian Distribution).
  • 14. The Spread: Range • The range is the distance between the smallest and the largest data value in the set. • Range = largest value – smallest value • Sometimes range is reported as an interval, anchored between the smallest and largest data value, rather than the actual width of that interval. © 2002 The Wadsworth Group
  • 15. The Spread: Variance • Variance is one of the most frequently used measures of spread, – for population, – for sample, • The right side of each equation is often used as a computational shortcut. s2  S(x i –)2 N  S(x i )2 – N2 N s2  S(x i – x)2 n–1  S(x i )2 –nx2 n–1 © 2002 The Wadsworth Group
  • 16. The Spread: Standard Deviation • Since variance is given in squared units, we often find uses for the standard deviation, which is the square root of variance: – for a population, – for a sample, Be sure you know how to get the values easily from your calculator and computer softwares. s  s2 s s2 © 2002 The Wadsworth Group
  • 17. Relative Position - Quartiles • One of the most frequently used quantiles is the quartile. • Quartiles divide the values of a data set into four subsets of equal size, each comprising 25% of the observations. • To find the first, second, and third quartiles: – 1. Arrange the N data values into an array. – 2. First quartile, Q1 = data value at position (N + 1)/4 – 3. Second quartile, Q2 = data value at position 2(N + 1)/4 – 4. Third quartile, Q3 = data value at position 3(N + 1)/4 © 2002 The Wadsworth Group