2. What are we going to learn? Measures of Center
Mean, Median, Mode Range Standard Deviation
Slide *
Arithmetic Mean
(Mean)
You hear people sometimes say average
the measure of center obtained by adding the values and
dividing the total by the number of values
Definition
N
Slide *
Notation
denotes the sum of a set of values (sigma)
x is the variable usually used to represent the individual
data values.
n represents the number of values in a sample (because it is
lower case)
N represents the number of values in a population (because it
is upper case)
3. Slide *
Notation
µ is pronounced ‘mew’ and denotes the mean of all values
in a population
is pronounced ‘x-bar’ and denotes the mean of a set of sample
values
Symbol for the sample mean
Symbol for the population mean
This equation reads the mean equals the sum of each score
divided by n
x =
n
x
N
µ =
Slide *
Let’s Practice Some Mean Exercises
Get out a calculator
Determine the mean for each of the data sets:
5. The table shows daily sales in the first 2 weeks of July in
Windsor Mill at the sandwhich shop.
DateSalesJuly 146July 245July 396July 426July 585July
645July 765July 884July 964July 1044July 1185July 1225July
1345July 1485
Slide *
Let’s Try this Exercise
What is the mean sales for the data?
56.7
59
60
63.5
DateSalesJuly 146July 245July 396July 426July 585July
645July 765July 884July 964July 1044July 1185July 1225July
6. 1345July 1485
Slide *
Answer
What is the mean sales for the data?
56.7
59
60
63.5
DateSalesJuly 146July 245July 396July 426July 585July
645July 765July 884July 964July 1044July 1185July 1225July
1345July 1485
7. Slide *
Definitions
Mode
the value that occurs most frequently
Mode is not always unique
A data set may be:
Bimodal (two modes)
Multimodal (many modes)
No Mode
Mode is the only measure of central tendency that can be used
with nominal data
8. Slide *
a. 5.40 1.10 0.42 0.73 0.48 1.10
b. 27 27 27 55 55 55 88 88 99
c. 1 2 3 6 7 8 9 10
Mode - Examples
Mode is 1.10
Bimodal - 27 & 55
No Mode
Slide *
Let’s Practice Finding the Mode
Find the mode in the following data sets:
17, 19, 19, 4, 19, 26, 3, 21, and 19
3, 4, 4, 5, 2, 1, 3, 0, 0, 3
0, 0, 2, 1, 1 5, 6, 3, 7, 0, 3, 1
Answers: a) 19; b) 3 c) 0 and 1
Slide *
Definitions
Median
the middle score is not affected by an extreme value
often denoted by x (pronounced ‘x-tilde’)
9. ~
Slide *
Finding the Median Steps
Step 1 – Place the data in an ordered array (from lowest to
highest)
Step 2 – Determine N
Step 3a - If the number of scores (N) is odd, the median is the
number located in the exact middle of the list.
Step 3b - If the number of scores (N) is even, the median is
found by computing the mean of the two middle numbers.
Slide *
10. 5.40 1.10 0.42 0.73 0.48 1.10 0.66
0.42 0.48 0.66 0.73 1.10 1.10 5.40
(in order - odd number of values)
exact middle MEDIAN is 0.73
5.40 1.10 0.42 0.73 0.48 1.10
0.42 0.48 0.73 1.10 1.10 5.40
0.73 + 1.10
2
(in order - even number of values – no exact middle
shared by two numbers)
MEDIAN is 0.915
Slide *
Let’s Practice Finding the Median
Find the median for the following data set:
A) 92, 2, 90, 86, 96B) 12.5, 10.5, 24.5, 26, 15, 21C) 250, 240,
188, 198, 240, 230, 210, 225
Answers: A) 90; B) 18; C) 227.5
Slide *
11. Let’s Try this Exercise
The table shows daily sales in the first 2 weeks of July in
Windsor Mill at the sandwhich shop.
DateSalesJuly 146July 245July 396July 426July 585July
645July 765July 884July 964July 1044July 1185July 1225July
1345July 1485
Slide *
Let’s Try this Exercise
What is the median for the data?
50
55
58
60DateSalesJuly 146July 245July 396July 426July 585July
645July 765July 884July 964July 1044July 1185July 1225July
12. 1345July 1485
Slide *
Answer
What is the median for the data?
50
55
58
60DateSalesJuly 146July 245July 396July 426July 585July
645July 765July 884July 964July 1044July 1185July 1225July
1345July 1485
17. Windsor Mill at the sandwhich shop.
DateSalesJuly 146July 245July 396July 426July 585July
645July 765July 884July 964July 1044July 1185July 1225July
1345July 1485
Slide *
Let’s Try this Exercise
What is the five-number summary for the data?
a) 26, 46, 55, 86, 96
b) 25, 45, 55, 85, 96
c) 26, 46, 56, 85, 96
d) 25, 46, 55, 86, 96
DateSalesJuly 146July 245July 396July 426July 585July
645July 765July 884July 964July 1044July 1185July 1225July
1345July 1485
18. Slide *
Answer
What is the five-number summary for the data?
a) 26, 46, 55, 86, 96
b) 25, 45, 55, 85, 96
c) 26, 46, 56, 85, 96
d) 25, 46, 55, 86, 96
DateSalesJuly 146July 245July 396July 426July 585July
645July 765July 884July 964July 1044July 1185July 1225July
1345July 1485
19. Slide *
Midrange
the value midway between the maximum and minimum
values in the original data set
Definition
Midrange =
maximum value + minimum value
2
20. Slide *
Carry one more decimal place than is present in the original set
of values.
Round-off Rule for
Measures of Center
Slide *
Best Measure of Center
Slide *
Skewness
Slide *
Recap
In this section we have discussed: Types of measures of center
Mean
Median
Mode Best measures of center Skewness
25. Slide *
Definition
The standard deviation of a set of sample values is a measure of
variation of values about the mean.
Slide *
Sample Standard
Deviation Formula
Let’s read this formula aloud. S which stands for standard
deviation is equal to the square root of the sum of each score
minus the mean squared divided by n-1.
- x)2
n - 1
s =
Slide *
26. Standard Deviation -
Important Properties The standard deviation is a measure of
variation of all values from the mean. The value of the
standard deviation s is usually positive. The value of the
standard deviation s can increase dramatically with the
inclusion of one or more outliers (extreme scores). The units
of the standard deviation s are the same as
the units of the original data values.
Slide *
Population Standard
Deviation
2
- µ)
N
This formula is similar to the previous formula, but instead, the
population mean and population size are used.
27. Slide *
Definition The variance of a set of values is a measure of
variation equal to the square of the standard deviation.
Sample variance: Square of the sample standard deviation s
Population variance: Square of the population
standard deviation
Slide *
Variance - Notation
standard deviation squared
s
2
2
}
Notation
Sample variance
Population variance
Slide *
Round-off Rule
for Measures of Variation
28. Carry one more decimal place than is present in the original
set of data.
Round only the final answer, not values in the middle of a
calculation.
Slide *
Let me now show you how to computer s (standard deviation)I
do it very simply…with a chartLet’s consider that we want to
find the standard deviation for the following data set: 5, 4, 2, 3,
6x (each score)x – mean (x-mean) squared
Slide *
Let me now show you how to computer s (standard
deviation)Let’s consider that we want to find the standard
deviation for the following data set: 5, 4, 2, 3, 6Step 1 – Place
each score in the chart and sum.Step 2 – Find the mean.Step 3 –
Identify N
29. Total this rowX (each score)x – mean (x-mean) squared
Slide *
Let me now show you how to computer s (standard
deviation)Let’s consider that we want to find the standard
deviation for the following data set: 5, 4, 2, 3, 6Step 1 – Place
each score in the chart and sum.
Total this rowX (each score)x – mean (x-mean) squared5423620
30. Slide *
Let me now show you how to computer s (standard
deviation)Let’s consider that we want to find the standard
deviation for the following data set: 5, 4, 2, 3, 6Step 2 – Find
the mean
Total this row
20/5 = 4
Mean = 4X (each score)x – mean (x-mean) squared5423620
Slide *
Let me now show you how to computer s (standard
deviation)Let’s consider that we want to find the standard
deviation for the following data set: 5, 4, 2, 3, 6Step 3 –
Identify N
Total this row
Mean =4
N = 5X (each score)x – mean (x-mean) squared5423620
31. Slide *
Let me now show you how to computer s (standard
deviation)Let’s consider that we want to find the standard
deviation for the following data set: 5, 4, 2, 3, 6Step 4 – Now
subtract each score from the mean
Total this row
Mean =4
N = 5X (each score)x – mean (x-mean) squared55-4 = 144-4=
022-4= -233-4= -166-4= 220
32. Slide *
Let me now show you how to computer s (standard
deviation)Let’s consider that we want to find the standard
deviation for the following data set: 5, 4, 2, 3, 6Step 5 – Total
this column. The total should almost always be 0.
Total this row
Mean =4
N = 5X (each score)x – mean (x-mean) squared51402-23-
162200
Slide *
Let me now show you how to computer s (standard
deviation)Let’s consider that we want to find the standard
deviation for the following data set: 5, 4, 2, 3, 6Step 6 – Square
each score.
Total this row
Mean =4
33. N = 5
Squared means the number times itself. On your calculator is
an x squared button.
Do not put negative numbers in your calculator. The last
column will always be positive.X (each score)x – mean (x-
mean) squared511 x 1 = 1400 x 0 = 02-2-2 x -2 = 43-1-1 x -1 =
1622 x 2 = 42000 x 0 = 0
Slide *
Let me now show you how to computer s (standard
deviation)Let’s consider that we want to find the standard
deviation for the following data set: 5, 4, 2, 3, 6Step 7 – Total
this column.
Total this row
Mean =4
N = 5
Squared means the number times itself. On your calculator is
an
This total becomes the numerator in the formula.X (each score)x
– mean (x-mean) squared5114002-243-1162420010
35. s =
Slide *
Sample Standard
Deviation Formula
10
10
Watch how the formula works itself out…
5 - 1
s =
4
s =
36. Slide *
Sample Standard
Deviation Formula
10
2.5
Watch how the formula works itself out…
4
s =
s =
Slide *
Sample Standard
Deviation Formula
2.5
Compute the square root of 2.5
37. s = 1.58
Voila!
s =
Slide *
Sample Standard Deviation FormulaI know that was a lot to take
in at once; however, let’s try one now by yourself. Just take
your time. Standard deviation takes practice to get perfect, but
with a little practice you will get it.
Take your time and check your work. DON’T RUSH!
Slide *
Sample Standard Deviation FormulaFind s for the following
data sets:A) 4, 2, 3B) 9, 6, 7, 2C) 2, 6, 1D) 498, 500, 503, 498,
501
Answers: a) 1; b) 2.94; c) 2.646; d) 2.12
Do not move forward unless you have the answers.
I mean it!
Slide *
Definition
Empirical (68-95-99.7) Rule
38. For data sets having a distribution that is approximately bell
shaped, the following properties apply: About 68% of all values
fall within 1 standard deviation of the mean. About 95% of
all values fall within 2 standard deviations of the mean.
About 99.7% of all values fall within 3 standard deviations of
the mean.
Slide *
The Empirical Rule
Slide *
The Empirical Rule
Slide *
The Empirical Rule
Slide *
39. What does standard deviation mean?Standard deviation is a
measure which talks about how the scores scatter around the
mean.If s is low, the scores, in general, are closer to the mean.If
s is high, the scores, in general, are farther from the mean.
Think of yourself as the mean and you are holding jellybeans in
your clinched fist. You open your hand and drop them at your
feet.Some jellybeans will be right by your feet and a few may
scatter a few feet away. S would be low. In general, the
jellybeans are staying close to you.
Think of yourself again as the mean and you are holding
jellybeans in your clinched fist. This time, you open your hand
and toss the jellybeans in front of you. S will be high. In
general, some jellybeans are near you, but more of them are
away from you.
Slide *
What else does standard deviation mean?There are 3 steps on
the standard deviation: 68%, 95%, and 99%
How many steps? Three
And what are they? 68% 95% 99%
Slide *
Here is standard deviation at workImagine the first statistics
exam. The mean is 80 and the standard deviation for our class
is 5.
Slide *
The Empirical Rule
80
40. mean
Slide *
The Empirical Rule
80 + 5 = 85
s = 5
85
75
80 - 5 = 75
Finding: 68% of the class scored between 75 and 85 on the first
STAT exam.
Slide *
The Empirical Rule
75 - 5 = 70
85 + 5 = 90
70
90
Finding: 95% of the class scored between 70 and 90 on the first
STAT exam.
Slide *
41. 70 - 5 = 65
90 + 5 = 95
65
95
Finding: 99% of the class scored between 65 and 95 on the first
STAT exam.
Slide *
Definition
An outlier is a value that is located very far away from
almost all of the other values.
Slide *
Important Principles
An outlier can have a dramatic effect on the mean. An outlier
can have a dramatic effect on the standard deviation. An
outlier can have a dramatic effect on the scale of the histogram
so that the true nature of the distribution is totally
obscured.
Slide *
48. 27,050
16765
29950
21545
$23,785
26,785
30050
23785
27950
A
25050
29955
B
27550
$27,594
26557
A
27650
25550
A – $9/hr @ 40 hrs week @ 50 weeks a year
B – $11/hr @ 35 hrs wee @ 52 weeks a year
Is any department receiving on average $2k or more on the high
end of the standard deviation using standard deviation at 95%?
If so, which department and be sure to explain your answer.
Problem 2
Find the five number summary for the data for all five
departments. Consider all five departments one department.
Problem 3
Find the Q3 and median for Departments 1 and 3 individually.
49. Which department has a higher Q3? Which department has a
higher median?
Problem 4
Find the Q1 and median for Departments 2 and 4 individually.
Which department has a higher Q1? Which department has a
higher median?