Slide - 1
Business Analytics
Third Edition
Chapter 7
Statistical
Inference
Copyright © 2020, 2016, 2013 Pearson Education, Inc. All Rights Reserved
Copyright © 2020, 2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 2
Statistical Inference
• Statistical inference focuses on drawing conclusions
about populations from samples.
– Statistical inference includes estimation of population
parameters and hypothesis testing, which involves
drawing conclusions about the value of the parameters
of one or more populations.
Copyright © 2020, 2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 3
Hypothesis Testing
• Hypothesis testing involves drawing inferences about two
contrasting propositions (each called a hypothesis) relating to the
value of one or more population parameters.
– 0
:
H Null hypothesis: describes an existing theory
– 1
:
H Alternative hypothesis: the complement of 0
H
• Using sample data, we either:
– reject 0
H and conclude the sample data provides sufficient
evidence to support 1
,
H or
– fail to reject 0
H and conclude the sample data does not support
1
.
H
Copyright © 2020, 2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 4
Example 7.1: A Legal Analogy for
Hypothesis Testing
• In the U.S. legal system, a defendant is innocent until
proven guilty.
– 0
:
H Innocent
– 1
:
H Guilty
• If evidence (sample data) strongly indicates the
defendant is guilty, then we reject 0
.
H
• Note that we have not proven guilt or innocence!
Copyright © 2020, 2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 5
Hypothesis Testing Procedure
Steps in conducting a hypothesis test:
1. Identify the population parameter and formulate the
hypotheses to test.
2. Select a level of significance (the risk of drawing an
incorrect conclusion).
3. Determine the decision rule on which to base a
conclusion.
4. Collect data and calculate a test statistic.
5. Apply the decision rule and draw a conclusion.
Copyright © 2020, 2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 6
One-Sample Hypothesis Tests
• Three types of one sample tests:
1. 0
: parameter constant

H
1
: parameter constant
H 
2. 0
: parameter constant
H 
1
: parameter constant
H 
3. 0
: parameter constant
H 
1
: parameter constant
H 
• It is not correct to formulate a null hypothesis using
, ,or .
  
Copyright © 2020, 2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 7
Determining the Proper Form of
Hypotheses
• Hypothesis testing always assumes that 0
H is true and uses
sample data to determine whether 1
H is more likely to be true.
– Statistically, we cannot “prove” that 0
H is true; we can only
fail to reject it.
• Rejecting the null hypothesis provides strong evidence (in a
statistical sense) that the null hypothesis is not true and that the
alternative hypothesis is true.
• Therefore, what we wish to provide evidence for statistically
should be identified as the alternative hypothesis.
Copyright © 2020, 2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 8
Example 7.2: Formulating a
One-Sample Test of Hypothesis
• CadSoft receives calls for technical support. In the past, the average
response time has been at least 25 minutes. It believes the average
response time can be reduced to less than 25 minutes.
– If the new information system makes a difference, then, data
should be able to confirm that the mean response time is
less than 25 minutes; this defines the alternative hypothesis, 1
.
H
0
: mean response time 25
H 
1
: mean response time 25
H 
Copyright © 2020, 2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 9
Understanding Potential Errors in
Hypothesis Testing
• Hypothesis testing can result in one of four different
outcomes:
1. 0
H is true and the test correctly fails to reject 0
H
2. 0
H is false and the test correctly rejects 0
H
3. 0
H is true and the test incorrectly rejects 0
H
(called I
Type error)
4. 0
H is false and the test incorrectly fails to reject 0
H
(called II
Type error)
Copyright © 2020, 2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 10
Terminology
• The probability of making a I
Type error
0 0
( ( |
level of significance) rejecting is true)
P H H

 
– The value of 1 
 is called the confidence coefficient
0 0
( | ),
not rejecting is true
P H H

– The value of  can be controlled. Common values are 0.01, 0.05, or
0.10.
• The probability of making a II
Type error 0 0
( | )
not rejecting is false
P H H

 
– The value of 1 
 is called the power of the test
0 0
( | ).
rejecting is false
P H H

– The value of  cannot be specified in advance and depends on the
value of the (unknown) population parameter.
Copyright © 2020, 2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 11
Example 7.3: How Beta Depends on
the True Population Mean
• In the CadSoft example:
0
: 25
mean response time
H 
1
: 25
mean response time
H 
– If the true mean is 15, then the sample mean will most likely be
less than 25, leading us to reject 0
.
H
– If the true mean is 24, then the sample mean may or may not be
less than 25, and we would have a higher chance of failing to
reject 0
.
H
• The further away the true mean is from the hypothesized value, the
smaller the value of .

• Generally, as  decreases,  increases.
Copyright © 2020, 2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 12
Improving the Power of the Test
• We would like the power of the test to be high (equivalently, we
would like the probability of a II
Type error to be low) to allow
us to make a valid conclusion.
• The power of the test is sensitive to the sample size; small
sample sizes generally result in a low value of 1 .


• The power of the test can be increased by taking larger
samples, which enable us to detect small differences between
the sample statistics and population parameters with more
accuracy.
• If you choose a small level of significance, you should try to
compensate by having a large sample size.
Copyright © 2020, 2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 13
Selecting the Test Statistic
• The decision to reject or fail to reject a null hypothesis is based
on computing a test statistic from the sample data.
• The test statistic used depends on the type of hypothesis test.
– Test statistics for one-sample hypothesis tests for means:
Type of Test Test Statistic
One-sample test for mean, Sigma known
z = start fractionx bar minus musub0over start fractionsigmaover radical n endfractionend fraction.Equation 7.1.
One-sample test for mean, Sigma unknown
t= startfraction x bar minus mu sub0 over startfractions over radical nendfractionendfraction. Equation7.2.
 0
(7.1)
/
x
z
n




 0
(7.2)
/
x
t
s n



Copyright © 2020, 2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 14
Example 7.4: Computing the Test
Statistic
• In the CadSoft example, sample data for 44 customers
revealed a mean response time of 21.91 minutes and a
sample standard deviation of 19.49 minutes.
0
21.91 25 3.09
1.05
2.938
/ 19.49 / 44
x
t
s n

  
   
1.05
t  indicates that the sample mean of 21.91 is 1.05
standard errors below the hypothesized mean of 25 minutes.
Copyright © 2020, 2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 15
Drawing a Conclusion
• The conclusion to reject or fail to reject 0
H is based on
comparing the value of the test statistic to a “critical value” from
the sampling distribution of the test statistic when the null
hypothesis is true and the chosen level of significance, .

– The sampling distribution of the test statistic is usually the
normal distribution, t-distribution, or some other well-known
distribution.
• The critical value divides the sampling distribution into two parts,
a rejection region and a non-rejection region. If the test statistic
falls into the rejection region, we reject the null hypothesis;
otherwise, we fail to reject it.
Copyright © 2020, 2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 16
Rejection Regions
0
:parameter constant
H  0
:parameter constant
H  0
:parameter constant
H 
1
:parameter constant
H  1
:parameter constant
H  1
:parameter constant
H 
For a one-tailed test, if 1
H is stated as <, the rejection region is in the
lower tail; if 1
H is stated as >, the rejection region is in the upper tail
(just think of the inequality as an arrow pointing to the proper tail direction).
Copyright © 2020, 2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 17
Example 7.5: Finding the Critical Value
and Drawing a Conclusion
• In the CadSoft example, use 0.05.
 
– 0
: 25
mean response time
H 
– 1
: 25
mean response time
H 
– 44; 1 43
n df n
   
– 1.05
t 
– Critical value , 1
2
T.INV(1 , 1) T.INV(0.95,43) 1.68
n
t n
 

     
– For a lower-tailed test, use 1.68

– 1.05
t  does not fall in the rejection region.
– Fail to reject 0
H
Even though the sample mean of 21.91 is well below 25, we
have too much sampling error to conclude the that the true
population mean is less than 25 minutes.
Copyright © 2020, 2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 18
Two-Tailed Hypothesis Test for the Mean
• For a two-tailed test using the t-distribution, use the
Excel function T.INV(1 / 2, 1) T.INV.2T( , 1
n n
 
  
or )
to find the critical value, /2, 1
.
n
t 
• Be careful when using T.INV.2T! Use  and not / 2

in this function.
Copyright © 2020, 2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 19
Example 7.6: Conducting a Two-Tailed
Hypothesis Test for the Mean
• Excel file Vacation Survey
• Test whether the average age of respondents is equal to 35.
– 0 : 35
mean age
H 
– 1 : 35
mean age
H 
• n = 34; sample mean = 38.676; sample standard deviation = 7.857.
• Test statistic:
• Critical value T.INV.2T(.05, 33) 2.0345
 
• Reject 0.
H
Copyright © 2020, 2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 20
p-Values
• A p-value (observed significance level) is the probability
of obtaining a test statistic value equal to or more extreme
than that obtained from the sample data when the null
hypothesis is true.
• An alternative approach to Step 3 of a hypothesis test uses
the p-value rather than the critical value:
Reject 0
H if the p-value 

Copyright © 2020, 2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 21
Finding p-Values When Sigma is Known
• For a lower one-tailed test, the p-value is the probability to
the left of the test statistic z in the z-distribution, and is
found using the Excel function:
NORM.S.DIST( , true).
z
• For an upper one-tailed test, the p-value is the probability
to the right of the test statistic z, and is found using the
Excel function:
1 NORM.S.DIST( , true)
z

• For a two-tailed test, the p-value is found using the Excel
function:
2*(1 NORM.S.DIST(ABS( ), true)
z

Copyright © 2020, 2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 22
Finding p-Values When Sigma is
Unknown
• For a lower one-tailed test, the p-value is the probability to
the left of the test statistic t in the t-distribution, and is
found using the Excel function:
= T.DIST( , 1, true).
t n 
• For an upper one-tailed test, the p-value is the probability
to the right of the test statistic t, and is found using the
Excel function:
1 T.DIST( , 1, true).
t n
 
• For a two-tailed test, the p-value is found using the Excel
function:
T.DIST.2T(ABS( ), 1).
t n 
Copyright © 2020, 2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 23
Example 7.7: Using p-Values
• In the CadSoft example, the p-value is the left tail area of
the observed test statistic, 1.05.
t 
– p-value TDIST( 1.05, 43,TRUE) 0.15
  
– Do not reject 0
H because the
• For the Vacation Survey two-tailed hypothesis test in
Example 7.6, the p-value for this test is
– p-value T.DIST.2T(2.73,33) 0.010
 
– Reject 0
H because 0.010 < 0.05
Copyright © 2020, 2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 24
One-Sample Tests for Proportions
• Test statistic:
 
0
0 0
ˆ
7.3
(1 ) /
p
z
n

 



• 0
 is the hypothesized value and p̂ is the sample
proportion
Copyright © 2020, 2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 25
Example 7.8: One-Sample Test for the
Proportion (1 of 2)
• CadSoft sampled 44 customers and asked them to rate
the overall quality of a software package. Sample data
revealed that 35 respondents (a proportion of
33
0.795)
44

thought the software was very good or excellent. In the
past, this proportion has averaged about 75%. Is there
sufficient evidence to conclude that this satisfaction
measure has significantly exceeded 75% using a
significance level of 0.05?
Copyright © 2020, 2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 26
Example 7.8: One-Sample Test for the
Proportion (2 of 2)
• Hypotheses:
– 0
: 0.75
H  
– 1
: 0.75
H  
• Test statistic:
0.795 0.75
0.69
0.75(1 0.75) / 44
z

 

• Critical value NORM.S.INV(0.95) 1.645
 
• p-value 1 NORM.S.DIST(0.69,true) 0.25
  
• Do not reject 0
.
H
Copyright © 2020, 2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 27
Confidence Intervals and Hypothesis
Tests - Two-Tailed Tests
• Suppose we construct a 95% confidence interval for the
mean. If we wish to test the hypotheses
0 0
:
H  

1 0
:
H  

at a 5% level of significance, we simply check whether the
hypothesized value 0
 falls within the confidence interval.
If it does not, then we reject 0
;
H if it does, then we cannot
reject 0
.
H
Copyright © 2020, 2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 28
Confidence Intervals and Hypothesis
Tests - One-Tailed Tests
• For a lower-tailed test, if the confidence interval falls
entirely below the hypothesized value, we reject the null
hypothesis.
• For an upper-tailed test, if the confidence interval falls
entirely above the hypothesized value, we also reject the
null hypothesis.
Copyright © 2020, 2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 29
Excel Templates
• The Excel file One Sample Hypothesis Tests provides
template worksheets for conducting hypothesis tests for
means and proportions. One example:
Copyright © 2020, 2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 30
Two-Sample Hypothesis Tests (1 of 3)
• Lower-tailed test
– 0 0
: (1) (2)
population parameter population parameter
H D
 
– 1 0
: (1) (2)
population parameter population parameter
H D
 
• This test seeks evidence that the difference between
population parameter (1) and population parameter (2) is
less than some value, 0
.
D
• When 0
0,
D  the test simply seeks to conclude whether
population parameter (1) is smaller than population
parameter (2).
Copyright © 2020, 2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 31
Two-Sample Hypothesis Tests (2 of 3)
• Upper-tailed test
– 0 0
: (1) (2)
population parameter population parameter
H D
 
– 1 0
: (1) (2)
population parameter population parameter
H D
 
• This test seeks evidence that the difference between
population parameter (1) and population parameter (2)
is greater than some value, 0
.
D
• When 0
0,
D  the test simply seeks to conclude whether
population parameter (1) is larger than population
parameter (2).
Copyright © 2020, 2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 32
Two-Sample Hypothesis Tests (3 of 3)
• Two-tailed test
– 0 0
: (1) (2)
population parameter population parameter
H D
 
– 1 0
: (1) (2)
population parameter population parameter
H D
 
• This test seeks evidence that the difference between the
population parameters is equal to 0
.
D
• When 0
0,
D  we are seeking evidence that population
parameter (1) differs from population parameter (2).
– In most applications, 0
0,
D  and we are simply seeking
to compare the population parameters.
Copyright © 2020, 2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 33
Excel Analysis Toolpak Procedures for
Two-Sample Hypothesis Tests
Type of Test Excel Procedure
Two-sample test for means, Sigma squared. known Excel z-test: Two-sample for means
Two-sample test for means, Sigma squared.
unknown, assumed unequal
Excel t-test: Two-sample assuming
unequal variances
Two-sample test for means, Sigma squared.
unknown, assumed equal
Excel t-test: Two-sample assuming
equal variances
Paired two-sample test for means Excel t-test: Paired two-sample test for
means
Two-sample test for equality of
variances
Excel F-test Two-sample for variances
2

2

2

Copyright © 2020, 2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 34
Two-Sample Tests for Difference in
Means
• Forms of the hypothesis test:
0 1 2
: { , , }0
or
H  
  
 
1 1 2
: { , , }0 7.4
or
H  
   
Copyright © 2020, 2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 35
Example 7.9: Comparing Supplier
Performance
• Purchase Orders database
• Determine if the mean lead time for Alum Sheeting  
1

is greater than the mean lead time for Durrable Products
 
2
.

0 1 2
: 0
H  
 
1 1 2
: 0
H  
 
Copyright © 2020, 2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 36
Selecting the Proper Excel Procedure
• Population variances are known:
– z-Test: Two-Sample for Means
• Population variances are unknown and assumed unequal:
– t-Test: Two-Sample Assuming Unequal Variances
• Population variances are unknown but assumed equal:
– t-Test: Two-Sample Assuming Equal Variances
• These tools calculate the test statistic, the p-value for both
a one-tail and two-tail test, and the critical values for one-
tail and two-tail tests.
Copyright © 2020, 2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 37
Interpreting Excel Output
• If the test statistic is negative, the one-tailed p-value is
the correct p-value for a lower-tail test; however, for an
upper-tail test, you must subtract this number from 1.0 to
get the correct p-value.
• If the test statistic is nonnegative (positive or zero), then
the p-value in the output is the correct p-value for an
upper-tail test; but for a lower-tail test, you must subtract
this number from 1.0 to get the correct p-value.
• For a lower-tail test, you must change the sign of the
one-tailed critical value.
Copyright © 2020, 2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 38
Example 7.10: Testing the Hypotheses for
Supplier Lead-Time Performance (1 of 2)
• t-Test: Two-Sample Assuming Unequal Variances
– Variable 1 Range: Alum Sheeting data
– Variable 2 Range: Durrable Products data
Copyright © 2020, 2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 39
Example 7.10: Testing the Hypotheses for
Supplier Lead-Time Performance (2 of 2)
• Results
– Rule 2: If the test statistic is nonnegative
(positive or zero), then the p-value in the output
is the correct p-value for an upper-tail test.
t = 3.83
Critical value = 1.81
p-value = 0.00166
Reject 0
.
H
Copyright © 2020, 2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 40
Two-Sample Test for Means with Paired
Samples
• In many situations, data from two samples are naturally
paired or matched.
• When paired samples are used, a paired t-test is more
accurate than assuming that the data come from
independent populations.
• Hypotheses ( D
 is the mean difference between the
paired samples):
0
: { , , }0
or
D
H   
1
: { , , }0
or
D
H    
• Excel Data Analysis tool: t-Test: Paired Two-Sample for
Means
Copyright © 2020, 2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 41
Example 7.11: Using the Paired
Two-Sample Test for Means (1 of 2)
• Excel file Pile Foundation
– Test for a difference in the means of the estimated and
actual pile lengths (two-tailed test).
Copyright © 2020, 2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 42
Example 7.11: Using the Paired
Two-Sample Test for Means (2 of 2)
• Results:
• t = 10.91

• t is smaller than the
lower critical value
• p-value 0

• Reject the null
hypothesis.
Copyright © 2020, 2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 43
Two-Sample Test for Equality of
Variances
• Test for equality of variances between two samples using a
new type of test, the F-test.
– To use this test, we must assume that both samples are
drawn from normal populations.
• Hypotheses: 2 2
0 1 2
: 0
H  
 
 
2 2
1 1 2
: 0 7.5
H  
 
• F-test statistic:
 
2
1
2
2
7.6
s
F
s

• Excel tool: F-test for Equality of Variances
Copyright © 2020, 2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 44
F-Distribution
• The F-distribution has two degrees of freedom, one associated with
the numerator of the F-statistic, 1
1,
n  and one associated with the
denominator of the F-statistic, 2
1.
n 
• Table 4, Appendix A provides only upper-tail critical values, and the
distribution is not symmetric.
Copyright © 2020, 2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 45
Conducting the F-Test
• Although the hypothesis test is really a two-tailed test, we will simplify it as an
upper-tailed, one-tailed test to make it easy to use tables of the F-distribution
and interpret the results of the Excel tool.
– We do this by ensuring that when we compute F, we take the ratio of the
larger sample variance to the smaller sample variance.
• Find the critical value , 1, 2
2
df df
F of the F-distribution, and then we reject the
null hypothesis if the F-test statistic exceeds the critical value.
• Note that we are using
2

to find the critical value, not .

This is because we are using only the upper tail information on which to base
our conclusion.
Copyright © 2020, 2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 46
Example 7.12: Applying the F-Test for
Equality of Variances
• Determine whether the variance of lead times is the same
for Alum Sheeting and Durrable Products in the Purchase
Orders data.
– The variance of the lead times for Alum Sheeting is
larger than the variance for Durable Products, so this is
assigned to Variable 1.
F = 3.47
Critical value = 3.607
P-value = 0.029
Reject 0
H
Copyright © 2020, 2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 47
Analysis of Variance (ANOVA)
• Used to compare the means of two or more population
groups.
0 1 2
: m
H   
  

1
:
H at least one mean is different from the others
• ANOVA derives its name from the fact that we are
analyzing variances in the data.
• ANOVA measures variation between groups relative to
variation within groups.
• Each of the population groups is assumed to come from a
normally distributed population.
Copyright © 2020, 2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 48
Example 7.13: Difference in Insurance
Survey Data
• Determine whether
any significant
differences exist in
satisfaction among
individuals with
different levels of
education.
Blank
College Graduate Graduate Degree Some College
Blank
5 3 4
Blank
3 4 1
Blank
5 5 4
Blank
3 5 2
Blank
3 5 3
Blank
3 4 4
Blank
3 5 4
Blank
4 5
Blank
Blank
2
Blank Blank
Average 3.444 4.500 3.143
Count 9 8 7
• The variable of interest is called a factor. In this example, the factor is the
educational level, and we have three categorical levels of this factor, college
graduate, graduate degree, and some college.
Copyright © 2020, 2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 49
Example 7.14: Applying the Excel
ANOVA Tool (1 of 2)
• Data Analysis tool: ANOVA: Single Factor
– The input range of the data must be in contiguous
columns
Copyright © 2020, 2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 50
Example 7.14: Applying the Excel
ANOVA Tool (2 of 2)
• Results
– F = 3.92
–
– crit
F F

– p-value = 0.0356
– Reject 0
.
H
Copyright © 2020, 2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 51
Assumptions of ANOVA
• The m groups or factor levels being studied represent
populations whose outcome measures
1. are randomly and independently obtained,
2. are normally distributed, and
3. have equal variances.
• If these assumptions are violated, then the level of
significance and the power of the test can be affected.
Copyright © 2020, 2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 52
Chi-Square Test for Independence
• Test for independence of two categorical variables.
– 0
:
H two categorical variables are independent
– 1
:
H two categorical variables are dependent
Copyright © 2020, 2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 53
Example 7.15: Independence and
Marketing Strategy
• Energy Drink Survey data. A key marketing question is whether the
proportion of males who prefer a particular brand is no different from the
proportion of females.
– If gender and brand preference are indeed independent, we would
expect that about the same proportion of the sample of female students
would also prefer brand 1.
– If they are not independent, then advertising should be targeted
differently to males and females, whereas if they are independent, it
would not matter.
Copyright © 2020, 2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 54
Chi-Square Test Calculations (1 of 3)
• Step 1: Using a cross-tabulation of the data, compute the
expected frequency if the two variables are independent.
  
 
expected frequency in row grand total row grand total column
= 7.7
and column total number of observations
i j
i j
Copyright © 2020, 2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 55
Example 7.16: Computing Expected
Frequencies
  
 
expected frequency in row grand total row grand total column
= 7.7
and column total number of observations
i j
i j
Copyright © 2020, 2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 56
Chi-Square Test Calculations (2 of 3)
• Step 2: Compute a test statistic, called a chi-square
statistic, which is the sum of the squares of the
differences between observed frequency, 0
,
f and
expected frequency, ,
e
f divided by the expected
frequency in each cell:
 
2
2 0
( )
7.8
e
e
f f
f



Copyright © 2020, 2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 57
Chi-Square Distribution
• The sampling distribution of
2
 is a special distribution
called the chi-square distribution.
– The chi-square distribution is characterized by
degrees of freedom.
– Table 3 in Appendix A
Copyright © 2020, 2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 58
Chi-Square Test Calculations (3 of 3)
• Step 3: Compare the chi-square statistic for the level of significance 
to the critical value from a chi-square distribution with   
1 1
r c
 
degrees of freedom, where r and c are the number of rows and
columns in the cross-tabulation table, respectively.
– The Excel function CHISQ.INV.RT( )
probability,deg_freedom
returns the value of
2
 that has a right-tail area equal to
probability for a specified degree of freedom.
– By setting probability equal to the level of significance, we can
obtain the critical value for the hypothesis test.
– The Excel function CHISQ.TEST( )
actual_range,expected_range
computes the p-value for the chi-square test.
Copyright © 2020, 2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 59
Example 7.17: Conducting the
Chi-Square Test
• Test statistic = 6.49
• . (2 1)(3 1) 2
d f    
• Critical value =
CHISQ.INV.RT(0.05,2) 5.99146

• p-value =
• Reject 0
H
Copyright © 2020, 2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 60
Cautions in Using the Chi-Square Test
• The chi-square test assumes adequate expected cell
frequencies.
– A rule of thumb is that there be no more than 20% of
cells with expected frequencies smaller than 5, and no
expected frequencies of zero.
• Consider aggregating some of the rows or columns in a
logical fashion to enforce this assumption.
Copyright © 2020, 2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 61
Example 7.18: Violations of Chi-Square
Assumptions (1 of 2)
• A survey of 100 students at a university queried their
beverage preferences at a local coffee shop.
Of the 16 cells, five, or over 30%, have frequencies smaller
than 5. Four of them are in the Cappuccino, Latte, and
Mocha columns; these can be aggregated into one column
called Hot Specialty beverages.
Copyright © 2020, 2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 62
Example 7.18: Violations of Chi-Square
Assumptions (2 of 2)
Now only 2 of 12 cells have an expected frequency less than
5; this now meets the assumptions of the chi-square test.
Copyright © 2020, 2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 63
Copyright
This work is protected by United States copyright laws and is
provided solely for the use of instructors in teaching their
courses and assessing student learning. Dissemination or sale of
any part of this work (including on the World Wide Web) will
destroy the integrity of the work and is not permitted. The work
and materials from it should never be made available to students
except by instructors using the accompanying text in their
classes. All recipients of this work are expected to abide by these
restrictions and to honor the intended pedagogical purposes and
the needs of other instructors who rely on these materials.

Evans_Analytics3e_ppt_07_Accessible.pptx

  • 1.
    Slide - 1 BusinessAnalytics Third Edition Chapter 7 Statistical Inference Copyright © 2020, 2016, 2013 Pearson Education, Inc. All Rights Reserved
  • 2.
    Copyright © 2020,2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 2 Statistical Inference • Statistical inference focuses on drawing conclusions about populations from samples. – Statistical inference includes estimation of population parameters and hypothesis testing, which involves drawing conclusions about the value of the parameters of one or more populations.
  • 3.
    Copyright © 2020,2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 3 Hypothesis Testing • Hypothesis testing involves drawing inferences about two contrasting propositions (each called a hypothesis) relating to the value of one or more population parameters. – 0 : H Null hypothesis: describes an existing theory – 1 : H Alternative hypothesis: the complement of 0 H • Using sample data, we either: – reject 0 H and conclude the sample data provides sufficient evidence to support 1 , H or – fail to reject 0 H and conclude the sample data does not support 1 . H
  • 4.
    Copyright © 2020,2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 4 Example 7.1: A Legal Analogy for Hypothesis Testing • In the U.S. legal system, a defendant is innocent until proven guilty. – 0 : H Innocent – 1 : H Guilty • If evidence (sample data) strongly indicates the defendant is guilty, then we reject 0 . H • Note that we have not proven guilt or innocence!
  • 5.
    Copyright © 2020,2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 5 Hypothesis Testing Procedure Steps in conducting a hypothesis test: 1. Identify the population parameter and formulate the hypotheses to test. 2. Select a level of significance (the risk of drawing an incorrect conclusion). 3. Determine the decision rule on which to base a conclusion. 4. Collect data and calculate a test statistic. 5. Apply the decision rule and draw a conclusion.
  • 6.
    Copyright © 2020,2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 6 One-Sample Hypothesis Tests • Three types of one sample tests: 1. 0 : parameter constant  H 1 : parameter constant H  2. 0 : parameter constant H  1 : parameter constant H  3. 0 : parameter constant H  1 : parameter constant H  • It is not correct to formulate a null hypothesis using , ,or .   
  • 7.
    Copyright © 2020,2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 7 Determining the Proper Form of Hypotheses • Hypothesis testing always assumes that 0 H is true and uses sample data to determine whether 1 H is more likely to be true. – Statistically, we cannot “prove” that 0 H is true; we can only fail to reject it. • Rejecting the null hypothesis provides strong evidence (in a statistical sense) that the null hypothesis is not true and that the alternative hypothesis is true. • Therefore, what we wish to provide evidence for statistically should be identified as the alternative hypothesis.
  • 8.
    Copyright © 2020,2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 8 Example 7.2: Formulating a One-Sample Test of Hypothesis • CadSoft receives calls for technical support. In the past, the average response time has been at least 25 minutes. It believes the average response time can be reduced to less than 25 minutes. – If the new information system makes a difference, then, data should be able to confirm that the mean response time is less than 25 minutes; this defines the alternative hypothesis, 1 . H 0 : mean response time 25 H  1 : mean response time 25 H 
  • 9.
    Copyright © 2020,2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 9 Understanding Potential Errors in Hypothesis Testing • Hypothesis testing can result in one of four different outcomes: 1. 0 H is true and the test correctly fails to reject 0 H 2. 0 H is false and the test correctly rejects 0 H 3. 0 H is true and the test incorrectly rejects 0 H (called I Type error) 4. 0 H is false and the test incorrectly fails to reject 0 H (called II Type error)
  • 10.
    Copyright © 2020,2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 10 Terminology • The probability of making a I Type error 0 0 ( ( | level of significance) rejecting is true) P H H    – The value of 1   is called the confidence coefficient 0 0 ( | ), not rejecting is true P H H  – The value of  can be controlled. Common values are 0.01, 0.05, or 0.10. • The probability of making a II Type error 0 0 ( | ) not rejecting is false P H H    – The value of 1   is called the power of the test 0 0 ( | ). rejecting is false P H H  – The value of  cannot be specified in advance and depends on the value of the (unknown) population parameter.
  • 11.
    Copyright © 2020,2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 11 Example 7.3: How Beta Depends on the True Population Mean • In the CadSoft example: 0 : 25 mean response time H  1 : 25 mean response time H  – If the true mean is 15, then the sample mean will most likely be less than 25, leading us to reject 0 . H – If the true mean is 24, then the sample mean may or may not be less than 25, and we would have a higher chance of failing to reject 0 . H • The further away the true mean is from the hypothesized value, the smaller the value of .  • Generally, as  decreases,  increases.
  • 12.
    Copyright © 2020,2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 12 Improving the Power of the Test • We would like the power of the test to be high (equivalently, we would like the probability of a II Type error to be low) to allow us to make a valid conclusion. • The power of the test is sensitive to the sample size; small sample sizes generally result in a low value of 1 .   • The power of the test can be increased by taking larger samples, which enable us to detect small differences between the sample statistics and population parameters with more accuracy. • If you choose a small level of significance, you should try to compensate by having a large sample size.
  • 13.
    Copyright © 2020,2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 13 Selecting the Test Statistic • The decision to reject or fail to reject a null hypothesis is based on computing a test statistic from the sample data. • The test statistic used depends on the type of hypothesis test. – Test statistics for one-sample hypothesis tests for means: Type of Test Test Statistic One-sample test for mean, Sigma known z = start fractionx bar minus musub0over start fractionsigmaover radical n endfractionend fraction.Equation 7.1. One-sample test for mean, Sigma unknown t= startfraction x bar minus mu sub0 over startfractions over radical nendfractionendfraction. Equation7.2.  0 (7.1) / x z n      0 (7.2) / x t s n   
  • 14.
    Copyright © 2020,2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 14 Example 7.4: Computing the Test Statistic • In the CadSoft example, sample data for 44 customers revealed a mean response time of 21.91 minutes and a sample standard deviation of 19.49 minutes. 0 21.91 25 3.09 1.05 2.938 / 19.49 / 44 x t s n         1.05 t  indicates that the sample mean of 21.91 is 1.05 standard errors below the hypothesized mean of 25 minutes.
  • 15.
    Copyright © 2020,2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 15 Drawing a Conclusion • The conclusion to reject or fail to reject 0 H is based on comparing the value of the test statistic to a “critical value” from the sampling distribution of the test statistic when the null hypothesis is true and the chosen level of significance, .  – The sampling distribution of the test statistic is usually the normal distribution, t-distribution, or some other well-known distribution. • The critical value divides the sampling distribution into two parts, a rejection region and a non-rejection region. If the test statistic falls into the rejection region, we reject the null hypothesis; otherwise, we fail to reject it.
  • 16.
    Copyright © 2020,2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 16 Rejection Regions 0 :parameter constant H  0 :parameter constant H  0 :parameter constant H  1 :parameter constant H  1 :parameter constant H  1 :parameter constant H  For a one-tailed test, if 1 H is stated as <, the rejection region is in the lower tail; if 1 H is stated as >, the rejection region is in the upper tail (just think of the inequality as an arrow pointing to the proper tail direction).
  • 17.
    Copyright © 2020,2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 17 Example 7.5: Finding the Critical Value and Drawing a Conclusion • In the CadSoft example, use 0.05.   – 0 : 25 mean response time H  – 1 : 25 mean response time H  – 44; 1 43 n df n     – 1.05 t  – Critical value , 1 2 T.INV(1 , 1) T.INV(0.95,43) 1.68 n t n          – For a lower-tailed test, use 1.68  – 1.05 t  does not fall in the rejection region. – Fail to reject 0 H Even though the sample mean of 21.91 is well below 25, we have too much sampling error to conclude the that the true population mean is less than 25 minutes.
  • 18.
    Copyright © 2020,2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 18 Two-Tailed Hypothesis Test for the Mean • For a two-tailed test using the t-distribution, use the Excel function T.INV(1 / 2, 1) T.INV.2T( , 1 n n      or ) to find the critical value, /2, 1 . n t  • Be careful when using T.INV.2T! Use  and not / 2  in this function.
  • 19.
    Copyright © 2020,2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 19 Example 7.6: Conducting a Two-Tailed Hypothesis Test for the Mean • Excel file Vacation Survey • Test whether the average age of respondents is equal to 35. – 0 : 35 mean age H  – 1 : 35 mean age H  • n = 34; sample mean = 38.676; sample standard deviation = 7.857. • Test statistic: • Critical value T.INV.2T(.05, 33) 2.0345   • Reject 0. H
  • 20.
    Copyright © 2020,2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 20 p-Values • A p-value (observed significance level) is the probability of obtaining a test statistic value equal to or more extreme than that obtained from the sample data when the null hypothesis is true. • An alternative approach to Step 3 of a hypothesis test uses the p-value rather than the critical value: Reject 0 H if the p-value  
  • 21.
    Copyright © 2020,2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 21 Finding p-Values When Sigma is Known • For a lower one-tailed test, the p-value is the probability to the left of the test statistic z in the z-distribution, and is found using the Excel function: NORM.S.DIST( , true). z • For an upper one-tailed test, the p-value is the probability to the right of the test statistic z, and is found using the Excel function: 1 NORM.S.DIST( , true) z  • For a two-tailed test, the p-value is found using the Excel function: 2*(1 NORM.S.DIST(ABS( ), true) z 
  • 22.
    Copyright © 2020,2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 22 Finding p-Values When Sigma is Unknown • For a lower one-tailed test, the p-value is the probability to the left of the test statistic t in the t-distribution, and is found using the Excel function: = T.DIST( , 1, true). t n  • For an upper one-tailed test, the p-value is the probability to the right of the test statistic t, and is found using the Excel function: 1 T.DIST( , 1, true). t n   • For a two-tailed test, the p-value is found using the Excel function: T.DIST.2T(ABS( ), 1). t n 
  • 23.
    Copyright © 2020,2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 23 Example 7.7: Using p-Values • In the CadSoft example, the p-value is the left tail area of the observed test statistic, 1.05. t  – p-value TDIST( 1.05, 43,TRUE) 0.15    – Do not reject 0 H because the • For the Vacation Survey two-tailed hypothesis test in Example 7.6, the p-value for this test is – p-value T.DIST.2T(2.73,33) 0.010   – Reject 0 H because 0.010 < 0.05
  • 24.
    Copyright © 2020,2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 24 One-Sample Tests for Proportions • Test statistic:   0 0 0 ˆ 7.3 (1 ) / p z n       • 0  is the hypothesized value and p̂ is the sample proportion
  • 25.
    Copyright © 2020,2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 25 Example 7.8: One-Sample Test for the Proportion (1 of 2) • CadSoft sampled 44 customers and asked them to rate the overall quality of a software package. Sample data revealed that 35 respondents (a proportion of 33 0.795) 44  thought the software was very good or excellent. In the past, this proportion has averaged about 75%. Is there sufficient evidence to conclude that this satisfaction measure has significantly exceeded 75% using a significance level of 0.05?
  • 26.
    Copyright © 2020,2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 26 Example 7.8: One-Sample Test for the Proportion (2 of 2) • Hypotheses: – 0 : 0.75 H   – 1 : 0.75 H   • Test statistic: 0.795 0.75 0.69 0.75(1 0.75) / 44 z     • Critical value NORM.S.INV(0.95) 1.645   • p-value 1 NORM.S.DIST(0.69,true) 0.25    • Do not reject 0 . H
  • 27.
    Copyright © 2020,2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 27 Confidence Intervals and Hypothesis Tests - Two-Tailed Tests • Suppose we construct a 95% confidence interval for the mean. If we wish to test the hypotheses 0 0 : H    1 0 : H    at a 5% level of significance, we simply check whether the hypothesized value 0  falls within the confidence interval. If it does not, then we reject 0 ; H if it does, then we cannot reject 0 . H
  • 28.
    Copyright © 2020,2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 28 Confidence Intervals and Hypothesis Tests - One-Tailed Tests • For a lower-tailed test, if the confidence interval falls entirely below the hypothesized value, we reject the null hypothesis. • For an upper-tailed test, if the confidence interval falls entirely above the hypothesized value, we also reject the null hypothesis.
  • 29.
    Copyright © 2020,2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 29 Excel Templates • The Excel file One Sample Hypothesis Tests provides template worksheets for conducting hypothesis tests for means and proportions. One example:
  • 30.
    Copyright © 2020,2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 30 Two-Sample Hypothesis Tests (1 of 3) • Lower-tailed test – 0 0 : (1) (2) population parameter population parameter H D   – 1 0 : (1) (2) population parameter population parameter H D   • This test seeks evidence that the difference between population parameter (1) and population parameter (2) is less than some value, 0 . D • When 0 0, D  the test simply seeks to conclude whether population parameter (1) is smaller than population parameter (2).
  • 31.
    Copyright © 2020,2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 31 Two-Sample Hypothesis Tests (2 of 3) • Upper-tailed test – 0 0 : (1) (2) population parameter population parameter H D   – 1 0 : (1) (2) population parameter population parameter H D   • This test seeks evidence that the difference between population parameter (1) and population parameter (2) is greater than some value, 0 . D • When 0 0, D  the test simply seeks to conclude whether population parameter (1) is larger than population parameter (2).
  • 32.
    Copyright © 2020,2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 32 Two-Sample Hypothesis Tests (3 of 3) • Two-tailed test – 0 0 : (1) (2) population parameter population parameter H D   – 1 0 : (1) (2) population parameter population parameter H D   • This test seeks evidence that the difference between the population parameters is equal to 0 . D • When 0 0, D  we are seeking evidence that population parameter (1) differs from population parameter (2). – In most applications, 0 0, D  and we are simply seeking to compare the population parameters.
  • 33.
    Copyright © 2020,2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 33 Excel Analysis Toolpak Procedures for Two-Sample Hypothesis Tests Type of Test Excel Procedure Two-sample test for means, Sigma squared. known Excel z-test: Two-sample for means Two-sample test for means, Sigma squared. unknown, assumed unequal Excel t-test: Two-sample assuming unequal variances Two-sample test for means, Sigma squared. unknown, assumed equal Excel t-test: Two-sample assuming equal variances Paired two-sample test for means Excel t-test: Paired two-sample test for means Two-sample test for equality of variances Excel F-test Two-sample for variances 2  2  2 
  • 34.
    Copyright © 2020,2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 34 Two-Sample Tests for Difference in Means • Forms of the hypothesis test: 0 1 2 : { , , }0 or H        1 1 2 : { , , }0 7.4 or H      
  • 35.
    Copyright © 2020,2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 35 Example 7.9: Comparing Supplier Performance • Purchase Orders database • Determine if the mean lead time for Alum Sheeting   1  is greater than the mean lead time for Durrable Products   2 .  0 1 2 : 0 H     1 1 2 : 0 H    
  • 36.
    Copyright © 2020,2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 36 Selecting the Proper Excel Procedure • Population variances are known: – z-Test: Two-Sample for Means • Population variances are unknown and assumed unequal: – t-Test: Two-Sample Assuming Unequal Variances • Population variances are unknown but assumed equal: – t-Test: Two-Sample Assuming Equal Variances • These tools calculate the test statistic, the p-value for both a one-tail and two-tail test, and the critical values for one- tail and two-tail tests.
  • 37.
    Copyright © 2020,2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 37 Interpreting Excel Output • If the test statistic is negative, the one-tailed p-value is the correct p-value for a lower-tail test; however, for an upper-tail test, you must subtract this number from 1.0 to get the correct p-value. • If the test statistic is nonnegative (positive or zero), then the p-value in the output is the correct p-value for an upper-tail test; but for a lower-tail test, you must subtract this number from 1.0 to get the correct p-value. • For a lower-tail test, you must change the sign of the one-tailed critical value.
  • 38.
    Copyright © 2020,2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 38 Example 7.10: Testing the Hypotheses for Supplier Lead-Time Performance (1 of 2) • t-Test: Two-Sample Assuming Unequal Variances – Variable 1 Range: Alum Sheeting data – Variable 2 Range: Durrable Products data
  • 39.
    Copyright © 2020,2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 39 Example 7.10: Testing the Hypotheses for Supplier Lead-Time Performance (2 of 2) • Results – Rule 2: If the test statistic is nonnegative (positive or zero), then the p-value in the output is the correct p-value for an upper-tail test. t = 3.83 Critical value = 1.81 p-value = 0.00166 Reject 0 . H
  • 40.
    Copyright © 2020,2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 40 Two-Sample Test for Means with Paired Samples • In many situations, data from two samples are naturally paired or matched. • When paired samples are used, a paired t-test is more accurate than assuming that the data come from independent populations. • Hypotheses ( D  is the mean difference between the paired samples): 0 : { , , }0 or D H    1 : { , , }0 or D H     • Excel Data Analysis tool: t-Test: Paired Two-Sample for Means
  • 41.
    Copyright © 2020,2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 41 Example 7.11: Using the Paired Two-Sample Test for Means (1 of 2) • Excel file Pile Foundation – Test for a difference in the means of the estimated and actual pile lengths (two-tailed test).
  • 42.
    Copyright © 2020,2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 42 Example 7.11: Using the Paired Two-Sample Test for Means (2 of 2) • Results: • t = 10.91  • t is smaller than the lower critical value • p-value 0  • Reject the null hypothesis.
  • 43.
    Copyright © 2020,2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 43 Two-Sample Test for Equality of Variances • Test for equality of variances between two samples using a new type of test, the F-test. – To use this test, we must assume that both samples are drawn from normal populations. • Hypotheses: 2 2 0 1 2 : 0 H       2 2 1 1 2 : 0 7.5 H     • F-test statistic:   2 1 2 2 7.6 s F s  • Excel tool: F-test for Equality of Variances
  • 44.
    Copyright © 2020,2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 44 F-Distribution • The F-distribution has two degrees of freedom, one associated with the numerator of the F-statistic, 1 1, n  and one associated with the denominator of the F-statistic, 2 1. n  • Table 4, Appendix A provides only upper-tail critical values, and the distribution is not symmetric.
  • 45.
    Copyright © 2020,2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 45 Conducting the F-Test • Although the hypothesis test is really a two-tailed test, we will simplify it as an upper-tailed, one-tailed test to make it easy to use tables of the F-distribution and interpret the results of the Excel tool. – We do this by ensuring that when we compute F, we take the ratio of the larger sample variance to the smaller sample variance. • Find the critical value , 1, 2 2 df df F of the F-distribution, and then we reject the null hypothesis if the F-test statistic exceeds the critical value. • Note that we are using 2  to find the critical value, not .  This is because we are using only the upper tail information on which to base our conclusion.
  • 46.
    Copyright © 2020,2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 46 Example 7.12: Applying the F-Test for Equality of Variances • Determine whether the variance of lead times is the same for Alum Sheeting and Durrable Products in the Purchase Orders data. – The variance of the lead times for Alum Sheeting is larger than the variance for Durable Products, so this is assigned to Variable 1. F = 3.47 Critical value = 3.607 P-value = 0.029 Reject 0 H
  • 47.
    Copyright © 2020,2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 47 Analysis of Variance (ANOVA) • Used to compare the means of two or more population groups. 0 1 2 : m H        1 : H at least one mean is different from the others • ANOVA derives its name from the fact that we are analyzing variances in the data. • ANOVA measures variation between groups relative to variation within groups. • Each of the population groups is assumed to come from a normally distributed population.
  • 48.
    Copyright © 2020,2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 48 Example 7.13: Difference in Insurance Survey Data • Determine whether any significant differences exist in satisfaction among individuals with different levels of education. Blank College Graduate Graduate Degree Some College Blank 5 3 4 Blank 3 4 1 Blank 5 5 4 Blank 3 5 2 Blank 3 5 3 Blank 3 4 4 Blank 3 5 4 Blank 4 5 Blank Blank 2 Blank Blank Average 3.444 4.500 3.143 Count 9 8 7 • The variable of interest is called a factor. In this example, the factor is the educational level, and we have three categorical levels of this factor, college graduate, graduate degree, and some college.
  • 49.
    Copyright © 2020,2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 49 Example 7.14: Applying the Excel ANOVA Tool (1 of 2) • Data Analysis tool: ANOVA: Single Factor – The input range of the data must be in contiguous columns
  • 50.
    Copyright © 2020,2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 50 Example 7.14: Applying the Excel ANOVA Tool (2 of 2) • Results – F = 3.92 – – crit F F  – p-value = 0.0356 – Reject 0 . H
  • 51.
    Copyright © 2020,2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 51 Assumptions of ANOVA • The m groups or factor levels being studied represent populations whose outcome measures 1. are randomly and independently obtained, 2. are normally distributed, and 3. have equal variances. • If these assumptions are violated, then the level of significance and the power of the test can be affected.
  • 52.
    Copyright © 2020,2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 52 Chi-Square Test for Independence • Test for independence of two categorical variables. – 0 : H two categorical variables are independent – 1 : H two categorical variables are dependent
  • 53.
    Copyright © 2020,2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 53 Example 7.15: Independence and Marketing Strategy • Energy Drink Survey data. A key marketing question is whether the proportion of males who prefer a particular brand is no different from the proportion of females. – If gender and brand preference are indeed independent, we would expect that about the same proportion of the sample of female students would also prefer brand 1. – If they are not independent, then advertising should be targeted differently to males and females, whereas if they are independent, it would not matter.
  • 54.
    Copyright © 2020,2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 54 Chi-Square Test Calculations (1 of 3) • Step 1: Using a cross-tabulation of the data, compute the expected frequency if the two variables are independent.      expected frequency in row grand total row grand total column = 7.7 and column total number of observations i j i j
  • 55.
    Copyright © 2020,2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 55 Example 7.16: Computing Expected Frequencies      expected frequency in row grand total row grand total column = 7.7 and column total number of observations i j i j
  • 56.
    Copyright © 2020,2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 56 Chi-Square Test Calculations (2 of 3) • Step 2: Compute a test statistic, called a chi-square statistic, which is the sum of the squares of the differences between observed frequency, 0 , f and expected frequency, , e f divided by the expected frequency in each cell:   2 2 0 ( ) 7.8 e e f f f   
  • 57.
    Copyright © 2020,2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 57 Chi-Square Distribution • The sampling distribution of 2  is a special distribution called the chi-square distribution. – The chi-square distribution is characterized by degrees of freedom. – Table 3 in Appendix A
  • 58.
    Copyright © 2020,2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 58 Chi-Square Test Calculations (3 of 3) • Step 3: Compare the chi-square statistic for the level of significance  to the critical value from a chi-square distribution with    1 1 r c   degrees of freedom, where r and c are the number of rows and columns in the cross-tabulation table, respectively. – The Excel function CHISQ.INV.RT( ) probability,deg_freedom returns the value of 2  that has a right-tail area equal to probability for a specified degree of freedom. – By setting probability equal to the level of significance, we can obtain the critical value for the hypothesis test. – The Excel function CHISQ.TEST( ) actual_range,expected_range computes the p-value for the chi-square test.
  • 59.
    Copyright © 2020,2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 59 Example 7.17: Conducting the Chi-Square Test • Test statistic = 6.49 • . (2 1)(3 1) 2 d f     • Critical value = CHISQ.INV.RT(0.05,2) 5.99146  • p-value = • Reject 0 H
  • 60.
    Copyright © 2020,2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 60 Cautions in Using the Chi-Square Test • The chi-square test assumes adequate expected cell frequencies. – A rule of thumb is that there be no more than 20% of cells with expected frequencies smaller than 5, and no expected frequencies of zero. • Consider aggregating some of the rows or columns in a logical fashion to enforce this assumption.
  • 61.
    Copyright © 2020,2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 61 Example 7.18: Violations of Chi-Square Assumptions (1 of 2) • A survey of 100 students at a university queried their beverage preferences at a local coffee shop. Of the 16 cells, five, or over 30%, have frequencies smaller than 5. Four of them are in the Cappuccino, Latte, and Mocha columns; these can be aggregated into one column called Hot Specialty beverages.
  • 62.
    Copyright © 2020,2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 62 Example 7.18: Violations of Chi-Square Assumptions (2 of 2) Now only 2 of 12 cells have an expected frequency less than 5; this now meets the assumptions of the chi-square test.
  • 63.
    Copyright © 2020,2016, 2013 Pearson Education, Inc. All Rights Reserved Slide - 63 Copyright This work is protected by United States copyright laws and is provided solely for the use of instructors in teaching their courses and assessing student learning. Dissemination or sale of any part of this work (including on the World Wide Web) will destroy the integrity of the work and is not permitted. The work and materials from it should never be made available to students except by instructors using the accompanying text in their classes. All recipients of this work are expected to abide by these restrictions and to honor the intended pedagogical purposes and the needs of other instructors who rely on these materials.

Editor's Notes

  • #1 If this PowerPoint presentation contains mathematical equations, you may need to check that your computer has the following installed: 1) MathType Plugin 2) Math Player (free versions available) 3) NVDA Reader (free versions available)
  • #8 The table 2 columns and 7 rows. The column headings from left to right are, customer and time in minutes. The row entries are as follows. Row 1. 1, 20. Row 2. 2, 12. Row 3. 3, 15. Row 4. 4, 11. Row 5. 5, 22. Row 6. 6, 6. Row 7. 7, 39.
  • #16 A vertical line is drawn below the normal curve on the left. This line is labeled, critical value. The region below the curve, to the left of this line is shaded and labeled, rejection region, alpha. Figure 2. A vertical line is drawn below the normal curve on the right side. This line is labeled, critical value. The region below the curve, to the right of this line is shaded and labeled, rejection region alpha. A normal distribution curve titled, two tailed test, is displayed below. The rejection region is on both the sides of the distribution curve. A vertical is drawn below the curve on the left side and is labeled, lower critical value. Another vertical is drawn below the curve on the right side and it is labeled, upper critical value. The region to this left and right side of the vertical is shaded and labeled, start fraction alpha over 2 end fraction. This represents the rejection region.
  • #19 The rejection region is on both the sides of the distribution curve. The lower critical value is at negative 2.0345. The region below the curve to the left of the lower value is shaded. The upper critical value is at 2.0345. The region below the curve to the right of the upper critical value is shaded. A vertical line is drawn to the curve at 2.73.
  • #29 The text on the spreadsheet reads, one sample test for the mean, population standard deviation unknown. The data are as follows. Sample size, 44. Sample mean, 21.91. sample standard deviation, 19.49. hypothesize value, 25. Level of significance. T statistic, negative 1.5017. one tailed t value, 1.6811. two tailed t value, + and negative, 2.0167. The hypothesis test results are as follows. The results of the lower one tailed test. Critical t value, negative 1.681070703, p value, 0.149416269. Conclusion, do not reject null hypothesis. The results of the upper one tailed test are as follows. Critical t value, 1.618070703. p value, 0.850583731. Conclusion, do not reject null hypothesis. Two tailed test, critical t value, + and negative, 2.016692199, p value, 0.298832538, conclusion, do not reject null hypothesis.
  • #35 The table has 11 columns and 7 rows. The column headings from left to right are as follows. Supplier, order number, item number, item description, item cost, quantity, cost per order, A per P terms in months, order date, arrival date, lead time. The row entries are as follows. Row 1. Hulkey Fasteners, A u g 11001, 1122, Airframe fasteners, $4.25, 19,500, $82,875.00, 30, August 5 2011, August 13 2011, 8. Row 2. Alum Sheeting, A u g 11002, 1243, Airframe fasteners, $4.25, 10,000, $42,500.00, 30, August 8 2011, August 14 2011, 6. Row 3. Fast Tie Aerospace, A u g 11003, 5462, Shielded cable per feet, $1.05, 23,000, $24,150.00, 30, August 10 2011, August 15 2011, 5. Row 4. Fast Tie Aerospace, A u g 11004, 5462, Shielded cable per feet, $1.05, 21,500, $22,575.00, 30, August 15 2011, August 22 2011 7. Row 5. Steel pin incorporated, A u g 11005, 5319, Shielded cable per feet, $1.10, 17,500, $19,250.00, 30, August 20 2011, August 31 2011, 11. Row 6. Fast Tie Aerospace, A u g 11006, 5462, Shielded cable per feet, $1.05, 22,500, $23,625.00, 30, August 20 2011, August 26 2011, 6. Row 7. Steel pin incorporated, A u g 11007, 4312, Bolt nut package, $3.75, 4,250, $15,937.50, 30, August 25 2011, August 01 2011, 7. The table has 2 columns and 9 columns. The column headings from left to right are as follows. Row labels and average of lead time. The row entries are as follows. Row 1. Alum sheeting, 7.00. Row 2. Durable products, 4.92. Row 3. Fast Tie Aerospace, 8.47. Row 4. Hulkey fasteners, 6.47. Row 5. Manley valve, 6.45. Row 6. Pylon Accessories, 8.00. Row 7. Space time Technologies, 15.25. Row 8. Steel pin incorporated, 10.20. Row 9. Grand total 8.41. The first three entries in the second column are encircled.
  • #38 The column headings from left to right are as follows. Supplier and Lead time. The row entries are as follows. Row 1. Alum sheeting, 6. Row 2. Alum sheeting, 9. Row 3. Alum sheeting, 7. Row 4. Alum sheeting, 7. Row 5. Alum sheeting, 5. Row 6. Alum sheeting, 7. Row 7. Alum sheeting, 9. Row 8. Alum sheeting, 6. Row 9. Durable products, 3. Row 10. Durable products, 4. Row 11. Durable products, 5. Row 12. Durable products, 6. Row 13. Durable products, 5. Row 14. Durable products, 5. Row 15. Durable products, 5. Row 16. Durable products, 6. Row 17. Durable products, 5. Row 18. Durable products, 5. Row 19. Durable products, 5. Row 20. Durable products, 5. Row 21. Durable products, 5. The dialog box has two panes. The input pane has entries against the fields as follows. Variable 1 range, $K $4 colon $K $11. Variable 2 range, $K $12 colon $K $24. Alpha, 0.05. The output options has three radio buttons. The radio button, new worksheet ply is selected. Okay button is highlighted.
  • #39 The table has 3 columns and 11 rows. The column headings from left to right are as follows. Parameter, Alum sheeting variable 1, Durable products variable 2. The row entries are as follows. Row 1. Mean, 7, 4.923076923. Row 2. Variance, 2, 0.576923077. Row 3. Observations, 8, 13. Row 4. Hypothesized mean difference, 0, blank. Row 5. d f, 10, blank. Row 6. t s t a t, 3.827958507, blank. Row 7. P of start expression upper T is less than or equal to lower t end expression one tail, 0.001664976, blank. Row 8. t critical one tail, 1.812461123, blank. Row 9. P of start expression upper T is less than or equal to lower t end expression two tail, 0.003329952, blank. Row 10. t critical two tail, 2.228138852, blank.
  • #41 The table has 3 columns and 16 rows. The column headings from left to right are as follows. Pile number, estimated pile length in feet and actual pile length in feet. The row entries are as follows. Row 1. 1, 10.58, 18.58. Row 2. 2, 10.58, 18.58. Row 3. 3, 10.58, 18.58. Row 4. 4, 10.58, 18.58. Row 5. 5, 10.58, 28.58. Row 6. 6, 10.58, 28.58. Row 7. 7, 10.58, 17.58. Row 8. 8, 10.58, 27.58. Row 9. 9, 10.58, 27.58. Row 10. 10, 10.58, 37.58. Row 11. 11, 10.58, 28.58. Row 12. 12, 5.83, 1.83. Row 13. 13, 5.83, 8.83. Row 14. 14, 5.83, 8.83. Row 15. 5.83, 8.83. Row 16. 16, 10.83, 16.83. The dialog box has two panes. The input pane has entries against the fields as follows. Variable 1 range, $B $4 colon $B $315. Variable 2 range, $C $4 colon $C $315. Alpha, 0.05. The output options has three radio buttons. The radio button, new worksheet ply is selected. Okay button is highlighted.
  • #42 The table has 3 columns and 11 rows. The column headings from left to right are as follows. Parameter, estimated, actual. The row entries are as follows. Row 1. Mean, 28.17755627, 34.55623794. Row 2. Variance, 255.8100385, 267.0113061. Row 3. Observations, 311, 311. Row 4. Pearson correlation, 0.79692836, blank. Row 5. Hypothesized mean difference, 0, blank. Row 6. d f, 310, blank. Row 7. t s t a t, negative 10.91225025, blank. Row 8. P of start expression upper T is less than or equal to lower t end expression one tail, 5.59435 E negative 24, blank. Row 9. t critical one tail, 1.649783823, blank. Row 10. P of start expression upper T is less than or equal to lower t end expression two tail, 1.11887 E negative 23, blank. Row 11. t critical two tail, 1.967645929, blank.
  • #44 Upper critical values of the F distribution for numerator degrees of freedom v sub 1 and denominator degrees of freedom v sub 2, 5% significance level has 11 columns and 10 rows. The columns and the rows represents v sub 1 and v sub 2 respectively. The column headings from left to right are as follows. V sub 2 or v sub 1, 1, 2, 3, 4, 5, 6, 7, 8, 9, and 10. The row entries are as follows. Row 1. 1, 161.448, 199.500, 215.707, 224.583, 230.162, 233.986, 236.768, 238.882, 240.543, 241.882. Row 2. 2, 18.513, 19.000, 19.164, 19.247, 19.296, 19.330, 19.353, 19.371, 19.385, 19.396. Row 3. 3, 10.128, 9.552, 9.277, 9.117, 9.013, 8.941, 8.887, 8.845, 8.812, 8.786. Row 4. 4, 7.709, 6.944, 6.591, 6.388, 6.256, 6.163, 6.094, 6.041, 5.999, 5.964. Row 5. 5, 6.608, 5.786, 5.409, 5.192, 5.050, 4.950, 4.876, 4.818, 4.772, 4.735. Row 6. 6, 5.987, 5.143, 4.757, 4.534, 4.387, 4.284, 4.207, 4.147, 4.099, 4.060. Row 7. 7, 5.591, 4.737, 4.347, 4.120, 3.972, 3.866, 3.787, 3.726, 3.677, 3.637. Row 8. 8, 5.318, 4.459, 4.066, 3.838, 3.687, 3.581, 3.500, 3.438, 3.388, 3.347. Row 9. 9, 5.117, 4.256, 3.863, 3.633, 3.482, 3.374, 3.293, 3.230, 3.179, 3.137. Row 10. 10, 4.965, 4.103, 3.708, 3.478, 3.326, 3.217, 3.135, 3.072, 3.020, 2.978.
  • #46 The table has 7 rows and 3 columns. The columns have the following headings from left to right. Calculation, Alum Sheeting Variable 1, Durable Products Variable 2. The row entries are as follows. Row 1. Mean, 7, 4.923076923. Row 2. Variance, 2, 0.576923077. Row 3. Observations, 8, 13. Row 4. d f, 7, 12. Row 5. F, 3.466666667, Blank. Row 6. P of left parenthesis upper f lesser than or equal to lower f right parenthesis one tail, 0.028595441, Blank. Row 7. F Critical one tail, 3.606514842, Blank.
  • #49 The dialog box has two sections that are input and output options. Under the input section, a text box corresponding to Input range reads, blank. Below the text box, grouped by section has two radio buttons, columns and rows, in which the columns radio button is selected. An unticked checkbox corresponds to labels in first row. A text box alpha reads, 0.05. The output section has three radio buttons, output range, new worksheet ply, and new workbook, in which the new workbook ply is selected. The ok command button is selected.
  • #50 The first table, titled summary, has 3 rows and 5 columns. The columns have the following headings from left to right. Groups, Count, Sum, Average, Variance. The row entries are as follows. Row 1. College graduate, 9, 31, 3.444444444, 1.027777778. Row 2. Graduate degree, 8, 36, 4.5, 0.571428571. Row 3. Some college, 7, 22, 3.142857143, 1.476190476. The second table, titled anova, has 3 rows and 7 columns. The columns have the following headings from left to right. Source of Variation, S , d f, M S, F, P value, F critical. The row entries are as follows. Row 1. Between Groups, 7.878968254, 2, 3.939484127, 3.924651732, 0.035635398, 3.466800112. Row 2. Within Groups, 21.07936508, 21, 1.003779289, blank, blank, blank. Row 3. Total, 28.95833333, 23, blank, blank, blank, blank.
  • #53 The first table has three columns and seven rows. The column headers from left to right are as follows. Respondent, gender, brand preference. The row entries are as follows. Row 1. 1, male, brand 3. Row 2. 2, female, brand 3. Row 3. 3, male, brand 3. Row 4. 4, male, brand 1. Row 5. 5, male, brand 1. Row 6. 6, female, brand 2. Row 7. 7, male, brand 2. The second table has 3 rows and 5 columns. The columns have the following headings from left to right. Count of Respondent Row Labels, Column Labels Brand 1, Brand 2, Brand 3, Grand Total. The row entries are as follows. Row 1. Female, 9, 6, 22, 37. Row 2. Male, 25, 17, 21, 63. Row 3. Grand Total, 34, 23, 43, 100.
  • #55 The first table has 3 rows and 5 columns. The columns have the following headings from left to right. Count of Respondent Row Labels, Column Labels Brand 1, Brand 2, Brand 3, Grand Total. The row entries are as follows. Row 1. Female, 9, 6, 22, 37. Row 2. Male, 25, 17, 21, 63. Row 3. Grand Total, 34, 23, 43, 100. The second table has 3 rows and 5 columns. The columns have the following headings from left to right. Expected frequency, Brand 1, Brand 2, Brand 3, Grand Total. The row entries are as follows. Row 1. Female, 12.58, 8.51, 15.91, 37. Row 2. Male, 21.42, 14.49, 27.09, 63. Row 3. Grand Total, 34, 23, 43, 100. The value 12.58 under column brand 1, row female is labeled, Expected frequency of female and brand 1 = start fraction 37 times 34 over 100 end fraction.
  • #57 The columns have the following headings from left to right. Degrees of freedom, 0.995, 0.990, 0.975, 0.950, 0.900, 0.750, 0.250, 0.100, 0.050, 0.025, 0.010, 0.005. The row entries are as follows. Row 1. 1, blank, blank, 0.001, 0.004, 0.016, 0.102, 1.323, 2.706, 3.841, 5.024, 6.635, 7.879. Row 2. 2, 0.010, 0.020, 0.051, 0.103, 0.211, 0.575, 2.773, 4.605, 5.991, 7.378, 9.210, 10.597. Row 3. 3, 0.072, 0.115, 0.216, 0.352, 0.584, 1.213, 4.108, 6.251, 7.815, 9.348, 11.345, 12.838. Row 4. 4, 0.207, 0.297, 0.484, 0.711, 1.064, 1.923, 5.385, 7.779, 9.488, 11.143, 13.277, 14.860. Row 5. 5, 0.412, 0.554, 0.831, 1.145, 1.610, 2.675, 6.626, 9.236, 11.071, 12.833, 15.086, 16.750. Row 6. 6, 0.676, 0.872, 1.237, 1.635, 2.204, 3.455, 7.841, 10.645, 12.592, 14.449, 16.812, 18.548. Row 7. 7, 0.989, 1.239, 1.690, 2.167, 2.833, 4.255, 9.037, 12.017, 14.067, 16.013, 18.475, 20.278. Row 8. 8, 1.344, 1.646, 2.180, 2.733, 3.490, 5.071, 10.219, 13.362, 15.507, 17.535, 20.090, 21.955. Row 9. 9, 1.735, 2.088, 2.700, 3.325, 4.168, 5.899, 11.389, 14.684, 16.919, 19.023, 21.666, 23.589. Row 10. 10, 2.158, 2.558, 3.247, 3.940, 4.865, 6.737, 12.549, 15.987, 18.307, 20.483, 23.209, 25.188.
  • #59 The first table has 3 rows and 5 columns. The columns have the following headings from left to right. Count of Respondent Row Labels, Column Labels Brand 1, Brand 2, Brand 3, Grand Total. The row entries are as follows. Row 1. Female, 9, 6, 22, 37. Row 2. Male, 25, 17, 21, 63. Row 3. Grand Total, 34, 23, 43, 100. The second table has 3 rows and 5 columns. The columns have the following headings from left to right. Expected frequency, Brand 1, Brand 2, Brand 3, Grand Total. The row entries are as follows. Row 1. Female, 12.58, 8.51, 15.91, 37. Row 2. Male, 21.42, 14.49, 27.09, 63. Row 3. Grand Total, 34, 23, 43, 100. The third table has 3 rows and 5 columns. The columns have the following headings from left to right. Chi square statistic, Brand 1, Brand 2, Brand 3, Grand Total. The row entries are as follows. Row 1. Female, 1.02, 0.74, 2.33, 4.09. Row 2. Male, 0.60, 0.43, 1.37, 2.40. Row 3. Grand Total, 1.62, 1.18, 3.70, 6.49. The value 6.49 is labeled, test statistic. Chi-square critical value is 5.99146455 and p value is 0.03892134.
  • #61 Gender, Brewed coffee, Iced coffee, Espresso, Cappuccino, Latte, Mocha, Iced blended, Tea, Total. The row entries are as follows. Row 1. Female, 6, 10, 2, 4, 7, 9, 10, 8, 56. Row 2. Male 16, 5, 8, 2, 3, 2, 7, 1, 44. Row 3. Total, 22, 15, 10, 6, 10, 11, 17, 9, 100. The first two rows are boxed. Gender, Brewed coffee, Iced coffee, Espresso, Cappuccino, Latte, Mocha, Iced blended, Tea, Total. The row entries are as follows. Row 1. Female, 12.32, 8.4, 5.6, 3.36, 5.6, 6.16, 9.52, 5.04, 56. Row 2. Male 9.68, 6.6, 4.4, 2.64, 4.4, 4.84, 7.48, 3.96, 44. Row 3. Total, 22, 15, 10, 6, 10, 11, 17, 9, 100. The first two rows in the table are boxed.
  • #62 Gender, Brewed coffee, Iced coffee, Espresso, Hot Specialty, Iced blended, Tea, Total. The row entries are as follows. Row 1. Female, 6, 10, 2, 20, 10, 8, 56. Row 2. Male 16, 5, 8, 7, 7, 1, 44. Row 3. Total, 22, 15, 10, 27, 17, 9, 100. The first two rows in the table are boxed. The columns have the following headings from left to right. Gender, Brewed coffee, Iced coffee, Espresso, Hot Specialty, Iced blended, Tea, Total. The row entries are as follows. Row 1. Female, 12.32, 8.4, 5.6, 15.12, 9.52, 5.04, 56. Row 2. Male 9.68, 6.6, 4.4, 11.88, 7.48, 3.96, 44. Row 3. Total, 22, 15, 10, 27, 17, 9, 100. The first two rows in the table are boxed.