SlideShare a Scribd company logo
1 of 67
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
Statistics for
Business and Economics
7th Edition
Chapter 14
Analysis of Categorical Data
Ch. 14-1
Chapter Goals
After completing this chapter, you should be able to:
 Use the chi-square goodness-of-fit test to determine
whether data fits specified probabilities
 Perform tests for the Poisson and Normal distributions
 Set up a contingency analysis table and perform a chi-
square test of association
 Use the sign test for paired or matched samples
 Recognize when and how to use the Wilcoxon signed
rank test for paired or matched samples
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-2
Chapter Goals
After completing this chapter, you should be able to:
 Use a sign test for a single population median
 Apply a normal approximation for the Wilcoxon signed
rank test
 Know when and how to perform a Mann-Whitney U-test
 Explain Spearman rank correlation and perform a test
for association
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-3
(continued)
Nonparametric Statistics
 Nonparametric Statistics
 Fewer restrictive assumptions about data
levels and underlying probability distributions
 Population distributions may be skewed
 The level of data measurement may only be
ordinal or nominal
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-4
 Does sample data conform to a hypothesized
distribution?
 Examples:
 Do sample results conform to specified expected
probabilities?
 Are technical support calls equal across all days of
the week? (i.e., do calls follow a uniform
distribution?)
 Do measurements from a production process
follow a normal distribution?
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
Goodness-of-Fit Tests
Ch. 14-5
14.1
 Are technical support calls equal across all days of the
week? (i.e., do calls follow a uniform distribution?)
 Sample data for 10 days per day of week:
Sum of calls for this day:
Monday 290
Tuesday 250
Wednesday 238
Thursday 257
Friday 265
Saturday 230
Sunday 192
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
Chi-Square Goodness-of-Fit Test
(continued)
 = 1722
Ch. 14-6
 If calls are uniformly distributed, the 1722 calls
would be expected to be equally divided across
the 7 days:
 Chi-Square Goodness-of-Fit Test: test to see if
the sample results are consistent with the
expected results
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
Logic of Goodness-of-Fit Test
uniformifdaypercallsexpected246
7
1722

Ch. 14-7
Observed vs. Expected
Frequencies
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
Observed
Oi
Expected
Ei
Monday
Tuesday
Wednesday
Thursday
Friday
Saturday
Sunday
290
250
238
257
265
230
192
246
246
246
246
246
246
246
TOTAL 1722 1722
Ch. 14-8
Chi-Square Test Statistic
 The test statistic is
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
1)Kd.f.(where
E
)E(OK
1i i
2
ii2


 

where:
K = number of categories
Oi = observed frequency for category i
Ei = expected frequency for category i
H0: The distribution of calls is uniform
over days of the week
H1: The distribution of calls is not uniform
Ch. 14-9
The Rejection Region
 Reject H0 if
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall



K
1i i
2
ii2
E
)E(O

H0: The distribution of calls is uniform
over days of the week
H1: The distribution of calls is not uniform
2
α
2

0

2

Reject H0Do not
reject H0
(with k – 1 degrees
of freedom) 2
Ch. 14-10
Chi-Square Test Statistic
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
23.05
246
246)(192
...
246
246)(250
246
246)(290 222
2







H0: The distribution of calls is uniform
over days of the week
H1: The distribution of calls is not uniform
0
 = .05
Reject H0Do not
reject H0
2
k – 1 = 6 (7 days of the week) so
use 6 degrees of freedom:
2
.05 = 12.5916
2
.05 = 12.5916
Conclusion:
2 = 23.05 > 2
 = 12.5916 so
reject H0 and conclude that the
distribution is not uniform
Ch. 14-11
Goodness-of-Fit Tests,
Population Parameters Unknown
Idea:
 Test whether data follow a specified distribution
(such as binomial, Poisson, or normal) . . .
 . . . without assuming the parameters of the
distribution are known
 Use sample data to estimate the unknown
population parameters
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-12
14.2
Goodness-of-Fit Tests,
Population Parameters Unknown
 Suppose that a null hypothesis specifies category
probabilities that depend on the estimation (from the
data) of m unknown population parameters
 The appropriate goodness-of-fit test is the same as in
the previously section . . .
 . . . except that the number of degrees of freedom for
the chi-square random variable is
 Where K is the number of categories
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall



K
1i i
2
ii2
E
)E(O

1)m(KFreedomofDegrees 
(continued)
Ch. 14-13
Test of Normality
 The assumption that data follow a normal
distribution is common in statistics
 Normality was assessed in prior chapters (for
example, with Normal probability plots in
Chapter 5)
 Here, a chi-square test is developed
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-14
14.3
Test of Normality
 Two population parameters can be estimated using
sample data:
 For a normal distribution,
Skewness = 0
Kurtosis = 3
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
(continued)
3
n
1i
3
i
ns
)x(x
Skewness



4
n
1i
4
i
ns
)x(x
Kurtosis



Ch. 14-15
Jarque-Bera
Test for Normality
 Consider the null hypothesis that the population
distribution is normal
 The Jarque-Bera Test for Normality is based on the closeness the
sample skewness to 0 and the sample kurtosis to 3
 The test statistic is
 as the number of sample observations becomes very large, this
statistic has a chi-square distribution with 2 degrees of freedom
 The null hypothesis is rejected for large values of the test statistic
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall





 

24
3)(Kurtosis
6
(Skewness)
nJB
22
Ch. 14-16
Jarque-Bera
Test for Normality
 The chi-square approximation is close only for very
large sample sizes
 If the sample size is not very large, the Bowman-
Shelton test statistic is compared to significance points
from text Table 14.9
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
(continued)
Sample
size N
10%
point
5% point Sample
size N
10%
point
5% point
20
30
40
50
75
100
125
150
2.13
2.49
2.70
2.90
3.09
3.14
3.31
3.43
3.26
3.71
3.99
4.26
4.27
4.29
4.34
4.39
200
250
300
400
500
800
∞
3.48
3.54
3.68
3.76
3.91
4.32
4.61
4.43
4.61
4.60
4.74
4.82
5.46
5.99
Ch. 14-17
Example: Jarque-Bera
Test for Normality
 The average daily temperature has been recorded for
200 randomly selected days, with sample skewness
0.232 and kurtosis 3.319
 Test the null hypothesis that the true distribution is
normal
 From Table 14.9 the 10% critical value for n = 200 is
3.48, so there is not sufficient evidence to reject that the
population is normal
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
2.642
24
3)(3.319
6
(0.232)
200
24
3)(Kurtosis
6
(Skewness)
nJB
2222





 





 

Ch. 14-18
Contingency Tables
Contingency Tables
 Used to classify sample observations according
to a pair of attributes
 Also called a cross-classification or cross-
tabulation table
 Assume r categories for attribute A and c
categories for attribute B
 Then there are (r x c) possible cross-classifications
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-19
14.3
r x c Contingency Table
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
Attribute B
Attribute A 1 2 . . . C Totals
1
2
.
.
.
r
Totals
O11
O21
.
.
.
Or1
C1
O12
O22
.
.
.
Or2
C2
…
…
…
…
…
…
…
O1c
O2c
.
.
.
Orc
Cc
R1
R2
.
.
.
Rr
n
Ch. 14-20
Test for Association
 Consider n observations tabulated in an r x c
contingency table
 Denote by Oij the number of observations in
the cell that is in the ith row and the jth column
 The null hypothesis is
 The appropriate test is a chi-square test with
(r-1)(c-1) degrees of freedom
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
populationtheinattributestwothebetween
existsnassociatioNo:H0
Ch. 14-21
Test for Association
 Let Ri and Cj be the row and column totals
 The expected number of observations in cell row i and
column j, given that H0 is true, is
 A test of association at a significance level  is based
on the chi-square distribution and the following decision
rule
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
2
1),1)c(r
r
1i
c
1j ij
2
ijij2
0
E
)E(O
ifHReject αχχ 
 


 
n
CR
E
ji
ij 
(continued)
Ch. 14-22
Contingency Table Example
H0: There is no association between
hand preference and gender
H1: Hand preference is not independent of gender
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
Left-Handed vs. Gender
 Dominant Hand: Left vs. Right
 Gender: Male vs. Female
Ch. 14-23
Contingency Table Example
Sample results organized in a contingency table:
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
Gender
Hand Preference
Left Right
Female 12 108 120
Male 24 156 180
36 264 300
120 Females, 12
were left handed
180 Males, 24 were
left handed
sample size = n = 300:
(continued)
Ch. 14-24
Logic of the Test
 If H0 is true, then the proportion of left-handed females
should be the same as the proportion of left-handed
males
 The two proportions above should be the same as the
proportion of left-handed people overall
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
H0: There is no association between
hand preference and gender
H1: Hand preference is not independent of gender
Ch. 14-25
Finding Expected Frequencies
Overall:
P(Left Handed)
= 36/300 = .12
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
120 Females, 12
were left handed
180 Males, 24 were
left handed
If no association, then
P(Left Handed | Female) = P(Left Handed | Male) = .12
So we would expect 12% of the 120 females and 12% of the 180
males to be left handed…
i.e., we would expect (120)(.12) = 14.4 females to be left handed
(180)(.12) = 21.6 males to be left handed
Ch. 14-26
Expected Cell Frequencies
 Expected cell frequencies:
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
sizesampleTotal
total)Columntotal)(jRow(i
n
CR
E
thth
ji
ij 
14.4
300
(120)(36)
E11 
Example:
(continued)
Ch. 14-27
Observed vs. Expected
Frequencies
Observed frequencies vs. expected frequencies:
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
Gender
Hand Preference
Left Right
Female
Observed = 12
Expected = 14.4
Observed = 108
Expected = 105.6
120
Male
Observed = 24
Expected = 21.6
Observed = 156
Expected = 158.4
180
36 264 300
Ch. 14-28
The Chi-Square Test Statistic
 where:
Oij = observed frequency in cell (i, j)
Eij = expected frequency in cell (i, j)
r = number of rows
c = number of columns
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
 


r
1i
c
1j ij
2
ijij2
E
)E(O

The Chi-square test statistic is:
)1c)(1r(.f.dwith 
Ch. 14-29
Observed vs. Expected
Frequencies
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
Gender
Hand Preference
Left Right
Female
Observed = 12
Expected = 14.4
Observed = 108
Expected = 105.6
120
Male
Observed = 24
Expected = 21.6
Observed = 156
Expected = 158.4
180
36 264 300
6848.0
4.158
)4.158156(
6.21
)6.2124(
6.105
)6.105108(
4.14
)4.1412( 2222
2









Ch. 14-30
Contingency Analysis
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
2
2
.05 = 3.841
Reject H0
 = 0.05
Decision Rule:
If 2 > 3.841, reject H0,
otherwise, do not reject H0
1(1)(1)1)-1)(c-(rd.f.with6848.02

Do not reject H0
Here, 2 = 0.6848
< 3.841, so we
do not reject H0
and conclude that
gender and hand
preference are not
associated
Ch. 14-31
Nonparametric Tests for
Paired or Matched Samples
 A sign test for paired or matched samples:
 Calculate the differences of the paired observations
 Discard the differences equal to 0, leaving n
observations
 Record the sign of the difference as + or –
 For a symmetric distribution, the signs are
random and + and – are equally likely
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-32
14.4
Sign Test
 Define + to be a “success” and let P = the true
proportion of +’s in the population
 The sign test is used for the hypothesis test
 The test-statistic S for the sign test is
S = the number of pairs with a positive difference
 S has a binomial distribution with P = 0.5 and
n = the number of nonzero differences
(continued)
0.5P:H0 
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-33
Determining the p-value
 The p-value for a Sign Test is found using the binomial
distribution with n = number of nonzero differences, S =
number of positive differences, and P = 0.5
 For an upper-tail test, H1: P > 0.5, p-value = P(x  S)
 For a lower-tail test, H1: P < 0.5, p-value = P(x  S)
 For a two-tail test, H1: P  0.5, 2(p-value)
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-34
Sign Test Example
 Ten consumers in a focus group have rated the
attractiveness of two package designs for a new product
Consumer Rating Difference Sign of Difference
Package 1 Package 2 Rating 1 – 2
1
2
3
4
5
6
7
8
9
10
5
4
4
6
3
5
7
5
6
7
8
8
4
5
9
9
6
9
3
9
-3
-4
0
+1
-6
-4
-1
-4
+3
-2
–
–
0
+
–
–
–
–
+
–
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-35
Sign Test Example
 Test the hypothesis that there is no overall package preference
using  = 0.10
The proportion of consumers who prefer
package 1 is the same as the proportion
preferring package 2
A majority prefer package 2
 The test-statistic S for the sign test is
S = the number of pairs with a positive difference
= 2
 S has a binomial distribution with P = 0.5 and n = 9 (there was
one zero difference)
(continued)
0.5P:H0 
0.5P:H1 
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-36
Sign Test Example
 The p-value for this sign test is found using the binomial
distribution with n = 9, S = 2, and P = 0.5:
 For a lower-tail test,
p-value = P(x  2|n=9, P=0.5)
= 0.090
Since 0.090 <  = 0.10 we reject the null hypothesis
and conclude that consumers prefer package 2
(continued)
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-37
Wilcoxon Signed Rank Test for
Paired or Matched Samples
 Uses matched pairs of random observations
 Still based on ranks
 Incorporates information about the magnitude
of the differences
 Tests the hypothesis that the distribution of
differences is centered at zero
 The population of paired differences is
assumed to be symmetric
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-38
Wilcoxon Signed Rank Test for
Paired or Matched Samples
Conducting the test:
 Discard pairs for which the difference is 0
 Rank the remaining n absolute differences in ascending order
(ties are assigned the average of their ranks)
 Find the sums of the positive ranks and the negative ranks
 The smaller of these sums is the Wilcoxon Signed Rank Statistic T:
T = min(T+ , T- )
Where T+ = the sum of the positive ranks
T- = the sum of the negative ranks
n = the number of nonzero differences
 The null hypothesis is rejected if T is less than or equal to the value in
Appendix Table 10
(continued)
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-39
Signed Rank Test Example
T+ = 3 T– = 42
Consumer Rating Difference
Package 1 Package 2 Diff (rank) Rank (+) Rank (–)
1
2
3
4
5
6
7
8
9
10
5
4
4
6
3
5
7
5
6
7
8
8
4
5
9
9
6
9
3
9
-3 (5)
-4 (7 tie)
0 (-)
+1 (2)
-6 (9)
-4 (7 tie)
-1 (3)
-4 (7 tie)
+3 (1)
-2 (4)
2
1
5
7
9
7
3
7
4
 Ten consumers in a focus group have
rated the attractiveness of two package
designs for a new product
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-40
Signed Rank Test Example
Test the hypothesis that the distribution of paired
differences is centered at zero, using  = 0.10
Conducting the test:
 The smaller of T+ and T- is the Wilcoxon Signed Rank Statistic T:
T = min(T+ , T- ) = 3
 Use Appendix Table 10 with n = 9 to find the critical value:
The null hypothesis is rejected if T ≤ 4
 Since T = 3 < 4, we reject the null hypothesis
(continued)
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-41
Normal Approximation
to the Sign Test
 If the number n of nonzero sample observations is large,
then the sign test is based on the normal approximation to
the binomial with mean and standard deviation
 The test statistic is
 Where S* is the test-statistic corrected for continuity:
 For a two-tail test, S* = S + 0.5, if S < μ or S* = S – 0.5, if S > μ
 For upper-tail test, S* = S – 0.5
 For lower-tail test, S* = S + 0.5
n0.50.25nP)nP(1σ
0.5nnPμ


n0.5
0.5n*S
σ
μ*S
Z




Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-42
Normal Approximation to the
Wilcoxon Signed Rank Test
A normal approximation can be used when
 Paired samples are observed
 The sample size is large (n > 20)
 The hypothesis test is that the population
distribution of differences is centered at zero
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-43
Wilcoxon Matched Pairs Test
for Large Samples
 The mean and standard deviation for
Wilcoxon T :
4
1)n(n
μE(T) T


24
1)1)(2n(n)(n
σVar(T) 2
T


where n is the number of paired values
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-44
Wilcoxon Matched Pairs Test
for Large Samples
 Normal approximation for the Wilcoxon T Statistic:
(continued)
24
1)1)(2nn(n
4
1)n(n
T
σ
μT
z
T
T






 If the alternative hypothesis is one-sided, reject the null
hypothesis if
 If the alternative hypothesis is two-sided, reject the null
hypothesis if
α
T
T
z
σ
μT


α/2
T
T
z
σ
μT


Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-45
Sign Test for
Single Population Median
 The sign test can be used to test that a single
population median is equal to a specified value
 For small samples, use the binomial
distribution
 For large samples, use the normal
approximation
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-46
Nonparametric Tests for
Independent Random Samples
Used to compare two samples from two populations
Assumptions:
 The two samples are independent and random
 The value measured is a continuous variable
 The two distributions are identical except for a possible
difference in the central location
 The sample size from each population is at least 10
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-47
14.5
Mann-Whitney U-Test
 Consider two samples
 Pool the two samples (combine into a singe list) but
keep track of which sample each value came from
 rank the values in the combined list in ascending
order
 For ties, assign each the average rank of the tied values
 sum the resulting rankings separately for each
sample
 If the sum of rankings from one sample differs enough
from the sum of rankings from the other sample, we
conclude there is a difference in the population
medians
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-48
Mann-Whitney U Statistic
 Consider n1 observations from the first population and
n2 observations from the second
 Let R1 denote the sum of the ranks of the observations
from the first population
 The Mann-Whitney U statistic is
1
11
21 R
2
1)(nn
nnU 


Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-49
Mann-Whitney U Statistic
 The null hypothesis is that the medians of the two
population distributions are the same
 The Mann-Whitney U statistic has mean and variance
 Then for large sample sizes (both at least 10), the
distribution of the random variable
is approximated by the normal distribution
(continued)
2
nn
μE(U) 21
U 
12
1)n(nnn
σVar(U) 21212
U


U
U
σ
μU
z


Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-50
Decision Rules for
Mann-Whitney Test
The decision rule for the null hypothesis that the two
populations have the same medians:
 For a one-sided upper-tailed alternative hypothesis:
 For a one-sided lower-tailed hypothesis:
 For a two-sided alternative hypothesis:
α
U
U
0 z
σ
μU
zifHReject 


α
U
U
0 z
σ
μU
zifHReject 


α/2
U
U
0α/2
U
U
0 z
σ
μU
zifHRejectorz
σ
μU
zifHReject 




Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-51
Mann-Whitney U-Test Example
Claim: Median class size for Math is larger
than the median class size for English
A random sample of 10 Math and 10 English
classes is selected (samples do not have to
be of equal size)
Rank the combined values and then
determine rankings by original sample
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-52
Mann-Whitney U-Test Example
 Suppose the results are:
Class size (Math, M) Class size (English, E)
23
45
34
78
34
66
62
95
81
99
30
47
18
34
44
61
54
28
40
96
(continued)
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-53
Mann-Whitney U-Test Example
Size Rank
18 1
23 2
28 3
30 4
34 6
34 6
34 6
40 8
44 9
45 10
Size Rank
47 11
54 12
61 13
62 14
66 15
78 16
81 17
95 18
96 19
99 20
Ranking for combined samples
tied
(continued)
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-54
Mann-Whitney U-Test Example
 Rank by
original
sample:
Class size
(Math, M)
Rank
Class size
(English, E)
Rank
23
45
34
78
34
66
62
95
81
99
2
10
6
16
6
15
14
18
17
20
30
47
18
34
44
61
54
28
40
96
4
11
1
6
9
13
12
3
8
19
 = 124  = 86
(continued)
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-55
Mann-Whitney U-Test Example
H0: MedianM ≤ MedianE
(Math median is not greater
than English median)
HA: MedianM > MedianE
(Math median is larger)
Claim: Median class size for
Math is larger than the
median class size for English
31124
2
(10)(11)
(10)(10)R
2
1)(nn
nnU 1
11
21 

 
(continued)
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-56
Mann-Whitney U-Test Example
 The decision rule for this one-sided upper-tailed alternative
hypothesis:
 For  = 0.05, -z = -1.645
 The calculated z value is not in the rejection region, so we
conclude that there is not sufficient evidence of difference in class
size medians
(continued)
1.436
12
1)100(10)(10)(1
2
(10)(10)
31
12
1)n(nnn
2
nn
U
σ
μU
z
2121
21
U
U









H0: MedianM ≤ MedianE
HA: MedianM > MedianE
α
U
U
0 z
σ
μU
zifHReject 


Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-57
Wilcoxon Rank Sum Test
 Similar to Mann-Whitney U test
 Results will be the same for both tests
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-58
Wilcoxon Rank Sum Test
 n1 observations from the first population
 n2 observations from the second population
 Pool the samples and rank the observations in
ascending order
 Let T denote the sum of the ranks of the
observations from the first population
 (T in the Wilcoxon Rank Sum Test is the same
as R1 in the Mann-Whitney U Test)
(continued)
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-59
Wilcoxon Rank Sum Test
 The Wilcoxon Rank Sum Statistic, T, has mean
 And variance
 Then, for large samples (n1  10 and n2  10) the
distribution of the random variable
is approximated by the normal distribution
(continued)
2
1)n(nn
μE(T) 211
T


12
1)n(nnn
σVar(T) 21212
T


T
T
σ
μT
Z


Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-60
Wilcoxon Rank Sum Example
 We wish to test
 Use  = 0.05
 Suppose two samples are obtained:
 n1 = 40 , n2 = 50
 When rankings are completed, the sum of ranks
for sample 1 is R1 = 1475 = T
 When rankings are completed, the sum of ranks
for sample 2 is R2 = 2620
H0: Median1  Median2
H1: Median1 < Median2
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-61
Wilcoxon Rank Sum Example
 Using the normal approximation:
(continued)
2.80
12
1)500(40)(50)(4
2
1)50(40)(40
1475
12
1)n(nnn
2
1)n(nn
T
σ
μT
z
2121
211
T
T











Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-62
Wilcoxon Rank Sum Example
Since z = -2.80 < -1.645, we reject H0 and conclude
that median 1 is less than median 2 at the 0.05 level
of significance
645.1z 
Reject H0
 = .05
Do not reject H0
0
(continued)
2.80
σ
μT
z
T
T



H0: Median1  Median2
H1: Median1 < Median2
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-63
Spearman Rank Correlation
 Consider a random sample (x1 , y1), . . .,(xn, yn) of n pairs of
observations
 Rank xi and yi each in ascending order
 Calculate the sample correlation of these ranks
 The resulting coefficient is called Spearman’s Rank Correlation
Coefficient.
 If there are no tied ranks, an equivalent formula for computing this
coefficient is
where the di are the differences of the ranked pairs
1)n(n
d6
1r 2
n
1i
2
i
S



Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-64
14.6
Spearman Rank Correlation
 Consider the null hypothesis
H0: no association in the population
 To test against the alternative of positive association,
the decision rule is
 To test against the alternative of negative association,
the decision rule is
 To test against the two-sided alternative of some
association, the decision rule is
(continued)
αS,S0 rrifHReject 
αS,S0 rrifHReject 
/2S,S/2S,S0 rrorrrifHReject αα 
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-65
Chapter Summary
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
 Used the chi-square goodness-of-fit test to determine
whether sample data match specified probabilities
 Conducted goodness-of-fit tests when a population
parameter was unknown
 Tested for normality using the Jarque-Bera test
 Used contingency tables to perform a chi-square test for
association
 Compared observed cell frequencies to expected cell
frequencies
Ch. 14-66
Chapter Summary
 Used the sign test for paired or matched samples, and
the normal approximation for the sign test
 Developed and applied the Wilcoxon signed rank test,
and the large sample normal approximation
 Developed and applied the Mann-Whitney U-test for
two population medians
 Used the Wilcoxon rank-sum test
 Examined Spearman rank correlation for tests of
association
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-67
(continued)

More Related Content

What's hot

Hypothesis testing: A single sample test
Hypothesis testing: A single sample testHypothesis testing: A single sample test
Hypothesis testing: A single sample testUmme Salma Tuli
 
law of large number and central limit theorem
 law of large number and central limit theorem law of large number and central limit theorem
law of large number and central limit theoremlovemucheca
 
Statistical tests for categorical data
Statistical tests for categorical dataStatistical tests for categorical data
Statistical tests for categorical dataRizwan S A
 
Chapter 4 part1-Probability Model
Chapter 4 part1-Probability ModelChapter 4 part1-Probability Model
Chapter 4 part1-Probability Modelnszakir
 
Bbs11 ppt ch08
Bbs11 ppt ch08Bbs11 ppt ch08
Bbs11 ppt ch08Tuul Tuul
 
Inferential vs descriptive tutorial of when to use - Copyright Updated
Inferential vs descriptive tutorial of when to use - Copyright UpdatedInferential vs descriptive tutorial of when to use - Copyright Updated
Inferential vs descriptive tutorial of when to use - Copyright UpdatedKen Plummer
 
MANN WHITNEY U TEST.pptx
MANN WHITNEY U TEST.pptxMANN WHITNEY U TEST.pptx
MANN WHITNEY U TEST.pptxRichelleDordas1
 
Categorical data analysis
Categorical data analysisCategorical data analysis
Categorical data analysisSumit Das
 
Advanced Biostatistics - Simplified
Advanced Biostatistics - SimplifiedAdvanced Biostatistics - Simplified
Advanced Biostatistics - SimplifiedMohammed Alhefzi
 
1. Introduction to Survival analysis
1. Introduction to Survival analysis1. Introduction to Survival analysis
1. Introduction to Survival analysisGhada Abu sheasha
 
Introduction to Statistics
Introduction to StatisticsIntroduction to Statistics
Introduction to Statisticsaan786
 
Basic survival analysis
Basic survival analysisBasic survival analysis
Basic survival analysisMike LaValley
 
SURVIVAL ANALYSIS.ppt
SURVIVAL ANALYSIS.pptSURVIVAL ANALYSIS.ppt
SURVIVAL ANALYSIS.pptmbang ernest
 
UNIT VII EPIDEMIOLOGICAL METHODS PDF.pdf
UNIT VII EPIDEMIOLOGICAL METHODS PDF.pdfUNIT VII EPIDEMIOLOGICAL METHODS PDF.pdf
UNIT VII EPIDEMIOLOGICAL METHODS PDF.pdfBurhan Khan
 

What's hot (20)

Hypothesis testing: A single sample test
Hypothesis testing: A single sample testHypothesis testing: A single sample test
Hypothesis testing: A single sample test
 
law of large number and central limit theorem
 law of large number and central limit theorem law of large number and central limit theorem
law of large number and central limit theorem
 
Statistical tests for categorical data
Statistical tests for categorical dataStatistical tests for categorical data
Statistical tests for categorical data
 
Survival analysis
Survival analysis  Survival analysis
Survival analysis
 
Goodness of-fit
Goodness of-fit  Goodness of-fit
Goodness of-fit
 
Chapter 4 part1-Probability Model
Chapter 4 part1-Probability ModelChapter 4 part1-Probability Model
Chapter 4 part1-Probability Model
 
Sample size
Sample sizeSample size
Sample size
 
Bbs11 ppt ch08
Bbs11 ppt ch08Bbs11 ppt ch08
Bbs11 ppt ch08
 
Inferential vs descriptive tutorial of when to use - Copyright Updated
Inferential vs descriptive tutorial of when to use - Copyright UpdatedInferential vs descriptive tutorial of when to use - Copyright Updated
Inferential vs descriptive tutorial of when to use - Copyright Updated
 
MANN WHITNEY U TEST.pptx
MANN WHITNEY U TEST.pptxMANN WHITNEY U TEST.pptx
MANN WHITNEY U TEST.pptx
 
Categorical data analysis
Categorical data analysisCategorical data analysis
Categorical data analysis
 
Sampling Distribution
Sampling DistributionSampling Distribution
Sampling Distribution
 
Advanced Biostatistics - Simplified
Advanced Biostatistics - SimplifiedAdvanced Biostatistics - Simplified
Advanced Biostatistics - Simplified
 
1. Introduction to Survival analysis
1. Introduction to Survival analysis1. Introduction to Survival analysis
1. Introduction to Survival analysis
 
Sign Test
Sign TestSign Test
Sign Test
 
Introduction to Statistics
Introduction to StatisticsIntroduction to Statistics
Introduction to Statistics
 
Basic survival analysis
Basic survival analysisBasic survival analysis
Basic survival analysis
 
SURVIVAL ANALYSIS.ppt
SURVIVAL ANALYSIS.pptSURVIVAL ANALYSIS.ppt
SURVIVAL ANALYSIS.ppt
 
Sampling distribution
Sampling distributionSampling distribution
Sampling distribution
 
UNIT VII EPIDEMIOLOGICAL METHODS PDF.pdf
UNIT VII EPIDEMIOLOGICAL METHODS PDF.pdfUNIT VII EPIDEMIOLOGICAL METHODS PDF.pdf
UNIT VII EPIDEMIOLOGICAL METHODS PDF.pdf
 

Similar to Chap14 analysis of categorical data

Chap06 sampling and sampling distributions
Chap06 sampling and sampling distributionsChap06 sampling and sampling distributions
Chap06 sampling and sampling distributionsJudianto Nugroho
 
Newbold_chap16.ppt
Newbold_chap16.pptNewbold_chap16.ppt
Newbold_chap16.pptcfisicaster
 
Newbold_chap10.ppt
Newbold_chap10.pptNewbold_chap10.ppt
Newbold_chap10.pptcfisicaster
 
Exam III Tian Xia
Exam III Tian XiaExam III Tian Xia
Exam III Tian XiaTIAN XIA
 
Chap01 describing data; graphical
Chap01 describing data;  graphicalChap01 describing data;  graphical
Chap01 describing data; graphicalJudianto Nugroho
 
Newbold_chap15.ppt
Newbold_chap15.pptNewbold_chap15.ppt
Newbold_chap15.pptcfisicaster
 
Levine smume7 ch09 modified
Levine smume7 ch09 modifiedLevine smume7 ch09 modified
Levine smume7 ch09 modifiedmgbardossy
 
MPhil clinical psy Non-parametric statistics.pptx
MPhil clinical psy Non-parametric statistics.pptxMPhil clinical psy Non-parametric statistics.pptx
MPhil clinical psy Non-parametric statistics.pptxrodrickrajamanickam
 
Nonparametric statistics ppt @ bec doms
Nonparametric statistics ppt @ bec domsNonparametric statistics ppt @ bec doms
Nonparametric statistics ppt @ bec domsBabasab Patil
 
Nonparametric statistics ppt @ bec doms
Nonparametric statistics ppt @ bec domsNonparametric statistics ppt @ bec doms
Nonparametric statistics ppt @ bec domsBabasab Patil
 
Day 11 t test for independent samples
Day 11 t test for independent samplesDay 11 t test for independent samples
Day 11 t test for independent samplesElih Sutisna Yanto
 

Similar to Chap14 analysis of categorical data (20)

Chap06 sampling and sampling distributions
Chap06 sampling and sampling distributionsChap06 sampling and sampling distributions
Chap06 sampling and sampling distributions
 
Chap09 hypothesis testing
Chap09 hypothesis testingChap09 hypothesis testing
Chap09 hypothesis testing
 
ch14.ppt
ch14.pptch14.ppt
ch14.ppt
 
Newbold_chap16.ppt
Newbold_chap16.pptNewbold_chap16.ppt
Newbold_chap16.ppt
 
Newbold_chap10.ppt
Newbold_chap10.pptNewbold_chap10.ppt
Newbold_chap10.ppt
 
Chi square analysis
Chi square analysisChi square analysis
Chi square analysis
 
Hypothesis testing
Hypothesis testingHypothesis testing
Hypothesis testing
 
Exam III Tian Xia
Exam III Tian XiaExam III Tian Xia
Exam III Tian Xia
 
sigir2018tutorial
sigir2018tutorialsigir2018tutorial
sigir2018tutorial
 
Chap01 describing data; graphical
Chap01 describing data;  graphicalChap01 describing data;  graphical
Chap01 describing data; graphical
 
Newbold_chap15.ppt
Newbold_chap15.pptNewbold_chap15.ppt
Newbold_chap15.ppt
 
Day 3 SPSS
Day 3 SPSSDay 3 SPSS
Day 3 SPSS
 
Levine smume7 ch09 modified
Levine smume7 ch09 modifiedLevine smume7 ch09 modified
Levine smume7 ch09 modified
 
MPhil clinical psy Non-parametric statistics.pptx
MPhil clinical psy Non-parametric statistics.pptxMPhil clinical psy Non-parametric statistics.pptx
MPhil clinical psy Non-parametric statistics.pptx
 
Nonparametric statistics ppt @ bec doms
Nonparametric statistics ppt @ bec domsNonparametric statistics ppt @ bec doms
Nonparametric statistics ppt @ bec doms
 
Nonparametric statistics ppt @ bec doms
Nonparametric statistics ppt @ bec domsNonparametric statistics ppt @ bec doms
Nonparametric statistics ppt @ bec doms
 
Himani
HimaniHimani
Himani
 
Himani sharma
Himani sharmaHimani sharma
Himani sharma
 
Research Procedure
Research ProcedureResearch Procedure
Research Procedure
 
Day 11 t test for independent samples
Day 11 t test for independent samplesDay 11 t test for independent samples
Day 11 t test for independent samples
 

More from Judianto Nugroho (20)

Chap14 en-id
Chap14 en-idChap14 en-id
Chap14 en-id
 
Chap19 en-id
Chap19 en-idChap19 en-id
Chap19 en-id
 
Chap18 en-id
Chap18 en-idChap18 en-id
Chap18 en-id
 
Chap16 en-id
Chap16 en-idChap16 en-id
Chap16 en-id
 
Chap15 en-id
Chap15 en-idChap15 en-id
Chap15 en-id
 
Chap17 en-id
Chap17 en-idChap17 en-id
Chap17 en-id
 
Chap13 en-id
Chap13 en-idChap13 en-id
Chap13 en-id
 
Chap12 en-id
Chap12 en-idChap12 en-id
Chap12 en-id
 
Chap11 en-id
Chap11 en-idChap11 en-id
Chap11 en-id
 
Chap10 en-id
Chap10 en-idChap10 en-id
Chap10 en-id
 
Chap09 en-id
Chap09 en-idChap09 en-id
Chap09 en-id
 
Chap08 en-id
Chap08 en-idChap08 en-id
Chap08 en-id
 
Chap05 en-id
Chap05 en-idChap05 en-id
Chap05 en-id
 
Chap07 en-id
Chap07 en-idChap07 en-id
Chap07 en-id
 
Chap06 en-id
Chap06 en-idChap06 en-id
Chap06 en-id
 
Chap04 en-id
Chap04 en-idChap04 en-id
Chap04 en-id
 
Chap03 en-id
Chap03 en-idChap03 en-id
Chap03 en-id
 
Chap02 en-id
Chap02 en-idChap02 en-id
Chap02 en-id
 
Chap01 en-id
Chap01 en-idChap01 en-id
Chap01 en-id
 
Spss session 1 and 2
Spss session 1 and 2Spss session 1 and 2
Spss session 1 and 2
 

Recently uploaded

ENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomnelietumpap1
 
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...Postal Advocate Inc.
 
Q4 English4 Week3 PPT Melcnmg-based.pptx
Q4 English4 Week3 PPT Melcnmg-based.pptxQ4 English4 Week3 PPT Melcnmg-based.pptx
Q4 English4 Week3 PPT Melcnmg-based.pptxnelietumpap1
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Celine George
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxBarangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxCarlos105
 
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...JhezDiaz1
 
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSGRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSJoshuaGantuangco2
 
DATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersDATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersSabitha Banu
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designMIPLM
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatYousafMalik24
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPCeline George
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)lakshayb543
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTiammrhaywood
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxiammrhaywood
 
Gas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxGas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxDr.Ibrahim Hassaan
 
Grade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptxGrade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptxChelloAnnAsuncion2
 

Recently uploaded (20)

ENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choom
 
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
 
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptxYOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
 
Q4 English4 Week3 PPT Melcnmg-based.pptx
Q4 English4 Week3 PPT Melcnmg-based.pptxQ4 English4 Week3 PPT Melcnmg-based.pptx
Q4 English4 Week3 PPT Melcnmg-based.pptx
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxBarangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
 
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
 
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSGRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
 
DATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersDATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginners
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-design
 
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptxYOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice great
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERP
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
 
Gas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxGas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptx
 
Grade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptxGrade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptx
 

Chap14 analysis of categorical data

  • 1. Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 7th Edition Chapter 14 Analysis of Categorical Data Ch. 14-1
  • 2. Chapter Goals After completing this chapter, you should be able to:  Use the chi-square goodness-of-fit test to determine whether data fits specified probabilities  Perform tests for the Poisson and Normal distributions  Set up a contingency analysis table and perform a chi- square test of association  Use the sign test for paired or matched samples  Recognize when and how to use the Wilcoxon signed rank test for paired or matched samples Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-2
  • 3. Chapter Goals After completing this chapter, you should be able to:  Use a sign test for a single population median  Apply a normal approximation for the Wilcoxon signed rank test  Know when and how to perform a Mann-Whitney U-test  Explain Spearman rank correlation and perform a test for association Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-3 (continued)
  • 4. Nonparametric Statistics  Nonparametric Statistics  Fewer restrictive assumptions about data levels and underlying probability distributions  Population distributions may be skewed  The level of data measurement may only be ordinal or nominal Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-4
  • 5.  Does sample data conform to a hypothesized distribution?  Examples:  Do sample results conform to specified expected probabilities?  Are technical support calls equal across all days of the week? (i.e., do calls follow a uniform distribution?)  Do measurements from a production process follow a normal distribution? Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Goodness-of-Fit Tests Ch. 14-5 14.1
  • 6.  Are technical support calls equal across all days of the week? (i.e., do calls follow a uniform distribution?)  Sample data for 10 days per day of week: Sum of calls for this day: Monday 290 Tuesday 250 Wednesday 238 Thursday 257 Friday 265 Saturday 230 Sunday 192 Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Chi-Square Goodness-of-Fit Test (continued)  = 1722 Ch. 14-6
  • 7.  If calls are uniformly distributed, the 1722 calls would be expected to be equally divided across the 7 days:  Chi-Square Goodness-of-Fit Test: test to see if the sample results are consistent with the expected results Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Logic of Goodness-of-Fit Test uniformifdaypercallsexpected246 7 1722  Ch. 14-7
  • 8. Observed vs. Expected Frequencies Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Observed Oi Expected Ei Monday Tuesday Wednesday Thursday Friday Saturday Sunday 290 250 238 257 265 230 192 246 246 246 246 246 246 246 TOTAL 1722 1722 Ch. 14-8
  • 9. Chi-Square Test Statistic  The test statistic is Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall 1)Kd.f.(where E )E(OK 1i i 2 ii2      where: K = number of categories Oi = observed frequency for category i Ei = expected frequency for category i H0: The distribution of calls is uniform over days of the week H1: The distribution of calls is not uniform Ch. 14-9
  • 10. The Rejection Region  Reject H0 if Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall    K 1i i 2 ii2 E )E(O  H0: The distribution of calls is uniform over days of the week H1: The distribution of calls is not uniform 2 α 2  0  2  Reject H0Do not reject H0 (with k – 1 degrees of freedom) 2 Ch. 14-10
  • 11. Chi-Square Test Statistic Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall 23.05 246 246)(192 ... 246 246)(250 246 246)(290 222 2        H0: The distribution of calls is uniform over days of the week H1: The distribution of calls is not uniform 0  = .05 Reject H0Do not reject H0 2 k – 1 = 6 (7 days of the week) so use 6 degrees of freedom: 2 .05 = 12.5916 2 .05 = 12.5916 Conclusion: 2 = 23.05 > 2  = 12.5916 so reject H0 and conclude that the distribution is not uniform Ch. 14-11
  • 12. Goodness-of-Fit Tests, Population Parameters Unknown Idea:  Test whether data follow a specified distribution (such as binomial, Poisson, or normal) . . .  . . . without assuming the parameters of the distribution are known  Use sample data to estimate the unknown population parameters Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-12 14.2
  • 13. Goodness-of-Fit Tests, Population Parameters Unknown  Suppose that a null hypothesis specifies category probabilities that depend on the estimation (from the data) of m unknown population parameters  The appropriate goodness-of-fit test is the same as in the previously section . . .  . . . except that the number of degrees of freedom for the chi-square random variable is  Where K is the number of categories Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall    K 1i i 2 ii2 E )E(O  1)m(KFreedomofDegrees  (continued) Ch. 14-13
  • 14. Test of Normality  The assumption that data follow a normal distribution is common in statistics  Normality was assessed in prior chapters (for example, with Normal probability plots in Chapter 5)  Here, a chi-square test is developed Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-14 14.3
  • 15. Test of Normality  Two population parameters can be estimated using sample data:  For a normal distribution, Skewness = 0 Kurtosis = 3 Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall (continued) 3 n 1i 3 i ns )x(x Skewness    4 n 1i 4 i ns )x(x Kurtosis    Ch. 14-15
  • 16. Jarque-Bera Test for Normality  Consider the null hypothesis that the population distribution is normal  The Jarque-Bera Test for Normality is based on the closeness the sample skewness to 0 and the sample kurtosis to 3  The test statistic is  as the number of sample observations becomes very large, this statistic has a chi-square distribution with 2 degrees of freedom  The null hypothesis is rejected for large values of the test statistic Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall         24 3)(Kurtosis 6 (Skewness) nJB 22 Ch. 14-16
  • 17. Jarque-Bera Test for Normality  The chi-square approximation is close only for very large sample sizes  If the sample size is not very large, the Bowman- Shelton test statistic is compared to significance points from text Table 14.9 Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall (continued) Sample size N 10% point 5% point Sample size N 10% point 5% point 20 30 40 50 75 100 125 150 2.13 2.49 2.70 2.90 3.09 3.14 3.31 3.43 3.26 3.71 3.99 4.26 4.27 4.29 4.34 4.39 200 250 300 400 500 800 ∞ 3.48 3.54 3.68 3.76 3.91 4.32 4.61 4.43 4.61 4.60 4.74 4.82 5.46 5.99 Ch. 14-17
  • 18. Example: Jarque-Bera Test for Normality  The average daily temperature has been recorded for 200 randomly selected days, with sample skewness 0.232 and kurtosis 3.319  Test the null hypothesis that the true distribution is normal  From Table 14.9 the 10% critical value for n = 200 is 3.48, so there is not sufficient evidence to reject that the population is normal Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall 2.642 24 3)(3.319 6 (0.232) 200 24 3)(Kurtosis 6 (Skewness) nJB 2222                Ch. 14-18
  • 19. Contingency Tables Contingency Tables  Used to classify sample observations according to a pair of attributes  Also called a cross-classification or cross- tabulation table  Assume r categories for attribute A and c categories for attribute B  Then there are (r x c) possible cross-classifications Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-19 14.3
  • 20. r x c Contingency Table Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Attribute B Attribute A 1 2 . . . C Totals 1 2 . . . r Totals O11 O21 . . . Or1 C1 O12 O22 . . . Or2 C2 … … … … … … … O1c O2c . . . Orc Cc R1 R2 . . . Rr n Ch. 14-20
  • 21. Test for Association  Consider n observations tabulated in an r x c contingency table  Denote by Oij the number of observations in the cell that is in the ith row and the jth column  The null hypothesis is  The appropriate test is a chi-square test with (r-1)(c-1) degrees of freedom Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall populationtheinattributestwothebetween existsnassociatioNo:H0 Ch. 14-21
  • 22. Test for Association  Let Ri and Cj be the row and column totals  The expected number of observations in cell row i and column j, given that H0 is true, is  A test of association at a significance level  is based on the chi-square distribution and the following decision rule Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall 2 1),1)c(r r 1i c 1j ij 2 ijij2 0 E )E(O ifHReject αχχ        n CR E ji ij  (continued) Ch. 14-22
  • 23. Contingency Table Example H0: There is no association between hand preference and gender H1: Hand preference is not independent of gender Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Left-Handed vs. Gender  Dominant Hand: Left vs. Right  Gender: Male vs. Female Ch. 14-23
  • 24. Contingency Table Example Sample results organized in a contingency table: Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Gender Hand Preference Left Right Female 12 108 120 Male 24 156 180 36 264 300 120 Females, 12 were left handed 180 Males, 24 were left handed sample size = n = 300: (continued) Ch. 14-24
  • 25. Logic of the Test  If H0 is true, then the proportion of left-handed females should be the same as the proportion of left-handed males  The two proportions above should be the same as the proportion of left-handed people overall Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall H0: There is no association between hand preference and gender H1: Hand preference is not independent of gender Ch. 14-25
  • 26. Finding Expected Frequencies Overall: P(Left Handed) = 36/300 = .12 Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall 120 Females, 12 were left handed 180 Males, 24 were left handed If no association, then P(Left Handed | Female) = P(Left Handed | Male) = .12 So we would expect 12% of the 120 females and 12% of the 180 males to be left handed… i.e., we would expect (120)(.12) = 14.4 females to be left handed (180)(.12) = 21.6 males to be left handed Ch. 14-26
  • 27. Expected Cell Frequencies  Expected cell frequencies: Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall sizesampleTotal total)Columntotal)(jRow(i n CR E thth ji ij  14.4 300 (120)(36) E11  Example: (continued) Ch. 14-27
  • 28. Observed vs. Expected Frequencies Observed frequencies vs. expected frequencies: Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Gender Hand Preference Left Right Female Observed = 12 Expected = 14.4 Observed = 108 Expected = 105.6 120 Male Observed = 24 Expected = 21.6 Observed = 156 Expected = 158.4 180 36 264 300 Ch. 14-28
  • 29. The Chi-Square Test Statistic  where: Oij = observed frequency in cell (i, j) Eij = expected frequency in cell (i, j) r = number of rows c = number of columns Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall     r 1i c 1j ij 2 ijij2 E )E(O  The Chi-square test statistic is: )1c)(1r(.f.dwith  Ch. 14-29
  • 30. Observed vs. Expected Frequencies Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Gender Hand Preference Left Right Female Observed = 12 Expected = 14.4 Observed = 108 Expected = 105.6 120 Male Observed = 24 Expected = 21.6 Observed = 156 Expected = 158.4 180 36 264 300 6848.0 4.158 )4.158156( 6.21 )6.2124( 6.105 )6.105108( 4.14 )4.1412( 2222 2          Ch. 14-30
  • 31. Contingency Analysis Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall 2 2 .05 = 3.841 Reject H0  = 0.05 Decision Rule: If 2 > 3.841, reject H0, otherwise, do not reject H0 1(1)(1)1)-1)(c-(rd.f.with6848.02  Do not reject H0 Here, 2 = 0.6848 < 3.841, so we do not reject H0 and conclude that gender and hand preference are not associated Ch. 14-31
  • 32. Nonparametric Tests for Paired or Matched Samples  A sign test for paired or matched samples:  Calculate the differences of the paired observations  Discard the differences equal to 0, leaving n observations  Record the sign of the difference as + or –  For a symmetric distribution, the signs are random and + and – are equally likely Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-32 14.4
  • 33. Sign Test  Define + to be a “success” and let P = the true proportion of +’s in the population  The sign test is used for the hypothesis test  The test-statistic S for the sign test is S = the number of pairs with a positive difference  S has a binomial distribution with P = 0.5 and n = the number of nonzero differences (continued) 0.5P:H0  Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-33
  • 34. Determining the p-value  The p-value for a Sign Test is found using the binomial distribution with n = number of nonzero differences, S = number of positive differences, and P = 0.5  For an upper-tail test, H1: P > 0.5, p-value = P(x  S)  For a lower-tail test, H1: P < 0.5, p-value = P(x  S)  For a two-tail test, H1: P  0.5, 2(p-value) Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-34
  • 35. Sign Test Example  Ten consumers in a focus group have rated the attractiveness of two package designs for a new product Consumer Rating Difference Sign of Difference Package 1 Package 2 Rating 1 – 2 1 2 3 4 5 6 7 8 9 10 5 4 4 6 3 5 7 5 6 7 8 8 4 5 9 9 6 9 3 9 -3 -4 0 +1 -6 -4 -1 -4 +3 -2 – – 0 + – – – – + – Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-35
  • 36. Sign Test Example  Test the hypothesis that there is no overall package preference using  = 0.10 The proportion of consumers who prefer package 1 is the same as the proportion preferring package 2 A majority prefer package 2  The test-statistic S for the sign test is S = the number of pairs with a positive difference = 2  S has a binomial distribution with P = 0.5 and n = 9 (there was one zero difference) (continued) 0.5P:H0  0.5P:H1  Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-36
  • 37. Sign Test Example  The p-value for this sign test is found using the binomial distribution with n = 9, S = 2, and P = 0.5:  For a lower-tail test, p-value = P(x  2|n=9, P=0.5) = 0.090 Since 0.090 <  = 0.10 we reject the null hypothesis and conclude that consumers prefer package 2 (continued) Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-37
  • 38. Wilcoxon Signed Rank Test for Paired or Matched Samples  Uses matched pairs of random observations  Still based on ranks  Incorporates information about the magnitude of the differences  Tests the hypothesis that the distribution of differences is centered at zero  The population of paired differences is assumed to be symmetric Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-38
  • 39. Wilcoxon Signed Rank Test for Paired or Matched Samples Conducting the test:  Discard pairs for which the difference is 0  Rank the remaining n absolute differences in ascending order (ties are assigned the average of their ranks)  Find the sums of the positive ranks and the negative ranks  The smaller of these sums is the Wilcoxon Signed Rank Statistic T: T = min(T+ , T- ) Where T+ = the sum of the positive ranks T- = the sum of the negative ranks n = the number of nonzero differences  The null hypothesis is rejected if T is less than or equal to the value in Appendix Table 10 (continued) Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-39
  • 40. Signed Rank Test Example T+ = 3 T– = 42 Consumer Rating Difference Package 1 Package 2 Diff (rank) Rank (+) Rank (–) 1 2 3 4 5 6 7 8 9 10 5 4 4 6 3 5 7 5 6 7 8 8 4 5 9 9 6 9 3 9 -3 (5) -4 (7 tie) 0 (-) +1 (2) -6 (9) -4 (7 tie) -1 (3) -4 (7 tie) +3 (1) -2 (4) 2 1 5 7 9 7 3 7 4  Ten consumers in a focus group have rated the attractiveness of two package designs for a new product Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-40
  • 41. Signed Rank Test Example Test the hypothesis that the distribution of paired differences is centered at zero, using  = 0.10 Conducting the test:  The smaller of T+ and T- is the Wilcoxon Signed Rank Statistic T: T = min(T+ , T- ) = 3  Use Appendix Table 10 with n = 9 to find the critical value: The null hypothesis is rejected if T ≤ 4  Since T = 3 < 4, we reject the null hypothesis (continued) Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-41
  • 42. Normal Approximation to the Sign Test  If the number n of nonzero sample observations is large, then the sign test is based on the normal approximation to the binomial with mean and standard deviation  The test statistic is  Where S* is the test-statistic corrected for continuity:  For a two-tail test, S* = S + 0.5, if S < μ or S* = S – 0.5, if S > μ  For upper-tail test, S* = S – 0.5  For lower-tail test, S* = S + 0.5 n0.50.25nP)nP(1σ 0.5nnPμ   n0.5 0.5n*S σ μ*S Z     Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-42
  • 43. Normal Approximation to the Wilcoxon Signed Rank Test A normal approximation can be used when  Paired samples are observed  The sample size is large (n > 20)  The hypothesis test is that the population distribution of differences is centered at zero Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-43
  • 44. Wilcoxon Matched Pairs Test for Large Samples  The mean and standard deviation for Wilcoxon T : 4 1)n(n μE(T) T   24 1)1)(2n(n)(n σVar(T) 2 T   where n is the number of paired values Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-44
  • 45. Wilcoxon Matched Pairs Test for Large Samples  Normal approximation for the Wilcoxon T Statistic: (continued) 24 1)1)(2nn(n 4 1)n(n T σ μT z T T        If the alternative hypothesis is one-sided, reject the null hypothesis if  If the alternative hypothesis is two-sided, reject the null hypothesis if α T T z σ μT   α/2 T T z σ μT   Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-45
  • 46. Sign Test for Single Population Median  The sign test can be used to test that a single population median is equal to a specified value  For small samples, use the binomial distribution  For large samples, use the normal approximation Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-46
  • 47. Nonparametric Tests for Independent Random Samples Used to compare two samples from two populations Assumptions:  The two samples are independent and random  The value measured is a continuous variable  The two distributions are identical except for a possible difference in the central location  The sample size from each population is at least 10 Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-47 14.5
  • 48. Mann-Whitney U-Test  Consider two samples  Pool the two samples (combine into a singe list) but keep track of which sample each value came from  rank the values in the combined list in ascending order  For ties, assign each the average rank of the tied values  sum the resulting rankings separately for each sample  If the sum of rankings from one sample differs enough from the sum of rankings from the other sample, we conclude there is a difference in the population medians Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-48
  • 49. Mann-Whitney U Statistic  Consider n1 observations from the first population and n2 observations from the second  Let R1 denote the sum of the ranks of the observations from the first population  The Mann-Whitney U statistic is 1 11 21 R 2 1)(nn nnU    Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-49
  • 50. Mann-Whitney U Statistic  The null hypothesis is that the medians of the two population distributions are the same  The Mann-Whitney U statistic has mean and variance  Then for large sample sizes (both at least 10), the distribution of the random variable is approximated by the normal distribution (continued) 2 nn μE(U) 21 U  12 1)n(nnn σVar(U) 21212 U   U U σ μU z   Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-50
  • 51. Decision Rules for Mann-Whitney Test The decision rule for the null hypothesis that the two populations have the same medians:  For a one-sided upper-tailed alternative hypothesis:  For a one-sided lower-tailed hypothesis:  For a two-sided alternative hypothesis: α U U 0 z σ μU zifHReject    α U U 0 z σ μU zifHReject    α/2 U U 0α/2 U U 0 z σ μU zifHRejectorz σ μU zifHReject      Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-51
  • 52. Mann-Whitney U-Test Example Claim: Median class size for Math is larger than the median class size for English A random sample of 10 Math and 10 English classes is selected (samples do not have to be of equal size) Rank the combined values and then determine rankings by original sample Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-52
  • 53. Mann-Whitney U-Test Example  Suppose the results are: Class size (Math, M) Class size (English, E) 23 45 34 78 34 66 62 95 81 99 30 47 18 34 44 61 54 28 40 96 (continued) Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-53
  • 54. Mann-Whitney U-Test Example Size Rank 18 1 23 2 28 3 30 4 34 6 34 6 34 6 40 8 44 9 45 10 Size Rank 47 11 54 12 61 13 62 14 66 15 78 16 81 17 95 18 96 19 99 20 Ranking for combined samples tied (continued) Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-54
  • 55. Mann-Whitney U-Test Example  Rank by original sample: Class size (Math, M) Rank Class size (English, E) Rank 23 45 34 78 34 66 62 95 81 99 2 10 6 16 6 15 14 18 17 20 30 47 18 34 44 61 54 28 40 96 4 11 1 6 9 13 12 3 8 19  = 124  = 86 (continued) Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-55
  • 56. Mann-Whitney U-Test Example H0: MedianM ≤ MedianE (Math median is not greater than English median) HA: MedianM > MedianE (Math median is larger) Claim: Median class size for Math is larger than the median class size for English 31124 2 (10)(11) (10)(10)R 2 1)(nn nnU 1 11 21     (continued) Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-56
  • 57. Mann-Whitney U-Test Example  The decision rule for this one-sided upper-tailed alternative hypothesis:  For  = 0.05, -z = -1.645  The calculated z value is not in the rejection region, so we conclude that there is not sufficient evidence of difference in class size medians (continued) 1.436 12 1)100(10)(10)(1 2 (10)(10) 31 12 1)n(nnn 2 nn U σ μU z 2121 21 U U          H0: MedianM ≤ MedianE HA: MedianM > MedianE α U U 0 z σ μU zifHReject    Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-57
  • 58. Wilcoxon Rank Sum Test  Similar to Mann-Whitney U test  Results will be the same for both tests Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-58
  • 59. Wilcoxon Rank Sum Test  n1 observations from the first population  n2 observations from the second population  Pool the samples and rank the observations in ascending order  Let T denote the sum of the ranks of the observations from the first population  (T in the Wilcoxon Rank Sum Test is the same as R1 in the Mann-Whitney U Test) (continued) Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-59
  • 60. Wilcoxon Rank Sum Test  The Wilcoxon Rank Sum Statistic, T, has mean  And variance  Then, for large samples (n1  10 and n2  10) the distribution of the random variable is approximated by the normal distribution (continued) 2 1)n(nn μE(T) 211 T   12 1)n(nnn σVar(T) 21212 T   T T σ μT Z   Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-60
  • 61. Wilcoxon Rank Sum Example  We wish to test  Use  = 0.05  Suppose two samples are obtained:  n1 = 40 , n2 = 50  When rankings are completed, the sum of ranks for sample 1 is R1 = 1475 = T  When rankings are completed, the sum of ranks for sample 2 is R2 = 2620 H0: Median1  Median2 H1: Median1 < Median2 Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-61
  • 62. Wilcoxon Rank Sum Example  Using the normal approximation: (continued) 2.80 12 1)500(40)(50)(4 2 1)50(40)(40 1475 12 1)n(nnn 2 1)n(nn T σ μT z 2121 211 T T            Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-62
  • 63. Wilcoxon Rank Sum Example Since z = -2.80 < -1.645, we reject H0 and conclude that median 1 is less than median 2 at the 0.05 level of significance 645.1z  Reject H0  = .05 Do not reject H0 0 (continued) 2.80 σ μT z T T    H0: Median1  Median2 H1: Median1 < Median2 Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-63
  • 64. Spearman Rank Correlation  Consider a random sample (x1 , y1), . . .,(xn, yn) of n pairs of observations  Rank xi and yi each in ascending order  Calculate the sample correlation of these ranks  The resulting coefficient is called Spearman’s Rank Correlation Coefficient.  If there are no tied ranks, an equivalent formula for computing this coefficient is where the di are the differences of the ranked pairs 1)n(n d6 1r 2 n 1i 2 i S    Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-64 14.6
  • 65. Spearman Rank Correlation  Consider the null hypothesis H0: no association in the population  To test against the alternative of positive association, the decision rule is  To test against the alternative of negative association, the decision rule is  To test against the two-sided alternative of some association, the decision rule is (continued) αS,S0 rrifHReject  αS,S0 rrifHReject  /2S,S/2S,S0 rrorrrifHReject αα  Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-65
  • 66. Chapter Summary Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall  Used the chi-square goodness-of-fit test to determine whether sample data match specified probabilities  Conducted goodness-of-fit tests when a population parameter was unknown  Tested for normality using the Jarque-Bera test  Used contingency tables to perform a chi-square test for association  Compared observed cell frequencies to expected cell frequencies Ch. 14-66
  • 67. Chapter Summary  Used the sign test for paired or matched samples, and the normal approximation for the sign test  Developed and applied the Wilcoxon signed rank test, and the large sample normal approximation  Developed and applied the Mann-Whitney U-test for two population medians  Used the Wilcoxon rank-sum test  Examined Spearman rank correlation for tests of association Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 14-67 (continued)