Lecture 7 Hypothesis Testing Two SamplePresentation Transcript
Testing Hypotheses: Two-Sample Tests Chapter 9 Statistics for Management Levin and Rubin
Hypotheses based on one group
A two-sample case involves comparing a parameter between two distinct groups, such as males versus females.
A two-sample hypothesis test might be to analyze the difference in the mean cholesterol level between men and women. Or, to determine if the percentage of job success is different between men and women.
Testing Hypothesis about Difference between Two Population Means (Two Sample Tests of Hypothesis)
Null Hypothesis: H 0 : (µ 1 - µ 2 )=D or µ 1 = µ 2
Alternate Hypothesis H 1 : (µ 1 - µ 2 )>D
(µ 1 - µ 2 )<D
[µ 1 > µ 2 µ 1 < µ 2 ] ..one-tailed test
Alternate Hypothesis H 1 : (µ 1 - µ 2 ) ≠D
[µ 1 ≠ µ 2 ] .. two-tailed test
(replace s when σ not available)
Rejection Region z > z α or z < -z α ( one tailed test)
z > z α/2 or z < -z α/2 ( two tailed test)
Mary Jo Fitzpatric is the Vice President for Nursing Services at St. Luke’s Memorial Hospital. Recently she noticed in the job posting for nurses that those that are unionised seem to offer higher wages. She decided to investigate and gathered the following sample information. Would it be reasonable for her to conclude that there is significant difference in earning between union and non-union nurses? Use the .01 significance level. 45 $1.90 $19.80 Nonunion 40 $2.25 $20.75 Union Sample Size Sample Standard Deviation Mean Wage Group
A manpower-development statistician is asked to determine whether the hourly wages of semiskilled workers are the same in two cities. The results of the survey are presented in the following table:
Hourly Wage Rate
This is a two-tailed test. The hypothesis is stated below. The significance level is 0.05 (given)
H 0 : μ 1 = μ 2 versus H 1 : μ 1 ≠ μ 2
Since this is a test of the means and the degrees of freedom (n 1 + n 2 - 2) is in excess of 30, a z test is appropriate. The critical values are ± 1.96 (from a z table).
Step 4: Sketch the distribution, locate the critical values and the test statistic. Step 5: Decide! Since the test statistic values lies within the rejection region then there is sufficient statistical evidence based on this sample to reject H 0 . The test for a difference between parameters does not have to be zero, it can be non-zero. For example: H 0 : μ 1 - μ 2 ≤ 0.10 versus H 1 : μ 1 - μ 2 > 0.10
Two research laboratories have independently produced drugs that provide relief to arthritis sufferers. The first drug was tested on a group of 90 arthritis sufferers and produced an average of 8.5 hours of relief, and a sample standard deviation of 1.8 hours. The second drug was tested on 80 arthritis sufferers, producing an average of 7.9 hours of relief, and a sample standard deviation of 2.1 hours. At the 0.05 level of significance, does the second drug provide a significantly shorter period of relief?
Notwithstanding the Equal Pay Act of 1963, in 1993 it still appeared that men earned more than women in similar jobs. A random sample of 38 male machinetool operators found a mean hourly wage of $11.38, and the sample standard deviation was $1.84. A random sample of 45 female machine-tool operators found their mean wage to be $8.42, and the sample standard deviation was $1.31. On the basis of these samples, is it reasonable to conclude (at a = 0.01) that the male operators are earning over $2.00 more per hour than the female operators?
Testing Hypothesis about Difference between Two Population Proportion
Null Hypothesis: H 0 : (p 1 - p 2 )=D or p 1 = p 2
Alternate Hypothesis H 1 : (p 1 - p 2 )>D (p 1 - p 2 )<D
[p 1 > p 2 p 1 < p 2 ] ..one-tailed test
Alternate Hypothesis H 1 : (p 1 - p 2 ) ≠D
[p 1 ≠ p 2 ] .. two-tailed test
3. Test Statistics:
Rejection Region z > z α or z < -z α ( one tailed test) z > z α/2 or z < -z α/2 ( two tailed test)
Testing Hypothesis about Difference between Two Population Proportion: Pooled
According to a report by the American Cancer Society, more men than women smoke and twice as many smokers die prematurely than nonsmokers. In random samples of 200 males and 200 females, 62 of the males and 54 of the females were smokers. Is there sufficient evidence to conclude that the proportion of male smokers higher from the proportion of female smokers when α = .01?
A financial analyst wants to compare the turnover rates, in percent, for shares of oil related stocks versus other stocks. She selected 32 oil-related stocks and 49 other stocks. The mean turnover of oil related stocks is 31.4 percent and the standard deviation 5.1 percent. For the other stocks, the mean rate was computed to be 34.9 percent and the standard deviation 6.7 percent. Is there a significant difference in the turnover rates of the two types of stock?
A coal-fired power plant is considering two different systems for pollution abatement. The first system has reduced the emission of pollutants to acceptable levels 68 percent of the time, as determined from 200 air samples. The second, more expensive system has reduced the emission of pollutants to acceptable levels 76 percent of the time, as determined from 250 air samples. If the expensive system is significantly more effective than the inexpensive system in reducing pollutants to acceptable levels, then the management of the power plant will install the expensive system. Which system will be installed if management uses a significance level of 0.02 in making its decision?
A group of clinical physicians is performing tests on patients to determine the effectiveness of a new antihypertensive drug. Patients with high blood pressure were randomly chosen and then randomly assigned to either the control group (which received a well-established antihypertensive) or the treatment group (which received the new drug). The doctors noted the percentage of patients whose blood pressure was reduced to a normal level within 1 year. At the 0.01 level of significance, test appropriate hypotheses to determine whether the new drug is significantly more effective than the older drug in reducing high blood pressure.
150 0.36 Control 120 0.45 Treatment Number of Patients Proportion That Improved Group
Test for difference between Means: small sample size
For small samples sizes, we must estimate a 'pooled' estimate (a.k.a. a weighted average) of the variances for the two groups. This estimate is:
and then, the estimated standard error is:
Tests for Difference Between Two Means: Small Sample Size
A company wishes to test when the sensitivity achieved by a new program is significantly higher than achieved under the legacy program. The following information is available from test results.
15 19 84 P.M.O. 12 15 92 Proposed Sample Size Standard Deviation Mean
Solution Step 1: This is a one-tailed test. The hypothesis is stated below. The significance level is 0.05 (given) H 0 : μ 1 ≤ μ 2 versus H 1 : μ 1 > μ 2 Step 2: Since this is a test of the means and neither n 1 or n 2 is in excess of 30, a t test is appropriate. The critical value is 1.708 (from a t table with 25 degrees of freedom). Step 3:
Step 4: Sketch the distribution, locate the critical values and the test statistic. Step 5: Decide! Since the test statistic values lies within the retention region then there is no sufficient statistical evidence based on this sample to reject H 0 .
A credit-insurance organization has developed a new high-tech method of training new sales personnel. The company sampled 16 employees who were trained the original way and found average daily sales to be $688 and the sample standard deviation was $32.63. They also sampled 11 employees who were trained using the new method and found average daily sales to be $706 and the sample standard deviation was $24.84. At a = 0.05, can the company conclude that average daily sales have increased under the new plan?
A large stock-brokerage firm wants to determine how successful its new account executives have been at recruiting clients. After completing their training, new account execs spend several weeks calling prospective clients, trying to get the prospects to open accounts with the firm. The following data give the numbers of new accounts opened in their first 2 weeks by 10 randomly chosen female account execs and by 8 randomly chosen male account execs. At a = 0.05, does it appear that the women are more effective at generating new accounts than the men are?
Paired Difference Test: Dependent Samples
Jeff Richardson, the receiving clerk for a chemical-products distributor, is faced with the continuing problem of broken glassware, including testtubes, petri dishes, and flasks. Jeff has determined some additional shipping precautions that can be undertaken to prevent breakage, and he has asked the Purchasing Director to inform the suppliers of the new measures. Data for 8 suppliers are given below in terms of average number of broken items per shipment. Do the data indicate, at a = 0.05, that the new measures have lowered the average number of broken items?
Aquarius Health Club has been advertising a rigorous program for body conditioning. The club claims that after 1 month in the program, the average participant should be able to do eight more push-ups in 2 minutes than he or she could do at the start. Does the random sample of 10 program participants given below support the club's claim? Use the 0.025 level of significance.
Another way to look at hypothesis testing (more common)
In typical hypothesis testing, the analyst does not go through the procedures manually. Instead, they use a computer and a statistical analysis software package (such as SAS, SPSS, Minitab, etc.)
Instead of drawing the distribution and determining the location of the test statistic value (within the rejection or retention region), the computer calculates the area under the normal curve in excess of the test statistic value. This area is called the probability value or p-value
So, you do not have to determine the location of the test statistic to make your decision on H 0 . You can simply look at the p-value and decide.
Saves time since all computer statistics packages compute the p-value, and
Not all people share your personal level of risk assessment. That is, you might use an alpha value of 0.05 but someone else might use a higher/lower value. You might reject H 0 using α = 0.05 but someone else might retain H 0 at a different level of risk.