Hypothesis Testing

3,937 views

Published on

Published in: Education, Technology
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
3,937
On SlideShare
0
From Embeds
0
Number of Embeds
97
Actions
Shares
0
Downloads
201
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

Hypothesis Testing

  1. 1. Normal Distribution and Hypothesis Testing STR1K
  2. 3. Characteristics <ul><li>Bell-shaped , depends on standard deviation </li></ul><ul><li>Continuous distribution </li></ul><ul><li>Unimodal </li></ul><ul><li>Symmetric about the vertical axis through the mean μ </li></ul><ul><li>Approaches the horizontal axis asymptotically </li></ul><ul><li>Total area under the curve and above the horizontal is 1 </li></ul>
  3. 4. Characteristics <ul><li>Approximately 68% of observations fall within 1 σ from the mean </li></ul><ul><li>Approximately 95% of observations fall within 2 σ from the mean </li></ul><ul><li>Approximately 99.7% of observations fall within 3 σ from the mean </li></ul>
  4. 5. 68.27% 95.45 % 99.73%
  5. 6. Standard Normal Distribution <ul><li>Special type of normal distribution where μ =0 </li></ul><ul><li>Used to avoid integral calculus to find the area under the curve </li></ul><ul><li>Standardizes raw data </li></ul><ul><li>Dimensionless Z-score </li></ul>Z = X - μ 0 σ
  6. 7. <ul><li>Example 1 </li></ul><ul><li>Given the normal distribution with μ = 49 and σ = 8, find the probability that X assumes a value: </li></ul><ul><li>Less than 45 </li></ul><ul><li>More than 50 </li></ul>
  7. 8. Example 2 The achievement sores for a college entrance examination are normally distributed with the mean 75 and standard deviation equal to 10. What fraction of the scores would one expect to lie between 70 and 90.
  8. 9. Sampling Distribution <ul><li>Distribution of all possible sample statistics </li></ul>Population All Possible Samples Sample Means 1, 2, 3, 4 1, 2, 3 2.00 1, 2, 4 2.33 3, 4, 1 2.67 2, 3, 4 3.00 μ = 2.5; σ = 1.18 n = 3 μ xbar = 2.5
  9. 10. Central Limit Theorem <ul><li>Given a distribution with a mean μ and variance σ² , the sampling distribution of the mean approaches a normal distribution  with a mean (μ) and a variance σ²/N as N, the sample size, increases.  </li></ul>
  10. 11. Characteristics <ul><li>The mean of the population and the mean of the sampling distribution of means will always have the same value .  </li></ul><ul><li>The sampling distribution of the mean will be normal  regardless of the shape of the population distribution. </li></ul>
  11. 12. N(70, 16)
  12. 13. N(70,1)
  13. 14. N(70,.25)
  14. 15. Characteristics <ul><li>As the sample size increases , the distribution of the sample average becomes less and less variable. </li></ul><ul><li>Hence the sample average X bar approaches the value of the population mean  μ .  </li></ul>
  15. 16. Example 3 An electrical firm manufactures light bulbs that have a length of life normally distributed with mean and standard deviation equal to 500 and 50 hours respectively. Find the probability that a random sample of 15 bulbs will have an average life ofless than 475 hours.
  16. 17. HYPOTHESIS TESTING <ul><li>Normal Distribution and Hypothesis Testing </li></ul>
  17. 18. Hypothesis Testing <ul><li>A hypothesis is a conjecture or assertion about a parameter </li></ul><ul><li>Null v. Alternative hypothesis </li></ul><ul><ul><li>Proof by contradiction </li></ul></ul><ul><ul><li>Null hypothesis is the hypothesis being tested </li></ul></ul><ul><ul><li>Alternative hypothesis is the operational statement of the experiment that is believed to be true </li></ul></ul>
  18. 19. One-tailed test <ul><ul><li>Alternative hypothesis specifies a one-directional difference for parameter </li></ul></ul><ul><ul><li>H 0 : μ = 10 v. H a : μ < 10 </li></ul></ul><ul><ul><li>H 0 : μ = 10 v. H a : μ > 10 </li></ul></ul><ul><ul><li>H 0 : μ 1 - μ 2 = 0 v. H a : μ 1 - μ 2 > 0 </li></ul></ul><ul><ul><li>H 0 : μ 1 - μ 2 = 0 v. H a : μ 1 - μ 2 < 0 </li></ul></ul>
  19. 20. Two-tailed test <ul><ul><li>Alternative hypothesis does not specify a directional difference for the parameter of interest </li></ul></ul><ul><ul><li>H 0 : μ = 10 v. H a : μ ≠ 10 </li></ul></ul><ul><ul><li>H 0 : μ 1 - μ 2 = 0 v. H a : μ 1 - μ 2 ≠ 0 </li></ul></ul>
  20. 21. Critical Region <ul><li>Also known as the “rejection region” </li></ul><ul><li>Critical region contains values of the test statistic for which the null hypothesis will be rejected </li></ul><ul><li>Acceptance and rejection regions are separated by the critical value, Z . </li></ul>
  21. 22. Type I error <ul><li>Error made by rejecting the null hypothesis when it is true . </li></ul><ul><li>False positive </li></ul><ul><li>Denoted by the level of significance, α </li></ul><ul><li>Level of significance suggests the highest probability of committing a type I error </li></ul>
  22. 23. Type II error <ul><li>Error made by not rejecting (accepting) the null hypothesis when it is false . </li></ul><ul><li>False negative </li></ul><ul><li>Probability denoted by β </li></ul>
  23. 25. Notes on errors <ul><li>Type I ( α ) and type II errors ( β ) are related . A decrease in the probability of one, increases the probability in the other. </li></ul><ul><li>As α increases , the size of the critical region also increases </li></ul><ul><li>Consequently, if H 0 is rejected at a low α , H 0 will also be rejected at a higher α . </li></ul>
  24. 27. Testing a Hypothesis on the Population Mean Z = X - μ 0 σ /√n t = X - μ 0 S /√n υ = n - 1 H 0 Test Statistic H a Critical Region σ known μ = μ 0 μ < μ 0 μ > μ 0 μ ≠ μ 0 z < -z α z > z α |z| > z α /2 σ unknown μ = μ 0 μ < μ 0 μ > μ 0 μ ≠ μ 0 t < -t α t > t α |t| > t α /2
  25. 28. critical value test statistic Reject H 0
  26. 29. critical value test statistic Do not reject H 0
  27. 30. Example 4 It is claimed that an automobile is driven on the average of less than 25,000 km per year. To test this claim, a random sample of 100 automobile owners are asked to keep a record of the kilometers they travel. Would you agree with this claim if the random sample showed an average of 23,500 km and a standard deviation of 3,900 km? Use 0.01 level of significance.

×