Upcoming SlideShare
×

# 2 5 standard deviation

5,471 views

Published on

Published in: Technology
2 Likes
Statistics
Notes
• Full Name
Comment goes here.

Are you sure you want to Yes No
• Be the first to comment

Views
Total views
5,471
On SlideShare
0
From Embeds
0
Number of Embeds
48
Actions
Shares
0
66
0
Likes
2
Embeds 0
No embeds

No notes for slide

### 2 5 standard deviation

1. 1. Do Now, Calculate these values <ul><li>Customer waiting times, in seconds, for two banks </li></ul><ul><li>Mean = </li></ul><ul><li>Median = </li></ul><ul><li>Mode = </li></ul><ul><li>Midrange = </li></ul>Providence Jefferson 10 7.7 9.3 7.7 8.5 7.7 7.7 7.4 7.7 7.3 6.7 7.1 6.2 6.8 5.8 6.7 5.4 6.6 4.2 6.5
2. 2. Measures of Variation <ul><li>We need to measure the variation in such a way that </li></ul><ul><ul><li>Data in which values that are close together should have a small measurement </li></ul></ul><ul><ul><li>Values that are spread apart should have a high measurement </li></ul></ul><ul><li>Range </li></ul><ul><li>Standard deviation </li></ul><ul><li>Variance </li></ul>
3. 3. Range <ul><li>maximum - minimum </li></ul><ul><li>Jefferson: </li></ul><ul><li>Providence: </li></ul>
4. 4. Standard Deviation <ul><li>“Measurement of variation of values about their mean” </li></ul><ul><li>Standard deviation based on a sample : </li></ul><ul><ul><li>Abbreviated s </li></ul></ul><ul><li>Same units as the original values </li></ul><ul><li>Sample variance = s 2 </li></ul>
5. 5. Population Standard Deviation <ul><li>Standard deviation of the entire population: </li></ul><ul><ul><li>Abbreviated σ (lower-case sigma) </li></ul></ul><ul><li>Same units as the original values </li></ul><ul><li>Population variance = σ 2 </li></ul>
6. 6. Standard Deviation <ul><li>Interpretation </li></ul><ul><ul><li>“Average distance from the mean” </li></ul></ul><ul><ul><li>About 2/3 of the values should fall within 1 standard deviation of the mean </li></ul></ul><ul><li>Can be estimated with the Range rule of thumb : </li></ul>
7. 7. Calculating a Standard Deviation <ul><li>Find the mean </li></ul><ul><li>For each value </li></ul><ul><ul><li>Subtract the mean from the value </li></ul></ul><ul><ul><li>Square the difference </li></ul></ul><ul><li>Add the squares from step 2b. </li></ul><ul><li>Divide that sum buy one less than number of samples </li></ul><ul><li>Find the square root of the quotient </li></ul>
8. 8. Live Example: Jefferson Valley 7.7 7.7 7.7 6.5 7.4 7.3 7.1 6.8 6.7 6.6
9. 9. Your turn: Providence 6.2 5.8 5.4 4.2 6.7 7.7 10 9.3 8.5 7.7
10. 10. Rounding <ul><li>Round to one more decimal place that the original data. </li></ul><ul><ul><li>E.g., the mean of 14 15 9 16 and 12 would be 13.2 </li></ul></ul><ul><ul><li>E.g., the mean of 1.04, 2.17, and 3.79 would be 2.333 </li></ul></ul><ul><ul><li>Except for ‘well-know’ decimals </li></ul></ul><ul><ul><ul><li>0.25, 0.75, 0.5 </li></ul></ul></ul><ul><li>Round the standard deviation to one more place that the mean </li></ul>
11. 11. Homework <ul><li>Find the standard deviation of Super Bowl Points </li></ul><ul><li>Use the range rule of thumb to estimate the standard deviation of motor vehicle deaths and Sunspot number </li></ul>
12. 12. Uses Standard Deviation <ul><li>Interpretation </li></ul><ul><ul><li>“Average distance from the mean” </li></ul></ul><ul><ul><li>About 2/3 of the values should fall within 1 standard deviation of the mean </li></ul></ul><ul><li>Uses </li></ul><ul><ul><li>Identify outliers and determining what is “unusual” </li></ul></ul><ul><ul><li>Frequency Tables </li></ul></ul><ul><ul><li>Curving </li></ul></ul>
13. 13. Estimating Frequencies <ul><li>In a large sample of normally distributed data a specific percent of values should fall into frequency classes whose boundaries are based on the mean and standard deviation </li></ul>mean mean + s mean + 2s mean + 3s mean – 3s mean – 2s mean – s 34% 34% 17% 17% 2.4% 2.4% 0.1% 0.1% 68% 95% 99.7%
14. 14. Distribution Spread of Bank Data 0 (mean) +1s +2s + 3s -3s -2s -1s
15. 15. Outliers and Unusual Values <ul><li>5% is considered the boundary for being unusual </li></ul><ul><li>Values are considered outliers or unusual if they are beyond 2 standard deviations from the mean. </li></ul>mean mean + 2s mean – 2s 2.5% 2.5% 95%
16. 16. Outliers and Unusual Values <ul><li>Outliers and/or unusual values are usually investigated </li></ul><ul><ul><li>Different category </li></ul></ul><ul><ul><li>Bad data </li></ul></ul><ul><ul><li>“Bad day” </li></ul></ul><ul><li>Outliers and/or unusual values might be trimmed from the list </li></ul>
17. 17. Frequency Table <ul><li>We can use the mean and standard deviation as the boundary of our frequency classes </li></ul><ul><li>If the data is normally distributed </li></ul><ul><ul><li>68% of our samples should be within one standard deviation of the mean </li></ul></ul><ul><ul><li>95% should be within two standard deviations </li></ul></ul><ul><ul><li>99.7 should be within three standard deviations </li></ul></ul>
18. 18. Building a Frequency Table <ul><li>Create a table of six classes </li></ul><ul><li>Find the mean and standard deviation </li></ul><ul><li>Use the mean as the lower class fourth class </li></ul><ul><li>Add the standard deviation to the mean to get the LCL for the fifth classes, add it again for the LCL for the sixth the mean. </li></ul><ul><li>Subtract the standard deviation from the mean to get the LCL for the third class, subtract it again to get the LCL for the second class. The first class has no LCL </li></ul><ul><li>Fill in the UCLs; the sixth class has no UCL </li></ul><ul><li>Fill in the frequencies </li></ul>
19. 19. IQ Scores <ul><li>The mean and standard deviation for IQ scores is 100 and 15, respectively. </li></ul><ul><li>Create a frequency table for the following IQ scores: 100 111 122 95 83 95 109 93 102 110 86 98 120 108 117 94 101 105 104 72 </li></ul>
20. 20. IQ Frequency Table Frequency UCB LCB
21. 21. Using Mean and Std Dev to curve a tests <ul><li>25 55 59 59 63 71 71 74 80 80 80 83 84 84 87 88 95 95 100 100 </li></ul><ul><li>X-bar is 77, s is 18 </li></ul><ul><li>Use s divided by 3 for grade boundaries </li></ul>C+ B B+ A F D D+ C A+
22. 22. Your Turn <ul><li>Define distribution spread and a frequency table using the following values: 50 140 104 75 55 22 75 39 90 45 110 62 90 35 200 90 </li></ul><ul><li>Are there any “unusual values? </li></ul><ul><li>What is the “theoretical” percent of values that fall within 1 sigma? </li></ul><ul><li>What is the actual percent of values that fall within 1 sigma? </li></ul>
23. 23. IQ Frequency Table Frequency UCB LCB
24. 24. Homework <ul><li>The mean adult male height is 69 inches; the standard deviation is 3.5 </li></ul><ul><li>The mean adult female height is 64 inched; the standard deviation is 3 </li></ul><ul><li>On the next slide use these values to create a frequency table of your opinion poll data. </li></ul><ul><li>Identify outliers/unusual values </li></ul>
25. 25. Frequency Table: Male Frequency UCB LCB
26. 26. Frequency Table: Female Frequency UCB LCB