Frequency Tables - Statistics


Published on

  • Be the first to comment

  • Be the first to like this

No Downloads
Total views
On SlideShare
From Embeds
Number of Embeds
Embeds 0
No embeds

No notes for slide

Frequency Tables - Statistics

  1. 1. Frequency Tables <ul><li>Ms. Carter </li></ul><ul><li>Leaving Certificate </li></ul>
  2. 2. Tally Marks <ul><li>Tally marks are a way to count things in groups of five. </li></ul><ul><li>You draw four vertical lines and on the fifth you draw a diagonal line across the previous four. </li></ul>
  3. 3. Example: Counting Apples
  4. 4. Tally Marks
  5. 5. You tally now!
  6. 6. Frequency Table
  7. 7. What type of jobs use frequency tables?
  8. 8. EXAMPLE
  9. 9. The marks awarded for an assignment set for a Year 8 class of 20 students were as follows:      6     7     5     7     7     8     7     6     9     7      4     10   6     8     8     9     5     6     4     8 Present this information in a frequency table. Step 1 Step 2 Step 3 Now you try! Page 17 Question 1!
  10. 10. Solutions to Exercise 1.3 Result 3 Heads 2 Heads 1 Head 0 Heads Tally Frequency 3 11 9 2
  11. 12. Stem-and-Leaf Diagrams and Stemplots A S & L diagram represents data by seperating each value into two parts: the stem (usually the leftmost digit) and the leaf. S & L diagrams represent data in a similiar way to bar charts. In stem-and-leaf plots, numeric data is shown by using the actual numerals. Stem-and-leaf plots are especially useful when you have a lot of data that has a wide range.
  12. 13. To make a Stemplot, follow these steps: The following stem-and-leaf plot shows the record of wins for the Eastern Conference NBA teams:
  13. 14. Back-to-Back S & L This S & L Diagram compares two data sets.
  14. 15. Histograms H. break the range of values of a variable into classes and display only the count or per cent of the observations that fall into each class. H. are used to represent CONTINUOUS NUMERICAL DATA!
  15. 16. The following frequency table shows the times, in minutes, spent by a group of woman in a boutique. Draw a histogram of the distribution. Time 0-10 10-20 20-30 30-40 40-50 Numbe r 1 4 8 7 9
  16. 17. Distribution of Data
  17. 18. Symmetric Distribution If the values smaller and larger than its midpoint are mirror images of each other
  18. 19. Negatively Skewered Distribution Also known as a skewered left distribution.
  19. 20. Positively Skewered Distribution Also known as a skewered right distribution.
  20. 21. What are Variables? Variables are things that we measure, control, or manipulate in research. They differ in many respects, most notably in the role they are given in our research and in the type of measures that can be applied to them. Variables are things that we measure, control, or manipulate in research. They differ in many respects, most notably in the role they are given in our research and in the type of measures that can be applied to them.
  21. 22. Looking for Links <ul><li>Correlations and Associations </li></ul>
  22. 23. National Institutes of Health (NIH) <ul><li>Sedentary activities (like Tv watching) are associated with an increase in obesity and an increase in the risk of diabetes in women. </li></ul><ul><li>Anger expression may be inversely related to the risk of heart attack and stroke. (Those who express anger may have a decreased risk). </li></ul><ul><li>Light-t-moderate drinking reduces the risk of heart disease in men. </li></ul>
  23. 24. News Reporters love to tell stories about the latest links! Such as.. Does having her first baby later in life cause a woman to live longer? (New York Times) Do we believe this or much of anything anymore?
  24. 25. ‘ Count Cricket Chirps to Gauge Temperature’ ( Garden Gate ) ) What you have to do! 1. find a cricket 2. count the number of times it chirps in 15 seconds 3. add 40 You’ve just predicted the temp. in degrees Fahrenfeit!
  25. 27. Table 18-1 Cricket Chirps and Temperature Data (Excerpt) No. of Chirps in 15 sec Temperature (in degrees Fahrenheit) 18 57 20 60 21 64 23 65 27 68 30 71 34 74 39 77
  26. 28. To make a Stemplot, follow these steps: The following stem-and-leaf plot shows the record of wins for the Eastern Conference NBA teams:
  27. 29. Bivariate Data
  28. 30. <ul><li>Two variables </li></ul><ul><li>Tied or paired together </li></ul><ul><li>Two - dimensional data </li></ul><ul><li>Bivariate Data </li></ul><ul><li>Deals with causes or relationships </li></ul><ul><li>The major purpose of bivariate analysis is to determine whether relationships exist. </li></ul>Each observation is composed of..
  29. 31. Lets see another example!
  30. 32. A Press Release by Ohio State University Medical Center The headline says that... “ aspirin can prevent polyps in colon cancer patients”
  31. 33. Raw Data for this Study <ul><ul><li>ID NO. 22292 GROUP=ASPIRIN DEVELOPED POLYPS=NO </li></ul></ul><ul><ul><li>(635 LINES) </li></ul></ul>Table 18-2 Summary of Aspirin v’s Polyps Study Results * total sample size = 635 (approx were half randomly assigned to each person) Group % Developing Polyps* Aspirin 17 Non-aspirin 27
  32. 34. Scatter Plots <ul><li>Bivariate Numerical Data </li></ul><ul><li>Two Dimensions </li></ul><ul><li>Horizontal dimension (x-axis) </li></ul><ul><li>Vertical dimension (y-axis) </li></ul>
  33. 35. Scatter Plot of cricket chirps versus outdoor temperature.
  34. 36. Interpreting a Scatterplot <ul><li>you do this by looking for trends in the data as you go from left to right. </li></ul>
  35. 37. <ul><li>Positive linear relationship </li></ul><ul><li>Proportional relationship </li></ul><ul><li>As x increases (moves right one unit), y increases (moves up) a certain amount. </li></ul>
  36. 38. <ul><li>Negative linear relationship </li></ul><ul><li>Inverse relationship </li></ul><ul><li>As x increases, y decreases (moves down) a certain amount. </li></ul>
  37. 39. <ul><li>If the data don’t seem to resemble any kind of line (even a vague one) this means that no linear relationship exists. </li></ul>
  38. 40. Positive Linear Relationship as the cricket chirps increase so does the temperature aswell.
  39. 41. Q. 3 Page 6 AQA GSCE Age of Car Value of Car (£)
  40. 42. Q. 4
  41. 43. Q 5. Point A B C D E F G H I J Days abs ent 1 2 3 4 5 6 7 8 9 10 No. of peop p 30 50 36 8 36 28 16 58 34 42
  42. 44. Q. 6
  43. 45. Q. 7
  44. 46. Quantifying the Relationship <ul><li>Quantify or measure the extent and nature of the relationship. </li></ul>
  45. 47. We have already seen how to measure the direction of a linear relationship BUT you will also have to decide on the STRENGTH of the relationsbip!! Introduce the...
  46. 48. Correlation Coefficient <ul><li>Measures the strength and direction of the linear relationship between x and y (or the vertical and horizontal dimension). </li></ul>
  47. 49. Calculating the C.C. <ul><li>It is represented by the letter r </li></ul><ul><li>It has a value between - 1 and 1 </li></ul><ul><li>You only have to be able to calculate it using your calculator-luckily for you! </li></ul>
  48. 50. <ul><li>If r is close to 1, then there is a strong positive correlation between two sets of data. </li></ul><ul><li>If r is close to -1, we say there is a strong negative correlation between the two sets. </li></ul><ul><li>If r is close to 0, then there is no correlation between the two sets. </li></ul><ul><li>Most statisticians like to see correlations above = 0.6 or below - 0.6. </li></ul>
  49. 51. Types of Correlation
  50. 52. It is important you state the Direction and the Strength of a Correlation Correlation Coefficient = 0.99 Correlation coefficient = 0.5
  51. 53. A positive correlation means that high values of one variable are associated with high values of a second variable . The relationship between height and weight, between IQ scores and achievement test scores, and between self-concept and grades are examples of positive correlation.
  52. 54. Correlation Coefficient = - 0.99 Correlation Coefficient = - 0.5
  53. 55. A negative correlation or relationship means that high values of one variable are associated with low values of a second variable. Examples of negative correlations include those between exercise and heart failure, between successful test performance and feelings of incompetence, and between absence from school and school achievement.
  54. 56. No CORRELATION Correlation Coefficient = -.16
  55. 57. 8.2 Scatter Plots Aus. Book Q. 3
  56. 58. Q. 4
  57. 59. Q. 7
  58. 60. Using your calculator to calculate the C.C!
  59. 61. Correlation Coefficient Before doing this on the calculator, the class should do a scatter graph using the data in the table. Discuss the relationship between the data (i.e. gms of fat v calories). Project Maths Development Team © 2008 Total Fat (g) Total Calories Hamburger 9 260 Cheeseburger 13 320 Quarter Pounder 21 420 Quarter Pounder with Cheese 30 530 Big Mac 31 560 Special 31 550 Special with Bacon 34 590 Crispy Chicken 25 500 Fish Fillet 28 560 Grilled Chicken 20 440 Grilled Chicken Light 5 300
  60. 63. SHARP EL-W531 Mode 1 1 2 (x,y) 5 DATA 12 (x,y) 24 DATA 21 (x,y) 24 DATA 15 (x,y) 25 DATA RCL r x y 2 2 12 21 21 21 15 5 5 24 40 40 40 25
  61. 64. Correlation Coefficient by Calculator This will show the table on the right on the screen. When all the data items are inputted, press the following: This will give a correlatrion coefficient of 0.9746 Compare this answer with your interpretation of the scatter graph. Using the Casio fx-83ES Project Maths Development Team © 2008 Now input all the data items into the X and Y columns. Press after each data item.
  62. 65. Scatter Plot of cricket chirps versus outdoor temperature.
  63. 66. Correlation of 0.98!
  64. 67. Correlation versus Causation
  65. 68. <ul><li>The amount of fuel burned by a car depends on the size of its engine, since bigger engines burn more petrol. We say there is a CASUAL RELATIONSHIP between the amount of petrol used and the size of the cars engine. </li></ul>
  66. 69. <ul><li>If two variables are found to be either associated or correlated, that doesn’t necessarily mean that a cause-and-effect relationship exists between the two variables. </li></ul><ul><li>If we find a statistical relationship between two variables, then we cannot always conclude that one of the variables is the cause of the other, i.e. correlation does not always imply causality. </li></ul>
  67. 70. <ul><li>During 1980 and 2000 there was a large increase in sales of calculators and computers! </li></ul><ul><li>There was a strong positive correlation between the sales of computers and the sales of calculators! </li></ul>For Example.. Did the increase of sales of calculators cause an increase in the sale of computers??
  68. 71. NO!!!! Production Costs Decreased Cost of Production was a third variable causing the other two to increase. We call this third variable a LURKING VARIABLE.
  69. 72. Linear Regression <ul><li>Line of Best Fit </li></ul>
  70. 73. <ul><li>After you’ve found a relationship between two variables </li></ul><ul><li>and you have some way of quantifying this relationship, you can create a model that allows you to use one variabe to predict another. </li></ul>
  71. 74. <ul><li>1. Draw a Scatter Plot. </li></ul><ul><li>2. If graph suggests a linear relationship.. </li></ul><ul><li>3. Calculate Correlation Coefficient. </li></ul><ul><li>4. Find the equation of the Line that best fits the data. </li></ul><ul><ul><li>- We draw this by eye, and then find its equation. </li></ul></ul>
  72. 75. Because you have a strong correlation be it positive or negative you know that x is correlated with y. If you know the slope and the y-intercept of that line, then you can plug in a value for x and predict the average value for y. In other words, you can predict y from x. You should never do a regression analysis unless you’ve already found a strong correlation (either pos. or neg.) between the two variables!
  73. 76. Drawing by Eye <ul><li>Draw the line so there are roughly equal points above and below the line. </li></ul>
  74. 77. Drawing by Eye Two <ul><li>Draw a vertical line that splits the points up into two equal sized groups.  If there are an odd number of points (for instance 5), just split the groups slightly unevenly (3 in one, 2 in the other). </li></ul><ul><li>Find the middle of each group in the horizontal direction. </li></ul><ul><li>Find the middle of each group in the vertical direction. </li></ul>
  75. 78. <ul><li>Draw a cross or marker at the midpoint of each of the two groups.  The midpoint is the location found in steps 2 and 3. </li></ul><ul><li>Draw a line between these two midpoints. </li></ul>
  76. 79. Step 1
  77. 80. Step 2
  78. 81. Step 3
  79. 82. Step 4
  80. 83. Step 5
  81. 84. Now Calculate Line! i.e. what is the formula for the line? i.e. what is the formula for the line?
  82. 85. <ul><li>Equation: y = mx + c </li></ul><ul><li>M = slope y2-y2/x2-x1 where (x1,y1) and (x2,y2) are points on the line of best fit. </li></ul><ul><li>Substitute the m and one point into y-y1=m(x-x1). </li></ul>
  83. 86. Q. 3 AQA GSCE
  84. 87. Q. 5
  85. 88. Q. 6
  86. 89. Q. 9 Active Maths 0.98
  87. 90. Q. 10 0.90
  88. 91. Exercise 1.7 QUESTION 4 (ii)0.96 (iii) Strong Positive Correlation (v) y = 4x + 6 (vi) € 18, 000
  89. 93. Q. 5 <ul><li>(ii) 1.00 </li></ul><ul><li>(iv) Perfect positive correlation </li></ul><ul><li>(v) y = 11/7 (x-51/7) </li></ul>
  90. 95. Q. 6 <ul><li>(ii) 0.99 </li></ul><ul><li>(iii)Near - perfect positive correlation </li></ul><ul><li>(v) y = 2.5 x + 14 </li></ul><ul><li>(vi) 39 % </li></ul>
  91. 96. Let’s Sum up! <ul><li>Types of Sampling </li></ul><ul><li>Populations and Samples </li></ul><ul><li>Types of Sampling </li></ul><ul><li>Bias in Sampling </li></ul><ul><li>Reliability of Data </li></ul><ul><li>Collecting Data </li></ul>
  92. 97. <ul><li>Frequency Tables </li></ul><ul><li>Stem-and-Leaf Diagram </li></ul><ul><li>Back-toBack S & L </li></ul><ul><li>Histograms </li></ul><ul><li>Distribution of Data </li></ul>
  93. 98. <ul><li>Scatter Graph </li></ul><ul><ul><li>Correlation </li></ul></ul><ul><ul><li>Correlation Coefficent </li></ul></ul><ul><ul><li>Causality </li></ul></ul><ul><ul><li>Linear Regression </li></ul></ul>