SlideShare a Scribd company logo
1 of 23
Chapter 15


  Association Between Variables
  Measured at the Interval-Ratio
  Level
Chapter Outline
   Interpreting the Correlation
    Coefficient: r 2
   The Correlation Matrix
   Testing Pearson’s r for Significance
   Interpreting Statistics: The Correlates
    of Crime
Scattergrams
   Scattergrams have two dimensions:
       The X (independent) variable is arrayed
        along the horizontal axis.
       The Y (dependent) variable is arrayed
        along the vertical axis.
Scattergrams
   Each dot on a scattergram is a case.
   The dot is placed at the intersection
    of the case’s scores on X and Y.
Scattergra ms
   Shows the relationship between %
    College Educated (X) and Voter Turnout
    (Y) on election day for the 50 states.
                                Turnout By % College
       73


       68


       63


       58


       53


       48


       43
            15   17   19   21    23      25       27   29   31   33   35
                                      % College
Scattergrams
   Horizontal X axis - % of population of a
    state with a college education.
      Scores range from 15.3% to 34.6%
       and increase from left to right.
                                 Turnout By % College
        73


        68


        63


        58


        53


        48


        43
             15   17   19   21    23      25       27   29   31   33   35
                                       % College
Scattergrams
   Vertical (Y) axis is voter turnout.
     Scores range from 44.1% to 70.4% and
      increase from bottom to top
                                Turnout By % College
       73


       68


       63


       58


       53


       48


       43
            15   17   19   21    23      25       27   29   31   33   35
                                      % College
Scattergrams: Regression Line
   A single straight line that comes as close as
    possible to all data points.
   Indicates strength and direction of the
    relationship.
                                 Turnout By % College
        73


        68


        63


        58


        53


        48


        43
             15   17   19   21    23      25       27   29   31   33   35
                                       % College
Scattergrams:
Strength of Regression Line
   The greater the extent to which dots are clustered
    around the regression line, the stronger the
    relationship.
   This relationship is weak to moderate in strength.

                                       Turnout By % College
              73


              68


              63


              58


              53


              48


              43
                   15   17   19   21    23      25       27   29   31   33   35
                                             % College
Scattergrams:
Direction of Regression Line
   Positive: regression line rises left to right.
   Negative: regression line falls left to right.
   This a positive relationship: As % college
    educated increases, turnout increases.
                                       Turnout By % College
              73


              68


              63


              58


              53


              48


              43
                   15   17   19   21    23      25       27   29   31   33   35
                                             % College
Scattergrams
   Inspection of the scattergram should
    always be the first step in assessing the
    correlation between two I-R variables
                                  Turnout By % College
         73


         68


         63


         58


         53


         48


         43
              15   17   19   21    23      25       27   29   31   33   35
                                        % College
The Regression Line: Formula
   This formula defines the regression line:
       Y = a + bX
       Where:
          Y = score on the dependent variable
          a = the Y intercept or the point where the
           regression line crosses the Y axis.
          b = the slope of the regression line or the
           amount of change produced in Y by a unit
           change in X
          X = score on the independent variable
Regression Analysis
   Before using the formula for the regression line, a
    and b must be calculated.
   Compute b first, using Formula 15.3 (we won’t do
    any calculation for this chapter)
Regression Analysis
   The Y intercept (a) is computed from
    Formula 15.4:
Regression Analysis
   For the relationship between % college
    educated and turnout:
       b (slope) = .42
       a (Y intercept)= 50.03
   Regression formula: Y = 50.03 + .42 X
   A slope of .42 means that turnout increases
    by .42 (less than half a percent) for every
    unit increase of 1 in % college educated.
   The Y intercept means that the regression
    line crosses the Y axis at Y = 50.03.
Predicting Y
   What turnout would be expected in a state
    where only 10% of the population was
    college educated?
   What turnout would be expected in a state
    where 70% of the population was college
    educated?
   This is a positive relationship so the value
    for Y increases as X increases:
       For X =10, Y = 50.3 +.42(10) = 54.5
       For X =70, Y = 50.3 + .42(70) = 79.7
Pearson correlation coefficient
   But of course, this is just an estimate of
    turnout based on % college educated, and
    many other factors also affect voter
    turnout.
   How much of the variation in voter turnout
    depends on % college educated? The
    relevant statististic is the coefficient of
    determination (r squared), but first we
    need to learn about Pearson’s correlation
    coefficient (r).
Pearson’s r
   Pearson’s r is a measure of association for I-R
    variables.
   It varies from -1.0 to +1.0
   Relationship may be positive (as X increases, Y
    increases) or negative (as X increases, Y decreases)
   For the relationship between % college educated and
    turnout, r =.32.
   The relationship is positive: as level of education
    increases, turnout increases.
   How strong is the relationship? For that we use R
    squared, but first, let’s look at the calculation process
Example of Computation
   The computation and interpretation of a, b,
    and Pearson’s r will be illustrated using
    Problem 15.1.
   The variables are:
       Voter turnout (Y)
       Average years of school (X)
   The sample is 5 cities.
       This is only to simplify computations, 5 is much
        too small a sample for serious research.
Example of Computation
                       The scores on each
 City    X     Y
                        variable are
  A     11.9   55
                        displayed in table
                        format:
  B     12.1   60          Y = Turnout
                           X = Years of
  C     12.7   65           Education

  D     12.8   68

  E     13.0   70
Example of Computation
                        Y2
                                           Sums are
  X      Y      X2
                                 XY         needed to
                                            compute b, a,
 11.9   55    141.61   3025    654.5        and Pearson’s
                                            r.
 12.1   60    146.41   3600     726

 12.7   65    161.29   4225    825.5

 12.8   68    163.84   4624    870.4

 13.0   70     169     4900     910

 62.5   318   782.15   20374   3986.4
Interpreting Pearson’s r
   An r of 0.98 indicates an extremely strong
    relationship between average years of
    education and voter turnout for these five
    cities.
   The coefficient of determination is r2 = .96.
    Knowing education level improves our
    prediction of voter turnout by 96%. This is
    a PRE measure (like lambda and gamma)
   We could also say that education explains
    96% of the variation in voter turnout.
Interpreting Pearson’s r
   Our first example provides a more
    realistic value for r.
       The r between turnout and % college
        educated for the 50 states was:
           r = .32
           This is a weak to moderate, positive
            relationship.
   The value of r2 is .10.
    Percent college educated explains
    10% of the variation in turnout.

More Related Content

What's hot

Math 1314 College Algebra Arithmetic Review
Math 1314 College Algebra Arithmetic ReviewMath 1314 College Algebra Arithmetic Review
Math 1314 College Algebra Arithmetic ReviewCCoffmanHowardCollege
 
Unit 4 lesson 2
Unit 4 lesson 2Unit 4 lesson 2
Unit 4 lesson 2VMRoberts
 
Simple linear regression (final)
Simple linear regression (final)Simple linear regression (final)
Simple linear regression (final)Harsh Upadhyay
 
regression and correlation
regression and correlationregression and correlation
regression and correlationPriya Sharma
 
Ajeesh e resource book
Ajeesh e resource bookAjeesh e resource book
Ajeesh e resource bookViji Vs
 
Correlation and Regression
Correlation and RegressionCorrelation and Regression
Correlation and RegressionShubham Mehta
 
Correlation analysis
Correlation analysisCorrelation analysis
Correlation analysisRajat Sharma
 
Correlation and regression
Correlation and regressionCorrelation and regression
Correlation and regressionAjendra7846
 
Pearson Correlation, Spearman Correlation &Linear Regression
Pearson Correlation, Spearman Correlation &Linear RegressionPearson Correlation, Spearman Correlation &Linear Regression
Pearson Correlation, Spearman Correlation &Linear RegressionAzmi Mohd Tamil
 
Fractions Xiii
Fractions XiiiFractions Xiii
Fractions Xiiihiratufail
 
Simple Linear Regression: Step-By-Step
Simple Linear Regression: Step-By-StepSimple Linear Regression: Step-By-Step
Simple Linear Regression: Step-By-StepDan Wellisch
 
Mutiple linear regression project
Mutiple linear regression projectMutiple linear regression project
Mutiple linear regression projectJAPAN SHAH
 

What's hot (20)

Glm
GlmGlm
Glm
 
Correlation and regression
Correlation and regressionCorrelation and regression
Correlation and regression
 
Math 1314 College Algebra Arithmetic Review
Math 1314 College Algebra Arithmetic ReviewMath 1314 College Algebra Arithmetic Review
Math 1314 College Algebra Arithmetic Review
 
Scatter diagrams
Scatter diagramsScatter diagrams
Scatter diagrams
 
Unit 4 lesson 2
Unit 4 lesson 2Unit 4 lesson 2
Unit 4 lesson 2
 
Chapter 12
Chapter 12Chapter 12
Chapter 12
 
Simple linear regression (final)
Simple linear regression (final)Simple linear regression (final)
Simple linear regression (final)
 
regression and correlation
regression and correlationregression and correlation
regression and correlation
 
Ajeesh e resource book
Ajeesh e resource bookAjeesh e resource book
Ajeesh e resource book
 
Unit1 part2
Unit1 part2Unit1 part2
Unit1 part2
 
Correlation and Regression
Correlation and RegressionCorrelation and Regression
Correlation and Regression
 
Correlation analysis
Correlation analysisCorrelation analysis
Correlation analysis
 
Measure of mode in biostatic
Measure of mode in biostaticMeasure of mode in biostatic
Measure of mode in biostatic
 
Chapter 10
Chapter 10Chapter 10
Chapter 10
 
Correlation and regression
Correlation and regressionCorrelation and regression
Correlation and regression
 
Pearson Correlation, Spearman Correlation &Linear Regression
Pearson Correlation, Spearman Correlation &Linear RegressionPearson Correlation, Spearman Correlation &Linear Regression
Pearson Correlation, Spearman Correlation &Linear Regression
 
Fractions Xiii
Fractions XiiiFractions Xiii
Fractions Xiii
 
Simple Linear Regression: Step-By-Step
Simple Linear Regression: Step-By-StepSimple Linear Regression: Step-By-Step
Simple Linear Regression: Step-By-Step
 
Mutiple linear regression project
Mutiple linear regression projectMutiple linear regression project
Mutiple linear regression project
 
The Suitcase Case
The Suitcase CaseThe Suitcase Case
The Suitcase Case
 

Viewers also liked

PDU 207 Basic Statistics: Variability
PDU 207 Basic Statistics: VariabilityPDU 207 Basic Statistics: Variability
PDU 207 Basic Statistics: VariabilityAgatha N. Ardhiati
 
150721 FINAL APA 2015 Poster (Scott, Najafi, & Hakim-Larson)
150721 FINAL APA 2015 Poster  (Scott, Najafi, & Hakim-Larson)150721 FINAL APA 2015 Poster  (Scott, Najafi, & Hakim-Larson)
150721 FINAL APA 2015 Poster (Scott, Najafi, & Hakim-Larson)Shawna Scott
 
Introduction to statistics 3
Introduction to statistics 3Introduction to statistics 3
Introduction to statistics 3David Balfour
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statisticsmercy rani
 
Data analysis 1
Data analysis 1Data analysis 1
Data analysis 1Bùi Trâm
 
Measures of correlation (pearson's r correlation coefficient and spearman rho)
Measures of correlation (pearson's r correlation coefficient and spearman rho)Measures of correlation (pearson's r correlation coefficient and spearman rho)
Measures of correlation (pearson's r correlation coefficient and spearman rho)Jyl Matz
 
Why we run cronbach’s alpha
Why we run cronbach’s alphaWhy we run cronbach’s alpha
Why we run cronbach’s alphaAiden Yeh
 
Measures of Variability
Measures of VariabilityMeasures of Variability
Measures of Variabilityjasondroesch
 
Chapter 09
Chapter 09Chapter 09
Chapter 09bmcfad01
 
Anova and T-Test
Anova and T-TestAnova and T-Test
Anova and T-TestAD Sarwar
 
Chapter 6 part2-Introduction to Inference-Tests of Significance, Stating Hyp...
Chapter 6 part2-Introduction to Inference-Tests of Significance,  Stating Hyp...Chapter 6 part2-Introduction to Inference-Tests of Significance,  Stating Hyp...
Chapter 6 part2-Introduction to Inference-Tests of Significance, Stating Hyp...nszakir
 
Choosing the best measure of central tendency
Choosing the best measure of central tendencyChoosing the best measure of central tendency
Choosing the best measure of central tendencybujols
 
Correlation and regression
Correlation and regressionCorrelation and regression
Correlation and regressionMohit Asija
 
Inferential statistics powerpoint
Inferential statistics powerpointInferential statistics powerpoint
Inferential statistics powerpointkellula
 
Inferential statistics.ppt
Inferential statistics.pptInferential statistics.ppt
Inferential statistics.pptNursing Path
 

Viewers also liked (20)

reliability
reliabilityreliability
reliability
 
PDU 207 Basic Statistics: Variability
PDU 207 Basic Statistics: VariabilityPDU 207 Basic Statistics: Variability
PDU 207 Basic Statistics: Variability
 
150721 FINAL APA 2015 Poster (Scott, Najafi, & Hakim-Larson)
150721 FINAL APA 2015 Poster  (Scott, Najafi, & Hakim-Larson)150721 FINAL APA 2015 Poster  (Scott, Najafi, & Hakim-Larson)
150721 FINAL APA 2015 Poster (Scott, Najafi, & Hakim-Larson)
 
Introduction to statistics 3
Introduction to statistics 3Introduction to statistics 3
Introduction to statistics 3
 
Measuring relationships
Measuring relationshipsMeasuring relationships
Measuring relationships
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
 
Data analysis 1
Data analysis 1Data analysis 1
Data analysis 1
 
Measures of correlation (pearson's r correlation coefficient and spearman rho)
Measures of correlation (pearson's r correlation coefficient and spearman rho)Measures of correlation (pearson's r correlation coefficient and spearman rho)
Measures of correlation (pearson's r correlation coefficient and spearman rho)
 
Why we run cronbach’s alpha
Why we run cronbach’s alphaWhy we run cronbach’s alpha
Why we run cronbach’s alpha
 
Measures of Variability
Measures of VariabilityMeasures of Variability
Measures of Variability
 
Chapter 09
Chapter 09Chapter 09
Chapter 09
 
Anova and T-Test
Anova and T-TestAnova and T-Test
Anova and T-Test
 
Chapter 6 part2-Introduction to Inference-Tests of Significance, Stating Hyp...
Chapter 6 part2-Introduction to Inference-Tests of Significance,  Stating Hyp...Chapter 6 part2-Introduction to Inference-Tests of Significance,  Stating Hyp...
Chapter 6 part2-Introduction to Inference-Tests of Significance, Stating Hyp...
 
Choosing the best measure of central tendency
Choosing the best measure of central tendencyChoosing the best measure of central tendency
Choosing the best measure of central tendency
 
Pearson Correlation
Pearson CorrelationPearson Correlation
Pearson Correlation
 
Correlation and regression
Correlation and regressionCorrelation and regression
Correlation and regression
 
P value
P valueP value
P value
 
Inferential statistics powerpoint
Inferential statistics powerpointInferential statistics powerpoint
Inferential statistics powerpoint
 
Chi square test
Chi square test Chi square test
Chi square test
 
Inferential statistics.ppt
Inferential statistics.pptInferential statistics.ppt
Inferential statistics.ppt
 

Similar to Healey ch15b

Correlation and Regression ppt
Correlation and Regression pptCorrelation and Regression ppt
Correlation and Regression pptSantosh Bhaskar
 
Correlation AnalysisCorrelation AnalysisCorrelation meas.docx
Correlation AnalysisCorrelation AnalysisCorrelation meas.docxCorrelation AnalysisCorrelation AnalysisCorrelation meas.docx
Correlation AnalysisCorrelation AnalysisCorrelation meas.docxfaithxdunce63732
 
Topic 5 Covariance & Correlation.pptx
Topic 5  Covariance & Correlation.pptxTopic 5  Covariance & Correlation.pptx
Topic 5 Covariance & Correlation.pptxCallplanetsDeveloper
 
Topic 5 Covariance & Correlation.pptx
Topic 5  Covariance & Correlation.pptxTopic 5  Covariance & Correlation.pptx
Topic 5 Covariance & Correlation.pptxCallplanetsDeveloper
 
Lecture 13 Regression & Correlation.ppt
Lecture 13 Regression & Correlation.pptLecture 13 Regression & Correlation.ppt
Lecture 13 Regression & Correlation.pptshakirRahman10
 
Module 2_ Regression Models..pptx
Module 2_ Regression Models..pptxModule 2_ Regression Models..pptx
Module 2_ Regression Models..pptxnikshaikh786
 
Correlation and Regression Analysis.pptx
Correlation and Regression Analysis.pptxCorrelation and Regression Analysis.pptx
Correlation and Regression Analysis.pptxUnfold1
 
correlation and regression
correlation and regressioncorrelation and regression
correlation and regressionUnsa Shakir
 
Lesson 27 using statistical techniques in analyzing data
Lesson 27 using statistical techniques in analyzing dataLesson 27 using statistical techniques in analyzing data
Lesson 27 using statistical techniques in analyzing datamjlobetos
 
1Bivariate RegressionStraight Lines¾ Simple way to.docx
1Bivariate RegressionStraight Lines¾ Simple way to.docx1Bivariate RegressionStraight Lines¾ Simple way to.docx
1Bivariate RegressionStraight Lines¾ Simple way to.docxaulasnilda
 
CORRELATION-AND-REGRESSION.pdf for human resource
CORRELATION-AND-REGRESSION.pdf for human resourceCORRELATION-AND-REGRESSION.pdf for human resource
CORRELATION-AND-REGRESSION.pdf for human resourceSharon517605
 
Correlation and regression
Correlation and regressionCorrelation and regression
Correlation and regressionAntony Raj
 
Correlation _ Regression Analysis statistics.pptx
Correlation _ Regression Analysis statistics.pptxCorrelation _ Regression Analysis statistics.pptx
Correlation _ Regression Analysis statistics.pptxkrunal soni
 
Unit-III Correlation and Regression.pptx
Unit-III Correlation and Regression.pptxUnit-III Correlation and Regression.pptx
Unit-III Correlation and Regression.pptxAnusuya123
 

Similar to Healey ch15b (20)

Correlation and Regression ppt
Correlation and Regression pptCorrelation and Regression ppt
Correlation and Regression ppt
 
Correlation AnalysisCorrelation AnalysisCorrelation meas.docx
Correlation AnalysisCorrelation AnalysisCorrelation meas.docxCorrelation AnalysisCorrelation AnalysisCorrelation meas.docx
Correlation AnalysisCorrelation AnalysisCorrelation meas.docx
 
Correlation and Regression
Correlation and Regression Correlation and Regression
Correlation and Regression
 
Topic 5 Covariance & Correlation.pptx
Topic 5  Covariance & Correlation.pptxTopic 5  Covariance & Correlation.pptx
Topic 5 Covariance & Correlation.pptx
 
Topic 5 Covariance & Correlation.pptx
Topic 5  Covariance & Correlation.pptxTopic 5  Covariance & Correlation.pptx
Topic 5 Covariance & Correlation.pptx
 
Lecture 13 Regression & Correlation.ppt
Lecture 13 Regression & Correlation.pptLecture 13 Regression & Correlation.ppt
Lecture 13 Regression & Correlation.ppt
 
Module 2_ Regression Models..pptx
Module 2_ Regression Models..pptxModule 2_ Regression Models..pptx
Module 2_ Regression Models..pptx
 
Correlation
CorrelationCorrelation
Correlation
 
Correlation and Regression Analysis.pptx
Correlation and Regression Analysis.pptxCorrelation and Regression Analysis.pptx
Correlation and Regression Analysis.pptx
 
Correlation
CorrelationCorrelation
Correlation
 
correlation and regression
correlation and regressioncorrelation and regression
correlation and regression
 
Correlation Analysis
Correlation AnalysisCorrelation Analysis
Correlation Analysis
 
regression
regressionregression
regression
 
chapter17.ppt
chapter17.pptchapter17.ppt
chapter17.ppt
 
Lesson 27 using statistical techniques in analyzing data
Lesson 27 using statistical techniques in analyzing dataLesson 27 using statistical techniques in analyzing data
Lesson 27 using statistical techniques in analyzing data
 
1Bivariate RegressionStraight Lines¾ Simple way to.docx
1Bivariate RegressionStraight Lines¾ Simple way to.docx1Bivariate RegressionStraight Lines¾ Simple way to.docx
1Bivariate RegressionStraight Lines¾ Simple way to.docx
 
CORRELATION-AND-REGRESSION.pdf for human resource
CORRELATION-AND-REGRESSION.pdf for human resourceCORRELATION-AND-REGRESSION.pdf for human resource
CORRELATION-AND-REGRESSION.pdf for human resource
 
Correlation and regression
Correlation and regressionCorrelation and regression
Correlation and regression
 
Correlation _ Regression Analysis statistics.pptx
Correlation _ Regression Analysis statistics.pptxCorrelation _ Regression Analysis statistics.pptx
Correlation _ Regression Analysis statistics.pptx
 
Unit-III Correlation and Regression.pptx
Unit-III Correlation and Regression.pptxUnit-III Correlation and Regression.pptx
Unit-III Correlation and Regression.pptx
 

Healey ch15b

  • 1. Chapter 15 Association Between Variables Measured at the Interval-Ratio Level
  • 2. Chapter Outline  Interpreting the Correlation Coefficient: r 2  The Correlation Matrix  Testing Pearson’s r for Significance  Interpreting Statistics: The Correlates of Crime
  • 3. Scattergrams  Scattergrams have two dimensions:  The X (independent) variable is arrayed along the horizontal axis.  The Y (dependent) variable is arrayed along the vertical axis.
  • 4. Scattergrams  Each dot on a scattergram is a case.  The dot is placed at the intersection of the case’s scores on X and Y.
  • 5. Scattergra ms  Shows the relationship between % College Educated (X) and Voter Turnout (Y) on election day for the 50 states. Turnout By % College 73 68 63 58 53 48 43 15 17 19 21 23 25 27 29 31 33 35 % College
  • 6. Scattergrams  Horizontal X axis - % of population of a state with a college education.  Scores range from 15.3% to 34.6% and increase from left to right. Turnout By % College 73 68 63 58 53 48 43 15 17 19 21 23 25 27 29 31 33 35 % College
  • 7. Scattergrams  Vertical (Y) axis is voter turnout.  Scores range from 44.1% to 70.4% and increase from bottom to top Turnout By % College 73 68 63 58 53 48 43 15 17 19 21 23 25 27 29 31 33 35 % College
  • 8. Scattergrams: Regression Line  A single straight line that comes as close as possible to all data points.  Indicates strength and direction of the relationship. Turnout By % College 73 68 63 58 53 48 43 15 17 19 21 23 25 27 29 31 33 35 % College
  • 9. Scattergrams: Strength of Regression Line  The greater the extent to which dots are clustered around the regression line, the stronger the relationship.  This relationship is weak to moderate in strength. Turnout By % College 73 68 63 58 53 48 43 15 17 19 21 23 25 27 29 31 33 35 % College
  • 10. Scattergrams: Direction of Regression Line  Positive: regression line rises left to right.  Negative: regression line falls left to right.  This a positive relationship: As % college educated increases, turnout increases. Turnout By % College 73 68 63 58 53 48 43 15 17 19 21 23 25 27 29 31 33 35 % College
  • 11. Scattergrams  Inspection of the scattergram should always be the first step in assessing the correlation between two I-R variables Turnout By % College 73 68 63 58 53 48 43 15 17 19 21 23 25 27 29 31 33 35 % College
  • 12. The Regression Line: Formula  This formula defines the regression line:  Y = a + bX  Where:  Y = score on the dependent variable  a = the Y intercept or the point where the regression line crosses the Y axis.  b = the slope of the regression line or the amount of change produced in Y by a unit change in X  X = score on the independent variable
  • 13. Regression Analysis  Before using the formula for the regression line, a and b must be calculated.  Compute b first, using Formula 15.3 (we won’t do any calculation for this chapter)
  • 14. Regression Analysis  The Y intercept (a) is computed from Formula 15.4:
  • 15. Regression Analysis  For the relationship between % college educated and turnout:  b (slope) = .42  a (Y intercept)= 50.03  Regression formula: Y = 50.03 + .42 X  A slope of .42 means that turnout increases by .42 (less than half a percent) for every unit increase of 1 in % college educated.  The Y intercept means that the regression line crosses the Y axis at Y = 50.03.
  • 16. Predicting Y  What turnout would be expected in a state where only 10% of the population was college educated?  What turnout would be expected in a state where 70% of the population was college educated?  This is a positive relationship so the value for Y increases as X increases:  For X =10, Y = 50.3 +.42(10) = 54.5  For X =70, Y = 50.3 + .42(70) = 79.7
  • 17. Pearson correlation coefficient  But of course, this is just an estimate of turnout based on % college educated, and many other factors also affect voter turnout.  How much of the variation in voter turnout depends on % college educated? The relevant statististic is the coefficient of determination (r squared), but first we need to learn about Pearson’s correlation coefficient (r).
  • 18. Pearson’s r  Pearson’s r is a measure of association for I-R variables.  It varies from -1.0 to +1.0  Relationship may be positive (as X increases, Y increases) or negative (as X increases, Y decreases)  For the relationship between % college educated and turnout, r =.32.  The relationship is positive: as level of education increases, turnout increases.  How strong is the relationship? For that we use R squared, but first, let’s look at the calculation process
  • 19. Example of Computation  The computation and interpretation of a, b, and Pearson’s r will be illustrated using Problem 15.1.  The variables are:  Voter turnout (Y)  Average years of school (X)  The sample is 5 cities.  This is only to simplify computations, 5 is much too small a sample for serious research.
  • 20. Example of Computation  The scores on each City X Y variable are A 11.9 55 displayed in table format: B 12.1 60  Y = Turnout  X = Years of C 12.7 65 Education D 12.8 68 E 13.0 70
  • 21. Example of Computation Y2  Sums are X Y X2 XY needed to compute b, a, 11.9 55 141.61 3025 654.5 and Pearson’s r. 12.1 60 146.41 3600 726 12.7 65 161.29 4225 825.5 12.8 68 163.84 4624 870.4 13.0 70 169 4900 910 62.5 318 782.15 20374 3986.4
  • 22. Interpreting Pearson’s r  An r of 0.98 indicates an extremely strong relationship between average years of education and voter turnout for these five cities.  The coefficient of determination is r2 = .96. Knowing education level improves our prediction of voter turnout by 96%. This is a PRE measure (like lambda and gamma)  We could also say that education explains 96% of the variation in voter turnout.
  • 23. Interpreting Pearson’s r  Our first example provides a more realistic value for r.  The r between turnout and % college educated for the 50 states was:  r = .32  This is a weak to moderate, positive relationship.  The value of r2 is .10. Percent college educated explains 10% of the variation in turnout.