Measures of Association and Correlation Coefficients

Chapter 18Chapter 18
Measures ofMeasures of
AssociationAssociation
McGraw-Hill/Irwin Copyright © 2011 by The McGraw-Hill Companies, Inc. All Rights Reserved.

18-2
Learning ObjectivesLearning Objectives
Understand . . .
• How correlation analysis may be applied to
study relationships between two or more
variables
• The uses, requirements, and interpretation of
the product moment correlation coefficient.
• How predictions are made with regression
analysis using the method of least squares to
minimize errors in drawing a line of best fit.

18-3
Learning ObjectivesLearning Objectives
Understand . . .
• How to test regression models for linearity and
whether the equation is effective in fitting the
data.
• Nonparametric measures of association and
the alternatives they offer when key
assumptions and requirements for parametric
techniques cannot be met.

18-4
Invalid AssumptionsInvalid Assumptions
“The invalid assumption that correlation
implies cause is probably among the two
or three most serious and common errors
of human reasoning.”
Stephen Jay Gould
paleontologist and science writer

18-5
PulsePoint:PulsePoint:
Research RevelationResearch Revelation
25
The percent of students using a
credit card for college costs due
to convenience.

18-6
Measures of Association:Measures of Association:
Interval/Ratio DataInterval/Ratio Data
Pearson correlation coefficient
For continuous linearly related
variables
Correlation ratio (eta)
For nonlinear data or relating a main
effect to a continuous dependent
variable
Biserial
One continuous and one dichotomous
variable with an underlying normal
distribution
Partial correlation
Three variables; relating two with the
third’s effect taken out
Multiple correlation
Three variables; relating one variable
with two others
Bivariate linear regression
Predicting one variable from another’s
scores

18-7
Ordinal DataOrdinal Data
Gamma
Based on concordant-discordant
pairs; proportional reduction in
error (PRE) interpretation
Kendall’s tau b
P-Q based; adjustment for tied
ranks
Kendall’s tau c
P-Q based; adjustment for table
dimensions
Somers’s d
P-Q based; asymmetrical
extension of gamma
Spearman’s rho
Product moment correlation for
ranked data

18-8
Nominal DataNominal Data
Phi Chi-square based for 2*2 tables
Cramer’s V
CS based; adjustment when one table
dimension >2
Contingency coefficient C
CS based; flexible data and distribution
assumptions
Lambda PRE based interpretation
Goodman & Kruskal’s tau
PRE based with table marginals
emphasis
Uncertainty coefficient Useful for multidimensional tables
Kappa Agreement measure

18-9
Researchers Search for InsightsResearchers Search for Insights
Burke, one of the
world’s leading
research
companies, claims
researchers add the
most value to a
project when they
look beyond the raw
numbers to the
shades of gray…
what the data really
mean.

18-10
Pearson’s Product MomentPearson’s Product Moment
CorrelationCorrelation rr
Is there a relationship between X and Y?
What is the magnitude of the relationship?
What is the direction of the relationship?

18-11
Connections andConnections and
DisconnectionsDisconnections
“To truly understand consumers’ motives
and actions, you must determine
relationships between what they think and
feel and what they actually do.”
David Singleton, vp of insights
Zyman Marketing Group

18-12
Scatterplots of RelationshipsScatterplots of Relationships

18-13
ScatterplotsScatterplots

18-14
Diagram of Common VarianceDiagram of Common Variance

18-15
Interpretation of CorrelationsInterpretation of Correlations
X causes YX causes Y
Y causes XY causes X
X and Y are activated by
one or more other variables
X and Y are activated by
one or more other variables
X and Y influence each
other reciprocally
X and Y influence each
other reciprocally

18-16
Artifact CorrelationsArtifact Correlations

18-17
Interpretation of CoefficientsInterpretation of Coefficients
A coefficient is not remarkable simply
because it is statistically significant!
It must be practically meaningful.

18-18
Comparison of Bivariate LinearComparison of Bivariate Linear
Correlation and RegressionCorrelation and Regression

18-19
Examples of Different SlopesExamples of Different Slopes

18-20
Concept ApplicationConcept Application
X
Average Temperature (Celsius)
Y
Price per Case
(FF)
12 2,000
16 3,000
20 4,000
24 5,000
Mean =18 Mean = 3,500

18-21
Plot of Wine Price by AveragePlot of Wine Price by Average
TemperatureTemperature

18-22
Distribution of Y forDistribution of Y for
Observation of XObservation of X

18-23
Wine Price Study ExampleWine Price Study Example

18-24
Least Squares Line: Wine PriceLeast Squares Line: Wine Price
StudyStudy

18-25
Plot of Standardized ResidualsPlot of Standardized Residuals

18-26
Prediction andPrediction and
Confidence BandsConfidence Bands

18-27
Testing Goodness of FitTesting Goodness of Fit
Y is completely unrelated to X
and no systematic pattern is evident
There are constant values of
Y for every value of X
The data are related but
represented by a nonlinear function

18-28
Components of VariationComponents of Variation

18-29
FF Ratio in RegressionRatio in Regression

18-30
Coefficient of Determination:Coefficient of Determination: rr22
Total proportion of variance in
Y explained by X
Desired r2
: 80% or more

18-31
Chi-Square Based MeasuresChi-Square Based Measures

18-32
Proportional Reduction ofProportional Reduction of
Error MeasuresError Measures

18-33
Statistical AlternativesStatistical Alternatives
for Ordinal Measuresfor Ordinal Measures

18-34
Calculation of Concordant (Calculation of Concordant (PP),),
Discordant (Discordant (QQ), Tied (), Tied (Tx,TyTx,Ty), and), and
Total Paired Observations:Total Paired Observations:
KeyDesign ExampleKeyDesign Example

18-35
KDL Data for Spearman’s RhoKDL Data for Spearman’s Rho
_______ _____ Rank By_____ _____
_____
Applicant Panel x Psychologist y d d2
1
2
3
4
5
6
7
8
9
10
3.5
10.0
6.5
2.0
1.0
9.0
3.5
6.5
8.0
5.0
6.0
5.0
8.0
1.5
3.0
7.0
1.5
9.0
10.0
4.0
-2.5
5.0
-1.5
.05
-2
2.0
2.0
-2.5
-2
1.0
6.25
25.00
2.52
0.25
4.00
4.00
4.00
6.25
4.00
_1.00_
57.00 .

18-36
Key TermsKey Terms
• Artifact correlations
• Bivariate correlation
analysis
• Bivariate normal
distribution
• Chi-square-based
measures
• Contingency coefficient C
• Cramer’s V
• Phi
• Coefficient of determination
(r2)
• Concordant
• Correlation matrix
• Discordant
• Error term
• Goodness of fit
• lambda

18-37
Key TermsKey Terms
• Linearity
• Method of least squares
• Ordinal measures
• Gamma
• Somers’s d
• Spearman’s rho
• tau b
• tau c
• Pearson correlation
coefficient
• Prediction and confidence
bands
• Proportional reduction in
error (PRE)
• Regression analysis
• Regression coefficients

18-38
Key TermsKey Terms
• Intercept
• Slope
• Residual
• Scatterplot
• Simple prediction
• tau

Measures of Association and Correlation Coefficients

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Measures of Association and Correlation Coefficients

Similar to Measures of Association and Correlation Coefficients (20)

More from Dhamo daran

More from Dhamo daran (20)

Recently uploaded

Recently uploaded (17)

Measures of Association and Correlation Coefficients

Editor's Notes