SlideShare a Scribd company logo
1 of 38
Chapter 18Chapter 18
Measures ofMeasures of
AssociationAssociation
McGraw-Hill/Irwin Copyright © 2011 by The McGraw-Hill Companies, Inc. All Rights Reserved.
18-2
Learning ObjectivesLearning Objectives
Understand . . .
• How correlation analysis may be applied to
study relationships between two or more
variables
• The uses, requirements, and interpretation of
the product moment correlation coefficient.
• How predictions are made with regression
analysis using the method of least squares to
minimize errors in drawing a line of best fit.
18-3
Learning ObjectivesLearning Objectives
Understand . . .
• How to test regression models for linearity and
whether the equation is effective in fitting the
data.
• Nonparametric measures of association and
the alternatives they offer when key
assumptions and requirements for parametric
techniques cannot be met.
18-4
Invalid AssumptionsInvalid Assumptions
“The invalid assumption that correlation
implies cause is probably among the two
or three most serious and common errors
of human reasoning.”
Stephen Jay Gould
paleontologist and science writer
18-5
PulsePoint:PulsePoint:
Research RevelationResearch Revelation
25
The percent of students using a
credit card for college costs due
to convenience.
18-6
Measures of Association:Measures of Association:
Interval/Ratio DataInterval/Ratio Data
Pearson correlation coefficient
For continuous linearly related
variables
Correlation ratio (eta)
For nonlinear data or relating a main
effect to a continuous dependent
variable
Biserial
One continuous and one dichotomous
variable with an underlying normal
distribution
Partial correlation
Three variables; relating two with the
third’s effect taken out
Multiple correlation
Three variables; relating one variable
with two others
Bivariate linear regression
Predicting one variable from another’s
scores
18-7
Measures of Association:Measures of Association:
Ordinal DataOrdinal Data
Gamma
Based on concordant-discordant
pairs; proportional reduction in
error (PRE) interpretation
Kendall’s tau b
P-Q based; adjustment for tied
ranks
Kendall’s tau c
P-Q based; adjustment for table
dimensions
Somers’s d
P-Q based; asymmetrical
extension of gamma
Spearman’s rho
Product moment correlation for
ranked data
18-8
Measures of Association:Measures of Association:
Nominal DataNominal Data
Phi Chi-square based for 2*2 tables
Cramer’s V
CS based; adjustment when one table
dimension >2
Contingency coefficient C
CS based; flexible data and distribution
assumptions
Lambda PRE based interpretation
Goodman & Kruskal’s tau
PRE based with table marginals
emphasis
Uncertainty coefficient Useful for multidimensional tables
Kappa Agreement measure
18-9
Researchers Search for InsightsResearchers Search for Insights
Burke, one of the
world’s leading
research
companies, claims
researchers add the
most value to a
project when they
look beyond the raw
numbers to the
shades of gray…
what the data really
mean.
18-10
Pearson’s Product MomentPearson’s Product Moment
CorrelationCorrelation rr
Is there a relationship between X and Y?
What is the magnitude of the relationship?
What is the direction of the relationship?
18-11
Connections andConnections and
DisconnectionsDisconnections
“To truly understand consumers’ motives
and actions, you must determine
relationships between what they think and
feel and what they actually do.”
David Singleton, vp of insights
Zyman Marketing Group
18-12
Scatterplots of RelationshipsScatterplots of Relationships
18-13
ScatterplotsScatterplots
18-14
Diagram of Common VarianceDiagram of Common Variance
18-15
Interpretation of CorrelationsInterpretation of Correlations
X causes YX causes Y
Y causes XY causes X
X and Y are activated by
one or more other variables
X and Y are activated by
one or more other variables
X and Y influence each
other reciprocally
X and Y influence each
other reciprocally
18-16
Artifact CorrelationsArtifact Correlations
18-17
Interpretation of CoefficientsInterpretation of Coefficients
A coefficient is not remarkable simply
because it is statistically significant!
It must be practically meaningful.
18-18
Comparison of Bivariate LinearComparison of Bivariate Linear
Correlation and RegressionCorrelation and Regression
18-19
Examples of Different SlopesExamples of Different Slopes
18-20
Concept ApplicationConcept Application
X
Average Temperature (Celsius)
Y
Price per Case
(FF)
12 2,000
16 3,000
20 4,000
24 5,000
Mean =18 Mean = 3,500
18-21
Plot of Wine Price by AveragePlot of Wine Price by Average
TemperatureTemperature
18-22
Distribution of Y forDistribution of Y for
Observation of XObservation of X
18-23
Wine Price Study ExampleWine Price Study Example
18-24
Least Squares Line: Wine PriceLeast Squares Line: Wine Price
StudyStudy
18-25
Plot of Standardized ResidualsPlot of Standardized Residuals
18-26
Prediction andPrediction and
Confidence BandsConfidence Bands
18-27
Testing Goodness of FitTesting Goodness of Fit
Y is completely unrelated to X
and no systematic pattern is evident
There are constant values of
Y for every value of X
The data are related but
represented by a nonlinear function
18-28
Components of VariationComponents of Variation
18-29
FF Ratio in RegressionRatio in Regression
18-30
Coefficient of Determination:Coefficient of Determination: rr22
Total proportion of variance in
Y explained by X
Desired r2
: 80% or more
18-31
Chi-Square Based MeasuresChi-Square Based Measures
18-32
Proportional Reduction ofProportional Reduction of
Error MeasuresError Measures
18-33
Statistical AlternativesStatistical Alternatives
for Ordinal Measuresfor Ordinal Measures
18-34
Calculation of Concordant (Calculation of Concordant (PP),),
Discordant (Discordant (QQ), Tied (), Tied (Tx,TyTx,Ty), and), and
Total Paired Observations:Total Paired Observations:
KeyDesign ExampleKeyDesign Example
18-35
KDL Data for Spearman’s RhoKDL Data for Spearman’s Rho
_______ _____ Rank By_____ _____
_____
Applicant Panel x Psychologist y d d2
1
2
3
4
5
6
7
8
9
10
3.5
10.0
6.5
2.0
1.0
9.0
3.5
6.5
8.0
5.0
6.0
5.0
8.0
1.5
3.0
7.0
1.5
9.0
10.0
4.0
-2.5
5.0
-1.5
.05
-2
2.0
2.0
-2.5
-2
1.0
6.25
25.00
2.52
0.25
4.00
4.00
4.00
6.25
4.00
_1.00_
57.00 .
18-36
Key TermsKey Terms
• Artifact correlations
• Bivariate correlation
analysis
• Bivariate normal
distribution
• Chi-square-based
measures
• Contingency coefficient C
• Cramer’s V
• Phi
• Coefficient of determination
(r2)
• Concordant
• Correlation matrix
• Discordant
• Error term
• Goodness of fit
• lambda
18-37
Key TermsKey Terms
• Linearity
• Method of least squares
• Ordinal measures
• Gamma
• Somers’s d
• Spearman’s rho
• tau b
• tau c
• Pearson correlation
coefficient
• Prediction and confidence
bands
• Proportional reduction in
error (PRE)
• Regression analysis
• Regression coefficients
18-38
Key TermsKey Terms
• Intercept
• Slope
• Residual
• Scatterplot
• Simple prediction
• tau

More Related Content

What's hot

Regression & correlation coefficient
Regression & correlation coefficientRegression & correlation coefficient
Regression & correlation coefficientMuhamamdZiaSamad
 
Correlation and regression
Correlation and regressionCorrelation and regression
Correlation and regressionAbdelaziz Tayoun
 
Power Analysis and Sample Size Determination
Power Analysis and Sample Size DeterminationPower Analysis and Sample Size Determination
Power Analysis and Sample Size DeterminationAjay Dhamija
 
Spearman Rank Correlation - Thiyagu
Spearman Rank Correlation - ThiyaguSpearman Rank Correlation - Thiyagu
Spearman Rank Correlation - ThiyaguThiyagu K
 
The Chi-Square Statistic: Tests for Goodness of Fit and Independence
The Chi-Square Statistic: Tests for Goodness of Fit and IndependenceThe Chi-Square Statistic: Tests for Goodness of Fit and Independence
The Chi-Square Statistic: Tests for Goodness of Fit and Independencejasondroesch
 
Correlation and regression
Correlation and regressionCorrelation and regression
Correlation and regressionMOHIT PANCHAL
 
F test and ANOVA
F test and ANOVAF test and ANOVA
F test and ANOVAParag Shah
 
Correlation and regression
Correlation and regressionCorrelation and regression
Correlation and regressionAjendra7846
 
Regression Analysis - Thiyagu
Regression Analysis - ThiyaguRegression Analysis - Thiyagu
Regression Analysis - ThiyaguThiyagu K
 
Inferential statistics powerpoint
Inferential statistics powerpointInferential statistics powerpoint
Inferential statistics powerpointkellula
 
Correlation analysis ppt
Correlation analysis pptCorrelation analysis ppt
Correlation analysis pptAnil Mishra
 
Inferential statistics (2)
Inferential statistics (2)Inferential statistics (2)
Inferential statistics (2)rajnulada
 
Mpc 006 - 02-03 partial and multiple correlation
Mpc 006 - 02-03 partial and multiple correlationMpc 006 - 02-03 partial and multiple correlation
Mpc 006 - 02-03 partial and multiple correlationVasant Kothari
 
Basics of Structural Equation Modeling
Basics of Structural Equation ModelingBasics of Structural Equation Modeling
Basics of Structural Equation Modelingsmackinnon
 
What is a kendall's tau?
What is a kendall's tau?What is a kendall's tau?
What is a kendall's tau?Ken Plummer
 

What's hot (20)

Sampling & Sampling Distribtutions
Sampling & Sampling DistribtutionsSampling & Sampling Distribtutions
Sampling & Sampling Distribtutions
 
Correlation
CorrelationCorrelation
Correlation
 
Regression & correlation coefficient
Regression & correlation coefficientRegression & correlation coefficient
Regression & correlation coefficient
 
Correlation and regression
Correlation and regressionCorrelation and regression
Correlation and regression
 
Power Analysis and Sample Size Determination
Power Analysis and Sample Size DeterminationPower Analysis and Sample Size Determination
Power Analysis and Sample Size Determination
 
Spearman Rank Correlation - Thiyagu
Spearman Rank Correlation - ThiyaguSpearman Rank Correlation - Thiyagu
Spearman Rank Correlation - Thiyagu
 
Intro to probability
Intro to probabilityIntro to probability
Intro to probability
 
The Chi-Square Statistic: Tests for Goodness of Fit and Independence
The Chi-Square Statistic: Tests for Goodness of Fit and IndependenceThe Chi-Square Statistic: Tests for Goodness of Fit and Independence
The Chi-Square Statistic: Tests for Goodness of Fit and Independence
 
Correlation and regression
Correlation and regressionCorrelation and regression
Correlation and regression
 
F test and ANOVA
F test and ANOVAF test and ANOVA
F test and ANOVA
 
Path analysis
Path analysisPath analysis
Path analysis
 
Correlation and regression
Correlation and regressionCorrelation and regression
Correlation and regression
 
Regression Analysis - Thiyagu
Regression Analysis - ThiyaguRegression Analysis - Thiyagu
Regression Analysis - Thiyagu
 
Inferential statistics powerpoint
Inferential statistics powerpointInferential statistics powerpoint
Inferential statistics powerpoint
 
Correlation analysis ppt
Correlation analysis pptCorrelation analysis ppt
Correlation analysis ppt
 
Inferential statistics (2)
Inferential statistics (2)Inferential statistics (2)
Inferential statistics (2)
 
Mpc 006 - 02-03 partial and multiple correlation
Mpc 006 - 02-03 partial and multiple correlationMpc 006 - 02-03 partial and multiple correlation
Mpc 006 - 02-03 partial and multiple correlation
 
Spearman Rank
Spearman RankSpearman Rank
Spearman Rank
 
Basics of Structural Equation Modeling
Basics of Structural Equation ModelingBasics of Structural Equation Modeling
Basics of Structural Equation Modeling
 
What is a kendall's tau?
What is a kendall's tau?What is a kendall's tau?
What is a kendall's tau?
 

Similar to Measures of Association and Correlation Coefficients

09.2 credit scoring
09.2   credit scoring09.2   credit scoring
09.2 credit scoringcrmbasel
 
correlation_and_covariance
correlation_and_covariancecorrelation_and_covariance
correlation_and_covarianceEkta Doger
 
Business Research Methods Chap018
Business Research Methods Chap018Business Research Methods Chap018
Business Research Methods Chap018Mazhar Masood
 
Chapter 18 sensitivity analysis
Chapter 18   sensitivity analysisChapter 18   sensitivity analysis
Chapter 18 sensitivity analysisBich Lien Pham
 
Chapter 18 sensitivity analysis
Chapter 18   sensitivity analysisChapter 18   sensitivity analysis
Chapter 18 sensitivity analysisBich Lien Pham
 
Demand estimation and forecasting
Demand estimation and forecastingDemand estimation and forecasting
Demand estimation and forecastingshivraj negi
 
Question 1 Independent random samples taken on two university .docx
Question 1 Independent random samples taken on two university .docxQuestion 1 Independent random samples taken on two university .docx
Question 1 Independent random samples taken on two university .docxIRESH3
 
Managerial Economics (Chapter 5 - Demand Estimation)
 Managerial Economics (Chapter 5 - Demand Estimation) Managerial Economics (Chapter 5 - Demand Estimation)
Managerial Economics (Chapter 5 - Demand Estimation)Nurul Shareena Misran
 
Big Data Revolution: Increasing Transparency to Risk and Valuation
Big Data Revolution: Increasing Transparency to Risk and ValuationBig Data Revolution: Increasing Transparency to Risk and Valuation
Big Data Revolution: Increasing Transparency to Risk and ValuationHouseCanary
 
Threshold and Multivariate Analysis
Threshold and Multivariate AnalysisThreshold and Multivariate Analysis
Threshold and Multivariate AnalysisAlbertRaja6
 
UsingAHP_02
UsingAHP_02UsingAHP_02
UsingAHP_02pbaxter
 
Credit card fraud detection using machine learning Algorithms
Credit card fraud detection using machine learning AlgorithmsCredit card fraud detection using machine learning Algorithms
Credit card fraud detection using machine learning Algorithmsankit panigrahy
 
Click Model-Based Information Retrieval Metrics
Click Model-Based Information Retrieval MetricsClick Model-Based Information Retrieval Metrics
Click Model-Based Information Retrieval MetricsAleksandr Chuklin
 
Cost Risk Analysis (CRA) by Pedram Daneshmand 19-Jan-2011
Cost Risk Analysis (CRA) by Pedram Daneshmand 19-Jan-2011Cost Risk Analysis (CRA) by Pedram Daneshmand 19-Jan-2011
Cost Risk Analysis (CRA) by Pedram Daneshmand 19-Jan-2011Pedram Danesh-Mand
 
Quantitative Techniques in Management - Objective Assignment
Quantitative Techniques in Management - Objective AssignmentQuantitative Techniques in Management - Objective Assignment
Quantitative Techniques in Management - Objective AssignmentRohit Sharma
 
REPORTby Assignment 1 Asssignment 1Submission dat e 30.docx
REPORTby Assignment 1 Asssignment 1Submission dat e  30.docxREPORTby Assignment 1 Asssignment 1Submission dat e  30.docx
REPORTby Assignment 1 Asssignment 1Submission dat e 30.docxsodhi3
 

Similar to Measures of Association and Correlation Coefficients (20)

chapter19[1].ppt
chapter19[1].pptchapter19[1].ppt
chapter19[1].ppt
 
09.2 credit scoring
09.2   credit scoring09.2   credit scoring
09.2 credit scoring
 
correlation_and_covariance
correlation_and_covariancecorrelation_and_covariance
correlation_and_covariance
 
Business Research Methods Chap018
Business Research Methods Chap018Business Research Methods Chap018
Business Research Methods Chap018
 
Chapter 18 sensitivity analysis
Chapter 18   sensitivity analysisChapter 18   sensitivity analysis
Chapter 18 sensitivity analysis
 
Chapter 18 sensitivity analysis
Chapter 18   sensitivity analysisChapter 18   sensitivity analysis
Chapter 18 sensitivity analysis
 
MyRBQM Academy | Webinar Fraud and Sloppiness Detection in Clinical Trials [P...
MyRBQM Academy | Webinar Fraud and Sloppiness Detection in Clinical Trials [P...MyRBQM Academy | Webinar Fraud and Sloppiness Detection in Clinical Trials [P...
MyRBQM Academy | Webinar Fraud and Sloppiness Detection in Clinical Trials [P...
 
Demand estimation and forecasting
Demand estimation and forecastingDemand estimation and forecasting
Demand estimation and forecasting
 
MyRBQM Academy | Webinar Fraud and Sloppiness Detection in Clinical Trials [P...
MyRBQM Academy | Webinar Fraud and Sloppiness Detection in Clinical Trials [P...MyRBQM Academy | Webinar Fraud and Sloppiness Detection in Clinical Trials [P...
MyRBQM Academy | Webinar Fraud and Sloppiness Detection in Clinical Trials [P...
 
Question 1 Independent random samples taken on two university .docx
Question 1 Independent random samples taken on two university .docxQuestion 1 Independent random samples taken on two university .docx
Question 1 Independent random samples taken on two university .docx
 
Managerial Economics (Chapter 5 - Demand Estimation)
 Managerial Economics (Chapter 5 - Demand Estimation) Managerial Economics (Chapter 5 - Demand Estimation)
Managerial Economics (Chapter 5 - Demand Estimation)
 
Big Data Revolution: Increasing Transparency to Risk and Valuation
Big Data Revolution: Increasing Transparency to Risk and ValuationBig Data Revolution: Increasing Transparency to Risk and Valuation
Big Data Revolution: Increasing Transparency to Risk and Valuation
 
Threshold and Multivariate Analysis
Threshold and Multivariate AnalysisThreshold and Multivariate Analysis
Threshold and Multivariate Analysis
 
UsingAHP_02
UsingAHP_02UsingAHP_02
UsingAHP_02
 
Credit card fraud detection using machine learning Algorithms
Credit card fraud detection using machine learning AlgorithmsCredit card fraud detection using machine learning Algorithms
Credit card fraud detection using machine learning Algorithms
 
Click Model-Based Information Retrieval Metrics
Click Model-Based Information Retrieval MetricsClick Model-Based Information Retrieval Metrics
Click Model-Based Information Retrieval Metrics
 
DataMining_CA2-4
DataMining_CA2-4DataMining_CA2-4
DataMining_CA2-4
 
Cost Risk Analysis (CRA) by Pedram Daneshmand 19-Jan-2011
Cost Risk Analysis (CRA) by Pedram Daneshmand 19-Jan-2011Cost Risk Analysis (CRA) by Pedram Daneshmand 19-Jan-2011
Cost Risk Analysis (CRA) by Pedram Daneshmand 19-Jan-2011
 
Quantitative Techniques in Management - Objective Assignment
Quantitative Techniques in Management - Objective AssignmentQuantitative Techniques in Management - Objective Assignment
Quantitative Techniques in Management - Objective Assignment
 
REPORTby Assignment 1 Asssignment 1Submission dat e 30.docx
REPORTby Assignment 1 Asssignment 1Submission dat e  30.docxREPORTby Assignment 1 Asssignment 1Submission dat e  30.docx
REPORTby Assignment 1 Asssignment 1Submission dat e 30.docx
 

More from Dhamo daran

Projectriskmanagement pmbok5
Projectriskmanagement pmbok5Projectriskmanagement pmbok5
Projectriskmanagement pmbok5Dhamo daran
 
13 project control & closing management
13 project control & closing management13 project control & closing management
13 project control & closing managementDhamo daran
 
12 projectprocurementmanagement
12 projectprocurementmanagement12 projectprocurementmanagement
12 projectprocurementmanagementDhamo daran
 
10 projectcommunicationmanagement
10 projectcommunicationmanagement10 projectcommunicationmanagement
10 projectcommunicationmanagementDhamo daran
 
09 projecthumanresourcemanagement
09 projecthumanresourcemanagement09 projecthumanresourcemanagement
09 projecthumanresourcemanagementDhamo daran
 
08 projectqualitymanagement
08 projectqualitymanagement08 projectqualitymanagement
08 projectqualitymanagementDhamo daran
 
07 projectcostmanagement
07 projectcostmanagement07 projectcostmanagement
07 projectcostmanagementDhamo daran
 
06 projecttimemanagement
06 projecttimemanagement06 projecttimemanagement
06 projecttimemanagementDhamo daran
 
04 projectintegrationmanagement
04 projectintegrationmanagement04 projectintegrationmanagement
04 projectintegrationmanagementDhamo daran
 
04 projectintegrationmanagement
04 projectintegrationmanagement04 projectintegrationmanagement
04 projectintegrationmanagementDhamo daran
 
01 introductiontoframework
01 introductiontoframework01 introductiontoframework
01 introductiontoframeworkDhamo daran
 

More from Dhamo daran (20)

Projectriskmanagement pmbok5
Projectriskmanagement pmbok5Projectriskmanagement pmbok5
Projectriskmanagement pmbok5
 
13 project control & closing management
13 project control & closing management13 project control & closing management
13 project control & closing management
 
12 projectprocurementmanagement
12 projectprocurementmanagement12 projectprocurementmanagement
12 projectprocurementmanagement
 
10 projectcommunicationmanagement
10 projectcommunicationmanagement10 projectcommunicationmanagement
10 projectcommunicationmanagement
 
09 projecthumanresourcemanagement
09 projecthumanresourcemanagement09 projecthumanresourcemanagement
09 projecthumanresourcemanagement
 
08 projectqualitymanagement
08 projectqualitymanagement08 projectqualitymanagement
08 projectqualitymanagement
 
07 projectcostmanagement
07 projectcostmanagement07 projectcostmanagement
07 projectcostmanagement
 
06 projecttimemanagement
06 projecttimemanagement06 projecttimemanagement
06 projecttimemanagement
 
04 projectintegrationmanagement
04 projectintegrationmanagement04 projectintegrationmanagement
04 projectintegrationmanagement
 
04 projectintegrationmanagement
04 projectintegrationmanagement04 projectintegrationmanagement
04 projectintegrationmanagement
 
01 introductiontoframework
01 introductiontoframework01 introductiontoframework
01 introductiontoframework
 
Chap021
Chap021Chap021
Chap021
 
Chap020
Chap020Chap020
Chap020
 
Chap019
Chap019Chap019
Chap019
 
Chap017
Chap017Chap017
Chap017
 
Chap016
Chap016Chap016
Chap016
 
Chap015
Chap015Chap015
Chap015
 
Chap014
Chap014Chap014
Chap014
 
Chap013
Chap013Chap013
Chap013
 
Chap012
Chap012Chap012
Chap012
 

Recently uploaded

Fifteenth Finance Commission Presentation
Fifteenth Finance Commission PresentationFifteenth Finance Commission Presentation
Fifteenth Finance Commission Presentationmintusiprd
 
Reflecting, turning experience into insight
Reflecting, turning experience into insightReflecting, turning experience into insight
Reflecting, turning experience into insightWayne Abrahams
 
原版1:1复刻密西西比大学毕业证Mississippi毕业证留信学历认证
原版1:1复刻密西西比大学毕业证Mississippi毕业证留信学历认证原版1:1复刻密西西比大学毕业证Mississippi毕业证留信学历认证
原版1:1复刻密西西比大学毕业证Mississippi毕业证留信学历认证jdkhjh
 
self respect is very important in this crual word where everyone in just thin...
self respect is very important in this crual word where everyone in just thin...self respect is very important in this crual word where everyone in just thin...
self respect is very important in this crual word where everyone in just thin...afaqsaeed463
 
Unlocking Productivity and Personal Growth through the Importance-Urgency Matrix
Unlocking Productivity and Personal Growth through the Importance-Urgency MatrixUnlocking Productivity and Personal Growth through the Importance-Urgency Matrix
Unlocking Productivity and Personal Growth through the Importance-Urgency MatrixCIToolkit
 
Introduction to LPC - Facility Design And Re-Engineering
Introduction to LPC - Facility Design And Re-EngineeringIntroduction to LPC - Facility Design And Re-Engineering
Introduction to LPC - Facility Design And Re-Engineeringthomas851723
 
Simplifying Complexity: How the Four-Field Matrix Reshapes Thinking
Simplifying Complexity: How the Four-Field Matrix Reshapes ThinkingSimplifying Complexity: How the Four-Field Matrix Reshapes Thinking
Simplifying Complexity: How the Four-Field Matrix Reshapes ThinkingCIToolkit
 
Pooja Mehta 9167673311, Trusted Call Girls In NAVI MUMBAI Cash On Payment , V...
Pooja Mehta 9167673311, Trusted Call Girls In NAVI MUMBAI Cash On Payment , V...Pooja Mehta 9167673311, Trusted Call Girls In NAVI MUMBAI Cash On Payment , V...
Pooja Mehta 9167673311, Trusted Call Girls In NAVI MUMBAI Cash On Payment , V...Pooja Nehwal
 
LPC Operations Review PowerPoint | Operations Review
LPC Operations Review PowerPoint | Operations ReviewLPC Operations Review PowerPoint | Operations Review
LPC Operations Review PowerPoint | Operations Reviewthomas851723
 
LPC Warehouse Management System For Clients In The Business Sector
LPC Warehouse Management System For Clients In The Business SectorLPC Warehouse Management System For Clients In The Business Sector
LPC Warehouse Management System For Clients In The Business Sectorthomas851723
 
Board Diversity Initiaive Launch Presentation
Board Diversity Initiaive Launch PresentationBoard Diversity Initiaive Launch Presentation
Board Diversity Initiaive Launch Presentationcraig524401
 
Measuring True Process Yield using Robust Yield Metrics
Measuring True Process Yield using Robust Yield MetricsMeasuring True Process Yield using Robust Yield Metrics
Measuring True Process Yield using Robust Yield MetricsCIToolkit
 
VIP Kolkata Call Girl Rajarhat 👉 8250192130 Available With Room
VIP Kolkata Call Girl Rajarhat 👉 8250192130  Available With RoomVIP Kolkata Call Girl Rajarhat 👉 8250192130  Available With Room
VIP Kolkata Call Girl Rajarhat 👉 8250192130 Available With Roomdivyansh0kumar0
 
Call Us🔝⇛+91-97111🔝47426 Call In girls Munirka (DELHI)
Call Us🔝⇛+91-97111🔝47426 Call In girls Munirka (DELHI)Call Us🔝⇛+91-97111🔝47426 Call In girls Munirka (DELHI)
Call Us🔝⇛+91-97111🔝47426 Call In girls Munirka (DELHI)jennyeacort
 
ANIn Gurugram April 2024 |Can Agile and AI work together? by Pramodkumar Shri...
ANIn Gurugram April 2024 |Can Agile and AI work together? by Pramodkumar Shri...ANIn Gurugram April 2024 |Can Agile and AI work together? by Pramodkumar Shri...
ANIn Gurugram April 2024 |Can Agile and AI work together? by Pramodkumar Shri...AgileNetwork
 
Farmer Representative Organization in Lucknow | Rashtriya Kisan Manch
Farmer Representative Organization in Lucknow | Rashtriya Kisan ManchFarmer Representative Organization in Lucknow | Rashtriya Kisan Manch
Farmer Representative Organization in Lucknow | Rashtriya Kisan ManchRashtriya Kisan Manch
 

Recently uploaded (17)

Fifteenth Finance Commission Presentation
Fifteenth Finance Commission PresentationFifteenth Finance Commission Presentation
Fifteenth Finance Commission Presentation
 
Reflecting, turning experience into insight
Reflecting, turning experience into insightReflecting, turning experience into insight
Reflecting, turning experience into insight
 
原版1:1复刻密西西比大学毕业证Mississippi毕业证留信学历认证
原版1:1复刻密西西比大学毕业证Mississippi毕业证留信学历认证原版1:1复刻密西西比大学毕业证Mississippi毕业证留信学历认证
原版1:1复刻密西西比大学毕业证Mississippi毕业证留信学历认证
 
self respect is very important in this crual word where everyone in just thin...
self respect is very important in this crual word where everyone in just thin...self respect is very important in this crual word where everyone in just thin...
self respect is very important in this crual word where everyone in just thin...
 
Unlocking Productivity and Personal Growth through the Importance-Urgency Matrix
Unlocking Productivity and Personal Growth through the Importance-Urgency MatrixUnlocking Productivity and Personal Growth through the Importance-Urgency Matrix
Unlocking Productivity and Personal Growth through the Importance-Urgency Matrix
 
Introduction to LPC - Facility Design And Re-Engineering
Introduction to LPC - Facility Design And Re-EngineeringIntroduction to LPC - Facility Design And Re-Engineering
Introduction to LPC - Facility Design And Re-Engineering
 
sauth delhi call girls in Defence Colony🔝 9953056974 🔝 escort Service
sauth delhi call girls in Defence Colony🔝 9953056974 🔝 escort Servicesauth delhi call girls in Defence Colony🔝 9953056974 🔝 escort Service
sauth delhi call girls in Defence Colony🔝 9953056974 🔝 escort Service
 
Simplifying Complexity: How the Four-Field Matrix Reshapes Thinking
Simplifying Complexity: How the Four-Field Matrix Reshapes ThinkingSimplifying Complexity: How the Four-Field Matrix Reshapes Thinking
Simplifying Complexity: How the Four-Field Matrix Reshapes Thinking
 
Pooja Mehta 9167673311, Trusted Call Girls In NAVI MUMBAI Cash On Payment , V...
Pooja Mehta 9167673311, Trusted Call Girls In NAVI MUMBAI Cash On Payment , V...Pooja Mehta 9167673311, Trusted Call Girls In NAVI MUMBAI Cash On Payment , V...
Pooja Mehta 9167673311, Trusted Call Girls In NAVI MUMBAI Cash On Payment , V...
 
LPC Operations Review PowerPoint | Operations Review
LPC Operations Review PowerPoint | Operations ReviewLPC Operations Review PowerPoint | Operations Review
LPC Operations Review PowerPoint | Operations Review
 
LPC Warehouse Management System For Clients In The Business Sector
LPC Warehouse Management System For Clients In The Business SectorLPC Warehouse Management System For Clients In The Business Sector
LPC Warehouse Management System For Clients In The Business Sector
 
Board Diversity Initiaive Launch Presentation
Board Diversity Initiaive Launch PresentationBoard Diversity Initiaive Launch Presentation
Board Diversity Initiaive Launch Presentation
 
Measuring True Process Yield using Robust Yield Metrics
Measuring True Process Yield using Robust Yield MetricsMeasuring True Process Yield using Robust Yield Metrics
Measuring True Process Yield using Robust Yield Metrics
 
VIP Kolkata Call Girl Rajarhat 👉 8250192130 Available With Room
VIP Kolkata Call Girl Rajarhat 👉 8250192130  Available With RoomVIP Kolkata Call Girl Rajarhat 👉 8250192130  Available With Room
VIP Kolkata Call Girl Rajarhat 👉 8250192130 Available With Room
 
Call Us🔝⇛+91-97111🔝47426 Call In girls Munirka (DELHI)
Call Us🔝⇛+91-97111🔝47426 Call In girls Munirka (DELHI)Call Us🔝⇛+91-97111🔝47426 Call In girls Munirka (DELHI)
Call Us🔝⇛+91-97111🔝47426 Call In girls Munirka (DELHI)
 
ANIn Gurugram April 2024 |Can Agile and AI work together? by Pramodkumar Shri...
ANIn Gurugram April 2024 |Can Agile and AI work together? by Pramodkumar Shri...ANIn Gurugram April 2024 |Can Agile and AI work together? by Pramodkumar Shri...
ANIn Gurugram April 2024 |Can Agile and AI work together? by Pramodkumar Shri...
 
Farmer Representative Organization in Lucknow | Rashtriya Kisan Manch
Farmer Representative Organization in Lucknow | Rashtriya Kisan ManchFarmer Representative Organization in Lucknow | Rashtriya Kisan Manch
Farmer Representative Organization in Lucknow | Rashtriya Kisan Manch
 

Measures of Association and Correlation Coefficients

Editor's Notes

  1. This chapter explains the use of several techniques including correlation analysis and regression analysis.
  2. See the text Instructors Manual (downloadable from the text website) for ideas for using this research-generated statistic.
  3. Exhibit 18-1 in the text presents a list of commonly used measures of association. These measures of association will be discussed further in this chapter. This slide shows those measures of association relevant for interval and ratio scaled data. The measures used for ordinal and nominal data are presented on the following slides.
  4. Exhibit 18-1 This slide presents the commonly used measures of association for ordinal data.
  5. This final component of Exhibit 18-1 presents the measures of association used for nominal data.
  6. The Pearson correlation coefficient varies over a range of +1 to -1. The designation r symbolizes the coefficient’s estimate of linear association based on sampling data. The coefficient p represents the population correlation. Correlation coefficients reveal the magnitude and direction of relationships. The magnitude is the degree to which variables move in unison or opposition. The closer the coefficient is to 1 (regardless of sign) the stronger the relationship. The sign indicates the direction of the relationship. A positive sign indicates a direct relationship and a negative sign indicates an inverse relationship.
  7. Exhibit 18-2 Scatterplots are essential for understanding the relationships between variables. They provide a means for visual inspection of data that a list for two variables cannot. Both the direction and shape of a relationship are conveyed in a plot. Exhibit 18-2 contains a series of scatterplots that depict some relationships across the range r. The three plots on the left side of the figure have their points sloping from the upper left to the lower right of each x-y plot. They represent different magnitudes of negative relationships. On the right side of the figure, the three plots have opposite directional patterns and show positive relationships. When stronger relationships are apparent, the points cluster close to an imaginary line passing through the data. For a weak relationship or no relationship (r=0), data points are randomly scattered with little or no directional pattern.
  8. Exhibit 18-4 The need for data visualization is illustrated with four small data sets possessing identical summary statistics but displaying different patterns. Exhibit 18-3 contains the data and the exhibit shown in the slide plots them. In Plot 1, the variables are positively related. In Plot 2, the data are curvilinear and r is an inappropriate measure of their relationship. In Plot 3, an influential point that changed the coefficient is shown. In Plot 4, the values of x are constant. Correlations are based on linear relationships and bivariate normal distributions.
  9. Exhibit 18-7 The amount of common variance in X and Y may be summarized by r2, the coefficient of determination. As shown in the exhibit in the slide, the overlap between the two variables is the proportion of their common or shared variance. The area of overlap represents the percentage of the total relationship accounted for by one variable or the other. In the slide, 86% of the variance in X is explained by Y and vice versa.
  10. A correlation coefficient of any magnitude or sign, regardless of its statistical significance, does not imply causation. Correlation provides no evidence of cause and effect. There are several alternate explanations that may be provided for correlation results and these are listed in the slide. Note that causation may actually exist, but the correlation only shows that a relationship exists. Therefore, x may cause y or y may cause x. However, it may also be that z causes x and y or that y and x interact.
  11. Exhibit 18-8 Exhibit 18-8 Artifact correlations occur where distinct groups combine to give the impression of one. Artifact correlations should be avoided because they appear as one group when, in fact, there are distinct groups present. The left panel shows data from two business sectors. If all the data points for x and y variables are aggregated and a correlation is computed for a single group, a positive correlation results. Separate calculations for each sector reveal no relationship. In the right panel, the companies in the financial sector score high on assets and low in sales but they are all banks. When the data for banks are removed, the correlation is nearly perfect. When banks are kept in, the overall relationship drops considerably.
  12. In many relationships, other factors combine to make the coefficient’s meaning misleading. For example, there is a relationship between the results of presidential elections and the results of the Washington Redskins’ game before the election. This does not indicate a meaningful relationship.
  13. Exhibit 18-9 Relationships also serve as a basis for estimation and prediction. Simple and multiple predictions are made with a technique called regression analysis. When we take the observed values of X to estimate or predict corresponding Y values, the process is called simple prediction. When more than one X variable is used, the outcome is a function of multiple predictors. The similarities and differences of correlation and regression are summarized in Exhibit 18-9. Their relatedness suggests that beneath many correlation problems is a regression analysis that could provide further insight about the relationship of Y with X.
  14. Exhibit 18-10 A straight line is fundamentally the best way to model the relationship between two continuous variables. The bivariate linear regression may be expressed as Use formula from page 545 Where the value of the dependent variable Y is a linear function of the corresponding value of the independent variable Xi in the ith observation. The slope and the Y intercept are known as regression coefficients. The slope, 1, is the change in Y for a 1-unit change in X. It is sometimes called the “rise over run.” This is defined by the formula Use formula from page 545 This is the ratio of change in the rise of the line relative to the run or travel along the X axis. Exhibit 18-10 shows a few of the many possible slopes. The intercept is the value for the linear function when it crosses the Y axis; it is the estimate of Y when X = 0. A formula for the intercept based on the mean scores of the X and Y variables is Use formula from page 545
  15. In this example, X represents the average growing-season temperature in degrees Celsius and Y the price of a 12-bottle case in French francs (6.8 French francs = 1 euro). The plotted data in Exhibit 18-11 is shown on the next slide.
  16. Exhibit 18-11 The plot shows a linear relationship between the pairs of points and a perfect positive correlation, ryx = 1.0. The slope of the line is calculated: Use formula from page 546 Where the XiYi values are the data points (20,4000) and XjYj are points (16, 3000). The intercept is -1,000, the point at which X = 0 in this plot. This area is off the graph and appears in an insert on the figure. Substituting into the formula, we have the simple regression equation We can now predict that a warm growing season with 25.5C temperature would bring a case price of 5,375 French francs.
  17. Exhibit 18-12 It is more likely that we will collect data where the values of Y vary for each X value. Considering Exhibit 18-12, we should expect a distribution of price values for the temperature X = 16, another for X = 20, and another for each value of X. The means of these Y distributions will also vary in some systematic way with X. These variations lead us to construct a probabilistic model that also uses a linear function. This function is written Use formula from page 547 As shown in the exhibit, the actual values of Y may be found above or below the regression line represented by the mean value of Y (o + 1Xi) for a particular value of X. These deviations are the error in fitting the line and are often called the error term.
  18. Exhibit 18-14 Exhibit 18-14 contains a new data set for the wine price example. Our prediction of Y from X must now account for the fact that the X and Y pairs do not fall neatly along the line. Exhibit 18-14, shown in the slide, suggests two alternatives based on visual inspection. The method of least squares allows us to find a regression line, or line of best fit, which will keep these errors to a minimum. It uses the criterion of minimizing the total squared errors of estimate. When we predict values of Y for each Xi, the difference between the actual Yi and the predicted Y is the error. This error is squared and then summed. The line of best fit is the one that minimizes the total squared errors of prediction. Use formula from page 549 Regression coefficients 0 and 1 are used to find the least-squares solution. They are computed as follows:
  19. Exhibit 18-15 Substituting data from Exhibit 18-14 into both formulas, we get Use formula from page 524 Before drawing the regression line, we select two values of X to compute. Using values 13 and 24 for Xi, the points are Use formula from page 000
  20. Exhibit 18-16 A residual is what remains after the line is fit. When standardized, residuals are comparable to Z scores with a mean of 0 and a standard deviation of 1. In this plot, the standardized residuals should fall between 2 and -2, be randomly distributed about zero, and show no discernible pattern. All these conditions say the model is applied appropriately.
  21. Exhibit 18-17 Prediction and confidence bands are bow-tie shaped confidence interval around a predictor. Predictors farther from the mean have larger bandwidths. If we wanted to predict the price of a case of investment-grade red wine for a growing season that averages 21 degrees Celsius, our prediction would be Use formula from page 552 This is a point prediction Y and should be corrected for greater precision. As with other confidence estimates, we establish the degree of confidence desired and substitute into the formula Use formula from page 552
  22. With the regression line plotted, next we evaluate the goodness of fit. Goodness of fit is a measure of how well the regression model is able to predict Y. The most important test in bivariate linear regression is whether the slope, 1 , is equal to zero. Zero slopes result from the conditions listed in the slide. To test whether the slope is equal to zero, we use a two-tailed test. The test follows the t distribution for n-2 degrees of freedom. In this case, we reject the null hypothesis because the calculated t is greater than any t value for 8 degrees of freedom and an alpha of .01.
  23. Exhibit 18-18 Computer printouts generally contain an analysis of variance (ANOVA) table with an F test of the regression model. In bivariate regression, t and F tests produce the same results since t2 is equal to F. In multiple regression, the F test has an overall role for the model and each of the independent variables is evaluated in a separate t-test. For regression, the ANOVA comprises explained deviations and unexplained deviations. Together, they constitute total deviation. This is shown graphically in the exhibit in the slide. These sources of deviation are squared for all observations and summed across the data points.
  24. Exhibit 18-19 In Exhibit 18-19, we develop this concept sequentially, concluding with the F test of the regression model for the wine data. Based on the results presented in that table, we find evidence of a linear relationship. The null hypothesis is rejected.
  25. Exhibit 18-20 Nominal measures are used to assess the strength of relationships in cross-classification tables. They are often used with chi-square. Exhibit 18-20 reports a 2 x 2 table showing the test of an advertising campaign involving 66 people The variables are success of the campaign and whether direct mail was used. In this example, the observed significance level is less than the testing level (.05) and the null hypothesis is rejected. A correction to chi-square is provided. The exhibit also provides an approximate significance of the coefficient based on the chi-square distribution. This is a test of the null hypothesis that no relationship exists between the variables of direct mail and campaign success. Phi is used with chi-square and is a measure of association for nominal, nonparametric variables. Phi ranges from 0 to +1. It is calculated as Cramer’s V is also used with chi-square and is a measure of association for nominal variables with larger than 2 x 2 tables. It is calculated as The contingency coefficient C is a not comparable to other measures and has a different upper limit for various table sizes. C is calculated as
  26. Exhibit 18-21Proportional reduction in error (PRE) statistics are the second type used with contingency tables. Lambda and tau are the examples discussed in the text. Lambda is a measure of how well the frequencies of one nominal variable predict the frequencies of another variable. Exhibit 18-21 shows result from an opinion survey with a sample of 400 (see highlighted portion of exhibit) shareholders in publicly traded advertising firms. If asked to predict the opinions of an individual in the sample, we would achieve the best prediction by choosing the mode. However, by doing so, we would be wrong 180 out of 400 times. Lambda shows us that we improve the prediction by 39% if we include information about a respondent’s occupational status. The formula for Lambda is This figure is also highlighted. Goodman and Kruskal’s tau uses table marginals to reduce prediction errors. In predicting opinion in the example without knowledge of occupational class, we would expect a 50.5% correct classification and a 49.5% probability of error. When additional knowledge of occupational class is used, information for correct classification of the opinion variable is improved to 62.7% with a 37.3% probability of error. Tau is computed as
  27. Exhibit 18-22 When data require ordinal measures, we may use Gamma, Kendall’s tau b and c, Somers’s d, and Spearman’s rho. All but Spearman’s rank-order correlation are based on the concept of concordant and discordant pairs, which are discussed further on the following slide. Exhibit 18-22 presents data for 70 employees of KeyDesign who have been evaluated for coronary risk. The management levels are ranked as are the fitness assessments by the physicians. The information in the exhibit has been arranged so that the number of concordant and discordant pairs of individual observations may be calculated. When a subject that ranks higher on one variable also ranks higher on the other, the pairs of observations are said to be concordant. If a higher ranking on one is accompanied by a lower ranking on the other variable, the pairs of observations are discordant. Exhibit 18-23 summarizes the procedure for calculating the summary terms needed. Gamma is a statistic that compares concordant and discordant pairs and then standardizes the outcome by maximizing the value of the denominator. Kendall’s tau b is a refinement of gamma for ordinal data that considers “tied” pairs for square tables. Kendall’s tau c is a refinement of gamma for ordinal data that considers “tied” pairs for any-size table. Somers’s d is a measure of association for ordinal data that compensates for “tied” ranks and adjusts for direction of the independent variable.
  28. Exhibit 18-23Exhibit 18-23
  29. Exhibit 18-24 Spearman’s rho correlates ranks been two ordinal variables. Spearman’s rho has many advantages. When data are transformed by logs or squaring, rho remains unaffected. Outliers or extreme scores are not a threat. It is easy to compute. In the example in the exhibit in the slide KDL, a media firm, is recruiting account executive trainees. Assume the field has been narrowed to 10 applicants for final evaluation. They go through a battery of tests and interviews. The results are evaluated by an industrial psychologist who then ranks the 10 candidates. The executives produce a composite ranking based on the interviews. Now we need to determine how well these two sets of rankings agree. Using the following equation, we can see that there is a strong relationship between the rankings and reject the null. The relationship between the panel’s and the psychologist’s rankings is moderately high, suggesting agreement between the two measures. The test of the null hypothesis that there is no relationship between the measures is rejected at the .05 level with n-2 degrees of freedom.