SlideShare a Scribd company logo
INTRODUCTION TO
STATISTICS & PROBABILITY
Chapter 2:
Looking at Data–Relationships (Part 2)
Dr. Nahid Sultana
1
2
Chapter 2:
Looking at Data–Relationships
2.1: Scatterplots
2.2: Correlation
2.3: Least-Squares Regression
2.5: Data Analysis for Two-Way Tables
Objectives
 The correlation coefficient “r”
 r does not distinguish between x and y
 r has no units of measurement
 r ranges from -1 to +1
 Influential points
2.2: Correlation
3
The correlation coefficient "r"
 The correlation coefficient is a measure of the direction and
strength of a linear relationship.
 It is calculated using the mean and the standard deviation of both
the x and y variables.
 Correlation can only be used to describe quantitative variables.
Categorical variables don’t have means and standard deviations.
4
The correlation coefficient “r“ (Cont…)
Time to swim: = 35, sx = 0.7
Pulse rate: = 140, sy = 9.5
x
y
5
r =
1
n −1
xi − x
sx






i=1
n
∑
yi − y
sy






 Suppose that we have data
on variables x and y for n
individuals.
 The means and standard
deviations of the two variables
are and for the x-values,
and and for y-values.
 The correlation r between x
and y
x
y
“r” does not distinguish x & y
The correlation coefficient, r,
treats x and y symmetrically.
"Time to swim" is the explanatory variable here, and belongs on
the x axis. However, in either plot r is the same (r=-0.75).
r = -0.75 r = -0.75
r =
1
n −1
xi − x
sx






i=1
n
∑
yi − y
sy






6
Changing the units of variables does
not change the correlation coefficient
"r“.
"r" has no unit r = -0.75
r = -0.75
7
standardized
value of x
(unit less)
standardized
value of y
(unit less)
"r" ranges from -1 to +1
Properties of Correlation
 r is always a no. between –1 and 1.
 r > 0 indicates a positive association.
r < 0 indicates a negative association.
 Values of r near 0 indicate a very
weak linear relationship.
 The strength of the linear relationship
increases as r moves away from 0
toward –1 or 1.
 The extreme values r = –1 and r = 1
occur only in the case of a perfect
linear relationship.
8
9
“r” increases as variation decreases
When variability in
one or both variables
decreases, the
correlation coefficient
gets stronger
( closer to +1 or -1).
Correlation only describes linear
relationships
10
No matter how strong the association,
r does not describe curved relationships.
11
Influential points
Correlations are calculated using
means and standard deviations,
and thus are NOT resistant to
outliers.
Just moving one point away from
the general trend here decreases
the correlation from -0.91 to -
0.75
12
12
Influential points (Cont…)

More Related Content

What's hot

Correlation and regression
Correlation and regressionCorrelation and regression
Correlation and regression
mejikpg
 
Correlation analysis ppt
Correlation analysis pptCorrelation analysis ppt
Correlation analysis ppt
Anil Mishra
 
Correlation by Neeraj Bhandari ( Surkhet.Nepal )
Correlation by Neeraj Bhandari ( Surkhet.Nepal )Correlation by Neeraj Bhandari ( Surkhet.Nepal )
Correlation by Neeraj Bhandari ( Surkhet.Nepal )Neeraj Bhandari
 
Correlation and Regression
Correlation and RegressionCorrelation and Regression
Correlation and RegressionShubham Mehta
 
Correlation & Regression
Correlation & RegressionCorrelation & Regression
Correlation & RegressionGrant Heller
 
Correlation and Regression
Correlation and RegressionCorrelation and Regression
Correlation and Regression
jasondroesch
 
Presentation on regression analysis
Presentation on regression analysisPresentation on regression analysis
Presentation on regression analysis
Sujeet Singh
 
Correlation & regression uwsb (3)
Correlation & regression   uwsb (3)Correlation & regression   uwsb (3)
Correlation & regression uwsb (3)Arnab Roy Chowdhury
 
Chapter 2 part3-Least-Squares Regression
Chapter 2 part3-Least-Squares RegressionChapter 2 part3-Least-Squares Regression
Chapter 2 part3-Least-Squares Regression
nszakir
 
Correlation analysis
Correlation analysisCorrelation analysis
Correlation analysis
Rajat Sharma
 
Correlation and Regression
Correlation and RegressionCorrelation and Regression
Correlation and Regression
Ram Kumar Shah "Struggler"
 
Karl pearson's correlation
Karl pearson's correlationKarl pearson's correlation
Karl pearson's correlation
fairoos1
 
Correlation 2
Correlation 2Correlation 2
Correlation 2
KanishkJaiswal6
 
Regression Analysis
Regression AnalysisRegression Analysis
Regression Analysis
ASAD ALI
 
Karl pearson's coefficient of correlation
Karl pearson's coefficient of correlationKarl pearson's coefficient of correlation
Karl pearson's coefficient of correlation
teenathankachen1993
 

What's hot (17)

Correlation and regression
Correlation and regressionCorrelation and regression
Correlation and regression
 
Correlation analysis ppt
Correlation analysis pptCorrelation analysis ppt
Correlation analysis ppt
 
Correlation by Neeraj Bhandari ( Surkhet.Nepal )
Correlation by Neeraj Bhandari ( Surkhet.Nepal )Correlation by Neeraj Bhandari ( Surkhet.Nepal )
Correlation by Neeraj Bhandari ( Surkhet.Nepal )
 
Correlation and Regression
Correlation and RegressionCorrelation and Regression
Correlation and Regression
 
Correlation & Regression
Correlation & RegressionCorrelation & Regression
Correlation & Regression
 
Correlation and Regression
Correlation and RegressionCorrelation and Regression
Correlation and Regression
 
Presentation on regression analysis
Presentation on regression analysisPresentation on regression analysis
Presentation on regression analysis
 
Correlation & regression uwsb (3)
Correlation & regression   uwsb (3)Correlation & regression   uwsb (3)
Correlation & regression uwsb (3)
 
Regression
RegressionRegression
Regression
 
Chapter 14 Part I
Chapter 14 Part IChapter 14 Part I
Chapter 14 Part I
 
Chapter 2 part3-Least-Squares Regression
Chapter 2 part3-Least-Squares RegressionChapter 2 part3-Least-Squares Regression
Chapter 2 part3-Least-Squares Regression
 
Correlation analysis
Correlation analysisCorrelation analysis
Correlation analysis
 
Correlation and Regression
Correlation and RegressionCorrelation and Regression
Correlation and Regression
 
Karl pearson's correlation
Karl pearson's correlationKarl pearson's correlation
Karl pearson's correlation
 
Correlation 2
Correlation 2Correlation 2
Correlation 2
 
Regression Analysis
Regression AnalysisRegression Analysis
Regression Analysis
 
Karl pearson's coefficient of correlation
Karl pearson's coefficient of correlationKarl pearson's coefficient of correlation
Karl pearson's coefficient of correlation
 

Similar to Chapter 2 part2-Correlation

correlation.ppt
correlation.pptcorrelation.ppt
correlation.ppt
NayanPatil59
 
correlationppt-111222215110-phpapp02.pdf
correlationppt-111222215110-phpapp02.pdfcorrelationppt-111222215110-phpapp02.pdf
correlationppt-111222215110-phpapp02.pdf
KrishnaVamsiMuthinen
 
12943625.ppt
12943625.ppt12943625.ppt
12943625.ppt
MokayceLimited
 
CORRELATION ( srm1) - Copy.pptx
CORRELATION ( srm1) - Copy.pptxCORRELATION ( srm1) - Copy.pptx
CORRELATION ( srm1) - Copy.pptx
VaishnaviElumalai
 
Correlation analysis notes
Correlation analysis notesCorrelation analysis notes
Correlation analysis notes
Japheth Muthama
 
A correlation analysis.ppt 2018
A correlation analysis.ppt 2018A correlation analysis.ppt 2018
A correlation analysis.ppt 2018
DrRavindraKumarSaini
 
Co re
Co reCo re
CORRELATION AND REGRESSION.pptx
CORRELATION AND REGRESSION.pptxCORRELATION AND REGRESSION.pptx
CORRELATION AND REGRESSION.pptx
Rohit77460
 
Regression and Co-Relation
Regression and Co-RelationRegression and Co-Relation
Regression and Co-Relation
nuwan udugampala
 
Correlation 3rd
Correlation 3rdCorrelation 3rd
Correlation 3rd
Forensic Pathology
 
Statistics
Statistics Statistics
Statistics
KafiPati
 
Correlation and regression
Correlation and regressionCorrelation and regression
Correlation and regression
Mohit Asija
 
Quantitative Methods - Level II - CFA Program
Quantitative Methods - Level II - CFA ProgramQuantitative Methods - Level II - CFA Program
Quantitative Methods - Level II - CFA Program
Mohamed Farouk, CFA, CFTe I
 
Exploring bivariate data
Exploring bivariate dataExploring bivariate data
Exploring bivariate dataUlster BOCES
 
Pearson product moment correlation
Pearson product moment correlationPearson product moment correlation
Pearson product moment correlationSharlaine Ruth
 
Correlation
CorrelationCorrelation
Correlation
Anjali Awasthi
 
Correlation analysis in Biostatistics .pptx
Correlation analysis in Biostatistics .pptxCorrelation analysis in Biostatistics .pptx
Correlation analysis in Biostatistics .pptx
HamdiMichaelCC
 
correlation.final.ppt (1).pptx
correlation.final.ppt (1).pptxcorrelation.final.ppt (1).pptx
correlation.final.ppt (1).pptx
ChieWoo1
 

Similar to Chapter 2 part2-Correlation (20)

correlation.ppt
correlation.pptcorrelation.ppt
correlation.ppt
 
Correlation
CorrelationCorrelation
Correlation
 
correlationppt-111222215110-phpapp02.pdf
correlationppt-111222215110-phpapp02.pdfcorrelationppt-111222215110-phpapp02.pdf
correlationppt-111222215110-phpapp02.pdf
 
12943625.ppt
12943625.ppt12943625.ppt
12943625.ppt
 
CORRELATION ( srm1) - Copy.pptx
CORRELATION ( srm1) - Copy.pptxCORRELATION ( srm1) - Copy.pptx
CORRELATION ( srm1) - Copy.pptx
 
Correlation analysis notes
Correlation analysis notesCorrelation analysis notes
Correlation analysis notes
 
A correlation analysis.ppt 2018
A correlation analysis.ppt 2018A correlation analysis.ppt 2018
A correlation analysis.ppt 2018
 
Co re
Co reCo re
Co re
 
CORRELATION AND REGRESSION.pptx
CORRELATION AND REGRESSION.pptxCORRELATION AND REGRESSION.pptx
CORRELATION AND REGRESSION.pptx
 
Regression and Co-Relation
Regression and Co-RelationRegression and Co-Relation
Regression and Co-Relation
 
Correlation 3rd
Correlation 3rdCorrelation 3rd
Correlation 3rd
 
Statistics
Statistics Statistics
Statistics
 
Correlation and regression
Correlation and regressionCorrelation and regression
Correlation and regression
 
Quantitative Methods - Level II - CFA Program
Quantitative Methods - Level II - CFA ProgramQuantitative Methods - Level II - CFA Program
Quantitative Methods - Level II - CFA Program
 
Exploring bivariate data
Exploring bivariate dataExploring bivariate data
Exploring bivariate data
 
Pearson product moment correlation
Pearson product moment correlationPearson product moment correlation
Pearson product moment correlation
 
S2 pb
S2 pbS2 pb
S2 pb
 
Correlation
CorrelationCorrelation
Correlation
 
Correlation analysis in Biostatistics .pptx
Correlation analysis in Biostatistics .pptxCorrelation analysis in Biostatistics .pptx
Correlation analysis in Biostatistics .pptx
 
correlation.final.ppt (1).pptx
correlation.final.ppt (1).pptxcorrelation.final.ppt (1).pptx
correlation.final.ppt (1).pptx
 

More from nszakir

Chapter-4: More on Direct Proof and Proof by Contrapositive
Chapter-4: More on Direct Proof and Proof by ContrapositiveChapter-4: More on Direct Proof and Proof by Contrapositive
Chapter-4: More on Direct Proof and Proof by Contrapositive
nszakir
 
Chapter-3: DIRECT PROOF AND PROOF BY CONTRAPOSITIVE
Chapter-3: DIRECT PROOF AND PROOF BY CONTRAPOSITIVEChapter-3: DIRECT PROOF AND PROOF BY CONTRAPOSITIVE
Chapter-3: DIRECT PROOF AND PROOF BY CONTRAPOSITIVE
nszakir
 
Chapter 2: Relations
Chapter 2: RelationsChapter 2: Relations
Chapter 2: Relations
nszakir
 
Chapter 7 : Inference for Distributions(The t Distributions, One-Sample t Con...
Chapter 7 : Inference for Distributions(The t Distributions, One-Sample t Con...Chapter 7 : Inference for Distributions(The t Distributions, One-Sample t Con...
Chapter 7 : Inference for Distributions(The t Distributions, One-Sample t Con...
nszakir
 
Chapter 6 part2-Introduction to Inference-Tests of Significance, Stating Hyp...
Chapter 6 part2-Introduction to Inference-Tests of Significance,  Stating Hyp...Chapter 6 part2-Introduction to Inference-Tests of Significance,  Stating Hyp...
Chapter 6 part2-Introduction to Inference-Tests of Significance, Stating Hyp...
nszakir
 
Chapter 6 part1- Introduction to Inference-Estimating with Confidence (Introd...
Chapter 6 part1- Introduction to Inference-Estimating with Confidence (Introd...Chapter 6 part1- Introduction to Inference-Estimating with Confidence (Introd...
Chapter 6 part1- Introduction to Inference-Estimating with Confidence (Introd...
nszakir
 
Chapter 5 part2- Sampling Distributions for Counts and Proportions (Binomial ...
Chapter 5 part2- Sampling Distributions for Counts and Proportions (Binomial ...Chapter 5 part2- Sampling Distributions for Counts and Proportions (Binomial ...
Chapter 5 part2- Sampling Distributions for Counts and Proportions (Binomial ...
nszakir
 
Chapter 5 part1- The Sampling Distribution of a Sample Mean
Chapter 5 part1- The Sampling Distribution of a Sample MeanChapter 5 part1- The Sampling Distribution of a Sample Mean
Chapter 5 part1- The Sampling Distribution of a Sample Mean
nszakir
 
Chapter 4 part4- General Probability Rules
Chapter 4 part4- General Probability RulesChapter 4 part4- General Probability Rules
Chapter 4 part4- General Probability Rules
nszakir
 
Chapter 4 part3- Means and Variances of Random Variables
Chapter 4 part3- Means and Variances of Random VariablesChapter 4 part3- Means and Variances of Random Variables
Chapter 4 part3- Means and Variances of Random Variables
nszakir
 
Chapter 4 part2- Random Variables
Chapter 4 part2- Random VariablesChapter 4 part2- Random Variables
Chapter 4 part2- Random Variables
nszakir
 
Chapter 4 part1-Probability Model
Chapter 4 part1-Probability ModelChapter 4 part1-Probability Model
Chapter 4 part1-Probability Model
nszakir
 
Chapter 3 part3-Toward Statistical Inference
Chapter 3 part3-Toward Statistical InferenceChapter 3 part3-Toward Statistical Inference
Chapter 3 part3-Toward Statistical Inference
nszakir
 
Chapter 3 part2- Sampling Design
Chapter 3 part2- Sampling DesignChapter 3 part2- Sampling Design
Chapter 3 part2- Sampling Designnszakir
 
Chapter 3 part1-Design of Experiments
Chapter 3 part1-Design of ExperimentsChapter 3 part1-Design of Experiments
Chapter 3 part1-Design of Experiments
nszakir
 
Chapter 2 part1-Scatterplots
Chapter 2 part1-ScatterplotsChapter 2 part1-Scatterplots
Chapter 2 part1-Scatterplots
nszakir
 
Density Curves and Normal Distributions
Density Curves and Normal DistributionsDensity Curves and Normal Distributions
Density Curves and Normal Distributions
nszakir
 
Describing Distributions with Numbers
Describing Distributions with NumbersDescribing Distributions with Numbers
Describing Distributions with Numbers
nszakir
 
Displaying Distributions with Graphs
Displaying Distributions with GraphsDisplaying Distributions with Graphs
Displaying Distributions with Graphs
nszakir
 

More from nszakir (19)

Chapter-4: More on Direct Proof and Proof by Contrapositive
Chapter-4: More on Direct Proof and Proof by ContrapositiveChapter-4: More on Direct Proof and Proof by Contrapositive
Chapter-4: More on Direct Proof and Proof by Contrapositive
 
Chapter-3: DIRECT PROOF AND PROOF BY CONTRAPOSITIVE
Chapter-3: DIRECT PROOF AND PROOF BY CONTRAPOSITIVEChapter-3: DIRECT PROOF AND PROOF BY CONTRAPOSITIVE
Chapter-3: DIRECT PROOF AND PROOF BY CONTRAPOSITIVE
 
Chapter 2: Relations
Chapter 2: RelationsChapter 2: Relations
Chapter 2: Relations
 
Chapter 7 : Inference for Distributions(The t Distributions, One-Sample t Con...
Chapter 7 : Inference for Distributions(The t Distributions, One-Sample t Con...Chapter 7 : Inference for Distributions(The t Distributions, One-Sample t Con...
Chapter 7 : Inference for Distributions(The t Distributions, One-Sample t Con...
 
Chapter 6 part2-Introduction to Inference-Tests of Significance, Stating Hyp...
Chapter 6 part2-Introduction to Inference-Tests of Significance,  Stating Hyp...Chapter 6 part2-Introduction to Inference-Tests of Significance,  Stating Hyp...
Chapter 6 part2-Introduction to Inference-Tests of Significance, Stating Hyp...
 
Chapter 6 part1- Introduction to Inference-Estimating with Confidence (Introd...
Chapter 6 part1- Introduction to Inference-Estimating with Confidence (Introd...Chapter 6 part1- Introduction to Inference-Estimating with Confidence (Introd...
Chapter 6 part1- Introduction to Inference-Estimating with Confidence (Introd...
 
Chapter 5 part2- Sampling Distributions for Counts and Proportions (Binomial ...
Chapter 5 part2- Sampling Distributions for Counts and Proportions (Binomial ...Chapter 5 part2- Sampling Distributions for Counts and Proportions (Binomial ...
Chapter 5 part2- Sampling Distributions for Counts and Proportions (Binomial ...
 
Chapter 5 part1- The Sampling Distribution of a Sample Mean
Chapter 5 part1- The Sampling Distribution of a Sample MeanChapter 5 part1- The Sampling Distribution of a Sample Mean
Chapter 5 part1- The Sampling Distribution of a Sample Mean
 
Chapter 4 part4- General Probability Rules
Chapter 4 part4- General Probability RulesChapter 4 part4- General Probability Rules
Chapter 4 part4- General Probability Rules
 
Chapter 4 part3- Means and Variances of Random Variables
Chapter 4 part3- Means and Variances of Random VariablesChapter 4 part3- Means and Variances of Random Variables
Chapter 4 part3- Means and Variances of Random Variables
 
Chapter 4 part2- Random Variables
Chapter 4 part2- Random VariablesChapter 4 part2- Random Variables
Chapter 4 part2- Random Variables
 
Chapter 4 part1-Probability Model
Chapter 4 part1-Probability ModelChapter 4 part1-Probability Model
Chapter 4 part1-Probability Model
 
Chapter 3 part3-Toward Statistical Inference
Chapter 3 part3-Toward Statistical InferenceChapter 3 part3-Toward Statistical Inference
Chapter 3 part3-Toward Statistical Inference
 
Chapter 3 part2- Sampling Design
Chapter 3 part2- Sampling DesignChapter 3 part2- Sampling Design
Chapter 3 part2- Sampling Design
 
Chapter 3 part1-Design of Experiments
Chapter 3 part1-Design of ExperimentsChapter 3 part1-Design of Experiments
Chapter 3 part1-Design of Experiments
 
Chapter 2 part1-Scatterplots
Chapter 2 part1-ScatterplotsChapter 2 part1-Scatterplots
Chapter 2 part1-Scatterplots
 
Density Curves and Normal Distributions
Density Curves and Normal DistributionsDensity Curves and Normal Distributions
Density Curves and Normal Distributions
 
Describing Distributions with Numbers
Describing Distributions with NumbersDescribing Distributions with Numbers
Describing Distributions with Numbers
 
Displaying Distributions with Graphs
Displaying Distributions with GraphsDisplaying Distributions with Graphs
Displaying Distributions with Graphs
 

Chapter 2 part2-Correlation

  • 1. INTRODUCTION TO STATISTICS & PROBABILITY Chapter 2: Looking at Data–Relationships (Part 2) Dr. Nahid Sultana 1
  • 2. 2 Chapter 2: Looking at Data–Relationships 2.1: Scatterplots 2.2: Correlation 2.3: Least-Squares Regression 2.5: Data Analysis for Two-Way Tables
  • 3. Objectives  The correlation coefficient “r”  r does not distinguish between x and y  r has no units of measurement  r ranges from -1 to +1  Influential points 2.2: Correlation 3
  • 4. The correlation coefficient "r"  The correlation coefficient is a measure of the direction and strength of a linear relationship.  It is calculated using the mean and the standard deviation of both the x and y variables.  Correlation can only be used to describe quantitative variables. Categorical variables don’t have means and standard deviations. 4
  • 5. The correlation coefficient “r“ (Cont…) Time to swim: = 35, sx = 0.7 Pulse rate: = 140, sy = 9.5 x y 5 r = 1 n −1 xi − x sx       i=1 n ∑ yi − y sy        Suppose that we have data on variables x and y for n individuals.  The means and standard deviations of the two variables are and for the x-values, and and for y-values.  The correlation r between x and y x y
  • 6. “r” does not distinguish x & y The correlation coefficient, r, treats x and y symmetrically. "Time to swim" is the explanatory variable here, and belongs on the x axis. However, in either plot r is the same (r=-0.75). r = -0.75 r = -0.75 r = 1 n −1 xi − x sx       i=1 n ∑ yi − y sy       6
  • 7. Changing the units of variables does not change the correlation coefficient "r“. "r" has no unit r = -0.75 r = -0.75 7 standardized value of x (unit less) standardized value of y (unit less)
  • 8. "r" ranges from -1 to +1 Properties of Correlation  r is always a no. between –1 and 1.  r > 0 indicates a positive association. r < 0 indicates a negative association.  Values of r near 0 indicate a very weak linear relationship.  The strength of the linear relationship increases as r moves away from 0 toward –1 or 1.  The extreme values r = –1 and r = 1 occur only in the case of a perfect linear relationship. 8
  • 9. 9 “r” increases as variation decreases When variability in one or both variables decreases, the correlation coefficient gets stronger ( closer to +1 or -1).
  • 10. Correlation only describes linear relationships 10 No matter how strong the association, r does not describe curved relationships.
  • 11. 11 Influential points Correlations are calculated using means and standard deviations, and thus are NOT resistant to outliers. Just moving one point away from the general trend here decreases the correlation from -0.91 to - 0.75