SlideShare a Scribd company logo
1 of 61
Download to read offline
Statistics One
Lecture 5
Correlation
1
Three segments
•  Overview
•  Calculation of r
•  Assumptions

2
Lecture 5 ~ Segment 1
Correlation: Overview

3
Correlation: Overview
•  Important concepts & topics
–  What is a correlation?
–  What are they used for?
–  Scatterplots
–  CAUTION!
–  Types of correlations
4
Correlation: Overview
•  Correlation
–  A statistical procedure used to measure and
describe the relationship between two variables
–  Correlations can range between +1 and -1
•  +1 is a perfect positive correlation
•  0 is no correlation (independence)
•  -1 is a perfect negative correlation
5
Correlation: Overview
•  When two variables, let’s call them X and Y,
are correlated, then one variable can be used
to predict the other variable
–  More precisely, a person’s score on X can be
used to predict his or her score on Y

6
Correlation: Overview
•  Example:
–  Working memory capacity is strongly correlated
with intelligence, or IQ, in healthy young adults
–  So if we know a person’s IQ then we can predict
how they will do on a test of working memory

7
Correlation: Overview

8
Correlation: Overview
•  CAUTION!
–  Correlation does not imply causation

9
Correlation: Overview
•  CAUTION!
–  The magnitude of a correlation depends upon
many factors, including:
•  Sampling (random and representative?)

10
Correlation: Overview
•  CAUTION!
–  The magnitude of a correlation is also
influenced by:
•  Measurement of X & Y (See Lecture 6)
•  Several other assumptions (See Segment 3)

11
Correlation: Overview
•  For now, consider just one assumption:
–  Random and representative sampling

–  There is a strong correlation between IQ and
working memory among all healthy young
adults.
•  What is the correlation between IQ and working
memory among college graduates?

12
Correlation: Overview

13
Correlation: Overview
•  CAUTION!
•  Finally & perhaps most important:
–  The correlation coefficient is a sample statistic,
just like the mean
•  It may not be representative of ALL individuals
–  For example, in school I scored very high on Math and
Science but below average on Language and History
14
Correlation: Overview

15
Correlation: Overview
•  Note: there are several types of correlation
coefficients, for different variable types
–  Pearson product-moment correlation
coefficient (r)
•  When both variables, X & Y, are continuous

–  Point bi-serial correlation
•  When 1 variable is continuous and 1 is dichotomous
16
Correlation: Overview
•  Note: there are several types of correlation
coefficients
–  Phi coefficient
•  When both variables are dichotomous

–  Spearman rank correlation
•  When both variables are ordinal (ranked data)

17
Segment summary
•  Important concepts/topics
–  What is a correlation?
–  What are they used for?
–  Scatterplots
–  CAUTION!
–  Types of correlations
18
END SEGMENT

19
Lecture 5 ~ Segment 2
Calculation of r

20
Calculation of r
•  Important topics
–  r
•  Pearson product-moment correlation coefficient
–  Raw score formula
–  Z-score formula

–  Sum of cross products (SP) & Covariance

21
Calculation of r
•  r = the degree to which X and Y vary together,
relative to the degree to which X and Y vary
independently
•  r = (Covariance of X & Y) / (Variance of X & Y)

22
Calculation of r
•  Two ways to calculate r
–  Raw score formula
–  Z-score formula

23
Calculation of r
•  Let’s quickly review calculations from
Lecture 4 on summary statistics
•  Variance = SD2 = MS = (SS/N)

24
Linsanity!

25
Jeremy Lin (10 games)
Points	
  per	
  game	
  

(X-­‐M)	
  

(X-­‐M)2	
  

28	
  

5.3	
  

28.09	
  

26	
  

3.3	
  

10.89	
  

10	
  

-­‐12.7	
  

161.29	
  

27	
  

4.3	
  

18.49	
  

20	
  

-­‐2.7	
  

7.29	
  

38	
  

15.3	
  

234.09	
  

23	
  

0.3	
  

0.09	
  

28	
  

5.3	
  

28.09	
  

25	
  

2.3	
  

5.29	
  

2	
  

-­‐20.7	
  

428.49	
  
M	
  =	
  922.1/10	
  =	
  92.21	
  

M	
  =	
  227/10	
  =	
  22.7	
  

M	
  =	
  0/10	
  =	
  0	
  

26
Results
•  M = Mean = 22.7
2 = Variance = MS = SS/N = 92.21
•  SD
•  SD = Standard Deviation = 9.6

27
Just one new concept!
•  SP = Sum of cross Products

28
Just one new concept!
•  Review: To calculate SS
–  For each row, calculate the deviation score
•  (X – Mx)

–  Square the deviation scores
•  (X - Mx)2

–  Sum the squared deviation scores
•  SSx = Σ[(X – Mx)2] = Σ[(X – Mx) x (X – Mx)]
29
Just one new concept!
•  To calculate SP
–  For each row, calculate the deviation score on X
•  (X - Mx)

–  For each row, calculate the deviation score on Y
•  (Y – My)

30
Just one new concept!
•  To calculate SP
–  Then, for each row, multiply the deviation score
on X by the deviation score on Y
•  (X – Mx) x (Y – My)

–  Then, sum the “cross products”
•  SP = Σ[(X – Mx) x (Y – My)]

31
Calculation of r
Raw score formula:	

	

r = SPxy / SQRT(SSx x SSy)	

	


32
Calculation of r
SPxy = Σ[(X - Mx) x (Y - My)]	

	

2 = Σ[(X - M ) x (X - M )]	

SSx = Σ(X - Mx)
x
x
	

SSy = Σ(Y - My)2 = Σ[(Y - My) x (Y - My)]	


	

	

	

	


33
Formulae to calculate r
r = SPxy / SQRT (SSx x SSy)	

	

r = Σ[(X - Mx) x (Y - My)] / 	

2 x Σ(Y - M )2)	

SQRT (Σ(X - Mx)
y
	

	

	

	

	


34
Formulae to calculate r
Z-score formula:	

	

r = Σ(Zx x Zy) / N	

	


35
Formulae to calculate r
Zx = (X - Mx) / SDx	

Zy = (Y - My) / SDy	

	

2 / N)	

SDx = SQRT (Σ(X - Mx)
SDy = SQRT (Σ(Y - My)2 / N)	

	

	

	


36
Formulae to calculate r
Proof of equivalence:	

	

Zx = (X - Mx) / SQRT (Σ(X - Mx)2 / N)	

	

Zy = (Y - My) / SQRT (Σ(Y - My)2 / N)	

	

	

	

37
Formulae to calculate r
r = Σ { [(X - Mx) / SQRT (Σ(X - Mx)2 / N)] x	

[(Y - My) / SQRT (Σ(Y - My)2 / N)] } / N	


	

	

	

	

	

	

	


38
Formulae to calculate r
r = Σ { [(X - Mx) / SQRT (Σ(X - Mx)2 / N)] x	

[(Y - My) / SQRT (Σ(Y - My)2 / N)] } / N	


	

r = Σ [(X - Mx) x (Y - My)] / 	

SQRT ( Σ(X - Mx)2 x Σ(Y - My)2 )	

	

r = SPxy / SQRT (SSx x SSy) ß The raw score formula!	

	


	


39
Variance and covariance
•  Variance = MS = SS / N
•  Covariance = COV = SP / N
•  Correlation is standardized COV
–  Standardized so the value is in the range -1 to 1

40
Note on the denominators
•  Correlation for descriptive statistics
–  Divide by N

•  Correlation for inferential statistics
–  Divide by N – 1

41
Segment summary
•  Important topics
–  r
•  Pearson product-moment correlation coefficient
–  Raw score formula
–  Z-score formula

–  Sum of cross Products (SP) & Covariance

42
END SEGMENT

43
Lecture 5 ~ Segment 3
Assumptions

44
Assumptions
•  Assumptions when interpreting r
–  Normal distributions for X and Y
–  Linear relationship between X and Y
–  Homoscedasticity

45
Assumptions
•  Assumptions when interpreting r
–  Reliability of X and Y
–  Validity of X and Y
–  Random and representative sampling

46
Assumptions
•  Assumptions when interpreting r
–  Normal distributions for X and Y
•  How to detect violations?
–  Plot histograms and examine summary statistics

47
Assumptions
•  Assumptions when interpreting r
–  Linear relationship between X and Y
•  How to detect violation?
–  Examine scatterplots (see following examples)

48
Assumptions
•  Assumptions when interpreting r
–  Homoscedasticity
•  How to detect violation?
–  Examine scatterplots (see following examples)

49
Homoscedasticity
•  In a scatterplot the vertical distance between a dot
and the regression line reflects the amount of
prediction error (known as the “residual”)

50
Homoscedasticity
•  Homoscedasticity means that the distances
(the residuals) are not related to the variable
plotted on the X axis (they are not a function
of X)
•  This is best illustrated with scatterplots
51
Anscombe’s quartet
•  In 1973, statistician Dr. Frank Anscombe
developed a classic example to illustrate
several of the assumptions underlying
correlation and regression

52
Anscombe’s quartet

53
Anscombe’s quartet

54
Anscombe’s quartet

55
Anscombe’s quartet

56
Anscombe’s quartet

57
Segment summary
•  Assumptions when interpreting r
–  Normal distributions for X and Y
–  Linear relationship between X and Y
–  Homoscedasticity

58
Segment summary
•  Assumptions when interpreting r
–  Reliability of X and Y
–  Validity of X and Y
–  Random and representative sampling

59
END SEGMENT

60
END LECTURE 5

61

More Related Content

What's hot

Summary statistics (1)
Summary statistics (1)Summary statistics (1)
Summary statistics (1)Godwin Okley
 
5.5 parallel perp lines
5.5 parallel perp lines5.5 parallel perp lines
5.5 parallel perp linescageke
 
Lecture slides stats1.13.l15.air
Lecture slides stats1.13.l15.airLecture slides stats1.13.l15.air
Lecture slides stats1.13.l15.airatutor_te
 
Arithmetic vs Geometric Series and Sequence
Arithmetic vs Geometric Series  and SequenceArithmetic vs Geometric Series  and Sequence
Arithmetic vs Geometric Series and Sequencechsunny
 
Geometry unit 3.8
Geometry unit 3.8Geometry unit 3.8
Geometry unit 3.8Mark Ryder
 
Chapter 1 arithmetic & geometric sequence
Chapter 1 arithmetic & geometric sequenceChapter 1 arithmetic & geometric sequence
Chapter 1 arithmetic & geometric sequenceNajla Nizam
 
Chapter 5 Slopes of Parallel and Perpendicular Lines
Chapter 5 Slopes of Parallel and Perpendicular LinesChapter 5 Slopes of Parallel and Perpendicular Lines
Chapter 5 Slopes of Parallel and Perpendicular LinesIinternational Program School
 
Geometry unit 3.7
Geometry unit 3.7Geometry unit 3.7
Geometry unit 3.7Mark Ryder
 
Finding Slope Given A Graph And Two Points
Finding Slope Given A Graph And Two PointsFinding Slope Given A Graph And Two Points
Finding Slope Given A Graph And Two PointsGillian Guiang
 
Parallel And Perpendicular Lines
Parallel And Perpendicular LinesParallel And Perpendicular Lines
Parallel And Perpendicular Linesguestd1dc2e
 
13 1 arithmetic and geometric sequences
13 1 arithmetic and geometric sequences13 1 arithmetic and geometric sequences
13 1 arithmetic and geometric sequenceshisema01
 
Parallel and Perpendicular Slopes lines
Parallel and Perpendicular Slopes linesParallel and Perpendicular Slopes lines
Parallel and Perpendicular Slopes linesswartzje
 
Correlational research design
Correlational research designCorrelational research design
Correlational research designRox Wale Equias
 
Linear, Quadratic and Cubic sequences
Linear, Quadratic and Cubic sequencesLinear, Quadratic and Cubic sequences
Linear, Quadratic and Cubic sequencesSmart Exam Resources
 
Chapter 2 part2-Correlation
Chapter 2 part2-CorrelationChapter 2 part2-Correlation
Chapter 2 part2-Correlationnszakir
 
Finding the slope of a line
Finding the slope of a lineFinding the slope of a line
Finding the slope of a lineAhmed Nar
 

What's hot (19)

คาบ 2
คาบ 2คาบ 2
คาบ 2
 
Summary statistics (1)
Summary statistics (1)Summary statistics (1)
Summary statistics (1)
 
5.5 parallel perp lines
5.5 parallel perp lines5.5 parallel perp lines
5.5 parallel perp lines
 
Lecture slides stats1.13.l15.air
Lecture slides stats1.13.l15.airLecture slides stats1.13.l15.air
Lecture slides stats1.13.l15.air
 
Arithmetic vs Geometric Series and Sequence
Arithmetic vs Geometric Series  and SequenceArithmetic vs Geometric Series  and Sequence
Arithmetic vs Geometric Series and Sequence
 
Geometry unit 3.8
Geometry unit 3.8Geometry unit 3.8
Geometry unit 3.8
 
Chapter 1 arithmetic & geometric sequence
Chapter 1 arithmetic & geometric sequenceChapter 1 arithmetic & geometric sequence
Chapter 1 arithmetic & geometric sequence
 
Chapter 5 Slopes of Parallel and Perpendicular Lines
Chapter 5 Slopes of Parallel and Perpendicular LinesChapter 5 Slopes of Parallel and Perpendicular Lines
Chapter 5 Slopes of Parallel and Perpendicular Lines
 
Geometry unit 3.7
Geometry unit 3.7Geometry unit 3.7
Geometry unit 3.7
 
Finding Slope Given A Graph And Two Points
Finding Slope Given A Graph And Two PointsFinding Slope Given A Graph And Two Points
Finding Slope Given A Graph And Two Points
 
Parallel And Perpendicular Lines
Parallel And Perpendicular LinesParallel And Perpendicular Lines
Parallel And Perpendicular Lines
 
13 1 arithmetic and geometric sequences
13 1 arithmetic and geometric sequences13 1 arithmetic and geometric sequences
13 1 arithmetic and geometric sequences
 
Statistics
StatisticsStatistics
Statistics
 
Parallel and Perpendicular Slopes lines
Parallel and Perpendicular Slopes linesParallel and Perpendicular Slopes lines
Parallel and Perpendicular Slopes lines
 
Correlational research design
Correlational research designCorrelational research design
Correlational research design
 
Linear, Quadratic and Cubic sequences
Linear, Quadratic and Cubic sequencesLinear, Quadratic and Cubic sequences
Linear, Quadratic and Cubic sequences
 
Chapter 2 part2-Correlation
Chapter 2 part2-CorrelationChapter 2 part2-Correlation
Chapter 2 part2-Correlation
 
Finding the slope of a line
Finding the slope of a lineFinding the slope of a line
Finding the slope of a line
 
1237
12371237
1237
 

Viewers also liked

Viewers also liked (19)

Bayeasian inference
Bayeasian inferenceBayeasian inference
Bayeasian inference
 
Decision theory
Decision theoryDecision theory
Decision theory
 
Decision theory
Decision theoryDecision theory
Decision theory
 
Decision theory
Decision theoryDecision theory
Decision theory
 
Decision Theory
Decision TheoryDecision Theory
Decision Theory
 
Probability Concepts Applications
Probability Concepts  ApplicationsProbability Concepts  Applications
Probability Concepts Applications
 
DECISION THEORY WITH EXAMPLE
DECISION THEORY WITH EXAMPLEDECISION THEORY WITH EXAMPLE
DECISION THEORY WITH EXAMPLE
 
Ppt on decision theory
Ppt on decision theoryPpt on decision theory
Ppt on decision theory
 
Correlation
CorrelationCorrelation
Correlation
 
Decision theory
Decision theoryDecision theory
Decision theory
 
Pearson Correlation, Spearman Correlation &Linear Regression
Pearson Correlation, Spearman Correlation &Linear RegressionPearson Correlation, Spearman Correlation &Linear Regression
Pearson Correlation, Spearman Correlation &Linear Regression
 
Spss
SpssSpss
Spss
 
Basic Probability
Basic Probability Basic Probability
Basic Probability
 
Simple linear regressionn and Correlation
Simple linear regressionn and CorrelationSimple linear regressionn and Correlation
Simple linear regressionn and Correlation
 
Bayesian Belief Networks for dummies
Bayesian Belief Networks for dummiesBayesian Belief Networks for dummies
Bayesian Belief Networks for dummies
 
Probability concept and Probability distribution
Probability concept and Probability distributionProbability concept and Probability distribution
Probability concept and Probability distribution
 
Correlation analysis ppt
Correlation analysis pptCorrelation analysis ppt
Correlation analysis ppt
 
Correlation ppt...
Correlation ppt...Correlation ppt...
Correlation ppt...
 
PROBABILITY
PROBABILITYPROBABILITY
PROBABILITY
 

Similar to Lecture slides stats1.13.l05.air

Correlation and Regression
Correlation and RegressionCorrelation and Regression
Correlation and Regressionjasondroesch
 
Correlation _ Regression Analysis statistics.pptx
Correlation _ Regression Analysis statistics.pptxCorrelation _ Regression Analysis statistics.pptx
Correlation _ Regression Analysis statistics.pptxkrunal soni
 
Lecture slides stats1.13.l07.air
Lecture slides stats1.13.l07.airLecture slides stats1.13.l07.air
Lecture slides stats1.13.l07.airatutor_te
 
Regression and Co-Relation
Regression and Co-RelationRegression and Co-Relation
Regression and Co-Relationnuwan udugampala
 
3010l8.pdf
3010l8.pdf3010l8.pdf
3010l8.pdfdawitg2
 
Stat 1163 -correlation and regression
Stat 1163 -correlation and regressionStat 1163 -correlation and regression
Stat 1163 -correlation and regressionKhulna University
 
Introduction to statistical concepts
Introduction to statistical conceptsIntroduction to statistical concepts
Introduction to statistical conceptsAya Christeen
 
Jujie and saima introduction of statistical concept
Jujie and saima introduction of statistical conceptJujie and saima introduction of statistical concept
Jujie and saima introduction of statistical conceptJUJIE ATILANO
 
슬로우캠퍼스: scikit-learn & 머신러닝 (강박사)
슬로우캠퍼스:  scikit-learn & 머신러닝 (강박사)슬로우캠퍼스:  scikit-learn & 머신러닝 (강박사)
슬로우캠퍼스: scikit-learn & 머신러닝 (강박사)마이캠퍼스
 
EXERCISE 24 UNDERSTANDING PEARSONS r, EFFECT SIZE, AND PERCEN.docx
EXERCISE 24 UNDERSTANDING PEARSONS r, EFFECT SIZE, AND PERCEN.docxEXERCISE 24 UNDERSTANDING PEARSONS r, EFFECT SIZE, AND PERCEN.docx
EXERCISE 24 UNDERSTANDING PEARSONS r, EFFECT SIZE, AND PERCEN.docxSANSKAR20
 
Lect w8 w9_correlation_regression
Lect w8 w9_correlation_regressionLect w8 w9_correlation_regression
Lect w8 w9_correlation_regressionRione Drevale
 
Regression analysis
Regression analysisRegression analysis
Regression analysisSrikant001p
 
Unit-III Correlation and Regression.pptx
Unit-III Correlation and Regression.pptxUnit-III Correlation and Regression.pptx
Unit-III Correlation and Regression.pptxAnusuya123
 
Unit 4_3 Correlation Regression.pptx
Unit 4_3 Correlation Regression.pptxUnit 4_3 Correlation Regression.pptx
Unit 4_3 Correlation Regression.pptxAppasamiG
 
Unit 1 Correlation- BSRM.pdf
Unit 1 Correlation- BSRM.pdfUnit 1 Correlation- BSRM.pdf
Unit 1 Correlation- BSRM.pdfRavinandan A P
 
Chi square test evidence based dentistry
Chi square test evidence based dentistryChi square test evidence based dentistry
Chi square test evidence based dentistryPiyushJain163909
 
20151120221133 how to analyze survey research data
20151120221133 how to analyze survey research data20151120221133 how to analyze survey research data
20151120221133 how to analyze survey research dataRohaniza Rejab
 

Similar to Lecture slides stats1.13.l05.air (20)

Correlation and Regression
Correlation and RegressionCorrelation and Regression
Correlation and Regression
 
Correlation _ Regression Analysis statistics.pptx
Correlation _ Regression Analysis statistics.pptxCorrelation _ Regression Analysis statistics.pptx
Correlation _ Regression Analysis statistics.pptx
 
Correlations
CorrelationsCorrelations
Correlations
 
Lecture slides stats1.13.l07.air
Lecture slides stats1.13.l07.airLecture slides stats1.13.l07.air
Lecture slides stats1.13.l07.air
 
Regression and Co-Relation
Regression and Co-RelationRegression and Co-Relation
Regression and Co-Relation
 
3010l8.pdf
3010l8.pdf3010l8.pdf
3010l8.pdf
 
Stat 1163 -correlation and regression
Stat 1163 -correlation and regressionStat 1163 -correlation and regression
Stat 1163 -correlation and regression
 
Introduction to statistical concepts
Introduction to statistical conceptsIntroduction to statistical concepts
Introduction to statistical concepts
 
Jujie and saima introduction of statistical concept
Jujie and saima introduction of statistical conceptJujie and saima introduction of statistical concept
Jujie and saima introduction of statistical concept
 
Lecture 4
Lecture 4Lecture 4
Lecture 4
 
슬로우캠퍼스: scikit-learn & 머신러닝 (강박사)
슬로우캠퍼스:  scikit-learn & 머신러닝 (강박사)슬로우캠퍼스:  scikit-learn & 머신러닝 (강박사)
슬로우캠퍼스: scikit-learn & 머신러닝 (강박사)
 
Research Methodology-Chapter 14
Research Methodology-Chapter 14Research Methodology-Chapter 14
Research Methodology-Chapter 14
 
EXERCISE 24 UNDERSTANDING PEARSONS r, EFFECT SIZE, AND PERCEN.docx
EXERCISE 24 UNDERSTANDING PEARSONS r, EFFECT SIZE, AND PERCEN.docxEXERCISE 24 UNDERSTANDING PEARSONS r, EFFECT SIZE, AND PERCEN.docx
EXERCISE 24 UNDERSTANDING PEARSONS r, EFFECT SIZE, AND PERCEN.docx
 
Lect w8 w9_correlation_regression
Lect w8 w9_correlation_regressionLect w8 w9_correlation_regression
Lect w8 w9_correlation_regression
 
Regression analysis
Regression analysisRegression analysis
Regression analysis
 
Unit-III Correlation and Regression.pptx
Unit-III Correlation and Regression.pptxUnit-III Correlation and Regression.pptx
Unit-III Correlation and Regression.pptx
 
Unit 4_3 Correlation Regression.pptx
Unit 4_3 Correlation Regression.pptxUnit 4_3 Correlation Regression.pptx
Unit 4_3 Correlation Regression.pptx
 
Unit 1 Correlation- BSRM.pdf
Unit 1 Correlation- BSRM.pdfUnit 1 Correlation- BSRM.pdf
Unit 1 Correlation- BSRM.pdf
 
Chi square test evidence based dentistry
Chi square test evidence based dentistryChi square test evidence based dentistry
Chi square test evidence based dentistry
 
20151120221133 how to analyze survey research data
20151120221133 how to analyze survey research data20151120221133 how to analyze survey research data
20151120221133 how to analyze survey research data
 

More from atutor_te

Lecture slides stats1.13.l24.air
Lecture slides stats1.13.l24.airLecture slides stats1.13.l24.air
Lecture slides stats1.13.l24.airatutor_te
 
Lecture slides stats1.13.l22.air
Lecture slides stats1.13.l22.airLecture slides stats1.13.l22.air
Lecture slides stats1.13.l22.airatutor_te
 
Lecture slides stats1.13.l21.air
Lecture slides stats1.13.l21.airLecture slides stats1.13.l21.air
Lecture slides stats1.13.l21.airatutor_te
 
Lecture slides stats1.13.l20.air
Lecture slides stats1.13.l20.airLecture slides stats1.13.l20.air
Lecture slides stats1.13.l20.airatutor_te
 
Lecture slides stats1.13.l19.air
Lecture slides stats1.13.l19.airLecture slides stats1.13.l19.air
Lecture slides stats1.13.l19.airatutor_te
 
Lecture slides stats1.13.l18.air
Lecture slides stats1.13.l18.airLecture slides stats1.13.l18.air
Lecture slides stats1.13.l18.airatutor_te
 
Lecture slides stats1.13.l17.air
Lecture slides stats1.13.l17.airLecture slides stats1.13.l17.air
Lecture slides stats1.13.l17.airatutor_te
 
Lecture slides stats1.13.l16.air
Lecture slides stats1.13.l16.airLecture slides stats1.13.l16.air
Lecture slides stats1.13.l16.airatutor_te
 
Lecture slides stats1.13.l14.air
Lecture slides stats1.13.l14.airLecture slides stats1.13.l14.air
Lecture slides stats1.13.l14.airatutor_te
 
Lecture slides stats1.13.l13.air
Lecture slides stats1.13.l13.airLecture slides stats1.13.l13.air
Lecture slides stats1.13.l13.airatutor_te
 
Lecture slides stats1.13.l12.air
Lecture slides stats1.13.l12.airLecture slides stats1.13.l12.air
Lecture slides stats1.13.l12.airatutor_te
 
Lecture slides stats1.13.l11.air
Lecture slides stats1.13.l11.airLecture slides stats1.13.l11.air
Lecture slides stats1.13.l11.airatutor_te
 
Lecture slides stats1.13.l10.air
Lecture slides stats1.13.l10.airLecture slides stats1.13.l10.air
Lecture slides stats1.13.l10.airatutor_te
 
Lecture slides stats1.13.l09.air
Lecture slides stats1.13.l09.airLecture slides stats1.13.l09.air
Lecture slides stats1.13.l09.airatutor_te
 
Lecture slides stats1.13.l08.air
Lecture slides stats1.13.l08.airLecture slides stats1.13.l08.air
Lecture slides stats1.13.l08.airatutor_te
 
Lecture slides stats1.13.l06.air
Lecture slides stats1.13.l06.airLecture slides stats1.13.l06.air
Lecture slides stats1.13.l06.airatutor_te
 
Lecture slides stats1.13.l04.air
Lecture slides stats1.13.l04.airLecture slides stats1.13.l04.air
Lecture slides stats1.13.l04.airatutor_te
 
Lecture slides stats1.13.l23.air
Lecture slides stats1.13.l23.airLecture slides stats1.13.l23.air
Lecture slides stats1.13.l23.airatutor_te
 
Lecture slides stats1.13.l03.air
Lecture slides stats1.13.l03.airLecture slides stats1.13.l03.air
Lecture slides stats1.13.l03.airatutor_te
 
Lecture slides stats1.13.l02.air
Lecture slides stats1.13.l02.airLecture slides stats1.13.l02.air
Lecture slides stats1.13.l02.airatutor_te
 

More from atutor_te (20)

Lecture slides stats1.13.l24.air
Lecture slides stats1.13.l24.airLecture slides stats1.13.l24.air
Lecture slides stats1.13.l24.air
 
Lecture slides stats1.13.l22.air
Lecture slides stats1.13.l22.airLecture slides stats1.13.l22.air
Lecture slides stats1.13.l22.air
 
Lecture slides stats1.13.l21.air
Lecture slides stats1.13.l21.airLecture slides stats1.13.l21.air
Lecture slides stats1.13.l21.air
 
Lecture slides stats1.13.l20.air
Lecture slides stats1.13.l20.airLecture slides stats1.13.l20.air
Lecture slides stats1.13.l20.air
 
Lecture slides stats1.13.l19.air
Lecture slides stats1.13.l19.airLecture slides stats1.13.l19.air
Lecture slides stats1.13.l19.air
 
Lecture slides stats1.13.l18.air
Lecture slides stats1.13.l18.airLecture slides stats1.13.l18.air
Lecture slides stats1.13.l18.air
 
Lecture slides stats1.13.l17.air
Lecture slides stats1.13.l17.airLecture slides stats1.13.l17.air
Lecture slides stats1.13.l17.air
 
Lecture slides stats1.13.l16.air
Lecture slides stats1.13.l16.airLecture slides stats1.13.l16.air
Lecture slides stats1.13.l16.air
 
Lecture slides stats1.13.l14.air
Lecture slides stats1.13.l14.airLecture slides stats1.13.l14.air
Lecture slides stats1.13.l14.air
 
Lecture slides stats1.13.l13.air
Lecture slides stats1.13.l13.airLecture slides stats1.13.l13.air
Lecture slides stats1.13.l13.air
 
Lecture slides stats1.13.l12.air
Lecture slides stats1.13.l12.airLecture slides stats1.13.l12.air
Lecture slides stats1.13.l12.air
 
Lecture slides stats1.13.l11.air
Lecture slides stats1.13.l11.airLecture slides stats1.13.l11.air
Lecture slides stats1.13.l11.air
 
Lecture slides stats1.13.l10.air
Lecture slides stats1.13.l10.airLecture slides stats1.13.l10.air
Lecture slides stats1.13.l10.air
 
Lecture slides stats1.13.l09.air
Lecture slides stats1.13.l09.airLecture slides stats1.13.l09.air
Lecture slides stats1.13.l09.air
 
Lecture slides stats1.13.l08.air
Lecture slides stats1.13.l08.airLecture slides stats1.13.l08.air
Lecture slides stats1.13.l08.air
 
Lecture slides stats1.13.l06.air
Lecture slides stats1.13.l06.airLecture slides stats1.13.l06.air
Lecture slides stats1.13.l06.air
 
Lecture slides stats1.13.l04.air
Lecture slides stats1.13.l04.airLecture slides stats1.13.l04.air
Lecture slides stats1.13.l04.air
 
Lecture slides stats1.13.l23.air
Lecture slides stats1.13.l23.airLecture slides stats1.13.l23.air
Lecture slides stats1.13.l23.air
 
Lecture slides stats1.13.l03.air
Lecture slides stats1.13.l03.airLecture slides stats1.13.l03.air
Lecture slides stats1.13.l03.air
 
Lecture slides stats1.13.l02.air
Lecture slides stats1.13.l02.airLecture slides stats1.13.l02.air
Lecture slides stats1.13.l02.air
 

Recently uploaded

EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxEPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxRaymartEstabillo3
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxNirmalaLoungPoorunde1
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon AUnboundStockton
 
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfFraming an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfUjwalaBharambe
 
Pharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfPharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfMahmoud M. Sallam
 
Blooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docxBlooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docxUnboundStockton
 
Hierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of managementHierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of managementmkooblal
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsanshu789521
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTiammrhaywood
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
Painted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of IndiaPainted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of IndiaVirag Sontakke
 
Meghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media ComponentMeghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media ComponentInMediaRes1
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxiammrhaywood
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
Roles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceRoles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceSamikshaHamane
 

Recently uploaded (20)

EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxEPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptx
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon A
 
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfFraming an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
 
Pharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfPharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdf
 
Blooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docxBlooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docx
 
Hierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of managementHierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of management
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha elections
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
Painted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of IndiaPainted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of India
 
Meghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media ComponentMeghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media Component
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
OS-operating systems- ch04 (Threads) ...
OS-operating systems- ch04 (Threads) ...OS-operating systems- ch04 (Threads) ...
OS-operating systems- ch04 (Threads) ...
 
ESSENTIAL of (CS/IT/IS) class 06 (database)
ESSENTIAL of (CS/IT/IS) class 06 (database)ESSENTIAL of (CS/IT/IS) class 06 (database)
ESSENTIAL of (CS/IT/IS) class 06 (database)
 
Roles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceRoles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in Pharmacovigilance
 

Lecture slides stats1.13.l05.air

  • 2. Three segments •  Overview •  Calculation of r •  Assumptions 2
  • 3. Lecture 5 ~ Segment 1 Correlation: Overview 3
  • 4. Correlation: Overview •  Important concepts & topics –  What is a correlation? –  What are they used for? –  Scatterplots –  CAUTION! –  Types of correlations 4
  • 5. Correlation: Overview •  Correlation –  A statistical procedure used to measure and describe the relationship between two variables –  Correlations can range between +1 and -1 •  +1 is a perfect positive correlation •  0 is no correlation (independence) •  -1 is a perfect negative correlation 5
  • 6. Correlation: Overview •  When two variables, let’s call them X and Y, are correlated, then one variable can be used to predict the other variable –  More precisely, a person’s score on X can be used to predict his or her score on Y 6
  • 7. Correlation: Overview •  Example: –  Working memory capacity is strongly correlated with intelligence, or IQ, in healthy young adults –  So if we know a person’s IQ then we can predict how they will do on a test of working memory 7
  • 9. Correlation: Overview •  CAUTION! –  Correlation does not imply causation 9
  • 10. Correlation: Overview •  CAUTION! –  The magnitude of a correlation depends upon many factors, including: •  Sampling (random and representative?) 10
  • 11. Correlation: Overview •  CAUTION! –  The magnitude of a correlation is also influenced by: •  Measurement of X & Y (See Lecture 6) •  Several other assumptions (See Segment 3) 11
  • 12. Correlation: Overview •  For now, consider just one assumption: –  Random and representative sampling –  There is a strong correlation between IQ and working memory among all healthy young adults. •  What is the correlation between IQ and working memory among college graduates? 12
  • 14. Correlation: Overview •  CAUTION! •  Finally & perhaps most important: –  The correlation coefficient is a sample statistic, just like the mean •  It may not be representative of ALL individuals –  For example, in school I scored very high on Math and Science but below average on Language and History 14
  • 16. Correlation: Overview •  Note: there are several types of correlation coefficients, for different variable types –  Pearson product-moment correlation coefficient (r) •  When both variables, X & Y, are continuous –  Point bi-serial correlation •  When 1 variable is continuous and 1 is dichotomous 16
  • 17. Correlation: Overview •  Note: there are several types of correlation coefficients –  Phi coefficient •  When both variables are dichotomous –  Spearman rank correlation •  When both variables are ordinal (ranked data) 17
  • 18. Segment summary •  Important concepts/topics –  What is a correlation? –  What are they used for? –  Scatterplots –  CAUTION! –  Types of correlations 18
  • 20. Lecture 5 ~ Segment 2 Calculation of r 20
  • 21. Calculation of r •  Important topics –  r •  Pearson product-moment correlation coefficient –  Raw score formula –  Z-score formula –  Sum of cross products (SP) & Covariance 21
  • 22. Calculation of r •  r = the degree to which X and Y vary together, relative to the degree to which X and Y vary independently •  r = (Covariance of X & Y) / (Variance of X & Y) 22
  • 23. Calculation of r •  Two ways to calculate r –  Raw score formula –  Z-score formula 23
  • 24. Calculation of r •  Let’s quickly review calculations from Lecture 4 on summary statistics •  Variance = SD2 = MS = (SS/N) 24
  • 26. Jeremy Lin (10 games) Points  per  game   (X-­‐M)   (X-­‐M)2   28   5.3   28.09   26   3.3   10.89   10   -­‐12.7   161.29   27   4.3   18.49   20   -­‐2.7   7.29   38   15.3   234.09   23   0.3   0.09   28   5.3   28.09   25   2.3   5.29   2   -­‐20.7   428.49   M  =  922.1/10  =  92.21   M  =  227/10  =  22.7   M  =  0/10  =  0   26
  • 27. Results •  M = Mean = 22.7 2 = Variance = MS = SS/N = 92.21 •  SD •  SD = Standard Deviation = 9.6 27
  • 28. Just one new concept! •  SP = Sum of cross Products 28
  • 29. Just one new concept! •  Review: To calculate SS –  For each row, calculate the deviation score •  (X – Mx) –  Square the deviation scores •  (X - Mx)2 –  Sum the squared deviation scores •  SSx = Σ[(X – Mx)2] = Σ[(X – Mx) x (X – Mx)] 29
  • 30. Just one new concept! •  To calculate SP –  For each row, calculate the deviation score on X •  (X - Mx) –  For each row, calculate the deviation score on Y •  (Y – My) 30
  • 31. Just one new concept! •  To calculate SP –  Then, for each row, multiply the deviation score on X by the deviation score on Y •  (X – Mx) x (Y – My) –  Then, sum the “cross products” •  SP = Σ[(X – Mx) x (Y – My)] 31
  • 32. Calculation of r Raw score formula: r = SPxy / SQRT(SSx x SSy) 32
  • 33. Calculation of r SPxy = Σ[(X - Mx) x (Y - My)] 2 = Σ[(X - M ) x (X - M )] SSx = Σ(X - Mx) x x SSy = Σ(Y - My)2 = Σ[(Y - My) x (Y - My)] 33
  • 34. Formulae to calculate r r = SPxy / SQRT (SSx x SSy) r = Σ[(X - Mx) x (Y - My)] / 2 x Σ(Y - M )2) SQRT (Σ(X - Mx) y 34
  • 35. Formulae to calculate r Z-score formula: r = Σ(Zx x Zy) / N 35
  • 36. Formulae to calculate r Zx = (X - Mx) / SDx Zy = (Y - My) / SDy 2 / N) SDx = SQRT (Σ(X - Mx) SDy = SQRT (Σ(Y - My)2 / N) 36
  • 37. Formulae to calculate r Proof of equivalence: Zx = (X - Mx) / SQRT (Σ(X - Mx)2 / N) Zy = (Y - My) / SQRT (Σ(Y - My)2 / N) 37
  • 38. Formulae to calculate r r = Σ { [(X - Mx) / SQRT (Σ(X - Mx)2 / N)] x [(Y - My) / SQRT (Σ(Y - My)2 / N)] } / N 38
  • 39. Formulae to calculate r r = Σ { [(X - Mx) / SQRT (Σ(X - Mx)2 / N)] x [(Y - My) / SQRT (Σ(Y - My)2 / N)] } / N r = Σ [(X - Mx) x (Y - My)] / SQRT ( Σ(X - Mx)2 x Σ(Y - My)2 ) r = SPxy / SQRT (SSx x SSy) ß The raw score formula! 39
  • 40. Variance and covariance •  Variance = MS = SS / N •  Covariance = COV = SP / N •  Correlation is standardized COV –  Standardized so the value is in the range -1 to 1 40
  • 41. Note on the denominators •  Correlation for descriptive statistics –  Divide by N •  Correlation for inferential statistics –  Divide by N – 1 41
  • 42. Segment summary •  Important topics –  r •  Pearson product-moment correlation coefficient –  Raw score formula –  Z-score formula –  Sum of cross Products (SP) & Covariance 42
  • 44. Lecture 5 ~ Segment 3 Assumptions 44
  • 45. Assumptions •  Assumptions when interpreting r –  Normal distributions for X and Y –  Linear relationship between X and Y –  Homoscedasticity 45
  • 46. Assumptions •  Assumptions when interpreting r –  Reliability of X and Y –  Validity of X and Y –  Random and representative sampling 46
  • 47. Assumptions •  Assumptions when interpreting r –  Normal distributions for X and Y •  How to detect violations? –  Plot histograms and examine summary statistics 47
  • 48. Assumptions •  Assumptions when interpreting r –  Linear relationship between X and Y •  How to detect violation? –  Examine scatterplots (see following examples) 48
  • 49. Assumptions •  Assumptions when interpreting r –  Homoscedasticity •  How to detect violation? –  Examine scatterplots (see following examples) 49
  • 50. Homoscedasticity •  In a scatterplot the vertical distance between a dot and the regression line reflects the amount of prediction error (known as the “residual”) 50
  • 51. Homoscedasticity •  Homoscedasticity means that the distances (the residuals) are not related to the variable plotted on the X axis (they are not a function of X) •  This is best illustrated with scatterplots 51
  • 52. Anscombe’s quartet •  In 1973, statistician Dr. Frank Anscombe developed a classic example to illustrate several of the assumptions underlying correlation and regression 52
  • 58. Segment summary •  Assumptions when interpreting r –  Normal distributions for X and Y –  Linear relationship between X and Y –  Homoscedasticity 58
  • 59. Segment summary •  Assumptions when interpreting r –  Reliability of X and Y –  Validity of X and Y –  Random and representative sampling 59