SlideShare a Scribd company logo
1 of 28
Download to read offline
Introduction to Data Science
Dr. Irfan Yousuf
Department of Computer Science (New Campus)
UET, Lahore
(Lecture # 5; September 21, 2022)
Outline
•Linear Regression Analysis
Relations Between Variables
•Functional Relation
•Statistical Relation
Functional Relation
•A functional relation between two variables is a perfect
relation where the value of the dependent variable is
uniquely determined by the value of the independent
variable.
•It is expressed by a mathematical formula.
Functional Relation
Statistical Relation
•A statistical relation between two variables is a relation
where the value of the dependent variable is NOT
uniquely determined when the level of the independent
variable is specified.
•If the values of a variable Y increase or decrease when the
values of a variable X change, there is a statistical
relationship between Variable Y and Variable X.
•It is not an exact relation.
• Examples:
• The relation between age and income
• The relation between income and expenditures
Statistical Relation
Scatter Plot
•A scatter plot (aka scatter chart, scatter graph) uses dots to
represent values for two different numeric variables.
•The position of each dot on the horizontal and vertical axis
indicates values for an individual data point.
•Scatter plots are used to observe relationships between
variables.
Scatter Plot
Regression Analysis
•Regression analysis provides a method of estimating an
average relation (often linear) between two or more variables.
•Regressor: The variable that forms the basis of estimation or
prediction (aka predictor variable or independent variable).
•Regressand: The variable whose value depend on the
independent variable is called a regressand (aka response
variable, predictand variable or dependent variable).
Regression Models
•Regression models describe the relationship between
variables by fitting a line to the observed data.
•Linear regression models use a straight line, while logistic
and nonlinear regression models use a curved line.
• Regression allows you to estimate how a dependent variable
changes as the independent variable(s) change.
Simple Linear Regression
Simple Linear Regression
- Best-fit Line
- Least Squares Method or Least Squares Regression
Simple Linear Regression
- Best-fit Line
- Least Squares Method or Least Squares Regression
Simple Linear Regression: Example
x y
14.2 215
16.4 325
11.9 185
15.2 332
18.5 406
22.1 522
19.4 412
25.1 614
23.4 544
18.1 421
22.6 445
17.2 408 b = 30.08
a = -159.4
Simple Linear Regression: Example
Residuals
- The residuals are the differences between the observed and
predicted values.
- It measures the distance from the regression line (predicted
value) and the actual observed value. In other words, it
helps us to measure error, or how well our regression line
“fits” our data.
Outliers
R-squared (Goodness of Fit)
- After fitting a linear regression model, you need to
determine how well the model fits the data.
- R-squared is a goodness-of-fit measure for linear
regression models.
- This statistic indicates the percentage of the variance in
the dependent variable that the independent variables
explain collectively (or that is predictable from the
independent variable).
- R-squared measures the strength of the relationship
between your model and the dependent variable on a
convenient 0 – 100% scale.
- It is also called Coefficient of Determination.
R-squared (Goodness of Fit)
x y Predicted (Y)
14.2 215 267.78
16.4 325 333.97
11.9 185 198.58
15.2 332 297.87
18.5 406 397.16
22.1 522 505.47
19.4 412 424.24
25.1 614 595.74
23.4 544 544.59
18.1 421 385.12
22.6 445 520.52
17.2 408 358.04
Y= 30.08X – 159.4
R2 = 0.9168
yi = Actual Value
y-hat = Predicted Value
y-bar = Actual Mean
Correlation
- The direction and strength of pairwise relationships
between two or more numeric variables.
- -1: Perfect negative correlation. The variables tend to
move in opposite directions (i.e., when one variable
increases, the other variable decreases).
- 0: No correlation. The variables do not have a relationship
with each other.
- 1: Perfect positive correlation. The variables tend to
move in the same direction (i.e., when one variable
increases, the other variable also increases).
Correlation
Correlation
r = 0.9575
How to Interpret Correlation?
- Strength: The greater the absolute value of the correlation
coefficient, the stronger the relationship.
- Direction: The sign of the correlation coefficient
represents the direction of the relationship.
How to Interpret Correlation?
- The correlation, denoted by r, measures the amount of
linear association between two variables.
- r is always between -1 and 1 inclusive.
- The R-squared value, denoted by R2, is the square of the
correlation. It measures the proportion of variation in
the dependent variable that can be attributed to the
independent variable.
- The R-squared value R2 is always between 0 and 1
inclusive.
Correlation vs. Regression
- Correlation is a statistical measure that quantifies the
direction and strength of the relationship between two
numeric variables.
- Regression is a statistical technique that predicts the value
of the dependent variable Y based on the known value of
the independent variable X through an equation.
Drawbacks of Linear Regression
- Linear regression only looks at linear relationships between
dependent and independent variables.
- Linear regression looks at a relationship between the mean
of the dependent variable and the independent variables.
- Linear regression is sensitive to outliers
Summary
•Regression Analysis

More Related Content

Similar to IDS.pdf

Correlation and Regression.pptx
Correlation and Regression.pptxCorrelation and Regression.pptx
Correlation and Regression.pptxJayaprakash985685
 
Stat 1163 -correlation and regression
Stat 1163 -correlation and regressionStat 1163 -correlation and regression
Stat 1163 -correlation and regressionKhulna University
 
Correlation.pptx
Correlation.pptxCorrelation.pptx
Correlation.pptxIloveBepis
 
Dependance Technique, Regression & Correlation
Dependance Technique, Regression & Correlation Dependance Technique, Regression & Correlation
Dependance Technique, Regression & Correlation Qasim Raza
 
Correlation analysis notes
Correlation analysis notesCorrelation analysis notes
Correlation analysis notesJapheth Muthama
 
Stats 3000 Week 2 - Winter 2011
Stats 3000 Week 2 - Winter 2011Stats 3000 Week 2 - Winter 2011
Stats 3000 Week 2 - Winter 2011Lauren Crosby
 
Correlation and regression
Correlation and regressionCorrelation and regression
Correlation and regressionAntony Raj
 
Correlation and regression
Correlation and regressionCorrelation and regression
Correlation and regressionAntony Raj
 
Correlation analysis
Correlation analysisCorrelation analysis
Correlation analysisAwais Salman
 
Regression analysis in R
Regression analysis in RRegression analysis in R
Regression analysis in RAlichy Sowmya
 
how to select the appropriate method for our study of interest
how to select the appropriate method for our study of interest how to select the appropriate method for our study of interest
how to select the appropriate method for our study of interest NurFathihaTahiatSeeu
 
Linear regression analysis
Linear regression analysisLinear regression analysis
Linear regression analysisNimrita Koul
 

Similar to IDS.pdf (20)

Correlation and Regression.pptx
Correlation and Regression.pptxCorrelation and Regression.pptx
Correlation and Regression.pptx
 
Stat 1163 -correlation and regression
Stat 1163 -correlation and regressionStat 1163 -correlation and regression
Stat 1163 -correlation and regression
 
Correlation.pptx
Correlation.pptxCorrelation.pptx
Correlation.pptx
 
Research Methodology-Chapter 14
Research Methodology-Chapter 14Research Methodology-Chapter 14
Research Methodology-Chapter 14
 
Dependance Technique, Regression & Correlation
Dependance Technique, Regression & Correlation Dependance Technique, Regression & Correlation
Dependance Technique, Regression & Correlation
 
Correlation analysis notes
Correlation analysis notesCorrelation analysis notes
Correlation analysis notes
 
Stats 3000 Week 2 - Winter 2011
Stats 3000 Week 2 - Winter 2011Stats 3000 Week 2 - Winter 2011
Stats 3000 Week 2 - Winter 2011
 
Statistics ppt
Statistics pptStatistics ppt
Statistics ppt
 
Correlation and regression
Correlation and regressionCorrelation and regression
Correlation and regression
 
Correlation and regression
Correlation and regressionCorrelation and regression
Correlation and regression
 
2-20-04.ppt
2-20-04.ppt2-20-04.ppt
2-20-04.ppt
 
Quantitative Methods - Level II - CFA Program
Quantitative Methods - Level II - CFA ProgramQuantitative Methods - Level II - CFA Program
Quantitative Methods - Level II - CFA Program
 
Correlation
CorrelationCorrelation
Correlation
 
Correlation
CorrelationCorrelation
Correlation
 
Measure of Association
Measure of AssociationMeasure of Association
Measure of Association
 
Correlation analysis
Correlation analysisCorrelation analysis
Correlation analysis
 
Regression analysis in R
Regression analysis in RRegression analysis in R
Regression analysis in R
 
RMBS - CORRELATION.pptx
RMBS - CORRELATION.pptxRMBS - CORRELATION.pptx
RMBS - CORRELATION.pptx
 
how to select the appropriate method for our study of interest
how to select the appropriate method for our study of interest how to select the appropriate method for our study of interest
how to select the appropriate method for our study of interest
 
Linear regression analysis
Linear regression analysisLinear regression analysis
Linear regression analysis
 

Recently uploaded

Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubaihf8803863
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdfHuman37
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort servicejennyeacort
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfgstagge
 
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...ThinkInnovation
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster
 
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAmazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAbdelrhman abooda
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfJohn Sterrett
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...soniya singh
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingNeil Barnes
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...Suhani Kapoor
 
Data Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptxData Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptxFurkanTasci3
 

Recently uploaded (20)

Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdf
 
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
 
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAmazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdf
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptx
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data Storytelling
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
 
Data Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptxData Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptx
 

IDS.pdf

  • 1. Introduction to Data Science Dr. Irfan Yousuf Department of Computer Science (New Campus) UET, Lahore (Lecture # 5; September 21, 2022)
  • 3. Relations Between Variables •Functional Relation •Statistical Relation
  • 4. Functional Relation •A functional relation between two variables is a perfect relation where the value of the dependent variable is uniquely determined by the value of the independent variable. •It is expressed by a mathematical formula.
  • 6. Statistical Relation •A statistical relation between two variables is a relation where the value of the dependent variable is NOT uniquely determined when the level of the independent variable is specified. •If the values of a variable Y increase or decrease when the values of a variable X change, there is a statistical relationship between Variable Y and Variable X. •It is not an exact relation. • Examples: • The relation between age and income • The relation between income and expenditures
  • 8. Scatter Plot •A scatter plot (aka scatter chart, scatter graph) uses dots to represent values for two different numeric variables. •The position of each dot on the horizontal and vertical axis indicates values for an individual data point. •Scatter plots are used to observe relationships between variables.
  • 10. Regression Analysis •Regression analysis provides a method of estimating an average relation (often linear) between two or more variables. •Regressor: The variable that forms the basis of estimation or prediction (aka predictor variable or independent variable). •Regressand: The variable whose value depend on the independent variable is called a regressand (aka response variable, predictand variable or dependent variable).
  • 11. Regression Models •Regression models describe the relationship between variables by fitting a line to the observed data. •Linear regression models use a straight line, while logistic and nonlinear regression models use a curved line. • Regression allows you to estimate how a dependent variable changes as the independent variable(s) change.
  • 13. Simple Linear Regression - Best-fit Line - Least Squares Method or Least Squares Regression
  • 14. Simple Linear Regression - Best-fit Line - Least Squares Method or Least Squares Regression
  • 15. Simple Linear Regression: Example x y 14.2 215 16.4 325 11.9 185 15.2 332 18.5 406 22.1 522 19.4 412 25.1 614 23.4 544 18.1 421 22.6 445 17.2 408 b = 30.08 a = -159.4
  • 17. Residuals - The residuals are the differences between the observed and predicted values. - It measures the distance from the regression line (predicted value) and the actual observed value. In other words, it helps us to measure error, or how well our regression line “fits” our data.
  • 19. R-squared (Goodness of Fit) - After fitting a linear regression model, you need to determine how well the model fits the data. - R-squared is a goodness-of-fit measure for linear regression models. - This statistic indicates the percentage of the variance in the dependent variable that the independent variables explain collectively (or that is predictable from the independent variable). - R-squared measures the strength of the relationship between your model and the dependent variable on a convenient 0 – 100% scale. - It is also called Coefficient of Determination.
  • 20. R-squared (Goodness of Fit) x y Predicted (Y) 14.2 215 267.78 16.4 325 333.97 11.9 185 198.58 15.2 332 297.87 18.5 406 397.16 22.1 522 505.47 19.4 412 424.24 25.1 614 595.74 23.4 544 544.59 18.1 421 385.12 22.6 445 520.52 17.2 408 358.04 Y= 30.08X – 159.4 R2 = 0.9168 yi = Actual Value y-hat = Predicted Value y-bar = Actual Mean
  • 21. Correlation - The direction and strength of pairwise relationships between two or more numeric variables. - -1: Perfect negative correlation. The variables tend to move in opposite directions (i.e., when one variable increases, the other variable decreases). - 0: No correlation. The variables do not have a relationship with each other. - 1: Perfect positive correlation. The variables tend to move in the same direction (i.e., when one variable increases, the other variable also increases).
  • 24. How to Interpret Correlation? - Strength: The greater the absolute value of the correlation coefficient, the stronger the relationship. - Direction: The sign of the correlation coefficient represents the direction of the relationship.
  • 25. How to Interpret Correlation? - The correlation, denoted by r, measures the amount of linear association between two variables. - r is always between -1 and 1 inclusive. - The R-squared value, denoted by R2, is the square of the correlation. It measures the proportion of variation in the dependent variable that can be attributed to the independent variable. - The R-squared value R2 is always between 0 and 1 inclusive.
  • 26. Correlation vs. Regression - Correlation is a statistical measure that quantifies the direction and strength of the relationship between two numeric variables. - Regression is a statistical technique that predicts the value of the dependent variable Y based on the known value of the independent variable X through an equation.
  • 27. Drawbacks of Linear Regression - Linear regression only looks at linear relationships between dependent and independent variables. - Linear regression looks at a relationship between the mean of the dependent variable and the independent variables. - Linear regression is sensitive to outliers