SlideShare a Scribd company logo
Linear RegressionTheory
https://in.linkedin.com/in/sauravmukherjee
What &Why
1
What is Regression?
Formulation of a functional relationship between a set of Independent or
Explanatory variables (X’s) with a Dependent or Response variable (Y).
Y = f(X)
Why Regression?
Knowledge of Y is crucial for decision making.
• Will he/she buy or not?
• Shall I offer him/her the loan or not?
• ………
X is available at the time of decision making and is related to Y, thus making
it possible to have a prediction of Y.
2
Types of Regression
Y
Continuous
E.g., SalesVolume, Claim
Amount, % of sales growth
etc.
Binary (0/1)
E.g., Buy/No-Buy, Survive/Not-
Survive,Win/Loss etc
Ordinary Least Square
(OLS) Regression
Logistic Regression
• Regression analysis is used to:
• Predict the value of a
dependent variable based on
the value of at least one
independent variable
• Explain the impact of changes
in an independent variable on
the dependent variable
• Dependent variable: the
variable we wish to explain,
usually denoted by Y.
• Independent variable: the
variable used to explain the
dependent variable. Usually
denoted by X.
3
Intro to RegressionAnalysis
4
Regression Example
Predict the fitness of a
person based on one or
more parameters.
5
Regression Example
• Only one independent
variable, x
• Relationship between x
and y is described by a
linear function
• Changes in y are
assumed to be caused
by changes in x
6
Simple Linear Regression Model
7
Assumptions for Simple Linear Regression
E(ε) = 0
8
Assumptions for Multiple Regression
9
Assumptions for Multiple Regression
݅
2
ߝ
ଶ
10
i]j0,)ε[E(ε ji ≠=
Assumptions for Multiple Regression
11
Equations for Regression
12
Simple Linear Regression Model
13
Beta Zero
14
Beta One
1 unit
15
ErrorTerm /Residual
16
Regression Line Equation
17
The Simple Linear Regression Model
18
The Multiple Linear Regression Model
19
Model for Multiple Regression
20
Positive Linear Relationship
Negative Linear Relationship
No Relationship
Relationship NOT Linear
Types of Regression Relationships
21
Unknown
Relationship
Population Random Sample
Y Xi i i= + +β β ε0 1
☺ ☺
☺
☺
☺
☺
☺
Population & Sample Regression Models
22
PredictedValue
ofY for Xi
Intercept = β0
Random Error for this x value
Y
X
uXββY 10 ++=
xi
Slope = β1
ui
Individual
person's marks
Population Linear Regression
23
Linear component
Population y
intercept
Population Slope
Coefficient
Random
Error term, or
residual
Dependent
Variable
Independent
Variable
Random Error
component
uXββY 10 ++=
But can we actually get this equation?
If yes what all information we will need?
Population Regression Function
24
PredictedValue
ofY for Xi
Intercept = β0
Random Error for this x value
Y
Xxi
Slope = β1
exbby 10 ++=
ei
ObservedValue
of y for xi
Sample Regression Function
25
exbby 10i ++=
Estimate of the
regression intercept
Estimate of the
regression slope
Independent
variable
Error term
Notice the similarity with the Population Regression Function
Can we do something of the error term?
Sample Regression Function
• Represents the influence of all the variable which
we have not accounted for in the equation
• It represents the difference between the actual y
values as compared the predicted y values from the
Sample Regression Line
• Wouldn't it be good if we were able to reduce this
error term?
• By the way - what are we trying to achieve by
Sample Regression?
26
The ErrorTerm (Residual)
27
HowWell A Model Fits the Data
28
Comparing the Regression Model to a Baseline Model
29
Comparing the Regression Model to a Baseline Model
• The sum of the residuals from the least squares regression line is
zero.
• The sum of the squared residuals is a minimum.
Minimize( )
• The simple regression line always passes through the mean of
the y variable and the mean of the x variable
• The least squares coefficients are unbiased estimates of β0 and
β1
30
0)ˆ( =−∑ yy
2
)ˆ( yy∑ −
OLS Regression Properties
• Parameter Instability - This happens in situations where
correlations change over a period of time.This is very
common in financial markets where economic, tax,
regulatory, and political factors change frequently.
• Public knowledge of a specific regression relation may
cause a large number of people to react in a similar fashion
towards the variables, negating its future usefulness.
• If any of the regression assumptions are violated,
predicted dependent variables and hypothesis tests will not
hold valid.
31
Limitations of RegressionAnalysis
• In simple linear regression, the dependent variable was assumed to be
dependent on only one variable (independent variable)
• In General Multiple Linear Regression model, the dependent variable derives its
value from two or more than two variable.
• General Multiple Linear Regression model take the following form:
where:
Yi = ith observation of dependent variableY
Xki = ith observation of kth independent variable X
b0 = intercept term
bk = slope coefficient of kth independent variable
εi = error term of ith observation
n = number of observations
k = total number of independent variables
32
ikikiii XbXbXbbY ε+++++= .........22110
General Multiple Linear Regression Model
• As we calculated the intercept and the slope coefficient in case of
simple linear regression by minimizing the sum of squared errors,
similarly we estimate the intercept and slope coefficient in multiple
linear regression.
• Sum of Squared Errors is minimized and the slope coefficient is
estimated.
• The resultant estimated equation becomes:
• Now the error in the ith observation can be written as:
33
∑=
n
i
i
1
2
ε
kikiii XbXbXbbY
∧∧∧∧∧
++++= .........22110






++++−=−=
∧∧∧∧∧
kikiiiiii XbXbXbbYYY .........22110ε
Estimated Regression Equation
34
Assumptions of Multiple Regression Model
• There exists a linear relationship between the dependent and
independent variables.
• The expected value of the error term, conditional on the
independent variables is zero.
• The error terms are homoskedastic, i.e. the variance of the
error terms is constant for all the observations.
• The expected value of the product of error terms is always
zero, which implies that the error terms are uncorrelated with
each other.
• The error term is normally distributed.
• The independent variables doesn't have any linear
relationships between each other.
Thank you!

More Related Content

What's hot

Simple linear regression
Simple linear regressionSimple linear regression
Simple linear regression
pankaj8108
 
Logistic regression
Logistic regressionLogistic regression
Logistic regression
DrZahid Khan
 
Lasso and ridge regression
Lasso and ridge regressionLasso and ridge regression
Lasso and ridge regression
SreerajVA
 
Logistic Regression Analysis
Logistic Regression AnalysisLogistic Regression Analysis
Logistic Regression Analysis
COSTARCH Analytical Consulting (P) Ltd.
 
Simple linear regression
Simple linear regressionSimple linear regression
Simple linear regression
RekhaChoudhary24
 
An Overview of Simple Linear Regression
An Overview of Simple Linear RegressionAn Overview of Simple Linear Regression
An Overview of Simple Linear Regression
Georgian Court University
 
Logistic regression
Logistic regressionLogistic regression
Logistic regression
YashwantGahlot1
 
Logistic regression
Logistic regressionLogistic regression
Logistic regressionsaba khan
 
Basics of Regression analysis
 Basics of Regression analysis Basics of Regression analysis
Basics of Regression analysis
Mahak Vijayvargiya
 
Regression Analysis
Regression AnalysisRegression Analysis
Regression Analysis
Shiela Vinarao
 
Simple Linier Regression
Simple Linier RegressionSimple Linier Regression
Simple Linier Regressiondessybudiyanti
 
Regression
RegressionRegression
Regression
LavanyaK75
 
Logistic regression
Logistic regressionLogistic regression
Logistic regression
DrZahid Khan
 
Bias and variance trade off
Bias and variance trade offBias and variance trade off
Bias and variance trade off
VARUN KUMAR
 
Multiple linear regression
Multiple linear regressionMultiple linear regression
Multiple linear regression
James Neill
 
Bernoullis Random Variables And Binomial Distribution
Bernoullis Random Variables And Binomial DistributionBernoullis Random Variables And Binomial Distribution
Bernoullis Random Variables And Binomial Distribution
mathscontent
 
Logistic regression (blyth 2006) (simplified)
Logistic regression (blyth 2006) (simplified)Logistic regression (blyth 2006) (simplified)
Logistic regression (blyth 2006) (simplified)
MikeBlyth
 
Simple linear regression (final)
Simple linear regression (final)Simple linear regression (final)
Simple linear regression (final)Harsh Upadhyay
 

What's hot (20)

Regression
RegressionRegression
Regression
 
Simple linear regression
Simple linear regressionSimple linear regression
Simple linear regression
 
Logistic regression
Logistic regressionLogistic regression
Logistic regression
 
Lasso and ridge regression
Lasso and ridge regressionLasso and ridge regression
Lasso and ridge regression
 
Logistic Regression Analysis
Logistic Regression AnalysisLogistic Regression Analysis
Logistic Regression Analysis
 
Simple linear regression
Simple linear regressionSimple linear regression
Simple linear regression
 
An Overview of Simple Linear Regression
An Overview of Simple Linear RegressionAn Overview of Simple Linear Regression
An Overview of Simple Linear Regression
 
Logistic regression
Logistic regressionLogistic regression
Logistic regression
 
Logistic regression
Logistic regressionLogistic regression
Logistic regression
 
Basics of Regression analysis
 Basics of Regression analysis Basics of Regression analysis
Basics of Regression analysis
 
Regression Analysis
Regression AnalysisRegression Analysis
Regression Analysis
 
Simple Linier Regression
Simple Linier RegressionSimple Linier Regression
Simple Linier Regression
 
Regression
RegressionRegression
Regression
 
Logistic regression
Logistic regressionLogistic regression
Logistic regression
 
Bias and variance trade off
Bias and variance trade offBias and variance trade off
Bias and variance trade off
 
Multiple linear regression
Multiple linear regressionMultiple linear regression
Multiple linear regression
 
Bernoullis Random Variables And Binomial Distribution
Bernoullis Random Variables And Binomial DistributionBernoullis Random Variables And Binomial Distribution
Bernoullis Random Variables And Binomial Distribution
 
Logistic regression (blyth 2006) (simplified)
Logistic regression (blyth 2006) (simplified)Logistic regression (blyth 2006) (simplified)
Logistic regression (blyth 2006) (simplified)
 
Logistic regression
Logistic regressionLogistic regression
Logistic regression
 
Simple linear regression (final)
Simple linear regression (final)Simple linear regression (final)
Simple linear regression (final)
 

Similar to Linear regression theory

Regression analysis ppt
Regression analysis pptRegression analysis ppt
Regression analysis pptElkana Rorio
 
Linear Regression and Logistic Regression in ML
Linear Regression and Logistic Regression in MLLinear Regression and Logistic Regression in ML
Linear Regression and Logistic Regression in ML
Kumud Arora
 
Introduction to Limited Dependent variable
Introduction to Limited Dependent variableIntroduction to Limited Dependent variable
Introduction to Limited Dependent variable
Ashok Dsouza
 
Regression analysis
Regression analysisRegression analysis
Regression analysissaba khan
 
Multiple Regression.ppt
Multiple Regression.pptMultiple Regression.ppt
Multiple Regression.ppt
EkoGaniarto
 
REGRESSION METasdfghjklmjhgftrHODS1.pptx
REGRESSION METasdfghjklmjhgftrHODS1.pptxREGRESSION METasdfghjklmjhgftrHODS1.pptx
REGRESSION METasdfghjklmjhgftrHODS1.pptx
cajativ595
 
MModule 1 ppt.pptx
MModule 1 ppt.pptxMModule 1 ppt.pptx
MModule 1 ppt.pptx
jyotikumarijyotshna
 
Chapter 15Multiple Regression and Model BuildingCo
Chapter 15Multiple Regression and Model BuildingCoChapter 15Multiple Regression and Model BuildingCo
Chapter 15Multiple Regression and Model BuildingCo
EstelaJeffery653
 
Multiple Regression.ppt
Multiple Regression.pptMultiple Regression.ppt
Multiple Regression.ppt
TanyaWadhwani4
 
Regression Analysis.pptx
Regression Analysis.pptxRegression Analysis.pptx
Regression Analysis.pptx
ShivankAggatwal
 
Regression (Linear Regression and Logistic Regression) by Akanksha Bali
Regression (Linear Regression and Logistic Regression) by Akanksha BaliRegression (Linear Regression and Logistic Regression) by Akanksha Bali
Regression (Linear Regression and Logistic Regression) by Akanksha Bali
Akanksha Bali
 
STATISTICAL REGRESSION MODELS
STATISTICAL REGRESSION MODELSSTATISTICAL REGRESSION MODELS
STATISTICAL REGRESSION MODELS
Aneesa K Ayoob
 
Data Analysison Regression
Data Analysison RegressionData Analysison Regression
Data Analysison Regressionjamuga gitulho
 
Detail Study of the concept of Regression model.pptx
Detail Study of the concept of  Regression model.pptxDetail Study of the concept of  Regression model.pptx
Detail Study of the concept of Regression model.pptx
truptikulkarni2066
 
Ders 2 ols .ppt
Ders 2 ols .pptDers 2 ols .ppt
Ders 2 ols .ppt
Ergin Akalpler
 
Regression
RegressionRegression
Regression analysis in Research Methodology.pptx
Regression analysis in Research Methodology.pptxRegression analysis in Research Methodology.pptx
Regression analysis in Research Methodology.pptx
Balamurugan M
 
Group 5 - Regression Analysis.pdf
Group 5 - Regression Analysis.pdfGroup 5 - Regression Analysis.pdf
Group 5 - Regression Analysis.pdf
fahlevet40
 

Similar to Linear regression theory (20)

Regression analysis ppt
Regression analysis pptRegression analysis ppt
Regression analysis ppt
 
Linear Regression and Logistic Regression in ML
Linear Regression and Logistic Regression in MLLinear Regression and Logistic Regression in ML
Linear Regression and Logistic Regression in ML
 
SPSS
SPSSSPSS
SPSS
 
Introduction to Limited Dependent variable
Introduction to Limited Dependent variableIntroduction to Limited Dependent variable
Introduction to Limited Dependent variable
 
Regression analysis
Regression analysisRegression analysis
Regression analysis
 
Multiple Regression.ppt
Multiple Regression.pptMultiple Regression.ppt
Multiple Regression.ppt
 
REGRESSION METasdfghjklmjhgftrHODS1.pptx
REGRESSION METasdfghjklmjhgftrHODS1.pptxREGRESSION METasdfghjklmjhgftrHODS1.pptx
REGRESSION METasdfghjklmjhgftrHODS1.pptx
 
MModule 1 ppt.pptx
MModule 1 ppt.pptxMModule 1 ppt.pptx
MModule 1 ppt.pptx
 
Lecture 4
Lecture 4Lecture 4
Lecture 4
 
Chapter 15Multiple Regression and Model BuildingCo
Chapter 15Multiple Regression and Model BuildingCoChapter 15Multiple Regression and Model BuildingCo
Chapter 15Multiple Regression and Model BuildingCo
 
Multiple Regression.ppt
Multiple Regression.pptMultiple Regression.ppt
Multiple Regression.ppt
 
Regression Analysis.pptx
Regression Analysis.pptxRegression Analysis.pptx
Regression Analysis.pptx
 
Regression (Linear Regression and Logistic Regression) by Akanksha Bali
Regression (Linear Regression and Logistic Regression) by Akanksha BaliRegression (Linear Regression and Logistic Regression) by Akanksha Bali
Regression (Linear Regression and Logistic Regression) by Akanksha Bali
 
STATISTICAL REGRESSION MODELS
STATISTICAL REGRESSION MODELSSTATISTICAL REGRESSION MODELS
STATISTICAL REGRESSION MODELS
 
Data Analysison Regression
Data Analysison RegressionData Analysison Regression
Data Analysison Regression
 
Detail Study of the concept of Regression model.pptx
Detail Study of the concept of  Regression model.pptxDetail Study of the concept of  Regression model.pptx
Detail Study of the concept of Regression model.pptx
 
Ders 2 ols .ppt
Ders 2 ols .pptDers 2 ols .ppt
Ders 2 ols .ppt
 
Regression
RegressionRegression
Regression
 
Regression analysis in Research Methodology.pptx
Regression analysis in Research Methodology.pptxRegression analysis in Research Methodology.pptx
Regression analysis in Research Methodology.pptx
 
Group 5 - Regression Analysis.pdf
Group 5 - Regression Analysis.pdfGroup 5 - Regression Analysis.pdf
Group 5 - Regression Analysis.pdf
 

More from Saurav Mukherjee

Complex C-declarations & typedef
Complex C-declarations & typedefComplex C-declarations & typedef
Complex C-declarations & typedef
Saurav Mukherjee
 
Presentation Skills
Presentation SkillsPresentation Skills
Presentation Skills
Saurav Mukherjee
 
Enterprise Agile Adoption
Enterprise Agile AdoptionEnterprise Agile Adoption
Enterprise Agile Adoption
Saurav Mukherjee
 
Enterprise Data Management - Data Lake - A Perspective
Enterprise Data Management - Data Lake - A PerspectiveEnterprise Data Management - Data Lake - A Perspective
Enterprise Data Management - Data Lake - A Perspective
Saurav Mukherjee
 
Tire Pressure Monitoring System (TPMS) - An Introduction
Tire Pressure Monitoring System (TPMS) - An IntroductionTire Pressure Monitoring System (TPMS) - An Introduction
Tire Pressure Monitoring System (TPMS) - An Introduction
Saurav Mukherjee
 
Competitive positioning and routes to market for a high-technology innovation...
Competitive positioning and routes to market for a high-technology innovation...Competitive positioning and routes to market for a high-technology innovation...
Competitive positioning and routes to market for a high-technology innovation...
Saurav Mukherjee
 

More from Saurav Mukherjee (6)

Complex C-declarations & typedef
Complex C-declarations & typedefComplex C-declarations & typedef
Complex C-declarations & typedef
 
Presentation Skills
Presentation SkillsPresentation Skills
Presentation Skills
 
Enterprise Agile Adoption
Enterprise Agile AdoptionEnterprise Agile Adoption
Enterprise Agile Adoption
 
Enterprise Data Management - Data Lake - A Perspective
Enterprise Data Management - Data Lake - A PerspectiveEnterprise Data Management - Data Lake - A Perspective
Enterprise Data Management - Data Lake - A Perspective
 
Tire Pressure Monitoring System (TPMS) - An Introduction
Tire Pressure Monitoring System (TPMS) - An IntroductionTire Pressure Monitoring System (TPMS) - An Introduction
Tire Pressure Monitoring System (TPMS) - An Introduction
 
Competitive positioning and routes to market for a high-technology innovation...
Competitive positioning and routes to market for a high-technology innovation...Competitive positioning and routes to market for a high-technology innovation...
Competitive positioning and routes to market for a high-technology innovation...
 

Recently uploaded

Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Subhajit Sahu
 
Everything you wanted to know about LIHTC
Everything you wanted to know about LIHTCEverything you wanted to know about LIHTC
Everything you wanted to know about LIHTC
Roger Valdez
 
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdfEnhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
GetInData
 
Analysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performanceAnalysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performance
roli9797
 
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
AbhimanyuSinha9
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
u86oixdj
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
Timothy Spann
 
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptxData_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
AnirbanRoy608946
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
jerlynmaetalle
 
My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.
rwarrenll
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
javier ramirez
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
slg6lamcq
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
v3tuleee
 
Learn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queriesLearn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queries
manishkhaire30
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
Subhajit Sahu
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Subhajit Sahu
 
Influence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business PlanInfluence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business Plan
jerlynmaetalle
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
NABLAS株式会社
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
TravisMalana
 
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
mzpolocfi
 

Recently uploaded (20)

Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
 
Everything you wanted to know about LIHTC
Everything you wanted to know about LIHTCEverything you wanted to know about LIHTC
Everything you wanted to know about LIHTC
 
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdfEnhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
 
Analysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performanceAnalysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performance
 
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
 
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptxData_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
 
My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
 
Learn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queriesLearn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queries
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
 
Influence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business PlanInfluence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business Plan
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
 
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
 

Linear regression theory

  • 2. What &Why 1 What is Regression? Formulation of a functional relationship between a set of Independent or Explanatory variables (X’s) with a Dependent or Response variable (Y). Y = f(X) Why Regression? Knowledge of Y is crucial for decision making. • Will he/she buy or not? • Shall I offer him/her the loan or not? • ……… X is available at the time of decision making and is related to Y, thus making it possible to have a prediction of Y.
  • 3. 2 Types of Regression Y Continuous E.g., SalesVolume, Claim Amount, % of sales growth etc. Binary (0/1) E.g., Buy/No-Buy, Survive/Not- Survive,Win/Loss etc Ordinary Least Square (OLS) Regression Logistic Regression
  • 4. • Regression analysis is used to: • Predict the value of a dependent variable based on the value of at least one independent variable • Explain the impact of changes in an independent variable on the dependent variable • Dependent variable: the variable we wish to explain, usually denoted by Y. • Independent variable: the variable used to explain the dependent variable. Usually denoted by X. 3 Intro to RegressionAnalysis
  • 5. 4 Regression Example Predict the fitness of a person based on one or more parameters.
  • 7. • Only one independent variable, x • Relationship between x and y is described by a linear function • Changes in y are assumed to be caused by changes in x 6 Simple Linear Regression Model
  • 8. 7 Assumptions for Simple Linear Regression E(ε) = 0
  • 10. 9 Assumptions for Multiple Regression ݅ 2 ߝ ଶ
  • 11. 10 i]j0,)ε[E(ε ji ≠= Assumptions for Multiple Regression
  • 18. 17 The Simple Linear Regression Model
  • 19. 18 The Multiple Linear Regression Model
  • 20. 19 Model for Multiple Regression
  • 21. 20 Positive Linear Relationship Negative Linear Relationship No Relationship Relationship NOT Linear Types of Regression Relationships
  • 22. 21 Unknown Relationship Population Random Sample Y Xi i i= + +β β ε0 1 ☺ ☺ ☺ ☺ ☺ ☺ ☺ Population & Sample Regression Models
  • 23. 22 PredictedValue ofY for Xi Intercept = β0 Random Error for this x value Y X uXββY 10 ++= xi Slope = β1 ui Individual person's marks Population Linear Regression
  • 24. 23 Linear component Population y intercept Population Slope Coefficient Random Error term, or residual Dependent Variable Independent Variable Random Error component uXββY 10 ++= But can we actually get this equation? If yes what all information we will need? Population Regression Function
  • 25. 24 PredictedValue ofY for Xi Intercept = β0 Random Error for this x value Y Xxi Slope = β1 exbby 10 ++= ei ObservedValue of y for xi Sample Regression Function
  • 26. 25 exbby 10i ++= Estimate of the regression intercept Estimate of the regression slope Independent variable Error term Notice the similarity with the Population Regression Function Can we do something of the error term? Sample Regression Function
  • 27. • Represents the influence of all the variable which we have not accounted for in the equation • It represents the difference between the actual y values as compared the predicted y values from the Sample Regression Line • Wouldn't it be good if we were able to reduce this error term? • By the way - what are we trying to achieve by Sample Regression? 26 The ErrorTerm (Residual)
  • 28. 27 HowWell A Model Fits the Data
  • 29. 28 Comparing the Regression Model to a Baseline Model
  • 30. 29 Comparing the Regression Model to a Baseline Model
  • 31. • The sum of the residuals from the least squares regression line is zero. • The sum of the squared residuals is a minimum. Minimize( ) • The simple regression line always passes through the mean of the y variable and the mean of the x variable • The least squares coefficients are unbiased estimates of β0 and β1 30 0)ˆ( =−∑ yy 2 )ˆ( yy∑ − OLS Regression Properties
  • 32. • Parameter Instability - This happens in situations where correlations change over a period of time.This is very common in financial markets where economic, tax, regulatory, and political factors change frequently. • Public knowledge of a specific regression relation may cause a large number of people to react in a similar fashion towards the variables, negating its future usefulness. • If any of the regression assumptions are violated, predicted dependent variables and hypothesis tests will not hold valid. 31 Limitations of RegressionAnalysis
  • 33. • In simple linear regression, the dependent variable was assumed to be dependent on only one variable (independent variable) • In General Multiple Linear Regression model, the dependent variable derives its value from two or more than two variable. • General Multiple Linear Regression model take the following form: where: Yi = ith observation of dependent variableY Xki = ith observation of kth independent variable X b0 = intercept term bk = slope coefficient of kth independent variable εi = error term of ith observation n = number of observations k = total number of independent variables 32 ikikiii XbXbXbbY ε+++++= .........22110 General Multiple Linear Regression Model
  • 34. • As we calculated the intercept and the slope coefficient in case of simple linear regression by minimizing the sum of squared errors, similarly we estimate the intercept and slope coefficient in multiple linear regression. • Sum of Squared Errors is minimized and the slope coefficient is estimated. • The resultant estimated equation becomes: • Now the error in the ith observation can be written as: 33 ∑= n i i 1 2 ε kikiii XbXbXbbY ∧∧∧∧∧ ++++= .........22110       ++++−=−= ∧∧∧∧∧ kikiiiiii XbXbXbbYYY .........22110ε Estimated Regression Equation
  • 35. 34 Assumptions of Multiple Regression Model • There exists a linear relationship between the dependent and independent variables. • The expected value of the error term, conditional on the independent variables is zero. • The error terms are homoskedastic, i.e. the variance of the error terms is constant for all the observations. • The expected value of the product of error terms is always zero, which implies that the error terms are uncorrelated with each other. • The error term is normally distributed. • The independent variables doesn't have any linear relationships between each other.