SlideShare a Scribd company logo
1 of 30
Download to read offline
SIMPLE LINEAR
REGRESSION
Avjinder Singh Kaler and Kristi Mai
 In the first part of this section we find the equation of the straight line
that best fits the paired sample data. That equation algebraically
describes the relationship between two variables.
 The best-fitting straight line is called a regression line and its equation is
called the regression equation.
Regression Equation – given a collection of paired sample data, the regression
equation that algebraically describes the relationship between the two
variables π‘₯ and 𝑦 is 𝑦 = 𝑏0 + 𝑏1 π‘₯
β€’ The regression equation attempts to describe a relationship between two
variables
β€’ Inherently, the equation algebraically describes how the values of one variable
are somehow associated with the values of the other variable
Regression line – the graph of the regression equation
β€’ Also known as the β€œline of best fit” or the β€œleast square line”
β€’ The regression line fits the sample points best
Notice the 𝑦 in the sample regression equation!
This implies that we are predicting something!
We are predicting values for 𝑦 based upon true and observed values
of π‘₯.
1. The sample of paired data is a simple random sample of quantitative data
2. The pairs of data π‘₯, 𝑦 have a bivariate normal distribution, meaning the
following:
β€’ Visual examination of the scatter plot(s) confirms that the sample points follow
an approximately straight line(s)
β€’ Because results can be strongly affected by the presence of outliers, any outliers
should be removed if they are known to be errors (Note: Use caution when
removing data points)
Slope:
β€’ 𝑏1 = π‘Ÿ βˆ—
𝑆 𝑦
𝑆 π‘₯
where 𝑠 𝑦 and 𝑠 π‘₯ are the standard deviations of
𝑦 values and π‘₯ values
Intercept:
β€’ 𝑏0 = 𝑦 βˆ’ 𝑏1 βˆ— π‘₯
Software will typically be utilized to calculate these
coefficients
We would like to use the explanatory variable, x, shoe print length, to
predict the response variable, y, height.
The data are listed below:
Requirement Check:
1. The data are assumed to be a simple random sample.
2. The scatterplot showed a roughly straight-line pattern.
3. There are no outliers.
The use of technology is recommended for finding the equation of a
regression line.
Using StatCrunch, we obtain the following result:
So then, the regression equation can be expressed as:
Λ† 125 1.73y xο€½ 
Graph the regression equation on a scatterplot: Λ† 125 1.73y xο€½ 
1. Use the regression equation for predictions ONLY if the graph of the regression
line on the scatter plot confirms that the line fits the points reasonably well.
2. Use the equation for predictions ONLY if the data used for prediction does not
go much beyond the scope of the available sample data.
3. Use the equation for prediction ONLY if π‘Ÿ indicates that there is a significant
linear correlation indicated between the two variables, π‘₯ and 𝑦
Notice: If the regression equation does not appear to be useful for predictions,
the best predicted value of a 𝑦 variable is its point estimate [i.e. the sample
mean of the 𝑦 variable would be the best predicted value for that variable]
Marginal change – in working with two variables related by a regression
equation, the marginal change in a variable is the amount that the variable
changes when the other variable changes by exactly one unit
β€’ The slope, 𝑏1, in the regression equation is the marginal change in 𝑦 when π‘₯
changes by one unit
For the 40 pairs of shoe print lengths and heights, the regression equation was:
The slope of 3.22 tells us that if we increase shoe print length by 1 cm, the
predicted height of a person increases by 3.22 cm.
Λ† 80.9 3.22y xο€½ 
Scatter Plot – plot of paired π‘₯, 𝑦 quantitative data with a dot representing
each pair of points
β€’ Helpful in determining whether there is a relationship between two variables
Time Series Graph – quantitative data collected over time and plotted
accordingly with the horizontal axis representing some measure of time
β€’ Outlier – in a scatter plot, an outlier is a point lying far away from the other
data points
β€’ Influential point – a point that strongly affects the graph of the regression line
For the 40 pairs of shoe prints and heights, observe what happens if we include
this additional data point:
x = 35 cm and y = 25 cm
The additional point is an influential point because the graph of the regression
line because the graph of the regression line did change considerably.
The additional point is also an outlier because it is far from the other points.
Residual – for a pair of sample π‘₯ and 𝑦 values, the difference between the
observed sample value of 𝑦 (a true value observed) and the y-value that is
predicted by using the regression equation 𝑦 is the residual
β€’ π‘…π‘’π‘ π‘–π‘‘π‘’π‘Žπ‘™ = π‘‚π‘π‘ π‘’π‘Ÿπ‘£π‘’π‘‘ βˆ’ π‘ƒπ‘Ÿπ‘’π‘‘π‘–π‘π‘‘π‘’π‘‘ = 𝑦 βˆ’ 𝑦
β€’ A residual represents a type of inherent prediction error
β€’ The regression equation does not, typically, pass through all the observed data
values that we have
The Least Squares Property – a straight line satisfies this property if the sum of the
squares of the residuals is the smallest sum possible
Residual Plot – a scatter plot of the π‘₯, 𝑦 values after each of the y-values has
been replaced by the residual value, 𝑦 βˆ’ 𝑦.
β€’ That is, a residual plot is a graph of the points π‘₯, 𝑦 βˆ’ 𝑦 .
When analyzing a residual plot, look for a pattern in the way the points
are configured, and use these criteria:
The residual plot should not have any obvious patterns (not even a
straight line pattern). This confirms that the scatterplot of the sample
data is a straight-line pattern.
The residual plot should not become thicker (or thinner) when viewed
from left to right. This confirms the requirement that for different fixed
values of x, the distributions of the corresponding y values all have the
same standard deviation.
The shoe print and height data are used to generate the following
residual plot:
The residual plot becomes thicker, which suggests that the requirement of
equal standard deviations is violated.
The following slides show three residual plots.
Let’s observe what is good or bad about the individual regression
models.
Regression model is a good model:
Distinct pattern: sample data may not follow a straight-line pattern.
Residual plot becoming thicker: equal standard deviations violated.
1. Construct a scatterplot and verify that the pattern of the points is
approximately a straight-line pattern without outliers. (If there are
outliers, consider their effects by comparing results that include the
outliers to results that exclude the outliers.)
2. Construct a residual plot and verify that there is no pattern (other
than a straight-line pattern) and also verify that the residual plot
does not become thicker (or thinner).
3. Use a histogram and/or normal quantile plot to confirm that the
values of the residuals have a distribution that is approximately
normal.
4. Consider any effects of a pattern over time.

More Related Content

What's hot

Presentation On Regression
Presentation On RegressionPresentation On Regression
Presentation On Regressionalok tiwari
Β 
Regression analysis.
Regression analysis.Regression analysis.
Regression analysis.sonia gupta
Β 
Multiple regression presentation
Multiple regression presentationMultiple regression presentation
Multiple regression presentationCarlo Magno
Β 
Multiple linear regression
Multiple linear regressionMultiple linear regression
Multiple linear regressionJames Neill
Β 
Regression analysis
Regression analysisRegression analysis
Regression analysissaba khan
Β 
Simple Linear Regression: Step-By-Step
Simple Linear Regression: Step-By-StepSimple Linear Regression: Step-By-Step
Simple Linear Regression: Step-By-StepDan Wellisch
Β 
Regression
RegressionRegression
RegressionLavanyaK75
Β 
Statistical inference: Estimation
Statistical inference: EstimationStatistical inference: Estimation
Statistical inference: EstimationParag Shah
Β 
correlation and regression
correlation and regressioncorrelation and regression
correlation and regressionUnsa Shakir
Β 
Correlation and Regression Analysis using SPSS and Microsoft Excel
Correlation and Regression Analysis using SPSS and Microsoft ExcelCorrelation and Regression Analysis using SPSS and Microsoft Excel
Correlation and Regression Analysis using SPSS and Microsoft ExcelSetia Pramana
Β 
Correlation & Regression Analysis using SPSS
Correlation & Regression Analysis  using SPSSCorrelation & Regression Analysis  using SPSS
Correlation & Regression Analysis using SPSSParag Shah
Β 
Statistical Estimation
Statistical Estimation Statistical Estimation
Statistical Estimation Remyagharishs
Β 
Regression Analysis
Regression AnalysisRegression Analysis
Regression AnalysisSalim Azad
Β 
Mpc 006 - 02-03 partial and multiple correlation
Mpc 006 - 02-03 partial and multiple correlationMpc 006 - 02-03 partial and multiple correlation
Mpc 006 - 02-03 partial and multiple correlationVasant Kothari
Β 
Regression Analysis
Regression AnalysisRegression Analysis
Regression AnalysisShiela Vinarao
Β 

What's hot (20)

Presentation On Regression
Presentation On RegressionPresentation On Regression
Presentation On Regression
Β 
Regression analysis.
Regression analysis.Regression analysis.
Regression analysis.
Β 
Regression Analysis
Regression AnalysisRegression Analysis
Regression Analysis
Β 
Multiple regression presentation
Multiple regression presentationMultiple regression presentation
Multiple regression presentation
Β 
Regression
RegressionRegression
Regression
Β 
Multiple linear regression
Multiple linear regressionMultiple linear regression
Multiple linear regression
Β 
Regression analysis
Regression analysisRegression analysis
Regression analysis
Β 
Regression analysis
Regression analysisRegression analysis
Regression analysis
Β 
Regression
RegressionRegression
Regression
Β 
Simple Linear Regression: Step-By-Step
Simple Linear Regression: Step-By-StepSimple Linear Regression: Step-By-Step
Simple Linear Regression: Step-By-Step
Β 
Regression
RegressionRegression
Regression
Β 
Statistical inference: Estimation
Statistical inference: EstimationStatistical inference: Estimation
Statistical inference: Estimation
Β 
correlation and regression
correlation and regressioncorrelation and regression
correlation and regression
Β 
Correlation Analysis
Correlation AnalysisCorrelation Analysis
Correlation Analysis
Β 
Correlation and Regression Analysis using SPSS and Microsoft Excel
Correlation and Regression Analysis using SPSS and Microsoft ExcelCorrelation and Regression Analysis using SPSS and Microsoft Excel
Correlation and Regression Analysis using SPSS and Microsoft Excel
Β 
Correlation & Regression Analysis using SPSS
Correlation & Regression Analysis  using SPSSCorrelation & Regression Analysis  using SPSS
Correlation & Regression Analysis using SPSS
Β 
Statistical Estimation
Statistical Estimation Statistical Estimation
Statistical Estimation
Β 
Regression Analysis
Regression AnalysisRegression Analysis
Regression Analysis
Β 
Mpc 006 - 02-03 partial and multiple correlation
Mpc 006 - 02-03 partial and multiple correlationMpc 006 - 02-03 partial and multiple correlation
Mpc 006 - 02-03 partial and multiple correlation
Β 
Regression Analysis
Regression AnalysisRegression Analysis
Regression Analysis
Β 

Similar to Simple linear regression

Unit-III Correlation and Regression.pptx
Unit-III Correlation and Regression.pptxUnit-III Correlation and Regression.pptx
Unit-III Correlation and Regression.pptxAnusuya123
Β 
Simple Linear Regression.pptx
Simple Linear Regression.pptxSimple Linear Regression.pptx
Simple Linear Regression.pptxAbdalrahmanTahaJaya
Β 
regression.pptx
regression.pptxregression.pptx
regression.pptxRashi Agarwal
Β 
FSE 200AdkinsPage 1 of 10Simple Linear Regression Corr.docx
FSE 200AdkinsPage 1 of 10Simple Linear Regression Corr.docxFSE 200AdkinsPage 1 of 10Simple Linear Regression Corr.docx
FSE 200AdkinsPage 1 of 10Simple Linear Regression Corr.docxbudbarber38650
Β 
Exploring bivariate data
Exploring bivariate dataExploring bivariate data
Exploring bivariate dataUlster BOCES
Β 
Linear Regression
Linear RegressionLinear Regression
Linear RegressionSourajitMaity1
Β 
Chapter05
Chapter05Chapter05
Chapter05rwmiller
Β 
Simple linear regression
Simple linear regressionSimple linear regression
Simple linear regressionpankaj8108
Β 

Similar to Simple linear regression (20)

Correlation in Statistics
Correlation in StatisticsCorrelation in Statistics
Correlation in Statistics
Β 
rugs koco.pptx
rugs koco.pptxrugs koco.pptx
rugs koco.pptx
Β 
Unit-III Correlation and Regression.pptx
Unit-III Correlation and Regression.pptxUnit-III Correlation and Regression.pptx
Unit-III Correlation and Regression.pptx
Β 
Simple egression.pptx
Simple egression.pptxSimple egression.pptx
Simple egression.pptx
Β 
Simple Linear Regression.pptx
Simple Linear Regression.pptxSimple Linear Regression.pptx
Simple Linear Regression.pptx
Β 
regression.pptx
regression.pptxregression.pptx
regression.pptx
Β 
FSE 200AdkinsPage 1 of 10Simple Linear Regression Corr.docx
FSE 200AdkinsPage 1 of 10Simple Linear Regression Corr.docxFSE 200AdkinsPage 1 of 10Simple Linear Regression Corr.docx
FSE 200AdkinsPage 1 of 10Simple Linear Regression Corr.docx
Β 
Multiple linear regression
Multiple linear regressionMultiple linear regression
Multiple linear regression
Β 
Exploring bivariate data
Exploring bivariate dataExploring bivariate data
Exploring bivariate data
Β 
Linear Regression
Linear RegressionLinear Regression
Linear Regression
Β 
Curve fitting
Curve fittingCurve fitting
Curve fitting
Β 
Curve fitting
Curve fittingCurve fitting
Curve fitting
Β 
Chapter05
Chapter05Chapter05
Chapter05
Β 
R nonlinear least square
R   nonlinear least squareR   nonlinear least square
R nonlinear least square
Β 
Linear Regression.pptx
Linear Regression.pptxLinear Regression.pptx
Linear Regression.pptx
Β 
Simple linear regression
Simple linear regressionSimple linear regression
Simple linear regression
Β 
Simple lin regress_inference
Simple lin regress_inferenceSimple lin regress_inference
Simple lin regress_inference
Β 
IDS.pdf
IDS.pdfIDS.pdf
IDS.pdf
Β 
Linear relations
   Linear relations   Linear relations
Linear relations
Β 
Regression
RegressionRegression
Regression
Β 

More from Avjinder (Avi) Kaler

Unleashing Real-World Simulations: A Python Tutorial by Avjinder Kaler
Unleashing Real-World Simulations: A Python Tutorial by Avjinder KalerUnleashing Real-World Simulations: A Python Tutorial by Avjinder Kaler
Unleashing Real-World Simulations: A Python Tutorial by Avjinder KalerAvjinder (Avi) Kaler
Β 
Tutorial for Deep Learning Project with Keras
Tutorial for Deep Learning Project  with KerasTutorial for Deep Learning Project  with Keras
Tutorial for Deep Learning Project with KerasAvjinder (Avi) Kaler
Β 
Tutorial for DBSCAN Clustering in Machine Learning
Tutorial for DBSCAN Clustering in Machine LearningTutorial for DBSCAN Clustering in Machine Learning
Tutorial for DBSCAN Clustering in Machine LearningAvjinder (Avi) Kaler
Β 
Python Code for Classification Supervised Machine Learning.pdf
Python Code for Classification Supervised Machine Learning.pdfPython Code for Classification Supervised Machine Learning.pdf
Python Code for Classification Supervised Machine Learning.pdfAvjinder (Avi) Kaler
Β 
Sql tutorial for select, where, order by, null, insert functions
Sql tutorial for select, where, order by, null, insert functionsSql tutorial for select, where, order by, null, insert functions
Sql tutorial for select, where, order by, null, insert functionsAvjinder (Avi) Kaler
Β 
Association mapping identifies loci for canopy coverage in diverse soybean ge...
Association mapping identifies loci for canopy coverage in diverse soybean ge...Association mapping identifies loci for canopy coverage in diverse soybean ge...
Association mapping identifies loci for canopy coverage in diverse soybean ge...Avjinder (Avi) Kaler
Β 
Genome-Wide Association Mapping of Carbon Isotope and Oxygen Isotope Ratios i...
Genome-Wide Association Mapping of Carbon Isotope and Oxygen Isotope Ratios i...Genome-Wide Association Mapping of Carbon Isotope and Oxygen Isotope Ratios i...
Genome-Wide Association Mapping of Carbon Isotope and Oxygen Isotope Ratios i...Avjinder (Avi) Kaler
Β 
Genome-wide association mapping of canopy wilting in diverse soybean genotypes
Genome-wide association mapping of canopy wilting in diverse soybean genotypesGenome-wide association mapping of canopy wilting in diverse soybean genotypes
Genome-wide association mapping of canopy wilting in diverse soybean genotypesAvjinder (Avi) Kaler
Β 
Tutorial for Estimating Broad and Narrow Sense Heritability using R
Tutorial for Estimating Broad and Narrow Sense Heritability using RTutorial for Estimating Broad and Narrow Sense Heritability using R
Tutorial for Estimating Broad and Narrow Sense Heritability using RAvjinder (Avi) Kaler
Β 
Tutorial for Circular and Rectangular Manhattan plots
Tutorial for Circular and Rectangular Manhattan plotsTutorial for Circular and Rectangular Manhattan plots
Tutorial for Circular and Rectangular Manhattan plotsAvjinder (Avi) Kaler
Β 
Genomic Selection with Bayesian Generalized Linear Regression model using R
Genomic Selection with Bayesian Generalized Linear Regression model using RGenomic Selection with Bayesian Generalized Linear Regression model using R
Genomic Selection with Bayesian Generalized Linear Regression model using RAvjinder (Avi) Kaler
Β 
Genome wide association mapping
Genome wide association mappingGenome wide association mapping
Genome wide association mappingAvjinder (Avi) Kaler
Β 
Nutrient availability response to sulfur amendment in histosols having variab...
Nutrient availability response to sulfur amendment in histosols having variab...Nutrient availability response to sulfur amendment in histosols having variab...
Nutrient availability response to sulfur amendment in histosols having variab...Avjinder (Avi) Kaler
Β 
Sugarcane yield and plant nutrient response to sulfur amended everglades hist...
Sugarcane yield and plant nutrient response to sulfur amended everglades hist...Sugarcane yield and plant nutrient response to sulfur amended everglades hist...
Sugarcane yield and plant nutrient response to sulfur amended everglades hist...Avjinder (Avi) Kaler
Β 
R code descriptive statistics of phenotypic data by Avjinder Kaler
R code descriptive statistics of phenotypic data by Avjinder KalerR code descriptive statistics of phenotypic data by Avjinder Kaler
R code descriptive statistics of phenotypic data by Avjinder KalerAvjinder (Avi) Kaler
Β 
Seed rate calculation for experiment
Seed rate calculation for experimentSeed rate calculation for experiment
Seed rate calculation for experimentAvjinder (Avi) Kaler
Β 

More from Avjinder (Avi) Kaler (20)

Unleashing Real-World Simulations: A Python Tutorial by Avjinder Kaler
Unleashing Real-World Simulations: A Python Tutorial by Avjinder KalerUnleashing Real-World Simulations: A Python Tutorial by Avjinder Kaler
Unleashing Real-World Simulations: A Python Tutorial by Avjinder Kaler
Β 
Tutorial for Deep Learning Project with Keras
Tutorial for Deep Learning Project  with KerasTutorial for Deep Learning Project  with Keras
Tutorial for Deep Learning Project with Keras
Β 
Tutorial for DBSCAN Clustering in Machine Learning
Tutorial for DBSCAN Clustering in Machine LearningTutorial for DBSCAN Clustering in Machine Learning
Tutorial for DBSCAN Clustering in Machine Learning
Β 
Python Code for Classification Supervised Machine Learning.pdf
Python Code for Classification Supervised Machine Learning.pdfPython Code for Classification Supervised Machine Learning.pdf
Python Code for Classification Supervised Machine Learning.pdf
Β 
Sql tutorial for select, where, order by, null, insert functions
Sql tutorial for select, where, order by, null, insert functionsSql tutorial for select, where, order by, null, insert functions
Sql tutorial for select, where, order by, null, insert functions
Β 
Kaler et al 2018 euphytica
Kaler et al 2018 euphyticaKaler et al 2018 euphytica
Kaler et al 2018 euphytica
Β 
Association mapping identifies loci for canopy coverage in diverse soybean ge...
Association mapping identifies loci for canopy coverage in diverse soybean ge...Association mapping identifies loci for canopy coverage in diverse soybean ge...
Association mapping identifies loci for canopy coverage in diverse soybean ge...
Β 
Genome-Wide Association Mapping of Carbon Isotope and Oxygen Isotope Ratios i...
Genome-Wide Association Mapping of Carbon Isotope and Oxygen Isotope Ratios i...Genome-Wide Association Mapping of Carbon Isotope and Oxygen Isotope Ratios i...
Genome-Wide Association Mapping of Carbon Isotope and Oxygen Isotope Ratios i...
Β 
Genome-wide association mapping of canopy wilting in diverse soybean genotypes
Genome-wide association mapping of canopy wilting in diverse soybean genotypesGenome-wide association mapping of canopy wilting in diverse soybean genotypes
Genome-wide association mapping of canopy wilting in diverse soybean genotypes
Β 
Tutorial for Estimating Broad and Narrow Sense Heritability using R
Tutorial for Estimating Broad and Narrow Sense Heritability using RTutorial for Estimating Broad and Narrow Sense Heritability using R
Tutorial for Estimating Broad and Narrow Sense Heritability using R
Β 
Tutorial for Circular and Rectangular Manhattan plots
Tutorial for Circular and Rectangular Manhattan plotsTutorial for Circular and Rectangular Manhattan plots
Tutorial for Circular and Rectangular Manhattan plots
Β 
Genomic Selection with Bayesian Generalized Linear Regression model using R
Genomic Selection with Bayesian Generalized Linear Regression model using RGenomic Selection with Bayesian Generalized Linear Regression model using R
Genomic Selection with Bayesian Generalized Linear Regression model using R
Β 
Genome wide association mapping
Genome wide association mappingGenome wide association mapping
Genome wide association mapping
Β 
Nutrient availability response to sulfur amendment in histosols having variab...
Nutrient availability response to sulfur amendment in histosols having variab...Nutrient availability response to sulfur amendment in histosols having variab...
Nutrient availability response to sulfur amendment in histosols having variab...
Β 
Sugarcane yield and plant nutrient response to sulfur amended everglades hist...
Sugarcane yield and plant nutrient response to sulfur amended everglades hist...Sugarcane yield and plant nutrient response to sulfur amended everglades hist...
Sugarcane yield and plant nutrient response to sulfur amended everglades hist...
Β 
R code descriptive statistics of phenotypic data by Avjinder Kaler
R code descriptive statistics of phenotypic data by Avjinder KalerR code descriptive statistics of phenotypic data by Avjinder Kaler
R code descriptive statistics of phenotypic data by Avjinder Kaler
Β 
Population genetics
Population geneticsPopulation genetics
Population genetics
Β 
Quantitative genetics
Quantitative geneticsQuantitative genetics
Quantitative genetics
Β 
Abiotic stresses in plant
Abiotic stresses in plantAbiotic stresses in plant
Abiotic stresses in plant
Β 
Seed rate calculation for experiment
Seed rate calculation for experimentSeed rate calculation for experiment
Seed rate calculation for experiment
Β 

Recently uploaded

Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon AUnboundStockton
Β 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxNirmalaLoungPoorunde1
Β 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatYousafMalik24
Β 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Celine George
Β 
Historical philosophical, theoretical, and legal foundations of special and i...
Historical philosophical, theoretical, and legal foundations of special and i...Historical philosophical, theoretical, and legal foundations of special and i...
Historical philosophical, theoretical, and legal foundations of special and i...jaredbarbolino94
Β 
18-04-UA_REPORT_MEDIALITERAΠ‘Y_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAΠ‘Y_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAΠ‘Y_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAΠ‘Y_INDEX-DM_23-1-final-eng.pdfssuser54595a
Β 
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxEPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxRaymartEstabillo3
Β 
Types of Journalistic Writing Grade 8.pptx
Types of Journalistic Writing Grade 8.pptxTypes of Journalistic Writing Grade 8.pptx
Types of Journalistic Writing Grade 8.pptxEyham Joco
Β 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
Β 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Celine George
Β 
MICROBIOLOGY biochemical test detailed.pptx
MICROBIOLOGY biochemical test detailed.pptxMICROBIOLOGY biochemical test detailed.pptx
MICROBIOLOGY biochemical test detailed.pptxabhijeetpadhi001
Β 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxOH TEIK BIN
Β 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Jisc
Β 
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...JhezDiaz1
Β 
AmericanHighSchoolsprezentacijaoskolama.
AmericanHighSchoolsprezentacijaoskolama.AmericanHighSchoolsprezentacijaoskolama.
AmericanHighSchoolsprezentacijaoskolama.arsicmarija21
Β 
call girls in Kamla Market (DELHI) πŸ” >ΰΌ’9953330565πŸ” genuine Escort Service πŸ”βœ”οΈβœ”οΈ
call girls in Kamla Market (DELHI) πŸ” >ΰΌ’9953330565πŸ” genuine Escort Service πŸ”βœ”οΈβœ”οΈcall girls in Kamla Market (DELHI) πŸ” >ΰΌ’9953330565πŸ” genuine Escort Service πŸ”βœ”οΈβœ”οΈ
call girls in Kamla Market (DELHI) πŸ” >ΰΌ’9953330565πŸ” genuine Escort Service πŸ”βœ”οΈβœ”οΈ9953056974 Low Rate Call Girls In Saket, Delhi NCR
Β 
Hierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of managementHierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of managementmkooblal
Β 
Roles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceRoles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceSamikshaHamane
Β 

Recently uploaded (20)

Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon A
Β 
9953330565 Low Rate Call Girls In Rohini Delhi NCR
9953330565 Low Rate Call Girls In Rohini  Delhi NCR9953330565 Low Rate Call Girls In Rohini  Delhi NCR
9953330565 Low Rate Call Girls In Rohini Delhi NCR
Β 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptx
Β 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice great
Β 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17
Β 
Historical philosophical, theoretical, and legal foundations of special and i...
Historical philosophical, theoretical, and legal foundations of special and i...Historical philosophical, theoretical, and legal foundations of special and i...
Historical philosophical, theoretical, and legal foundations of special and i...
Β 
18-04-UA_REPORT_MEDIALITERAΠ‘Y_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAΠ‘Y_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAΠ‘Y_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAΠ‘Y_INDEX-DM_23-1-final-eng.pdf
Β 
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxEPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
Β 
Types of Journalistic Writing Grade 8.pptx
Types of Journalistic Writing Grade 8.pptxTypes of Journalistic Writing Grade 8.pptx
Types of Journalistic Writing Grade 8.pptx
Β 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Β 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17
Β 
MICROBIOLOGY biochemical test detailed.pptx
MICROBIOLOGY biochemical test detailed.pptxMICROBIOLOGY biochemical test detailed.pptx
MICROBIOLOGY biochemical test detailed.pptx
Β 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptx
Β 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...
Β 
Model Call Girl in Bikash Puri Delhi reach out to us at πŸ”9953056974πŸ”
Model Call Girl in Bikash Puri  Delhi reach out to us at πŸ”9953056974πŸ”Model Call Girl in Bikash Puri  Delhi reach out to us at πŸ”9953056974πŸ”
Model Call Girl in Bikash Puri Delhi reach out to us at πŸ”9953056974πŸ”
Β 
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
Β 
AmericanHighSchoolsprezentacijaoskolama.
AmericanHighSchoolsprezentacijaoskolama.AmericanHighSchoolsprezentacijaoskolama.
AmericanHighSchoolsprezentacijaoskolama.
Β 
call girls in Kamla Market (DELHI) πŸ” >ΰΌ’9953330565πŸ” genuine Escort Service πŸ”βœ”οΈβœ”οΈ
call girls in Kamla Market (DELHI) πŸ” >ΰΌ’9953330565πŸ” genuine Escort Service πŸ”βœ”οΈβœ”οΈcall girls in Kamla Market (DELHI) πŸ” >ΰΌ’9953330565πŸ” genuine Escort Service πŸ”βœ”οΈβœ”οΈ
call girls in Kamla Market (DELHI) πŸ” >ΰΌ’9953330565πŸ” genuine Escort Service πŸ”βœ”οΈβœ”οΈ
Β 
Hierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of managementHierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of management
Β 
Roles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceRoles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in Pharmacovigilance
Β 

Simple linear regression

  • 2.  In the first part of this section we find the equation of the straight line that best fits the paired sample data. That equation algebraically describes the relationship between two variables.  The best-fitting straight line is called a regression line and its equation is called the regression equation.
  • 3. Regression Equation – given a collection of paired sample data, the regression equation that algebraically describes the relationship between the two variables π‘₯ and 𝑦 is 𝑦 = 𝑏0 + 𝑏1 π‘₯ β€’ The regression equation attempts to describe a relationship between two variables β€’ Inherently, the equation algebraically describes how the values of one variable are somehow associated with the values of the other variable Regression line – the graph of the regression equation β€’ Also known as the β€œline of best fit” or the β€œleast square line” β€’ The regression line fits the sample points best
  • 4. Notice the 𝑦 in the sample regression equation! This implies that we are predicting something! We are predicting values for 𝑦 based upon true and observed values of π‘₯.
  • 5. 1. The sample of paired data is a simple random sample of quantitative data 2. The pairs of data π‘₯, 𝑦 have a bivariate normal distribution, meaning the following: β€’ Visual examination of the scatter plot(s) confirms that the sample points follow an approximately straight line(s) β€’ Because results can be strongly affected by the presence of outliers, any outliers should be removed if they are known to be errors (Note: Use caution when removing data points)
  • 6. Slope: β€’ 𝑏1 = π‘Ÿ βˆ— 𝑆 𝑦 𝑆 π‘₯ where 𝑠 𝑦 and 𝑠 π‘₯ are the standard deviations of 𝑦 values and π‘₯ values Intercept: β€’ 𝑏0 = 𝑦 βˆ’ 𝑏1 βˆ— π‘₯ Software will typically be utilized to calculate these coefficients
  • 7. We would like to use the explanatory variable, x, shoe print length, to predict the response variable, y, height. The data are listed below:
  • 8. Requirement Check: 1. The data are assumed to be a simple random sample. 2. The scatterplot showed a roughly straight-line pattern. 3. There are no outliers. The use of technology is recommended for finding the equation of a regression line.
  • 9. Using StatCrunch, we obtain the following result: So then, the regression equation can be expressed as: Λ† 125 1.73y xο€½ 
  • 10. Graph the regression equation on a scatterplot: Λ† 125 1.73y xο€½ 
  • 11. 1. Use the regression equation for predictions ONLY if the graph of the regression line on the scatter plot confirms that the line fits the points reasonably well. 2. Use the equation for predictions ONLY if the data used for prediction does not go much beyond the scope of the available sample data. 3. Use the equation for prediction ONLY if π‘Ÿ indicates that there is a significant linear correlation indicated between the two variables, π‘₯ and 𝑦 Notice: If the regression equation does not appear to be useful for predictions, the best predicted value of a 𝑦 variable is its point estimate [i.e. the sample mean of the 𝑦 variable would be the best predicted value for that variable]
  • 12.
  • 13. Marginal change – in working with two variables related by a regression equation, the marginal change in a variable is the amount that the variable changes when the other variable changes by exactly one unit β€’ The slope, 𝑏1, in the regression equation is the marginal change in 𝑦 when π‘₯ changes by one unit
  • 14. For the 40 pairs of shoe print lengths and heights, the regression equation was: The slope of 3.22 tells us that if we increase shoe print length by 1 cm, the predicted height of a person increases by 3.22 cm. Λ† 80.9 3.22y xο€½ 
  • 15. Scatter Plot – plot of paired π‘₯, 𝑦 quantitative data with a dot representing each pair of points β€’ Helpful in determining whether there is a relationship between two variables
  • 16. Time Series Graph – quantitative data collected over time and plotted accordingly with the horizontal axis representing some measure of time
  • 17. β€’ Outlier – in a scatter plot, an outlier is a point lying far away from the other data points β€’ Influential point – a point that strongly affects the graph of the regression line
  • 18. For the 40 pairs of shoe prints and heights, observe what happens if we include this additional data point: x = 35 cm and y = 25 cm
  • 19. The additional point is an influential point because the graph of the regression line because the graph of the regression line did change considerably. The additional point is also an outlier because it is far from the other points.
  • 20. Residual – for a pair of sample π‘₯ and 𝑦 values, the difference between the observed sample value of 𝑦 (a true value observed) and the y-value that is predicted by using the regression equation 𝑦 is the residual β€’ π‘…π‘’π‘ π‘–π‘‘π‘’π‘Žπ‘™ = π‘‚π‘π‘ π‘’π‘Ÿπ‘£π‘’π‘‘ βˆ’ π‘ƒπ‘Ÿπ‘’π‘‘π‘–π‘π‘‘π‘’π‘‘ = 𝑦 βˆ’ 𝑦 β€’ A residual represents a type of inherent prediction error β€’ The regression equation does not, typically, pass through all the observed data values that we have The Least Squares Property – a straight line satisfies this property if the sum of the squares of the residuals is the smallest sum possible
  • 21.
  • 22. Residual Plot – a scatter plot of the π‘₯, 𝑦 values after each of the y-values has been replaced by the residual value, 𝑦 βˆ’ 𝑦. β€’ That is, a residual plot is a graph of the points π‘₯, 𝑦 βˆ’ 𝑦 .
  • 23. When analyzing a residual plot, look for a pattern in the way the points are configured, and use these criteria: The residual plot should not have any obvious patterns (not even a straight line pattern). This confirms that the scatterplot of the sample data is a straight-line pattern. The residual plot should not become thicker (or thinner) when viewed from left to right. This confirms the requirement that for different fixed values of x, the distributions of the corresponding y values all have the same standard deviation.
  • 24. The shoe print and height data are used to generate the following residual plot: The residual plot becomes thicker, which suggests that the requirement of equal standard deviations is violated.
  • 25. The following slides show three residual plots. Let’s observe what is good or bad about the individual regression models.
  • 26. Regression model is a good model:
  • 27. Distinct pattern: sample data may not follow a straight-line pattern.
  • 28. Residual plot becoming thicker: equal standard deviations violated.
  • 29. 1. Construct a scatterplot and verify that the pattern of the points is approximately a straight-line pattern without outliers. (If there are outliers, consider their effects by comparing results that include the outliers to results that exclude the outliers.) 2. Construct a residual plot and verify that there is no pattern (other than a straight-line pattern) and also verify that the residual plot does not become thicker (or thinner).
  • 30. 3. Use a histogram and/or normal quantile plot to confirm that the values of the residuals have a distribution that is approximately normal. 4. Consider any effects of a pattern over time.