1. When the bivariate data are displayed on the ๐ฅ๐ฆ coordinate plane by using a scatter plot, each data point
represents two variables. One of these two variables is the INDEPENDENT VARIABLE and the other one is
DEPENDENT VARIABLE.
THE BEST-FIT LINE
A scatter plot displays a group of data points that may appear to be following a straight line pointing either
upward to the right or to the left. The straight line that best illustrates the trend or direction that the data
points seem to follow is called the BEST-FIT LINE or LINE OF BEST FIT. This line of best fit can be estimated
from two data points on the scatter plot.
In finding the equation of the line of best fit, you can use the equation of the regression line or simply
regression equation formula.
The equation of the regression line is ๐
ฬ = ๐ + ๐๐, where:
๐ = ๐ โ ๐๐๐๐๐๐๐๐๐ ๐๐ ๐๐๐ ๐๐๐๐๐๐๐๐๐๐ ๐๐๐๐
๐ = ๐๐๐๐๐ ๐๐ ๐๐๐ ๐๐๐๐๐๐๐๐๐๐ ๐๐๐๐
The y-intercept can be computed using the following formula:
๐ =
(โ ๐)(โ ๐๐
) โ (โ ๐)(โ ๐๐)
๐(โ ๐๐) โ (โ ๐)๐
The x-intercept can be computed using the following formula:
๐ =
๐(โ ๐๐) โ (โ ๐)(โ ๐)
๐(โ ๐๐) โ (โ ๐)๐
EXAMPLE 1: Consider the following data
a) Find the equation of the regression line using the table below.
X 1 2 3 4 5 6 7
y 4 3 8 6 12 10 8
SOLUTION:
๐ ๐ ๐๐ ๐๐
1 4 4 1
2 3 6 4
3 8 24 9
4 6 24 16
5 12 60 25
6 10 60 36
7 8 56 49
โ ๐ฅ = 28 โ ๐ฆ = 51 โ ๐ฅ๐ฆ = 234 โ ๐ฅ2
= 140
2. The y-intercept can be computed using the following formula:
๐ =
(โ ๐)(โ ๐๐
) โ (โ ๐)(โ ๐๐)
๐(โ ๐๐) โ (โ ๐)๐
๐ =
(๐๐)(๐๐๐) โ (๐๐)(๐๐๐)
๐(๐๐๐) โ (๐๐)๐
=
๐, ๐๐๐ โ ๐, ๐๐๐
๐๐๐ โ ๐๐๐
=
๐๐๐
๐๐๐
= ๐
The x-intercept can be computed using the following formula:
๐ =
๐(โ ๐๐) โ (โ ๐)(โ ๐)
๐(โ ๐๐) โ (โ ๐)๐
๐ =
๐(๐๐๐) โ (๐๐)(๐๐)
๐(๐๐๐) โ (๐๐)๐
=
๐, ๐๐๐ โ ๐, ๐๐๐
๐๐๐ โ ๐๐๐
=
๐๐๐
๐๐๐
= ๐. ๐๐๐๐๐ ๐๐ ๐. ๐๐๐
The equation of the regression line is
๐
ฬ = ๐ + ๐๐
๐
ฬ = ๐ + ๐. ๐๐๐๐
EXAMPLE 2: The grades of 7 students in the first and second grading periods are shown below.
a) Find the equation of the regression line.
b) Estimate the grade in the second grading period of a student who received a grade of 88 in
the first grading period.
x 80 78 76 82 84 85 75
y 84 79 75 86 84 77 78
SOLUTION:
๐ ๐ ๐๐ ๐๐
80 84 6,720 6,400
78 79 6,162 6,084
76 75 5,700 5,776
82 86 7,052 6,724
84 84 7,056 7,056
85 77 6,545 7,225
75 78 5,850 5,625
โ ๐ = ๐๐๐ โ ๐ = ๐๐๐ โ ๐๐ = ๐๐, ๐๐๐ โ ๐๐
= ๐๐, ๐๐๐
The y-intercept can be computed using the following formula:
๐ =
(โ ๐)(โ ๐๐
) โ (โ ๐)(โ ๐๐)
๐(โ ๐๐) โ (โ ๐)๐
3. ๐ =
(๐๐๐)(๐๐, ๐๐๐) โ (๐๐๐)(๐๐, ๐๐๐)
๐(๐๐, ๐๐๐) โ (๐๐๐)๐
=
๐๐, ๐๐๐, ๐๐๐ โ ๐๐, ๐๐๐, ๐๐๐
๐๐๐, ๐๐๐ โ ๐๐๐, ๐๐๐
=
๐๐, ๐๐๐
๐๐๐
= ๐๐. ๐๐๐
The x-intercept can be computed using the following formula:
๐ =
๐(โ ๐๐) โ (โ ๐)(โ ๐)
๐(โ ๐๐) โ (โ ๐)๐
๐ =
๐(๐๐, ๐๐๐) โ (๐๐๐)(๐๐๐)
๐(๐๐, ๐๐๐) โ (๐๐๐)๐
=
๐๐๐, ๐๐๐ โ ๐๐๐, ๐๐๐
๐๐๐, ๐๐๐ โ ๐๐๐, ๐๐๐
=
๐๐๐
๐๐๐
= ๐. ๐
The equation of the regression line is
๐
ฬ = ๐ + ๐๐
๐
ฬ = ๐๐. ๐๐๐ + ๐. ๐๐
Estimate the grade in the second grading period of a student who received a grade of 88 in the first grading
period:
๐
ฬ = ๐๐. ๐๐๐ + ๐. ๐๐
๐
ฬ = ๐๐. ๐๐๐ + ๐. ๐(๐๐)
๐
ฬ = ๐๐. ๐๐๐ + ๐๐
๐
ฬ = ๐๐. ๐๐๐ ๐๐ ๐๐. ๐๐
A student whose grade is 88 in the first grading can expect a grade of 84.43 in the second grading.