1) Gradient descent is an algorithm used to find the best fit linear regression line for a dataset by minimizing the cost function. 2) The cost function measures the error between predicted and actual values, using mean squared error. 3) Gradient descent works by iteratively calculating the partial derivatives of the cost function with respect to the slope and intercept and adjusting those values in the anti-gradient direction with each iteration until the cost is minimized.