Machine Learning lecture6(regularization)

Regularization
The problem of
overfitting
Machine Learning

Andrew Ng
Example: Linear regression (housing prices)
Overfitting: If we have too many features, the learned hypothesis
may fit the training set very well ( ), but fail
to generalize to new examples (predict prices on new examples).
Price
Size
Price
Size
Price
Size
d = 1 d = 2 d = 4

Andrew Ng
Example: Logistic regression
( = sigmoid function)
x1
x2
x1
x2
x1
x2

Andrew Ng
Addressing overfitting:
Price
Size
size of house
no. of bedrooms
no. of floors
age of house
average income in neighborhood
kitchen size
Overfitting is most likely to occur whenever we have a large no. of features and a
small no. of training examples.

Andrew Ng
Addressing overfitting:
Options:
1. Reduce number of features.
― Manually select which features to keep.
― Model selection algorithm (later in course).
2. Regularization.
― Keep all the features, but reduce magnitude/values of
parameters .
― Works well when we have a lot of features, each of
which contributes a bit to predicting .

Regularization
Cost function
Machine Learning

Andrew Ng
Intuition
Suppose we penalize and make , really small.
Price
Size of house
Price
Size of house

Andrew Ng
Small values for parameters
― “Simpler” hypothesis
― Less prone to overfitting
Regularization.
Housing:
― Features:
― Parameters:

Andrew Ng
Regularization.
Price
Size of house
regularization parameter
Regularization parameter controls the
tradeoff between minimizing the
squared error term and minimizing
the parameters .

Andrew Ng
In regularized linear regression, we choose to minimize
What if is set to an extremely large value (perhaps too large for
our problem, say )?
- Algorithm works fine; setting to be very large can’t hurt it
- Algorithm fails to eliminate overfitting.
- Algorithm results in underfitting. (Fails to fit even training data
well).
- Gradient descent will fail to converge.

Andrew Ng
In regularized linear regression, we choose to minimize
What if is set to an extremely large value (perhaps too large for
our problem, say )?
Price
Size of house

Regularization
Regularized linear
regression
Machine Learning

Andrew Ng
Gradient descent
Repeat

Andrew Ng
Suppose ,
Non-invertibility (optional/advanced).
(#examples) (#features)
If ,

Regularization
Regularized
logistic regression
Machine Learning

Andrew Ng
Regularized logistic regression.
Cost function:
x1
x2

Andrew Ng
gradient(1) = [ ];
function [jVal, gradient] = costFunction(theta)
jVal = [ ];
gradient(2) = [ ];
gradient(n+1) = [ ];
code to compute
code to compute
code to compute
code to compute
Advanced optimization
gradient(3) = [ ];code to compute

Machine Learning lecture6(regularization)

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Machine Learning lecture6(regularization)

Similar to Machine Learning lecture6(regularization) (20)

More from cairo university

More from cairo university (20)

Recently uploaded

Recently uploaded (20)

Machine Learning lecture6(regularization)

Editor's Notes