gradientDescentTNP (2).pdf

•

0 likes•5 views

AkashRajBehera

Gradient descent

Data & Analytics

CS771: Intro to ML
Gradient descent algorithm
2
• Gradient descent is an optimization algorithm used to
minimize some function by iteratively moving in the
direction of steepest descent .
• In machine learning, we use gradient descent to
update the parameters of our model. Parameters refer
to coefficients in Linear Regression

CS771: Intro to ML
5
Learning rate
• The size of these steps is called the learning rate.
• With a high learning rate we can cover more ground each step, but we
risk overshooting the lowest point since the slope of the hill is
constantly changing.
• A low learning rate is more precise, but calculating the gradient is
time-consuming, so it will take us a very long time to get to the
bottom.

CS771: Intro to ML
Local & Global Minima , Maxima
10
𝑓(𝑥)
Global
maxima
A local
maxima
A local
maxima
A local
minima
A local
minima A local
minima
Global
minima
𝑥

CS771: Intro to ML
the tangent is perfectly horizontal at the local minima and maxima.

CS771: Intro to ML
Derivatives
13
 How the derivative itself changes tells us about the function’s optima
 The second derivative 𝑓’’(𝑥) can provide this information
𝑓’(𝑥)= 0 at 𝑥,
𝑓’(𝑥)>0 just
before 𝑥 𝑓’(𝑥)<0
just after 𝑥
𝑥 is a maxima
𝑓’(𝑥)= 0 at 𝑥
𝑓’(𝑥)< 0 just
before 𝑥 𝑓’(𝑥)>0
just after 𝑥
𝑥 is a minima
𝑓’(𝑥)= 0 at 𝑥
𝑓’(𝑥)= 0 just
before 𝑥 𝑓’(𝑥)= 0
just after 𝑥
𝑥 may be a saddle
𝑓’(𝑥)= 0 and 𝑓’’(𝑥) <
0
𝑥 is a maxima
𝑓’(𝑥)= 0 and 𝑓’’ 𝑥 > 0
𝑥 is a minima
𝑓’(𝑥)= 0 and 𝑓’’ 𝑥 = 0
𝑥 may be a saddle. May
need higher derivatives

CS771: Intro to ML
Saddle Points
15
 Points where derivative is zero but are neither minima nor maxima
 Second or higher derivative may help identify if a stationary point is a
saddle
Saddle is a point of
inflection where the
derivative is also zero
A saddle
point

CS771: Intro to ML
Gradient Descent: An Illustration
16
𝒘∗
𝒘(0) 𝒘(1) 𝒘(2) 𝒘(0)
𝒘(1)
𝒘(2) 𝒘∗
𝒘(3) 𝒘(3)
Stuck at a
local minima
Negative gradient here (
𝛿𝐿
𝛿𝑤
<
0). Let’s move in the positive
direction
Positive gradient
here. Let’s move
in the negative
direction
Learning rate is very important
Good initialization
is very important
𝐿(𝒘)
𝒘

CS771: Intro to ML
Optimal value of intercept ?
24

CS771: Intro to ML
Assume intercept=0
25

CS771: Intro to ML
For row =2 and row=3
27

CS771: Intro to ML
Different values of intercept
30

CS771: Intro to ML
Red line is the slope .. As the intercept
increases…
32

Similar to gradientDescentTNP (2).pdf

4. OPTIMIZATION NN AND FL.pptxkumarkaushal17

Stepwise Selection Choosing the Optimal Model .pptneelamsanjeevkumar

Regression.pptxtayyaba19799

Regression.pptxTigabu Yaya

Simple Linear Regression: Step-By-StepDan Wellisch

Week6n7 Applications of Derivative.pptxkashiijaam008

CHAPTER 4.1.pdfLAILATULATILA

working with pythonbhavesh lande

15303589.pptABINASHPADHY6

Essay on-data-analysisRaman Kannan

Optimization techniqRakshithGowdakodihal

Regression pptSuyashSingh70

Regression Analysis.pptxarsh260174

Regression Analysis Techniques.pptxYutaItadori

Techniques in Deep LearningSourya Dey

Lecture 11 linear regressionMostafa El-Hosseini

2. Linear regression with one variable.pptxEmad Nabil

#6 formal methods – loop proof using induction methodSharif Omar Salem

Lecture_3_Gradient_Descent.pptxgnans Kgnanshek

MF Presentation.pptxHarshitSingh334328

Similar to gradientDescentTNP (2).pdf (20)

4. OPTIMIZATION NN AND FL.pptx

Stepwise Selection Choosing the Optimal Model .ppt

Regression.pptx

Simple Linear Regression: Step-By-Step

Week6n7 Applications of Derivative.pptx

CHAPTER 4.1.pdf

working with python

15303589.ppt

Essay on-data-analysis

Optimization techniq

Regression ppt

Regression Analysis.pptx

Regression Analysis Techniques.pptx

Techniques in Deep Learning

Lecture 11 linear regression

2. Linear regression with one variable.pptx

#6 formal methods – loop proof using induction method

Lecture_3_Gradient_Descent.pptx

MF Presentation.pptx

Recently uploaded

Abortion pills in Jeddah | +966572737505 | Get CytotecAbortion pills in Riyadh +966572737505 get cytotec

RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptxronsairoathenadugay

SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...Elaine Werffeli

5CL-ADBA,5cladba, Chinese supplier, safety is guaranteedamy56318795

Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...ZurliaSoop

Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...nirzagarg

Predicting HDB Resale Prices - Conducting Linear Regression Analysis With OrangeThinkInnovation

Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...HyderabadDolls

Kings of Saudi Arabia, information about themeitharjee

Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...gajnagarg

Gartner's Data Analytics Maturity Model.pptxchadhar227

Reconciling Conflicting Data Curation Actions: Transparency Through Argument...Bertram Ludäscher

Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...nirzagarg

Dubai Call Girls Peeing O525547819 Call Girls Dubaikojalkojal131

Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...gajnagarg

Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...HyderabadDolls

Top Call Girls in Balaghat 9332606886Call Girls Advance Cash On Delivery Ser...kumargunjan9515

Gulbai Tekra * Cheap Call Girls In Ahmedabad Phone No 8005736733 Elite Escort...gragchanchal546

20240412-SmartCityIndex-2024-Full-Report.pdfkhraisr

Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book nowgargpaaro

Recently uploaded (20)

Abortion pills in Jeddah | +966572737505 | Get Cytotec

RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx

SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...

5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed

Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...

Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...

Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange

Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...

Kings of Saudi Arabia, information about them

Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...

Gartner's Data Analytics Maturity Model.pptx

Reconciling Conflicting Data Curation Actions: Transparency Through Argument...

Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...

Dubai Call Girls Peeing O525547819 Call Girls Dubai

Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...

Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...

Top Call Girls in Balaghat 9332606886Call Girls Advance Cash On Delivery Ser...

Gulbai Tekra * Cheap Call Girls In Ahmedabad Phone No 8005736733 Elite Escort...

20240412-SmartCityIndex-2024-Full-Report.pdf

Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now

gradientDescentTNP (2).pdf

1. CS771: Intro to ML Gradient descent algorithm • Gradient descent algorithm is an optimization algorithm which is used to minimise the function. • The function which is set to be minimised is called as an objective function. • For machine learning, the objective function is also termed as the cost function or loss function. • Loss function is the measure of the squared difference between actual values and predictions

2. CS771: Intro to ML Gradient descent algorithm 2 • Gradient descent is an optimization algorithm used to minimize some function by iteratively moving in the direction of steepest descent . • In machine learning, we use gradient descent to update the parameters of our model. Parameters refer to coefficients in Linear Regression

3. CS771: Intro to ML

4. CS771: Intro to ML 4

5. CS771: Intro to ML 5 Learning rate • The size of these steps is called the learning rate. • With a high learning rate we can cover more ground each step, but we risk overshooting the lowest point since the slope of the hill is constantly changing. • A low learning rate is more precise, but calculating the gradient is time-consuming, so it will take us a very long time to get to the bottom.

6. CS771: Intro to ML

7. CS771: Intro to ML 7

8. CS771: Intro to ML 8

9. CS771: Intro to ML 9

10. CS771: Intro to ML Local & Global Minima , Maxima 10 𝑓(𝑥) Global maxima A local maxima A local maxima A local minima A local minima A local minima Global minima 𝑥

11. CS771: Intro to ML the tangent is perfectly horizontal at the local minima and maxima.

12. CS771: Intro to ML

13. CS771: Intro to ML Derivatives 13  How the derivative itself changes tells us about the function’s optima  The second derivative 𝑓’’(𝑥) can provide this information 𝑓’(𝑥)= 0 at 𝑥, 𝑓’(𝑥)>0 just before 𝑥 𝑓’(𝑥)<0 just after 𝑥 𝑥 is a maxima 𝑓’(𝑥)= 0 at 𝑥 𝑓’(𝑥)< 0 just before 𝑥 𝑓’(𝑥)>0 just after 𝑥 𝑥 is a minima 𝑓’(𝑥)= 0 at 𝑥 𝑓’(𝑥)= 0 just before 𝑥 𝑓’(𝑥)= 0 just after 𝑥 𝑥 may be a saddle 𝑓’(𝑥)= 0 and 𝑓’’(𝑥) < 0 𝑥 is a maxima 𝑓’(𝑥)= 0 and 𝑓’’ 𝑥 > 0 𝑥 is a minima 𝑓’(𝑥)= 0 and 𝑓’’ 𝑥 = 0 𝑥 may be a saddle. May need higher derivatives

14. CS771: Intro to ML

15. CS771: Intro to ML Saddle Points 15  Points where derivative is zero but are neither minima nor maxima  Second or higher derivative may help identify if a stationary point is a saddle Saddle is a point of inflection where the derivative is also zero A saddle point

16. CS771: Intro to ML Gradient Descent: An Illustration 16 𝒘∗ 𝒘(0) 𝒘(1) 𝒘(2) 𝒘(0) 𝒘(1) 𝒘(2) 𝒘∗ 𝒘(3) 𝒘(3) Stuck at a local minima Negative gradient here ( 𝛿𝐿 𝛿𝑤 < 0). Let’s move in the positive direction Positive gradient here. Let’s move in the negative direction Learning rate is very important Good initialization is very important 𝐿(𝒘) 𝒘

17. CS771: Intro to ML

18. CS771: Intro to ML 18

19. CS771: Intro to ML 19

20. CS771: Intro to ML 20

21. CS771: Intro to ML 21

22. CS771: Intro to ML 22

23. CS771: Intro to ML 23

24. CS771: Intro to ML Optimal value of intercept ? 24

25. CS771: Intro to ML Assume intercept=0 25

26. CS771: Intro to ML For row =1 26

27. CS771: Intro to ML For row =2 and row=3 27

28. CS771: Intro to ML 28