Introduction to optimization for deep learning

•Download as PPTX, PDF•

0 likes•152 views

Although stochastic gradient descent (SGD) resolves some issues of gradient descent (GD), as a slow and costly process, it results in fluctuations around the local minima. Mini-batch gradient descent was suggested to get the middle point reducing fluctuations observed in SGD and cost of GD. Benefit of mini-batch gradient descent is usually reported in deep learning (DL) setting as DL models usually are built training on large datasets.

Data & Analytics

Ali Madani
https://www.linkedin.com/in/amlearning/
Introduction to optimization
for deep learning

gradient descent
2
Obtaining the parameters in the direction of
maximum variation:
Cost function
Learning rate

gradient descent
3
Obtaining the parameters in the direction of
maximum variation:
Cost function
Learning rate
Changing the
parameters to minimize
the cost

gradient descent
4
Obtaining the parameters in the direction of
maximum variation:
Cost function
Learning rate
Summation over
all data points
● slow
● intractable

Stochastic gradient descent
5
Obtaining the parameters in the direction of
maximum variation:
Cost function
Learning rate

Stochastic gradient descent
6
Obtaining the parameters in the direction of
maximum variation:
Issue solved: parameter update for each training example
● Objective function fluctuates
○ Maybe get to a better local minima faster (by jumps)
Cost function
Learning rate

Comparison of gradient descent and stochastic
gradient descent
7

Mini-batch gradient descent
8
Let’s get the middle
Cost function
Learning rate

Mini-batch gradient descent
9
Let’s get the middle
Cost function
Learning rate
Update for mini-batches
of n training examples

Please share what you
learned with others

Similar to Introduction to optimization for deep learning

An overview of gradient descent optimization algorithms Hakky St

Methods of Optimization in Machine LearningKnoldus Inc.

Bagging.pptxComsatsSahiwal1

Tuning for Systematic Trading: Talk 2: Deep LearningSigOpt

Getting Started with Server-Side TestingOptimizely

[Webinar] Getting started with server-side testing - presented by WiderFunnel...Chris Goward

Understanding parametersniffing sqlsatSanil Mhatre

IM426 3A G5.pptMohamedSalem979344

MACHINE LEARNING YEAR DL SECOND PART.pptxNAGARAJANS68

Using Bayesian Optimization to Tune Machine Learning ModelsScott Clark

Using Bayesian Optimization to Tune Machine Learning ModelsSigOpt

Meetup_Consumer_Credit_Default_Vers_2_AllBernard Ong

Algorithmic pricing: Forecasting and PricingTofigh Naghibi

Tuning 2.0: Advanced Optimization Techniques WebinarSigOpt

Using SigOpt to Tune Deep Learning Models with Nervana CloudSigOpt

Reinforcement Learning 8: Planning and Learning with Tabular MethodsSeung Jae Lee

Building an algorithmic price management system using MLGrid Dynamics

Compiler Design- Machine Independent OptimizationsJyothishmathi Institute of Technology and Science Karimnagar

Modern goal tracking stratgies for the webMichael Freeman

Lecture 7 - Bias, Variance and Regularization, a lecture in subject module St...Maninda Edirisooriya

Similar to Introduction to optimization for deep learning (20)

An overview of gradient descent optimization algorithms

Methods of Optimization in Machine Learning

Bagging.pptx

Tuning for Systematic Trading: Talk 2: Deep Learning

Getting Started with Server-Side Testing

[Webinar] Getting started with server-side testing - presented by WiderFunnel...

Understanding parametersniffing sqlsat

IM426 3A G5.ppt

MACHINE LEARNING YEAR DL SECOND PART.pptx

Using Bayesian Optimization to Tune Machine Learning Models

Meetup_Consumer_Credit_Default_Vers_2_All

Algorithmic pricing: Forecasting and Pricing

Tuning 2.0: Advanced Optimization Techniques Webinar

Using SigOpt to Tune Deep Learning Models with Nervana Cloud

Reinforcement Learning 8: Planning and Learning with Tabular Methods

Building an algorithmic price management system using ML

Compiler Design- Machine Independent Optimizations

Modern goal tracking stratgies for the web

Lecture 7 - Bias, Variance and Regularization, a lecture in subject module St...

Recently uploaded

Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Valters Lauzums

Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Delhi Call girls

VidaXL dropshipping via API with DroFx.pptxolyaivanovalion

Zuja dropshipping via API with DroFx.pptxolyaivanovalion

CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE9953056974 Low Rate Call Girls In Saket, Delhi NCR

Mature dropshipping via API with DroFx.pptxolyaivanovalion

Edukaciniai dropshipping via API with DroFxolyaivanovalion

Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann

Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh9953056974 Low Rate Call Girls In Saket, Delhi NCR

Sampling (random) method and Non random.pptDr. Soumendra Kumar Patra

Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson

Data-Analysis for Chicago Crime Data 2023ymrp368

Halmar dropshipping via API with DroFxolyaivanovalion

Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71

(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7Call Girls in Nagpur High Profile Call Girls

Accredited-Transport-Cooperatives-Jan-2021-Web.pdfadriantubila

Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% SecurePooja Nehwal

BigBuy dropshipping via API with DroFx.pptxolyaivanovalion

Introduction-to-Machine-Learning (1).pptxfirstjob4

BabyOno dropshipping via API with DroFx.pptxolyaivanovalion

Recently uploaded (20)

Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...

Best VIP Call Girls Noida Sector 39 Call Me: 8448380779

VidaXL dropshipping via API with DroFx.pptx

Zuja dropshipping via API with DroFx.pptx

CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE

Mature dropshipping via API with DroFx.pptx

Edukaciniai dropshipping via API with DroFx

Generative AI on Enterprise Cloud with NiFi and Milvus

Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh

Sampling (random) method and Non random.ppt

Schema on read is obsolete. Welcome metaprogramming..pdf

Data-Analysis for Chicago Crime Data 2023

Halmar dropshipping via API with DroFx

Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha

(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7

Accredited-Transport-Cooperatives-Jan-2021-Web.pdf

Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure

BigBuy dropshipping via API with DroFx.pptx

Introduction-to-Machine-Learning (1).pptx

BabyOno dropshipping via API with DroFx.pptx

Introduction to optimization for deep learning

1. Ali Madani https://www.linkedin.com/in/amlearning/ Introduction to optimization for deep learning

2. gradient descent 2 Obtaining the parameters in the direction of maximum variation: Cost function Learning rate

3. gradient descent 3 Obtaining the parameters in the direction of maximum variation: Cost function Learning rate Changing the parameters to minimize the cost

4. gradient descent 4 Obtaining the parameters in the direction of maximum variation: Cost function Learning rate Summation over all data points ● slow ● intractable

5. Stochastic gradient descent 5 Obtaining the parameters in the direction of maximum variation: Cost function Learning rate

6. Stochastic gradient descent 6 Obtaining the parameters in the direction of maximum variation: Issue solved: parameter update for each training example ● Objective function fluctuates ○ Maybe get to a better local minima faster (by jumps) Cost function Learning rate

7. Comparison of gradient descent and stochastic gradient descent 7

8. Mini-batch gradient descent 8 Let’s get the middle Cost function Learning rate

9. Mini-batch gradient descent 9 Let’s get the middle Cost function Learning rate Update for mini-batches of n training examples

10. Please share what you learned with others

Introduction to optimization for deep learning

Recommended

Recommended

More Related Content

Similar to Introduction to optimization for deep learning

Similar to Introduction to optimization for deep learning (20)

Recently uploaded

Recently uploaded (20)

Introduction to optimization for deep learning