The Art Of Backpropagation

•Download as PPTX, PDF•

2 likes•604 views

Supporting slides for Hidden Layers MeetUp (Deep Learning Study Group) - January 31st, 2017 The presentation covers the common difficulties when creating a Deep Learning model (DL architecture, back-propagation, vanishing gradients, etc.)

Technology

The Art Of Backpropagation
and other Bedtime Deep Learning Stories
Jennifer Prendki, @WalmartLabs

Why this talk?
• Deep Learning can solve many problem
• Deep Learning is trendy
• Deep Learning is applied in many different industries
 Everybody is using it, or want to use it
• But many people are using Deep Learning as a black-box
• There is no consistent theory regarding architecture building

Context: Neural Nets, Forward & Backward Feeds
• Back to the basics: what are Artificial Neural Nets?
 The combination of:
• a training method
• an optimization method
 A 2-phase cycle:
• propagation
• weight update

Deep Learning Glossary
• Input: the first layer (what is fed to the algorithm, the initial data columns)
• Output: what we want to compute (can be more than one value)
• Hidden layers: the neurons for the intermediate steps
• Forward propagation of a training pattern's input through the neural network in
order to generate the network's output value(s)
• Backward propagation of the propagation's output activations through the
neural net using the training pattern target in order to generate the
• Deltas: the difference between the targeted and actual output values of all
output and hidden neurons
• Weight update: the process of multiplying the output delta and input activation
to compute the gradient of the weight.
• Learning rate: ratio of the weight's gradient is subtracted from the weight

Backpropagation Algorithm
• Propagation
 Forward propagation of a training pattern's input through the neural network in order to
generate the network's output value(s).
 Backward propagation of the propagation's output activations through the neural
network using the training pattern target in order to generate the deltas.
• Weight update
 The weight's output delta and input activation are multiplied to find the gradient of the
weight.
 The weight is updated according to the learning rate.

Backpropagation Algorithm
Backpropagation can be explained
through the “Shoe Lace” analogy
- Too little tension =
- Not enough constraining, too loose
(unsatisfactory model)
- Too much tension =
- too much constraint (overtraining)
- taking too much time (slow process)
- higher likelihood of breaking (non
convergence)
- Pulling more on one than the other =
- discomfort (bias)

Learning Rate
• Learning rate definition:
• Ratio of the weight's gradient that is subtracted from the weight
• Learning rate = Trade-Off
• Large values for ratio => Fast training
• Lower ratios => Accurate training
• Question: How do you choose the learning rate?

Activation Function
• Backpropagation &
Supervised Learning
• Backpropagation used in
supervised context
• Backpropagation requires
the activation function to
be differentiable

Vanishing Gradient
• What is a vanishing gradient?
 The case where some weights go down to 0
Lessons:
- Starting point for weight matter
(can fall into non optimal minimum)
- Large architectures make it harder to
control
- Expensive memory-wise (and useless)
hidden

Let’s Recap: What Is Hard/Tricky with DL?
• What decisions to be made to build a DL model?
• Overall architecture (RNN, etc.)
• Number of layers
• Number of neurons
• Learning rate
• Conclusion
• Architecture building is sketchy and empirical
• Experimentation takes time and memory
• Loss function
• Activation function
• Starting weights
ARCHITECTURE MODEL DATA
• Number of inputs
• Number of outputs
• Amount of Data

What's hot

nural network ER. Abhishek k. upadhyayabhishek upadhyay

Classification By Back PropagationBineeshJose99

Backpropagation algonoT yeT woRkiNg !! iM stiLl stUdYinG !!

On Implementation of Neuron Network(Back-propagation)Yu Liu

Multi Layer Perceptron & Back PropagationSung-ju Kim

Feedforward neural networkSopheaktra YONG

Classification by back propagation, multi layered feed forward neural network...bihira aggrey

Neural network in matlab Fahim Khan

Principles of soft computing-Associative memory networksSivagowry Shathesh

Back propagation networkHIRA Zaidi

Multi Layer NetworkInternational Islamic University

The Back Propagation Learning AlgorithmESCOM

Artificial Neural NetworksArslan Zulfiqar

Activation functionAstha Jain

Artificial neural networkDEEPASHRI HK

Back propagationNagarajan

Learning in Networks: were Pavlov and Hebb right?Victor Miagkikh

Deep Feed Forward Neural Networks and RegularizationYan Xu

ARTIFICIAL NEURAL NETWORKSAIMS Education

Ffnnguestd60a613

What's hot (20)

nural network ER. Abhishek k. upadhyay

Classification By Back Propagation

Backpropagation algo

On Implementation of Neuron Network(Back-propagation)

Multi Layer Perceptron & Back Propagation

Feedforward neural network

Classification by back propagation, multi layered feed forward neural network...

Neural network in matlab

Principles of soft computing-Associative memory networks

Back propagation network

Multi Layer Network

The Back Propagation Learning Algorithm

Artificial Neural Networks

Activation function

Artificial neural network

Back propagation

Learning in Networks: were Pavlov and Hebb right?

Deep Feed Forward Neural Networks and Regularization

ARTIFICIAL NEURAL NETWORKS

Ffnn

Similar to The Art Of Backpropagation

Hyperparameter TuningJon Lederman

Regularization in deep learningKien Le

Deep Learning Interview Questions And Answers | AI & Deep Learning Interview ...Simplilearn

Deep learning - a primerUwe Friedrichsen

Deep learning - a primerShirin Elsinghorst

Artificial Neural Network (ANNAndrew Molina

Dataset Augmentation and machine learning.pdfsudheeremoa229

Introduction to Deep learning and H2O for beginner'sVidyasagar Bhargava

33.-Multi-Layer-Perceptron.pdfgnans Kgnanshek

part3Module 3 ppt_with classification.pptxVaishaliBagewadikar

Deep Learning in Recommender Systems - RecSys Summer School 2017Balázs Hidasi

Python for Image Understanding: Deep Learning with Convolutional Neural NetsRoelof Pieters

EssentialsOfMachineLearning.pdfAnkita Tiwari

Deeplearning Nimrita Koul

in5490-classification (1).pptxMonicaTimber

Artificial Neural Networks , Recurrent networks , Perceptron'sSRM institute of Science and Technology

Artificial neural network model & hidden layers in multilayer artificial neur...Muhammad Ishaq

08 neural networksankit_ppt

Competition winning learning ratesMLconf

Neural networksPeterPan284882

Similar to The Art Of Backpropagation (20)

Hyperparameter Tuning

Regularization in deep learning

Deep Learning Interview Questions And Answers | AI & Deep Learning Interview ...

Deep learning - a primer

Artificial Neural Network (ANN

Dataset Augmentation and machine learning.pdf

Introduction to Deep learning and H2O for beginner's

33.-Multi-Layer-Perceptron.pdf

part3Module 3 ppt_with classification.pptx

Deep Learning in Recommender Systems - RecSys Summer School 2017

Python for Image Understanding: Deep Learning with Convolutional Neural Nets

EssentialsOfMachineLearning.pdf

Deeplearning

in5490-classification (1).pptx

Artificial Neural Networks , Recurrent networks , Perceptron's

Artificial neural network model & hidden layers in multilayer artificial neur...

08 neural networks

Competition winning learning rates

Neural networks

Recently uploaded

Vulnerability_Management_GRC_by Sohang Sengupta.pptxnull - The Open Security Community

Bluetooth Controlled Car with Arduino.pdfngoud9212

Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation

Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsHyundai Motor Group

"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays

Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada

Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge

Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software

Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix

Science&tech:THE INFORMATION AGE STS.pdfjimielynbastida

Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited

Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service9953056974 Low Rate Call Girls In Saket, Delhi NCR

"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays

AI as an Interface for Commercial BuildingsMemoori

Pigging Solutions in Pet Food ManufacturingPigging Solutions

#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada

Key Features Of Token Development (1).pptxLBM Solutions

Unlocking the Potential of the Cloud for IBM Power SystemsPrecisely

Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK

DMCC Future of Trade Web3 - Special EditionDubai Multi Commodity Centre

Recently uploaded (20)

Vulnerability_Management_GRC_by Sohang Sengupta.pptx

Bluetooth Controlled Car with Arduino.pdf

Connect Wave/ connectwave Pitch Deck Presentation

Snow Chain-Integrated Tire for a Safe Drive on Winter Roads

"Federated learning: out of reach no matter how close",Oleksandr Lapshyn

Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024

Designing IA for AI - Information Architecture Conference 2024

Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation

Swan(sea) Song – personal research during my six years at Swansea ... and bey...

Science&tech:THE INFORMATION AGE STS.pdf

Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365

Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service

"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...

AI as an Interface for Commercial Buildings

Pigging Solutions in Pet Food Manufacturing

#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024

Key Features Of Token Development (1).pptx

Unlocking the Potential of the Cloud for IBM Power Systems

Unblocking The Main Thread Solving ANRs and Frozen Frames

DMCC Future of Trade Web3 - Special Edition

The Art Of Backpropagation

1. The Art Of Backpropagation and other Bedtime Deep Learning Stories Jennifer Prendki, @WalmartLabs

2. Why this talk? • Deep Learning can solve many problem • Deep Learning is trendy • Deep Learning is applied in many different industries  Everybody is using it, or want to use it • But many people are using Deep Learning as a black-box • There is no consistent theory regarding architecture building

3. Context: Neural Nets, Forward & Backward Feeds • Back to the basics: what are Artificial Neural Nets?  The combination of: • a training method • an optimization method  A 2-phase cycle: • propagation • weight update

4. Deep Learning Glossary • Input: the first layer (what is fed to the algorithm, the initial data columns) • Output: what we want to compute (can be more than one value) • Hidden layers: the neurons for the intermediate steps • Forward propagation of a training pattern's input through the neural network in order to generate the network's output value(s) • Backward propagation of the propagation's output activations through the neural net using the training pattern target in order to generate the • Deltas: the difference between the targeted and actual output values of all output and hidden neurons • Weight update: the process of multiplying the output delta and input activation to compute the gradient of the weight. • Learning rate: ratio of the weight's gradient is subtracted from the weight

5. Backpropagation Algorithm • Propagation  Forward propagation of a training pattern's input through the neural network in order to generate the network's output value(s).  Backward propagation of the propagation's output activations through the neural network using the training pattern target in order to generate the deltas. • Weight update  The weight's output delta and input activation are multiplied to find the gradient of the weight.  The weight is updated according to the learning rate.

6. Backpropagation Algorithm Backpropagation can be explained through the “Shoe Lace” analogy - Too little tension = - Not enough constraining, too loose (unsatisfactory model) - Too much tension = - too much constraint (overtraining) - taking too much time (slow process) - higher likelihood of breaking (non convergence) - Pulling more on one than the other = - discomfort (bias)

7. Learning Rate • Learning rate definition: • Ratio of the weight's gradient that is subtracted from the weight • Learning rate = Trade-Off • Large values for ratio => Fast training • Lower ratios => Accurate training • Question: How do you choose the learning rate?

8. Activation Function • Backpropagation & Supervised Learning • Backpropagation used in supervised context • Backpropagation requires the activation function to be differentiable

9. Vanishing Gradient • What is a vanishing gradient?  The case where some weights go down to 0 Lessons: - Starting point for weight matter (can fall into non optimal minimum) - Large architectures make it harder to control - Expensive memory-wise (and useless) hidden

10.

11. Let’s Recap: What Is Hard/Tricky with DL? • What decisions to be made to build a DL model? • Overall architecture (RNN, etc.) • Number of layers • Number of neurons • Learning rate • Conclusion • Architecture building is sketchy and empirical • Experimentation takes time and memory • Loss function • Activation function • Starting weights ARCHITECTURE MODEL DATA • Number of inputs • Number of outputs • Amount of Data

Editor's Notes

LiveSlide Site http://www.emergentmind.com/neural-network

The Art Of Backpropagation

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to The Art Of Backpropagation

Similar to The Art Of Backpropagation (20)

Recently uploaded

Recently uploaded (20)

The Art Of Backpropagation

Editor's Notes