Introduction to deep learning workshop

Deep Learning is well descriptive

Self Driving Cars - Detection Mechanisms
MIT 6.S094- Self Driving

So Many Applications
1. How FB cluster images - Open Face
2. Google Neural Machine Translation - Seq2Seq
3. Google Assistant/ Siri - Seq2Seq , Attention Mechanisms
4. Self Driving Cars - Detection , Deep Reinforcement Learning
5. Signal Processing - Discriminative and separable feature extractors

Yes… Even In Wall Street !
Algorithmic Trading With Bots

THE GAP BETWEEN ACADEMIA AND
INDUSTRY IS REALLY REALLY LESS

Where To Start ?
● This is different from traditional Machine Learning
● SVM , Decision Trees , Random Forest , Naive Bayes , Regression

Where to use traditional ML ?
● Good for problems with linear classification boundaries

Not Lucky Enough ..
● Non Linear Boundaries
● We need algorithms with more non-linearity

Problems Getting Complex With Big Data
1. Data is new oil
2. Economy is data driven
3. So many patterns
4. So many connections

Mining the Big Data
Again more NON LINEARITY
Best Fit is Deep Neural Architecture

Machine Learning Basics - Very Brief
1. Supervised Learning - Neural Nets , SVM , Naive Bayes , Random Forest
2. Unsupervised Learning - K Means Clustering
3. Reinforcement Learning - Model Free Learning , MDP , Q Learning
4. Semi Supervised Learning - GAN(New)

Semi Supervised Learning (GAN)

Again Why Deep Learning
❏ Traditional ML works with more rule based structure
❏ Not deep enough to extract complex patterns
❏ Should carefully input selected features : PCA , Handcraft -Haar
Haar Features
SIFT Features

Deep Learning Allows : End to End Training

Where to Start ..
★ Neural Architecture
Single Layer Neural Net

Architecture - Basics
● Classification Algorithm
● Trainable Weight Set
● Labeled Data

Let’s start with a node
Perceptron
Nonlinearity applied inside
a node

Nonlinearity to mine complex boundaries
Basic Idea -:
These units are there to squash the information which means they have a
working rage .
Ex : Activation Range Of Sigmoid (a) & Tanh(b) Neurons
Linear line in the curve

Understanding the Neural Model
● Training Phase -: Supervised Learning | Labeled data
● Inference / Testing Phase -: Checking the model for unseen data

Architecture
Forward Pass
•Calculating the loss

Backward Pass
•Backpropagation Algorithm
•Distributing Gradients

Optimizing
•Reducing the loss
•Updating the weight matrix

Updating weights - SGD
Loss FunctionUpdating weights

YOU CAN’T TRAIN
IF THERE ARE NO GRADIENTS

This went deeper!
How ?
With the help of two superheros!

Hierarchical Feature Representation

Why this feature thing is so effective ?

Then what happened ?
Deep Neural Nets became harder and harder to train!
DEEP NETS!
Y U NO EASY?

These nets are huge!
Many
layers
nodes
parameters
BUT DON’T KNOW
HOW MANY HIDDEN
LAYERS / NODES TO
USE?
WHEN YOU WANT TO
BUILD A NEURAL NET

Algorithm Fail To Generalize Things
❏ When predicting , algorithm should be very logical , it should take decisions
based on the patterns .
❏ Otherwise It will fail eventually on Unseen Data
❏ When networks become bigger and bigger they came up with overfitting issue
more and more

Common Regularization wasn’t enough
● Adding some kind of penalty when tweaking
parameters

Common Regularization Methods
Not Enough !
Left - L2 Regularization | Right - L1 Regularization

Then Came up with new methods
1. Dropout
2. Modified SGD for optimization - Adam , RMSprop , Adagrad
3. Different architectures like CNN

Killing the training process
Neuron in a neural network
Sigmoid activation function

Batch Normalization Helped DL a LOT!

A Neural Net should have more Unsaturated Neurons .
● Backpropagation is basically applying chain rule
● In-order to make it happen we need to calculate local gradients in each node
connection
Network with ReLU
activation function

Can we design set of features for
machines ??
No way!
We may design some high level features!
But our machines deal with PIXELS!
(In other domains like NLP also)

What if we let the machine to
extract it’s own features!

Deep Learning is all about End-End Training
Automated Feature Extraction
Visualization of self learned filters of a CNN. Each layer different
features

Some Cool Applications
Self driving cars
Generative Chatbots

Introduction to deep learning workshop

Introduction to deep learning workshop

More Related Content

What's hot

Similar to Introduction to deep learning workshop

Recently uploaded

Introduction to deep learning workshop

Editor's Notes