SlideShare a Scribd company logo
Animesh Prasad
Muthu Kumar Chandrasekaran 

Basics of Deep Learning
Workshop on Educational and Social Science Technologies (WESST 2017, 19—21 July)
Machine Learning Primer
2
Machine Learning Primer
3
4
Machine Learning Primer
5
http://playground.tensorflow.org
Machine Learning, AI
• Four key ingredients for ML towards AI
1. Lots & lots of data
2. Very flexible models
3.
 
Enough computing power
4.  Powerful priors that can defeat the curse of
dimensionality 

6
Bypassing the curse of dimensionality
We need to build compositionality into our ML models
Just as human languages exploit compositionality to give representations and
meanings to complex ideas
Exploiting compositionality gives an exponential gain in representational power
(1) Distributed representations / embeddings: feature learning
(2) Deep architecture: multiple levels of feature learning
Additional prior: compositionality is useful to describe the
world around us efficiently


7
Each feature can be discovered without the need for
seeing the exponentially large number of
configurations of the other features
• Consider a network whose hidden units discover the following features:
• Person wears glasses
• Person is female

• Person is a child

• Etc.
If each of n feature requires O(k) parameters, need O(nk) examples Non-
parametric methods would require O(nd
) examples


8


9
Hidden Units Discover Semantically Meaningful
Concepts 

• Network trained to recognize places, not objects


10
Why does it work?
•
 
It only works because we are making some assumptions about 

the data generating distribution 

•
 
Worse-case distributions still require exponential data 

•
 
But the world has structure and we can get an exponential gain by
exploiting some of it 



11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
Recurrent Neural Networks
(RNN)
Recurrent Neural Networks
• Selectively summarize an input sequence in a fixed-size state vector via a
recursive update
26
Recurrent Neural Networks
• Can produce an output at each time step: unfolding the graph tells us how to back-prop
through time.
27
Recurrent Neural Networks
• The RNN gradient is a product of Jacobian matrices, each associated with a step in the
forward computation.
28
29
RNN Tricks
30
Gated Recurrent Units & LSTM
Create a path where gradients can
flow for longer with self-loop
•  Corresponds to an eigenvalue of
Jacobian slightly less than 1
•  LSTM is heavily used
• GRU light-weight version
31
In Practice
32
33
34
35
36
37
Saddle Points During Training
38
Piecewise Linear Nonlinearity
Absolute value rectification works better than tanh
in lower layers of convnet.
Using a rectifier non- linearity (ReLU) instead of
tanh of softplus allows for the first time to train
very deep supervised networks without the need
for unsupervised pre-training; was biologically
motivated
Rectifiers one of the crucial ingredients in
ImageNet breakthrough
39
Dropout Regularizer: Super-
Efficient Bagging
•  Dropouts trick: during training
multiply neuron output by random bit
(p=0.5), during test by 0.5
•  Used in deep supervised networks
•  Similar to denoising auto-encoder,
but corrupting every layer
•  Works better with some non-
linearities (rectifiers, maxout)
40
Batch Normalization
•  Regulaizes & helps to train
41
Early Stopping
•  Beautiful FREE LUNCH (no need to launch many different
training runs for each value of hyper-parameter for #iterations)
•  Monitor validation error during training (after visiting # of
training examples = a multiple of validation set size)
•  Keep track of parameters with best validation error and report
them at the end
•  If error does not improve enough (with some patience), stop. 

42
Sampling Hyperparameters
•  Common approach: manual +
grid search
•  Grid search over
hyperparameters: simple &
wasteful
•  Random search: simple &
efficient 

• Independently sample each HP,

• More convenient: ok to early-
stop, continue further, etc. 

43
Speech
44
Natural Language
Representations
45
46
Analogical Representations for
Free Text
• Semantic relations appear
as linear relationships in the
space of learned
representations
• King – Queen ≈ Man –
Woman
• Paris – France + Italy ≈
Rome
47
•  Intermediate representation of meaning = ‘universal representation’
•  Encoder: from word sequence to sentence representation
•  Decoder: from representation to word sequence distribution
Encoder-Decoder Framework
48
Attention Mechanism for Deep
Learning
•  Consider an input (or intermediate) sequence or image
•  Consider an upper level representation, which can choose « where to
look », by assigning a weight or probability to each input position, as
produced by an MLP, applied at each position
49
Image-to-Text: Caption Generation
with Attention
50
Image-to-Text: Caption Generation
with Attention
51
52
53
54
Unsupervised Representation
Learning
55
Autoencoders
•  RBM 

•  (Denoising) Autoencoder
56
57
58
Insights
•  The big payoff of deep learning is to allow learning
higher levels of abstraction 

•  Higher-level abstractions disentangle the factors of
variation, which allows much easier generalization and
transfer 

59
Hands On
60
Hands On
• Login to NUS SoC VPN with your NUSNET Account
• ssh wesst@kpe.d1.comp.nus.edu.sg
Password: kerasuser
• copy folder demo to a new folder rename demo_<yourname>
61
CPU or GPU?
62
Frameworks
63
64
65
66
67
68
69
70
71
72
73
74
75
19 July 2017 76
References:
NIPS DL Tutorial 2015
CS231n: Convolutional Neural
Networks for Visual Recognition

More Related Content

What's hot

Neural networks
Neural networksNeural networks
Neural networks
Learnbay Datascience
 
Introduction to deep learning
Introduction to deep learningIntroduction to deep learning
Introduction to deep learning
Amr Rashed
 
Digit recognition
Digit recognitionDigit recognition
Digit recognition
btandale
 
Artificial neural network
Artificial neural networkArtificial neural network
Artificial neural network
nainabhatt2
 
Compegence: Dr. Rajaram Kudli - An Introduction to Artificial Neural Network ...
Compegence: Dr. Rajaram Kudli - An Introduction to Artificial Neural Network ...Compegence: Dr. Rajaram Kudli - An Introduction to Artificial Neural Network ...
Compegence: Dr. Rajaram Kudli - An Introduction to Artificial Neural Network ...
COMPEGENCE
 
Neural networks and deep learning
Neural networks and deep learningNeural networks and deep learning
Neural networks and deep learning
RADO7900
 
Neutral Network
Neutral NetworkNeutral Network
Neutral Network
Divyansh Sawant
 
Deep Learning Tutorial
Deep Learning TutorialDeep Learning Tutorial
Deep Learning Tutorial
Amr Rashed
 
Neural networks of artificial intelligence
Neural networks of artificial  intelligenceNeural networks of artificial  intelligence
Neural networks of artificial intelligence
alldesign
 
Soft Computing-173101
Soft Computing-173101Soft Computing-173101
Soft Computing-173101AMIT KUMAR
 
Neural networks.ppt
Neural networks.pptNeural networks.ppt
Neural networks.ppt
SrinivashR3
 
Soft computing (ANN and Fuzzy Logic) : Dr. Purnima Pandit
Soft computing (ANN and Fuzzy Logic)  : Dr. Purnima PanditSoft computing (ANN and Fuzzy Logic)  : Dr. Purnima Pandit
Soft computing (ANN and Fuzzy Logic) : Dr. Purnima Pandit
Purnima Pandit
 
Neural Networks Ver1
Neural  Networks  Ver1Neural  Networks  Ver1
Neural Networks Ver1
ncct
 
Neural Netwrok
Neural NetwrokNeural Netwrok
Neural Netwrok
Rabin BK
 
Handwritten Digit Recognition
Handwritten Digit RecognitionHandwritten Digit Recognition
Handwritten Digit Recognition
ijtsrd
 
neural networks
 neural networks neural networks
neural networks
joshiblog
 
Soft Computing
Soft ComputingSoft Computing
Soft Computing
MANISH T I
 
Emotion detection using cnn.pptx
Emotion detection using cnn.pptxEmotion detection using cnn.pptx
Emotion detection using cnn.pptx
RADO7900
 
Artificial neural network for machine learning
Artificial neural network for machine learningArtificial neural network for machine learning
Artificial neural network for machine learning
grinu
 

What's hot (20)

Neural networks
Neural networksNeural networks
Neural networks
 
Introduction to deep learning
Introduction to deep learningIntroduction to deep learning
Introduction to deep learning
 
Digit recognition
Digit recognitionDigit recognition
Digit recognition
 
Artificial neural network
Artificial neural networkArtificial neural network
Artificial neural network
 
Compegence: Dr. Rajaram Kudli - An Introduction to Artificial Neural Network ...
Compegence: Dr. Rajaram Kudli - An Introduction to Artificial Neural Network ...Compegence: Dr. Rajaram Kudli - An Introduction to Artificial Neural Network ...
Compegence: Dr. Rajaram Kudli - An Introduction to Artificial Neural Network ...
 
Neural networks and deep learning
Neural networks and deep learningNeural networks and deep learning
Neural networks and deep learning
 
Neutral Network
Neutral NetworkNeutral Network
Neutral Network
 
Deep Learning Tutorial
Deep Learning TutorialDeep Learning Tutorial
Deep Learning Tutorial
 
Neural networks of artificial intelligence
Neural networks of artificial  intelligenceNeural networks of artificial  intelligence
Neural networks of artificial intelligence
 
Soft Computing-173101
Soft Computing-173101Soft Computing-173101
Soft Computing-173101
 
Neural networks.ppt
Neural networks.pptNeural networks.ppt
Neural networks.ppt
 
Soft computing (ANN and Fuzzy Logic) : Dr. Purnima Pandit
Soft computing (ANN and Fuzzy Logic)  : Dr. Purnima PanditSoft computing (ANN and Fuzzy Logic)  : Dr. Purnima Pandit
Soft computing (ANN and Fuzzy Logic) : Dr. Purnima Pandit
 
Neural Networks Ver1
Neural  Networks  Ver1Neural  Networks  Ver1
Neural Networks Ver1
 
Neural Netwrok
Neural NetwrokNeural Netwrok
Neural Netwrok
 
Handwritten Digit Recognition
Handwritten Digit RecognitionHandwritten Digit Recognition
Handwritten Digit Recognition
 
neural networks
 neural networks neural networks
neural networks
 
Soft Computing
Soft ComputingSoft Computing
Soft Computing
 
Emotion detection using cnn.pptx
Emotion detection using cnn.pptxEmotion detection using cnn.pptx
Emotion detection using cnn.pptx
 
test
testtest
test
 
Artificial neural network for machine learning
Artificial neural network for machine learningArtificial neural network for machine learning
Artificial neural network for machine learning
 

Viewers also liked

Xiangen Hu - WESST Keynote - Conversational Tutors and the Experience API
Xiangen Hu - WESST Keynote - Conversational Tutors and the Experience APIXiangen Hu - WESST Keynote - Conversational Tutors and the Experience API
Xiangen Hu - WESST Keynote - Conversational Tutors and the Experience API
NUS Institute of Applied Learning Sciences and Educational Technology
 
Jennifer Pei-Ling Tan - WESST - Fostering 21st Century Literacies through a ...
Jennifer Pei-Ling Tan - WESST - Fostering 21st Century Literacies through a  ...Jennifer Pei-Ling Tan - WESST - Fostering 21st Century Literacies through a  ...
Jennifer Pei-Ling Tan - WESST - Fostering 21st Century Literacies through a ...
NUS Institute of Applied Learning Sciences and Educational Technology
 
NUS's Digital Learning Journey in Enhancing Teaching and Learning
NUS's Digital Learning Journey in Enhancing Teaching and LearningNUS's Digital Learning Journey in Enhancing Teaching and Learning
NUS's Digital Learning Journey in Enhancing Teaching and Learning
NUS Institute of Applied Learning Sciences and Educational Technology
 
Introducing ALSET & SoC, Opportunities at NUS for Data and Intervention, Dat...
 Introducing ALSET & SoC, Opportunities at NUS for Data and Intervention, Dat... Introducing ALSET & SoC, Opportunities at NUS for Data and Intervention, Dat...
Introducing ALSET & SoC, Opportunities at NUS for Data and Intervention, Dat...
NUS Institute of Applied Learning Sciences and Educational Technology
 
John Stamper - WESST Keynote - Continuous Improvement of Educational Technolo...
John Stamper - WESST Keynote - Continuous Improvement of Educational Technolo...John Stamper - WESST Keynote - Continuous Improvement of Educational Technolo...
John Stamper - WESST Keynote - Continuous Improvement of Educational Technolo...
NUS Institute of Applied Learning Sciences and Educational Technology
 
Xiangen Hu - WESST - AutoTutor, an implementation of Conversation-Based Intel...
Xiangen Hu - WESST - AutoTutor, an implementation of Conversation-Based Intel...Xiangen Hu - WESST - AutoTutor, an implementation of Conversation-Based Intel...
Xiangen Hu - WESST - AutoTutor, an implementation of Conversation-Based Intel...
NUS Institute of Applied Learning Sciences and Educational Technology
 
Ravi Chandran - WESST - NUS’ Technology Enhanced Learning Environment
Ravi Chandran - WESST - NUS’ Technology Enhanced Learning EnvironmentRavi Chandran - WESST - NUS’ Technology Enhanced Learning Environment
Ravi Chandran - WESST - NUS’ Technology Enhanced Learning Environment
NUS Institute of Applied Learning Sciences and Educational Technology
 
Carolyn Rosé - WESST Keynote - Technology Support for Effective Team-Based Le...
Carolyn Rosé - WESST Keynote - Technology Support for Effective Team-Based Le...Carolyn Rosé - WESST Keynote - Technology Support for Effective Team-Based Le...
Carolyn Rosé - WESST Keynote - Technology Support for Effective Team-Based Le...
NUS Institute of Applied Learning Sciences and Educational Technology
 
Tuan Q. Phan - WESST - Getting Started on the Computational Social Sciences
Tuan Q. Phan - WESST - Getting Started on the Computational Social SciencesTuan Q. Phan - WESST - Getting Started on the Computational Social Sciences
Tuan Q. Phan - WESST - Getting Started on the Computational Social Sciences
NUS Institute of Applied Learning Sciences and Educational Technology
 
Dragan Gašević - WESST keynote - Critical Directions for Learning Analytics...
Dragan Gašević - WESST keynote - Critical Directions for Learning Analytics...Dragan Gašević - WESST keynote - Critical Directions for Learning Analytics...
Dragan Gašević - WESST keynote - Critical Directions for Learning Analytics...
NUS Institute of Applied Learning Sciences and Educational Technology
 
Elizabeth Koh - WESST - Leveraging Learning Analytics to Formatively Assess T...
Elizabeth Koh - WESST - Leveraging Learning Analytics to Formatively Assess T...Elizabeth Koh - WESST - Leveraging Learning Analytics to Formatively Assess T...
Elizabeth Koh - WESST - Leveraging Learning Analytics to Formatively Assess T...
NUS Institute of Applied Learning Sciences and Educational Technology
 
Joseph Jay Williams - WESST - Bridging Research and Practice via MOOClets & C...
Joseph Jay Williams - WESST - Bridging Research and Practice via MOOClets & C...Joseph Jay Williams - WESST - Bridging Research and Practice via MOOClets & C...
Joseph Jay Williams - WESST - Bridging Research and Practice via MOOClets & C...
NUS Institute of Applied Learning Sciences and Educational Technology
 
Opening for WESST 2017
Opening for WESST 2017Opening for WESST 2017
Bibhas Chakraborty - WESST - Dynamically Personalized Interventions
Bibhas Chakraborty - WESST - Dynamically Personalized InterventionsBibhas Chakraborty - WESST - Dynamically Personalized Interventions
Bibhas Chakraborty - WESST - Dynamically Personalized Interventions
NUS Institute of Applied Learning Sciences and Educational Technology
 
Carolyn Rosé - WESST - From Data to Design of Dynamic Support for Collaborati...
Carolyn Rosé - WESST - From Data to Design of Dynamic Support for Collaborati...Carolyn Rosé - WESST - From Data to Design of Dynamic Support for Collaborati...
Carolyn Rosé - WESST - From Data to Design of Dynamic Support for Collaborati...
NUS Institute of Applied Learning Sciences and Educational Technology
 
Prasanta Bhattacharya - WESST - Social Networks and Causal Inference
Prasanta Bhattacharya - WESST - Social Networks and Causal InferencePrasanta Bhattacharya - WESST - Social Networks and Causal Inference
Prasanta Bhattacharya - WESST - Social Networks and Causal Inference
NUS Institute of Applied Learning Sciences and Educational Technology
 
Jo Fang - WESST - Applied Research in Workforce Development
Jo Fang - WESST - Applied Research in Workforce DevelopmentJo Fang - WESST - Applied Research in Workforce Development
Jo Fang - WESST - Applied Research in Workforce Development
NUS Institute of Applied Learning Sciences and Educational Technology
 
Dragan Gašević - WESST - Learning Analytics Data Policy and Adoption Strate...
Dragan Gašević - WESST - Learning Analytics Data Policy and Adoption Strate...Dragan Gašević - WESST - Learning Analytics Data Policy and Adoption Strate...
Dragan Gašević - WESST - Learning Analytics Data Policy and Adoption Strate...
NUS Institute of Applied Learning Sciences and Educational Technology
 

Viewers also liked (18)

Xiangen Hu - WESST Keynote - Conversational Tutors and the Experience API
Xiangen Hu - WESST Keynote - Conversational Tutors and the Experience APIXiangen Hu - WESST Keynote - Conversational Tutors and the Experience API
Xiangen Hu - WESST Keynote - Conversational Tutors and the Experience API
 
Jennifer Pei-Ling Tan - WESST - Fostering 21st Century Literacies through a ...
Jennifer Pei-Ling Tan - WESST - Fostering 21st Century Literacies through a  ...Jennifer Pei-Ling Tan - WESST - Fostering 21st Century Literacies through a  ...
Jennifer Pei-Ling Tan - WESST - Fostering 21st Century Literacies through a ...
 
NUS's Digital Learning Journey in Enhancing Teaching and Learning
NUS's Digital Learning Journey in Enhancing Teaching and LearningNUS's Digital Learning Journey in Enhancing Teaching and Learning
NUS's Digital Learning Journey in Enhancing Teaching and Learning
 
Introducing ALSET & SoC, Opportunities at NUS for Data and Intervention, Dat...
 Introducing ALSET & SoC, Opportunities at NUS for Data and Intervention, Dat... Introducing ALSET & SoC, Opportunities at NUS for Data and Intervention, Dat...
Introducing ALSET & SoC, Opportunities at NUS for Data and Intervention, Dat...
 
John Stamper - WESST Keynote - Continuous Improvement of Educational Technolo...
John Stamper - WESST Keynote - Continuous Improvement of Educational Technolo...John Stamper - WESST Keynote - Continuous Improvement of Educational Technolo...
John Stamper - WESST Keynote - Continuous Improvement of Educational Technolo...
 
Xiangen Hu - WESST - AutoTutor, an implementation of Conversation-Based Intel...
Xiangen Hu - WESST - AutoTutor, an implementation of Conversation-Based Intel...Xiangen Hu - WESST - AutoTutor, an implementation of Conversation-Based Intel...
Xiangen Hu - WESST - AutoTutor, an implementation of Conversation-Based Intel...
 
Ravi Chandran - WESST - NUS’ Technology Enhanced Learning Environment
Ravi Chandran - WESST - NUS’ Technology Enhanced Learning EnvironmentRavi Chandran - WESST - NUS’ Technology Enhanced Learning Environment
Ravi Chandran - WESST - NUS’ Technology Enhanced Learning Environment
 
Carolyn Rosé - WESST Keynote - Technology Support for Effective Team-Based Le...
Carolyn Rosé - WESST Keynote - Technology Support for Effective Team-Based Le...Carolyn Rosé - WESST Keynote - Technology Support for Effective Team-Based Le...
Carolyn Rosé - WESST Keynote - Technology Support for Effective Team-Based Le...
 
Tuan Q. Phan - WESST - Getting Started on the Computational Social Sciences
Tuan Q. Phan - WESST - Getting Started on the Computational Social SciencesTuan Q. Phan - WESST - Getting Started on the Computational Social Sciences
Tuan Q. Phan - WESST - Getting Started on the Computational Social Sciences
 
Dragan Gašević - WESST keynote - Critical Directions for Learning Analytics...
Dragan Gašević - WESST keynote - Critical Directions for Learning Analytics...Dragan Gašević - WESST keynote - Critical Directions for Learning Analytics...
Dragan Gašević - WESST keynote - Critical Directions for Learning Analytics...
 
Elizabeth Koh - WESST - Leveraging Learning Analytics to Formatively Assess T...
Elizabeth Koh - WESST - Leveraging Learning Analytics to Formatively Assess T...Elizabeth Koh - WESST - Leveraging Learning Analytics to Formatively Assess T...
Elizabeth Koh - WESST - Leveraging Learning Analytics to Formatively Assess T...
 
Joseph Jay Williams - WESST - Bridging Research and Practice via MOOClets & C...
Joseph Jay Williams - WESST - Bridging Research and Practice via MOOClets & C...Joseph Jay Williams - WESST - Bridging Research and Practice via MOOClets & C...
Joseph Jay Williams - WESST - Bridging Research and Practice via MOOClets & C...
 
Opening for WESST 2017
Opening for WESST 2017Opening for WESST 2017
Opening for WESST 2017
 
Bibhas Chakraborty - WESST - Dynamically Personalized Interventions
Bibhas Chakraborty - WESST - Dynamically Personalized InterventionsBibhas Chakraborty - WESST - Dynamically Personalized Interventions
Bibhas Chakraborty - WESST - Dynamically Personalized Interventions
 
Carolyn Rosé - WESST - From Data to Design of Dynamic Support for Collaborati...
Carolyn Rosé - WESST - From Data to Design of Dynamic Support for Collaborati...Carolyn Rosé - WESST - From Data to Design of Dynamic Support for Collaborati...
Carolyn Rosé - WESST - From Data to Design of Dynamic Support for Collaborati...
 
Prasanta Bhattacharya - WESST - Social Networks and Causal Inference
Prasanta Bhattacharya - WESST - Social Networks and Causal InferencePrasanta Bhattacharya - WESST - Social Networks and Causal Inference
Prasanta Bhattacharya - WESST - Social Networks and Causal Inference
 
Jo Fang - WESST - Applied Research in Workforce Development
Jo Fang - WESST - Applied Research in Workforce DevelopmentJo Fang - WESST - Applied Research in Workforce Development
Jo Fang - WESST - Applied Research in Workforce Development
 
Dragan Gašević - WESST - Learning Analytics Data Policy and Adoption Strate...
Dragan Gašević - WESST - Learning Analytics Data Policy and Adoption Strate...Dragan Gašević - WESST - Learning Analytics Data Policy and Adoption Strate...
Dragan Gašević - WESST - Learning Analytics Data Policy and Adoption Strate...
 

Similar to Animesh Prasad and Muthu Kumar Chandrasekaran - WESST - Basics of Deep Learning

Machine Duping 101: Pwning Deep Learning Systems
Machine Duping 101: Pwning Deep Learning SystemsMachine Duping 101: Pwning Deep Learning Systems
Machine Duping 101: Pwning Deep Learning Systems
Clarence Chio
 
Convolutional Neural Networks (CNN)
Convolutional Neural Networks (CNN)Convolutional Neural Networks (CNN)
Convolutional Neural Networks (CNN)
Gaurav Mittal
 
DEF CON 24 - Clarence Chio - machine duping 101
DEF CON 24 - Clarence Chio - machine duping 101DEF CON 24 - Clarence Chio - machine duping 101
DEF CON 24 - Clarence Chio - machine duping 101
Felipe Prado
 
Introduction to deep learning
Introduction to deep learningIntroduction to deep learning
Introduction to deep learning
Abhishek Bhandwaldar
 
Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...
Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...
Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...
Impetus Technologies
 
Deep learning
Deep learningDeep learning
Deep learning
Ratnakar Pandey
 
AINL 2016: Filchenkov
AINL 2016: FilchenkovAINL 2016: Filchenkov
AINL 2016: Filchenkov
Lidia Pivovarova
 
Deep Learning in Robotics: Robot gains Social Intelligence through Multimodal...
Deep Learning in Robotics: Robot gains Social Intelligence through Multimodal...Deep Learning in Robotics: Robot gains Social Intelligence through Multimodal...
Deep Learning in Robotics: Robot gains Social Intelligence through Multimodal...
gabrielesisinna
 
Deep learning from a novice perspective
Deep learning from a novice perspectiveDeep learning from a novice perspective
Deep learning from a novice perspective
Anirban Santara
 
Natural Language Processing Advancements By Deep Learning: A Survey
Natural Language Processing Advancements By Deep Learning: A SurveyNatural Language Processing Advancements By Deep Learning: A Survey
Natural Language Processing Advancements By Deep Learning: A Survey
Rimzim Thube
 
Deep learning for 3-D Scene Reconstruction and Modeling
Deep learning for 3-D Scene Reconstruction and Modeling Deep learning for 3-D Scene Reconstruction and Modeling
Deep learning for 3-D Scene Reconstruction and Modeling
Yu Huang
 
Advancements in HPCC Systems Machine Learning
Advancements in HPCC Systems Machine LearningAdvancements in HPCC Systems Machine Learning
Advancements in HPCC Systems Machine Learning
HPCC Systems
 
ML Module 3 Non Linear Learning.pptx
ML Module 3 Non Linear Learning.pptxML Module 3 Non Linear Learning.pptx
ML Module 3 Non Linear Learning.pptx
DebabrataPain1
 
deepnet-lourentzou.ppt
deepnet-lourentzou.pptdeepnet-lourentzou.ppt
deepnet-lourentzou.ppt
yang947066
 
Model Evaluation in the land of Deep Learning
Model Evaluation in the land of Deep LearningModel Evaluation in the land of Deep Learning
Model Evaluation in the land of Deep Learning
Pramit Choudhary
 
DSRLab seminar Introduction to deep learning
DSRLab seminar   Introduction to deep learningDSRLab seminar   Introduction to deep learning
DSRLab seminar Introduction to deep learning
Poo Kuan Hoong
 
Neural Networks and Deep Learning Basics
Neural Networks and Deep Learning BasicsNeural Networks and Deep Learning Basics
Neural Networks and Deep Learning Basics
Jon Lederman
 
Recent progress on distributing deep learning
Recent progress on distributing deep learningRecent progress on distributing deep learning
Recent progress on distributing deep learning
Viet-Trung TRAN
 
Recurrent Neural Networks, LSTM and GRU
Recurrent Neural Networks, LSTM and GRURecurrent Neural Networks, LSTM and GRU
Recurrent Neural Networks, LSTM and GRU
ananth
 
Lecture on Deep Learning
Lecture on Deep LearningLecture on Deep Learning
Lecture on Deep Learning
Yasas Senarath
 

Similar to Animesh Prasad and Muthu Kumar Chandrasekaran - WESST - Basics of Deep Learning (20)

Machine Duping 101: Pwning Deep Learning Systems
Machine Duping 101: Pwning Deep Learning SystemsMachine Duping 101: Pwning Deep Learning Systems
Machine Duping 101: Pwning Deep Learning Systems
 
Convolutional Neural Networks (CNN)
Convolutional Neural Networks (CNN)Convolutional Neural Networks (CNN)
Convolutional Neural Networks (CNN)
 
DEF CON 24 - Clarence Chio - machine duping 101
DEF CON 24 - Clarence Chio - machine duping 101DEF CON 24 - Clarence Chio - machine duping 101
DEF CON 24 - Clarence Chio - machine duping 101
 
Introduction to deep learning
Introduction to deep learningIntroduction to deep learning
Introduction to deep learning
 
Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...
Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...
Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...
 
Deep learning
Deep learningDeep learning
Deep learning
 
AINL 2016: Filchenkov
AINL 2016: FilchenkovAINL 2016: Filchenkov
AINL 2016: Filchenkov
 
Deep Learning in Robotics: Robot gains Social Intelligence through Multimodal...
Deep Learning in Robotics: Robot gains Social Intelligence through Multimodal...Deep Learning in Robotics: Robot gains Social Intelligence through Multimodal...
Deep Learning in Robotics: Robot gains Social Intelligence through Multimodal...
 
Deep learning from a novice perspective
Deep learning from a novice perspectiveDeep learning from a novice perspective
Deep learning from a novice perspective
 
Natural Language Processing Advancements By Deep Learning: A Survey
Natural Language Processing Advancements By Deep Learning: A SurveyNatural Language Processing Advancements By Deep Learning: A Survey
Natural Language Processing Advancements By Deep Learning: A Survey
 
Deep learning for 3-D Scene Reconstruction and Modeling
Deep learning for 3-D Scene Reconstruction and Modeling Deep learning for 3-D Scene Reconstruction and Modeling
Deep learning for 3-D Scene Reconstruction and Modeling
 
Advancements in HPCC Systems Machine Learning
Advancements in HPCC Systems Machine LearningAdvancements in HPCC Systems Machine Learning
Advancements in HPCC Systems Machine Learning
 
ML Module 3 Non Linear Learning.pptx
ML Module 3 Non Linear Learning.pptxML Module 3 Non Linear Learning.pptx
ML Module 3 Non Linear Learning.pptx
 
deepnet-lourentzou.ppt
deepnet-lourentzou.pptdeepnet-lourentzou.ppt
deepnet-lourentzou.ppt
 
Model Evaluation in the land of Deep Learning
Model Evaluation in the land of Deep LearningModel Evaluation in the land of Deep Learning
Model Evaluation in the land of Deep Learning
 
DSRLab seminar Introduction to deep learning
DSRLab seminar   Introduction to deep learningDSRLab seminar   Introduction to deep learning
DSRLab seminar Introduction to deep learning
 
Neural Networks and Deep Learning Basics
Neural Networks and Deep Learning BasicsNeural Networks and Deep Learning Basics
Neural Networks and Deep Learning Basics
 
Recent progress on distributing deep learning
Recent progress on distributing deep learningRecent progress on distributing deep learning
Recent progress on distributing deep learning
 
Recurrent Neural Networks, LSTM and GRU
Recurrent Neural Networks, LSTM and GRURecurrent Neural Networks, LSTM and GRU
Recurrent Neural Networks, LSTM and GRU
 
Lecture on Deep Learning
Lecture on Deep LearningLecture on Deep Learning
Lecture on Deep Learning
 

Recently uploaded

Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
DeeptiGupta154
 
Chapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptxChapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptx
Mohd Adib Abd Muin, Senior Lecturer at Universiti Utara Malaysia
 
special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdf
Special education needs
 
How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...
Jisc
 
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXXPhrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
MIRIAMSALINAS13
 
Template Jadual Bertugas Kelas (Boleh Edit)
Template Jadual Bertugas Kelas (Boleh Edit)Template Jadual Bertugas Kelas (Boleh Edit)
Template Jadual Bertugas Kelas (Boleh Edit)
rosedainty
 
Home assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdfHome assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdf
Tamralipta Mahavidyalaya
 
The approach at University of Liverpool.pptx
The approach at University of Liverpool.pptxThe approach at University of Liverpool.pptx
The approach at University of Liverpool.pptx
Jisc
 
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup   New Member Orientation and Q&A (May 2024).pdfWelcome to TechSoup   New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
TechSoup
 
2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...
Sandy Millin
 
Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
Jisc
 
GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...
GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...
GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...
Nguyen Thanh Tu Collection
 
The Roman Empire A Historical Colossus.pdf
The Roman Empire A Historical Colossus.pdfThe Roman Empire A Historical Colossus.pdf
The Roman Empire A Historical Colossus.pdf
kaushalkr1407
 
Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345
beazzy04
 
How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17
Celine George
 
Polish students' mobility in the Czech Republic
Polish students' mobility in the Czech RepublicPolish students' mobility in the Czech Republic
Polish students' mobility in the Czech Republic
Anna Sz.
 
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
EugeneSaldivar
 
Additional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdfAdditional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdf
joachimlavalley1
 
PART A. Introduction to Costumer Service
PART A. Introduction to Costumer ServicePART A. Introduction to Costumer Service
PART A. Introduction to Costumer Service
PedroFerreira53928
 
The geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideasThe geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideas
GeoBlogs
 

Recently uploaded (20)

Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
 
Chapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptxChapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptx
 
special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdf
 
How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...
 
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXXPhrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
 
Template Jadual Bertugas Kelas (Boleh Edit)
Template Jadual Bertugas Kelas (Boleh Edit)Template Jadual Bertugas Kelas (Boleh Edit)
Template Jadual Bertugas Kelas (Boleh Edit)
 
Home assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdfHome assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdf
 
The approach at University of Liverpool.pptx
The approach at University of Liverpool.pptxThe approach at University of Liverpool.pptx
The approach at University of Liverpool.pptx
 
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup   New Member Orientation and Q&A (May 2024).pdfWelcome to TechSoup   New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
 
2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...
 
Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
 
GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...
GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...
GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...
 
The Roman Empire A Historical Colossus.pdf
The Roman Empire A Historical Colossus.pdfThe Roman Empire A Historical Colossus.pdf
The Roman Empire A Historical Colossus.pdf
 
Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345
 
How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17
 
Polish students' mobility in the Czech Republic
Polish students' mobility in the Czech RepublicPolish students' mobility in the Czech Republic
Polish students' mobility in the Czech Republic
 
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
 
Additional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdfAdditional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdf
 
PART A. Introduction to Costumer Service
PART A. Introduction to Costumer ServicePART A. Introduction to Costumer Service
PART A. Introduction to Costumer Service
 
The geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideasThe geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideas
 

Animesh Prasad and Muthu Kumar Chandrasekaran - WESST - Basics of Deep Learning

  • 1. Animesh Prasad Muthu Kumar Chandrasekaran 
 Basics of Deep Learning Workshop on Educational and Social Science Technologies (WESST 2017, 19—21 July)
  • 6. Machine Learning, AI • Four key ingredients for ML towards AI 1. Lots & lots of data 2. Very flexible models 3.   Enough computing power 4.  Powerful priors that can defeat the curse of dimensionality 
 6
  • 7. Bypassing the curse of dimensionality We need to build compositionality into our ML models Just as human languages exploit compositionality to give representations and meanings to complex ideas Exploiting compositionality gives an exponential gain in representational power (1) Distributed representations / embeddings: feature learning (2) Deep architecture: multiple levels of feature learning Additional prior: compositionality is useful to describe the world around us efficiently 
 7
  • 8. Each feature can be discovered without the need for seeing the exponentially large number of configurations of the other features • Consider a network whose hidden units discover the following features: • Person wears glasses • Person is female
 • Person is a child
 • Etc. If each of n feature requires O(k) parameters, need O(nk) examples Non- parametric methods would require O(nd ) examples 
 8
  • 10. Hidden Units Discover Semantically Meaningful Concepts 
 • Network trained to recognize places, not objects 
 10
  • 11. Why does it work? •   It only works because we are making some assumptions about 
 the data generating distribution 
 •   Worse-case distributions still require exponential data 
 •   But the world has structure and we can get an exponential gain by exploiting some of it 
 
 11
  • 12. 12
  • 13. 13
  • 14. 14
  • 15. 15
  • 16. 16
  • 17. 17
  • 18. 18
  • 19. 19
  • 20. 20
  • 21. 21
  • 22. 22
  • 23. 23
  • 24. 24
  • 26. Recurrent Neural Networks • Selectively summarize an input sequence in a fixed-size state vector via a recursive update 26
  • 27. Recurrent Neural Networks • Can produce an output at each time step: unfolding the graph tells us how to back-prop through time. 27
  • 28. Recurrent Neural Networks • The RNN gradient is a product of Jacobian matrices, each associated with a step in the forward computation. 28
  • 30. 30 Gated Recurrent Units & LSTM Create a path where gradients can flow for longer with self-loop •  Corresponds to an eigenvalue of Jacobian slightly less than 1 •  LSTM is heavily used • GRU light-weight version
  • 32. 32
  • 33. 33
  • 34. 34
  • 35. 35
  • 36. 36
  • 38. 38 Piecewise Linear Nonlinearity Absolute value rectification works better than tanh in lower layers of convnet. Using a rectifier non- linearity (ReLU) instead of tanh of softplus allows for the first time to train very deep supervised networks without the need for unsupervised pre-training; was biologically motivated Rectifiers one of the crucial ingredients in ImageNet breakthrough
  • 39. 39 Dropout Regularizer: Super- Efficient Bagging •  Dropouts trick: during training multiply neuron output by random bit (p=0.5), during test by 0.5 •  Used in deep supervised networks •  Similar to denoising auto-encoder, but corrupting every layer •  Works better with some non- linearities (rectifiers, maxout)
  • 41. 41 Early Stopping •  Beautiful FREE LUNCH (no need to launch many different training runs for each value of hyper-parameter for #iterations) •  Monitor validation error during training (after visiting # of training examples = a multiple of validation set size) •  Keep track of parameters with best validation error and report them at the end •  If error does not improve enough (with some patience), stop. 

  • 42. 42 Sampling Hyperparameters •  Common approach: manual + grid search •  Grid search over hyperparameters: simple & wasteful •  Random search: simple & efficient 
 • Independently sample each HP,
 • More convenient: ok to early- stop, continue further, etc. 

  • 45. 45
  • 46. 46 Analogical Representations for Free Text • Semantic relations appear as linear relationships in the space of learned representations • King – Queen ≈ Man – Woman • Paris – France + Italy ≈ Rome
  • 47. 47 •  Intermediate representation of meaning = ‘universal representation’ •  Encoder: from word sequence to sentence representation •  Decoder: from representation to word sequence distribution Encoder-Decoder Framework
  • 48. 48 Attention Mechanism for Deep Learning •  Consider an input (or intermediate) sequence or image •  Consider an upper level representation, which can choose « where to look », by assigning a weight or probability to each input position, as produced by an MLP, applied at each position
  • 51. 51
  • 52. 52
  • 53. 53
  • 55. 55 Autoencoders •  RBM 
 •  (Denoising) Autoencoder
  • 56. 56
  • 57. 57
  • 58. 58 Insights •  The big payoff of deep learning is to allow learning higher levels of abstraction 
 •  Higher-level abstractions disentangle the factors of variation, which allows much easier generalization and transfer 

  • 60. 60 Hands On • Login to NUS SoC VPN with your NUSNET Account • ssh wesst@kpe.d1.comp.nus.edu.sg Password: kerasuser • copy folder demo to a new folder rename demo_<yourname>
  • 63. 63
  • 64. 64
  • 65. 65
  • 66. 66
  • 67. 67
  • 68. 68
  • 69. 69
  • 70. 70
  • 71. 71
  • 72. 72
  • 73. 73
  • 74. 74
  • 75. 75
  • 76. 19 July 2017 76 References: NIPS DL Tutorial 2015 CS231n: Convolutional Neural Networks for Visual Recognition