SlideShare a Scribd company logo
Introduction to Deep Learning
M S Ram
Dept. of Computer Science & Engg.
Indian Institute of Technology Kanpur
Reading of Chap. 1 from “Learning Deep Architectures for AI”; Yoshua Bengio; FTML Vol. 2, No. 1 (2009) 1–127
1
Date: 12 Nov, 2015
A Motivational Task: Percepts  Concepts
• Create algorithms
• that can understand scenes and describe
them in natural language
• that can infer semantic concepts to allow
machines to interact with humans using these
concepts
• Requires creating a series of abstractions
• Image (Pixel Intensities)  Objects in Image  Object
Interactions  Scene Description
• Deep learning aims to automatically learn these
abstractions with little supervision
Courtesy: Yoshua Bengio, Learning Deep Architectures for AI
2
Deep Visual-Semantic Alignments for Generating
Image Descriptions (Karpathy, Fei-Fei; CVPR 2015)
"boy is doing backflip
on wakeboard."
“two young girls are
playing with lego toy.”
"man in black shirt is
playing guitar."
"construction worker in
orange safety vest is
working on road."
http://cs.stanford.edu/people/karpathy/deepimagesent/
3
Challenge in Modelling Complex Behaviour
• Too many concepts to learn
• Too many object categories
• Too many ways of interaction between objects categories
• Behaviour is a highly varying function underlying factors
• f: L  V
• L: latent factors of variation
• low dimensional latent factor space
• V: visible behaviour
• high dimensional observable space
• f: highly non-linear function
4
Example: Learning the Configuration Space of a Robotic Arm
5
C-Space Discovery using Isomap
6
How do We Train Deep Architectures?
• Inspiration from mammal brain
• Multiple Layers of “neurons” (Rumelhart et al 1986)
• Train each layer to compose the representations of the previous layer
to learn a higher level abstraction
• Ex: Pixels  Edges  Contours  Object parts  Object categories
• Local Features  Global Features
• Train the layers one-by-one (Hinton et al 2006)
• Greedy strategy
7
Multilayer Perceptron with Back-propagation
First deep learning model (Rumelhart, Hinton, Williams 1986)
input vector
hidden
layers
outputs
Back-propagate
error signal to
get derivatives
for learning
Compare outputs with
correct answer to get
error signal
Source: Hinton’s 2009 tutorial on Deep Belief Networks 8
Drawbacks of Back-propagation based Deep Neural
Networks
• They are discriminative models
• Get all the information from the labels
• And the labels don’t give so much of information
• Need a substantial amount of labeled data
• Gradient descent with random initialization leads to poor local
minima
Hand-written digit recognition
• Classification of MNIST hand-written digits
• 10 digit classes
• Input image: 28x28 gray scale
• 784 dimensional input
A Deeper Look at the Problem
• One hidden layer with 500 neurons
=> 784 * 500 + 500 * 10
≈ 0.4 million weights
• Fitting a model that best explains the training data is an
optimization problem in a 0.4 million dimensional space
• It’s almost impossible for Gradient descent with random
initialization to arrive at the global optimum
A Solution – Deep Belief Networks
(Hinton et al. 2006)
Pre-trained
N/W Weights
Fast unsupervised
pre-training
Good
Solution
Slow Fine-tuning
(Using Back-propagation)
Very slow Back-propagation
(Often gets stuck at poor local minima)
Random
Initial position
Very high-dimensional parameter space
A Solution – Deep Belief Networks
(Hinton et al. 2006)
• Before applying back-propagation, pre-train the network as a
series of generative models
• Use the weights of the pre-trained network as the initial point
for the traditional back-propagation
• This leads to quicker convergence to a good solution
• Pre-training is fast; fine-tuning can be slow
Quick Check: MLP vs DBN on MNIST
• MLP (1 Hidden Layer)
• 1 hour: 2.18%
• 14 hours: 1.65%
• DBN
• 1 hour: 1.65%
• 14 hours: 1.10%
• 21 hours: 0.97%
Intel QuadCore 2.83GHz, 4GB RAM
MLP: Python :: DBN: Matlab
Intermediate Representations in Brain
• Disentanglement of factors of variation
underlying the data
• Distributed Representations
• Activation of each neuron is a function of
multiple features of the previous layer
• Feature combinations of different neurons
are not necessarily mutually exclusive
• Sparse Representations
• Only 1-4% neurons are active at a time
15
Localized Representation
Distributed Representation
Local vs. Distributed in Input Space
• Local Methods
• Assume smoothness prior
• g(x) = f(g(x1), g(x2), …, g(xk))
• {x1, x2, …, xk} are neighbours of x
• Require a metric space
• A notion of distance or similarity in the input space
• Fail when the target function is highly varying
• Examples
• Nearest Neighbour methods
• Kernel methods with a Gaussian kernel
• Distributed Methods
• No assumption of smoothness  No need for a notion of similarity
• Ex: Neural networks
16
Multi-task Learning
17
Source: https://en.wikipedia.org/wiki/Multi-task_learning
Desiderata for Learning AI
• Ability to learn complex, highly-varying functions
• Ability to learn multiple levels of abstraction with little human input
• Ability to learn from a very large set of examples
• Training time linear in the number of examples
• Ability to learn from mostly unlabeled data
• Unsupervised and semi-supervised
• Multi-task learning
• Sharing of representations across tasks
• Fast predictions
18
References
 Primary
 Yoshua Bengio, Learning Deep Architectures for AI, Foundations and Trends in Machine
Learning Vol. 2, No. 1 (2009) 1–127
 Hinton, G. E., Osindero, S. and Teh, Y. A fast learning algorithm for deep belief nets. Neural
Computation 18 (2006), pp 1527-1554
 Rumelhart, David E., Geoffrey E. Hinton, and R. J. Williams. Learning Internal
Representations by Error Propagation. David E. Rumelhart, James L. McClelland, and the
PDP research group. (editors), Parallel distributed processing: Explorations in the
microstructure of cognition, Volume 1: Foundations. MIT Press, 1986.
 Secondary
 Hinton, G. E., Learning Multiple Layers of Representation, Trends in Cognitive Sciences, Vol.
11, (2007) pp 428-434.
 Hinton G.E., Tutorial on Deep Belief Networks, Machine Learning Summer School,
Cambridge, 2009
 Andrej Karpathy, Li Fei-Fei. Deep Visual-Semantic Alignments for Generating Image
Descriptions. CVPR 2015.

More Related Content

What's hot

DSRLab seminar Introduction to deep learning
DSRLab seminar   Introduction to deep learningDSRLab seminar   Introduction to deep learning
DSRLab seminar Introduction to deep learning
Poo Kuan Hoong
 
Deep Learning Tutorial
Deep Learning TutorialDeep Learning Tutorial
Deep Learning Tutorial
Amr Rashed
 
Basics of Deep learning
Basics of Deep learningBasics of Deep learning
Basics of Deep learning
Ramesh Kumar
 
Deep Neural Networks 
that talk (Back)… with style
Deep Neural Networks 
that talk (Back)… with styleDeep Neural Networks 
that talk (Back)… with style
Deep Neural Networks 
that talk (Back)… with style
Roelof Pieters
 
Deep Learning for Artificial Intelligence (AI)
Deep Learning for Artificial Intelligence (AI)Deep Learning for Artificial Intelligence (AI)
Deep Learning for Artificial Intelligence (AI)
Er. Shiva K. Shrestha
 
HML: Historical View and Trends of Deep Learning
HML: Historical View and Trends of Deep LearningHML: Historical View and Trends of Deep Learning
HML: Historical View and Trends of Deep Learning
Yan Xu
 
Neural Networks-1
Neural Networks-1Neural Networks-1
Neural Networks-1
Sai Kumar Dwivedi
 
MDEC Data Matters Series: machine learning and Deep Learning, A Primer
MDEC Data Matters Series: machine learning and Deep Learning, A PrimerMDEC Data Matters Series: machine learning and Deep Learning, A Primer
MDEC Data Matters Series: machine learning and Deep Learning, A Primer
Poo Kuan Hoong
 
Deep Visual Understanding from Deep Learning by Prof. Jitendra Malik
Deep Visual Understanding from Deep Learning by Prof. Jitendra MalikDeep Visual Understanding from Deep Learning by Prof. Jitendra Malik
Deep Visual Understanding from Deep Learning by Prof. Jitendra Malik
The Hive
 
101: Convolutional Neural Networks
101: Convolutional Neural Networks 101: Convolutional Neural Networks
101: Convolutional Neural Networks
Mad Scientists
 
Introduction to Deep learning
Introduction to Deep learningIntroduction to Deep learning
Introduction to Deep learning
leopauly
 
Deep Learning for Data Scientists - Data Science ATL Meetup Presentation, 201...
Deep Learning for Data Scientists - Data Science ATL Meetup Presentation, 201...Deep Learning for Data Scientists - Data Science ATL Meetup Presentation, 201...
Deep Learning for Data Scientists - Data Science ATL Meetup Presentation, 201...
Andrew Gardner
 
Deep Learning
Deep LearningDeep Learning
Deep Learning
Jun Wang
 
Deep learning - A Visual Introduction
Deep learning - A Visual IntroductionDeep learning - A Visual Introduction
Deep learning - A Visual Introduction
Lukas Masuch
 
Deep Learning for NLP (without Magic) - Richard Socher and Christopher Manning
Deep Learning for NLP (without Magic) - Richard Socher and Christopher ManningDeep Learning for NLP (without Magic) - Richard Socher and Christopher Manning
Deep Learning for NLP (without Magic) - Richard Socher and Christopher Manning
BigDataCloud
 
Tutorial on Deep Learning
Tutorial on Deep LearningTutorial on Deep Learning
Tutorial on Deep Learning
inside-BigData.com
 
LiDeng-BerlinOct2015-ASR-GenDisc-4by3.pptx
LiDeng-BerlinOct2015-ASR-GenDisc-4by3.pptxLiDeng-BerlinOct2015-ASR-GenDisc-4by3.pptx
LiDeng-BerlinOct2015-ASR-GenDisc-4by3.pptx
VishnuRajuV
 
Intro To Convolutional Neural Networks
Intro To Convolutional Neural NetworksIntro To Convolutional Neural Networks
Intro To Convolutional Neural Networks
Mark Scully
 

What's hot (20)

DSRLab seminar Introduction to deep learning
DSRLab seminar   Introduction to deep learningDSRLab seminar   Introduction to deep learning
DSRLab seminar Introduction to deep learning
 
Deep Learning Tutorial
Deep Learning TutorialDeep Learning Tutorial
Deep Learning Tutorial
 
Basics of Deep learning
Basics of Deep learningBasics of Deep learning
Basics of Deep learning
 
Deep Learning
Deep LearningDeep Learning
Deep Learning
 
Deep Neural Networks 
that talk (Back)… with style
Deep Neural Networks 
that talk (Back)… with styleDeep Neural Networks 
that talk (Back)… with style
Deep Neural Networks 
that talk (Back)… with style
 
Deep Learning for Artificial Intelligence (AI)
Deep Learning for Artificial Intelligence (AI)Deep Learning for Artificial Intelligence (AI)
Deep Learning for Artificial Intelligence (AI)
 
HML: Historical View and Trends of Deep Learning
HML: Historical View and Trends of Deep LearningHML: Historical View and Trends of Deep Learning
HML: Historical View and Trends of Deep Learning
 
Neural Networks-1
Neural Networks-1Neural Networks-1
Neural Networks-1
 
MDEC Data Matters Series: machine learning and Deep Learning, A Primer
MDEC Data Matters Series: machine learning and Deep Learning, A PrimerMDEC Data Matters Series: machine learning and Deep Learning, A Primer
MDEC Data Matters Series: machine learning and Deep Learning, A Primer
 
Deep Visual Understanding from Deep Learning by Prof. Jitendra Malik
Deep Visual Understanding from Deep Learning by Prof. Jitendra MalikDeep Visual Understanding from Deep Learning by Prof. Jitendra Malik
Deep Visual Understanding from Deep Learning by Prof. Jitendra Malik
 
101: Convolutional Neural Networks
101: Convolutional Neural Networks 101: Convolutional Neural Networks
101: Convolutional Neural Networks
 
Introduction to Deep learning
Introduction to Deep learningIntroduction to Deep learning
Introduction to Deep learning
 
Deep Learning for Data Scientists - Data Science ATL Meetup Presentation, 201...
Deep Learning for Data Scientists - Data Science ATL Meetup Presentation, 201...Deep Learning for Data Scientists - Data Science ATL Meetup Presentation, 201...
Deep Learning for Data Scientists - Data Science ATL Meetup Presentation, 201...
 
Deep Learning
Deep LearningDeep Learning
Deep Learning
 
Deep learning - A Visual Introduction
Deep learning - A Visual IntroductionDeep learning - A Visual Introduction
Deep learning - A Visual Introduction
 
Deep Learning for NLP (without Magic) - Richard Socher and Christopher Manning
Deep Learning for NLP (without Magic) - Richard Socher and Christopher ManningDeep Learning for NLP (without Magic) - Richard Socher and Christopher Manning
Deep Learning for NLP (without Magic) - Richard Socher and Christopher Manning
 
Tutorial on Deep Learning
Tutorial on Deep LearningTutorial on Deep Learning
Tutorial on Deep Learning
 
LiDeng-BerlinOct2015-ASR-GenDisc-4by3.pptx
LiDeng-BerlinOct2015-ASR-GenDisc-4by3.pptxLiDeng-BerlinOct2015-ASR-GenDisc-4by3.pptx
LiDeng-BerlinOct2015-ASR-GenDisc-4by3.pptx
 
Ai overview
Ai overviewAi overview
Ai overview
 
Intro To Convolutional Neural Networks
Intro To Convolutional Neural NetworksIntro To Convolutional Neural Networks
Intro To Convolutional Neural Networks
 

Similar to Deep learning 1

Evolution of Deep Learning and new advancements
Evolution of Deep Learning and new advancementsEvolution of Deep Learning and new advancements
Evolution of Deep Learning and new advancements
Chitta Ranjan
 
PhD Defense
PhD DefensePhD Defense
PhD Defense
Taehoon Lee
 
Training machine learning deep learning 2017
Training machine learning deep learning 2017Training machine learning deep learning 2017
Training machine learning deep learning 2017
Iwan Sofana
 
MLIP - Chapter 3 - Introduction to deep learning
MLIP - Chapter 3 - Introduction to deep learningMLIP - Chapter 3 - Introduction to deep learning
MLIP - Chapter 3 - Introduction to deep learning
Charles Deledalle
 
Introduction to deep learning
Introduction to deep learningIntroduction to deep learning
Introduction to deep learning
Abhishek Bhandwaldar
 
AINL 2016: Filchenkov
AINL 2016: FilchenkovAINL 2016: Filchenkov
AINL 2016: Filchenkov
Lidia Pivovarova
 
DEF CON 24 - Clarence Chio - machine duping 101
DEF CON 24 - Clarence Chio - machine duping 101DEF CON 24 - Clarence Chio - machine duping 101
DEF CON 24 - Clarence Chio - machine duping 101
Felipe Prado
 
Rise of AI through DL
Rise of AI through DLRise of AI through DL
Rise of AI through DL
Rehan Guha
 
Introduction to deep learning
Introduction to deep learningIntroduction to deep learning
Introduction to deep learning
Amr Rashed
 
Deep learning tutorial 9/2019
Deep learning tutorial 9/2019Deep learning tutorial 9/2019
Deep learning tutorial 9/2019
Amr Rashed
 
Neural Networks and Deep Learning Basics
Neural Networks and Deep Learning BasicsNeural Networks and Deep Learning Basics
Neural Networks and Deep Learning Basics
Jon Lederman
 
Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...
Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...
Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...
Impetus Technologies
 
Deep learning introduction
Deep learning introductionDeep learning introduction
Deep learning introduction
Adwait Bhave
 
Tsinghua invited talk_zhou_xing_v2r0
Tsinghua invited talk_zhou_xing_v2r0Tsinghua invited talk_zhou_xing_v2r0
Tsinghua invited talk_zhou_xing_v2r0
Joe Xing
 
Deep learning introduction
Deep learning introductionDeep learning introduction
Deep learning introduction
giangbui0816
 
Scene classification using Convolutional Neural Networks - Jayani Withanawasam
Scene classification using Convolutional Neural Networks - Jayani WithanawasamScene classification using Convolutional Neural Networks - Jayani Withanawasam
Scene classification using Convolutional Neural Networks - Jayani Withanawasam
WithTheBest
 
Semantic, Cognitive and Perceptual Computing -Deep learning
Semantic, Cognitive and Perceptual Computing -Deep learning Semantic, Cognitive and Perceptual Computing -Deep learning
Semantic, Cognitive and Perceptual Computing -Deep learning
Artificial Intelligence Institute at UofSC
 
Machine Duping 101: Pwning Deep Learning Systems
Machine Duping 101: Pwning Deep Learning SystemsMachine Duping 101: Pwning Deep Learning Systems
Machine Duping 101: Pwning Deep Learning Systems
Clarence Chio
 
introduction to deeplearning
introduction to deeplearningintroduction to deeplearning
introduction to deeplearning
Eyad Alshami
 

Similar to Deep learning 1 (20)

Evolution of Deep Learning and new advancements
Evolution of Deep Learning and new advancementsEvolution of Deep Learning and new advancements
Evolution of Deep Learning and new advancements
 
PhD Defense
PhD DefensePhD Defense
PhD Defense
 
Training machine learning deep learning 2017
Training machine learning deep learning 2017Training machine learning deep learning 2017
Training machine learning deep learning 2017
 
MLIP - Chapter 3 - Introduction to deep learning
MLIP - Chapter 3 - Introduction to deep learningMLIP - Chapter 3 - Introduction to deep learning
MLIP - Chapter 3 - Introduction to deep learning
 
Introduction to deep learning
Introduction to deep learningIntroduction to deep learning
Introduction to deep learning
 
Deep learning
Deep learningDeep learning
Deep learning
 
AINL 2016: Filchenkov
AINL 2016: FilchenkovAINL 2016: Filchenkov
AINL 2016: Filchenkov
 
DEF CON 24 - Clarence Chio - machine duping 101
DEF CON 24 - Clarence Chio - machine duping 101DEF CON 24 - Clarence Chio - machine duping 101
DEF CON 24 - Clarence Chio - machine duping 101
 
Rise of AI through DL
Rise of AI through DLRise of AI through DL
Rise of AI through DL
 
Introduction to deep learning
Introduction to deep learningIntroduction to deep learning
Introduction to deep learning
 
Deep learning tutorial 9/2019
Deep learning tutorial 9/2019Deep learning tutorial 9/2019
Deep learning tutorial 9/2019
 
Neural Networks and Deep Learning Basics
Neural Networks and Deep Learning BasicsNeural Networks and Deep Learning Basics
Neural Networks and Deep Learning Basics
 
Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...
Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...
Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...
 
Deep learning introduction
Deep learning introductionDeep learning introduction
Deep learning introduction
 
Tsinghua invited talk_zhou_xing_v2r0
Tsinghua invited talk_zhou_xing_v2r0Tsinghua invited talk_zhou_xing_v2r0
Tsinghua invited talk_zhou_xing_v2r0
 
Deep learning introduction
Deep learning introductionDeep learning introduction
Deep learning introduction
 
Scene classification using Convolutional Neural Networks - Jayani Withanawasam
Scene classification using Convolutional Neural Networks - Jayani WithanawasamScene classification using Convolutional Neural Networks - Jayani Withanawasam
Scene classification using Convolutional Neural Networks - Jayani Withanawasam
 
Semantic, Cognitive and Perceptual Computing -Deep learning
Semantic, Cognitive and Perceptual Computing -Deep learning Semantic, Cognitive and Perceptual Computing -Deep learning
Semantic, Cognitive and Perceptual Computing -Deep learning
 
Machine Duping 101: Pwning Deep Learning Systems
Machine Duping 101: Pwning Deep Learning SystemsMachine Duping 101: Pwning Deep Learning Systems
Machine Duping 101: Pwning Deep Learning Systems
 
introduction to deeplearning
introduction to deeplearningintroduction to deeplearning
introduction to deeplearning
 

Recently uploaded

一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
mbawufebxi
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
slg6lamcq
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
NABLAS株式会社
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
slg6lamcq
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
u86oixdj
 
Learn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queriesLearn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queries
manishkhaire30
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
Timothy Spann
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
jerlynmaetalle
 
Adjusting OpenMP PageRank : SHORT REPORT / NOTES
Adjusting OpenMP PageRank : SHORT REPORT / NOTESAdjusting OpenMP PageRank : SHORT REPORT / NOTES
Adjusting OpenMP PageRank : SHORT REPORT / NOTES
Subhajit Sahu
 
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
oz8q3jxlp
 
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
74nqk8xf
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
javier ramirez
 
Everything you wanted to know about LIHTC
Everything you wanted to know about LIHTCEverything you wanted to know about LIHTC
Everything you wanted to know about LIHTC
Roger Valdez
 
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdfEnhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
GetInData
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Subhajit Sahu
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
haila53
 
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdf
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdfUnleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdf
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdf
Enterprise Wired
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
ahzuo
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
TravisMalana
 
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptxData_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
AnirbanRoy608946
 

Recently uploaded (20)

一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
 
Learn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queriesLearn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queries
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
 
Adjusting OpenMP PageRank : SHORT REPORT / NOTES
Adjusting OpenMP PageRank : SHORT REPORT / NOTESAdjusting OpenMP PageRank : SHORT REPORT / NOTES
Adjusting OpenMP PageRank : SHORT REPORT / NOTES
 
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
 
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
 
Everything you wanted to know about LIHTC
Everything you wanted to know about LIHTCEverything you wanted to know about LIHTC
Everything you wanted to know about LIHTC
 
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdfEnhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
 
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdf
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdfUnleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdf
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdf
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
 
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptxData_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
 

Deep learning 1

  • 1. Introduction to Deep Learning M S Ram Dept. of Computer Science & Engg. Indian Institute of Technology Kanpur Reading of Chap. 1 from “Learning Deep Architectures for AI”; Yoshua Bengio; FTML Vol. 2, No. 1 (2009) 1–127 1 Date: 12 Nov, 2015
  • 2. A Motivational Task: Percepts  Concepts • Create algorithms • that can understand scenes and describe them in natural language • that can infer semantic concepts to allow machines to interact with humans using these concepts • Requires creating a series of abstractions • Image (Pixel Intensities)  Objects in Image  Object Interactions  Scene Description • Deep learning aims to automatically learn these abstractions with little supervision Courtesy: Yoshua Bengio, Learning Deep Architectures for AI 2
  • 3. Deep Visual-Semantic Alignments for Generating Image Descriptions (Karpathy, Fei-Fei; CVPR 2015) "boy is doing backflip on wakeboard." “two young girls are playing with lego toy.” "man in black shirt is playing guitar." "construction worker in orange safety vest is working on road." http://cs.stanford.edu/people/karpathy/deepimagesent/ 3
  • 4. Challenge in Modelling Complex Behaviour • Too many concepts to learn • Too many object categories • Too many ways of interaction between objects categories • Behaviour is a highly varying function underlying factors • f: L  V • L: latent factors of variation • low dimensional latent factor space • V: visible behaviour • high dimensional observable space • f: highly non-linear function 4
  • 5. Example: Learning the Configuration Space of a Robotic Arm 5
  • 7. How do We Train Deep Architectures? • Inspiration from mammal brain • Multiple Layers of “neurons” (Rumelhart et al 1986) • Train each layer to compose the representations of the previous layer to learn a higher level abstraction • Ex: Pixels  Edges  Contours  Object parts  Object categories • Local Features  Global Features • Train the layers one-by-one (Hinton et al 2006) • Greedy strategy 7
  • 8. Multilayer Perceptron with Back-propagation First deep learning model (Rumelhart, Hinton, Williams 1986) input vector hidden layers outputs Back-propagate error signal to get derivatives for learning Compare outputs with correct answer to get error signal Source: Hinton’s 2009 tutorial on Deep Belief Networks 8
  • 9. Drawbacks of Back-propagation based Deep Neural Networks • They are discriminative models • Get all the information from the labels • And the labels don’t give so much of information • Need a substantial amount of labeled data • Gradient descent with random initialization leads to poor local minima
  • 10. Hand-written digit recognition • Classification of MNIST hand-written digits • 10 digit classes • Input image: 28x28 gray scale • 784 dimensional input
  • 11. A Deeper Look at the Problem • One hidden layer with 500 neurons => 784 * 500 + 500 * 10 ≈ 0.4 million weights • Fitting a model that best explains the training data is an optimization problem in a 0.4 million dimensional space • It’s almost impossible for Gradient descent with random initialization to arrive at the global optimum
  • 12. A Solution – Deep Belief Networks (Hinton et al. 2006) Pre-trained N/W Weights Fast unsupervised pre-training Good Solution Slow Fine-tuning (Using Back-propagation) Very slow Back-propagation (Often gets stuck at poor local minima) Random Initial position Very high-dimensional parameter space
  • 13. A Solution – Deep Belief Networks (Hinton et al. 2006) • Before applying back-propagation, pre-train the network as a series of generative models • Use the weights of the pre-trained network as the initial point for the traditional back-propagation • This leads to quicker convergence to a good solution • Pre-training is fast; fine-tuning can be slow
  • 14. Quick Check: MLP vs DBN on MNIST • MLP (1 Hidden Layer) • 1 hour: 2.18% • 14 hours: 1.65% • DBN • 1 hour: 1.65% • 14 hours: 1.10% • 21 hours: 0.97% Intel QuadCore 2.83GHz, 4GB RAM MLP: Python :: DBN: Matlab
  • 15. Intermediate Representations in Brain • Disentanglement of factors of variation underlying the data • Distributed Representations • Activation of each neuron is a function of multiple features of the previous layer • Feature combinations of different neurons are not necessarily mutually exclusive • Sparse Representations • Only 1-4% neurons are active at a time 15 Localized Representation Distributed Representation
  • 16. Local vs. Distributed in Input Space • Local Methods • Assume smoothness prior • g(x) = f(g(x1), g(x2), …, g(xk)) • {x1, x2, …, xk} are neighbours of x • Require a metric space • A notion of distance or similarity in the input space • Fail when the target function is highly varying • Examples • Nearest Neighbour methods • Kernel methods with a Gaussian kernel • Distributed Methods • No assumption of smoothness  No need for a notion of similarity • Ex: Neural networks 16
  • 18. Desiderata for Learning AI • Ability to learn complex, highly-varying functions • Ability to learn multiple levels of abstraction with little human input • Ability to learn from a very large set of examples • Training time linear in the number of examples • Ability to learn from mostly unlabeled data • Unsupervised and semi-supervised • Multi-task learning • Sharing of representations across tasks • Fast predictions 18
  • 19. References  Primary  Yoshua Bengio, Learning Deep Architectures for AI, Foundations and Trends in Machine Learning Vol. 2, No. 1 (2009) 1–127  Hinton, G. E., Osindero, S. and Teh, Y. A fast learning algorithm for deep belief nets. Neural Computation 18 (2006), pp 1527-1554  Rumelhart, David E., Geoffrey E. Hinton, and R. J. Williams. Learning Internal Representations by Error Propagation. David E. Rumelhart, James L. McClelland, and the PDP research group. (editors), Parallel distributed processing: Explorations in the microstructure of cognition, Volume 1: Foundations. MIT Press, 1986.  Secondary  Hinton, G. E., Learning Multiple Layers of Representation, Trends in Cognitive Sciences, Vol. 11, (2007) pp 428-434.  Hinton G.E., Tutorial on Deep Belief Networks, Machine Learning Summer School, Cambridge, 2009  Andrej Karpathy, Li Fei-Fei. Deep Visual-Semantic Alignments for Generating Image Descriptions. CVPR 2015.