SlideShare a Scribd company logo
1 of 25
Deep Learning
What’s the buzz all about?
Dr. Debdoot Sheet
Assistant Professor
Department of Electrical Engineering
Indian Institute of Technology Kharagpur
www.facweb.iitkgp.ernet.in/~debdoot/
5 Aug. 2014 2Deep Learning - What's the buzz all about? / Debdoot Sheet
Learning?
A computer program is said to learn from
experience E with respect to some class of
tasks T and performance measure P, if its
performance at tasks in T, as measured by
P, improves with experience E
-Tom Mitchell
5 Aug. 2014 Deep Learning - What's the buzz all about? / Debdoot Sheet 3
Demystifying Learning
5 Aug. 2014 Deep Learning - What's the buzz all about? / Debdoot Sheet 4
Man 1 Man 2 Man 3Man 4
Great Wall logo
Great Wall tower
Kim Jung
WangDebdoot
Experience (E)
Performance(P)
Debdoot, Kim, Jung and Wang are standing near the
Great Wall logo and the Great Wall tower is behind them.
How was it Learning?
5 Aug. 2014 Deep Learning - What's the buzz all about? / Debdoot Sheet 5
Man 1 Man 2 Man 3Man 4
Great Wall logo
Great Wall tower
Kim Jung
WangDebdoot
Salient Segments
Objectify
Detect
humans
Recognize
inanimate
Describe Scene
Debdoot, Kim, Jung and Wang are standing near the
Great Wall logo and the Great Wall tower is behind them.
Recognize
humans
Challenges
5 Aug. 2014 Deep Learning - What's the buzz all about? / Debdoot Sheet 6
Salient Segments
Objectify
Detect
humans
Recognize
inanimate
Describe Scene
Recognize
humans
Salient Segments
Objectify
Detect
humans
Recognize
inanimate
Describe Scene
Recognize
humans
Salient Segments
Objectify
Detect
humans
Recognize
inanimate
Describe Scene
Recognize
humans
Challenges
5 Aug. 2014 Deep Learning - What's the buzz all about? / Debdoot Sheet 7
Salient Segments
Objectify
Recognize
inanimate
Describe Scene
Recognize
humans
LBP
Wavelets
HoG
Body part
recognition
Human
appearance
Chroma
clustering
Posture
realign Silhouette
matching
Recognize
human
Detect
humans
Dilemma only in Machine Vision?
• Speech Recognition and Signal Processing
– Dahl, et al,(2012). Context dependent pre-trained deep neural networks
for large vocabulary speech recognition. IEEE Trans. Audio, Speech, and
Language Processing, 20(1), 33–42.
• Handwriting Recognition
– Bengio, et al (2006). Greedy layer-wise training of deep networks. In
NIPS’2006.
• Natural Language Processing
– Hinton, (1986). Learning distributed representations of concepts. In
Proc. 8th Conf. Cog. Sc. Society, pp. 1–12.
• Hierarchical and Transfer Learning
– Goodfellow, et al. (2011). Spike-and slab sparse coding for unsupervised
feature discovery. In NIPS’2011 Workshop on Challenges in Learning
Hierarchical Models.
• Medical Imaging
– Karri and Sheet, et al (2014). Deep learnt random forests for
segmentation of retinal layers in optical coherence tomography images.
In ISBI’2014.
5 Aug. 2014 Deep Learning - What's the buzz all about? / Debdoot Sheet 8
How to tackle this dilemma?
5 Aug. 2014 Deep Learning - What's the buzz all about? / Debdoot Sheet 9
Great Wall
behind
Great Wall
logo beside
Debdoot, Kim,
Jung, Wang
Multilayer Perceptron (MLP)
5 Aug. 2014 Deep Learning - What's the buzz all about? / Debdoot Sheet 10
Hiddenlayers
Hiddenlayers
Hiddenlayers
MLP Learning, troubles thereof
5 Aug. 2014 Deep Learning - What's the buzz all about? / Debdoot Sheet 11
P
T1
T2
MLP Learning troubles, why so?
5 Aug. 2014 Deep Learning - What's the buzz all about? / Debdoot Sheet 12
P
T1
T2
LBP
Wavelets
HoG
Body part
recognition
Human
appearance
MLP Learning troubles, why so?
5 Aug. 2014 Deep Learning - What's the buzz all about? / Debdoot Sheet 13
P
T1
T2
Chroma
clustering
Posture
realign Silhouette
matching
Recognize
human
MLP Learning troubles, why so?
5 Aug. 2014 Deep Learning - What's the buzz all about? / Debdoot Sheet 14
P
T1
T2
?
Deep Learning, origin and growth
• Around 1950 – NN age
– Neural Nets (McCulloch and Pitts,
1943)
– Unsupervised Learn. (Hebb, 1949)
– Supervised Learn. (Rosenblatt, 1958)
– Associative Memory (Palm, 1980;
Hopfield, 1982)
• 1960
– Discovery of visual sensory cells that
respond to Edges (Hubel and Wiesel,
1962)
– Feed Forward Multi Layer Perceptron
(FF-MLP) (Ivakhnenko, 1968)
• 1980 – Neocognition
– Convolution + WeightReplication +
Subsampling (Fukushima, 1980)
– Max Pooling
– Back-propagation (Werbos, 1981;
LeCunn, 1985, 1988)
5 Aug. 2014 Deep Learning - What's the buzz all about? / Debdoot Sheet 15
Deep Learning, origin and growth
• 1980-2000 – Search for simple,
low-complexity, problem-solvers
– Recurrent Neural Network (RNN)
(Hochreiter and Schmidhuber, 1996)
– Local learning Feed forward NN
(Dayan and Hinton, 1996)
– Advanced gradient descent
(Schaback and Werner, 1992)
– Sequential Network Construction
(Honavar and Uhr, 1988)
– Unsupervised Pre-training (Ritter
and Kohonen, 1989)
– Auto-Encoder (Hinton et al., 1989)
– Back Propagating Convolutional
Neural Networks (LeCun et al., 1989,
1990a, 1998)
5 Aug. 2014 Deep Learning - What's the buzz all about? / Debdoot Sheet 16
Deep Learning, origin and growth
• 2000 – Era of Deep Learning
– NIPS 2003 Feature Selection
Challenge (Neal and Zhang, 2006)
– MNIST digit recognition (LeCun et
al., 1989)
– Deep Belief Network (DBN) /
Restricted Boltzmann Machines
(Hinton et al., 2006)
– Auto Encoders (Bengio, 2009)
• 2006
– GPU based CNN (Chellapilla et al.,
2006)
• 2009
– GPU DBN (Raina et al., 2009)
• 2011
– Max-Pooling CNN on the GPU
(Ciresan et al., 2011)
• 2012
– Image Net (Krizhevsky et al., 2012)
5 Aug. 2014 Deep Learning - What's the buzz all about? / Debdoot Sheet 17
Science fiction or reality
Synthetic facial expression Machine that can dream
5 Aug. 2014 Deep Learning - What's the buzz all about? / Debdoot Sheet 18
Susskind, Joshua M., et al. "Generating facial
expressions with deep belief nets." Affective
Computing, Emotion Modelling, Synthesis and
Recognition (2008): 421-440.
Deep Learning, where
• Facebook
• Baidu – IDL
• Cortica
• Cortexica
• Google
– DNNresearch
– Deep Mind
• Microsoft
– DNN
– Adam project
5 Aug. 2014 Deep Learning - What's the buzz all about? / Debdoot Sheet 19
COMPUTATIONAL
MEDICAL IMAGING
my experiences with
Deep Learning (or not) for
5 Aug. 2014 Deep Learning - What's the buzz all about? / Debdoot Sheet 20
    xx |,,|11 nnff  
  x,,,| 1 ng  
(Not so Deep) Transfer Learning
5 Aug. 2014 Deep Learning - What's the buzz all about? / Debdoot Sheet 21
D. Sheet, et al, Iterative self-
organizing atherosclerotic tissue
labeling in intravascular ultrasound
images and comparison with virtual
histology, IEEE TBME, 59(11), 2012
D. Sheet, et al, Hunting for necrosis
in the shadows of intravascular
ultrasound, CMIG, 38(2), 2014
D. Sheet, et al, Joint learning of
ultrasonic backscattering statistical
physics and signal confidence primal for
characterizing atherosclerotic plaques
using intravascular ultrasound, Med.
Image Anal,18(1), 2014
Training set
of IVUS
signals
Ground
truth
Random forest
learning
Test IVUS
signal
Labeled
tissue
Multi-scale
Speckle stats.
Multi-scale
Speckle stats.
Local Minima Juggling, experience
5 Aug. 2014 Deep Learning - What's the buzz all about? / Debdoot Sheet 22
P
T1
T2
LBP
Wavelets
HoG
Body part
recognition
Human
appearance
Chroma
clustering
Posture
realign Silhouette
matching
Recognize
human
Transfer vs. Full Deep Architecture
5 Aug. 2014 Deep Learning - What's the buzz all about? / Debdoot Sheet 23
Multi-scale
modeling of
OCT speckles
Training
image
set
Ground
truth
Random forest
learning
Multi-scale
modeling of
OCT speckles
Test image
Labeled
tissue
Stacked Auto-
Encoders,
Logistic
Regression
Random
Forest
Training
image
set
Ground
truth
Convolution
masks
http://www.facweb.iitkgp.ernet.in/~debdoot/current.html
Endnote (almost there)
“It’s like in quantum physics at the beginning of the
20th century” Trishul Chilimbi (MSR, DNN, Adam)
“The experimentalists and practitioners were ahead
of the theoreticians. They couldn’t explain the
results. We appear to be at a similar stage with
DNNs. We’re realizing the power and the
capabilities, but we still don’t understand the
fundamentals of exactly how they work.”
5 Aug. 2014 Deep Learning - What's the buzz all about? / Debdoot Sheet 24
Take home message
• Challenges
– Architectures
• Neural Nets vs. Others
– Implementation
• CPU vs. GPU vs. Cloud
– GPU (VLSI) architectures
• Hierarchical Temporal
Memory
• Potential Causal Connection
• Toolboxes
– Theano (Python/SciPy)
– Pylearn2
• More information
– www.deeplearning.net
– Schmidhuber (2014). Deep
Learning in Neural
Networks: An Overview
(arXiv:1404.7828)
– Bengio (2009). Learning
Deep Architectures for AI.
– Deng and Yu (2013). Deep
Learning: Methods and
Applications.
• Conferences
– Int. Conf. Learning
Representations (ICLR)
5 Aug. 2014 Deep Learning - What's the buzz all about? / Debdoot Sheet 25

More Related Content

What's hot

automated perimeters
automated perimetersautomated perimeters
automated perimeters
gueste49b03
 
International Journal of Image Processing (IJIP) Volume (3) Issue (6)
International Journal of Image Processing (IJIP) Volume (3) Issue (6)International Journal of Image Processing (IJIP) Volume (3) Issue (6)
International Journal of Image Processing (IJIP) Volume (3) Issue (6)
CSCJournals
 

What's hot (7)

automated perimeters
automated perimetersautomated perimeters
automated perimeters
 
Variational formulation of unsupervised deep learning for ultrasound image ar...
Variational formulation of unsupervised deep learning for ultrasound image ar...Variational formulation of unsupervised deep learning for ultrasound image ar...
Variational formulation of unsupervised deep learning for ultrasound image ar...
 
A BRIEF OVERVIEW ON DIFFERENT ANIMAL DETECTION METHODS
A BRIEF OVERVIEW ON DIFFERENT ANIMAL DETECTION METHODSA BRIEF OVERVIEW ON DIFFERENT ANIMAL DETECTION METHODS
A BRIEF OVERVIEW ON DIFFERENT ANIMAL DETECTION METHODS
 
International Journal of Image Processing (IJIP) Volume (3) Issue (6)
International Journal of Image Processing (IJIP) Volume (3) Issue (6)International Journal of Image Processing (IJIP) Volume (3) Issue (6)
International Journal of Image Processing (IJIP) Volume (3) Issue (6)
 
Advanced Retinal Imaging
Advanced Retinal ImagingAdvanced Retinal Imaging
Advanced Retinal Imaging
 
2017 07 03_meetup_d
2017 07 03_meetup_d2017 07 03_meetup_d
2017 07 03_meetup_d
 
Bayesian Nonparametric Motor-skill Representations for Efficient Learning of ...
Bayesian Nonparametric Motor-skill Representations for Efficient Learning of ...Bayesian Nonparametric Motor-skill Representations for Efficient Learning of ...
Bayesian Nonparametric Motor-skill Representations for Efficient Learning of ...
 

Similar to Deep Learning - What's the buzz all about

Case study on deep learning
Case study on deep learningCase study on deep learning
Case study on deep learning
HarshitBarde
 
LiDeng-BerlinOct2015-ASR-GenDisc-4by3.pptx
LiDeng-BerlinOct2015-ASR-GenDisc-4by3.pptxLiDeng-BerlinOct2015-ASR-GenDisc-4by3.pptx
LiDeng-BerlinOct2015-ASR-GenDisc-4by3.pptx
VishnuRajuV
 
From Silver Bullets to First Principles: Effectively Leveraging Technology in...
From Silver Bullets to First Principles: Effectively Leveraging Technology in...From Silver Bullets to First Principles: Effectively Leveraging Technology in...
From Silver Bullets to First Principles: Effectively Leveraging Technology in...
Peter Doolittle
 
Individual functional atlasing of the human brain with multitask fMRI data: l...
Individual functional atlasing of the human brain with multitask fMRI data: l...Individual functional atlasing of the human brain with multitask fMRI data: l...
Individual functional atlasing of the human brain with multitask fMRI data: l...
Ana Luísa Pinho
 
Individual functional atlasing of the human brain with multitask fMRI data: l...
Individual functional atlasing of the human brain with multitask fMRI data: l...Individual functional atlasing of the human brain with multitask fMRI data: l...
Individual functional atlasing of the human brain with multitask fMRI data: l...
Ana Luísa Pinho
 
Lecture 08 larry zitnick - undestanding and describing scenes
Lecture 08   larry zitnick - undestanding and describing scenesLecture 08   larry zitnick - undestanding and describing scenes
Lecture 08 larry zitnick - undestanding and describing scenes
mustafa sarac
 
34th.余凯.机器学习进展及语音图像中的应用
34th.余凯.机器学习进展及语音图像中的应用34th.余凯.机器学习进展及语音图像中的应用
34th.余凯.机器学习进展及语音图像中的应用
komunling
 
Individual functional atlasing of the human brain with multitask fMRI data: l...
Individual functional atlasing of the human brain with multitask fMRI data: l...Individual functional atlasing of the human brain with multitask fMRI data: l...
Individual functional atlasing of the human brain with multitask fMRI data: l...
Ana Luísa Pinho
 
Individual functional atlasing of the human brain with multitask fMRI data: l...
Individual functional atlasing of the human brain with multitask fMRI data: l...Individual functional atlasing of the human brain with multitask fMRI data: l...
Individual functional atlasing of the human brain with multitask fMRI data: l...
Ana Luísa Pinho
 

Similar to Deep Learning - What's the buzz all about (20)

Rise of AI through DL
Rise of AI through DLRise of AI through DL
Rise of AI through DL
 
Case study on deep learning
Case study on deep learningCase study on deep learning
Case study on deep learning
 
Brains@Bay Meetup: Open-ended Skill Acquisition in Humans and Machines: An Ev...
Brains@Bay Meetup: Open-ended Skill Acquisition in Humans and Machines: An Ev...Brains@Bay Meetup: Open-ended Skill Acquisition in Humans and Machines: An Ev...
Brains@Bay Meetup: Open-ended Skill Acquisition in Humans and Machines: An Ev...
 
Dli milano rl_parton_sep
Dli milano rl_parton_sepDli milano rl_parton_sep
Dli milano rl_parton_sep
 
LiDeng-BerlinOct2015-ASR-GenDisc-4by3.pptx
LiDeng-BerlinOct2015-ASR-GenDisc-4by3.pptxLiDeng-BerlinOct2015-ASR-GenDisc-4by3.pptx
LiDeng-BerlinOct2015-ASR-GenDisc-4by3.pptx
 
Materi_01_VK_2223_3.pdf
Materi_01_VK_2223_3.pdfMateri_01_VK_2223_3.pdf
Materi_01_VK_2223_3.pdf
 
From Silver Bullets to First Principles: Effectively Leveraging Technology in...
From Silver Bullets to First Principles: Effectively Leveraging Technology in...From Silver Bullets to First Principles: Effectively Leveraging Technology in...
From Silver Bullets to First Principles: Effectively Leveraging Technology in...
 
final-day1-july2.pptx
final-day1-july2.pptxfinal-day1-july2.pptx
final-day1-july2.pptx
 
Artificial intelligence
Artificial intelligenceArtificial intelligence
Artificial intelligence
 
Individual functional atlasing of the human brain with multitask fMRI data: l...
Individual functional atlasing of the human brain with multitask fMRI data: l...Individual functional atlasing of the human brain with multitask fMRI data: l...
Individual functional atlasing of the human brain with multitask fMRI data: l...
 
Individual functional atlasing of the human brain with multitask fMRI data: l...
Individual functional atlasing of the human brain with multitask fMRI data: l...Individual functional atlasing of the human brain with multitask fMRI data: l...
Individual functional atlasing of the human brain with multitask fMRI data: l...
 
Introduction to Visual Analysis
Introduction to Visual AnalysisIntroduction to Visual Analysis
Introduction to Visual Analysis
 
Lecture 08 larry zitnick - undestanding and describing scenes
Lecture 08   larry zitnick - undestanding and describing scenesLecture 08   larry zitnick - undestanding and describing scenes
Lecture 08 larry zitnick - undestanding and describing scenes
 
AI&BigData Lab. Артем Чернодуб "Распознавание изображений методом Lazy Deep ...
AI&BigData Lab. Артем Чернодуб  "Распознавание изображений методом Lazy Deep ...AI&BigData Lab. Артем Чернодуб  "Распознавание изображений методом Lazy Deep ...
AI&BigData Lab. Артем Чернодуб "Распознавание изображений методом Lazy Deep ...
 
Iciap 2
Iciap 2Iciap 2
Iciap 2
 
34th.余凯.机器学习进展及语音图像中的应用
34th.余凯.机器学习进展及语音图像中的应用34th.余凯.机器学习进展及语音图像中的应用
34th.余凯.机器学习进展及语音图像中的应用
 
Research in Deep Learning: A Perspective from NSF
Research in Deep Learning: A Perspective from NSFResearch in Deep Learning: A Perspective from NSF
Research in Deep Learning: A Perspective from NSF
 
Individual functional atlasing of the human brain with multitask fMRI data: l...
Individual functional atlasing of the human brain with multitask fMRI data: l...Individual functional atlasing of the human brain with multitask fMRI data: l...
Individual functional atlasing of the human brain with multitask fMRI data: l...
 
Growing-up With AI
Growing-up With AIGrowing-up With AI
Growing-up With AI
 
Individual functional atlasing of the human brain with multitask fMRI data: l...
Individual functional atlasing of the human brain with multitask fMRI data: l...Individual functional atlasing of the human brain with multitask fMRI data: l...
Individual functional atlasing of the human brain with multitask fMRI data: l...
 

Recently uploaded

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 

Recently uploaded (20)

The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 

Deep Learning - What's the buzz all about

  • 1. Deep Learning What’s the buzz all about? Dr. Debdoot Sheet Assistant Professor Department of Electrical Engineering Indian Institute of Technology Kharagpur www.facweb.iitkgp.ernet.in/~debdoot/
  • 2. 5 Aug. 2014 2Deep Learning - What's the buzz all about? / Debdoot Sheet
  • 3. Learning? A computer program is said to learn from experience E with respect to some class of tasks T and performance measure P, if its performance at tasks in T, as measured by P, improves with experience E -Tom Mitchell 5 Aug. 2014 Deep Learning - What's the buzz all about? / Debdoot Sheet 3
  • 4. Demystifying Learning 5 Aug. 2014 Deep Learning - What's the buzz all about? / Debdoot Sheet 4 Man 1 Man 2 Man 3Man 4 Great Wall logo Great Wall tower Kim Jung WangDebdoot Experience (E) Performance(P) Debdoot, Kim, Jung and Wang are standing near the Great Wall logo and the Great Wall tower is behind them.
  • 5. How was it Learning? 5 Aug. 2014 Deep Learning - What's the buzz all about? / Debdoot Sheet 5 Man 1 Man 2 Man 3Man 4 Great Wall logo Great Wall tower Kim Jung WangDebdoot Salient Segments Objectify Detect humans Recognize inanimate Describe Scene Debdoot, Kim, Jung and Wang are standing near the Great Wall logo and the Great Wall tower is behind them. Recognize humans
  • 6. Challenges 5 Aug. 2014 Deep Learning - What's the buzz all about? / Debdoot Sheet 6 Salient Segments Objectify Detect humans Recognize inanimate Describe Scene Recognize humans Salient Segments Objectify Detect humans Recognize inanimate Describe Scene Recognize humans Salient Segments Objectify Detect humans Recognize inanimate Describe Scene Recognize humans
  • 7. Challenges 5 Aug. 2014 Deep Learning - What's the buzz all about? / Debdoot Sheet 7 Salient Segments Objectify Recognize inanimate Describe Scene Recognize humans LBP Wavelets HoG Body part recognition Human appearance Chroma clustering Posture realign Silhouette matching Recognize human Detect humans
  • 8. Dilemma only in Machine Vision? • Speech Recognition and Signal Processing – Dahl, et al,(2012). Context dependent pre-trained deep neural networks for large vocabulary speech recognition. IEEE Trans. Audio, Speech, and Language Processing, 20(1), 33–42. • Handwriting Recognition – Bengio, et al (2006). Greedy layer-wise training of deep networks. In NIPS’2006. • Natural Language Processing – Hinton, (1986). Learning distributed representations of concepts. In Proc. 8th Conf. Cog. Sc. Society, pp. 1–12. • Hierarchical and Transfer Learning – Goodfellow, et al. (2011). Spike-and slab sparse coding for unsupervised feature discovery. In NIPS’2011 Workshop on Challenges in Learning Hierarchical Models. • Medical Imaging – Karri and Sheet, et al (2014). Deep learnt random forests for segmentation of retinal layers in optical coherence tomography images. In ISBI’2014. 5 Aug. 2014 Deep Learning - What's the buzz all about? / Debdoot Sheet 8
  • 9. How to tackle this dilemma? 5 Aug. 2014 Deep Learning - What's the buzz all about? / Debdoot Sheet 9 Great Wall behind Great Wall logo beside Debdoot, Kim, Jung, Wang
  • 10. Multilayer Perceptron (MLP) 5 Aug. 2014 Deep Learning - What's the buzz all about? / Debdoot Sheet 10 Hiddenlayers Hiddenlayers Hiddenlayers
  • 11. MLP Learning, troubles thereof 5 Aug. 2014 Deep Learning - What's the buzz all about? / Debdoot Sheet 11 P T1 T2
  • 12. MLP Learning troubles, why so? 5 Aug. 2014 Deep Learning - What's the buzz all about? / Debdoot Sheet 12 P T1 T2 LBP Wavelets HoG Body part recognition Human appearance
  • 13. MLP Learning troubles, why so? 5 Aug. 2014 Deep Learning - What's the buzz all about? / Debdoot Sheet 13 P T1 T2 Chroma clustering Posture realign Silhouette matching Recognize human
  • 14. MLP Learning troubles, why so? 5 Aug. 2014 Deep Learning - What's the buzz all about? / Debdoot Sheet 14 P T1 T2 ?
  • 15. Deep Learning, origin and growth • Around 1950 – NN age – Neural Nets (McCulloch and Pitts, 1943) – Unsupervised Learn. (Hebb, 1949) – Supervised Learn. (Rosenblatt, 1958) – Associative Memory (Palm, 1980; Hopfield, 1982) • 1960 – Discovery of visual sensory cells that respond to Edges (Hubel and Wiesel, 1962) – Feed Forward Multi Layer Perceptron (FF-MLP) (Ivakhnenko, 1968) • 1980 – Neocognition – Convolution + WeightReplication + Subsampling (Fukushima, 1980) – Max Pooling – Back-propagation (Werbos, 1981; LeCunn, 1985, 1988) 5 Aug. 2014 Deep Learning - What's the buzz all about? / Debdoot Sheet 15
  • 16. Deep Learning, origin and growth • 1980-2000 – Search for simple, low-complexity, problem-solvers – Recurrent Neural Network (RNN) (Hochreiter and Schmidhuber, 1996) – Local learning Feed forward NN (Dayan and Hinton, 1996) – Advanced gradient descent (Schaback and Werner, 1992) – Sequential Network Construction (Honavar and Uhr, 1988) – Unsupervised Pre-training (Ritter and Kohonen, 1989) – Auto-Encoder (Hinton et al., 1989) – Back Propagating Convolutional Neural Networks (LeCun et al., 1989, 1990a, 1998) 5 Aug. 2014 Deep Learning - What's the buzz all about? / Debdoot Sheet 16
  • 17. Deep Learning, origin and growth • 2000 – Era of Deep Learning – NIPS 2003 Feature Selection Challenge (Neal and Zhang, 2006) – MNIST digit recognition (LeCun et al., 1989) – Deep Belief Network (DBN) / Restricted Boltzmann Machines (Hinton et al., 2006) – Auto Encoders (Bengio, 2009) • 2006 – GPU based CNN (Chellapilla et al., 2006) • 2009 – GPU DBN (Raina et al., 2009) • 2011 – Max-Pooling CNN on the GPU (Ciresan et al., 2011) • 2012 – Image Net (Krizhevsky et al., 2012) 5 Aug. 2014 Deep Learning - What's the buzz all about? / Debdoot Sheet 17
  • 18. Science fiction or reality Synthetic facial expression Machine that can dream 5 Aug. 2014 Deep Learning - What's the buzz all about? / Debdoot Sheet 18 Susskind, Joshua M., et al. "Generating facial expressions with deep belief nets." Affective Computing, Emotion Modelling, Synthesis and Recognition (2008): 421-440.
  • 19. Deep Learning, where • Facebook • Baidu – IDL • Cortica • Cortexica • Google – DNNresearch – Deep Mind • Microsoft – DNN – Adam project 5 Aug. 2014 Deep Learning - What's the buzz all about? / Debdoot Sheet 19
  • 20. COMPUTATIONAL MEDICAL IMAGING my experiences with Deep Learning (or not) for 5 Aug. 2014 Deep Learning - What's the buzz all about? / Debdoot Sheet 20     xx |,,|11 nnff     x,,,| 1 ng  
  • 21. (Not so Deep) Transfer Learning 5 Aug. 2014 Deep Learning - What's the buzz all about? / Debdoot Sheet 21 D. Sheet, et al, Iterative self- organizing atherosclerotic tissue labeling in intravascular ultrasound images and comparison with virtual histology, IEEE TBME, 59(11), 2012 D. Sheet, et al, Hunting for necrosis in the shadows of intravascular ultrasound, CMIG, 38(2), 2014 D. Sheet, et al, Joint learning of ultrasonic backscattering statistical physics and signal confidence primal for characterizing atherosclerotic plaques using intravascular ultrasound, Med. Image Anal,18(1), 2014 Training set of IVUS signals Ground truth Random forest learning Test IVUS signal Labeled tissue Multi-scale Speckle stats. Multi-scale Speckle stats.
  • 22. Local Minima Juggling, experience 5 Aug. 2014 Deep Learning - What's the buzz all about? / Debdoot Sheet 22 P T1 T2 LBP Wavelets HoG Body part recognition Human appearance Chroma clustering Posture realign Silhouette matching Recognize human
  • 23. Transfer vs. Full Deep Architecture 5 Aug. 2014 Deep Learning - What's the buzz all about? / Debdoot Sheet 23 Multi-scale modeling of OCT speckles Training image set Ground truth Random forest learning Multi-scale modeling of OCT speckles Test image Labeled tissue Stacked Auto- Encoders, Logistic Regression Random Forest Training image set Ground truth Convolution masks http://www.facweb.iitkgp.ernet.in/~debdoot/current.html
  • 24. Endnote (almost there) “It’s like in quantum physics at the beginning of the 20th century” Trishul Chilimbi (MSR, DNN, Adam) “The experimentalists and practitioners were ahead of the theoreticians. They couldn’t explain the results. We appear to be at a similar stage with DNNs. We’re realizing the power and the capabilities, but we still don’t understand the fundamentals of exactly how they work.” 5 Aug. 2014 Deep Learning - What's the buzz all about? / Debdoot Sheet 24
  • 25. Take home message • Challenges – Architectures • Neural Nets vs. Others – Implementation • CPU vs. GPU vs. Cloud – GPU (VLSI) architectures • Hierarchical Temporal Memory • Potential Causal Connection • Toolboxes – Theano (Python/SciPy) – Pylearn2 • More information – www.deeplearning.net – Schmidhuber (2014). Deep Learning in Neural Networks: An Overview (arXiv:1404.7828) – Bengio (2009). Learning Deep Architectures for AI. – Deng and Yu (2013). Deep Learning: Methods and Applications. • Conferences – Int. Conf. Learning Representations (ICLR) 5 Aug. 2014 Deep Learning - What's the buzz all about? / Debdoot Sheet 25