Deep Learning - What's the buzz all about

Deep Learning
What’s the buzz all about?
Dr. Debdoot Sheet
Assistant Professor
Department of Electrical Engineering
Indian Institute of Technology Kharagpur
www.facweb.iitkgp.ernet.in/~debdoot/

5 Aug. 2014 2Deep Learning - What's the buzz all about? / Debdoot Sheet

Learning?
A computer program is said to learn from
experience E with respect to some class of
tasks T and performance measure P, if its
performance at tasks in T, as measured by
P, improves with experience E
-Tom Mitchell
5 Aug. 2014 Deep Learning - What's the buzz all about? / Debdoot Sheet 3

Demystifying Learning
Man 1 Man 2 Man 3Man 4
Great Wall logo
Great Wall tower
Kim Jung
WangDebdoot
Experience (E)
Performance(P)
Debdoot, Kim, Jung and Wang are standing near the
Great Wall logo and the Great Wall tower is behind them.

How was it Learning?
Man 1 Man 2 Man 3Man 4
Great Wall logo
Great Wall tower
Kim Jung
WangDebdoot
Salient Segments
Objectify
Detect
humans
Recognize
inanimate
Describe Scene
Debdoot, Kim, Jung and Wang are standing near the
Great Wall logo and the Great Wall tower is behind them.
Recognize
humans

Challenges
Salient Segments
Objectify
Detect
humans
Recognize
inanimate
Describe Scene
Recognize
humans
Salient Segments
Objectify
Detect
humans
Recognize
inanimate
Describe Scene
Recognize
humans
Salient Segments
Objectify
Detect
humans
Recognize
inanimate
Describe Scene
Recognize
humans

Challenges
Salient Segments
Objectify
Recognize
inanimate
Describe Scene
Recognize
humans
LBP
Wavelets
HoG
Body part
recognition
Human
appearance
Chroma
clustering
Posture
realign Silhouette
matching
Recognize
human
Detect
humans

Dilemma only in Machine Vision?
• Speech Recognition and Signal Processing
– Dahl, et al,(2012). Context dependent pre-trained deep neural networks
for large vocabulary speech recognition. IEEE Trans. Audio, Speech, and
Language Processing, 20(1), 33–42.
• Handwriting Recognition
– Bengio, et al (2006). Greedy layer-wise training of deep networks. In
NIPS’2006.
• Natural Language Processing
– Hinton, (1986). Learning distributed representations of concepts. In
Proc. 8th Conf. Cog. Sc. Society, pp. 1–12.
• Hierarchical and Transfer Learning
– Goodfellow, et al. (2011). Spike-and slab sparse coding for unsupervised
feature discovery. In NIPS’2011 Workshop on Challenges in Learning
Hierarchical Models.
• Medical Imaging
– Karri and Sheet, et al (2014). Deep learnt random forests for
segmentation of retinal layers in optical coherence tomography images.
In ISBI’2014.

How to tackle this dilemma?
Great Wall
behind
Great Wall
logo beside
Debdoot, Kim,
Jung, Wang

Multilayer Perceptron (MLP)
Hiddenlayers
Hiddenlayers
Hiddenlayers

MLP Learning, troubles thereof
P
T1
T2

MLP Learning troubles, why so?
P
T1
T2
LBP
Wavelets
HoG
Body part
recognition
Human
appearance

P
T1
T2
Chroma
clustering
Posture
realign Silhouette
matching
Recognize
human

P
T1
T2
?

Deep Learning, origin and growth
• Around 1950 – NN age
– Neural Nets (McCulloch and Pitts,
1943)
– Unsupervised Learn. (Hebb, 1949)
– Supervised Learn. (Rosenblatt, 1958)
– Associative Memory (Palm, 1980;
Hopfield, 1982)
• 1960
– Discovery of visual sensory cells that
respond to Edges (Hubel and Wiesel,
1962)
– Feed Forward Multi Layer Perceptron
(FF-MLP) (Ivakhnenko, 1968)
• 1980 – Neocognition
– Convolution + WeightReplication +
Subsampling (Fukushima, 1980)
– Max Pooling
– Back-propagation (Werbos, 1981;
LeCunn, 1985, 1988)

• 1980-2000 – Search for simple,
low-complexity, problem-solvers
– Recurrent Neural Network (RNN)
(Hochreiter and Schmidhuber, 1996)
– Local learning Feed forward NN
(Dayan and Hinton, 1996)
– Advanced gradient descent
(Schaback and Werner, 1992)
– Sequential Network Construction
(Honavar and Uhr, 1988)
– Unsupervised Pre-training (Ritter
and Kohonen, 1989)
– Auto-Encoder (Hinton et al., 1989)
– Back Propagating Convolutional
Neural Networks (LeCun et al., 1989,
1990a, 1998)

• 2000 – Era of Deep Learning
– NIPS 2003 Feature Selection
Challenge (Neal and Zhang, 2006)
– MNIST digit recognition (LeCun et
al., 1989)
– Deep Belief Network (DBN) /
Restricted Boltzmann Machines
(Hinton et al., 2006)
– Auto Encoders (Bengio, 2009)
• 2006
– GPU based CNN (Chellapilla et al.,
2006)
• 2009
– GPU DBN (Raina et al., 2009)
• 2011
– Max-Pooling CNN on the GPU
(Ciresan et al., 2011)
• 2012
– Image Net (Krizhevsky et al., 2012)

Science fiction or reality
Synthetic facial expression Machine that can dream
Susskind, Joshua M., et al. "Generating facial
expressions with deep belief nets." Affective
Computing, Emotion Modelling, Synthesis and
Recognition (2008): 421-440.

Deep Learning, where
• Facebook
• Baidu – IDL
• Cortica
• Cortexica
• Google
– DNNresearch
– Deep Mind
• Microsoft
– DNN
– Adam project

COMPUTATIONAL
MEDICAL IMAGING
my experiences with
Deep Learning (or not) for
    xx |,,|11 nnff  
  x,,,| 1 ng  

(Not so Deep) Transfer Learning
D. Sheet, et al, Iterative self-
organizing atherosclerotic tissue
labeling in intravascular ultrasound
images and comparison with virtual
histology, IEEE TBME, 59(11), 2012
D. Sheet, et al, Hunting for necrosis
in the shadows of intravascular
ultrasound, CMIG, 38(2), 2014
D. Sheet, et al, Joint learning of
ultrasonic backscattering statistical
physics and signal confidence primal for
characterizing atherosclerotic plaques
using intravascular ultrasound, Med.
Image Anal,18(1), 2014
Training set
of IVUS
signals
Ground
truth
Random forest
learning
Test IVUS
signal
Labeled
tissue
Multi-scale
Speckle stats.
Multi-scale
Speckle stats.

Local Minima Juggling, experience
P
T1
T2
LBP
Wavelets
HoG
Body part
recognition
Human
appearance
Chroma
clustering
Posture
realign Silhouette
matching
Recognize
human

Transfer vs. Full Deep Architecture
Multi-scale
modeling of
OCT speckles
Training
image
set
Ground
truth
Random forest
learning
Multi-scale
modeling of
OCT speckles
Test image
Labeled
tissue
Stacked Auto-
Encoders,
Logistic
Regression
Random
Forest
Training
image
set
Ground
truth
Convolution
masks
http://www.facweb.iitkgp.ernet.in/~debdoot/current.html

Endnote (almost there)
“It’s like in quantum physics at the beginning of the
20th century” Trishul Chilimbi (MSR, DNN, Adam)
“The experimentalists and practitioners were ahead
of the theoreticians. They couldn’t explain the
results. We appear to be at a similar stage with
DNNs. We’re realizing the power and the
capabilities, but we still don’t understand the
fundamentals of exactly how they work.”

Take home message
• Challenges
– Architectures
• Neural Nets vs. Others
– Implementation
• CPU vs. GPU vs. Cloud
– GPU (VLSI) architectures
• Hierarchical Temporal
Memory
• Potential Causal Connection
• Toolboxes
– Theano (Python/SciPy)
– Pylearn2
• More information
– www.deeplearning.net
– Schmidhuber (2014). Deep
Learning in Neural
Networks: An Overview
(arXiv:1404.7828)
– Bengio (2009). Learning
Deep Architectures for AI.
– Deng and Yu (2013). Deep
Learning: Methods and
Applications.
• Conferences
– Int. Conf. Learning
Representations (ICLR)

Deep Learning - What's the buzz all about

Recommended

Recommended

More Related Content

What's hot

What's hot (7)

Similar to Deep Learning - What's the buzz all about

Similar to Deep Learning - What's the buzz all about (20)

Recently uploaded

Recently uploaded (20)

Deep Learning - What's the buzz all about