Attention please! Attention Mechanism in Neural Networks

Attention Please! Attention Mechanisms
in Neural Networks
PyData Heidelberg #4
2019-11-21
Patrick Michl
www.frootlab.org

Patrick Michl
www.frootlab.org
Slide 2
2019-11-21
Attention Please!
Visual Attention in
Human Perception
since
1990s
Milestones in Attention Research

Patrick Michl
www.frootlab.org
Slide 3
2019-11-21
Attention Please!
Image source: John Henderson and Taylor Hayes, UC Davis
Visual attention in
human perception is
based on the dynamics
between Recognition
and Selection
Visual Attention in
Human Perception

Patrick Michl
www.frootlab.org
Slide 4
2019-11-21
Attention Please!
Visual Attention in
Image Captioning
Visual Attention in
Human Perception
since
1990s
since
2014

Patrick Michl
www.frootlab.org
Slide 5
2019-11-21
Attention Please!
Visual Attention in
Image Captioning
Encoder: CNN
Decoder: RNN
Architecture

Patrick Michl
www.frootlab.org
Slide 6
2019-11-21
Attention Please!
Visual Attention in
Image Captioning
Encoder: CNN
Convolutional features
entangle spatial with
semantic information
Decoder: RNN
Encoder

Patrick Michl
www.frootlab.org
Slide 7
2019-11-21
Attention Please!
Visual Attention in
Image Captioning
Encoder: CNN
Decoder: RNN
Visual Attention
allows to recover spatial
relations of current outputs
Decoder

Patrick Michl
www.frootlab.org
Slide 8
2019-11-21
Attention Please!
Visual Attention in
Image Captioning
Encoder: CNN
Decoder: RNN
Visual Attention
allows to recover spatial
relations of current outputs

Patrick Michl
www.frootlab.org
Slide 9
2019-11-21
Attention Please!
Visual Attention in
Image Captioning
Visual Attention in
Human Perception
since
1990s
since
2014
since
2015
Global Attention in
Machine Translation

Patrick Michl
www.frootlab.org
Slide 10
2019-11-21
Attention Please!
Global Attention in
Machine Translation
Encoder: RNN
Decoder: RNN
Architecture

Patrick Michl
www.frootlab.org
Slide 11
2019-11-21
Attention Please!
Global Attention in
Machine Translation
Encoder: RNN
Sequence of hidden states
encodes the context building
stack
Decoder: RNN
Encoder

Patrick Michl
www.frootlab.org
Slide 12
2019-11-21
Attention Please!
Global Attention in
Machine Translation
Decoder
Encoder: RNN
stack
Decoder: RNN
Global Attention
allows to recover the context
of current outputs

Patrick Michl
www.frootlab.org
Slide 13
2019-11-21
Attention Please!
Global Attention in
Machine Translation
Context
Encoder: RNN
stack
Decoder: RNN
Global Attention
allows to recover the context
of current outputs

Patrick Michl
www.frootlab.org
Slide 14
2019-11-21
Attention Please!
Visual Attention in
Image Captioning
Visual Attention in
Human Perception
since
1990s
since
2014
since
2015
since
2016
Global Attention in
Machine Translation
Multi-Step Attention in
Natural Language
Processing

Patrick Michl
www.frootlab.org
Slide 15
2019-11-21
Attention Please!
Multi-Step Attention
in Natural Language
Processing
Encoder: CNN
Decoder: CNN
Architecture

Patrick Michl
www.frootlab.org
Slide 16
2019-11-21
Attention Please!
in Natural Language
Processing
Encoder: CNN
Sequence of convolutional
features encodes the context
building stack
Decoder: CNN
Encoder

Patrick Michl
www.frootlab.org
Slide 17
2019-11-21
Attention Please!
in Natural Language
Processing
Encoder: CNN
Sequence of convolutional
features encodes the context
building stack
Decoder: CNN
allows to recover the current
context of current
deconvolutional steps
Decoder

Patrick Michl
www.frootlab.org
Slide 18
2019-11-21
Attention Please!
Visual Attention in
Image Captioning
Visual Attention in
Human Perception
Self-Attention in
Natural Language
Processing
since
1990s
since
2014
since
2015
since
2016
since
2017
Global Attention in
Machine Translation
Multi-Step Attention in
Natural Language
Processing

Patrick Michl
www.frootlab.org
Slide 19
2019-11-21
Attention Please!
Self-Attention in
Natural Language
Processing
Encoder: Stacked ANN
Decoder: Stacked ANN
Architecture

Patrick Michl
www.frootlab.org
Slide 20
2019-11-21
Attention Please!
Self-Attention in
Natural Language
Processing
Sequence of hidden features
captures the encoder context
building stack
Encoder Stack

Patrick Michl
www.frootlab.org
Slide 21
2019-11-21
Attention Please!
Self-Attention in
Natural Language
Processing
building stack
Multi-Step Attention unrolls
the context stack to the
decoder
Decoder Stack

Patrick Michl
www.frootlab.org
Slide 22
2019-11-21
Attention Please!
Self-Attention in
Natural Language
Processing
building stack
Multi-Step Attention unrolls
the context stack to the
decoder
------
Self-attention aggregates
context dependencies within
the inputs
Self-Attention

Patrick Michl
www.frootlab.org
Slide 23
2019-11-21
Attention Please!
Summary
#1
Human Perception is based on
the dynamics between selection
and recognition

Patrick Michl
www.frootlab.org
Slide 24
2019-11-21
Attention Please!
Summary
#2
Attention machanisms immitate
this behaviour by using
intermediate states, that entangle
context with semantic information
#1
and recognition

Patrick Michl
www.frootlab.org
Slide 25
2019-11-21
Attention Please!
Summary
#2
#1
and recognition
#3
The incorporation of context
provides dynamic features that
are context specific and therefore
improve the model perfomance

Patrick Michl
www.frootlab.org
Slide 26
2019-11-21
Attention Please!
Summary
#2
#1
and recognition
#3
The incorporation of context
provides dynamic features that
are context specific and therefore
improve the model perfomance
#4
After all - attention mechanisms
can also help to understand the
decisions of deep networks

Patrick Michl
www.frootlab.org
Slide 27
2019-11-21
Attention Please!
Thank you for your attention!

Attention please! Attention Mechanism in Neural Networks

Recommended

Recommended

More Related Content

Similar to Attention please! Attention Mechanism in Neural Networks

Similar to Attention please! Attention Mechanism in Neural Networks (20)

More from Patrick Michl

More from Patrick Michl (8)

Recently uploaded

Recently uploaded (20)

Attention please! Attention Mechanism in Neural Networks