Deep learning tutorial

제프리 힌튼
캐나다 토론토 대학
교수
얀 레쿤
뉴욕대학교 교수
앤드류
응
스탠포드
대학교 교수
요슈아
벤지오
캐나다 몬트리올
대학교 교수

2.7
-8.6
0.002
f(x)
1.4
-2.5
-0.06
x = -0.06×2.7 + 2.5×8.6 + 1.4×0.002 = 21.34

Training data
Fields class
1.4 2.7 1.9 0
3.8 3.4 3.2 0
6.4 2.8 1.7 1
4.1 0.1 0.2 0
etc …
Initialise with random weights

Present a training pattern
Training data
Fields class
1.4 2.7 1.9 0
3.8 3.4 3.2 0
6.4 2.8 1.7 1
4.1 0.1 0.2 0
etc …
1.4
2.7
1.9

Training data
Fields class
1.4 2.7 1.9 0
3.8 3.4 3.2 0
6.4 2.8 1.7 1
4.1 0.1 0.2 0
etc …
Feed it through to get output
1.4
2.7 0.8
1.9

Compare with target output
Training data
Fields class
1.4 2.7 1.9 0
3.8 3.4 3.2 0
6.4 2.8 1.7 1
4.1 0.1 0.2 0
etc …
1.4
2.7 0.8
0
1.9 error 0.8

Training data
Fields class
1.4 2.7 1.9 0
3.8 3.4 3.2 0
6.4 2.8 1.7 1
4.1 0.1 0.2 0
etc …
Adjust weights based on error
1.4
2.7 0.8
0
1.9 error 0.8

Training data
Fields class
1.4 2.7 1.9 0
3.8 3.4 3.2 0
6.4 2.8 1.7 1
4.1 0.1 0.2 0
etc …
6.4
2.8
1.7
Present a training pattern

Training data
Fields class
1.4 2.7 1.9 0
3.8 3.4 3.2 0
6.4 2.8 1.7 1
4.1 0.1 0.2 0
etc …
Feed it through to get output
6.4
2.8 0.9
1.7

Training data
Fields class
1.4 2.7 1.9 0
3.8 3.4 3.2 0
6.4 2.8 1.7 1
4.1 0.1 0.2 0
etc …
Compare with target output
6.4
2.8 0.9
1
1.7 error -0.1

Training data
Fields class
1.4 2.7 1.9 0
3.8 3.4 3.2 0
6.4 2.8 1.7 1
4.1 0.1 0.2 0
etc …
6.4
2.8 0.9
1
1.7 error -0.1

And so on ….
Training data
Fields class
1.4 2.7 1.9 0
3.8 3.4 3.2 0
6.4 2.8 1.7 1
4.1 0.1 0.2 0
etc …
6.4
2.8 0.9
1
1.7 error -0.1
Repeat this thousands, maybe millions of times – each time
taking a random training instance, and making slight
weight adjustments
Algorithms for weight adjustment are designed to make
changes that will reduce the error

Present a training instance / adjust the weights

1
63
…
1 5 10 15 20 25 …
strong +ve weight
low/zero weight

…
1 5 10 15 20 25 …
strong +ve weight
low/zero weight
1
63
it will send strong signal for a horizontal
line in the top row, ignoring everywhere else

…
1 5 10 15 20 25 …
strong +ve weight
low/zero weight
1
63

…
1 5 10 15 20 25 …
strong +ve weight
low/zero weight
1
63
Strong signal for a dark area in the top left
corner

etc …detect lines in
Specific positions
Higher level detetors
( horizontal line,
“RHS vertical lune”
“upper loop”, etc…
etc …

etc …detect lines in
Specific positions
Higher level detetors
( horizontal line,
“RHS vertical lune”
“upper loop”, etc…
etc …
What does this unit detect?

Training data
Fields class
1.4 2.7 1.9 0
3.8 3.4 3.2 0
6.4 2.8 1.7 1
4.1 0.1 0.2 0
etc …
1.4
2.7 0.8
0
1.9 error 0.8
𝑐𝑜𝑠𝑡 = 0.8 − 0 2

Training data
Fields class
1.4 2.7 1.9 0
3.8 3.4 3.2 0
6.4 2.8 1.7 1
4.1 0.1 0.2 0
etc …
1.4
2.7 0.8
0
1.9 error 0.8
Adjust weights based on how much they contributed to the error

Training data
Fields class
1.4 2.7 1.9 0
3.8 3.4 3.2 0
6.4 2.8 1.7 1
4.1 0.1 0.2 0
etc …
1.4
2.7 0.8
0
1.9 error 0.8
Adjust weights based on how much they
contributed to the error
𝜕𝐶
𝜕𝑤

What’s good can we have when we use logistic sigmoid function as our activation function?

input
features
Abstracted features
More abstracted features
output

강연 Source:http://lstm.iupr.com/

강연
Peepholes are variations to the original LSTM

강연
(1)
(2)
(3)
(4)
(5) (6)
In this case we have one cell in this memory block, thus C=1
For peepholes
(1)
(2)
(3)
(4)
(5)
(6)

강연
(1)
(2)
(3)
(4)
(5)
(6)

Deep learning tutorial

Recommended

Recommended

More Related Content

Similar to Deep learning tutorial

Similar to Deep learning tutorial (20)

Recently uploaded

Recently uploaded (20)

Deep learning tutorial