This slides gives an introduction to the deep learning including cnn and rnn and lstm. To lead the readers to the deep learning field, it has started from the very basics of aritifical neural networks and then proceed to the convolutional neural networks, and then recurrent neural networks and lstm.
I tried to my best to give credits to those who made the many slides I borrowed...But sometimes I couldn't remeber their exact source I borrowed...I am very sorry for this..and please let me know if you know their original source that I couldn't find....
26. Present a training pattern
Training data
Fields class
1.4 2.7 1.9 0
3.8 3.4 3.2 0
6.4 2.8 1.7 1
4.1 0.1 0.2 0
etc …
1.4
2.7
1.9
27. Training data
Fields class
1.4 2.7 1.9 0
3.8 3.4 3.2 0
6.4 2.8 1.7 1
4.1 0.1 0.2 0
etc …
Feed it through to get output
1.4
2.7 0.8
1.9
28. Compare with target output
Training data
Fields class
1.4 2.7 1.9 0
3.8 3.4 3.2 0
6.4 2.8 1.7 1
4.1 0.1 0.2 0
etc …
1.4
2.7 0.8
0
1.9 error 0.8
29. Training data
Fields class
1.4 2.7 1.9 0
3.8 3.4 3.2 0
6.4 2.8 1.7 1
4.1 0.1 0.2 0
etc …
Adjust weights based on error
1.4
2.7 0.8
0
1.9 error 0.8
30. Training data
Fields class
1.4 2.7 1.9 0
3.8 3.4 3.2 0
6.4 2.8 1.7 1
4.1 0.1 0.2 0
etc …
6.4
2.8
1.7
Present a training pattern
31. Training data
Fields class
1.4 2.7 1.9 0
3.8 3.4 3.2 0
6.4 2.8 1.7 1
4.1 0.1 0.2 0
etc …
Feed it through to get output
6.4
2.8 0.9
1.7
32. Training data
Fields class
1.4 2.7 1.9 0
3.8 3.4 3.2 0
6.4 2.8 1.7 1
4.1 0.1 0.2 0
etc …
Compare with target output
6.4
2.8 0.9
1
1.7 error -0.1
33. Adjust weights based on error
Training data
Fields class
1.4 2.7 1.9 0
3.8 3.4 3.2 0
6.4 2.8 1.7 1
4.1 0.1 0.2 0
etc …
6.4
2.8 0.9
1
1.7 error -0.1
34. And so on ….
Training data
Fields class
1.4 2.7 1.9 0
3.8 3.4 3.2 0
6.4 2.8 1.7 1
4.1 0.1 0.2 0
etc …
6.4
2.8 0.9
1
1.7 error -0.1
Repeat this thousands, maybe millions of times – each time
taking a random training instance, and making slight
weight adjustments
Algorithms for weight adjustment are designed to make
changes that will reduce the error
40. …
1 5 10 15 20 25 …
strong +ve weight
low/zero weight
1
63
it will send strong signal for a horizontal
line in the top row, ignoring everywhere else
44. etc …detect lines in
Specific positions
Higher level detetors
( horizontal line,
“RHS vertical lune”
“upper loop”, etc…
etc …
What does this unit detect?
45.
46.
47. Training data
Fields class
1.4 2.7 1.9 0
3.8 3.4 3.2 0
6.4 2.8 1.7 1
4.1 0.1 0.2 0
etc …
Adjust weights based on error
1.4
2.7 0.8
0
1.9 error 0.8
48. Training data
Fields class
1.4 2.7 1.9 0
3.8 3.4 3.2 0
6.4 2.8 1.7 1
4.1 0.1 0.2 0
etc …
Adjust weights based on error
1.4
2.7 0.8
0
1.9 error 0.8
𝑐𝑜𝑠𝑡 = 0.8 − 0 2
49. Training data
Fields class
1.4 2.7 1.9 0
3.8 3.4 3.2 0
6.4 2.8 1.7 1
4.1 0.1 0.2 0
etc …
1.4
2.7 0.8
0
1.9 error 0.8
Adjust weights based on how much they contributed to the error
52. Training data
Fields class
1.4 2.7 1.9 0
3.8 3.4 3.2 0
6.4 2.8 1.7 1
4.1 0.1 0.2 0
etc …
1.4
2.7 0.8
0
1.9 error 0.8
Adjust weights based on how much they
contributed to the error
𝜕𝐶
𝜕𝑤
53.
54.
55.
56.
57. What’s good can we have when we use logistic sigmoid function as our activation function?