핵심 딥러닝 입문 4장 RNN

•Download as PPTX, PDF•

0 likes•49 views

Jaey Jeong

deep learning, RNN, LSTM, GRU

Software

Interaction Lab. Seoul National University of Science and Technology
핵심 딥러닝 입문
chapter 4. RNN
Jeong Jae-Yeop

Interaction Lab., Seoul National University of Science and Technology
■Intro
■Training method
■Code practice
■Conclusion
Agenda
2

Interaction Lab., Seoul National University of Science and Technology
■What is RNN?
 Reccurent Neural Network
• Sequence data
• 𝑡 : Time
Intro
4
Input Output
Hidden

Interaction Lab., Seoul National University of Science and Technology
■Reccurent architecture
Intro
5

Interaction Lab., Seoul National University of Science and Technology
■Activation function
 Hyperbolic tangent
• 𝑥𝑡 : Input
• 𝑊
𝑥 : Input weight
• 𝑏 : Bias
• ℎ𝑡−1 : Previous output
• 𝑊ℎ : Previous output weight
Intro
6

Interaction Lab., Seoul National University of Science and Technology
■Feed forward propagation
 Calculate and store variables sequentially from the input layer to the output layer of the NN
■Backpropagation
 How to calculate gradients for parameters of a NN
Training method

Interaction Lab., Seoul National University of Science and Technology
■Feed forward propagation of RNN
 Deep Neural Network
• 𝑈 = 𝑋𝑊 + 𝐵
• 𝑌 = 𝑓(𝑈)
 RNN
• 𝑈(𝑡)
= 𝑋(𝑡)
𝑊 + 𝑌(𝑡−1)
𝑉 + 𝐵
• 𝑌(𝑡)
= 𝑓(𝑈(𝑡)
)
Training method
9

Interaction Lab., Seoul National University of Science and Technology
■Feed forward propagation of RNN
Training method
10
Input(t) 행렬 곱
행렬 곱
+
Activation
function Next layer
Next point
Weight
Weight
Bias
Output

Interaction Lab., Seoul National University of Science and Technology
■Feed forward propagation of RNN
Training method
11
𝑈(𝑡)
= 𝑥𝑡𝑊𝑥ℎ + ℎ𝑡−1𝑊ℎℎ + 𝑏ℎ

Interaction Lab., Seoul National University of Science and Technology
■Backpropagation of RNN
Training method
12

Interaction Lab., Seoul National University of Science and Technology
■Backpropagation of RNN
 We have to update parameters 𝑊𝑥ℎ, 𝑊ℎℎ, 𝑏
Training method
13
𝑑ℎ𝑡−1

Interaction Lab., Seoul National University of Science and Technology
■BPTT (Backpropagation Through Time)
 As the time scale of time series data increases, the computing resources consumed by
BPTT also increase
 As the time scale increases, the gradient of backpropagation becomes unstable
Training method
14

Interaction Lab., Seoul National University of Science and Technology
■Truncated BPTT
 Data must be entered in order
 Cut the backpropagation connection to an appropriate length
Training method
15

Interaction Lab., Seoul National University of Science and Technology
■Truncated BPTT using mini-batch
 Mini-batch : 2
 1,000 data : 500 / 500
Training method
16

Interaction Lab., Seoul National University of Science and Technology
■Binary addition
 5 = 1 × 22 + 0 × 21 + 1 × 20 ∶ 101
 36 = 1 × 25 + 0 × 24 + 0 × 23 + 0 × 22 +0 × 21 +0 × 20 ∶ 100100
 Input : two randomly selected binary numbers
 Label : sum of two numbers
 Link
Code practice
17

Interaction Lab., Seoul National University of Science and Technology
■Disadvantage of RNN
 Gradient vanishing and Gradient exploding
• LSTM and GRU
• Gradient clipping
Conclusion

Similar to 핵심 딥러닝 입문 4장 RNN

Improving accuracy of binary neural networks using unbalanced activation dist...Jaey Jeong

Data Wrangling Week 7Ferdin Joe John Joseph PhD

deep learning from scratch chapter 3 neural networkJaey Jeong

deep learning from scratch chapter 7.cnnJaey Jeong

Week 9: Programming for Data AnalysisFerdin Joe John Joseph PhD

Introduction to Neural networks (under graduate course) Lecture 7 of 9Randa Elanwar

Transfer Learning: Breve introducción a modelos pre-entrenados.Fernando Constantino

Nimrita deep learningNimrita Koul

hands on machine learning Chapter 4 model trainingJaey Jeong

Unsupervised representation learning for gaze estimationJaey Jeong

Recent progress on distributing deep learningViet-Trung TRAN

IDS for IoT.pptxRashilaShrestha

Bag of tricks for image classification with convolutional neural networks r...Dongmin Choi

Artificial neural network model & hidden layers in multilayer artificial neur...Muhammad Ishaq

Neural network techniquesVipul Bhargava

Forecasting of Sales using Neural network techniquesHitesh Dua

240219_RNN, LSTM code.pptxddddddddddddddddssuser2624f71

[20240422_LabSeminar_Huy]Taming_Effect.pptxthanhdowork

Training Deep Networks with Backprop (D1L4 Insight@DCU Machine Learning Works...Universitat Politècnica de Catalunya

Gaze estimation using transformerJaey Jeong

Similar to 핵심 딥러닝 입문 4장 RNN (20)

Improving accuracy of binary neural networks using unbalanced activation dist...

Data Wrangling Week 7

deep learning from scratch chapter 3 neural network

deep learning from scratch chapter 7.cnn

Week 9: Programming for Data Analysis

Introduction to Neural networks (under graduate course) Lecture 7 of 9

Transfer Learning: Breve introducción a modelos pre-entrenados.

Nimrita deep learning

hands on machine learning Chapter 4 model training

Unsupervised representation learning for gaze estimation

Recent progress on distributing deep learning

IDS for IoT.pptx

Bag of tricks for image classification with convolutional neural networks r...

Artificial neural network model & hidden layers in multilayer artificial neur...

Neural network techniques

Forecasting of Sales using Neural network techniques

240219_RNN, LSTM code.pptxdddddddddddddddd

[20240422_LabSeminar_Huy]Taming_Effect.pptx

Training Deep Networks with Backprop (D1L4 Insight@DCU Machine Learning Works...

Gaze estimation using transformer

Recently uploaded

W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...panagenda

Software Quality Assurance Interview QuestionsArshad QA

5 Signs You Need a Fashion PLM Software.pdfWave PLM

TECUNIQUE: Success Stories: IT Service providermohitmore19

Microsoft AI Transformation Partner Playbook.pdfWilly Marroquin (WillyDevNET)

The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...ICS

call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️Delhi Call girls

Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...harshavardhanraghave

Hand gesture recognition PROJECT PPT.pptxbodapatigopi8531

Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...kellynguyen01

Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS LiveCall Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure

A Secure and Reliable Document Management System is Essential.docxComplianceQuest1

Optimizing AI for immediate response in Smart CCTVshikhaohhpro

Clustering techniques data mining book ....ShaimaaMohamedGalal

Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfkalichargn70th171

DNT_Corporate presentation know about usDynamic Netsoft

CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️anilsa9823

Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...OnePlan Solutions

Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...Call Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure

Test Automation Strategy for Frontend and BackendArshad QA

Recently uploaded (20)

W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...

Software Quality Assurance Interview Questions

5 Signs You Need a Fashion PLM Software.pdf

TECUNIQUE: Success Stories: IT Service provider

Microsoft AI Transformation Partner Playbook.pdf

The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...

call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️

Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...

Hand gesture recognition PROJECT PPT.pptx

Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...

Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live

A Secure and Reliable Document Management System is Essential.docx

Optimizing AI for immediate response in Smart CCTV

Clustering techniques data mining book ....

Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf

DNT_Corporate presentation know about us

CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️

Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...

Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...

Test Automation Strategy for Frontend and Backend

핵심 딥러닝 입문 4장 RNN

1. Interaction Lab. Seoul National University of Science and Technology 핵심 딥러닝 입문 chapter 4. RNN Jeong Jae-Yeop

2. Interaction Lab., Seoul National University of Science and Technology ■Intro ■Training method ■Code practice ■Conclusion Agenda 2

3. Intro Training method 3

4. Interaction Lab., Seoul National University of Science and Technology ■What is RNN?  Reccurent Neural Network • Sequence data • 𝑡 : Time Intro 4 Input Output Hidden

5. Interaction Lab., Seoul National University of Science and Technology ■Reccurent architecture Intro 5

6. Interaction Lab., Seoul National University of Science and Technology ■Activation function  Hyperbolic tangent • 𝑥𝑡 : Input • 𝑊 𝑥 : Input weight • 𝑏 : Bias • ℎ𝑡−1 : Previous output • 𝑊ℎ : Previous output weight Intro 6

7. Training method Code practice 7

8. Interaction Lab., Seoul National University of Science and Technology ■Feed forward propagation  Calculate and store variables sequentially from the input layer to the output layer of the NN ■Backpropagation  How to calculate gradients for parameters of a NN Training method

9. Interaction Lab., Seoul National University of Science and Technology ■Feed forward propagation of RNN  Deep Neural Network • 𝑈 = 𝑋𝑊 + 𝐵 • 𝑌 = 𝑓(𝑈)  RNN • 𝑈(𝑡) = 𝑋(𝑡) 𝑊 + 𝑌(𝑡−1) 𝑉 + 𝐵 • 𝑌(𝑡) = 𝑓(𝑈(𝑡) ) Training method 9

10. Interaction Lab., Seoul National University of Science and Technology ■Feed forward propagation of RNN Training method 10 Input(t) 행렬 곱 행렬 곱 + Activation function Next layer Next point Weight Weight Bias Output

11. Interaction Lab., Seoul National University of Science and Technology ■Feed forward propagation of RNN Training method 11 𝑈(𝑡) = 𝑥𝑡𝑊𝑥ℎ + ℎ𝑡−1𝑊ℎℎ + 𝑏ℎ

12. Interaction Lab., Seoul National University of Science and Technology ■Backpropagation of RNN Training method 12

13. Interaction Lab., Seoul National University of Science and Technology ■Backpropagation of RNN  We have to update parameters 𝑊𝑥ℎ, 𝑊ℎℎ, 𝑏 Training method 13 𝑑ℎ𝑡−1

14. Interaction Lab., Seoul National University of Science and Technology ■BPTT (Backpropagation Through Time)  As the time scale of time series data increases, the computing resources consumed by BPTT also increase  As the time scale increases, the gradient of backpropagation becomes unstable Training method 14

15. Interaction Lab., Seoul National University of Science and Technology ■Truncated BPTT  Data must be entered in order  Cut the backpropagation connection to an appropriate length Training method 15

16. Interaction Lab., Seoul National University of Science and Technology ■Truncated BPTT using mini-batch  Mini-batch : 2  1,000 data : 500 / 500 Training method 16

17. Interaction Lab., Seoul National University of Science and Technology ■Binary addition  5 = 1 × 22 + 0 × 21 + 1 × 20 ∶ 101  36 = 1 × 25 + 0 × 24 + 0 × 23 + 0 × 22 +0 × 21 +0 × 20 ∶ 100100  Input : two randomly selected binary numbers  Label : sum of two numbers  Link Code practice 17

18. Interaction Lab., Seoul National University of Science and Technology ■Disadvantage of RNN  Gradient vanishing and Gradient exploding • LSTM and GRU • Gradient clipping Conclusion

19. Q&A 19

핵심 딥러닝 입문 4장 RNN

Recommended

Recommended

More Related Content

Similar to 핵심 딥러닝 입문 4장 RNN

Similar to 핵심 딥러닝 입문 4장 RNN (20)

More from Jaey Jeong

More from Jaey Jeong (10)

Recently uploaded

Recently uploaded (20)

핵심 딥러닝 입문 4장 RNN