Deep learning nlp

•Download as PPTX, PDF•

1 like•91 views

We describe how the RNN and LSTM works in Deep Learning.Also, we summarize previous section content about NLP. Tokenization, lemmation, stemm are ;

Technology

Deep Learning in NLP
微光國際資訊有限公司 2018/06/09

About this Section
• Preprocessing data
• Work with NLTK + word2vec
• Introduce Recurrent Neural Networks(RNN)
• Introduce Long Short-Term Memory
Networks(LSTM)

Difference between Classic and NN
5
Classic NN

這叫做 Feature Engineering
Bag-of-word
N-gram
例句：Bob went to the market to buy some flowers.
[1, 1, 2, 1, 1, 1, 1, 1, 0, 0, 0]
["Bo", "ob", "b ", " w", "we", "en", ..., "me", "e "," f", "fl", "lo", "ow", "we", "er", "rs"]

這叫做 Feature Engineering
Bag-of-word
N-gram
例句：Bob went to the Market to buy some flowers.
[1, 1, 2, 1, 1, 1, 1, 1, 0, 0, 0]
["Bo", "ob", "b ", " w", "we", "en", ..., "me", "e "," f", "fl", "lo", "ow", "we", "er", "rs"]
缺點在於
1. 太過消耗資源
2.需要專家且經驗不易複製

word2vec - calculate distance
14
target = father + woman - man

回想三個東西
• Activation Function
• Loss Function
• Optimizer

The problem with forward NN
22
已知這些關係
文字預測 James has cat and it like to drink ___ .
圖像表現

The problem with forward NN
23
改變這些關係
文字預測 James has cat and it like to drink ___ .
圖像表現

Begin with RNN
24
已知這些關係
可推導出 Y2
可推導出 Y4
圖像表現

Have problem in backpropagation
28
想找到 Error 與 weight 的關係
展開之後

Have problem in backpropagation with RNN
30
一樣想找到關係
展開之後卻發現...
h 產生 recursive

Limit recursive times.
31
Backpropagation Through Time (BPTT)
有效但會使梯度
消失
爆炸

Another problem in Long-Term Dependence
32
John lives in France. 以下省略五千字…..
這邊省略一點點字..... John speak ___.
距離太遙遠導致記憶消失

How LSTM differ from RNN ?
35
標準型 RNN ＬＳＴＭ最小單位

How LSTM differ from RNN ?
36
LSTM
可以決定哪些記憶要忘記，哪些要留下

What is LSTM ?
38
LSTM 有五個值，四個動作，三個門
• Cell state: This is the internal cell state (that is, memory) of an LSTM cell
• Hidden state: This is the external hidden state used to calculate predictions
• Input gate: This determines how much of the current input is read into the cell state
• Forget gate: This determines how much of the previous cell state is sent into the current cell state
• Output gate: This determines how much of the cell state is output into the hidden state
表過去經驗
表預測結果
• Discard old information(that, forget something…)
• Store which new information in the cell state
• Updated the old cell state Ct-1, into the new cell state Ct
• Output information

How does LSTM works ?
39
• Input gate: This determines how much of the current input is read into the cell state (0~1)
• Forget gate: This determines how much of the previous cell state is sent into the current cell state (0~1
• Output gate: This determines how much of the cell state is output into the hidden state (0~1)

Material in “Understanding LSTM Networks”
41

Discard old information(that, forget something…)
42
• Forget gate: This determines how much of the previous cell state is sent into the current cell state (0~

Store which new information in the cell state
43
• Input gate: This determines how much of the current input is read into the cell state (0~1)

Updated the old cell state Ct-1, into the new cell state Ct
44

Output information
45
• Output gate: This determines how much of the cell state is output into the hidden state

Use these functions to build LSTM cell
47
Input Gate
Output Gate
Forget Gate

What's hot

Bryan Thompson, Chief Scientist and Founder at SYSTAP, LLC at MLconf NYCMLconf

Recurrent Neural Networks II (D2L3 Deep Learning for Speech and Language UPC ...Universitat Politècnica de Catalunya

RNN & LSTM: Neural Network for Sequential DataYao-Chieh Hu

Algoritma fuzzy c means fcm java c++ contoh programym.ygrex@comp

Lecture 7: Recurrent Neural NetworksSang Jun Lee

Electricity price forecasting with Recurrent Neural NetworksTaegyun Jeon

Multidimensional RNNGrigory Sapunov

20141204.journal clubHayaru SHOUNO

Naist2015 dec ver1Hiroki Nakahara

Recurrent Neural NetworksSharath TS

Recurrent Neural Network (RNN) | RNN LSTM Tutorial | Deep Learning Course | S...Simplilearn

Introduction For seq2seq(sequence to sequence) and RNNHye-min Ahn

Long Short Term MemoryYan Xu

Recurrent Neural Networks for Text Analysisodsc

Recurrent Neural Networks. Part 1: TheoryAndrii Gakhov

ISCAS'18: A Deep Neural Network on the Nested RNS (NRNS) on an FPGA: Applied ...Hiroki Nakahara

AINL 2016: SkornyakovLidia Pivovarova

DL for moleculesDai-Hai Nguyen

Seq2Seq (encoder decoder) model佳蓉倪

LSTM BasicsAkshay Sehgal

What's hot (20)

Bryan Thompson, Chief Scientist and Founder at SYSTAP, LLC at MLconf NYC

Recurrent Neural Networks II (D2L3 Deep Learning for Speech and Language UPC ...

RNN & LSTM: Neural Network for Sequential Data

Algoritma fuzzy c means fcm java c++ contoh program

Lecture 7: Recurrent Neural Networks

Electricity price forecasting with Recurrent Neural Networks

Multidimensional RNN

20141204.journal club

Naist2015 dec ver1

Recurrent Neural Networks

Recurrent Neural Network (RNN) | RNN LSTM Tutorial | Deep Learning Course | S...

Introduction For seq2seq(sequence to sequence) and RNN

Long Short Term Memory

Recurrent Neural Networks for Text Analysis

Recurrent Neural Networks. Part 1: Theory

ISCAS'18: A Deep Neural Network on the Nested RNS (NRNS) on an FPGA: Applied ...

AINL 2016: Skornyakov

DL for molecules

Seq2Seq (encoder decoder) model

LSTM Basics

Similar to Deep learning nlp

Artificial Intelligence, Machine Learning and Deep LearningSujit Pal

Neural machine translation by jointly learning to align and translate.pptxssuser2624f71

Skip RNN: Learning to Skip State Updates in RNNs (ICLR 2018)Universitat Politècnica de Catalunya

XPDDS17: uniprof: Transparent Unikernel Performance Profiling and Debugging -...The Linux Foundation

Video Analysis with Recurrent Neural Networks (Master Computer Vision Barcelo...Universitat Politècnica de Catalunya

Detecting Misleading Headlines in Online News: Hands-on Experiences on Attent...Kunwoo Park

Recurrent Neural Networks, LSTM and GRUananth

Neural networkMahmoud Hussein

#02 Next RNNTerence Huang

Deep Learning for Computer Vision: Deep Networks (UPC 2016)Universitat Politècnica de Catalunya

Skip RNN: Learning to Skip State Updates in Recurrent Neural NetworksUniversitat Politècnica de Catalunya

240219_RNN, LSTM code.pptxddddddddddddddddssuser2624f71

Neural network basic and introduction of Deep learningTapas Majumdar

Deep Learning with Audio Signals: Prepare, Process, Design, ExpectKeunwoo Choi

[2A4]DeepLearningAtNAVERNAVER D2

Recurrent Neural Networks (RNNs)Abdullah al Mamun

Deep Neural Networks (D1L2 Insight@DCU Machine Learning Workshop 2017)Universitat Politècnica de Catalunya

Hardware Acceleration for Machine LearningCastLabKAIST

My invited talk at the 23rd International Symposium of Mathematical Programmi...Anirbit Mukherjee

JAISTサマースクール2016「脳を知るための理論」講義04 Neural Networks and Neuroscience hirokazutanaka

Similar to Deep learning nlp (20)

Artificial Intelligence, Machine Learning and Deep Learning

Neural machine translation by jointly learning to align and translate.pptx

Skip RNN: Learning to Skip State Updates in RNNs (ICLR 2018)

XPDDS17: uniprof: Transparent Unikernel Performance Profiling and Debugging -...

Video Analysis with Recurrent Neural Networks (Master Computer Vision Barcelo...

Detecting Misleading Headlines in Online News: Hands-on Experiences on Attent...

Recurrent Neural Networks, LSTM and GRU

Neural network

#02 Next RNN

Deep Learning for Computer Vision: Deep Networks (UPC 2016)

Skip RNN: Learning to Skip State Updates in Recurrent Neural Networks

240219_RNN, LSTM code.pptxdddddddddddddddd

Neural network basic and introduction of Deep learning

Deep Learning with Audio Signals: Prepare, Process, Design, Expect

[2A4]DeepLearningAtNAVER

Recurrent Neural Networks (RNNs)

Deep Neural Networks (D1L2 Insight@DCU Machine Learning Workshop 2017)

Hardware Acceleration for Machine Learning

My invited talk at the 23rd International Symposium of Mathematical Programmi...

JAISTサマースクール2016「脳を知るための理論」講義04 Neural Networks and Neuroscience

Recently uploaded

Finology Group – Insurtech Innovation Award 2024The Digital Insurer

SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren

Salesforce Community Group Quito, Salesforce 101Paola De la Torre

Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited

Histor y of HAM Radio presentation slidevu2urc

Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55

CNv6 Instructor Chapter 6 Quality of Servicegiselly40

WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal

Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo

Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix

Presentation on how to chat with PDF using ChatGPT code interpreternaman860154

#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada

Boost PC performance: How more available memory can improve productivityPrincipled Technologies

Understanding the Laravel MVC ArchitecturePixlogix Infotech

Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge

08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls

[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745

My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar

Recently uploaded (20)

Finology Group – Insurtech Innovation Award 2024

SQL Database Design For Developers at php[tek] 2024

Salesforce Community Group Quito, Salesforce 101

Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365

Histor y of HAM Radio presentation slide

Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...

CNv6 Instructor Chapter 6 Quality of Service

WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service

Handwritten Text Recognition for manuscripts and early printed texts

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...

Swan(sea) Song – personal research during my six years at Swansea ... and bey...

Presentation on how to chat with PDF using ChatGPT code interpreter

#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024

Boost PC performance: How more available memory can improve productivity

Understanding the Laravel MVC Architecture

Breaking the Kubernetes Kill Chain: Host Path Mount

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf

08448380779 Call Girls In Friends Colony Women Seeking Men

[2024]Digital Global Overview Report 2024 Meltwater.pdf

My Hashitalk Indonesia April 2024 Presentation

Deep learning nlp

1. Deep Learning in NLP 微光國際資訊有限公司 2018/06/09

2. About this Section • Preprocessing data • Work with NLTK + word2vec • Introduce Recurrent Neural Networks(RNN) • Introduce Long Short-Term Memory Networks(LSTM)

3. Preprocessing Data(Classic) 3

4. Preprocessing Data(NN) 4

5. Difference between Classic and NN 5 Classic NN

6. 這叫做 Feature Engineering

7. 這叫做 Feature Engineering Bag-of-word N-gram 例句：Bob went to the market to buy some flowers. [1, 1, 2, 1, 1, 1, 1, 1, 0, 0, 0] ["Bo", "ob", "b ", " w", "we", "en", ..., "me", "e "," f", "fl", "lo", "ow", "we", "er", "rs"]

8. 這叫做 Feature Engineering Bag-of-word N-gram 例句：Bob went to the Market to buy some flowers. [1, 1, 2, 1, 1, 1, 1, 1, 0, 0, 0] ["Bo", "ob", "b ", " w", "we", "en", ..., "me", "e "," f", "fl", "lo", "ow", "we", "er", "rs"] 缺點在於 1. 太過消耗資源 2.需要專家且經驗不易複製

9. Load Data 9

10. Import NLTK & use tokenize 10

11. use word2vec and store it 11

12. word2vec - distance 12

13. word2vec - similar 13

14. word2vec - calculate distance 14 target = father + woman - man

15. Img that we want u to remember 15

16. 開始NN之前先回想些東西

17. 回想三個東西 • Activation Function • Loss Function • Optimizer

18. Activation Function 18

19. Loss Function 19

20. Optimizer 20

21. About this Section • Preprocessing data • Work with NLTK + word2vec • Introduce Recurrent Neural Networks(RNN) • Introduce Long Short-Term Memory Networks(LSTM)

22. The problem with forward NN 22 已知這些關係文字預測 James has cat and it like to drink ___ . 圖像表現

23. The problem with forward NN 23 改變這些關係文字預測 James has cat and it like to drink ___ . 圖像表現

24. Begin with RNN 24 已知這些關係可推導出 Y2 可推導出 Y4 圖像表現

25. Application in RNN 25

26. Application in RNN (Img) 26

27. Application in RNN (Table) 27

28. Have problem in backpropagation 28 想找到 Error 與 weight 的關係展開之後

29. Recall previous video on youtube 29

30. Have problem in backpropagation with RNN 30 一樣想找到關係展開之後卻發現... h 產生 recursive

31. Limit recursive times. 31 Backpropagation Through Time (BPTT) 有效但會使梯度消失爆炸

32. Another problem in Long-Term Dependence 32 John lives in France. 以下省略五千字….. 這邊省略一點點字..... John speak ___. 距離太遙遠導致記憶消失

33. LSTM 就這樣出世

34. About this Section • Preprocessing data • Work with NLTK + word2vec • Introduce Recurrent Neural Networks(RNN) • Introduce Long Short-Term Memory Networks(LSTM)

35. How LSTM differ from RNN ? 35 標準型 RNN ＬＳＴＭ最小單位

36. How LSTM differ from RNN ? 36 LSTM 可以決定哪些記憶要忘記，哪些要留下

37. Application in LSTM 37

38. What is LSTM ? 38 LSTM 有五個值，四個動作，三個門 • Cell state: This is the internal cell state (that is, memory) of an LSTM cell • Hidden state: This is the external hidden state used to calculate predictions • Input gate: This determines how much of the current input is read into the cell state • Forget gate: This determines how much of the previous cell state is sent into the current cell state • Output gate: This determines how much of the cell state is output into the hidden state 表過去經驗表預測結果 • Discard old information(that, forget something…) • Store which new information in the cell state • Updated the old cell state Ct-1, into the new cell state Ct • Output information

39. How does LSTM works ? 39 • Input gate: This determines how much of the current input is read into the cell state (0~1) • Forget gate: This determines how much of the previous cell state is sent into the current cell state (0~1 • Output gate: This determines how much of the cell state is output into the hidden state (0~1)

40. Classical article in LSTM 40

41. Material in “Understanding LSTM Networks” 41

42. Discard old information(that, forget something…) 42 • Forget gate: This determines how much of the previous cell state is sent into the current cell state (0~

43. Store which new information in the cell state 43 • Input gate: This determines how much of the current input is read into the cell state (0~1)

44. Updated the old cell state Ct-1, into the new cell state Ct 44

45. Output information 45 • Output gate: This determines how much of the cell state is output into the hidden state

46. Practice it by yourself

47. Use these functions to build LSTM cell 47 Input Gate Output Gate Forget Gate

Deep learning nlp

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Deep learning nlp

Similar to Deep learning nlp (20)

More from Heng-Xiu Xu

More from Heng-Xiu Xu (6)

Recently uploaded

Recently uploaded (20)

Deep learning nlp