Submit Search
Upload
Deep Multi-Task Learning with Shared Memory
•
2 likes
•
1,623 views
M
marujirou
Follow
EMNLP reading group @TMU
Read less
Read more
Technology
Report
Share
Report
Share
1 of 14
Download now
Download to read offline
Recommended
Exploring Strategies for Training Deep Neural Networks paper review
Exploring Strategies for Training Deep Neural Networks paper review
Vimukthi Wickramasinghe
Using Multi-layered Feed-forward Neural Network (MLFNN) Architecture as Bidir...
Using Multi-layered Feed-forward Neural Network (MLFNN) Architecture as Bidir...
IOSR Journals
Long Zhou - 2017 - Neural System Combination for Machine Transaltion
Long Zhou - 2017 - Neural System Combination for Machine Transaltion
Association for Computational Linguistics
Neural netorksmatching
Neural netorksmatching
Masa Kato
18
18
IMPULSE_TECHNOLOGY
Switch Transformers: Scaling to Trillion Parameter Models with Simple and Eff...
Switch Transformers: Scaling to Trillion Parameter Models with Simple and Eff...
taeseon ryu
Multi-task Learning for Dense Prediction Tasks in Computer Vision
Multi-task Learning for Dense Prediction Tasks in Computer Vision
Arun Talkad
[딥논읽] Meta-Transfer Learning for Zero-Shot Super-Resolution paper review
[딥논읽] Meta-Transfer Learning for Zero-Shot Super-Resolution paper review
taeseon ryu
Recommended
Exploring Strategies for Training Deep Neural Networks paper review
Exploring Strategies for Training Deep Neural Networks paper review
Vimukthi Wickramasinghe
Using Multi-layered Feed-forward Neural Network (MLFNN) Architecture as Bidir...
Using Multi-layered Feed-forward Neural Network (MLFNN) Architecture as Bidir...
IOSR Journals
Long Zhou - 2017 - Neural System Combination for Machine Transaltion
Long Zhou - 2017 - Neural System Combination for Machine Transaltion
Association for Computational Linguistics
Neural netorksmatching
Neural netorksmatching
Masa Kato
18
18
IMPULSE_TECHNOLOGY
Switch Transformers: Scaling to Trillion Parameter Models with Simple and Eff...
Switch Transformers: Scaling to Trillion Parameter Models with Simple and Eff...
taeseon ryu
Multi-task Learning for Dense Prediction Tasks in Computer Vision
Multi-task Learning for Dense Prediction Tasks in Computer Vision
Arun Talkad
[딥논읽] Meta-Transfer Learning for Zero-Shot Super-Resolution paper review
[딥논읽] Meta-Transfer Learning for Zero-Shot Super-Resolution paper review
taeseon ryu
On the cross domain reusability of neural modules for general video game playing
On the cross domain reusability of neural modules for general video game playing
Alexander Braylan
Attention Is All You Need
Attention Is All You Need
SEMINARGROOT
Introduction to CNN
Introduction to CNN
Shuai Zhang
Mlp mixer image_process_210613 deeplearning paper review!
Mlp mixer image_process_210613 deeplearning paper review!
taeseon ryu
Deep Belief Networks
Deep Belief Networks
Hasan H Topcu
Restricting the Flow: Information Bottlenecks for Attribution
Restricting the Flow: Information Bottlenecks for Attribution
taeseon ryu
Data Applied:Forecast
Data Applied:Forecast
DataminingTools Inc
N ns 1
N ns 1
Thy Selaroth
Reading group nfm - 20170312
Reading group nfm - 20170312
Shuai Zhang
Digit recognition using mnist database
Digit recognition using mnist database
btandale
Introduction to Machine learning & Neural Networks
Introduction to Machine learning & Neural Networks
Ankur Nair
doc1.docx
doc1.docx
Ramyadevi62
Convolutional neural network
Convolutional neural network
Itachi SK
Attention
Attention
SEMINARGROOT
Learning weighted lower linear envelope potentials in binary markov random fi...
Learning weighted lower linear envelope potentials in binary markov random fi...
jpstudcorner
Classification by backpropacation
Classification by backpropacation
Siva Priya
SummaRuNNer: A Recurrent Neural Network based Sequence Model for Extractive ...
SummaRuNNer: A Recurrent Neural Network based Sequence Model for Extractive ...
Shubhangi Tandon
Learning to compare: relation network for few shot learning
Learning to compare: relation network for few shot learning
Simon John
Icacci2017 poster template
Icacci2017 poster template
vinaykumar R
1D Convolutional Neural Networks for Time Series Modeling - Nathan Janos, Jef...
1D Convolutional Neural Networks for Time Series Modeling - Nathan Janos, Jef...
PyData
怖くない誤差逆伝播法 Chainerを添えて
怖くない誤差逆伝播法 Chainerを添えて
marujirou
ニューラル日英翻訳における出力文の態制御
ニューラル日英翻訳における出力文の態制御
Hayahide Yamagishi
More Related Content
What's hot
On the cross domain reusability of neural modules for general video game playing
On the cross domain reusability of neural modules for general video game playing
Alexander Braylan
Attention Is All You Need
Attention Is All You Need
SEMINARGROOT
Introduction to CNN
Introduction to CNN
Shuai Zhang
Mlp mixer image_process_210613 deeplearning paper review!
Mlp mixer image_process_210613 deeplearning paper review!
taeseon ryu
Deep Belief Networks
Deep Belief Networks
Hasan H Topcu
Restricting the Flow: Information Bottlenecks for Attribution
Restricting the Flow: Information Bottlenecks for Attribution
taeseon ryu
Data Applied:Forecast
Data Applied:Forecast
DataminingTools Inc
N ns 1
N ns 1
Thy Selaroth
Reading group nfm - 20170312
Reading group nfm - 20170312
Shuai Zhang
Digit recognition using mnist database
Digit recognition using mnist database
btandale
Introduction to Machine learning & Neural Networks
Introduction to Machine learning & Neural Networks
Ankur Nair
doc1.docx
doc1.docx
Ramyadevi62
Convolutional neural network
Convolutional neural network
Itachi SK
Attention
Attention
SEMINARGROOT
Learning weighted lower linear envelope potentials in binary markov random fi...
Learning weighted lower linear envelope potentials in binary markov random fi...
jpstudcorner
Classification by backpropacation
Classification by backpropacation
Siva Priya
SummaRuNNer: A Recurrent Neural Network based Sequence Model for Extractive ...
SummaRuNNer: A Recurrent Neural Network based Sequence Model for Extractive ...
Shubhangi Tandon
Learning to compare: relation network for few shot learning
Learning to compare: relation network for few shot learning
Simon John
Icacci2017 poster template
Icacci2017 poster template
vinaykumar R
1D Convolutional Neural Networks for Time Series Modeling - Nathan Janos, Jef...
1D Convolutional Neural Networks for Time Series Modeling - Nathan Janos, Jef...
PyData
What's hot
(20)
On the cross domain reusability of neural modules for general video game playing
On the cross domain reusability of neural modules for general video game playing
Attention Is All You Need
Attention Is All You Need
Introduction to CNN
Introduction to CNN
Mlp mixer image_process_210613 deeplearning paper review!
Mlp mixer image_process_210613 deeplearning paper review!
Deep Belief Networks
Deep Belief Networks
Restricting the Flow: Information Bottlenecks for Attribution
Restricting the Flow: Information Bottlenecks for Attribution
Data Applied:Forecast
Data Applied:Forecast
N ns 1
N ns 1
Reading group nfm - 20170312
Reading group nfm - 20170312
Digit recognition using mnist database
Digit recognition using mnist database
Introduction to Machine learning & Neural Networks
Introduction to Machine learning & Neural Networks
doc1.docx
doc1.docx
Convolutional neural network
Convolutional neural network
Attention
Attention
Learning weighted lower linear envelope potentials in binary markov random fi...
Learning weighted lower linear envelope potentials in binary markov random fi...
Classification by backpropacation
Classification by backpropacation
SummaRuNNer: A Recurrent Neural Network based Sequence Model for Extractive ...
SummaRuNNer: A Recurrent Neural Network based Sequence Model for Extractive ...
Learning to compare: relation network for few shot learning
Learning to compare: relation network for few shot learning
Icacci2017 poster template
Icacci2017 poster template
1D Convolutional Neural Networks for Time Series Modeling - Nathan Janos, Jef...
1D Convolutional Neural Networks for Time Series Modeling - Nathan Janos, Jef...
Viewers also liked
怖くない誤差逆伝播法 Chainerを添えて
怖くない誤差逆伝播法 Chainerを添えて
marujirou
ニューラル日英翻訳における出力文の態制御
ニューラル日英翻訳における出力文の態制御
Hayahide Yamagishi
GoogLeNet Insights
GoogLeNet Insights
Auro Tripathy
ディープボルツマンマシン入門〜後半〜
ディープボルツマンマシン入門〜後半〜
sakaizawa
tmu_science_cafe02
tmu_science_cafe02
Tomoyuki Kajiwara
ディープボルツマンマシン入門
ディープボルツマンマシン入門
Saya Katafuchi
ECS for Amazon Deep Learning and Amazon Machine Learning
ECS for Amazon Deep Learning and Amazon Machine Learning
Amanda Mackay (she/her)
[ACL2016] Achieving Open Vocabulary Neural Machine Translation with Hybrid Wo...
[ACL2016] Achieving Open Vocabulary Neural Machine Translation with Hybrid Wo...
Hayahide Yamagishi
[EMNLP2016読み会] Memory-enhanced Decoder for Neural Machine Translation
[EMNLP2016読み会] Memory-enhanced Decoder for Neural Machine Translation
Hayahide Yamagishi
Chainerチュートリアル -v1.5向け- ViEW2015
Chainerチュートリアル -v1.5向け- ViEW2015
Ryosuke Okuta
Viewers also liked
(10)
怖くない誤差逆伝播法 Chainerを添えて
怖くない誤差逆伝播法 Chainerを添えて
ニューラル日英翻訳における出力文の態制御
ニューラル日英翻訳における出力文の態制御
GoogLeNet Insights
GoogLeNet Insights
ディープボルツマンマシン入門〜後半〜
ディープボルツマンマシン入門〜後半〜
tmu_science_cafe02
tmu_science_cafe02
ディープボルツマンマシン入門
ディープボルツマンマシン入門
ECS for Amazon Deep Learning and Amazon Machine Learning
ECS for Amazon Deep Learning and Amazon Machine Learning
[ACL2016] Achieving Open Vocabulary Neural Machine Translation with Hybrid Wo...
[ACL2016] Achieving Open Vocabulary Neural Machine Translation with Hybrid Wo...
[EMNLP2016読み会] Memory-enhanced Decoder for Neural Machine Translation
[EMNLP2016読み会] Memory-enhanced Decoder for Neural Machine Translation
Chainerチュートリアル -v1.5向け- ViEW2015
Chainerチュートリアル -v1.5向け- ViEW2015
Similar to Deep Multi-Task Learning with Shared Memory
Multi tasking learning
Multi tasking learning
ShreyusPuthiyapurail
Local Applications of Large Language Models based on RAG.pptx
Local Applications of Large Language Models based on RAG.pptx
lwz614595250
Transfer Learning in NLP: A Survey
Transfer Learning in NLP: A Survey
NUPUR YADAV
DLD meetup 2017, Efficient Deep Learning
DLD meetup 2017, Efficient Deep Learning
Brodmann17
Machine learning in science and industry — day 4
Machine learning in science and industry — day 4
arogozhnikov
Knowledge distillation deeplab
Knowledge distillation deeplab
Frozen Paradise
Feature Extraction and Analysis of Natural Language Processing for Deep Learn...
Feature Extraction and Analysis of Natural Language Processing for Deep Learn...
Sharmila Sathish
Nn kb
Nn kb
Kushal Arora
ODSC East: Effective Transfer Learning for NLP
ODSC East: Effective Transfer Learning for NLP
indico data
Performance Comparison between Pytorch and Mindspore
Performance Comparison between Pytorch and Mindspore
ijdms
Neural Network in Knowledge Bases
Neural Network in Knowledge Bases
Kushal Arora
SENTIMENT ANALYSIS IN MYANMAR LANGUAGE USING CONVOLUTIONAL LSTM NEURAL NETWORK
SENTIMENT ANALYSIS IN MYANMAR LANGUAGE USING CONVOLUTIONAL LSTM NEURAL NETWORK
ijnlc
Sentiment Analysis In Myanmar Language Using Convolutional Lstm Neural Network
Sentiment Analysis In Myanmar Language Using Convolutional Lstm Neural Network
kevig
Nips 2017 in a nutshell
Nips 2017 in a nutshell
LULU CHENG
Multi Task Learning and Meta Learning
Multi Task Learning and Meta Learning
Srilalitha Veerubhotla
Life-long / Incremental Learning (DLAI D6L1 2017 UPC Deep Learning for Artifi...
Life-long / Incremental Learning (DLAI D6L1 2017 UPC Deep Learning for Artifi...
Universitat Politècnica de Catalunya
Lifelong / Incremental Deep Learning - Ramon Morros - UPC Barcelona 2018
Lifelong / Incremental Deep Learning - Ramon Morros - UPC Barcelona 2018
Universitat Politècnica de Catalunya
Lifelong Learning for Dynamically Expandable Networks
Lifelong Learning for Dynamically Expandable Networks
NAVER Engineering
Implementing a neural network potential for exascale molecular dynamics
Implementing a neural network potential for exascale molecular dynamics
PFHub PFHub
DL for sentence classification project Write-up
DL for sentence classification project Write-up
Hoàng Triều Trịnh
Similar to Deep Multi-Task Learning with Shared Memory
(20)
Multi tasking learning
Multi tasking learning
Local Applications of Large Language Models based on RAG.pptx
Local Applications of Large Language Models based on RAG.pptx
Transfer Learning in NLP: A Survey
Transfer Learning in NLP: A Survey
DLD meetup 2017, Efficient Deep Learning
DLD meetup 2017, Efficient Deep Learning
Machine learning in science and industry — day 4
Machine learning in science and industry — day 4
Knowledge distillation deeplab
Knowledge distillation deeplab
Feature Extraction and Analysis of Natural Language Processing for Deep Learn...
Feature Extraction and Analysis of Natural Language Processing for Deep Learn...
Nn kb
Nn kb
ODSC East: Effective Transfer Learning for NLP
ODSC East: Effective Transfer Learning for NLP
Performance Comparison between Pytorch and Mindspore
Performance Comparison between Pytorch and Mindspore
Neural Network in Knowledge Bases
Neural Network in Knowledge Bases
SENTIMENT ANALYSIS IN MYANMAR LANGUAGE USING CONVOLUTIONAL LSTM NEURAL NETWORK
SENTIMENT ANALYSIS IN MYANMAR LANGUAGE USING CONVOLUTIONAL LSTM NEURAL NETWORK
Sentiment Analysis In Myanmar Language Using Convolutional Lstm Neural Network
Sentiment Analysis In Myanmar Language Using Convolutional Lstm Neural Network
Nips 2017 in a nutshell
Nips 2017 in a nutshell
Multi Task Learning and Meta Learning
Multi Task Learning and Meta Learning
Life-long / Incremental Learning (DLAI D6L1 2017 UPC Deep Learning for Artifi...
Life-long / Incremental Learning (DLAI D6L1 2017 UPC Deep Learning for Artifi...
Lifelong / Incremental Deep Learning - Ramon Morros - UPC Barcelona 2018
Lifelong / Incremental Deep Learning - Ramon Morros - UPC Barcelona 2018
Lifelong Learning for Dynamically Expandable Networks
Lifelong Learning for Dynamically Expandable Networks
Implementing a neural network potential for exascale molecular dynamics
Implementing a neural network potential for exascale molecular dynamics
DL for sentence classification project Write-up
DL for sentence classification project Write-up
More from marujirou
Cross-Lingual Sentiment Analysis using modified BRAE
Cross-Lingual Sentiment Analysis using modified BRAE
marujirou
Learning Tag Embeddings and Tag-specific Composition Functions in Recursive N...
Learning Tag Embeddings and Tag-specific Composition Functions in Recursive N...
marujirou
2015 08 survey
2015 08 survey
marujirou
Representation Learning Using Multi-Task Deep Neural Networks for Semantic Cl...
Representation Learning Using Multi-Task Deep Neural Networks for Semantic Cl...
marujirou
Combining Distant and Partial Supervision for Relation Extraction (Angeli et ...
Combining Distant and Partial Supervision for Relation Extraction (Angeli et ...
marujirou
Semantic Compositionality through Recursive Matrix-Vector Spaces (Socher et al.)
Semantic Compositionality through Recursive Matrix-Vector Spaces (Socher et al.)
marujirou
Relation Classification via Convolutional Deep Neural Network (Zeng et al.)
Relation Classification via Convolutional Deep Neural Network (Zeng et al.)
marujirou
DL勉強会 01ディープボルツマンマシン
DL勉強会 01ディープボルツマンマシン
marujirou
More from marujirou
(8)
Cross-Lingual Sentiment Analysis using modified BRAE
Cross-Lingual Sentiment Analysis using modified BRAE
Learning Tag Embeddings and Tag-specific Composition Functions in Recursive N...
Learning Tag Embeddings and Tag-specific Composition Functions in Recursive N...
2015 08 survey
2015 08 survey
Representation Learning Using Multi-Task Deep Neural Networks for Semantic Cl...
Representation Learning Using Multi-Task Deep Neural Networks for Semantic Cl...
Combining Distant and Partial Supervision for Relation Extraction (Angeli et ...
Combining Distant and Partial Supervision for Relation Extraction (Angeli et ...
Semantic Compositionality through Recursive Matrix-Vector Spaces (Socher et al.)
Semantic Compositionality through Recursive Matrix-Vector Spaces (Socher et al.)
Relation Classification via Convolutional Deep Neural Network (Zeng et al.)
Relation Classification via Convolutional Deep Neural Network (Zeng et al.)
DL勉強会 01ディープボルツマンマシン
DL勉強会 01ディープボルツマンマシン
Recently uploaded
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
carlostorres15106
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
Mark Billinghurst
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
ThousandEyes
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Patryk Bandurski
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
Memoori
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
Maria Levchenko
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
2toLead Limited
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
BookNet Canada
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
Scott Keck-Warren
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
HampshireHUG
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
Puma Security, LLC
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
Softradix Technologies
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
Radu Cotescu
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
Neo4j
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
naman860154
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
Ridwan Fadjar
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
Allon Mureinik
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
HostedbyConfluent
Recently uploaded
(20)
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Deep Multi-Task Learning with Shared Memory
1.
Deep Multi-Task Learning with Shared Memory Pengfei Liu, Xipeng Qiu, Xuanjing
Huang EMNLP2016 reading group presenter: ryosuke miyazaki
2.
Abstract Due to the large number of parameters neural models need a large-scale corpus. → unsupervised pre-training is effective Multi-task learning also improve the final performance. This paper propose LSTM with external memory for multi-task learning.
3.
Model: ME-LSTM Key vector, Erase vector, Add vector
4.
Model: ME-LSTM Reading operation K segment, M dimensions per one segment ,
5.
Model: ME-LSTM Deep Fusion strategy
6.
Model: ME-LSTM Writing operation
7.
Two architectures ARC-1 ARC-2
8.
Training Task-specific output layer Linear combination of cost function λm is the weights for each task m
9.
Experiment: text classification
10.
Result: Movie
11.
Result: Product
12.
Analysis: Visualize deep fusion gate Sentiment score Dimensions of deep fusion gate gt Activate → black
13.
Analysis: Visualize deep fusion gate
14.
Conclusion ・ This paper propose two deep architectures for multi-task learning. ・ They design an external memory to store the knowledge by related tasks. ・
Deep fusion strategy enabling the model to give shared information.
Download now