SlideShare a Scribd company logo
1 of 30
Download to read offline
Meta-Learning with Memory
Augmented Neural Networks
Master Seminar II: Data Analytics
Presented By:
Sakshi Singh (305238)
Seminar II: Data Analytics
OUTLINE
• Introduction
• Related work
• Methodology
• Model
• Experiments and Results
• Conclusion
• Future Work
• References
Seminar II: Data Analytics
2Meta-Learning with Memory-Augmented Neural NetworksSakshi Singh (Universität Hildesheim)
Paper Introduction
• Title: Meta-Learning with Memory-Augmented Neural Networks
• Authors: Adam Santoro (Google DeepMind), Sergey Bartunov (Google DeepMind,
Natural Research University Higher School of Economics), Mathew Botvinick
(Google DeepMind), Daan Wierstra (Google DeepMind) and Timothy Lillicrap
(Google DeepMind)
• Publish date: June 19th, 2016
• Published by: ICML'16: Proceedings of the 33rd International Conference on
International Conference on Machine Learning
Seminar II: Data Analytics
3Meta-Learning with Memory-Augmented Neural NetworksSakshi Singh (Universität Hildesheim)
Introduction
• Traditional gradient-based networks: require lot of data to learn by
iteratively relearning the parameters
• Meta-learning: learning to learn
• Memory Augmented Neural Networks (MAML):
• Refers to a class of external memory equipped networks
• rapidly integrates new data to make predictions after few samples
Meta-Learning with Memory-Augmented Neural Networks 4Sakshi Singh (Universität Hildesheim)
Seminar II: Data Analytics
Why MANN?
• It can store information in a form that is:
• Stable (accessed whenever needed)
• Element-wise addressable (only relevant piece of info accessed)
• The number of parameters are not tied up the memory.
• Approach in this paper:
• Slowly learn representation of raw data
• Rapidly bind never-seen-before data to external memory
Meta-Learning with Memory-Augmented Neural Networks 5Sakshi Singh (Universität Hildesheim)
Seminar II: Data Analytics
Related work
• Hochreiter, Sepp, Younger, A Steven, and Conwell, Peter R. “Learning
to learn using gradient descent in Artificial Neural Networks” ICANN
2001, pp. 87–94. Springer (2001)
• Graves, Alex, Wayne, Greg, and Danihelka, Ivo. “Neural Turing
Machines”. arXiv preprint arXiv:1410.5401 (2014)
• Lake, Brenden M, Salakhutdinov, Ruslan, and Tenenbaum, Joshua B.
“Human-level concept learning through probabilistic program
induction. Science”, 350(6266):13321338 (2015)
Meta-Learning with Memory-Augmented Neural Networks 6
Seminar II: Data Analytics
Sakshi Singh (Universität Hildesheim)
Methodology
• Learning is done in episodes
• Example:
• Human’s lifespan = 1 episode
• Evolution = meta-learning
• Learning during episode = very important
• Episodes contain similar problems but are different
Meta-Learning with Memory-Augmented Neural Networks 7Sakshi Singh (Universität Hildesheim)
Seminar II: Data Analytics
Episode Setup
1. Pick out different characters (Omniglot) and assign random labels
3 4 1 2
Meta-Learning with Memory-Augmented Neural Networks 8Sakshi Singh (Universität Hildesheim)
Seminar II: Data Analytics
Episode Setup
2. Shuffle in any random sequence from dataset-to-dataset
Meta-Learning with Memory-Augmented Neural Networks 9Sakshi Singh (Universität Hildesheim)
Seminar II: Data Analytics
Episode Setup
3. Pass one by one to neural network and it has to predict correct labels
Meta-Learning with Memory-Augmented Neural Networks 10Sakshi Singh (Universität Hildesheim)
Seminar II: Data Analytics
Episode Setup
Meta-Learning with Memory-Augmented Neural Networks 11Sakshi Singh (Universität Hildesheim)
Seminar II: Data Analytics
• Dataset: D = {dt} = {(xt, yt)}
• yt is presented in offset manner:
• Regression: real valued number xt
• Classification: class label for xt
• Input sequence:
• (x1, null), (x2, y1), . . . , (xT , yT−1)
• yt is both output and input
Episode Setup
Meta-Learning with Memory-Augmented Neural Networks 12Sakshi Singh (Universität Hildesheim)
Seminar II: Data Analytics
Feed-forward neural networks (no feedback) Solution 1: Recurrent Neural Networks
Episode Setup
Solution 2: Augmented memory
• RNN: controller
• External memory: RAM
• Controller network
• send data to external memory (write)
• query external memory to retrieve data
(read)
Meta-Learning with Memory-Augmented Neural Networks 13Sakshi Singh (Universität Hildesheim)
Seminar II: Data Analytics
Model: NTM
Meta-Learning with Memory-Augmented Neural Networks 14Sakshi Singh (Universität Hildesheim)
Seminar II: Data Analytics
Neural Turing Machines
• Content-based approach
• Controller interact with external
memory using read-write heads
• Capable of
• long-term storage via slow weight updates
• short-term storage via external memory
Content-based approach: NTM
• Controller produces the key to store and retrieve data
• read-write vector
• Memory rt is retrieved using
Meta-Learning with Memory-Augmented Neural Networks 15
Seminar II: Data Analytics
Sakshi Singh (Universität Hildesheim)
LRUA
• Least Recently Used Access
• Pure content-based memory module
• Two options:
• Least used location: preserving information recently encoded
• Last used location: updating the memory with new information
Meta-Learning with Memory-Augmented Neural Networks 16
Seminar II: Data Analytics
LRUA
• Usage weight:
• Least used weight:
• Write weight:
• Writing to memory:
Meta-Learning with Memory-Augmented Neural Networks 17
Seminar II: Data Analytics
Sakshi Singh (Universität Hildesheim)
Experiments
• Datasets:
• Classification: Omniglot
• Regression: Gaussian Process (GP)
• Data augmented by translating & rotating images by 90◦, 180 ◦, 270 ◦
• Training classes: 1200
• Testing classes: 423
Meta-Learning with Memory-Augmented Neural Networks 18
Seminar II: Data Analytics
Sakshi Singh (Universität Hildesheim)
Omniglot Classification (1)
• Training:
• 100,000 episodes on 5 random classes
• Testing:
• on never-before-seen classes
• use one-hot vector as class labels
Meta-Learning with Memory-Augmented Neural Networks 19
Seminar II: Data Analytics
Sakshi Singh (Universität Hildesheim)
MANN Vs. Human
• Performance of MANN surpassed
human performance
• Educated guessing in MANN: if a
sample produced key which was a
poor match in memory, then
probability of correct guessing
increases on first instance.
Meta-Learning with Memory-Augmented Neural Networks 20
Seminar II: Data Analytics
Sakshi Singh (Universität Hildesheim)
Omniglot Classification (2)
• Class representation approach: so that no. of classes per episode can
increase arbitrarily
• Characters from each label uniformly sampled from {‘a’, ’b’, ’c’, ’d’, ’e’}
• Produces random strings
• String length= 25 (with 5 values=1, rest=0)
• Total 3125 possible labels
Meta-Learning with Memory-Augmented Neural Networks 21
Seminar II: Data Analytics
Sakshi Singh (Universität Hildesheim)
Meta-Learning with Memory-Augmented Neural Networks 22
Seminar II: Data Analytics
Sakshi Singh (Universität Hildesheim)
LSTM MANN
One-hot
Vector
representation
Class
representation
5 random
classes/episode
(length 50)
15 random
classes/episode
(length 100)
Experiment: Different Algorithms
Meta-Learning with Memory-Augmented Neural Networks 23
Seminar II: Data Analytics
Sakshi Singh (Universität Hildesheim)
Experiment: Persistent Memory
• Better strategy: wipe external
memory episode-to-episode
(a) → 5-classes per episode length
50: learning very slow and no
spike in accuracy
(b) → 10-classes per episode
length 75: accuracy
comparable
Meta-Learning with Memory-Augmented Neural Networks 24
Seminar II: Data Analytics
Sakshi Singh (Universität Hildesheim)
Experiment: Curriculum Training
• Done to scale classification capabilities
• Initially 15-classes per episode
• After every 10,000 episodes increase no. of
classes per episode by 1
• Result: network maintained high accuracy
• Tested on 50-classes per episode, increasing
by 100 (max)
• Result: network with decaying performance
Meta-Learning with Memory-Augmented Neural Networks 25
Seminar II: Data Analytics
Sakshi Singh (Universität Hildesheim)
Regression
• Generated functions using GP with fixed
hyper-parameters
• Each episode:
• presentation of x value(1D, 2D or 3D)
• Time offset function values f(xt-1)
• Binding x values to their function values
• GP can performs complex queries on all
data points
Meta-Learning with Memory-Augmented Neural Networks 26
Seminar II: Data Analytics
Sakshi Singh (Universität Hildesheim)
Conclusion
• Central contribution: demonstrated the ability of MANN to do meta-
learning
• Gradual, incremental learning encodes background knowledge and
external memory binds information to new tasks
• Content-based memory access approach
• MANN outperformed all the existing methods
• External memory yields better results
Meta-Learning with Memory-Augmented Neural Networks 27
Seminar II: Data Analytics
Sakshi Singh (Universität Hildesheim)
Future Work
• Meta-learning can discover optimal memory addressing procedures
• Training on wider range of tasks
• MANN performance in meta-learning tasks requiring active learning
• Exploring requirements of robust performance
• Accessing maximum capacity of the network
Meta-Learning with Memory-Augmented Neural Networks 28
Seminar II: Data Analytics
Sakshi Singh (Universität Hildesheim)
References
1. Adam Santoro, Sergey Bartunov, Mathew Botvinick, Daan Wierstra and Timothy
Lillicrap, Meta-Learning with Memory-Augmented Neural Networks (2016)
2. Graves, Alex, Wayne, Greg, and Danihelka, Ivo. “Neural Turing Machines”. arXiv
preprint arXiv:1410.5401 (2014)
3. Meta learning with memory augmented neural network and NPI,
https://www.youtube.com/watch?v=s3iV4SK3CuU, accessed on 19.05.2020
4. Meta-Learning and One-Shot Learning,
https://www.youtube.com/watch?v=KUWywwvQv8E, accessed on 19.05.2020
Meta-Learning with Memory-Augmented Neural Networks 29
Seminar II: Data Analytics
Sakshi Singh (Universität Hildesheim)
Meta-Learning with Memory-Augmented Neural Networks 30
THANK YOU!
Questions?
Seminar II: Data Analytics
Sakshi Singh (Universität Hildesheim)

More Related Content

What's hot

Deep Learning: Chapter 11 Practical Methodology
Deep Learning: Chapter 11 Practical MethodologyDeep Learning: Chapter 11 Practical Methodology
Deep Learning: Chapter 11 Practical MethodologyJason Tsai
 
Learning to learn unlearned feature for segmentation
Learning to learn unlearned feature for segmentationLearning to learn unlearned feature for segmentation
Learning to learn unlearned feature for segmentationNAVER Engineering
 
Distance oracle - Truy vấn nhanh khoảng cách giữa hai điểm bất kỳ trên đồ thị
Distance oracle - Truy vấn nhanh khoảng cách giữa hai điểm bất kỳ trên đồ thịDistance oracle - Truy vấn nhanh khoảng cách giữa hai điểm bất kỳ trên đồ thị
Distance oracle - Truy vấn nhanh khoảng cách giữa hai điểm bất kỳ trên đồ thịHong Ong
 
Deep Learning without Annotations - Xavier Giro - UPC Barcelona 2018
Deep Learning without Annotations - Xavier Giro - UPC Barcelona 2018Deep Learning without Annotations - Xavier Giro - UPC Barcelona 2018
Deep Learning without Annotations - Xavier Giro - UPC Barcelona 2018Universitat Politècnica de Catalunya
 
Q-Learning with a Neural Network - Xavier Giró - UPC Barcelona 2020
Q-Learning with a Neural Network - Xavier Giró - UPC Barcelona 2020Q-Learning with a Neural Network - Xavier Giró - UPC Barcelona 2020
Q-Learning with a Neural Network - Xavier Giró - UPC Barcelona 2020Universitat Politècnica de Catalunya
 
Deep Learning for Recommender Systems RecSys2017 Tutorial
Deep Learning for Recommender Systems RecSys2017 Tutorial Deep Learning for Recommender Systems RecSys2017 Tutorial
Deep Learning for Recommender Systems RecSys2017 Tutorial Alexandros Karatzoglou
 
Sara Hooker & Sean McPherson, Delta Analytics, at MLconf Seattle 2017
Sara Hooker & Sean McPherson, Delta Analytics, at MLconf Seattle 2017Sara Hooker & Sean McPherson, Delta Analytics, at MLconf Seattle 2017
Sara Hooker & Sean McPherson, Delta Analytics, at MLconf Seattle 2017MLconf
 
Modellbildung, Berechnung und Simulation in Forschung und Lehre
Modellbildung, Berechnung und Simulation in Forschung und LehreModellbildung, Berechnung und Simulation in Forschung und Lehre
Modellbildung, Berechnung und Simulation in Forschung und LehreJoachim Schlosser
 
Machine & Deep Learning: Practical Deployments and Best Practices for the Nex...
Machine & Deep Learning: Practical Deployments and Best Practices for the Nex...Machine & Deep Learning: Practical Deployments and Best Practices for the Nex...
Machine & Deep Learning: Practical Deployments and Best Practices for the Nex...inside-BigData.com
 
Artificial Intelligence, Machine Learning and Deep Learning
Artificial Intelligence, Machine Learning and Deep LearningArtificial Intelligence, Machine Learning and Deep Learning
Artificial Intelligence, Machine Learning and Deep LearningSujit Pal
 
DLD meetup 2017, Efficient Deep Learning
DLD meetup 2017, Efficient Deep LearningDLD meetup 2017, Efficient Deep Learning
DLD meetup 2017, Efficient Deep LearningBrodmann17
 

What's hot (11)

Deep Learning: Chapter 11 Practical Methodology
Deep Learning: Chapter 11 Practical MethodologyDeep Learning: Chapter 11 Practical Methodology
Deep Learning: Chapter 11 Practical Methodology
 
Learning to learn unlearned feature for segmentation
Learning to learn unlearned feature for segmentationLearning to learn unlearned feature for segmentation
Learning to learn unlearned feature for segmentation
 
Distance oracle - Truy vấn nhanh khoảng cách giữa hai điểm bất kỳ trên đồ thị
Distance oracle - Truy vấn nhanh khoảng cách giữa hai điểm bất kỳ trên đồ thịDistance oracle - Truy vấn nhanh khoảng cách giữa hai điểm bất kỳ trên đồ thị
Distance oracle - Truy vấn nhanh khoảng cách giữa hai điểm bất kỳ trên đồ thị
 
Deep Learning without Annotations - Xavier Giro - UPC Barcelona 2018
Deep Learning without Annotations - Xavier Giro - UPC Barcelona 2018Deep Learning without Annotations - Xavier Giro - UPC Barcelona 2018
Deep Learning without Annotations - Xavier Giro - UPC Barcelona 2018
 
Q-Learning with a Neural Network - Xavier Giró - UPC Barcelona 2020
Q-Learning with a Neural Network - Xavier Giró - UPC Barcelona 2020Q-Learning with a Neural Network - Xavier Giró - UPC Barcelona 2020
Q-Learning with a Neural Network - Xavier Giró - UPC Barcelona 2020
 
Deep Learning for Recommender Systems RecSys2017 Tutorial
Deep Learning for Recommender Systems RecSys2017 Tutorial Deep Learning for Recommender Systems RecSys2017 Tutorial
Deep Learning for Recommender Systems RecSys2017 Tutorial
 
Sara Hooker & Sean McPherson, Delta Analytics, at MLconf Seattle 2017
Sara Hooker & Sean McPherson, Delta Analytics, at MLconf Seattle 2017Sara Hooker & Sean McPherson, Delta Analytics, at MLconf Seattle 2017
Sara Hooker & Sean McPherson, Delta Analytics, at MLconf Seattle 2017
 
Modellbildung, Berechnung und Simulation in Forschung und Lehre
Modellbildung, Berechnung und Simulation in Forschung und LehreModellbildung, Berechnung und Simulation in Forschung und Lehre
Modellbildung, Berechnung und Simulation in Forschung und Lehre
 
Machine & Deep Learning: Practical Deployments and Best Practices for the Nex...
Machine & Deep Learning: Practical Deployments and Best Practices for the Nex...Machine & Deep Learning: Practical Deployments and Best Practices for the Nex...
Machine & Deep Learning: Practical Deployments and Best Practices for the Nex...
 
Artificial Intelligence, Machine Learning and Deep Learning
Artificial Intelligence, Machine Learning and Deep LearningArtificial Intelligence, Machine Learning and Deep Learning
Artificial Intelligence, Machine Learning and Deep Learning
 
DLD meetup 2017, Efficient Deep Learning
DLD meetup 2017, Efficient Deep LearningDLD meetup 2017, Efficient Deep Learning
DLD meetup 2017, Efficient Deep Learning
 

Similar to Meta-Learning with Memory Augmented Neural Networks

HML: Historical View and Trends of Deep Learning
HML: Historical View and Trends of Deep LearningHML: Historical View and Trends of Deep Learning
HML: Historical View and Trends of Deep LearningYan Xu
 
Lecture_1_Intro.pdf
Lecture_1_Intro.pdfLecture_1_Intro.pdf
Lecture_1_Intro.pdfpaijitk
 
Introduction to Recurrent Neural Network
Introduction to Recurrent Neural NetworkIntroduction to Recurrent Neural Network
Introduction to Recurrent Neural NetworkYan Xu
 
Developing Computational Skills in the Sciences with Matlab Webinar 2017
Developing Computational Skills in the Sciences with Matlab Webinar 2017Developing Computational Skills in the Sciences with Matlab Webinar 2017
Developing Computational Skills in the Sciences with Matlab Webinar 2017SERC at Carleton College
 
Current clustering techniques
Current clustering techniquesCurrent clustering techniques
Current clustering techniquesPoonam Kshirsagar
 
Deep learning health care
Deep learning health care  Deep learning health care
Deep learning health care Meenakshi Sood
 
Cse 8th sem syllabus
Cse 8th sem syllabusCse 8th sem syllabus
Cse 8th sem syllabusAkshatha Nair
 
H2O with Erin LeDell at Portland R User Group
H2O with Erin LeDell at Portland R User GroupH2O with Erin LeDell at Portland R User Group
H2O with Erin LeDell at Portland R User GroupSri Ambati
 
An Online Spark Pipeline: Semi-Supervised Learning and Automatic Retraining w...
An Online Spark Pipeline: Semi-Supervised Learning and Automatic Retraining w...An Online Spark Pipeline: Semi-Supervised Learning and Automatic Retraining w...
An Online Spark Pipeline: Semi-Supervised Learning and Automatic Retraining w...Databricks
 
How Can Machine Learning Help Your Research Forward?
How Can Machine Learning Help Your Research Forward?How Can Machine Learning Help Your Research Forward?
How Can Machine Learning Help Your Research Forward?Wouter Deconinck
 
Introduction to Neural Network
Introduction to Neural NetworkIntroduction to Neural Network
Introduction to Neural NetworkYan Xu
 
MIS 542 Syllabus 08.doc
MIS 542 Syllabus 08.docMIS 542 Syllabus 08.doc
MIS 542 Syllabus 08.docbutest
 
The Materials Data Facility: A Distributed Model for the Materials Data Commu...
The Materials Data Facility: A Distributed Model for the Materials Data Commu...The Materials Data Facility: A Distributed Model for the Materials Data Commu...
The Materials Data Facility: A Distributed Model for the Materials Data Commu...Ben Blaiszik
 
A review on Exploiting experts’ knowledge for structure learning of bayesian ...
A review on Exploiting experts’ knowledge for structure learning of bayesian ...A review on Exploiting experts’ knowledge for structure learning of bayesian ...
A review on Exploiting experts’ knowledge for structure learning of bayesian ...Reza Sadeghi
 
Image anomaly detection with generative adversarial networks
Image anomaly detection with generative adversarial networksImage anomaly detection with generative adversarial networks
Image anomaly detection with generative adversarial networksSakshiSingh480
 

Similar to Meta-Learning with Memory Augmented Neural Networks (20)

HML: Historical View and Trends of Deep Learning
HML: Historical View and Trends of Deep LearningHML: Historical View and Trends of Deep Learning
HML: Historical View and Trends of Deep Learning
 
Lecture_1_Intro.pdf
Lecture_1_Intro.pdfLecture_1_Intro.pdf
Lecture_1_Intro.pdf
 
PLAY Minecraft!
PLAY Minecraft!PLAY Minecraft!
PLAY Minecraft!
 
5th sem
5th sem5th sem
5th sem
 
5th sem
5th sem5th sem
5th sem
 
Introduction to Recurrent Neural Network
Introduction to Recurrent Neural NetworkIntroduction to Recurrent Neural Network
Introduction to Recurrent Neural Network
 
Developing Computational Skills in the Sciences with Matlab Webinar 2017
Developing Computational Skills in the Sciences with Matlab Webinar 2017Developing Computational Skills in the Sciences with Matlab Webinar 2017
Developing Computational Skills in the Sciences with Matlab Webinar 2017
 
Current clustering techniques
Current clustering techniquesCurrent clustering techniques
Current clustering techniques
 
Deep learning health care
Deep learning health care  Deep learning health care
Deep learning health care
 
Icml2017 overview
Icml2017 overviewIcml2017 overview
Icml2017 overview
 
Cse 8th sem syllabus
Cse 8th sem syllabusCse 8th sem syllabus
Cse 8th sem syllabus
 
H2O with Erin LeDell at Portland R User Group
H2O with Erin LeDell at Portland R User GroupH2O with Erin LeDell at Portland R User Group
H2O with Erin LeDell at Portland R User Group
 
An Online Spark Pipeline: Semi-Supervised Learning and Automatic Retraining w...
An Online Spark Pipeline: Semi-Supervised Learning and Automatic Retraining w...An Online Spark Pipeline: Semi-Supervised Learning and Automatic Retraining w...
An Online Spark Pipeline: Semi-Supervised Learning and Automatic Retraining w...
 
How Can Machine Learning Help Your Research Forward?
How Can Machine Learning Help Your Research Forward?How Can Machine Learning Help Your Research Forward?
How Can Machine Learning Help Your Research Forward?
 
Introduction to Neural Network
Introduction to Neural NetworkIntroduction to Neural Network
Introduction to Neural Network
 
MIS 542 Syllabus 08.doc
MIS 542 Syllabus 08.docMIS 542 Syllabus 08.doc
MIS 542 Syllabus 08.doc
 
The Materials Data Facility: A Distributed Model for the Materials Data Commu...
The Materials Data Facility: A Distributed Model for the Materials Data Commu...The Materials Data Facility: A Distributed Model for the Materials Data Commu...
The Materials Data Facility: A Distributed Model for the Materials Data Commu...
 
A review on Exploiting experts’ knowledge for structure learning of bayesian ...
A review on Exploiting experts’ knowledge for structure learning of bayesian ...A review on Exploiting experts’ knowledge for structure learning of bayesian ...
A review on Exploiting experts’ knowledge for structure learning of bayesian ...
 
Image anomaly detection with generative adversarial networks
Image anomaly detection with generative adversarial networksImage anomaly detection with generative adversarial networks
Image anomaly detection with generative adversarial networks
 
lecture1.pptx
lecture1.pptxlecture1.pptx
lecture1.pptx
 

Recently uploaded

Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改atducpo
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfgstagge
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...Suhani Kapoor
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...Suhani Kapoor
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxStephen266013
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Jack DiGiovanna
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubaihf8803863
 
Data Warehouse , Data Cube Computation
Data Warehouse   , Data Cube ComputationData Warehouse   , Data Cube Computation
Data Warehouse , Data Cube Computationsit20ad004
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Sapana Sha
 
Data Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptxData Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptxFurkanTasci3
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...Pooja Nehwal
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝soniya singh
 

Recently uploaded (20)

Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdf
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docx
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
 
Data Warehouse , Data Cube Computation
Data Warehouse   , Data Cube ComputationData Warehouse   , Data Cube Computation
Data Warehouse , Data Cube Computation
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
 
Data Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptxData Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptx
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
 

Meta-Learning with Memory Augmented Neural Networks

  • 1. Meta-Learning with Memory Augmented Neural Networks Master Seminar II: Data Analytics Presented By: Sakshi Singh (305238) Seminar II: Data Analytics
  • 2. OUTLINE • Introduction • Related work • Methodology • Model • Experiments and Results • Conclusion • Future Work • References Seminar II: Data Analytics 2Meta-Learning with Memory-Augmented Neural NetworksSakshi Singh (Universität Hildesheim)
  • 3. Paper Introduction • Title: Meta-Learning with Memory-Augmented Neural Networks • Authors: Adam Santoro (Google DeepMind), Sergey Bartunov (Google DeepMind, Natural Research University Higher School of Economics), Mathew Botvinick (Google DeepMind), Daan Wierstra (Google DeepMind) and Timothy Lillicrap (Google DeepMind) • Publish date: June 19th, 2016 • Published by: ICML'16: Proceedings of the 33rd International Conference on International Conference on Machine Learning Seminar II: Data Analytics 3Meta-Learning with Memory-Augmented Neural NetworksSakshi Singh (Universität Hildesheim)
  • 4. Introduction • Traditional gradient-based networks: require lot of data to learn by iteratively relearning the parameters • Meta-learning: learning to learn • Memory Augmented Neural Networks (MAML): • Refers to a class of external memory equipped networks • rapidly integrates new data to make predictions after few samples Meta-Learning with Memory-Augmented Neural Networks 4Sakshi Singh (Universität Hildesheim) Seminar II: Data Analytics
  • 5. Why MANN? • It can store information in a form that is: • Stable (accessed whenever needed) • Element-wise addressable (only relevant piece of info accessed) • The number of parameters are not tied up the memory. • Approach in this paper: • Slowly learn representation of raw data • Rapidly bind never-seen-before data to external memory Meta-Learning with Memory-Augmented Neural Networks 5Sakshi Singh (Universität Hildesheim) Seminar II: Data Analytics
  • 6. Related work • Hochreiter, Sepp, Younger, A Steven, and Conwell, Peter R. “Learning to learn using gradient descent in Artificial Neural Networks” ICANN 2001, pp. 87–94. Springer (2001) • Graves, Alex, Wayne, Greg, and Danihelka, Ivo. “Neural Turing Machines”. arXiv preprint arXiv:1410.5401 (2014) • Lake, Brenden M, Salakhutdinov, Ruslan, and Tenenbaum, Joshua B. “Human-level concept learning through probabilistic program induction. Science”, 350(6266):13321338 (2015) Meta-Learning with Memory-Augmented Neural Networks 6 Seminar II: Data Analytics Sakshi Singh (Universität Hildesheim)
  • 7. Methodology • Learning is done in episodes • Example: • Human’s lifespan = 1 episode • Evolution = meta-learning • Learning during episode = very important • Episodes contain similar problems but are different Meta-Learning with Memory-Augmented Neural Networks 7Sakshi Singh (Universität Hildesheim) Seminar II: Data Analytics
  • 8. Episode Setup 1. Pick out different characters (Omniglot) and assign random labels 3 4 1 2 Meta-Learning with Memory-Augmented Neural Networks 8Sakshi Singh (Universität Hildesheim) Seminar II: Data Analytics
  • 9. Episode Setup 2. Shuffle in any random sequence from dataset-to-dataset Meta-Learning with Memory-Augmented Neural Networks 9Sakshi Singh (Universität Hildesheim) Seminar II: Data Analytics
  • 10. Episode Setup 3. Pass one by one to neural network and it has to predict correct labels Meta-Learning with Memory-Augmented Neural Networks 10Sakshi Singh (Universität Hildesheim) Seminar II: Data Analytics
  • 11. Episode Setup Meta-Learning with Memory-Augmented Neural Networks 11Sakshi Singh (Universität Hildesheim) Seminar II: Data Analytics • Dataset: D = {dt} = {(xt, yt)} • yt is presented in offset manner: • Regression: real valued number xt • Classification: class label for xt • Input sequence: • (x1, null), (x2, y1), . . . , (xT , yT−1) • yt is both output and input
  • 12. Episode Setup Meta-Learning with Memory-Augmented Neural Networks 12Sakshi Singh (Universität Hildesheim) Seminar II: Data Analytics Feed-forward neural networks (no feedback) Solution 1: Recurrent Neural Networks
  • 13. Episode Setup Solution 2: Augmented memory • RNN: controller • External memory: RAM • Controller network • send data to external memory (write) • query external memory to retrieve data (read) Meta-Learning with Memory-Augmented Neural Networks 13Sakshi Singh (Universität Hildesheim) Seminar II: Data Analytics
  • 14. Model: NTM Meta-Learning with Memory-Augmented Neural Networks 14Sakshi Singh (Universität Hildesheim) Seminar II: Data Analytics Neural Turing Machines • Content-based approach • Controller interact with external memory using read-write heads • Capable of • long-term storage via slow weight updates • short-term storage via external memory
  • 15. Content-based approach: NTM • Controller produces the key to store and retrieve data • read-write vector • Memory rt is retrieved using Meta-Learning with Memory-Augmented Neural Networks 15 Seminar II: Data Analytics Sakshi Singh (Universität Hildesheim)
  • 16. LRUA • Least Recently Used Access • Pure content-based memory module • Two options: • Least used location: preserving information recently encoded • Last used location: updating the memory with new information Meta-Learning with Memory-Augmented Neural Networks 16 Seminar II: Data Analytics
  • 17. LRUA • Usage weight: • Least used weight: • Write weight: • Writing to memory: Meta-Learning with Memory-Augmented Neural Networks 17 Seminar II: Data Analytics Sakshi Singh (Universität Hildesheim)
  • 18. Experiments • Datasets: • Classification: Omniglot • Regression: Gaussian Process (GP) • Data augmented by translating & rotating images by 90◦, 180 ◦, 270 ◦ • Training classes: 1200 • Testing classes: 423 Meta-Learning with Memory-Augmented Neural Networks 18 Seminar II: Data Analytics Sakshi Singh (Universität Hildesheim)
  • 19. Omniglot Classification (1) • Training: • 100,000 episodes on 5 random classes • Testing: • on never-before-seen classes • use one-hot vector as class labels Meta-Learning with Memory-Augmented Neural Networks 19 Seminar II: Data Analytics Sakshi Singh (Universität Hildesheim)
  • 20. MANN Vs. Human • Performance of MANN surpassed human performance • Educated guessing in MANN: if a sample produced key which was a poor match in memory, then probability of correct guessing increases on first instance. Meta-Learning with Memory-Augmented Neural Networks 20 Seminar II: Data Analytics Sakshi Singh (Universität Hildesheim)
  • 21. Omniglot Classification (2) • Class representation approach: so that no. of classes per episode can increase arbitrarily • Characters from each label uniformly sampled from {‘a’, ’b’, ’c’, ’d’, ’e’} • Produces random strings • String length= 25 (with 5 values=1, rest=0) • Total 3125 possible labels Meta-Learning with Memory-Augmented Neural Networks 21 Seminar II: Data Analytics Sakshi Singh (Universität Hildesheim)
  • 22. Meta-Learning with Memory-Augmented Neural Networks 22 Seminar II: Data Analytics Sakshi Singh (Universität Hildesheim) LSTM MANN One-hot Vector representation Class representation 5 random classes/episode (length 50) 15 random classes/episode (length 100)
  • 23. Experiment: Different Algorithms Meta-Learning with Memory-Augmented Neural Networks 23 Seminar II: Data Analytics Sakshi Singh (Universität Hildesheim)
  • 24. Experiment: Persistent Memory • Better strategy: wipe external memory episode-to-episode (a) → 5-classes per episode length 50: learning very slow and no spike in accuracy (b) → 10-classes per episode length 75: accuracy comparable Meta-Learning with Memory-Augmented Neural Networks 24 Seminar II: Data Analytics Sakshi Singh (Universität Hildesheim)
  • 25. Experiment: Curriculum Training • Done to scale classification capabilities • Initially 15-classes per episode • After every 10,000 episodes increase no. of classes per episode by 1 • Result: network maintained high accuracy • Tested on 50-classes per episode, increasing by 100 (max) • Result: network with decaying performance Meta-Learning with Memory-Augmented Neural Networks 25 Seminar II: Data Analytics Sakshi Singh (Universität Hildesheim)
  • 26. Regression • Generated functions using GP with fixed hyper-parameters • Each episode: • presentation of x value(1D, 2D or 3D) • Time offset function values f(xt-1) • Binding x values to their function values • GP can performs complex queries on all data points Meta-Learning with Memory-Augmented Neural Networks 26 Seminar II: Data Analytics Sakshi Singh (Universität Hildesheim)
  • 27. Conclusion • Central contribution: demonstrated the ability of MANN to do meta- learning • Gradual, incremental learning encodes background knowledge and external memory binds information to new tasks • Content-based memory access approach • MANN outperformed all the existing methods • External memory yields better results Meta-Learning with Memory-Augmented Neural Networks 27 Seminar II: Data Analytics Sakshi Singh (Universität Hildesheim)
  • 28. Future Work • Meta-learning can discover optimal memory addressing procedures • Training on wider range of tasks • MANN performance in meta-learning tasks requiring active learning • Exploring requirements of robust performance • Accessing maximum capacity of the network Meta-Learning with Memory-Augmented Neural Networks 28 Seminar II: Data Analytics Sakshi Singh (Universität Hildesheim)
  • 29. References 1. Adam Santoro, Sergey Bartunov, Mathew Botvinick, Daan Wierstra and Timothy Lillicrap, Meta-Learning with Memory-Augmented Neural Networks (2016) 2. Graves, Alex, Wayne, Greg, and Danihelka, Ivo. “Neural Turing Machines”. arXiv preprint arXiv:1410.5401 (2014) 3. Meta learning with memory augmented neural network and NPI, https://www.youtube.com/watch?v=s3iV4SK3CuU, accessed on 19.05.2020 4. Meta-Learning and One-Shot Learning, https://www.youtube.com/watch?v=KUWywwvQv8E, accessed on 19.05.2020 Meta-Learning with Memory-Augmented Neural Networks 29 Seminar II: Data Analytics Sakshi Singh (Universität Hildesheim)
  • 30. Meta-Learning with Memory-Augmented Neural Networks 30 THANK YOU! Questions? Seminar II: Data Analytics Sakshi Singh (Universität Hildesheim)