PR-153: SNAIL: A Simple Neural Attentive Meta-Learner

•Download as PPTX, PDF•

1 like•510 views

- Title: SNAIL: A Simple Neural Attentive Meta-Learner - Paper: https://arxiv.org/abs/1707.03141 - Youtube: https://youtu.be/zGrwpa5-_0Y Taekmin Kim, http://github.com/tantara

Technology

A Simple Neural AttentIve Meta-
Learner
PR-153
Mar 31, 2019
Taekmin Kim
1

Machine Learning vs. Human
● Machine Learning
○ Try to learn data points
■ Supervised/Unsupervised Learning
■ Reinforcement Learning
● Human
○ Fast adaptation with prior knowledge
■ Few-shot learning
■ Generalization across tasks
3

● Related Work
○ LEARNING TO REINFORCEMENT LEARN
○ RL^2
○ MAML
○ Auto-Meta
Meta-Learner?
Multi-armed bandit problem
https://blog.floydhub.com/meta-rl/
4

Meta-RL
● Goal: Generalization across tasks
● Notations
○ T: Task distribution e.g., driving, multi-armed bandit problems
○ T_i: Specific task e.g., Sonata, Porsche, ...
○ x_t: state
○ a_t: action
5

RNN-based Meta-RL(Agent)
● sequence-to-sequence problem
○ refer to past experience
● Drawbacks:
○ Temporally-linear dependency
https://blog.floydhub.com/meta-rl/
6

Motivation
● Temporal(Causal) Convolution
○ depends on previous steps
● Soft Attention
○ weighted sum
https://www.slideshare.net/ThomasHjeldeThoresen/temporal-convolutional-networks-dethroning-rnns-for-sequence-modelling
https://medium.com/syncedreview/memory-attention-sequences-8522f531dd43
7

Temporal Convolution
https://www.slideshare.net/ThomasHjeldeThoresen/temporal-convolutional-networks-dethroning-rnns-for-sequence-modelling
Vanilla 1D TCs
(exponential) Dilated 1D TCs
Vanilla 1D Convolution Temporal Convolution(TC)
8

Attention is All you Need(2017)
https://mchromiak.github.io/articles/2017/Sep/12/Transformer-Attention-is-all-you-need/#.XJ6U6-szZ0c
https://medium.com/@hyponymous/paper-summary-attention-is-all-you-need-22c2c7a5e06
Q: Hidden State of Decoder
K: Hidden State of Encoder
V: (normalized) Weights
9
PR-049: https://www.youtube.com/watch?v=6zGgVIlStXs

Simple Neural AttentIve Learner
Building Blocks
● DenseBlock
● TCBlock
● AttentionBlock
11

Attention Block
Query: Hidden State of Decoder
Key: Hidden State of Encoder
Value: (normalized) Weights
14

Simple Neural AttentIve Learner
Building Blocks
● DenseBlock
● TCBlock
● AttentionBlock
16

Experiments
● Supervised Learning
○ Few-Shot Learning(Image Classification)
■ n-Way DATASET
■ m-shot
● Reinforcement Learning
○ Multi-Armed Bandits
○ Tabular MDPs
○ Continuous Control
○ Visual Navigation
17

Results: Few-shot Learning
MAML
Omniglot
18

MAML: Optimization-based Meta-RL
20
https://arxiv.org/abs/1703.03400
PR-094: MAML https://www.youtube.com/watch?v=fxJXXKZb-ik

Results: Tabular MDPs
23
가깝고도 먼 TRPO(이웅원 님)
https://www.slideshare.net/WoongwonLee/trpo-87165690

Summary
26
● SNAIL
○ Temporal Convolution
○ Soft Attention
● Meta-RL is promising
● Related Work
○ LEARNING TO REINFORCEMENT LEARN
○ RL^2
○ MAML
○ Auto-Meta
● Materials
○ Meta-RL(Chelsea Finn): http://rail.eecs.berkeley.edu/deeprlcourse/static/slides/lec-20.pdf

What's hot

Machine Learning and Applications

Geeta Arora

LLaMa 2 is a large language model (LLM) developed by Meta AI. It is a successor to the Llama model, and it is one of the most powerful LLMs available today. Llama 2 is trained on a massive dataset of text and code, and it can be used for a wide range of tasks, including: Generating text, such as articles, poems, and code Translating languages Answering questions in a comprehensive and informative way Following instructions and completing requests thoughtfully LLama 2 is still under development, but it has already been shown to outperform other LLMs on many benchmarks. For example, Llama 2 outperforms other open source LLMs on many external benchmarks, including reasoning, coding, proficiency, and knowledge tests.

LLaMA 2.pptx

RkRahul16

The GPT-3 model architecture is a transformer-based neural network that has been fed 45TB of text data. It is non-deterministic, in the sense that given the same input, multiple runs of the engine will return different responses. Also, it is trained on massive datasets that covered the entire web and contained 500B tokens, humongous 175 Billion parameters, a more than 100x increase over GPT-2, which was considered state-of-the-art technology with 1.5 billion parameters.

A brief primer on OpenAI's GPT-3

Ishan Jain

Machine learning

Saurabh Agrawal

この資料は、東京工業大学横田研究室の藤井一喜さんがW&Bマンスリーミートアップのために準備してくれた資料です。「大規模言語モデル開発を支える分散学習技術」大規模言語モデル（LLM）を学習する過程において、分散学習は避けて通れない重要な技術の一つです。本講演では、分散学習の基本的な概念とそのメカニズムをわかりやすく解説します。さらに、実例やノウハウについてもご紹介します。

大規模言語モデル開発を支える分散学習技術 - 東京工業大学横田理央研究室の藤井一喜さん

Akira Shibata

As the complexity of choosing optimised and task specific steps and ML models is often beyond non-experts, the rapid growth of machine learning applications has created a demand for off-the-shelf machine learning methods that can be used easily and without expert knowledge. We call the resulting research area that targets progressive automation of machine learning AutoML. Although it focuses on end users without expert knowledge, AutoML also offers new tools to machine learning experts, for example to: 1. Perform architecture search over deep representations 2. Analyse the importance of hyperparameters.

Automatic Machine Learning, AutoML

Gpt models

Customizing LLMs

Siamese networks

[DL輪読会]BERT: Pre-training of Deep Bidirectional Transformers for Language Und...

Deep Learning JP

Pydata_リクルートにおけるbanditアルゴリズム_実装前までのプロセス

Shoichi Taguchi

The term Machine Learning was coined by Arthur Samuel in 1959, an american pioneer in the field of computer gaming and artificial intelligence and stated that “ it gives computers the ability to learn without being explicitly programmed” And in 1997, Tom Mitchell gave a “ well-Posed” mathematical and relational definition that “ A Computer Program is said to learn from experience E with respect to some task T and some performance measure P, if its performance on T, as measured by P, improves with experience E”. Machine learning is needed for tasks that are too complex for humans to code directly. So instead, we provide a large amount of data to a machine learning algorithm and let the algorithm work it out by exploring that data and searching for a model that will achieve what the programmers have set it out to achieve.

Machine learning basics

Akanksha Bali

Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks

Yoonho Lee

Introduction to machine learning

Sangath babu

MLP深層学習 LSTM

Shuhei Sowa

Optunaを使ったHuman-in-the-loop最適化の紹介 - 2023/04/27 W&B 東京ミートアップ #3

Preferred Networks

알아두면 쓸데있는 신비한 딥러닝 이야기

Kwangsik Lee

An introduction to the Transformers architecture and BERT

Suman Debnath

A Brief Introduction on Recurrent Neural Network and Its Application

Xiaohu ZHU

Machine learning(ML) is the scientific study of algorithms and statistical models that computer systems used to progressively improve their performance on a specific task. Machine learning algorithms build a mathematical model of sample data, known as “Training Data", in order to make predictions or decisions without being explicitly programmed to perform the task. Machine learning algorithms are used in the applications of email filtering, detection of network intruders and computer vision, where it is infeasible to develop an algorithm of specific instructions for performing the task. Machine learning is closely related to computational statistics, which focuses on making predictions using computers. The study of mathematical optimization delivers methods, theory and application domains to the field of machine learning. Data mining is a field of study within machine learning and focuses on exploratory data analysis through unsupervised learning. In its application across business problems, Machine learning is the study of computer systems that learn from data and experience. It is applied in an incredibly wide variety of application areas, from medicine to advertising, from military to pedestrian. Any area in which you need to make sense of data is a potential customer of machine learning.

Machine learning

Tushar Nikam

What's hot (20)

Machine Learning and Applications

LLaMA 2.pptx

A brief primer on OpenAI's GPT-3

Machine learning

大規模言語モデル開発を支える分散学習技術 - 東京工業大学横田理央研究室の藤井一喜さん

Automatic Machine Learning, AutoML

Gpt models

Customizing LLMs

Siamese networks

[DL輪読会]BERT: Pre-training of Deep Bidirectional Transformers for Language Und...

Pydata_リクルートにおけるbanditアルゴリズム_実装前までのプロセス

Machine learning basics

Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks

Introduction to machine learning

MLP深層学習 LSTM

Optunaを使ったHuman-in-the-loop最適化の紹介 - 2023/04/27 W&B 東京ミートアップ #3

알아두면 쓸데있는 신비한 딥러닝 이야기

An introduction to the Transformers architecture and BERT

A Brief Introduction on Recurrent Neural Network and Its Application

Machine learning

Recently uploaded

Frisco Automating Purchase Orders with MuleSoft IDP- May 10th, 2024.pptx.pdf

AnubhavMangla3

In today's digital world, trust is key to customer relationships, but keeping it is a huge challenge. Customers are well-informed and empowered, quick to change brands if their trust is broken, even if it costs them more. This puts a lot of pressure on organizations to handle trust and safety issues with great care and transparency. The challenge, however, is real. Fragmented solutions have left privacy, legal, and security teams in a perpetual cycle of catch-up, struggling to update privacy notices, manage customer data rights, and answer lengthy security questionnaires—all while trying to prove ROI to the business. It's a thankless job, filled with repetition, tedious tasks, and constant interdepartmental coordination. Combine this with fast regulatory changes and the quick evolution of AI, and it becomes overwhelming. Join this webinar to learn more about TrustArc's new innovative solution Trust Center, the only unified, no-code online hub for trust and safety information built for privacy, security, compliance, and legal teams. Trust Center streamlines your path to compliance, shortens the pre-sales cycle, and reduces both legal and regulatory risks, saving time, effort, and cost. This webinar will review: - Why companies are building unified Trust Centers for a robust privacy program. - How unified Trust Centers streamline sales cycles, ensure regulatory compliance, and reduce operational bottlenecks. - How compliance, legal, security, GRC, and privacy teams benefit from a unified Trust Center in terms of needs, pains, and outcomes. - How TrustArc Trust Center saves time and work while reducing legal, reputational, and compliance risk by effectively managing policies, notices, terms, and disclosures, and providing real-time updates on subprocessors.

TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...

TrustArc

CORS (Kitworks Team Study 양다윗 발표자료 240510)

Wonjun Hwang

Because observability is such a broad topic – and often something we learn on the job – it can feel like there’s too much to learn at once. But you don’t have to tackle everything and can start with the basics and build from there! Monitoring and observability aren’t traditionally found in software curriculums and many of us cobble this knowledge together from whatever vendor or ecosystem we were first introduced to and whatever is a part of your current company’s observability stack. No matter what tooling is in place, there are still observability fundamentals that developers should know. That’s why I’ve put together a primer on the different telemetry types, when to use them, how to understand the data journey, and what to look for in time series graphs.

Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)

Paige Cruz

ERP Contender Series: Acumatica vs. Sage Intacct

BrainSell Technologies

In the ever-evolving landscape of data management, Zero-ETL is an approach that is reshaping how businesses handle and integrate their data. This webinar explores Zero-ETL, a paradigm shift from the traditional Extract, Transform, Load (ETL) process, offering a more streamlined, efficient, and real-time data integration method. We will begin with an introduction to the concept of Zero-ETL, including how it allows direct access to data in its native environment and real-time data transformation, providing up-to-date information with significantly reduced data redundancy. Next, we'll take you through several demonstrations showing how Zero-ETL can deliver real-time data and enable the free movement of data between systems. We will also discuss the various tools that support all aspects of Zero-ETL, providing attendees with an understanding of how they can adopt this innovative approach in their organizations. Lastly, the session will conclude with an interactive Q&A segment, allowing participants to gain deeper insights into how Zero-ETL can be tailored to their specific business needs and how they can get started today. Join us to discover how Zero-ETL can elevate your organization's data strategy.

The Zero-ETL Approach: Enhancing Data Agility and Insight

Safe Software

At Skynet Technologies, our team of accessibility experts performs automated, semi-automated, and manual audits of websites and web applications as per WCAG 2.2 level AA, ADA, and section 508. Based on evaluations of the accessibility compliance level of the website’s UI, design, source code, navigation, interactive elements, and overall usability, we will provide a digital accessibility evaluation report with in-depth details of potential accessibility barriers and remediation recommendations. Get a manual website WCAG audit (2.0, 2.1, 2.2 level AA) for a small website: 10 pages: $2,500 within 7 business days 30 pages: $7,500 within 14 business days 50 pages: $12,500 within 28 business days For medium websites: 100 pages: $25,000 within 6 weeks For larger websites or audits of all pages, please reach out hello@skynettechnologies.com.

Human Expert Website Manual WCAG 2.0 2.1 2.2 Audit - Digital Accessibility Au...

Skynet Technologies

Event-Driven Architecture Masterclass: Challenges in Stream Processing

ScyllaDB

Introduction to use of FHIR Documents in ABDM

Kumar Satyam

Vector Search @ sw2con for slideshare.pptx

jbellis

WebAssembly is Key to Better LLM Performance

Samy Fodil

Working together SRE & Platform Engineering

Marcus Vechiato

Design and Development of a Provenance Capture Platform for Data Science

Paolo Missier

Continuing Bonds Through AI: A Hermeneutic Reflection on Thanabots

Leah Henrickson

State of the Smart Building Startup Landscape 2024!

Memoori

Oauth 2.0 Introduction and Flows with MuleSoft

shyamraj55

Microsoft CSP Briefing Pre-Engagement - Questionnaire

Exakis Nelite

Join me in this session where I'll share our journey of building a fully serverless application that flawlessly managed check-ins for an event with a staggering 80 thousand registrations. We'll dive into three key strategies that made this possible. Firstly, by harnessing DynamoDB global tables, we ensured global service availability and data replication across regions, boosting performance and disaster recovery. Next, we'll explore how we seamlessly integrated real-time updates into the app using Appsync subscriptions, making the experience dynamic and engaging for users. Finally, I'll discuss how provisioned concurrency not only improved performance but also kept costs in check, highlighting the cost-effectiveness of serverless architectures. Through these strategies and the inherent scalability of serverless technology, our application effortlessly handled massive user loads without manual intervention. This session is a real world example to the power and efficiency of modern cloud-based solutions in enabling seamless scalability and robust performance with Serverless

How we scaled to 80K users by doing nothing!.pdf

Srushith Repakula

Cyber Insurance - RalphGilot - Embry-Riddle Aeronautical University.pptx

MasterG

Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...

ScyllaDB

Recently uploaded (20)

Frisco Automating Purchase Orders with MuleSoft IDP- May 10th, 2024.pptx.pdf

TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...

CORS (Kitworks Team Study 양다윗 발표자료 240510)

Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)

ERP Contender Series: Acumatica vs. Sage Intacct

The Zero-ETL Approach: Enhancing Data Agility and Insight

Human Expert Website Manual WCAG 2.0 2.1 2.2 Audit - Digital Accessibility Au...

Event-Driven Architecture Masterclass: Challenges in Stream Processing

Introduction to use of FHIR Documents in ABDM

Vector Search @ sw2con for slideshare.pptx

WebAssembly is Key to Better LLM Performance

Working together SRE & Platform Engineering

Design and Development of a Provenance Capture Platform for Data Science

Continuing Bonds Through AI: A Hermeneutic Reflection on Thanabots

State of the Smart Building Startup Landscape 2024!

Oauth 2.0 Introduction and Flows with MuleSoft

Microsoft CSP Briefing Pre-Engagement - Questionnaire

How we scaled to 80K users by doing nothing!.pdf

Cyber Insurance - RalphGilot - Embry-Riddle Aeronautical University.pptx

Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...

PR-153: SNAIL: A Simple Neural Attentive Meta-Learner

1. A Simple Neural AttentIve Meta- Learner PR-153 Mar 31, 2019 Taekmin Kim 1

2. 2

3. Machine Learning vs. Human ● Machine Learning ○ Try to learn data points ■ Supervised/Unsupervised Learning ■ Reinforcement Learning ● Human ○ Fast adaptation with prior knowledge ■ Few-shot learning ■ Generalization across tasks 3

4. ● Related Work ○ LEARNING TO REINFORCEMENT LEARN ○ RL^2 ○ MAML ○ Auto-Meta Meta-Learner? Multi-armed bandit problem https://blog.floydhub.com/meta-rl/ 4

5. Meta-RL ● Goal: Generalization across tasks ● Notations ○ T: Task distribution e.g., driving, multi-armed bandit problems ○ T_i: Specific task e.g., Sonata, Porsche, ... ○ x_t: state ○ a_t: action 5

6. RNN-based Meta-RL(Agent) ● sequence-to-sequence problem ○ refer to past experience ● Drawbacks: ○ Temporally-linear dependency https://blog.floydhub.com/meta-rl/ 6

7. Motivation ● Temporal(Causal) Convolution ○ depends on previous steps ● Soft Attention ○ weighted sum https://www.slideshare.net/ThomasHjeldeThoresen/temporal-convolutional-networks-dethroning-rnns-for-sequence-modelling https://medium.com/syncedreview/memory-attention-sequences-8522f531dd43 7

8. Temporal Convolution https://www.slideshare.net/ThomasHjeldeThoresen/temporal-convolutional-networks-dethroning-rnns-for-sequence-modelling Vanilla 1D TCs (exponential) Dilated 1D TCs Vanilla 1D Convolution Temporal Convolution(TC) 8

9. Attention is All you Need(2017) https://mchromiak.github.io/articles/2017/Sep/12/Transformer-Attention-is-all-you-need/#.XJ6U6-szZ0c https://medium.com/@hyponymous/paper-summary-attention-is-all-you-need-22c2c7a5e06 Q: Hidden State of Decoder K: Hidden State of Encoder V: (normalized) Weights 9 PR-049: https://www.youtube.com/watch?v=6zGgVIlStXs

10. Motivation ● Temporal(Causal) Convolution ○ depends on previous steps ● Soft Attention ○ weighted sum https://www.slideshare.net/ThomasHjeldeThoresen/temporal-convolutional-networks-dethroning-rnns-for-sequence-modelling https://medium.com/syncedreview/memory-attention-sequences-8522f531dd43 10

11. Simple Neural AttentIve Learner Building Blocks ● DenseBlock ● TCBlock ● AttentionBlock 11

12. Dense Block 12

13. TC Block 13

14. Attention Block Query: Hidden State of Decoder Key: Hidden State of Encoder Value: (normalized) Weights 14

15. 15

16. Simple Neural AttentIve Learner Building Blocks ● DenseBlock ● TCBlock ● AttentionBlock 16

17. Experiments ● Supervised Learning ○ Few-Shot Learning(Image Classification) ■ n-Way DATASET ■ m-shot ● Reinforcement Learning ○ Multi-Armed Bandits ○ Tabular MDPs ○ Continuous Control ○ Visual Navigation 17

18. Results: Few-shot Learning MAML Omniglot 18

19. MAML 19

20. MAML: Optimization-based Meta-RL 20 https://arxiv.org/abs/1703.03400 PR-094: MAML https://www.youtube.com/watch?v=fxJXXKZb-ik

21. Results: Multi-armed Bandits MAML 21

22. Results: Visual Navigation 22

23. Results: Tabular MDPs 23 가깝고도 먼 TRPO(이웅원 님) https://www.slideshare.net/WoongwonLee/trpo-87165690

24. MAML 24

25. Results: Continous Control 25

26. Summary 26 ● SNAIL ○ Temporal Convolution ○ Soft Attention ● Meta-RL is promising ● Related Work ○ LEARNING TO REINFORCEMENT LEARN ○ RL^2 ○ MAML ○ Auto-Meta ● Materials ○ Meta-RL(Chelsea Finn): http://rail.eecs.berkeley.edu/deeprlcourse/static/slides/lec-20.pdf

PR-153: SNAIL: A Simple Neural Attentive Meta-Learner

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Recently uploaded

Recently uploaded (20)

PR-153: SNAIL: A Simple Neural Attentive Meta-Learner