"Attention Is All You Need" presented by Maroua Maachou (Veepee)

•

1 like•299 views

Paris Women in Machine Learning and Data Science

"Attention Is All You Need" (Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Łukasz Kaiser, https://bit.ly/2y7yAD2 presented by Maroua Maachou (Veepee)

Engineering

Google Brain - Google Research | Presentation by MAACHOU Maroua
Attention is all you
need

TABLE OF
CONTENTS
Background
General Idea
01
02
03
04
05
Encoder
Decoder
Encoder - Decoder Attention
mechanism
More Details
Self - Attention explained

Problems in usual Seq2Seq model
→ Diﬃculty to summarize long-term dependencies
→ The need of a contextual embedding for the elements
of the sequence
→ Slow and diﬃcult models to train

CREDITS: This presentation template was created by Slidesgo, including
icons by Flaticon, and infographics & images by Freepik.
Please keep this slide for attribution.
Does anyone have any questions?
marouamaachou@gmail.com
THANKS

■ The Paper : https://arxiv.org/pdf/1706.03762.pdf
■ The Illustrated Transformer : http://jalammar.github.io/illustrated-transformer/
■ The Annotated Transformer : https://nlp.seas.harvard.edu/2018/04/03/attention.html
■ Stanford NLP course :
https://www.youtube.com/watch?v=XXtpJxZBa2c&list=PLoROMvodv4rOhcuXMZkNm7j3fVwBBY4
2z&index=8
■ Attention and Augmented Recurrent Neural Networks : https://distill.pub/2016/augmented-rnns/
■ Other Attention Mechanisms :
https://lilianweng.github.io/lil-log/2018/06/24/attention-attention.html#a-family-of-attention-mecha
nisms
■ My Implementation : https://github.com/marouamaachou
RESOURCES

What's hot

Lecture 4: Transformers (Full Stack Deep Learning - Spring 2021)Sergey Karayev

Transformer in Computer VisionDongmin Choi

Introduction to Transformer ModelNuwan Sriyantha Bandara

Deep Learning for Natural Language Processing: Word EmbeddingsRoelof Pieters

Natural language processing and transformer modelsDing Li

PR-409: Denoising Diffusion Probabilistic ModelsHyeongmin Lee

INTRODUCTION TO NLP, RNN, LSTM, GRUSri Geetha

An introduction to the Transformers architecture and BERTSuman Debnath

Thomas Wolf "Transfer learning in NLP"Fwdays

Introduction to Recurrent Neural NetworkKnoldus Inc.

Convolutional Neural Network Models - Deep LearningMohamed Loey

Transformer Seq2Sqe Models: Concepts, Trends & Limitations (DLI)Deep Learning Italia

Recurrent Neural Networks (RNN) | RNN LSTM | Deep Learning Tutorial | Tensorf...Edureka!

[Paper] Multiscale Vision Transformers(MVit)Susang Kim

[Paper Reading] Attention is All You NeedDaiki Tanaka

Seq2Seq (encoder decoder) model佳蓉倪

Long Short Term MemoryYan Xu

Fine tune and deploy Hugging Face NLP modelsOVHcloud

Recurrent Neural NetworksSeiya Tokui

Time series predictions using LSTMsSetu Chokshi

What's hot (20)

Lecture 4: Transformers (Full Stack Deep Learning - Spring 2021)

Transformer in Computer Vision

Introduction to Transformer Model

Deep Learning for Natural Language Processing: Word Embeddings

Natural language processing and transformer models

PR-409: Denoising Diffusion Probabilistic Models

INTRODUCTION TO NLP, RNN, LSTM, GRU

An introduction to the Transformers architecture and BERT

Thomas Wolf "Transfer learning in NLP"

Introduction to Recurrent Neural Network

Convolutional Neural Network Models - Deep Learning

Transformer Seq2Sqe Models: Concepts, Trends & Limitations (DLI)

Recurrent Neural Networks (RNN) | RNN LSTM | Deep Learning Tutorial | Tensorf...

[Paper] Multiscale Vision Transformers(MVit)

[Paper Reading] Attention is All You Need

Seq2Seq (encoder decoder) model

Long Short Term Memory

Fine tune and deploy Hugging Face NLP models

Recurrent Neural Networks

Time series predictions using LSTMs

Similar to "Attention Is All You Need" presented by Maroua Maachou (Veepee)

CQRS recepiesFrancesco Garavaglia

Scale up downMOHIT KUMAR

Why you should consider a microframework for your next web projectJoaquín Muñoz M.

Pitfalls of Object Oriented ProgrammingSlide_N

An Introduction To Model  View  Controller In XPagesUlrich Krause

Php and-mvcManoj Sahoo

MWLUG 2015 - An Introduction to MVCUlrich Krause

Platform Engineering for the Modern Oracle WorldSimon Haslam

Efficient Code OrganisationSqueed

Productionizing Predictive Analytics using the Rendezvous Architecture - for ...danielschulz2005

Spark tutorialSahan Bulathwela

Essentials of Professional VLSI Digital Design TrainingmyTectra Learning Solutions Private Ltd

MOUG17: Oracle APEX - Tame IT Backlog Low Code Micro Apps in APEXMonica Li

SWEBOK Guide Evolution and Its Emerging Areas including Machine Learning Patt...Hironori Washizaki

ITea&Coffee: React best practicesAgata Piórkowska

1st slide VLSI.pdfmisbahmridul

Java lectureNataraj Dg

Nidec Global PLM Past, Present & FutureAras

Similar to "Attention Is All You Need" presented by Maroua Maachou (Veepee) (20)

CQRS recepies

Scale up down

Why you should consider a microframework for your next web project

Pitfalls of Object Oriented Programming

An Introduction To Model  View  Controller In XPages

Php and-mvc

MWLUG 2015 - An Introduction to MVC

Platform Engineering for the Modern Oracle World

Efficient Code Organisation

Productionizing Predictive Analytics using the Rendezvous Architecture - for ...

Spark tutorial

Essentials of Professional VLSI Digital Design Training

MOUG17: Oracle APEX - Tame IT Backlog Low Code Micro Apps in APEX

SWEBOK Guide Evolution and Its Emerging Areas including Machine Learning Patt...

ITea&Coffee: React best practices

1st slide VLSI.pdf

Java lecture

Nidec Global PLM Past, Present & Future

Recently uploaded

★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR9953056974 Low Rate Call Girls In Saket, Delhi NCR

Introduction and different types of Ethernet.pptxupamatechverse

Software Development Life Cycle By Team Orange (Dept. of Pharmacy)Suman Mia

High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile

(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat

VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130Suhani Kapoor

Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Dr.Costas Sachpazis

High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escortsranjana rawat

DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINEslot gacor bisa pakai pulsa

HARMONY IN THE NATURE AND EXISTENCE - Unit-IVRajaP95

SPICE PARK APR2024 ( 6,793 SPICE Models )Tsuyoshi Horigome

Introduction to IEEE STANDARDS and its different types.pptxupamatechverse

Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxJoão Esperancinha

VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130Suhani Kapoor

Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptxnull - The Open Security Community

247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).pptssuser5c9d4b1

Introduction to Multiple Access Protocol.pptxupamatechverse

APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSKurinjimalarL3

OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...Soham Mondal

High Profile Call Girls Nashik Megha 7001305949 Independent Escort Service Na...Call Girls in Nagpur High Profile

Recently uploaded (20)

★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR

Introduction and different types of Ethernet.pptx

Software Development Life Cycle By Team Orange (Dept. of Pharmacy)

High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts

(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service

VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130

Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...

High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts

DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE

HARMONY IN THE NATURE AND EXISTENCE - Unit-IV

SPICE PARK APR2024 ( 6,793 SPICE Models )

Introduction to IEEE STANDARDS and its different types.pptx

Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx

VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130

Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx

247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt

Introduction to Multiple Access Protocol.pptx

APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS

OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...

High Profile Call Girls Nashik Megha 7001305949 Independent Escort Service Na...

"Attention Is All You Need" presented by Maroua Maachou (Veepee)

1. Google Brain - Google Research | Presentation by MAACHOU Maroua Attention is all you need

2. TABLE OF CONTENTS Background General Idea 01 02 03 04 05 Encoder Decoder Encoder - Decoder Attention mechanism More Details Self - Attention explained

3. Background 01

4. Seq2Seq models - RNN

5. Problems in usual Seq2Seq model → Diﬃculty to summarize long-term dependencies → The need of a contextual embedding for the elements of the sequence → Slow and diﬃcult models to train

6. A General Idea 02

7. Transformer Model

8. Encoder 03 Self-Attention Explained

9. Architecture

10. Self-Attention

11. Decoder 04 Decoding Attention Mechanism

12. General Architecture

13. The Attention Mechanism

14. More details 05

15. Multi-Headed Attention

16. Positional Encoding

17. CREDITS: This presentation template was created by Slidesgo, including icons by Flaticon, and infographics & images by Freepik. Please keep this slide for attribution. Does anyone have any questions? marouamaachou@gmail.com THANKS

18. ■ The Paper : https://arxiv.org/pdf/1706.03762.pdf ■ The Illustrated Transformer : http://jalammar.github.io/illustrated-transformer/ ■ The Annotated Transformer : https://nlp.seas.harvard.edu/2018/04/03/attention.html ■ Stanford NLP course : https://www.youtube.com/watch?v=XXtpJxZBa2c&list=PLoROMvodv4rOhcuXMZkNm7j3fVwBBY4 2z&index=8 ■ Attention and Augmented Recurrent Neural Networks : https://distill.pub/2016/augmented-rnns/ ■ Other Attention Mechanisms : https://lilianweng.github.io/lil-log/2018/06/24/attention-attention.html#a-family-of-attention-mecha nisms ■ My Implementation : https://github.com/marouamaachou RESOURCES

"Attention Is All You Need" presented by Maroua Maachou (Veepee)

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to "Attention Is All You Need" presented by Maroua Maachou (Veepee)

Similar to "Attention Is All You Need" presented by Maroua Maachou (Veepee) (20)

More from Paris Women in Machine Learning and Data Science

More from Paris Women in Machine Learning and Data Science (20)

Recently uploaded

Recently uploaded (20)

"Attention Is All You Need" presented by Maroua Maachou (Veepee)