NLP_deep_learning_intro.pptx

•Download as PPTX, PDF•

0 likes•93 views

I'll share some slides I prepared for a workshop on NLP with deep learning. Maybe you could consider using some of the figures. Or I’d appreciate some feedbacks.

Engineering

Supervised Learning
Blackbox Model
• Approximation of real-world blackbox
• Supervision by errors between predictions and lables
Model
Correct
Incorrect
Correct
Supervision

Unsupervised Learning
• Dimension reduction • Clustering
Handcrafted
rules
Handcrafted
rules

Deep Learning: Model as Neural Networks
• NLP
Verctors → Vectors
A tensor → Vectors/tensors
Positive
Negative
Cat
Dog
Horse
Goat…
• Image processing

Deep Learning: Various Training Examples
• Sentiment analysis
Verctors → Vectors
Positive
Negative
• Translation
Verctors → Vectors
• ChatGPT
Verctors → Vectors
No. 1
No. 2
No. 3
Correct
label
Correct
translation
Giving rankings
to outputs

Expressions of Tokens in Deep Learning
 One-hot encoding  Word embedding
”a” ”am” ”!”
No. 1
No. 2
No. N
• Sparse
• High-dimensional (N words)
• Hardcoded
• Dense
• Lower-dimensional (D dimensional)
• Learned from data
”a” ”am” ”!”
No. 1
No. D

Expression of Texts in Deep Learning
”am”
”person”
”banana”
”love”
“a”
”cat”
”dog”
”!”
”I”
”am”
”a”
”cat”
”person”
One-hot
encoding
Word
embedding
Text embedding
Neural network
These embeddings are gained through trainings

Self-Supervised Learning: BERT and GPT
• BERT: filling a blank • GPT: predicting the next token
Word/text embedding can be trained without labels
I am a person
I am a cat person
I
I am
I am
I am a
I am a
I am a cat

Transfer Learning, Intuitively
Conventional machine learning Transfer learning
• Processing “coffee beans“
for every task
• Needs relatively a lot of data
• Using preprocessed „instant coffee“
for various tasks
• Performance can be adjusted with
extra “coffee beans“

Transfer Learning: Using Coffee (Embeddings)
Pre-trained CNN
(ResNet, ViT, etc.)
Pre-trained NLP models
(ElMo, BERT, GPT, etc.)
An input
image
An input
text
or
or
A tensor
(usually
3 channel)
A sequence
of vectors
A converted
tensor
A sequence
of vectors
A vector
• Image classification
• Object detection
• Image captioning etc.
• Sentiment analysis
• Token classification
• Summarization etc.
A vector

Visualizing Embeddings with Dimension Reduction
Pre-trained CNN
(ResNet, ViT, etc.)
Pre-trained NLP models
(ElMo, BERT, GPT, etc.)
Input
images
Input
texts
Tensors
Vectors
Embeddings
(hundreds or thousands
of dimensions)
Dimension
Reduction
(t-SNE, UMAP)
Pretrained model
2D visualization

Types of Topic Analysis
Topic analysis
Topic classification Topic modleing
BERTopic
Conventional topic
modleing (LDA, NMF)
• Needs labels for our uses
• Technically same as
sentiment classification
Embedding Discrete
Supervised Unsupervised
• Cluster more fluent texts
• Effective for Twitter
• Unstructured texts
• Can detect relations of BoW

BERTopic: Clustering Text Embedding
Pre-trained NLP models
(ElMo, BERT, GPT, etc.)
Input
texts
Topic 1
Topic 3
Topic 2

What's hot

Introduction to Transformers for NLP - Olga PetrovaAlexey Grigorev

BERT Finetuning Webinar Presentationbhavesh_physics

Lecture 4: Transformers (Full Stack Deep Learning - Spring 2021)Sergey Karayev

An introduction to the Transformers architecture and BERTSuman Debnath

Introduction to Transformer ModelNuwan Sriyantha Bandara

Attention Is All You NeedIllia Polosukhin

Fine tune and deploy Hugging Face NLP modelsOVHcloud

BERTMohd Shukri Hasan

BERT: Pre-training of Deep Bidirectional Transformers for Language UnderstandingMinh Pham

Deep learning - A Visual IntroductionLukas Masuch

[Paper Reading] Attention is All You NeedDaiki Tanaka

Pn sequenceDarshil Shah

Deep learningRatnakar Pandey

BertAbdallah Bashir

Machine learning in image processingData Science Thailand

BERTKhang Pham

Review of Triplet LearningAdwin Jahn

Deep Learning: Application & OpportunityiTrain

Deep Learning - RNN and CNNPradnya Saval

Transformers AI PPT.pptxRahulKumar854607

What's hot (20)

Introduction to Transformers for NLP - Olga Petrova

BERT Finetuning Webinar Presentation

Lecture 4: Transformers (Full Stack Deep Learning - Spring 2021)

An introduction to the Transformers architecture and BERT

Introduction to Transformer Model

Attention Is All You Need

Fine tune and deploy Hugging Face NLP models

BERT

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Deep learning - A Visual Introduction

[Paper Reading] Attention is All You Need

Pn sequence

Deep learning

Bert

Machine learning in image processing

BERT

Review of Triplet Learning

Deep Learning: Application & Opportunity

Deep Learning - RNN and CNN

Transformers AI PPT.pptx

Similar to NLP_deep_learning_intro.pptx

Beyond the Symbols: A 30-minute Overview of NLPMENGSAYLOEM1

Deep learning: the future of recommendationsBalázs Hidasi

Engineering Intelligent NLP Applications Using Deep Learning – Part 2 Saurabh Kaushik

large_scale_search.pdfEmerald72

Machine LearningArthur Breitman

Automating Tinder w/ Eigenfaces and StanfordNLPJustin Long

Building a Neural Machine Translation System From ScratchNatasha Latysheva

Apache MXNet ODSC West 2018Apache MXNet

Introduction to deep learningVishwas Lele

5_RNN_LSTM.pdfFEG

An Introduction to Recent Advances in the Field of NLPRrubaa Panchendrarajan

Deep learning introductionAdwait Bhave

Natural language processing techniques transition from machine learning to de...Divya Gera

DotNet 2019 | Pablo Doval - Recurrent Neural Networks with TF2.0Plain Concepts

Natural Language Processing Advancements By Deep Learning: A SurveyRimzim Thube

Thai Word Embedding with Tensorflow Kobkrit Viriyayudhakorn

NLP and Deep Learning for non_expertsSanghamitra Deb

Natural Language ProcessingGeeks Anonymes

Deep learning - a primerUwe Friedrichsen

Deep learning - a primerShirin Elsinghorst

Similar to NLP_deep_learning_intro.pptx (20)

Beyond the Symbols: A 30-minute Overview of NLP

Deep learning: the future of recommendations

Engineering Intelligent NLP Applications Using Deep Learning – Part 2

large_scale_search.pdf

Machine Learning

Automating Tinder w/ Eigenfaces and StanfordNLP

Building a Neural Machine Translation System From Scratch

Apache MXNet ODSC West 2018

Introduction to deep learning

5_RNN_LSTM.pdf

An Introduction to Recent Advances in the Field of NLP

Deep learning introduction

Natural language processing techniques transition from machine learning to de...

DotNet 2019 | Pablo Doval - Recurrent Neural Networks with TF2.0

Natural Language Processing Advancements By Deep Learning: A Survey

Thai Word Embedding with Tensorflow

NLP and Deep Learning for non_experts

Natural Language Processing

Deep learning - a primer

Recently uploaded

POWER SYSTEMS-1 Complete notes examplesDr. Gudipudi Nageswara Rao

Past, Present and Future of Generative AIabhishek36461

young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service9953056974 Low Rate Call Girls In Saket, Delhi NCR

Microscopic Analysis of Ceramic Materials.pptxpurnimasatapathy1234

Architect Hassan Khalil Portfolio for 2024hassan khalil

Application of Residue Theorem to evaluate real integrations.pptx959SahilShah

young call girls in Green Park🔝 9953056974 🔝 escort Service9953056974 Low Rate Call Girls In Saket, Delhi NCR

IVE Industry Focused Event - Defence Sector 2024Mark Billinghurst

Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR9953056974 Low Rate Call Girls In Saket, Delhi NCR

OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...Soham Mondal

Internship report on mechanical engineeringmalavadedarshan25

APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSKurinjimalarL3

CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfAsst.prof M.Gokilavani

Electronically Controlled suspensions system .pdfme23b1001

CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfAsst.prof M.Gokilavani

chaitra-1.pptx fake news detection using machine learningmisbanausheenparvam

VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...VICTOR MAESTRE RAMIREZ

Artificial-Intelligence-in-Electronics (K).pptxbritheesh05

main PPT.pptx of girls hostel security using rfidNikhilNagaraju

Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxJoão Esperancinha

Recently uploaded (20)

POWER SYSTEMS-1 Complete notes examples

Past, Present and Future of Generative AI

young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service

Microscopic Analysis of Ceramic Materials.pptx

Architect Hassan Khalil Portfolio for 2024

Application of Residue Theorem to evaluate real integrations.pptx

young call girls in Green Park🔝 9953056974 🔝 escort Service

IVE Industry Focused Event - Defence Sector 2024

Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR

OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...

Internship report on mechanical engineering

APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS

CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf

Electronically Controlled suspensions system .pdf

CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf

chaitra-1.pptx fake news detection using machine learning

VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...

Artificial-Intelligence-in-Electronics (K).pptx

main PPT.pptx of girls hostel security using rfid

Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx

NLP_deep_learning_intro.pptx

1. NLP with Deep Learning Yasuto Tamura

2. Supervised Learning Blackbox Model • Approximation of real-world blackbox • Supervision by errors between predictions and lables Model Correct Incorrect Correct Supervision

3. Unsupervised Learning • Dimension reduction • Clustering Handcrafted rules Handcrafted rules

4. Deep Learning: Model as Neural Networks • NLP Verctors → Vectors A tensor → Vectors/tensors Positive Negative Cat Dog Horse Goat… • Image processing

5. Deep Learning: Various Training Examples • Sentiment analysis Verctors → Vectors Positive Negative • Translation Verctors → Vectors • ChatGPT Verctors → Vectors No. 1 No. 2 No. 3 Correct label Correct translation Giving rankings to outputs

6. Expressions of Tokens in Deep Learning  One-hot encoding  Word embedding ”a” ”am” ”!” No. 1 No. 2 No. N • Sparse • High-dimensional (N words) • Hardcoded • Dense • Lower-dimensional (D dimensional) • Learned from data ”a” ”am” ”!” No. 1 No. D

7. Expression of Texts in Deep Learning ”am” ”person” ”banana” ”love” “a” ”cat” ”dog” ”!” ”I” ”am” ”a” ”cat” ”person” One-hot encoding Word embedding Text embedding Neural network These embeddings are gained through trainings

8. Self-Supervised Learning: BERT and GPT • BERT: filling a blank • GPT: predicting the next token Word/text embedding can be trained without labels I am a person I am a cat person I I am I am I am a I am a I am a cat

9. Transfer Learning, Intuitively Conventional machine learning Transfer learning • Processing “coffee beans“ for every task • Needs relatively a lot of data • Using preprocessed „instant coffee“ for various tasks • Performance can be adjusted with extra “coffee beans“

10. Transfer Learning: Using Coffee (Embeddings) Pre-trained CNN (ResNet, ViT, etc.) Pre-trained NLP models (ElMo, BERT, GPT, etc.) An input image An input text or or A tensor (usually 3 channel) A sequence of vectors A converted tensor A sequence of vectors A vector • Image classification • Object detection • Image captioning etc. • Sentiment analysis • Token classification • Summarization etc. A vector

11. Visualizing Embeddings with Dimension Reduction Pre-trained CNN (ResNet, ViT, etc.) Pre-trained NLP models (ElMo, BERT, GPT, etc.) Input images Input texts Tensors Vectors Embeddings (hundreds or thousands of dimensions) Dimension Reduction (t-SNE, UMAP) Pretrained model 2D visualization

12. Visualizing Embeddings by ResNet 101

13. Visualizing Embeddings by BERT

14. Types of Topic Analysis Topic analysis Topic classification Topic modleing BERTopic Conventional topic modleing (LDA, NMF) • Needs labels for our uses • Technically same as sentiment classification Embedding Discrete Supervised Unsupervised • Cluster more fluent texts • Effective for Twitter • Unstructured texts • Can detect relations of BoW

15. BERTopic: Clustering Text Embedding Pre-trained NLP models (ElMo, BERT, GPT, etc.) Input texts Topic 1 Topic 3 Topic 2

Editor's Notes

In the figure, a vector is a word, or a label. And a sequence of vectors is a sentence. In the figure, a matrix means a 1 channel image. And a tensor means a multi channel (RGB) tensor.
Tranformer is a small component in the networks
The ResNet is trained on image classification on Imagenet dataset.
Classical discrete topic modelings are used in Job Skill App. In Job Skill App, we filetered out only necessary tokens to use. LDA: Latent Dirichlet allocation NMF: Non-negative matrix factorization
SBERT, UMAP, HDBSCAN etc. Choices of them are up to your design. Dimensionality reduction is needed to tackle curse of dimensionality and to get better clustering. Stop words are filtered out in the tokenization step for better representations of topics.

NLP_deep_learning_intro.pptx

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to NLP_deep_learning_intro.pptx

Similar to NLP_deep_learning_intro.pptx (20)

More from YasutoTamura1

More from YasutoTamura1 (7)

Recently uploaded

Recently uploaded (20)

NLP_deep_learning_intro.pptx

Editor's Notes