Applications of BERT in NLP and Understanding

•Download as PPTX, PDF•

0 likes•50 views

Samer Baslan

Samer Baslan CMPE-258: Deep Learning Spring 2021, Vijay Eranti

Data & Analytics

Applications of BERT in NLP
and Understanding
Samer Baslan
CMPE-258: Spring 2021

The Emergence of BERT &
Previous methods

Basic Concepts about the new BERT
linguistic model
● Today, most advanced text models use transformers to teach how to
represent text.
● Ease of use - one output layer to existing neural architecture to obtain state-
of-art accuracy in several NLP tasks
● 2 categories of NLP tasks:
○ Holistic
○ Tokenized
● Masked Language Models
● 2 stages of BERT model training
● Performed very well on GLUE, SQuAD, and SWAG (natural language
understanding tasks)

BERT Retraining Methodology for Text
Problems
2 groups of methodologies:
● The use of pretrained models (transfer learning)
● Multitasking Learning
When adapting BERT to specific word processing tasks,
a special retraining technique is required. 3 types of
techniques:
1. Further pre-training
2. Retraining strategies
3. Multitasking Learning
“Catastrophic Forgetting”
Basic BERT model:
● An encoder with 12 transformer blocks,
12 attention areas, and a textual
representation dimension of 768.
● 512 tokens input, and outputs its vector
representation
● SEP, CLS tokens

improving the
subject-specific
classification of
texts using BERT
Traditional text embedding models
represent tokens as an embedding
Problems: ambiguity, subject-
specificity
General Universal Text Model, pre-
trained on a large corpus of general
purpose texts
Research is ongoing, potential as a
universal text model has not yet been
revealed

Conclusion
BERT producing state of the art
results in NLP and driving the
industry forward

Thanks!
Samer Baslan
CMPE-258: Deep Learning
Spring 2021, Vijay Eranti

What's hot

Semantic Mask for Transformer Based End-to-End Speech RecognitionWhenty Ariyanti

End-to-end sequence labeling via bi-directional LSTM-CNNs-CRFJayavardhan Reddy Peddamail

Neural machine translation by jointly learning to align and translatesotanemoto

Study_of_Sequence_labeling_SystemsJayavardhan Reddy Peddamail

Multi Task Learning and Meta LearningSrilalitha Veerubhotla

Answer Span Correction in Machine Reading ComprehensionEfsun Kayi

Bt8903, c# programmingsmumbahelp

NL to OCL Transformation (EDOC 2010)IT Industry

[論文紹介] Deep contextualized word representationsOgataTomoya

EXTRACTIVE SUMMARIZATION WITH VERY DEEP PRETRAINED LANGUAGE MODELijaia

Association of deep learning algorithm with fuzzy logic for multi-document te...Salem-Kabbani

BERT - Part 2 Learning NotesSenthil Kumar M

Extractive Summarization with Very Deep Pretrained Language Modelgerogepatton

Resumedhavalmehtas

Communication systems-theory-for-undergraduate-students-using-matlabSaifAbdulNabi1

What's hot (15)

Semantic Mask for Transformer Based End-to-End Speech Recognition

End-to-end sequence labeling via bi-directional LSTM-CNNs-CRF

Neural machine translation by jointly learning to align and translate

Study_of_Sequence_labeling_Systems

Multi Task Learning and Meta Learning

Answer Span Correction in Machine Reading Comprehension

Bt8903, c# programming

NL to OCL Transformation (EDOC 2010)

[論文紹介] Deep contextualized word representations

EXTRACTIVE SUMMARIZATION WITH VERY DEEP PRETRAINED LANGUAGE MODEL

Association of deep learning algorithm with fuzzy logic for multi-document te...

BERT - Part 2 Learning Notes

Extractive Summarization with Very Deep Pretrained Language Model

Resume

Communication systems-theory-for-undergraduate-students-using-matlab

Similar to Applications of BERT in NLP and Understanding

Bert.pptxDivya Gera

BertAbdallah Bashir

Deep Learning for Machine TranslationMatīss ‎‎‎‎‎‎‎

Transfer Learning in NLP: A SurveyNUPUR YADAV

BERT - Part 1 Learning Notes of Senthil KumarSenthil Kumar M

BERT- Pre-training of Deep Bidirectional Transformers for Language Understand...Kyuri Kim

Transformer Models_ BERT vs. GPT.pdfhelloworld28847

Thomas Wolf "Transfer learning in NLP"Fwdays

Transformer ZooGrigory Sapunov

Nlp and transformer (v3s)H K Yoon

Learning New Semi-Supervised Deep Auto-encoder Features for Statistical Machi...Vimukthi Wickramasinghe

An Introduction to Pre-training General Language Representationszperjaccico

Transfer Learning for Low Resource Languages and DomainsEfsun Kayi

An NLP-based architecture for the autocompletion of partial domain modelsLola Burgueño

Transformer Seq2Sqe Models: Concepts, Trends & Limitations (DLI)Deep Learning Italia

Natural Language Generation / Stanford cs224n 2019w lecture 15 Reviewchangedaeoh

BERT Finetuning Webinar Presentationbhavesh_physics

BloombergGPT.pdfA Large Language Model for Finance957671457

Natural Language Processing - Research and Application TrendsShreyas Suresh Rao

End-to-End Natural Language Understanding Pipeline for Bangla Conversational ...MD Abdullah Al Nasim

Similar to Applications of BERT in NLP and Understanding (20)

Bert.pptx

Bert

Deep Learning for Machine Translation

Transfer Learning in NLP: A Survey

BERT - Part 1 Learning Notes of Senthil Kumar

BERT- Pre-training of Deep Bidirectional Transformers for Language Understand...

Transformer Models_ BERT vs. GPT.pdf

Thomas Wolf "Transfer learning in NLP"

Transformer Zoo

Nlp and transformer (v3s)

Learning New Semi-Supervised Deep Auto-encoder Features for Statistical Machi...

An Introduction to Pre-training General Language Representations

Transfer Learning for Low Resource Languages and Domains

An NLP-based architecture for the autocompletion of partial domain models

Transformer Seq2Sqe Models: Concepts, Trends & Limitations (DLI)

Natural Language Generation / Stanford cs224n 2019w lecture 15 Review

BERT Finetuning Webinar Presentation

BloombergGPT.pdfA Large Language Model for Finance

Natural Language Processing - Research and Application Trends

End-to-End Natural Language Understanding Pipeline for Bangla Conversational ...

Recently uploaded

Solution manual for managerial accounting 8th edition by john wild ken shaw b...rightmanforbloodline

obat aborsi Bontang wa 082135199655 jual obat aborsi cytotec asli di Bontangsiskavia95

Harnessing the Power of GenAI for BI and Reporting.pptxParas Gupta

Northern New England Tableau User Group (TUG) May 2024patrickdtherriault

一比一原版(曼大毕业证书）曼尼托巴大学毕业证成绩单留信学历认证一手价格q6pzkpark

如何办理澳洲拉筹伯大学毕业证（LaTrobe毕业证书）成绩单原件一模一样wsppdmt

Displacement, Velocity, Acceleration, and Second Derivatives23050636

Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Klinik kandungan

Seven tools of quality control.slideshareraiaryan448

社内勉強会資料_Object Recognition as Next Token PredictionNABLAS株式会社

jll-asia-pacific-capital-tracker-1q24.pdfjaytendertech

Reconciling Conflicting Data Curation Actions: Transparency Through Argument...Bertram Ludäscher

Credit Card Fraud Detection: Safeguarding Transactions in the Digital AgeBoston Institute of Analytics

Las implicancias del memorándum de entendimiento entre Codelco y SQM según la...Voces Mineras

Ranking and Scoring Exercises for ResearchRajesh Mondal

bams-3rd-case-presentation-scabies-12-05-2020.pptxJocylDuran

Predictive Precipitation: Advanced Rain Forecasting TechniquesBoston Institute of Analytics

Abortion pills in Riyadh Saudi Arabia (+966572737505 buy cytotecAbortion pills in Riyadh +966572737505 get cytotec

Jual Obat Aborsi Bandung (Asli No.1) Wa 082134680322 Klinik Obat Penggugur Ka...Klinik Aborsi

Case Study 4 Where the cry of rebellion happen?RemarkSemacio

Recently uploaded (20)

Solution manual for managerial accounting 8th edition by john wild ken shaw b...

obat aborsi Bontang wa 082135199655 jual obat aborsi cytotec asli di Bontang

Harnessing the Power of GenAI for BI and Reporting.pptx

Northern New England Tableau User Group (TUG) May 2024

一比一原版(曼大毕业证书）曼尼托巴大学毕业证成绩单留信学历认证一手价格

如何办理澳洲拉筹伯大学毕业证（LaTrobe毕业证书）成绩单原件一模一样

Displacement, Velocity, Acceleration, and Second Derivatives

Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...

Seven tools of quality control.slideshare

社内勉強会資料_Object Recognition as Next Token Prediction

jll-asia-pacific-capital-tracker-1q24.pdf

Reconciling Conflicting Data Curation Actions: Transparency Through Argument...

Credit Card Fraud Detection: Safeguarding Transactions in the Digital Age

Las implicancias del memorándum de entendimiento entre Codelco y SQM según la...

Ranking and Scoring Exercises for Research

bams-3rd-case-presentation-scabies-12-05-2020.pptx

Predictive Precipitation: Advanced Rain Forecasting Techniques

Abortion pills in Riyadh Saudi Arabia (+966572737505 buy cytotec

Jual Obat Aborsi Bandung (Asli No.1) Wa 082134680322 Klinik Obat Penggugur Ka...

Case Study 4 Where the cry of rebellion happen?

Applications of BERT in NLP and Understanding

1. Applications of BERT in NLP and Understanding Samer Baslan CMPE-258: Spring 2021

2. The Emergence of BERT & Previous methods

3. Basic Concepts about the new BERT linguistic model ● Today, most advanced text models use transformers to teach how to represent text. ● Ease of use - one output layer to existing neural architecture to obtain state- of-art accuracy in several NLP tasks ● 2 categories of NLP tasks: ○ Holistic ○ Tokenized ● Masked Language Models ● 2 stages of BERT model training ● Performed very well on GLUE, SQuAD, and SWAG (natural language understanding tasks)

4. BERT Retraining Methodology for Text Problems 2 groups of methodologies: ● The use of pretrained models (transfer learning) ● Multitasking Learning When adapting BERT to specific word processing tasks, a special retraining technique is required. 3 types of techniques: 1. Further pre-training 2. Retraining strategies 3. Multitasking Learning “Catastrophic Forgetting” Basic BERT model: ● An encoder with 12 transformer blocks, 12 attention areas, and a textual representation dimension of 768. ● 512 tokens input, and outputs its vector representation ● SEP, CLS tokens

5. improving the subject-specific classification of texts using BERT Traditional text embedding models represent tokens as an embedding Problems: ambiguity, subject- specificity General Universal Text Model, pretrained on a large corpus of general purpose texts Research is ongoing, potential as a universal text model has not yet been revealed

6. BERTScore

7. Conclusion BERT producing state of the art results in NLP and driving the industry forward

8. Thanks! Samer Baslan CMPE-258: Deep Learning Spring 2021, Vijay Eranti

Applications of BERT in NLP and Understanding

Recommended

Recommended

More Related Content

What's hot

What's hot (15)

Similar to Applications of BERT in NLP and Understanding

Similar to Applications of BERT in NLP and Understanding (20)

Recently uploaded

Recently uploaded (20)

Applications of BERT in NLP and Understanding