SlideShare a Scribd company logo
1 of 47
Download to read offline
Introduction to Natural
Language Processing
Pham Quang Nhat Minh
FPT Technology Research Institute (FTRI)
minhpqn@fpt.edu.vn
2
IBM Watson won Jeopardy Game
Lecture Outline
● What is Natural Language Processing?
● Why is NLP hard?
● Brief history of NLP
● Fundamental tasks in NLP
● Some applications of NLP
3
What is Natural Language
Processing?
● A field of computer science, artificial
intelligence, and computational linguistics
● To get computers to perform useful tasks
involving human languages
− Human-Machine communication
− Improving human-human communication
● E.g Machine Translation
− Extracting information from texts
4
Why is NLP interesting?
● Languages involve many human activities
− Reading, writing, speaking, listening
● Voice can be used as an user interface in many
applications
− Remote controls, virtual assistants like siri,...
● NLP is used to acquire insights from massive
amount of textual data
− E.g., hypotheses from medical, health reports
● NLP has many applications
● NLP is hard!
5
Why is NLP hard?
● Highly ambiguous
● Sentence I made her duck may have different
meanings (from Jurafsky book)
− I cooked waterfowl for her.
− I cooked waterfowl belong to her.
− I created the (plaster?) duck she owns.
− I caused her to quickly lower her head or body.
− I waved my magic wand and turned her into
undifferentiated waterfowl.
6
Why is NLP hard?
I shot an elephant in my pajamas.
7
Why is NLP hard?
● Natural languages are highly ambiguous at all levels
− Lexical (word’s meaning)
− Syntactic
− Semantic
− Discourse
● Natural languages are fuzzy
● Natural languages involve reasoning about the world
− E.g., It is unlikely that an elephant wears a pajamas
8
Brief history of NLP
● Foundational Insights: 1940s and 1950s
− Two foundational paradigms
●
Automaton
● Probabilistic/Information-Theoretic models
● The two camps: 1957-1970
− Symbolic paradigm: the work of Chomsky and others on
formal language theory and generative syntax (1950s ~
mid 1960s)
− Stochastic paradigm
● In departments of statistics
9
Brief history of NLP
● Four paradigms: 1970-1983, explosion in
research in speech and language processing
− Stochastic paradigm
− Logic-based paradigm
− Natural language understanding
− Discourse modeling paradigm
● Empiricism and Finite State Models Redux:
1983-1993
10
Brief history of NLP
● The Fields Comes Together: 1994-1999
− Probabilistic and data-driven models had become
quite standard
● The Rise of Machine Learning: 2000-now
− Large amount of spoken and textual data become
available
− Widespread availability of high-performance
computing systems
11
Fundamental Tasks in NLP
● Word Segmentation
● Part-of-speech (POS) tagging
● Syntactic Analysis
● Semantic Analysis
12
Word Segmentation
● In some languages, there is no space between
words, or a word may contain smaller syllables
− 毎年うちの研究室の学生が1-2名国語研でアルバイトさせ
てもらっているので、今日は新しくアルバイトする B4 学
生の紹介である。
− Nhật Bản luôn là thị trường thương mại quan trọng của
Việt Nam (Nhật_Bản luôn là thị_ trường thương_mại
quan_trọng của Việt_Nam)
● In such languages, word segmentation is the first
step of NLP systems.
13
Word Segmentation
● A possible solution is maximum matching
− Start by pointing at the beginning of a string, then choose the
longest word in the the dictionary that matches the input at the
current position.
− Nhật_Bản luôn là thị trường thương mại quan trọng của Việt
Nam
● Nhật_Bản is a word in dictionary, but “Nhật Bản luôn” is not
● Problems:
− Maxmatching could not deal with unknown words
− Dependency between words in the same sentences is not
exploited
14
Word Segmentation
● Most successful word segmentation tools are
based on machine-learning techniques.
● Word segmentation tools obtained high
accuracy
− vn.vitk (https://github.com/phuonglh/vn.vitk)
obtained 97% accuracy on test data
15
POS Tagging
● Each word in a sentence can be classified in to
classes, such as verbs, adjectives, nouns, etc
● POS Tagging is a process of tagging words in a
sentences to particular part-of-speech, based on:
− Its definition
− Its context in the sentence
● The/DT grand/JJ jury/NN commented/VBD on/IN
a/DT number/NN of/IN other/JJ topics/NNS ./.
16
Sequence Labeling
● Many NLP problems can be viewed as
sequence labeling
● Each token in a sequence is assigned a label.
● Labels of tokens are dependent on the labels
of other tokens in the sequence, particularly
their neighbors.
17
John saw the saw and decided to take it to the table.
NNP VBD DT NN CC VBD TO VB PRP IN DT NN
Sequence Labeling as Classification
● Classify each token independently
● Use as features, information about the
surrounding tokens (sliding window).
18
John saw the saw and decided to take it to the table.
classifier
NNP
Probabilistic Sequence Models
• Model probabilities of pairs (token sequences,
tag sequences) from annotated data set.
• Exploit dependency between tokens
• Typical sequence models
• Hidden Markov Models (HMMs)
• Conditional Random Fields (CRF)
19
Fundamental Tasks in NLP
● Word Segmentation
● Part-of-speech (POS) tagging
● Syntactic Analysis
● Semantic Analysis
20
Syntax Analysis
● The task of recognizing a sentence and assigning a
syntactic structure to it.
My dog also likes eating sausage.
(ROOT
(S
(NP (PRP$ My) (NN dog))
(ADVP (RB also))
(VP (VBZ likes)
(S
(VP (VBG eating)
(NP (NN sausage)))))
(. .)))
S
NP
PRP NN
ADVP
RB
My dog also
VP
VBZ S
VP
VBG
eating
NP
NN sausage
.
.
21
Syntax analysis
● An important task in NLP with many
applications
− Intermediate stage of representation for semantic
analysis
− Play an important role in applications like question
answering and information extraction
− E.g., What books were written by British women
authors before 1800?
22
Syntax analysis
● A challenging task in NLP
− Ambiguity problem: one sentence may have many
possible parsing trees
● Vietnamese language processing (VNLP) still
lacks accurate syntax parsers (in my
understanding)
− Accuracy about 78 ~ 84%
23
Approaches to Syntax analysis
● Top-down parsing
● Bottom-up parsing
● Dynamic programming methods
− CYK algorithm
− Earley algorithm
− Chart parsing
● Probabilistic Context-Free Grammars (PCFG)
● Assign probabilities for derivations
24
Fundamental Tasks in NLP
● Word Segmentation
● Part-of-speech (POS) tagging
● Syntactic Analysis
● Semantic Analysis
25
Semantic Analysis
● Two levels
● Lexical semantics
− Representing meaning of words
− Word sense disambiguation (e.g., word bank)
• Compositional semantics
− How words combined to form a larger meaning.
26
Meaning representations
• First order predicate calculus
•E.g., Maharani serves vegetarian food.
=> Serves(Maharani, vegetarian food)
•E.g., I only have five dollars and I don’t have a lot of
time
=> Have(Speaker, FiveDollars) ∧ ¬Have(Speaker,
LotOfTime)
27
28
Syntax-driven semantic analysis
Lecture Outline
● What is Natural Language Processing?
● Why is NLP hard?
● Brief history of NLP
● Fundamental tasks in NLP
● Some NLP applications
29
Some applications
● Information Retrieval
● Information Extraction
● Question Answering
● Text Summarization
● Machine Translation
30
Information Retrieval
● Query: “list of good sushi restaurants in kyoto?”
31
Query Query processing
Search (Vector
space model or
probabilistic)
Ranked
documents
Indexing
Document
collection
32
Architecture of an ad hoc IR system
Information Extraction
● To extract from unstructured text, information
which pre-specified or pre-defined in templates
− Fill a number of slots/attributes
● Example: use template [PERSON, go,
LOCATION, TIME] to extract information about
the destination of an individual goes.
− “President Obama went to Hanoi yesterday.
− [PERSON = “President Obama”, go, LOCATION =
“Hanoi”, TIME = “yesterday”]
33
Question Answering
● A system that automatically return answers for an
user’s question by retrieving information from a
collected documents.
● Differences from information retrieval system:
− QA system’s goal is to respond exact answer instead of
documents related to users’ question.
● Q: who did invent the internet? A: Robert E. Kahn and Vint
Cerf.
− QA system requires more complicated semantic
analysis.
34
Question Answering
● Factoid question answering:
− Who/What/Where/When
− Answers are often phrases.
● Non-factoid question answering:
− Definition question
− How/Why
− Answers may span multiple sentences (paragraph)
35
The figure is credited by Dr Ngo Xuan Bach: http://tinyurl.com/jk2dv33 36
Text Summarization
● Text summarization is process of distilling the
most important information from a text to produce
an abridge version for a particular task or user.
● Useful in the era of information explosion
● Categories of text summarization:
− Single-document/Multi-document summarization
− Extractive/Abstractive summarization
− Query-focused text summarization
37
Example of text summarisation
• https://www.bloomberg.com/view/articles/
2016-08-23/china-s-super-bus-exposes-dark-
side-of-p2p-lending
• It looked like the future: a wide, elevated Chinese bus that would speed
atop tracks straddling the road while multiple lanes of traffic flowed below.
And the future looked surprisingly near. In early August, a prototype of the
Transit Elevated Bus -- or TEB -- was tested in northern China.
• Demand for such loans has exploded in recent years, growing in volume
from $4.3 billion in 2013 to $71 billion in 2015. The appeal is twofold.
First, China's big state-owned banks have traditionally focused their
attention on other companies in the state sector, at the expense of
consumers and small businesses.
• Meanwhile, cash-rich Chinese are anxious to find yields higher than the
anemic rates paid by China's state banks, which typically fall below 3
percent. China's dodgy stock markets aren't a terribly appealing alternative,
while the attractiveness of Chinese real estate varies by region.
Output by Skype’s Summarization chatbot
38
Machine Translation
● The use computer to automatic some or all of
the process of translating one language to the
other one.
● Fully automatic machine translation is one of
the most challenging and hot topic in NLP.
● Recent advances of Deep Learning raise the
trend of Neural Machine Translation.
39
Example (Google translation)
It looked like the future: a wide, elevated Chinese
bus that would speed atop tracks straddling the
road while multiple lanes of traffic flowed below.
And the future looked surprisingly near.
Nó trông giống như tương lai: một rộng, xe buýt cao
Trung Quốc sẽ tăng tốc trên đường ray trải dài
đường trong khi nhiều tuyến đường giao thông chảy
bên dưới. Và tương lai có vẻ ngạc nhiên gần.
40
Approaches in Machine Translation
• Rule-based methods
• Transfer-based MT
• Interlingual MT
• Dictionary-based MT
• Statistical MT
• Example-based MT
• Hybrid MT
Bernard Vauquois' pyramid showing comparative depths of
intermediary representation, interlingual machine translation at the
peak, followed by transfer-based, then direct translation.
42
How to learn NLP?
• Have background/knowledge about:
• Probabilistic and Statistics
• Basic math (linear algebra, calculus)
• Machine Learning
• Programming
• Read textbook or attend online NLP courses:
• Speech and Language Processing, by Jurafsky, Daniel and
Martin, James H.
• Youtube’s playlist (Dan Jurafsky & Chris Manning: Natural
Language Processing): http://tinyurl.com/lb57fxf
How to learn NLP?
• Practice with programming exercises:
• 100 NLP drill exercises: https://github.com/minhpqn/nlp_100_drill_exercises
• NLP Programming Tutorial, by Graham Neubig: http://www.phontron.com/
teaching.php
• Compete in Kaggle data science challenges
(kaggle.com)
Try some NLP applications
• Try Stanford CoreNLP and Stanford Parser
demo
• http://nlp.stanford.edu:8080/corenlp
• http://nlp.stanford.edu:8080/parser
• Solve SAT-style math questions
• http://euclid.allenai.org
References
1. Speech and Language Processing, by Jurafsky,
Daniel and Martin, James H.
2. An Introduction to Natural Language
Processing series (http://tinyurl.com/hdg58wx)
References
• An Introduction to Natural Language Processing - Section
1 (http://tinyurl.com/ztkwb2b)
• An Introduction to Natural Language Processing - Section
2: Some Brief History (http://tinyurl.com/j48or27)
• An Introduction to Natural Language Processing - Section
3: Fundamental Tasks in NLP (http://tinyurl.com/zk7dgzv)
• An Introduction to Natural Language Processing - Section
4: Some Applications (http://tinyurl.com/jk2dv33)
• An Introduction to Natural Language Processing (http://
tinyurl.com/hdg58wx)
47

More Related Content

What's hot

Natural Language processing
Natural Language processingNatural Language processing
Natural Language processingSanzid Kawsar
 
natural language processing help at myassignmenthelp.net
natural language processing  help at myassignmenthelp.netnatural language processing  help at myassignmenthelp.net
natural language processing help at myassignmenthelp.netwww.myassignmenthelp.net
 
Introduction to Natural Language Processing (NLP)
Introduction to Natural Language Processing (NLP)Introduction to Natural Language Processing (NLP)
Introduction to Natural Language Processing (NLP)VenkateshMurugadas
 
Natural Language Processing seminar review
Natural Language Processing seminar review Natural Language Processing seminar review
Natural Language Processing seminar review Jayneel Vora
 
Natural language processing
Natural language processingNatural language processing
Natural language processingprashantdahake
 
Natural language processing
Natural language processing Natural language processing
Natural language processing Md.Sumon Sarder
 
Natural language processing (nlp)
Natural language processing (nlp)Natural language processing (nlp)
Natural language processing (nlp)Kuppusamy P
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language ProcessingVeenaSKumar2
 
Natural language processing
Natural language processingNatural language processing
Natural language processingAbash shah
 
Natural Language Processing with Python
Natural Language Processing with PythonNatural Language Processing with Python
Natural Language Processing with PythonBenjamin Bengfort
 
Natural lanaguage processing
Natural lanaguage processingNatural lanaguage processing
Natural lanaguage processinggulshan kumar
 
Introduction to natural language processing (NLP)
Introduction to natural language processing (NLP)Introduction to natural language processing (NLP)
Introduction to natural language processing (NLP)Alia Hamwi
 
Natural Language Processing
Natural Language Processing Natural Language Processing
Natural Language Processing Adarsh Saxena
 
Introduction to natural language processing, history and origin
Introduction to natural language processing, history and originIntroduction to natural language processing, history and origin
Introduction to natural language processing, history and originShubhankar Mohan
 
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language UnderstandingBERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language UnderstandingMinh Pham
 
Natural Language Processing (NLP)
Natural Language Processing (NLP)Natural Language Processing (NLP)
Natural Language Processing (NLP)Yuriy Guts
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language ProcessingRishikese MR
 

What's hot (20)

Natural Language processing
Natural Language processingNatural Language processing
Natural Language processing
 
natural language processing help at myassignmenthelp.net
natural language processing  help at myassignmenthelp.netnatural language processing  help at myassignmenthelp.net
natural language processing help at myassignmenthelp.net
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Introduction to Natural Language Processing (NLP)
Introduction to Natural Language Processing (NLP)Introduction to Natural Language Processing (NLP)
Introduction to Natural Language Processing (NLP)
 
Natural Language Processing seminar review
Natural Language Processing seminar review Natural Language Processing seminar review
Natural Language Processing seminar review
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
 
Natural language processing
Natural language processing Natural language processing
Natural language processing
 
Natural language processing (nlp)
Natural language processing (nlp)Natural language processing (nlp)
Natural language processing (nlp)
 
NLP
NLPNLP
NLP
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
 
Natural Language Processing with Python
Natural Language Processing with PythonNatural Language Processing with Python
Natural Language Processing with Python
 
Natural lanaguage processing
Natural lanaguage processingNatural lanaguage processing
Natural lanaguage processing
 
Introduction to natural language processing (NLP)
Introduction to natural language processing (NLP)Introduction to natural language processing (NLP)
Introduction to natural language processing (NLP)
 
Natural Language Processing
Natural Language Processing Natural Language Processing
Natural Language Processing
 
Introduction to natural language processing, history and origin
Introduction to natural language processing, history and originIntroduction to natural language processing, history and origin
Introduction to natural language processing, history and origin
 
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language UnderstandingBERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
 
Natural Language Processing (NLP)
Natural Language Processing (NLP)Natural Language Processing (NLP)
Natural Language Processing (NLP)
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 

Viewers also liked

Các bài toán xử lý ngôn ngữ tự nhiên trong phát triển hệ thống chatbot
Các bài toán xử lý ngôn ngữ tự nhiên trong phát triển hệ thống chatbotCác bài toán xử lý ngôn ngữ tự nhiên trong phát triển hệ thống chatbot
Các bài toán xử lý ngôn ngữ tự nhiên trong phát triển hệ thống chatbotMinh Pham
 
Natural language processing
Natural language processingNatural language processing
Natural language processingHansi Thenuwara
 
Introduction to Natural Language Processing
Introduction to Natural Language ProcessingIntroduction to Natural Language Processing
Introduction to Natural Language ProcessingSandeep Tammu
 
Smart Data Webinar: Advances in Natural Language Processing
Smart Data Webinar: Advances in Natural Language ProcessingSmart Data Webinar: Advances in Natural Language Processing
Smart Data Webinar: Advances in Natural Language ProcessingDATAVERSITY
 
Practical Natural Language Processing
Practical Natural Language ProcessingPractical Natural Language Processing
Practical Natural Language ProcessingJaganadh Gopinadhan
 
Introduction to Natural Language Processing
Introduction to Natural Language ProcessingIntroduction to Natural Language Processing
Introduction to Natural Language Processingrohitnayak
 
How to Become a Thought Leader in Your Niche
How to Become a Thought Leader in Your NicheHow to Become a Thought Leader in Your Niche
How to Become a Thought Leader in Your NicheLeslie Samuel
 
Formal Grammars of English
Formal Grammars of EnglishFormal Grammars of English
Formal Grammars of EnglishLuciano Sclovsky
 
A Combined Method for E-Learning Ontology Population based on NLP and User Ac...
A Combined Method for E-Learning Ontology Population based on NLP and User Ac...A Combined Method for E-Learning Ontology Population based on NLP and User Ac...
A Combined Method for E-Learning Ontology Population based on NLP and User Ac...Fred Kozlov
 
DIY Chinese Segmentation
DIY Chinese SegmentationDIY Chinese Segmentation
DIY Chinese Segmentationrueshyna
 
World of Watson 2016 - Artificial Intelligence Research
World of Watson 2016 - Artificial Intelligence ResearchWorld of Watson 2016 - Artificial Intelligence Research
World of Watson 2016 - Artificial Intelligence ResearchKeith Redman
 
Introduction to Natural Language Processing
Introduction to Natural Language ProcessingIntroduction to Natural Language Processing
Introduction to Natural Language ProcessingDavid Rostcheck
 
Machine Learning for NLP
Machine Learning for NLPMachine Learning for NLP
Machine Learning for NLPbutest
 
Game Design Patterns Workshop - FDG2012 - Opening Remarks
Game Design Patterns Workshop - FDG2012 - Opening RemarksGame Design Patterns Workshop - FDG2012 - Opening Remarks
Game Design Patterns Workshop - FDG2012 - Opening RemarksJose Zagal
 
Natural Language Processing for Games Research
Natural Language Processing for Games ResearchNatural Language Processing for Games Research
Natural Language Processing for Games ResearchJose Zagal
 
Un día hipotético
Un día hipotéticoUn día hipotético
Un día hipotéticoLucipaly
 

Viewers also liked (20)

Các bài toán xử lý ngôn ngữ tự nhiên trong phát triển hệ thống chatbot
Các bài toán xử lý ngôn ngữ tự nhiên trong phát triển hệ thống chatbotCác bài toán xử lý ngôn ngữ tự nhiên trong phát triển hệ thống chatbot
Các bài toán xử lý ngôn ngữ tự nhiên trong phát triển hệ thống chatbot
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
 
Introduction to Natural Language Processing
Introduction to Natural Language ProcessingIntroduction to Natural Language Processing
Introduction to Natural Language Processing
 
Smart Data Webinar: Advances in Natural Language Processing
Smart Data Webinar: Advances in Natural Language ProcessingSmart Data Webinar: Advances in Natural Language Processing
Smart Data Webinar: Advances in Natural Language Processing
 
Practical Natural Language Processing
Practical Natural Language ProcessingPractical Natural Language Processing
Practical Natural Language Processing
 
Introduction to Natural Language Processing
Introduction to Natural Language ProcessingIntroduction to Natural Language Processing
Introduction to Natural Language Processing
 
How to Become a Thought Leader in Your Niche
How to Become a Thought Leader in Your NicheHow to Become a Thought Leader in Your Niche
How to Become a Thought Leader in Your Niche
 
Thực tập GameLoft SAI1
Thực tập GameLoft SAI1Thực tập GameLoft SAI1
Thực tập GameLoft SAI1
 
Formal Grammars of English
Formal Grammars of EnglishFormal Grammars of English
Formal Grammars of English
 
A Combined Method for E-Learning Ontology Population based on NLP and User Ac...
A Combined Method for E-Learning Ontology Population based on NLP and User Ac...A Combined Method for E-Learning Ontology Population based on NLP and User Ac...
A Combined Method for E-Learning Ontology Population based on NLP and User Ac...
 
DIY Chinese Segmentation
DIY Chinese SegmentationDIY Chinese Segmentation
DIY Chinese Segmentation
 
World of Watson 2016 - Artificial Intelligence Research
World of Watson 2016 - Artificial Intelligence ResearchWorld of Watson 2016 - Artificial Intelligence Research
World of Watson 2016 - Artificial Intelligence Research
 
Laravel 5 framework
Laravel 5 frameworkLaravel 5 framework
Laravel 5 framework
 
Introduction to Natural Language Processing
Introduction to Natural Language ProcessingIntroduction to Natural Language Processing
Introduction to Natural Language Processing
 
Analytics2017
Analytics2017Analytics2017
Analytics2017
 
Machine Learning for NLP
Machine Learning for NLPMachine Learning for NLP
Machine Learning for NLP
 
Game Design Patterns Workshop - FDG2012 - Opening Remarks
Game Design Patterns Workshop - FDG2012 - Opening RemarksGame Design Patterns Workshop - FDG2012 - Opening Remarks
Game Design Patterns Workshop - FDG2012 - Opening Remarks
 
Natural Language Processing for Games Research
Natural Language Processing for Games ResearchNatural Language Processing for Games Research
Natural Language Processing for Games Research
 
Un día hipotético
Un día hipotéticoUn día hipotético
Un día hipotético
 
NLP e Chatbots
NLP e ChatbotsNLP e Chatbots
NLP e Chatbots
 

Similar to Introduction to natural language processing

Recent trends in natural language processing
Recent trends in natural language processingRecent trends in natural language processing
Recent trends in natural language processingBalayogi G
 
Natural Language Generation / Stanford cs224n 2019w lecture 15 Review
Natural Language Generation / Stanford cs224n 2019w lecture 15 ReviewNatural Language Generation / Stanford cs224n 2019w lecture 15 Review
Natural Language Generation / Stanford cs224n 2019w lecture 15 Reviewchangedaeoh
 
Introduction to Natural Language Processing
Introduction to Natural Language ProcessingIntroduction to Natural Language Processing
Introduction to Natural Language Processinggokulprasath06
 
Natural Language Processing (NLP)
Natural Language Processing (NLP)Natural Language Processing (NLP)
Natural Language Processing (NLP)Abdullah al Mamun
 
Pptphrase tagset mapping for french and english treebanks and its application...
Pptphrase tagset mapping for french and english treebanks and its application...Pptphrase tagset mapping for french and english treebanks and its application...
Pptphrase tagset mapping for french and english treebanks and its application...Lifeng (Aaron) Han
 
Yves Peirsman - Deep Learning for NLP
Yves Peirsman - Deep Learning for NLPYves Peirsman - Deep Learning for NLP
Yves Peirsman - Deep Learning for NLPHendrik D'Oosterlinck
 
Artificial Intelligence Notes Unit 4
Artificial Intelligence Notes Unit 4Artificial Intelligence Notes Unit 4
Artificial Intelligence Notes Unit 4DigiGurukul
 
Engineering Intelligent NLP Applications Using Deep Learning – Part 2
Engineering Intelligent NLP Applications Using Deep Learning – Part 2 Engineering Intelligent NLP Applications Using Deep Learning – Part 2
Engineering Intelligent NLP Applications Using Deep Learning – Part 2 Saurabh Kaushik
 
Natural language processing for requirements engineering: ICSE 2021 Technical...
Natural language processing for requirements engineering: ICSE 2021 Technical...Natural language processing for requirements engineering: ICSE 2021 Technical...
Natural language processing for requirements engineering: ICSE 2021 Technical...alessio_ferrari
 
Natural Language Processing, Techniques, Current Trends and Applications in I...
Natural Language Processing, Techniques, Current Trends and Applications in I...Natural Language Processing, Techniques, Current Trends and Applications in I...
Natural Language Processing, Techniques, Current Trends and Applications in I...RajkiranVeluri
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language ProcessingGeeks Anonymes
 
Master defence 2020 - Anastasiia Khaburska - Statistical and Neural Language ...
Master defence 2020 - Anastasiia Khaburska - Statistical and Neural Language ...Master defence 2020 - Anastasiia Khaburska - Statistical and Neural Language ...
Master defence 2020 - Anastasiia Khaburska - Statistical and Neural Language ...Lviv Data Science Summer School
 
nlp-01.pptxvvvffffffvvvvvfeddeeddffffffffff
nlp-01.pptxvvvffffffvvvvvfeddeeddffffffffffnlp-01.pptxvvvffffffvvvvvfeddeeddffffffffff
nlp-01.pptxvvvffffffvvvvvfeddeeddffffffffffSushantVyas1
 
Beyond the Symbols: A 30-minute Overview of NLP
Beyond the Symbols: A 30-minute Overview of NLPBeyond the Symbols: A 30-minute Overview of NLP
Beyond the Symbols: A 30-minute Overview of NLPMENGSAYLOEM1
 
Integration of speech recognition with computer assisted translation
Integration of speech recognition with computer assisted translationIntegration of speech recognition with computer assisted translation
Integration of speech recognition with computer assisted translationChamani Shiranthika
 
pptphrase-tagset-mapping-for-french-and-english-treebanks-and-its-application...
pptphrase-tagset-mapping-for-french-and-english-treebanks-and-its-application...pptphrase-tagset-mapping-for-french-and-english-treebanks-and-its-application...
pptphrase-tagset-mapping-for-french-and-english-treebanks-and-its-application...Lifeng (Aaron) Han
 
Pptphrase tagset mapping for french and english treebanks and its application...
Pptphrase tagset mapping for french and english treebanks and its application...Pptphrase tagset mapping for french and english treebanks and its application...
Pptphrase tagset mapping for french and english treebanks and its application...Lifeng (Aaron) Han
 
Thomas Wolf "Transfer learning in NLP"
Thomas Wolf "Transfer learning in NLP"Thomas Wolf "Transfer learning in NLP"
Thomas Wolf "Transfer learning in NLP"Fwdays
 
Thomas Wolf "An Introduction to Transfer Learning and Hugging Face"
Thomas Wolf "An Introduction to Transfer Learning and Hugging Face"Thomas Wolf "An Introduction to Transfer Learning and Hugging Face"
Thomas Wolf "An Introduction to Transfer Learning and Hugging Face"Fwdays
 
2010 INTERSPEECH
2010 INTERSPEECH 2010 INTERSPEECH
2010 INTERSPEECH WarNik Chow
 

Similar to Introduction to natural language processing (20)

Recent trends in natural language processing
Recent trends in natural language processingRecent trends in natural language processing
Recent trends in natural language processing
 
Natural Language Generation / Stanford cs224n 2019w lecture 15 Review
Natural Language Generation / Stanford cs224n 2019w lecture 15 ReviewNatural Language Generation / Stanford cs224n 2019w lecture 15 Review
Natural Language Generation / Stanford cs224n 2019w lecture 15 Review
 
Introduction to Natural Language Processing
Introduction to Natural Language ProcessingIntroduction to Natural Language Processing
Introduction to Natural Language Processing
 
Natural Language Processing (NLP)
Natural Language Processing (NLP)Natural Language Processing (NLP)
Natural Language Processing (NLP)
 
Pptphrase tagset mapping for french and english treebanks and its application...
Pptphrase tagset mapping for french and english treebanks and its application...Pptphrase tagset mapping for french and english treebanks and its application...
Pptphrase tagset mapping for french and english treebanks and its application...
 
Yves Peirsman - Deep Learning for NLP
Yves Peirsman - Deep Learning for NLPYves Peirsman - Deep Learning for NLP
Yves Peirsman - Deep Learning for NLP
 
Artificial Intelligence Notes Unit 4
Artificial Intelligence Notes Unit 4Artificial Intelligence Notes Unit 4
Artificial Intelligence Notes Unit 4
 
Engineering Intelligent NLP Applications Using Deep Learning – Part 2
Engineering Intelligent NLP Applications Using Deep Learning – Part 2 Engineering Intelligent NLP Applications Using Deep Learning – Part 2
Engineering Intelligent NLP Applications Using Deep Learning – Part 2
 
Natural language processing for requirements engineering: ICSE 2021 Technical...
Natural language processing for requirements engineering: ICSE 2021 Technical...Natural language processing for requirements engineering: ICSE 2021 Technical...
Natural language processing for requirements engineering: ICSE 2021 Technical...
 
Natural Language Processing, Techniques, Current Trends and Applications in I...
Natural Language Processing, Techniques, Current Trends and Applications in I...Natural Language Processing, Techniques, Current Trends and Applications in I...
Natural Language Processing, Techniques, Current Trends and Applications in I...
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Master defence 2020 - Anastasiia Khaburska - Statistical and Neural Language ...
Master defence 2020 - Anastasiia Khaburska - Statistical and Neural Language ...Master defence 2020 - Anastasiia Khaburska - Statistical and Neural Language ...
Master defence 2020 - Anastasiia Khaburska - Statistical and Neural Language ...
 
nlp-01.pptxvvvffffffvvvvvfeddeeddffffffffff
nlp-01.pptxvvvffffffvvvvvfeddeeddffffffffffnlp-01.pptxvvvffffffvvvvvfeddeeddffffffffff
nlp-01.pptxvvvffffffvvvvvfeddeeddffffffffff
 
Beyond the Symbols: A 30-minute Overview of NLP
Beyond the Symbols: A 30-minute Overview of NLPBeyond the Symbols: A 30-minute Overview of NLP
Beyond the Symbols: A 30-minute Overview of NLP
 
Integration of speech recognition with computer assisted translation
Integration of speech recognition with computer assisted translationIntegration of speech recognition with computer assisted translation
Integration of speech recognition with computer assisted translation
 
pptphrase-tagset-mapping-for-french-and-english-treebanks-and-its-application...
pptphrase-tagset-mapping-for-french-and-english-treebanks-and-its-application...pptphrase-tagset-mapping-for-french-and-english-treebanks-and-its-application...
pptphrase-tagset-mapping-for-french-and-english-treebanks-and-its-application...
 
Pptphrase tagset mapping for french and english treebanks and its application...
Pptphrase tagset mapping for french and english treebanks and its application...Pptphrase tagset mapping for french and english treebanks and its application...
Pptphrase tagset mapping for french and english treebanks and its application...
 
Thomas Wolf "Transfer learning in NLP"
Thomas Wolf "Transfer learning in NLP"Thomas Wolf "Transfer learning in NLP"
Thomas Wolf "Transfer learning in NLP"
 
Thomas Wolf "An Introduction to Transfer Learning and Hugging Face"
Thomas Wolf "An Introduction to Transfer Learning and Hugging Face"Thomas Wolf "An Introduction to Transfer Learning and Hugging Face"
Thomas Wolf "An Introduction to Transfer Learning and Hugging Face"
 
2010 INTERSPEECH
2010 INTERSPEECH 2010 INTERSPEECH
2010 INTERSPEECH
 

More from Minh Pham

Prompt Engineering Tutorial: Cách viết prompt hiệu quả với ChatGPT
Prompt Engineering Tutorial: Cách viết prompt hiệu quả với ChatGPTPrompt Engineering Tutorial: Cách viết prompt hiệu quả với ChatGPT
Prompt Engineering Tutorial: Cách viết prompt hiệu quả với ChatGPTMinh Pham
 
AimeLaw at ALQAC 2021: Enriching Neural Network Models with Legal-Domain Know...
AimeLaw at ALQAC 2021: Enriching Neural Network Models with Legal-Domain Know...AimeLaw at ALQAC 2021: Enriching Neural Network Models with Legal-Domain Know...
AimeLaw at ALQAC 2021: Enriching Neural Network Models with Legal-Domain Know...Minh Pham
 
A Multimodal Ensemble Model for Detecting Unreliable Information on Vietnames...
A Multimodal Ensemble Model for Detecting Unreliable Information on Vietnames...A Multimodal Ensemble Model for Detecting Unreliable Information on Vietnames...
A Multimodal Ensemble Model for Detecting Unreliable Information on Vietnames...Minh Pham
 
Research methods for engineering students (v.2020)
Research methods for engineering students (v.2020)Research methods for engineering students (v.2020)
Research methods for engineering students (v.2020)Minh Pham
 
Giới thiệu về AIML
Giới thiệu về AIMLGiới thiệu về AIML
Giới thiệu về AIMLMinh Pham
 
Mạng neural nhân tạo và ứng dụng trong xử lý ngôn ngữ tự nhiên
Mạng neural nhân tạo và ứng dụng trong xử lý ngôn ngữ tự nhiênMạng neural nhân tạo và ứng dụng trong xử lý ngôn ngữ tự nhiên
Mạng neural nhân tạo và ứng dụng trong xử lý ngôn ngữ tự nhiênMinh Pham
 
Deep Contexualized Representation
Deep Contexualized RepresentationDeep Contexualized Representation
Deep Contexualized RepresentationMinh Pham
 
Research Methods in Natural Language Processing (2018 version)
Research Methods in Natural Language Processing (2018 version)Research Methods in Natural Language Processing (2018 version)
Research Methods in Natural Language Processing (2018 version)Minh Pham
 
A Feature-Based Model for Nested Named-Entity Recognition at VLSP-2018 NER Ev...
A Feature-Based Model for Nested Named-Entity Recognition at VLSP-2018 NER Ev...A Feature-Based Model for Nested Named-Entity Recognition at VLSP-2018 NER Ev...
A Feature-Based Model for Nested Named-Entity Recognition at VLSP-2018 NER Ev...Minh Pham
 
Về kỹ thuật Attention trong mô hình sequence-to-sequence tại hội nghị ACL 2017
Về kỹ thuật Attention trong mô hình sequence-to-sequence  tại hội nghị ACL 2017Về kỹ thuật Attention trong mô hình sequence-to-sequence  tại hội nghị ACL 2017
Về kỹ thuật Attention trong mô hình sequence-to-sequence tại hội nghị ACL 2017Minh Pham
 
Research Methods in Natural Language Processing
Research Methods in Natural Language ProcessingResearch Methods in Natural Language Processing
Research Methods in Natural Language ProcessingMinh Pham
 

More from Minh Pham (11)

Prompt Engineering Tutorial: Cách viết prompt hiệu quả với ChatGPT
Prompt Engineering Tutorial: Cách viết prompt hiệu quả với ChatGPTPrompt Engineering Tutorial: Cách viết prompt hiệu quả với ChatGPT
Prompt Engineering Tutorial: Cách viết prompt hiệu quả với ChatGPT
 
AimeLaw at ALQAC 2021: Enriching Neural Network Models with Legal-Domain Know...
AimeLaw at ALQAC 2021: Enriching Neural Network Models with Legal-Domain Know...AimeLaw at ALQAC 2021: Enriching Neural Network Models with Legal-Domain Know...
AimeLaw at ALQAC 2021: Enriching Neural Network Models with Legal-Domain Know...
 
A Multimodal Ensemble Model for Detecting Unreliable Information on Vietnames...
A Multimodal Ensemble Model for Detecting Unreliable Information on Vietnames...A Multimodal Ensemble Model for Detecting Unreliable Information on Vietnames...
A Multimodal Ensemble Model for Detecting Unreliable Information on Vietnames...
 
Research methods for engineering students (v.2020)
Research methods for engineering students (v.2020)Research methods for engineering students (v.2020)
Research methods for engineering students (v.2020)
 
Giới thiệu về AIML
Giới thiệu về AIMLGiới thiệu về AIML
Giới thiệu về AIML
 
Mạng neural nhân tạo và ứng dụng trong xử lý ngôn ngữ tự nhiên
Mạng neural nhân tạo và ứng dụng trong xử lý ngôn ngữ tự nhiênMạng neural nhân tạo và ứng dụng trong xử lý ngôn ngữ tự nhiên
Mạng neural nhân tạo và ứng dụng trong xử lý ngôn ngữ tự nhiên
 
Deep Contexualized Representation
Deep Contexualized RepresentationDeep Contexualized Representation
Deep Contexualized Representation
 
Research Methods in Natural Language Processing (2018 version)
Research Methods in Natural Language Processing (2018 version)Research Methods in Natural Language Processing (2018 version)
Research Methods in Natural Language Processing (2018 version)
 
A Feature-Based Model for Nested Named-Entity Recognition at VLSP-2018 NER Ev...
A Feature-Based Model for Nested Named-Entity Recognition at VLSP-2018 NER Ev...A Feature-Based Model for Nested Named-Entity Recognition at VLSP-2018 NER Ev...
A Feature-Based Model for Nested Named-Entity Recognition at VLSP-2018 NER Ev...
 
Về kỹ thuật Attention trong mô hình sequence-to-sequence tại hội nghị ACL 2017
Về kỹ thuật Attention trong mô hình sequence-to-sequence  tại hội nghị ACL 2017Về kỹ thuật Attention trong mô hình sequence-to-sequence  tại hội nghị ACL 2017
Về kỹ thuật Attention trong mô hình sequence-to-sequence tại hội nghị ACL 2017
 
Research Methods in Natural Language Processing
Research Methods in Natural Language ProcessingResearch Methods in Natural Language Processing
Research Methods in Natural Language Processing
 

Recently uploaded

Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...anilsa9823
 
Analytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdfAnalytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdfSwapnil Therkar
 
Cultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxCultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxpradhanghanshyam7136
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Nistarini College, Purulia (W.B) India
 
Boyles law module in the grade 10 science
Boyles law module in the grade 10 scienceBoyles law module in the grade 10 science
Boyles law module in the grade 10 sciencefloriejanemacaya1
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bSérgio Sacani
 
GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxAleenaTreesaSaji
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsSérgio Sacani
 
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.aasikanpl
 
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.aasikanpl
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...Sérgio Sacani
 
Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Patrick Diehl
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...Sérgio Sacani
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​kaibalyasahoo82800
 
NAVSEA PEO USC - Unmanned & Small Combatants 26Oct23.pdf
NAVSEA PEO USC - Unmanned & Small Combatants 26Oct23.pdfNAVSEA PEO USC - Unmanned & Small Combatants 26Oct23.pdf
NAVSEA PEO USC - Unmanned & Small Combatants 26Oct23.pdfWadeK3
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSarthak Sekhar Mondal
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRDelhi Call girls
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Sérgio Sacani
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PPRINCE C P
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsAArockiyaNisha
 

Recently uploaded (20)

Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
 
Analytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdfAnalytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdf
 
Cultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxCultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptx
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...
 
Boyles law module in the grade 10 science
Boyles law module in the grade 10 scienceBoyles law module in the grade 10 science
Boyles law module in the grade 10 science
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
 
GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptx
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
 
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
 
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
 
Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​
 
NAVSEA PEO USC - Unmanned & Small Combatants 26Oct23.pdf
NAVSEA PEO USC - Unmanned & Small Combatants 26Oct23.pdfNAVSEA PEO USC - Unmanned & Small Combatants 26Oct23.pdf
NAVSEA PEO USC - Unmanned & Small Combatants 26Oct23.pdf
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C P
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based Nanomaterials
 

Introduction to natural language processing

  • 1. Introduction to Natural Language Processing Pham Quang Nhat Minh FPT Technology Research Institute (FTRI) minhpqn@fpt.edu.vn
  • 2. 2 IBM Watson won Jeopardy Game
  • 3. Lecture Outline ● What is Natural Language Processing? ● Why is NLP hard? ● Brief history of NLP ● Fundamental tasks in NLP ● Some applications of NLP 3
  • 4. What is Natural Language Processing? ● A field of computer science, artificial intelligence, and computational linguistics ● To get computers to perform useful tasks involving human languages − Human-Machine communication − Improving human-human communication ● E.g Machine Translation − Extracting information from texts 4
  • 5. Why is NLP interesting? ● Languages involve many human activities − Reading, writing, speaking, listening ● Voice can be used as an user interface in many applications − Remote controls, virtual assistants like siri,... ● NLP is used to acquire insights from massive amount of textual data − E.g., hypotheses from medical, health reports ● NLP has many applications ● NLP is hard! 5
  • 6. Why is NLP hard? ● Highly ambiguous ● Sentence I made her duck may have different meanings (from Jurafsky book) − I cooked waterfowl for her. − I cooked waterfowl belong to her. − I created the (plaster?) duck she owns. − I caused her to quickly lower her head or body. − I waved my magic wand and turned her into undifferentiated waterfowl. 6
  • 7. Why is NLP hard? I shot an elephant in my pajamas. 7
  • 8. Why is NLP hard? ● Natural languages are highly ambiguous at all levels − Lexical (word’s meaning) − Syntactic − Semantic − Discourse ● Natural languages are fuzzy ● Natural languages involve reasoning about the world − E.g., It is unlikely that an elephant wears a pajamas 8
  • 9. Brief history of NLP ● Foundational Insights: 1940s and 1950s − Two foundational paradigms ● Automaton ● Probabilistic/Information-Theoretic models ● The two camps: 1957-1970 − Symbolic paradigm: the work of Chomsky and others on formal language theory and generative syntax (1950s ~ mid 1960s) − Stochastic paradigm ● In departments of statistics 9
  • 10. Brief history of NLP ● Four paradigms: 1970-1983, explosion in research in speech and language processing − Stochastic paradigm − Logic-based paradigm − Natural language understanding − Discourse modeling paradigm ● Empiricism and Finite State Models Redux: 1983-1993 10
  • 11. Brief history of NLP ● The Fields Comes Together: 1994-1999 − Probabilistic and data-driven models had become quite standard ● The Rise of Machine Learning: 2000-now − Large amount of spoken and textual data become available − Widespread availability of high-performance computing systems 11
  • 12. Fundamental Tasks in NLP ● Word Segmentation ● Part-of-speech (POS) tagging ● Syntactic Analysis ● Semantic Analysis 12
  • 13. Word Segmentation ● In some languages, there is no space between words, or a word may contain smaller syllables − 毎年うちの研究室の学生が1-2名国語研でアルバイトさせ てもらっているので、今日は新しくアルバイトする B4 学 生の紹介である。 − Nhật Bản luôn là thị trường thương mại quan trọng của Việt Nam (Nhật_Bản luôn là thị_ trường thương_mại quan_trọng của Việt_Nam) ● In such languages, word segmentation is the first step of NLP systems. 13
  • 14. Word Segmentation ● A possible solution is maximum matching − Start by pointing at the beginning of a string, then choose the longest word in the the dictionary that matches the input at the current position. − Nhật_Bản luôn là thị trường thương mại quan trọng của Việt Nam ● Nhật_Bản is a word in dictionary, but “Nhật Bản luôn” is not ● Problems: − Maxmatching could not deal with unknown words − Dependency between words in the same sentences is not exploited 14
  • 15. Word Segmentation ● Most successful word segmentation tools are based on machine-learning techniques. ● Word segmentation tools obtained high accuracy − vn.vitk (https://github.com/phuonglh/vn.vitk) obtained 97% accuracy on test data 15
  • 16. POS Tagging ● Each word in a sentence can be classified in to classes, such as verbs, adjectives, nouns, etc ● POS Tagging is a process of tagging words in a sentences to particular part-of-speech, based on: − Its definition − Its context in the sentence ● The/DT grand/JJ jury/NN commented/VBD on/IN a/DT number/NN of/IN other/JJ topics/NNS ./. 16
  • 17. Sequence Labeling ● Many NLP problems can be viewed as sequence labeling ● Each token in a sequence is assigned a label. ● Labels of tokens are dependent on the labels of other tokens in the sequence, particularly their neighbors. 17 John saw the saw and decided to take it to the table. NNP VBD DT NN CC VBD TO VB PRP IN DT NN
  • 18. Sequence Labeling as Classification ● Classify each token independently ● Use as features, information about the surrounding tokens (sliding window). 18 John saw the saw and decided to take it to the table. classifier NNP
  • 19. Probabilistic Sequence Models • Model probabilities of pairs (token sequences, tag sequences) from annotated data set. • Exploit dependency between tokens • Typical sequence models • Hidden Markov Models (HMMs) • Conditional Random Fields (CRF) 19
  • 20. Fundamental Tasks in NLP ● Word Segmentation ● Part-of-speech (POS) tagging ● Syntactic Analysis ● Semantic Analysis 20
  • 21. Syntax Analysis ● The task of recognizing a sentence and assigning a syntactic structure to it. My dog also likes eating sausage. (ROOT (S (NP (PRP$ My) (NN dog)) (ADVP (RB also)) (VP (VBZ likes) (S (VP (VBG eating) (NP (NN sausage))))) (. .))) S NP PRP NN ADVP RB My dog also VP VBZ S VP VBG eating NP NN sausage . . 21
  • 22. Syntax analysis ● An important task in NLP with many applications − Intermediate stage of representation for semantic analysis − Play an important role in applications like question answering and information extraction − E.g., What books were written by British women authors before 1800? 22
  • 23. Syntax analysis ● A challenging task in NLP − Ambiguity problem: one sentence may have many possible parsing trees ● Vietnamese language processing (VNLP) still lacks accurate syntax parsers (in my understanding) − Accuracy about 78 ~ 84% 23
  • 24. Approaches to Syntax analysis ● Top-down parsing ● Bottom-up parsing ● Dynamic programming methods − CYK algorithm − Earley algorithm − Chart parsing ● Probabilistic Context-Free Grammars (PCFG) ● Assign probabilities for derivations 24
  • 25. Fundamental Tasks in NLP ● Word Segmentation ● Part-of-speech (POS) tagging ● Syntactic Analysis ● Semantic Analysis 25
  • 26. Semantic Analysis ● Two levels ● Lexical semantics − Representing meaning of words − Word sense disambiguation (e.g., word bank) • Compositional semantics − How words combined to form a larger meaning. 26
  • 27. Meaning representations • First order predicate calculus •E.g., Maharani serves vegetarian food. => Serves(Maharani, vegetarian food) •E.g., I only have five dollars and I don’t have a lot of time => Have(Speaker, FiveDollars) ∧ ¬Have(Speaker, LotOfTime) 27
  • 29. Lecture Outline ● What is Natural Language Processing? ● Why is NLP hard? ● Brief history of NLP ● Fundamental tasks in NLP ● Some NLP applications 29
  • 30. Some applications ● Information Retrieval ● Information Extraction ● Question Answering ● Text Summarization ● Machine Translation 30
  • 31. Information Retrieval ● Query: “list of good sushi restaurants in kyoto?” 31
  • 32. Query Query processing Search (Vector space model or probabilistic) Ranked documents Indexing Document collection 32 Architecture of an ad hoc IR system
  • 33. Information Extraction ● To extract from unstructured text, information which pre-specified or pre-defined in templates − Fill a number of slots/attributes ● Example: use template [PERSON, go, LOCATION, TIME] to extract information about the destination of an individual goes. − “President Obama went to Hanoi yesterday. − [PERSON = “President Obama”, go, LOCATION = “Hanoi”, TIME = “yesterday”] 33
  • 34. Question Answering ● A system that automatically return answers for an user’s question by retrieving information from a collected documents. ● Differences from information retrieval system: − QA system’s goal is to respond exact answer instead of documents related to users’ question. ● Q: who did invent the internet? A: Robert E. Kahn and Vint Cerf. − QA system requires more complicated semantic analysis. 34
  • 35. Question Answering ● Factoid question answering: − Who/What/Where/When − Answers are often phrases. ● Non-factoid question answering: − Definition question − How/Why − Answers may span multiple sentences (paragraph) 35
  • 36. The figure is credited by Dr Ngo Xuan Bach: http://tinyurl.com/jk2dv33 36
  • 37. Text Summarization ● Text summarization is process of distilling the most important information from a text to produce an abridge version for a particular task or user. ● Useful in the era of information explosion ● Categories of text summarization: − Single-document/Multi-document summarization − Extractive/Abstractive summarization − Query-focused text summarization 37
  • 38. Example of text summarisation • https://www.bloomberg.com/view/articles/ 2016-08-23/china-s-super-bus-exposes-dark- side-of-p2p-lending • It looked like the future: a wide, elevated Chinese bus that would speed atop tracks straddling the road while multiple lanes of traffic flowed below. And the future looked surprisingly near. In early August, a prototype of the Transit Elevated Bus -- or TEB -- was tested in northern China. • Demand for such loans has exploded in recent years, growing in volume from $4.3 billion in 2013 to $71 billion in 2015. The appeal is twofold. First, China's big state-owned banks have traditionally focused their attention on other companies in the state sector, at the expense of consumers and small businesses. • Meanwhile, cash-rich Chinese are anxious to find yields higher than the anemic rates paid by China's state banks, which typically fall below 3 percent. China's dodgy stock markets aren't a terribly appealing alternative, while the attractiveness of Chinese real estate varies by region. Output by Skype’s Summarization chatbot 38
  • 39. Machine Translation ● The use computer to automatic some or all of the process of translating one language to the other one. ● Fully automatic machine translation is one of the most challenging and hot topic in NLP. ● Recent advances of Deep Learning raise the trend of Neural Machine Translation. 39
  • 40. Example (Google translation) It looked like the future: a wide, elevated Chinese bus that would speed atop tracks straddling the road while multiple lanes of traffic flowed below. And the future looked surprisingly near. Nó trông giống như tương lai: một rộng, xe buýt cao Trung Quốc sẽ tăng tốc trên đường ray trải dài đường trong khi nhiều tuyến đường giao thông chảy bên dưới. Và tương lai có vẻ ngạc nhiên gần. 40
  • 41. Approaches in Machine Translation • Rule-based methods • Transfer-based MT • Interlingual MT • Dictionary-based MT • Statistical MT • Example-based MT • Hybrid MT
  • 42. Bernard Vauquois' pyramid showing comparative depths of intermediary representation, interlingual machine translation at the peak, followed by transfer-based, then direct translation. 42
  • 43. How to learn NLP? • Have background/knowledge about: • Probabilistic and Statistics • Basic math (linear algebra, calculus) • Machine Learning • Programming • Read textbook or attend online NLP courses: • Speech and Language Processing, by Jurafsky, Daniel and Martin, James H. • Youtube’s playlist (Dan Jurafsky & Chris Manning: Natural Language Processing): http://tinyurl.com/lb57fxf
  • 44. How to learn NLP? • Practice with programming exercises: • 100 NLP drill exercises: https://github.com/minhpqn/nlp_100_drill_exercises • NLP Programming Tutorial, by Graham Neubig: http://www.phontron.com/ teaching.php • Compete in Kaggle data science challenges (kaggle.com)
  • 45. Try some NLP applications • Try Stanford CoreNLP and Stanford Parser demo • http://nlp.stanford.edu:8080/corenlp • http://nlp.stanford.edu:8080/parser • Solve SAT-style math questions • http://euclid.allenai.org
  • 46. References 1. Speech and Language Processing, by Jurafsky, Daniel and Martin, James H. 2. An Introduction to Natural Language Processing series (http://tinyurl.com/hdg58wx)
  • 47. References • An Introduction to Natural Language Processing - Section 1 (http://tinyurl.com/ztkwb2b) • An Introduction to Natural Language Processing - Section 2: Some Brief History (http://tinyurl.com/j48or27) • An Introduction to Natural Language Processing - Section 3: Fundamental Tasks in NLP (http://tinyurl.com/zk7dgzv) • An Introduction to Natural Language Processing - Section 4: Some Applications (http://tinyurl.com/jk2dv33) • An Introduction to Natural Language Processing (http:// tinyurl.com/hdg58wx) 47