SlideShare a Scribd company logo
1 of 39
Part of Speech Tagging
Perpectivising NLP: Areas of AI and 
their inter-dependencies 
Knowledge 
Search Logic Representation 
Machine 
Learning 
Planning 
Expert 
NLP Vision Robotics Systems
Two pictures 
Problem 
NLP 
Semantics NLP 
nity 
Parsing 
Vision Speech Morph 
Analysis 
HM 
M Statistics and Probability Hindi English 
Language 
CRF + 
Knowledge Based 
MEM 
M 
Algorithm 
N 
Tri 
Part of Speech 
Tagging 
Marathi French
What it is 
POS Tagging is a process that attaches 
each word in a sentence with a suitable 
tag from a given set of tags. 
The set of tags is called the Tag-set. 
Standard Tag-set : Penn Treebank (for 
English).
Definition 
Tagging is the assignment of a 
singlepart-of-speech tag to each word 
(and punctuation marker) in a corpus. 
“_“ The_DT guys_NNS that_WDT 
make_VBP traditional_JJ hardware_NN 
are_VBP really_RB being_VBG 
obsoleted_VBN by_IN microprocessor-based_ 
JJ machines_NNS ,_, ”_” said_VBD 
Mr._NNP Benton_NNP ._.
POS Tags 
NN – Noun; e.g. 
VM – Main Verb; 
Dog_NN 
e.g. Run_VM 
VAUX – AuxiliaryVerb; e.g. Is_VAUX 
JJ – Adjective; e.g. Red_JJ 
PRP – Pronoun; e.g. You_PRP 
NNP– Proper Noun; e.g. John_NNP 
etc.
POS Tag Ambiguity 
In English : I bank1 on the bank2 on the 
river bank3 for 
Bank1 is verb, 
my transactions. 
the other two banks are 
noun 
In Hindi : 
”Khaanaa” : can be noun (food) or 
eat) 
verb (to
For Hindi 
Rama achhaa gaata hai. (hai is VAUX : 
Auxiliary verb); Ram sings well 
Rama achha ladakaa hai. (hai is VCOP : 
Copula verb); Ram is a good boy
Process 
List all possible tag for each word in 
sentence. 
Choose best suitable tag sequence.
Example 
”People jump high”. 
People : Noun/Verb 
jump : Noun/Verb 
high : Noun/Verb/Adjective 
We can start with probabilities.
Importance of POS tagging 
Ack: presentation by Claire 
Gardent on POS tagging by NLTK
What is Part of Speech (POS) 
Words can be divided into classes 
behave similarly. 
Traditionally eight parts of speech 
that 
in 
English: noun, verb, pronoun, 
preposition, adverb, 
adjective and article 
More recently larger 
conjunction, 
sets have been 
used: e.g. Penn Treebank (45 tags), 
Susanne (353 tags).
Why POS 
POS tell us a lot about a word (and the 
words near it). 
E.g, adjectives often followed by nouns 
personal pronouns often followed by verbs 
possessive pronouns by nouns 
Pronunciations depends on POS, e.g. 
object (first syllable NN, second syllable 
VM), content, discount 
First step in many NLP applications
Categories of POS 
Open and closed classes 
Closed classes have a fixed membership of 
words: determiners, pronouns, prepositions 
Closed class words are usually function 
word: frequently occurring, 
grammatically important, often short 
(e.g. of, it, the, in) 
Open classes: nouns, verbs, adjectives 
and adverbs(allow new addition of word)
Open Class (1/2) 
Nouns: 
Proper nouns (Scotland, BBC), 
common nouns 
count nouns (goat, glass) 
mass nouns (snow, pacifism) 
Verbs: 
actions and processes (run, hope) 
also auxiliary verbs (is, are, am, will, can)
Open Class (2/2) 
Adjectives: 
properties and qualities 
value) 
Adverbs: 
(age, colour, 
modify verbs, or verb phrases, or other 
adverbs- Unfortunately John walked home 
extremely slowly yesterday 
Sentential adverb: unfortunately 
Manner adverb: extremely, slowly 
Time adverb: yesterday
Closed class 
Prepositions: on, under, over, to, with, 
by 
Determiners: the, a, an, some 
Pronouns: she, you, I, who 
Conjunctions: and, but, or, as, when, if 
Auxiliary verbs: can, may, are
Penn tagset (1/2)
Penn tagset (2/2)
Indian 
Noun 
Language Tagset:
Indian Language Tagset: 
Pronoun
Indian Language Tagset: 
Quantifier
Indian Language Tagset: 
Demonstrative 
3 Demonstrative DM DM Vaha, jo, 
yaha, 
3.1 Deictic DMD DM DMD Vaha, yaha 
3.2 Relative DMR DM DMR jo, jis 
3.3 Wh-word DMQ DM DMQ kis, kaun 
Indefinite DMI DM DMI KoI, kis
Indian Language Tagset: 
Verb, Adjective, Adverb
Indian Language Tagset: 
Postposition, conjunction
Indian Language Tagset: 
Particle
Indian Language Tagset: 
Residuals
Bigram 
Best tag sequence 
Assumption 
= 
= 
= 
T* 
argmax P(T|W) 
argmax P(T)P(W|T) (by Baye’s Theorem) 
P(T) = P(t0=^ t1t2 … tn+1=.) 
= P(t0)P(t1|t0)P(t2|t1t0)P(t3|t2t1t0) … 
P(tn|tn-1tn-2…t0)P(tn+1|tntn-1…t0) 
= P(t0)P(t1|t0)P(t2|t1) … P(tn|tn-1)P(tn+1|tn) 
N+1 
Π 
i = 0 
= P(ti|ti-1) Bigram Assumption
Lexical Probability Assumption 
P(W|T) = P(w0|t0-tn+1)P(w1|w0t0-tn+1)P(w2|w1w0t0-tn+1) … 
P(wn|w0-wn-1t0-tn+1)P(wn+1|w0-wnt0-tn+1) 
Assumption: A word is determined completely by 
inspired by speech recognition 
its tag. This is 
= P(wo|to)P(w1|t1) … P(wn+1|tn+1) 
n+1 
=Π P(wi|ti) 
i = 0 
n+1 
= Π P(wi|ti) 
i = 1 
(Lexical Probability Assumption)
Generative Model 
^_^ People_N Jump_V High_R ._ 
. 
Lexical 
Probabilit 
ies 
^ N V A . 
V N N Bigram 
Probabilit 
ies 
A A N 
This model is called Generative model. 
Here words are observed from tags as states. 
This is similar to HMM.
Bigram probabilities 
N V A 
N 0.2 0.7 0.1 
V 0.6 0.2 0.2 
A 0.5 0.2 0.3
Lexical Probability 
People jump high 
10 
10 
10 
N - 5 
10 
- 3 
0.4x10 
-7 
V - 7 10 -2 10 -7 
A 0 0 -1 
values in cell are P(col-heading/row-heading)
Calculation from 
Corpus 
actual data 
^ Ram got many NLP books. He found them 
all very interesting. 
Pos Tagged 
^N V A N N . N V N A R A .
Recording numbers 
^ N V A R . 
^ 0 2 0 0 0 0 
N 0 1 2 1 0 1 
V 0 1 0 1 0 0 
A 0 1 0 0 1 1 
R 0 0 0 1 0 0 
. 1 0 0 0 0 0
Probabilities 
^ N V A R . 
^ 0 1 0 0 0 0 
N 0 1/5 2/5 1/5 0 1/5 
V 0 1/2 0 1/2 0 0 
A 0 1/3 0 0 1/3 1/3 
R 0 0 0 1 0 0 
. 1 0 0 0 0 0
To find 
T* = argmax (P(T) P(W/T)) 
P(T).P(W/T) = Π P( ti / ti+1 ).P(wi /ti) 
i=1 n 
P( ti / ti+1 ) : Bigram probability 
P(wi /ti): Lexical probability
Bigram probabilities 
N V A R 
N 0.15 0.7 0.05 0.1 
V 0.6 0.2 0.1 0.1 
A 0.5 0.2 0.3 0 
R 0.1 0.3 0.5 0.1
Lexical Probability 
People jump high 
10 
10 
10 
N 
-5 
10 
-3 
0.4x10 -7 
V 
-7 
10 
-2 
10 -7 
A 0 0 -1 
R 0 0 0 
values in cell are P(col-heading/row-heading)

More Related Content

What's hot

Parts of Speect Tagging
Parts of Speect TaggingParts of Speect Tagging
Parts of Speect Taggingtheyaseen51
 
Natural Language Processing (NLP)
Natural Language Processing (NLP)Natural Language Processing (NLP)
Natural Language Processing (NLP)Yuriy Guts
 
Syntactic analysis in NLP
Syntactic analysis in NLPSyntactic analysis in NLP
Syntactic analysis in NLPkartikaVashisht
 
Nlp toolkits and_preprocessing_techniques
Nlp toolkits and_preprocessing_techniquesNlp toolkits and_preprocessing_techniques
Nlp toolkits and_preprocessing_techniquesankit_ppt
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language ProcessingSaurabh Kaushik
 
Introduction to Natural Language Processing
Introduction to Natural Language ProcessingIntroduction to Natural Language Processing
Introduction to Natural Language ProcessingPranav Gupta
 
Text categorization
Text categorizationText categorization
Text categorizationKU Leuven
 
Natural Language Processing with Python
Natural Language Processing with PythonNatural Language Processing with Python
Natural Language Processing with PythonBenjamin Bengfort
 
natural language processing help at myassignmenthelp.net
natural language processing  help at myassignmenthelp.netnatural language processing  help at myassignmenthelp.net
natural language processing help at myassignmenthelp.netwww.myassignmenthelp.net
 
6 shallow parsing introduction
6 shallow parsing introduction6 shallow parsing introduction
6 shallow parsing introductionThennarasuSakkan
 
CONTEXT FREE GRAMMAR
CONTEXT FREE GRAMMAR CONTEXT FREE GRAMMAR
CONTEXT FREE GRAMMAR Zahid Parvez
 
Natural language processing
Natural language processingNatural language processing
Natural language processingHansi Thenuwara
 
Lecture: Question Answering
Lecture: Question AnsweringLecture: Question Answering
Lecture: Question AnsweringMarina Santini
 
NLP_KASHK:Finite-State Morphological Parsing
NLP_KASHK:Finite-State Morphological ParsingNLP_KASHK:Finite-State Morphological Parsing
NLP_KASHK:Finite-State Morphological ParsingHemantha Kulathilake
 
Lecture 1: Semantic Analysis in Language Technology
Lecture 1: Semantic Analysis in Language TechnologyLecture 1: Semantic Analysis in Language Technology
Lecture 1: Semantic Analysis in Language TechnologyMarina Santini
 
The Role of Natural Language Processing in Information Retrieval
The Role of Natural Language Processing in Information RetrievalThe Role of Natural Language Processing in Information Retrieval
The Role of Natural Language Processing in Information RetrievalTony Russell-Rose
 

What's hot (20)

Parts of Speect Tagging
Parts of Speect TaggingParts of Speect Tagging
Parts of Speect Tagging
 
Nlp ambiguity presentation
Nlp ambiguity presentationNlp ambiguity presentation
Nlp ambiguity presentation
 
Natural Language Processing (NLP)
Natural Language Processing (NLP)Natural Language Processing (NLP)
Natural Language Processing (NLP)
 
Syntactic analysis in NLP
Syntactic analysis in NLPSyntactic analysis in NLP
Syntactic analysis in NLP
 
Nlp toolkits and_preprocessing_techniques
Nlp toolkits and_preprocessing_techniquesNlp toolkits and_preprocessing_techniques
Nlp toolkits and_preprocessing_techniques
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
NLP
NLPNLP
NLP
 
Introduction to Natural Language Processing
Introduction to Natural Language ProcessingIntroduction to Natural Language Processing
Introduction to Natural Language Processing
 
Text categorization
Text categorizationText categorization
Text categorization
 
Language models
Language modelsLanguage models
Language models
 
Natural Language Processing with Python
Natural Language Processing with PythonNatural Language Processing with Python
Natural Language Processing with Python
 
natural language processing help at myassignmenthelp.net
natural language processing  help at myassignmenthelp.netnatural language processing  help at myassignmenthelp.net
natural language processing help at myassignmenthelp.net
 
6 shallow parsing introduction
6 shallow parsing introduction6 shallow parsing introduction
6 shallow parsing introduction
 
CONTEXT FREE GRAMMAR
CONTEXT FREE GRAMMAR CONTEXT FREE GRAMMAR
CONTEXT FREE GRAMMAR
 
Nlp
NlpNlp
Nlp
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
 
Lecture: Question Answering
Lecture: Question AnsweringLecture: Question Answering
Lecture: Question Answering
 
NLP_KASHK:Finite-State Morphological Parsing
NLP_KASHK:Finite-State Morphological ParsingNLP_KASHK:Finite-State Morphological Parsing
NLP_KASHK:Finite-State Morphological Parsing
 
Lecture 1: Semantic Analysis in Language Technology
Lecture 1: Semantic Analysis in Language TechnologyLecture 1: Semantic Analysis in Language Technology
Lecture 1: Semantic Analysis in Language Technology
 
The Role of Natural Language Processing in Information Retrieval
The Role of Natural Language Processing in Information RetrievalThe Role of Natural Language Processing in Information Retrieval
The Role of Natural Language Processing in Information Retrieval
 

Viewers also liked

Current state of the art pos tagging for indian languages – a study
Current state of the art pos tagging for indian languages – a studyCurrent state of the art pos tagging for indian languages – a study
Current state of the art pos tagging for indian languages – a studyiaemedu
 
A Connectionist approach to Part-Of-Speech Tagging
A Connectionist approach to Part-Of-Speech TaggingA Connectionist approach to Part-Of-Speech Tagging
A Connectionist approach to Part-Of-Speech TaggingFrancisco Zamora-Martinez
 
Part of speech tagger
Part of speech taggerPart of speech tagger
Part of speech taggerarteimi
 
Text Analytics for Security
Text Analytics for SecurityText Analytics for Security
Text Analytics for SecurityTao Xie
 
Crash-course in Natural Language Processing
Crash-course in Natural Language ProcessingCrash-course in Natural Language Processing
Crash-course in Natural Language ProcessingVsevolod Dyomkin
 
CS571:: Part of-Speech Tagging
CS571:: Part of-Speech TaggingCS571:: Part of-Speech Tagging
CS571:: Part of-Speech TaggingJinho Choi
 
Natural Language Processing: L02 words
Natural Language Processing: L02 wordsNatural Language Processing: L02 words
Natural Language Processing: L02 wordsananth
 
L05 language model_part2
L05 language model_part2L05 language model_part2
L05 language model_part2ananth
 
English : Part of speech
English : Part of speech English : Part of speech
English : Part of speech Sol Sid
 

Viewers also liked (16)

Current state of the art pos tagging for indian languages – a study
Current state of the art pos tagging for indian languages – a studyCurrent state of the art pos tagging for indian languages – a study
Current state of the art pos tagging for indian languages – a study
 
A Connectionist approach to Part-Of-Speech Tagging
A Connectionist approach to Part-Of-Speech TaggingA Connectionist approach to Part-Of-Speech Tagging
A Connectionist approach to Part-Of-Speech Tagging
 
TDC 1 - Class 2
TDC 1 - Class 2TDC 1 - Class 2
TDC 1 - Class 2
 
Part of speech tagger
Part of speech taggerPart of speech tagger
Part of speech tagger
 
TDC 1 - Class 2
TDC 1 - Class 2TDC 1 - Class 2
TDC 1 - Class 2
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
L3 v2
L3 v2L3 v2
L3 v2
 
Ignat vita artur
Ignat vita arturIgnat vita artur
Ignat vita artur
 
Text Analytics for Security
Text Analytics for SecurityText Analytics for Security
Text Analytics for Security
 
Ngrams smoothing
Ngrams smoothingNgrams smoothing
Ngrams smoothing
 
Crash-course in Natural Language Processing
Crash-course in Natural Language ProcessingCrash-course in Natural Language Processing
Crash-course in Natural Language Processing
 
CS571:: Part of-Speech Tagging
CS571:: Part of-Speech TaggingCS571:: Part of-Speech Tagging
CS571:: Part of-Speech Tagging
 
Natural Language Processing: L02 words
Natural Language Processing: L02 wordsNatural Language Processing: L02 words
Natural Language Processing: L02 words
 
OpenNLP demo
OpenNLP demoOpenNLP demo
OpenNLP demo
 
L05 language model_part2
L05 language model_part2L05 language model_part2
L05 language model_part2
 
English : Part of speech
English : Part of speech English : Part of speech
English : Part of speech
 

Similar to Natural Language processing Parts of speech tagging, its classes, and how to process it

Natural Language parsing.pptx
Natural Language parsing.pptxNatural Language parsing.pptx
Natural Language parsing.pptxsiddhantroy13
 
Segmenting dna sequence into words
Segmenting dna sequence into wordsSegmenting dna sequence into words
Segmenting dna sequence into wordsLiang Wang
 
NLP Deep Learning with Tensorflow
NLP Deep Learning with TensorflowNLP Deep Learning with Tensorflow
NLP Deep Learning with Tensorflowseungwoo kim
 
Open nlp presentationss
Open nlp presentationssOpen nlp presentationss
Open nlp presentationssChandan Deb
 
Lecture 7- Text Statistics and Document Parsing
Lecture 7- Text Statistics and Document ParsingLecture 7- Text Statistics and Document Parsing
Lecture 7- Text Statistics and Document ParsingSean Golliher
 
Natural Language Processing made easy
Natural Language Processing made easyNatural Language Processing made easy
Natural Language Processing made easyGopi Krishnan Nambiar
 
natural language processing
natural language processing natural language processing
natural language processing sunanthakrishnan
 
NLTK: Natural Language Processing made easy
NLTK: Natural Language Processing made easyNLTK: Natural Language Processing made easy
NLTK: Natural Language Processing made easyoutsider2
 
Data Analytics using R with Yelp Dataset
Data Analytics using R with Yelp DatasetData Analytics using R with Yelp Dataset
Data Analytics using R with Yelp DatasetCédric Poottaren
 
Language Technology Enhanced Learning
Language Technology Enhanced LearningLanguage Technology Enhanced Learning
Language Technology Enhanced Learningtelss09
 
Artificial Intelligence Notes Unit 4
Artificial Intelligence Notes Unit 4Artificial Intelligence Notes Unit 4
Artificial Intelligence Notes Unit 4DigiGurukul
 
MEBI 591C/598 – Data and Text Mining in Biomedical Informatics
MEBI 591C/598 – Data and Text Mining in Biomedical InformaticsMEBI 591C/598 – Data and Text Mining in Biomedical Informatics
MEBI 591C/598 – Data and Text Mining in Biomedical Informaticsbutest
 
Recommender systems
Recommender systemsRecommender systems
Recommender systemsVenkat Raman
 
ToC_M1L3_Grammar and Derivation.pdf
ToC_M1L3_Grammar and Derivation.pdfToC_M1L3_Grammar and Derivation.pdf
ToC_M1L3_Grammar and Derivation.pdfjaishreemane73
 
Unsupervised Software-Specific Morphological Forms Inference from Informal Di...
Unsupervised Software-Specific Morphological Forms Inference from Informal Di...Unsupervised Software-Specific Morphological Forms Inference from Informal Di...
Unsupervised Software-Specific Morphological Forms Inference from Informal Di...Chunyang Chen
 
GSCL2013.Phrase Tagset Mapping for French and English Treebanks and Its Appli...
GSCL2013.Phrase Tagset Mapping for French and English Treebanks and Its Appli...GSCL2013.Phrase Tagset Mapping for French and English Treebanks and Its Appli...
GSCL2013.Phrase Tagset Mapping for French and English Treebanks and Its Appli...Lifeng (Aaron) Han
 
Moore_slides.ppt
Moore_slides.pptMoore_slides.ppt
Moore_slides.pptbutest
 

Similar to Natural Language processing Parts of speech tagging, its classes, and how to process it (20)

Text Mining Analytics 101
Text Mining Analytics 101Text Mining Analytics 101
Text Mining Analytics 101
 
Nlp
NlpNlp
Nlp
 
Natural Language parsing.pptx
Natural Language parsing.pptxNatural Language parsing.pptx
Natural Language parsing.pptx
 
Segmenting dna sequence into words
Segmenting dna sequence into wordsSegmenting dna sequence into words
Segmenting dna sequence into words
 
NLP Deep Learning with Tensorflow
NLP Deep Learning with TensorflowNLP Deep Learning with Tensorflow
NLP Deep Learning with Tensorflow
 
Open nlp presentationss
Open nlp presentationssOpen nlp presentationss
Open nlp presentationss
 
Lecture 7- Text Statistics and Document Parsing
Lecture 7- Text Statistics and Document ParsingLecture 7- Text Statistics and Document Parsing
Lecture 7- Text Statistics and Document Parsing
 
Natural Language Processing made easy
Natural Language Processing made easyNatural Language Processing made easy
Natural Language Processing made easy
 
natural language processing
natural language processing natural language processing
natural language processing
 
NLTK: Natural Language Processing made easy
NLTK: Natural Language Processing made easyNLTK: Natural Language Processing made easy
NLTK: Natural Language Processing made easy
 
Data Analytics using R with Yelp Dataset
Data Analytics using R with Yelp DatasetData Analytics using R with Yelp Dataset
Data Analytics using R with Yelp Dataset
 
Language Technology Enhanced Learning
Language Technology Enhanced LearningLanguage Technology Enhanced Learning
Language Technology Enhanced Learning
 
Artificial Intelligence Notes Unit 4
Artificial Intelligence Notes Unit 4Artificial Intelligence Notes Unit 4
Artificial Intelligence Notes Unit 4
 
MEBI 591C/598 – Data and Text Mining in Biomedical Informatics
MEBI 591C/598 – Data and Text Mining in Biomedical InformaticsMEBI 591C/598 – Data and Text Mining in Biomedical Informatics
MEBI 591C/598 – Data and Text Mining in Biomedical Informatics
 
Recommender systems
Recommender systemsRecommender systems
Recommender systems
 
ToC_M1L3_Grammar and Derivation.pdf
ToC_M1L3_Grammar and Derivation.pdfToC_M1L3_Grammar and Derivation.pdf
ToC_M1L3_Grammar and Derivation.pdf
 
Unsupervised Software-Specific Morphological Forms Inference from Informal Di...
Unsupervised Software-Specific Morphological Forms Inference from Informal Di...Unsupervised Software-Specific Morphological Forms Inference from Informal Di...
Unsupervised Software-Specific Morphological Forms Inference from Informal Di...
 
GSCL2013.Phrase Tagset Mapping for French and English Treebanks and Its Appli...
GSCL2013.Phrase Tagset Mapping for French and English Treebanks and Its Appli...GSCL2013.Phrase Tagset Mapping for French and English Treebanks and Its Appli...
GSCL2013.Phrase Tagset Mapping for French and English Treebanks and Its Appli...
 
Nltk
NltkNltk
Nltk
 
Moore_slides.ppt
Moore_slides.pptMoore_slides.ppt
Moore_slides.ppt
 

Recently uploaded

Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort serviceGurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort servicejennyeacort
 
chaitra-1.pptx fake news detection using machine learning
chaitra-1.pptx  fake news detection using machine learningchaitra-1.pptx  fake news detection using machine learning
chaitra-1.pptx fake news detection using machine learningmisbanausheenparvam
 
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...VICTOR MAESTRE RAMIREZ
 
main PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidmain PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidNikhilNagaraju
 
Heart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptxHeart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptxPoojaBan
 
power system scada applications and uses
power system scada applications and usespower system scada applications and uses
power system scada applications and usesDevarapalliHaritha
 
Biology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxBiology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxDeepakSakkari2
 
IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024Mark Billinghurst
 
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfCCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfAsst.prof M.Gokilavani
 
GDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSCAESB
 
Artificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptxArtificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptxbritheesh05
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxpurnimasatapathy1234
 
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfCCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfAsst.prof M.Gokilavani
 
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionSachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionDr.Costas Sachpazis
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130Suhani Kapoor
 

Recently uploaded (20)

Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort serviceGurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
 
chaitra-1.pptx fake news detection using machine learning
chaitra-1.pptx  fake news detection using machine learningchaitra-1.pptx  fake news detection using machine learning
chaitra-1.pptx fake news detection using machine learning
 
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
 
main PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidmain PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfid
 
Design and analysis of solar grass cutter.pdf
Design and analysis of solar grass cutter.pdfDesign and analysis of solar grass cutter.pdf
Design and analysis of solar grass cutter.pdf
 
Heart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptxHeart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptx
 
power system scada applications and uses
power system scada applications and usespower system scada applications and uses
power system scada applications and uses
 
young call girls in Green Park🔝 9953056974 🔝 escort Service
young call girls in Green Park🔝 9953056974 🔝 escort Serviceyoung call girls in Green Park🔝 9953056974 🔝 escort Service
young call girls in Green Park🔝 9953056974 🔝 escort Service
 
Biology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxBiology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptx
 
IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024
 
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfCCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
 
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptxExploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
 
GDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentation
 
🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...
🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...
🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...
 
Artificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptxArtificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptx
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptx
 
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfCCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
 
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionSachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
 
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCRCall Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
 

Natural Language processing Parts of speech tagging, its classes, and how to process it

  • 1. Part of Speech Tagging
  • 2. Perpectivising NLP: Areas of AI and their inter-dependencies Knowledge Search Logic Representation Machine Learning Planning Expert NLP Vision Robotics Systems
  • 3. Two pictures Problem NLP Semantics NLP nity Parsing Vision Speech Morph Analysis HM M Statistics and Probability Hindi English Language CRF + Knowledge Based MEM M Algorithm N Tri Part of Speech Tagging Marathi French
  • 4. What it is POS Tagging is a process that attaches each word in a sentence with a suitable tag from a given set of tags. The set of tags is called the Tag-set. Standard Tag-set : Penn Treebank (for English).
  • 5. Definition Tagging is the assignment of a singlepart-of-speech tag to each word (and punctuation marker) in a corpus. “_“ The_DT guys_NNS that_WDT make_VBP traditional_JJ hardware_NN are_VBP really_RB being_VBG obsoleted_VBN by_IN microprocessor-based_ JJ machines_NNS ,_, ”_” said_VBD Mr._NNP Benton_NNP ._.
  • 6. POS Tags NN – Noun; e.g. VM – Main Verb; Dog_NN e.g. Run_VM VAUX – AuxiliaryVerb; e.g. Is_VAUX JJ – Adjective; e.g. Red_JJ PRP – Pronoun; e.g. You_PRP NNP– Proper Noun; e.g. John_NNP etc.
  • 7. POS Tag Ambiguity In English : I bank1 on the bank2 on the river bank3 for Bank1 is verb, my transactions. the other two banks are noun In Hindi : ”Khaanaa” : can be noun (food) or eat) verb (to
  • 8. For Hindi Rama achhaa gaata hai. (hai is VAUX : Auxiliary verb); Ram sings well Rama achha ladakaa hai. (hai is VCOP : Copula verb); Ram is a good boy
  • 9. Process List all possible tag for each word in sentence. Choose best suitable tag sequence.
  • 10. Example ”People jump high”. People : Noun/Verb jump : Noun/Verb high : Noun/Verb/Adjective We can start with probabilities.
  • 11.
  • 12. Importance of POS tagging Ack: presentation by Claire Gardent on POS tagging by NLTK
  • 13. What is Part of Speech (POS) Words can be divided into classes behave similarly. Traditionally eight parts of speech that in English: noun, verb, pronoun, preposition, adverb, adjective and article More recently larger conjunction, sets have been used: e.g. Penn Treebank (45 tags), Susanne (353 tags).
  • 14. Why POS POS tell us a lot about a word (and the words near it). E.g, adjectives often followed by nouns personal pronouns often followed by verbs possessive pronouns by nouns Pronunciations depends on POS, e.g. object (first syllable NN, second syllable VM), content, discount First step in many NLP applications
  • 15. Categories of POS Open and closed classes Closed classes have a fixed membership of words: determiners, pronouns, prepositions Closed class words are usually function word: frequently occurring, grammatically important, often short (e.g. of, it, the, in) Open classes: nouns, verbs, adjectives and adverbs(allow new addition of word)
  • 16. Open Class (1/2) Nouns: Proper nouns (Scotland, BBC), common nouns count nouns (goat, glass) mass nouns (snow, pacifism) Verbs: actions and processes (run, hope) also auxiliary verbs (is, are, am, will, can)
  • 17. Open Class (2/2) Adjectives: properties and qualities value) Adverbs: (age, colour, modify verbs, or verb phrases, or other adverbs- Unfortunately John walked home extremely slowly yesterday Sentential adverb: unfortunately Manner adverb: extremely, slowly Time adverb: yesterday
  • 18. Closed class Prepositions: on, under, over, to, with, by Determiners: the, a, an, some Pronouns: she, you, I, who Conjunctions: and, but, or, as, when, if Auxiliary verbs: can, may, are
  • 24. Indian Language Tagset: Demonstrative 3 Demonstrative DM DM Vaha, jo, yaha, 3.1 Deictic DMD DM DMD Vaha, yaha 3.2 Relative DMR DM DMR jo, jis 3.3 Wh-word DMQ DM DMQ kis, kaun Indefinite DMI DM DMI KoI, kis
  • 25. Indian Language Tagset: Verb, Adjective, Adverb
  • 26. Indian Language Tagset: Postposition, conjunction
  • 29. Bigram Best tag sequence Assumption = = = T* argmax P(T|W) argmax P(T)P(W|T) (by Baye’s Theorem) P(T) = P(t0=^ t1t2 … tn+1=.) = P(t0)P(t1|t0)P(t2|t1t0)P(t3|t2t1t0) … P(tn|tn-1tn-2…t0)P(tn+1|tntn-1…t0) = P(t0)P(t1|t0)P(t2|t1) … P(tn|tn-1)P(tn+1|tn) N+1 Π i = 0 = P(ti|ti-1) Bigram Assumption
  • 30. Lexical Probability Assumption P(W|T) = P(w0|t0-tn+1)P(w1|w0t0-tn+1)P(w2|w1w0t0-tn+1) … P(wn|w0-wn-1t0-tn+1)P(wn+1|w0-wnt0-tn+1) Assumption: A word is determined completely by inspired by speech recognition its tag. This is = P(wo|to)P(w1|t1) … P(wn+1|tn+1) n+1 =Π P(wi|ti) i = 0 n+1 = Π P(wi|ti) i = 1 (Lexical Probability Assumption)
  • 31. Generative Model ^_^ People_N Jump_V High_R ._ . Lexical Probabilit ies ^ N V A . V N N Bigram Probabilit ies A A N This model is called Generative model. Here words are observed from tags as states. This is similar to HMM.
  • 32. Bigram probabilities N V A N 0.2 0.7 0.1 V 0.6 0.2 0.2 A 0.5 0.2 0.3
  • 33. Lexical Probability People jump high 10 10 10 N - 5 10 - 3 0.4x10 -7 V - 7 10 -2 10 -7 A 0 0 -1 values in cell are P(col-heading/row-heading)
  • 34. Calculation from Corpus actual data ^ Ram got many NLP books. He found them all very interesting. Pos Tagged ^N V A N N . N V N A R A .
  • 35. Recording numbers ^ N V A R . ^ 0 2 0 0 0 0 N 0 1 2 1 0 1 V 0 1 0 1 0 0 A 0 1 0 0 1 1 R 0 0 0 1 0 0 . 1 0 0 0 0 0
  • 36. Probabilities ^ N V A R . ^ 0 1 0 0 0 0 N 0 1/5 2/5 1/5 0 1/5 V 0 1/2 0 1/2 0 0 A 0 1/3 0 0 1/3 1/3 R 0 0 0 1 0 0 . 1 0 0 0 0 0
  • 37. To find T* = argmax (P(T) P(W/T)) P(T).P(W/T) = Π P( ti / ti+1 ).P(wi /ti) i=1 n P( ti / ti+1 ) : Bigram probability P(wi /ti): Lexical probability
  • 38. Bigram probabilities N V A R N 0.15 0.7 0.05 0.1 V 0.6 0.2 0.1 0.1 A 0.5 0.2 0.3 0 R 0.1 0.3 0.5 0.1
  • 39. Lexical Probability People jump high 10 10 10 N -5 10 -3 0.4x10 -7 V -7 10 -2 10 -7 A 0 0 -1 R 0 0 0 values in cell are P(col-heading/row-heading)