SlideShare a Scribd company logo
Guided By,
Mrs. Gauri M. Dhopavkar
Presented By,
Ritikesh Bhaskarwar Vimal Shah
Ashwin Borkar Shashil Pohankar
Department of ComputerTechnology
YESHWANTRAO CHAVAN COLLEGE OF ENGINEERING,
Nagpur
(An Autonomous Institution Affiliated to RashtrasantTukadoji Maharaj Nagpur University)
Natural language processing
 Natural language processing (NLP) is a
field of computer science, artificial
intelligence, and linguistics concerned
with the interactions between computers
and human (natural) languages.
 Natural Language Processing (NLP) is the
computerized approach to analysing text
that is based on both a set of theories and
a set of technologies
POS Tagging :
 Part-of-Speech (POS) tagging is the
process of assigning a part-of-speech like
noun, verb, pronoun or other lexical class
marker to each word in a sentence.
 After POS tags are identified, the next
step is chunking, which involves dividing
sentences into non-overlapping non-
recursive phrases.
ते फू ल खूप
सुगंधी
आहे
Marathi POS
Tagger
ते-unidentified
फू ल-noun
खूप-adjective
सुगंधी-
adjective
आहे-verb
THE POSTAGGING EXAMPLE
Need of Marathi POS Tagging :
 Lack of significant tools for Indian
languages
 Dependence of other NLP activities on
POS tagging
 Failure of existing techniques on Indian
Languages
Overview of
POS tagging
Methods for POSTagging
1.Rule Based 2.Stochastic
 The rule based POS tagging
models apply a set of hand
written rules and use
contextual information to
assign POS tags to words.
 A stochastic approach
includes frequency,
probability or statistics. The
simplest stochastic
approach finds out the most
frequently used tag for a
specific word in the
annotated training data and
uses this information to tag
that word in the
unannotated text.
Methods for POSTagging
(cntd.)
3. Hiden Markov Model 4. Maximum Entropy Model
 The HMM model trains on
annotated corpora to find
out the transition and
emission probabilities
 The Maximum Entropy
Model (MEM) is based on
the principle of Maximum
Entropy, which states that
when choosing between a
number of different
probabilistic models for a
set of data, the most valid
model is the one which
makes fewest arbitrary
assumptions about the
nature of the data
Architecture and Design :
 Marathi sentence is taken as input , then
the tokens are created followed by
tagging and finding ambiguity.
TOKENIZING TAGGING FINDING
AMBIGUOUS
WORDS
FINDING
PROBABILITY
ASSIGN TAGS
ACCORDING TO
PROBABILITY
VIEW THE
RESULT
INPUT
Detail of Identified Module :
 Tokenizer :This module is used to get the
tokens of the input sentence.Also, calls
the other modules when required.
 Tagging :These modules is used for
assigning certain tags to tokens and also
search for ambiguous words and also find
their types and assign some special
symbols to them.
Details of identified modules (cntd.)
 Root word : This module is used for
finding the root word of each token
finding it from the Marathi wordnet.
 Probability : This module calculates the
probability and accordingly assigns the tag,
according to the higher probability of
word.
• Showing the results :This module shows
the result.The words are shown with
tags.
Experimentation and Results :
1.
• 1000: If first bit is 1, then we assign a tag as a noun to
the particular word.
• 1100: In this case, the word can be used as both
unidentified.
2.
• 0100: If second bit is 1, then we assign a tag as an
adjective to the particular word.
• 0110: In this case, the word can be used as other
words.
3.
• 0010: If third bit is 1, then we assign a tag as an adverb
to the particular word.
• 0001: If fourth bit is 1, then we assign a tag as a verb
to the particular word.
Advantages :
 A POS tagger can be seen as a first-step
towards tightening the integration
between speech recognition and natural
language processing.
 A POS tagger in the language model aids
in the identification of boundary tones and
speech repairs, redefining the speech
recognition problem.
Advantages (cntd.):
 A typical NLP system consists of
tokenization, sentence delimitation, part-of-
speech (POS) tagging, phrase chunking,
parsing, and concept mapping. As one of the
initial steps, POS tagging determines the part
of speech for each token in a sentence.
 Managers, educators, Trainers, Sales people
are able to accurately assess the needs of a
group, improves questioning techniques thus
improving their skills to achieve more
consistent results.
Limitations :
 User Cannot enter more than one sentence
i.e. cannot enter paragraph.
 It is not able to detect and report the gender
of the word i.e. Morphological analysis in
not done.
 When ambiguity is encountered it is
searched for the POS of the ambiguous word
if it contains less or no word with the correct
POS and there are more number of words for
other POS then it shows incorrect POS for
the ambiguous word.
Applications :
 Information Retrieval
 Speech synthesis
 Word Sense Disambiguation (WSD)
 Machine Translation (MT)
-Text to Text
-Speech to Speech
Snapshots
Conclusion and Future Scope :
 The POS tagger described here is very
simple and efficient for automatic tagging,
but the morphological complexity of the
Marathi make it hard.The performance of
the current system is good and result
achieved by this method are excellent. In
future we wish to improve the accuracy
our system by adding more tagged
sentence in our training corpus.

More Related Content

What's hot

D3 dhanalakshmi
D3 dhanalakshmiD3 dhanalakshmi
D3 dhanalakshmi
Jasline Presilda
 
Introduction to natural language processing
Introduction to natural language processingIntroduction to natural language processing
Introduction to natural language processing
Minh Pham
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
Basha Chand
 
Natural Language Processing (NLP)
Natural Language Processing (NLP)Natural Language Processing (NLP)
Natural Language Processing (NLP)
Yuriy Guts
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
Saurav Aryal
 
Natural Language Processing for Games Research
Natural Language Processing for Games ResearchNatural Language Processing for Games Research
Natural Language Processing for Games Research
Jose Zagal
 
Nlp presentation
Nlp presentationNlp presentation
Nlp presentation
Surya Sg
 
Natural Language Processing glossary for Coders
Natural Language Processing glossary for CodersNatural Language Processing glossary for Coders
Natural Language Processing glossary for Coders
Aravind Mohanoor
 
A Review on a web based Punjabi t o English Machine Transliteration System
A Review on a web based Punjabi t o English Machine Transliteration SystemA Review on a web based Punjabi t o English Machine Transliteration System
A Review on a web based Punjabi t o English Machine Transliteration System
Editor IJCATR
 
Natural Language Processing: L01 introduction
Natural Language Processing: L01 introductionNatural Language Processing: L01 introduction
Natural Language Processing: L01 introduction
ananth
 
NLP
NLPNLP
Introduction to Natural Language Processing (NLP)
Introduction to Natural Language Processing (NLP)Introduction to Natural Language Processing (NLP)
Introduction to Natural Language Processing (NLP)
VenkateshMurugadas
 
Big Data and Natural Language Processing
Big Data and Natural Language ProcessingBig Data and Natural Language Processing
Big Data and Natural Language Processing
Michel Bruley
 
Ijartes v1-i1-002
Ijartes v1-i1-002Ijartes v1-i1-002
Ijartes v1-i1-002
IJARTES
 
NLPinAAC
NLPinAACNLPinAAC
NLPinAAC
Divya Sugumar
 
Improving Sentiment Analysis of Short Informal Indonesian Product Reviews usi...
Improving Sentiment Analysis of Short Informal Indonesian Product Reviews usi...Improving Sentiment Analysis of Short Informal Indonesian Product Reviews usi...
Improving Sentiment Analysis of Short Informal Indonesian Product Reviews usi...
TELKOMNIKA JOURNAL
 
NLP Project Presentation
NLP Project PresentationNLP Project Presentation
NLP Project Presentation
Aryak Sengupta
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
Aanchal Chaurasia
 
Natural language processing (nlp)
Natural language processing (nlp)Natural language processing (nlp)
Natural language processing (nlp)
Kuppusamy P
 
Technical Development Workshop - Text Analytics with Python
Technical Development Workshop - Text Analytics with PythonTechnical Development Workshop - Text Analytics with Python
Technical Development Workshop - Text Analytics with Python
Michelle Purnama
 

What's hot (20)

D3 dhanalakshmi
D3 dhanalakshmiD3 dhanalakshmi
D3 dhanalakshmi
 
Introduction to natural language processing
Introduction to natural language processingIntroduction to natural language processing
Introduction to natural language processing
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
 
Natural Language Processing (NLP)
Natural Language Processing (NLP)Natural Language Processing (NLP)
Natural Language Processing (NLP)
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
 
Natural Language Processing for Games Research
Natural Language Processing for Games ResearchNatural Language Processing for Games Research
Natural Language Processing for Games Research
 
Nlp presentation
Nlp presentationNlp presentation
Nlp presentation
 
Natural Language Processing glossary for Coders
Natural Language Processing glossary for CodersNatural Language Processing glossary for Coders
Natural Language Processing glossary for Coders
 
A Review on a web based Punjabi t o English Machine Transliteration System
A Review on a web based Punjabi t o English Machine Transliteration SystemA Review on a web based Punjabi t o English Machine Transliteration System
A Review on a web based Punjabi t o English Machine Transliteration System
 
Natural Language Processing: L01 introduction
Natural Language Processing: L01 introductionNatural Language Processing: L01 introduction
Natural Language Processing: L01 introduction
 
NLP
NLPNLP
NLP
 
Introduction to Natural Language Processing (NLP)
Introduction to Natural Language Processing (NLP)Introduction to Natural Language Processing (NLP)
Introduction to Natural Language Processing (NLP)
 
Big Data and Natural Language Processing
Big Data and Natural Language ProcessingBig Data and Natural Language Processing
Big Data and Natural Language Processing
 
Ijartes v1-i1-002
Ijartes v1-i1-002Ijartes v1-i1-002
Ijartes v1-i1-002
 
NLPinAAC
NLPinAACNLPinAAC
NLPinAAC
 
Improving Sentiment Analysis of Short Informal Indonesian Product Reviews usi...
Improving Sentiment Analysis of Short Informal Indonesian Product Reviews usi...Improving Sentiment Analysis of Short Informal Indonesian Product Reviews usi...
Improving Sentiment Analysis of Short Informal Indonesian Product Reviews usi...
 
NLP Project Presentation
NLP Project PresentationNLP Project Presentation
NLP Project Presentation
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
 
Natural language processing (nlp)
Natural language processing (nlp)Natural language processing (nlp)
Natural language processing (nlp)
 
Technical Development Workshop - Text Analytics with Python
Technical Development Workshop - Text Analytics with PythonTechnical Development Workshop - Text Analytics with Python
Technical Development Workshop - Text Analytics with Python
 

Viewers also liked

Power point arsenal
Power point arsenalPower point arsenal
Power point arsenal478551412
 
Google cloud
Google cloudGoogle cloud
Google cloud
478551412
 
Google cloud
Google cloudGoogle cloud
Google cloud
478551412
 
Music video plans
Music video plansMusic video plans
Music video plans
kieranhyde
 
Planning digipak and advert
Planning digipak and advertPlanning digipak and advert
Planning digipak and advert
kieranhyde
 
Q analysis
Q analysisQ analysis
Q analysis
kieranhyde
 

Viewers also liked (6)

Power point arsenal
Power point arsenalPower point arsenal
Power point arsenal
 
Google cloud
Google cloudGoogle cloud
Google cloud
 
Google cloud
Google cloudGoogle cloud
Google cloud
 
Music video plans
Music video plansMusic video plans
Music video plans
 
Planning digipak and advert
Planning digipak and advertPlanning digipak and advert
Planning digipak and advert
 
Q analysis
Q analysisQ analysis
Q analysis
 

Similar to Presentation1

PART OF SPEECH TAGGING OFMARATHI TEXT USING TRIGRAMMETHOD
PART OF SPEECH TAGGING OFMARATHI TEXT USING TRIGRAMMETHODPART OF SPEECH TAGGING OFMARATHI TEXT USING TRIGRAMMETHOD
PART OF SPEECH TAGGING OFMARATHI TEXT USING TRIGRAMMETHOD
ijait
 
Natural Language Processing .pdf
Natural Language Processing .pdfNatural Language Processing .pdf
Natural Language Processing .pdf
Anime196637
 
Top 10 Must-Know NLP Techniques for Data Scientists
Top 10 Must-Know NLP Techniques for Data ScientistsTop 10 Must-Know NLP Techniques for Data Scientists
Top 10 Must-Know NLP Techniques for Data Scientists
Xavor Corporation - Redefining Health Technology
 
Natural Language Processing: A comprehensive overview
Natural Language Processing: A comprehensive overviewNatural Language Processing: A comprehensive overview
Natural Language Processing: A comprehensive overview
Benjaminlapid1
 
Detecting paraphrases in the Marathi language
Detecting paraphrases in the Marathi languageDetecting paraphrases in the Marathi language
Detecting paraphrases in the Marathi language
BOHRInternationalJou1
 
NLP Techniques for Sentiment Anaysis.docx
NLP Techniques for Sentiment Anaysis.docxNLP Techniques for Sentiment Anaysis.docx
NLP Techniques for Sentiment Anaysis.docx
KevinSims18
 
HMM BASED POS TAGGER FOR HINDI
HMM BASED POS TAGGER FOR HINDIHMM BASED POS TAGGER FOR HINDI
HMM BASED POS TAGGER FOR HINDI
cscpconf
 
COMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUE
COMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUECOMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUE
COMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUE
Journal For Research
 
IRJET -Survey on Named Entity Recognition using Syntactic Parsing for Hindi L...
IRJET -Survey on Named Entity Recognition using Syntactic Parsing for Hindi L...IRJET -Survey on Named Entity Recognition using Syntactic Parsing for Hindi L...
IRJET -Survey on Named Entity Recognition using Syntactic Parsing for Hindi L...
IRJET Journal
 
DETECTION OF JARGON WORDS IN A TEXT USING SEMI-SUPERVISED LEARNING
DETECTION OF JARGON WORDS IN A TEXT USING SEMI-SUPERVISED LEARNINGDETECTION OF JARGON WORDS IN A TEXT USING SEMI-SUPERVISED LEARNING
DETECTION OF JARGON WORDS IN A TEXT USING SEMI-SUPERVISED LEARNING
csandit
 
DETECTION OF JARGON WORDS IN A TEXT USING SEMI-SUPERVISED LEARNING
DETECTION OF JARGON WORDS IN A TEXT USING SEMI-SUPERVISED LEARNINGDETECTION OF JARGON WORDS IN A TEXT USING SEMI-SUPERVISED LEARNING
DETECTION OF JARGON WORDS IN A TEXT USING SEMI-SUPERVISED LEARNING
cscpconf
 
Natural Language Processing (NLP).pdf
Natural Language Processing (NLP).pdfNatural Language Processing (NLP).pdf
Natural Language Processing (NLP).pdf
Moar Digital 360
 
NLP Deep Learning with Tensorflow
NLP Deep Learning with TensorflowNLP Deep Learning with Tensorflow
NLP Deep Learning with Tensorflow
seungwoo kim
 
NLP.pptx
NLP.pptxNLP.pptx
NLP.pptx
Rahul Borate
 
Language Modeling.docx
Language Modeling.docxLanguage Modeling.docx
Language Modeling.docx
AnuradhaRaheja1
 
Cl35491494
Cl35491494Cl35491494
Cl35491494
IJERA Editor
 
INTRODUCTION TO Natural language processing
INTRODUCTION TO Natural language processingINTRODUCTION TO Natural language processing
INTRODUCTION TO Natural language processing
socarem879
 
Detection of slang words in e data using semi supervised learning
Detection of slang words in e data using semi supervised learningDetection of slang words in e data using semi supervised learning
Detection of slang words in e data using semi supervised learning
ijaia
 
AI UNIT 3 - SRCAS JOC.pptx enjoy this ppt
AI UNIT 3 - SRCAS JOC.pptx enjoy this pptAI UNIT 3 - SRCAS JOC.pptx enjoy this ppt
AI UNIT 3 - SRCAS JOC.pptx enjoy this ppt
pavankalyanadroittec
 
Shallow parser for hindi language with an input from a transliterator
Shallow parser for hindi language with an input from a transliteratorShallow parser for hindi language with an input from a transliterator
Shallow parser for hindi language with an input from a transliterator
Shashank Shisodia
 

Similar to Presentation1 (20)

PART OF SPEECH TAGGING OFMARATHI TEXT USING TRIGRAMMETHOD
PART OF SPEECH TAGGING OFMARATHI TEXT USING TRIGRAMMETHODPART OF SPEECH TAGGING OFMARATHI TEXT USING TRIGRAMMETHOD
PART OF SPEECH TAGGING OFMARATHI TEXT USING TRIGRAMMETHOD
 
Natural Language Processing .pdf
Natural Language Processing .pdfNatural Language Processing .pdf
Natural Language Processing .pdf
 
Top 10 Must-Know NLP Techniques for Data Scientists
Top 10 Must-Know NLP Techniques for Data ScientistsTop 10 Must-Know NLP Techniques for Data Scientists
Top 10 Must-Know NLP Techniques for Data Scientists
 
Natural Language Processing: A comprehensive overview
Natural Language Processing: A comprehensive overviewNatural Language Processing: A comprehensive overview
Natural Language Processing: A comprehensive overview
 
Detecting paraphrases in the Marathi language
Detecting paraphrases in the Marathi languageDetecting paraphrases in the Marathi language
Detecting paraphrases in the Marathi language
 
NLP Techniques for Sentiment Anaysis.docx
NLP Techniques for Sentiment Anaysis.docxNLP Techniques for Sentiment Anaysis.docx
NLP Techniques for Sentiment Anaysis.docx
 
HMM BASED POS TAGGER FOR HINDI
HMM BASED POS TAGGER FOR HINDIHMM BASED POS TAGGER FOR HINDI
HMM BASED POS TAGGER FOR HINDI
 
COMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUE
COMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUECOMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUE
COMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUE
 
IRJET -Survey on Named Entity Recognition using Syntactic Parsing for Hindi L...
IRJET -Survey on Named Entity Recognition using Syntactic Parsing for Hindi L...IRJET -Survey on Named Entity Recognition using Syntactic Parsing for Hindi L...
IRJET -Survey on Named Entity Recognition using Syntactic Parsing for Hindi L...
 
DETECTION OF JARGON WORDS IN A TEXT USING SEMI-SUPERVISED LEARNING
DETECTION OF JARGON WORDS IN A TEXT USING SEMI-SUPERVISED LEARNINGDETECTION OF JARGON WORDS IN A TEXT USING SEMI-SUPERVISED LEARNING
DETECTION OF JARGON WORDS IN A TEXT USING SEMI-SUPERVISED LEARNING
 
DETECTION OF JARGON WORDS IN A TEXT USING SEMI-SUPERVISED LEARNING
DETECTION OF JARGON WORDS IN A TEXT USING SEMI-SUPERVISED LEARNINGDETECTION OF JARGON WORDS IN A TEXT USING SEMI-SUPERVISED LEARNING
DETECTION OF JARGON WORDS IN A TEXT USING SEMI-SUPERVISED LEARNING
 
Natural Language Processing (NLP).pdf
Natural Language Processing (NLP).pdfNatural Language Processing (NLP).pdf
Natural Language Processing (NLP).pdf
 
NLP Deep Learning with Tensorflow
NLP Deep Learning with TensorflowNLP Deep Learning with Tensorflow
NLP Deep Learning with Tensorflow
 
NLP.pptx
NLP.pptxNLP.pptx
NLP.pptx
 
Language Modeling.docx
Language Modeling.docxLanguage Modeling.docx
Language Modeling.docx
 
Cl35491494
Cl35491494Cl35491494
Cl35491494
 
INTRODUCTION TO Natural language processing
INTRODUCTION TO Natural language processingINTRODUCTION TO Natural language processing
INTRODUCTION TO Natural language processing
 
Detection of slang words in e data using semi supervised learning
Detection of slang words in e data using semi supervised learningDetection of slang words in e data using semi supervised learning
Detection of slang words in e data using semi supervised learning
 
AI UNIT 3 - SRCAS JOC.pptx enjoy this ppt
AI UNIT 3 - SRCAS JOC.pptx enjoy this pptAI UNIT 3 - SRCAS JOC.pptx enjoy this ppt
AI UNIT 3 - SRCAS JOC.pptx enjoy this ppt
 
Shallow parser for hindi language with an input from a transliterator
Shallow parser for hindi language with an input from a transliteratorShallow parser for hindi language with an input from a transliterator
Shallow parser for hindi language with an input from a transliterator
 

Recently uploaded

Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-EfficiencyFreshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
ScyllaDB
 
Best 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERPBest 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERP
Pixlogix Infotech
 
June Patch Tuesday
June Patch TuesdayJune Patch Tuesday
June Patch Tuesday
Ivanti
 
Choosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptxChoosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptx
Brandon Minnick, MBA
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
innovationoecd
 
SAP S/4 HANA sourcing and procurement to Public cloud
SAP S/4 HANA sourcing and procurement to Public cloudSAP S/4 HANA sourcing and procurement to Public cloud
SAP S/4 HANA sourcing and procurement to Public cloud
maazsz111
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
panagenda
 
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
saastr
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
panagenda
 
Dandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity serverDandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity server
Antonios Katsarakis
 
Taking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdfTaking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdf
ssuserfac0301
 
Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)
Jakub Marek
 
dbms calicut university B. sc Cs 4th sem.pdf
dbms  calicut university B. sc Cs 4th sem.pdfdbms  calicut university B. sc Cs 4th sem.pdf
dbms calicut university B. sc Cs 4th sem.pdf
Shinana2
 
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
Alex Pruden
 
Public CyberSecurity Awareness Presentation 2024.pptx
Public CyberSecurity Awareness Presentation 2024.pptxPublic CyberSecurity Awareness Presentation 2024.pptx
Public CyberSecurity Awareness Presentation 2024.pptx
marufrahmanstratejm
 
Introduction of Cybersecurity with OSS at Code Europe 2024
Introduction of Cybersecurity with OSS  at Code Europe 2024Introduction of Cybersecurity with OSS  at Code Europe 2024
Introduction of Cybersecurity with OSS at Code Europe 2024
Hiroshi SHIBATA
 
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with SlackLet's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
shyamraj55
 
A Comprehensive Guide to DeFi Development Services in 2024
A Comprehensive Guide to DeFi Development Services in 2024A Comprehensive Guide to DeFi Development Services in 2024
A Comprehensive Guide to DeFi Development Services in 2024
Intelisync
 
GraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracyGraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracy
Tomaz Bratanic
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
Zilliz
 

Recently uploaded (20)

Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-EfficiencyFreshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
 
Best 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERPBest 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERP
 
June Patch Tuesday
June Patch TuesdayJune Patch Tuesday
June Patch Tuesday
 
Choosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptxChoosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptx
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
 
SAP S/4 HANA sourcing and procurement to Public cloud
SAP S/4 HANA sourcing and procurement to Public cloudSAP S/4 HANA sourcing and procurement to Public cloud
SAP S/4 HANA sourcing and procurement to Public cloud
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
 
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
 
Dandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity serverDandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity server
 
Taking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdfTaking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdf
 
Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)
 
dbms calicut university B. sc Cs 4th sem.pdf
dbms  calicut university B. sc Cs 4th sem.pdfdbms  calicut university B. sc Cs 4th sem.pdf
dbms calicut university B. sc Cs 4th sem.pdf
 
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
 
Public CyberSecurity Awareness Presentation 2024.pptx
Public CyberSecurity Awareness Presentation 2024.pptxPublic CyberSecurity Awareness Presentation 2024.pptx
Public CyberSecurity Awareness Presentation 2024.pptx
 
Introduction of Cybersecurity with OSS at Code Europe 2024
Introduction of Cybersecurity with OSS  at Code Europe 2024Introduction of Cybersecurity with OSS  at Code Europe 2024
Introduction of Cybersecurity with OSS at Code Europe 2024
 
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with SlackLet's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
 
A Comprehensive Guide to DeFi Development Services in 2024
A Comprehensive Guide to DeFi Development Services in 2024A Comprehensive Guide to DeFi Development Services in 2024
A Comprehensive Guide to DeFi Development Services in 2024
 
GraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracyGraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracy
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
 

Presentation1

  • 1. Guided By, Mrs. Gauri M. Dhopavkar Presented By, Ritikesh Bhaskarwar Vimal Shah Ashwin Borkar Shashil Pohankar
  • 2. Department of ComputerTechnology YESHWANTRAO CHAVAN COLLEGE OF ENGINEERING, Nagpur (An Autonomous Institution Affiliated to RashtrasantTukadoji Maharaj Nagpur University)
  • 3. Natural language processing  Natural language processing (NLP) is a field of computer science, artificial intelligence, and linguistics concerned with the interactions between computers and human (natural) languages.  Natural Language Processing (NLP) is the computerized approach to analysing text that is based on both a set of theories and a set of technologies
  • 4.
  • 5. POS Tagging :  Part-of-Speech (POS) tagging is the process of assigning a part-of-speech like noun, verb, pronoun or other lexical class marker to each word in a sentence.  After POS tags are identified, the next step is chunking, which involves dividing sentences into non-overlapping non- recursive phrases.
  • 6. ते फू ल खूप सुगंधी आहे Marathi POS Tagger ते-unidentified फू ल-noun खूप-adjective सुगंधी- adjective आहे-verb THE POSTAGGING EXAMPLE
  • 7. Need of Marathi POS Tagging :  Lack of significant tools for Indian languages  Dependence of other NLP activities on POS tagging  Failure of existing techniques on Indian Languages
  • 9. Methods for POSTagging 1.Rule Based 2.Stochastic  The rule based POS tagging models apply a set of hand written rules and use contextual information to assign POS tags to words.  A stochastic approach includes frequency, probability or statistics. The simplest stochastic approach finds out the most frequently used tag for a specific word in the annotated training data and uses this information to tag that word in the unannotated text.
  • 10. Methods for POSTagging (cntd.) 3. Hiden Markov Model 4. Maximum Entropy Model  The HMM model trains on annotated corpora to find out the transition and emission probabilities  The Maximum Entropy Model (MEM) is based on the principle of Maximum Entropy, which states that when choosing between a number of different probabilistic models for a set of data, the most valid model is the one which makes fewest arbitrary assumptions about the nature of the data
  • 11. Architecture and Design :  Marathi sentence is taken as input , then the tokens are created followed by tagging and finding ambiguity. TOKENIZING TAGGING FINDING AMBIGUOUS WORDS FINDING PROBABILITY ASSIGN TAGS ACCORDING TO PROBABILITY VIEW THE RESULT INPUT
  • 12. Detail of Identified Module :  Tokenizer :This module is used to get the tokens of the input sentence.Also, calls the other modules when required.  Tagging :These modules is used for assigning certain tags to tokens and also search for ambiguous words and also find their types and assign some special symbols to them.
  • 13. Details of identified modules (cntd.)  Root word : This module is used for finding the root word of each token finding it from the Marathi wordnet.  Probability : This module calculates the probability and accordingly assigns the tag, according to the higher probability of word. • Showing the results :This module shows the result.The words are shown with tags.
  • 14. Experimentation and Results : 1. • 1000: If first bit is 1, then we assign a tag as a noun to the particular word. • 1100: In this case, the word can be used as both unidentified. 2. • 0100: If second bit is 1, then we assign a tag as an adjective to the particular word. • 0110: In this case, the word can be used as other words. 3. • 0010: If third bit is 1, then we assign a tag as an adverb to the particular word. • 0001: If fourth bit is 1, then we assign a tag as a verb to the particular word.
  • 15. Advantages :  A POS tagger can be seen as a first-step towards tightening the integration between speech recognition and natural language processing.  A POS tagger in the language model aids in the identification of boundary tones and speech repairs, redefining the speech recognition problem.
  • 16. Advantages (cntd.):  A typical NLP system consists of tokenization, sentence delimitation, part-of- speech (POS) tagging, phrase chunking, parsing, and concept mapping. As one of the initial steps, POS tagging determines the part of speech for each token in a sentence.  Managers, educators, Trainers, Sales people are able to accurately assess the needs of a group, improves questioning techniques thus improving their skills to achieve more consistent results.
  • 17. Limitations :  User Cannot enter more than one sentence i.e. cannot enter paragraph.  It is not able to detect and report the gender of the word i.e. Morphological analysis in not done.  When ambiguity is encountered it is searched for the POS of the ambiguous word if it contains less or no word with the correct POS and there are more number of words for other POS then it shows incorrect POS for the ambiguous word.
  • 18. Applications :  Information Retrieval  Speech synthesis  Word Sense Disambiguation (WSD)  Machine Translation (MT) -Text to Text -Speech to Speech
  • 20.
  • 21.
  • 22.
  • 23. Conclusion and Future Scope :  The POS tagger described here is very simple and efficient for automatic tagging, but the morphological complexity of the Marathi make it hard.The performance of the current system is good and result achieved by this method are excellent. In future we wish to improve the accuracy our system by adding more tagged sentence in our training corpus.