SlideShare a Scribd company logo
1 of 7
Download to read offline
Module
          13
Natural Language
      Processing
        Version 2 CSE IIT, Kharagpur
13.1 Instructional Objective
•   The students should understand the necessity of natural language processing in
    building an intelligent system
•   Students should understand the difference between natural and formal language and
    the difficulty in processing the former
•   Students should understand the ambiguities that arise in natural language processing
•   Students should understand the language information required like like
        o Phonology
        o Morphology
        o Syntax
        o Semantic
        o Discourse
        o World knowledge
•   Students should understand the steps involved in natural language understanding and
    generation
•   The student should be familiar with basic language processing operations like
        o Morphological analysis
        o Parts-of-Speech tagging
        o Lexical processing
        o Semantic processing
        o Knowledge representation

At the end of this lesson the student should be able to do the following:
    • Design the processing steps required for a NLP task
    • Implement the processing techniques.




                                                            Version 2 CSE IIT, Kharagpur
Lesson
        40
Issues in NLP
    Version 2 CSE IIT, Kharagpur
13.1 Natural Language Processing
Natural Language Processing (NLP) is the process of computer analysis of input provided
in a human language (natural language), and conversion of this input into a useful form of
representation.

The field of NLP is primarily concerned with getting computers to perform useful and
interesting tasks with human languages. The field of NLP is secondarily concerned with
helping us come to a better understanding of human language.

   •   The input/output of a NLP system can be:
           – written text
           – speech
   •   We will mostly concerned with written text (not speech).
   •   To process written text, we need:
           – lexical, syntactic, semantic knowledge about the language
           – discourse information, real world knowledge
   •   To process spoken language, we need everything required to process written text,
       plus the challenges of speech recognition and speech synthesis.

There are two components of NLP.

   •   Natural Language Understanding
          – Mapping the given input in the natural language into a useful
             representation.
          – Different level of analysis required:
             morphological analysis,
             syntactic analysis,
             semantic analysis,
             discourse analysis, …
   •   Natural Language Generation
          – Producing output in the natural language from some internal
             representation.
          – Different level of synthesis required:
             deep planning (what to say),
             syntactic generation
   •   NL Understanding is much harder than NL Generation. But, still both of them are
       hard.

The difficulty in NL understanding arises from the following facts:

   •   Natural language is extremely rich in form and structure, and very ambiguous.
          – How to represent meaning,
          – Which structures map to which meaning structures.
   •   One input can mean many different things. Ambiguity can be at different levels.

                                                           Version 2 CSE IIT, Kharagpur
–    Lexical (word level) ambiguity -- different meanings of words
          –    Syntactic ambiguity -- different ways to parse the sentence
          –    Interpreting partial information -- how to interpret pronouns
          –    Contextual information -- context of the sentence may affect the meaning
               of that sentence.
   •   Many input can mean the same thing.
   •   Interaction among components of the input is not clear.

The following language related information are useful in NLP:

   •   Phonology – concerns how words are related to the sounds that realize them.

   •   Morphology – concerns how words are constructed from more        basic meaning
       units called morphemes. A morpheme is the primitive unit of meaning in a
       language.

   •   Syntax – concerns how can be put together to form correct sentences and
       determines what structural role each word plays in the sentence and what phrases
       are subparts of other phrases.

   •   Semantics – concerns what words mean and how these meaning combine in
       sentences to form sentence meaning. The study of context-independent meaning.

   •   Pragmatics – concerns how sentences are used in different situations and how
       use affects the interpretation of the sentence.

   •   Discourse – concerns how the immediately preceding sentences affect the
       interpretation of the next sentence. For example, interpreting pronouns and
       interpreting the temporal aspects of the information.

   •   World Knowledge – includes general knowledge about the world. What each
       language user must know about the other’s beliefs and goals.


13.1.1 Ambiguity

I made her duck.

   •   How many different interpretations does this sentence have?
   •   What are the reasons for the ambiguity?
   •   The categories of knowledge of language can be thought of as ambiguity
       resolving components.
   •   How can each ambiguous piece be resolved?
   •   Does speech input make the sentence even more ambiguous?
           – Yes – deciding word boundaries
   •   Some interpretations of : I made her duck.

                                                         Version 2 CSE IIT, Kharagpur
1. I cooked duck for her.
           2. I cooked duck belonging to her.
           3. I created a toy duck which she owns.
           4. I caused her to quickly lower her head or body.
           5. I used magic and turned her into a duck.
   •   duck – morphologically and syntactically ambiguous:
               noun or verb.
   •   her – syntactically ambiguous: dative or possessive.
   •   make – semantically ambiguous: cook or create.
   •   make – syntactically ambiguous:
           – Transitive – takes a direct object. => 2
           – Di-transitive – takes two objects. => 5
           – Takes a direct object and a verb. => 4

Ambiguities are resolved using the following methods.

   •   models and algorithms are introduced to resolve ambiguities at different levels.
   •   part-of-speech tagging -- Deciding whether duck is verb or noun.
   •   word-sense disambiguation -- Deciding whether make is create or cook.
   •   lexical disambiguation -- Resolution of part-of-speech and        word-sense
       ambiguities are two important kinds of lexical disambiguation.
   •   syntactic ambiguity -- her duck is an example of syntactic ambiguity, and can be
       addressed by probabilistic parsing.

13.1.2 Models to represent Linguistic Knowledge

   •   We will use certain formalisms (models) to represent the required linguistic
       knowledge.
   •   State Machines -- FSAs, FSTs, HMMs, ATNs, RTNs
   •   Formal Rule Systems -- Context Free Grammars, Unification Grammars,
       Probabilistic CFGs.
   •   Logic-based Formalisms -- first order predicate logic, some higher order logic.
   •   Models of Uncertainty -- Bayesian probability theory.

13.1.3 Algorithms to Manipulate Linguistic Knowledge

   •   We will use algorithms to manipulate the models of linguistic knowledge to
       produce the desired behavior.
   •   Most of the algorithms we will study are transducers and parsers.
           – These algorithms construct some structure based on their input.
   •   Since the language is ambiguous at all levels,
       these algorithms are never simple processes.
   •   Categories of most algorithms that will be used can fall into following categories.
           – state space search
           – dynamic programming


                                                           Version 2 CSE IIT, Kharagpur
13.2 Natural Language Understanding
The steps in natural language understanding are as follows:

           Words

Morphological Analysis

           Morphologically analyzed words (another step: POS tagging)

Syntactic Analysis

           Syntactic Structure

Semantic Analysis

           Context-independent meaning representation

Discourse Processing

            Final meaning representation




                                                          Version 2 CSE IIT, Kharagpur

More Related Content

What's hot

Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
Yasir Khan
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
Mariana Soffer
 
Natural language processing (nlp)
Natural language processing (nlp)Natural language processing (nlp)
Natural language processing (nlp)
Kuppusamy P
 
Natural Language Processing in Alternative and Augmentative Communication
Natural Language Processing in Alternative and Augmentative CommunicationNatural Language Processing in Alternative and Augmentative Communication
Natural Language Processing in Alternative and Augmentative Communication
Divya Sugumar
 

What's hot (20)

Natural language processing
Natural language processing Natural language processing
Natural language processing
 
Nlp
NlpNlp
Nlp
 
NLP
NLPNLP
NLP
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
 
Natural Language Processing (NLP)
Natural Language Processing (NLP)Natural Language Processing (NLP)
Natural Language Processing (NLP)
 
NLP_KASHK: Introduction
NLP_KASHK: Introduction NLP_KASHK: Introduction
NLP_KASHK: Introduction
 
Natural language processing (NLP)
Natural language processing (NLP) Natural language processing (NLP)
Natural language processing (NLP)
 
Natural Language Processing glossary for Coders
Natural Language Processing glossary for CodersNatural Language Processing glossary for Coders
Natural Language Processing glossary for Coders
 
Natural Language Processing
Natural Language Processing Natural Language Processing
Natural Language Processing
 
Natural language processing (nlp)
Natural language processing (nlp)Natural language processing (nlp)
Natural language processing (nlp)
 
Natural Language Processing: Parsing
Natural Language Processing: ParsingNatural Language Processing: Parsing
Natural Language Processing: Parsing
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
 
Natural Language Processing in Alternative and Augmentative Communication
Natural Language Processing in Alternative and Augmentative CommunicationNatural Language Processing in Alternative and Augmentative Communication
Natural Language Processing in Alternative and Augmentative Communication
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
 
Natural Language Processing for Games Research
Natural Language Processing for Games ResearchNatural Language Processing for Games Research
Natural Language Processing for Games Research
 

Similar to AI Lesson 40

Natural Language Processing (NLP).pptx
Natural Language Processing (NLP).pptxNatural Language Processing (NLP).pptx
Natural Language Processing (NLP).pptx
SHIBDASDUTTA
 
1 Introduction.ppt
1 Introduction.ppt1 Introduction.ppt
1 Introduction.ppt
tanishamahajan11
 
Natural Language Processing (NLP)
Natural Language Processing (NLP)Natural Language Processing (NLP)
Natural Language Processing (NLP)
Abdullah al Mamun
 
Eng19 week 6 (aural comprehension instruction2)
Eng19 week 6 (aural comprehension instruction2)Eng19 week 6 (aural comprehension instruction2)
Eng19 week 6 (aural comprehension instruction2)
leolita
 

Similar to AI Lesson 40 (20)

L1 nlp intro
L1 nlp introL1 nlp intro
L1 nlp intro
 
CNN for NLP using text analysis by using deep learning
CNN for NLP using text analysis by using deep learningCNN for NLP using text analysis by using deep learning
CNN for NLP using text analysis by using deep learning
 
Natural Language Processing (NLP).pptx
Natural Language Processing (NLP).pptxNatural Language Processing (NLP).pptx
Natural Language Processing (NLP).pptx
 
AI Lesson 41
AI Lesson 41AI Lesson 41
AI Lesson 41
 
natural language processing help at myassignmenthelp.net
natural language processing  help at myassignmenthelp.netnatural language processing  help at myassignmenthelp.net
natural language processing help at myassignmenthelp.net
 
AI - natural language processing
AI - natural language processingAI - natural language processing
AI - natural language processing
 
Natural Language Processing Course in AI
Natural Language Processing Course in AINatural Language Processing Course in AI
Natural Language Processing Course in AI
 
1 Introduction.ppt
1 Introduction.ppt1 Introduction.ppt
1 Introduction.ppt
 
nlp-01.pptxvvvffffffvvvvvfeddeeddffffffffff
nlp-01.pptxvvvffffffvvvvvfeddeeddffffffffffnlp-01.pptxvvvffffffvvvvvfeddeeddffffffffff
nlp-01.pptxvvvffffffvvvvvfeddeeddffffffffff
 
Lesson 41.pdf
Lesson 41.pdfLesson 41.pdf
Lesson 41.pdf
 
English for Specific Purposes
English for Specific PurposesEnglish for Specific Purposes
English for Specific Purposes
 
Ch9
Ch9Ch9
Ch9
 
Natural Language Processing (NLP)
Natural Language Processing (NLP)Natural Language Processing (NLP)
Natural Language Processing (NLP)
 
Artificial Intelligence Notes Unit 4
Artificial Intelligence Notes Unit 4Artificial Intelligence Notes Unit 4
Artificial Intelligence Notes Unit 4
 
LARG-20010118-Natasha e wejkwrlkwr klwrlknrklnr k.ppt
LARG-20010118-Natasha e wejkwrlkwr klwrlknrklnr k.pptLARG-20010118-Natasha e wejkwrlkwr klwrlknrklnr k.ppt
LARG-20010118-Natasha e wejkwrlkwr klwrlknrklnr k.ppt
 
NLP introduced and in 47 slides Lecture 1.ppt
NLP introduced and in 47 slides Lecture 1.pptNLP introduced and in 47 slides Lecture 1.ppt
NLP introduced and in 47 slides Lecture 1.ppt
 
Teaching reading
Teaching readingTeaching reading
Teaching reading
 
Sla glossary
Sla glossarySla glossary
Sla glossary
 
Eng19 week 6 (aural comprehension instruction2)
Eng19 week 6 (aural comprehension instruction2)Eng19 week 6 (aural comprehension instruction2)
Eng19 week 6 (aural comprehension instruction2)
 
Transformational grammar
Transformational grammarTransformational grammar
Transformational grammar
 

More from Assistant Professor (20)

AI Lesson 39
AI Lesson 39AI Lesson 39
AI Lesson 39
 
AI Lesson 38
AI Lesson 38AI Lesson 38
AI Lesson 38
 
AI Lesson 37
AI Lesson 37AI Lesson 37
AI Lesson 37
 
AI Lesson 36
AI Lesson 36AI Lesson 36
AI Lesson 36
 
AI Lesson 35
AI Lesson 35AI Lesson 35
AI Lesson 35
 
AI Lesson 34
AI Lesson 34AI Lesson 34
AI Lesson 34
 
AI Lesson 33
AI Lesson 33AI Lesson 33
AI Lesson 33
 
AI Lesson 32
AI Lesson 32AI Lesson 32
AI Lesson 32
 
AI Lesson 31
AI Lesson 31AI Lesson 31
AI Lesson 31
 
AI Lesson 30
AI Lesson 30AI Lesson 30
AI Lesson 30
 
AI Lesson 29
AI Lesson 29AI Lesson 29
AI Lesson 29
 
AI Lesson 28
AI Lesson 28AI Lesson 28
AI Lesson 28
 
AI Lesson 27
AI Lesson 27AI Lesson 27
AI Lesson 27
 
AI Lesson 26
AI Lesson 26AI Lesson 26
AI Lesson 26
 
AI Lesson 25
AI Lesson 25AI Lesson 25
AI Lesson 25
 
AI Lesson 24
AI Lesson 24AI Lesson 24
AI Lesson 24
 
AI Lesson 23
AI Lesson 23AI Lesson 23
AI Lesson 23
 
AI Lesson 22
AI Lesson 22AI Lesson 22
AI Lesson 22
 
AI Lesson 21
AI Lesson 21AI Lesson 21
AI Lesson 21
 
Lesson 20
Lesson 20Lesson 20
Lesson 20
 

Recently uploaded

Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
kauryashika82
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
PECB
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
QucHHunhnh
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
QucHHunhnh
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Krashi Coaching
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
SoniaTolstoy
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
ciinovamais
 

Recently uploaded (20)

Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdf
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room service
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajan
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 

AI Lesson 40

  • 1. Module 13 Natural Language Processing Version 2 CSE IIT, Kharagpur
  • 2. 13.1 Instructional Objective • The students should understand the necessity of natural language processing in building an intelligent system • Students should understand the difference between natural and formal language and the difficulty in processing the former • Students should understand the ambiguities that arise in natural language processing • Students should understand the language information required like like o Phonology o Morphology o Syntax o Semantic o Discourse o World knowledge • Students should understand the steps involved in natural language understanding and generation • The student should be familiar with basic language processing operations like o Morphological analysis o Parts-of-Speech tagging o Lexical processing o Semantic processing o Knowledge representation At the end of this lesson the student should be able to do the following: • Design the processing steps required for a NLP task • Implement the processing techniques. Version 2 CSE IIT, Kharagpur
  • 3. Lesson 40 Issues in NLP Version 2 CSE IIT, Kharagpur
  • 4. 13.1 Natural Language Processing Natural Language Processing (NLP) is the process of computer analysis of input provided in a human language (natural language), and conversion of this input into a useful form of representation. The field of NLP is primarily concerned with getting computers to perform useful and interesting tasks with human languages. The field of NLP is secondarily concerned with helping us come to a better understanding of human language. • The input/output of a NLP system can be: – written text – speech • We will mostly concerned with written text (not speech). • To process written text, we need: – lexical, syntactic, semantic knowledge about the language – discourse information, real world knowledge • To process spoken language, we need everything required to process written text, plus the challenges of speech recognition and speech synthesis. There are two components of NLP. • Natural Language Understanding – Mapping the given input in the natural language into a useful representation. – Different level of analysis required: morphological analysis, syntactic analysis, semantic analysis, discourse analysis, … • Natural Language Generation – Producing output in the natural language from some internal representation. – Different level of synthesis required: deep planning (what to say), syntactic generation • NL Understanding is much harder than NL Generation. But, still both of them are hard. The difficulty in NL understanding arises from the following facts: • Natural language is extremely rich in form and structure, and very ambiguous. – How to represent meaning, – Which structures map to which meaning structures. • One input can mean many different things. Ambiguity can be at different levels. Version 2 CSE IIT, Kharagpur
  • 5. Lexical (word level) ambiguity -- different meanings of words – Syntactic ambiguity -- different ways to parse the sentence – Interpreting partial information -- how to interpret pronouns – Contextual information -- context of the sentence may affect the meaning of that sentence. • Many input can mean the same thing. • Interaction among components of the input is not clear. The following language related information are useful in NLP: • Phonology – concerns how words are related to the sounds that realize them. • Morphology – concerns how words are constructed from more basic meaning units called morphemes. A morpheme is the primitive unit of meaning in a language. • Syntax – concerns how can be put together to form correct sentences and determines what structural role each word plays in the sentence and what phrases are subparts of other phrases. • Semantics – concerns what words mean and how these meaning combine in sentences to form sentence meaning. The study of context-independent meaning. • Pragmatics – concerns how sentences are used in different situations and how use affects the interpretation of the sentence. • Discourse – concerns how the immediately preceding sentences affect the interpretation of the next sentence. For example, interpreting pronouns and interpreting the temporal aspects of the information. • World Knowledge – includes general knowledge about the world. What each language user must know about the other’s beliefs and goals. 13.1.1 Ambiguity I made her duck. • How many different interpretations does this sentence have? • What are the reasons for the ambiguity? • The categories of knowledge of language can be thought of as ambiguity resolving components. • How can each ambiguous piece be resolved? • Does speech input make the sentence even more ambiguous? – Yes – deciding word boundaries • Some interpretations of : I made her duck. Version 2 CSE IIT, Kharagpur
  • 6. 1. I cooked duck for her. 2. I cooked duck belonging to her. 3. I created a toy duck which she owns. 4. I caused her to quickly lower her head or body. 5. I used magic and turned her into a duck. • duck – morphologically and syntactically ambiguous: noun or verb. • her – syntactically ambiguous: dative or possessive. • make – semantically ambiguous: cook or create. • make – syntactically ambiguous: – Transitive – takes a direct object. => 2 – Di-transitive – takes two objects. => 5 – Takes a direct object and a verb. => 4 Ambiguities are resolved using the following methods. • models and algorithms are introduced to resolve ambiguities at different levels. • part-of-speech tagging -- Deciding whether duck is verb or noun. • word-sense disambiguation -- Deciding whether make is create or cook. • lexical disambiguation -- Resolution of part-of-speech and word-sense ambiguities are two important kinds of lexical disambiguation. • syntactic ambiguity -- her duck is an example of syntactic ambiguity, and can be addressed by probabilistic parsing. 13.1.2 Models to represent Linguistic Knowledge • We will use certain formalisms (models) to represent the required linguistic knowledge. • State Machines -- FSAs, FSTs, HMMs, ATNs, RTNs • Formal Rule Systems -- Context Free Grammars, Unification Grammars, Probabilistic CFGs. • Logic-based Formalisms -- first order predicate logic, some higher order logic. • Models of Uncertainty -- Bayesian probability theory. 13.1.3 Algorithms to Manipulate Linguistic Knowledge • We will use algorithms to manipulate the models of linguistic knowledge to produce the desired behavior. • Most of the algorithms we will study are transducers and parsers. – These algorithms construct some structure based on their input. • Since the language is ambiguous at all levels, these algorithms are never simple processes. • Categories of most algorithms that will be used can fall into following categories. – state space search – dynamic programming Version 2 CSE IIT, Kharagpur
  • 7. 13.2 Natural Language Understanding The steps in natural language understanding are as follows: Words Morphological Analysis Morphologically analyzed words (another step: POS tagging) Syntactic Analysis Syntactic Structure Semantic Analysis Context-independent meaning representation Discourse Processing Final meaning representation Version 2 CSE IIT, Kharagpur