SlideShare a Scribd company logo
1 of 93
ο‚— Overview
ο‚— Basic knowledge
ο‚— Demonstration
LOGO

USER : Men are all alike.
ELIZA : In what way?
USER : They’re always bugging us about something or other.
ELIZA : Can you think of a specific example?
USER : Well, my boyfriend made me come here.
ELIZA : Your boyfriend made you come here ?
USER : He says I’m depressed much of the time.
ELIZA : I am sorry to hear you are depressed.
USER : It’s true, I am unhappy.
ELIZA : Do you think coming here will help you not to be
unhappy?
USER : I need some help; that much seems certain.
ELIZA : What would it mean to you if you got some help?
USER : Perhaps I could learn to get along with my mother.
ELIZA : Tell me more about your family.
USER : My mother takes care of me.
ELIZA : Who else in your family takes care of you?
USER : My father.
ELIZA : Your father ?
USER : You are like my father in some ways.
ELIZA : What resemblance do you see?
οƒ˜ A sub-field of Artificial Intelligent, since 1960s …


οƒ˜ Concerned with the interactions between computers and
  human languages with one ultimate goal : Computers can
  β€œunderstand” human



οƒ˜ Many applications in real world
οƒ˜ Natural language unit?
   οƒ˜ Natural language understanding
   οƒ˜ Natural language generation


οƒ˜ Data?
   οƒ˜ Speech processing
   οƒ˜ Text processing


Natural language text understanding!
ο‚— Task of generating natural language from a machine
  representation
ο‚— May be viewed as the opposite of natural language
  understanding .

ο‚— Applications:
  ο‚— Jokes generation
  ο‚— Textual summaries of databases
  ο‚— Enhancing accessibility
ο‚— An advanced subtopic of NLP deals with reading
  comprehension
ο‚— More complex than NLG
ο‚— Many commercial interest in this field
  ο‚— News-gathering
  ο‚— Data-Mining
  ο‚— Voice-Activation
  ο‚— Large-scale content analysis
ο‚— Logic is too clear, the lost of flexibility cause
  difficulties in NLP

ο‚— Examples :
  ο‚— Time flies like an arrow
  Can be understood in 7 ways !!!

  ο‚— I never said she stole my money !
     οƒ˜ Someone else said it, but I didn't.
ο‚— Logic is too clear, the lost of flexibility become
  difficulties in NLP

ο‚— Examples :
  ο‚— Time flies like an arrow
  Can be understood in 7 ways !!!

  ο‚— I never said she stole my money !
     ο‚— I simply didn't ever say it
ο‚— Logic is too clear, the lost of flexibility become
  difficulties in NLP

ο‚— Examples :
  ο‚— Time flies like an arrow
  Can be understood in 7 ways !!!

  ο‚— I never said she stole my money !
     ο‚— I might have implied it in some way, but I never explicitly said it
ο‚— Logic is too clear, the lost of flexibility become
  difficulties in NLP

ο‚— Examples :
  ο‚— Time flies like an arrow
  Can be understood in 7 ways !!!

  ο‚— I never said she stole my money !
     ο‚— I said someone took it; I didn't say it was she
ο‚— Logic is too clear, the lost of flexibility become
  difficulties in NLP

ο‚— Examples:
  ο‚— Time flies like an arrow
  Can be understood in 7 ways !!!

  ο‚— I never said she stole my money !
     ο‚— I just said she probably borrowed it
ο‚— Logic is too clear, the lost of flexibility become
  difficulties in NLP

ο‚— Examples :
  ο‚— Time flies like an arrow
  Can be understood in 7 ways !!!

  ο‚— I never said she stole my money !
     ο‚— I said she stole someone else's money
ο‚— Logic is too clear, the lost of flexibility become
  difficulties in NLP

ο‚— Examples :
  ο‚— Time flies like an arrow
  Can be understood in 7 ways !!!

  ο‚— I never said she stole my money !
     ο‚— I said she stole something, but not my money
ο‚— Words combination and division
ο‚— Stress placing on words
ο‚— The properties of subjects
  ο‚— We gave the monkeys the bananas because they were
    hungry
  ο‚— We gave the monkeys the bananas because they were
    over-ripe
ο‚— Specifying which word an adjective applies to
  ο‚— A pretty little girls' school
ο‚— Involves reasoning about the world
ο‚— Embedded a social system of people interacting
  ο‚— persuading, insulting and amusing them
  ο‚— changing over time
ο‚— Homonymous
ο‚— Automatic Summarization
ο‚— Information Extraction
ο‚— Grammar Testing
ο‚— ePi Group:
  ο‚— Automatic Vietnamese processing system
  ο‚— www.baomoi.com
     ο‚— Collecting news from all Vietnamese e-newspapers

ο‚— EVTrans – Softex Co Ltd.
ο‚— Cyclop
ο‚— VnKim
ο‚— Morphological analysis :
   Individual words are analyzed into their
     components
ο‚— Syntactic analysis
   Linear sequence of words are transformed
      into structures that show how the words
      relate to each other
ο‚— Semantic analysis
   ο‚— A transformation is made from the input
     text to an internal representation that
     reflects the meaning
ο‚— Pragmatic analysis
   ο‚— To reinterpret what was said to what was
     actually meant
ο‚— Discourse analysis
   ο‚— Resolving references between sentences
Morphology

Syntax

Semantic

Pragmatic

Discourse
Morphology

Syntax

Semantic

Pragmatic

Discourse
ο‚— Morphemes: smallest meaningful unit
 spoken units of language.
  ο‚— Stem: book, cat, car, …
  ο‚— Affixes : un-, -s, -es, ..               Morphology

  ο‚— Clitic: β€˜ve, β€˜m                          Syntax

                                             Semantic
ο‚— Morphological parsing: parsing a word
                                             Pragmatic
 into stem and affixes and identifying the
                                             Discourse
 parts and their relationships
ο‚— Word Classes
  ο‚— Parts of speech: noun, verb, adjectives,
    etc.
                                               Morphology
  ο‚— Word class dictates how a word combines
    with morphemes to form new words           Syntax

                                               Semantic
ο‚— Examples                                     Pragmatic
  ο‚— Books: book + s
                                               Discourse
  ο‚— Unladylike = un + lady + like
ο‚— Vietnamese?
  ο‚— Δ‚n = Δƒn
                                  Morphology
  ο‚— Uα»‘ng = uα»‘ng
  ο‚— Xe = xe                       Syntax

                                  Semantic

ο‚— No β€˜Xes’ in Vietnamese!         Pragmatic
ο‚— Problems are text tokenizing.   Discourse
ο‚— Why parse words?

                                          Morphology
  ο‚— To identify a word’s part-of-speech
  ο‚— To identify a word’s stem (IR)        Syntax

                                          Semantic

… then?                                   Pragmatic
  ο‚— Spell- checking
                                          Discourse
  ο‚— To predict next words
  ο‚— To predict the word’s accent
ο‚— Ambiguity
  ο‚— I want her to go to the cinema with me
                                             Morphology
  To - infinitive?                           Syntax

  To - preposition?                          Semantic

                                             Pragmatic
  ο‚— Con ngα»±a Δ‘Γ‘ Δ‘Γ‘ con ngα»±a Δ‘Γ‘.
                                             Discourse



    Δ‘Γ‘ = Δ‘Γ‘?
ο‚— How to implement?
  ο‚— Regular expression
  ο‚— Finite State Transducers (FST)
  ο‚— Finite State Accepter (FSA)      Morphology

                                     Syntax
  *.exe                              Semantic
  ir??man
                                     Pragmatic
  b[0-9]+ *(Mb|[Mm]egabytes?)b
                                     Discourse
ο‚— Relate terms:
  ο‚— Stem, stemming   Morphology
  ο‚— Part of speech
                     Syntax
  ο‚— N-gram
                     Semantic

                     Pragmatic

                     Discourse
Morphology

Syntax

Semantic

Pragmatic

Discourse
Morphology

SYNTAX   Syntax

         Semantic

         Pragmatic

         Discourse
ο‚— Linear sequence of words are transformed into
  structures that show how the words relate to
  each other.
                                                    Morphology
ο‚— Determine grammatical structure.
                                                    Syntax

                                                    Semantic

                                                    Pragmatic

ο‚— I am a boy = [Subject] [Verb] [Cardinal] [Noun]   Discourse
Morphology

Syntax

Semantic

Pragmatic

Discourse
ο‚— Syntax
  ο‚— Actual structure of a sentence
                                        Morphology

                                        Syntax
ο‚— Grammar
                                        Semantic
  ο‚— The rule set used in the analysis
                                        Pragmatic

                                        Discourse
ο‚— A grammar define syntactically legal sentences
   ο‚— I ate an apple     (syntactic legal)
   ο‚— I ate apple        (not syntactic legal)
   ο‚— I ate a building   (syntactic legal, but?)    Morphology

                                                   Syntax

   doesn’t mean that it’s meaningful!              Semantic

                                                   Pragmatic

                                                   Discourse
ο‚— Ambiguities




                Morphology

                Syntax

                Semantic

                Pragmatic

                Discourse
Morphology

Syntax

Semantic

Pragmatic

Discourse
Morphology

           Syntax

SEMANTIC   Semantic

           Pragmatic

           Discourse
ο‚— What could this mean…
  ο‚— Representations of linguistic inputs that capture
    the meanings of those inputs


ο‚— For us it means                                       Morphology
  ο‚— Representations that permit or   facilitate         Syntax
    semantic processing
  ο‚— Permit us to reason   about their truth             Semantic
    (relationship to some world)
                                                        Pragmatic
  ο‚— Permit us to answer questions based on their
    content                                             Discourse
  ο‚— Permit us to perform   inference (answer
    questions and determine the truth of things we
    don’t actually know)
Morphology

Syntax

Semantic

Pragmatic

Discourse
ο‚— Requirements


  ο‚— Verifiability
  ο‚— Ambiguity
                     Morphology
  ο‚— Canonical Form
  ο‚— Inference        Syntax

  ο‚— Expressiveness
                     Semantic

                     Pragmatic

                     Discourse
Morphology

Syntax

Semantic

Pragmatic

Discourse
ο‚— Pragmatics: concerns how sentences are
 used in different situations and how use
                                              Morphology
 affects the interpretation of the sentence
                                              Syntax

                                              Semantic

ο‚— Discourse: concerns how the                 Pragmatic
 immediately preceding sentences affect
                                              Discourse
 the interpretation of the next sentence
Morphology

                                           Syntax
ο‚— β€˜He’, β€˜it’, β€˜his’ can be inferred from
                                           Semantic
  previous sentence
                                           Pragmatic


ο‚— It’s   discourse                         Discourse
Morphology

Syntax

Semantic

Pragmatic

Discourse
Morphology

Syntax

Semantic

Pragmatic

Discourse
Morphology

Syntax

Semantic

Pragmatic

Discourse
Morphology

Syntax

Semantic

Pragmatic

Discourse
Morphology

Syntax

Semantic

Pragmatic

Discourse
ο‚— Wordnet
ο‚— Mindnet
ο‚— Stanford Tagger
ο‚— Stanford Parser
ο‚— ……..
ο‚— Machine translation
ο‚— Search engine
ο‚— Information extraction
ο‚— Chat bot
ο‚— Can we use previously translated text to learn how to
 translate new texts?
  ο‚— Yes! But, it’s not so easy
  ο‚— Two paradigms, statistical MT, and EBMT
ο‚— Requirements:
  ο‚— Aligned large parallel corpus of translated sentences
  ο‚— {S source  S target }
  ο‚— Bilingual dictionary for intra-S alignment
  ο‚— Generalization patterns (names, numbers, dates…)
ο‚— Simplest: Translation Memory
  ο‚— If S new= S source in corpus, output aligned S target


ο‚— Compositional EBMT
  ο‚— If fragment of Snew matches fragment of Ss, output
    corresponding fragment of aligned St
  ο‚— Prefer maximal-length fragments
  ο‚— Maximize grammatical compositionality
     ο‚— Via a target language grammar
     ο‚— Or, via an N-gram statistical language model
ο‚— Requires an Interlingua - language-neutral Knowledge
  Representation (KR)
ο‚— Philosophical debate: Is there an interlingua?
  ο‚— FOL is not totally language neutral (predicates,
    functions, expressed in a language)
  ο‚— Other near-interlinguas (Conceptual Dependency)
ο‚— Requires a fully-disambiguating parser
  ο‚— Domain model of legal objects, actions, relations
ο‚— Requires a NL generator (KR -> text)
ο‚— Applicable only to well-defined technical domains
ο‚— Produces high-quality MT in those domains
ο‚— Intelingua-based MT
ο‚— Rule-based MT
ο‚— Each approach has its own strength


  ο‚— Rapidly adaptable: statistical, example-based
  ο‚— Good grammar: rule-based (grammar)
  ο‚— High precision in narrow domain: Intelingua
ο‚— Google
ο‚— Yahoo
ο‚— Alta-vista
ο‚— Answer.com
ο‚— Spider - a browser-like program that downloads web pages.
ο‚— Crawler – a program that automatically follows all of the
    links on each web page.
ο‚—   Indexer - a program that analyzes web pages downloaded
    by the spider and the crawler.
ο‚—   Database– storage for downloaded and processed pages.
ο‚—   Results engine – extracts search results from the database.
ο‚—    Web server – a server that is responsible for interaction
    between the user and other search engine components.
ο‚—   Spider - a browser-like program that downloads web pages.
ο‚—   Crawler – a program that automatically follows all of the
    links on each web page.
ο‚—   Indexer - a program that analyzes web pages downloaded
    by the spider and the crawler.
ο‚—   Database– storage for downloaded and processed pages.
ο‚—   Results engine – extracts search results from the database.
ο‚—    Web server – a server that is responsible for interaction
    between the user and other search engine components.
ο‚— Idea is to β€˜extract’ particular types of information from
  arbitrary text or transcribed speech

ο‚— Examples:
  ο‚— Names entities: people, places, organization
  ο‚— Telephone numbers
  ο‚— Dates
ο‚— Many uses:
  ο‚— Question answering systems, fisting of news or mail…
  ο‚— Job ads, financial information, terrorist attacks
ο‚— Often use a set of simple templates or frames with slots
 to be filled in from input text. Ignore everything else.
  ο‚— Husni’s number is 966-3-860-2624.
  ο‚— The inventor of the First plane was Abbas ibnu Fernas
  ο‚— The British King died in March of 1932.
ο‚— Named Entity recognition (NE)
  ο‚— Finds and classifies names, places etc.
ο‚— Co-reference Resolution (CO)
  ο‚— Identifies identity relations between entities in texts.
ο‚— Template Element construction (TE)
  ο‚— Adds descriptive information to NE results (using CO).
ο‚— Template Relation construction (TR)
  ο‚— Finds relations between TE entities. Scenario
ο‚— Template production (ST)
  ο‚— Fits TE and TR results into specified event scenarios.
ο‚— AIML = Artificial Intelligent Mark-up Language
ο‚— Alice
ο‚— A.L.I.C.E. (Artificial Linguistic Internet Computer
 Entity)
  ο‚— an award-winning free natural language artificial
    intelligence chat robot.


ο‚— Ruled-base
ο‚— Human-like answer without complicated β€œbrain”
ο‚— Multi-language
ο‚— NLP’s course , Husni Al-Muhtaseb
ο‚— Lexical descriptions for Vietnamese language
  processing .
ο‚— en.wikipedia.org
ο‚— www.xulyngonngu.com
Natural language processing 2

More Related Content

What's hot

Group presentation lexical semantics
Group presentation lexical semanticsGroup presentation lexical semantics
Group presentation lexical semanticsblessedkkr
Β 
Langacker's cognitive grammar
Langacker's cognitive grammarLangacker's cognitive grammar
Langacker's cognitive grammarJOy Verzosa
Β 
MORPHOLOGICAL SEGMENTATION WITH LSTM NEURAL NETWORKS FOR TIGRINYA
MORPHOLOGICAL SEGMENTATION WITH LSTM NEURAL NETWORKS FOR TIGRINYAMORPHOLOGICAL SEGMENTATION WITH LSTM NEURAL NETWORKS FOR TIGRINYA
MORPHOLOGICAL SEGMENTATION WITH LSTM NEURAL NETWORKS FOR TIGRINYAijnlc
Β 
Minimalist program
Minimalist programMinimalist program
Minimalist programAmusan Kayode
Β 
Translation
TranslationTranslation
Translationmjkay
Β 
Feature Structure Unification Syntactic Parser 2.0
Feature Structure Unification Syntactic Parser 2.0Feature Structure Unification Syntactic Parser 2.0
Feature Structure Unification Syntactic Parser 2.0rcaneba
Β 
5a use of annotated corpus
5a use of annotated corpus5a use of annotated corpus
5a use of annotated corpusThennarasuSakkan
Β 
Semantics
SemanticsSemantics
SemanticsAef Tony
Β 
A Constructive Mathematics approach for NL formal grammars
A Constructive Mathematics approach for NL formal grammarsA Constructive Mathematics approach for NL formal grammars
A Constructive Mathematics approach for NL formal grammarsFederico Gobbo
Β 
Unit 1 Semantics
Unit 1 SemanticsUnit 1 Semantics
Unit 1 Semanticsmjgvalcarce
Β 
Natural language-processing
Natural language-processingNatural language-processing
Natural language-processingHareem Naz
Β 
Constructive Hybrid Logics
Constructive Hybrid LogicsConstructive Hybrid Logics
Constructive Hybrid LogicsValeria de Paiva
Β 
Constructive Description Logics 2006
Constructive Description Logics 2006Constructive Description Logics 2006
Constructive Description Logics 2006Valeria de Paiva
Β 
Narrative
NarrativeNarrative
Narrativeirbaz khan
Β 
Prosodic Morphology
Prosodic Morphology Prosodic Morphology
Prosodic Morphology Maroua Harrif
Β 
Text : Definition, Elaboration and Examples
Text : Definition, Elaboration and ExamplesText : Definition, Elaboration and Examples
Text : Definition, Elaboration and ExamplesAlaahussein81
Β 
Minimalist program
Minimalist programMinimalist program
Minimalist programRabbiaAzam
Β 

What's hot (19)

Group presentation lexical semantics
Group presentation lexical semanticsGroup presentation lexical semantics
Group presentation lexical semantics
Β 
Langacker's cognitive grammar
Langacker's cognitive grammarLangacker's cognitive grammar
Langacker's cognitive grammar
Β 
MORPHOLOGICAL SEGMENTATION WITH LSTM NEURAL NETWORKS FOR TIGRINYA
MORPHOLOGICAL SEGMENTATION WITH LSTM NEURAL NETWORKS FOR TIGRINYAMORPHOLOGICAL SEGMENTATION WITH LSTM NEURAL NETWORKS FOR TIGRINYA
MORPHOLOGICAL SEGMENTATION WITH LSTM NEURAL NETWORKS FOR TIGRINYA
Β 
Minimalist program
Minimalist programMinimalist program
Minimalist program
Β 
ACTIVIDAD 7
ACTIVIDAD 7ACTIVIDAD 7
ACTIVIDAD 7
Β 
Translation
TranslationTranslation
Translation
Β 
Presentation1
Presentation1Presentation1
Presentation1
Β 
Feature Structure Unification Syntactic Parser 2.0
Feature Structure Unification Syntactic Parser 2.0Feature Structure Unification Syntactic Parser 2.0
Feature Structure Unification Syntactic Parser 2.0
Β 
5a use of annotated corpus
5a use of annotated corpus5a use of annotated corpus
5a use of annotated corpus
Β 
Semantics
SemanticsSemantics
Semantics
Β 
A Constructive Mathematics approach for NL formal grammars
A Constructive Mathematics approach for NL formal grammarsA Constructive Mathematics approach for NL formal grammars
A Constructive Mathematics approach for NL formal grammars
Β 
Unit 1 Semantics
Unit 1 SemanticsUnit 1 Semantics
Unit 1 Semantics
Β 
Natural language-processing
Natural language-processingNatural language-processing
Natural language-processing
Β 
Constructive Hybrid Logics
Constructive Hybrid LogicsConstructive Hybrid Logics
Constructive Hybrid Logics
Β 
Constructive Description Logics 2006
Constructive Description Logics 2006Constructive Description Logics 2006
Constructive Description Logics 2006
Β 
Narrative
NarrativeNarrative
Narrative
Β 
Prosodic Morphology
Prosodic Morphology Prosodic Morphology
Prosodic Morphology
Β 
Text : Definition, Elaboration and Examples
Text : Definition, Elaboration and ExamplesText : Definition, Elaboration and Examples
Text : Definition, Elaboration and Examples
Β 
Minimalist program
Minimalist programMinimalist program
Minimalist program
Β 

Viewers also liked

NLP and its applications
NLP and its applicationsNLP and its applications
NLP and its applicationsUtphala P
Β 
Natural Language Processing: Definition and Application
Natural Language Processing: Definition and ApplicationNatural Language Processing: Definition and Application
Natural Language Processing: Definition and ApplicationStephen Shellman
Β 
Statistical machine translation
Statistical machine translationStatistical machine translation
Statistical machine translationHrishikesh Nair
Β 
Jeeves -natural language interface application
Jeeves -natural language interface applicationJeeves -natural language interface application
Jeeves -natural language interface applicationKaran Harsh Wardhan
Β 
Startupfest 2015: HARPER REED (Modest, Inc.) - Lightning Keynote
Startupfest 2015: HARPER REED (Modest, Inc.) - Lightning KeynoteStartupfest 2015: HARPER REED (Modest, Inc.) - Lightning Keynote
Startupfest 2015: HARPER REED (Modest, Inc.) - Lightning KeynoteStartupfest
Β 
Statistical machine translation in a few slides
Statistical machine translation in a few slidesStatistical machine translation in a few slides
Statistical machine translation in a few slidesForcada Mikel
Β 
Natural language procesing in R
Natural language procesing in RNatural language procesing in R
Natural language procesing in ROlabanji Shonibare
Β 
Machine translation with statistical approach
Machine translation with statistical approachMachine translation with statistical approach
Machine translation with statistical approachvini89
Β 
Gordana Panajotović - NLP Master
Gordana Panajotović - NLP MasterGordana Panajotović - NLP Master
Gordana Panajotović - NLP MasterNLP Centar Beograd
Β 
Text Mining Infrastructure in R
Text Mining Infrastructure in RText Mining Infrastructure in R
Text Mining Infrastructure in RAshraf Uddin
Β 
Introduction to nlp 2014
Introduction to nlp 2014Introduction to nlp 2014
Introduction to nlp 2014Grant Hamel
Β 
Types of machine translation
Types of machine translationTypes of machine translation
Types of machine translationRushdi Shams
Β 
Text analytics in Python and R with examples from Tobacco Control
Text analytics in Python and R with examples from Tobacco ControlText analytics in Python and R with examples from Tobacco Control
Text analytics in Python and R with examples from Tobacco ControlBen Healey
Β 
Natural language processing (NLP) introduction
Natural language processing (NLP) introductionNatural language processing (NLP) introduction
Natural language processing (NLP) introductionRobert Lujo
Β 
Practical Natural Language Processing
Practical Natural Language ProcessingPractical Natural Language Processing
Practical Natural Language ProcessingJaganadh Gopinadhan
Β 
Introduction to Natural Language Processing
Introduction to Natural Language ProcessingIntroduction to Natural Language Processing
Introduction to Natural Language Processingrohitnayak
Β 
Introducing natural language processing(NLP) with r
Introducing natural language processing(NLP) with rIntroducing natural language processing(NLP) with r
Introducing natural language processing(NLP) with rVivian S. Zhang
Β 
Natural language processing
Natural language processingNatural language processing
Natural language processingYogendra Tamang
Β 
Natural Language Processing in R (rNLP)
Natural Language Processing in R (rNLP)Natural Language Processing in R (rNLP)
Natural Language Processing in R (rNLP)fridolin.wild
Β 

Viewers also liked (20)

NLP and its applications
NLP and its applicationsNLP and its applications
NLP and its applications
Β 
Natural Language Processing: Definition and Application
Natural Language Processing: Definition and ApplicationNatural Language Processing: Definition and Application
Natural Language Processing: Definition and Application
Β 
Statistical machine translation
Statistical machine translationStatistical machine translation
Statistical machine translation
Β 
Jeeves -natural language interface application
Jeeves -natural language interface applicationJeeves -natural language interface application
Jeeves -natural language interface application
Β 
Startupfest 2015: HARPER REED (Modest, Inc.) - Lightning Keynote
Startupfest 2015: HARPER REED (Modest, Inc.) - Lightning KeynoteStartupfest 2015: HARPER REED (Modest, Inc.) - Lightning Keynote
Startupfest 2015: HARPER REED (Modest, Inc.) - Lightning Keynote
Β 
Statistical machine translation in a few slides
Statistical machine translation in a few slidesStatistical machine translation in a few slides
Statistical machine translation in a few slides
Β 
Natural language procesing in R
Natural language procesing in RNatural language procesing in R
Natural language procesing in R
Β 
Machine translation with statistical approach
Machine translation with statistical approachMachine translation with statistical approach
Machine translation with statistical approach
Β 
Intro to nlp
Intro to nlpIntro to nlp
Intro to nlp
Β 
Gordana Panajotović - NLP Master
Gordana Panajotović - NLP MasterGordana Panajotović - NLP Master
Gordana Panajotović - NLP Master
Β 
Text Mining Infrastructure in R
Text Mining Infrastructure in RText Mining Infrastructure in R
Text Mining Infrastructure in R
Β 
Introduction to nlp 2014
Introduction to nlp 2014Introduction to nlp 2014
Introduction to nlp 2014
Β 
Types of machine translation
Types of machine translationTypes of machine translation
Types of machine translation
Β 
Text analytics in Python and R with examples from Tobacco Control
Text analytics in Python and R with examples from Tobacco ControlText analytics in Python and R with examples from Tobacco Control
Text analytics in Python and R with examples from Tobacco Control
Β 
Natural language processing (NLP) introduction
Natural language processing (NLP) introductionNatural language processing (NLP) introduction
Natural language processing (NLP) introduction
Β 
Practical Natural Language Processing
Practical Natural Language ProcessingPractical Natural Language Processing
Practical Natural Language Processing
Β 
Introduction to Natural Language Processing
Introduction to Natural Language ProcessingIntroduction to Natural Language Processing
Introduction to Natural Language Processing
Β 
Introducing natural language processing(NLP) with r
Introducing natural language processing(NLP) with rIntroducing natural language processing(NLP) with r
Introducing natural language processing(NLP) with r
Β 
Natural language processing
Natural language processingNatural language processing
Natural language processing
Β 
Natural Language Processing in R (rNLP)
Natural Language Processing in R (rNLP)Natural Language Processing in R (rNLP)
Natural Language Processing in R (rNLP)
Β 

Similar to Natural language processing 2

Semantics
SemanticsSemantics
SemanticsgandesAM
Β 
Mental grammar
Mental grammarMental grammar
Mental grammarRona Andres
Β 
Visual Word Recognition. The Journey from Features to Meaning
Visual Word Recognition. The Journey from Features to MeaningVisual Word Recognition. The Journey from Features to Meaning
Visual Word Recognition. The Journey from Features to Meaningfawzia
Β 
Structural grammar iii
Structural grammar iiiStructural grammar iii
Structural grammar iiiflakcute
Β 
Syntactic Features in Mother Tongue.pptx
Syntactic Features in Mother Tongue.pptxSyntactic Features in Mother Tongue.pptx
Syntactic Features in Mother Tongue.pptxJamelMirafuentes
Β 
Understanding ASL Grammatical Features and Discourse Mapping
Understanding ASL Grammatical Features and Discourse MappingUnderstanding ASL Grammatical Features and Discourse Mapping
Understanding ASL Grammatical Features and Discourse MappingDoug Stringham
Β 
Language in cognitive psychology
Language in cognitive psychologyLanguage in cognitive psychology
Language in cognitive psychologyAli Bahrani
Β 
Nlp Sentemental analysis of Tweetr And CaseStudy
Nlp Sentemental analysis of Tweetr And CaseStudyNlp Sentemental analysis of Tweetr And CaseStudy
Nlp Sentemental analysis of Tweetr And CaseStudyRaza Azeem
Β 
05 linguistic theory meets lexicography
05 linguistic theory meets lexicography05 linguistic theory meets lexicography
05 linguistic theory meets lexicographyDuygu Aşıklar
Β 
Syntax
SyntaxSyntax
Syntaxamirasoul
Β 
Setswana Tokenisation and Computational Verb Morphology: Facing the Challenge...
Setswana Tokenisation and Computational Verb Morphology: Facing the Challenge...Setswana Tokenisation and Computational Verb Morphology: Facing the Challenge...
Setswana Tokenisation and Computational Verb Morphology: Facing the Challenge...Guy De Pauw
Β 
Grammar Presentation
Grammar PresentationGrammar Presentation
Grammar Presentationtickingmindpd
Β 
Assignment on morphology
Assignment on morphologyAssignment on morphology
Assignment on morphologyLinda Midy
Β 
What English Do University Students Really Need
What English Do University Students Really NeedWhat English Do University Students Really Need
What English Do University Students Really NeedHala Nur
Β 

Similar to Natural language processing 2 (20)

NLP
NLPNLP
NLP
Β 
Semantics
SemanticsSemantics
Semantics
Β 
Mental grammar
Mental grammarMental grammar
Mental grammar
Β 
Visual Word Recognition. The Journey from Features to Meaning
Visual Word Recognition. The Journey from Features to MeaningVisual Word Recognition. The Journey from Features to Meaning
Visual Word Recognition. The Journey from Features to Meaning
Β 
Structural grammar iii
Structural grammar iiiStructural grammar iii
Structural grammar iii
Β 
Syntactic Features in Mother Tongue.pptx
Syntactic Features in Mother Tongue.pptxSyntactic Features in Mother Tongue.pptx
Syntactic Features in Mother Tongue.pptx
Β 
Understanding ASL Grammatical Features and Discourse Mapping
Understanding ASL Grammatical Features and Discourse MappingUnderstanding ASL Grammatical Features and Discourse Mapping
Understanding ASL Grammatical Features and Discourse Mapping
Β 
Language in cognitive psychology
Language in cognitive psychologyLanguage in cognitive psychology
Language in cognitive psychology
Β 
Nlp ambiguity presentation
Nlp ambiguity presentationNlp ambiguity presentation
Nlp ambiguity presentation
Β 
Nlp
NlpNlp
Nlp
Β 
Nlp Sentemental analysis of Tweetr And CaseStudy
Nlp Sentemental analysis of Tweetr And CaseStudyNlp Sentemental analysis of Tweetr And CaseStudy
Nlp Sentemental analysis of Tweetr And CaseStudy
Β 
Semantics
SemanticsSemantics
Semantics
Β 
05 linguistic theory meets lexicography
05 linguistic theory meets lexicography05 linguistic theory meets lexicography
05 linguistic theory meets lexicography
Β 
Syntax
SyntaxSyntax
Syntax
Β 
Setswana Tokenisation and Computational Verb Morphology: Facing the Challenge...
Setswana Tokenisation and Computational Verb Morphology: Facing the Challenge...Setswana Tokenisation and Computational Verb Morphology: Facing the Challenge...
Setswana Tokenisation and Computational Verb Morphology: Facing the Challenge...
Β 
Grammar Presentation
Grammar PresentationGrammar Presentation
Grammar Presentation
Β 
Assignment on morphology
Assignment on morphologyAssignment on morphology
Assignment on morphology
Β 
What English Do University Students Really Need
What English Do University Students Really NeedWhat English Do University Students Really Need
What English Do University Students Really Need
Β 
Grammar 4
Grammar 4Grammar 4
Grammar 4
Β 
Grammar 4
Grammar 4Grammar 4
Grammar 4
Β 

Recently uploaded

Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
Β 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
Β 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
Β 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
Β 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
Β 
FULL ENJOY πŸ” 8264348440 πŸ” Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY πŸ” 8264348440 πŸ” Call Girls in Diplomatic Enclave | DelhiFULL ENJOY πŸ” 8264348440 πŸ” Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY πŸ” 8264348440 πŸ” Call Girls in Diplomatic Enclave | Delhisoniya singh
Β 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
Β 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxnull - The Open Security Community
Β 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
Β 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
Β 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
Β 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
Β 
WhatsApp 9892124323 βœ“Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 βœ“Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 βœ“Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 βœ“Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
Β 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?XfilesPro
Β 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
Β 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
Β 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
Β 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
Β 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
Β 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
Β 

Recently uploaded (20)

Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
Β 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Β 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Β 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
Β 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Β 
FULL ENJOY πŸ” 8264348440 πŸ” Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY πŸ” 8264348440 πŸ” Call Girls in Diplomatic Enclave | DelhiFULL ENJOY πŸ” 8264348440 πŸ” Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY πŸ” 8264348440 πŸ” Call Girls in Diplomatic Enclave | Delhi
Β 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
Β 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Β 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
Β 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Β 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
Β 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
Β 
WhatsApp 9892124323 βœ“Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 βœ“Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 βœ“Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 βœ“Call Girls In Kalyan ( Mumbai ) secure service
Β 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?
Β 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
Β 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
Β 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Β 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
Β 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
Β 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
Β 

Natural language processing 2

  • 1.
  • 2. ο‚— Overview ο‚— Basic knowledge ο‚— Demonstration
  • 3. LOGO USER : Men are all alike. ELIZA : In what way? USER : They’re always bugging us about something or other. ELIZA : Can you think of a specific example? USER : Well, my boyfriend made me come here. ELIZA : Your boyfriend made you come here ? USER : He says I’m depressed much of the time. ELIZA : I am sorry to hear you are depressed. USER : It’s true, I am unhappy. ELIZA : Do you think coming here will help you not to be unhappy? USER : I need some help; that much seems certain. ELIZA : What would it mean to you if you got some help? USER : Perhaps I could learn to get along with my mother. ELIZA : Tell me more about your family. USER : My mother takes care of me. ELIZA : Who else in your family takes care of you? USER : My father. ELIZA : Your father ? USER : You are like my father in some ways. ELIZA : What resemblance do you see?
  • 4. οƒ˜ A sub-field of Artificial Intelligent, since 1960s … οƒ˜ Concerned with the interactions between computers and human languages with one ultimate goal : Computers can β€œunderstand” human οƒ˜ Many applications in real world
  • 5. οƒ˜ Natural language unit? οƒ˜ Natural language understanding οƒ˜ Natural language generation οƒ˜ Data? οƒ˜ Speech processing οƒ˜ Text processing Natural language text understanding!
  • 6. ο‚— Task of generating natural language from a machine representation ο‚— May be viewed as the opposite of natural language understanding . ο‚— Applications: ο‚— Jokes generation ο‚— Textual summaries of databases ο‚— Enhancing accessibility
  • 7. ο‚— An advanced subtopic of NLP deals with reading comprehension ο‚— More complex than NLG ο‚— Many commercial interest in this field ο‚— News-gathering ο‚— Data-Mining ο‚— Voice-Activation ο‚— Large-scale content analysis
  • 8. ο‚— Logic is too clear, the lost of flexibility cause difficulties in NLP ο‚— Examples : ο‚— Time flies like an arrow Can be understood in 7 ways !!! ο‚— I never said she stole my money ! οƒ˜ Someone else said it, but I didn't.
  • 9. ο‚— Logic is too clear, the lost of flexibility become difficulties in NLP ο‚— Examples : ο‚— Time flies like an arrow Can be understood in 7 ways !!! ο‚— I never said she stole my money ! ο‚— I simply didn't ever say it
  • 10. ο‚— Logic is too clear, the lost of flexibility become difficulties in NLP ο‚— Examples : ο‚— Time flies like an arrow Can be understood in 7 ways !!! ο‚— I never said she stole my money ! ο‚— I might have implied it in some way, but I never explicitly said it
  • 11. ο‚— Logic is too clear, the lost of flexibility become difficulties in NLP ο‚— Examples : ο‚— Time flies like an arrow Can be understood in 7 ways !!! ο‚— I never said she stole my money ! ο‚— I said someone took it; I didn't say it was she
  • 12. ο‚— Logic is too clear, the lost of flexibility become difficulties in NLP ο‚— Examples: ο‚— Time flies like an arrow Can be understood in 7 ways !!! ο‚— I never said she stole my money ! ο‚— I just said she probably borrowed it
  • 13. ο‚— Logic is too clear, the lost of flexibility become difficulties in NLP ο‚— Examples : ο‚— Time flies like an arrow Can be understood in 7 ways !!! ο‚— I never said she stole my money ! ο‚— I said she stole someone else's money
  • 14. ο‚— Logic is too clear, the lost of flexibility become difficulties in NLP ο‚— Examples : ο‚— Time flies like an arrow Can be understood in 7 ways !!! ο‚— I never said she stole my money ! ο‚— I said she stole something, but not my money
  • 15. ο‚— Words combination and division ο‚— Stress placing on words ο‚— The properties of subjects ο‚— We gave the monkeys the bananas because they were hungry ο‚— We gave the monkeys the bananas because they were over-ripe ο‚— Specifying which word an adjective applies to ο‚— A pretty little girls' school
  • 16. ο‚— Involves reasoning about the world ο‚— Embedded a social system of people interacting ο‚— persuading, insulting and amusing them ο‚— changing over time ο‚— Homonymous
  • 17.
  • 21.
  • 22.
  • 23.
  • 24.
  • 25.
  • 26.
  • 27.
  • 28. ο‚— ePi Group: ο‚— Automatic Vietnamese processing system ο‚— www.baomoi.com ο‚— Collecting news from all Vietnamese e-newspapers ο‚— EVTrans – Softex Co Ltd. ο‚— Cyclop ο‚— VnKim
  • 29.
  • 30.
  • 31.
  • 32.
  • 33. ο‚— Morphological analysis : Individual words are analyzed into their components ο‚— Syntactic analysis Linear sequence of words are transformed into structures that show how the words relate to each other ο‚— Semantic analysis ο‚— A transformation is made from the input text to an internal representation that reflects the meaning ο‚— Pragmatic analysis ο‚— To reinterpret what was said to what was actually meant ο‚— Discourse analysis ο‚— Resolving references between sentences
  • 36. ο‚— Morphemes: smallest meaningful unit spoken units of language. ο‚— Stem: book, cat, car, … ο‚— Affixes : un-, -s, -es, .. Morphology ο‚— Clitic: β€˜ve, β€˜m Syntax Semantic ο‚— Morphological parsing: parsing a word Pragmatic into stem and affixes and identifying the Discourse parts and their relationships
  • 37. ο‚— Word Classes ο‚— Parts of speech: noun, verb, adjectives, etc. Morphology ο‚— Word class dictates how a word combines with morphemes to form new words Syntax Semantic ο‚— Examples Pragmatic ο‚— Books: book + s Discourse ο‚— Unladylike = un + lady + like
  • 38. ο‚— Vietnamese? ο‚— Δ‚n = Δƒn Morphology ο‚— Uα»‘ng = uα»‘ng ο‚— Xe = xe Syntax Semantic ο‚— No β€˜Xes’ in Vietnamese! Pragmatic ο‚— Problems are text tokenizing. Discourse
  • 39. ο‚— Why parse words? Morphology ο‚— To identify a word’s part-of-speech ο‚— To identify a word’s stem (IR) Syntax Semantic … then? Pragmatic ο‚— Spell- checking Discourse ο‚— To predict next words ο‚— To predict the word’s accent
  • 40. ο‚— Ambiguity ο‚— I want her to go to the cinema with me Morphology To - infinitive? Syntax To - preposition? Semantic Pragmatic ο‚— Con ngα»±a Δ‘Γ‘ Δ‘Γ‘ con ngα»±a Δ‘Γ‘. Discourse Δ‘Γ‘ = Δ‘Γ‘?
  • 41. ο‚— How to implement? ο‚— Regular expression ο‚— Finite State Transducers (FST) ο‚— Finite State Accepter (FSA) Morphology Syntax *.exe Semantic ir??man Pragmatic b[0-9]+ *(Mb|[Mm]egabytes?)b Discourse
  • 42.
  • 43. ο‚— Relate terms: ο‚— Stem, stemming Morphology ο‚— Part of speech Syntax ο‚— N-gram Semantic Pragmatic Discourse
  • 45. Morphology SYNTAX Syntax Semantic Pragmatic Discourse
  • 46. ο‚— Linear sequence of words are transformed into structures that show how the words relate to each other. Morphology ο‚— Determine grammatical structure. Syntax Semantic Pragmatic ο‚— I am a boy = [Subject] [Verb] [Cardinal] [Noun] Discourse
  • 48. ο‚— Syntax ο‚— Actual structure of a sentence Morphology Syntax ο‚— Grammar Semantic ο‚— The rule set used in the analysis Pragmatic Discourse
  • 49. ο‚— A grammar define syntactically legal sentences ο‚— I ate an apple (syntactic legal) ο‚— I ate apple (not syntactic legal) ο‚— I ate a building (syntactic legal, but?) Morphology Syntax doesn’t mean that it’s meaningful! Semantic Pragmatic Discourse
  • 50. ο‚— Ambiguities Morphology Syntax Semantic Pragmatic Discourse
  • 52. Morphology Syntax SEMANTIC Semantic Pragmatic Discourse
  • 53. ο‚— What could this mean… ο‚— Representations of linguistic inputs that capture the meanings of those inputs ο‚— For us it means Morphology ο‚— Representations that permit or facilitate Syntax semantic processing ο‚— Permit us to reason about their truth Semantic (relationship to some world) Pragmatic ο‚— Permit us to answer questions based on their content Discourse ο‚— Permit us to perform inference (answer questions and determine the truth of things we don’t actually know)
  • 55. ο‚— Requirements ο‚— Verifiability ο‚— Ambiguity Morphology ο‚— Canonical Form ο‚— Inference Syntax ο‚— Expressiveness Semantic Pragmatic Discourse
  • 57. ο‚— Pragmatics: concerns how sentences are used in different situations and how use Morphology affects the interpretation of the sentence Syntax Semantic ο‚— Discourse: concerns how the Pragmatic immediately preceding sentences affect Discourse the interpretation of the next sentence
  • 58. Morphology Syntax ο‚— β€˜He’, β€˜it’, β€˜his’ can be inferred from Semantic previous sentence Pragmatic ο‚— It’s discourse Discourse
  • 64. ο‚— Wordnet ο‚— Mindnet ο‚— Stanford Tagger ο‚— Stanford Parser ο‚— ……..
  • 65. ο‚— Machine translation ο‚— Search engine ο‚— Information extraction ο‚— Chat bot
  • 66.
  • 67.
  • 68.
  • 69. ο‚— Can we use previously translated text to learn how to translate new texts? ο‚— Yes! But, it’s not so easy ο‚— Two paradigms, statistical MT, and EBMT ο‚— Requirements: ο‚— Aligned large parallel corpus of translated sentences ο‚— {S source  S target } ο‚— Bilingual dictionary for intra-S alignment ο‚— Generalization patterns (names, numbers, dates…)
  • 70. ο‚— Simplest: Translation Memory ο‚— If S new= S source in corpus, output aligned S target ο‚— Compositional EBMT ο‚— If fragment of Snew matches fragment of Ss, output corresponding fragment of aligned St ο‚— Prefer maximal-length fragments ο‚— Maximize grammatical compositionality ο‚— Via a target language grammar ο‚— Or, via an N-gram statistical language model
  • 71. ο‚— Requires an Interlingua - language-neutral Knowledge Representation (KR) ο‚— Philosophical debate: Is there an interlingua? ο‚— FOL is not totally language neutral (predicates, functions, expressed in a language) ο‚— Other near-interlinguas (Conceptual Dependency) ο‚— Requires a fully-disambiguating parser ο‚— Domain model of legal objects, actions, relations ο‚— Requires a NL generator (KR -> text) ο‚— Applicable only to well-defined technical domains ο‚— Produces high-quality MT in those domains
  • 73. ο‚— Each approach has its own strength ο‚— Rapidly adaptable: statistical, example-based ο‚— Good grammar: rule-based (grammar) ο‚— High precision in narrow domain: Intelingua
  • 74. ο‚— Google ο‚— Yahoo ο‚— Alta-vista ο‚— Answer.com
  • 75. ο‚— Spider - a browser-like program that downloads web pages. ο‚— Crawler – a program that automatically follows all of the links on each web page. ο‚— Indexer - a program that analyzes web pages downloaded by the spider and the crawler. ο‚— Database– storage for downloaded and processed pages. ο‚— Results engine – extracts search results from the database. ο‚— Web server – a server that is responsible for interaction between the user and other search engine components.
  • 76. ο‚— Spider - a browser-like program that downloads web pages. ο‚— Crawler – a program that automatically follows all of the links on each web page. ο‚— Indexer - a program that analyzes web pages downloaded by the spider and the crawler. ο‚— Database– storage for downloaded and processed pages. ο‚— Results engine – extracts search results from the database. ο‚— Web server – a server that is responsible for interaction between the user and other search engine components.
  • 77.
  • 78.
  • 79.
  • 80. ο‚— Idea is to β€˜extract’ particular types of information from arbitrary text or transcribed speech ο‚— Examples: ο‚— Names entities: people, places, organization ο‚— Telephone numbers ο‚— Dates ο‚— Many uses: ο‚— Question answering systems, fisting of news or mail… ο‚— Job ads, financial information, terrorist attacks
  • 81. ο‚— Often use a set of simple templates or frames with slots to be filled in from input text. Ignore everything else. ο‚— Husni’s number is 966-3-860-2624. ο‚— The inventor of the First plane was Abbas ibnu Fernas ο‚— The British King died in March of 1932.
  • 82. ο‚— Named Entity recognition (NE) ο‚— Finds and classifies names, places etc. ο‚— Co-reference Resolution (CO) ο‚— Identifies identity relations between entities in texts. ο‚— Template Element construction (TE) ο‚— Adds descriptive information to NE results (using CO). ο‚— Template Relation construction (TR) ο‚— Finds relations between TE entities. Scenario ο‚— Template production (ST) ο‚— Fits TE and TR results into specified event scenarios.
  • 83.
  • 84.
  • 85.
  • 86.
  • 87.
  • 88.
  • 89. ο‚— AIML = Artificial Intelligent Mark-up Language ο‚— Alice
  • 90. ο‚— A.L.I.C.E. (Artificial Linguistic Internet Computer Entity) ο‚— an award-winning free natural language artificial intelligence chat robot. ο‚— Ruled-base ο‚— Human-like answer without complicated β€œbrain” ο‚— Multi-language
  • 91.
  • 92. ο‚— NLP’s course , Husni Al-Muhtaseb ο‚— Lexical descriptions for Vietnamese language processing . ο‚— en.wikipedia.org ο‚— www.xulyngonngu.com