SlideShare a Scribd company logo
1 of 19
Syntax: Structural Descriptions of Sentences
Why Study Syntax?
Syntax provides
• systematic rules for forming new sentences in a language.
• can be used to verify if a sentence is legitimate in a language.
• a step closer to the “meaning” of a sentence.
– Who did what to whom semantics
Applications
• Improving precision in search applications
– Yankees beat red sox
– Red sox beat yankees
• Paraphrasing
– John loves Mary = Mary is loved by John
• Information Extraction
– Fill in a form by extracting information from a document.
Structure of Words
What are words?
• Orthographic tokens separated by white space.
In some languages the distinction between words and
sentences is less clear.
• Chinese, Japanese: no white space between words
– nowhitespace  no white space/no whites pace/now hit esp ace
• Turkish: words could represent a complete “sentence”
– Eg: uygarlastiramadiklarimizdanmissinizcasina
Morphology: the structure of words
• Basic elements: morphemes
• Morphological Rules: how to combine morphemes.
Syntax: the structure of sentences
• Rules for ordering words in a sentence
• Elementary units: Phrasal and Clauses
Morphology and Syntax
Interplay between syntax and morphology
• How much information does a language allow to be packed in a
word, and how easy is it to unpack.
• More information  less rigid syntax  more free word order
• Hindi: “John likes Mary” – all six orders are possible, due to rich
morphological information.
– John-nom Mary-acc likes
English expresses relations between words through word order.
Morphologically rich languages have freer word order.
• However, some parts have rigid word order.
– Noun groups in Hindi: “one yellow book”
Outline
Constituency
• How does this notion arise?
• Type of constituents
• Representation: Tree Structure
Formal device: Context Free Grammars
• Derived tree and derivation tree
• Grammar Equivalence
– Strong and weak generative capacity
– Chomsky Normal Form
• Other Formal Frameworks (Tree-Adjoining Grammar)
Other topics in syntax
• Dependency
• Spoken language syntax
• Structural Priming
Constituency
Words are grouped into part-of-speech groups
• Similar morphological inflections
• Allows us to create new word forms (“blog”, “xerox”)
• Nouns, Verbs, Determiners, Adjectives etc…
Certain sequences of words in a sentence are grouped as
constituents
• Distributionally similar behavior
• cohesive units (move around in a sentence as a unit)
– In the morning I take a walk
– I take a walk in the morning
• Substrings are typed “Clause”, “Noun Phrase”, “Verb Phrase”
“Preposition Phrase” etc.
Constituency – contd.
Examples of constituents:
• Noun phrase:
– the dog, two big light blue vans
• Preposition phrase:
– in the box, under the bridge
• Clause:
– the dog bit the man, John thought the dog bit the man
The type of a constituent is derived from the “head word” of
the constituent.
Constituent Structure
Decomposition of a sentence into its constituents.
Attaching constituents to each other to reflect relations among words:
Emergence of Tree Structure
• John saw the man with the telescope
• (S (NP John) saw (NP (NP the man) (PP with (NP the telescope))))
• (S (NP John) saw (NP the man) (PP with (NP the telescope))))
Select a sentence from a newspaper text and provide its constituent
structure.
Evidence of another constituent – verb phrase (“VP”)
• Substring involving a verb move around and can be referred to as a unit.
– VP-fronting (and quickly clean the carpet he did! )
– VP-ellipsis (He cleaned the carpets quickly, and so did she )
– Can have adjuncts before and after VP, but not in VP (He often eats beans, *he
eats often beans )
Relations among Words
Types of relations between words
• Arguments: subject, object, indirect object, prepositional object
• Adjuncts: temporal, locative, causal, manner, …
• Function Words
Subcategorization: List of arguments of a word (verb)
• with features about realization (POS, perhaps case, verb form etc)
For English, the argument order: Subject-Object-IndirectObj
Example:
• like: NP-NP (“John likes Mary”), NP-VP(to-inf) (John likes to watch movies)
• think: NP-S (“John thought Mary was going to the party”)
• put: NP-NP-PP
Adjuncts are optional (typically modifiers of an action)
• John put the book on the table at 3pm yesterday
There are words with “demands” and words that fill the “demands”.
• Demands are typed (NP, VP, PP, S)
English Syntax: A Sample
Sentence types:
• Declarative (John closed the door)
• Imperative (close the door!!)
• Yes-No-Question (can you close the door?)
• Wh-question (who closed the door? What did John close?)
Clause types:
• Infinitival (to read a book)
• Gerundive (reading of a book)
• Relative Clause (that has a green cover)
English Syntax: A Sample – contd.
Noun Phrase:
• Before the head noun:
– Pre-determiner Determiner Post-determiner (Adjective|Noun) Noun
• After the head noun (Modifiers)
– Preposition phrases
– Relative Clauses (the book that has only one sentence)
– Gerundive (the flight arriving after 10pm)
Auxiliary Verbs
• Modal (could, might, will, should…) < perfect (have) < progressive (be) <
passive (be)
• “might have been destroyed”
Large wide-coverage grammars have been developed/under
development
• XTAG (www.cis.upenn.edu/~xtag), HPSG, LFG
Two Representations of Syntactic Structure
Phrase structure: illustrates the constituents and its type.
Dependency structure: Relations between words without
intervening structure.
reads
boy book
the a
boy
the
reads
book
a
DetP
NP NP
DetP
S
Adv
slowly
slowly
adj
arg0
arg1
fw
fw
Context Free-Grammars
String Rewriting Systems
• Transform one string to another (until termination)
G=(V,T,P,S)
where V: vocabulary of non-terminals
T: vocabulary of terminals
S: start symbol
P: set of productions of the form
a  b where a  V and b  (V U T)*
Derivation: Rewrite a non-terminal with the production of the grammar until
no non-terminals exist in the string.
• Start with “S”
Sample Context-Free Grammar, derivation and derived structure.
Two Representations
String rewriting system: we derive a string (=derived structure)
But derivation history represented by phrase-structure tree
(=derivation structure)!
Grammar Equivalence
• Can have different grammars that generate same set of strings (weak
equivalence)
• Can have different grammars that have same set of derivation trees (strong
equivalence)
• Strong equivalence implies weak equivalence
CFG Normal Forms:
• Chomsky Normal Form (a  b g)
• Griebarch Normal Form (a  w b)
• Convert a grammar into CNF and GNF
Penn Treebank (PTB)
Syntactically annotated corpus (phrase structure)
Contains 1 miilion words of Wall Street Journal sentences marked
up with syntactic structure.
• Can be converted into a dependency Treebank.
– need for head percolation tables
• Completely flat structure in NP
– brown bag lunch, pink-and-yellow child seat
• Represents a particular linguistic theory
PropBank
• PTB with some grammatical relations made explicit
Unification
Mechanism needed to pass and check constraints.
Constraints, syntactic and semantic:
• Subject-verb agreement
– S  NP VP
– the boy reads / the boys read / * the boys reads
• Subject/Auxiliary inversion: (Yes-no-question)
– S  AuxVerb NP VP
– Do you have flights / * does you have flights
• Selectional restrictions:
– An apple reads a book
Need a mechanism to encode these constraints
• Refine the non-terminal set to encode these constraints.
• S  3sgAux 3sgNP VP ; 3sgAux  does | has …
• S  Non3sgAux Non3sgNP VP; Non3sgAux  do | have | can
• We need to split the NP rule into the 3sgNP and Non3sgNP.
• Size of the grammar grows;
• can we factor these constraints out of the structure of the rules?
Unification – contd.
Attribute value matrix:
boy : Number
Person
sg
3
Cat N
read : Number
Person
pl
3
Cat V
Subj agr
NP.number = VP.subj.agr.number
NP.person = VP.subj.agr.person
S  NP VP
reads: Number sg
Cat V
Subj agr
VP  V
VP.number = V.subj.agr.number
VP.person = V.subj.agr.person
Percolate Constraints Check Constraints
The boy reads / * the boys reads / the boys read
boys : Number
Person
pl
3
Cat N
Number sg
Person 1|2
Structural Priming
Structure of preceding sentences helps/hinders the reading times of
subsequent sentences.
• Dative alternation
– The woman gave her car to the church
– The woman gave the church her car
• One of these forms is primed depending on what the prime was
– V NP NP  gave the church her car
– V NP PP  gave her car to the church
Spoken Language Syntax
Not as “clean”, rampant disfluency.
• edits (restarts, repairs)
• Filled pauses
• Ungrammaticality
Sentence  utterance.
“Clean up” the utterance first before understanding it.

More Related Content

Similar to Syntax.ppt

history-of-gfs.ppt
history-of-gfs.ppthistory-of-gfs.ppt
history-of-gfs.pptcamUyanguren
 
Emspresentation11 140918070933-phpapp02
Emspresentation11 140918070933-phpapp02Emspresentation11 140918070933-phpapp02
Emspresentation11 140918070933-phpapp02Abdullah Al-Asmari
 
Deep structure and surface structure
Deep structure and surface structureDeep structure and surface structure
Deep structure and surface structureAsif Ali Raza
 
Grammar and its types
Grammar and its typesGrammar and its types
Grammar and its typesAsif Ali Raza
 
Structural ambiguity
Structural ambiguityStructural ambiguity
Structural ambiguityAsif Ali Raza
 
Phrase structure rules
Phrase structure rulesPhrase structure rules
Phrase structure rulesAsif Ali Raza
 
Introduction to linguistics syntax
Introduction to linguistics syntaxIntroduction to linguistics syntax
Introduction to linguistics syntaxAYOUBDRAOUI
 
grammaticality, deep & surface structure, and ambiguity
grammaticality, deep & surface structure, and ambiguitygrammaticality, deep & surface structure, and ambiguity
grammaticality, deep & surface structure, and ambiguityDedew Deviarini
 
Morphological Analysis
Morphological AnalysisMorphological Analysis
Morphological AnalysisAkshat Pandey
 
Introduction to syntax
Introduction to syntaxIntroduction to syntax
Introduction to syntaxFarjana Ela
 
Natural Language Processing Course in AI
Natural Language Processing Course in AINatural Language Processing Course in AI
Natural Language Processing Course in AISATHYANARAYANAKB
 
Natural language processing
Natural language processingNatural language processing
Natural language processingBasha Chand
 

Similar to Syntax.ppt (20)

Syntax
SyntaxSyntax
Syntax
 
history-of-gfs.ppt
history-of-gfs.ppthistory-of-gfs.ppt
history-of-gfs.ppt
 
Emspresentation11 140918070933-phpapp02
Emspresentation11 140918070933-phpapp02Emspresentation11 140918070933-phpapp02
Emspresentation11 140918070933-phpapp02
 
Deep structure and surface structure
Deep structure and surface structureDeep structure and surface structure
Deep structure and surface structure
 
Tree diagram
Tree diagramTree diagram
Tree diagram
 
Recursion
RecursionRecursion
Recursion
 
Grammar and its types
Grammar and its typesGrammar and its types
Grammar and its types
 
Complement phrase
Complement phraseComplement phrase
Complement phrase
 
Structural ambiguity
Structural ambiguityStructural ambiguity
Structural ambiguity
 
Movement rules
Movement rulesMovement rules
Movement rules
 
Phrase structure rules
Phrase structure rulesPhrase structure rules
Phrase structure rules
 
Introduction to linguistics syntax
Introduction to linguistics syntaxIntroduction to linguistics syntax
Introduction to linguistics syntax
 
Lecture 2009-09-22
Lecture 2009-09-22Lecture 2009-09-22
Lecture 2009-09-22
 
grammaticality, deep & surface structure, and ambiguity
grammaticality, deep & surface structure, and ambiguitygrammaticality, deep & surface structure, and ambiguity
grammaticality, deep & surface structure, and ambiguity
 
Syntax
SyntaxSyntax
Syntax
 
Syntax course
Syntax courseSyntax course
Syntax course
 
Morphological Analysis
Morphological AnalysisMorphological Analysis
Morphological Analysis
 
Introduction to syntax
Introduction to syntaxIntroduction to syntax
Introduction to syntax
 
Natural Language Processing Course in AI
Natural Language Processing Course in AINatural Language Processing Course in AI
Natural Language Processing Course in AI
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
 

Recently uploaded

Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxthorishapillay1
 
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,Virag Sontakke
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a
 
Painted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of IndiaPainted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of IndiaVirag Sontakke
 
internship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developerinternship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developerunnathinaik
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...Marc Dusseiller Dusjagr
 
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptx
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptxENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptx
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptxAnaBeatriceAblay2
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTiammrhaywood
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Celine George
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
Final demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxFinal demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxAvyJaneVismanos
 
Biting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfBiting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfadityarao40181
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon AUnboundStockton
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 

Recently uploaded (20)

TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptx
 
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
 
Painted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of IndiaPainted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of India
 
internship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developerinternship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developer
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
 
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptx
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptxENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptx
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptx
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Final demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxFinal demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptx
 
Biting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfBiting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdf
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon A
 
Staff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSDStaff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSD
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 

Syntax.ppt

  • 2. Why Study Syntax? Syntax provides • systematic rules for forming new sentences in a language. • can be used to verify if a sentence is legitimate in a language. • a step closer to the “meaning” of a sentence. – Who did what to whom semantics Applications • Improving precision in search applications – Yankees beat red sox – Red sox beat yankees • Paraphrasing – John loves Mary = Mary is loved by John • Information Extraction – Fill in a form by extracting information from a document.
  • 3. Structure of Words What are words? • Orthographic tokens separated by white space. In some languages the distinction between words and sentences is less clear. • Chinese, Japanese: no white space between words – nowhitespace  no white space/no whites pace/now hit esp ace • Turkish: words could represent a complete “sentence” – Eg: uygarlastiramadiklarimizdanmissinizcasina Morphology: the structure of words • Basic elements: morphemes • Morphological Rules: how to combine morphemes. Syntax: the structure of sentences • Rules for ordering words in a sentence • Elementary units: Phrasal and Clauses
  • 4. Morphology and Syntax Interplay between syntax and morphology • How much information does a language allow to be packed in a word, and how easy is it to unpack. • More information  less rigid syntax  more free word order • Hindi: “John likes Mary” – all six orders are possible, due to rich morphological information. – John-nom Mary-acc likes English expresses relations between words through word order. Morphologically rich languages have freer word order. • However, some parts have rigid word order. – Noun groups in Hindi: “one yellow book”
  • 5. Outline Constituency • How does this notion arise? • Type of constituents • Representation: Tree Structure Formal device: Context Free Grammars • Derived tree and derivation tree • Grammar Equivalence – Strong and weak generative capacity – Chomsky Normal Form • Other Formal Frameworks (Tree-Adjoining Grammar) Other topics in syntax • Dependency • Spoken language syntax • Structural Priming
  • 6. Constituency Words are grouped into part-of-speech groups • Similar morphological inflections • Allows us to create new word forms (“blog”, “xerox”) • Nouns, Verbs, Determiners, Adjectives etc… Certain sequences of words in a sentence are grouped as constituents • Distributionally similar behavior • cohesive units (move around in a sentence as a unit) – In the morning I take a walk – I take a walk in the morning • Substrings are typed “Clause”, “Noun Phrase”, “Verb Phrase” “Preposition Phrase” etc.
  • 7. Constituency – contd. Examples of constituents: • Noun phrase: – the dog, two big light blue vans • Preposition phrase: – in the box, under the bridge • Clause: – the dog bit the man, John thought the dog bit the man The type of a constituent is derived from the “head word” of the constituent.
  • 8. Constituent Structure Decomposition of a sentence into its constituents. Attaching constituents to each other to reflect relations among words: Emergence of Tree Structure • John saw the man with the telescope • (S (NP John) saw (NP (NP the man) (PP with (NP the telescope)))) • (S (NP John) saw (NP the man) (PP with (NP the telescope)))) Select a sentence from a newspaper text and provide its constituent structure. Evidence of another constituent – verb phrase (“VP”) • Substring involving a verb move around and can be referred to as a unit. – VP-fronting (and quickly clean the carpet he did! ) – VP-ellipsis (He cleaned the carpets quickly, and so did she ) – Can have adjuncts before and after VP, but not in VP (He often eats beans, *he eats often beans )
  • 9. Relations among Words Types of relations between words • Arguments: subject, object, indirect object, prepositional object • Adjuncts: temporal, locative, causal, manner, … • Function Words Subcategorization: List of arguments of a word (verb) • with features about realization (POS, perhaps case, verb form etc) For English, the argument order: Subject-Object-IndirectObj Example: • like: NP-NP (“John likes Mary”), NP-VP(to-inf) (John likes to watch movies) • think: NP-S (“John thought Mary was going to the party”) • put: NP-NP-PP Adjuncts are optional (typically modifiers of an action) • John put the book on the table at 3pm yesterday There are words with “demands” and words that fill the “demands”. • Demands are typed (NP, VP, PP, S)
  • 10. English Syntax: A Sample Sentence types: • Declarative (John closed the door) • Imperative (close the door!!) • Yes-No-Question (can you close the door?) • Wh-question (who closed the door? What did John close?) Clause types: • Infinitival (to read a book) • Gerundive (reading of a book) • Relative Clause (that has a green cover)
  • 11. English Syntax: A Sample – contd. Noun Phrase: • Before the head noun: – Pre-determiner Determiner Post-determiner (Adjective|Noun) Noun • After the head noun (Modifiers) – Preposition phrases – Relative Clauses (the book that has only one sentence) – Gerundive (the flight arriving after 10pm) Auxiliary Verbs • Modal (could, might, will, should…) < perfect (have) < progressive (be) < passive (be) • “might have been destroyed” Large wide-coverage grammars have been developed/under development • XTAG (www.cis.upenn.edu/~xtag), HPSG, LFG
  • 12. Two Representations of Syntactic Structure Phrase structure: illustrates the constituents and its type. Dependency structure: Relations between words without intervening structure. reads boy book the a boy the reads book a DetP NP NP DetP S Adv slowly slowly adj arg0 arg1 fw fw
  • 13. Context Free-Grammars String Rewriting Systems • Transform one string to another (until termination) G=(V,T,P,S) where V: vocabulary of non-terminals T: vocabulary of terminals S: start symbol P: set of productions of the form a  b where a  V and b  (V U T)* Derivation: Rewrite a non-terminal with the production of the grammar until no non-terminals exist in the string. • Start with “S” Sample Context-Free Grammar, derivation and derived structure.
  • 14. Two Representations String rewriting system: we derive a string (=derived structure) But derivation history represented by phrase-structure tree (=derivation structure)! Grammar Equivalence • Can have different grammars that generate same set of strings (weak equivalence) • Can have different grammars that have same set of derivation trees (strong equivalence) • Strong equivalence implies weak equivalence CFG Normal Forms: • Chomsky Normal Form (a  b g) • Griebarch Normal Form (a  w b) • Convert a grammar into CNF and GNF
  • 15. Penn Treebank (PTB) Syntactically annotated corpus (phrase structure) Contains 1 miilion words of Wall Street Journal sentences marked up with syntactic structure. • Can be converted into a dependency Treebank. – need for head percolation tables • Completely flat structure in NP – brown bag lunch, pink-and-yellow child seat • Represents a particular linguistic theory PropBank • PTB with some grammatical relations made explicit
  • 16. Unification Mechanism needed to pass and check constraints. Constraints, syntactic and semantic: • Subject-verb agreement – S  NP VP – the boy reads / the boys read / * the boys reads • Subject/Auxiliary inversion: (Yes-no-question) – S  AuxVerb NP VP – Do you have flights / * does you have flights • Selectional restrictions: – An apple reads a book Need a mechanism to encode these constraints • Refine the non-terminal set to encode these constraints. • S  3sgAux 3sgNP VP ; 3sgAux  does | has … • S  Non3sgAux Non3sgNP VP; Non3sgAux  do | have | can • We need to split the NP rule into the 3sgNP and Non3sgNP. • Size of the grammar grows; • can we factor these constraints out of the structure of the rules?
  • 17. Unification – contd. Attribute value matrix: boy : Number Person sg 3 Cat N read : Number Person pl 3 Cat V Subj agr NP.number = VP.subj.agr.number NP.person = VP.subj.agr.person S  NP VP reads: Number sg Cat V Subj agr VP  V VP.number = V.subj.agr.number VP.person = V.subj.agr.person Percolate Constraints Check Constraints The boy reads / * the boys reads / the boys read boys : Number Person pl 3 Cat N Number sg Person 1|2
  • 18. Structural Priming Structure of preceding sentences helps/hinders the reading times of subsequent sentences. • Dative alternation – The woman gave her car to the church – The woman gave the church her car • One of these forms is primed depending on what the prime was – V NP NP  gave the church her car – V NP PP  gave her car to the church
  • 19. Spoken Language Syntax Not as “clean”, rampant disfluency. • edits (restarts, repairs) • Filled pauses • Ungrammaticality Sentence  utterance. “Clean up” the utterance first before understanding it.