SlideShare a Scribd company logo
1 of 37
Lexical and semantic
selection
Options for grammar
engineers and what
they might mean
linguistically
Outline and
acknowledgements
1. Selection in constraint-based approaches
i. types of selection and overview of methods used in
LKB/ERG
ii. denotation
2. The collocation problem
i. collocation in general
ii. corpus data on magnitude adjectives
iii. possible accounts
3. Conclusions
 Acknowledgements: LinGO/DELPH-IN, especially
Dan Flickinger, also Generative Lexicon 2005
1(i): Types of grammatical
selection
 syntactic: e.g., preposition among selects for
an NP (like other prepositions)
 lexical: e.g., spend selects for PP headed by
on
 Kim spent the money on a car
 semantic: e.g., temporal at selects for times
of day (and meals)
 at 3am
 at three thirty five and ten seconds precisely
Lexical selection
 lexical selection requires method of
specifying a lexeme
 in the ERG, this is via the PRED value
spend (e.g., spend the money on Kim)
spend_v2 := v_np_prep_trans_le &
[ STEM < "spend" >,
SYNSEM [ LKEYS [ --OCOMPKEY _on_p_rel
KEYREL.PRED "_spend_v_rel" ]]].
Lexical selection
 ERG relies on convention that different lexemes have
different relations
 `lexical’ selection is actually semantic. cf Wechsler
 no true synonyms assumption, or assume that grammar
makes distinctions that are more fine-grained than real-
world denotation justifies.
 near-synonymy would have to be recorded elsewhere: ERG
does (some) morphology, syntax and compositional
semantics
 alternatives?
 orthography: but ambiguity or non-monotonic semantics
 lexical identifier: requires new feature
 PFORM: requires features, values
Semantic selection
 Requires a method of specifying a
semantically-defined phrase
 In ERG, done by specifying a higher
node in the hierarchy of relations:
at_temp := p_temp_le &
[ STEM < "at" >,
SYNSEM [ LKEYS [ --COMPKEY hour_or_time_rel,
KEYREL.PRED _at_p_temp_rel ]]].
Hierarchy of relations
Semantic selection
 Semantic selection allows for indefinitely large set of
alternative phrases
 compositionally constructed time expressions
 productive with respect to new words, but exceptions
allowable
• approach wouldn’t be falsified if e.g., *at tiffin
 ERG lexical selection is a special case of ERG
semantic selection!
 could assume featural encoding of semantic
properties (alternatively or in addition to hierarchy)
 TFS semantic selection is relatively limited practically
(see later)
 also idiom mechanism in ERG
1(ii): Denotation, grammar
engineering perspective
 Denotation is truth-conditional, logically formalisable (in
principle), refers to `real world’ (extension)
 Not necessarily decomposable
 Naive physics, biology, etc
 Must interface with non-linguistic components
 Minimising lexical complexity in broad-coverage grammars is
practically necessary
 Plausible input to generator:
 reasonable to expect real world constraints to be obeyed (except in
context)
• the goat read the book
 Potential disambiguation is not a sufficient condition for lexical
encoding
 The vet treated the rabbit and the guinea pig with dietary Vitamin
C deficiency
Denotation, continued
 Assume linkage to domain, richer knowledge
representation language available
 TFS language for syntax etc, not intended for
general inference
 Talmy example: the baguette lay across the
road
 across - Figure’s length > Ground’s width
 identifying F and G and location for comparison in
grammar?
 coding average length of all nouns?
 allowing for massive baguettes and tiny roads?
But ...
 Trend in KR is towards description logics rather than richer
languages.
 Need to think about the denotation to justify grammaticization
(or otherwise)
 if temporal in/on/at have same denotation, selectional account is
required for different distribution
 unreasonable to expect lexical choice for in/on/at in input to
generator
 Linguistic criteria: denotation versus grammaticization?
 effect found cross-linguistically?
 predictable on basis of world knowledge?
 closed class vs open class
 Practical considerations about interfacing go along with
linguistic criteria
 non-linguists expect some information about word meaning!
 allow generalisation over e.g., in/on/at in generator input, while
keeping possibility of distinction
2(i) Collocation: assumptions
 Significant co-occurrences of words in
syntactically interesting relationships
 `syntactically interesting’: for examples in this
talk, attributive adjectives and the nouns they
immediately precede
 `significant’: statistically significant (but on what
assumptions about baseline?)
 Compositional, no idiosyncratic syntax etc (as
opposed to multiword expression)
 About language rather than the real world
Collocation versus denotation
 Whether an unusually frequent word pair is a
collocation or not depends on assumptions about
denotation: fix denotation to investigate collocation
 Empirically: investigations using WordNet synsets
(Pearce, 2001)
 Anti-collocation: words that might be expected to go
together and tend not to
 e.g., flawless behaviour (Cruse, 1986): big rain (unless
explained by denotation)
 e.g., buy house is predictable on basis of denotation,
shake fist is not
2(ii): Distribution of
`magnitude’ adjectives
 some very frequent adjectives have magnitude-
related meanings (e.g., heavy, high, big, large)
 basic meaning with simple concrete entities
 extended meaning with abstract nouns, non-concrete
physical entities (high taxation, heavy rain)
 extended uses more common than basic
 not all magnitude adjectives – e.g. tall
 nouns tend to occur with a limited subset of these
extended adjectives
 some apparent semantic groupings of nouns which
go with particular adjectives, but not easily specified
Some adjective-noun
frequencies in the BNC
number proportion quality problem part winds rain
large 1790 404 0 10 533 0 0
high 92 501 799 0 3 90 0
big 11 1 0 79 79 3 1
heavy 0 0 1 0 1 2 198
Grammaticality judgments
number proportion quality problem part winds rain
large * ? * *
high * ? *
big ? *
heavy ? * * *
More examples
impor
tance
success majority number proport
ion
quality role problem part winds support rain
great 310 360 382 172 9 11 3 44 71 0 22 0
large 1 1 112 1790 404 0 13 10 533 0 1 0
high 8 0 0 92 501 799 1 0 3 90 2 0
major 62 60 0 0 7 0 272 356 408 1 8 0
big 0 40 5 11 1 0 3 79 79 3 1 1
strong 0 0 2 0 0 1 8 0 3 132 147 0
heavy 0 0 1 0 0 1 0 0 1 2 4 198
Judgments
impor
tance
success majority number
proporti
on
quality role problem part winds support rain
great ? *
large ? ? * ? * *
high * ? ? * ? *
major ? ? ?
big ? ?
strong ? ? * * * * ?
heavy ? * ? * * * *
Distribution
 Investigated the distribution of heavy, high, big,
large, strong, great, major with the most common
co-occurring nouns in the BNC
 Nouns tend to occur with up to three of these
adjectives with high frequency and low or zero
frequency with the rest
 My intuitive grammaticality judgments correlate but
allow for some unseen combinations and disallow a
few observed but very infrequent ones
 big, major and great are grammatical with many
nouns (but not frequent with most), strong and
heavy are ungrammatical with most nouns, high and
large intermediate
heavy: groupings?
magnitude: dew, rainstorm, downpour, rain, rainfall,
snowfall, fall, snow, shower: frost, spindrift: clouds,
mist, fog: flow, flooding, bleeding, period, traffic:
demands, reliance, workload, responsibility, emphasis,
dependence: irony, sarcasm, criticism: infestation,
soiling: loss, price, cost, expenditure, taxation, fine,
penalty, damages, investment: punishment, sentence:
fire, bombardment, casualties, defeat, fighting:
burden, load, weight, pressure: crop: advertising: use,
drinking:
magnitude of verb: drinker, smoker:
magnitude related? odour, perfume, scent, smell,
whiff: lunch: sea, surf, swell:
high: groupings?
magnitude: esteem, status, regard, reputation,
standing, calibre, value, priority; grade, quality, level;
proportion, degree, incidence, frequency, number,
prevalence, percentage; volume, speed, voltage,
pressure, concentration, density, performance,
temperature, energy, resolution, dose, wind; risk, cost,
price, rate, inflation, tax, taxation, mortality, turnover,
wage, income, productivity, unemployment, demand
magnitude of verb: earner
heavy and high
 50 nouns in BNC with the extended
magnitude use of heavy with frequency
10 or more
 160 such nouns with high
 Only 9 such nouns with both adjectives:
price, pressure, investment, demand,
rainfall, cost, costs, concentration,
taxation
2(iii): Possible empirical
accounts of distribution
1. Difference in denotation between
`extended’ uses of adjectives
2. Grammaticized selectional
restrictions/preferences
3. Lexical selection
• stipulate Magn function with nouns (Meaning-
Text Theory)
4. Semi-productivity / collocation
• plus semantic back-off
1 - Denotation account of
distribution
 Denotation of adjective simply prevents it being
possible with the noun. Implies that heavy and high
have different denotations
heavy’(x) => MF(x) > norm(MF,type(x),c) & precipitation(x) or
cost(x) or flow(x) or consumption(x)...
(where rain(x) -> precipitation(x) and so on)
 But: messy disjunction or multiple senses, open-
ended, unlikely to be tractable.
 e.g., heavy shower only for rain sense, not bathroom sense
 Not falsifiable, but no motivation other than
distribution.
 Dictionary definitions can be seen as doing this
(informally), but none account for observed
distribution.
 Input to generator?
2 - Selectional restrictions and
distribution
 Assume the adjectives have the same denotation
 Distribution via features in the lexicon
 e.g., literal high selects for [ANIMATE false ]
 cf., approach used in the ERG for in/on/at in temporal
expressions
 grammaticized, so doesn’t need to be determined by
denotation (though assume consistency)
 could utilise qualia structure
 Problem: can’t find a reasonable set of cross-cutting
features!
 Stipulative approach possible, but unattractive.
3 - Lexical selection
 MTT approach
 noun specifies its Magn adjective
 in Mel’čuk and Polguère (1987), Magn is a
function, but could modify to make it a set, or
vary meanings
 could also make adjective specify set of
nouns, though not directly in LKB logic
 stipulative: if we’re going to do this, why not
use a corpus directly?
4- Collocational account of
distribution
 all the adjectives share a denotation corresponding
to magnitude, distribution differences due to
collocation, soft rather than hard constraints
 linguistically:
 adjective-noun combination is semi-productive
 denotation and syntax allow heavy esteem etc, but speakers
are sensitive to frequencies, prefer more frequent phrases
with same meaning
 cf morphology and sense extension: Briscoe and Copestake
(1999). Blocking (but weaker than with morphology)
 anti-collocations as reflection of semi-productivity
Collocational account of
distribution
 computationally,
 fits with some current practice:
• filter adjective-noun realisations according to n-
grams (statistical generation – e.g., Langkilde
and Knight, recent experiments with ERG)
• use of co-occurrences in WSD
 back-off techniques
 requires an approach to clustering
semantic spaces
 acquired from corpora
 generally, collect vectors of words which co-occur with
the target
 best known is LSA: often used in psycholinguistics
 more sophisticated models incorporate syntactic
relationships
 currently sexy, but severe limitations!
dog bark house cat
dog - 1 0 0
bark 1 - 0 0
Back-off and analogy
 back-off: decision for infrequent noun with no corpus
evidence for specific magnitude adjective
 should be partly based on productivity of adjective:
number of nouns it occurs with
 default to big
 back-off also sensitive to word clusters
 e.g., heavy spindrift because spindrift is semantically similar
to snow
 semantic space models: i.e., group according to distribution
with other words
 hence, adjective has some correlation with semantics of the
noun
Metaphor
 Different metaphors for different nouns (cf., Lakoff et
al)
 `high’ nouns measured with an upright scale: e.g.,
temperature: temperature is rising
 `heavy’ nouns metaphorically like burden: e.g., workload:
her workload is weighing on her
 Doesn’t lead to an empirical account of distribution,
since we can’t predict classes. Assumption of literal
denotation followed by coercion is implausible.
 But: extended metaphor idea is consistent with idea
that clusters for backoff are based on semantic space
Collocation and linguistic
theory
 Collocation plus semantic space clusters may account for some
of the `messy’ bits, at least for some speakers.
 in/on transport: in the car, on the bus
 Talmy: presence of walkway, `ragged lower end of hierarchy’
 but trains without walkway, caravans with walkway?
 in/on choice perhaps collocational, not real exception to language-
independent schema elements
 Potential to simplify linguistic theories considerably.
 Success of ngrams, LSA models of priming.
 Practically testable: assume same denotation of heavy/high or
in/on, see if we can account for distribution in corpus.
 Alternative for temporal in/on/at?
 Experiments with machine learning temporal in/on/at (Mei Lin,
MPhil thesis, 2004): very successful at predicting distribution, but
used lots of Treebank-derived features.
Summary
 Selection in ERG
 Other aspects of ERG selection not described here:
multiword expressions and idioms
 Collocational models as adjunct to TFS
encoding
 Role of denotation is crucial
 Practical considerations about grammar
usability
Final remarks
 Grammar usability:
 A good broad-coverage grammar should have an
account of denotation of closed-class words at
least, but probably not within TFS encoding.
 Can we use semantic web languages for non-
domain-specific encoding?
 Collocational techniques require much further
investigation
 Can semantic space models be related to
denotation (e.g., somehow excluding
collocational component)?
Idioms
Idiom entry:
stand+guard := v_nbar_idiom &
[ SYNSEM.LOCAL.CONT.RELS
<! [ PRED "_stand_v_i_rel" ],
[ PRED "_guard_n_i_rel" ] !> ].
Idiomatic lexical entries:
guard_n1_i := n_intr_nospr_le &
[ STEM < "guard" >,
SYNSEM [ LKEYS.KEYREL.PRED "_guard_n_i_rel“ ]].
stand_v1_i := v_np_non_trans_idiom_le &
[ STEM < "stand" >,
SYNSEM [ LKEYS.KEYREL.PRED "_stand_v_i_rel”]].
Idioms in ERG/LKB
 Account based on Wasow et al (1982), Nunberg et al
(1994).
 Idiom entry specifies a set of coindexed MRS
relations (coindexation specified by idiom type, e.g.,
v_nbar_idiom)
 Relations may correspond to idiomatic lexical entries
(but may be literal uses: e.g., cat out of the bag –
literal out of the).
 Idiom is recognised if some phrase matches the
idiom entry.
 Allows for modification: e.g., stand watchful guard
Messy examples
 among: requires group or plural or ?
 among the family (BNC)
 among the chaos (BNC)
 between: requires plural denoting two
objects, but not group (?)
 fudge sandwiched between sponge (BNC)
 between each tendon (BNC)
 ? the actor threw a dart between the couple
 * the actor threw a dart between the audience
(even if only two people in the audience)

More Related Content

Similar to scl.ppt

An Outline Of Type-Theoretical Approaches To Lexical Semantics
An Outline Of Type-Theoretical Approaches To Lexical SemanticsAn Outline Of Type-Theoretical Approaches To Lexical Semantics
An Outline Of Type-Theoretical Approaches To Lexical SemanticsTye Rausch
 
Chapter 2 Text Operation.pdf
Chapter 2 Text Operation.pdfChapter 2 Text Operation.pdf
Chapter 2 Text Operation.pdfHabtamu100
 
Beyond the Surface: A Computational Exploration of Linguistic Ambiguity
Beyond the Surface: A Computational Exploration of Linguistic AmbiguityBeyond the Surface: A Computational Exploration of Linguistic Ambiguity
Beyond the Surface: A Computational Exploration of Linguistic AmbiguityIIIT Hyderabad
 
causativeppt.ppt
causativeppt.pptcausativeppt.ppt
causativeppt.pptAjiNug3
 
What can a corpus tell us about grammar
What can a corpus tell us about grammarWhat can a corpus tell us about grammar
What can a corpus tell us about grammarSami Khalil
 
Evolution of Morphological Agreement - Peche Kucha
Evolution of Morphological Agreement - Peche KuchaEvolution of Morphological Agreement - Peche Kucha
Evolution of Morphological Agreement - Peche KuchaRichard Littauer
 
Corpus linguistics and multi-word units
Corpus linguistics and multi-word unitsCorpus linguistics and multi-word units
Corpus linguistics and multi-word unitsPascual Pérez-Paredes
 
Chapter 1 about semantics
Chapter 1 about semanticsChapter 1 about semantics
Chapter 1 about semanticsAmena dheif
 
Computer assisted text and corpus analysis
Computer assisted text and corpus analysisComputer assisted text and corpus analysis
Computer assisted text and corpus analysisRubyaShaheen
 
DDH 2021-03-03: Text Processing and Searching in the Medical Domain
DDH 2021-03-03: Text Processing and Searching in the Medical DomainDDH 2021-03-03: Text Processing and Searching in the Medical Domain
DDH 2021-03-03: Text Processing and Searching in the Medical DomainLuukBoulogne
 
Pragmatics_and_Presupposition.pdf
Pragmatics_and_Presupposition.pdfPragmatics_and_Presupposition.pdf
Pragmatics_and_Presupposition.pdfKassahun16
 

Similar to scl.ppt (20)

An Outline Of Type-Theoretical Approaches To Lexical Semantics
An Outline Of Type-Theoretical Approaches To Lexical SemanticsAn Outline Of Type-Theoretical Approaches To Lexical Semantics
An Outline Of Type-Theoretical Approaches To Lexical Semantics
 
Chapter 2 Text Operation.pdf
Chapter 2 Text Operation.pdfChapter 2 Text Operation.pdf
Chapter 2 Text Operation.pdf
 
samplepaper
samplepapersamplepaper
samplepaper
 
Beyond the Surface: A Computational Exploration of Linguistic Ambiguity
Beyond the Surface: A Computational Exploration of Linguistic AmbiguityBeyond the Surface: A Computational Exploration of Linguistic Ambiguity
Beyond the Surface: A Computational Exploration of Linguistic Ambiguity
 
Collocation.ppt
Collocation.pptCollocation.ppt
Collocation.ppt
 
causativeppt.ppt
causativeppt.pptcausativeppt.ppt
causativeppt.ppt
 
Nlp ambiguity presentation
Nlp ambiguity presentationNlp ambiguity presentation
Nlp ambiguity presentation
 
What can a corpus tell us about grammar
What can a corpus tell us about grammarWhat can a corpus tell us about grammar
What can a corpus tell us about grammar
 
Syntax
SyntaxSyntax
Syntax
 
Evolution of Morphological Agreement - Peche Kucha
Evolution of Morphological Agreement - Peche KuchaEvolution of Morphological Agreement - Peche Kucha
Evolution of Morphological Agreement - Peche Kucha
 
Grammar 4
Grammar 4Grammar 4
Grammar 4
 
Grammar 4
Grammar 4Grammar 4
Grammar 4
 
Measuring Similarity Between Contexts and Concepts
Measuring Similarity Between Contexts and ConceptsMeasuring Similarity Between Contexts and Concepts
Measuring Similarity Between Contexts and Concepts
 
Corpus linguistics and multi-word units
Corpus linguistics and multi-word unitsCorpus linguistics and multi-word units
Corpus linguistics and multi-word units
 
Chapter 1 about semantics
Chapter 1 about semanticsChapter 1 about semantics
Chapter 1 about semantics
 
Pragmatics
PragmaticsPragmatics
Pragmatics
 
Pragmatics slide 1
Pragmatics slide 1Pragmatics slide 1
Pragmatics slide 1
 
Computer assisted text and corpus analysis
Computer assisted text and corpus analysisComputer assisted text and corpus analysis
Computer assisted text and corpus analysis
 
DDH 2021-03-03: Text Processing and Searching in the Medical Domain
DDH 2021-03-03: Text Processing and Searching in the Medical DomainDDH 2021-03-03: Text Processing and Searching in the Medical Domain
DDH 2021-03-03: Text Processing and Searching in the Medical Domain
 
Pragmatics_and_Presupposition.pdf
Pragmatics_and_Presupposition.pdfPragmatics_and_Presupposition.pdf
Pragmatics_and_Presupposition.pdf
 

More from MariYam371004

SOCIAL_IDENTITY_pptx.pptx
SOCIAL_IDENTITY_pptx.pptxSOCIAL_IDENTITY_pptx.pptx
SOCIAL_IDENTITY_pptx.pptxMariYam371004
 
srategies-for-teaching-language-skills.ppt
srategies-for-teaching-language-skills.pptsrategies-for-teaching-language-skills.ppt
srategies-for-teaching-language-skills.pptMariYam371004
 
EMLex-Diachronic-lexicography-and-lexicology-1.ppt
EMLex-Diachronic-lexicography-and-lexicology-1.pptEMLex-Diachronic-lexicography-and-lexicology-1.ppt
EMLex-Diachronic-lexicography-and-lexicology-1.pptMariYam371004
 
dokumen.tips_what-is-apa-apa-format-is-a-standard-set-of-conventionsrules-for...
dokumen.tips_what-is-apa-apa-format-is-a-standard-set-of-conventionsrules-for...dokumen.tips_what-is-apa-apa-format-is-a-standard-set-of-conventionsrules-for...
dokumen.tips_what-is-apa-apa-format-is-a-standard-set-of-conventionsrules-for...MariYam371004
 
Sociolinguistics 2.pptx
Sociolinguistics 2.pptxSociolinguistics 2.pptx
Sociolinguistics 2.pptxMariYam371004
 
MixedLanguageProcessingTutorialEMNLP2019.pptx
MixedLanguageProcessingTutorialEMNLP2019.pptxMixedLanguageProcessingTutorialEMNLP2019.pptx
MixedLanguageProcessingTutorialEMNLP2019.pptxMariYam371004
 
Word-Formation-Presentation.pptx
Word-Formation-Presentation.pptxWord-Formation-Presentation.pptx
Word-Formation-Presentation.pptxMariYam371004
 
lingua-franca-151019083140-lva1-app6892.pdf
lingua-franca-151019083140-lva1-app6892.pdflingua-franca-151019083140-lva1-app6892.pdf
lingua-franca-151019083140-lva1-app6892.pdfMariYam371004
 
Phonetic-Chart-BE.pdf
Phonetic-Chart-BE.pdfPhonetic-Chart-BE.pdf
Phonetic-Chart-BE.pdfMariYam371004
 
lecturer-website-result-final.pdf
lecturer-website-result-final.pdflecturer-website-result-final.pdf
lecturer-website-result-final.pdfMariYam371004
 
5-Listening-Skills-2.ppt
5-Listening-Skills-2.ppt5-Listening-Skills-2.ppt
5-Listening-Skills-2.pptMariYam371004
 
The Conditionals.pdf
The Conditionals.pdfThe Conditionals.pdf
The Conditionals.pdfMariYam371004
 

More from MariYam371004 (19)

SOCIAL_IDENTITY_pptx.pptx
SOCIAL_IDENTITY_pptx.pptxSOCIAL_IDENTITY_pptx.pptx
SOCIAL_IDENTITY_pptx.pptx
 
srategies-for-teaching-language-skills.ppt
srategies-for-teaching-language-skills.pptsrategies-for-teaching-language-skills.ppt
srategies-for-teaching-language-skills.ppt
 
EMLex-Diachronic-lexicography-and-lexicology-1.ppt
EMLex-Diachronic-lexicography-and-lexicology-1.pptEMLex-Diachronic-lexicography-and-lexicology-1.ppt
EMLex-Diachronic-lexicography-and-lexicology-1.ppt
 
dokumen.tips_what-is-apa-apa-format-is-a-standard-set-of-conventionsrules-for...
dokumen.tips_what-is-apa-apa-format-is-a-standard-set-of-conventionsrules-for...dokumen.tips_what-is-apa-apa-format-is-a-standard-set-of-conventionsrules-for...
dokumen.tips_what-is-apa-apa-format-is-a-standard-set-of-conventionsrules-for...
 
DESCRIPTIVE.pptx
DESCRIPTIVE.pptxDESCRIPTIVE.pptx
DESCRIPTIVE.pptx
 
Sociolinguistics 2.pptx
Sociolinguistics 2.pptxSociolinguistics 2.pptx
Sociolinguistics 2.pptx
 
MixedLanguageProcessingTutorialEMNLP2019.pptx
MixedLanguageProcessingTutorialEMNLP2019.pptxMixedLanguageProcessingTutorialEMNLP2019.pptx
MixedLanguageProcessingTutorialEMNLP2019.pptx
 
Morphology.2.ppt
Morphology.2.pptMorphology.2.ppt
Morphology.2.ppt
 
chapter_5_3.pptx
chapter_5_3.pptxchapter_5_3.pptx
chapter_5_3.pptx
 
Word-Formation-Presentation.pptx
Word-Formation-Presentation.pptxWord-Formation-Presentation.pptx
Word-Formation-Presentation.pptx
 
lingua-franca-151019083140-lva1-app6892.pdf
lingua-franca-151019083140-lva1-app6892.pdflingua-franca-151019083140-lva1-app6892.pdf
lingua-franca-151019083140-lva1-app6892.pdf
 
Phonetic-Chart-BE.pdf
Phonetic-Chart-BE.pdfPhonetic-Chart-BE.pdf
Phonetic-Chart-BE.pdf
 
verbrevf3.ppt
verbrevf3.pptverbrevf3.ppt
verbrevf3.ppt
 
9Identity.ppt
9Identity.ppt9Identity.ppt
9Identity.ppt
 
Assimilation.pdf
Assimilation.pdfAssimilation.pdf
Assimilation.pdf
 
meaning.ppt.pdf
meaning.ppt.pdfmeaning.ppt.pdf
meaning.ppt.pdf
 
lecturer-website-result-final.pdf
lecturer-website-result-final.pdflecturer-website-result-final.pdf
lecturer-website-result-final.pdf
 
5-Listening-Skills-2.ppt
5-Listening-Skills-2.ppt5-Listening-Skills-2.ppt
5-Listening-Skills-2.ppt
 
The Conditionals.pdf
The Conditionals.pdfThe Conditionals.pdf
The Conditionals.pdf
 

Recently uploaded

VVIP Pune Call Girls Hadapsar (7001035870) Pune Escorts Nearby with Complete ...
VVIP Pune Call Girls Hadapsar (7001035870) Pune Escorts Nearby with Complete ...VVIP Pune Call Girls Hadapsar (7001035870) Pune Escorts Nearby with Complete ...
VVIP Pune Call Girls Hadapsar (7001035870) Pune Escorts Nearby with Complete ...Call Girls in Nagpur High Profile
 
SD_The MATATAG Curriculum Training Design.pptx
SD_The MATATAG Curriculum Training Design.pptxSD_The MATATAG Curriculum Training Design.pptx
SD_The MATATAG Curriculum Training Design.pptxjanettecruzeiro1
 
PODSCAPE - Brochure 2023_ prefab homes in Bangalore India
PODSCAPE - Brochure 2023_ prefab homes in Bangalore IndiaPODSCAPE - Brochure 2023_ prefab homes in Bangalore India
PODSCAPE - Brochure 2023_ prefab homes in Bangalore IndiaYathish29
 
Call Girls in Okhla Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Okhla Delhi 💯Call Us 🔝8264348440🔝Call Girls in Okhla Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Okhla Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
CALL ON ➥8923113531 🔝Call Girls Aminabad Lucknow best Night Fun service
CALL ON ➥8923113531 🔝Call Girls Aminabad Lucknow best Night Fun serviceCALL ON ➥8923113531 🔝Call Girls Aminabad Lucknow best Night Fun service
CALL ON ➥8923113531 🔝Call Girls Aminabad Lucknow best Night Fun serviceanilsa9823
 
VIP Call Girl Amravati Aashi 8250192130 Independent Escort Service Amravati
VIP Call Girl Amravati Aashi 8250192130 Independent Escort Service AmravatiVIP Call Girl Amravati Aashi 8250192130 Independent Escort Service Amravati
VIP Call Girl Amravati Aashi 8250192130 Independent Escort Service AmravatiSuhani Kapoor
 
VIP Russian Call Girls in Saharanpur Deepika 8250192130 Independent Escort Se...
VIP Russian Call Girls in Saharanpur Deepika 8250192130 Independent Escort Se...VIP Russian Call Girls in Saharanpur Deepika 8250192130 Independent Escort Se...
VIP Russian Call Girls in Saharanpur Deepika 8250192130 Independent Escort Se...Suhani Kapoor
 
WAEC Carpentry and Joinery Past Questions
WAEC Carpentry and Joinery Past QuestionsWAEC Carpentry and Joinery Past Questions
WAEC Carpentry and Joinery Past QuestionsCharles Obaleagbon
 
Cheap Rate ➥8448380779 ▻Call Girls In Iffco Chowk Gurgaon
Cheap Rate ➥8448380779 ▻Call Girls In Iffco Chowk GurgaonCheap Rate ➥8448380779 ▻Call Girls In Iffco Chowk Gurgaon
Cheap Rate ➥8448380779 ▻Call Girls In Iffco Chowk GurgaonDelhi Call girls
 
VIP Call Girls Service Kukatpally Hyderabad Call +91-8250192130
VIP Call Girls Service Kukatpally Hyderabad Call +91-8250192130VIP Call Girls Service Kukatpally Hyderabad Call +91-8250192130
VIP Call Girls Service Kukatpally Hyderabad Call +91-8250192130Suhani Kapoor
 
CALL ON ➥8923113531 🔝Call Girls Kalyanpur Lucknow best Female service 🧵
CALL ON ➥8923113531 🔝Call Girls Kalyanpur Lucknow best Female service  🧵CALL ON ➥8923113531 🔝Call Girls Kalyanpur Lucknow best Female service  🧵
CALL ON ➥8923113531 🔝Call Girls Kalyanpur Lucknow best Female service 🧵anilsa9823
 
Fashion trends before and after covid.pptx
Fashion trends before and after covid.pptxFashion trends before and after covid.pptx
Fashion trends before and after covid.pptxVanshNarang19
 
The_Canvas_of_Creative_Mastery_Newsletter_April_2024_Version.pdf
The_Canvas_of_Creative_Mastery_Newsletter_April_2024_Version.pdfThe_Canvas_of_Creative_Mastery_Newsletter_April_2024_Version.pdf
The_Canvas_of_Creative_Mastery_Newsletter_April_2024_Version.pdfAmirYakdi
 
Kindergarten Assessment Questions Via LessonUp
Kindergarten Assessment Questions Via LessonUpKindergarten Assessment Questions Via LessonUp
Kindergarten Assessment Questions Via LessonUpmainac1
 
VIP Russian Call Girls in Gorakhpur Deepika 8250192130 Independent Escort Ser...
VIP Russian Call Girls in Gorakhpur Deepika 8250192130 Independent Escort Ser...VIP Russian Call Girls in Gorakhpur Deepika 8250192130 Independent Escort Ser...
VIP Russian Call Girls in Gorakhpur Deepika 8250192130 Independent Escort Ser...Suhani Kapoor
 
Cosumer Willingness to Pay for Sustainable Bricks
Cosumer Willingness to Pay for Sustainable BricksCosumer Willingness to Pay for Sustainable Bricks
Cosumer Willingness to Pay for Sustainable Bricksabhishekparmar618
 
VIP Call Girls Service Bhagyanagar Hyderabad Call +91-8250192130
VIP Call Girls Service Bhagyanagar Hyderabad Call +91-8250192130VIP Call Girls Service Bhagyanagar Hyderabad Call +91-8250192130
VIP Call Girls Service Bhagyanagar Hyderabad Call +91-8250192130Suhani Kapoor
 
The history of music videos a level presentation
The history of music videos a level presentationThe history of music videos a level presentation
The history of music videos a level presentationamedia6
 
Design Portfolio - 2024 - William Vickery
Design Portfolio - 2024 - William VickeryDesign Portfolio - 2024 - William Vickery
Design Portfolio - 2024 - William VickeryWilliamVickery6
 
Revit Understanding Reference Planes and Reference lines in Revit for Family ...
Revit Understanding Reference Planes and Reference lines in Revit for Family ...Revit Understanding Reference Planes and Reference lines in Revit for Family ...
Revit Understanding Reference Planes and Reference lines in Revit for Family ...Narsimha murthy
 

Recently uploaded (20)

VVIP Pune Call Girls Hadapsar (7001035870) Pune Escorts Nearby with Complete ...
VVIP Pune Call Girls Hadapsar (7001035870) Pune Escorts Nearby with Complete ...VVIP Pune Call Girls Hadapsar (7001035870) Pune Escorts Nearby with Complete ...
VVIP Pune Call Girls Hadapsar (7001035870) Pune Escorts Nearby with Complete ...
 
SD_The MATATAG Curriculum Training Design.pptx
SD_The MATATAG Curriculum Training Design.pptxSD_The MATATAG Curriculum Training Design.pptx
SD_The MATATAG Curriculum Training Design.pptx
 
PODSCAPE - Brochure 2023_ prefab homes in Bangalore India
PODSCAPE - Brochure 2023_ prefab homes in Bangalore IndiaPODSCAPE - Brochure 2023_ prefab homes in Bangalore India
PODSCAPE - Brochure 2023_ prefab homes in Bangalore India
 
Call Girls in Okhla Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Okhla Delhi 💯Call Us 🔝8264348440🔝Call Girls in Okhla Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Okhla Delhi 💯Call Us 🔝8264348440🔝
 
CALL ON ➥8923113531 🔝Call Girls Aminabad Lucknow best Night Fun service
CALL ON ➥8923113531 🔝Call Girls Aminabad Lucknow best Night Fun serviceCALL ON ➥8923113531 🔝Call Girls Aminabad Lucknow best Night Fun service
CALL ON ➥8923113531 🔝Call Girls Aminabad Lucknow best Night Fun service
 
VIP Call Girl Amravati Aashi 8250192130 Independent Escort Service Amravati
VIP Call Girl Amravati Aashi 8250192130 Independent Escort Service AmravatiVIP Call Girl Amravati Aashi 8250192130 Independent Escort Service Amravati
VIP Call Girl Amravati Aashi 8250192130 Independent Escort Service Amravati
 
VIP Russian Call Girls in Saharanpur Deepika 8250192130 Independent Escort Se...
VIP Russian Call Girls in Saharanpur Deepika 8250192130 Independent Escort Se...VIP Russian Call Girls in Saharanpur Deepika 8250192130 Independent Escort Se...
VIP Russian Call Girls in Saharanpur Deepika 8250192130 Independent Escort Se...
 
WAEC Carpentry and Joinery Past Questions
WAEC Carpentry and Joinery Past QuestionsWAEC Carpentry and Joinery Past Questions
WAEC Carpentry and Joinery Past Questions
 
Cheap Rate ➥8448380779 ▻Call Girls In Iffco Chowk Gurgaon
Cheap Rate ➥8448380779 ▻Call Girls In Iffco Chowk GurgaonCheap Rate ➥8448380779 ▻Call Girls In Iffco Chowk Gurgaon
Cheap Rate ➥8448380779 ▻Call Girls In Iffco Chowk Gurgaon
 
VIP Call Girls Service Kukatpally Hyderabad Call +91-8250192130
VIP Call Girls Service Kukatpally Hyderabad Call +91-8250192130VIP Call Girls Service Kukatpally Hyderabad Call +91-8250192130
VIP Call Girls Service Kukatpally Hyderabad Call +91-8250192130
 
CALL ON ➥8923113531 🔝Call Girls Kalyanpur Lucknow best Female service 🧵
CALL ON ➥8923113531 🔝Call Girls Kalyanpur Lucknow best Female service  🧵CALL ON ➥8923113531 🔝Call Girls Kalyanpur Lucknow best Female service  🧵
CALL ON ➥8923113531 🔝Call Girls Kalyanpur Lucknow best Female service 🧵
 
Fashion trends before and after covid.pptx
Fashion trends before and after covid.pptxFashion trends before and after covid.pptx
Fashion trends before and after covid.pptx
 
The_Canvas_of_Creative_Mastery_Newsletter_April_2024_Version.pdf
The_Canvas_of_Creative_Mastery_Newsletter_April_2024_Version.pdfThe_Canvas_of_Creative_Mastery_Newsletter_April_2024_Version.pdf
The_Canvas_of_Creative_Mastery_Newsletter_April_2024_Version.pdf
 
Kindergarten Assessment Questions Via LessonUp
Kindergarten Assessment Questions Via LessonUpKindergarten Assessment Questions Via LessonUp
Kindergarten Assessment Questions Via LessonUp
 
VIP Russian Call Girls in Gorakhpur Deepika 8250192130 Independent Escort Ser...
VIP Russian Call Girls in Gorakhpur Deepika 8250192130 Independent Escort Ser...VIP Russian Call Girls in Gorakhpur Deepika 8250192130 Independent Escort Ser...
VIP Russian Call Girls in Gorakhpur Deepika 8250192130 Independent Escort Ser...
 
Cosumer Willingness to Pay for Sustainable Bricks
Cosumer Willingness to Pay for Sustainable BricksCosumer Willingness to Pay for Sustainable Bricks
Cosumer Willingness to Pay for Sustainable Bricks
 
VIP Call Girls Service Bhagyanagar Hyderabad Call +91-8250192130
VIP Call Girls Service Bhagyanagar Hyderabad Call +91-8250192130VIP Call Girls Service Bhagyanagar Hyderabad Call +91-8250192130
VIP Call Girls Service Bhagyanagar Hyderabad Call +91-8250192130
 
The history of music videos a level presentation
The history of music videos a level presentationThe history of music videos a level presentation
The history of music videos a level presentation
 
Design Portfolio - 2024 - William Vickery
Design Portfolio - 2024 - William VickeryDesign Portfolio - 2024 - William Vickery
Design Portfolio - 2024 - William Vickery
 
Revit Understanding Reference Planes and Reference lines in Revit for Family ...
Revit Understanding Reference Planes and Reference lines in Revit for Family ...Revit Understanding Reference Planes and Reference lines in Revit for Family ...
Revit Understanding Reference Planes and Reference lines in Revit for Family ...
 

scl.ppt

  • 1. Lexical and semantic selection Options for grammar engineers and what they might mean linguistically
  • 2. Outline and acknowledgements 1. Selection in constraint-based approaches i. types of selection and overview of methods used in LKB/ERG ii. denotation 2. The collocation problem i. collocation in general ii. corpus data on magnitude adjectives iii. possible accounts 3. Conclusions  Acknowledgements: LinGO/DELPH-IN, especially Dan Flickinger, also Generative Lexicon 2005
  • 3. 1(i): Types of grammatical selection  syntactic: e.g., preposition among selects for an NP (like other prepositions)  lexical: e.g., spend selects for PP headed by on  Kim spent the money on a car  semantic: e.g., temporal at selects for times of day (and meals)  at 3am  at three thirty five and ten seconds precisely
  • 4. Lexical selection  lexical selection requires method of specifying a lexeme  in the ERG, this is via the PRED value spend (e.g., spend the money on Kim) spend_v2 := v_np_prep_trans_le & [ STEM < "spend" >, SYNSEM [ LKEYS [ --OCOMPKEY _on_p_rel KEYREL.PRED "_spend_v_rel" ]]].
  • 5. Lexical selection  ERG relies on convention that different lexemes have different relations  `lexical’ selection is actually semantic. cf Wechsler  no true synonyms assumption, or assume that grammar makes distinctions that are more fine-grained than real- world denotation justifies.  near-synonymy would have to be recorded elsewhere: ERG does (some) morphology, syntax and compositional semantics  alternatives?  orthography: but ambiguity or non-monotonic semantics  lexical identifier: requires new feature  PFORM: requires features, values
  • 6. Semantic selection  Requires a method of specifying a semantically-defined phrase  In ERG, done by specifying a higher node in the hierarchy of relations: at_temp := p_temp_le & [ STEM < "at" >, SYNSEM [ LKEYS [ --COMPKEY hour_or_time_rel, KEYREL.PRED _at_p_temp_rel ]]].
  • 8. Semantic selection  Semantic selection allows for indefinitely large set of alternative phrases  compositionally constructed time expressions  productive with respect to new words, but exceptions allowable • approach wouldn’t be falsified if e.g., *at tiffin  ERG lexical selection is a special case of ERG semantic selection!  could assume featural encoding of semantic properties (alternatively or in addition to hierarchy)  TFS semantic selection is relatively limited practically (see later)  also idiom mechanism in ERG
  • 9. 1(ii): Denotation, grammar engineering perspective  Denotation is truth-conditional, logically formalisable (in principle), refers to `real world’ (extension)  Not necessarily decomposable  Naive physics, biology, etc  Must interface with non-linguistic components  Minimising lexical complexity in broad-coverage grammars is practically necessary  Plausible input to generator:  reasonable to expect real world constraints to be obeyed (except in context) • the goat read the book  Potential disambiguation is not a sufficient condition for lexical encoding  The vet treated the rabbit and the guinea pig with dietary Vitamin C deficiency
  • 10. Denotation, continued  Assume linkage to domain, richer knowledge representation language available  TFS language for syntax etc, not intended for general inference  Talmy example: the baguette lay across the road  across - Figure’s length > Ground’s width  identifying F and G and location for comparison in grammar?  coding average length of all nouns?  allowing for massive baguettes and tiny roads?
  • 11. But ...  Trend in KR is towards description logics rather than richer languages.  Need to think about the denotation to justify grammaticization (or otherwise)  if temporal in/on/at have same denotation, selectional account is required for different distribution  unreasonable to expect lexical choice for in/on/at in input to generator  Linguistic criteria: denotation versus grammaticization?  effect found cross-linguistically?  predictable on basis of world knowledge?  closed class vs open class  Practical considerations about interfacing go along with linguistic criteria  non-linguists expect some information about word meaning!  allow generalisation over e.g., in/on/at in generator input, while keeping possibility of distinction
  • 12. 2(i) Collocation: assumptions  Significant co-occurrences of words in syntactically interesting relationships  `syntactically interesting’: for examples in this talk, attributive adjectives and the nouns they immediately precede  `significant’: statistically significant (but on what assumptions about baseline?)  Compositional, no idiosyncratic syntax etc (as opposed to multiword expression)  About language rather than the real world
  • 13. Collocation versus denotation  Whether an unusually frequent word pair is a collocation or not depends on assumptions about denotation: fix denotation to investigate collocation  Empirically: investigations using WordNet synsets (Pearce, 2001)  Anti-collocation: words that might be expected to go together and tend not to  e.g., flawless behaviour (Cruse, 1986): big rain (unless explained by denotation)  e.g., buy house is predictable on basis of denotation, shake fist is not
  • 14. 2(ii): Distribution of `magnitude’ adjectives  some very frequent adjectives have magnitude- related meanings (e.g., heavy, high, big, large)  basic meaning with simple concrete entities  extended meaning with abstract nouns, non-concrete physical entities (high taxation, heavy rain)  extended uses more common than basic  not all magnitude adjectives – e.g. tall  nouns tend to occur with a limited subset of these extended adjectives  some apparent semantic groupings of nouns which go with particular adjectives, but not easily specified
  • 15. Some adjective-noun frequencies in the BNC number proportion quality problem part winds rain large 1790 404 0 10 533 0 0 high 92 501 799 0 3 90 0 big 11 1 0 79 79 3 1 heavy 0 0 1 0 1 2 198
  • 16. Grammaticality judgments number proportion quality problem part winds rain large * ? * * high * ? * big ? * heavy ? * * *
  • 17. More examples impor tance success majority number proport ion quality role problem part winds support rain great 310 360 382 172 9 11 3 44 71 0 22 0 large 1 1 112 1790 404 0 13 10 533 0 1 0 high 8 0 0 92 501 799 1 0 3 90 2 0 major 62 60 0 0 7 0 272 356 408 1 8 0 big 0 40 5 11 1 0 3 79 79 3 1 1 strong 0 0 2 0 0 1 8 0 3 132 147 0 heavy 0 0 1 0 0 1 0 0 1 2 4 198
  • 18. Judgments impor tance success majority number proporti on quality role problem part winds support rain great ? * large ? ? * ? * * high * ? ? * ? * major ? ? ? big ? ? strong ? ? * * * * ? heavy ? * ? * * * *
  • 19. Distribution  Investigated the distribution of heavy, high, big, large, strong, great, major with the most common co-occurring nouns in the BNC  Nouns tend to occur with up to three of these adjectives with high frequency and low or zero frequency with the rest  My intuitive grammaticality judgments correlate but allow for some unseen combinations and disallow a few observed but very infrequent ones  big, major and great are grammatical with many nouns (but not frequent with most), strong and heavy are ungrammatical with most nouns, high and large intermediate
  • 20. heavy: groupings? magnitude: dew, rainstorm, downpour, rain, rainfall, snowfall, fall, snow, shower: frost, spindrift: clouds, mist, fog: flow, flooding, bleeding, period, traffic: demands, reliance, workload, responsibility, emphasis, dependence: irony, sarcasm, criticism: infestation, soiling: loss, price, cost, expenditure, taxation, fine, penalty, damages, investment: punishment, sentence: fire, bombardment, casualties, defeat, fighting: burden, load, weight, pressure: crop: advertising: use, drinking: magnitude of verb: drinker, smoker: magnitude related? odour, perfume, scent, smell, whiff: lunch: sea, surf, swell:
  • 21. high: groupings? magnitude: esteem, status, regard, reputation, standing, calibre, value, priority; grade, quality, level; proportion, degree, incidence, frequency, number, prevalence, percentage; volume, speed, voltage, pressure, concentration, density, performance, temperature, energy, resolution, dose, wind; risk, cost, price, rate, inflation, tax, taxation, mortality, turnover, wage, income, productivity, unemployment, demand magnitude of verb: earner
  • 22. heavy and high  50 nouns in BNC with the extended magnitude use of heavy with frequency 10 or more  160 such nouns with high  Only 9 such nouns with both adjectives: price, pressure, investment, demand, rainfall, cost, costs, concentration, taxation
  • 23. 2(iii): Possible empirical accounts of distribution 1. Difference in denotation between `extended’ uses of adjectives 2. Grammaticized selectional restrictions/preferences 3. Lexical selection • stipulate Magn function with nouns (Meaning- Text Theory) 4. Semi-productivity / collocation • plus semantic back-off
  • 24. 1 - Denotation account of distribution  Denotation of adjective simply prevents it being possible with the noun. Implies that heavy and high have different denotations heavy’(x) => MF(x) > norm(MF,type(x),c) & precipitation(x) or cost(x) or flow(x) or consumption(x)... (where rain(x) -> precipitation(x) and so on)  But: messy disjunction or multiple senses, open- ended, unlikely to be tractable.  e.g., heavy shower only for rain sense, not bathroom sense  Not falsifiable, but no motivation other than distribution.  Dictionary definitions can be seen as doing this (informally), but none account for observed distribution.  Input to generator?
  • 25. 2 - Selectional restrictions and distribution  Assume the adjectives have the same denotation  Distribution via features in the lexicon  e.g., literal high selects for [ANIMATE false ]  cf., approach used in the ERG for in/on/at in temporal expressions  grammaticized, so doesn’t need to be determined by denotation (though assume consistency)  could utilise qualia structure  Problem: can’t find a reasonable set of cross-cutting features!  Stipulative approach possible, but unattractive.
  • 26. 3 - Lexical selection  MTT approach  noun specifies its Magn adjective  in Mel’čuk and Polguère (1987), Magn is a function, but could modify to make it a set, or vary meanings  could also make adjective specify set of nouns, though not directly in LKB logic  stipulative: if we’re going to do this, why not use a corpus directly?
  • 27. 4- Collocational account of distribution  all the adjectives share a denotation corresponding to magnitude, distribution differences due to collocation, soft rather than hard constraints  linguistically:  adjective-noun combination is semi-productive  denotation and syntax allow heavy esteem etc, but speakers are sensitive to frequencies, prefer more frequent phrases with same meaning  cf morphology and sense extension: Briscoe and Copestake (1999). Blocking (but weaker than with morphology)  anti-collocations as reflection of semi-productivity
  • 28. Collocational account of distribution  computationally,  fits with some current practice: • filter adjective-noun realisations according to n- grams (statistical generation – e.g., Langkilde and Knight, recent experiments with ERG) • use of co-occurrences in WSD  back-off techniques  requires an approach to clustering
  • 29. semantic spaces  acquired from corpora  generally, collect vectors of words which co-occur with the target  best known is LSA: often used in psycholinguistics  more sophisticated models incorporate syntactic relationships  currently sexy, but severe limitations! dog bark house cat dog - 1 0 0 bark 1 - 0 0
  • 30. Back-off and analogy  back-off: decision for infrequent noun with no corpus evidence for specific magnitude adjective  should be partly based on productivity of adjective: number of nouns it occurs with  default to big  back-off also sensitive to word clusters  e.g., heavy spindrift because spindrift is semantically similar to snow  semantic space models: i.e., group according to distribution with other words  hence, adjective has some correlation with semantics of the noun
  • 31. Metaphor  Different metaphors for different nouns (cf., Lakoff et al)  `high’ nouns measured with an upright scale: e.g., temperature: temperature is rising  `heavy’ nouns metaphorically like burden: e.g., workload: her workload is weighing on her  Doesn’t lead to an empirical account of distribution, since we can’t predict classes. Assumption of literal denotation followed by coercion is implausible.  But: extended metaphor idea is consistent with idea that clusters for backoff are based on semantic space
  • 32. Collocation and linguistic theory  Collocation plus semantic space clusters may account for some of the `messy’ bits, at least for some speakers.  in/on transport: in the car, on the bus  Talmy: presence of walkway, `ragged lower end of hierarchy’  but trains without walkway, caravans with walkway?  in/on choice perhaps collocational, not real exception to language- independent schema elements  Potential to simplify linguistic theories considerably.  Success of ngrams, LSA models of priming.  Practically testable: assume same denotation of heavy/high or in/on, see if we can account for distribution in corpus.  Alternative for temporal in/on/at?  Experiments with machine learning temporal in/on/at (Mei Lin, MPhil thesis, 2004): very successful at predicting distribution, but used lots of Treebank-derived features.
  • 33. Summary  Selection in ERG  Other aspects of ERG selection not described here: multiword expressions and idioms  Collocational models as adjunct to TFS encoding  Role of denotation is crucial  Practical considerations about grammar usability
  • 34. Final remarks  Grammar usability:  A good broad-coverage grammar should have an account of denotation of closed-class words at least, but probably not within TFS encoding.  Can we use semantic web languages for non- domain-specific encoding?  Collocational techniques require much further investigation  Can semantic space models be related to denotation (e.g., somehow excluding collocational component)?
  • 35. Idioms Idiom entry: stand+guard := v_nbar_idiom & [ SYNSEM.LOCAL.CONT.RELS <! [ PRED "_stand_v_i_rel" ], [ PRED "_guard_n_i_rel" ] !> ]. Idiomatic lexical entries: guard_n1_i := n_intr_nospr_le & [ STEM < "guard" >, SYNSEM [ LKEYS.KEYREL.PRED "_guard_n_i_rel“ ]]. stand_v1_i := v_np_non_trans_idiom_le & [ STEM < "stand" >, SYNSEM [ LKEYS.KEYREL.PRED "_stand_v_i_rel”]].
  • 36. Idioms in ERG/LKB  Account based on Wasow et al (1982), Nunberg et al (1994).  Idiom entry specifies a set of coindexed MRS relations (coindexation specified by idiom type, e.g., v_nbar_idiom)  Relations may correspond to idiomatic lexical entries (but may be literal uses: e.g., cat out of the bag – literal out of the).  Idiom is recognised if some phrase matches the idiom entry.  Allows for modification: e.g., stand watchful guard
  • 37. Messy examples  among: requires group or plural or ?  among the family (BNC)  among the chaos (BNC)  between: requires plural denoting two objects, but not group (?)  fudge sandwiched between sponge (BNC)  between each tendon (BNC)  ? the actor threw a dart between the couple  * the actor threw a dart between the audience (even if only two people in the audience)