By:Govind Raj
IT/1001227464
Topics of Discussion
 What is WORDNET?????

 International Scenarios.
 Design fundamentals.
 Major Lexical Relations.

 Uses of Wordnet.
 Wordnets in India.
What is WORDNET?????
 WordNet is a lexical database for the English

language based on conceptual look-up.
 George A. Miller who began the WordNet project in

the mid 1980s.
 Organizes lexical information in terms of word
meanings rather than word form.
 Wordnet can also be used as a thesaurus.
Miller on Wordnet
 “In terms of coverage, WordNet’s goals

differ little from those of a good
standard college-level dictionary, and
the semantics of WordNet is based on
the notion of word sense that
lexicographers have traditionally used
in writing dictionaries. ”
Wordnet: International Scenario
 Wordnet is a network of words linked by

lexical and semantic relations.
 The first wordnet in the world was for
English developed at Princeton over 15 years.
 The Eurowordnet- linked structure of
European language wordnet was built in
1998 over 3 years.
 Wordnets for Hindi and Marathi being built
at IIT Bombay are amongst the first IL
wordnet.
 All these are proposed to be linked into the
IndoWordnet which eventually will be
linked to the English and the Euro wordnet.
Psycholinguistic Theory
 Can canary sing? – pretty fast response.
 Can canary fly ? – a bit slower response.
 Does canary have skin ? – a slow response.
Animal

Bird

Canary
Fundamental Design Questions
 Syntagmatic VS Paradigmatic ???

Syntagmatic – when words appear together in a unit.
Paradigmatic – if words are linked in a lexical resource
When we hear a word , many words come to our mind
through association.
For cat :
animal , mammal – Paradigmatic
mew , purr , furry - Syntagmatic
Major Lexical Relations
Synonymy
Polysemy
Metonymy

Hyponymy/ Hypernymy
Meronymy/ Holonymy
Antonymy
Synonymy
 Different ways of expressing related concepts

 Examples
 cat, feline, Siamese cat

 Overlaps with basic and subordinate levels

 Synonyms are almost never truly substitutable
 Used in different contexts
 Have different implications
 This is a point of contention
Polysemy
 Most words have more than one sense
 Homonym: same sound and/or spelling,

different meaning



bank (river)
bank (financial)

 Polysemy: different senses of same word
 That dog has floppy ears.
 She has a good ear for jazz.
 bank (financial) has several related senses
 the building, the institution, the notion of where
money is stored
Metonomy
 Use one aspect of something to stand for the whole
 The building stands for the institution of the bank.
 Library stands for a whole set of books ..
 Mostly all collective nouns came under this.
Hyponymy
 ISA relation
 Related to Super ordinate and Subordinate level

categories
 hyponym(robin , bird)
 hyponym(emu, bird)

 hyponym(bird, animal)
 hypernym(animal , bird)

 A is a hypernym of B if B is a type of A
 A is a hyponym of B if A is a type of B
Holonomy
 Part/Whole relation
 meronym(beak , bird)
 meronym(bark , tree)
 holonym(tree , bark)
 Transitive conceptually but not lexically
 The knob is a part of the door.
 The door is a part of the house.
 The knob is a part of the house
 Holonyms are (approximately) the inverse of

meronyms
Antonymy
 Lexical opposites
 antonym(large, small)
 antonym(big, small)
 antonym(big, little)
 but not large, little
 Many antonymous relations can be reliably detected

by looking for statistical correlations in large text
collections.
WordNet Sub-Graph (English)
Hyponymy
Dwelling,abode
Hypernymy

Meronymy

kitchen

Hyponymy
bckyard

veranda

M
e
r
o
n
y
m
y

bedroom

house,home

Gloss
A place that serves as the living
quarters of one or mor efamilies

Hyponymy

study

guestroom

hermitage

cottage
Structure of Wordnet
Uses:
 Word sense disambiguation.
 Information retrieval.
 Automatic text classification.
 Automatic text summarization.
 Machine translation
 Automatic crossword puzzle generation.
 Improve search engine results
WordNet : Size
WordNet Uses “Synsets” – sets of
synonymous terms
POS

Synsets

Noun

Unique
Strings
114648

Verb

11306

13508

Adjective

21436

18563

Adverb

4669

3664

Totals

152059

115424

79689
Linked Wordnets in India
Bengali
Wordnet

Dravidian
Language
Wordnets

Sanskrit
Wordnet

Punjabi
Wordnet

Hindi
Wordnet
North East
Language
Wordnet
Konkani
Wordnet

Marathi
Wordnet
English
Wordnet
Wordnet

Wordnet

  • 1.
  • 2.
    Topics of Discussion What is WORDNET?????  International Scenarios.  Design fundamentals.  Major Lexical Relations.  Uses of Wordnet.  Wordnets in India.
  • 3.
    What is WORDNET????? WordNet is a lexical database for the English language based on conceptual look-up.  George A. Miller who began the WordNet project in the mid 1980s.  Organizes lexical information in terms of word meanings rather than word form.  Wordnet can also be used as a thesaurus.
  • 4.
    Miller on Wordnet “In terms of coverage, WordNet’s goals differ little from those of a good standard college-level dictionary, and the semantics of WordNet is based on the notion of word sense that lexicographers have traditionally used in writing dictionaries. ”
  • 5.
    Wordnet: International Scenario Wordnet is a network of words linked by lexical and semantic relations.  The first wordnet in the world was for English developed at Princeton over 15 years.  The Eurowordnet- linked structure of European language wordnet was built in 1998 over 3 years.  Wordnets for Hindi and Marathi being built at IIT Bombay are amongst the first IL wordnet.  All these are proposed to be linked into the IndoWordnet which eventually will be linked to the English and the Euro wordnet.
  • 6.
    Psycholinguistic Theory  Cancanary sing? – pretty fast response.  Can canary fly ? – a bit slower response.  Does canary have skin ? – a slow response. Animal Bird Canary
  • 7.
    Fundamental Design Questions Syntagmatic VS Paradigmatic ??? Syntagmatic – when words appear together in a unit. Paradigmatic – if words are linked in a lexical resource When we hear a word , many words come to our mind through association. For cat : animal , mammal – Paradigmatic mew , purr , furry - Syntagmatic
  • 8.
  • 9.
    Synonymy  Different waysof expressing related concepts  Examples  cat, feline, Siamese cat  Overlaps with basic and subordinate levels  Synonyms are almost never truly substitutable  Used in different contexts  Have different implications  This is a point of contention
  • 10.
    Polysemy  Most wordshave more than one sense  Homonym: same sound and/or spelling, different meaning   bank (river) bank (financial)  Polysemy: different senses of same word  That dog has floppy ears.  She has a good ear for jazz.  bank (financial) has several related senses  the building, the institution, the notion of where money is stored
  • 11.
    Metonomy  Use oneaspect of something to stand for the whole  The building stands for the institution of the bank.  Library stands for a whole set of books ..  Mostly all collective nouns came under this.
  • 12.
    Hyponymy  ISA relation Related to Super ordinate and Subordinate level categories  hyponym(robin , bird)  hyponym(emu, bird)  hyponym(bird, animal)  hypernym(animal , bird)  A is a hypernym of B if B is a type of A  A is a hyponym of B if A is a type of B
  • 13.
    Holonomy  Part/Whole relation meronym(beak , bird)  meronym(bark , tree)  holonym(tree , bark)  Transitive conceptually but not lexically  The knob is a part of the door.  The door is a part of the house.  The knob is a part of the house  Holonyms are (approximately) the inverse of meronyms
  • 14.
    Antonymy  Lexical opposites antonym(large, small)  antonym(big, small)  antonym(big, little)  but not large, little  Many antonymous relations can be reliably detected by looking for statistical correlations in large text collections.
  • 15.
    WordNet Sub-Graph (English) Hyponymy Dwelling,abode Hypernymy Meronymy kitchen Hyponymy bckyard veranda M e r o n y m y bedroom house,home Gloss Aplace that serves as the living quarters of one or mor efamilies Hyponymy study guestroom hermitage cottage
  • 16.
  • 17.
    Uses:  Word sensedisambiguation.  Information retrieval.  Automatic text classification.  Automatic text summarization.  Machine translation  Automatic crossword puzzle generation.  Improve search engine results
  • 18.
    WordNet : Size WordNetUses “Synsets” – sets of synonymous terms POS Synsets Noun Unique Strings 114648 Verb 11306 13508 Adjective 21436 18563 Adverb 4669 3664 Totals 152059 115424 79689
  • 20.
    Linked Wordnets inIndia Bengali Wordnet Dravidian Language Wordnets Sanskrit Wordnet Punjabi Wordnet Hindi Wordnet North East Language Wordnet Konkani Wordnet Marathi Wordnet English Wordnet