SlideShare a Scribd company logo
Alexander Panchenko
Inducing Interpretable Word
Senses for WSD and Enrichment of
Lexical Resources
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 2/54
Overview
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 3/54
Inducing word sense representations:
word sense embeddings via retrofitting
[Pelevina et al., 2016, Remus & Biemann, 2018];
inducing synsets [Ustalov et al., 2017b, Ustalov et al., 2017a,
Ustalov et al., 2018]
inducing semantic classes [Panchenko et al., 2018]
Overview
Overview
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 3/54
Inducing word sense representations:
word sense embeddings via retrofitting
[Pelevina et al., 2016, Remus & Biemann, 2018];
inducing synsets [Ustalov et al., 2017b, Ustalov et al., 2017a,
Ustalov et al., 2018]
inducing semantic classes [Panchenko et al., 2018]
Making induced senses interpretable
[Panchenko et al., 2017b, Panchenko et al., 2017c]
Overview
Overview
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 3/54
Inducing word sense representations:
word sense embeddings via retrofitting
[Pelevina et al., 2016, Remus & Biemann, 2018];
inducing synsets [Ustalov et al., 2017b, Ustalov et al., 2017a,
Ustalov et al., 2018]
inducing semantic classes [Panchenko et al., 2018]
Making induced senses interpretable
[Panchenko et al., 2017b, Panchenko et al., 2017c]
Linking induced word senses to lexical
resources [Panchenko, 2016, Faralli et al., 2016,
Panchenko et al., 2017a, Biemann et al., 2018]
Overview
Overview
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 4/54
Inducing word sense representations
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 5/54
Inducing word sense representations
Word vs sense embeddings
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 6/54
Inducing word sense representations
Word vs sense embeddings
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 7/54
Inducing word sense representations
Related work
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 8/54
AutoExtend [Rothe & Schütze, 2015]
* image is reproduced from the original paper
Inducing word sense representations
Related work: knowledge-based
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 9/54
Adagram [Bartunov et al., 2016]
Multiple vector representations θ for each word:
Inducing word sense representations
Related work: knowledge-free
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 9/54
Adagram [Bartunov et al., 2016]
Multiple vector representations θ for each word:
p(Y, Z, β|X, α, θ) =
V∏
w=1
∞∏
k=1
p(βwk|α)
N∏
i=1
[p(zi|xi, β)
C∏
j=1
p(yij|zi, xi, θ)],
zi – a hidden variable: a sense index of word xi in context C;
α – a meta-parameter controlling number of senses.
Inducing word sense representations
Related work: knowledge-free
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 9/54
Adagram [Bartunov et al., 2016]
Multiple vector representations θ for each word:
p(Y, Z, β|X, α, θ) =
V∏
w=1
∞∏
k=1
p(βwk|α)
N∏
i=1
[p(zi|xi, β)
C∏
j=1
p(yij|zi, xi, θ)],
zi – a hidden variable: a sense index of word xi in context C;
α – a meta-parameter controlling number of senses.
See also: [Neelakantan et al., 2014] and [Li and Jurafsky, 2015]
Inducing word sense representations
Related work: knowledge-free
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 10/54
Word sense induction (WSI) based on graph clustering:
[Lin, 1998]
[Pantel and Lin, 2002]
[Widdows and Dorow, 2002]
Chinese Whispers [Biemann, 2006]
[Hope and Keller, 2013]
Inducing word sense representations
Related work: word sense induction
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 11/54
* source of the image: http://ic.pics.livejournal.com/blagin_anton/33716210/2701748/2701748_800.jpg
Inducing word sense representations
Related work: Chinese Whispers#1
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 12/54
Inducing word sense representations
Related work: Chinese Whispers#2
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 13/54
Inducing word sense representations
Related work: Chinese Whispers#2
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 14/54
Inducing word sense representations
Related work: Chinese Whispers#2
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 15/54
RepL4NLP@ACL’16 [Pelevina et al., 2016], LREC’18 [Remus & Biemann, 2018]
Prior methods:
Induce inventory by clustering of word instances
Use existing sense inventories
Our method:
Input: word embeddings
Output: word sense embeddings
Word sense induction by clustering of word ego-networks
Inducing word sense representations
Sense embeddings using retrofitting
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 16/54
From word embeddings to sense embeddings
Calculate Word
Similarity Graph
Learning Word Vectors
Word Sense Induction
Text Corpus
Word Vectors
Word Similarity Graph
Pooling of Word Vectors
Sense Inventory
Sense Vectors
1 2
4 3
Inducing word sense representations
Sense embeddings using retrofitting
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 17/54
Word sense induction using ego-network clustering
Inducing word sense representations
Sense embeddings using retrofitting
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 18/54
Neighbours of Word and Sense Vectors
Vector Nearest Neighbors
table tray, bottom, diagram, bucket, brackets, stack, basket,
list, parenthesis, cup, saucer, pile, playfield, bracket,
pot, drop-down, cue, plate
Inducing word sense representations
Sense embeddings using retrofitting
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 18/54
Neighbours of Word and Sense Vectors
Vector Nearest Neighbors
table tray, bottom, diagram, bucket, brackets, stack, basket,
list, parenthesis, cup, saucer, pile, playfield, bracket,
pot, drop-down, cue, plate
table#0 leftmost#0, column#1, tableau#1, indent#1, bracket#3,
pointer#0, footer#1, cursor#1, diagram#0, grid#0
table#1 pile#1, stool#1, tray#0, basket#0, bowl#1, bucket#0,
box#0, cage#0, saucer#3, mirror#1, pan#1, lid#0
Inducing word sense representations
Sense embeddings using retrofitting
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 19/54
Word Sense Disambiguation
1 Context extraction: use context words around the target
word
2 Context filtering: based on context word’s relevance for
disambiguation
3 Sense choice in context: maximise similarity between a
context vector and a sense vector
Inducing word sense representations
Sense embeddings using retrofitting
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 20/54
Inducing word sense representations
Sense embeddings using retrofitting
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 21/54
Inducing word sense representations
Sense embeddings using retrofitting
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 22/54
Inducing word sense representations
Sense embeddings using retrofitting
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 23/54
Inducing word sense representations
Sense embeddings using retrofitting
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 24/54
Unsupervised WSD SemEval’13, ReprL4NLP [Pelevina et al., 2016]:
comparable to SOTA, incl. Adagram sense embeddings.
Inducing word sense representations
Sense embeddings using retrofitting
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 24/54
Unsupervised WSD SemEval’13, ReprL4NLP [Pelevina et al., 2016]:
comparable to SOTA, incl. Adagram sense embeddings.
Semantic relatedness, LREC’2018 [Remus & Biemann, 2018]:
autoextend
adagram
SGNS
.
glove
.
sympat
.
LSAbow
.
LSAhal
.
paragramSL
.
SimLex999 0.45 0.29 0.44 0.37 0.54 0.30 0.27 0.68
MEN 0.72 0.67 0.77 0.73 0.53 0.67 0.71 0.77
SimVerb 0.43 0.27 0.36 0.23 0.37 0.15 0.19 0.53
WordSim353 0.58 0.61 0.70 0.61 0.47 0.67 0.59 0.72
SimLex999-N 0.44 0.33 0.45 0.39 0.48 0.32 0.34 0.68
MEN-N 0.72 0.68 0.77 ____ 0.76 ____ 0.57 ____ 0.71 ____ 0.73 ____ 0.78 ____
Inducing word sense representations
Sense embeddings using retrofitting
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 25/54
Unsupervised WSD SemEval’13, ReprL4NLP [Pelevina et al., 2016]:
comparable to SOTA, incl. sense embeddings.
Semantic relatedness, LREC’2018 [Remus & Biemann, 2018]:
autoextend
adagram
SGNS
SGNS+senses
glove
glove+senses
sympat
sympat+senses
LSAbow
LSAbow+senses
LSAhal
LSAhal+senses
paragramSL
paragramSL+senses
SimLex999 0.45 0.29 0.44 0.46 0.37 0.41 0.54 0.55 0.30 0.39 0.27 0.38 0.68 0.64
MEN 0.72 0.67 0.77 0.78 0.73 0.77 0.53 0.68 0.67 0.70 0.71 0.74 0.77 0.80
SimVerb 0.43 0.27 0.36 0.39 0.23 0.30 0.37 0.45 0.15 0.22 0.19 0.28 0.53 0.53
WordSim353 0.58 0.61 0.70 0.69 0.61 0.65 0.47 0.62 0.67 0.66 0.59 0.63 0.72 0.73
SimLex999-N 0.44 0.33 0.45 0.50 0.39 0.47 0.48 0.55 0.32 0.46 0.34 0.44 0.68 0.66
MEN-N 0.72 0.68 0.77 0.79 0.76 0.80 0.57 0.74 0.71 0.73 0.73 0.76 0.78 0.81
Inducing word sense representations
Sense embeddings using retrofitting
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 26/54
Word and sense embeddings
of words iron and vitamin.
LREC’18 [Remus & Biemann, 2018]
Inducing word sense representations
Sense embeddings using retrofitting
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 27/54
ACL’17 [Ustalov et al., 2017b]
Examples of extracted synsets:
Size Synset
2 {decimal point, dot}
3 {gullet, throat, food pipe}
4 {microwave meal, ready meal, TV dinner, frozen dinner}
5 {objective case, accusative case, oblique case, object
case, accusative}
6 {radiotheater, dramatizedaudiobook, audiotheater, ra-
dio play, radio drama, audio play}
Inducing word sense representations
Synset induction
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 28/54
Outline of the ’Watset’ method:
Background Corpus
Synonymy Dictionary
Learning

Word
Embeddings
Graph
Construction

Synsets
Word
Similarities
Ambiguous
Weighted Graph
Local
Clustering:
Word Sense Induction
Global
Clustering:

Synset Induction
Sense Inventory
Disambiguation
of

Neighbors

Disambiguated
Weighted Graph
Local-Global
Fuzzy
Graph
Clustering
Inducing word sense representations
Synset induction
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 29/54
Inducing word sense representations
Synset induction
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 30/54
Inducing word sense representations
Synset induction
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 31/54
Inducing word sense representations
Synset induction
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 32/54
CW MCL MaxMax ECO CPM Watset
0.0
0.1
0.2
0.3
WordNet (English)
F−score
CW MCL MaxMax ECO CPM Watset
0.0
0.1
0.2
0.3
BabelNet (English)
F−score
Inducing word sense representations
Synset induction
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 32/54
CW MCL MaxMax ECO CPM Watset
0.0
0.1
0.2
0.3
WordNet (English)
F−score
CW MCL MaxMax ECO CPM Watset
0.0
0.1
0.2
0.3
BabelNet (English)
F−score
CW MCL MaxMax ECO CPM Watset
0.00
0.05
0.10
0.15
0.20
RuWordNet (Russian)
F−score
CW MCL MaxMax ECO CPM Watset
0.0
0.1
0.2
0.3
0.4
YARN (Russian)
F−score
Inducing word sense representations
Synset induction
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 33/54
Examples of semantic classes:
ID Sense Cluster Hypernyms
1 peach#1, banana#1, pineapple#0, berry#0,
blackberry#0, grapefruit#0, strawberry#0, blue-
berry#0, grape#0, melon#0, orange#0, pear#0,
plum#0, raspberry#0, watermelon#0, apple#0,
apricot#0, …
fruit#0, crop#0, ingredi-
ent#0, food#0, ·
2 C#4, Basic#2, Haskell#5, Flash#1, Java#1, Pas-
cal#0, Ruby#6, PHP#0, Ada#1, Oracle#3, Python#3,
Apache#3, Visual Basic#1, ASP#2, Delphi#2, SQL
Server#0, CSS#0, AJAX#0, the Java#0, …
programming language#3,
technology#0, language#0,
format#2, app#0
Inducing word sense representations
Induction of semantic classes
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 34/54
Text
Corpus
Representing
Senses

with
Ego
Networks
Semantic
Classes
Word
Sense
Induction

from
Text
Corpus
Sense
Graph

Construction
Clustering
of

Word
Senes
Labeling
Sense
Clusters

with
Hypernyms

Induced Word Senses Sense Ego-Networks Global Sense Graph
s
Noisy
Hypernyms
Cleansed
Hypernyms
Induction
of
Semantic
Classes
Global Sense Clusters
Inducing word sense representations
Induction of semantic classes
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 35/54
Filtering noisy hypernyms with semantic classes
LREC’18 [Panchenko et al., 2018]:
fruit#1
food#0

apple#2 mango#0 pear#0
Hypernyms,
Sense Cluster,
mangosteen#0
city#2
Removed
Wrong
Added
Missing
Inducing word sense representations
Induction of sense semantic classes
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 36/54
Filtering of a noisy hypernymy database with semantic classes.
LREC’18 [Panchenko et al., 2018]
Precision Recall F-score
Original Hypernyms (Seitner et al., 2016) 0.475 0.546 0.508
Semantic Classes (coarse-grained) 0.541 0.679 0.602
Inducing word sense representations
Induction of sense semantic classes
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 37/54
Making induced senses interpretable
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 38/54
Knowledge-based sense representations are interpretable
Making induced senses interpretable
Making induced senses interpretable
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 39/54
Most knowledge-free sense representations are uninterpretable
Making induced senses interpretable
Making induced senses interpretable
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 40/54
Making induced senses interpretable
Making induced senses interpretable
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 41/54
Hypernymy prediction in context. EMNLP’17 [Panchenko et al., 2017b]
Making induced senses interpretable
Making induced senses interpretable
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 42/54
11.702 sentences, 863 words with avg.polysemy of 3.1.
WSD Model Accuracy
Inventory Features Hypers HyperHypers
Word Senses Random 0.257 0.610
Word Senses MFS 0.292 0.682
Word Senses Cluster Words 0.291 0.650
Word Senses Context Words 0.308 0.686
Making induced senses interpretable
Making induced senses interpretable
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 42/54
11.702 sentences, 863 words with avg.polysemy of 3.1.
WSD Model Accuracy
Inventory Features Hypers HyperHypers
Word Senses Random 0.257 0.610
Word Senses MFS 0.292 0.682
Word Senses Cluster Words 0.291 0.650
Word Senses Context Words 0.308 0.686
Super Senses Random 0.001 0.001
Super Senses MFS 0.001 0.001
Super Senses Cluster Words 0.174 0.365
Super Senses Context Words 0.086 0.188
Making induced senses interpretable
Making induced senses interpretable
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 43/54
Linking induced senses to resources
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 44/54
Text Corpus
Linking
Induced
Senses

to

Senses
of
the
LR


Induce
a
Graph
of
Sem.
Related
Words
Enriched
Lexical Resource
Graph of Related Words
Word
Sense

Induction

Labeling
Senses

with
Hypernyms

Disambiguation

of
Neighbours

Typing
of
the
Unmapped

Induced
Senses



Word Sense Inventory Labeled Word Senses
PCZ
Construction
of
Proto-Conceptualization (PCZ)
Linking
Proto-Conceptualization
to
Lexical
Resource

Part. Linked Senses to the LR
Lexical Resource (LR):
WordNet, BabelNet, ...
Construction
of
sense

feature
representations

Graph of Related Senses
LREC’16 [Panchenko, 2016], ISWC’16 [Faralli et al., 2016],
SENSE@EACL’17 [Panchenko et al., 2017a],
NLE’18 [Biemann et al., 2018]
Linking induced senses to resources
Linking induced senses to resources
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 45/54
Word AdaGram BabelNet AdaGram BoW BabelNet BoW
python 2 bn:01713224n perl, php, java, smalltalk, ruby,
lua, tcl, scripting, javascript,
bindings, binding, programming,
coldfusion, actionscript, net, . . .
language, programming, python-
ista, python programming,
python3, python2, level, com-
puter, pythonistas, python3000,
python 1 bn:01157670n monty, circus, spamalot, python,
magoo, muppet, snoopy, fea-
turette, disney, tunes, tune, clas-
sic, shorts, short, apocalypse, . . .
monty, comedy, monty python,
british, monte, monte python,
troupe, pythonesque, foot, artist,
record, surreal, terry, . . .
python 3 bn:00046456n spectacled, unicornis, snake, gi-
ant, caiman, leopard, squirrel,
crocodile, horned, cat, mole, ele-
phant, opossum, pheasant, . . .
molurus, indian, boa, tigris,
tiger python, rock, tiger, indian
python, reptile, python molurus,
indian rock python, coluber, . . .
python 4 bn:01157670n circus, fly, flying, dusk, lizard,
moth, unicorn, puff, adder, vul-
ture, tyrannosaurus, zephyr, bad-
ger, . . .
monty, comedy, monty python,
british, monte, monte python,
troupe, pythonesque, foot, artist,
record, surreal, terry, . . .
python 1 bn:00473212n monty, circus, spamalot, python,
magoo, muppet, snoopy, fea-
turette, disney, tunes, tune, clas-
sic, shorts, short, apocalypse, . . .
pictures, monty, python monty
pictures, limited, company,
python pictures limited, king-
dom, picture, serve, director,
. . .
python 1 bn:03489893n monty, circus, spamalot, python,
magoo, muppet, snoopy, fea-
turette, disney, tunes, tune, clas-
sic, shorts, short, apocalypse, . . .
film, horror, movie, clabaugh,
richard, monster, century, direct,
snake, python movie, television,
giant, natural, language, for-tv,
. . .
Linking induced senses to resources
Linking induced senses to resources
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 46/54
Model Representation of the Sense ”disk (medium)”
WordNet memory, device, floppy, disk, hard, disk, disk, computer, science, computing, diskette, fixed, disk,
floppy, magnetic, disc, magnetic, disk, hard, disc, storage, device
WordNet + Linked recorder, disk, floppy, console, diskette, handset, desktop, iPhone, iPod, HDTV, kit, RAM, Discs, Blu-
ray, computer, GB, microchip, site, cartridge, printer, tv, VCR, Disc, player, LCD, software, component,
camcorder, cellphone, card, monitor, display, burner, Web, stereo, internet, model, iTunes, turntable,
chip, cable, camera, iphone, notebook, device, server, surface, wafer, page, drive, laptop, screen,
pc, television, hardware, YouTube, dvr, DVD, product, folder, VCR, radio, phone, circuitry, partition,
megabyte, peripheral, format, machine, tuner, website, merchandise, equipment, gb, discs, MP3,
hard-drive, piece, video, storage device, memory device, microphone, hd, EP, content, soundtrack,
webcam, system, blade, graphic, microprocessor, collection, document, programming, battery, key-
board, HD, handheld, CDs, reel, web, material, hard-disk, ep, chart, debut, configuration, recording,
album, broadcast, download, fixed disk, planet, pda, microfilm, iPod, videotape, text, cylinder, cpu,
canvas, label, sampler, workstation, electrode, magnetic disc, catheter, magnetic disk, Video, mo-
bile, cd, song, modem, mouse, tube, set, ipad, signal, substrate, vinyl, music, clip, pad, audio, com-
pilation, memory, message, reissue, ram, CD, subsystem, hdd, touchscreen, electronics, demo, shell,
sensor, file, shelf, processor, cassette, extra, mainframe, motherboard, floppy disk, lp, tape, version,
kilobyte, pacemaker, browser, Playstation, pager, module, cache, DVD, movie, Windows, cd-rom, e-
book, valve, directory, harddrive, smartphone, audiotape, technology, hard disk, show, computing,
computer science, Blu-Ray, blu-ray, HDD, HD-DVD, scanner, hard disc, gadget, booklet, copier, play-
back, TiVo, controller, filter, DVDs, gigabyte, paper, mp3, CPU, dvd-r, pipe, cd-r, playlist, slot, VHS, film,
videocassette, interface, adapter, database, manual, book, channel, changer, storage
Linking induced senses to resources
Linking induced senses to resources
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 47/54
Evaluation of linking accuracy:
Linking induced senses to resources
Linking induced senses to resources
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 48/54
Evaluation of enriched representations based on WSD:
Linking induced senses to resources
Linking induced senses to resources
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 49/54
Conclusion
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 50/54
Conclusion
Vectors + Graphs = ♡
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 51/54
We can induce word senses, synsets and semantic classes in
a knowledge-free way using graph clustering and
distributional models.
Conclusion
Take home messages
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 51/54
We can induce word senses, synsets and semantic classes in
a knowledge-free way using graph clustering and
distributional models.
We can make the induced word senses interpretable in a
knowledge-free way with hypernyms, images, definitions.
Conclusion
Take home messages
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 51/54
We can induce word senses, synsets and semantic classes in
a knowledge-free way using graph clustering and
distributional models.
We can make the induced word senses interpretable in a
knowledge-free way with hypernyms, images, definitions.
We can link induced senses to lexical resources to
improve performance of WSD;
enrich lexical resources with emerging senses.
Conclusion
Take home messages
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 52/54
Participate in an ACL SIGSLAV sponsored shared task on
word sense induction and disambiguation for Russian!
More details: http://russe.nlpub.org/2018/wsi
Conclusion
An ongoing shared task on WSI&D
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 53/54
Thank you! Questions?
This research was supported by
Conclusion
Acknowledgments
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 54/54
Evaluation on SemEval 2013 Task 13 WSI&D:
Model Jacc. Tau WNDCG F.NMI F.B-Cubed
AI-KU (add1000) 0.176 0.609 0.205 0.033 0.317
AI-KU 0.176 0.619 0.393 0.066 0.382
AI-KU (remove5-add1000) 0.228 0.654 0.330 0.040 0.463
Unimelb (5p) 0.198 0.623 0.374 0.056 0.475
Unimelb (50k) 0.198 0.633 0.384 0.060 0.494
UoS (#WN senses) 0.171 0.600 0.298 0.046 0.186
UoS (top-3) 0.220 0.637 0.370 0.044 0.451
La Sapienza (1) 0.131 0.544 0.332 – –
La Sapienza (2) 0.131 0.535 0.394 – –
AdaGram, α = 0.05, 100 dim 0.274 0.644 0.318 0.058 0.470
w2v 0.197 0.615 0.291 0.011 0.615
w2v (nouns) 0.179 0.626 0.304 0.011 0.623
JBT 0.205 0.624 0.291 0.017 0.598
JBT (nouns) 0.198 0.643 0.310 0.031 0.595
TWSI (nouns) 0.215 0.651 0.318 0.030 0.573
Conclusion
Sense embeddings using retrofitting
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 54/54
Bartunov, S., Kondrashkin, D., Osokin, A., & Vetrov, D. (2016).
Breaking sticks and ambiguities with adaptive skip-gram.
In Artificial Intelligence and Statistics (pp. 130–138).
Biemann, C., Faralli, S., Panchenko, A., & Ponzetto, S. P. (2018).
A framework for enriching lexical semantic resources with
distributional semantics.
In Journal of Natural Language Engineering (pp. 56–64).:
Cambridge Press.
Faralli, S., Panchenko, A., Biemann, C., & Ponzetto, S. P. (2016).
Linked disambiguated distributional semantic networks.
In International Semantic Web Conference (pp. 56–64).:
Springer.
Panchenko, A. (2016).
Best of both worlds: Making word sense embeddings
interpretable.
In LREC.
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 54/54
Panchenko, A., Faralli, S., Ponzetto, S. P., & Biemann, C.
(2017a).
Using linked disambiguated distributional networks for word
sense disambiguation.
In Proceedings of the 1st Workshop on Sense, Concept and
Entity Representations and their Applications (pp. 72–78).
Valencia, Spain: Association for Computational Linguistics.
Panchenko, A., Marten, F., Ruppert, E., Faralli, S., Ustalov, D.,
Ponzetto, S. P., & Biemann, C. (2017b).
Unsupervised, knowledge-free, and interpretable word sense
disambiguation.
In Proceedings of the 2017 Conference on Empirical Methods in
Natural Language Processing: System Demonstrations (pp.
91–96). Copenhagen, Denmark: Association for
Computational Linguistics.
Panchenko, A., Ruppert, E., Faralli, S., Ponzetto, S. P., &
Biemann, C. (2017c).
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 54/54
Unsupervised does not mean uninterpretable: The case for
word sense induction and disambiguation.
In Proceedings of the 15th Conference of the European Chapter
of the Association for Computational Linguistics: Volume 1,
Long Papers (pp. 86–98). Valencia, Spain: Association for
Computational Linguistics.
Panchenko, A., Ustalov, D., Faralli, S., Ponzetto, S. P., &
Biemann, C. (2018).
Improving hypernymy extraction with distributional
semantic classes.
In Proceedings of the LREC 2018 Miyazaki, Japan: European
Language Resources Association.
Pelevina, M., Arefiev, N., Biemann, C., & Panchenko, A. (2016).
Making sense of word embeddings.
In Proceedings of the 1st Workshop on Representation
Learning for NLP (pp. 174–183). Berlin, Germany: Association
for Computational Linguistics.
Remus, S. & Biemann, C. (2018).
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 54/54
Retrofittingword representations for unsupervised sense
aware word similarities.
In Proceedings of the LREC 2018 Miyazaki, Japan: European
Language Resources Association.
Rothe, S. & Schütze, H. (2015).
Autoextend: Extending word embeddings to embeddings for
synsets and lexemes.
In Proceedings of the 53rd Annual Meeting of the Association
for Computational Linguistics and the 7th International Joint
Conference on Natural Language Processing (Volume 1: Long
Papers) (pp. 1793–1803). Beijing, China: Association for
Computational Linguistics.
Ustalov, D., Chernoskutov, M., Biemann, C., & Panchenko, A.
(2017a).
Fighting with the sparsity of synonymy dictionaries for
automatic synset induction.
In International Conference on Analysis of Images, Social
Networks and Texts (pp. 94–105).: Springer.
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 54/54
Ustalov, D., Panchenko, A., & Biemann, C. (2017b).
Watset: Automatic induction of synsets from a graph of
synonyms.
In Proceedings of the 55th Annual Meeting of the Association
for Computational Linguistics (Volume 1: Long Papers) (pp.
1579–1590). Vancouver, Canada: Association for
Computational Linguistics.
Ustalov, D., Teslenko, D., Panchenko, A., Chernoskutov, M., &
Biemann, C. (2018).
Word sense disambiguation based on automatically induced
synsets.
In LREC 2018, 11th International Conference on Language
Resources and Evaluation : 7-12 May 2018, Miyazaki (Japan)
(pp. tba). Paris: European Language Resources Association,
ELRA-ELDA.
Accepted for publication.

More Related Content

More from Alexander Panchenko

IIT-UHH at SemEval-2017 Task 3: Exploring Multiple Features for Community Que...
IIT-UHH at SemEval-2017 Task 3: Exploring Multiple Features for Community Que...IIT-UHH at SemEval-2017 Task 3: Exploring Multiple Features for Community Que...
IIT-UHH at SemEval-2017 Task 3: Exploring Multiple Features for Community Que...
Alexander Panchenko
 
Fighting with Sparsity of the Synonymy Dictionaries for Automatic Synset Indu...
Fighting with Sparsity of the Synonymy Dictionaries for Automatic Synset Indu...Fighting with Sparsity of the Synonymy Dictionaries for Automatic Synset Indu...
Fighting with Sparsity of the Synonymy Dictionaries for Automatic Synset Indu...
Alexander Panchenko
 
The 6th Conference on Analysis of Images, Social Networks, and Texts (AIST 2...
The 6th Conference on Analysis of Images, Social Networks, and Texts  (AIST 2...The 6th Conference on Analysis of Images, Social Networks, and Texts  (AIST 2...
The 6th Conference on Analysis of Images, Social Networks, and Texts (AIST 2...
Alexander Panchenko
 
Using Linked Disambiguated Distributional Networks for Word Sense Disambiguation
Using Linked Disambiguated Distributional Networks for Word Sense DisambiguationUsing Linked Disambiguated Distributional Networks for Word Sense Disambiguation
Using Linked Disambiguated Distributional Networks for Word Sense Disambiguation
Alexander Panchenko
 
Unsupervised Does Not Mean Uninterpretable: The Case for Word Sense Induction...
Unsupervised Does Not Mean Uninterpretable: The Case for Word Sense Induction...Unsupervised Does Not Mean Uninterpretable: The Case for Word Sense Induction...
Unsupervised Does Not Mean Uninterpretable: The Case for Word Sense Induction...
Alexander Panchenko
 
Making Sense of Word Embeddings
Making Sense of Word EmbeddingsMaking Sense of Word Embeddings
Making Sense of Word Embeddings
Alexander Panchenko
 
Noun Sense Induction and Disambiguation using Graph-Based Distributional Sema...
Noun Sense Induction and Disambiguation using Graph-Based Distributional Sema...Noun Sense Induction and Disambiguation using Graph-Based Distributional Sema...
Noun Sense Induction and Disambiguation using Graph-Based Distributional Sema...
Alexander Panchenko
 
Getting started in Apache Spark and Flink (with Scala) - Part II
Getting started in Apache Spark and Flink (with Scala) - Part IIGetting started in Apache Spark and Flink (with Scala) - Part II
Getting started in Apache Spark and Flink (with Scala) - Part II
Alexander Panchenko
 
IIT-TUDA at SemEval-2016 Task 5: Beyond Sentiment Lexicon: Combining Domain ...
IIT-TUDA at SemEval-2016 Task 5: Beyond Sentiment Lexicon: Combining Domain ...IIT-TUDA at SemEval-2016 Task 5: Beyond Sentiment Lexicon: Combining Domain ...
IIT-TUDA at SemEval-2016 Task 5: Beyond Sentiment Lexicon: Combining Domain ...
Alexander Panchenko
 
Text Analysis of Social Networks: Working with FB and VK Data
Text Analysis of Social Networks: Working with FB and VK DataText Analysis of Social Networks: Working with FB and VK Data
Text Analysis of Social Networks: Working with FB and VK DataAlexander Panchenko
 
Неологизмы в социальной сети Фейсбук
Неологизмы в социальной сети ФейсбукНеологизмы в социальной сети Фейсбук
Неологизмы в социальной сети ФейсбукAlexander Panchenko
 
Sentiment Index of the Russian Speaking Facebook
Sentiment Index of the Russian Speaking FacebookSentiment Index of the Russian Speaking Facebook
Sentiment Index of the Russian Speaking Facebook
Alexander Panchenko
 
Similarity Measures for Semantic Relation Extraction
Similarity Measures for Semantic Relation ExtractionSimilarity Measures for Semantic Relation Extraction
Similarity Measures for Semantic Relation Extraction
Alexander Panchenko
 
Dmitry Gubanov. An Approach to the Study of Formal and Informal Relations of ...
Dmitry Gubanov. An Approach to the Study of Formal and Informal Relations of ...Dmitry Gubanov. An Approach to the Study of Formal and Informal Relations of ...
Dmitry Gubanov. An Approach to the Study of Formal and Informal Relations of ...
Alexander Panchenko
 
Detecting Gender by Full Name: Experiments with the Russian Language
Detecting Gender by Full Name:  Experiments with the Russian LanguageDetecting Gender by Full Name:  Experiments with the Russian Language
Detecting Gender by Full Name: Experiments with the Russian Language
Alexander Panchenko
 
Вычислительная лексическая семантика: метрики семантической близости и их при...
Вычислительная лексическая семантика: метрики семантической близости и их при...Вычислительная лексическая семантика: метрики семантической близости и их при...
Вычислительная лексическая семантика: метрики семантической близости и их при...
Alexander Panchenko
 
Semantic Similarity Measures for Semantic Relation Extraction
Semantic Similarity Measures for Semantic Relation ExtractionSemantic Similarity Measures for Semantic Relation Extraction
Semantic Similarity Measures for Semantic Relation ExtractionAlexander Panchenko
 

More from Alexander Panchenko (18)

IIT-UHH at SemEval-2017 Task 3: Exploring Multiple Features for Community Que...
IIT-UHH at SemEval-2017 Task 3: Exploring Multiple Features for Community Que...IIT-UHH at SemEval-2017 Task 3: Exploring Multiple Features for Community Que...
IIT-UHH at SemEval-2017 Task 3: Exploring Multiple Features for Community Que...
 
Fighting with Sparsity of the Synonymy Dictionaries for Automatic Synset Indu...
Fighting with Sparsity of the Synonymy Dictionaries for Automatic Synset Indu...Fighting with Sparsity of the Synonymy Dictionaries for Automatic Synset Indu...
Fighting with Sparsity of the Synonymy Dictionaries for Automatic Synset Indu...
 
The 6th Conference on Analysis of Images, Social Networks, and Texts (AIST 2...
The 6th Conference on Analysis of Images, Social Networks, and Texts  (AIST 2...The 6th Conference on Analysis of Images, Social Networks, and Texts  (AIST 2...
The 6th Conference on Analysis of Images, Social Networks, and Texts (AIST 2...
 
Using Linked Disambiguated Distributional Networks for Word Sense Disambiguation
Using Linked Disambiguated Distributional Networks for Word Sense DisambiguationUsing Linked Disambiguated Distributional Networks for Word Sense Disambiguation
Using Linked Disambiguated Distributional Networks for Word Sense Disambiguation
 
Unsupervised Does Not Mean Uninterpretable: The Case for Word Sense Induction...
Unsupervised Does Not Mean Uninterpretable: The Case for Word Sense Induction...Unsupervised Does Not Mean Uninterpretable: The Case for Word Sense Induction...
Unsupervised Does Not Mean Uninterpretable: The Case for Word Sense Induction...
 
Making Sense of Word Embeddings
Making Sense of Word EmbeddingsMaking Sense of Word Embeddings
Making Sense of Word Embeddings
 
Noun Sense Induction and Disambiguation using Graph-Based Distributional Sema...
Noun Sense Induction and Disambiguation using Graph-Based Distributional Sema...Noun Sense Induction and Disambiguation using Graph-Based Distributional Sema...
Noun Sense Induction and Disambiguation using Graph-Based Distributional Sema...
 
Getting started in Apache Spark and Flink (with Scala) - Part II
Getting started in Apache Spark and Flink (with Scala) - Part IIGetting started in Apache Spark and Flink (with Scala) - Part II
Getting started in Apache Spark and Flink (with Scala) - Part II
 
IIT-TUDA at SemEval-2016 Task 5: Beyond Sentiment Lexicon: Combining Domain ...
IIT-TUDA at SemEval-2016 Task 5: Beyond Sentiment Lexicon: Combining Domain ...IIT-TUDA at SemEval-2016 Task 5: Beyond Sentiment Lexicon: Combining Domain ...
IIT-TUDA at SemEval-2016 Task 5: Beyond Sentiment Lexicon: Combining Domain ...
 
Text Analysis of Social Networks: Working with FB and VK Data
Text Analysis of Social Networks: Working with FB and VK DataText Analysis of Social Networks: Working with FB and VK Data
Text Analysis of Social Networks: Working with FB and VK Data
 
Неологизмы в социальной сети Фейсбук
Неологизмы в социальной сети ФейсбукНеологизмы в социальной сети Фейсбук
Неологизмы в социальной сети Фейсбук
 
Sentiment Index of the Russian Speaking Facebook
Sentiment Index of the Russian Speaking FacebookSentiment Index of the Russian Speaking Facebook
Sentiment Index of the Russian Speaking Facebook
 
Similarity Measures for Semantic Relation Extraction
Similarity Measures for Semantic Relation ExtractionSimilarity Measures for Semantic Relation Extraction
Similarity Measures for Semantic Relation Extraction
 
Dmitry Gubanov. An Approach to the Study of Formal and Informal Relations of ...
Dmitry Gubanov. An Approach to the Study of Formal and Informal Relations of ...Dmitry Gubanov. An Approach to the Study of Formal and Informal Relations of ...
Dmitry Gubanov. An Approach to the Study of Formal and Informal Relations of ...
 
Detecting Gender by Full Name: Experiments with the Russian Language
Detecting Gender by Full Name:  Experiments with the Russian LanguageDetecting Gender by Full Name:  Experiments with the Russian Language
Detecting Gender by Full Name: Experiments with the Russian Language
 
Document
DocumentDocument
Document
 
Вычислительная лексическая семантика: метрики семантической близости и их при...
Вычислительная лексическая семантика: метрики семантической близости и их при...Вычислительная лексическая семантика: метрики семантической близости и их при...
Вычислительная лексическая семантика: метрики семантической близости и их при...
 
Semantic Similarity Measures for Semantic Relation Extraction
Semantic Similarity Measures for Semantic Relation ExtractionSemantic Similarity Measures for Semantic Relation Extraction
Semantic Similarity Measures for Semantic Relation Extraction
 

Recently uploaded

role of pramana in research.pptx in science
role of pramana in research.pptx in sciencerole of pramana in research.pptx in science
role of pramana in research.pptx in science
sonaliswain16
 
Nutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technologyNutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technology
Lokesh Patil
 
Deep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless ReproducibilityDeep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless Reproducibility
University of Rennes, INSA Rennes, Inria/IRISA, CNRS
 
Hemostasis_importance& clinical significance.pptx
Hemostasis_importance& clinical significance.pptxHemostasis_importance& clinical significance.pptx
Hemostasis_importance& clinical significance.pptx
muralinath2
 
Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.
Nistarini College, Purulia (W.B) India
 
Orion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWSOrion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWS
Columbia Weather Systems
 
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
yqqaatn0
 
bordetella pertussis.................................ppt
bordetella pertussis.................................pptbordetella pertussis.................................ppt
bordetella pertussis.................................ppt
kejapriya1
 
Toxic effects of heavy metals : Lead and Arsenic
Toxic effects of heavy metals : Lead and ArsenicToxic effects of heavy metals : Lead and Arsenic
Toxic effects of heavy metals : Lead and Arsenic
sanjana502982
 
S.1 chemistry scheme term 2 for ordinary level
S.1 chemistry scheme term 2 for ordinary levelS.1 chemistry scheme term 2 for ordinary level
S.1 chemistry scheme term 2 for ordinary level
ronaldlakony0
 
Leaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdfLeaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdf
RenuJangid3
 
DMARDs Pharmacolgy Pharm D 5th Semester.pdf
DMARDs Pharmacolgy Pharm D 5th Semester.pdfDMARDs Pharmacolgy Pharm D 5th Semester.pdf
DMARDs Pharmacolgy Pharm D 5th Semester.pdf
fafyfskhan251kmf
 
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Sérgio Sacani
 
What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.
moosaasad1975
 
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
University of Maribor
 
in vitro propagation of plants lecture note.pptx
in vitro propagation of plants lecture note.pptxin vitro propagation of plants lecture note.pptx
in vitro propagation of plants lecture note.pptx
yusufzako14
 
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Sérgio Sacani
 
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdfUnveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Erdal Coalmaker
 
GBSN - Microbiology (Lab 4) Culture Media
GBSN - Microbiology (Lab 4) Culture MediaGBSN - Microbiology (Lab 4) Culture Media
GBSN - Microbiology (Lab 4) Culture Media
Areesha Ahmad
 
platelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptxplatelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptx
muralinath2
 

Recently uploaded (20)

role of pramana in research.pptx in science
role of pramana in research.pptx in sciencerole of pramana in research.pptx in science
role of pramana in research.pptx in science
 
Nutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technologyNutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technology
 
Deep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless ReproducibilityDeep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless Reproducibility
 
Hemostasis_importance& clinical significance.pptx
Hemostasis_importance& clinical significance.pptxHemostasis_importance& clinical significance.pptx
Hemostasis_importance& clinical significance.pptx
 
Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.
 
Orion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWSOrion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWS
 
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
 
bordetella pertussis.................................ppt
bordetella pertussis.................................pptbordetella pertussis.................................ppt
bordetella pertussis.................................ppt
 
Toxic effects of heavy metals : Lead and Arsenic
Toxic effects of heavy metals : Lead and ArsenicToxic effects of heavy metals : Lead and Arsenic
Toxic effects of heavy metals : Lead and Arsenic
 
S.1 chemistry scheme term 2 for ordinary level
S.1 chemistry scheme term 2 for ordinary levelS.1 chemistry scheme term 2 for ordinary level
S.1 chemistry scheme term 2 for ordinary level
 
Leaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdfLeaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdf
 
DMARDs Pharmacolgy Pharm D 5th Semester.pdf
DMARDs Pharmacolgy Pharm D 5th Semester.pdfDMARDs Pharmacolgy Pharm D 5th Semester.pdf
DMARDs Pharmacolgy Pharm D 5th Semester.pdf
 
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
 
What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.
 
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
 
in vitro propagation of plants lecture note.pptx
in vitro propagation of plants lecture note.pptxin vitro propagation of plants lecture note.pptx
in vitro propagation of plants lecture note.pptx
 
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
 
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdfUnveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdf
 
GBSN - Microbiology (Lab 4) Culture Media
GBSN - Microbiology (Lab 4) Culture MediaGBSN - Microbiology (Lab 4) Culture Media
GBSN - Microbiology (Lab 4) Culture Media
 
platelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptxplatelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptx
 

Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources

  • 1. Alexander Panchenko Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources
  • 2. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 2/54 Overview
  • 3. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 3/54 Inducing word sense representations: word sense embeddings via retrofitting [Pelevina et al., 2016, Remus & Biemann, 2018]; inducing synsets [Ustalov et al., 2017b, Ustalov et al., 2017a, Ustalov et al., 2018] inducing semantic classes [Panchenko et al., 2018] Overview Overview
  • 4. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 3/54 Inducing word sense representations: word sense embeddings via retrofitting [Pelevina et al., 2016, Remus & Biemann, 2018]; inducing synsets [Ustalov et al., 2017b, Ustalov et al., 2017a, Ustalov et al., 2018] inducing semantic classes [Panchenko et al., 2018] Making induced senses interpretable [Panchenko et al., 2017b, Panchenko et al., 2017c] Overview Overview
  • 5. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 3/54 Inducing word sense representations: word sense embeddings via retrofitting [Pelevina et al., 2016, Remus & Biemann, 2018]; inducing synsets [Ustalov et al., 2017b, Ustalov et al., 2017a, Ustalov et al., 2018] inducing semantic classes [Panchenko et al., 2018] Making induced senses interpretable [Panchenko et al., 2017b, Panchenko et al., 2017c] Linking induced word senses to lexical resources [Panchenko, 2016, Faralli et al., 2016, Panchenko et al., 2017a, Biemann et al., 2018] Overview Overview
  • 6. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 4/54 Inducing word sense representations
  • 7. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 5/54 Inducing word sense representations Word vs sense embeddings
  • 8. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 6/54 Inducing word sense representations Word vs sense embeddings
  • 9. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 7/54 Inducing word sense representations Related work
  • 10. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 8/54 AutoExtend [Rothe & Schütze, 2015] * image is reproduced from the original paper Inducing word sense representations Related work: knowledge-based
  • 11. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 9/54 Adagram [Bartunov et al., 2016] Multiple vector representations θ for each word: Inducing word sense representations Related work: knowledge-free
  • 12. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 9/54 Adagram [Bartunov et al., 2016] Multiple vector representations θ for each word: p(Y, Z, β|X, α, θ) = V∏ w=1 ∞∏ k=1 p(βwk|α) N∏ i=1 [p(zi|xi, β) C∏ j=1 p(yij|zi, xi, θ)], zi – a hidden variable: a sense index of word xi in context C; α – a meta-parameter controlling number of senses. Inducing word sense representations Related work: knowledge-free
  • 13. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 9/54 Adagram [Bartunov et al., 2016] Multiple vector representations θ for each word: p(Y, Z, β|X, α, θ) = V∏ w=1 ∞∏ k=1 p(βwk|α) N∏ i=1 [p(zi|xi, β) C∏ j=1 p(yij|zi, xi, θ)], zi – a hidden variable: a sense index of word xi in context C; α – a meta-parameter controlling number of senses. See also: [Neelakantan et al., 2014] and [Li and Jurafsky, 2015] Inducing word sense representations Related work: knowledge-free
  • 14. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 10/54 Word sense induction (WSI) based on graph clustering: [Lin, 1998] [Pantel and Lin, 2002] [Widdows and Dorow, 2002] Chinese Whispers [Biemann, 2006] [Hope and Keller, 2013] Inducing word sense representations Related work: word sense induction
  • 15. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 11/54 * source of the image: http://ic.pics.livejournal.com/blagin_anton/33716210/2701748/2701748_800.jpg Inducing word sense representations Related work: Chinese Whispers#1
  • 16. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 12/54 Inducing word sense representations Related work: Chinese Whispers#2
  • 17. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 13/54 Inducing word sense representations Related work: Chinese Whispers#2
  • 18. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 14/54 Inducing word sense representations Related work: Chinese Whispers#2
  • 19. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 15/54 RepL4NLP@ACL’16 [Pelevina et al., 2016], LREC’18 [Remus & Biemann, 2018] Prior methods: Induce inventory by clustering of word instances Use existing sense inventories Our method: Input: word embeddings Output: word sense embeddings Word sense induction by clustering of word ego-networks Inducing word sense representations Sense embeddings using retrofitting
  • 20. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 16/54 From word embeddings to sense embeddings Calculate Word Similarity Graph Learning Word Vectors Word Sense Induction Text Corpus Word Vectors Word Similarity Graph Pooling of Word Vectors Sense Inventory Sense Vectors 1 2 4 3 Inducing word sense representations Sense embeddings using retrofitting
  • 21. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 17/54 Word sense induction using ego-network clustering Inducing word sense representations Sense embeddings using retrofitting
  • 22. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 18/54 Neighbours of Word and Sense Vectors Vector Nearest Neighbors table tray, bottom, diagram, bucket, brackets, stack, basket, list, parenthesis, cup, saucer, pile, playfield, bracket, pot, drop-down, cue, plate Inducing word sense representations Sense embeddings using retrofitting
  • 23. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 18/54 Neighbours of Word and Sense Vectors Vector Nearest Neighbors table tray, bottom, diagram, bucket, brackets, stack, basket, list, parenthesis, cup, saucer, pile, playfield, bracket, pot, drop-down, cue, plate table#0 leftmost#0, column#1, tableau#1, indent#1, bracket#3, pointer#0, footer#1, cursor#1, diagram#0, grid#0 table#1 pile#1, stool#1, tray#0, basket#0, bowl#1, bucket#0, box#0, cage#0, saucer#3, mirror#1, pan#1, lid#0 Inducing word sense representations Sense embeddings using retrofitting
  • 24. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 19/54 Word Sense Disambiguation 1 Context extraction: use context words around the target word 2 Context filtering: based on context word’s relevance for disambiguation 3 Sense choice in context: maximise similarity between a context vector and a sense vector Inducing word sense representations Sense embeddings using retrofitting
  • 25. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 20/54 Inducing word sense representations Sense embeddings using retrofitting
  • 26. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 21/54 Inducing word sense representations Sense embeddings using retrofitting
  • 27. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 22/54 Inducing word sense representations Sense embeddings using retrofitting
  • 28. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 23/54 Inducing word sense representations Sense embeddings using retrofitting
  • 29. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 24/54 Unsupervised WSD SemEval’13, ReprL4NLP [Pelevina et al., 2016]: comparable to SOTA, incl. Adagram sense embeddings. Inducing word sense representations Sense embeddings using retrofitting
  • 30. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 24/54 Unsupervised WSD SemEval’13, ReprL4NLP [Pelevina et al., 2016]: comparable to SOTA, incl. Adagram sense embeddings. Semantic relatedness, LREC’2018 [Remus & Biemann, 2018]: autoextend adagram SGNS . glove . sympat . LSAbow . LSAhal . paragramSL . SimLex999 0.45 0.29 0.44 0.37 0.54 0.30 0.27 0.68 MEN 0.72 0.67 0.77 0.73 0.53 0.67 0.71 0.77 SimVerb 0.43 0.27 0.36 0.23 0.37 0.15 0.19 0.53 WordSim353 0.58 0.61 0.70 0.61 0.47 0.67 0.59 0.72 SimLex999-N 0.44 0.33 0.45 0.39 0.48 0.32 0.34 0.68 MEN-N 0.72 0.68 0.77 ____ 0.76 ____ 0.57 ____ 0.71 ____ 0.73 ____ 0.78 ____ Inducing word sense representations Sense embeddings using retrofitting
  • 31. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 25/54 Unsupervised WSD SemEval’13, ReprL4NLP [Pelevina et al., 2016]: comparable to SOTA, incl. sense embeddings. Semantic relatedness, LREC’2018 [Remus & Biemann, 2018]: autoextend adagram SGNS SGNS+senses glove glove+senses sympat sympat+senses LSAbow LSAbow+senses LSAhal LSAhal+senses paragramSL paragramSL+senses SimLex999 0.45 0.29 0.44 0.46 0.37 0.41 0.54 0.55 0.30 0.39 0.27 0.38 0.68 0.64 MEN 0.72 0.67 0.77 0.78 0.73 0.77 0.53 0.68 0.67 0.70 0.71 0.74 0.77 0.80 SimVerb 0.43 0.27 0.36 0.39 0.23 0.30 0.37 0.45 0.15 0.22 0.19 0.28 0.53 0.53 WordSim353 0.58 0.61 0.70 0.69 0.61 0.65 0.47 0.62 0.67 0.66 0.59 0.63 0.72 0.73 SimLex999-N 0.44 0.33 0.45 0.50 0.39 0.47 0.48 0.55 0.32 0.46 0.34 0.44 0.68 0.66 MEN-N 0.72 0.68 0.77 0.79 0.76 0.80 0.57 0.74 0.71 0.73 0.73 0.76 0.78 0.81 Inducing word sense representations Sense embeddings using retrofitting
  • 32. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 26/54 Word and sense embeddings of words iron and vitamin. LREC’18 [Remus & Biemann, 2018] Inducing word sense representations Sense embeddings using retrofitting
  • 33. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 27/54 ACL’17 [Ustalov et al., 2017b] Examples of extracted synsets: Size Synset 2 {decimal point, dot} 3 {gullet, throat, food pipe} 4 {microwave meal, ready meal, TV dinner, frozen dinner} 5 {objective case, accusative case, oblique case, object case, accusative} 6 {radiotheater, dramatizedaudiobook, audiotheater, ra- dio play, radio drama, audio play} Inducing word sense representations Synset induction
  • 34. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 28/54 Outline of the ’Watset’ method: Background Corpus Synonymy Dictionary Learning Word Embeddings Graph Construction Synsets Word Similarities Ambiguous Weighted Graph Local Clustering: Word Sense Induction Global Clustering: Synset Induction Sense Inventory Disambiguation of Neighbors Disambiguated Weighted Graph Local-Global Fuzzy Graph Clustering Inducing word sense representations Synset induction
  • 35. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 29/54 Inducing word sense representations Synset induction
  • 36. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 30/54 Inducing word sense representations Synset induction
  • 37. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 31/54 Inducing word sense representations Synset induction
  • 38. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 32/54 CW MCL MaxMax ECO CPM Watset 0.0 0.1 0.2 0.3 WordNet (English) F−score CW MCL MaxMax ECO CPM Watset 0.0 0.1 0.2 0.3 BabelNet (English) F−score Inducing word sense representations Synset induction
  • 39. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 32/54 CW MCL MaxMax ECO CPM Watset 0.0 0.1 0.2 0.3 WordNet (English) F−score CW MCL MaxMax ECO CPM Watset 0.0 0.1 0.2 0.3 BabelNet (English) F−score CW MCL MaxMax ECO CPM Watset 0.00 0.05 0.10 0.15 0.20 RuWordNet (Russian) F−score CW MCL MaxMax ECO CPM Watset 0.0 0.1 0.2 0.3 0.4 YARN (Russian) F−score Inducing word sense representations Synset induction
  • 40. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 33/54 Examples of semantic classes: ID Sense Cluster Hypernyms 1 peach#1, banana#1, pineapple#0, berry#0, blackberry#0, grapefruit#0, strawberry#0, blue- berry#0, grape#0, melon#0, orange#0, pear#0, plum#0, raspberry#0, watermelon#0, apple#0, apricot#0, … fruit#0, crop#0, ingredi- ent#0, food#0, · 2 C#4, Basic#2, Haskell#5, Flash#1, Java#1, Pas- cal#0, Ruby#6, PHP#0, Ada#1, Oracle#3, Python#3, Apache#3, Visual Basic#1, ASP#2, Delphi#2, SQL Server#0, CSS#0, AJAX#0, the Java#0, … programming language#3, technology#0, language#0, format#2, app#0 Inducing word sense representations Induction of semantic classes
  • 41. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 34/54 Text Corpus Representing Senses with Ego Networks Semantic Classes Word Sense Induction from Text Corpus Sense Graph Construction Clustering of Word Senes Labeling Sense Clusters with Hypernyms Induced Word Senses Sense Ego-Networks Global Sense Graph s Noisy Hypernyms Cleansed Hypernyms Induction of Semantic Classes Global Sense Clusters Inducing word sense representations Induction of semantic classes
  • 42. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 35/54 Filtering noisy hypernyms with semantic classes LREC’18 [Panchenko et al., 2018]: fruit#1 food#0 apple#2 mango#0 pear#0 Hypernyms, Sense Cluster, mangosteen#0 city#2 Removed Wrong Added Missing Inducing word sense representations Induction of sense semantic classes
  • 43. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 36/54 Filtering of a noisy hypernymy database with semantic classes. LREC’18 [Panchenko et al., 2018] Precision Recall F-score Original Hypernyms (Seitner et al., 2016) 0.475 0.546 0.508 Semantic Classes (coarse-grained) 0.541 0.679 0.602 Inducing word sense representations Induction of sense semantic classes
  • 44. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 37/54 Making induced senses interpretable
  • 45. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 38/54 Knowledge-based sense representations are interpretable Making induced senses interpretable Making induced senses interpretable
  • 46. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 39/54 Most knowledge-free sense representations are uninterpretable Making induced senses interpretable Making induced senses interpretable
  • 47. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 40/54 Making induced senses interpretable Making induced senses interpretable
  • 48. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 41/54 Hypernymy prediction in context. EMNLP’17 [Panchenko et al., 2017b] Making induced senses interpretable Making induced senses interpretable
  • 49. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 42/54 11.702 sentences, 863 words with avg.polysemy of 3.1. WSD Model Accuracy Inventory Features Hypers HyperHypers Word Senses Random 0.257 0.610 Word Senses MFS 0.292 0.682 Word Senses Cluster Words 0.291 0.650 Word Senses Context Words 0.308 0.686 Making induced senses interpretable Making induced senses interpretable
  • 50. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 42/54 11.702 sentences, 863 words with avg.polysemy of 3.1. WSD Model Accuracy Inventory Features Hypers HyperHypers Word Senses Random 0.257 0.610 Word Senses MFS 0.292 0.682 Word Senses Cluster Words 0.291 0.650 Word Senses Context Words 0.308 0.686 Super Senses Random 0.001 0.001 Super Senses MFS 0.001 0.001 Super Senses Cluster Words 0.174 0.365 Super Senses Context Words 0.086 0.188 Making induced senses interpretable Making induced senses interpretable
  • 51. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 43/54 Linking induced senses to resources
  • 52. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 44/54 Text Corpus Linking Induced Senses to Senses of the LR Induce a Graph of Sem. Related Words Enriched Lexical Resource Graph of Related Words Word Sense Induction Labeling Senses with Hypernyms Disambiguation of Neighbours Typing of the Unmapped Induced Senses Word Sense Inventory Labeled Word Senses PCZ Construction of Proto-Conceptualization (PCZ) Linking Proto-Conceptualization to Lexical Resource Part. Linked Senses to the LR Lexical Resource (LR): WordNet, BabelNet, ... Construction of sense feature representations Graph of Related Senses LREC’16 [Panchenko, 2016], ISWC’16 [Faralli et al., 2016], SENSE@EACL’17 [Panchenko et al., 2017a], NLE’18 [Biemann et al., 2018] Linking induced senses to resources Linking induced senses to resources
  • 53. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 45/54 Word AdaGram BabelNet AdaGram BoW BabelNet BoW python 2 bn:01713224n perl, php, java, smalltalk, ruby, lua, tcl, scripting, javascript, bindings, binding, programming, coldfusion, actionscript, net, . . . language, programming, python- ista, python programming, python3, python2, level, com- puter, pythonistas, python3000, python 1 bn:01157670n monty, circus, spamalot, python, magoo, muppet, snoopy, fea- turette, disney, tunes, tune, clas- sic, shorts, short, apocalypse, . . . monty, comedy, monty python, british, monte, monte python, troupe, pythonesque, foot, artist, record, surreal, terry, . . . python 3 bn:00046456n spectacled, unicornis, snake, gi- ant, caiman, leopard, squirrel, crocodile, horned, cat, mole, ele- phant, opossum, pheasant, . . . molurus, indian, boa, tigris, tiger python, rock, tiger, indian python, reptile, python molurus, indian rock python, coluber, . . . python 4 bn:01157670n circus, fly, flying, dusk, lizard, moth, unicorn, puff, adder, vul- ture, tyrannosaurus, zephyr, bad- ger, . . . monty, comedy, monty python, british, monte, monte python, troupe, pythonesque, foot, artist, record, surreal, terry, . . . python 1 bn:00473212n monty, circus, spamalot, python, magoo, muppet, snoopy, fea- turette, disney, tunes, tune, clas- sic, shorts, short, apocalypse, . . . pictures, monty, python monty pictures, limited, company, python pictures limited, king- dom, picture, serve, director, . . . python 1 bn:03489893n monty, circus, spamalot, python, magoo, muppet, snoopy, fea- turette, disney, tunes, tune, clas- sic, shorts, short, apocalypse, . . . film, horror, movie, clabaugh, richard, monster, century, direct, snake, python movie, television, giant, natural, language, for-tv, . . . Linking induced senses to resources Linking induced senses to resources
  • 54. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 46/54 Model Representation of the Sense ”disk (medium)” WordNet memory, device, floppy, disk, hard, disk, disk, computer, science, computing, diskette, fixed, disk, floppy, magnetic, disc, magnetic, disk, hard, disc, storage, device WordNet + Linked recorder, disk, floppy, console, diskette, handset, desktop, iPhone, iPod, HDTV, kit, RAM, Discs, Blu- ray, computer, GB, microchip, site, cartridge, printer, tv, VCR, Disc, player, LCD, software, component, camcorder, cellphone, card, monitor, display, burner, Web, stereo, internet, model, iTunes, turntable, chip, cable, camera, iphone, notebook, device, server, surface, wafer, page, drive, laptop, screen, pc, television, hardware, YouTube, dvr, DVD, product, folder, VCR, radio, phone, circuitry, partition, megabyte, peripheral, format, machine, tuner, website, merchandise, equipment, gb, discs, MP3, hard-drive, piece, video, storage device, memory device, microphone, hd, EP, content, soundtrack, webcam, system, blade, graphic, microprocessor, collection, document, programming, battery, key- board, HD, handheld, CDs, reel, web, material, hard-disk, ep, chart, debut, configuration, recording, album, broadcast, download, fixed disk, planet, pda, microfilm, iPod, videotape, text, cylinder, cpu, canvas, label, sampler, workstation, electrode, magnetic disc, catheter, magnetic disk, Video, mo- bile, cd, song, modem, mouse, tube, set, ipad, signal, substrate, vinyl, music, clip, pad, audio, com- pilation, memory, message, reissue, ram, CD, subsystem, hdd, touchscreen, electronics, demo, shell, sensor, file, shelf, processor, cassette, extra, mainframe, motherboard, floppy disk, lp, tape, version, kilobyte, pacemaker, browser, Playstation, pager, module, cache, DVD, movie, Windows, cd-rom, e- book, valve, directory, harddrive, smartphone, audiotape, technology, hard disk, show, computing, computer science, Blu-Ray, blu-ray, HDD, HD-DVD, scanner, hard disc, gadget, booklet, copier, play- back, TiVo, controller, filter, DVDs, gigabyte, paper, mp3, CPU, dvd-r, pipe, cd-r, playlist, slot, VHS, film, videocassette, interface, adapter, database, manual, book, channel, changer, storage Linking induced senses to resources Linking induced senses to resources
  • 55. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 47/54 Evaluation of linking accuracy: Linking induced senses to resources Linking induced senses to resources
  • 56. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 48/54 Evaluation of enriched representations based on WSD: Linking induced senses to resources Linking induced senses to resources
  • 57. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 49/54 Conclusion
  • 58. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 50/54 Conclusion Vectors + Graphs = ♡
  • 59. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 51/54 We can induce word senses, synsets and semantic classes in a knowledge-free way using graph clustering and distributional models. Conclusion Take home messages
  • 60. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 51/54 We can induce word senses, synsets and semantic classes in a knowledge-free way using graph clustering and distributional models. We can make the induced word senses interpretable in a knowledge-free way with hypernyms, images, definitions. Conclusion Take home messages
  • 61. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 51/54 We can induce word senses, synsets and semantic classes in a knowledge-free way using graph clustering and distributional models. We can make the induced word senses interpretable in a knowledge-free way with hypernyms, images, definitions. We can link induced senses to lexical resources to improve performance of WSD; enrich lexical resources with emerging senses. Conclusion Take home messages
  • 62. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 52/54 Participate in an ACL SIGSLAV sponsored shared task on word sense induction and disambiguation for Russian! More details: http://russe.nlpub.org/2018/wsi Conclusion An ongoing shared task on WSI&D
  • 63. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 53/54 Thank you! Questions? This research was supported by Conclusion Acknowledgments
  • 64. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 54/54 Evaluation on SemEval 2013 Task 13 WSI&D: Model Jacc. Tau WNDCG F.NMI F.B-Cubed AI-KU (add1000) 0.176 0.609 0.205 0.033 0.317 AI-KU 0.176 0.619 0.393 0.066 0.382 AI-KU (remove5-add1000) 0.228 0.654 0.330 0.040 0.463 Unimelb (5p) 0.198 0.623 0.374 0.056 0.475 Unimelb (50k) 0.198 0.633 0.384 0.060 0.494 UoS (#WN senses) 0.171 0.600 0.298 0.046 0.186 UoS (top-3) 0.220 0.637 0.370 0.044 0.451 La Sapienza (1) 0.131 0.544 0.332 – – La Sapienza (2) 0.131 0.535 0.394 – – AdaGram, α = 0.05, 100 dim 0.274 0.644 0.318 0.058 0.470 w2v 0.197 0.615 0.291 0.011 0.615 w2v (nouns) 0.179 0.626 0.304 0.011 0.623 JBT 0.205 0.624 0.291 0.017 0.598 JBT (nouns) 0.198 0.643 0.310 0.031 0.595 TWSI (nouns) 0.215 0.651 0.318 0.030 0.573 Conclusion Sense embeddings using retrofitting
  • 65. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 54/54 Bartunov, S., Kondrashkin, D., Osokin, A., & Vetrov, D. (2016). Breaking sticks and ambiguities with adaptive skip-gram. In Artificial Intelligence and Statistics (pp. 130–138). Biemann, C., Faralli, S., Panchenko, A., & Ponzetto, S. P. (2018). A framework for enriching lexical semantic resources with distributional semantics. In Journal of Natural Language Engineering (pp. 56–64).: Cambridge Press. Faralli, S., Panchenko, A., Biemann, C., & Ponzetto, S. P. (2016). Linked disambiguated distributional semantic networks. In International Semantic Web Conference (pp. 56–64).: Springer. Panchenko, A. (2016). Best of both worlds: Making word sense embeddings interpretable. In LREC.
  • 66. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 54/54 Panchenko, A., Faralli, S., Ponzetto, S. P., & Biemann, C. (2017a). Using linked disambiguated distributional networks for word sense disambiguation. In Proceedings of the 1st Workshop on Sense, Concept and Entity Representations and their Applications (pp. 72–78). Valencia, Spain: Association for Computational Linguistics. Panchenko, A., Marten, F., Ruppert, E., Faralli, S., Ustalov, D., Ponzetto, S. P., & Biemann, C. (2017b). Unsupervised, knowledge-free, and interpretable word sense disambiguation. In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing: System Demonstrations (pp. 91–96). Copenhagen, Denmark: Association for Computational Linguistics. Panchenko, A., Ruppert, E., Faralli, S., Ponzetto, S. P., & Biemann, C. (2017c).
  • 67. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 54/54 Unsupervised does not mean uninterpretable: The case for word sense induction and disambiguation. In Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 1, Long Papers (pp. 86–98). Valencia, Spain: Association for Computational Linguistics. Panchenko, A., Ustalov, D., Faralli, S., Ponzetto, S. P., & Biemann, C. (2018). Improving hypernymy extraction with distributional semantic classes. In Proceedings of the LREC 2018 Miyazaki, Japan: European Language Resources Association. Pelevina, M., Arefiev, N., Biemann, C., & Panchenko, A. (2016). Making sense of word embeddings. In Proceedings of the 1st Workshop on Representation Learning for NLP (pp. 174–183). Berlin, Germany: Association for Computational Linguistics. Remus, S. & Biemann, C. (2018).
  • 68. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 54/54 Retrofittingword representations for unsupervised sense aware word similarities. In Proceedings of the LREC 2018 Miyazaki, Japan: European Language Resources Association. Rothe, S. & Schütze, H. (2015). Autoextend: Extending word embeddings to embeddings for synsets and lexemes. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers) (pp. 1793–1803). Beijing, China: Association for Computational Linguistics. Ustalov, D., Chernoskutov, M., Biemann, C., & Panchenko, A. (2017a). Fighting with the sparsity of synonymy dictionaries for automatic synset induction. In International Conference on Analysis of Images, Social Networks and Texts (pp. 94–105).: Springer.
  • 69. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 54/54 Ustalov, D., Panchenko, A., & Biemann, C. (2017b). Watset: Automatic induction of synsets from a graph of synonyms. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) (pp. 1579–1590). Vancouver, Canada: Association for Computational Linguistics. Ustalov, D., Teslenko, D., Panchenko, A., Chernoskutov, M., & Biemann, C. (2018). Word sense disambiguation based on automatically induced synsets. In LREC 2018, 11th International Conference on Language Resources and Evaluation : 7-12 May 2018, Miyazaki (Japan) (pp. tba). Paris: European Language Resources Association, ELRA-ELDA. Accepted for publication.