Advances in Distributional Semantics Using Computational Lexicons

Advances in Methods and
Evaluations for Distributional
Semantic Models using
Computational Lexicons
By Meera Hahn
Advisor: Jinho Choi

Distributional Semantics
Words used in similar contexts have similar semantic
and functional meaning - Harris

Word Embeddings
• From words to dense vectors
• Capturing semantics in a quantifiable vector
• Uses of these embeddings
• Current embedding methods
• Changing the game of natural language processing

Word Embeddings
KING – MAN + WOMEN = QUEEN

Word2Vec
I threw a green ball across the yard to him
vs.
I threw a green Frisbee across the yard to him

Word2Vec
Skip-Gram Architecture
Continuous Bag of Words
Architecture
I threw a green ball across the yard to him
3 4 5 6210 7-1 8
threw
I
a
green
threw
I
a
green

Dependency Structure Word
Embeddings
• Levy & Goldberg, 2014.
• Used only the head word and word and dependents
• Evaluated on differences of similarity (function) and
relatedness (topical)
• Evaluated on a dataset called WordSim, which is
mostly nouns
• Did not look at if the embeddings created a better
overall model nor did they try varying structures

Predicate Structure
• ARG0: agent
• ARG1: patient, theme
• ARG2: instrument, benefactive, attribute
• ARG3: starting point
• ARG4: ending point
• ARG5: external causer

Experiments
• DEP1: the first order dependencies of W
• DEP1H: the first order dependencies of W and the dependency head of W
• DEP12H: the first and second order dependencies of W and the dependency head of W
• DEP1SIB1: the first order dependencies of W, the rightmost sibling of W and the
leftmost sibling of W
• DEP1ALLSIB: the first order dependencies of W, all siblings of W
• DEP1SRLH: the first order dependencies of W, the semantic head of W
• DEP1SRLARG: the first order dependencies of W, all the semantic arguments of W

Evaluations
• Used 2 lexical databases for experiments
• WordNet
• Synsets
• WordNet similarity measurements: LIN, LCH, WUP
• VerbNet
• Categorizes all verbs into verb classes
Members assume, adopt, take
Thematic Roles agent, theme
Semantic Restriction animate, organization
Ex. of verb class Adopt-93

WordNet
• Creation of similarity matrices
• WordNet Similarity Matrix
• Word Embedding Similarity Matrix
• Comparison of matrices
• ranking correlation: Spearman’s and Kendall’s

Example Similarity Matrix
w1 w2 … wn
w1 WS(w1,w1) WS(w1,w2) … WS(w1,wn)
w2 WS(w2,w1) WS(w2,w2) … WS(w2,wn)
… … … … …
wn WS(wn,w1) WS(wn,w2) … WS(wn,wn)

Kendall’s Ranking Correlation for Nouns

Kendall’s Ranking Correlation for Adjectives

Kendall’s Ranking Correlation for Adverbs

Kendall’s Ranking Correlation for Verbs

VerbNet
• Finding the best and worst verb classes
• Finding patterns in the best and worst verb classes
• Thematic role labels
• Semantic restrictions
• Note*: for each verb class VerbNet labels both of the above
• Plotting patterns
Ex. of verb class Adopt-93
Members assume, adopt, take
Thematic Roles agent, theme
Semantic Restriction animate, organization

Top and Bottom Verb Classes
by Average Rank Correlation
• Sorted all verb classes by the average rank correlation of
verbs in that class
• some verb classes did better than others but no outliers
• many of the top/bottom verb classes were the same for
Word2Vec embeddings and DEP1 embeddings
• What attributes cause certain verb classes to do consistently better?
Word2Vec DEP1
Top Class consider-29.9 cooperate-73
Rank of Top Class 0.1642 0.299
Bottom Class light_emission-43.1 exhale-40.1.3
Rank of Bottom Class -0.0151 -0.1473

Extrinsic Evaluation:
Sentiment Analysis
• Task is to categorizes sentences according to their positive
or negative sentiment
• “I hate this movie” VS “This movie is ridiculously good”
• Using Kaggle Challenge data from Rotten Tomatoes Movie
Reviews
• System is composed of a convolutional neural network that
is feed the word vectors of the words in the sentence

Results of Sentiment Analysis Task
Accuracy
Emory W2V 0.8624373
DEP1 0.8814250
DEP1H 0.8867580
DEP12 0.8927635
DEP12H 0.8969352
DEP1SIB1 0.8910444
DEP1ALLSIB 0.8896382
DEP1SRLARG 0.8925598
DEP1SRLH 0.8997660

Conclusions
• Structure based embeddings are better then topical based
embeddings
• Shown by the evaluations on WordNet, VerbNet and
sentiment analysis task case study
• Different POS capture semantic information from
different sentence structures
• Within verbs embeddings: certain subclasses of verbs do
better than others
• Verbs with certain semantic restricts far outperform other
verb classes

Advances in Distributional Semantics Using Computational Lexicons

Recommended

Recommended

More Related Content

Similar to Advances in Distributional Semantics Using Computational Lexicons

Similar to Advances in Distributional Semantics Using Computational Lexicons (20)

More from Jinho Choi

More from Jinho Choi (20)

Recently uploaded

Recently uploaded (20)

Advances in Distributional Semantics Using Computational Lexicons