SlideShare a Scribd company logo
Distributional models
of word meaning
Alessandro Lenci
Annual Review of Linguistics, 4(1), 151–
171, 2018
Overview
• Explain what the distributional hypothesis is
• Briefly introduce major ways to generate
distributional representations of words
‣ I focus on the two most popular ways (Count/
Prediction) and add more materials to explain the
latter way
• Summarise the common challenges with
distributional representations of words
4
• Overview
• Distributional Hypothesis
• Taxonomy of Methods
• Models
• Challenges
Distributional Hypothesis
• “Lexemes with similar linguistic contexts have
similar meanings” (Lenci, 2018: p. 152)
• One of the ways to give the definition of word
meaning
5
• Overview
• Distributional Hypothesis
• Taxonomy of Methods
• Models
• Challenges
“I found a wonderful restaurant yesterday!”
“I found a fantastic restaurant yesterday!”
Looks like they have
a similar meaning
Target word
Context words
(latter)
Context words
(former)
Distributional Hypothesis
• Distributional hypothesis (DH) forms the theoretical
foundation of distributional semantics (aka vector
space semantics)
• Lenci (2008) pointed out two levels of DH:
‣ Weak DH: only assumes correlations between
semantics and word distributions
‣ Strong DH: also assumes DH is a cognitive
hypothesis
6
• Overview
• Distributional Hypothesis
• Taxonomy of Methods
• Models
• Challenges
Distributed vs Distributional
• According to Ferrone and Zanzotto (2017),
distributed representations contain distributional
representations
• Distributed: ways to represent each word by a
vector with several dimensions instead of a symbolic
vector (e.g., one-hot vectors)
• Distributional: ways to represent each word by a
vector with several dimensions based on
distributional hypothesis
7
• Overview
• Distributional Hypothesis
• Taxonomy of Methods
• Models
• Challenges
• The two major ways to
generate distributional
representations:
‣ Count models
‣ Prediction models
• The paper shows an
introduction of count
models (Sec 3.2)
8
Taxonomy of Methods:
The Method of Learning
• Overview
• Distributional Hypothesis
• Taxonomy of Methods
‣ The Method of Learning
‣ The Type of Context
• Models
• Challenges
9
Taxonomy of Methods:
The Type of Context
• Overview
• Distributional Hypothesis
• Taxonomy of Methods
‣ The Method of Learning
‣ The Type of Context
• Models
• Challenges
e.g., My example of
wonderful/fantastic.
Most popular
taking into account syntactically
dependent words only (e.g.,
predicate argument structure?)
Imagine
TFIDF
10
Count models
• Overview
• Distributional Hypothesis
• Taxonomy of Methods
• Models
‣ Count models
‣ Prediction models
• Challenges
Co-occurrence matrix
Enhance significance to reflect
the importance of the contexts
For example, taking:
Count models
• Use some methods to obtain latent features among
columns in explicit vectors → implicit vectors
• One easy example is to apply dimensionality-
reduction techniques like singular value
decomposition or principal components analysis
• See Table 2 of Lenci (2018) for famous tools
‣ GloVe (Pennington+, 2014), which is based on
weighted least-squares regression, is the most
popular word representations among count models
11
• Overview
• Distributional Hypothesis
• Taxonomy of Methods
• Models
‣ Count models
‣ Prediction models
• Challenges
Prediction models
• As known as word vectors, word embeddings, and
distributed embeddings
• Learn word representations using a neural network
model while the model is learning a language model
• The most famous tool of this category is word2vec
‣ The rest of the papers I will explain today are also
based on prediction models
12
• Overview
• Distributional Hypothesis
• Taxonomy of Methods
• Models
‣ Count models
‣ Prediction models
• Challenges
Prediction models
• Language model
‣ A probability distribution over sequences of words
‣ This assigns a probability of given sequences
• Neural language model
‣ A language model using neural network
‣ Given sequences, it predicts the next word of the
sequences
13
• Overview
• Distributional Hypothesis
• Taxonomy of Methods
• Models
‣ Count models
‣ Prediction models
• Challenges
14
Prediction models
• Overview
• Distributional Hypothesis
• Taxonomy of Methods
• Models
‣ Count models
‣ Prediction models
• Challenges
have a nice
one
one
this has the same number
of units as total word types
and represents a probability
distribution of the next word
…
Output layer
Hidden layers
Embedding layer
Modified from 坪井+ (2017) 『深層学習による自然言語処理』
E
Win
Whid
Wout
These parts become word
vectors during training
An example of feed forward
neural network language model
15
Prediction models
• Overview
• Distributional Hypothesis
• Taxonomy of Methods
• Models
‣ Count models
‣ Prediction models
• Challenges
have a nice
one
one …
Output layer
Embedding layer
Modified from 坪井+ (2017) 『深層学習による自然言語処理』
E
word2vec
No hidden layers
Just taking inner products
But with bunch of
optimisation techniques
Challenges
• Distributional semantic models tend to mix up
various types of semantic similarity/relatedness
‣ No distinction among hypernymy, antonymy,
meronymy, locative relations and topical relations
• How to represent larger linguistic units than word
16
• Overview
• Distributional Hypothesis
• Taxonomy of Methods
• Models
• Challenges

More Related Content

Similar to A Review of Distributional models of word meaning (Lenci, 2018)

Discourse analysis (Schmitt's book chapter 4)
Discourse analysis (Schmitt's book chapter 4)Discourse analysis (Schmitt's book chapter 4)
Discourse analysis (Schmitt's book chapter 4)Samira Rahmdel
 
Drifting distributions? Possibilities and risks of using distributional seman...
Drifting distributions? Possibilities and risks of using distributional seman...Drifting distributions? Possibilities and risks of using distributional seman...
Drifting distributions? Possibilities and risks of using distributional seman...
Antske Fokkens
 
Chapter Seven Research Questions and Hypotheses
Chapter Seven Research Questions and HypothesesChapter Seven Research Questions and Hypotheses
Chapter Seven Research Questions and Hypotheses
International advisers
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
Toine Bogers
 
Engagement strategies for (dis-)alignment with readership in postgraduate li...
Engagement strategies for (dis-)alignment with readership in postgraduate li...Engagement strategies for (dis-)alignment with readership in postgraduate li...
Engagement strategies for (dis-)alignment with readership in postgraduate li...
Lok Ming Eric Cheung
 
gwc-lit-review-resentation-matt-w (1).ppt
gwc-lit-review-resentation-matt-w (1).pptgwc-lit-review-resentation-matt-w (1).ppt
gwc-lit-review-resentation-matt-w (1).ppt
SyedNadeemAbbas6
 
Assignment Keys Black The requirements of that .docx
Assignment Keys Black The requirements of that .docxAssignment Keys Black The requirements of that .docx
Assignment Keys Black The requirements of that .docx
howard4little59962
 
2010 PACLIC - pay attention to categories
2010 PACLIC - pay attention to categories2010 PACLIC - pay attention to categories
2010 PACLIC - pay attention to categories
WarNik Chow
 
Experimenting with eXtreme Design (EKAW2010)
Experimenting with eXtreme Design (EKAW2010)Experimenting with eXtreme Design (EKAW2010)
Experimenting with eXtreme Design (EKAW2010)
evabl444
 
Running head SELF-EXPERIMENT ASSIGNMENT 1.docx
Running head SELF-EXPERIMENT ASSIGNMENT                        1.docxRunning head SELF-EXPERIMENT ASSIGNMENT                        1.docx
Running head SELF-EXPERIMENT ASSIGNMENT 1.docx
charisellington63520
 
gwc-lit-review-resentation-matt-w.ppt
gwc-lit-review-resentation-matt-w.pptgwc-lit-review-resentation-matt-w.ppt
gwc-lit-review-resentation-matt-w.ppt
YenL3
 
gwc-lit-review-resentation-matt-w.ppt
gwc-lit-review-resentation-matt-w.pptgwc-lit-review-resentation-matt-w.ppt
gwc-lit-review-resentation-matt-w.ppt
riau university
 
Review of Literature
Review of Literature Review of Literature
Review of Literature
ssuser9b14d6
 
gwc-lit-review-resentation-matt-w.ppt
gwc-lit-review-resentation-matt-w.pptgwc-lit-review-resentation-matt-w.ppt
gwc-lit-review-resentation-matt-w.ppt
DebbyUstari1
 
gwc-lit-review-resentation-matt-w.ppt
gwc-lit-review-resentation-matt-w.pptgwc-lit-review-resentation-matt-w.ppt
gwc-lit-review-resentation-matt-w.ppt
myjuni04
 
Discourse-Historical Approach.pptx
Discourse-Historical Approach.pptxDiscourse-Historical Approach.pptx
Discourse-Historical Approach.pptx
umarashaheen2
 
dissertation101_intervention_march2013.ppt
dissertation101_intervention_march2013.pptdissertation101_intervention_march2013.ppt
dissertation101_intervention_march2013.ppt
DavidFastoon1
 
dissertation101_intervention_march2013.ppt
dissertation101_intervention_march2013.pptdissertation101_intervention_march2013.ppt
dissertation101_intervention_march2013.ppt
Yasir Iqbal
 
Admission in india 2015
Admission in india 2015Admission in india 2015
Admission in india 2015
Edhole.com
 

Similar to A Review of Distributional models of word meaning (Lenci, 2018) (20)

Discourse analysis (Schmitt's book chapter 4)
Discourse analysis (Schmitt's book chapter 4)Discourse analysis (Schmitt's book chapter 4)
Discourse analysis (Schmitt's book chapter 4)
 
Discourse analysis
Discourse analysis Discourse analysis
Discourse analysis
 
Drifting distributions? Possibilities and risks of using distributional seman...
Drifting distributions? Possibilities and risks of using distributional seman...Drifting distributions? Possibilities and risks of using distributional seman...
Drifting distributions? Possibilities and risks of using distributional seman...
 
Chapter Seven Research Questions and Hypotheses
Chapter Seven Research Questions and HypothesesChapter Seven Research Questions and Hypotheses
Chapter Seven Research Questions and Hypotheses
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Engagement strategies for (dis-)alignment with readership in postgraduate li...
Engagement strategies for (dis-)alignment with readership in postgraduate li...Engagement strategies for (dis-)alignment with readership in postgraduate li...
Engagement strategies for (dis-)alignment with readership in postgraduate li...
 
gwc-lit-review-resentation-matt-w (1).ppt
gwc-lit-review-resentation-matt-w (1).pptgwc-lit-review-resentation-matt-w (1).ppt
gwc-lit-review-resentation-matt-w (1).ppt
 
Assignment Keys Black The requirements of that .docx
Assignment Keys Black The requirements of that .docxAssignment Keys Black The requirements of that .docx
Assignment Keys Black The requirements of that .docx
 
2010 PACLIC - pay attention to categories
2010 PACLIC - pay attention to categories2010 PACLIC - pay attention to categories
2010 PACLIC - pay attention to categories
 
Experimenting with eXtreme Design (EKAW2010)
Experimenting with eXtreme Design (EKAW2010)Experimenting with eXtreme Design (EKAW2010)
Experimenting with eXtreme Design (EKAW2010)
 
Running head SELF-EXPERIMENT ASSIGNMENT 1.docx
Running head SELF-EXPERIMENT ASSIGNMENT                        1.docxRunning head SELF-EXPERIMENT ASSIGNMENT                        1.docx
Running head SELF-EXPERIMENT ASSIGNMENT 1.docx
 
gwc-lit-review-resentation-matt-w.ppt
gwc-lit-review-resentation-matt-w.pptgwc-lit-review-resentation-matt-w.ppt
gwc-lit-review-resentation-matt-w.ppt
 
gwc-lit-review-resentation-matt-w.ppt
gwc-lit-review-resentation-matt-w.pptgwc-lit-review-resentation-matt-w.ppt
gwc-lit-review-resentation-matt-w.ppt
 
Review of Literature
Review of Literature Review of Literature
Review of Literature
 
gwc-lit-review-resentation-matt-w.ppt
gwc-lit-review-resentation-matt-w.pptgwc-lit-review-resentation-matt-w.ppt
gwc-lit-review-resentation-matt-w.ppt
 
gwc-lit-review-resentation-matt-w.ppt
gwc-lit-review-resentation-matt-w.pptgwc-lit-review-resentation-matt-w.ppt
gwc-lit-review-resentation-matt-w.ppt
 
Discourse-Historical Approach.pptx
Discourse-Historical Approach.pptxDiscourse-Historical Approach.pptx
Discourse-Historical Approach.pptx
 
dissertation101_intervention_march2013.ppt
dissertation101_intervention_march2013.pptdissertation101_intervention_march2013.ppt
dissertation101_intervention_march2013.ppt
 
dissertation101_intervention_march2013.ppt
dissertation101_intervention_march2013.pptdissertation101_intervention_march2013.ppt
dissertation101_intervention_march2013.ppt
 
Admission in india 2015
Admission in india 2015Admission in india 2015
Admission in india 2015
 

Recently uploaded

Planning Of Procurement o different goods and services
Planning Of Procurement o different goods and servicesPlanning Of Procurement o different goods and services
Planning Of Procurement o different goods and services
JoytuBarua2
 
Fundamentals of Induction Motor Drives.pptx
Fundamentals of Induction Motor Drives.pptxFundamentals of Induction Motor Drives.pptx
Fundamentals of Induction Motor Drives.pptx
manasideore6
 
Heap Sort (SS).ppt FOR ENGINEERING GRADUATES, BCA, MCA, MTECH, BSC STUDENTS
Heap Sort (SS).ppt FOR ENGINEERING GRADUATES, BCA, MCA, MTECH, BSC STUDENTSHeap Sort (SS).ppt FOR ENGINEERING GRADUATES, BCA, MCA, MTECH, BSC STUDENTS
Heap Sort (SS).ppt FOR ENGINEERING GRADUATES, BCA, MCA, MTECH, BSC STUDENTS
Soumen Santra
 
Basic Industrial Engineering terms for apparel
Basic Industrial Engineering terms for apparelBasic Industrial Engineering terms for apparel
Basic Industrial Engineering terms for apparel
top1002
 
Cosmetic shop management system project report.pdf
Cosmetic shop management system project report.pdfCosmetic shop management system project report.pdf
Cosmetic shop management system project report.pdf
Kamal Acharya
 
Water billing management system project report.pdf
Water billing management system project report.pdfWater billing management system project report.pdf
Water billing management system project report.pdf
Kamal Acharya
 
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
bakpo1
 
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdfHybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
fxintegritypublishin
 
Railway Signalling Principles Edition 3.pdf
Railway Signalling Principles Edition 3.pdfRailway Signalling Principles Edition 3.pdf
Railway Signalling Principles Edition 3.pdf
TeeVichai
 
Water Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation and Control Monthly - May 2024.pdfWater Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation & Control
 
road safety engineering r s e unit 3.pdf
road safety engineering  r s e unit 3.pdfroad safety engineering  r s e unit 3.pdf
road safety engineering r s e unit 3.pdf
VENKATESHvenky89705
 
DfMAy 2024 - key insights and contributions
DfMAy 2024 - key insights and contributionsDfMAy 2024 - key insights and contributions
DfMAy 2024 - key insights and contributions
gestioneergodomus
 
Student information management system project report ii.pdf
Student information management system project report ii.pdfStudent information management system project report ii.pdf
Student information management system project report ii.pdf
Kamal Acharya
 
Gen AI Study Jams _ For the GDSC Leads in India.pdf
Gen AI Study Jams _ For the GDSC Leads in India.pdfGen AI Study Jams _ For the GDSC Leads in India.pdf
Gen AI Study Jams _ For the GDSC Leads in India.pdf
gdsczhcet
 
PPT on GRP pipes manufacturing and testing
PPT on GRP pipes manufacturing and testingPPT on GRP pipes manufacturing and testing
PPT on GRP pipes manufacturing and testing
anoopmanoharan2
 
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
MdTanvirMahtab2
 
The Role of Electrical and Electronics Engineers in IOT Technology.pdf
The Role of Electrical and Electronics Engineers in IOT Technology.pdfThe Role of Electrical and Electronics Engineers in IOT Technology.pdf
The Role of Electrical and Electronics Engineers in IOT Technology.pdf
Nettur Technical Training Foundation
 
Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...
Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...
Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...
AJAYKUMARPUND1
 
NUMERICAL SIMULATIONS OF HEAT AND MASS TRANSFER IN CONDENSING HEAT EXCHANGERS...
NUMERICAL SIMULATIONS OF HEAT AND MASS TRANSFER IN CONDENSING HEAT EXCHANGERS...NUMERICAL SIMULATIONS OF HEAT AND MASS TRANSFER IN CONDENSING HEAT EXCHANGERS...
NUMERICAL SIMULATIONS OF HEAT AND MASS TRANSFER IN CONDENSING HEAT EXCHANGERS...
ssuser7dcef0
 
AKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdf
AKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdfAKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdf
AKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdf
SamSarthak3
 

Recently uploaded (20)

Planning Of Procurement o different goods and services
Planning Of Procurement o different goods and servicesPlanning Of Procurement o different goods and services
Planning Of Procurement o different goods and services
 
Fundamentals of Induction Motor Drives.pptx
Fundamentals of Induction Motor Drives.pptxFundamentals of Induction Motor Drives.pptx
Fundamentals of Induction Motor Drives.pptx
 
Heap Sort (SS).ppt FOR ENGINEERING GRADUATES, BCA, MCA, MTECH, BSC STUDENTS
Heap Sort (SS).ppt FOR ENGINEERING GRADUATES, BCA, MCA, MTECH, BSC STUDENTSHeap Sort (SS).ppt FOR ENGINEERING GRADUATES, BCA, MCA, MTECH, BSC STUDENTS
Heap Sort (SS).ppt FOR ENGINEERING GRADUATES, BCA, MCA, MTECH, BSC STUDENTS
 
Basic Industrial Engineering terms for apparel
Basic Industrial Engineering terms for apparelBasic Industrial Engineering terms for apparel
Basic Industrial Engineering terms for apparel
 
Cosmetic shop management system project report.pdf
Cosmetic shop management system project report.pdfCosmetic shop management system project report.pdf
Cosmetic shop management system project report.pdf
 
Water billing management system project report.pdf
Water billing management system project report.pdfWater billing management system project report.pdf
Water billing management system project report.pdf
 
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
 
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdfHybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
 
Railway Signalling Principles Edition 3.pdf
Railway Signalling Principles Edition 3.pdfRailway Signalling Principles Edition 3.pdf
Railway Signalling Principles Edition 3.pdf
 
Water Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation and Control Monthly - May 2024.pdfWater Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation and Control Monthly - May 2024.pdf
 
road safety engineering r s e unit 3.pdf
road safety engineering  r s e unit 3.pdfroad safety engineering  r s e unit 3.pdf
road safety engineering r s e unit 3.pdf
 
DfMAy 2024 - key insights and contributions
DfMAy 2024 - key insights and contributionsDfMAy 2024 - key insights and contributions
DfMAy 2024 - key insights and contributions
 
Student information management system project report ii.pdf
Student information management system project report ii.pdfStudent information management system project report ii.pdf
Student information management system project report ii.pdf
 
Gen AI Study Jams _ For the GDSC Leads in India.pdf
Gen AI Study Jams _ For the GDSC Leads in India.pdfGen AI Study Jams _ For the GDSC Leads in India.pdf
Gen AI Study Jams _ For the GDSC Leads in India.pdf
 
PPT on GRP pipes manufacturing and testing
PPT on GRP pipes manufacturing and testingPPT on GRP pipes manufacturing and testing
PPT on GRP pipes manufacturing and testing
 
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
 
The Role of Electrical and Electronics Engineers in IOT Technology.pdf
The Role of Electrical and Electronics Engineers in IOT Technology.pdfThe Role of Electrical and Electronics Engineers in IOT Technology.pdf
The Role of Electrical and Electronics Engineers in IOT Technology.pdf
 
Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...
Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...
Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...
 
NUMERICAL SIMULATIONS OF HEAT AND MASS TRANSFER IN CONDENSING HEAT EXCHANGERS...
NUMERICAL SIMULATIONS OF HEAT AND MASS TRANSFER IN CONDENSING HEAT EXCHANGERS...NUMERICAL SIMULATIONS OF HEAT AND MASS TRANSFER IN CONDENSING HEAT EXCHANGERS...
NUMERICAL SIMULATIONS OF HEAT AND MASS TRANSFER IN CONDENSING HEAT EXCHANGERS...
 
AKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdf
AKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdfAKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdf
AKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdf
 

A Review of Distributional models of word meaning (Lenci, 2018)

  • 1. Distributional models of word meaning Alessandro Lenci Annual Review of Linguistics, 4(1), 151– 171, 2018
  • 2. Overview • Explain what the distributional hypothesis is • Briefly introduce major ways to generate distributional representations of words ‣ I focus on the two most popular ways (Count/ Prediction) and add more materials to explain the latter way • Summarise the common challenges with distributional representations of words 4 • Overview • Distributional Hypothesis • Taxonomy of Methods • Models • Challenges
  • 3. Distributional Hypothesis • “Lexemes with similar linguistic contexts have similar meanings” (Lenci, 2018: p. 152) • One of the ways to give the definition of word meaning 5 • Overview • Distributional Hypothesis • Taxonomy of Methods • Models • Challenges “I found a wonderful restaurant yesterday!” “I found a fantastic restaurant yesterday!” Looks like they have a similar meaning Target word Context words (latter) Context words (former)
  • 4. Distributional Hypothesis • Distributional hypothesis (DH) forms the theoretical foundation of distributional semantics (aka vector space semantics) • Lenci (2008) pointed out two levels of DH: ‣ Weak DH: only assumes correlations between semantics and word distributions ‣ Strong DH: also assumes DH is a cognitive hypothesis 6 • Overview • Distributional Hypothesis • Taxonomy of Methods • Models • Challenges
  • 5. Distributed vs Distributional • According to Ferrone and Zanzotto (2017), distributed representations contain distributional representations • Distributed: ways to represent each word by a vector with several dimensions instead of a symbolic vector (e.g., one-hot vectors) • Distributional: ways to represent each word by a vector with several dimensions based on distributional hypothesis 7 • Overview • Distributional Hypothesis • Taxonomy of Methods • Models • Challenges
  • 6. • The two major ways to generate distributional representations: ‣ Count models ‣ Prediction models • The paper shows an introduction of count models (Sec 3.2) 8 Taxonomy of Methods: The Method of Learning • Overview • Distributional Hypothesis • Taxonomy of Methods ‣ The Method of Learning ‣ The Type of Context • Models • Challenges
  • 7. 9 Taxonomy of Methods: The Type of Context • Overview • Distributional Hypothesis • Taxonomy of Methods ‣ The Method of Learning ‣ The Type of Context • Models • Challenges e.g., My example of wonderful/fantastic. Most popular taking into account syntactically dependent words only (e.g., predicate argument structure?) Imagine TFIDF
  • 8. 10 Count models • Overview • Distributional Hypothesis • Taxonomy of Methods • Models ‣ Count models ‣ Prediction models • Challenges Co-occurrence matrix Enhance significance to reflect the importance of the contexts For example, taking:
  • 9. Count models • Use some methods to obtain latent features among columns in explicit vectors → implicit vectors • One easy example is to apply dimensionality- reduction techniques like singular value decomposition or principal components analysis • See Table 2 of Lenci (2018) for famous tools ‣ GloVe (Pennington+, 2014), which is based on weighted least-squares regression, is the most popular word representations among count models 11 • Overview • Distributional Hypothesis • Taxonomy of Methods • Models ‣ Count models ‣ Prediction models • Challenges
  • 10. Prediction models • As known as word vectors, word embeddings, and distributed embeddings • Learn word representations using a neural network model while the model is learning a language model • The most famous tool of this category is word2vec ‣ The rest of the papers I will explain today are also based on prediction models 12 • Overview • Distributional Hypothesis • Taxonomy of Methods • Models ‣ Count models ‣ Prediction models • Challenges
  • 11. Prediction models • Language model ‣ A probability distribution over sequences of words ‣ This assigns a probability of given sequences • Neural language model ‣ A language model using neural network ‣ Given sequences, it predicts the next word of the sequences 13 • Overview • Distributional Hypothesis • Taxonomy of Methods • Models ‣ Count models ‣ Prediction models • Challenges
  • 12. 14 Prediction models • Overview • Distributional Hypothesis • Taxonomy of Methods • Models ‣ Count models ‣ Prediction models • Challenges have a nice one one this has the same number of units as total word types and represents a probability distribution of the next word … Output layer Hidden layers Embedding layer Modified from 坪井+ (2017) 『深層学習による自然言語処理』 E Win Whid Wout These parts become word vectors during training An example of feed forward neural network language model
  • 13. 15 Prediction models • Overview • Distributional Hypothesis • Taxonomy of Methods • Models ‣ Count models ‣ Prediction models • Challenges have a nice one one … Output layer Embedding layer Modified from 坪井+ (2017) 『深層学習による自然言語処理』 E word2vec No hidden layers Just taking inner products But with bunch of optimisation techniques
  • 14. Challenges • Distributional semantic models tend to mix up various types of semantic similarity/relatedness ‣ No distinction among hypernymy, antonymy, meronymy, locative relations and topical relations • How to represent larger linguistic units than word 16 • Overview • Distributional Hypothesis • Taxonomy of Methods • Models • Challenges