SlideShare a Scribd company logo
Makoto P. Kato
(University of Tsukuba)
Wiradee Imrattanatrai (Kyoto University), Takehiro Yamamoto,
Hiroaki Ohshima (University of Hyogo), and Katsumi Tanaka (Kyoto University)
Context-guided Learning to Rank Entities
= 0.9(Context-guided Learning) + 0.1(Learning to Rank Entities)
Learn to rank entities with their numerical attributes
and a subset of ranked entities
Goal
Rank Country GDP $ Military Land area … Min. Temp.
1st Sweden 493 5 450 -50
2nd Canada 1,550 15 9,987 -40
3rd Switzerland 664 3 41 -30
4th Australia 1,225 21 7,692 -20
5th Norway 388 5 323 -10
Entity
Numerical attribute
Learn
Ranking of Popular Countries
𝐏𝐨𝐩𝐮𝐥𝐚𝐫𝐢𝐭𝐲 = +𝟎. 𝟓 𝐇𝐚𝐩𝐩𝐢𝐧𝐞𝐬𝐬 − 𝟎. 𝟑 #𝐒𝐮𝐢𝐜𝐢𝐝𝐞𝐬
2
Attractiveness of cities =
+ 0.035 (Avg. lifetime of women) - 0.032 (# Traffic accidents)
- 0.031 (Population/# Households)
Popularity of countries =
+ 0.058 (Happiness) - 0.057 (#refugees) - 0.045 (# Suicides)
Peacefulness of countries =
+ 0.170 (Grain harvest) + 0.166 (GDP grow rate) - 0.126 (# Suicides)
Usability of cameras =
- 0.240 (Weight) - 0.213 (Height) + 0.133 (Max. shutter speed)
Real Examples from Experiments 3
If the ranking of entities was learned,
we could realize the following applications
Motivation 4
safe countries
1.Iceland
2.New Zealand
3.Portugal
4.Austria
5.Denmark
Ranking entities
in a specified order
1.Iceland
2.New Zealand
3.Portugal
4.Austria
5.Denmark
Understanding rankings
Safety = +𝟎. 𝟓 Police budget − 𝟎. 𝟖 Crime rate
Safe country ranking 2020
Too many attributes for a small size of training data
(known as over-fitting)
Challenge
𝐏𝐨𝐩𝐮𝐥𝐚𝐫𝐢𝐭𝐲 = −𝟏. 𝟎 𝐌𝐢𝐧. 𝐓𝐞𝐦𝐩.
Rank Country GDP $ Military Land area … Min. Temp.
1st Sweden 493 5 450 -50
2nd Canada 1,550 15 9,987 -40
3rd Switzerland 664 3 41 -30
4th Australia 1,225 21 7,692 -20
5th Norway 388 5 323 -10
Entity
Numerical attribute
Learn
Ranking of Popular Countries
This should hold only
for these five countries
5
• Over-fitting
 The learned model is highly accurate for seen data (training data),
while it is not for unseen data (test data)
 In general, it happens when the number of features is large
compared to the number of training instances
• Is it a serious problem?
 If the number of attributes for an entity class is fixed,
only a solution is to increase the size of training data
Over-fitting? 6
Can you increase the number of entities?
e.g. the number of countries (max. ~ 200)
Sometimes yes, and sometimes no
Why not help rankers understand attributes by their context?
→ Context-guided Learning (CGL)
Observation 7
• It seems obvious for us that
𝐏𝐨𝐩𝐮𝐥𝐚𝐫𝐢𝐭𝐲 = −𝟏. 𝟎 𝐌𝐢𝐧. 𝐓𝐞𝐦𝐩.
is a wrong model for the popularity ranking
• Why?
 We know the meaning of the minimum temperature,
and that it is (probably) nothing to do with the country popularity
 We probably learned it by reading/listening to many sentences on
"popularity" and "minimum temperature"
Key Idea
1. Introduced the problem of learning to rank entities by
using attributes as features
 For ranking entities by various criteria and precisely understanding
ranking criteria
2. Proposed Context-guided Learning (CGL)
 A general ML method using contexts of labeling criteria and
features for preventing over-fitting
3. Conducted experiments with a wide variety of orders,
and demonstrated the effectiveness of CGL in the task
of learning to rank entities
Contributions 8
Learn the weights of a linear model by training instances,
as well as contexts of the labeling criteria and attributes
Context-guided Learning (CGL)
 Labeling criteria: language expression used to explain how labels are
given (e.g. popularity)
 Context of x: sentences on x
A large # suicides affects the popularity of countries.
# suicides may indicate low popularity of the country.
Contexts suggesting negative correlation
While the min. temp. is low, the country is popular.
The country is cold but popular.
Contexts suggesting no correlation
Estimated as
non-zero and negative
Estimated as zero
𝐏𝐨𝐩𝐮𝐥𝐚𝐫𝐢𝐭𝐲 = 𝑤1 𝐆𝐃𝐏 + 𝑤2 #𝐒𝐮𝐢𝐜𝐢𝐝𝐞𝐬 + 𝑤3(𝐌𝐢𝐧. 𝐓𝐞𝐦𝐩. )
A linear model for "popularity"
9
• Suppose we try to learn a linear model 𝑓 𝐱 = 𝐰T
𝐱 + 𝑏
• One of the weight values fitting the training data is
𝐰 = (𝟏, 𝟎) meaning that "warm countries are rich"
 (𝟎, 𝟏) is another candidate for 𝐰, but no evidence on which is better
Example: Learning without CGL 10
Rich
(𝑙)
Temp.
(𝑎1)
GDP
(𝑎2)
𝐱1 +1 14 9
𝐱2 +1 13 4
𝐱3 −1 3 1
Entities
Labeling
Criteria Attributes
Attributes and Labels of Entities Temp.
GDP
Decisionboundaryby𝐰
𝐱1
𝐱2
𝐱3 𝐰: Weights of a linear function
Learn
a linear
model
−1
+1
+1
𝐠 is a weight "roughly" estimated by the contexts
Expected that 𝐠 is somewhat close to the ideal weight
Example: Learning with CGL 1/2 11
… The average temp. of the lobster-
rich waters …
… The effect of rich air/fuel ratios
and temp. …
… Culturally-rich country has
moderate temp. …
Contexts of 𝑙
(usually derived from the Web corpus)
𝑐1
𝑐2
Temp.
GDP
𝐱1
𝐱2
𝐠
Predict… GDP is a key factor for richness.
…
… Rich countries have high GDP.
…
… Rich regions, where GDP was
above the EU-28 …
𝐱3
For"temp."For"GDP"
−1
+1
+1
CGL estimate 𝐰 by 𝐰 = 𝐠 + 𝐯
The difference 𝐯 is expected to be small
Evidences to support 𝐰 = (0, 1) meaning that "a high GDP indicates richness"
Example: Learning with CGL 2/2 12
Temp.
GDP
𝐱1
𝐱2
𝐱3
𝐠Predict
Rich
(𝑙)
Temp.
(𝑎1)
GDP
(𝑎2)
𝐱1 +1 14 9
𝐱2 +1 13 4
𝐱3 −1 3 1
Entities
Labeling
Criteria Attributes
Attributes and Labels of Entities
Decision boundary by 𝐰
𝐰
𝐯
−1
+1
+1
• Linear function 𝑓𝑘 to rank entities in order 𝑘
(we assume there are several orders to be learned)
𝑓𝑘 𝐱 𝑖 =
𝑗=1
𝑀
𝑤 𝑘,𝑗 x𝑖,𝑗
• Weight Model
𝑤 𝑘,𝑗 = 𝐮 𝑇
𝐜 𝑘,𝑗 + 𝑣 𝑘,𝑗
Formalization
𝒋-th attribute value of 𝒊-th entityWeight value
for 𝒋-th attribute
Weight vector
for context vectors
Context vector
for order 𝒌 and
𝒋-th attribute
Weight value that could not
be explained by only contexts
13
Any models such as TF-IDF, doc2vec, or
Sentence-BERT can be applied to the
contexts for generating context vectors
Context Model 14
… The average temp. of the lobster-
rich waters …
… The effect of rich air/fuel ratios
and temp. …
… Culturally-rich country has
moderate temp. …
Contexts of 𝑙
(usually derived from the Web corpus)
𝑐1,1
𝑐1,2
… GDP is a key factor for richness.
…
… Rich countries have high GDP.
…
… Rich regions, where GDP was
above the EU-28 …
For"temp."For"GDP"
𝐜1,1 = (1.2, 0, 0.1)
𝐜1,2 = (0, 2.2, 1.7)
𝐮 𝑇 𝐜1,1 = 0.1
𝐮 𝑇 𝐜1,2 = 2.9
If 𝐮 = (0, 0.5, 1)
𝐮 determines how to estimate
the weight based on the
context vector
• Find the solution of this optimization problem:
min
𝐮,𝐯 𝑘,𝜉 𝑘,𝑖
𝐮 2
+
𝑐
𝐾
𝑘=1
𝐾
𝐯 𝑘
2
+ 𝐶
𝑘=1
𝐾
𝑖=1
𝑁 𝑘
𝜉 𝑘,𝑖
 subject, for 𝑘 = 1, … , 𝐾 and 𝑖 = 1, … , to the constraints
𝑓𝑘 𝐱 𝑖
sup
− 𝑓𝑘 𝐱 𝑖
inf
≥ 1 − 𝜉 𝑘,𝑖
• Can be solved by SVM solvers with a special kernel
Learning of CGL
Regularization term
similar to SVM
Slack variables
similar to SVM
The rank of 𝐱 𝑖
sup
is higher than that of 𝐱 𝑖
inf
in
the training data. Similar to RankingSVM.
15
Cities Countries Cameras
# Entities 47 138 149
# Orders 64 40 54
# Attributes 137 83 16
Examples of Orders
Attractiveness,
Richness
Livability, Safety Portability, Usability
Examples of
Attributes
Population,
Crime rate
# Visitors,
# Suicides
Resolution,
Weight
Experiments
Experiments were conducted with
Used a half of ranked entities as training data, and
examined if the rest of the entities can be ranked correctly
16
• Baselines
 RankNet
 RankBoost
 Linear-Feature
(A linear feature-based model optimized by coordinate ascent)
 LambdaMART
 ListNet
• Proposed Methods
 CGL (TF-IDF)
• The TF-IDF weighting schema was used as a context model
 CGL (Distributed)
• Paragraph vector was used as a context model
Comparative Methods 17
Context-guided Learning (CGL) worked well (+16%) at every class of entities
No significant difference between the two context models
Experimental Results
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
City Country Camera Total
Accuracy
RankNet RankBoost LinearFeature LambdaMART
ListNet CGL (TF-IDF) CGL (Distributed)
18
Attractiveness of cities =
+ 0.035 (Avg. lifetime of women) - 0.032 (# Traffic accidents)
- 0.031 (Population/# Households)
Popularity of countries =
+ 0.058 (Happiness) - 0.057 (#refugees) - 0.045 (# Suicides)
Peacefulness of countries =
+ 0.170 (Grain harvest) + 0.166 (GDP grow rate) - 0.126 (# Suicides)
Usability of cameras =
- 0.240 (Weight) - 0.213 (Height) + 0.133 (Max. shutter speed)
Real Examples from Experiments 19
User Study
• Evaluated the learned model
by crowdsourcing
 “If you agree that there is a
correlation between <labeling
criterion> and <attribute>, please
assign a score +2. If you disagree,
please assign a score −2. If you
cannot agree or disagree, please
assign a score 0.”
• Compared CGL and Linear-
Feature
• CGL was slightly better
20
1. Introduced the problem of
learning to rank entities by
using attributes as features
2. Proposed Context-guided
Learning (CGL)
3. Conducted experiments with a
wide variety of orders, and
demonstrated the
effectiveness of CGL in the
task of learning to rank entities
Summary 21
Can take questions at
https://www.mpkato.net/

More Related Content

What's hot

Lecture #1: Introduction to machine learning (ML)
Lecture #1: Introduction to machine learning (ML)Lecture #1: Introduction to machine learning (ML)
Lecture #1: Introduction to machine learning (ML)butest
 
[系列活動] Machine Learning 機器學習課程
[系列活動] Machine Learning 機器學習課程[系列活動] Machine Learning 機器學習課程
[系列活動] Machine Learning 機器學習課程台灣資料科學年會
 
Data Mining: Concepts and Techniques — Chapter 2 —
Data Mining:  Concepts and Techniques — Chapter 2 —Data Mining:  Concepts and Techniques — Chapter 2 —
Data Mining: Concepts and Techniques — Chapter 2 —Salah Amean
 
Machine learning module 2
Machine learning module 2Machine learning module 2
Machine learning module 2Gokulks007
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine LearningRahul Jain
 
Machine learning introduction
Machine learning introductionMachine learning introduction
Machine learning introductionAnas Jamil
 
L5. Data Transformation and Feature Engineering
L5. Data Transformation and Feature EngineeringL5. Data Transformation and Feature Engineering
L5. Data Transformation and Feature EngineeringMachine Learning Valencia
 
Feature Engineering
Feature Engineering Feature Engineering
Feature Engineering odsc
 
DATA MINING.doc
DATA MINING.docDATA MINING.doc
DATA MINING.docbutest
 
Collective Response Spike Prediction for Mutually Interacting Consumers
Collective Response Spike Prediction for Mutually Interacting ConsumersCollective Response Spike Prediction for Mutually Interacting Consumers
Collective Response Spike Prediction for Mutually Interacting ConsumersRikiya Takahashi
 
Computational model for artificial learning using formal concept analysis
Computational model for artificial learning using formal concept analysisComputational model for artificial learning using formal concept analysis
Computational model for artificial learning using formal concept analysisAboul Ella Hassanien
 
On the Dynamics of Machine Learning Algorithms and Behavioral Game Theory
On the Dynamics of Machine Learning Algorithms and Behavioral Game TheoryOn the Dynamics of Machine Learning Algorithms and Behavioral Game Theory
On the Dynamics of Machine Learning Algorithms and Behavioral Game TheoryRikiya Takahashi
 
i-Eclat: performance enhancement of Eclat via incremental approach in frequen...
i-Eclat: performance enhancement of Eclat via incremental approach in frequen...i-Eclat: performance enhancement of Eclat via incremental approach in frequen...
i-Eclat: performance enhancement of Eclat via incremental approach in frequen...TELKOMNIKA JOURNAL
 
Binary search query classifier
Binary search query classifierBinary search query classifier
Binary search query classifierEsteban Ribero
 
Interactive Information Retrieval inspired by Quantum Theory
Interactive Information Retrieval inspired by Quantum TheoryInteractive Information Retrieval inspired by Quantum Theory
Interactive Information Retrieval inspired by Quantum TheoryIngo Frommholz
 
Matrix Factorization Techniques For Recommender Systems
Matrix Factorization Techniques For Recommender SystemsMatrix Factorization Techniques For Recommender Systems
Matrix Factorization Techniques For Recommender SystemsLei Guo
 
Machine learning
Machine learningMachine learning
Machine learningeonx_32
 
Predicting Preference Reversals via Gaussian Process Uncertainty Aversion
Predicting Preference Reversals via Gaussian Process Uncertainty AversionPredicting Preference Reversals via Gaussian Process Uncertainty Aversion
Predicting Preference Reversals via Gaussian Process Uncertainty AversionRikiya Takahashi
 
VSSML16 L3. Clusters and Anomaly Detection
VSSML16 L3. Clusters and Anomaly DetectionVSSML16 L3. Clusters and Anomaly Detection
VSSML16 L3. Clusters and Anomaly DetectionBigML, Inc
 
IRJET-Debarred Objects Recognition by PFL Operator
IRJET-Debarred Objects Recognition by PFL OperatorIRJET-Debarred Objects Recognition by PFL Operator
IRJET-Debarred Objects Recognition by PFL OperatorIRJET Journal
 

What's hot (20)

Lecture #1: Introduction to machine learning (ML)
Lecture #1: Introduction to machine learning (ML)Lecture #1: Introduction to machine learning (ML)
Lecture #1: Introduction to machine learning (ML)
 
[系列活動] Machine Learning 機器學習課程
[系列活動] Machine Learning 機器學習課程[系列活動] Machine Learning 機器學習課程
[系列活動] Machine Learning 機器學習課程
 
Data Mining: Concepts and Techniques — Chapter 2 —
Data Mining:  Concepts and Techniques — Chapter 2 —Data Mining:  Concepts and Techniques — Chapter 2 —
Data Mining: Concepts and Techniques — Chapter 2 —
 
Machine learning module 2
Machine learning module 2Machine learning module 2
Machine learning module 2
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
 
Machine learning introduction
Machine learning introductionMachine learning introduction
Machine learning introduction
 
L5. Data Transformation and Feature Engineering
L5. Data Transformation and Feature EngineeringL5. Data Transformation and Feature Engineering
L5. Data Transformation and Feature Engineering
 
Feature Engineering
Feature Engineering Feature Engineering
Feature Engineering
 
DATA MINING.doc
DATA MINING.docDATA MINING.doc
DATA MINING.doc
 
Collective Response Spike Prediction for Mutually Interacting Consumers
Collective Response Spike Prediction for Mutually Interacting ConsumersCollective Response Spike Prediction for Mutually Interacting Consumers
Collective Response Spike Prediction for Mutually Interacting Consumers
 
Computational model for artificial learning using formal concept analysis
Computational model for artificial learning using formal concept analysisComputational model for artificial learning using formal concept analysis
Computational model for artificial learning using formal concept analysis
 
On the Dynamics of Machine Learning Algorithms and Behavioral Game Theory
On the Dynamics of Machine Learning Algorithms and Behavioral Game TheoryOn the Dynamics of Machine Learning Algorithms and Behavioral Game Theory
On the Dynamics of Machine Learning Algorithms and Behavioral Game Theory
 
i-Eclat: performance enhancement of Eclat via incremental approach in frequen...
i-Eclat: performance enhancement of Eclat via incremental approach in frequen...i-Eclat: performance enhancement of Eclat via incremental approach in frequen...
i-Eclat: performance enhancement of Eclat via incremental approach in frequen...
 
Binary search query classifier
Binary search query classifierBinary search query classifier
Binary search query classifier
 
Interactive Information Retrieval inspired by Quantum Theory
Interactive Information Retrieval inspired by Quantum TheoryInteractive Information Retrieval inspired by Quantum Theory
Interactive Information Retrieval inspired by Quantum Theory
 
Matrix Factorization Techniques For Recommender Systems
Matrix Factorization Techniques For Recommender SystemsMatrix Factorization Techniques For Recommender Systems
Matrix Factorization Techniques For Recommender Systems
 
Machine learning
Machine learningMachine learning
Machine learning
 
Predicting Preference Reversals via Gaussian Process Uncertainty Aversion
Predicting Preference Reversals via Gaussian Process Uncertainty AversionPredicting Preference Reversals via Gaussian Process Uncertainty Aversion
Predicting Preference Reversals via Gaussian Process Uncertainty Aversion
 
VSSML16 L3. Clusters and Anomaly Detection
VSSML16 L3. Clusters and Anomaly DetectionVSSML16 L3. Clusters and Anomaly Detection
VSSML16 L3. Clusters and Anomaly Detection
 
IRJET-Debarred Objects Recognition by PFL Operator
IRJET-Debarred Objects Recognition by PFL OperatorIRJET-Debarred Objects Recognition by PFL Operator
IRJET-Debarred Objects Recognition by PFL Operator
 

Similar to Context-guided Learning to Rank Entities

.NET Fest 2017. Игорь Кочетов. Классификация результатов тестирования произво...
.NET Fest 2017. Игорь Кочетов. Классификация результатов тестирования произво....NET Fest 2017. Игорь Кочетов. Классификация результатов тестирования произво...
.NET Fest 2017. Игорь Кочетов. Классификация результатов тестирования произво...NETFest
 
Machine Learning and Deep Learning 4 dummies
Machine Learning and Deep Learning 4 dummies Machine Learning and Deep Learning 4 dummies
Machine Learning and Deep Learning 4 dummies Dori Waldman
 
Machine learning4dummies
Machine learning4dummiesMachine learning4dummies
Machine learning4dummiesMichael Winer
 
Practical deep learning for computer vision
Practical deep learning for computer visionPractical deep learning for computer vision
Practical deep learning for computer visionEran Shlomo
 
"Optimal Learning for Fun and Profit" by Scott Clark (Presented at The Yelp E...
"Optimal Learning for Fun and Profit" by Scott Clark (Presented at The Yelp E..."Optimal Learning for Fun and Profit" by Scott Clark (Presented at The Yelp E...
"Optimal Learning for Fun and Profit" by Scott Clark (Presented at The Yelp E...Yelp Engineering
 
week9_Machine_Learning.ppt
week9_Machine_Learning.pptweek9_Machine_Learning.ppt
week9_Machine_Learning.pptbutest
 
CS3114_09212011.ppt
CS3114_09212011.pptCS3114_09212011.ppt
CS3114_09212011.pptArumugam90
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine LearningAI Summary
 
Lecture 3.1_ Logistic Regression.pptx
Lecture 3.1_ Logistic Regression.pptxLecture 3.1_ Logistic Regression.pptx
Lecture 3.1_ Logistic Regression.pptxajondaree
 
OPTIMIZATION AS A MODEL FOR FEW-SHOT LEARNING
 OPTIMIZATION AS A MODEL FOR FEW-SHOT LEARNING OPTIMIZATION AS A MODEL FOR FEW-SHOT LEARNING
OPTIMIZATION AS A MODEL FOR FEW-SHOT LEARNINGMLReview
 
Bill howe 6_machinelearning_1
Bill howe 6_machinelearning_1Bill howe 6_machinelearning_1
Bill howe 6_machinelearning_1Mahammad Valiyev
 
Introduction to Item Response Theory
Introduction to Item Response TheoryIntroduction to Item Response Theory
Introduction to Item Response TheoryNathan Thompson
 
林守德/Practical Issues in Machine Learning
林守德/Practical Issues in Machine Learning林守德/Practical Issues in Machine Learning
林守德/Practical Issues in Machine Learning台灣資料科學年會
 
Data Science as a Career and Intro to R
Data Science as a Career and Intro to RData Science as a Career and Intro to R
Data Science as a Career and Intro to RAnshik Bansal
 
Categorical data stata cox 2004
Categorical data stata    cox 2004Categorical data stata    cox 2004
Categorical data stata cox 2004Lilian Carvalho
 
Genetic Algorithms.ppt
Genetic Algorithms.pptGenetic Algorithms.ppt
Genetic Algorithms.pptRohanBorgalli
 
IFTA2020 Kei Nakagawa
IFTA2020 Kei NakagawaIFTA2020 Kei Nakagawa
IFTA2020 Kei NakagawaKei Nakagawa
 

Similar to Context-guided Learning to Rank Entities (20)

.NET Fest 2017. Игорь Кочетов. Классификация результатов тестирования произво...
.NET Fest 2017. Игорь Кочетов. Классификация результатов тестирования произво....NET Fest 2017. Игорь Кочетов. Классификация результатов тестирования произво...
.NET Fest 2017. Игорь Кочетов. Классификация результатов тестирования произво...
 
Machine Learning and Deep Learning 4 dummies
Machine Learning and Deep Learning 4 dummies Machine Learning and Deep Learning 4 dummies
Machine Learning and Deep Learning 4 dummies
 
Machine learning4dummies
Machine learning4dummiesMachine learning4dummies
Machine learning4dummies
 
Practical deep learning for computer vision
Practical deep learning for computer visionPractical deep learning for computer vision
Practical deep learning for computer vision
 
"Optimal Learning for Fun and Profit" by Scott Clark (Presented at The Yelp E...
"Optimal Learning for Fun and Profit" by Scott Clark (Presented at The Yelp E..."Optimal Learning for Fun and Profit" by Scott Clark (Presented at The Yelp E...
"Optimal Learning for Fun and Profit" by Scott Clark (Presented at The Yelp E...
 
week9_Machine_Learning.ppt
week9_Machine_Learning.pptweek9_Machine_Learning.ppt
week9_Machine_Learning.ppt
 
Explainable AI
Explainable AIExplainable AI
Explainable AI
 
CS3114_09212011.ppt
CS3114_09212011.pptCS3114_09212011.ppt
CS3114_09212011.ppt
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
 
Lecture 3.1_ Logistic Regression.pptx
Lecture 3.1_ Logistic Regression.pptxLecture 3.1_ Logistic Regression.pptx
Lecture 3.1_ Logistic Regression.pptx
 
OPTIMIZATION AS A MODEL FOR FEW-SHOT LEARNING
 OPTIMIZATION AS A MODEL FOR FEW-SHOT LEARNING OPTIMIZATION AS A MODEL FOR FEW-SHOT LEARNING
OPTIMIZATION AS A MODEL FOR FEW-SHOT LEARNING
 
Bill howe 6_machinelearning_1
Bill howe 6_machinelearning_1Bill howe 6_machinelearning_1
Bill howe 6_machinelearning_1
 
[系列活動] 資料探勘速遊
[系列活動] 資料探勘速遊[系列活動] 資料探勘速遊
[系列活動] 資料探勘速遊
 
Introduction to Item Response Theory
Introduction to Item Response TheoryIntroduction to Item Response Theory
Introduction to Item Response Theory
 
林守德/Practical Issues in Machine Learning
林守德/Practical Issues in Machine Learning林守德/Practical Issues in Machine Learning
林守德/Practical Issues in Machine Learning
 
Data Science as a Career and Intro to R
Data Science as a Career and Intro to RData Science as a Career and Intro to R
Data Science as a Career and Intro to R
 
Categorical data stata cox 2004
Categorical data stata    cox 2004Categorical data stata    cox 2004
Categorical data stata cox 2004
 
Explore ML day 1
Explore ML day 1Explore ML day 1
Explore ML day 1
 
Genetic Algorithms.ppt
Genetic Algorithms.pptGenetic Algorithms.ppt
Genetic Algorithms.ppt
 
IFTA2020 Kei Nakagawa
IFTA2020 Kei NakagawaIFTA2020 Kei Nakagawa
IFTA2020 Kei Nakagawa
 

More from kt.mako

情報検索とゼロショット学習
情報検索とゼロショット学習情報検索とゼロショット学習
情報検索とゼロショット学習kt.mako
 
情報アクセス技術のためのテストコレクション作成
情報アクセス技術のためのテストコレクション作成情報アクセス技術のためのテストコレクション作成
情報アクセス技術のためのテストコレクション作成kt.mako
 
筑波大学 図書館情報メディア系 知識獲得システム 研究紹介
筑波大学 図書館情報メディア系 知識獲得システム 研究紹介筑波大学 図書館情報メディア系 知識獲得システム 研究紹介
筑波大学 図書館情報メディア系 知識獲得システム 研究紹介kt.mako
 
DEIM2017 私が愛したSIGIR Paper [京都大学 加藤誠]
DEIM2017 私が愛したSIGIR Paper [京都大学 加藤誠]DEIM2017 私が愛したSIGIR Paper [京都大学 加藤誠]
DEIM2017 私が愛したSIGIR Paper [京都大学 加藤誠]kt.mako
 
検索評価ツールキットNTCIREVALを用いた様々な情報アクセス技術の評価方法
検索評価ツールキットNTCIREVALを用いた様々な情報アクセス技術の評価方法検索評価ツールキットNTCIREVALを用いた様々な情報アクセス技術の評価方法
検索評価ツールキットNTCIREVALを用いた様々な情報アクセス技術の評価方法kt.mako
 
NTCIR-12 MobileClick-2 Overview
NTCIR-12 MobileClick-2 OverviewNTCIR-12 MobileClick-2 Overview
NTCIR-12 MobileClick-2 Overviewkt.mako
 
情報検索のためのユーザモデル
情報検索のためのユーザモデル情報検索のためのユーザモデル
情報検索のためのユーザモデルkt.mako
 
MobileClick-2 キックオフイベント
MobileClick-2 キックオフイベントMobileClick-2 キックオフイベント
MobileClick-2 キックオフイベントkt.mako
 
MobileClick-2 Kickoff Event
MobileClick-2 Kickoff EventMobileClick-2 Kickoff Event
MobileClick-2 Kickoff Eventkt.mako
 

More from kt.mako (9)

情報検索とゼロショット学習
情報検索とゼロショット学習情報検索とゼロショット学習
情報検索とゼロショット学習
 
情報アクセス技術のためのテストコレクション作成
情報アクセス技術のためのテストコレクション作成情報アクセス技術のためのテストコレクション作成
情報アクセス技術のためのテストコレクション作成
 
筑波大学 図書館情報メディア系 知識獲得システム 研究紹介
筑波大学 図書館情報メディア系 知識獲得システム 研究紹介筑波大学 図書館情報メディア系 知識獲得システム 研究紹介
筑波大学 図書館情報メディア系 知識獲得システム 研究紹介
 
DEIM2017 私が愛したSIGIR Paper [京都大学 加藤誠]
DEIM2017 私が愛したSIGIR Paper [京都大学 加藤誠]DEIM2017 私が愛したSIGIR Paper [京都大学 加藤誠]
DEIM2017 私が愛したSIGIR Paper [京都大学 加藤誠]
 
検索評価ツールキットNTCIREVALを用いた様々な情報アクセス技術の評価方法
検索評価ツールキットNTCIREVALを用いた様々な情報アクセス技術の評価方法検索評価ツールキットNTCIREVALを用いた様々な情報アクセス技術の評価方法
検索評価ツールキットNTCIREVALを用いた様々な情報アクセス技術の評価方法
 
NTCIR-12 MobileClick-2 Overview
NTCIR-12 MobileClick-2 OverviewNTCIR-12 MobileClick-2 Overview
NTCIR-12 MobileClick-2 Overview
 
情報検索のためのユーザモデル
情報検索のためのユーザモデル情報検索のためのユーザモデル
情報検索のためのユーザモデル
 
MobileClick-2 キックオフイベント
MobileClick-2 キックオフイベントMobileClick-2 キックオフイベント
MobileClick-2 キックオフイベント
 
MobileClick-2 Kickoff Event
MobileClick-2 Kickoff EventMobileClick-2 Kickoff Event
MobileClick-2 Kickoff Event
 

Recently uploaded

De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...Product School
 
ODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User GroupODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User GroupCatarinaPereira64715
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsPaul Groth
 
НАДІЯ ФЕДЮШКО БАЦ «Професійне зростання QA спеціаліста»
НАДІЯ ФЕДЮШКО БАЦ  «Професійне зростання QA спеціаліста»НАДІЯ ФЕДЮШКО БАЦ  «Професійне зростання QA спеціаліста»
НАДІЯ ФЕДЮШКО БАЦ «Професійне зростання QA спеціаліста»QADay
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesThousandEyes
 
Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Frank van Harmelen
 
Demystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John StaveleyDemystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John StaveleyJohn Staveley
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaRTTS
 
"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor TurskyiFwdays
 
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptxIOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptxAbida Shariff
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3DianaGray10
 
Speed Wins: From Kafka to APIs in Minutes
Speed Wins: From Kafka to APIs in MinutesSpeed Wins: From Kafka to APIs in Minutes
Speed Wins: From Kafka to APIs in Minutesconfluent
 
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...Product School
 
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptxUnpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptxDavid Michel
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingThijs Feryn
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...BookNet Canada
 
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Ramesh Iyer
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...Product School
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Product School
 

Recently uploaded (20)

De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
 
ODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User GroupODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User Group
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
 
НАДІЯ ФЕДЮШКО БАЦ «Професійне зростання QA спеціаліста»
НАДІЯ ФЕДЮШКО БАЦ  «Професійне зростання QA спеціаліста»НАДІЯ ФЕДЮШКО БАЦ  «Професійне зростання QA спеціаліста»
НАДІЯ ФЕДЮШКО БАЦ «Професійне зростання QA спеціаліста»
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
 
Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*
 
Demystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John StaveleyDemystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John Staveley
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
 
"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi
 
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptxIOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
 
Speed Wins: From Kafka to APIs in Minutes
Speed Wins: From Kafka to APIs in MinutesSpeed Wins: From Kafka to APIs in Minutes
Speed Wins: From Kafka to APIs in Minutes
 
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
 
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptxUnpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
 
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
 

Context-guided Learning to Rank Entities

  • 1. Makoto P. Kato (University of Tsukuba) Wiradee Imrattanatrai (Kyoto University), Takehiro Yamamoto, Hiroaki Ohshima (University of Hyogo), and Katsumi Tanaka (Kyoto University) Context-guided Learning to Rank Entities = 0.9(Context-guided Learning) + 0.1(Learning to Rank Entities)
  • 2. Learn to rank entities with their numerical attributes and a subset of ranked entities Goal Rank Country GDP $ Military Land area … Min. Temp. 1st Sweden 493 5 450 -50 2nd Canada 1,550 15 9,987 -40 3rd Switzerland 664 3 41 -30 4th Australia 1,225 21 7,692 -20 5th Norway 388 5 323 -10 Entity Numerical attribute Learn Ranking of Popular Countries 𝐏𝐨𝐩𝐮𝐥𝐚𝐫𝐢𝐭𝐲 = +𝟎. 𝟓 𝐇𝐚𝐩𝐩𝐢𝐧𝐞𝐬𝐬 − 𝟎. 𝟑 #𝐒𝐮𝐢𝐜𝐢𝐝𝐞𝐬 2
  • 3. Attractiveness of cities = + 0.035 (Avg. lifetime of women) - 0.032 (# Traffic accidents) - 0.031 (Population/# Households) Popularity of countries = + 0.058 (Happiness) - 0.057 (#refugees) - 0.045 (# Suicides) Peacefulness of countries = + 0.170 (Grain harvest) + 0.166 (GDP grow rate) - 0.126 (# Suicides) Usability of cameras = - 0.240 (Weight) - 0.213 (Height) + 0.133 (Max. shutter speed) Real Examples from Experiments 3
  • 4. If the ranking of entities was learned, we could realize the following applications Motivation 4 safe countries 1.Iceland 2.New Zealand 3.Portugal 4.Austria 5.Denmark Ranking entities in a specified order 1.Iceland 2.New Zealand 3.Portugal 4.Austria 5.Denmark Understanding rankings Safety = +𝟎. 𝟓 Police budget − 𝟎. 𝟖 Crime rate Safe country ranking 2020
  • 5. Too many attributes for a small size of training data (known as over-fitting) Challenge 𝐏𝐨𝐩𝐮𝐥𝐚𝐫𝐢𝐭𝐲 = −𝟏. 𝟎 𝐌𝐢𝐧. 𝐓𝐞𝐦𝐩. Rank Country GDP $ Military Land area … Min. Temp. 1st Sweden 493 5 450 -50 2nd Canada 1,550 15 9,987 -40 3rd Switzerland 664 3 41 -30 4th Australia 1,225 21 7,692 -20 5th Norway 388 5 323 -10 Entity Numerical attribute Learn Ranking of Popular Countries This should hold only for these five countries 5
  • 6. • Over-fitting  The learned model is highly accurate for seen data (training data), while it is not for unseen data (test data)  In general, it happens when the number of features is large compared to the number of training instances • Is it a serious problem?  If the number of attributes for an entity class is fixed, only a solution is to increase the size of training data Over-fitting? 6 Can you increase the number of entities? e.g. the number of countries (max. ~ 200) Sometimes yes, and sometimes no
  • 7. Why not help rankers understand attributes by their context? → Context-guided Learning (CGL) Observation 7 • It seems obvious for us that 𝐏𝐨𝐩𝐮𝐥𝐚𝐫𝐢𝐭𝐲 = −𝟏. 𝟎 𝐌𝐢𝐧. 𝐓𝐞𝐦𝐩. is a wrong model for the popularity ranking • Why?  We know the meaning of the minimum temperature, and that it is (probably) nothing to do with the country popularity  We probably learned it by reading/listening to many sentences on "popularity" and "minimum temperature" Key Idea
  • 8. 1. Introduced the problem of learning to rank entities by using attributes as features  For ranking entities by various criteria and precisely understanding ranking criteria 2. Proposed Context-guided Learning (CGL)  A general ML method using contexts of labeling criteria and features for preventing over-fitting 3. Conducted experiments with a wide variety of orders, and demonstrated the effectiveness of CGL in the task of learning to rank entities Contributions 8
  • 9. Learn the weights of a linear model by training instances, as well as contexts of the labeling criteria and attributes Context-guided Learning (CGL)  Labeling criteria: language expression used to explain how labels are given (e.g. popularity)  Context of x: sentences on x A large # suicides affects the popularity of countries. # suicides may indicate low popularity of the country. Contexts suggesting negative correlation While the min. temp. is low, the country is popular. The country is cold but popular. Contexts suggesting no correlation Estimated as non-zero and negative Estimated as zero 𝐏𝐨𝐩𝐮𝐥𝐚𝐫𝐢𝐭𝐲 = 𝑤1 𝐆𝐃𝐏 + 𝑤2 #𝐒𝐮𝐢𝐜𝐢𝐝𝐞𝐬 + 𝑤3(𝐌𝐢𝐧. 𝐓𝐞𝐦𝐩. ) A linear model for "popularity" 9
  • 10. • Suppose we try to learn a linear model 𝑓 𝐱 = 𝐰T 𝐱 + 𝑏 • One of the weight values fitting the training data is 𝐰 = (𝟏, 𝟎) meaning that "warm countries are rich"  (𝟎, 𝟏) is another candidate for 𝐰, but no evidence on which is better Example: Learning without CGL 10 Rich (𝑙) Temp. (𝑎1) GDP (𝑎2) 𝐱1 +1 14 9 𝐱2 +1 13 4 𝐱3 −1 3 1 Entities Labeling Criteria Attributes Attributes and Labels of Entities Temp. GDP Decisionboundaryby𝐰 𝐱1 𝐱2 𝐱3 𝐰: Weights of a linear function Learn a linear model −1 +1 +1
  • 11. 𝐠 is a weight "roughly" estimated by the contexts Expected that 𝐠 is somewhat close to the ideal weight Example: Learning with CGL 1/2 11 … The average temp. of the lobster- rich waters … … The effect of rich air/fuel ratios and temp. … … Culturally-rich country has moderate temp. … Contexts of 𝑙 (usually derived from the Web corpus) 𝑐1 𝑐2 Temp. GDP 𝐱1 𝐱2 𝐠 Predict… GDP is a key factor for richness. … … Rich countries have high GDP. … … Rich regions, where GDP was above the EU-28 … 𝐱3 For"temp."For"GDP" −1 +1 +1
  • 12. CGL estimate 𝐰 by 𝐰 = 𝐠 + 𝐯 The difference 𝐯 is expected to be small Evidences to support 𝐰 = (0, 1) meaning that "a high GDP indicates richness" Example: Learning with CGL 2/2 12 Temp. GDP 𝐱1 𝐱2 𝐱3 𝐠Predict Rich (𝑙) Temp. (𝑎1) GDP (𝑎2) 𝐱1 +1 14 9 𝐱2 +1 13 4 𝐱3 −1 3 1 Entities Labeling Criteria Attributes Attributes and Labels of Entities Decision boundary by 𝐰 𝐰 𝐯 −1 +1 +1
  • 13. • Linear function 𝑓𝑘 to rank entities in order 𝑘 (we assume there are several orders to be learned) 𝑓𝑘 𝐱 𝑖 = 𝑗=1 𝑀 𝑤 𝑘,𝑗 x𝑖,𝑗 • Weight Model 𝑤 𝑘,𝑗 = 𝐮 𝑇 𝐜 𝑘,𝑗 + 𝑣 𝑘,𝑗 Formalization 𝒋-th attribute value of 𝒊-th entityWeight value for 𝒋-th attribute Weight vector for context vectors Context vector for order 𝒌 and 𝒋-th attribute Weight value that could not be explained by only contexts 13
  • 14. Any models such as TF-IDF, doc2vec, or Sentence-BERT can be applied to the contexts for generating context vectors Context Model 14 … The average temp. of the lobster- rich waters … … The effect of rich air/fuel ratios and temp. … … Culturally-rich country has moderate temp. … Contexts of 𝑙 (usually derived from the Web corpus) 𝑐1,1 𝑐1,2 … GDP is a key factor for richness. … … Rich countries have high GDP. … … Rich regions, where GDP was above the EU-28 … For"temp."For"GDP" 𝐜1,1 = (1.2, 0, 0.1) 𝐜1,2 = (0, 2.2, 1.7) 𝐮 𝑇 𝐜1,1 = 0.1 𝐮 𝑇 𝐜1,2 = 2.9 If 𝐮 = (0, 0.5, 1) 𝐮 determines how to estimate the weight based on the context vector
  • 15. • Find the solution of this optimization problem: min 𝐮,𝐯 𝑘,𝜉 𝑘,𝑖 𝐮 2 + 𝑐 𝐾 𝑘=1 𝐾 𝐯 𝑘 2 + 𝐶 𝑘=1 𝐾 𝑖=1 𝑁 𝑘 𝜉 𝑘,𝑖  subject, for 𝑘 = 1, … , 𝐾 and 𝑖 = 1, … , to the constraints 𝑓𝑘 𝐱 𝑖 sup − 𝑓𝑘 𝐱 𝑖 inf ≥ 1 − 𝜉 𝑘,𝑖 • Can be solved by SVM solvers with a special kernel Learning of CGL Regularization term similar to SVM Slack variables similar to SVM The rank of 𝐱 𝑖 sup is higher than that of 𝐱 𝑖 inf in the training data. Similar to RankingSVM. 15
  • 16. Cities Countries Cameras # Entities 47 138 149 # Orders 64 40 54 # Attributes 137 83 16 Examples of Orders Attractiveness, Richness Livability, Safety Portability, Usability Examples of Attributes Population, Crime rate # Visitors, # Suicides Resolution, Weight Experiments Experiments were conducted with Used a half of ranked entities as training data, and examined if the rest of the entities can be ranked correctly 16
  • 17. • Baselines  RankNet  RankBoost  Linear-Feature (A linear feature-based model optimized by coordinate ascent)  LambdaMART  ListNet • Proposed Methods  CGL (TF-IDF) • The TF-IDF weighting schema was used as a context model  CGL (Distributed) • Paragraph vector was used as a context model Comparative Methods 17
  • 18. Context-guided Learning (CGL) worked well (+16%) at every class of entities No significant difference between the two context models Experimental Results 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 City Country Camera Total Accuracy RankNet RankBoost LinearFeature LambdaMART ListNet CGL (TF-IDF) CGL (Distributed) 18
  • 19. Attractiveness of cities = + 0.035 (Avg. lifetime of women) - 0.032 (# Traffic accidents) - 0.031 (Population/# Households) Popularity of countries = + 0.058 (Happiness) - 0.057 (#refugees) - 0.045 (# Suicides) Peacefulness of countries = + 0.170 (Grain harvest) + 0.166 (GDP grow rate) - 0.126 (# Suicides) Usability of cameras = - 0.240 (Weight) - 0.213 (Height) + 0.133 (Max. shutter speed) Real Examples from Experiments 19
  • 20. User Study • Evaluated the learned model by crowdsourcing  “If you agree that there is a correlation between <labeling criterion> and <attribute>, please assign a score +2. If you disagree, please assign a score −2. If you cannot agree or disagree, please assign a score 0.” • Compared CGL and Linear- Feature • CGL was slightly better 20
  • 21. 1. Introduced the problem of learning to rank entities by using attributes as features 2. Proposed Context-guided Learning (CGL) 3. Conducted experiments with a wide variety of orders, and demonstrated the effectiveness of CGL in the task of learning to rank entities Summary 21 Can take questions at https://www.mpkato.net/