SlideShare a Scribd company logo
1 of 40
Download to read offline
1/31
Parameterized Fielded Term Dependence Models
for Ad-hoc Entity Retrieval from Knowledge
Graph
Fedor Nikolaev 1,2 Alexander Kotov 1 Nikita Zhiltsov 2
1Textual Data Analytics Lab, Department of Computer Science, Wayne State
University
2Kazan Federal University
2/31
Knowledge Graphs
3/31
Entities
• Material objects or concepts in the
real world or fiction (e.g. people,
movies, conferences etc.)
• Are connected with other entities
by relations (e.g. hasGenre,
actedIn, isPCmemberOf etc.)
• Subject-Predicate-Object (SPO)
triple: subject=entity;
object=entity (or primitive data
value); predicate=relationship
between subject and object
• Many SPO triples → knowledge
graph
4/31
DBPedia entity page example
5/31
Entity Retrieval from Knowledge Graph(s)
• Graph KBs are perfectly suited for addressing the information
needs that aim at finding specific objects (entities) rather
than documents
• Given the user’s information need expressed as a keyword
query, retrieve a relevant set of objects from the knowledge
graph(s)
6/31
Typical ERWD tasks
• Entity Search
Queries refer to a particular entity.
• “Ben Franklin”
• “England football player highest paid”
• “Einstein Relativity theory”
• List Search
Complex queries with several relevant entities.
• “US presidents since 1960”
• “animals lay eggs mammals”
• Question Answering
Queries are questions in natural language.
• “Who is the architect of leaning tower of Pisa?”
• “For which label did Elvis record his first album?”
7/31
Entity document
An entity is represented as a structured (multi-fielded) document:
names
Conventional names of the entities, such as the name
of a person or the name of an organization
attributes
All entity properties, other than names
categories
Classes or groups, to which the entity has been
assigned
similar entity names
Names of the entities that are very similar or
identical to a given entity
related entity names
Names of the entities that are part of the same RDF
triple
8/31
Entity document example
Multi-fielded entity document for the entity Barack Obama.
Field Content
names barack obama barack hussein obama ii
attributes 44th current president united states
birth place honolulu hawaii
categories democratic party united states senator
nobel peace prize laureate christian
similar entity names barack obama jr barak hussein obama
barack h obama ii
related entity names spouse michelle obama illinois state
predecessor george walker bush
9/31
PRMS
P(Q|d) =
qi ∈Q j
PM(Ej|qi )PQL(qi |ej),
where
PM(Ej|w) =
PM(w|Ej)PM(Ej)
Ek ∈E PM(w|Ek)PM(Ek)
10/31
SDM, FDM
Ranks w.r.t. PΛ(D|Q) = i∈{T,U,O} λi fi (Q, D)
Potential function for unigrams is QL:
fT (qi , D) = log P(qi |θD) = log
tfqi ,D + µ
cfqi
|C|
|D| + µ
SDM only considers two-word sequences in queries, FDM considers
all two-word combinations.
11/31
Fielded SDM
FSDM incorporates document structure and term dependencies
into the one ranking model.
Potential function for unigrams in case of FSDM:
˜fT (qi , D) = log
j
wT
j P(qi |θj
D) = log
j
wT
j
tfqi ,Dj + µj
cf j
qi
|Cj |
|Dj| + µj
12/31
FSDM limitation
In FSDM field weights are the same for all query concepts of the
same type.
Example
capitals in Europe which were host cities of summer Olympic games
13/31
Parametric extension of FSDM
wT
qi ,j =
k
αU
j,kφk(qi , j)
13/31
Parametric extension of FSDM
wT
qi ,j =
k
αU
j,kφk(qi , j)
• φk(qi , j) is the the k-th feature value for unigram qi in field j.
13/31
Parametric extension of FSDM
wT
qi ,j =
k
αU
j,kφk(qi , j)
• φk(qi , j) is the the k-th feature value for unigram qi in field j.
• αU
j,k are feature weights that we learn.
13/31
Parametric extension of FSDM
wT
qi ,j =
k
αU
j,kφk(qi , j)
• φk(qi , j) is the the k-th feature value for unigram qi in field j.
• αU
j,k are feature weights that we learn.
j
wT
qi ,j = 1, wT
qi ,j ≥ 0, αU
j,k ≥ 0, 0 ≤ φk(qi , j) ≤ 1
13/31
Parametric extension of FSDM
wT
qi ,j =
k
αU
j,kφk(qi , j)
• φk(qi , j) is the the k-th feature value for unigram qi in field j.
• αU
j,k are feature weights that we learn.
j
wT
qi ,j = 1, wT
qi ,j ≥ 0, αU
j,k ≥ 0, 0 ≤ φk(qi , j) ≤ 1
PFFDM is the same, but uses full dependence model.
14/31
Features
Source Feature Description CT
Collection
statistics
FP(κ, j) Posterior probability P(Ej|w). UG BG
TS(κ, j) Top SDM score on j-th field
when κ is used as a query.
BG
14/31
Features
Source Feature Description CT
Collection
statistics
FP(κ, j) Posterior probability P(Ej|w). UG BG
TS(κ, j) Top SDM score on j-th field
when κ is used as a query.
BG
Stanford
POS
Tagger
NNP(κ) Is concept κ a proper noun? UG
NNS(κ) Is κ a plural non-proper noun? UG BG
JJS(κ) Is κ a superlative adjective? UG
Stanford
Parser
NPP(κ) Is κ part of a noun phrase? BG
NNO(κ) Is κ the only singular non-
proper noun in a noun phrase?
UG
INT Intercept feature (= 1). UG BG
15/31
Parameters of PFSDM
Both PFSDM and PFFDM have F ∗ U + F ∗ B + 3 free
parameters: ˆαU, ˆαB, ˆλ .
We perform direct optimization w.r.t. target metric (e.g. MAP)
using coordinate ascent.
16/31
Collections
1 DBPedia 3.7
• Structured version of on-line encyclopedia Wikipedia
• Provides the descriptions of over 3.5 million entities belonging
to 320 classes
2 BTC-2009
• Contains entities from multiple knowledge bases.
• Consists of 1.14 billion RDF triples.
17/31
Query sets
Balog and Neumayer. A Test Collection for Entity Search in
DBpedia, SIGIR’13.
Query set Amount Query types [Pound et al., 2010]
SemSearch ES 130 Entity
ListSearch 115 Type
INEX-LD 100 Entity, Type, Attribute, Relation
QALD-2 140 Entity, Type, Attribute, Relation
Only SemSearch ES judgments are available for BTC-2009, so for
BTC-2009 we used only this query set.
18/31
Real-valued features analysis
19/31
Real-valued features analysis
20/31
Real-valued features analysis
21/31
Real-valued features analysis
22/31
NLP-based features analysis
23/31
NLP-based features analysis
24/31
NLP-based features analysis
25/31
NLP-based features analysis
26/31
Feature effectiveness
0.225
0.230
0.235
0.240
UG:FP,NNP,NNS;BG:TS,NPP,NNS
UG:FP,NNS;BG:INT,TS
UG:FP,NNS;BG:INT,TS,NPP,NNS
UG:FP,NNS;BG:INT,FP,TS,NPP,NNS
UG:FP,NNP;BG:FP,TS,NPP,NNS
UG:FP,NNS;BG:INT
UG:FP,NNP,NNS;BG:INT,TS,NPP,NNS
UG:FP,NNP,NNS;BG:FP,TS,NPP,NNS
UG:FP,NNS;BG:FP,TS,NPP,NNS
UG:FP,NNP,TS;BG:INT,TS
UG:FP,NNP,TS;BG:TS,NPP,NNS
UG:FP,NNP;BG:INT,TS,NPP,NNS
UG:FP,NNS;BG:TS,NPP,NNS
UG:FP,NNP;BG:FP,TS,NPP
UG:FP,NNP,TS;BG:FP,TS,NPP,NNS
UG:FP,NNO;BG:INT,TS,NPP,NNS
UG:FP,NNP;BG:INT,FP,TS,NPP,NNS
UG:FP,NNP,NNS;BG:INT,TS
UG:FP,NNO;BG:FP,TS,NPP
UG:FP,NNP,JJS;BG:TS,NPP,NNS
UG:FP,NNP;BG:TS,NPP,NNS
UG:FP,NNP,JJS;BG:FP,TS,NPP,NNS
UG:FP,NNP,JJS;BG:INT,TS,NPP,NNS
UG:FP,NNP,JJS;BG:INT,FP,TS,NPP,NNS
UG:FP,NNO;BG:INT,TS
UG:FP,NNS;BG:FP,TS,NPP
UG:FP,NNO;BG:INT
UG:FP,NNO;BG:INT,FP,TS,NPP,NNS
UG:FP,NNO;BG:TS,NPP,NNS
UG:FP,NNP,TS;BG:INT,FP,TS,NPP,NNS
UG:FP,NNP,NNS;BG:FP,TS,NPP
UG:FP,NNP;BG:INT,TS
UG:FP,NNO;BG:FP,TS,NPP,NNS
UG:FP,NNP,TS;BG:INT
UG:FP,NNP,TS;BG:INT,TS,NPP,NNS
UG:FP,NNP,JJS;BG:INT
UG:FP,NNP,JJS;BG:INT,TS
UG:FP,NNP,NNS;BG:INT
UG:FP,NNP;BG:INT
UG:FP,NNP,JJS;BG:FP,TS,NPP
UG:FP,NNP,TS;BG:FP,TS,NPP
UG:FP,NNP,NNS;BG:INT,FP,TS,NPP,NNS
map
27/31
DBpedia results (using best features combination)
Query set Method MAP P@10 P@20 b-pref
SemSearch ES
PRMS 0.230 0.177 0.549 0.317
FSDM 0.386 0.286 0.737 0.476
PFSDM 0.394∗
0.286∗
0.757∗
0.494∗
†
FFDM 0.389∗
0.286∗
0.734∗
0.479∗
PFFDM 0.380∗
0.286∗
0.739∗
0.477∗
ListSearch
PRMS 0.111 0.154 0.355 0.176
FSDM 0.203 0.256 0.447 0.274
PFSDM 0.201∗
0.253∗
0.443∗
0.278∗
FFDM 0.226∗
†
0.282∗
†
0.499∗
†
0.313∗
†
PFFDM 0.228∗
†
0.286∗
†
0.487∗
0.302∗
†
INEX-LD
PRMS 0.064 0.145 0.409 0.216
FSDM 0.111 0.263 0.546 0.322
PFSDM 0.116∗
0.259∗
0.579∗
0.341∗
FFDM 0.122∗
†
0.273∗
0.560∗
0.345∗
†
PFFDM 0.121∗
†
0.274∗
0.556∗
0.343∗
QALD-2
PRMS 0.120 0.079 0.188 0.147
FSDM 0.195 0.136 0.283 0.229
PFSDM 0.218∗
†
0.140∗
0.308∗
0.253∗
†
FFDM 0.200∗
0.139∗
0.292∗
0.237∗
PFFDM 0.219∗
†
0.147∗
0.310∗
0.267∗
†
All queries
PRMS 0.136 0.136 0.370 0.214
FSDM 0.231 0.231 0.498 0.325
PFSDM 0.240∗
†
0.231∗
0.516∗
†
0.342∗
†
FFDM 0.241∗
†
0.240∗
†
0.515∗
†
0.342∗
†
PFFDM 0.244∗
†
0.244∗
†
0.518∗
†
0.347∗
†
27/31
DBpedia results (using best features combination)
Query set Method MAP P@10 P@20 b-pref
SemSearch ES
PRMS 0.230 0.177 0.549 0.317
FSDM 0.386 0.286 0.737 0.476
PFSDM 0.394∗
0.286∗
0.757∗
0.494∗
†
FFDM 0.389∗
0.286∗
0.734∗
0.479∗
PFFDM 0.380∗
0.286∗
0.739∗
0.477∗
ListSearch
PRMS 0.111 0.154 0.355 0.176
FSDM 0.203 0.256 0.447 0.274
PFSDM 0.201∗
0.253∗
0.443∗
0.278∗
FFDM 0.226∗
†
0.282∗
†
0.499∗
†
0.313∗
†
PFFDM 0.228∗
†
0.286∗
†
0.487∗
0.302∗
†
INEX-LD
PRMS 0.064 0.145 0.409 0.216
FSDM 0.111 0.263 0.546 0.322
PFSDM 0.116∗
0.259∗
0.579∗
0.341∗
FFDM 0.122∗
†
0.273∗
0.560∗
0.345∗
†
PFFDM 0.121∗
†
0.274∗
0.556∗
0.343∗
QALD-2
PRMS 0.120 0.079 0.188 0.147
FSDM 0.195 0.136 0.283 0.229
PFSDM 0.218∗
†
0.140∗
0.308∗
0.253∗
†
FFDM 0.200∗
0.139∗
0.292∗
0.237∗
PFFDM 0.219∗
†
0.147∗
0.310∗
0.267∗
†
All queries
PRMS 0.136 0.136 0.370 0.214
FSDM 0.231 0.231 0.498 0.325
PFSDM 0.240∗
†
0.231∗
0.516∗
†
0.342∗
†
FFDM 0.241∗
†
0.240∗
†
0.515∗
†
0.342∗
†
PFFDM 0.244∗
†
0.244∗
†
0.518∗
†
0.347∗
†
27/31
DBpedia results (using best features combination)
Query set Method MAP P@10 P@20 b-pref
SemSearch ES
PRMS 0.230 0.177 0.549 0.317
FSDM 0.386 0.286 0.737 0.476
PFSDM 0.394∗
0.286∗
0.757∗
0.494∗
†
FFDM 0.389∗
0.286∗
0.734∗
0.479∗
PFFDM 0.380∗
0.286∗
0.739∗
0.477∗
ListSearch
PRMS 0.111 0.154 0.355 0.176
FSDM 0.203 0.256 0.447 0.274
PFSDM 0.201∗
0.253∗
0.443∗
0.278∗
FFDM 0.226∗
†
0.282∗
†
0.499∗
†
0.313∗
†
PFFDM 0.228∗
†
0.286∗
†
0.487∗
0.302∗
†
INEX-LD
PRMS 0.064 0.145 0.409 0.216
FSDM 0.111 0.263 0.546 0.322
PFSDM 0.116∗
0.259∗
0.579∗
0.341∗
FFDM 0.122∗
†
0.273∗
0.560∗
0.345∗
†
PFFDM 0.121∗
†
0.274∗
0.556∗
0.343∗
QALD-2
PRMS 0.120 0.079 0.188 0.147
FSDM 0.195 0.136 0.283 0.229
PFSDM 0.218∗
†
0.140∗
0.308∗
0.253∗
†
FFDM 0.200∗
0.139∗
0.292∗
0.237∗
PFFDM 0.219∗
†
0.147∗
0.310∗
0.267∗
†
All queries
PRMS 0.136 0.136 0.370 0.214
FSDM 0.231 0.231 0.498 0.325
PFSDM 0.240∗
†
0.231∗
0.516∗
†
0.342∗
†
FFDM 0.241∗
†
0.240∗
†
0.515∗
†
0.342∗
†
PFFDM 0.244∗
†
0.244∗
†
0.518∗
†
0.347∗
†
27/31
DBpedia results (using best features combination)
Query set Method MAP P@10 P@20 b-pref
SemSearch ES
PRMS 0.230 0.177 0.549 0.317
FSDM 0.386 0.286 0.737 0.476
PFSDM 0.394∗
0.286∗
0.757∗
0.494∗
†
FFDM 0.389∗
0.286∗
0.734∗
0.479∗
PFFDM 0.380∗
0.286∗
0.739∗
0.477∗
ListSearch
PRMS 0.111 0.154 0.355 0.176
FSDM 0.203 0.256 0.447 0.274
PFSDM 0.201∗
0.253∗
0.443∗
0.278∗
FFDM 0.226∗
†
0.282∗
†
0.499∗
†
0.313∗
†
PFFDM 0.228∗
†
0.286∗
†
0.487∗
0.302∗
†
INEX-LD
PRMS 0.064 0.145 0.409 0.216
FSDM 0.111 0.263 0.546 0.322
PFSDM 0.116∗
0.259∗
0.579∗
0.341∗
FFDM 0.122∗
†
0.273∗
0.560∗
0.345∗
†
PFFDM 0.121∗
†
0.274∗
0.556∗
0.343∗
QALD-2
PRMS 0.120 0.079 0.188 0.147
FSDM 0.195 0.136 0.283 0.229
PFSDM 0.218∗
†
0.140∗
0.308∗
0.253∗
†
FFDM 0.200∗
0.139∗
0.292∗
0.237∗
PFFDM 0.219∗
†
0.147∗
0.310∗
0.267∗
†
All queries
PRMS 0.136 0.136 0.370 0.214
FSDM 0.231 0.231 0.498 0.325
PFSDM 0.240∗
†
0.231∗
0.516∗
†
0.342∗
†
FFDM 0.241∗
†
0.240∗
†
0.515∗
†
0.342∗
†
PFFDM 0.244∗
†
0.244∗
†
0.518∗
†
0.347∗
†
27/31
DBpedia results (using best features combination)
Query set Method MAP P@10 P@20 b-pref
SemSearch ES
PRMS 0.230 0.177 0.549 0.317
FSDM 0.386 0.286 0.737 0.476
PFSDM 0.394∗
0.286∗
0.757∗
0.494∗
†
FFDM 0.389∗
0.286∗
0.734∗
0.479∗
PFFDM 0.380∗
0.286∗
0.739∗
0.477∗
ListSearch
PRMS 0.111 0.154 0.355 0.176
FSDM 0.203 0.256 0.447 0.274
PFSDM 0.201∗
0.253∗
0.443∗
0.278∗
FFDM 0.226∗
†
0.282∗
†
0.499∗
†
0.313∗
†
PFFDM 0.228∗
†
0.286∗
†
0.487∗
0.302∗
†
INEX-LD
PRMS 0.064 0.145 0.409 0.216
FSDM 0.111 0.263 0.546 0.322
PFSDM 0.116∗
0.259∗
0.579∗
0.341∗
FFDM 0.122∗
†
0.273∗
0.560∗
0.345∗
†
PFFDM 0.121∗
†
0.274∗
0.556∗
0.343∗
QALD-2
PRMS 0.120 0.079 0.188 0.147
FSDM 0.195 0.136 0.283 0.229
PFSDM 0.218∗
†
0.140∗
0.308∗
0.253∗
†
FFDM 0.200∗
0.139∗
0.292∗
0.237∗
PFFDM 0.219∗
†
0.147∗
0.310∗
0.267∗
†
All queries
PRMS 0.136 0.136 0.370 0.214
FSDM 0.231 0.231 0.498 0.325
PFSDM 0.240∗
†
0.231∗
0.516∗
†
0.342∗
†
FFDM 0.241∗
†
0.240∗
†
0.515∗
†
0.342∗
†
PFFDM 0.244∗
†
0.244∗
†
0.518∗
†
0.347∗
†
28/31
BTC results
Method MAP P@10 P@20 b-pref
PRMS 0.098 0.198 0.545 0.269
FSDM 0.171 0.323 0.631 0.358
PFSDM 0.182∗
† 0.335∗ 0.657∗
† 0.371∗
FFDM 0.180∗
† 0.330∗
† 0.647∗ 0.373∗
†
PFFDM 0.187∗ 0.342∗
† 0.650∗ 0.377∗
0.0
0.3
0.6
0 50 100
(a) PFSDM-FSDM
-0.3
0.0
0.3
0.6
0 50 100
(b) PFFDM-FFDM
29/31
Conclusion
• Entity-centric keyword queries have an implicit structure, with
each element in that structure designating a particular aspect
in multi-fielded representation of relevant entities.
• We proposed two novel models for ad-hoc entity retrieval from
knowledge graph, which account for term dependencies and
perform feature-based projection of query concepts onto the
fields of entity documents.
• By demonstrating the possibility of inferring implicit structure
of keyword queries using linguistic attributes and simple field
statistics of query concepts, the proposed models constitute
an important step in the evolution of models for structured
document retrieval.
30/31
Feature work
We hypothesize that the proposed models can be effective in other
structured information retrieval scenarios, such as product and
social graph search, and leave verification of this hypothesis to
future work.
31/31
Questions?

More Related Content

What's hot

Adapt OWL as a Modular Ontology Language
Adapt OWL as a Modular Ontology LanguageAdapt OWL as a Modular Ontology Language
Adapt OWL as a Modular Ontology LanguageJie Bao
 
Divide and Conquer Semantic Web with Modular
Divide and Conquer Semantic Web with ModularDivide and Conquer Semantic Web with Modular
Divide and Conquer Semantic Web with ModularJie Bao
 
Representing and Reasoning with Modular Ontologies
Representing and Reasoning with Modular OntologiesRepresenting and Reasoning with Modular Ontologies
Representing and Reasoning with Modular OntologiesJie Bao
 
Pointing the Unknown Words
Pointing the Unknown WordsPointing the Unknown Words
Pointing the Unknown Wordshytae
 
3 problem-solving-
3 problem-solving-3 problem-solving-
3 problem-solving-Mhd Sb
 
MediaEval 2015 - GTM-UVigo Systems for the Query-by-Example Search on Speech ...
MediaEval 2015 - GTM-UVigo Systems for the Query-by-Example Search on Speech ...MediaEval 2015 - GTM-UVigo Systems for the Query-by-Example Search on Speech ...
MediaEval 2015 - GTM-UVigo Systems for the Query-by-Example Search on Speech ...multimediaeval
 
Lecture14 xing fei-fei
Lecture14 xing fei-feiLecture14 xing fei-fei
Lecture14 xing fei-feiTianlu Wang
 
Introduction to ggplot2
Introduction to ggplot2Introduction to ggplot2
Introduction to ggplot2maikroeder
 
Introduction to Data structure & Algorithms - Sethuonline.com | Sathyabama Un...
Introduction to Data structure & Algorithms - Sethuonline.com | Sathyabama Un...Introduction to Data structure & Algorithms - Sethuonline.com | Sathyabama Un...
Introduction to Data structure & Algorithms - Sethuonline.com | Sathyabama Un...sethuraman R
 
ESR10 Joachim Daiber - EXPERT Summer School - Malaga 2015
ESR10 Joachim Daiber - EXPERT Summer School - Malaga 2015ESR10 Joachim Daiber - EXPERT Summer School - Malaga 2015
ESR10 Joachim Daiber - EXPERT Summer School - Malaga 2015RIILP
 
Software tookits for machine learning and graphical models
Software tookits for machine learning and graphical modelsSoftware tookits for machine learning and graphical models
Software tookits for machine learning and graphical modelsbutest
 
Comparison between the genetic algorithms optimization and particle swarm opt...
Comparison between the genetic algorithms optimization and particle swarm opt...Comparison between the genetic algorithms optimization and particle swarm opt...
Comparison between the genetic algorithms optimization and particle swarm opt...IAEME Publication
 
Basic review on topic modeling
Basic review on  topic modelingBasic review on  topic modeling
Basic review on topic modelingHiroyuki Kuromiya
 
Like Alice in Wonderland: Unraveling Reasoning and Cognition Using Analogies ...
Like Alice in Wonderland: Unraveling Reasoning and Cognition Using Analogies ...Like Alice in Wonderland: Unraveling Reasoning and Cognition Using Analogies ...
Like Alice in Wonderland: Unraveling Reasoning and Cognition Using Analogies ...Facultad de Informática UCM
 
Topic modeling using big data analytics
Topic modeling using big data analyticsTopic modeling using big data analytics
Topic modeling using big data analyticsFarheen Nilofer
 

What's hot (20)

Adapt OWL as a Modular Ontology Language
Adapt OWL as a Modular Ontology LanguageAdapt OWL as a Modular Ontology Language
Adapt OWL as a Modular Ontology Language
 
Divide and Conquer Semantic Web with Modular
Divide and Conquer Semantic Web with ModularDivide and Conquer Semantic Web with Modular
Divide and Conquer Semantic Web with Modular
 
Representing and Reasoning with Modular Ontologies
Representing and Reasoning with Modular OntologiesRepresenting and Reasoning with Modular Ontologies
Representing and Reasoning with Modular Ontologies
 
Pointing the Unknown Words
Pointing the Unknown WordsPointing the Unknown Words
Pointing the Unknown Words
 
3 problem-solving-
3 problem-solving-3 problem-solving-
3 problem-solving-
 
5 csp
5 csp5 csp
5 csp
 
MediaEval 2015 - GTM-UVigo Systems for the Query-by-Example Search on Speech ...
MediaEval 2015 - GTM-UVigo Systems for the Query-by-Example Search on Speech ...MediaEval 2015 - GTM-UVigo Systems for the Query-by-Example Search on Speech ...
MediaEval 2015 - GTM-UVigo Systems for the Query-by-Example Search on Speech ...
 
record_linking
record_linkingrecord_linking
record_linking
 
Lecture14 xing fei-fei
Lecture14 xing fei-feiLecture14 xing fei-fei
Lecture14 xing fei-fei
 
Introduction to ggplot2
Introduction to ggplot2Introduction to ggplot2
Introduction to ggplot2
 
Introduction to Data structure & Algorithms - Sethuonline.com | Sathyabama Un...
Introduction to Data structure & Algorithms - Sethuonline.com | Sathyabama Un...Introduction to Data structure & Algorithms - Sethuonline.com | Sathyabama Un...
Introduction to Data structure & Algorithms - Sethuonline.com | Sathyabama Un...
 
ESR10 Joachim Daiber - EXPERT Summer School - Malaga 2015
ESR10 Joachim Daiber - EXPERT Summer School - Malaga 2015ESR10 Joachim Daiber - EXPERT Summer School - Malaga 2015
ESR10 Joachim Daiber - EXPERT Summer School - Malaga 2015
 
Lecture20 xing
Lecture20 xingLecture20 xing
Lecture20 xing
 
Kaggle kenneth
Kaggle kennethKaggle kenneth
Kaggle kenneth
 
Software tookits for machine learning and graphical models
Software tookits for machine learning and graphical modelsSoftware tookits for machine learning and graphical models
Software tookits for machine learning and graphical models
 
Comparison between the genetic algorithms optimization and particle swarm opt...
Comparison between the genetic algorithms optimization and particle swarm opt...Comparison between the genetic algorithms optimization and particle swarm opt...
Comparison between the genetic algorithms optimization and particle swarm opt...
 
COMPUTER LABORATORY-4 LAB MANUAL BE COMPUTER ENGINEERING
COMPUTER LABORATORY-4 LAB MANUAL BE COMPUTER ENGINEERINGCOMPUTER LABORATORY-4 LAB MANUAL BE COMPUTER ENGINEERING
COMPUTER LABORATORY-4 LAB MANUAL BE COMPUTER ENGINEERING
 
Basic review on topic modeling
Basic review on  topic modelingBasic review on  topic modeling
Basic review on topic modeling
 
Like Alice in Wonderland: Unraveling Reasoning and Cognition Using Analogies ...
Like Alice in Wonderland: Unraveling Reasoning and Cognition Using Analogies ...Like Alice in Wonderland: Unraveling Reasoning and Cognition Using Analogies ...
Like Alice in Wonderland: Unraveling Reasoning and Cognition Using Analogies ...
 
Topic modeling using big data analytics
Topic modeling using big data analyticsTopic modeling using big data analytics
Topic modeling using big data analytics
 

Similar to Parameterized Fielded Term Dependence Models for Ad-hoc Entity Retrieval from Knowledge Graph

Fielded Sequential Dependence Model for Ad-Hoc Entity Retrieval in the Web of...
Fielded Sequential Dependence Model for Ad-Hoc Entity Retrieval in the Web of...Fielded Sequential Dependence Model for Ad-Hoc Entity Retrieval in the Web of...
Fielded Sequential Dependence Model for Ad-Hoc Entity Retrieval in the Web of...FedorNikolaev
 
Exploiting Entity Linking in Queries For Entity Retrieval
Exploiting Entity Linking in Queries For Entity RetrievalExploiting Entity Linking in Queries For Entity Retrieval
Exploiting Entity Linking in Queries For Entity RetrievalFaegheh Hasibi
 
Lecture13 xing fei-fei
Lecture13 xing fei-feiLecture13 xing fei-fei
Lecture13 xing fei-feiTianlu Wang
 
Topic model an introduction
Topic model an introductionTopic model an introduction
Topic model an introductionYueshen Xu
 
Summary of SIGIR 2011 Papers
Summary of SIGIR 2011 PapersSummary of SIGIR 2011 Papers
Summary of SIGIR 2011 Paperschetanagavankar
 
Application of Bayesian and Sparse Network Models for Assessing Linkage Diseq...
Application of Bayesian and Sparse Network Models for Assessing Linkage Diseq...Application of Bayesian and Sparse Network Models for Assessing Linkage Diseq...
Application of Bayesian and Sparse Network Models for Assessing Linkage Diseq...Gota Morota
 
Query Translation for Ontology-extended Data Sources
Query Translation for Ontology-extended Data SourcesQuery Translation for Ontology-extended Data Sources
Query Translation for Ontology-extended Data SourcesJie Bao
 
Sparse Kernel Learning for Image Annotation
Sparse Kernel Learning for Image AnnotationSparse Kernel Learning for Image Annotation
Sparse Kernel Learning for Image AnnotationSean Moran
 
Artificial Intelligence
Artificial IntelligenceArtificial Intelligence
Artificial Intelligencevini89
 
Tutorial: Context In Recommender Systems
Tutorial: Context In Recommender SystemsTutorial: Context In Recommender Systems
Tutorial: Context In Recommender SystemsYONG ZHENG
 
A SVM Applied Text Categorization of Academia-Industry Collaborative Research...
A SVM Applied Text Categorization of Academia-Industry Collaborative Research...A SVM Applied Text Categorization of Academia-Industry Collaborative Research...
A SVM Applied Text Categorization of Academia-Industry Collaborative Research...National Institute of Informatics
 
Entity Retrieval (WSDM 2014 tutorial)
Entity Retrieval (WSDM 2014 tutorial)Entity Retrieval (WSDM 2014 tutorial)
Entity Retrieval (WSDM 2014 tutorial)krisztianbalog
 
Introduction to Functional Data Analysis
Introduction to Functional Data AnalysisIntroduction to Functional Data Analysis
Introduction to Functional Data AnalysisRené Franck Essomba
 

Similar to Parameterized Fielded Term Dependence Models for Ad-hoc Entity Retrieval from Knowledge Graph (20)

Fielded Sequential Dependence Model for Ad-Hoc Entity Retrieval in the Web of...
Fielded Sequential Dependence Model for Ad-Hoc Entity Retrieval in the Web of...Fielded Sequential Dependence Model for Ad-Hoc Entity Retrieval in the Web of...
Fielded Sequential Dependence Model for Ad-Hoc Entity Retrieval in the Web of...
 
A Survey of Entity Ranking over RDF Graphs
A Survey of Entity Ranking over RDF GraphsA Survey of Entity Ranking over RDF Graphs
A Survey of Entity Ranking over RDF Graphs
 
Lecture12 xing
Lecture12 xingLecture12 xing
Lecture12 xing
 
Exploiting Entity Linking in Queries For Entity Retrieval
Exploiting Entity Linking in Queries For Entity RetrievalExploiting Entity Linking in Queries For Entity Retrieval
Exploiting Entity Linking in Queries For Entity Retrieval
 
ppt0320defenseday
ppt0320defensedayppt0320defenseday
ppt0320defenseday
 
Sigir 2011 proceedings
Sigir 2011 proceedingsSigir 2011 proceedings
Sigir 2011 proceedings
 
Lecture13 xing fei-fei
Lecture13 xing fei-feiLecture13 xing fei-fei
Lecture13 xing fei-fei
 
Topic model an introduction
Topic model an introductionTopic model an introduction
Topic model an introduction
 
RLTopics_2021_Lect1.pdf
RLTopics_2021_Lect1.pdfRLTopics_2021_Lect1.pdf
RLTopics_2021_Lect1.pdf
 
Summary of SIGIR 2011 Papers
Summary of SIGIR 2011 PapersSummary of SIGIR 2011 Papers
Summary of SIGIR 2011 Papers
 
Application of Bayesian and Sparse Network Models for Assessing Linkage Diseq...
Application of Bayesian and Sparse Network Models for Assessing Linkage Diseq...Application of Bayesian and Sparse Network Models for Assessing Linkage Diseq...
Application of Bayesian and Sparse Network Models for Assessing Linkage Diseq...
 
Query Translation for Ontology-extended Data Sources
Query Translation for Ontology-extended Data SourcesQuery Translation for Ontology-extended Data Sources
Query Translation for Ontology-extended Data Sources
 
inteSearch: An Intelligent Linked Data Information Access Framework
inteSearch: An Intelligent Linked Data Information Access FrameworkinteSearch: An Intelligent Linked Data Information Access Framework
inteSearch: An Intelligent Linked Data Information Access Framework
 
Sparse Kernel Learning for Image Annotation
Sparse Kernel Learning for Image AnnotationSparse Kernel Learning for Image Annotation
Sparse Kernel Learning for Image Annotation
 
Artificial Intelligence
Artificial IntelligenceArtificial Intelligence
Artificial Intelligence
 
Tutorial: Context In Recommender Systems
Tutorial: Context In Recommender SystemsTutorial: Context In Recommender Systems
Tutorial: Context In Recommender Systems
 
A SVM Applied Text Categorization of Academia-Industry Collaborative Research...
A SVM Applied Text Categorization of Academia-Industry Collaborative Research...A SVM Applied Text Categorization of Academia-Industry Collaborative Research...
A SVM Applied Text Categorization of Academia-Industry Collaborative Research...
 
Entity Retrieval (WSDM 2014 tutorial)
Entity Retrieval (WSDM 2014 tutorial)Entity Retrieval (WSDM 2014 tutorial)
Entity Retrieval (WSDM 2014 tutorial)
 
Introduction to Functional Data Analysis
Introduction to Functional Data AnalysisIntroduction to Functional Data Analysis
Introduction to Functional Data Analysis
 
Entity-Relationship Queries over Wikipedia
Entity-Relationship Queries over WikipediaEntity-Relationship Queries over Wikipedia
Entity-Relationship Queries over Wikipedia
 

Recently uploaded

Fun for mover student's book- English book for teaching.pdf
Fun for mover student's book- English book for teaching.pdfFun for mover student's book- English book for teaching.pdf
Fun for mover student's book- English book for teaching.pdfhoangquan21999
 
Harry Coumnas Thinks That Human Teleportation is Possible in Quantum Mechanic...
Harry Coumnas Thinks That Human Teleportation is Possible in Quantum Mechanic...Harry Coumnas Thinks That Human Teleportation is Possible in Quantum Mechanic...
Harry Coumnas Thinks That Human Teleportation is Possible in Quantum Mechanic...kevin8smith
 
TEST BANK for Organic Chemistry 6th Edition.pdf
TEST BANK for Organic Chemistry 6th Edition.pdfTEST BANK for Organic Chemistry 6th Edition.pdf
TEST BANK for Organic Chemistry 6th Edition.pdfmarcuskenyatta275
 
MSCII_ FCT UNIT 5 TOXICOLOGY.pdf
MSCII_              FCT UNIT 5 TOXICOLOGY.pdfMSCII_              FCT UNIT 5 TOXICOLOGY.pdf
MSCII_ FCT UNIT 5 TOXICOLOGY.pdfSuchita Rawat
 
Molecular and Cellular Mechanism of Action of Hormones such as Growth Hormone...
Molecular and Cellular Mechanism of Action of Hormones such as Growth Hormone...Molecular and Cellular Mechanism of Action of Hormones such as Growth Hormone...
Molecular and Cellular Mechanism of Action of Hormones such as Growth Hormone...Ansari Aashif Raza Mohd Imtiyaz
 
PHOTOSYNTHETIC BACTERIA (OXYGENIC AND ANOXYGENIC)
PHOTOSYNTHETIC BACTERIA  (OXYGENIC AND ANOXYGENIC)PHOTOSYNTHETIC BACTERIA  (OXYGENIC AND ANOXYGENIC)
PHOTOSYNTHETIC BACTERIA (OXYGENIC AND ANOXYGENIC)kushbuR
 
Terpineol and it's characterization pptx
Terpineol and it's characterization pptxTerpineol and it's characterization pptx
Terpineol and it's characterization pptxMuhammadRazzaq31
 
Classification of Kerogen, Perspective on palynofacies in depositional envi...
Classification of Kerogen,  Perspective on palynofacies in depositional  envi...Classification of Kerogen,  Perspective on palynofacies in depositional  envi...
Classification of Kerogen, Perspective on palynofacies in depositional envi...Sangram Sahoo
 
Vital Signs of Animals Presentation By Aftab Ahmed Rahimoon
Vital Signs of Animals Presentation By Aftab Ahmed RahimoonVital Signs of Animals Presentation By Aftab Ahmed Rahimoon
Vital Signs of Animals Presentation By Aftab Ahmed Rahimoonintarciacompanies
 
A Scientific PowerPoint on Albert Einstein
A Scientific PowerPoint on Albert EinsteinA Scientific PowerPoint on Albert Einstein
A Scientific PowerPoint on Albert Einsteinxgamestudios8
 
POST TRANSCRIPTIONAL GENE SILENCING-AN INTRODUCTION.pptx
POST TRANSCRIPTIONAL GENE SILENCING-AN INTRODUCTION.pptxPOST TRANSCRIPTIONAL GENE SILENCING-AN INTRODUCTION.pptx
POST TRANSCRIPTIONAL GENE SILENCING-AN INTRODUCTION.pptxArpitaMishra69
 
GBSN - Microbiology (Unit 5) Concept of isolation
GBSN - Microbiology (Unit 5) Concept of isolationGBSN - Microbiology (Unit 5) Concept of isolation
GBSN - Microbiology (Unit 5) Concept of isolationAreesha Ahmad
 
GBSN - Microbiology (Unit 4) Concept of Asepsis
GBSN - Microbiology (Unit 4) Concept of AsepsisGBSN - Microbiology (Unit 4) Concept of Asepsis
GBSN - Microbiology (Unit 4) Concept of AsepsisAreesha Ahmad
 
PARENTAL CARE IN FISHES.pptx for 5th sem
PARENTAL CARE IN FISHES.pptx for 5th semPARENTAL CARE IN FISHES.pptx for 5th sem
PARENTAL CARE IN FISHES.pptx for 5th semborkhotudu123
 
Taphonomy and Quality of the Fossil Record
Taphonomy and Quality of the  Fossil RecordTaphonomy and Quality of the  Fossil Record
Taphonomy and Quality of the Fossil RecordSangram Sahoo
 
Electricity and Circuits for Grade 9 students
Electricity and Circuits for Grade 9 studentsElectricity and Circuits for Grade 9 students
Electricity and Circuits for Grade 9 studentslevieagacer
 
Factor Causing low production and physiology of mamary Gland
Factor Causing low production and physiology of mamary GlandFactor Causing low production and physiology of mamary Gland
Factor Causing low production and physiology of mamary GlandRcvets
 
NUMERICAL Proof Of TIme Electron Theory.
NUMERICAL Proof Of TIme Electron Theory.NUMERICAL Proof Of TIme Electron Theory.
NUMERICAL Proof Of TIme Electron Theory.syedmuneemqadri
 

Recently uploaded (20)

Fun for mover student's book- English book for teaching.pdf
Fun for mover student's book- English book for teaching.pdfFun for mover student's book- English book for teaching.pdf
Fun for mover student's book- English book for teaching.pdf
 
ABHISHEK ANTIBIOTICS PPT MICROBIOLOGY // USES OF ANTIOBIOTICS TYPES OF ANTIB...
ABHISHEK ANTIBIOTICS PPT MICROBIOLOGY  // USES OF ANTIOBIOTICS TYPES OF ANTIB...ABHISHEK ANTIBIOTICS PPT MICROBIOLOGY  // USES OF ANTIOBIOTICS TYPES OF ANTIB...
ABHISHEK ANTIBIOTICS PPT MICROBIOLOGY // USES OF ANTIOBIOTICS TYPES OF ANTIB...
 
Harry Coumnas Thinks That Human Teleportation is Possible in Quantum Mechanic...
Harry Coumnas Thinks That Human Teleportation is Possible in Quantum Mechanic...Harry Coumnas Thinks That Human Teleportation is Possible in Quantum Mechanic...
Harry Coumnas Thinks That Human Teleportation is Possible in Quantum Mechanic...
 
TEST BANK for Organic Chemistry 6th Edition.pdf
TEST BANK for Organic Chemistry 6th Edition.pdfTEST BANK for Organic Chemistry 6th Edition.pdf
TEST BANK for Organic Chemistry 6th Edition.pdf
 
MSCII_ FCT UNIT 5 TOXICOLOGY.pdf
MSCII_              FCT UNIT 5 TOXICOLOGY.pdfMSCII_              FCT UNIT 5 TOXICOLOGY.pdf
MSCII_ FCT UNIT 5 TOXICOLOGY.pdf
 
Molecular and Cellular Mechanism of Action of Hormones such as Growth Hormone...
Molecular and Cellular Mechanism of Action of Hormones such as Growth Hormone...Molecular and Cellular Mechanism of Action of Hormones such as Growth Hormone...
Molecular and Cellular Mechanism of Action of Hormones such as Growth Hormone...
 
PHOTOSYNTHETIC BACTERIA (OXYGENIC AND ANOXYGENIC)
PHOTOSYNTHETIC BACTERIA  (OXYGENIC AND ANOXYGENIC)PHOTOSYNTHETIC BACTERIA  (OXYGENIC AND ANOXYGENIC)
PHOTOSYNTHETIC BACTERIA (OXYGENIC AND ANOXYGENIC)
 
HIV AND INFULENZA VIRUS PPT HIV PPT INFULENZA VIRUS PPT
HIV AND INFULENZA VIRUS PPT HIV PPT  INFULENZA VIRUS PPTHIV AND INFULENZA VIRUS PPT HIV PPT  INFULENZA VIRUS PPT
HIV AND INFULENZA VIRUS PPT HIV PPT INFULENZA VIRUS PPT
 
Terpineol and it's characterization pptx
Terpineol and it's characterization pptxTerpineol and it's characterization pptx
Terpineol and it's characterization pptx
 
Classification of Kerogen, Perspective on palynofacies in depositional envi...
Classification of Kerogen,  Perspective on palynofacies in depositional  envi...Classification of Kerogen,  Perspective on palynofacies in depositional  envi...
Classification of Kerogen, Perspective on palynofacies in depositional envi...
 
Vital Signs of Animals Presentation By Aftab Ahmed Rahimoon
Vital Signs of Animals Presentation By Aftab Ahmed RahimoonVital Signs of Animals Presentation By Aftab Ahmed Rahimoon
Vital Signs of Animals Presentation By Aftab Ahmed Rahimoon
 
A Scientific PowerPoint on Albert Einstein
A Scientific PowerPoint on Albert EinsteinA Scientific PowerPoint on Albert Einstein
A Scientific PowerPoint on Albert Einstein
 
POST TRANSCRIPTIONAL GENE SILENCING-AN INTRODUCTION.pptx
POST TRANSCRIPTIONAL GENE SILENCING-AN INTRODUCTION.pptxPOST TRANSCRIPTIONAL GENE SILENCING-AN INTRODUCTION.pptx
POST TRANSCRIPTIONAL GENE SILENCING-AN INTRODUCTION.pptx
 
GBSN - Microbiology (Unit 5) Concept of isolation
GBSN - Microbiology (Unit 5) Concept of isolationGBSN - Microbiology (Unit 5) Concept of isolation
GBSN - Microbiology (Unit 5) Concept of isolation
 
GBSN - Microbiology (Unit 4) Concept of Asepsis
GBSN - Microbiology (Unit 4) Concept of AsepsisGBSN - Microbiology (Unit 4) Concept of Asepsis
GBSN - Microbiology (Unit 4) Concept of Asepsis
 
PARENTAL CARE IN FISHES.pptx for 5th sem
PARENTAL CARE IN FISHES.pptx for 5th semPARENTAL CARE IN FISHES.pptx for 5th sem
PARENTAL CARE IN FISHES.pptx for 5th sem
 
Taphonomy and Quality of the Fossil Record
Taphonomy and Quality of the  Fossil RecordTaphonomy and Quality of the  Fossil Record
Taphonomy and Quality of the Fossil Record
 
Electricity and Circuits for Grade 9 students
Electricity and Circuits for Grade 9 studentsElectricity and Circuits for Grade 9 students
Electricity and Circuits for Grade 9 students
 
Factor Causing low production and physiology of mamary Gland
Factor Causing low production and physiology of mamary GlandFactor Causing low production and physiology of mamary Gland
Factor Causing low production and physiology of mamary Gland
 
NUMERICAL Proof Of TIme Electron Theory.
NUMERICAL Proof Of TIme Electron Theory.NUMERICAL Proof Of TIme Electron Theory.
NUMERICAL Proof Of TIme Electron Theory.
 

Parameterized Fielded Term Dependence Models for Ad-hoc Entity Retrieval from Knowledge Graph

  • 1. 1/31 Parameterized Fielded Term Dependence Models for Ad-hoc Entity Retrieval from Knowledge Graph Fedor Nikolaev 1,2 Alexander Kotov 1 Nikita Zhiltsov 2 1Textual Data Analytics Lab, Department of Computer Science, Wayne State University 2Kazan Federal University
  • 3. 3/31 Entities • Material objects or concepts in the real world or fiction (e.g. people, movies, conferences etc.) • Are connected with other entities by relations (e.g. hasGenre, actedIn, isPCmemberOf etc.) • Subject-Predicate-Object (SPO) triple: subject=entity; object=entity (or primitive data value); predicate=relationship between subject and object • Many SPO triples → knowledge graph
  • 5. 5/31 Entity Retrieval from Knowledge Graph(s) • Graph KBs are perfectly suited for addressing the information needs that aim at finding specific objects (entities) rather than documents • Given the user’s information need expressed as a keyword query, retrieve a relevant set of objects from the knowledge graph(s)
  • 6. 6/31 Typical ERWD tasks • Entity Search Queries refer to a particular entity. • “Ben Franklin” • “England football player highest paid” • “Einstein Relativity theory” • List Search Complex queries with several relevant entities. • “US presidents since 1960” • “animals lay eggs mammals” • Question Answering Queries are questions in natural language. • “Who is the architect of leaning tower of Pisa?” • “For which label did Elvis record his first album?”
  • 7. 7/31 Entity document An entity is represented as a structured (multi-fielded) document: names Conventional names of the entities, such as the name of a person or the name of an organization attributes All entity properties, other than names categories Classes or groups, to which the entity has been assigned similar entity names Names of the entities that are very similar or identical to a given entity related entity names Names of the entities that are part of the same RDF triple
  • 8. 8/31 Entity document example Multi-fielded entity document for the entity Barack Obama. Field Content names barack obama barack hussein obama ii attributes 44th current president united states birth place honolulu hawaii categories democratic party united states senator nobel peace prize laureate christian similar entity names barack obama jr barak hussein obama barack h obama ii related entity names spouse michelle obama illinois state predecessor george walker bush
  • 9. 9/31 PRMS P(Q|d) = qi ∈Q j PM(Ej|qi )PQL(qi |ej), where PM(Ej|w) = PM(w|Ej)PM(Ej) Ek ∈E PM(w|Ek)PM(Ek)
  • 10. 10/31 SDM, FDM Ranks w.r.t. PΛ(D|Q) = i∈{T,U,O} λi fi (Q, D) Potential function for unigrams is QL: fT (qi , D) = log P(qi |θD) = log tfqi ,D + µ cfqi |C| |D| + µ SDM only considers two-word sequences in queries, FDM considers all two-word combinations.
  • 11. 11/31 Fielded SDM FSDM incorporates document structure and term dependencies into the one ranking model. Potential function for unigrams in case of FSDM: ˜fT (qi , D) = log j wT j P(qi |θj D) = log j wT j tfqi ,Dj + µj cf j qi |Cj | |Dj| + µj
  • 12. 12/31 FSDM limitation In FSDM field weights are the same for all query concepts of the same type. Example capitals in Europe which were host cities of summer Olympic games
  • 13. 13/31 Parametric extension of FSDM wT qi ,j = k αU j,kφk(qi , j)
  • 14. 13/31 Parametric extension of FSDM wT qi ,j = k αU j,kφk(qi , j) • φk(qi , j) is the the k-th feature value for unigram qi in field j.
  • 15. 13/31 Parametric extension of FSDM wT qi ,j = k αU j,kφk(qi , j) • φk(qi , j) is the the k-th feature value for unigram qi in field j. • αU j,k are feature weights that we learn.
  • 16. 13/31 Parametric extension of FSDM wT qi ,j = k αU j,kφk(qi , j) • φk(qi , j) is the the k-th feature value for unigram qi in field j. • αU j,k are feature weights that we learn. j wT qi ,j = 1, wT qi ,j ≥ 0, αU j,k ≥ 0, 0 ≤ φk(qi , j) ≤ 1
  • 17. 13/31 Parametric extension of FSDM wT qi ,j = k αU j,kφk(qi , j) • φk(qi , j) is the the k-th feature value for unigram qi in field j. • αU j,k are feature weights that we learn. j wT qi ,j = 1, wT qi ,j ≥ 0, αU j,k ≥ 0, 0 ≤ φk(qi , j) ≤ 1 PFFDM is the same, but uses full dependence model.
  • 18. 14/31 Features Source Feature Description CT Collection statistics FP(κ, j) Posterior probability P(Ej|w). UG BG TS(κ, j) Top SDM score on j-th field when κ is used as a query. BG
  • 19. 14/31 Features Source Feature Description CT Collection statistics FP(κ, j) Posterior probability P(Ej|w). UG BG TS(κ, j) Top SDM score on j-th field when κ is used as a query. BG Stanford POS Tagger NNP(κ) Is concept κ a proper noun? UG NNS(κ) Is κ a plural non-proper noun? UG BG JJS(κ) Is κ a superlative adjective? UG Stanford Parser NPP(κ) Is κ part of a noun phrase? BG NNO(κ) Is κ the only singular non- proper noun in a noun phrase? UG INT Intercept feature (= 1). UG BG
  • 20. 15/31 Parameters of PFSDM Both PFSDM and PFFDM have F ∗ U + F ∗ B + 3 free parameters: ˆαU, ˆαB, ˆλ . We perform direct optimization w.r.t. target metric (e.g. MAP) using coordinate ascent.
  • 21. 16/31 Collections 1 DBPedia 3.7 • Structured version of on-line encyclopedia Wikipedia • Provides the descriptions of over 3.5 million entities belonging to 320 classes 2 BTC-2009 • Contains entities from multiple knowledge bases. • Consists of 1.14 billion RDF triples.
  • 22. 17/31 Query sets Balog and Neumayer. A Test Collection for Entity Search in DBpedia, SIGIR’13. Query set Amount Query types [Pound et al., 2010] SemSearch ES 130 Entity ListSearch 115 Type INEX-LD 100 Entity, Type, Attribute, Relation QALD-2 140 Entity, Type, Attribute, Relation Only SemSearch ES judgments are available for BTC-2009, so for BTC-2009 we used only this query set.
  • 31. 26/31 Feature effectiveness 0.225 0.230 0.235 0.240 UG:FP,NNP,NNS;BG:TS,NPP,NNS UG:FP,NNS;BG:INT,TS UG:FP,NNS;BG:INT,TS,NPP,NNS UG:FP,NNS;BG:INT,FP,TS,NPP,NNS UG:FP,NNP;BG:FP,TS,NPP,NNS UG:FP,NNS;BG:INT UG:FP,NNP,NNS;BG:INT,TS,NPP,NNS UG:FP,NNP,NNS;BG:FP,TS,NPP,NNS UG:FP,NNS;BG:FP,TS,NPP,NNS UG:FP,NNP,TS;BG:INT,TS UG:FP,NNP,TS;BG:TS,NPP,NNS UG:FP,NNP;BG:INT,TS,NPP,NNS UG:FP,NNS;BG:TS,NPP,NNS UG:FP,NNP;BG:FP,TS,NPP UG:FP,NNP,TS;BG:FP,TS,NPP,NNS UG:FP,NNO;BG:INT,TS,NPP,NNS UG:FP,NNP;BG:INT,FP,TS,NPP,NNS UG:FP,NNP,NNS;BG:INT,TS UG:FP,NNO;BG:FP,TS,NPP UG:FP,NNP,JJS;BG:TS,NPP,NNS UG:FP,NNP;BG:TS,NPP,NNS UG:FP,NNP,JJS;BG:FP,TS,NPP,NNS UG:FP,NNP,JJS;BG:INT,TS,NPP,NNS UG:FP,NNP,JJS;BG:INT,FP,TS,NPP,NNS UG:FP,NNO;BG:INT,TS UG:FP,NNS;BG:FP,TS,NPP UG:FP,NNO;BG:INT UG:FP,NNO;BG:INT,FP,TS,NPP,NNS UG:FP,NNO;BG:TS,NPP,NNS UG:FP,NNP,TS;BG:INT,FP,TS,NPP,NNS UG:FP,NNP,NNS;BG:FP,TS,NPP UG:FP,NNP;BG:INT,TS UG:FP,NNO;BG:FP,TS,NPP,NNS UG:FP,NNP,TS;BG:INT UG:FP,NNP,TS;BG:INT,TS,NPP,NNS UG:FP,NNP,JJS;BG:INT UG:FP,NNP,JJS;BG:INT,TS UG:FP,NNP,NNS;BG:INT UG:FP,NNP;BG:INT UG:FP,NNP,JJS;BG:FP,TS,NPP UG:FP,NNP,TS;BG:FP,TS,NPP UG:FP,NNP,NNS;BG:INT,FP,TS,NPP,NNS map
  • 32. 27/31 DBpedia results (using best features combination) Query set Method MAP P@10 P@20 b-pref SemSearch ES PRMS 0.230 0.177 0.549 0.317 FSDM 0.386 0.286 0.737 0.476 PFSDM 0.394∗ 0.286∗ 0.757∗ 0.494∗ † FFDM 0.389∗ 0.286∗ 0.734∗ 0.479∗ PFFDM 0.380∗ 0.286∗ 0.739∗ 0.477∗ ListSearch PRMS 0.111 0.154 0.355 0.176 FSDM 0.203 0.256 0.447 0.274 PFSDM 0.201∗ 0.253∗ 0.443∗ 0.278∗ FFDM 0.226∗ † 0.282∗ † 0.499∗ † 0.313∗ † PFFDM 0.228∗ † 0.286∗ † 0.487∗ 0.302∗ † INEX-LD PRMS 0.064 0.145 0.409 0.216 FSDM 0.111 0.263 0.546 0.322 PFSDM 0.116∗ 0.259∗ 0.579∗ 0.341∗ FFDM 0.122∗ † 0.273∗ 0.560∗ 0.345∗ † PFFDM 0.121∗ † 0.274∗ 0.556∗ 0.343∗ QALD-2 PRMS 0.120 0.079 0.188 0.147 FSDM 0.195 0.136 0.283 0.229 PFSDM 0.218∗ † 0.140∗ 0.308∗ 0.253∗ † FFDM 0.200∗ 0.139∗ 0.292∗ 0.237∗ PFFDM 0.219∗ † 0.147∗ 0.310∗ 0.267∗ † All queries PRMS 0.136 0.136 0.370 0.214 FSDM 0.231 0.231 0.498 0.325 PFSDM 0.240∗ † 0.231∗ 0.516∗ † 0.342∗ † FFDM 0.241∗ † 0.240∗ † 0.515∗ † 0.342∗ † PFFDM 0.244∗ † 0.244∗ † 0.518∗ † 0.347∗ †
  • 33. 27/31 DBpedia results (using best features combination) Query set Method MAP P@10 P@20 b-pref SemSearch ES PRMS 0.230 0.177 0.549 0.317 FSDM 0.386 0.286 0.737 0.476 PFSDM 0.394∗ 0.286∗ 0.757∗ 0.494∗ † FFDM 0.389∗ 0.286∗ 0.734∗ 0.479∗ PFFDM 0.380∗ 0.286∗ 0.739∗ 0.477∗ ListSearch PRMS 0.111 0.154 0.355 0.176 FSDM 0.203 0.256 0.447 0.274 PFSDM 0.201∗ 0.253∗ 0.443∗ 0.278∗ FFDM 0.226∗ † 0.282∗ † 0.499∗ † 0.313∗ † PFFDM 0.228∗ † 0.286∗ † 0.487∗ 0.302∗ † INEX-LD PRMS 0.064 0.145 0.409 0.216 FSDM 0.111 0.263 0.546 0.322 PFSDM 0.116∗ 0.259∗ 0.579∗ 0.341∗ FFDM 0.122∗ † 0.273∗ 0.560∗ 0.345∗ † PFFDM 0.121∗ † 0.274∗ 0.556∗ 0.343∗ QALD-2 PRMS 0.120 0.079 0.188 0.147 FSDM 0.195 0.136 0.283 0.229 PFSDM 0.218∗ † 0.140∗ 0.308∗ 0.253∗ † FFDM 0.200∗ 0.139∗ 0.292∗ 0.237∗ PFFDM 0.219∗ † 0.147∗ 0.310∗ 0.267∗ † All queries PRMS 0.136 0.136 0.370 0.214 FSDM 0.231 0.231 0.498 0.325 PFSDM 0.240∗ † 0.231∗ 0.516∗ † 0.342∗ † FFDM 0.241∗ † 0.240∗ † 0.515∗ † 0.342∗ † PFFDM 0.244∗ † 0.244∗ † 0.518∗ † 0.347∗ †
  • 34. 27/31 DBpedia results (using best features combination) Query set Method MAP P@10 P@20 b-pref SemSearch ES PRMS 0.230 0.177 0.549 0.317 FSDM 0.386 0.286 0.737 0.476 PFSDM 0.394∗ 0.286∗ 0.757∗ 0.494∗ † FFDM 0.389∗ 0.286∗ 0.734∗ 0.479∗ PFFDM 0.380∗ 0.286∗ 0.739∗ 0.477∗ ListSearch PRMS 0.111 0.154 0.355 0.176 FSDM 0.203 0.256 0.447 0.274 PFSDM 0.201∗ 0.253∗ 0.443∗ 0.278∗ FFDM 0.226∗ † 0.282∗ † 0.499∗ † 0.313∗ † PFFDM 0.228∗ † 0.286∗ † 0.487∗ 0.302∗ † INEX-LD PRMS 0.064 0.145 0.409 0.216 FSDM 0.111 0.263 0.546 0.322 PFSDM 0.116∗ 0.259∗ 0.579∗ 0.341∗ FFDM 0.122∗ † 0.273∗ 0.560∗ 0.345∗ † PFFDM 0.121∗ † 0.274∗ 0.556∗ 0.343∗ QALD-2 PRMS 0.120 0.079 0.188 0.147 FSDM 0.195 0.136 0.283 0.229 PFSDM 0.218∗ † 0.140∗ 0.308∗ 0.253∗ † FFDM 0.200∗ 0.139∗ 0.292∗ 0.237∗ PFFDM 0.219∗ † 0.147∗ 0.310∗ 0.267∗ † All queries PRMS 0.136 0.136 0.370 0.214 FSDM 0.231 0.231 0.498 0.325 PFSDM 0.240∗ † 0.231∗ 0.516∗ † 0.342∗ † FFDM 0.241∗ † 0.240∗ † 0.515∗ † 0.342∗ † PFFDM 0.244∗ † 0.244∗ † 0.518∗ † 0.347∗ †
  • 35. 27/31 DBpedia results (using best features combination) Query set Method MAP P@10 P@20 b-pref SemSearch ES PRMS 0.230 0.177 0.549 0.317 FSDM 0.386 0.286 0.737 0.476 PFSDM 0.394∗ 0.286∗ 0.757∗ 0.494∗ † FFDM 0.389∗ 0.286∗ 0.734∗ 0.479∗ PFFDM 0.380∗ 0.286∗ 0.739∗ 0.477∗ ListSearch PRMS 0.111 0.154 0.355 0.176 FSDM 0.203 0.256 0.447 0.274 PFSDM 0.201∗ 0.253∗ 0.443∗ 0.278∗ FFDM 0.226∗ † 0.282∗ † 0.499∗ † 0.313∗ † PFFDM 0.228∗ † 0.286∗ † 0.487∗ 0.302∗ † INEX-LD PRMS 0.064 0.145 0.409 0.216 FSDM 0.111 0.263 0.546 0.322 PFSDM 0.116∗ 0.259∗ 0.579∗ 0.341∗ FFDM 0.122∗ † 0.273∗ 0.560∗ 0.345∗ † PFFDM 0.121∗ † 0.274∗ 0.556∗ 0.343∗ QALD-2 PRMS 0.120 0.079 0.188 0.147 FSDM 0.195 0.136 0.283 0.229 PFSDM 0.218∗ † 0.140∗ 0.308∗ 0.253∗ † FFDM 0.200∗ 0.139∗ 0.292∗ 0.237∗ PFFDM 0.219∗ † 0.147∗ 0.310∗ 0.267∗ † All queries PRMS 0.136 0.136 0.370 0.214 FSDM 0.231 0.231 0.498 0.325 PFSDM 0.240∗ † 0.231∗ 0.516∗ † 0.342∗ † FFDM 0.241∗ † 0.240∗ † 0.515∗ † 0.342∗ † PFFDM 0.244∗ † 0.244∗ † 0.518∗ † 0.347∗ †
  • 36. 27/31 DBpedia results (using best features combination) Query set Method MAP P@10 P@20 b-pref SemSearch ES PRMS 0.230 0.177 0.549 0.317 FSDM 0.386 0.286 0.737 0.476 PFSDM 0.394∗ 0.286∗ 0.757∗ 0.494∗ † FFDM 0.389∗ 0.286∗ 0.734∗ 0.479∗ PFFDM 0.380∗ 0.286∗ 0.739∗ 0.477∗ ListSearch PRMS 0.111 0.154 0.355 0.176 FSDM 0.203 0.256 0.447 0.274 PFSDM 0.201∗ 0.253∗ 0.443∗ 0.278∗ FFDM 0.226∗ † 0.282∗ † 0.499∗ † 0.313∗ † PFFDM 0.228∗ † 0.286∗ † 0.487∗ 0.302∗ † INEX-LD PRMS 0.064 0.145 0.409 0.216 FSDM 0.111 0.263 0.546 0.322 PFSDM 0.116∗ 0.259∗ 0.579∗ 0.341∗ FFDM 0.122∗ † 0.273∗ 0.560∗ 0.345∗ † PFFDM 0.121∗ † 0.274∗ 0.556∗ 0.343∗ QALD-2 PRMS 0.120 0.079 0.188 0.147 FSDM 0.195 0.136 0.283 0.229 PFSDM 0.218∗ † 0.140∗ 0.308∗ 0.253∗ † FFDM 0.200∗ 0.139∗ 0.292∗ 0.237∗ PFFDM 0.219∗ † 0.147∗ 0.310∗ 0.267∗ † All queries PRMS 0.136 0.136 0.370 0.214 FSDM 0.231 0.231 0.498 0.325 PFSDM 0.240∗ † 0.231∗ 0.516∗ † 0.342∗ † FFDM 0.241∗ † 0.240∗ † 0.515∗ † 0.342∗ † PFFDM 0.244∗ † 0.244∗ † 0.518∗ † 0.347∗ †
  • 37. 28/31 BTC results Method MAP P@10 P@20 b-pref PRMS 0.098 0.198 0.545 0.269 FSDM 0.171 0.323 0.631 0.358 PFSDM 0.182∗ † 0.335∗ 0.657∗ † 0.371∗ FFDM 0.180∗ † 0.330∗ † 0.647∗ 0.373∗ † PFFDM 0.187∗ 0.342∗ † 0.650∗ 0.377∗ 0.0 0.3 0.6 0 50 100 (a) PFSDM-FSDM -0.3 0.0 0.3 0.6 0 50 100 (b) PFFDM-FFDM
  • 38. 29/31 Conclusion • Entity-centric keyword queries have an implicit structure, with each element in that structure designating a particular aspect in multi-fielded representation of relevant entities. • We proposed two novel models for ad-hoc entity retrieval from knowledge graph, which account for term dependencies and perform feature-based projection of query concepts onto the fields of entity documents. • By demonstrating the possibility of inferring implicit structure of keyword queries using linguistic attributes and simple field statistics of query concepts, the proposed models constitute an important step in the evolution of models for structured document retrieval.
  • 39. 30/31 Feature work We hypothesize that the proposed models can be effective in other structured information retrieval scenarios, such as product and social graph search, and leave verification of this hypothesis to future work.