SlideShare a Scribd company logo
1 of 29
Download to read offline
Types and Entity Retrieval
Environment Dimensions
Type-aware Entity Retrieval
Type-aware Entity Retrieval
Dar´ıo Garigliotti
University of Stavanger
June 10, 2016
Dar´ıo Garigliotti Type-aware Entity Retrieval
Types and Entity Retrieval
Environment Dimensions
Type-aware Entity Retrieval
Outline:
1 Types and Entity Retrieval
2 Environment Dimensions
Type taxonomies
Type representations
Retrieval models
3 Type-aware Entity Retrieval
Dar´ıo Garigliotti Type-aware Entity Retrieval
Types and Entity Retrieval
Environment Dimensions
Type-aware Entity Retrieval
Types and Entity Retrieval
Traditional Information Retrieval recently extended to an
Entity-oriented Search
It revolves around the satisfaction of more complex
information needs
Several entity elements from knowledge bases, naturally
appearing in queries
Countries where one can pay with the euro
Related entities (via a relation or predicate)
Types or categories or classes
Dar´ıo Garigliotti Type-aware Entity Retrieval
Types and Entity Retrieval
Environment Dimensions
Type-aware Entity Retrieval
Types and Entity Retrieval
Dar´ıo Garigliotti Type-aware Entity Retrieval
Types and Entity Retrieval
Environment Dimensions
Type-aware Entity Retrieval
Types and Entity Retrieval
Dar´ıo Garigliotti Type-aware Entity Retrieval
Types and Entity Retrieval
Environment Dimensions
Type-aware Entity Retrieval
Types and Entity Retrieval
Why to think about types?
Entities are typed
Types are useful for retrieval, presentation,
summarization...
Related tasks, e.g.
Entity ranking (given a query and target categories)
List completion (given a query and entity examples, and?
types)
Query target type identification
Our focus is on emergent dimensions to explore
Dar´ıo Garigliotti Type-aware Entity Retrieval
Types and Entity Retrieval
Environment Dimensions
Type-aware Entity Retrieval
Type taxonomies
Type representations
Retrieval models
Type taxonomies
There are different type taxonomies from various knowledge
bases
DBpedia Ontology
Freebase Types
Wikipedia Categories
YAGO Taxonomy
These vary a lot in terms of hierarchical structure and in how
entity-type assignments are recorded
Normalisation efforts are needed
Dar´ıo Garigliotti Type-aware Entity Retrieval
Types and Entity Retrieval
Environment Dimensions
Type-aware Entity Retrieval
Type taxonomies
Type representations
Retrieval models
DBpedia Ontology
A well-designed hierarchy
Created manually by
considering the most
frequently used infoboxes
in Wikipedia
Clean and consistent, but
with limited coverage
0
1
2
3
4
5
6
7
|Level 1| = 58 types
|Level 2| = 114 types
|Level 3| = 142 types
|Level 4| = 213 types
|Level 5| = 45 types
|Level 6| = 17 types
|Level 7| = 1 type
Dar´ıo Garigliotti Type-aware Entity Retrieval
Types and Entity Retrieval
Environment Dimensions
Type-aware Entity Retrieval
Type taxonomies
Type representations
Retrieval models
DBpedia Ontology
Dar´ıo Garigliotti Type-aware Entity Retrieval
Types and Entity Retrieval
Environment Dimensions
Type-aware Entity Retrieval
Type taxonomies
Type representations
Retrieval models
Freebase Types
A two-layer categorization
system: types and
domains
Entities are only assigned
to types, having most of
them “same as” links to
DBpedia entities
0
1
2
|Level 1| = 92 types
|Level 2| = 1, 626 types
Dar´ıo Garigliotti Type-aware Entity Retrieval
Types and Entity Retrieval
Environment Dimensions
Type-aware Entity Retrieval
Type taxonomies
Type representations
Retrieval models
Wikipedia Categories
It consists of textual
labels known as
categories
It’s not a well-defined
“is-a” hierarchy, but a
graph
Category assignments
are neither consistent
nor complete
It requires a major
normalisation strategy
0
1
2-10
11-24
25-
34
|Level 1| = 27 types
|Level 2 ∪ ... ∪ Level 10| =
121, 657 types
|Level 11 ∪ ... ∪ Level 24| =
410, 697 types
|Level 25 ∪ ... ∪ Level 34| =
14, 564 types
Dar´ıo Garigliotti Type-aware Entity Retrieval
Types and Entity Retrieval
Environment Dimensions
Type-aware Entity Retrieval
Type taxonomies
Type representations
Retrieval models
YAGO Taxonomy
A deep subsumption
hierarchy
Its classification schema is
constructed by taking leaf
categories from Wikipedia
categories and then using
WordNet synsets to
establish the hierarchy
0
1
2-5
6-10
11-
19
|Level 1| = 61 types
|Level 2 ∪ ... ∪ Level 5| =
80, 384 types
|Level 6 ∪ ... ∪ Level 10| =
461, 843 types
|Level 11 ∪ ... ∪ Level 19| =
26, 383 types
Dar´ıo Garigliotti Type-aware Entity Retrieval
Types and Entity Retrieval
Environment Dimensions
Type-aware Entity Retrieval
Type taxonomies
Type representations
Retrieval models
Type representations
How to represent the hierarchical information?
Dar´ıo Garigliotti Type-aware Entity Retrieval
Types and Entity Retrieval
Environment Dimensions
Type-aware Entity Retrieval
Type taxonomies
Type representations
Retrieval models
Type representations
How to represent the hierarchical information?
t3t3
t2t2
t5t5t4t4
t9t9t8t8
e
t6t6
t12t12
t7t7
…
t10t10 t11t11
t0t0
t1t1 …
Type(s) along path
to top
Dar´ıo Garigliotti Type-aware Entity Retrieval
Types and Entity Retrieval
Environment Dimensions
Type-aware Entity Retrieval
Type taxonomies
Type representations
Retrieval models
Type representations
How to represent the hierarchical information?
t3t3
t2t2
t5t5t4t4
t9t9t8t8
e
t6t6
t12t12
t7t7
…
t10t10 t11t11
t0t0
t1t1 …
Type(s) along path
to top
t3t3
t2t2
t5t5t4t4
t9t9t8t8
e
t6t6
t12t12
t7t7
…
t10t10 t11t11
t0t0
t1t1 …
Top-level type(s)
Dar´ıo Garigliotti Type-aware Entity Retrieval
Types and Entity Retrieval
Environment Dimensions
Type-aware Entity Retrieval
Type taxonomies
Type representations
Retrieval models
Type representations
How to represent the hierarchical information?
t3t3
t2t2
t5t5t4t4
t9t9t8t8
e
t6t6
t12t12
t7t7
…
t10t10 t11t11
t0t0
t1t1 …
Type(s) along path
to top
t3t3
t2t2
t5t5t4t4
t9t9t8t8
e
t6t6
t12t12
t7t7
…
t10t10 t11t11
t0t0
t1t1 …
Top-level type(s)
t3t3
t2t2
t5t5t4t4
t9t9t8t8
e
t6t6
t12t12
t7t7
…
t10t10 t11t11
t0t0
t1t1 …
Most specific type(s)
Dar´ıo Garigliotti Type-aware Entity Retrieval
Types and Entity Retrieval
Environment Dimensions
Type-aware Entity Retrieval
Type taxonomies
Type representations
Retrieval models
Retrieval models
Retrieval task
defined in a
generative
probabilistic
framework
P(q | e)
query entity
Olympic games
target types
Rio de Janeiro
term-based
similarity
type-based
similarity
… …
entity types
Both query and entity considered in the term space as well as
in the type space
Dar´ıo Garigliotti Type-aware Entity Retrieval
Types and Entity Retrieval
Environment Dimensions
Type-aware Entity Retrieval
Type taxonomies
Type representations
Retrieval models
Retrieval models
(Strict) Filtering model
P(q | e) = P(θT
q | θT
e ) · χ[types(q) ∩ types(e) = ∅]
Types(q)Types(q) Types(e)Types(e)
Dar´ıo Garigliotti Type-aware Entity Retrieval
Types and Entity Retrieval
Environment Dimensions
Type-aware Entity Retrieval
Type taxonomies
Type representations
Retrieval models
Retrieval models
(Soft) Filtering model
P(q | e) = P(θT
q | θT
e ) · P(θT
q | θT
e )
Dar´ıo Garigliotti Type-aware Entity Retrieval
Types and Entity Retrieval
Environment Dimensions
Type-aware Entity Retrieval
Type taxonomies
Type representations
Retrieval models
Retrieval models
Interpolation model
P(q | e) = (1 − λ) · P(θT
q | θT
e ) + λ · P(θT
q | θT
e )
Dar´ıo Garigliotti Type-aware Entity Retrieval
Types and Entity Retrieval
Environment Dimensions
Type-aware Entity Retrieval
What did we do?
Lessons learned
What are we doing?
What did we do?
We systematically identified and compared all combinations of
those dimensions
4 type taxonomies: DBpedia Ontology (3.9), Freebase
Types (2015-03-31), Wikipedia Categories (for DBpedia
3.9) and YAGO Taxonomy (3.0.2)
3 type representations: path-to-top, top-level, most
specific
3 models: strict and soft filtering, interpolation
Environment: from idealized to realistic
query types oracle
entities fully typed in all the taxonomies
Dar´ıo Garigliotti Type-aware Entity Retrieval
Types and Entity Retrieval
Environment Dimensions
Type-aware Entity Retrieval
What did we do?
Lessons learned
What are we doing?
What did we do? Results
Dar´ıo Garigliotti Type-aware Entity Retrieval
Types and Entity Retrieval
Environment Dimensions
Type-aware Entity Retrieval
What did we do?
Lessons learned
What are we doing?
Lessons learned
Summary of insights:
How to represent hierarchical entity type information?
(RQ1) Using the most specific types appears to be the
best way
What (kind of) type taxonomies to use? (RQ2) Wikipedia,
in combination with most specific types, performs the best
in both the idealized and the more realistic scenarios
What combination model to choose? (RQ3) The
interpolation model appears to be more robust
Dar´ıo Garigliotti Type-aware Entity Retrieval
Types and Entity Retrieval
Environment Dimensions
Type-aware Entity Retrieval
What did we do?
Lessons learned
What are we doing?
Further analysis: strict filtering vs interpolation models
Strict filtering treats
target types as a set
Interpolation operates
with a probability
distribution over types
When we drop from
oracle every type
assigned to less than 3
entities, interpolation
adapts quite better
DBpedia Freebase Wikipedia YAGO
Most-specific types
DBpedia Freebase Wikipedia YAGO
Most-specific types
Dar´ıo Garigliotti Type-aware Entity Retrieval
Types and Entity Retrieval
Environment Dimensions
Type-aware Entity Retrieval
What did we do?
Lessons learned
What are we doing?
Further analysis: query-level ranking details
E.g. performance for
(Interpolation, Most
specific level,
Wikipedia-3.9)
query = “Which books by
Kerouac were published
by Viking Press?”
Types: 90 (including
Viking Press books)
Types of the hurt relevant
entities: all contain
Viking Press books
Dar´ıo Garigliotti Type-aware Entity Retrieval
Types and Entity Retrieval
Environment Dimensions
Type-aware Entity Retrieval
What did we do?
Lessons learned
What are we doing?
Further analysis: query-level ranking details
E.g. performance for
(Interpolation, Most
specific level,
Wikipedia-3.9)
query = “Give me all
actors starring in Batman
Begins”
All 7 relevant entities are
improved
Dar´ıo Garigliotti Type-aware Entity Retrieval
Types and Entity Retrieval
Environment Dimensions
Type-aware Entity Retrieval
What did we do?
Lessons learned
What are we doing?
What are we doing?
Automatic query target type detection
Baselines
Entity-centric: first, to rank entities based on their relevance
to the query, then look at what types the top-k ranked
entities have
Type-centric: to build a direct term-based representation for
each type, by aggregating descriptions of entities of that type
Learning-to-rank with several features
Dar´ıo Garigliotti Type-aware Entity Retrieval
Types and Entity Retrieval
Environment Dimensions
Type-aware Entity Retrieval
What did we do?
Lessons learned
What are we doing?
What are we doing? Target type detection
Dar´ıo Garigliotti Type-aware Entity Retrieval
Types and Entity Retrieval
Environment Dimensions
Type-aware Entity Retrieval
Thanks!
Questions?
Dar´ıo Garigliotti Type-aware Entity Retrieval

More Related Content

What's hot

Entity Retrieval (WSDM 2014 tutorial)
Entity Retrieval (WSDM 2014 tutorial)Entity Retrieval (WSDM 2014 tutorial)
Entity Retrieval (WSDM 2014 tutorial)krisztianbalog
 
Entity Search: The Last Decade and the Next
Entity Search: The Last Decade and the NextEntity Search: The Last Decade and the Next
Entity Search: The Last Decade and the Nextkrisztianbalog
 
Evaluation Initiatives for Entity-oriented Search
Evaluation Initiatives for Entity-oriented SearchEvaluation Initiatives for Entity-oriented Search
Evaluation Initiatives for Entity-oriented Searchkrisztianbalog
 
Entity Retrieval (tutorial organized by Radialpoint in Montreal)
Entity Retrieval (tutorial organized by Radialpoint in Montreal)Entity Retrieval (tutorial organized by Radialpoint in Montreal)
Entity Retrieval (tutorial organized by Radialpoint in Montreal)krisztianbalog
 
OWL-XML-Summer-School-09
OWL-XML-Summer-School-09OWL-XML-Summer-School-09
OWL-XML-Summer-School-09Duncan Hull
 
Lect6-An introduction to ontologies and ontology development
Lect6-An introduction to ontologies and ontology developmentLect6-An introduction to ontologies and ontology development
Lect6-An introduction to ontologies and ontology developmentAntonio Moreno
 
Issues and activities in authoring ontologies
Issues and activities in authoring ontologiesIssues and activities in authoring ontologies
Issues and activities in authoring ontologiesrobertstevens65
 

What's hot (8)

Entity Retrieval (WSDM 2014 tutorial)
Entity Retrieval (WSDM 2014 tutorial)Entity Retrieval (WSDM 2014 tutorial)
Entity Retrieval (WSDM 2014 tutorial)
 
Entity Search: The Last Decade and the Next
Entity Search: The Last Decade and the NextEntity Search: The Last Decade and the Next
Entity Search: The Last Decade and the Next
 
Evaluation Initiatives for Entity-oriented Search
Evaluation Initiatives for Entity-oriented SearchEvaluation Initiatives for Entity-oriented Search
Evaluation Initiatives for Entity-oriented Search
 
Entity Retrieval (tutorial organized by Radialpoint in Montreal)
Entity Retrieval (tutorial organized by Radialpoint in Montreal)Entity Retrieval (tutorial organized by Radialpoint in Montreal)
Entity Retrieval (tutorial organized by Radialpoint in Montreal)
 
OWL-XML-Summer-School-09
OWL-XML-Summer-School-09OWL-XML-Summer-School-09
OWL-XML-Summer-School-09
 
Lect6-An introduction to ontologies and ontology development
Lect6-An introduction to ontologies and ontology developmentLect6-An introduction to ontologies and ontology development
Lect6-An introduction to ontologies and ontology development
 
Entity Linking
Entity LinkingEntity Linking
Entity Linking
 
Issues and activities in authoring ontologies
Issues and activities in authoring ontologiesIssues and activities in authoring ontologies
Issues and activities in authoring ontologies
 

Similar to Type-Aware Entity Retrieval

On Type-Aware Entity Retrieval
On Type-Aware Entity RetrievalOn Type-Aware Entity Retrieval
On Type-Aware Entity RetrievalDarío Garigliotti
 
Type Information in Entity Retrieval
Type Information in Entity RetrievalType Information in Entity Retrieval
Type Information in Entity RetrievalDarío Garigliotti
 
Franz et al 2015 escjam 2015 logic resolution taxonomic variable
Franz et al 2015 escjam 2015 logic resolution taxonomic variableFranz et al 2015 escjam 2015 logic resolution taxonomic variable
Franz et al 2015 escjam 2015 logic resolution taxonomic variabletaxonbytes
 
Reading Group 2013 (DERI NUIG)
Reading Group 2013 (DERI NUIG)Reading Group 2013 (DERI NUIG)
Reading Group 2013 (DERI NUIG)Bianca Pereira
 
Global Collection Dashboard – Using data we have to uncover data we don’t
Global Collection Dashboard – Using data we have to uncover data we don’tGlobal Collection Dashboard – Using data we have to uncover data we don’t
Global Collection Dashboard – Using data we have to uncover data we don’tAxiell ALM
 
Penguins in-sweaters-or-serendipitous-entity-search-on-user-generated-content
Penguins in-sweaters-or-serendipitous-entity-search-on-user-generated-contentPenguins in-sweaters-or-serendipitous-entity-search-on-user-generated-content
Penguins in-sweaters-or-serendipitous-entity-search-on-user-generated-contentWenqiang Chen
 
Make your data great again - Ver 2
Make your data great again - Ver 2Make your data great again - Ver 2
Make your data great again - Ver 2Daniel JACOB
 
AI for information management: why and how
AI for information management: why and howAI for information management: why and how
AI for information management: why and howAnna Divoli
 
Digital Object Identifiers (DOIs) in the context of the International Treaty
Digital Object Identifiers (DOIs) in the context of the International TreatyDigital Object Identifiers (DOIs) in the context of the International Treaty
Digital Object Identifiers (DOIs) in the context of the International TreatyFAO
 
A Survey Ondecision Tree Learning Algorithms for Knowledge Discovery
A Survey Ondecision Tree Learning Algorithms for Knowledge DiscoveryA Survey Ondecision Tree Learning Algorithms for Knowledge Discovery
A Survey Ondecision Tree Learning Algorithms for Knowledge DiscoveryIJERA Editor
 
PATTERN DETECTION WITH RARE ITEM-SET MINING
PATTERN DETECTION WITH RARE ITEM-SET MININGPATTERN DETECTION WITH RARE ITEM-SET MINING
PATTERN DETECTION WITH RARE ITEM-SET MININGijscai
 
Making working thesauri
Making working thesauriMaking working thesauri
Making working thesauriliddy
 
Lecture-1-Introduction-to-Data-Mining.pdf
Lecture-1-Introduction-to-Data-Mining.pdfLecture-1-Introduction-to-Data-Mining.pdf
Lecture-1-Introduction-to-Data-Mining.pdfJojo314349
 
Pharo: a reflective language A first systematic analysis of reflective APIs
Pharo: a reflective language A first systematic analysis of reflective APIsPharo: a reflective language A first systematic analysis of reflective APIs
Pharo: a reflective language A first systematic analysis of reflective APIsESUG
 
Improving Semantic Search Using Query Log Analysis
Improving Semantic Search Using Query Log AnalysisImproving Semantic Search Using Query Log Analysis
Improving Semantic Search Using Query Log AnalysisStuart Wrigley
 

Similar to Type-Aware Entity Retrieval (20)

On Type-Aware Entity Retrieval
On Type-Aware Entity RetrievalOn Type-Aware Entity Retrieval
On Type-Aware Entity Retrieval
 
Type Information in Entity Retrieval
Type Information in Entity RetrievalType Information in Entity Retrieval
Type Information in Entity Retrieval
 
Type-Aware Entity Retrieval
Type-Aware Entity RetrievalType-Aware Entity Retrieval
Type-Aware Entity Retrieval
 
Artificial Intelligence in Data Curation
Artificial Intelligence in Data CurationArtificial Intelligence in Data Curation
Artificial Intelligence in Data Curation
 
Type-Aware Entity Retrieval
Type-Aware Entity RetrievalType-Aware Entity Retrieval
Type-Aware Entity Retrieval
 
Ontology
OntologyOntology
Ontology
 
Recommender Systems and Linked Open Data
Recommender Systems and Linked Open DataRecommender Systems and Linked Open Data
Recommender Systems and Linked Open Data
 
Franz et al 2015 escjam 2015 logic resolution taxonomic variable
Franz et al 2015 escjam 2015 logic resolution taxonomic variableFranz et al 2015 escjam 2015 logic resolution taxonomic variable
Franz et al 2015 escjam 2015 logic resolution taxonomic variable
 
Reading Group 2013 (DERI NUIG)
Reading Group 2013 (DERI NUIG)Reading Group 2013 (DERI NUIG)
Reading Group 2013 (DERI NUIG)
 
Global Collection Dashboard – Using data we have to uncover data we don’t
Global Collection Dashboard – Using data we have to uncover data we don’tGlobal Collection Dashboard – Using data we have to uncover data we don’t
Global Collection Dashboard – Using data we have to uncover data we don’t
 
Penguins in-sweaters-or-serendipitous-entity-search-on-user-generated-content
Penguins in-sweaters-or-serendipitous-entity-search-on-user-generated-contentPenguins in-sweaters-or-serendipitous-entity-search-on-user-generated-content
Penguins in-sweaters-or-serendipitous-entity-search-on-user-generated-content
 
Make your data great again - Ver 2
Make your data great again - Ver 2Make your data great again - Ver 2
Make your data great again - Ver 2
 
AI for information management: why and how
AI for information management: why and howAI for information management: why and how
AI for information management: why and how
 
Digital Object Identifiers (DOIs) in the context of the International Treaty
Digital Object Identifiers (DOIs) in the context of the International TreatyDigital Object Identifiers (DOIs) in the context of the International Treaty
Digital Object Identifiers (DOIs) in the context of the International Treaty
 
A Survey Ondecision Tree Learning Algorithms for Knowledge Discovery
A Survey Ondecision Tree Learning Algorithms for Knowledge DiscoveryA Survey Ondecision Tree Learning Algorithms for Knowledge Discovery
A Survey Ondecision Tree Learning Algorithms for Knowledge Discovery
 
PATTERN DETECTION WITH RARE ITEM-SET MINING
PATTERN DETECTION WITH RARE ITEM-SET MININGPATTERN DETECTION WITH RARE ITEM-SET MINING
PATTERN DETECTION WITH RARE ITEM-SET MINING
 
Making working thesauri
Making working thesauriMaking working thesauri
Making working thesauri
 
Lecture-1-Introduction-to-Data-Mining.pdf
Lecture-1-Introduction-to-Data-Mining.pdfLecture-1-Introduction-to-Data-Mining.pdf
Lecture-1-Introduction-to-Data-Mining.pdf
 
Pharo: a reflective language A first systematic analysis of reflective APIs
Pharo: a reflective language A first systematic analysis of reflective APIsPharo: a reflective language A first systematic analysis of reflective APIs
Pharo: a reflective language A first systematic analysis of reflective APIs
 
Improving Semantic Search Using Query Log Analysis
Improving Semantic Search Using Query Log AnalysisImproving Semantic Search Using Query Log Analysis
Improving Semantic Search Using Query Log Analysis
 

More from Darío Garigliotti

Task-Based Support in Search Engines
Task-Based Support in Search EnginesTask-Based Support in Search Engines
Task-Based Support in Search EnginesDarío Garigliotti
 
About "Towards Better Text Understanding and Retrieval through Kernel Entity ...
About "Towards Better Text Understanding and Retrieval through Kernel Entity ...About "Towards Better Text Understanding and Retrieval through Kernel Entity ...
About "Towards Better Text Understanding and Retrieval through Kernel Entity ...Darío Garigliotti
 
A Semantic Search Approach to Task-Completion Engines
A Semantic Search Approach to Task-Completion EnginesA Semantic Search Approach to Task-Completion Engines
A Semantic Search Approach to Task-Completion EnginesDarío Garigliotti
 
A Semantic Search Approach to Task-Completion Engines
A Semantic Search Approach to Task-Completion EnginesA Semantic Search Approach to Task-Completion Engines
A Semantic Search Approach to Task-Completion EnginesDarío Garigliotti
 
A Knowledge Base of Entity-Oriented Search Intents
A Knowledge Base of Entity-Oriented Search IntentsA Knowledge Base of Entity-Oriented Search Intents
A Knowledge Base of Entity-Oriented Search IntentsDarío Garigliotti
 
Learning-to-Rank Target Types for Entity-Bearing Queries
Learning-to-Rank Target Types for Entity-Bearing QueriesLearning-to-Rank Target Types for Entity-Bearing Queries
Learning-to-Rank Target Types for Entity-Bearing QueriesDarío Garigliotti
 
If this is the answer, what was the question?
If this is the answer, what was the question?If this is the answer, what was the question?
If this is the answer, what was the question?Darío Garigliotti
 
Semi-supervised Learning for Word Sense Disambiguation
Semi-supervised Learning for Word Sense DisambiguationSemi-supervised Learning for Word Sense Disambiguation
Semi-supervised Learning for Word Sense DisambiguationDarío Garigliotti
 
Semi-supervised Learning for Word Sense Disambiguation
Semi-supervised Learning for Word Sense DisambiguationSemi-supervised Learning for Word Sense Disambiguation
Semi-supervised Learning for Word Sense DisambiguationDarío Garigliotti
 
Semi-supervised Learning for Word Sense Disambiguation
Semi-supervised Learning for Word Sense DisambiguationSemi-supervised Learning for Word Sense Disambiguation
Semi-supervised Learning for Word Sense DisambiguationDarío Garigliotti
 
FACT-IR. Fairness, Accountability, Confidentiality and Transparency in Inform...
FACT-IR. Fairness, Accountability, Confidentiality and Transparency in Inform...FACT-IR. Fairness, Accountability, Confidentiality and Transparency in Inform...
FACT-IR. Fairness, Accountability, Confidentiality and Transparency in Inform...Darío Garigliotti
 
Machine Learning - Classification (ctd.)
Machine Learning - Classification (ctd.)Machine Learning - Classification (ctd.)
Machine Learning - Classification (ctd.)Darío Garigliotti
 
Machine Learning - Classification
Machine Learning - ClassificationMachine Learning - Classification
Machine Learning - ClassificationDarío Garigliotti
 
Data Mining - Introduction and Data
Data Mining - Introduction and DataData Mining - Introduction and Data
Data Mining - Introduction and DataDarío Garigliotti
 

More from Darío Garigliotti (20)

Task-Based Support in Search Engines
Task-Based Support in Search EnginesTask-Based Support in Search Engines
Task-Based Support in Search Engines
 
Task Recommendation
Task RecommendationTask Recommendation
Task Recommendation
 
About "Towards Better Text Understanding and Retrieval through Kernel Entity ...
About "Towards Better Text Understanding and Retrieval through Kernel Entity ...About "Towards Better Text Understanding and Retrieval through Kernel Entity ...
About "Towards Better Text Understanding and Retrieval through Kernel Entity ...
 
A Semantic Search Approach to Task-Completion Engines
A Semantic Search Approach to Task-Completion EnginesA Semantic Search Approach to Task-Completion Engines
A Semantic Search Approach to Task-Completion Engines
 
A Summary of ECIR'18
A Summary of ECIR'18A Summary of ECIR'18
A Summary of ECIR'18
 
A Semantic Search Approach to Task-Completion Engines
A Semantic Search Approach to Task-Completion EnginesA Semantic Search Approach to Task-Completion Engines
A Semantic Search Approach to Task-Completion Engines
 
A Knowledge Base of Entity-Oriented Search Intents
A Knowledge Base of Entity-Oriented Search IntentsA Knowledge Base of Entity-Oriented Search Intents
A Knowledge Base of Entity-Oriented Search Intents
 
Learning-to-Rank Target Types for Entity-Bearing Queries
Learning-to-Rank Target Types for Entity-Bearing QueriesLearning-to-Rank Target Types for Entity-Bearing Queries
Learning-to-Rank Target Types for Entity-Bearing Queries
 
Dive into Deep Learning
Dive into Deep LearningDive into Deep Learning
Dive into Deep Learning
 
If this is the answer, what was the question?
If this is the answer, what was the question?If this is the answer, what was the question?
If this is the answer, what was the question?
 
Semi-supervised Learning for Word Sense Disambiguation
Semi-supervised Learning for Word Sense DisambiguationSemi-supervised Learning for Word Sense Disambiguation
Semi-supervised Learning for Word Sense Disambiguation
 
Semi-supervised Learning for Word Sense Disambiguation
Semi-supervised Learning for Word Sense DisambiguationSemi-supervised Learning for Word Sense Disambiguation
Semi-supervised Learning for Word Sense Disambiguation
 
Semi-supervised Learning for Word Sense Disambiguation
Semi-supervised Learning for Word Sense DisambiguationSemi-supervised Learning for Word Sense Disambiguation
Semi-supervised Learning for Word Sense Disambiguation
 
FACT-IR. Fairness, Accountability, Confidentiality and Transparency in Inform...
FACT-IR. Fairness, Accountability, Confidentiality and Transparency in Inform...FACT-IR. Fairness, Accountability, Confidentiality and Transparency in Inform...
FACT-IR. Fairness, Accountability, Confidentiality and Transparency in Inform...
 
Machine Learning - Clustering
Machine Learning - ClusteringMachine Learning - Clustering
Machine Learning - Clustering
 
Machine Learning - Classification (ctd.)
Machine Learning - Classification (ctd.)Machine Learning - Classification (ctd.)
Machine Learning - Classification (ctd.)
 
Machine Learning - Classification
Machine Learning - ClassificationMachine Learning - Classification
Machine Learning - Classification
 
Data Mining - Exploring Data
Data Mining - Exploring DataData Mining - Exploring Data
Data Mining - Exploring Data
 
Data Mining - Introduction and Data
Data Mining - Introduction and DataData Mining - Introduction and Data
Data Mining - Introduction and Data
 
Predicate Logic
Predicate LogicPredicate Logic
Predicate Logic
 

Recently uploaded

Cultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxCultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxpradhanghanshyam7136
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfSumit Kumar yadav
 
G9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptG9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptMAESTRELLAMesa2
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRDelhi Call girls
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfmuntazimhurra
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Nistarini College, Purulia (W.B) India
 
Types of different blotting techniques.pptx
Types of different blotting techniques.pptxTypes of different blotting techniques.pptx
Types of different blotting techniques.pptxkhadijarafiq2012
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...Sérgio Sacani
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksSérgio Sacani
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisDiwakar Mishra
 
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...jana861314
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )aarthirajkumar25
 
Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Patrick Diehl
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxgindu3009
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...Sérgio Sacani
 
A relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfA relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfnehabiju2046
 
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.aasikanpl
 

Recently uploaded (20)

Cultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxCultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptx
 
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
 
CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 
Engler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomyEngler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomy
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdf
 
G9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptG9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.ppt
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdf
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...
 
Types of different blotting techniques.pptx
Types of different blotting techniques.pptxTypes of different blotting techniques.pptx
Types of different blotting techniques.pptx
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
 
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )
 
Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
 
A relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfA relative description on Sonoporation.pdf
A relative description on Sonoporation.pdf
 
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
 

Type-Aware Entity Retrieval

  • 1. Types and Entity Retrieval Environment Dimensions Type-aware Entity Retrieval Type-aware Entity Retrieval Dar´ıo Garigliotti University of Stavanger June 10, 2016 Dar´ıo Garigliotti Type-aware Entity Retrieval
  • 2. Types and Entity Retrieval Environment Dimensions Type-aware Entity Retrieval Outline: 1 Types and Entity Retrieval 2 Environment Dimensions Type taxonomies Type representations Retrieval models 3 Type-aware Entity Retrieval Dar´ıo Garigliotti Type-aware Entity Retrieval
  • 3. Types and Entity Retrieval Environment Dimensions Type-aware Entity Retrieval Types and Entity Retrieval Traditional Information Retrieval recently extended to an Entity-oriented Search It revolves around the satisfaction of more complex information needs Several entity elements from knowledge bases, naturally appearing in queries Countries where one can pay with the euro Related entities (via a relation or predicate) Types or categories or classes Dar´ıo Garigliotti Type-aware Entity Retrieval
  • 4. Types and Entity Retrieval Environment Dimensions Type-aware Entity Retrieval Types and Entity Retrieval Dar´ıo Garigliotti Type-aware Entity Retrieval
  • 5. Types and Entity Retrieval Environment Dimensions Type-aware Entity Retrieval Types and Entity Retrieval Dar´ıo Garigliotti Type-aware Entity Retrieval
  • 6. Types and Entity Retrieval Environment Dimensions Type-aware Entity Retrieval Types and Entity Retrieval Why to think about types? Entities are typed Types are useful for retrieval, presentation, summarization... Related tasks, e.g. Entity ranking (given a query and target categories) List completion (given a query and entity examples, and? types) Query target type identification Our focus is on emergent dimensions to explore Dar´ıo Garigliotti Type-aware Entity Retrieval
  • 7. Types and Entity Retrieval Environment Dimensions Type-aware Entity Retrieval Type taxonomies Type representations Retrieval models Type taxonomies There are different type taxonomies from various knowledge bases DBpedia Ontology Freebase Types Wikipedia Categories YAGO Taxonomy These vary a lot in terms of hierarchical structure and in how entity-type assignments are recorded Normalisation efforts are needed Dar´ıo Garigliotti Type-aware Entity Retrieval
  • 8. Types and Entity Retrieval Environment Dimensions Type-aware Entity Retrieval Type taxonomies Type representations Retrieval models DBpedia Ontology A well-designed hierarchy Created manually by considering the most frequently used infoboxes in Wikipedia Clean and consistent, but with limited coverage 0 1 2 3 4 5 6 7 |Level 1| = 58 types |Level 2| = 114 types |Level 3| = 142 types |Level 4| = 213 types |Level 5| = 45 types |Level 6| = 17 types |Level 7| = 1 type Dar´ıo Garigliotti Type-aware Entity Retrieval
  • 9. Types and Entity Retrieval Environment Dimensions Type-aware Entity Retrieval Type taxonomies Type representations Retrieval models DBpedia Ontology Dar´ıo Garigliotti Type-aware Entity Retrieval
  • 10. Types and Entity Retrieval Environment Dimensions Type-aware Entity Retrieval Type taxonomies Type representations Retrieval models Freebase Types A two-layer categorization system: types and domains Entities are only assigned to types, having most of them “same as” links to DBpedia entities 0 1 2 |Level 1| = 92 types |Level 2| = 1, 626 types Dar´ıo Garigliotti Type-aware Entity Retrieval
  • 11. Types and Entity Retrieval Environment Dimensions Type-aware Entity Retrieval Type taxonomies Type representations Retrieval models Wikipedia Categories It consists of textual labels known as categories It’s not a well-defined “is-a” hierarchy, but a graph Category assignments are neither consistent nor complete It requires a major normalisation strategy 0 1 2-10 11-24 25- 34 |Level 1| = 27 types |Level 2 ∪ ... ∪ Level 10| = 121, 657 types |Level 11 ∪ ... ∪ Level 24| = 410, 697 types |Level 25 ∪ ... ∪ Level 34| = 14, 564 types Dar´ıo Garigliotti Type-aware Entity Retrieval
  • 12. Types and Entity Retrieval Environment Dimensions Type-aware Entity Retrieval Type taxonomies Type representations Retrieval models YAGO Taxonomy A deep subsumption hierarchy Its classification schema is constructed by taking leaf categories from Wikipedia categories and then using WordNet synsets to establish the hierarchy 0 1 2-5 6-10 11- 19 |Level 1| = 61 types |Level 2 ∪ ... ∪ Level 5| = 80, 384 types |Level 6 ∪ ... ∪ Level 10| = 461, 843 types |Level 11 ∪ ... ∪ Level 19| = 26, 383 types Dar´ıo Garigliotti Type-aware Entity Retrieval
  • 13. Types and Entity Retrieval Environment Dimensions Type-aware Entity Retrieval Type taxonomies Type representations Retrieval models Type representations How to represent the hierarchical information? Dar´ıo Garigliotti Type-aware Entity Retrieval
  • 14. Types and Entity Retrieval Environment Dimensions Type-aware Entity Retrieval Type taxonomies Type representations Retrieval models Type representations How to represent the hierarchical information? t3t3 t2t2 t5t5t4t4 t9t9t8t8 e t6t6 t12t12 t7t7 … t10t10 t11t11 t0t0 t1t1 … Type(s) along path to top Dar´ıo Garigliotti Type-aware Entity Retrieval
  • 15. Types and Entity Retrieval Environment Dimensions Type-aware Entity Retrieval Type taxonomies Type representations Retrieval models Type representations How to represent the hierarchical information? t3t3 t2t2 t5t5t4t4 t9t9t8t8 e t6t6 t12t12 t7t7 … t10t10 t11t11 t0t0 t1t1 … Type(s) along path to top t3t3 t2t2 t5t5t4t4 t9t9t8t8 e t6t6 t12t12 t7t7 … t10t10 t11t11 t0t0 t1t1 … Top-level type(s) Dar´ıo Garigliotti Type-aware Entity Retrieval
  • 16. Types and Entity Retrieval Environment Dimensions Type-aware Entity Retrieval Type taxonomies Type representations Retrieval models Type representations How to represent the hierarchical information? t3t3 t2t2 t5t5t4t4 t9t9t8t8 e t6t6 t12t12 t7t7 … t10t10 t11t11 t0t0 t1t1 … Type(s) along path to top t3t3 t2t2 t5t5t4t4 t9t9t8t8 e t6t6 t12t12 t7t7 … t10t10 t11t11 t0t0 t1t1 … Top-level type(s) t3t3 t2t2 t5t5t4t4 t9t9t8t8 e t6t6 t12t12 t7t7 … t10t10 t11t11 t0t0 t1t1 … Most specific type(s) Dar´ıo Garigliotti Type-aware Entity Retrieval
  • 17. Types and Entity Retrieval Environment Dimensions Type-aware Entity Retrieval Type taxonomies Type representations Retrieval models Retrieval models Retrieval task defined in a generative probabilistic framework P(q | e) query entity Olympic games target types Rio de Janeiro term-based similarity type-based similarity … … entity types Both query and entity considered in the term space as well as in the type space Dar´ıo Garigliotti Type-aware Entity Retrieval
  • 18. Types and Entity Retrieval Environment Dimensions Type-aware Entity Retrieval Type taxonomies Type representations Retrieval models Retrieval models (Strict) Filtering model P(q | e) = P(θT q | θT e ) · χ[types(q) ∩ types(e) = ∅] Types(q)Types(q) Types(e)Types(e) Dar´ıo Garigliotti Type-aware Entity Retrieval
  • 19. Types and Entity Retrieval Environment Dimensions Type-aware Entity Retrieval Type taxonomies Type representations Retrieval models Retrieval models (Soft) Filtering model P(q | e) = P(θT q | θT e ) · P(θT q | θT e ) Dar´ıo Garigliotti Type-aware Entity Retrieval
  • 20. Types and Entity Retrieval Environment Dimensions Type-aware Entity Retrieval Type taxonomies Type representations Retrieval models Retrieval models Interpolation model P(q | e) = (1 − λ) · P(θT q | θT e ) + λ · P(θT q | θT e ) Dar´ıo Garigliotti Type-aware Entity Retrieval
  • 21. Types and Entity Retrieval Environment Dimensions Type-aware Entity Retrieval What did we do? Lessons learned What are we doing? What did we do? We systematically identified and compared all combinations of those dimensions 4 type taxonomies: DBpedia Ontology (3.9), Freebase Types (2015-03-31), Wikipedia Categories (for DBpedia 3.9) and YAGO Taxonomy (3.0.2) 3 type representations: path-to-top, top-level, most specific 3 models: strict and soft filtering, interpolation Environment: from idealized to realistic query types oracle entities fully typed in all the taxonomies Dar´ıo Garigliotti Type-aware Entity Retrieval
  • 22. Types and Entity Retrieval Environment Dimensions Type-aware Entity Retrieval What did we do? Lessons learned What are we doing? What did we do? Results Dar´ıo Garigliotti Type-aware Entity Retrieval
  • 23. Types and Entity Retrieval Environment Dimensions Type-aware Entity Retrieval What did we do? Lessons learned What are we doing? Lessons learned Summary of insights: How to represent hierarchical entity type information? (RQ1) Using the most specific types appears to be the best way What (kind of) type taxonomies to use? (RQ2) Wikipedia, in combination with most specific types, performs the best in both the idealized and the more realistic scenarios What combination model to choose? (RQ3) The interpolation model appears to be more robust Dar´ıo Garigliotti Type-aware Entity Retrieval
  • 24. Types and Entity Retrieval Environment Dimensions Type-aware Entity Retrieval What did we do? Lessons learned What are we doing? Further analysis: strict filtering vs interpolation models Strict filtering treats target types as a set Interpolation operates with a probability distribution over types When we drop from oracle every type assigned to less than 3 entities, interpolation adapts quite better DBpedia Freebase Wikipedia YAGO Most-specific types DBpedia Freebase Wikipedia YAGO Most-specific types Dar´ıo Garigliotti Type-aware Entity Retrieval
  • 25. Types and Entity Retrieval Environment Dimensions Type-aware Entity Retrieval What did we do? Lessons learned What are we doing? Further analysis: query-level ranking details E.g. performance for (Interpolation, Most specific level, Wikipedia-3.9) query = “Which books by Kerouac were published by Viking Press?” Types: 90 (including Viking Press books) Types of the hurt relevant entities: all contain Viking Press books Dar´ıo Garigliotti Type-aware Entity Retrieval
  • 26. Types and Entity Retrieval Environment Dimensions Type-aware Entity Retrieval What did we do? Lessons learned What are we doing? Further analysis: query-level ranking details E.g. performance for (Interpolation, Most specific level, Wikipedia-3.9) query = “Give me all actors starring in Batman Begins” All 7 relevant entities are improved Dar´ıo Garigliotti Type-aware Entity Retrieval
  • 27. Types and Entity Retrieval Environment Dimensions Type-aware Entity Retrieval What did we do? Lessons learned What are we doing? What are we doing? Automatic query target type detection Baselines Entity-centric: first, to rank entities based on their relevance to the query, then look at what types the top-k ranked entities have Type-centric: to build a direct term-based representation for each type, by aggregating descriptions of entities of that type Learning-to-rank with several features Dar´ıo Garigliotti Type-aware Entity Retrieval
  • 28. Types and Entity Retrieval Environment Dimensions Type-aware Entity Retrieval What did we do? Lessons learned What are we doing? What are we doing? Target type detection Dar´ıo Garigliotti Type-aware Entity Retrieval
  • 29. Types and Entity Retrieval Environment Dimensions Type-aware Entity Retrieval Thanks! Questions? Dar´ıo Garigliotti Type-aware Entity Retrieval