Finding support sentences for entities

•Download as PPT, PDF•

1 like•1,002 views

We study the problem of finding sentences that explain the relationship between a named entity and an ad-hoc query, which we refer to as entity support sentences. This is an important sub-problem of entity ranking which, to the best of our knowledge, has not been addressed before. In this paper we give the first formalization of the problem, how it can be evaluated, and present a full evaluation dataset. We propose several methods to rank these sentences, namely retrieval-based, entity-ranking based and position-based. We found that traditional bag-of-words models perform relatively well when there is a match between an entity and a query in a given sentence, but they fail to find a support sentence for a substantial portion of entities. This can be improved by incorporating small windows of context sentences and ranking them appropriately.

Technology

Finding Support Sentences for Entities
Roi Blanco and Hugo Zaragoza
Yahoo! Research Barcelona
July, SIGIR 2010

Outline
• Task definition
• Features for ranking sentences
• Context
• Evaluation
• SW1 dataset
• TREC Novelty Track
• Conclusions

Entity Ranking
• Given a topic, find relevant entities
• Evaluated in TREC and INEX campaigns
• Most well-known: people and expert search
• Many other applications: dates, events,
locations, companies, ...

Support Sentences for Entities
• We introduce the task explaining the
relationship between a query and an entity
• Applications for entity retrieval, expert
finding, object ranking, etc.
• We don’t focus on entity ranking, just on
the explanations
• Support Sentences: Hqe(s) ~ p(R|q,e,s)
• What makes a good sentence depends on
how general entities and queries are and
their relationship

PPPPiiciciccaaaassssssssoooo a aaannnndddd P PPPeeeeaaaacccceeee
ss11 ((00..9900))
ss22 ((00..8866))
ss33 ((00..8833))
........
ss11 ((00..9900))
ss22 ((00..8866))
ss33 ((00..8833))
........
ss11:: PPiiccaassssoo ((00..8811)),,
OOccttoobbeerr 11888811
((00..5522))
ss22:: SSppaanniisshh CCiivviill
WWaarr ((00..7733)),,
GGuueerrnniiccaa ((00..6655))
ss11:: PPiiccaassssoo ((00..8811)),,
OOccttoobbeerr 11888811
((00..5522))
ss22:: SSppaanniisshh CCiivviill
WWaarr ((00..7733)),,
GGuueerrnniiccaa ((00..6655))
11994444
PPiiccaassssoo
SSppaanniisshh CCiivviill WWaarr
11994444
PPiiccaassssoo
SSppaanniisshh CCiivviill WWaarr

Examples
• Query: Picasso and Peace
• Entity: 1944
•
“In 1944 Picasso joined the French Communist
Party, attended an international peace
conference in Poland, and in 1950 received the
Stalin Peace Prize from the Soviet government.”

Examples
• Query: Picasso and Peace
• Entity: Northern Spain
“Although it was not conceived by the
author as a representation of the disasters of
war, but the Nazi bombing of Guernica (a town
in Northern Spain), it is now considered an
iconic representation of the disasters of war.”

• Top-k sentenCceso ren-rtanekixngt (we don’t issue
any subsequent queries)
• Vocabulary mismatch problem (support
sentences that do not contain any query
term)
• Entity supported must be in the sentence
• Introduce small windows of context
sentences

TTiittllee
CCoonntteexxtt
SSeenntteennccee

Features for Ranking
• Top-k sentences
• Augmented
• Entity-candidate set
• Using sentence scores:
• Sentence score for the [query,sentence],
BM25
• Sentence score for the [query, sentence
+ context], BM25F
• Position:

• Aggregation of entity scores
• sum, max, min, average, ...
• Options for the entity ranker score E(q,e)
• Frequency
• Rarity
• Combination
• KLD

Evaluation Framework
• Semantically Annotated Snapshot of the
English Wikipedia (sentences +
annotations)
• 12 types from WSJ tag-set
• Judges produce a set of queries and remove
non-relevant entities
• Evaluate a set of sentences using a 4-grade
scale
• 226 (entity,query) with 45 unique queries

Conclusions
• We introduced the task of finding support
sentences for entities (aka “entity
snippets”)
• We engineered several features based on
scores of sentences and entities
• We developed an evaluation dataset
• http://barcelona.research.yahoo.net/dokuwiki
• Evaluated the task and the role of context
sentences

Viewers also liked

Gic2011 aula1-inglesMarielba-Mayeya Zacarias

Dissertation lyn woodward cwm jan 2010Francis Alamina

Gic2011 aula10-inglesMarielba-Mayeya Zacarias

Hellomynameis,lindsayhowardLindsay Howard

How to use spybot search and destroyAmiel Pangilinan

Tsa presentationElijah Clark Ginsberg

Gender in NTFP value chains in the Congo BasinVerina Ingram

Gic2011 aula9-inglesMarielba-Mayeya Zacarias

HRM CityMatters Survey Results - November 2013 Halifax Partnership

Mayonn Inc Bus Plan Presentationmayonn

Physical Science Chapter 3 sec1mshenry

Halifax Economic Review and Renewal - September 2010Halifax Partnership

Communications Systems ResearchPeter Lancaster

Windows 8Ankur Jain

Viewers also liked (14)

Gic2011 aula1-ingles

Dissertation lyn woodward cwm jan 2010

Gic2011 aula10-ingles

Hellomynameis,lindsayhoward

How to use spybot search and destroy

Tsa presentation

Gender in NTFP value chains in the Congo Basin

Gic2011 aula9-ingles

HRM CityMatters Survey Results - November 2013

Mayonn Inc Bus Plan Presentation

Physical Science Chapter 3 sec1

Halifax Economic Review and Renewal - September 2010

Communications Systems Research

Windows 8

Similar to Finding support sentences for entities

QAestro semantic based composition of QA pipelinesKuldeep Singh

[系列活動] 人工智慧與機器學習在推薦系統上的應用台灣資料科學年會

Temporal models for mining, ranking and recommendation in the WebTu Nguyen

Recommender System with Distributed RepresentationRakuten Group, Inc.

Entity Linking in Queries: Tasks and EvaluationFaegheh Hasibi

Tutorial 1 (information retrieval basics)Kira

AI3391 Artificial Intelligence session 24 knowledge representation.pptxAsst.prof M.Gokilavani

Improving Entity Retrieval on Structured DataBesnik Fetahu

sa-mincut-aditya.pptaashnareddy1

sa.pptINyomanSwitrayana

Recommender Systems, Matrices and GraphsRoelof Pieters

DLBLR talkAnuj Gupta

Deep Learning Bangalore meet up Satyam Saxena

sa-mincut-aditya.pptShaliniVerma380300

Entity Search: The Last Decade and the Nextkrisztianbalog

Extensible Operators and Literals for JavaScriptBrendan Eich

Information to Wisdom: Commonsense Knowledge Extraction and Compilation - Part 3Dr. Aparna Varde

Ph d sem_1@iitmVinu Ev

Deep Learning for Personalized Search and Recommender SystemsBenjamin Le

The Magical Art of Extracting Meaning From Datalmrei

Similar to Finding support sentences for entities (20)

QAestro semantic based composition of QA pipelines

[系列活動] 人工智慧與機器學習在推薦系統上的應用

Temporal models for mining, ranking and recommendation in the Web

Recommender System with Distributed Representation

Entity Linking in Queries: Tasks and Evaluation

Tutorial 1 (information retrieval basics)

AI3391 Artificial Intelligence session 24 knowledge representation.pptx

Improving Entity Retrieval on Structured Data

sa-mincut-aditya.ppt

sa.ppt

Recommender Systems, Matrices and Graphs

DLBLR talk

Deep Learning Bangalore meet up

sa-mincut-aditya.ppt

Entity Search: The Last Decade and the Next

Extensible Operators and Literals for JavaScript

Information to Wisdom: Commonsense Knowledge Extraction and Compilation - Part 3

Ph d sem_1@iitm

Deep Learning for Personalized Search and Recommender Systems

The Magical Art of Extracting Meaning From Data

Recently uploaded

Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Orbitshub

EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot

How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes

Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelDeepika Singh

DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamUiPathCommunity

Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@

Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...apidays

MS Copilot expands with MS Graph connectorsNanddeep Nachan

Architecting Cloud Native ApplicationsWSO2

Corporate and higher education May webinar.pptxRustici Software

FWD Group - Insurer Innovation Award 2024The Digital Insurer

Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays

Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya

Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood

Platformless Horizons for Digital AdaptabilityWSO2

Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra

CNIC Information System with Pakdata Cf In Pakistandanishmna97

"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz

Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Angeliki Cooney

Recently uploaded (20)

Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...

EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER

How to Troubleshoot Apps for the Modern Connected Worker

Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model

DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam

Boost Fertility New Invention Ups Success Rates.pdf

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...

Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...

MS Copilot expands with MS Graph connectors

Architecting Cloud Native Applications

Corporate and higher education May webinar.pptx

FWD Group - Insurer Innovation Award 2024

Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...

Artificial Intelligence Chap.5 : Uncertainty

Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...

Platformless Horizons for Digital Adaptability

Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving

CNIC Information System with Pakdata Cf In Pakistan

"I see eyes in my soup": How Delivery Hero implemented the safety system for ...

Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...

Finding support sentences for entities

1. Finding Support Sentences for Entities Roi Blanco and Hugo Zaragoza Yahoo! Research Barcelona July, SIGIR 2010

2. Outline • Task definition • Features for ranking sentences • Context • Evaluation • SW1 dataset • TREC Novelty Track • Conclusions

4. www.evri.com

7. Entity Ranking • Given a topic, find relevant entities • Evaluated in TREC and INEX campaigns • Most well-known: people and expert search • Many other applications: dates, events, locations, companies, ...

8. Support Sentences for Entities • We introduce the task explaining the relationship between a query and an entity • Applications for entity retrieval, expert finding, object ranking, etc. • We don’t focus on entity ranking, just on the explanations • Support Sentences: Hqe(s) ~ p(R|q,e,s) • What makes a good sentence depends on how general entities and queries are and their relationship

9. PPPPiiciciccaaaassssssssoooo a aaannnndddd P PPPeeeeaaaacccceeee ss11 ((00..9900)) ss22 ((00..8866)) ss33 ((00..8833)) ........ ss11 ((00..9900)) ss22 ((00..8866)) ss33 ((00..8833)) ........ ss11:: PPiiccaassssoo ((00..8811)),, OOccttoobbeerr 11888811 ((00..5522)) ss22:: SSppaanniisshh CCiivviill WWaarr ((00..7733)),, GGuueerrnniiccaa ((00..6655)) ss11:: PPiiccaassssoo ((00..8811)),, OOccttoobbeerr 11888811 ((00..5522)) ss22:: SSppaanniisshh CCiivviill WWaarr ((00..7733)),, GGuueerrnniiccaa ((00..6655)) 11994444 PPiiccaassssoo SSppaanniisshh CCiivviill WWaarr 11994444 PPiiccaassssoo SSppaanniisshh CCiivviill WWaarr

10. “Flying Circus”

11. Examples • Query: Picasso and Peace • Entity: 1944 • “In 1944 Picasso joined the French Communist Party, attended an international peace conference in Poland, and in 1950 received the Stalin Peace Prize from the Soviet government.”

12. Examples • Query: Picasso and Peace • Entity: Northern Spain “Although it was not conceived by the author as a representation of the disasters of war, but the Nazi bombing of Guernica (a town in Northern Spain), it is now considered an iconic representation of the disasters of war.”

13. • Top-k sentenCceso ren-rtanekixngt (we don’t issue any subsequent queries) • Vocabulary mismatch problem (support sentences that do not contain any query term) • Entity supported must be in the sentence • Introduce small windows of context sentences

14. TTiittllee CCoonntteexxtt SSeenntteennccee

15. Features for Ranking • Top-k sentences • Augmented • Entity-candidate set • Using sentence scores: • Sentence score for the [query,sentence], BM25 • Sentence score for the [query, sentence + context], BM25F • Position:

16. • Aggregation of entity scores • sum, max, min, average, ... • Options for the entity ranker score E(q,e) • Frequency • Rarity • Combination • KLD

17. Evaluation Framework • Semantically Annotated Snapshot of the English Wikipedia (sentences + annotations) • 12 types from WSJ tag-set • Judges produce a set of queries and remove non-relevant entities • Evaluate a set of sentences using a 4-grade scale • 226 (entity,query) with 45 unique queries

18. Results

19. Results (augmented)

20. Novelty Track 2003 2004

21. Conclusions • We introduced the task of finding support sentences for entities (aka “entity snippets”) • We engineered several features based on scores of sentences and entities • We developed an evaluation dataset • http://barcelona.research.yahoo.net/dokuwiki • Evaluated the task and the role of context sentences

Finding support sentences for entities

Recommended

Recommended

More Related Content

Viewers also liked

Viewers also liked (14)

Similar to Finding support sentences for entities

Similar to Finding support sentences for entities (20)

More from Roi Blanco

More from Roi Blanco (14)

Recently uploaded

Recently uploaded (20)

Finding support sentences for entities