SlideShare a Scribd company logo
Summarizing Entity Descriptions for
Effective and Efficient
Human-centered Entity Linking
Gong Cheng, Danyun Xu, Yuzhong Qu
Websoft Research Group
State Key Laboratory for Novel Software Technology
Nanjing University, China
Entity Linking (EL)
But with the release of the iPhone 6
and the 6 Plus phablet, Apple has finally
gone into big-screen territory, giving
Samsung a challenge in the category
that the company has been dominating
for some time now.
Text Knowledge Base
iPhone 6
- type: Smartphone
- ...
Samsung Electronics
- type: IT Company
- ...
Apple (Inc.)
- type: Company
- type: IT company
- product: iPhone 5
- ...
Apple (Fruit)
- type: Fruit
- genus: Malus
- ...
?
Candidate entities
Human-centered EL is needed
But with the release of the iPhone 6
and the 6 Plus phablet, Apple has finally
gone into big-screen territory, giving
Samsung a challenge in the category
that the company has been dominating
for some time now.
Text Knowledge Base
iPhone 6
- type: Smartphone
- ...
Samsung Electronics
- type: IT Company
- ...
Apple (Inc.)
- type: Company
- type: IT company
- product: iPhone 5
- ...
Apple (Fruit)
- type: Fruit
- genus: Malus
- ...
?
Candidate entities
• for defining gold standard,
• for crowdsourced EL.
entity description:
set of property-value pairs (called features)
But with the release of the iPhone 6
and the 6 Plus phablet, Apple has finally
gone into big-screen territory, giving
Samsung a challenge in the category
that the company has been dominating
for some time now.
Text Knowledge Base
iPhone 6
- type: Smartphone
- ...
Samsung Electronics
- type: IT Company
- ...
Apple (Inc.)
- type: Company
- type: IT company
- product: iPhone 5
- ...
Apple (Fruit)
- type: Fruit
- genus: Malus
- ...
?
Candidate entities
Entity descriptions are long.
Short, extractive summaries are
adequate for human-centered EL.
Apple (Inc.)
- type: Company
- product: iPhone 5
Apple (Corps)
- type: Company
- product: Let It Be
Apple (Fruit)
- type: Fruit
summary of k candidate entity descriptions: k subsets of features (subject to a length limit)
?… Apple
Short, extractive summaries are
adequate for human-centered EL.
Apple (Inc.)
- type: Company
- product: iPhone 5
Apple (Corps)
- type: Company
- product: Let It Be
Apple (Fruit)
- type: Fruit
?… Apple
summarizing entity descriptions  combinatorial optimization
summary of k candidate entity descriptions: k subsets of features (subject to a length limit)
Optimization goal (1)
+characterizing power, -information overlap
• Characterizing power of a feature (ch)
ch(type: IT company) < ch(product: iPhone 5)
Apple (Inc.)
Samsung
Electronics
Apple (Inc.)
- type: Company
- type: IT company
- product: iPhone 5
- ...
Optimization goal (1)
+characterizing power, -information overlap
• Characterizing power of a feature (ch)
ch(type: IT company) < ch(product: iPhone 5)
Apple (Inc.)
Samsung
Electronics
𝑐ℎ 𝑓 = − log
number of entities having 𝑓
number of all entities
Apple (Inc.)
- type: Company
- type: IT company
- product: iPhone 5
- ...
Optimization goal (1)
+characterizing power, -information overlap
• Information overlap between features (ov)
a) logical inference
entailment = maximized ov
ov(type: IT company, type: Company) = MAX
b) string/numerical similarity
Apple (Inc.)
- type: Company
- type: IT company
- product: iPhone 5
- ...
Optimization goal (1)
+characterizing power, -information overlap
• Information overlap between features (ov)
a) logical inference
entailment  maximized ov
ov(type: IT company, type: Company) = MAX
b) string/numerical similarity
Apple (Inc.)
- type: Company
- type: IT company
- product: iPhone 5
- ...
Optimization goal (1)
+characterizing power, -information overlap
• Information overlap between features (ov)
a) logical inference
entailment  maximized ov
ov(type: IT company, type: Company) = MAX
b) string/numerical similarity
ov = max{similarity between properties, similarity between values}
ov(type: IT company, product: iPhone 5) = SMALL
Apple (Inc.)
- type: Company
- type: IT company
- product: iPhone 5
- ...
Optimization goal (1)
+characterizing power, -information overlap
• Formulated as k Quadratic Knapsack Problems (QKP)
weight of a feature: length
profit of a pair of features:
to maximize characterizing power
to minimize information overlap
Optimization goal (2): +differentiating power
• Differentiating power of a pair of features (di)
a) string/numerical dissimilarity
di = property’s value uniqueness * dissimilarity between values
di(type: IT company, type: Fruit) = SMALL*LARGE = MEDIUM
(Single-valued properties are more useful.)
b) logical inference
entailment = minimized di
di(type: IT company, type: Company) = MIN
Apple (Inc.)
- type: Company
- type: IT company
- product: iPhone 5
- ...
Apple (Fruit)
- type: Fruit
- genus: Malus
- ...
Samsung Electronics
- type: IT Company
- ...
Optimization goal (2): +differentiating power
• Differentiating power of a pair of features (di)
a) string/numerical dissimilarity
di = dissimilarity between values * property’s value uniqueness
di(type: IT company, type: Fruit) = LARGE*SMALL = MEDIUM
(Single-valued properties are more useful.)
b) logical inference
entailment = minimized di
di(type: IT company, type: Company) = MIN
Apple (Inc.)
- type: Company
- type: IT company
- product: iPhone 5
- ...
Apple (Fruit)
- type: Fruit
- genus: Malus
- ...
Samsung Electronics
- type: IT Company
- ...
Optimization goal (2): +differentiating power
• Differentiating power of a pair of features (di)
a) string/numerical dissimilarity
di = dissimilarity between values * property’s value uniqueness
di(type: IT company, type: Fruit) = LARGE*SMALL = MEDIUM
(Single-valued properties are more useful.)
b) logical inference
entailment  minimized di
di(type: IT company, type: Company) = MIN
Apple (Inc.)
- type: Company
- type: IT company
- product: iPhone 5
- ...
Apple (Fruit)
- type: Fruit
- genus: Malus
- ...
Samsung Electronics
- type: IT Company
- ...
Optimization goal (2): +differentiating power
• Formulated as a Quadratic Multidimensional
Knapsack Problem (QMKP)
weight of a feature: length
profit of a pair of features: differentiating power
Optimization goal (3): +relevance to context
• Relevance of a feature to the context of entity mention
• cosine similarity in the class vector model (cs)
Vector(context) = {Smarphone, IT company}
Vector(type: Fruit) = {Fruit}
Vector(product: iPhone 5) = {Smartphone}
cs(context, product: iPhone 5) = HIGH
• class weighting: class frequency – inverse instance frequency (CF-IIF)
But with the release of the iPhone 6
and the 6 Plus phablet, Apple has finally
gone into big-screen territory, giving
Samsung a challenge in the category
that the company has been dominating
for some time now.
Text Knowledge Base
iPhone 6
- type: Smartphone
- ...
Samsung Electronics
- type: IT Company
- ...
Apple (Inc.)
- type: Company
- type: IT company
- product: iPhone 5
- ...
Apple (Fruit)
- type: Fruit
- genus: Malus
- ...
?
Candidate entities
Optimization goal (3): +relevance to context
• Relevance of a feature to the context of entity mention
• cosine similarity in the class vector model (cs)
Vector(context) = {Smarphone, IT company}
Vector(type: Fruit) = {Fruit}
Vector(product: iPhone 5) = {Smartphone}
cs(context, product: iPhone 5) = HIGH
• class weighting: class frequency – inverse instance frequency (CF-IIF)
But with the release of the iPhone 6
and the 6 Plus phablet, Apple has finally
gone into big-screen territory, giving
Samsung a challenge in the category
that the company has been dominating
for some time now.
Text Knowledge Base
iPhone 6
- type: Smartphone
- ...
Samsung Electronics
- type: IT Company
- ...
Apple (Inc.)
- type: Company
- type: IT company
- product: iPhone 5
- ...
Apple (Fruit)
- type: Fruit
- genus: Malus
- ...
?
Candidate entities
Optimization goal (3): +relevance to context
• Relevance of a feature to the context of entity mention
• cosine similarity in the class vector model (cs)
Vector(context) = {Smarphone, IT company}
Vector(type: Fruit) = {Fruit}
Vector(product: iPhone 5) = {Smartphone}
cs(context, product: iPhone 5) = HIGH
• class weighting: class frequency – inverse instance frequency (CF-IIF)
But with the release of the iPhone 6
and the 6 Plus phablet, Apple has finally
gone into big-screen territory, giving
Samsung a challenge in the category
that the company has been dominating
for some time now.
Text Knowledge Base
iPhone 6
- type: Smartphone
- ...
Samsung Electronics
- type: IT Company
- ...
Apple (Inc.)
- type: Company
- type: IT company
- product: iPhone 5
- ...
Apple (Fruit)
- type: Fruit
- genus: Malus
- ...
?
Candidate entities
Optimization goal (3): +relevance to context
• Relevance of a feature to the context of entity mention
• cosine similarity in the class vector model (cs)
Vector(context) = {Smarphone, IT company}
Vector(type: Fruit) = {Fruit}
Vector(product: iPhone 5) = {Smartphone}
cs(context, product: iPhone 5) = HIGH
• class weighting: class frequency – inverse instance frequency (CF-IIF)
But with the release of the iPhone 6
and the 6 Plus phablet, Apple has finally
gone into big-screen territory, giving
Samsung a challenge in the category
that the company has been dominating
for some time now.
Text Knowledge Base
iPhone 6
- type: Smartphone
- ...
Samsung Electronics
- type: IT Company
- ...
Apple (Inc.)
- type: Company
- type: IT company
- product: iPhone 5
- ...
Apple (Fruit)
- type: Fruit
- genus: Malus
- ...
?
Candidate entities
Optimization goal (3): +relevance to context
• Relevance of a feature to the context of entity mention
• cosine similarity in the class vector model (cs)
Vector(context) = {Smarphone, IT company}
Vector(type: Fruit) = {Fruit}
Vector(product: iPhone 5) = {Smartphone}
cs(context, product: iPhone 5) = HIGH
• class weighting: class frequency – inverse instance frequency (CF-IIF)
But with the release of the iPhone 6
and the 6 Plus phablet, Apple has finally
gone into big-screen territory, giving
Samsung a challenge in the category
that the company has been dominating
for some time now.
Text Knowledge Base
iPhone 6
- type: Smartphone
- ...
Samsung Electronics
- type: IT Company
- ...
Apple (Inc.)
- type: Company
- type: IT company
- product: iPhone 5
- ...
Apple (Fruit)
- type: Fruit
- genus: Malus
- ...
?
Candidate entities
Optimization goal (3): +relevance to context
• Solved by k Maximizing Marginal Relevance (MMR)
frameworks
• Features are iteratively selected.
• In each iteration, candidate features are re-ranked by
• relevance to context
• dissimilarity to selected features
Optimization goal (1+2+3)
• Formulated as a Quadratic Multidimensional
Knapsack Problem (QMKP)
Experiments: data sets
• Text corpora (with entity mentions linked to Wikipedia)
• AQUAINT
• IITB
• Knowledge base
• DBpedia
• Gold-standard links
• entity mentions  Wikipedia articles  DBpedia entities
Experiments: EL tasks
Apple (Inc.)
- type: Company
- product: iPhone 5
Apple (Corps)
- type: Company
- product: Let It Be
Apple (Fruit)
- type: Fruit
?
..., Apple has finally gone
into big-screen territory, …
1 target entity
• gold-standard
2 (very challenging) noise entities
• sharing a common name with the target entity,
obtained from Wikipedia’s disambiguation pages
Experiments: approaches
• Proposed approaches
• CHR: +characterizing power, -information overlap
• DFF: +differentiating power
• CNT: +relevance to context
• COMB: CHR+DFF+CNT
• Baseline approaches
• DESC: returns entire entity descriptions
• RELIN: a state-of-the-art entity summarization approach for
generic purposes
• average length of entity descriptions: 680 characters
• length limit for summaries: 100 characters (14.7%)
Experiments: extrinsic evaluation
• COMB is the only approach that achieved the following
statistically significant results on both data sets:
• accuracy (% of correct answers): COMB = DESC
• time: COMB < DESC (22-23% faster)
Experiments: intrinsic evaluation
• Statistically significant results on both data sets:
• human ratings: COMB > CHR > other approaches
Future work
• More extensive experiments
• to test with not-in-the-list
• Summaries for automatic EL
Questions?

More Related Content

Similar to Summarizing Entity Descriptions for Effective and Efficient Human-centered Entity Linking

Apple swot analysis 2016 (FREE)
Apple swot analysis 2016 (FREE)Apple swot analysis 2016 (FREE)
Apple swot analysis 2016 (FREE)
Strategic Management Insight
 
Apple manendra shukla
Apple manendra shuklaApple manendra shukla
Apple manendra shukla
Manendra Shukla
 
Presentation #1 – Discussion Questions (each question must be an.docx
Presentation #1 – Discussion Questions (each question must be an.docxPresentation #1 – Discussion Questions (each question must be an.docx
Presentation #1 – Discussion Questions (each question must be an.docx
harrisonhoward80223
 
Writing Sample - Equity Research - AAPL
Writing Sample - Equity Research - AAPLWriting Sample - Equity Research - AAPL
Writing Sample - Equity Research - AAPL
Michael Lin
 
Apple inc. Strategic Case Analysis
Apple inc. Strategic Case AnalysisApple inc. Strategic Case Analysis
Apple inc. Strategic Case Analysis
Mahy Helal
 
Apple CI Report 2015
Apple CI Report 2015Apple CI Report 2015
Apple CI Report 2015
Brad Irwin
 
The factors influencing the future business of apple
The factors influencing the future business of appleThe factors influencing the future business of apple
The factors influencing the future business of apple
Assignment Work Help
 
3 P a g e Section 2 = Discussion Questions. Qu.docx
3  P a g e   Section 2 = Discussion Questions. Qu.docx3  P a g e   Section 2 = Discussion Questions. Qu.docx
3 P a g e Section 2 = Discussion Questions. Qu.docx
domenicacullison
 
Apple Research Paper
Apple Research PaperApple Research Paper
Apple Research Paper
Lissette Hartman
 
General Environment, Forces of Competition, Future Improvement, Op.docx
General Environment, Forces of Competition, Future Improvement, Op.docxGeneral Environment, Forces of Competition, Future Improvement, Op.docx
General Environment, Forces of Competition, Future Improvement, Op.docx
shericehewat
 
Apple Evolution
Apple EvolutionApple Evolution
Apple Evolution
Vivek Bhurat
 
APEX 5 Interactive Reports: Deep Dive and Upgrade Advice
APEX 5 Interactive Reports: Deep Dive and Upgrade AdviceAPEX 5 Interactive Reports: Deep Dive and Upgrade Advice
APEX 5 Interactive Reports: Deep Dive and Upgrade Advice
Karen Cannell
 
Apple Company Review, February 2017 from OLMA NEXT Ltd.
Apple Company Review, February 2017 from OLMA NEXT Ltd.Apple Company Review, February 2017 from OLMA NEXT Ltd.
Apple Company Review, February 2017 from OLMA NEXT Ltd.
OLMA Capital Management
 
Motivation
MotivationMotivation
Motivation
Qandeel Noor
 
Apple Inc.
Apple Inc.Apple Inc.
Apple Inc.
Sakira Banu
 
Apple Inc.
Apple Inc.Apple Inc.
Sheet1SWOT ScenarioCorp StrategyBusiness StrategyStrategy Implemen.docx
Sheet1SWOT ScenarioCorp StrategyBusiness StrategyStrategy Implemen.docxSheet1SWOT ScenarioCorp StrategyBusiness StrategyStrategy Implemen.docx
Sheet1SWOT ScenarioCorp StrategyBusiness StrategyStrategy Implemen.docx
maoanderton
 
Apple Inc. Case Analysis
Apple Inc. Case AnalysisApple Inc. Case Analysis
Apple Inc. Case Analysis
Jennifer York
 
Running head EXTERNAL ENVIRONMENT SCAN—APPLE .docx
Running head  EXTERNAL ENVIRONMENT SCAN—APPLE                    .docxRunning head  EXTERNAL ENVIRONMENT SCAN—APPLE                    .docx
Running head EXTERNAL ENVIRONMENT SCAN—APPLE .docx
joellemurphey
 
Apple Inc. and it's implementation of IoT
Apple Inc. and it's implementation of IoTApple Inc. and it's implementation of IoT
Apple Inc. and it's implementation of IoT
Chonnam National University
 

Similar to Summarizing Entity Descriptions for Effective and Efficient Human-centered Entity Linking (20)

Apple swot analysis 2016 (FREE)
Apple swot analysis 2016 (FREE)Apple swot analysis 2016 (FREE)
Apple swot analysis 2016 (FREE)
 
Apple manendra shukla
Apple manendra shuklaApple manendra shukla
Apple manendra shukla
 
Presentation #1 – Discussion Questions (each question must be an.docx
Presentation #1 – Discussion Questions (each question must be an.docxPresentation #1 – Discussion Questions (each question must be an.docx
Presentation #1 – Discussion Questions (each question must be an.docx
 
Writing Sample - Equity Research - AAPL
Writing Sample - Equity Research - AAPLWriting Sample - Equity Research - AAPL
Writing Sample - Equity Research - AAPL
 
Apple inc. Strategic Case Analysis
Apple inc. Strategic Case AnalysisApple inc. Strategic Case Analysis
Apple inc. Strategic Case Analysis
 
Apple CI Report 2015
Apple CI Report 2015Apple CI Report 2015
Apple CI Report 2015
 
The factors influencing the future business of apple
The factors influencing the future business of appleThe factors influencing the future business of apple
The factors influencing the future business of apple
 
3 P a g e Section 2 = Discussion Questions. Qu.docx
3  P a g e   Section 2 = Discussion Questions. Qu.docx3  P a g e   Section 2 = Discussion Questions. Qu.docx
3 P a g e Section 2 = Discussion Questions. Qu.docx
 
Apple Research Paper
Apple Research PaperApple Research Paper
Apple Research Paper
 
General Environment, Forces of Competition, Future Improvement, Op.docx
General Environment, Forces of Competition, Future Improvement, Op.docxGeneral Environment, Forces of Competition, Future Improvement, Op.docx
General Environment, Forces of Competition, Future Improvement, Op.docx
 
Apple Evolution
Apple EvolutionApple Evolution
Apple Evolution
 
APEX 5 Interactive Reports: Deep Dive and Upgrade Advice
APEX 5 Interactive Reports: Deep Dive and Upgrade AdviceAPEX 5 Interactive Reports: Deep Dive and Upgrade Advice
APEX 5 Interactive Reports: Deep Dive and Upgrade Advice
 
Apple Company Review, February 2017 from OLMA NEXT Ltd.
Apple Company Review, February 2017 from OLMA NEXT Ltd.Apple Company Review, February 2017 from OLMA NEXT Ltd.
Apple Company Review, February 2017 from OLMA NEXT Ltd.
 
Motivation
MotivationMotivation
Motivation
 
Apple Inc.
Apple Inc.Apple Inc.
Apple Inc.
 
Apple Inc.
Apple Inc.Apple Inc.
Apple Inc.
 
Sheet1SWOT ScenarioCorp StrategyBusiness StrategyStrategy Implemen.docx
Sheet1SWOT ScenarioCorp StrategyBusiness StrategyStrategy Implemen.docxSheet1SWOT ScenarioCorp StrategyBusiness StrategyStrategy Implemen.docx
Sheet1SWOT ScenarioCorp StrategyBusiness StrategyStrategy Implemen.docx
 
Apple Inc. Case Analysis
Apple Inc. Case AnalysisApple Inc. Case Analysis
Apple Inc. Case Analysis
 
Running head EXTERNAL ENVIRONMENT SCAN—APPLE .docx
Running head  EXTERNAL ENVIRONMENT SCAN—APPLE                    .docxRunning head  EXTERNAL ENVIRONMENT SCAN—APPLE                    .docx
Running head EXTERNAL ENVIRONMENT SCAN—APPLE .docx
 
Apple Inc. and it's implementation of IoT
Apple Inc. and it's implementation of IoTApple Inc. and it's implementation of IoT
Apple Inc. and it's implementation of IoT
 

More from Gong Cheng

Towards Content-Based Dataset Search - Test Collections and Beyond
Towards Content-Based Dataset Search - Test Collections and BeyondTowards Content-Based Dataset Search - Test Collections and Beyond
Towards Content-Based Dataset Search - Test Collections and Beyond
Gong Cheng
 
从元数据到内容——新一代知识图谱搜索引擎初探
从元数据到内容——新一代知识图谱搜索引擎初探从元数据到内容——新一代知识图谱搜索引擎初探
从元数据到内容——新一代知识图谱搜索引擎初探
Gong Cheng
 
知识图谱中的实体摘要:基于神经网络的方法
知识图谱中的实体摘要:基于神经网络的方法知识图谱中的实体摘要:基于神经网络的方法
知识图谱中的实体摘要:基于神经网络的方法
Gong Cheng
 
Generating Compact and Relaxable Answers to Keyword Queries over Knowledge Gr...
Generating Compact and Relaxable Answers to Keyword Queries over Knowledge Gr...Generating Compact and Relaxable Answers to Keyword Queries over Knowledge Gr...
Generating Compact and Relaxable Answers to Keyword Queries over Knowledge Gr...
Gong Cheng
 
知识图谱中的关联搜索
知识图谱中的关联搜索知识图谱中的关联搜索
知识图谱中的关联搜索
Gong Cheng
 
面向高考机器人的知识表示与推理初探
面向高考机器人的知识表示与推理初探面向高考机器人的知识表示与推理初探
面向高考机器人的知识表示与推理初探
Gong Cheng
 
知识图谱中的实体关联搜索
知识图谱中的实体关联搜索知识图谱中的实体关联搜索
知识图谱中的实体关联搜索
Gong Cheng
 
Semantic Data Retrieval: Search, Ranking, and Summarization
Semantic Data Retrieval: Search, Ranking, and SummarizationSemantic Data Retrieval: Search, Ranking, and Summarization
Semantic Data Retrieval: Search, Ranking, and Summarization
Gong Cheng
 
Semantic Web related top conference review
Semantic Web related top conference reviewSemantic Web related top conference review
Semantic Web related top conference review
Gong Cheng
 
Relatedness-based Multi-Entity Summarization
Relatedness-based Multi-Entity SummarizationRelatedness-based Multi-Entity Summarization
Relatedness-based Multi-Entity Summarization
Gong Cheng
 
Generating Illustrative Snippets for Open Data on the Web
Generating Illustrative Snippets for Open Data on the WebGenerating Illustrative Snippets for Open Data on the Web
Generating Illustrative Snippets for Open Data on the Web
Gong Cheng
 
常识推理在地理自动答题中的需求分析
常识推理在地理自动答题中的需求分析常识推理在地理自动答题中的需求分析
常识推理在地理自动答题中的需求分析
Gong Cheng
 
Efficient Algorithms for Association Finding and Frequent Association Pattern...
Efficient Algorithms for Association Finding and Frequent Association Pattern...Efficient Algorithms for Association Finding and Frequent Association Pattern...
Efficient Algorithms for Association Finding and Frequent Association Pattern...
Gong Cheng
 
Summarizing Semantic Data
Summarizing Semantic DataSummarizing Semantic Data
Summarizing Semantic Data
Gong Cheng
 
HIEDS: A Generic and Efficient Approach to Hierarchical Dataset Summarization
HIEDS: A Generic and Efficient Approach to Hierarchical Dataset SummarizationHIEDS: A Generic and Efficient Approach to Hierarchical Dataset Summarization
HIEDS: A Generic and Efficient Approach to Hierarchical Dataset Summarization
Gong Cheng
 
Taking up the Gaokao Challenge: An Information Retrieval Approach
Taking up the Gaokao Challenge: An Information Retrieval ApproachTaking up the Gaokao Challenge: An Information Retrieval Approach
Taking up the Gaokao Challenge: An Information Retrieval Approach
Gong Cheng
 
知识的摘要
知识的摘要知识的摘要
知识的摘要
Gong Cheng
 
Explass: Exploring Associations between Entities via Top-K Ontological Patter...
Explass: Exploring Associations between Entities via Top-K Ontological Patter...Explass: Exploring Associations between Entities via Top-K Ontological Patter...
Explass: Exploring Associations between Entities via Top-K Ontological Patter...
Gong Cheng
 
Facilitating Human Intervention in Coreference Resolution with Comparative En...
Facilitating Human Intervention in Coreference Resolution with Comparative En...Facilitating Human Intervention in Coreference Resolution with Comparative En...
Facilitating Human Intervention in Coreference Resolution with Comparative En...
Gong Cheng
 
Towards Exploratory Relationship Search: A Clustering-based Approach
Towards Exploratory Relationship Search: A Clustering-based ApproachTowards Exploratory Relationship Search: A Clustering-based Approach
Towards Exploratory Relationship Search: A Clustering-based Approach
Gong Cheng
 

More from Gong Cheng (20)

Towards Content-Based Dataset Search - Test Collections and Beyond
Towards Content-Based Dataset Search - Test Collections and BeyondTowards Content-Based Dataset Search - Test Collections and Beyond
Towards Content-Based Dataset Search - Test Collections and Beyond
 
从元数据到内容——新一代知识图谱搜索引擎初探
从元数据到内容——新一代知识图谱搜索引擎初探从元数据到内容——新一代知识图谱搜索引擎初探
从元数据到内容——新一代知识图谱搜索引擎初探
 
知识图谱中的实体摘要:基于神经网络的方法
知识图谱中的实体摘要:基于神经网络的方法知识图谱中的实体摘要:基于神经网络的方法
知识图谱中的实体摘要:基于神经网络的方法
 
Generating Compact and Relaxable Answers to Keyword Queries over Knowledge Gr...
Generating Compact and Relaxable Answers to Keyword Queries over Knowledge Gr...Generating Compact and Relaxable Answers to Keyword Queries over Knowledge Gr...
Generating Compact and Relaxable Answers to Keyword Queries over Knowledge Gr...
 
知识图谱中的关联搜索
知识图谱中的关联搜索知识图谱中的关联搜索
知识图谱中的关联搜索
 
面向高考机器人的知识表示与推理初探
面向高考机器人的知识表示与推理初探面向高考机器人的知识表示与推理初探
面向高考机器人的知识表示与推理初探
 
知识图谱中的实体关联搜索
知识图谱中的实体关联搜索知识图谱中的实体关联搜索
知识图谱中的实体关联搜索
 
Semantic Data Retrieval: Search, Ranking, and Summarization
Semantic Data Retrieval: Search, Ranking, and SummarizationSemantic Data Retrieval: Search, Ranking, and Summarization
Semantic Data Retrieval: Search, Ranking, and Summarization
 
Semantic Web related top conference review
Semantic Web related top conference reviewSemantic Web related top conference review
Semantic Web related top conference review
 
Relatedness-based Multi-Entity Summarization
Relatedness-based Multi-Entity SummarizationRelatedness-based Multi-Entity Summarization
Relatedness-based Multi-Entity Summarization
 
Generating Illustrative Snippets for Open Data on the Web
Generating Illustrative Snippets for Open Data on the WebGenerating Illustrative Snippets for Open Data on the Web
Generating Illustrative Snippets for Open Data on the Web
 
常识推理在地理自动答题中的需求分析
常识推理在地理自动答题中的需求分析常识推理在地理自动答题中的需求分析
常识推理在地理自动答题中的需求分析
 
Efficient Algorithms for Association Finding and Frequent Association Pattern...
Efficient Algorithms for Association Finding and Frequent Association Pattern...Efficient Algorithms for Association Finding and Frequent Association Pattern...
Efficient Algorithms for Association Finding and Frequent Association Pattern...
 
Summarizing Semantic Data
Summarizing Semantic DataSummarizing Semantic Data
Summarizing Semantic Data
 
HIEDS: A Generic and Efficient Approach to Hierarchical Dataset Summarization
HIEDS: A Generic and Efficient Approach to Hierarchical Dataset SummarizationHIEDS: A Generic and Efficient Approach to Hierarchical Dataset Summarization
HIEDS: A Generic and Efficient Approach to Hierarchical Dataset Summarization
 
Taking up the Gaokao Challenge: An Information Retrieval Approach
Taking up the Gaokao Challenge: An Information Retrieval ApproachTaking up the Gaokao Challenge: An Information Retrieval Approach
Taking up the Gaokao Challenge: An Information Retrieval Approach
 
知识的摘要
知识的摘要知识的摘要
知识的摘要
 
Explass: Exploring Associations between Entities via Top-K Ontological Patter...
Explass: Exploring Associations between Entities via Top-K Ontological Patter...Explass: Exploring Associations between Entities via Top-K Ontological Patter...
Explass: Exploring Associations between Entities via Top-K Ontological Patter...
 
Facilitating Human Intervention in Coreference Resolution with Comparative En...
Facilitating Human Intervention in Coreference Resolution with Comparative En...Facilitating Human Intervention in Coreference Resolution with Comparative En...
Facilitating Human Intervention in Coreference Resolution with Comparative En...
 
Towards Exploratory Relationship Search: A Clustering-based Approach
Towards Exploratory Relationship Search: A Clustering-based ApproachTowards Exploratory Relationship Search: A Clustering-based Approach
Towards Exploratory Relationship Search: A Clustering-based Approach
 

Recently uploaded

Pro-competitive Industrial Policy – LANE – June 2024 OECD discussion
Pro-competitive Industrial Policy – LANE – June 2024 OECD discussionPro-competitive Industrial Policy – LANE – June 2024 OECD discussion
Pro-competitive Industrial Policy – LANE – June 2024 OECD discussion
OECD Directorate for Financial and Enterprise Affairs
 
Artificial Intelligence, Data and Competition – SCHREPEL – June 2024 OECD dis...
Artificial Intelligence, Data and Competition – SCHREPEL – June 2024 OECD dis...Artificial Intelligence, Data and Competition – SCHREPEL – June 2024 OECD dis...
Artificial Intelligence, Data and Competition – SCHREPEL – June 2024 OECD dis...
OECD Directorate for Financial and Enterprise Affairs
 
Disaster Management project for holidays homework and other uses
Disaster Management project for holidays homework and other usesDisaster Management project for holidays homework and other uses
Disaster Management project for holidays homework and other uses
RIDHIMAGARG21
 
XP 2024 presentation: A New Look to Leadership
XP 2024 presentation: A New Look to LeadershipXP 2024 presentation: A New Look to Leadership
XP 2024 presentation: A New Look to Leadership
samililja
 
原版制作贝德福特大学毕业证(bedfordhire毕业证)硕士文凭原版一模一样
原版制作贝德福特大学毕业证(bedfordhire毕业证)硕士文凭原版一模一样原版制作贝德福特大学毕业证(bedfordhire毕业证)硕士文凭原版一模一样
原版制作贝德福特大学毕业证(bedfordhire毕业证)硕士文凭原版一模一样
gpww3sf4
 
Why Psychological Safety Matters for Software Teams - ACE 2024 - Ben Linders.pdf
Why Psychological Safety Matters for Software Teams - ACE 2024 - Ben Linders.pdfWhy Psychological Safety Matters for Software Teams - ACE 2024 - Ben Linders.pdf
Why Psychological Safety Matters for Software Teams - ACE 2024 - Ben Linders.pdf
Ben Linders
 
Carrer goals.pptx and their importance in real life
Carrer goals.pptx  and their importance in real lifeCarrer goals.pptx  and their importance in real life
Carrer goals.pptx and their importance in real life
artemacademy2
 
Competition and Regulation in Professions and Occupations – ROBSON – June 202...
Competition and Regulation in Professions and Occupations – ROBSON – June 202...Competition and Regulation in Professions and Occupations – ROBSON – June 202...
Competition and Regulation in Professions and Occupations – ROBSON – June 202...
OECD Directorate for Financial and Enterprise Affairs
 
Pro-competitive Industrial Policy – OECD – June 2024 OECD discussion
Pro-competitive Industrial Policy – OECD – June 2024 OECD discussionPro-competitive Industrial Policy – OECD – June 2024 OECD discussion
Pro-competitive Industrial Policy – OECD – June 2024 OECD discussion
OECD Directorate for Financial and Enterprise Affairs
 
2024-05-30_meetup_devops_aix-marseille.pdf
2024-05-30_meetup_devops_aix-marseille.pdf2024-05-30_meetup_devops_aix-marseille.pdf
2024-05-30_meetup_devops_aix-marseille.pdf
Frederic Leger
 
The remarkable life of Sir Mokshagundam Visvesvaraya.pptx
The remarkable life of Sir Mokshagundam Visvesvaraya.pptxThe remarkable life of Sir Mokshagundam Visvesvaraya.pptx
The remarkable life of Sir Mokshagundam Visvesvaraya.pptx
JiteshKumarChoudhary2
 
Using-Presentation-Software-to-the-Fullf.pptx
Using-Presentation-Software-to-the-Fullf.pptxUsing-Presentation-Software-to-the-Fullf.pptx
Using-Presentation-Software-to-the-Fullf.pptx
kainatfatyma9
 
Competition and Regulation in Professions and Occupations – OECD – June 2024 ...
Competition and Regulation in Professions and Occupations – OECD – June 2024 ...Competition and Regulation in Professions and Occupations – OECD – June 2024 ...
Competition and Regulation in Professions and Occupations – OECD – June 2024 ...
OECD Directorate for Financial and Enterprise Affairs
 
ASONAM2023_presection_slide_track-recommendation.pdf
ASONAM2023_presection_slide_track-recommendation.pdfASONAM2023_presection_slide_track-recommendation.pdf
ASONAM2023_presection_slide_track-recommendation.pdf
ToshihiroIto4
 
Gregory Harris - Cycle 2 - Civics Presentation
Gregory Harris - Cycle 2 - Civics PresentationGregory Harris - Cycle 2 - Civics Presentation
Gregory Harris - Cycle 2 - Civics Presentation
gharris9
 
The Intersection between Competition and Data Privacy – KEMP – June 2024 OECD...
The Intersection between Competition and Data Privacy – KEMP – June 2024 OECD...The Intersection between Competition and Data Privacy – KEMP – June 2024 OECD...
The Intersection between Competition and Data Privacy – KEMP – June 2024 OECD...
OECD Directorate for Financial and Enterprise Affairs
 
The Intersection between Competition and Data Privacy – CAPEL – June 2024 OEC...
The Intersection between Competition and Data Privacy – CAPEL – June 2024 OEC...The Intersection between Competition and Data Privacy – CAPEL – June 2024 OEC...
The Intersection between Competition and Data Privacy – CAPEL – June 2024 OEC...
OECD Directorate for Financial and Enterprise Affairs
 
BRIC_2024_2024-06-06-11:30-haunschild_archival_version.pdf
BRIC_2024_2024-06-06-11:30-haunschild_archival_version.pdfBRIC_2024_2024-06-06-11:30-haunschild_archival_version.pdf
BRIC_2024_2024-06-06-11:30-haunschild_archival_version.pdf
Robin Haunschild
 
Artificial Intelligence, Data and Competition – LIM – June 2024 OECD discussion
Artificial Intelligence, Data and Competition – LIM – June 2024 OECD discussionArtificial Intelligence, Data and Competition – LIM – June 2024 OECD discussion
Artificial Intelligence, Data and Competition – LIM – June 2024 OECD discussion
OECD Directorate for Financial and Enterprise Affairs
 
Collapsing Narratives: Exploring Non-Linearity • a micro report by Rosie Wells
Collapsing Narratives: Exploring Non-Linearity • a micro report by Rosie WellsCollapsing Narratives: Exploring Non-Linearity • a micro report by Rosie Wells
Collapsing Narratives: Exploring Non-Linearity • a micro report by Rosie Wells
Rosie Wells
 

Recently uploaded (20)

Pro-competitive Industrial Policy – LANE – June 2024 OECD discussion
Pro-competitive Industrial Policy – LANE – June 2024 OECD discussionPro-competitive Industrial Policy – LANE – June 2024 OECD discussion
Pro-competitive Industrial Policy – LANE – June 2024 OECD discussion
 
Artificial Intelligence, Data and Competition – SCHREPEL – June 2024 OECD dis...
Artificial Intelligence, Data and Competition – SCHREPEL – June 2024 OECD dis...Artificial Intelligence, Data and Competition – SCHREPEL – June 2024 OECD dis...
Artificial Intelligence, Data and Competition – SCHREPEL – June 2024 OECD dis...
 
Disaster Management project for holidays homework and other uses
Disaster Management project for holidays homework and other usesDisaster Management project for holidays homework and other uses
Disaster Management project for holidays homework and other uses
 
XP 2024 presentation: A New Look to Leadership
XP 2024 presentation: A New Look to LeadershipXP 2024 presentation: A New Look to Leadership
XP 2024 presentation: A New Look to Leadership
 
原版制作贝德福特大学毕业证(bedfordhire毕业证)硕士文凭原版一模一样
原版制作贝德福特大学毕业证(bedfordhire毕业证)硕士文凭原版一模一样原版制作贝德福特大学毕业证(bedfordhire毕业证)硕士文凭原版一模一样
原版制作贝德福特大学毕业证(bedfordhire毕业证)硕士文凭原版一模一样
 
Why Psychological Safety Matters for Software Teams - ACE 2024 - Ben Linders.pdf
Why Psychological Safety Matters for Software Teams - ACE 2024 - Ben Linders.pdfWhy Psychological Safety Matters for Software Teams - ACE 2024 - Ben Linders.pdf
Why Psychological Safety Matters for Software Teams - ACE 2024 - Ben Linders.pdf
 
Carrer goals.pptx and their importance in real life
Carrer goals.pptx  and their importance in real lifeCarrer goals.pptx  and their importance in real life
Carrer goals.pptx and their importance in real life
 
Competition and Regulation in Professions and Occupations – ROBSON – June 202...
Competition and Regulation in Professions and Occupations – ROBSON – June 202...Competition and Regulation in Professions and Occupations – ROBSON – June 202...
Competition and Regulation in Professions and Occupations – ROBSON – June 202...
 
Pro-competitive Industrial Policy – OECD – June 2024 OECD discussion
Pro-competitive Industrial Policy – OECD – June 2024 OECD discussionPro-competitive Industrial Policy – OECD – June 2024 OECD discussion
Pro-competitive Industrial Policy – OECD – June 2024 OECD discussion
 
2024-05-30_meetup_devops_aix-marseille.pdf
2024-05-30_meetup_devops_aix-marseille.pdf2024-05-30_meetup_devops_aix-marseille.pdf
2024-05-30_meetup_devops_aix-marseille.pdf
 
The remarkable life of Sir Mokshagundam Visvesvaraya.pptx
The remarkable life of Sir Mokshagundam Visvesvaraya.pptxThe remarkable life of Sir Mokshagundam Visvesvaraya.pptx
The remarkable life of Sir Mokshagundam Visvesvaraya.pptx
 
Using-Presentation-Software-to-the-Fullf.pptx
Using-Presentation-Software-to-the-Fullf.pptxUsing-Presentation-Software-to-the-Fullf.pptx
Using-Presentation-Software-to-the-Fullf.pptx
 
Competition and Regulation in Professions and Occupations – OECD – June 2024 ...
Competition and Regulation in Professions and Occupations – OECD – June 2024 ...Competition and Regulation in Professions and Occupations – OECD – June 2024 ...
Competition and Regulation in Professions and Occupations – OECD – June 2024 ...
 
ASONAM2023_presection_slide_track-recommendation.pdf
ASONAM2023_presection_slide_track-recommendation.pdfASONAM2023_presection_slide_track-recommendation.pdf
ASONAM2023_presection_slide_track-recommendation.pdf
 
Gregory Harris - Cycle 2 - Civics Presentation
Gregory Harris - Cycle 2 - Civics PresentationGregory Harris - Cycle 2 - Civics Presentation
Gregory Harris - Cycle 2 - Civics Presentation
 
The Intersection between Competition and Data Privacy – KEMP – June 2024 OECD...
The Intersection between Competition and Data Privacy – KEMP – June 2024 OECD...The Intersection between Competition and Data Privacy – KEMP – June 2024 OECD...
The Intersection between Competition and Data Privacy – KEMP – June 2024 OECD...
 
The Intersection between Competition and Data Privacy – CAPEL – June 2024 OEC...
The Intersection between Competition and Data Privacy – CAPEL – June 2024 OEC...The Intersection between Competition and Data Privacy – CAPEL – June 2024 OEC...
The Intersection between Competition and Data Privacy – CAPEL – June 2024 OEC...
 
BRIC_2024_2024-06-06-11:30-haunschild_archival_version.pdf
BRIC_2024_2024-06-06-11:30-haunschild_archival_version.pdfBRIC_2024_2024-06-06-11:30-haunschild_archival_version.pdf
BRIC_2024_2024-06-06-11:30-haunschild_archival_version.pdf
 
Artificial Intelligence, Data and Competition – LIM – June 2024 OECD discussion
Artificial Intelligence, Data and Competition – LIM – June 2024 OECD discussionArtificial Intelligence, Data and Competition – LIM – June 2024 OECD discussion
Artificial Intelligence, Data and Competition – LIM – June 2024 OECD discussion
 
Collapsing Narratives: Exploring Non-Linearity • a micro report by Rosie Wells
Collapsing Narratives: Exploring Non-Linearity • a micro report by Rosie WellsCollapsing Narratives: Exploring Non-Linearity • a micro report by Rosie Wells
Collapsing Narratives: Exploring Non-Linearity • a micro report by Rosie Wells
 

Summarizing Entity Descriptions for Effective and Efficient Human-centered Entity Linking

  • 1. Summarizing Entity Descriptions for Effective and Efficient Human-centered Entity Linking Gong Cheng, Danyun Xu, Yuzhong Qu Websoft Research Group State Key Laboratory for Novel Software Technology Nanjing University, China
  • 2. Entity Linking (EL) But with the release of the iPhone 6 and the 6 Plus phablet, Apple has finally gone into big-screen territory, giving Samsung a challenge in the category that the company has been dominating for some time now. Text Knowledge Base iPhone 6 - type: Smartphone - ... Samsung Electronics - type: IT Company - ... Apple (Inc.) - type: Company - type: IT company - product: iPhone 5 - ... Apple (Fruit) - type: Fruit - genus: Malus - ... ? Candidate entities
  • 3. Human-centered EL is needed But with the release of the iPhone 6 and the 6 Plus phablet, Apple has finally gone into big-screen territory, giving Samsung a challenge in the category that the company has been dominating for some time now. Text Knowledge Base iPhone 6 - type: Smartphone - ... Samsung Electronics - type: IT Company - ... Apple (Inc.) - type: Company - type: IT company - product: iPhone 5 - ... Apple (Fruit) - type: Fruit - genus: Malus - ... ? Candidate entities • for defining gold standard, • for crowdsourced EL.
  • 4. entity description: set of property-value pairs (called features) But with the release of the iPhone 6 and the 6 Plus phablet, Apple has finally gone into big-screen territory, giving Samsung a challenge in the category that the company has been dominating for some time now. Text Knowledge Base iPhone 6 - type: Smartphone - ... Samsung Electronics - type: IT Company - ... Apple (Inc.) - type: Company - type: IT company - product: iPhone 5 - ... Apple (Fruit) - type: Fruit - genus: Malus - ... ? Candidate entities
  • 6. Short, extractive summaries are adequate for human-centered EL. Apple (Inc.) - type: Company - product: iPhone 5 Apple (Corps) - type: Company - product: Let It Be Apple (Fruit) - type: Fruit summary of k candidate entity descriptions: k subsets of features (subject to a length limit) ?… Apple
  • 7. Short, extractive summaries are adequate for human-centered EL. Apple (Inc.) - type: Company - product: iPhone 5 Apple (Corps) - type: Company - product: Let It Be Apple (Fruit) - type: Fruit ?… Apple summarizing entity descriptions  combinatorial optimization summary of k candidate entity descriptions: k subsets of features (subject to a length limit)
  • 8. Optimization goal (1) +characterizing power, -information overlap • Characterizing power of a feature (ch) ch(type: IT company) < ch(product: iPhone 5) Apple (Inc.) Samsung Electronics Apple (Inc.) - type: Company - type: IT company - product: iPhone 5 - ...
  • 9. Optimization goal (1) +characterizing power, -information overlap • Characterizing power of a feature (ch) ch(type: IT company) < ch(product: iPhone 5) Apple (Inc.) Samsung Electronics 𝑐ℎ 𝑓 = − log number of entities having 𝑓 number of all entities Apple (Inc.) - type: Company - type: IT company - product: iPhone 5 - ...
  • 10. Optimization goal (1) +characterizing power, -information overlap • Information overlap between features (ov) a) logical inference entailment = maximized ov ov(type: IT company, type: Company) = MAX b) string/numerical similarity Apple (Inc.) - type: Company - type: IT company - product: iPhone 5 - ...
  • 11. Optimization goal (1) +characterizing power, -information overlap • Information overlap between features (ov) a) logical inference entailment  maximized ov ov(type: IT company, type: Company) = MAX b) string/numerical similarity Apple (Inc.) - type: Company - type: IT company - product: iPhone 5 - ...
  • 12. Optimization goal (1) +characterizing power, -information overlap • Information overlap between features (ov) a) logical inference entailment  maximized ov ov(type: IT company, type: Company) = MAX b) string/numerical similarity ov = max{similarity between properties, similarity between values} ov(type: IT company, product: iPhone 5) = SMALL Apple (Inc.) - type: Company - type: IT company - product: iPhone 5 - ...
  • 13. Optimization goal (1) +characterizing power, -information overlap • Formulated as k Quadratic Knapsack Problems (QKP) weight of a feature: length profit of a pair of features: to maximize characterizing power to minimize information overlap
  • 14. Optimization goal (2): +differentiating power • Differentiating power of a pair of features (di) a) string/numerical dissimilarity di = property’s value uniqueness * dissimilarity between values di(type: IT company, type: Fruit) = SMALL*LARGE = MEDIUM (Single-valued properties are more useful.) b) logical inference entailment = minimized di di(type: IT company, type: Company) = MIN Apple (Inc.) - type: Company - type: IT company - product: iPhone 5 - ... Apple (Fruit) - type: Fruit - genus: Malus - ... Samsung Electronics - type: IT Company - ...
  • 15. Optimization goal (2): +differentiating power • Differentiating power of a pair of features (di) a) string/numerical dissimilarity di = dissimilarity between values * property’s value uniqueness di(type: IT company, type: Fruit) = LARGE*SMALL = MEDIUM (Single-valued properties are more useful.) b) logical inference entailment = minimized di di(type: IT company, type: Company) = MIN Apple (Inc.) - type: Company - type: IT company - product: iPhone 5 - ... Apple (Fruit) - type: Fruit - genus: Malus - ... Samsung Electronics - type: IT Company - ...
  • 16. Optimization goal (2): +differentiating power • Differentiating power of a pair of features (di) a) string/numerical dissimilarity di = dissimilarity between values * property’s value uniqueness di(type: IT company, type: Fruit) = LARGE*SMALL = MEDIUM (Single-valued properties are more useful.) b) logical inference entailment  minimized di di(type: IT company, type: Company) = MIN Apple (Inc.) - type: Company - type: IT company - product: iPhone 5 - ... Apple (Fruit) - type: Fruit - genus: Malus - ... Samsung Electronics - type: IT Company - ...
  • 17. Optimization goal (2): +differentiating power • Formulated as a Quadratic Multidimensional Knapsack Problem (QMKP) weight of a feature: length profit of a pair of features: differentiating power
  • 18. Optimization goal (3): +relevance to context • Relevance of a feature to the context of entity mention • cosine similarity in the class vector model (cs) Vector(context) = {Smarphone, IT company} Vector(type: Fruit) = {Fruit} Vector(product: iPhone 5) = {Smartphone} cs(context, product: iPhone 5) = HIGH • class weighting: class frequency – inverse instance frequency (CF-IIF) But with the release of the iPhone 6 and the 6 Plus phablet, Apple has finally gone into big-screen territory, giving Samsung a challenge in the category that the company has been dominating for some time now. Text Knowledge Base iPhone 6 - type: Smartphone - ... Samsung Electronics - type: IT Company - ... Apple (Inc.) - type: Company - type: IT company - product: iPhone 5 - ... Apple (Fruit) - type: Fruit - genus: Malus - ... ? Candidate entities
  • 19. Optimization goal (3): +relevance to context • Relevance of a feature to the context of entity mention • cosine similarity in the class vector model (cs) Vector(context) = {Smarphone, IT company} Vector(type: Fruit) = {Fruit} Vector(product: iPhone 5) = {Smartphone} cs(context, product: iPhone 5) = HIGH • class weighting: class frequency – inverse instance frequency (CF-IIF) But with the release of the iPhone 6 and the 6 Plus phablet, Apple has finally gone into big-screen territory, giving Samsung a challenge in the category that the company has been dominating for some time now. Text Knowledge Base iPhone 6 - type: Smartphone - ... Samsung Electronics - type: IT Company - ... Apple (Inc.) - type: Company - type: IT company - product: iPhone 5 - ... Apple (Fruit) - type: Fruit - genus: Malus - ... ? Candidate entities
  • 20. Optimization goal (3): +relevance to context • Relevance of a feature to the context of entity mention • cosine similarity in the class vector model (cs) Vector(context) = {Smarphone, IT company} Vector(type: Fruit) = {Fruit} Vector(product: iPhone 5) = {Smartphone} cs(context, product: iPhone 5) = HIGH • class weighting: class frequency – inverse instance frequency (CF-IIF) But with the release of the iPhone 6 and the 6 Plus phablet, Apple has finally gone into big-screen territory, giving Samsung a challenge in the category that the company has been dominating for some time now. Text Knowledge Base iPhone 6 - type: Smartphone - ... Samsung Electronics - type: IT Company - ... Apple (Inc.) - type: Company - type: IT company - product: iPhone 5 - ... Apple (Fruit) - type: Fruit - genus: Malus - ... ? Candidate entities
  • 21. Optimization goal (3): +relevance to context • Relevance of a feature to the context of entity mention • cosine similarity in the class vector model (cs) Vector(context) = {Smarphone, IT company} Vector(type: Fruit) = {Fruit} Vector(product: iPhone 5) = {Smartphone} cs(context, product: iPhone 5) = HIGH • class weighting: class frequency – inverse instance frequency (CF-IIF) But with the release of the iPhone 6 and the 6 Plus phablet, Apple has finally gone into big-screen territory, giving Samsung a challenge in the category that the company has been dominating for some time now. Text Knowledge Base iPhone 6 - type: Smartphone - ... Samsung Electronics - type: IT Company - ... Apple (Inc.) - type: Company - type: IT company - product: iPhone 5 - ... Apple (Fruit) - type: Fruit - genus: Malus - ... ? Candidate entities
  • 22. Optimization goal (3): +relevance to context • Relevance of a feature to the context of entity mention • cosine similarity in the class vector model (cs) Vector(context) = {Smarphone, IT company} Vector(type: Fruit) = {Fruit} Vector(product: iPhone 5) = {Smartphone} cs(context, product: iPhone 5) = HIGH • class weighting: class frequency – inverse instance frequency (CF-IIF) But with the release of the iPhone 6 and the 6 Plus phablet, Apple has finally gone into big-screen territory, giving Samsung a challenge in the category that the company has been dominating for some time now. Text Knowledge Base iPhone 6 - type: Smartphone - ... Samsung Electronics - type: IT Company - ... Apple (Inc.) - type: Company - type: IT company - product: iPhone 5 - ... Apple (Fruit) - type: Fruit - genus: Malus - ... ? Candidate entities
  • 23. Optimization goal (3): +relevance to context • Solved by k Maximizing Marginal Relevance (MMR) frameworks • Features are iteratively selected. • In each iteration, candidate features are re-ranked by • relevance to context • dissimilarity to selected features
  • 24. Optimization goal (1+2+3) • Formulated as a Quadratic Multidimensional Knapsack Problem (QMKP)
  • 25. Experiments: data sets • Text corpora (with entity mentions linked to Wikipedia) • AQUAINT • IITB • Knowledge base • DBpedia • Gold-standard links • entity mentions  Wikipedia articles  DBpedia entities
  • 26. Experiments: EL tasks Apple (Inc.) - type: Company - product: iPhone 5 Apple (Corps) - type: Company - product: Let It Be Apple (Fruit) - type: Fruit ? ..., Apple has finally gone into big-screen territory, … 1 target entity • gold-standard 2 (very challenging) noise entities • sharing a common name with the target entity, obtained from Wikipedia’s disambiguation pages
  • 27. Experiments: approaches • Proposed approaches • CHR: +characterizing power, -information overlap • DFF: +differentiating power • CNT: +relevance to context • COMB: CHR+DFF+CNT • Baseline approaches • DESC: returns entire entity descriptions • RELIN: a state-of-the-art entity summarization approach for generic purposes • average length of entity descriptions: 680 characters • length limit for summaries: 100 characters (14.7%)
  • 28. Experiments: extrinsic evaluation • COMB is the only approach that achieved the following statistically significant results on both data sets: • accuracy (% of correct answers): COMB = DESC • time: COMB < DESC (22-23% faster)
  • 29. Experiments: intrinsic evaluation • Statistically significant results on both data sets: • human ratings: COMB > CHR > other approaches
  • 30. Future work • More extensive experiments • to test with not-in-the-list • Summaries for automatic EL