SlideShare a Scribd company logo

PhD Day: Entity Linking using Generic Linked Data Datasets

Presentation at 4th NLP PhD Day at National University of Ireland, Galway (DERI) at 23/04/2013

1 of 20
Download to read offline
 Copyright 2009 Digital Enterprise Research Institute. All rights reserved.
Digital Enterprise Research Institute www.deri.ie
Entity Linking Using Generic Linked
Data Datasets
PhD Day – April/2013
Bianca Pereira
Digital Enterprise Research Institute www.deri.ie
Agenda
 Motivation
 Problem
 Related Work
 Research Questions
 Next Steps
 Challenges
2 of XYZ
Digital Enterprise Research Institute www.deri.ie
Motivation
 Biggest part of the content available on the web is
unstructured natural language text.
 How to structure natural language texts in order to be
easier to process them?
3 of XYZ
Digital Enterprise Research Institute www.deri.ie
Motivation
 There are three possible solutions to this problem:
 Extract knowledge from text according to a given structure
(ontology population from text).
 Extract knowledge from text without using a previous structure
(ontology learning from text).
 Link mention from text with entities from a structured knowledge
base (entity linking).
4 of XYZ
Digital Enterprise Research Institute www.deri.ie
Motivation
 Entity Linking..
 .. enables reusing knowledge already published on the web.
 .. can be used as the first step for ontology learning and
population algorithms.
5 of XYZ
Digital Enterprise Research Institute www.deri.ie
Motivation
 Many datasets have been used for Entity Linking:
 Relational datasets
 Wikipedia
 DBPedia, YAGO, MusicBrainz, Freebase, …
6 of XYZ

Recommended

Data science
Data scienceData science
Data science9diov
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data ScienceANOOP V S
 
Introduction To Data Science
Introduction To Data ScienceIntroduction To Data Science
Introduction To Data ScienceSpotle.ai
 
Knowledge Representation on the Web
Knowledge Representation on the WebKnowledge Representation on the Web
Knowledge Representation on the WebRinke Hoekstra
 
Introduction to data science
Introduction to data scienceIntroduction to data science
Introduction to data scienceSampath Kumar
 
The Rensselaer IDEA: Data Exploration
The Rensselaer IDEA: Data Exploration The Rensselaer IDEA: Data Exploration
The Rensselaer IDEA: Data Exploration James Hendler
 

More Related Content

What's hot

Data science presentation
Data science presentationData science presentation
Data science presentationMSDEVMTL
 
Knowledge graphs ilaria maresi the hyve 23apr2020
Knowledge graphs   ilaria maresi the hyve 23apr2020Knowledge graphs   ilaria maresi the hyve 23apr2020
Knowledge graphs ilaria maresi the hyve 23apr2020Pistoia Alliance
 
Data Science presentation for elementary school students
Data Science presentation for elementary school studentsData Science presentation for elementary school students
Data Science presentation for elementary school studentsMelanie Manning, CFA
 
Introduction to Data Science - Week 4 - Tools and Technologies in Data Science
Introduction to Data Science - Week 4 - Tools and Technologies in Data ScienceIntroduction to Data Science - Week 4 - Tools and Technologies in Data Science
Introduction to Data Science - Week 4 - Tools and Technologies in Data ScienceFerdin Joe John Joseph PhD
 
An Ecosystem for Linked Humanities Data
An Ecosystem for Linked Humanities DataAn Ecosystem for Linked Humanities Data
An Ecosystem for Linked Humanities DataRinke Hoekstra
 
More ways of symbol grounding for knowledge graphs?
More ways of symbol grounding for knowledge graphs?More ways of symbol grounding for knowledge graphs?
More ways of symbol grounding for knowledge graphs?Paul Groth
 
Knowledge Graph Maintenance
Knowledge Graph MaintenanceKnowledge Graph Maintenance
Knowledge Graph MaintenancePaul Groth
 
Big Data [sorry] & Data Science: What Does a Data Scientist Do?
Big Data [sorry] & Data Science: What Does a Data Scientist Do?Big Data [sorry] & Data Science: What Does a Data Scientist Do?
Big Data [sorry] & Data Science: What Does a Data Scientist Do?Data Science London
 
Prov-O-Viz: Interactive Provenance Visualization
Prov-O-Viz: Interactive Provenance VisualizationProv-O-Viz: Interactive Provenance Visualization
Prov-O-Viz: Interactive Provenance VisualizationRinke Hoekstra
 
Data and Knowledge as Commodities
Data and Knowledge as CommoditiesData and Knowledge as Commodities
Data and Knowledge as CommoditiesMathieu d'Aquin
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data ScienceEdureka!
 
The need for a transparent data supply chain
The need for a transparent data supply chainThe need for a transparent data supply chain
The need for a transparent data supply chainPaul Groth
 
Data Science: Not Just For Big Data
Data Science: Not Just For Big DataData Science: Not Just For Big Data
Data Science: Not Just For Big DataRevolution Analytics
 
Data science presentation 2nd CI day
Data science presentation 2nd CI dayData science presentation 2nd CI day
Data science presentation 2nd CI dayMohammed Barakat
 
Deep neural networks for matching online social networking profiles
Deep neural networks for matching online social networking profilesDeep neural networks for matching online social networking profiles
Deep neural networks for matching online social networking profilesTraian Rebedea
 
Session 04 communicating results
Session 04 communicating resultsSession 04 communicating results
Session 04 communicating resultsbodaceacat
 
The Science of Data Science
The Science of Data Science The Science of Data Science
The Science of Data Science James Hendler
 

What's hot (20)

Data science presentation
Data science presentationData science presentation
Data science presentation
 
Knowledge graphs ilaria maresi the hyve 23apr2020
Knowledge graphs   ilaria maresi the hyve 23apr2020Knowledge graphs   ilaria maresi the hyve 23apr2020
Knowledge graphs ilaria maresi the hyve 23apr2020
 
Data Science presentation for elementary school students
Data Science presentation for elementary school studentsData Science presentation for elementary school students
Data Science presentation for elementary school students
 
Introduction to Data Science - Week 4 - Tools and Technologies in Data Science
Introduction to Data Science - Week 4 - Tools and Technologies in Data ScienceIntroduction to Data Science - Week 4 - Tools and Technologies in Data Science
Introduction to Data Science - Week 4 - Tools and Technologies in Data Science
 
An Ecosystem for Linked Humanities Data
An Ecosystem for Linked Humanities DataAn Ecosystem for Linked Humanities Data
An Ecosystem for Linked Humanities Data
 
More ways of symbol grounding for knowledge graphs?
More ways of symbol grounding for knowledge graphs?More ways of symbol grounding for knowledge graphs?
More ways of symbol grounding for knowledge graphs?
 
Knowledge Graph Maintenance
Knowledge Graph MaintenanceKnowledge Graph Maintenance
Knowledge Graph Maintenance
 
Intro to Data Science Concepts
Intro to Data Science ConceptsIntro to Data Science Concepts
Intro to Data Science Concepts
 
Big Data [sorry] & Data Science: What Does a Data Scientist Do?
Big Data [sorry] & Data Science: What Does a Data Scientist Do?Big Data [sorry] & Data Science: What Does a Data Scientist Do?
Big Data [sorry] & Data Science: What Does a Data Scientist Do?
 
Prov-O-Viz: Interactive Provenance Visualization
Prov-O-Viz: Interactive Provenance VisualizationProv-O-Viz: Interactive Provenance Visualization
Prov-O-Viz: Interactive Provenance Visualization
 
Data and Knowledge as Commodities
Data and Knowledge as CommoditiesData and Knowledge as Commodities
Data and Knowledge as Commodities
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
The need for a transparent data supply chain
The need for a transparent data supply chainThe need for a transparent data supply chain
The need for a transparent data supply chain
 
Data Science: Not Just For Big Data
Data Science: Not Just For Big DataData Science: Not Just For Big Data
Data Science: Not Just For Big Data
 
Data science presentation 2nd CI day
Data science presentation 2nd CI dayData science presentation 2nd CI day
Data science presentation 2nd CI day
 
Data science
Data science Data science
Data science
 
Deep neural networks for matching online social networking profiles
Deep neural networks for matching online social networking profilesDeep neural networks for matching online social networking profiles
Deep neural networks for matching online social networking profiles
 
Session 04 communicating results
Session 04 communicating resultsSession 04 communicating results
Session 04 communicating results
 
The Science of Data Science
The Science of Data Science The Science of Data Science
The Science of Data Science
 

Viewers also liked

Entity Linking with Multiple Knowledge Bases: an Ontology Modularization Appr...
Entity Linking with Multiple Knowledge Bases: an Ontology Modularization Appr...Entity Linking with Multiple Knowledge Bases: an Ontology Modularization Appr...
Entity Linking with Multiple Knowledge Bases: an Ontology Modularization Appr...Bianca Pereira
 
Entity Linking in Queries: Tasks and Evaluation
Entity Linking in Queries: Tasks and EvaluationEntity Linking in Queries: Tasks and Evaluation
Entity Linking in Queries: Tasks and EvaluationFaegheh Hasibi
 
Exploiting Entity Linking in Queries For Entity Retrieval
Exploiting Entity Linking in Queries For Entity RetrievalExploiting Entity Linking in Queries For Entity Retrieval
Exploiting Entity Linking in Queries For Entity RetrievalFaegheh Hasibi
 
Being a PhD student: Experiences and Challenges
Being a PhD student: Experiences and ChallengesBeing a PhD student: Experiences and Challenges
Being a PhD student: Experiences and ChallengesFaegheh Hasibi
 
Camilo Thorne, Stefano Faralli and Heiner Stuckenschmidt | Entity Linking for...
Camilo Thorne, Stefano Faralli and Heiner Stuckenschmidt | Entity Linking for...Camilo Thorne, Stefano Faralli and Heiner Stuckenschmidt | Entity Linking for...
Camilo Thorne, Stefano Faralli and Heiner Stuckenschmidt | Entity Linking for...semanticsconference
 

Viewers also liked (8)

Entity Linking with Multiple Knowledge Bases: an Ontology Modularization Appr...
Entity Linking with Multiple Knowledge Bases: an Ontology Modularization Appr...Entity Linking with Multiple Knowledge Bases: an Ontology Modularization Appr...
Entity Linking with Multiple Knowledge Bases: an Ontology Modularization Appr...
 
Entity Linking in Queries: Tasks and Evaluation
Entity Linking in Queries: Tasks and EvaluationEntity Linking in Queries: Tasks and Evaluation
Entity Linking in Queries: Tasks and Evaluation
 
Exploiting Entity Linking in Queries For Entity Retrieval
Exploiting Entity Linking in Queries For Entity RetrievalExploiting Entity Linking in Queries For Entity Retrieval
Exploiting Entity Linking in Queries For Entity Retrieval
 
Being a PhD student: Experiences and Challenges
Being a PhD student: Experiences and ChallengesBeing a PhD student: Experiences and Challenges
Being a PhD student: Experiences and Challenges
 
Entity Linking
Entity LinkingEntity Linking
Entity Linking
 
Introduction to linked data
Introduction to linked dataIntroduction to linked data
Introduction to linked data
 
Linked Data Tutorial
Linked Data TutorialLinked Data Tutorial
Linked Data Tutorial
 
Camilo Thorne, Stefano Faralli and Heiner Stuckenschmidt | Entity Linking for...
Camilo Thorne, Stefano Faralli and Heiner Stuckenschmidt | Entity Linking for...Camilo Thorne, Stefano Faralli and Heiner Stuckenschmidt | Entity Linking for...
Camilo Thorne, Stefano Faralli and Heiner Stuckenschmidt | Entity Linking for...
 

Similar to PhD Day: Entity Linking using Generic Linked Data Datasets

Reading Group 2013 (DERI NUIG)
Reading Group 2013 (DERI NUIG)Reading Group 2013 (DERI NUIG)
Reading Group 2013 (DERI NUIG)Bianca Pereira
 
Introduction to question answering for linked data & big data
Introduction to question answering for linked data & big dataIntroduction to question answering for linked data & big data
Introduction to question answering for linked data & big dataAndre Freitas
 
EPR Annual Conference 2020 Workshop 1 - Simon Uytterhoeven
EPR Annual Conference 2020 Workshop 1 - Simon Uytterhoeven EPR Annual Conference 2020 Workshop 1 - Simon Uytterhoeven
EPR Annual Conference 2020 Workshop 1 - Simon Uytterhoeven EPR1
 
Toward a System Building Agenda for Data Integration(and Dat.docx
Toward a System Building Agenda for Data Integration(and Dat.docxToward a System Building Agenda for Data Integration(and Dat.docx
Toward a System Building Agenda for Data Integration(and Dat.docxjuliennehar
 
Querying Heterogeneous Datasets on the Linked Data Web
Querying Heterogeneous Datasets on the Linked Data WebQuerying Heterogeneous Datasets on the Linked Data Web
Querying Heterogeneous Datasets on the Linked Data WebEdward Curry
 
Machine Learning ICS 273A
Machine Learning ICS 273AMachine Learning ICS 273A
Machine Learning ICS 273Abutest
 
Machine Learning ICS 273A
Machine Learning ICS 273AMachine Learning ICS 273A
Machine Learning ICS 273Abutest
 
Application and Methods of Deep Learning in IoT
Application and Methods of Deep Learning in IoTApplication and Methods of Deep Learning in IoT
Application and Methods of Deep Learning in IoTIJAEMSJORNAL
 
A distributional structured semantic space for querying rdf graph data
A distributional structured semantic space for querying rdf graph dataA distributional structured semantic space for querying rdf graph data
A distributional structured semantic space for querying rdf graph dataAndre Freitas
 
From Linked Data to Semantic Applications
From Linked Data to Semantic ApplicationsFrom Linked Data to Semantic Applications
From Linked Data to Semantic ApplicationsAndre Freitas
 
Questions On The And Football
Questions On The And FootballQuestions On The And Football
Questions On The And FootballAmanda Gray
 
OWF14 - Plenary Session : Ori Pekelman, Founder, Constellation Matrix
OWF14 - Plenary Session : Ori Pekelman, Founder, Constellation MatrixOWF14 - Plenary Session : Ori Pekelman, Founder, Constellation Matrix
OWF14 - Plenary Session : Ori Pekelman, Founder, Constellation MatrixParis Open Source Summit
 
Learn Real World Machine Learning By Building Projects
Learn Real World Machine Learning By Building ProjectsLearn Real World Machine Learning By Building Projects
Learn Real World Machine Learning By Building ProjectsJohn Alex
 
Session 01 designing and scoping a data science project
Session 01 designing and scoping a data science projectSession 01 designing and scoping a data science project
Session 01 designing and scoping a data science projectbodaceacat
 
Session 01 designing and scoping a data science project
Session 01 designing and scoping a data science projectSession 01 designing and scoping a data science project
Session 01 designing and scoping a data science projectSara-Jayne Terp
 
Text Analytics Market Insights: What's Working and What's Next
Text Analytics Market Insights: What's Working and What's NextText Analytics Market Insights: What's Working and What's Next
Text Analytics Market Insights: What's Working and What's NextSeth Grimes
 
Generating domain specific sentiment lexicons using the Web Directory
Generating domain specific sentiment lexicons using the Web Directory Generating domain specific sentiment lexicons using the Web Directory
Generating domain specific sentiment lexicons using the Web Directory acijjournal
 
Speech Conversion Using Neural Networks
Speech Conversion Using Neural NetworksSpeech Conversion Using Neural Networks
Speech Conversion Using Neural NetworksJessica Moore
 

Similar to PhD Day: Entity Linking using Generic Linked Data Datasets (20)

Reading Group 2013 (DERI NUIG)
Reading Group 2013 (DERI NUIG)Reading Group 2013 (DERI NUIG)
Reading Group 2013 (DERI NUIG)
 
Introduction to question answering for linked data & big data
Introduction to question answering for linked data & big dataIntroduction to question answering for linked data & big data
Introduction to question answering for linked data & big data
 
EPR Annual Conference 2020 Workshop 1 - Simon Uytterhoeven
EPR Annual Conference 2020 Workshop 1 - Simon Uytterhoeven EPR Annual Conference 2020 Workshop 1 - Simon Uytterhoeven
EPR Annual Conference 2020 Workshop 1 - Simon Uytterhoeven
 
Toward a System Building Agenda for Data Integration(and Dat.docx
Toward a System Building Agenda for Data Integration(and Dat.docxToward a System Building Agenda for Data Integration(and Dat.docx
Toward a System Building Agenda for Data Integration(and Dat.docx
 
Querying Heterogeneous Datasets on the Linked Data Web
Querying Heterogeneous Datasets on the Linked Data WebQuerying Heterogeneous Datasets on the Linked Data Web
Querying Heterogeneous Datasets on the Linked Data Web
 
Machine Learning ICS 273A
Machine Learning ICS 273AMachine Learning ICS 273A
Machine Learning ICS 273A
 
Machine Learning ICS 273A
Machine Learning ICS 273AMachine Learning ICS 273A
Machine Learning ICS 273A
 
Application and Methods of Deep Learning in IoT
Application and Methods of Deep Learning in IoTApplication and Methods of Deep Learning in IoT
Application and Methods of Deep Learning in IoT
 
A distributional structured semantic space for querying rdf graph data
A distributional structured semantic space for querying rdf graph dataA distributional structured semantic space for querying rdf graph data
A distributional structured semantic space for querying rdf graph data
 
From Linked Data to Semantic Applications
From Linked Data to Semantic ApplicationsFrom Linked Data to Semantic Applications
From Linked Data to Semantic Applications
 
Questions On The And Football
Questions On The And FootballQuestions On The And Football
Questions On The And Football
 
OWF14 - Plenary Session : Ori Pekelman, Founder, Constellation Matrix
OWF14 - Plenary Session : Ori Pekelman, Founder, Constellation MatrixOWF14 - Plenary Session : Ori Pekelman, Founder, Constellation Matrix
OWF14 - Plenary Session : Ori Pekelman, Founder, Constellation Matrix
 
Learn Real World Machine Learning By Building Projects
Learn Real World Machine Learning By Building ProjectsLearn Real World Machine Learning By Building Projects
Learn Real World Machine Learning By Building Projects
 
MATLAB Essay
MATLAB EssayMATLAB Essay
MATLAB Essay
 
BrightTALK - Semantic AI
BrightTALK - Semantic AI BrightTALK - Semantic AI
BrightTALK - Semantic AI
 
Session 01 designing and scoping a data science project
Session 01 designing and scoping a data science projectSession 01 designing and scoping a data science project
Session 01 designing and scoping a data science project
 
Session 01 designing and scoping a data science project
Session 01 designing and scoping a data science projectSession 01 designing and scoping a data science project
Session 01 designing and scoping a data science project
 
Text Analytics Market Insights: What's Working and What's Next
Text Analytics Market Insights: What's Working and What's NextText Analytics Market Insights: What's Working and What's Next
Text Analytics Market Insights: What's Working and What's Next
 
Generating domain specific sentiment lexicons using the Web Directory
Generating domain specific sentiment lexicons using the Web Directory Generating domain specific sentiment lexicons using the Web Directory
Generating domain specific sentiment lexicons using the Web Directory
 
Speech Conversion Using Neural Networks
Speech Conversion Using Neural NetworksSpeech Conversion Using Neural Networks
Speech Conversion Using Neural Networks
 

More from Bianca Pereira

Dealing with writer's block
Dealing with writer's blockDealing with writer's block
Dealing with writer's blockBianca Pereira
 
HCI Challenges in Crowd4Access Citizen Science project
HCI Challenges in Crowd4Access Citizen Science projectHCI Challenges in Crowd4Access Citizen Science project
HCI Challenges in Crowd4Access Citizen Science projectBianca Pereira
 
Taxonomy Extraction for Customer Service Knowledge Base Construction
Taxonomy Extraction for Customer Service Knowledge Base ConstructionTaxonomy Extraction for Customer Service Knowledge Base Construction
Taxonomy Extraction for Customer Service Knowledge Base ConstructionBianca Pereira
 
How to build your topic?
How to build your topic?How to build your topic?
How to build your topic?Bianca Pereira
 
Dealing with writer's block
Dealing with writer's blockDealing with writer's block
Dealing with writer's blockBianca Pereira
 
Smart Futures presentation at St. Raphael's College
Smart Futures presentation at St. Raphael's CollegeSmart Futures presentation at St. Raphael's College
Smart Futures presentation at St. Raphael's CollegeBianca Pereira
 
Compreensão de Linguagem Natural no Insight: Construindo a Ponte entre Texto ...
Compreensão de Linguagem Natural no Insight: Construindo a Ponte entre Texto ...Compreensão de Linguagem Natural no Insight: Construindo a Ponte entre Texto ...
Compreensão de Linguagem Natural no Insight: Construindo a Ponte entre Texto ...Bianca Pereira
 
Tutorial de Web Semântica - CompSem 2015
Tutorial de Web Semântica - CompSem 2015Tutorial de Web Semântica - CompSem 2015
Tutorial de Web Semântica - CompSem 2015Bianca Pereira
 
DBpedia as Gaeilge Chapter
DBpedia as Gaeilge ChapterDBpedia as Gaeilge Chapter
DBpedia as Gaeilge ChapterBianca Pereira
 
PhD Day: Adaptive Entity Linking
PhD Day: Adaptive Entity LinkingPhD Day: Adaptive Entity Linking
PhD Day: Adaptive Entity LinkingBianca Pereira
 
PhD Day: Entity Linking using Ontology Modularization
PhD Day: Entity Linking using Ontology ModularizationPhD Day: Entity Linking using Ontology Modularization
PhD Day: Entity Linking using Ontology ModularizationBianca Pereira
 
NUIG Research Showcase 2014
NUIG Research Showcase 2014NUIG Research Showcase 2014
NUIG Research Showcase 2014Bianca Pereira
 
AELA: An Adaptive Entity Linking Approach
AELA: An Adaptive Entity Linking ApproachAELA: An Adaptive Entity Linking Approach
AELA: An Adaptive Entity Linking ApproachBianca Pereira
 
How to Make Your Content Smarter
How to Make Your Content SmarterHow to Make Your Content Smarter
How to Make Your Content SmarterBianca Pereira
 
Reading Group 2014 (Insight NUIG)
Reading Group 2014 (Insight NUIG)Reading Group 2014 (Insight NUIG)
Reading Group 2014 (Insight NUIG)Bianca Pereira
 

More from Bianca Pereira (15)

Dealing with writer's block
Dealing with writer's blockDealing with writer's block
Dealing with writer's block
 
HCI Challenges in Crowd4Access Citizen Science project
HCI Challenges in Crowd4Access Citizen Science projectHCI Challenges in Crowd4Access Citizen Science project
HCI Challenges in Crowd4Access Citizen Science project
 
Taxonomy Extraction for Customer Service Knowledge Base Construction
Taxonomy Extraction for Customer Service Knowledge Base ConstructionTaxonomy Extraction for Customer Service Knowledge Base Construction
Taxonomy Extraction for Customer Service Knowledge Base Construction
 
How to build your topic?
How to build your topic?How to build your topic?
How to build your topic?
 
Dealing with writer's block
Dealing with writer's blockDealing with writer's block
Dealing with writer's block
 
Smart Futures presentation at St. Raphael's College
Smart Futures presentation at St. Raphael's CollegeSmart Futures presentation at St. Raphael's College
Smart Futures presentation at St. Raphael's College
 
Compreensão de Linguagem Natural no Insight: Construindo a Ponte entre Texto ...
Compreensão de Linguagem Natural no Insight: Construindo a Ponte entre Texto ...Compreensão de Linguagem Natural no Insight: Construindo a Ponte entre Texto ...
Compreensão de Linguagem Natural no Insight: Construindo a Ponte entre Texto ...
 
Tutorial de Web Semântica - CompSem 2015
Tutorial de Web Semântica - CompSem 2015Tutorial de Web Semântica - CompSem 2015
Tutorial de Web Semântica - CompSem 2015
 
DBpedia as Gaeilge Chapter
DBpedia as Gaeilge ChapterDBpedia as Gaeilge Chapter
DBpedia as Gaeilge Chapter
 
PhD Day: Adaptive Entity Linking
PhD Day: Adaptive Entity LinkingPhD Day: Adaptive Entity Linking
PhD Day: Adaptive Entity Linking
 
PhD Day: Entity Linking using Ontology Modularization
PhD Day: Entity Linking using Ontology ModularizationPhD Day: Entity Linking using Ontology Modularization
PhD Day: Entity Linking using Ontology Modularization
 
NUIG Research Showcase 2014
NUIG Research Showcase 2014NUIG Research Showcase 2014
NUIG Research Showcase 2014
 
AELA: An Adaptive Entity Linking Approach
AELA: An Adaptive Entity Linking ApproachAELA: An Adaptive Entity Linking Approach
AELA: An Adaptive Entity Linking Approach
 
How to Make Your Content Smarter
How to Make Your Content SmarterHow to Make Your Content Smarter
How to Make Your Content Smarter
 
Reading Group 2014 (Insight NUIG)
Reading Group 2014 (Insight NUIG)Reading Group 2014 (Insight NUIG)
Reading Group 2014 (Insight NUIG)
 

Recently uploaded

Seagate HDD Firmware Repair Tool Datasheet 2024
Seagate HDD Firmware Repair Tool Datasheet 2024Seagate HDD Firmware Repair Tool Datasheet 2024
Seagate HDD Firmware Repair Tool Datasheet 2024Dolphin Data Lab
 
Practical SEO for WordPress Bloggers.pdf
Practical SEO for WordPress Bloggers.pdfPractical SEO for WordPress Bloggers.pdf
Practical SEO for WordPress Bloggers.pdfNile Flores
 
ConFoo 2024 - Sylius 2.0, top-notch eCommerce for customizable solution
ConFoo 2024 - Sylius 2.0, top-notch eCommerce for customizable solutionConFoo 2024 - Sylius 2.0, top-notch eCommerce for customizable solution
ConFoo 2024 - Sylius 2.0, top-notch eCommerce for customizable solutionŁukasz Chruściel
 
DNS-OARC 42: Is the DNS ready for IPv6? presentation by Geoff Huston
DNS-OARC 42: Is the DNS ready for IPv6? presentation by Geoff HustonDNS-OARC 42: Is the DNS ready for IPv6? presentation by Geoff Huston
DNS-OARC 42: Is the DNS ready for IPv6? presentation by Geoff HustonAPNIC
 
Information Technology Project to Create a Business
Information Technology Project to Create a BusinessInformation Technology Project to Create a Business
Information Technology Project to Create a Businessmbowl010
 
NANOG 90: 'BGP in 2023' presented by Geoff Huston
NANOG 90: 'BGP in 2023' presented by Geoff HustonNANOG 90: 'BGP in 2023' presented by Geoff Huston
NANOG 90: 'BGP in 2023' presented by Geoff HustonAPNIC
 
Biometrics Technology Intresting PPT
Biometrics Technology Intresting PPTBiometrics Technology Intresting PPT
Biometrics Technology Intresting PPTPraveenKumarThota7
 
ConFoo 2024 - Need for Speed: Removing speed bumps in API Projects
ConFoo 2024  - Need for Speed: Removing speed bumps in API ProjectsConFoo 2024  - Need for Speed: Removing speed bumps in API Projects
ConFoo 2024 - Need for Speed: Removing speed bumps in API ProjectsŁukasz Chruściel
 
Reactive programming with Spring Webflux.pptx
Reactive programming with Spring Webflux.pptxReactive programming with Spring Webflux.pptx
Reactive programming with Spring Webflux.pptxJoão Esperancinha
 
WAN-IFRA: World Press Trends Outlook 2023-2024
WAN-IFRA: World Press Trends Outlook 2023-2024WAN-IFRA: World Press Trends Outlook 2023-2024
WAN-IFRA: World Press Trends Outlook 2023-2024Damian Radcliffe
 

Recently uploaded (10)

Seagate HDD Firmware Repair Tool Datasheet 2024
Seagate HDD Firmware Repair Tool Datasheet 2024Seagate HDD Firmware Repair Tool Datasheet 2024
Seagate HDD Firmware Repair Tool Datasheet 2024
 
Practical SEO for WordPress Bloggers.pdf
Practical SEO for WordPress Bloggers.pdfPractical SEO for WordPress Bloggers.pdf
Practical SEO for WordPress Bloggers.pdf
 
ConFoo 2024 - Sylius 2.0, top-notch eCommerce for customizable solution
ConFoo 2024 - Sylius 2.0, top-notch eCommerce for customizable solutionConFoo 2024 - Sylius 2.0, top-notch eCommerce for customizable solution
ConFoo 2024 - Sylius 2.0, top-notch eCommerce for customizable solution
 
DNS-OARC 42: Is the DNS ready for IPv6? presentation by Geoff Huston
DNS-OARC 42: Is the DNS ready for IPv6? presentation by Geoff HustonDNS-OARC 42: Is the DNS ready for IPv6? presentation by Geoff Huston
DNS-OARC 42: Is the DNS ready for IPv6? presentation by Geoff Huston
 
Information Technology Project to Create a Business
Information Technology Project to Create a BusinessInformation Technology Project to Create a Business
Information Technology Project to Create a Business
 
NANOG 90: 'BGP in 2023' presented by Geoff Huston
NANOG 90: 'BGP in 2023' presented by Geoff HustonNANOG 90: 'BGP in 2023' presented by Geoff Huston
NANOG 90: 'BGP in 2023' presented by Geoff Huston
 
Biometrics Technology Intresting PPT
Biometrics Technology Intresting PPTBiometrics Technology Intresting PPT
Biometrics Technology Intresting PPT
 
ConFoo 2024 - Need for Speed: Removing speed bumps in API Projects
ConFoo 2024  - Need for Speed: Removing speed bumps in API ProjectsConFoo 2024  - Need for Speed: Removing speed bumps in API Projects
ConFoo 2024 - Need for Speed: Removing speed bumps in API Projects
 
Reactive programming with Spring Webflux.pptx
Reactive programming with Spring Webflux.pptxReactive programming with Spring Webflux.pptx
Reactive programming with Spring Webflux.pptx
 
WAN-IFRA: World Press Trends Outlook 2023-2024
WAN-IFRA: World Press Trends Outlook 2023-2024WAN-IFRA: World Press Trends Outlook 2023-2024
WAN-IFRA: World Press Trends Outlook 2023-2024
 

PhD Day: Entity Linking using Generic Linked Data Datasets

  • 1.  Copyright 2009 Digital Enterprise Research Institute. All rights reserved. Digital Enterprise Research Institute www.deri.ie Entity Linking Using Generic Linked Data Datasets PhD Day – April/2013 Bianca Pereira
  • 2. Digital Enterprise Research Institute www.deri.ie Agenda  Motivation  Problem  Related Work  Research Questions  Next Steps  Challenges 2 of XYZ
  • 3. Digital Enterprise Research Institute www.deri.ie Motivation  Biggest part of the content available on the web is unstructured natural language text.  How to structure natural language texts in order to be easier to process them? 3 of XYZ
  • 4. Digital Enterprise Research Institute www.deri.ie Motivation  There are three possible solutions to this problem:  Extract knowledge from text according to a given structure (ontology population from text).  Extract knowledge from text without using a previous structure (ontology learning from text).  Link mention from text with entities from a structured knowledge base (entity linking). 4 of XYZ
  • 5. Digital Enterprise Research Institute www.deri.ie Motivation  Entity Linking..  .. enables reusing knowledge already published on the web.  .. can be used as the first step for ontology learning and population algorithms. 5 of XYZ
  • 6. Digital Enterprise Research Institute www.deri.ie Motivation  Many datasets have been used for Entity Linking:  Relational datasets  Wikipedia  DBPedia, YAGO, MusicBrainz, Freebase, … 6 of XYZ
  • 7. Digital Enterprise Research Institute www.deri.ie Motivation  Linked Data datasets are promising because..  .. many of them are public.  .. they are already structured.  .. they are interlinked.  .. they are available under diverse ownership.  .. they provide knowledge in diverse domains.  .. the LOD cloud is growing. 7 of XYZ
  • 8. Digital Enterprise Research Institute www.deri.ie Motivation  There are already some Entity Linking solutions using Linked Data datasets. 8 of XYZ
  • 9. Digital Enterprise Research Institute www.deri.ie Problem  Current Entity Linking Approaches work only with a small fixed number of Linked Data datasets.  AIDA (YAGO)  Alchemy API (CIA Factbook, CrunchBase, Freebase, GeoNames, MusicBrainz, OpenCyc, UMBEL, US Census, YAGO)  DBPedia Spotlight (DBPedia)  Open Calais (Calais) 9 of XYZ
  • 10. Digital Enterprise Research Institute www.deri.ie Problem  Current tools work well with generic knowledge and public datasets. But what do we do if we want to..  .. link an enterprise text with a private dataset?  .. identify domain specific entities? 10 of XYZ
  • 11. Digital Enterprise Research Institute www.deri.ie Problem  AELA (Adaptive Entity Linking Approach) was developed to solve this problem.. 11 of XYZ
  • 12. Digital Enterprise Research Institute www.deri.ie Problem  What AELA does not solve is..  .. the recognition of generalized entities/topics (such as genes and diseases).  .. the recognition of individuals with the same name as their classes (such as ambulance, coffee machine and airplane). 12 of XYZ
  • 13. Digital Enterprise Research Institute www.deri.ie Related Work  Which topics are related to Entity Linking?  Entity Resolution, coreference resolution, merge-purge, data deduplication, object identification, mention matching, tuple matching, record linkage, entity disambiguation, anaphora resolution, instance identification, database hardening, entity identification, identity resolution, reference reconciliation, record matching, name matching, identity uncertainty, duplicate detection, entity matching, instance matching, entity consolidation, entity reconciliation, object consolidation, topic consolidation, reference disambiguation, instance fusion, data fusion. 13 of XYZ
  • 14. Digital Enterprise Research Institute www.deri.ie Related Work 14 of XYZ
  • 15. Digital Enterprise Research Institute www.deri.ie Research Questions  Which methods created in last 5 decades can be used to improve AELA results?  How can AELA adapt itself to a given domain?  What are the use cases in which AELA can be applied? Is it better than previous approaches?  May AELA be language independent? 15 of XYZ
  • 16. Digital Enterprise Research Institute www.deri.ie Research Questions  The most important question.. What is an entity!?  Object?  Concept?  Topic? 16 of XYZ
  • 17. Digital Enterprise Research Institute www.deri.ie Next Steps  Survey the methods used in related areas.  Evaluation of the methods within AELA architecture.  Develop a method to select a given Linked Data dataset given the domain from text.  Apply AELA to news domain.  Evaluate AELA using datasets in other languages. 17 of XYZ
  • 18. Digital Enterprise Research Institute www.deri.ie Next Steps  Define entity. 18 of XYZ
  • 19. Digital Enterprise Research Institute www.deri.ie Challenges  Many previous works  Big Data issues  Linked Data issues (standards and data quality)  Evaluation issues 19 of XYZ
  • 20. Digital Enterprise Research Institute www.deri.ie QUESTIONS? 20 of XYZ