SlideShare a Scribd company logo
1 of 21
Mapping Keywords to Linked Data Resources
for
Automatic Query Expansion
Isabelle Augenstein1
Anna Lisa Gentile1
Barry Norton2
Ziqi Zhang1
Fabio Ciravegna1
1
Department of Computer Science, University of Sheffield, UK
2
Ontotext, UK
{i.augenstein,a.l.gentile,z.zhang,f.ciravegna}@dcs.shef.ac.uk,
barry.norton@ontotext.com
May 26, 2013
Augenstein, Gentile, Norton, Zhang, Ciravegna Query Expansion May 26, 2013 1 / 21
Motivation
In order to consume Linked Data, end users need to be
familiar with RDF’s data model and query language SPARQL
have knowledge of datasets and their contents
Keyword search is a means to overcome these barriers
Map keywords to RDF resources
“film” → dbpedia-owl:Film
Augenstein, Gentile, Norton, Zhang, Ciravegna Query Expansion May 26, 2013 2 / 21
Motivation
Challenges for keyword search:
Spelling mistakes, e.g. “flim”
Lexical derivations, e.g. “films”
Synonyms, e.g. “movie”
How to address these challenges? → query expansion
What is query expansion?
Process of expanding a seed query with additional query terms
to improve recall
“film”, “flim”, “films”, “movie” → dbpedia-owl:Film
Augenstein, Gentile, Norton, Zhang, Ciravegna Query Expansion May 26, 2013 3 / 21
State of the art: How to find mappings
String similarity (Sindice [4])
“flim”, “films” → dbpedia-owl:Film
Domain-independent approach
Problem: Only does expansion for spelling mistakes and
lexically similar words
Dictionary-based methods (WordNet [3], Wikipedia [2])
“movie” → dbpedia-owl:Film
Approach finds synonyms
Problem: Dictionaries contain limited, domain-specific
vocabulary, WordNet has very few named entities
Augenstein, Gentile, Norton, Zhang, Ciravegna Query Expansion May 26, 2013 4 / 21
State of the art: How to rank resources
String similarity
Rank by decreasing String similarity (SearchWebDB [5])
Rank by decreasing tf-idf (Falcons Object Search [1])
Dictionary-based methods (WordNet ([3]), Wikipedia ([2]))
Rank by decreasing semantic similarity (ESA [2])
Combine frequency and confidence (PowerAqua [3])
Augenstein, Gentile, Norton, Zhang, Ciravegna Query Expansion May 26, 2013 5 / 21
Our approach
How to find mappings
In order to have a highly adaptable, domain-independent
approach, use knowledge contained within the dataset
Benefit from properties between resources in Linked Data,
which are potentially useful for finding semantically similar
keywords
How to rank resources
Follow intuition of state of the art methods
Combine tf-idf and confidence of our measure
Augenstein, Gentile, Norton, Zhang, Ciravegna Query Expansion May 26, 2013 6 / 21
Method: Overview
Scenario: User wants to find representative Linked Data
resources for keyword w
Example: “movie” → dbpedia-owl:Film
Dataset: Set of triples consisting of a ‘subject’, ‘predicate’ and
an ‘object’.
Current application: Find classes and properties
Augenstein, Gentile, Norton, Zhang, Ciravegna Query Expansion May 26, 2013 7 / 21
Method: Overview
Step 1: For each keyword w, learn an expanded set of
keywords Ew and a ranking of Ew to find semantically similar
words in the target vocabulary
Example: “movie” → “movie”’, “film”, “films”
Step 2: Identify concepts through labelling properties
“movie” → ∅
“film” → dbpedia-owl:Film
“films” → ∅
Step 3: Rank resulting concepts and return concept with
highest rank
“movie” → dbpedia-owl:Film
Augenstein, Gentile, Norton, Zhang, Ciravegna Query Expansion May 26, 2013 8 / 21
Method: Overview
Set of labelling properties to identify concepts
rdfs:label
foaf:name
dc:title
skos:prefLabel
skos:altLabel
fb:type.object.name
Table : Labelling properties
Augenstein, Gentile, Norton, Zhang, Ciravegna Query Expansion May 26, 2013 9 / 21
Method: Candidate Identification
Step 1: For each keyword w, learn an expanded set of keywords Ew
and a ranking of Ew
Step 1.1: Which properties are useful in expressing semantic
similarity?
Get resource r for keyword using labelling properties
“movie” → fb:Movie
Get all triples where r is a subject
fb:Movie wn:containsWordSense dbpedia-owl:show
fb:Movie dc:description ‘‘movie director’’
fb:Movie commontag:label ‘‘Film’’
For each of the objects of the triples, find the labels and
tokenise them
“movie director” → “movie”, “director”
Find resources for them in the target vocabulary
“show” → dbpedia-owl:show
“Film” → dbpedia-owl:Film
Augenstein, Gentile, Norton, Zhang, Ciravegna Query Expansion May 26, 2013 10 / 21
Method: Training
Step 1.2: How to learn a ranking for these properties?
Select a list of keywords, run step 1.1 for each of them
Manually chose the best resources among the candidates
“show”, “Film” → “Film”
Produce a precision measure for every property used to find
candidates
prec(p) = w∈
−−−−→
Wtrain
hits(p, w)
w∈
−−−−→
Wtrain
candidates(p, w)
Define treshold 0 ≤ θ ≤ 1, use prec to define an ordered
(ranked) subset of properties to encode semantic relatedness
Augenstein, Gentile, Norton, Zhang, Ciravegna Query Expansion May 26, 2013 11 / 21
Method: Test
Test set of keywords
−−−−→
Wtest
, apply algorithm to obtain
candidates
Only take subset of properties instead of all properties to find
candidates
Combine, as a numerical product, the precision of the property,
p, used to find that candidate and a tf-idf score for r
Augenstein, Gentile, Norton, Zhang, Ciravegna Query Expansion May 26, 2013 12 / 21
Evaluation: Gold standard and Metric
Used DBpedia ontology as the target vocabulary and Sindice
cache as dataset for computing expanded set of keywords Ew
Gold standard from Freitas et al. [2], contains 178 keywords of
which 134 have a representation in the DBpedia ontology
Corrected minor errors in gold-standard and re-evaluated
approach by Freitas et al. [2]
Actual figures don’t change significantly
Use Mean Reciprocal Rank (MRR) which measures the quality
of the ranking by calculating the inverse rank of the best result
Augenstein, Gentile, Norton, Zhang, Ciravegna Query Expansion May 26, 2013 13 / 21
Evaluation: Approach
Manually create training set consisting of 40 keywords for
supervised training phase
Result of training phase: 194 candidate properties
Use precision threshold of of 0.045 to cut off the candidate
property list. This resulted in 23 properties.
Augenstein, Gentile, Norton, Zhang, Ciravegna Query Expansion May 26, 2013 14 / 21
Evaluation: Result of Training
foaf:name 0.267 owl:sameAs 0.121
fb-common:topic 0.189 wn20schema:gloss 0.1
rdfs:label 0.187 opencyc:seeAlsoURI 0.093
dc:subject 0.182 rdfs:seeAlso 0.082
dc:title 0.169 dbpedia-owl:abstract 0.0774
sindice:label 0.168 rdfs:comment 0.0676
rdfs:suBClassOf 0.168 rdfs:range 0.0667
skos:prefLabel 0.157 rdfs:subClassOf 0.0656
fb-type:object 0.143 fb:documented object 0.0619
wn20schema:derivationallyRelated 0.143 dbpedia-owl:wikiPageWikiLink 0.0487
wn20schema:containsWordSense 0.138 dc:description 0.0471
commontag:label 0.133
Table : Top 23 properties used and their precision
Augenstein, Gentile, Norton, Zhang, Ciravegna Query Expansion May 26, 2013 15 / 21
Evaluation: Example results of Test
Query: [spacecraft] Query: [engine] Query: [factory]
dbpedia-owl:Spacecraft dbpedia-owl:engine dbpedia-owl:manufacturer
dbpedia-owl:spacecraft dbpedia-owl:gameEngine dbpedia-owl:plant
dbpedia-owl:satellite dbpedia-owl:Artwork dbpedia-owl:Canal
dbpedia-owl:missions dbpedia-owl:AutomobileEngine dbpedia-owl:engine
dbpedia-owl:launches dbpedia-owl:Locomotive dbpedia-owl:class
dbpedia-owl:closed dbpedia-owl:fuel dbpedia-owl:Album
dbpedia-owl:vehicle dbpedia-owl:added dbpedia-owl:product
dbpedia-owl:Rocket dbpedia-owl:Musical dbpedia-owl:assembly
Query: [bass] Query: [wife] Query: [honda]
dbpedia-owl:Fish dbpedia-owl:spouse dbpedia-owl:manufacturer
dbpedia-owl:Instrument dbpedia-owl:Criminal dbpedia-owl:discovered
dbpedia-owl:instrument dbpedia-owl:person dbpedia-owl:Asteroid
dbpedia-owl:voice dbpedia-owl:status dbpedia-owl:engine
dbpedia-owl:partner dbpedia-owl:education dbpedia-owl:season
dbpedia-owl:note dbpedia-owl:Language dbpedia-owl:vehicle
dbpedia-owl:Musical dbpedia-owl:sex dbpedia-owl:Automobile
dbpedia-owl:lowest dbpedia-owl:family dbpedia-owl:participant
Augenstein, Gentile, Norton, Zhang, Ciravegna Query Expansion May 26, 2013 16 / 21
Evaluation: Results of Test
Model MRR
ESA 0.6
LOD Keyword Expansion 0.77
Table : Mean reciprocal rank (MRR)
Model Recall
Strich match 0.45
String match + WordNet 0.52
ESA 0.87
LOD Keyword Expansion 0.90
Table : Percentage queries answered (Recall)
Augenstein, Gentile, Norton, Zhang, Ciravegna Query Expansion May 26, 2013 17 / 21
Conclusion
Method for automatic query expansion for Linked Data
resources based on using properties between resources within
the Linked Open Data cloud
“film”, “flim”, “films”, “movie” → dbpedia-owl:Film
Evaluation showed how useful these different properties are for
finding semantic similarities and thereby finding expanded
keywords
Improvement of 17% in MRR over state of the art
Augenstein, Gentile, Norton, Zhang, Ciravegna Query Expansion May 26, 2013 18 / 21
Future Work
Related work ([6]) shows that best results can be achieved by
following a multi-strategy approach (String similarity + WordNet
+ ESA), which we could also integrate our approach in
Adjust labelling properties (add the ones discovered in training)
Treat labelling properties seperately from other properties
Fine-tune tokenisation of literals
Perform in vivo evaluation of the task, e.g. in the context of
question answering
Reevaluate for instances
Augenstein, Gentile, Norton, Zhang, Ciravegna Query Expansion May 26, 2013 19 / 21
Bibliography I
Cheng, G., Ge, W., Qu, Y.: Falcons: searching and browsing
entities on the semantic web. In: Proceedings of the 17th
international conference on World Wide Web. pp. 1101–1102.
ACM (2008)
Freitas, A., Curry, E., Oliveira, J.G., O’Riain, S.: A distributional
structured semantic space for querying rdf graph data.
International Journal of Semantic Computing 5(04), 433–462
(2011)
Lopez, V., Nikolov, A., Fernandez, M., Sabou, M., Uren, V.,
Motta, E.: Merging and ranking answers in the semantic web:
The wisdom of crowds. The semantic web pp. 135–152 (2009)
Augenstein, Gentile, Norton, Zhang, Ciravegna Query Expansion May 26, 2013 20 / 21
Bibliography II
Oren, E., Delbru, R., Catasta, M., Cyganiak, R., Stenzhorn, H.,
Tummarello, G.: Sindice. com: a document-oriented lookup
index for open linked data. International Journal of Metadata,
Semantics and Ontologies 3(1), 37–52 (2008)
Tran, T., Wang, H., Haase, P.: Searchwebdb: Data web search
on a pay-as-you-go integration infrastructure. Tech. rep.,
Technical report, University of Karlsruhe (2008)
Walter, S., Unger, C., Cimiano, P., B¨ar, D.: Evaluation of a
layered approach to question answering over linked data. In:
The Semantic Web–ISWC 2012, pp. 362–374. Springer (2012)
Augenstein, Gentile, Norton, Zhang, Ciravegna Query Expansion May 26, 2013 21 / 21

More Related Content

What's hot

Applications of Natural Language Processing to Materials Design
Applications of Natural Language Processing to Materials DesignApplications of Natural Language Processing to Materials Design
Applications of Natural Language Processing to Materials DesignAnubhav Jain
 
Using Text Embeddings for Information Retrieval
Using Text Embeddings for Information RetrievalUsing Text Embeddings for Information Retrieval
Using Text Embeddings for Information RetrievalBhaskar Mitra
 
NLP Structured Data Investigation on Non-Text
NLP Structured Data Investigation on Non-TextNLP Structured Data Investigation on Non-Text
NLP Structured Data Investigation on Non-TextHortonworks
 
NLP Structured Data Investigation on Non-Text by Casey Stella
NLP Structured Data Investigation on Non-Text by Casey StellaNLP Structured Data Investigation on Non-Text by Casey Stella
NLP Structured Data Investigation on Non-Text by Casey StellaSpark Summit
 
Natural Language Processing on Non-Textual Data
Natural Language Processing on Non-Textual DataNatural Language Processing on Non-Textual Data
Natural Language Processing on Non-Textual Datagpano
 
Duet @ TREC 2019 Deep Learning Track
Duet @ TREC 2019 Deep Learning TrackDuet @ TREC 2019 Deep Learning Track
Duet @ TREC 2019 Deep Learning TrackBhaskar Mitra
 
巨量與開放資料之創新機會與關鍵挑戰-曾新穆
巨量與開放資料之創新機會與關鍵挑戰-曾新穆巨量與開放資料之創新機會與關鍵挑戰-曾新穆
巨量與開放資料之創新機會與關鍵挑戰-曾新穆台灣資料科學年會
 
EFFICIENTLY PROCESSING OF TOP-K TYPICALITY QUERY FOR STRUCTURED DATA
EFFICIENTLY PROCESSING OF TOP-K TYPICALITY QUERY FOR STRUCTURED DATAEFFICIENTLY PROCESSING OF TOP-K TYPICALITY QUERY FOR STRUCTURED DATA
EFFICIENTLY PROCESSING OF TOP-K TYPICALITY QUERY FOR STRUCTURED DATAcsandit
 
A Machine learning approach to classify a pair of sentence as duplicate or not.
A Machine learning approach to classify a pair of sentence as duplicate or not.A Machine learning approach to classify a pair of sentence as duplicate or not.
A Machine learning approach to classify a pair of sentence as duplicate or not.Pankaj Chandan Mohapatra
 
Test PDF
Test PDFTest PDF
Test PDFAlgnuD
 
Automatic Metadata Generation using Associative Networks
Automatic Metadata Generation using Associative NetworksAutomatic Metadata Generation using Associative Networks
Automatic Metadata Generation using Associative NetworksMarko Rodriguez
 
Best Keyword Cover Search
Best Keyword Cover SearchBest Keyword Cover Search
Best Keyword Cover Search1crore projects
 
A Model of the Scholarly Community
A Model of the Scholarly CommunityA Model of the Scholarly Community
A Model of the Scholarly CommunityMarko Rodriguez
 
Quora questions pair duplication analysis using semantic analysis
Quora questions pair duplication analysis using semantic analysisQuora questions pair duplication analysis using semantic analysis
Quora questions pair duplication analysis using semantic analysisAkshata Talankar
 
On Incentive-based Tagging
On Incentive-based TaggingOn Incentive-based Tagging
On Incentive-based TaggingFrancesco Rizzo
 
Machine Learning With R
Machine Learning With RMachine Learning With R
Machine Learning With RDavid Chiu
 
Slides: Concurrent Inference of Topic Models and Distributed Vector Represent...
Slides: Concurrent Inference of Topic Models and Distributed Vector Represent...Slides: Concurrent Inference of Topic Models and Distributed Vector Represent...
Slides: Concurrent Inference of Topic Models and Distributed Vector Represent...Parang Saraf
 

What's hot (20)

Applications of Natural Language Processing to Materials Design
Applications of Natural Language Processing to Materials DesignApplications of Natural Language Processing to Materials Design
Applications of Natural Language Processing to Materials Design
 
Using Text Embeddings for Information Retrieval
Using Text Embeddings for Information RetrievalUsing Text Embeddings for Information Retrieval
Using Text Embeddings for Information Retrieval
 
NLP Structured Data Investigation on Non-Text
NLP Structured Data Investigation on Non-TextNLP Structured Data Investigation on Non-Text
NLP Structured Data Investigation on Non-Text
 
NLP Structured Data Investigation on Non-Text by Casey Stella
NLP Structured Data Investigation on Non-Text by Casey StellaNLP Structured Data Investigation on Non-Text by Casey Stella
NLP Structured Data Investigation on Non-Text by Casey Stella
 
Natural Language Processing on Non-Textual Data
Natural Language Processing on Non-Textual DataNatural Language Processing on Non-Textual Data
Natural Language Processing on Non-Textual Data
 
Duet @ TREC 2019 Deep Learning Track
Duet @ TREC 2019 Deep Learning TrackDuet @ TREC 2019 Deep Learning Track
Duet @ TREC 2019 Deep Learning Track
 
巨量與開放資料之創新機會與關鍵挑戰-曾新穆
巨量與開放資料之創新機會與關鍵挑戰-曾新穆巨量與開放資料之創新機會與關鍵挑戰-曾新穆
巨量與開放資料之創新機會與關鍵挑戰-曾新穆
 
DL'12 mastro at work
DL'12 mastro at workDL'12 mastro at work
DL'12 mastro at work
 
Working with text data
Working with text dataWorking with text data
Working with text data
 
EFFICIENTLY PROCESSING OF TOP-K TYPICALITY QUERY FOR STRUCTURED DATA
EFFICIENTLY PROCESSING OF TOP-K TYPICALITY QUERY FOR STRUCTURED DATAEFFICIENTLY PROCESSING OF TOP-K TYPICALITY QUERY FOR STRUCTURED DATA
EFFICIENTLY PROCESSING OF TOP-K TYPICALITY QUERY FOR STRUCTURED DATA
 
A Machine learning approach to classify a pair of sentence as duplicate or not.
A Machine learning approach to classify a pair of sentence as duplicate or not.A Machine learning approach to classify a pair of sentence as duplicate or not.
A Machine learning approach to classify a pair of sentence as duplicate or not.
 
Test PDF
Test PDFTest PDF
Test PDF
 
Automatic Metadata Generation using Associative Networks
Automatic Metadata Generation using Associative NetworksAutomatic Metadata Generation using Associative Networks
Automatic Metadata Generation using Associative Networks
 
Best Keyword Cover Search
Best Keyword Cover SearchBest Keyword Cover Search
Best Keyword Cover Search
 
A Model of the Scholarly Community
A Model of the Scholarly CommunityA Model of the Scholarly Community
A Model of the Scholarly Community
 
Quora questions pair duplication analysis using semantic analysis
Quora questions pair duplication analysis using semantic analysisQuora questions pair duplication analysis using semantic analysis
Quora questions pair duplication analysis using semantic analysis
 
On Incentive-based Tagging
On Incentive-based TaggingOn Incentive-based Tagging
On Incentive-based Tagging
 
Machine Learning With R
Machine Learning With RMachine Learning With R
Machine Learning With R
 
Slides: Concurrent Inference of Topic Models and Distributed Vector Represent...
Slides: Concurrent Inference of Topic Models and Distributed Vector Represent...Slides: Concurrent Inference of Topic Models and Distributed Vector Represent...
Slides: Concurrent Inference of Topic Models and Distributed Vector Represent...
 
BoTLRet: A Template-based Linked Data Information Retrieval
 BoTLRet: A Template-based Linked Data Information Retrieval BoTLRet: A Template-based Linked Data Information Retrieval
BoTLRet: A Template-based Linked Data Information Retrieval
 

Viewers also liked

Web Scale Information Extraction tutorial ecml2013
Web Scale Information Extraction tutorial ecml2013Web Scale Information Extraction tutorial ecml2013
Web Scale Information Extraction tutorial ecml2013Anna Lisa Gentile
 
Sensing 
Presence
(PreSense)
Ontology
–
 
User 
Modelling
 in 
the 
Semantic ...
Sensing 
Presence
(PreSense)
Ontology
–
 
User 
Modelling
 in 
the 
Semantic ...Sensing 
Presence
(PreSense)
Ontology
–
 
User 
Modelling
 in 
the 
Semantic ...
Sensing 
Presence
(PreSense)
Ontology
–
 
User 
Modelling
 in 
the 
Semantic ...Amparo Elizabeth Cano Basave
 
Exploring the similarity between Social Knowledge Sources and Twitter for Cro...
Exploring the similarity between Social Knowledge Sources and Twitter for Cro...Exploring the similarity between Social Knowledge Sources and Twitter for Cro...
Exploring the similarity between Social Knowledge Sources and Twitter for Cro...Andrea Varga
 
Volatile Classification of Point of Interests based on Social Activity Streams
Volatile Classification of Point of Interests based on Social Activity StreamsVolatile Classification of Point of Interests based on Social Activity Streams
Volatile Classification of Point of Interests based on Social Activity StreamsAmparo Elizabeth Cano Basave
 
Infrastructure and Workflow for the Formal Evaluation of Semantic Search Tech...
Infrastructure and Workflow for the Formal Evaluation of Semantic Search Tech...Infrastructure and Workflow for the Formal Evaluation of Semantic Search Tech...
Infrastructure and Workflow for the Formal Evaluation of Semantic Search Tech...Stuart Wrigley
 
Methodology and Campaign Design for the Evaluation of Semantic Search Tools
Methodology and Campaign Design for the Evaluation of Semantic Search ToolsMethodology and Campaign Design for the Evaluation of Semantic Search Tools
Methodology and Campaign Design for the Evaluation of Semantic Search ToolsStuart Wrigley
 
Asterid: Linked Data Asterisms
Asterid: Linked Data AsterismsAsterid: Linked Data Asterisms
Asterid: Linked Data AsterismsGregoire Burel
 
Information Extraction with Linked Data
Information Extraction with Linked DataInformation Extraction with Linked Data
Information Extraction with Linked DataIsabelle Augenstein
 
Natural Language Processing for the Semantic Web
Natural Language Processing for the Semantic WebNatural Language Processing for the Semantic Web
Natural Language Processing for the Semantic WebIsabelle Augenstein
 
USFD at SemEval-2016 - Stance Detection on Twitter with Autoencoders
USFD at SemEval-2016 - Stance Detection on Twitter with AutoencodersUSFD at SemEval-2016 - Stance Detection on Twitter with Autoencoders
USFD at SemEval-2016 - Stance Detection on Twitter with AutoencodersIsabelle Augenstein
 
Weakly Supervised Machine Reading
Weakly Supervised Machine ReadingWeakly Supervised Machine Reading
Weakly Supervised Machine ReadingIsabelle Augenstein
 
Lodifier: Generating Linked Data from Unstructured Text
Lodifier: Generating Linked Data from Unstructured TextLodifier: Generating Linked Data from Unstructured Text
Lodifier: Generating Linked Data from Unstructured TextIsabelle Augenstein
 
Distant Supervision with Imitation Learning
Distant Supervision with Imitation LearningDistant Supervision with Imitation Learning
Distant Supervision with Imitation LearningIsabelle Augenstein
 
Seed Selection for Distantly Supervised Web-Based Relation Extraction
Seed Selection for Distantly Supervised Web-Based Relation ExtractionSeed Selection for Distantly Supervised Web-Based Relation Extraction
Seed Selection for Distantly Supervised Web-Based Relation ExtractionIsabelle Augenstein
 
Relation Extraction from the Web using Distant Supervision
Relation Extraction from the Web using Distant SupervisionRelation Extraction from the Web using Distant Supervision
Relation Extraction from the Web using Distant SupervisionIsabelle Augenstein
 
Semantic Search tutorial at SemTech 2012
Semantic Search tutorial at SemTech 2012Semantic Search tutorial at SemTech 2012
Semantic Search tutorial at SemTech 2012Peter Mika
 
Question Answering over Linked Data (Reasoning Web Summer School)
Question Answering over Linked Data (Reasoning Web Summer School)Question Answering over Linked Data (Reasoning Web Summer School)
Question Answering over Linked Data (Reasoning Web Summer School)Andre Freitas
 

Viewers also liked (20)

Web Scale Information Extraction tutorial ecml2013
Web Scale Information Extraction tutorial ecml2013Web Scale Information Extraction tutorial ecml2013
Web Scale Information Extraction tutorial ecml2013
 
SEALS @ WWW2012
SEALS @ WWW2012SEALS @ WWW2012
SEALS @ WWW2012
 
Lodie overview
Lodie overviewLodie overview
Lodie overview
 
Sensing 
Presence
(PreSense)
Ontology
–
 
User 
Modelling
 in 
the 
Semantic ...
Sensing 
Presence
(PreSense)
Ontology
–
 
User 
Modelling
 in 
the 
Semantic ...Sensing 
Presence
(PreSense)
Ontology
–
 
User 
Modelling
 in 
the 
Semantic ...
Sensing 
Presence
(PreSense)
Ontology
–
 
User 
Modelling
 in 
the 
Semantic ...
 
Exploring the similarity between Social Knowledge Sources and Twitter for Cro...
Exploring the similarity between Social Knowledge Sources and Twitter for Cro...Exploring the similarity between Social Knowledge Sources and Twitter for Cro...
Exploring the similarity between Social Knowledge Sources and Twitter for Cro...
 
Volatile Classification of Point of Interests based on Social Activity Streams
Volatile Classification of Point of Interests based on Social Activity StreamsVolatile Classification of Point of Interests based on Social Activity Streams
Volatile Classification of Point of Interests based on Social Activity Streams
 
Infrastructure and Workflow for the Formal Evaluation of Semantic Search Tech...
Infrastructure and Workflow for the Formal Evaluation of Semantic Search Tech...Infrastructure and Workflow for the Formal Evaluation of Semantic Search Tech...
Infrastructure and Workflow for the Formal Evaluation of Semantic Search Tech...
 
Methodology and Campaign Design for the Evaluation of Semantic Search Tools
Methodology and Campaign Design for the Evaluation of Semantic Search ToolsMethodology and Campaign Design for the Evaluation of Semantic Search Tools
Methodology and Campaign Design for the Evaluation of Semantic Search Tools
 
Asterid: Linked Data Asterisms
Asterid: Linked Data AsterismsAsterid: Linked Data Asterisms
Asterid: Linked Data Asterisms
 
Information Extraction with Linked Data
Information Extraction with Linked DataInformation Extraction with Linked Data
Information Extraction with Linked Data
 
Natural Language Processing for the Semantic Web
Natural Language Processing for the Semantic WebNatural Language Processing for the Semantic Web
Natural Language Processing for the Semantic Web
 
USFD at SemEval-2016 - Stance Detection on Twitter with Autoencoders
USFD at SemEval-2016 - Stance Detection on Twitter with AutoencodersUSFD at SemEval-2016 - Stance Detection on Twitter with Autoencoders
USFD at SemEval-2016 - Stance Detection on Twitter with Autoencoders
 
Weakly Supervised Machine Reading
Weakly Supervised Machine ReadingWeakly Supervised Machine Reading
Weakly Supervised Machine Reading
 
Lodifier: Generating Linked Data from Unstructured Text
Lodifier: Generating Linked Data from Unstructured TextLodifier: Generating Linked Data from Unstructured Text
Lodifier: Generating Linked Data from Unstructured Text
 
Distant Supervision with Imitation Learning
Distant Supervision with Imitation LearningDistant Supervision with Imitation Learning
Distant Supervision with Imitation Learning
 
Seed Selection for Distantly Supervised Web-Based Relation Extraction
Seed Selection for Distantly Supervised Web-Based Relation ExtractionSeed Selection for Distantly Supervised Web-Based Relation Extraction
Seed Selection for Distantly Supervised Web-Based Relation Extraction
 
Relation Extraction from the Web using Distant Supervision
Relation Extraction from the Web using Distant SupervisionRelation Extraction from the Web using Distant Supervision
Relation Extraction from the Web using Distant Supervision
 
Semantic Search tutorial at SemTech 2012
Semantic Search tutorial at SemTech 2012Semantic Search tutorial at SemTech 2012
Semantic Search tutorial at SemTech 2012
 
Human Neural Machine
Human Neural MachineHuman Neural Machine
Human Neural Machine
 
Question Answering over Linked Data (Reasoning Web Summer School)
Question Answering over Linked Data (Reasoning Web Summer School)Question Answering over Linked Data (Reasoning Web Summer School)
Question Answering over Linked Data (Reasoning Web Summer School)
 

Similar to Mapping Keywords to

Progress Towards Leveraging Natural Language Processing for Collecting Experi...
Progress Towards Leveraging Natural Language Processing for Collecting Experi...Progress Towards Leveraging Natural Language Processing for Collecting Experi...
Progress Towards Leveraging Natural Language Processing for Collecting Experi...Anubhav Jain
 
Hala skafkeynote@conferencedata2021
Hala skafkeynote@conferencedata2021Hala skafkeynote@conferencedata2021
Hala skafkeynote@conferencedata2021hala Skaf
 
Introduction
IntroductionIntroduction
Introductionbutest
 
Introduction
IntroductionIntroduction
Introductionbutest
 
Introduction
IntroductionIntroduction
Introductionbutest
 
Neo4j MeetUp - Graph Exploration with MetaExp
Neo4j MeetUp - Graph Exploration with MetaExpNeo4j MeetUp - Graph Exploration with MetaExp
Neo4j MeetUp - Graph Exploration with MetaExpAdrian Ziegler
 
Crowdsourcing Linked Data Quality Assessment
Crowdsourcing Linked Data Quality AssessmentCrowdsourcing Linked Data Quality Assessment
Crowdsourcing Linked Data Quality AssessmentMaribel Acosta Deibe
 
Download
DownloadDownload
Downloadbutest
 
Download
DownloadDownload
Downloadbutest
 
Topic detecton by clustering and text mining
Topic detecton by clustering and text miningTopic detecton by clustering and text mining
Topic detecton by clustering and text miningIRJET Journal
 
Tag And Tag Based Recommender
Tag And Tag Based RecommenderTag And Tag Based Recommender
Tag And Tag Based Recommendergu wendong
 
Ensemble Learning Featuring the Netflix Prize Competition and ...
Ensemble Learning Featuring the Netflix Prize Competition and ...Ensemble Learning Featuring the Netflix Prize Competition and ...
Ensemble Learning Featuring the Netflix Prize Competition and ...butest
 
Big-Data Analytics for Media Management
Big-Data Analytics for Media ManagementBig-Data Analytics for Media Management
Big-Data Analytics for Media Managementtechkrish
 
How to build recommender system
How to build recommender systemHow to build recommender system
How to build recommender systemMitko Gurbanski
 
Session 1.5 supporting virtual integration of linked data with just-in-time...
Session 1.5   supporting virtual integration of linked data with just-in-time...Session 1.5   supporting virtual integration of linked data with just-in-time...
Session 1.5 supporting virtual integration of linked data with just-in-time...semanticsconference
 
G04124041046
G04124041046G04124041046
G04124041046IOSR-JEN
 
Crowdsourcing Linked Data Quality Assessment
Crowdsourcing Linked Data Quality AssessmentCrowdsourcing Linked Data Quality Assessment
Crowdsourcing Linked Data Quality AssessmentAmrapali Zaveri, PhD
 
Overview of Movie Recommendation System using Machine learning by R programmi...
Overview of Movie Recommendation System using Machine learning by R programmi...Overview of Movie Recommendation System using Machine learning by R programmi...
Overview of Movie Recommendation System using Machine learning by R programmi...IRJET Journal
 

Similar to Mapping Keywords to (20)

Progress Towards Leveraging Natural Language Processing for Collecting Experi...
Progress Towards Leveraging Natural Language Processing for Collecting Experi...Progress Towards Leveraging Natural Language Processing for Collecting Experi...
Progress Towards Leveraging Natural Language Processing for Collecting Experi...
 
Hala skafkeynote@conferencedata2021
Hala skafkeynote@conferencedata2021Hala skafkeynote@conferencedata2021
Hala skafkeynote@conferencedata2021
 
Introduction
IntroductionIntroduction
Introduction
 
Introduction
IntroductionIntroduction
Introduction
 
Introduction
IntroductionIntroduction
Introduction
 
Neo4j MeetUp - Graph Exploration with MetaExp
Neo4j MeetUp - Graph Exploration with MetaExpNeo4j MeetUp - Graph Exploration with MetaExp
Neo4j MeetUp - Graph Exploration with MetaExp
 
Crowdsourcing Linked Data Quality Assessment
Crowdsourcing Linked Data Quality AssessmentCrowdsourcing Linked Data Quality Assessment
Crowdsourcing Linked Data Quality Assessment
 
Download
DownloadDownload
Download
 
Download
DownloadDownload
Download
 
Topic detecton by clustering and text mining
Topic detecton by clustering and text miningTopic detecton by clustering and text mining
Topic detecton by clustering and text mining
 
Tag And Tag Based Recommender
Tag And Tag Based RecommenderTag And Tag Based Recommender
Tag And Tag Based Recommender
 
G1803054653
G1803054653G1803054653
G1803054653
 
Ensemble Learning Featuring the Netflix Prize Competition and ...
Ensemble Learning Featuring the Netflix Prize Competition and ...Ensemble Learning Featuring the Netflix Prize Competition and ...
Ensemble Learning Featuring the Netflix Prize Competition and ...
 
Big-Data Analytics for Media Management
Big-Data Analytics for Media ManagementBig-Data Analytics for Media Management
Big-Data Analytics for Media Management
 
Property Alignment on Linked Open Data
Property Alignment on Linked Open DataProperty Alignment on Linked Open Data
Property Alignment on Linked Open Data
 
How to build recommender system
How to build recommender systemHow to build recommender system
How to build recommender system
 
Session 1.5 supporting virtual integration of linked data with just-in-time...
Session 1.5   supporting virtual integration of linked data with just-in-time...Session 1.5   supporting virtual integration of linked data with just-in-time...
Session 1.5 supporting virtual integration of linked data with just-in-time...
 
G04124041046
G04124041046G04124041046
G04124041046
 
Crowdsourcing Linked Data Quality Assessment
Crowdsourcing Linked Data Quality AssessmentCrowdsourcing Linked Data Quality Assessment
Crowdsourcing Linked Data Quality Assessment
 
Overview of Movie Recommendation System using Machine learning by R programmi...
Overview of Movie Recommendation System using Machine learning by R programmi...Overview of Movie Recommendation System using Machine learning by R programmi...
Overview of Movie Recommendation System using Machine learning by R programmi...
 

More from Isabelle Augenstein

Beyond Fact Checking — Modelling Information Change in Scientific Communication
Beyond Fact Checking — Modelling Information Change in Scientific CommunicationBeyond Fact Checking — Modelling Information Change in Scientific Communication
Beyond Fact Checking — Modelling Information Change in Scientific CommunicationIsabelle Augenstein
 
Automatically Detecting Scientific Misinformation
Automatically Detecting Scientific MisinformationAutomatically Detecting Scientific Misinformation
Automatically Detecting Scientific MisinformationIsabelle Augenstein
 
Accountable and Robust Automatic Fact Checking
Accountable and Robust Automatic Fact CheckingAccountable and Robust Automatic Fact Checking
Accountable and Robust Automatic Fact CheckingIsabelle Augenstein
 
Determining the Credibility of Science Communication
Determining the Credibility of Science CommunicationDetermining the Credibility of Science Communication
Determining the Credibility of Science CommunicationIsabelle Augenstein
 
Towards Explainable Fact Checking (DIKU Business Club presentation)
Towards Explainable Fact Checking (DIKU Business Club presentation)Towards Explainable Fact Checking (DIKU Business Club presentation)
Towards Explainable Fact Checking (DIKU Business Club presentation)Isabelle Augenstein
 
Towards Explainable Fact Checking
Towards Explainable Fact CheckingTowards Explainable Fact Checking
Towards Explainable Fact CheckingIsabelle Augenstein
 
Tracking False Information Online
Tracking False Information OnlineTracking False Information Online
Tracking False Information OnlineIsabelle Augenstein
 
What can typological knowledge bases and language representations tell us abo...
What can typological knowledge bases and language representations tell us abo...What can typological knowledge bases and language representations tell us abo...
What can typological knowledge bases and language representations tell us abo...Isabelle Augenstein
 
Multi-task Learning of Pairwise Sequence Classification Tasks Over Disparate ...
Multi-task Learning of Pairwise Sequence Classification Tasks Over Disparate ...Multi-task Learning of Pairwise Sequence Classification Tasks Over Disparate ...
Multi-task Learning of Pairwise Sequence Classification Tasks Over Disparate ...Isabelle Augenstein
 
Learning with limited labelled data in NLP: multi-task learning and beyond
Learning with limited labelled data in NLP: multi-task learning and beyondLearning with limited labelled data in NLP: multi-task learning and beyond
Learning with limited labelled data in NLP: multi-task learning and beyondIsabelle Augenstein
 
Learning to read for automated fact checking
Learning to read for automated fact checkingLearning to read for automated fact checking
Learning to read for automated fact checkingIsabelle Augenstein
 
SemEval 2017 Task 10: ScienceIE – Extracting Keyphrases and Relations from Sc...
SemEval 2017 Task 10: ScienceIE – Extracting Keyphrases and Relations from Sc...SemEval 2017 Task 10: ScienceIE – Extracting Keyphrases and Relations from Sc...
SemEval 2017 Task 10: ScienceIE – Extracting Keyphrases and Relations from Sc...Isabelle Augenstein
 
1st Workshop for Women and Underrepresented Minorities (WiNLP) at ACL 2017 - ...
1st Workshop for Women and Underrepresented Minorities (WiNLP) at ACL 2017 - ...1st Workshop for Women and Underrepresented Minorities (WiNLP) at ACL 2017 - ...
1st Workshop for Women and Underrepresented Minorities (WiNLP) at ACL 2017 - ...Isabelle Augenstein
 
1st Workshop for Women and Underrepresented Minorities (WiNLP) at ACL 2017 - ...
1st Workshop for Women and Underrepresented Minorities (WiNLP) at ACL 2017 - ...1st Workshop for Women and Underrepresented Minorities (WiNLP) at ACL 2017 - ...
1st Workshop for Women and Underrepresented Minorities (WiNLP) at ACL 2017 - ...Isabelle Augenstein
 
Machine Reading Using Neural Machines (talk at Microsoft Research Faculty Sum...
Machine Reading Using Neural Machines (talk at Microsoft Research Faculty Sum...Machine Reading Using Neural Machines (talk at Microsoft Research Faculty Sum...
Machine Reading Using Neural Machines (talk at Microsoft Research Faculty Sum...Isabelle Augenstein
 
Extracting Relations between Non-Standard Entities using Distant Supervision ...
Extracting Relations between Non-Standard Entities using Distant Supervision ...Extracting Relations between Non-Standard Entities using Distant Supervision ...
Extracting Relations between Non-Standard Entities using Distant Supervision ...Isabelle Augenstein
 

More from Isabelle Augenstein (17)

Beyond Fact Checking — Modelling Information Change in Scientific Communication
Beyond Fact Checking — Modelling Information Change in Scientific CommunicationBeyond Fact Checking — Modelling Information Change in Scientific Communication
Beyond Fact Checking — Modelling Information Change in Scientific Communication
 
Automatically Detecting Scientific Misinformation
Automatically Detecting Scientific MisinformationAutomatically Detecting Scientific Misinformation
Automatically Detecting Scientific Misinformation
 
Accountable and Robust Automatic Fact Checking
Accountable and Robust Automatic Fact CheckingAccountable and Robust Automatic Fact Checking
Accountable and Robust Automatic Fact Checking
 
Determining the Credibility of Science Communication
Determining the Credibility of Science CommunicationDetermining the Credibility of Science Communication
Determining the Credibility of Science Communication
 
Towards Explainable Fact Checking (DIKU Business Club presentation)
Towards Explainable Fact Checking (DIKU Business Club presentation)Towards Explainable Fact Checking (DIKU Business Club presentation)
Towards Explainable Fact Checking (DIKU Business Club presentation)
 
Explainability for NLP
Explainability for NLPExplainability for NLP
Explainability for NLP
 
Towards Explainable Fact Checking
Towards Explainable Fact CheckingTowards Explainable Fact Checking
Towards Explainable Fact Checking
 
Tracking False Information Online
Tracking False Information OnlineTracking False Information Online
Tracking False Information Online
 
What can typological knowledge bases and language representations tell us abo...
What can typological knowledge bases and language representations tell us abo...What can typological knowledge bases and language representations tell us abo...
What can typological knowledge bases and language representations tell us abo...
 
Multi-task Learning of Pairwise Sequence Classification Tasks Over Disparate ...
Multi-task Learning of Pairwise Sequence Classification Tasks Over Disparate ...Multi-task Learning of Pairwise Sequence Classification Tasks Over Disparate ...
Multi-task Learning of Pairwise Sequence Classification Tasks Over Disparate ...
 
Learning with limited labelled data in NLP: multi-task learning and beyond
Learning with limited labelled data in NLP: multi-task learning and beyondLearning with limited labelled data in NLP: multi-task learning and beyond
Learning with limited labelled data in NLP: multi-task learning and beyond
 
Learning to read for automated fact checking
Learning to read for automated fact checkingLearning to read for automated fact checking
Learning to read for automated fact checking
 
SemEval 2017 Task 10: ScienceIE – Extracting Keyphrases and Relations from Sc...
SemEval 2017 Task 10: ScienceIE – Extracting Keyphrases and Relations from Sc...SemEval 2017 Task 10: ScienceIE – Extracting Keyphrases and Relations from Sc...
SemEval 2017 Task 10: ScienceIE – Extracting Keyphrases and Relations from Sc...
 
1st Workshop for Women and Underrepresented Minorities (WiNLP) at ACL 2017 - ...
1st Workshop for Women and Underrepresented Minorities (WiNLP) at ACL 2017 - ...1st Workshop for Women and Underrepresented Minorities (WiNLP) at ACL 2017 - ...
1st Workshop for Women and Underrepresented Minorities (WiNLP) at ACL 2017 - ...
 
1st Workshop for Women and Underrepresented Minorities (WiNLP) at ACL 2017 - ...
1st Workshop for Women and Underrepresented Minorities (WiNLP) at ACL 2017 - ...1st Workshop for Women and Underrepresented Minorities (WiNLP) at ACL 2017 - ...
1st Workshop for Women and Underrepresented Minorities (WiNLP) at ACL 2017 - ...
 
Machine Reading Using Neural Machines (talk at Microsoft Research Faculty Sum...
Machine Reading Using Neural Machines (talk at Microsoft Research Faculty Sum...Machine Reading Using Neural Machines (talk at Microsoft Research Faculty Sum...
Machine Reading Using Neural Machines (talk at Microsoft Research Faculty Sum...
 
Extracting Relations between Non-Standard Entities using Distant Supervision ...
Extracting Relations between Non-Standard Entities using Distant Supervision ...Extracting Relations between Non-Standard Entities using Distant Supervision ...
Extracting Relations between Non-Standard Entities using Distant Supervision ...
 

Recently uploaded

Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxRoyAbrique
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting DataJhengPantaleon
 
MENTAL STATUS EXAMINATION format.docx
MENTAL     STATUS EXAMINATION format.docxMENTAL     STATUS EXAMINATION format.docx
MENTAL STATUS EXAMINATION format.docxPoojaSen20
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...Marc Dusseiller Dusjagr
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfchloefrazer622
 
Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesFatimaKhan178732
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppCeline George
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAssociation for Project Management
 
Concept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.CompdfConcept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.CompdfUmakantAnnand
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Celine George
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon AUnboundStockton
 

Recently uploaded (20)

Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
 
MENTAL STATUS EXAMINATION format.docx
MENTAL     STATUS EXAMINATION format.docxMENTAL     STATUS EXAMINATION format.docx
MENTAL STATUS EXAMINATION format.docx
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and Actinides
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website App
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
 
Concept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.CompdfConcept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.Compdf
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
Staff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSDStaff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSD
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon A
 

Mapping Keywords to

  • 1. Mapping Keywords to Linked Data Resources for Automatic Query Expansion Isabelle Augenstein1 Anna Lisa Gentile1 Barry Norton2 Ziqi Zhang1 Fabio Ciravegna1 1 Department of Computer Science, University of Sheffield, UK 2 Ontotext, UK {i.augenstein,a.l.gentile,z.zhang,f.ciravegna}@dcs.shef.ac.uk, barry.norton@ontotext.com May 26, 2013 Augenstein, Gentile, Norton, Zhang, Ciravegna Query Expansion May 26, 2013 1 / 21
  • 2. Motivation In order to consume Linked Data, end users need to be familiar with RDF’s data model and query language SPARQL have knowledge of datasets and their contents Keyword search is a means to overcome these barriers Map keywords to RDF resources “film” → dbpedia-owl:Film Augenstein, Gentile, Norton, Zhang, Ciravegna Query Expansion May 26, 2013 2 / 21
  • 3. Motivation Challenges for keyword search: Spelling mistakes, e.g. “flim” Lexical derivations, e.g. “films” Synonyms, e.g. “movie” How to address these challenges? → query expansion What is query expansion? Process of expanding a seed query with additional query terms to improve recall “film”, “flim”, “films”, “movie” → dbpedia-owl:Film Augenstein, Gentile, Norton, Zhang, Ciravegna Query Expansion May 26, 2013 3 / 21
  • 4. State of the art: How to find mappings String similarity (Sindice [4]) “flim”, “films” → dbpedia-owl:Film Domain-independent approach Problem: Only does expansion for spelling mistakes and lexically similar words Dictionary-based methods (WordNet [3], Wikipedia [2]) “movie” → dbpedia-owl:Film Approach finds synonyms Problem: Dictionaries contain limited, domain-specific vocabulary, WordNet has very few named entities Augenstein, Gentile, Norton, Zhang, Ciravegna Query Expansion May 26, 2013 4 / 21
  • 5. State of the art: How to rank resources String similarity Rank by decreasing String similarity (SearchWebDB [5]) Rank by decreasing tf-idf (Falcons Object Search [1]) Dictionary-based methods (WordNet ([3]), Wikipedia ([2])) Rank by decreasing semantic similarity (ESA [2]) Combine frequency and confidence (PowerAqua [3]) Augenstein, Gentile, Norton, Zhang, Ciravegna Query Expansion May 26, 2013 5 / 21
  • 6. Our approach How to find mappings In order to have a highly adaptable, domain-independent approach, use knowledge contained within the dataset Benefit from properties between resources in Linked Data, which are potentially useful for finding semantically similar keywords How to rank resources Follow intuition of state of the art methods Combine tf-idf and confidence of our measure Augenstein, Gentile, Norton, Zhang, Ciravegna Query Expansion May 26, 2013 6 / 21
  • 7. Method: Overview Scenario: User wants to find representative Linked Data resources for keyword w Example: “movie” → dbpedia-owl:Film Dataset: Set of triples consisting of a ‘subject’, ‘predicate’ and an ‘object’. Current application: Find classes and properties Augenstein, Gentile, Norton, Zhang, Ciravegna Query Expansion May 26, 2013 7 / 21
  • 8. Method: Overview Step 1: For each keyword w, learn an expanded set of keywords Ew and a ranking of Ew to find semantically similar words in the target vocabulary Example: “movie” → “movie”’, “film”, “films” Step 2: Identify concepts through labelling properties “movie” → ∅ “film” → dbpedia-owl:Film “films” → ∅ Step 3: Rank resulting concepts and return concept with highest rank “movie” → dbpedia-owl:Film Augenstein, Gentile, Norton, Zhang, Ciravegna Query Expansion May 26, 2013 8 / 21
  • 9. Method: Overview Set of labelling properties to identify concepts rdfs:label foaf:name dc:title skos:prefLabel skos:altLabel fb:type.object.name Table : Labelling properties Augenstein, Gentile, Norton, Zhang, Ciravegna Query Expansion May 26, 2013 9 / 21
  • 10. Method: Candidate Identification Step 1: For each keyword w, learn an expanded set of keywords Ew and a ranking of Ew Step 1.1: Which properties are useful in expressing semantic similarity? Get resource r for keyword using labelling properties “movie” → fb:Movie Get all triples where r is a subject fb:Movie wn:containsWordSense dbpedia-owl:show fb:Movie dc:description ‘‘movie director’’ fb:Movie commontag:label ‘‘Film’’ For each of the objects of the triples, find the labels and tokenise them “movie director” → “movie”, “director” Find resources for them in the target vocabulary “show” → dbpedia-owl:show “Film” → dbpedia-owl:Film Augenstein, Gentile, Norton, Zhang, Ciravegna Query Expansion May 26, 2013 10 / 21
  • 11. Method: Training Step 1.2: How to learn a ranking for these properties? Select a list of keywords, run step 1.1 for each of them Manually chose the best resources among the candidates “show”, “Film” → “Film” Produce a precision measure for every property used to find candidates prec(p) = w∈ −−−−→ Wtrain hits(p, w) w∈ −−−−→ Wtrain candidates(p, w) Define treshold 0 ≤ θ ≤ 1, use prec to define an ordered (ranked) subset of properties to encode semantic relatedness Augenstein, Gentile, Norton, Zhang, Ciravegna Query Expansion May 26, 2013 11 / 21
  • 12. Method: Test Test set of keywords −−−−→ Wtest , apply algorithm to obtain candidates Only take subset of properties instead of all properties to find candidates Combine, as a numerical product, the precision of the property, p, used to find that candidate and a tf-idf score for r Augenstein, Gentile, Norton, Zhang, Ciravegna Query Expansion May 26, 2013 12 / 21
  • 13. Evaluation: Gold standard and Metric Used DBpedia ontology as the target vocabulary and Sindice cache as dataset for computing expanded set of keywords Ew Gold standard from Freitas et al. [2], contains 178 keywords of which 134 have a representation in the DBpedia ontology Corrected minor errors in gold-standard and re-evaluated approach by Freitas et al. [2] Actual figures don’t change significantly Use Mean Reciprocal Rank (MRR) which measures the quality of the ranking by calculating the inverse rank of the best result Augenstein, Gentile, Norton, Zhang, Ciravegna Query Expansion May 26, 2013 13 / 21
  • 14. Evaluation: Approach Manually create training set consisting of 40 keywords for supervised training phase Result of training phase: 194 candidate properties Use precision threshold of of 0.045 to cut off the candidate property list. This resulted in 23 properties. Augenstein, Gentile, Norton, Zhang, Ciravegna Query Expansion May 26, 2013 14 / 21
  • 15. Evaluation: Result of Training foaf:name 0.267 owl:sameAs 0.121 fb-common:topic 0.189 wn20schema:gloss 0.1 rdfs:label 0.187 opencyc:seeAlsoURI 0.093 dc:subject 0.182 rdfs:seeAlso 0.082 dc:title 0.169 dbpedia-owl:abstract 0.0774 sindice:label 0.168 rdfs:comment 0.0676 rdfs:suBClassOf 0.168 rdfs:range 0.0667 skos:prefLabel 0.157 rdfs:subClassOf 0.0656 fb-type:object 0.143 fb:documented object 0.0619 wn20schema:derivationallyRelated 0.143 dbpedia-owl:wikiPageWikiLink 0.0487 wn20schema:containsWordSense 0.138 dc:description 0.0471 commontag:label 0.133 Table : Top 23 properties used and their precision Augenstein, Gentile, Norton, Zhang, Ciravegna Query Expansion May 26, 2013 15 / 21
  • 16. Evaluation: Example results of Test Query: [spacecraft] Query: [engine] Query: [factory] dbpedia-owl:Spacecraft dbpedia-owl:engine dbpedia-owl:manufacturer dbpedia-owl:spacecraft dbpedia-owl:gameEngine dbpedia-owl:plant dbpedia-owl:satellite dbpedia-owl:Artwork dbpedia-owl:Canal dbpedia-owl:missions dbpedia-owl:AutomobileEngine dbpedia-owl:engine dbpedia-owl:launches dbpedia-owl:Locomotive dbpedia-owl:class dbpedia-owl:closed dbpedia-owl:fuel dbpedia-owl:Album dbpedia-owl:vehicle dbpedia-owl:added dbpedia-owl:product dbpedia-owl:Rocket dbpedia-owl:Musical dbpedia-owl:assembly Query: [bass] Query: [wife] Query: [honda] dbpedia-owl:Fish dbpedia-owl:spouse dbpedia-owl:manufacturer dbpedia-owl:Instrument dbpedia-owl:Criminal dbpedia-owl:discovered dbpedia-owl:instrument dbpedia-owl:person dbpedia-owl:Asteroid dbpedia-owl:voice dbpedia-owl:status dbpedia-owl:engine dbpedia-owl:partner dbpedia-owl:education dbpedia-owl:season dbpedia-owl:note dbpedia-owl:Language dbpedia-owl:vehicle dbpedia-owl:Musical dbpedia-owl:sex dbpedia-owl:Automobile dbpedia-owl:lowest dbpedia-owl:family dbpedia-owl:participant Augenstein, Gentile, Norton, Zhang, Ciravegna Query Expansion May 26, 2013 16 / 21
  • 17. Evaluation: Results of Test Model MRR ESA 0.6 LOD Keyword Expansion 0.77 Table : Mean reciprocal rank (MRR) Model Recall Strich match 0.45 String match + WordNet 0.52 ESA 0.87 LOD Keyword Expansion 0.90 Table : Percentage queries answered (Recall) Augenstein, Gentile, Norton, Zhang, Ciravegna Query Expansion May 26, 2013 17 / 21
  • 18. Conclusion Method for automatic query expansion for Linked Data resources based on using properties between resources within the Linked Open Data cloud “film”, “flim”, “films”, “movie” → dbpedia-owl:Film Evaluation showed how useful these different properties are for finding semantic similarities and thereby finding expanded keywords Improvement of 17% in MRR over state of the art Augenstein, Gentile, Norton, Zhang, Ciravegna Query Expansion May 26, 2013 18 / 21
  • 19. Future Work Related work ([6]) shows that best results can be achieved by following a multi-strategy approach (String similarity + WordNet + ESA), which we could also integrate our approach in Adjust labelling properties (add the ones discovered in training) Treat labelling properties seperately from other properties Fine-tune tokenisation of literals Perform in vivo evaluation of the task, e.g. in the context of question answering Reevaluate for instances Augenstein, Gentile, Norton, Zhang, Ciravegna Query Expansion May 26, 2013 19 / 21
  • 20. Bibliography I Cheng, G., Ge, W., Qu, Y.: Falcons: searching and browsing entities on the semantic web. In: Proceedings of the 17th international conference on World Wide Web. pp. 1101–1102. ACM (2008) Freitas, A., Curry, E., Oliveira, J.G., O’Riain, S.: A distributional structured semantic space for querying rdf graph data. International Journal of Semantic Computing 5(04), 433–462 (2011) Lopez, V., Nikolov, A., Fernandez, M., Sabou, M., Uren, V., Motta, E.: Merging and ranking answers in the semantic web: The wisdom of crowds. The semantic web pp. 135–152 (2009) Augenstein, Gentile, Norton, Zhang, Ciravegna Query Expansion May 26, 2013 20 / 21
  • 21. Bibliography II Oren, E., Delbru, R., Catasta, M., Cyganiak, R., Stenzhorn, H., Tummarello, G.: Sindice. com: a document-oriented lookup index for open linked data. International Journal of Metadata, Semantics and Ontologies 3(1), 37–52 (2008) Tran, T., Wang, H., Haase, P.: Searchwebdb: Data web search on a pay-as-you-go integration infrastructure. Tech. rep., Technical report, University of Karlsruhe (2008) Walter, S., Unger, C., Cimiano, P., B¨ar, D.: Evaluation of a layered approach to question answering over linked data. In: The Semantic Web–ISWC 2012, pp. 362–374. Springer (2012) Augenstein, Gentile, Norton, Zhang, Ciravegna Query Expansion May 26, 2013 21 / 21