SlideShare a Scribd company logo
1 of 27
Download to read offline
Using	
  SKOS	
  Vocabularies	
  for	
  
Improving	
  Web	
  Search	
  
Web	
  of	
  Linked	
  En>>es	
  Workshop	
  (WoLE	
  2013)	
  
WWW	
  2013,	
  Rio	
  de	
  Janeiro,	
  May	
  13th	
  2013	
  
	
  
Bernhard	
  Haslhofer	
  |	
  University	
  of	
  Vienna	
  
Flávio	
  Mar>ns	
  |	
  Universidade	
  Nova	
  de	
  Lisboa	
  
João	
  Magalhães	
  |	
  Universidade	
  Nova	
  de	
  Lisboa	
  
Overview	
  
•  A	
  brief	
  intro	
  to	
  SKOS	
  
•  SKOS-­‐based	
  term	
  expansion	
  
•  Lucene-­‐SKOS	
  
•  Evalua>on	
  
•  Conclusions	
  
2	
  
What	
  is	
  SKOS?	
  
•  A	
  language	
  for	
  describing	
  Web	
  vocabularies	
  
(taxonomies,	
  classifica>on	
  schemes,	
  thesauri)	
  
•  Builds	
  on	
  Linked	
  Data	
  principles	
  
– Concepts	
  have	
  URIs	
  	
  
– Concepts	
  are	
  interlinked	
  
– Vocabularies	
  are	
  expressed	
  in	
  RDF	
  
3	
  
Who	
  is	
  using	
  SKOS?	
  
h"p://www.w3.org/2001/sw/wiki/SKOS/Datasets	
  4	
  
SKOS	
  Example	
  -­‐	
  UKAT	
  
"Weapons"
skos:prefLabel
skos:Concept
ukat:859
"Military Equipment"
skos:prefLabel
skos:Concept
ukat:5060
skos:broaderskos:narrower
skos:broader
skos:narrower
"Ordnance"
skos:altLabel
"Armaments"skos:altLabel
"Arms"
skos:altLabel
ukat: http://www.ukat.org.uk/thesaurus/concept/
skos: http://www.w3.org/2004/02/skos/core# 5	
  
Overview	
  
•  A	
  brief	
  intro	
  to	
  SKOS	
  
•  SKOS-­‐based	
  term	
  expansion	
  
•  Lucene-­‐SKOS	
  
•  Evalua>on	
  
•  Conclusions	
  
6	
  
The	
  big	
  picture	
  
Retrieval Model
Query
Documents
Analysis
Analysis
Query
Representation
Document
Representation
Scoring Results
SKOS-­‐based	
  term	
  expansion	
  
7	
  
Label-­‐based	
  (Query)	
  Expansion	
  
•  Query:	
  roman	
  arms	
  
•  Document:	
  
Title	
   Spearhead	
  
Descrip>on	
   Roman	
  iron	
  spearhead.	
  The	
  spearhead	
  was	
  
abached	
  to	
  one	
  end	
  of	
  a	
  wooden	
  shac.	
  .	
  .	
  
Subject	
   Weapons	
  
8	
  
Label-­‐based	
  (Query)	
  Expansion	
  
roman arms
skos:prefLabel
weapons
skos:altLabel
armaments
skos:broader
ordnance
skos:broader
military equipment
Title	
   Spearhead	
  
Descrip>on	
   Roman	
  iron	
  spearhead.	
  The	
  spearhead	
  was	
  abached	
  
to	
  one	
  end	
  of	
  a	
  wooden	
  shac.	
  .	
  .	
  
Subject	
   Weapons	
  
matches	
  
9	
  
URI-­‐based	
  Expansion	
  
•  Query:	
  roman	
  arms	
  
•  Document:	
  
Title	
   Spearhead	
  
Descrip>on	
   Roman	
  iron	
  spearhead.	
  The	
  spearhead	
  was	
  
abached	
  to	
  one	
  end	
  of	
  a	
  wooden	
  shac.	
  .	
  .	
  
Subject	
   hbp://www.ukat.org.uk/thesaurus/concept/859	
  
10	
  
URI-­‐based	
  Expansion	
  
11	
  
URI-­‐based	
  Expansion	
  
matches	
  
Title	
   Spearhead	
  
Descrip>on	
   Roman	
  iron	
  spearhead.	
  The	
  spearhead	
  was	
  
abached	
  to	
  one	
  end	
  of	
  a	
  wooden	
  shac.	
  .	
  .	
  
Subject	
   hbp://www.ukat.org.uk/thesaurus/concept/859	
  
arms,	
  weapons,	
  armaments,	
  ...	
  
Query:	
  roman	
  arms	
  
12	
  
Scoring	
  
•  Apply	
  regular	
  text	
  retrieval	
  func>ons	
  
•  boostctype:	
  leverages	
  explicit	
  declara>on	
  of	
  
SKOS	
  expansion	
  types	
  in	
  term	
  representa>ons	
  
•  coordq,d:	
  ensures	
  that	
  a	
  document	
  with	
  more	
  
matching	
  terms	
  will	
  score	
  higher	
  
13	
  
Overview	
  
•  A	
  brief	
  intro	
  to	
  SKOS	
  
•  SKOS-­‐based	
  term	
  expansion	
  
•  Implementa>on:	
  Lucene-­‐SKOS	
  
•  Evalua>on	
  
•  Conclusions	
  
14	
  
h"ps://github.com/behas/lucene-­‐skos	
  
15	
  
Overview	
  
•  A	
  brief	
  intro	
  to	
  SKOS	
  
•  SKOS-­‐based	
  term	
  expansion	
  
•  Implementa>on:	
  Lucene-­‐SKOS	
  
•  Evalua>on	
  
•  Conclusions	
  
16	
  
Dataset	
  1	
  
•  OHSUMED:	
  
– 350K	
  Pubmed	
  metadata	
  records	
  from	
  270	
  journals	
  
– Title,	
  author,	
  abstract,	
  …	
  
– 3	
  level	
  relevance	
  judgments	
  for	
  informa>on	
  needs	
  
•  Medical	
  Subject	
  Headings	
  (MeSH)	
  in	
  SKOS	
  
– Maintained	
  by	
  US	
  Na>onal	
  Library	
  of	
  Medicine	
  
– Used	
  to	
  index	
  millions	
  of	
  ar>cles	
  in	
  PubMed	
  
	
  
17	
  
Dataset	
  2	
  
•  8,905	
  MODS	
  metadata	
  records	
  harvested	
  
from	
  the	
  Library	
  of	
  Congress	
  (LoC)	
  
•  10	
  queries	
  from	
  the	
  2009/2010	
  query	
  
collec>on	
  
•  Binary	
  relevance	
  judgments	
  for	
  queries	
  	
  
•  Library	
  of	
  Congress	
  Subject	
  Headings	
  (LCSH)	
  in	
  
SKOS	
  
18	
  
Method	
  
•  Focus	
  on	
  label-­‐based	
  expansion	
  at	
  query	
  >me	
  
•  Normalized	
  queries	
  and	
  SKOS	
  vocabularies	
  
•  Expanded	
  each	
  query	
  term	
  by	
  collec>ng	
  SKOS	
  
concepts	
  that	
  have	
  that	
  term	
  in	
  any	
  of	
  their	
  
labels	
  (prefLabel,	
  altLabel,	
  hiddenLabel)	
  
•  Applied	
  pre-­‐defined	
  boost-­‐weights	
  that	
  
maximized	
  query	
  performance	
  
19	
  
Two	
  Baselines	
  
•  No	
  term	
  expansion	
  (NoExp)	
  
– queries	
  are	
  executed	
  over	
  documents	
  without	
  
SKOS-­‐based	
  term	
  expansion.	
  
•  Pseudo	
  Relevance	
  Feedback	
  (PRF):	
  
– Perform	
  ini>al	
  search	
  with	
  original	
  query	
  
– Collect	
  terms	
  from	
  k	
  retrieved	
  documents	
  
– Resubmit	
  query	
  +	
  collected	
  terms	
  
20	
  
OHSUMED	
  query	
  expansion	
  example	
  
Query:	
  fibromyalgia fibrositis, diagnosis and treatment	
  
Expanded	
  Query:	
  
(fibromyalgia rheumatism muscular^0.5 diffuse myofascial pain syndrome^0.5
fibromyalgia fibromyositis syndrome^0.5 myofascial pain syndrome diffuse^0.5
fibromyositis fibromyalgia syndrome^0.5 fibrositis^0.5)	
  
(fibrositis rheumatism muscular^0.5 diffuse myofascial pain syndrome^0.5
fibromyalgia fibromyositis syndrome^0.5 myofascial pain syndrome diffuse^0.5
fibromyositis fibromyalgia syndrome^0.5 fibromyalgia^0.5)	
  
(diagnosis examinations diagnoses^0.5)	
  
(treatment disease management^0.5 therapy^0.5)	
  
21	
  
Results	
  OHSUMED/MESH	
  
LTC TF-IDF
P@1 P@3 P@10 nDCG@1 nDCG@3 nDCG@10 P
NoExp 0.333 0.302 0.260 0.407 0.376 0.356 0.3
PRF 0.322 0.282 0.302 0.379 0.360 0.393 0.3
SKOS 0.419 0.366 0.276 0.484 0.429 0.379 0.5
Table 1: Precision and nDCG results on
query set provided with each dataset (63 OHSUMED/MESH,
10 LoC/LCSH). We focused on two measures, precision at
rank n (P@n) for both dataset bundles and nDCG at rank n
(nDCG@n) for the OHSUMED/MESH bundle, which pro-
vides ordinal relevance judgments. Two retrieval models
were used for ranking documents: LTC TF-IDF and BM25.
4.1 Dataset Characteristics
only
outco
25 m
witho
apart
PRF
docum
BM25
nDCG@10 P@1 P@3 P@10 nDCG@1 nDCG@3 nDCG@10
0.356 0.381 0.344 0.265 0.450 0.414 0.374
0.393 0.377 0.317 0.275 0.443 0.397 0.369
0.379 0.500 0.366 0.282 0.548 0.435 0.397
CG results on the OHSUMED dataset.
22	
  
Ini>al	
  Results	
  LoC/LCSH	
  
.
f
s
h
.
m
-
e
e
-
e
d
e
.
-
-
f
PRF might lead to better recall i.e., return more relevant
documents, but hurt precison at top ranks. On the other
hand, with SKOS-expansion the terms used in expansion
are always from a SKOS vocabulary and not from the cor-
pus. Therefore, we are already only expanding using rele-
vant terms, since the terms’ existence in the vocabulary is a
good hint that the term is important.
LTC TF-IDF
P@1 P@3 P@10
NoExp 0.500 0.500 0.370
PRF 0.300 0.433 0.430
SKOS 0.600 0.533 0.370
Table 2: Precision with LTC TF-IDF results on the
Library of Congress dataset.
Table 2 shows the results we obtained from early experi-
ments with the Library of Congress Subject Headings. The
results are similar to the results with OHSUMED. The PRF
23	
  
Overview	
  
•  A	
  brief	
  intro	
  to	
  SKOS	
  
•  SKOS-­‐based	
  term	
  expansion	
  
•  Implementa>on:	
  Lucene-­‐SKOS	
  
•  Experiments	
  
•  Conclusions	
  
24	
  
Conclusions	
  
•  Our	
  experiments	
  indicated	
  gains	
  in	
  retrieval	
  
effec>veness	
  compared	
  to	
  no-­‐expansion	
  or	
  
pseudo	
  relevance	
  feedback	
  
•  Our	
  solu>on	
  can	
  easily	
  be	
  adopted	
  by	
  loading	
  
lucene-­‐skos	
  with	
  Apache	
  Lucene	
  and	
  Solr	
  
25	
  
Future	
  Work	
  
•  More	
  thorough	
  evalua>on	
  using	
  LoC/LCSH	
  	
  
•  Use	
  corpora,	
  queries,	
  and	
  vocabularies	
  from	
  
other	
  domains	
  
•  Apply	
  this	
  technique	
  for	
  general	
  concept-­‐
based	
  and	
  URI-­‐iden>fied	
  Web	
  data	
  sources	
  
26	
  
Thanks!	
  
@bhaslhofer,	
  @flaviomar>ns	
  
hbp://slideshare.net/bhaslhofer	
  
	
  
hbps://github.com/behas/lucene-­‐skos	
  
27	
  

More Related Content

Viewers also liked

DH101 2013/2014 course 6 - Semantic coding, RDF, CIDOC-CRM
DH101 2013/2014 course 6 - Semantic coding, RDF, CIDOC-CRMDH101 2013/2014 course 6 - Semantic coding, RDF, CIDOC-CRM
DH101 2013/2014 course 6 - Semantic coding, RDF, CIDOC-CRMFrederic Kaplan
 
DH101 2013/2014 course 2
DH101 2013/2014 course 2DH101 2013/2014 course 2
DH101 2013/2014 course 2Frederic Kaplan
 
GraphSense - Real-time Insight into Virtual Currency Ecosystems
GraphSense - Real-time Insight into Virtual Currency EcosystemsGraphSense - Real-time Insight into Virtual Currency Ecosystems
GraphSense - Real-time Insight into Virtual Currency EcosystemsBernhard Haslhofer
 
CIDOC CRM+FRBRoo: an Integrated View of Museum and Library Information
CIDOC CRM+FRBRoo: an Integrated View of Museum and Library InformationCIDOC CRM+FRBRoo: an Integrated View of Museum and Library Information
CIDOC CRM+FRBRoo: an Integrated View of Museum and Library InformationPatrick Le Boeuf
 
Interopérabilité de l'information bibliographique et muséologique
Interopérabilité de l'information bibliographique et muséologiqueInteropérabilité de l'information bibliographique et muséologique
Interopérabilité de l'information bibliographique et muséologiquePatrick Le Boeuf
 
Types and Annotations for CIDOC CRM Properties - Presentation
Types and Annotations for CIDOC CRM Properties - PresentationTypes and Annotations for CIDOC CRM Properties - Presentation
Types and Annotations for CIDOC CRM Properties - PresentationVladimir Alexiev, PhD, PMP
 
DH101 2013/2014 course 5 - Project on Venice / Datafication / Regulated repre...
DH101 2013/2014 course 5 - Project on Venice / Datafication / Regulated repre...DH101 2013/2014 course 5 - Project on Venice / Datafication / Regulated repre...
DH101 2013/2014 course 5 - Project on Venice / Datafication / Regulated repre...Frederic Kaplan
 
DH101 2013/2014 course 4 - Digitization techniques 2D and 3D
DH101 2013/2014 course 4 - Digitization techniques 2D and 3D DH101 2013/2014 course 4 - Digitization techniques 2D and 3D
DH101 2013/2014 course 4 - Digitization techniques 2D and 3D Frederic Kaplan
 
DH101 2013/2014 course1 - Presentation of the course / Collaborative writing ...
DH101 2013/2014 course1 - Presentation of the course / Collaborative writing ...DH101 2013/2014 course1 - Presentation of the course / Collaborative writing ...
DH101 2013/2014 course1 - Presentation of the course / Collaborative writing ...Frederic Kaplan
 
DH101 2013/2014 course 9 - Crowdsourcing, crowdfunding, Wikipedia, Open Stree...
DH101 2013/2014 course 9 - Crowdsourcing, crowdfunding, Wikipedia, Open Stree...DH101 2013/2014 course 9 - Crowdsourcing, crowdfunding, Wikipedia, Open Stree...
DH101 2013/2014 course 9 - Crowdsourcing, crowdfunding, Wikipedia, Open Stree...Frederic Kaplan
 
DH101 2013/2014 course 7 - OCR, Printed text recognition, Handwriting recogni...
DH101 2013/2014 course 7 - OCR, Printed text recognition, Handwriting recogni...DH101 2013/2014 course 7 - OCR, Printed text recognition, Handwriting recogni...
DH101 2013/2014 course 7 - OCR, Printed text recognition, Handwriting recogni...Frederic Kaplan
 
Lidar for heritage mapping in India
Lidar for heritage mapping in IndiaLidar for heritage mapping in India
Lidar for heritage mapping in IndiaArchana Joshi
 
Cultural Mapping & Digital Storytelling in a Social Context
Cultural Mapping & Digital Storytelling in a Social ContextCultural Mapping & Digital Storytelling in a Social Context
Cultural Mapping & Digital Storytelling in a Social ContextStefan Kolgen
 
Mapping Cultural Heritage Information to CIDOC-CRM
Mapping Cultural Heritage Information to CIDOC-CRMMapping Cultural Heritage Information to CIDOC-CRM
Mapping Cultural Heritage Information to CIDOC-CRMMaria Theodoridou
 
Achille Felicetti - ARIADNE Semantic Integration of Archaeological Information
Achille Felicetti - ARIADNE Semantic Integration of Archaeological InformationAchille Felicetti - ARIADNE Semantic Integration of Archaeological Information
Achille Felicetti - ARIADNE Semantic Integration of Archaeological Informationariadnenetwork
 
Cultural Mapping Dumaguete City
Cultural Mapping Dumaguete CityCultural Mapping Dumaguete City
Cultural Mapping Dumaguete CityMonte Christo
 
Big Data & Text Mining
Big Data & Text MiningBig Data & Text Mining
Big Data & Text MiningMichel Bruley
 

Viewers also liked (20)

DH101 2013/2014 course 6 - Semantic coding, RDF, CIDOC-CRM
DH101 2013/2014 course 6 - Semantic coding, RDF, CIDOC-CRMDH101 2013/2014 course 6 - Semantic coding, RDF, CIDOC-CRM
DH101 2013/2014 course 6 - Semantic coding, RDF, CIDOC-CRM
 
DH101 2013/2014 course 2
DH101 2013/2014 course 2DH101 2013/2014 course 2
DH101 2013/2014 course 2
 
GraphSense - Real-time Insight into Virtual Currency Ecosystems
GraphSense - Real-time Insight into Virtual Currency EcosystemsGraphSense - Real-time Insight into Virtual Currency Ecosystems
GraphSense - Real-time Insight into Virtual Currency Ecosystems
 
CIDOC CRM+FRBRoo: an Integrated View of Museum and Library Information
CIDOC CRM+FRBRoo: an Integrated View of Museum and Library InformationCIDOC CRM+FRBRoo: an Integrated View of Museum and Library Information
CIDOC CRM+FRBRoo: an Integrated View of Museum and Library Information
 
Interopérabilité de l'information bibliographique et muséologique
Interopérabilité de l'information bibliographique et muséologiqueInteropérabilité de l'information bibliographique et muséologique
Interopérabilité de l'information bibliographique et muséologique
 
Types and Annotations for CIDOC CRM Properties - Presentation
Types and Annotations for CIDOC CRM Properties - PresentationTypes and Annotations for CIDOC CRM Properties - Presentation
Types and Annotations for CIDOC CRM Properties - Presentation
 
DH101 2013/2014 course 5 - Project on Venice / Datafication / Regulated repre...
DH101 2013/2014 course 5 - Project on Venice / Datafication / Regulated repre...DH101 2013/2014 course 5 - Project on Venice / Datafication / Regulated repre...
DH101 2013/2014 course 5 - Project on Venice / Datafication / Regulated repre...
 
DH101 2013/2014 course 4 - Digitization techniques 2D and 3D
DH101 2013/2014 course 4 - Digitization techniques 2D and 3D DH101 2013/2014 course 4 - Digitization techniques 2D and 3D
DH101 2013/2014 course 4 - Digitization techniques 2D and 3D
 
DH101 2013/2014 course1 - Presentation of the course / Collaborative writing ...
DH101 2013/2014 course1 - Presentation of the course / Collaborative writing ...DH101 2013/2014 course1 - Presentation of the course / Collaborative writing ...
DH101 2013/2014 course1 - Presentation of the course / Collaborative writing ...
 
DH101 2013/2014 course 9 - Crowdsourcing, crowdfunding, Wikipedia, Open Stree...
DH101 2013/2014 course 9 - Crowdsourcing, crowdfunding, Wikipedia, Open Stree...DH101 2013/2014 course 9 - Crowdsourcing, crowdfunding, Wikipedia, Open Stree...
DH101 2013/2014 course 9 - Crowdsourcing, crowdfunding, Wikipedia, Open Stree...
 
DH101 2013/2014 course 7 - OCR, Printed text recognition, Handwriting recogni...
DH101 2013/2014 course 7 - OCR, Printed text recognition, Handwriting recogni...DH101 2013/2014 course 7 - OCR, Printed text recognition, Handwriting recogni...
DH101 2013/2014 course 7 - OCR, Printed text recognition, Handwriting recogni...
 
Lidar for heritage mapping in India
Lidar for heritage mapping in IndiaLidar for heritage mapping in India
Lidar for heritage mapping in India
 
Cultural Mapping & Digital Storytelling in a Social Context
Cultural Mapping & Digital Storytelling in a Social ContextCultural Mapping & Digital Storytelling in a Social Context
Cultural Mapping & Digital Storytelling in a Social Context
 
Mapping Cultural Heritage Information to CIDOC-CRM
Mapping Cultural Heritage Information to CIDOC-CRMMapping Cultural Heritage Information to CIDOC-CRM
Mapping Cultural Heritage Information to CIDOC-CRM
 
CIDOC CRM in Practice
CIDOC CRM in PracticeCIDOC CRM in Practice
CIDOC CRM in Practice
 
Achille Felicetti - ARIADNE Semantic Integration of Archaeological Information
Achille Felicetti - ARIADNE Semantic Integration of Archaeological InformationAchille Felicetti - ARIADNE Semantic Integration of Archaeological Information
Achille Felicetti - ARIADNE Semantic Integration of Archaeological Information
 
Cultural Mapping Dumaguete City
Cultural Mapping Dumaguete CityCultural Mapping Dumaguete City
Cultural Mapping Dumaguete City
 
Cultural Mapping
Cultural MappingCultural Mapping
Cultural Mapping
 
Big Data & Text Mining
Big Data & Text MiningBig Data & Text Mining
Big Data & Text Mining
 
Things, not Strings
Things, not StringsThings, not Strings
Things, not Strings
 

Similar to Using SKOS Vocabularies for Improving Web Search

From SKOS over SKOS-XL to Custom Ontologies
From SKOS over SKOS-XL to Custom OntologiesFrom SKOS over SKOS-XL to Custom Ontologies
From SKOS over SKOS-XL to Custom OntologiesSemantic Web Company
 
The Semantic Web meets the Code of Federal Regulations
The Semantic Web meets the Code of Federal RegulationsThe Semantic Web meets the Code of Federal Regulations
The Semantic Web meets the Code of Federal Regulationstbruce
 
The state of KOS in the Linked Data movement
The state of KOS in the Linked Data movementThe state of KOS in the Linked Data movement
The state of KOS in the Linked Data movementMarcia Zeng
 
Harmonization of vocabularies for water data
Harmonization of vocabularies for water dataHarmonization of vocabularies for water data
Harmonization of vocabularies for water dataSimon Cox
 
A Brief Introduction to SKOS
A Brief Introduction to SKOSA Brief Introduction to SKOS
A Brief Introduction to SKOSHeather Hedden
 
Fairport domain specific metadata using w3 c dcat & skos w ontology views
Fairport domain specific metadata using w3 c dcat & skos w ontology viewsFairport domain specific metadata using w3 c dcat & skos w ontology views
Fairport domain specific metadata using w3 c dcat & skos w ontology viewsTim Clark
 
KOS evolution in Linked Data
KOS evolution in Linked DataKOS evolution in Linked Data
KOS evolution in Linked DataJoachim Neubert
 
Exploring Direct Concept Search
Exploring Direct Concept SearchExploring Direct Concept Search
Exploring Direct Concept SearchSteve Rowe
 
Exploring Direct Concept Search - Steve Rowe, Lucidworks
Exploring Direct Concept Search - Steve Rowe, LucidworksExploring Direct Concept Search - Steve Rowe, Lucidworks
Exploring Direct Concept Search - Steve Rowe, LucidworksLucidworks
 
SKOS - 2007 Open Forum on Metadata Registries - NYC
SKOS - 2007 Open Forum on Metadata Registries - NYCSKOS - 2007 Open Forum on Metadata Registries - NYC
SKOS - 2007 Open Forum on Metadata Registries - NYCjonphipps
 
Linking Folksonomies to Knowledge Organization Systems
Linking Folksonomies to Knowledge Organization SystemsLinking Folksonomies to Knowledge Organization Systems
Linking Folksonomies to Knowledge Organization SystemsJakob .
 
DataScience Lab 2017_From bag of texts to bag of clusters_Терпиль Евгений / П...
DataScience Lab 2017_From bag of texts to bag of clusters_Терпиль Евгений / П...DataScience Lab 2017_From bag of texts to bag of clusters_Терпиль Евгений / П...
DataScience Lab 2017_From bag of texts to bag of clusters_Терпиль Евгений / П...GeeksLab Odessa
 
Expressive Querying of Semantic Databases with Incremental Query Rewriting
Expressive Querying of Semantic Databases with Incremental Query RewritingExpressive Querying of Semantic Databases with Incremental Query Rewriting
Expressive Querying of Semantic Databases with Incremental Query RewritingAlexandre Riazanov
 
Harmonizing services for LOD vocabularies: a case study
Harmonizing services for LOD vocabularies: a case studyHarmonizing services for LOD vocabularies: a case study
Harmonizing services for LOD vocabularies: a case studyGhislain Atemezing
 
SKOS and Its Application in Transferring Traditional Thesauri into Networked KOS
SKOS and Its Application in Transferring Traditional Thesauri into Networked KOSSKOS and Its Application in Transferring Traditional Thesauri into Networked KOS
SKOS and Its Application in Transferring Traditional Thesauri into Networked KOSMarcia Zeng
 
TermPicker: Enabling the Reuse of Vocabulary Terms by Exploiting Data from th...
TermPicker: Enabling the Reuse of Vocabulary Terms by Exploiting Data from th...TermPicker: Enabling the Reuse of Vocabulary Terms by Exploiting Data from th...
TermPicker: Enabling the Reuse of Vocabulary Terms by Exploiting Data from th...JohannWanja
 
Leveraging SKOS to trace the overhaul of the STW Thesaurus for Economics
Leveraging SKOS to trace the overhaul of the STW Thesaurus for EconomicsLeveraging SKOS to trace the overhaul of the STW Thesaurus for Economics
Leveraging SKOS to trace the overhaul of the STW Thesaurus for EconomicsJoachim Neubert
 
Albert Merono-Penuela: Understanding Change in Versioned Web-Knowledge Organi...
Albert Merono-Penuela: Understanding Change in Versioned Web-Knowledge Organi...Albert Merono-Penuela: Understanding Change in Versioned Web-Knowledge Organi...
Albert Merono-Penuela: Understanding Change in Versioned Web-Knowledge Organi...COST Action TD1210
 
Knowledge Organization System (KOS) for biodiversity information resources, G...
Knowledge Organization System (KOS) for biodiversity information resources, G...Knowledge Organization System (KOS) for biodiversity information resources, G...
Knowledge Organization System (KOS) for biodiversity information resources, G...Dag Endresen
 

Similar to Using SKOS Vocabularies for Improving Web Search (20)

From SKOS over SKOS-XL to Custom Ontologies
From SKOS over SKOS-XL to Custom OntologiesFrom SKOS over SKOS-XL to Custom Ontologies
From SKOS over SKOS-XL to Custom Ontologies
 
The Semantic Web meets the Code of Federal Regulations
The Semantic Web meets the Code of Federal RegulationsThe Semantic Web meets the Code of Federal Regulations
The Semantic Web meets the Code of Federal Regulations
 
The state of KOS in the Linked Data movement
The state of KOS in the Linked Data movementThe state of KOS in the Linked Data movement
The state of KOS in the Linked Data movement
 
Harmonization of vocabularies for water data
Harmonization of vocabularies for water dataHarmonization of vocabularies for water data
Harmonization of vocabularies for water data
 
A Brief Introduction to SKOS
A Brief Introduction to SKOSA Brief Introduction to SKOS
A Brief Introduction to SKOS
 
Fairport domain specific metadata using w3 c dcat & skos w ontology views
Fairport domain specific metadata using w3 c dcat & skos w ontology viewsFairport domain specific metadata using w3 c dcat & skos w ontology views
Fairport domain specific metadata using w3 c dcat & skos w ontology views
 
KOS evolution in Linked Data
KOS evolution in Linked DataKOS evolution in Linked Data
KOS evolution in Linked Data
 
Exploring Direct Concept Search
Exploring Direct Concept SearchExploring Direct Concept Search
Exploring Direct Concept Search
 
Exploring Direct Concept Search - Steve Rowe, Lucidworks
Exploring Direct Concept Search - Steve Rowe, LucidworksExploring Direct Concept Search - Steve Rowe, Lucidworks
Exploring Direct Concept Search - Steve Rowe, Lucidworks
 
SKOS - 2007 Open Forum on Metadata Registries - NYC
SKOS - 2007 Open Forum on Metadata Registries - NYCSKOS - 2007 Open Forum on Metadata Registries - NYC
SKOS - 2007 Open Forum on Metadata Registries - NYC
 
Linking Folksonomies to Knowledge Organization Systems
Linking Folksonomies to Knowledge Organization SystemsLinking Folksonomies to Knowledge Organization Systems
Linking Folksonomies to Knowledge Organization Systems
 
DataScience Lab 2017_From bag of texts to bag of clusters_Терпиль Евгений / П...
DataScience Lab 2017_From bag of texts to bag of clusters_Терпиль Евгений / П...DataScience Lab 2017_From bag of texts to bag of clusters_Терпиль Евгений / П...
DataScience Lab 2017_From bag of texts to bag of clusters_Терпиль Евгений / П...
 
Expressive Querying of Semantic Databases with Incremental Query Rewriting
Expressive Querying of Semantic Databases with Incremental Query RewritingExpressive Querying of Semantic Databases with Incremental Query Rewriting
Expressive Querying of Semantic Databases with Incremental Query Rewriting
 
SPIN and Shapes
SPIN and ShapesSPIN and Shapes
SPIN and Shapes
 
Harmonizing services for LOD vocabularies: a case study
Harmonizing services for LOD vocabularies: a case studyHarmonizing services for LOD vocabularies: a case study
Harmonizing services for LOD vocabularies: a case study
 
SKOS and Its Application in Transferring Traditional Thesauri into Networked KOS
SKOS and Its Application in Transferring Traditional Thesauri into Networked KOSSKOS and Its Application in Transferring Traditional Thesauri into Networked KOS
SKOS and Its Application in Transferring Traditional Thesauri into Networked KOS
 
TermPicker: Enabling the Reuse of Vocabulary Terms by Exploiting Data from th...
TermPicker: Enabling the Reuse of Vocabulary Terms by Exploiting Data from th...TermPicker: Enabling the Reuse of Vocabulary Terms by Exploiting Data from th...
TermPicker: Enabling the Reuse of Vocabulary Terms by Exploiting Data from th...
 
Leveraging SKOS to trace the overhaul of the STW Thesaurus for Economics
Leveraging SKOS to trace the overhaul of the STW Thesaurus for EconomicsLeveraging SKOS to trace the overhaul of the STW Thesaurus for Economics
Leveraging SKOS to trace the overhaul of the STW Thesaurus for Economics
 
Albert Merono-Penuela: Understanding Change in Versioned Web-Knowledge Organi...
Albert Merono-Penuela: Understanding Change in Versioned Web-Knowledge Organi...Albert Merono-Penuela: Understanding Change in Versioned Web-Knowledge Organi...
Albert Merono-Penuela: Understanding Change in Versioned Web-Knowledge Organi...
 
Knowledge Organization System (KOS) for biodiversity information resources, G...
Knowledge Organization System (KOS) for biodiversity information resources, G...Knowledge Organization System (KOS) for biodiversity information resources, G...
Knowledge Organization System (KOS) for biodiversity information resources, G...
 

More from Bernhard Haslhofer

Decentralized Finance (DeFi) - Understanding Risks in an Emerging Financial P...
Decentralized Finance (DeFi) - Understanding Risks in an Emerging Financial P...Decentralized Finance (DeFi) - Understanding Risks in an Emerging Financial P...
Decentralized Finance (DeFi) - Understanding Risks in an Emerging Financial P...Bernhard Haslhofer
 
Token Systems, Payment Channels, and Corporate Currencies
Token Systems, Payment Channels, and Corporate CurrenciesToken Systems, Payment Channels, and Corporate Currencies
Token Systems, Payment Channels, and Corporate CurrenciesBernhard Haslhofer
 
Can a blockchain solve the trust problem?
Can a blockchain solve the trust problem?Can a blockchain solve the trust problem?
Can a blockchain solve the trust problem?Bernhard Haslhofer
 
Measurements in Cryptocurrency Networks
Measurements in Cryptocurrency NetworksMeasurements in Cryptocurrency Networks
Measurements in Cryptocurrency NetworksBernhard Haslhofer
 
Post-Bitcoin Cryptocurrencies, Off-Chain Transaction Channels, and Cryptocur...
 Post-Bitcoin Cryptocurrencies, Off-Chain Transaction Channels, and Cryptocur... Post-Bitcoin Cryptocurrencies, Off-Chain Transaction Channels, and Cryptocur...
Post-Bitcoin Cryptocurrencies, Off-Chain Transaction Channels, and Cryptocur...Bernhard Haslhofer
 
Insight Into Cryptocurrencies - Methods and Tools for Analyzing Blockchain-ba...
Insight Into Cryptocurrencies - Methods and Tools for Analyzing Blockchain-ba...Insight Into Cryptocurrencies - Methods and Tools for Analyzing Blockchain-ba...
Insight Into Cryptocurrencies - Methods and Tools for Analyzing Blockchain-ba...Bernhard Haslhofer
 
O Bitcoin Where Art Thou? An Introduction to Cryptocurrency Analytics
O Bitcoin Where Art Thou? An Introduction to Cryptocurrency AnalyticsO Bitcoin Where Art Thou? An Introduction to Cryptocurrency Analytics
O Bitcoin Where Art Thou? An Introduction to Cryptocurrency AnalyticsBernhard Haslhofer
 
Mind the Gap - Data Science Meets Software Engineering
Mind the Gap - Data Science Meets Software EngineeringMind the Gap - Data Science Meets Software Engineering
Mind the Gap - Data Science Meets Software EngineeringBernhard Haslhofer
 
BITCOIN - De-anonymization and Money Laundering Detection Strategies
BITCOIN - De-anonymization and Money Laundering Detection StrategiesBITCOIN - De-anonymization and Money Laundering Detection Strategies
BITCOIN - De-anonymization and Money Laundering Detection StrategiesBernhard Haslhofer
 
Bitcoin - Introduction, Technical Aspects and Ongoing Developments
Bitcoin - Introduction, Technical Aspects and Ongoing DevelopmentsBitcoin - Introduction, Technical Aspects and Ongoing Developments
Bitcoin - Introduction, Technical Aspects and Ongoing DevelopmentsBernhard Haslhofer
 
Maphub und Pelagios: Anwendung von Linked Data in den Digitalen Geisteswissen...
Maphub und Pelagios: Anwendung von Linked Data in den Digitalen Geisteswissen...Maphub und Pelagios: Anwendung von Linked Data in den Digitalen Geisteswissen...
Maphub und Pelagios: Anwendung von Linked Data in den Digitalen Geisteswissen...Bernhard Haslhofer
 
The value of open data and the OpenGLAM network
The value of open data and the OpenGLAM networkThe value of open data and the OpenGLAM network
The value of open data and the OpenGLAM networkBernhard Haslhofer
 
Offene Daten im Kulturbereich - Die pragmatische Perspektive
Offene Daten im Kulturbereich - Die pragmatische PerspektiveOffene Daten im Kulturbereich - Die pragmatische Perspektive
Offene Daten im Kulturbereich - Die pragmatische PerspektiveBernhard Haslhofer
 
Open Data - Principles and Techniques
Open Data - Principles and TechniquesOpen Data - Principles and Techniques
Open Data - Principles and TechniquesBernhard Haslhofer
 
Semantic Tagging on Historical Maps
Semantic Tagging on Historical MapsSemantic Tagging on Historical Maps
Semantic Tagging on Historical MapsBernhard Haslhofer
 
OpenGLAM Intro @ OKFN.AT Meetup Graz
OpenGLAM Intro @ OKFN.AT Meetup GrazOpenGLAM Intro @ OKFN.AT Meetup Graz
OpenGLAM Intro @ OKFN.AT Meetup GrazBernhard Haslhofer
 
Semantic Tagging for old maps...and other things on the Web
Semantic Tagging for old maps...and other things on the WebSemantic Tagging for old maps...and other things on the Web
Semantic Tagging for old maps...and other things on the WebBernhard Haslhofer
 
ResourceSync: Leveraging Sitemaps for Resource Synchronization
ResourceSync: Leveraging Sitemaps for Resource SynchronizationResourceSync: Leveraging Sitemaps for Resource Synchronization
ResourceSync: Leveraging Sitemaps for Resource SynchronizationBernhard Haslhofer
 

More from Bernhard Haslhofer (20)

Decentralized Finance (DeFi) - Understanding Risks in an Emerging Financial P...
Decentralized Finance (DeFi) - Understanding Risks in an Emerging Financial P...Decentralized Finance (DeFi) - Understanding Risks in an Emerging Financial P...
Decentralized Finance (DeFi) - Understanding Risks in an Emerging Financial P...
 
Token Systems, Payment Channels, and Corporate Currencies
Token Systems, Payment Channels, and Corporate CurrenciesToken Systems, Payment Channels, and Corporate Currencies
Token Systems, Payment Channels, and Corporate Currencies
 
Can a blockchain solve the trust problem?
Can a blockchain solve the trust problem?Can a blockchain solve the trust problem?
Can a blockchain solve the trust problem?
 
Measurements in Cryptocurrency Networks
Measurements in Cryptocurrency NetworksMeasurements in Cryptocurrency Networks
Measurements in Cryptocurrency Networks
 
Post-Bitcoin Cryptocurrencies, Off-Chain Transaction Channels, and Cryptocur...
 Post-Bitcoin Cryptocurrencies, Off-Chain Transaction Channels, and Cryptocur... Post-Bitcoin Cryptocurrencies, Off-Chain Transaction Channels, and Cryptocur...
Post-Bitcoin Cryptocurrencies, Off-Chain Transaction Channels, and Cryptocur...
 
Insight Into Cryptocurrencies - Methods and Tools for Analyzing Blockchain-ba...
Insight Into Cryptocurrencies - Methods and Tools for Analyzing Blockchain-ba...Insight Into Cryptocurrencies - Methods and Tools for Analyzing Blockchain-ba...
Insight Into Cryptocurrencies - Methods and Tools for Analyzing Blockchain-ba...
 
O Bitcoin Where Art Thou? An Introduction to Cryptocurrency Analytics
O Bitcoin Where Art Thou? An Introduction to Cryptocurrency AnalyticsO Bitcoin Where Art Thou? An Introduction to Cryptocurrency Analytics
O Bitcoin Where Art Thou? An Introduction to Cryptocurrency Analytics
 
Mind the Gap - Data Science Meets Software Engineering
Mind the Gap - Data Science Meets Software EngineeringMind the Gap - Data Science Meets Software Engineering
Mind the Gap - Data Science Meets Software Engineering
 
BITCOIN - De-anonymization and Money Laundering Detection Strategies
BITCOIN - De-anonymization and Money Laundering Detection StrategiesBITCOIN - De-anonymization and Money Laundering Detection Strategies
BITCOIN - De-anonymization and Money Laundering Detection Strategies
 
Bitcoin - Introduction, Technical Aspects and Ongoing Developments
Bitcoin - Introduction, Technical Aspects and Ongoing DevelopmentsBitcoin - Introduction, Technical Aspects and Ongoing Developments
Bitcoin - Introduction, Technical Aspects and Ongoing Developments
 
Maphub und Pelagios: Anwendung von Linked Data in den Digitalen Geisteswissen...
Maphub und Pelagios: Anwendung von Linked Data in den Digitalen Geisteswissen...Maphub und Pelagios: Anwendung von Linked Data in den Digitalen Geisteswissen...
Maphub und Pelagios: Anwendung von Linked Data in den Digitalen Geisteswissen...
 
The value of open data and the OpenGLAM network
The value of open data and the OpenGLAM networkThe value of open data and the OpenGLAM network
The value of open data and the OpenGLAM network
 
Offene Daten im Kulturbereich - Die pragmatische Perspektive
Offene Daten im Kulturbereich - Die pragmatische PerspektiveOffene Daten im Kulturbereich - Die pragmatische Perspektive
Offene Daten im Kulturbereich - Die pragmatische Perspektive
 
Open Data - Principles and Techniques
Open Data - Principles and TechniquesOpen Data - Principles and Techniques
Open Data - Principles and Techniques
 
Semantic Tagging on Historical Maps
Semantic Tagging on Historical MapsSemantic Tagging on Historical Maps
Semantic Tagging on Historical Maps
 
OpenGLAM Intro @ OKFN.AT Meetup Graz
OpenGLAM Intro @ OKFN.AT Meetup GrazOpenGLAM Intro @ OKFN.AT Meetup Graz
OpenGLAM Intro @ OKFN.AT Meetup Graz
 
Semantic Tagging for old maps...and other things on the Web
Semantic Tagging for old maps...and other things on the WebSemantic Tagging for old maps...and other things on the Web
Semantic Tagging for old maps...and other things on the Web
 
Linked (Open) Data
Linked (Open) DataLinked (Open) Data
Linked (Open) Data
 
ResourceSync: Leveraging Sitemaps for Resource Synchronization
ResourceSync: Leveraging Sitemaps for Resource SynchronizationResourceSync: Leveraging Sitemaps for Resource Synchronization
ResourceSync: Leveraging Sitemaps for Resource Synchronization
 
Maphub and Annotorious
Maphub and AnnotoriousMaphub and Annotorious
Maphub and Annotorious
 

Recently uploaded

Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDGMarianaLemus7
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
Unlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsUnlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsPrecisely
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 

Recently uploaded (20)

Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptxVulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDG
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
Unlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsUnlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power Systems
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 

Using SKOS Vocabularies for Improving Web Search

  • 1. Using  SKOS  Vocabularies  for   Improving  Web  Search   Web  of  Linked  En>>es  Workshop  (WoLE  2013)   WWW  2013,  Rio  de  Janeiro,  May  13th  2013     Bernhard  Haslhofer  |  University  of  Vienna   Flávio  Mar>ns  |  Universidade  Nova  de  Lisboa   João  Magalhães  |  Universidade  Nova  de  Lisboa  
  • 2. Overview   •  A  brief  intro  to  SKOS   •  SKOS-­‐based  term  expansion   •  Lucene-­‐SKOS   •  Evalua>on   •  Conclusions   2  
  • 3. What  is  SKOS?   •  A  language  for  describing  Web  vocabularies   (taxonomies,  classifica>on  schemes,  thesauri)   •  Builds  on  Linked  Data  principles   – Concepts  have  URIs     – Concepts  are  interlinked   – Vocabularies  are  expressed  in  RDF   3  
  • 4. Who  is  using  SKOS?   h"p://www.w3.org/2001/sw/wiki/SKOS/Datasets  4  
  • 5. SKOS  Example  -­‐  UKAT   "Weapons" skos:prefLabel skos:Concept ukat:859 "Military Equipment" skos:prefLabel skos:Concept ukat:5060 skos:broaderskos:narrower skos:broader skos:narrower "Ordnance" skos:altLabel "Armaments"skos:altLabel "Arms" skos:altLabel ukat: http://www.ukat.org.uk/thesaurus/concept/ skos: http://www.w3.org/2004/02/skos/core# 5  
  • 6. Overview   •  A  brief  intro  to  SKOS   •  SKOS-­‐based  term  expansion   •  Lucene-­‐SKOS   •  Evalua>on   •  Conclusions   6  
  • 7. The  big  picture   Retrieval Model Query Documents Analysis Analysis Query Representation Document Representation Scoring Results SKOS-­‐based  term  expansion   7  
  • 8. Label-­‐based  (Query)  Expansion   •  Query:  roman  arms   •  Document:   Title   Spearhead   Descrip>on   Roman  iron  spearhead.  The  spearhead  was   abached  to  one  end  of  a  wooden  shac.  .  .   Subject   Weapons   8  
  • 9. Label-­‐based  (Query)  Expansion   roman arms skos:prefLabel weapons skos:altLabel armaments skos:broader ordnance skos:broader military equipment Title   Spearhead   Descrip>on   Roman  iron  spearhead.  The  spearhead  was  abached   to  one  end  of  a  wooden  shac.  .  .   Subject   Weapons   matches   9  
  • 10. URI-­‐based  Expansion   •  Query:  roman  arms   •  Document:   Title   Spearhead   Descrip>on   Roman  iron  spearhead.  The  spearhead  was   abached  to  one  end  of  a  wooden  shac.  .  .   Subject   hbp://www.ukat.org.uk/thesaurus/concept/859   10  
  • 12. URI-­‐based  Expansion   matches   Title   Spearhead   Descrip>on   Roman  iron  spearhead.  The  spearhead  was   abached  to  one  end  of  a  wooden  shac.  .  .   Subject   hbp://www.ukat.org.uk/thesaurus/concept/859   arms,  weapons,  armaments,  ...   Query:  roman  arms   12  
  • 13. Scoring   •  Apply  regular  text  retrieval  func>ons   •  boostctype:  leverages  explicit  declara>on  of   SKOS  expansion  types  in  term  representa>ons   •  coordq,d:  ensures  that  a  document  with  more   matching  terms  will  score  higher   13  
  • 14. Overview   •  A  brief  intro  to  SKOS   •  SKOS-­‐based  term  expansion   •  Implementa>on:  Lucene-­‐SKOS   •  Evalua>on   •  Conclusions   14  
  • 16. Overview   •  A  brief  intro  to  SKOS   •  SKOS-­‐based  term  expansion   •  Implementa>on:  Lucene-­‐SKOS   •  Evalua>on   •  Conclusions   16  
  • 17. Dataset  1   •  OHSUMED:   – 350K  Pubmed  metadata  records  from  270  journals   – Title,  author,  abstract,  …   – 3  level  relevance  judgments  for  informa>on  needs   •  Medical  Subject  Headings  (MeSH)  in  SKOS   – Maintained  by  US  Na>onal  Library  of  Medicine   – Used  to  index  millions  of  ar>cles  in  PubMed     17  
  • 18. Dataset  2   •  8,905  MODS  metadata  records  harvested   from  the  Library  of  Congress  (LoC)   •  10  queries  from  the  2009/2010  query   collec>on   •  Binary  relevance  judgments  for  queries     •  Library  of  Congress  Subject  Headings  (LCSH)  in   SKOS   18  
  • 19. Method   •  Focus  on  label-­‐based  expansion  at  query  >me   •  Normalized  queries  and  SKOS  vocabularies   •  Expanded  each  query  term  by  collec>ng  SKOS   concepts  that  have  that  term  in  any  of  their   labels  (prefLabel,  altLabel,  hiddenLabel)   •  Applied  pre-­‐defined  boost-­‐weights  that   maximized  query  performance   19  
  • 20. Two  Baselines   •  No  term  expansion  (NoExp)   – queries  are  executed  over  documents  without   SKOS-­‐based  term  expansion.   •  Pseudo  Relevance  Feedback  (PRF):   – Perform  ini>al  search  with  original  query   – Collect  terms  from  k  retrieved  documents   – Resubmit  query  +  collected  terms   20  
  • 21. OHSUMED  query  expansion  example   Query:  fibromyalgia fibrositis, diagnosis and treatment   Expanded  Query:   (fibromyalgia rheumatism muscular^0.5 diffuse myofascial pain syndrome^0.5 fibromyalgia fibromyositis syndrome^0.5 myofascial pain syndrome diffuse^0.5 fibromyositis fibromyalgia syndrome^0.5 fibrositis^0.5)   (fibrositis rheumatism muscular^0.5 diffuse myofascial pain syndrome^0.5 fibromyalgia fibromyositis syndrome^0.5 myofascial pain syndrome diffuse^0.5 fibromyositis fibromyalgia syndrome^0.5 fibromyalgia^0.5)   (diagnosis examinations diagnoses^0.5)   (treatment disease management^0.5 therapy^0.5)   21  
  • 22. Results  OHSUMED/MESH   LTC TF-IDF P@1 P@3 P@10 nDCG@1 nDCG@3 nDCG@10 P NoExp 0.333 0.302 0.260 0.407 0.376 0.356 0.3 PRF 0.322 0.282 0.302 0.379 0.360 0.393 0.3 SKOS 0.419 0.366 0.276 0.484 0.429 0.379 0.5 Table 1: Precision and nDCG results on query set provided with each dataset (63 OHSUMED/MESH, 10 LoC/LCSH). We focused on two measures, precision at rank n (P@n) for both dataset bundles and nDCG at rank n (nDCG@n) for the OHSUMED/MESH bundle, which pro- vides ordinal relevance judgments. Two retrieval models were used for ranking documents: LTC TF-IDF and BM25. 4.1 Dataset Characteristics only outco 25 m witho apart PRF docum BM25 nDCG@10 P@1 P@3 P@10 nDCG@1 nDCG@3 nDCG@10 0.356 0.381 0.344 0.265 0.450 0.414 0.374 0.393 0.377 0.317 0.275 0.443 0.397 0.369 0.379 0.500 0.366 0.282 0.548 0.435 0.397 CG results on the OHSUMED dataset. 22  
  • 23. Ini>al  Results  LoC/LCSH   . f s h . m - e e - e d e . - - f PRF might lead to better recall i.e., return more relevant documents, but hurt precison at top ranks. On the other hand, with SKOS-expansion the terms used in expansion are always from a SKOS vocabulary and not from the cor- pus. Therefore, we are already only expanding using rele- vant terms, since the terms’ existence in the vocabulary is a good hint that the term is important. LTC TF-IDF P@1 P@3 P@10 NoExp 0.500 0.500 0.370 PRF 0.300 0.433 0.430 SKOS 0.600 0.533 0.370 Table 2: Precision with LTC TF-IDF results on the Library of Congress dataset. Table 2 shows the results we obtained from early experi- ments with the Library of Congress Subject Headings. The results are similar to the results with OHSUMED. The PRF 23  
  • 24. Overview   •  A  brief  intro  to  SKOS   •  SKOS-­‐based  term  expansion   •  Implementa>on:  Lucene-­‐SKOS   •  Experiments   •  Conclusions   24  
  • 25. Conclusions   •  Our  experiments  indicated  gains  in  retrieval   effec>veness  compared  to  no-­‐expansion  or   pseudo  relevance  feedback   •  Our  solu>on  can  easily  be  adopted  by  loading   lucene-­‐skos  with  Apache  Lucene  and  Solr   25  
  • 26. Future  Work   •  More  thorough  evalua>on  using  LoC/LCSH     •  Use  corpora,  queries,  and  vocabularies  from   other  domains   •  Apply  this  technique  for  general  concept-­‐ based  and  URI-­‐iden>fied  Web  data  sources   26  
  • 27. Thanks!   @bhaslhofer,  @flaviomar>ns   hbp://slideshare.net/bhaslhofer     hbps://github.com/behas/lucene-­‐skos   27