Keith Alexander (Talis), Richard Cyganiak (DERI),
         Michael Hausenblas (DERI) and Jun Zhao (University of Oxford)

...
Agenda

       •    The Problem
       •    Our Proposal – voiD
       •    Applications
       •    Next Steps




Descri...
2007
                                                          The Problem




                                           ...
The Problem
                             2008




                                                                        ...
The Problem

       • The Linking Open Data (LOD) cloud gathers
         currently roughly the same momentum as the
      ...
The Problem




Describing Linked Datasets – On the Design and Usage of voiD, the “Vocabulary Of Interlinked Datasets”,
  ...
The Problem

       • From 2007 on, we have been doing it in the
         Yahoo!-catalog-style: manually collecting and
  ...
The Problem
 http://esw.w3.org/topic/TaskForces/CommunityProjects/LinkingOpenData/DataSets/LinkStatistics




            ...
The Problem

       • Currently, only human comprehensible
         descriptions (the LOD cloud, Wiki pages)
         avai...
The Problem

       • We can’t apply our tools and methods we
         have experiences with, such as editors,
         en...
Agenda

       The Problem
       • Our Proposal – voiD
       • Applications
       • Next Steps




Describing Linked D...
Our Proposal - voiD

       • Solution: providing a formal description of
              – What a dataset is about (topic, ...
Our Proposal - voiD

       • A dataset is a set of RDF triples that are
         published, maintained or aggregated by a...
Our Proposal - voiD

       • A linkset LSis a set of RDF triples where for
         all triples ti=⟨si,pi,oi⟩∈LS, the sub...
Our Proposal - voiD




Describing Linked Datasets – On the Design and Usage of voiD, the “Vocabulary Of Interlinked Datas...
Our Proposal - voiD


                                                 3rd-party,                    3rd-party,
          ...
Our Proposal - voiD




                                                                                                  ...
Our Proposal - voiD




                                                                                                  ...
Our Proposal - voiD




                                                                                                  ...
Our Proposal - voiD




                                                                                                  ...
Our Proposal - voiD

       • Reusing terms from other vocabularies
              – foaf:homepage/IFP
              – dcte...
Our Proposal - voiD

       • Publication & discovery via sitemaps and/or
         backlinks (dcterms:isPartOf)




Descri...
Our Proposal - voiD

       • Once dataset providers have published their
         voiD description in RDF along with thei...
Agenda

       The Problem
       Our Proposal – voiD
       • Applications
       • Next Steps




Describing Linked Da...
Applications

       •    Generation (ve, liftSSM, NX parser)
       •    Vocabulary Management (Talis)
       •    Explor...
Applications




                                                     http://ld2sd.deri.org/ve
Describing Linked Datasets ...
Applications




                                                   http://ld2sd.deri.org/lde
Describing Linked Datasets –...
Applications




                                                                                                         ...
Applications




                                         http://linkeddata.uriburner. com/

Describing Linked Datasets – ...
Agenda

       The Problem
       Our Proposal – voiD
       Applications
       • Next Steps




Describing Linked Dat...
Next Steps

       • voiD 2.0 see issues at
            http://code.google.com/p/void-impl/issues/list
       •    statist...
Upcoming SlideShare
Loading in …5
×

voiD talk at LDOW09

2,570 views

Published on

K. Alexander, R. Cyganiak, M. Hausenblas, J. Zhao. Describing Linked Datasets - On the Design and Usage of voiD, the 'Vocabulary of Interlinked Datasets'. Linked Data on the Web Workshop (LDOW 09) at WWW09. Apr 2009.
See http://events.linkeddata.org/ldow2009/

Published in: Technology, Education
0 Comments
2 Likes
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total views
2,570
On SlideShare
0
From Embeds
0
Number of Embeds
3
Actions
Shares
0
Downloads
24
Comments
0
Likes
2
Embeds 0
No embeds

No notes for slide

voiD talk at LDOW09

  1. 1. Keith Alexander (Talis), Richard Cyganiak (DERI), Michael Hausenblas (DERI) and Jun Zhao (University of Oxford) Describing Linked Datasets On the Design and Usage of voiD, the ‘Vocabulary Of Interlinked Datasets’ Linked Data Workshop at WWW09, 2009-04-20, Madrid, Spain
  2. 2. Agenda • The Problem • Our Proposal – voiD • Applications • Next Steps Describing Linked Datasets – On the Design and Usage of voiD, the “Vocabulary Of Interlinked Datasets”, 2 Linked Data Workshop at WWW09, 2009-04-20, Madrid, Spain
  3. 3. 2007 The Problem 2008 Describing Linked Datasets – On the Design and Usage of voiD, the “Vocabulary Of Interlinked Datasets”, 3 Linked Data Workshop at WWW09, 2009-04-20, Madrid, Spain
  4. 4. The Problem 2008 2009 Describing Linked Datasets – On the Design and Usage of voiD, the “Vocabulary Of Interlinked Datasets”, 4 Linked Data Workshop at WWW09, 2009-04-20, Madrid, Spain
  5. 5. The Problem • The Linking Open Data (LOD) cloud gathers currently roughly the same momentum as the Web in the early 1990s • How did people deal with the consequences of having a decentralized system, back then? Describing Linked Datasets – On the Design and Usage of voiD, the “Vocabulary Of Interlinked Datasets”, 5 Linked Data Workshop at WWW09, 2009-04-20, Madrid, Spain
  6. 6. The Problem Describing Linked Datasets – On the Design and Usage of voiD, the “Vocabulary Of Interlinked Datasets”, 6 Linked Data Workshop at WWW09, 2009-04-20, Madrid, Spain
  7. 7. The Problem • From 2007 on, we have been doing it in the Yahoo!-catalog-style: manually collecting and representing data about the Linking Open Data cloud: – In the LOD cloud diagram, we give a qualitative view in form of a visual graph – In various ESW Wiki pages we create HTML tables: • http://esw.w3.org/topic/TaskForces/CommunityProjects/Lin kingOpenData/DataSets/Statistics • http://esw.w3.org/topic/TaskForces/CommunityProjects/Lin kingOpenData/DataSets/LinkStatistics Describing Linked Datasets – On the Design and Usage of voiD, the “Vocabulary Of Interlinked Datasets”, 7 Linked Data Workshop at WWW09, 2009-04-20, Madrid, Spain
  8. 8. The Problem http://esw.w3.org/topic/TaskForces/CommunityProjects/LinkingOpenData/DataSets/LinkStatistics http://esw.w3.org/topic/TaskForces/CommunityProjects/LinkingOpenData/DataSets/Statistics Describing Linked Datasets – On the Design and Usage of voiD, the “Vocabulary Of Interlinked Datasets”, 8 Linked Data Workshop at WWW09, 2009-04-20, Madrid, Spain
  9. 9. The Problem • Currently, only human comprehensible descriptions (the LOD cloud, Wiki pages) available • We can’t automate tasks, such as – Efficient & effective search – Selection of dataset (for apps, interlinking targets) – Generation of maps, etc. Describing Linked Datasets – On the Design and Usage of voiD, the “Vocabulary Of Interlinked Datasets”, 9 Linked Data Workshop at WWW09, 2009-04-20, Madrid, Spain
  10. 10. The Problem • We can’t apply our tools and methods we have experiences with, such as editors, engines, stores, etc. • Even worse, it doesn’t scale – We’d need a Google-style approach that scales like hell and is powerful enough to enable the above mentioned – Providing metadata about the LOD cloud in a machine-comprehensible way Describing Linked Datasets – On the Design and Usage of voiD, the “Vocabulary Of Interlinked Datasets”, 10 Linked Data Workshop at WWW09, 2009-04-20, Madrid, Spain
  11. 11. Agenda The Problem • Our Proposal – voiD • Applications • Next Steps Describing Linked Datasets – On the Design and Usage of voiD, the “Vocabulary Of Interlinked Datasets”, 11 Linked Data Workshop at WWW09, 2009-04-20, Madrid, Spain
  12. 12. Our Proposal - voiD • Solution: providing a formal description of – What a dataset is about (topic, technical details) – How and under which conditions to access it – How the dataset is interlinked with other datasets • Qualitative level: type of interlinking • Quantitative level: number of links, resources, etc. – How to discover the metadata • voiD, the “Vocabulary of Interlinked Datasets” provides precisely this Describing Linked Datasets – On the Design and Usage of voiD, the “Vocabulary Of Interlinked Datasets”, 12 Linked Data Workshop at WWW09, 2009-04-20, Madrid, Spain
  13. 13. Our Proposal - voiD • A dataset is a set of RDF triples that are published, maintained or aggregated by a single provider. • A dataset is authoritative with respect to a certain URI namespace if it contains information about resources named by URIs in this namespace, and is published by the URI owner (URI ownership as of the AWWW1) Describing Linked Datasets – On the Design and Usage of voiD, the “Vocabulary Of Interlinked Datasets”, 13 Linked Data Workshop at WWW09, 2009-04-20, Madrid, Spain
  14. 14. Our Proposal - voiD • A linkset LSis a set of RDF triples where for all triples ti=⟨si,pi,oi⟩∈LS, the subject is in one dataset, i.e. all si are described in DS1 , and the object is in another dataset, i.e. all oi are described in DS2 . Describing Linked Datasets – On the Design and Usage of voiD, the “Vocabulary Of Interlinked Datasets”, 14 Linked Data Workshop at WWW09, 2009-04-20, Madrid, Spain
  15. 15. Our Proposal - voiD Describing Linked Datasets – On the Design and Usage of voiD, the “Vocabulary Of Interlinked Datasets”, 15 Linked Data Workshop at WWW09, 2009-04-20, Madrid, Spain
  16. 16. Our Proposal - voiD 3rd-party, 3rd-party, non-directed directed classic LOD, classic LOD, non-directed directed voiD offers two orthogonal interlinking types: • classic LOD vs. 3rd-party, differing in where the interlinking statements are kept. In the first case the interlinking triples, i.e. a linkset, are hosted in one of the two involved datasets, while in the latter case there is a third dataset involved that contains the interlinking triples, i.e. the linkset; • non-directed vs. directed, which addresses the issue if someone is interested in stating the direction of the interlinking or not (for example with owl:sameAs) Describing Linked Datasets – On the Design and Usage of voiD, the “Vocabulary Of Interlinked Datasets”, 16 Linked Data Workshop at WWW09, 2009-04-20, Madrid, Spain
  17. 17. Our Proposal - voiD classic LOD, non-directed Describing Linked Datasets – On the Design and Usage of voiD, the “Vocabulary Of Interlinked Datasets”, 17 Linked Data Workshop at WWW09, 2009-04-20, Madrid, Spain
  18. 18. Our Proposal - voiD classic LOD, directed Describing Linked Datasets – On the Design and Usage of voiD, the “Vocabulary Of Interlinked Datasets”, 18 Linked Data Workshop at WWW09, 2009-04-20, Madrid, Spain
  19. 19. Our Proposal - voiD 3rd-party, non-directed Describing Linked Datasets – On the Design and Usage of voiD, the “Vocabulary Of Interlinked Datasets”, 19 Linked Data Workshop at WWW09, 2009-04-20, Madrid, Spain
  20. 20. Our Proposal - voiD 3rd-party, directed Describing Linked Datasets – On the Design and Usage of voiD, the “Vocabulary Of Interlinked Datasets”, 20 Linked Data Workshop at WWW09, 2009-04-20, Madrid, Spain
  21. 21. Our Proposal - voiD • Reusing terms from other vocabularies – foaf:homepage/IFP – dcterms:subject along with DBpedia URIs http://dbpedia.org/resource/ XXX – SCOVO for statistics about triples, links, etc Describing Linked Datasets – On the Design and Usage of voiD, the “Vocabulary Of Interlinked Datasets”, 21 Linked Data Workshop at WWW09, 2009-04-20, Madrid, Spain
  22. 22. Our Proposal - voiD • Publication & discovery via sitemaps and/or backlinks (dcterms:isPartOf) Describing Linked Datasets – On the Design and Usage of voiD, the “Vocabulary Of Interlinked Datasets”, 22 Linked Data Workshop at WWW09, 2009-04-20, Madrid, Spain
  23. 23. Our Proposal - voiD • Once dataset providers have published their voiD description in RDF along with their dataset, one can address the following issues: – How to find some datasets? – How to efficiently find a specific dataset? – How to effectively find datasets? – How to dynamically select datasets? – How to select datasets based on certain preferences? Describing Linked Datasets – On the Design and Usage of voiD, the “Vocabulary Of Interlinked Datasets”, 23 Linked Data Workshop at WWW09, 2009-04-20, Madrid, Spain
  24. 24. Agenda The Problem Our Proposal – voiD • Applications • Next Steps Describing Linked Datasets – On the Design and Usage of voiD, the “Vocabulary Of Interlinked Datasets”, 24 Linked Data Workshop at WWW09, 2009-04-20, Madrid, Spain
  25. 25. Applications • Generation (ve, liftSSM, NX parser) • Vocabulary Management (Talis) • Explorer (RKB, LDE) • Query Federation (Clarck-Parsia, OpenLink) • Dataset ranking ( DING! talk) • Potential Applications – Map of data (Sindice) – Dynamic Meshups for Application Describing Linked Datasets – On the Design and Usage of voiD, the “Vocabulary Of Interlinked Datasets”, 25 Linked Data Workshop at WWW09, 2009-04-20, Madrid, Spain
  26. 26. Applications http://ld2sd.deri.org/ve Describing Linked Datasets – On the Design and Usage of voiD, the “Vocabulary Of Interlinked Datasets”, 26 Linked Data Workshop at WWW09, 2009-04-20, Madrid, Spain
  27. 27. Applications http://ld2sd.deri.org/lde Describing Linked Datasets – On the Design and Usage of voiD, the “Vocabulary Of Interlinked Datasets”, 27 Linked Data Workshop at WWW09, 2009-04-20, Madrid, Spain
  28. 28. Applications http://dblp.rkbexplorer.com/models/void.ttl Describing Linked Datasets – On the Design and Usage of voiD, the “Vocabulary Of Interlinked Datasets”, 28 Linked Data Workshop at WWW09, 2009-04-20, Madrid, Spain
  29. 29. Applications http://linkeddata.uriburner. com/ Describing Linked Datasets – On the Design and Usage of voiD, the “Vocabulary Of Interlinked Datasets”, 29 Linked Data Workshop at WWW09, 2009-04-20, Madrid, Spain
  30. 30. Agenda The Problem Our Proposal – voiD Applications • Next Steps Describing Linked Datasets – On the Design and Usage of voiD, the “Vocabulary Of Interlinked Datasets”, 30 Linked Data Workshop at WWW09, 2009-04-20, Madrid, Spain
  31. 31. Next Steps • voiD 2.0 see issues at http://code.google.com/p/void-impl/issues/list • statistics module (fix/extend re SCOVO) • SPARQL endpoints • provenance, trust (?) • Assist people in publishing voiD Describing Linked Datasets – On the Design and Usage of voiD, the “Vocabulary Of Interlinked Datasets”, 31 Linked Data Workshop at WWW09, 2009-04-20, Madrid, Spain

×