W3C Library Linked Data Incubator Group


Published on

Presentation of the Library Linked Data incubator group at W3C: http://www.w3.org/2005/Incubator/lld/

Published in: Technology
  • Be the first to comment

No Downloads
Total views
On SlideShare
From Embeds
Number of Embeds
Embeds 0
No embeds

No notes for slide

W3C Library Linked Data Incubator Group

  1. 1. W3C Library Linked Data Incubator Group Antoine Isaac Vrije Universiteit Amsterdam Europeana Talis Open Day: Linked Data and Libraries, London, July 21st 2010
  2. 2. Let’s start with references <ul><li>Landmark Linked Data implementations by library actors </li></ul><ul><li>Swedish National Library’s Libris catalogue and thesaurus </li></ul><ul><li>Library of Congress’ vocabularies, including LCSH </li></ul><ul><li>DNB’s Gemeinsame Normdatei </li></ul><ul><li>BnF’s RAMEAU subject headings </li></ul><ul><li>OCLC’s DDC classification and VIAF </li></ul><ul><li>NL of Hungary’s catalogue and thesauri </li></ul><ul><li>Also relevant! </li></ul><ul><li>STW economy thesaurus </li></ul><ul><li>Social Science thesaurus </li></ul><ul><li>GEMET environmental thesaurus </li></ul><ul><li>Agrovoc </li></ul><ul><li>New York Times subject headings </li></ul><ul><li>Scientific publications </li></ul>libris.kb.se/ id.loc.gov/ d-nb.info/gnd/ stitch.cs.vu.nl/rameau/ dewey.info/ viaf.org/ oszkdk.oszk.hu/resource/DRJ/404 zbw.eu/stw lod.gesis.org eionet.europa.eu/gemet aims.fao.org/ data.nytimes.com/ (among others) dblp.rkbexplorer.com/
  3. 3. Linked Library Cloud beginning 2008 [Ross Singer, Code4Lib2010] http://code4lib.org/conference/2010/singer
  4. 4. Linked Library Cloud mid-2010 <ul><li>Plus: </li></ul><ul><li>Germany NL </li></ul><ul><li>Hungary NL </li></ul><ul><li>STW </li></ul><ul><li>GEMET </li></ul><ul><li>NYT </li></ul><ul><li>Agrovoc </li></ul>[Ross Singer, Code4Lib2010] http://code4lib.org/conference/2010/singer
  5. 5. Useful tooling <ul><li>Available </li></ul><ul><li>Dublin Core </li></ul><ul><li>SKOS </li></ul><ul><li>BIBO </li></ul><ul><li>OAI-ORE </li></ul><ul><li>... </li></ul><ul><li>In progress </li></ul><ul><li>RDA vocabularies </li></ul><ul><li>[email_address] </li></ul>dublincore.org/ www.w3.org/2004/02/skos/ bibliontology.com/ www.openarchives.org/ore/ metadataregistry.org/rdabrowse.htm
  6. 6. <ul><li>2010, Year 1 of library linked data? </li></ul>
  7. 7. Libraries and LD, the perfect match? <ul><li>Libraries have been producing metadata for ages </li></ul><ul><li>Libraries (often) produce high-quality metadata </li></ul>
  8. 8. Libraries and LD, the perfect match? <ul><li>Library metadata is still locked in records </li></ul><ul><li>While it does maintain links to the outside world </li></ul><ul><ul><li>Bibliographic and web references </li></ul></ul><ul><ul><li>Shared vocabularies </li></ul></ul><ul><ul><li>Same books! </li></ul></ul>
  9. 9. Unleash library data!
  10. 10. Johan Stapel, Koninklijke Bibliotheek A dream at the Dutch National Library
  11. 11. <ul><li>But there are obstacles </li></ul>
  12. 12. Emerging best practices? <ul><li>What vocabularies are being used, and is there emerging consensus about which to use? </li></ul><ul><li>What licenses (if any) are associated with the data? </li></ul><ul><li>How much linking and interlinking is going on? </li></ul><ul><li>What sorts of mechanisms does the publisher offer for getting the data: sitemap, feeds, SPARQL, bulk download? </li></ul><ul><li>What is the quality of the data: granularity, link integrity, vocabulary usage. </li></ul><ul><li>What approaches to identifiers for “real world things” have publishers taken: hash, slash, 303, PURLs, reuse of traditional identifiers, etc. </li></ul><ul><li>What are the relative sizes of the pools of library linked data? </li></ul><ul><li>How are updates being managed? </li></ul>Ed Summers http://inkdroid.org/journal/2010/04/18/research-ideas-for-library-linked-data/
  13. 13. Connecting to more general LD Issues <ul><li>Mike Uschold’s “semantic elephants” </li></ul><ul><li>Proliferation of URIs, Managing Coreference </li></ul><ul><li>Overloading owl:sameAs </li></ul><ul><li>Versioning and URIs </li></ul>http://lists.w3.org/Archives/Public/public-lod/2010May/0012.html
  14. 14. Also, gospel is needed
  15. 15. What’s this I hear about the Semantic Web ? <ul><li>What is the Semantic Web? </li></ul><ul><li>What does it have to do with bibliography? </li></ul><ul><li>Does it make life better for patrons? </li></ul><ul><li>Does it strengthen libraries? </li></ul><ul><li>Is it practical? </li></ul><ul><li>Where can we get some? </li></ul>http://www.slideshare.net/stuartweibel/semantic-web-technologies-changing-bibliographic-descriptions
  16. 16. Determine use case & business models <ul><li>Libraries may just publish data, but they can do more </li></ul><ul><ul><li>Connect library data to other data </li></ul></ul><ul><ul><li>Integrate data from external sources in library systems </li></ul></ul><ul><ul><li>Crowdsourcing? </li></ul></ul><ul><li>Potential data consumers deserve some help, too </li></ul>
  17. 17. Linking strategy <ul><li>Links to library-originated sources </li></ul><ul><ul><li>VIAF, LCSH, DDC, UDC </li></ul></ul><ul><ul><li>RDA vocabularies </li></ul></ul><ul><ul><li>Worldcat, TEL </li></ul></ul><ul><li>Links to resources from the “natural environment” </li></ul><ul><ul><li>Museums, archives </li></ul></ul><ul><ul><li>Scientific communities: bibliographic data & research data </li></ul></ul><ul><ul><li>Publishers </li></ul></ul><ul><ul><li>Europeana and other aggregators </li></ul></ul>
  18. 18. Need for charting the LLD landscape
  19. 19. W3C incubator (XG) activity <ul><li>Short-lived working groups: 1 year </li></ul><ul><li>Light administration burden </li></ul><ul><li>Not W3C Recommendations, but “innovative ideas for specifications, guidelines, and applications that are not (or not yet) clear candidates as Web standards” </li></ul><ul><ul><li>Deliverables are, but XGs can trigger further W3C work </li></ul></ul>http://www.w3.org/2005/Incubator/
  20. 20. Example XGs <ul><li>Provenance </li></ul><ul><li>Multimedia semantics </li></ul><ul><li>Social Web </li></ul><ul><li>… </li></ul>
  21. 21. LLD Steps <ul><li>Preparing a charter </li></ul><ul><ul><li>Initial chairs: Tom Baker, Emmanuelle Bermès, Antoine Isaac </li></ul></ul><ul><ul><li>10 W3C initiating members </li></ul></ul><ul><ul><ul><li>Aalto University Helsinki </li></ul></ul></ul><ul><ul><ul><li>DERI Galway </li></ul></ul></ul><ul><ul><ul><li>Competence Centre for Interoperable Metadata (KIM) </li></ul></ul></ul><ul><ul><ul><li>Library of Congress </li></ul></ul></ul><ul><ul><ul><li>Los Alamos National Laboratory </li></ul></ul></ul><ul><ul><ul><li>MIMOS </li></ul></ul></ul><ul><ul><ul><li>OCLC </li></ul></ul></ul><ul><ul><ul><li>Talis </li></ul></ul></ul><ul><ul><ul><li>University of Applied Sciences Potsdam </li></ul></ul></ul><ul><ul><ul><li>Vrije Universiteit Amsterdam </li></ul></ul></ul>
  22. 23. <ul><li>To help increase global interoperability of library data on the Web, by </li></ul><ul><li>bringing together people involved in Semantic Web activities —focusing on Linked Data—in the library community and beyond, </li></ul><ul><li>building on existing initiatives , and </li></ul><ul><li>identifying collaboration tracks for the future . </li></ul>http://www.w3.org/2005/Incubator/lld/charter
  23. 24. Activities <ul><li>Gathering use cases and case studies demonstrating successful implementation of Semantic Web technologies in libraries and related sectors </li></ul><ul><li>Fostering collaboration among actors (libraries, museums, archives, publishers) interested in porting cultural assets to the Linked Data Web </li></ul><ul><li>Identifying relevant data models, vocabularies and ontologies and ways to build or improve interoperability among them </li></ul><ul><li>Identifying the need for the elaboration of new standards, guidelines & best practices </li></ul><ul><li>Identifying the areas of (Semantic) Web technology that could benefit from the expertise of the communities represented in the Group </li></ul><ul><li>Proposing a relevant scope and organization for work that follows on the initial effort carried by the Group. </li></ul>http://www.w3.org/2005/Incubator/lld/charter
  24. 25. Planned deliverables <ul><li>Report presenting the landscape of Linked data development in the library domain and related sectors, including: </li></ul><ul><li>A use-case document that describes a number of real-world use cases, case studies, outreach and dissemination initiatives targeted to the library community and related sectors </li></ul><ul><li>A document that describes relevant technology pieces , including vocabularies and ontologies (e.g., SKOS), with the intended goal to identify extension or interoperability requirements, and help determine what other standards may be needed. </li></ul>http://www.w3.org/2005/Incubator/lld/charter
  25. 26. Charter – leaving scope open <ul><li>The incubator group has been initiated by actors from national libraries, university libraries and research units, library vendors companies and other interested stakeholders. Its scope is however not limited to libraries as institutions , but is meant to involve other cultural heritage institutions , partners from the publishing industry , and other relevant domains . </li></ul><ul><li>Potential Links with other communities </li></ul><ul><li>W3C eGovernment Interest Group </li></ul><ul><li>EDItEUR </li></ul><ul><li>Semuse </li></ul><ul><li>… </li></ul>http://www.w3.org/2005/Incubator/lld/charter
  26. 27. Charter – leaving scope open <ul><li>The Incubator could contribute feedback and ideas re. other W3C area </li></ul><ul><li>Experience in modeling and publishing data… </li></ul>
  27. 28. LLD Steps <ul><li>Preparing a charter </li></ul><ul><li>Launch XG </li></ul><ul><ul><li>May 21 st 2010: http://www.w3.org/News/2010#entry-8803 </li></ul></ul>
  28. 29. LLD Steps <ul><li>Preparing a charter </li></ul><ul><li>Launch XG </li></ul><ul><li>Get participants </li></ul><ul><ul><li>43 participants </li></ul></ul><ul><ul><li>20 W3C member organizations </li></ul></ul><ul><ul><li>10 invited experts </li></ul></ul>
  29. 30. <ul><ul><li>Complete list at http://www.w3.org/2000/09/dbwg/details?group=44833&public=1 </li></ul></ul>
  30. 31. Steps <ul><li>Preparing a charter </li></ul><ul><li>Launch XG </li></ul><ul><li>Get participants </li></ul><ul><li>Start work! </li></ul><ul><ul><li>Use cases and case studies </li></ul></ul><ul><ul><li>Issue list </li></ul></ul>
  31. 32. Case Template <ul><li>Background and Current Practice </li></ul><ul><li>Goal </li></ul><ul><li>Use Case Scenario </li></ul><ul><li>Application of linked data for the given use case </li></ul><ul><li>Problems and Limitations </li></ul><ul><li>Library Linked Data Dimensions / Topics </li></ul><ul><li>Plus other optional references </li></ul>http://www.w3.org/2005/Incubator/lld/wiki/Use_Case_Template
  32. 33. First Cases <ul><li>Authority data enrichment </li></ul><ul><ul><li>http://www.w3.org/2005/Incubator/lld/wiki/Use_Case_Authority_Data_Enrichment </li></ul></ul><ul><li>Digital preservation </li></ul><ul><ul><li>http://www.w3.org/2005/Incubator/lld/wiki/Use_Case_Digital_Preservation </li></ul></ul>
  33. 34. Coming Case Work <ul><li>Contributions from XG participants </li></ul><ul><li>And from the wider community </li></ul><ul><ul><li>Calls for cases will be issued, stay tuned! </li></ul></ul>http://www.w3.org/2005/Incubator/lld/wiki/UseCases
  34. 35. Current LLD Issue List <ul><li>Conceptual Models </li></ul><ul><ul><li>FR family, SKOS, non-bibliographic/authority data </li></ul></ul><ul><li>Applying SemWeb Technology to Library Data </li></ul><ul><ul><li>Handling legacy data, available vocabularies (ontologies) </li></ul></ul><ul><li>Semantic Web/LD “Environmental Issues” </li></ul><ul><ul><li>Identifiers, linking across datasets </li></ul></ul><ul><li>Management and distribution of data </li></ul><ul><ul><li>Hosting, preservation, updates, web architecture </li></ul></ul><ul><li>Community and Management Issues </li></ul><ul><ul><li>Outreach, strategic guidance & business models, licenses </li></ul></ul>http://www.w3.org/2005/Incubator/lld/wiki/Topics
  35. 36. Participate?
  36. 37. Core Incubator work <ul><li>Participation to the LLD Incubator is still open </li></ul><ul><li>Teleconferences and work on deliverables </li></ul><ul><ul><li>http://www.w3.org/2005/Incubator/lld/ </li></ul></ul><ul><li>Everyone can follow our work without participating </li></ul><ul><li>Publicly readable LLD XG wiki </li></ul><ul><ul><li>http://www.w3.org/2005/Incubator/lld/wiki/ </li></ul></ul><ul><li>Publicly readable LLD XG mailing list </li></ul><ul><ul><li>http://lists.w3.org/Archives/Public/public-xg-lld/ </li></ul></ul>
  37. 38. Outside the LLD XG <ul><li>We try to provide spaces to the wider LLD community </li></ul><ul><li>LLD community wiki </li></ul><ul><ul><li>http://www.w3.org/2001/sw/wiki/LLD </li></ul></ul><ul><li>LLD community mailing list </li></ul><ul><ul><li>http://lists.w3.org/Archives/Public/public-lld/ </li></ul></ul><ul><li>Twitter hashtag </li></ul><ul><li>#lldata </li></ul>
  38. 39. Thanks! <ul><li>[email_address] </li></ul>
  39. 40. Pictures <ul><li>http://www.flickr.com/photos/nationalarchives/3048286070/ </li></ul><ul><li>http://www.europeana.eu/portal/record/04031/2D6FEB34557045A39A1D62761DAE00FEAF8B48F0.html </li></ul><ul><li>http://www.europeana.eu/portal/record/03903/8C5C6AEFF6B50DCCEDF6A23A99DD3A2D66AEB2CC.html </li></ul><ul><li>http://www.europeana.eu/portal/record/03903/1C123C986FDEBFCD0E307AFF8969F07F95BFCA49.html </li></ul><ul><li>http://www.europeana.eu/portal/record/03903/78FA3F8B4299B45C25C395345D3D16ED24EA7F4F.html </li></ul><ul><li>http://www.europeana.eu/portal/record/04031/CBF262142EAC88529CAA8F8D8A6969B72F8D3541.html </li></ul><ul><li>http://www.europeana.eu/portal/record/03903/95D8DA53C17F227BD27BCC148F79238FD6E2443E.html </li></ul><ul><li>(Europeana links give access to resources on original sites) </li></ul>