W3C Library Linked Data Incubator Group  Antoine Isaac Vrije Universiteit Amsterdam Europeana Talis Open Day: Linked Data and Libraries, London, July 21st 2010
Let’s start with references Landmark Linked Data implementations by library actors Swedish National Library’s Libris catalogue and thesaurus Library of Congress’ vocabularies, including LCSH DNB’s Gemeinsame Normdatei BnF’s RAMEAU subject headings OCLC’s DDC classification and VIAF NL of Hungary’s catalogue and thesauri Also relevant! STW economy thesaurus Social Science thesaurus GEMET environmental thesaurus Agrovoc New York Times subject headings Scientific publications libris.kb.se/ id.loc.gov/ d-nb.info/gnd/ stitch.cs.vu.nl/rameau/ dewey.info/ viaf.org/ oszkdk.oszk.hu/resource/DRJ/404 zbw.eu/stw lod.gesis.org eionet.europa.eu/gemet aims.fao.org/ data.nytimes.com/ (among others) dblp.rkbexplorer.com/
Linked Library Cloud beginning 2008 [Ross Singer, Code4Lib2010] http://code4lib.org/conference/2010/singer
Linked Library Cloud mid-2010 Plus: Germany NL Hungary NL STW GEMET NYT Agrovoc [Ross Singer, Code4Lib2010] http://code4lib.org/conference/2010/singer
Useful tooling Available Dublin Core SKOS BIBO OAI-ORE ... In progress RDA vocabularies [email_address] dublincore.org/ www.w3.org/2004/02/skos/ bibliontology.com/ www.openarchives.org/ore/ metadataregistry.org/rdabrowse.htm
2010, Year 1 of library linked data?
Libraries and LD, the perfect match? Libraries have been producing metadata  for ages Libraries (often) produce  high-quality  metadata
Libraries and LD, the perfect match? Library metadata is still  locked  in records While it does maintain links to the outside world Bibliographic and web references Shared vocabularies Same books!
Unleash library data!
Johan Stapel, Koninklijke Bibliotheek A dream at the Dutch National Library
But there are obstacles
Emerging best practices? What vocabularies are being used, and is there emerging consensus about which to use? What licenses (if any) are associated with the data? How much linking and interlinking is going on? What sorts of mechanisms does the publisher offer for getting the data: sitemap, feeds, SPARQL, bulk download? What is the quality of the data: granularity, link integrity, vocabulary usage. What approaches to identifiers for “real world things” have publishers taken: hash, slash, 303, PURLs, reuse of traditional identifiers, etc. What are the relative sizes of the pools of library linked data? How are updates being managed? Ed Summers http://inkdroid.org/journal/2010/04/18/research-ideas-for-library-linked-data/
Connecting to more general LD Issues Mike Uschold’s “semantic elephants” Proliferation of URIs, Managing Coreference Overloading owl:sameAs Versioning and URIs http://lists.w3.org/Archives/Public/public-lod/2010May/0012.html
Also, gospel is needed
What’s this I hear about the  Semantic Web ? What is the Semantic Web? What does it have to do with bibliography? Does it make life better for patrons? Does it strengthen libraries? Is it practical? Where can we get some? http://www.slideshare.net/stuartweibel/semantic-web-technologies-changing-bibliographic-descriptions
Determine use case & business models Libraries may just publish data, but they can do more Connect library data to other data Integrate data from external sources in library systems Crowdsourcing? Potential data consumers deserve some help, too
Linking strategy Links to library-originated sources VIAF, LCSH, DDC, UDC RDA vocabularies Worldcat, TEL Links to resources from the “natural environment” Museums, archives Scientific communities: bibliographic data & research data Publishers Europeana and other aggregators
Need for charting the LLD landscape
W3C incubator (XG) activity Short-lived working groups: 1 year Light administration burden Not  W3C Recommendations, but “innovative ideas for specifications, guidelines, and applications that are not (or not yet) clear candidates as Web standards” Deliverables are, but XGs can trigger further W3C work http://www.w3.org/2005/Incubator/
Example XGs Provenance Multimedia semantics Social Web …
LLD Steps Preparing a charter Initial chairs: Tom Baker, Emmanuelle Bermès, Antoine Isaac 10 W3C initiating members Aalto University Helsinki DERI Galway Competence Centre for Interoperable Metadata (KIM) Library of Congress Los Alamos National Laboratory MIMOS OCLC Talis University of Applied Sciences Potsdam Vrije Universiteit Amsterdam
 
To  help  increase global interoperability of library data on the Web, by bringing together people  involved in Semantic Web  activities —focusing on Linked Data—in the library community and beyond, building on existing initiatives , and  identifying collaboration tracks for the future . http://www.w3.org/2005/Incubator/lld/charter
Activities Gathering use cases and case studies demonstrating successful implementation of Semantic Web technologies in libraries and related sectors Fostering collaboration among actors (libraries, museums, archives, publishers) interested in porting cultural assets to the Linked Data Web Identifying relevant data models, vocabularies and ontologies and ways to build or improve interoperability among them Identifying the need for the elaboration of new standards, guidelines & best practices Identifying the areas of (Semantic) Web technology that could benefit from the expertise of the communities represented in the Group Proposing a relevant scope and organization for work that follows on the initial effort carried by the Group. http://www.w3.org/2005/Incubator/lld/charter
Planned deliverables Report presenting the landscape of Linked data development in the library domain and related sectors, including:  A  use-case document  that describes a number of real-world use cases, case studies, outreach and dissemination initiatives targeted to the library community and related sectors A  document that describes relevant technology pieces , including vocabularies and ontologies (e.g., SKOS), with the intended goal to identify extension or interoperability requirements, and help determine what other standards may be needed. http://www.w3.org/2005/Incubator/lld/charter
Charter – leaving scope open The incubator group has been initiated by actors from national libraries, university libraries and research units, library vendors companies and other interested stakeholders. Its scope is however  not limited to libraries as institutions , but is meant to involve  other cultural heritage institutions , partners from the  publishing industry , and  other relevant domains .  Potential Links with other communities W3C eGovernment Interest Group EDItEUR Semuse … http://www.w3.org/2005/Incubator/lld/charter
Charter – leaving scope open The Incubator could contribute feedback and ideas re. other W3C area Experience in modeling and publishing data…
LLD Steps Preparing a charter Launch XG  May 21 st  2010: http://www.w3.org/News/2010#entry-8803
LLD Steps Preparing a charter Launch XG Get participants 43 participants 20 W3C member organizations 10 invited experts
Complete list at http://www.w3.org/2000/09/dbwg/details?group=44833&public=1
Steps Preparing a charter Launch XG Get participants Start work! Use cases and case studies Issue list
Case Template Background and Current Practice Goal Use Case Scenario Application of linked data for the given use case Problems and Limitations Library Linked Data Dimensions / Topics Plus other optional references http://www.w3.org/2005/Incubator/lld/wiki/Use_Case_Template
First Cases Authority data enrichment http://www.w3.org/2005/Incubator/lld/wiki/Use_Case_Authority_Data_Enrichment Digital preservation http://www.w3.org/2005/Incubator/lld/wiki/Use_Case_Digital_Preservation
Coming Case Work Contributions from XG participants And from the wider community Calls for cases will be issued, stay tuned! http://www.w3.org/2005/Incubator/lld/wiki/UseCases
Current LLD Issue List Conceptual Models FR family, SKOS, non-bibliographic/authority data Applying SemWeb Technology to Library Data Handling legacy data, available vocabularies (ontologies) Semantic Web/LD “Environmental Issues” Identifiers, linking across datasets Management and distribution of data  Hosting, preservation, updates, web architecture Community and Management Issues Outreach, strategic guidance & business models, licenses http://www.w3.org/2005/Incubator/lld/wiki/Topics
Participate?
Core Incubator work Participation to the LLD Incubator is still open Teleconferences and work on deliverables http://www.w3.org/2005/Incubator/lld/ Everyone can follow our work without participating Publicly readable LLD XG wiki http://www.w3.org/2005/Incubator/lld/wiki/ Publicly readable LLD XG mailing list http://lists.w3.org/Archives/Public/public-xg-lld/
Outside the LLD XG We try to provide spaces to the wider LLD community LLD community wiki http://www.w3.org/2001/sw/wiki/LLD LLD community mailing list http://lists.w3.org/Archives/Public/public-lld/ Twitter hashtag #lldata
Thanks! [email_address]
Pictures http://www.flickr.com/photos/nationalarchives/3048286070/ http://www.europeana.eu/portal/record/04031/2D6FEB34557045A39A1D62761DAE00FEAF8B48F0.html http://www.europeana.eu/portal/record/03903/8C5C6AEFF6B50DCCEDF6A23A99DD3A2D66AEB2CC.html http://www.europeana.eu/portal/record/03903/1C123C986FDEBFCD0E307AFF8969F07F95BFCA49.html http://www.europeana.eu/portal/record/03903/78FA3F8B4299B45C25C395345D3D16ED24EA7F4F.html http://www.europeana.eu/portal/record/04031/CBF262142EAC88529CAA8F8D8A6969B72F8D3541.html http://www.europeana.eu/portal/record/03903/95D8DA53C17F227BD27BCC148F79238FD6E2443E.html (Europeana links give access to resources on original sites)

W3C Library Linked Data Incubator Group

  • 1.
    W3C Library LinkedData Incubator Group Antoine Isaac Vrije Universiteit Amsterdam Europeana Talis Open Day: Linked Data and Libraries, London, July 21st 2010
  • 2.
    Let’s start withreferences Landmark Linked Data implementations by library actors Swedish National Library’s Libris catalogue and thesaurus Library of Congress’ vocabularies, including LCSH DNB’s Gemeinsame Normdatei BnF’s RAMEAU subject headings OCLC’s DDC classification and VIAF NL of Hungary’s catalogue and thesauri Also relevant! STW economy thesaurus Social Science thesaurus GEMET environmental thesaurus Agrovoc New York Times subject headings Scientific publications libris.kb.se/ id.loc.gov/ d-nb.info/gnd/ stitch.cs.vu.nl/rameau/ dewey.info/ viaf.org/ oszkdk.oszk.hu/resource/DRJ/404 zbw.eu/stw lod.gesis.org eionet.europa.eu/gemet aims.fao.org/ data.nytimes.com/ (among others) dblp.rkbexplorer.com/
  • 3.
    Linked Library Cloudbeginning 2008 [Ross Singer, Code4Lib2010] http://code4lib.org/conference/2010/singer
  • 4.
    Linked Library Cloudmid-2010 Plus: Germany NL Hungary NL STW GEMET NYT Agrovoc [Ross Singer, Code4Lib2010] http://code4lib.org/conference/2010/singer
  • 5.
    Useful tooling AvailableDublin Core SKOS BIBO OAI-ORE ... In progress RDA vocabularies [email_address] dublincore.org/ www.w3.org/2004/02/skos/ bibliontology.com/ www.openarchives.org/ore/ metadataregistry.org/rdabrowse.htm
  • 6.
    2010, Year 1of library linked data?
  • 7.
    Libraries and LD,the perfect match? Libraries have been producing metadata for ages Libraries (often) produce high-quality metadata
  • 8.
    Libraries and LD,the perfect match? Library metadata is still locked in records While it does maintain links to the outside world Bibliographic and web references Shared vocabularies Same books!
  • 9.
  • 10.
    Johan Stapel, KoninklijkeBibliotheek A dream at the Dutch National Library
  • 11.
    But there areobstacles
  • 12.
    Emerging best practices?What vocabularies are being used, and is there emerging consensus about which to use? What licenses (if any) are associated with the data? How much linking and interlinking is going on? What sorts of mechanisms does the publisher offer for getting the data: sitemap, feeds, SPARQL, bulk download? What is the quality of the data: granularity, link integrity, vocabulary usage. What approaches to identifiers for “real world things” have publishers taken: hash, slash, 303, PURLs, reuse of traditional identifiers, etc. What are the relative sizes of the pools of library linked data? How are updates being managed? Ed Summers http://inkdroid.org/journal/2010/04/18/research-ideas-for-library-linked-data/
  • 13.
    Connecting to moregeneral LD Issues Mike Uschold’s “semantic elephants” Proliferation of URIs, Managing Coreference Overloading owl:sameAs Versioning and URIs http://lists.w3.org/Archives/Public/public-lod/2010May/0012.html
  • 14.
  • 15.
    What’s this Ihear about the Semantic Web ? What is the Semantic Web? What does it have to do with bibliography? Does it make life better for patrons? Does it strengthen libraries? Is it practical? Where can we get some? http://www.slideshare.net/stuartweibel/semantic-web-technologies-changing-bibliographic-descriptions
  • 16.
    Determine use case& business models Libraries may just publish data, but they can do more Connect library data to other data Integrate data from external sources in library systems Crowdsourcing? Potential data consumers deserve some help, too
  • 17.
    Linking strategy Linksto library-originated sources VIAF, LCSH, DDC, UDC RDA vocabularies Worldcat, TEL Links to resources from the “natural environment” Museums, archives Scientific communities: bibliographic data & research data Publishers Europeana and other aggregators
  • 18.
    Need for chartingthe LLD landscape
  • 19.
    W3C incubator (XG)activity Short-lived working groups: 1 year Light administration burden Not W3C Recommendations, but “innovative ideas for specifications, guidelines, and applications that are not (or not yet) clear candidates as Web standards” Deliverables are, but XGs can trigger further W3C work http://www.w3.org/2005/Incubator/
  • 20.
    Example XGs ProvenanceMultimedia semantics Social Web …
  • 21.
    LLD Steps Preparinga charter Initial chairs: Tom Baker, Emmanuelle Bermès, Antoine Isaac 10 W3C initiating members Aalto University Helsinki DERI Galway Competence Centre for Interoperable Metadata (KIM) Library of Congress Los Alamos National Laboratory MIMOS OCLC Talis University of Applied Sciences Potsdam Vrije Universiteit Amsterdam
  • 22.
  • 23.
    To help increase global interoperability of library data on the Web, by bringing together people involved in Semantic Web activities —focusing on Linked Data—in the library community and beyond, building on existing initiatives , and identifying collaboration tracks for the future . http://www.w3.org/2005/Incubator/lld/charter
  • 24.
    Activities Gathering usecases and case studies demonstrating successful implementation of Semantic Web technologies in libraries and related sectors Fostering collaboration among actors (libraries, museums, archives, publishers) interested in porting cultural assets to the Linked Data Web Identifying relevant data models, vocabularies and ontologies and ways to build or improve interoperability among them Identifying the need for the elaboration of new standards, guidelines & best practices Identifying the areas of (Semantic) Web technology that could benefit from the expertise of the communities represented in the Group Proposing a relevant scope and organization for work that follows on the initial effort carried by the Group. http://www.w3.org/2005/Incubator/lld/charter
  • 25.
    Planned deliverables Reportpresenting the landscape of Linked data development in the library domain and related sectors, including: A use-case document that describes a number of real-world use cases, case studies, outreach and dissemination initiatives targeted to the library community and related sectors A document that describes relevant technology pieces , including vocabularies and ontologies (e.g., SKOS), with the intended goal to identify extension or interoperability requirements, and help determine what other standards may be needed. http://www.w3.org/2005/Incubator/lld/charter
  • 26.
    Charter – leavingscope open The incubator group has been initiated by actors from national libraries, university libraries and research units, library vendors companies and other interested stakeholders. Its scope is however not limited to libraries as institutions , but is meant to involve other cultural heritage institutions , partners from the publishing industry , and other relevant domains . Potential Links with other communities W3C eGovernment Interest Group EDItEUR Semuse … http://www.w3.org/2005/Incubator/lld/charter
  • 27.
    Charter – leavingscope open The Incubator could contribute feedback and ideas re. other W3C area Experience in modeling and publishing data…
  • 28.
    LLD Steps Preparinga charter Launch XG May 21 st 2010: http://www.w3.org/News/2010#entry-8803
  • 29.
    LLD Steps Preparinga charter Launch XG Get participants 43 participants 20 W3C member organizations 10 invited experts
  • 30.
    Complete list athttp://www.w3.org/2000/09/dbwg/details?group=44833&public=1
  • 31.
    Steps Preparing acharter Launch XG Get participants Start work! Use cases and case studies Issue list
  • 32.
    Case Template Backgroundand Current Practice Goal Use Case Scenario Application of linked data for the given use case Problems and Limitations Library Linked Data Dimensions / Topics Plus other optional references http://www.w3.org/2005/Incubator/lld/wiki/Use_Case_Template
  • 33.
    First Cases Authoritydata enrichment http://www.w3.org/2005/Incubator/lld/wiki/Use_Case_Authority_Data_Enrichment Digital preservation http://www.w3.org/2005/Incubator/lld/wiki/Use_Case_Digital_Preservation
  • 34.
    Coming Case WorkContributions from XG participants And from the wider community Calls for cases will be issued, stay tuned! http://www.w3.org/2005/Incubator/lld/wiki/UseCases
  • 35.
    Current LLD IssueList Conceptual Models FR family, SKOS, non-bibliographic/authority data Applying SemWeb Technology to Library Data Handling legacy data, available vocabularies (ontologies) Semantic Web/LD “Environmental Issues” Identifiers, linking across datasets Management and distribution of data Hosting, preservation, updates, web architecture Community and Management Issues Outreach, strategic guidance & business models, licenses http://www.w3.org/2005/Incubator/lld/wiki/Topics
  • 36.
  • 37.
    Core Incubator workParticipation to the LLD Incubator is still open Teleconferences and work on deliverables http://www.w3.org/2005/Incubator/lld/ Everyone can follow our work without participating Publicly readable LLD XG wiki http://www.w3.org/2005/Incubator/lld/wiki/ Publicly readable LLD XG mailing list http://lists.w3.org/Archives/Public/public-xg-lld/
  • 38.
    Outside the LLDXG We try to provide spaces to the wider LLD community LLD community wiki http://www.w3.org/2001/sw/wiki/LLD LLD community mailing list http://lists.w3.org/Archives/Public/public-lld/ Twitter hashtag #lldata
  • 39.
  • 40.
    Pictures http://www.flickr.com/photos/nationalarchives/3048286070/ http://www.europeana.eu/portal/record/04031/2D6FEB34557045A39A1D62761DAE00FEAF8B48F0.htmlhttp://www.europeana.eu/portal/record/03903/8C5C6AEFF6B50DCCEDF6A23A99DD3A2D66AEB2CC.html http://www.europeana.eu/portal/record/03903/1C123C986FDEBFCD0E307AFF8969F07F95BFCA49.html http://www.europeana.eu/portal/record/03903/78FA3F8B4299B45C25C395345D3D16ED24EA7F4F.html http://www.europeana.eu/portal/record/04031/CBF262142EAC88529CAA8F8D8A6969B72F8D3541.html http://www.europeana.eu/portal/record/03903/95D8DA53C17F227BD27BCC148F79238FD6E2443E.html (Europeana links give access to resources on original sites)