Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

Linked data and the LOCAH project ILI2011

Slides for a presentation given at the Internet Librarian International Conference (ILI2011), October 2011

  • Be the first to comment

  • Be the first to like this

Linked data and the LOCAH project ILI2011

  1. 1. LINKED DATA AND THE LOCAH PROJECT Bethan Ruddock, Library and Archival Services, Mimas, University of @ bethanar #ILI2011
  2. 2. LINKED OPEN COPAC & ARCHIVES HUB JISC-funded project (under JISCexpo - exposing digital content for education and research) September 2010 – August 2011 Staff from Mimas, UKOLN, Eduserv Additional expertise from Talis, OCLC, Library of Congress
  3. 3. PROJECT AIMS Put archival and bibliographic data at the heart of the Linked Data Web, making new links between diverse content sources, enabling the free and flexible exploration of data and enabling researchers to make new connections between subjects, people, organisations and places to reveal more about our history and society. Make a collection of resources available on the Web as structured data, in particular linked data, where a case can be made that it would benefit teaching, learning, research, administration and/or knowledge transfer in UK higher education Develop a prototype with instructional step-by-step demonstration and documentation to show how the structured content can be used by 3rd party tools and services Explore and report on the opportunities and barriers in making content structured and exposed on the Web for discovery and use. Such opportunities and barriers may coalesce around licensing implications, trust, provenance, sustainability and usability
  4. 4. Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch.
  5. 5. THE DATA: COPAC• Merged union catalogue of the holdings of over 60 UK libraries• Over 50 million records• Consolidated records• MODS XML (not MARC) A Copac consolidated record created from 5 contributed records. Lines show how contributed records match with one another.
  6. 6. THE DATA: ARCHIVES HUB• Descriptions of archive collections from over 200 UK repositories• Nearly 25,000 descriptions – collection-level and multi-level• EAD (Encoded Archival Description)
  7. 7. CHALLENGES: VARIANCE• Data from many sources – should adhere to  Standards  AARC2  ISAD(G) BUT  Differences in implementation
  8. 8. CHALLENGES: DATA 260 $b: unknown dct:publisher: unknown dct publisher: definition:‘entity responsible for making the resource available’
  9. 9. CHALLENGES: MULTIPLE SOURCES A ‘match graph’ of a consolidated Copac record
  11. 11. LICENSING• Data comes from contributors  Not ours to redistribute!• Concerns  Provenance  Trust  Control• Consulted  Liaised with contributors and stakeholders
  12. 12. THE TECHY STUFFSpecifications required a lot of brainstorming… Image used under a CC licence from
  13. 13. ARCHIVES HUB MODEL in Finding maintainedBy/ Repository administeredB Place Postcode Aid maintains (Agent) y/ Unit administers hasPart/ encodedAs/ partOf encodes EAD Document accessProvidedBy/ LevelBiographical hasBiogHist/ topic/ providesAccessTo History isBiogHistFor page level Language Archival language at time topic/ origination hasPart/ Resource page product of Creation Temporal partOf associatedWith Entity extent inScheme Extent Agent Concept Concept Scheme representedBy Is-a foaf:focus Object Is-a associatedWith Person Family Organisation Place Book participates in Birth Death Genre Function at time Temporal Entity
  14. 14. COPAC MODEL
  15. 15. Node name MODS field OntologyBibliographicResource <modscollection> bibocardinality property URI/literal ontology0 1 copac:creator Creator URI dc0 m copac:contributor Contributor URI coapc0 1 event:producedIn Production Date URI event0 1 dct:issued Production Date URI dc0 m pode:publicationPlace Place URI pode0 m isbd:P1016 Place URI isbd0 m dct:publisher Publisher URI dc0 1 dct:isPartOf Series URI dc1 m copac:HeldBy Institution URI with Institution as subject1 1 bibo:type Type URI bibo0 m dct:subject Subject URI dc0 m skos:subject subject URI skos0 m dct:language Language URI dc1 1 hub:encodedAs mods URI hub
  16. 16.
  17. 17. Visualisation PrototypeUsing Timemap – Googlemaps and Simile map/Early stages with thisWill give location and‘extent’ of archive.Will link through toArchives Hub
  18. 18. LinkingBBC:Cranford Copac:Cranford VIAF:Dickens DBPedia: Gaskell Geonames: Hub:Gaskell Manchester DBPedia: Dickens Hub:Dickens
  19. 19. CHALLENGES: ANONYMOUS Anonymous Anon. anonanon. anonymous Anon . Mask image used under a CC licence from
  20. 20. concept/agent/6947473 lacywilliam
  21. 21. agent/rys
  22. 22. doc/concept/unesco/ photography
  23. 23. WHAT NEXT? Linking Lives  name-based approach into the data  integrating archival resource with other resources  DBPedia, VIAF, Copac...  route into archives for different audiences?  issues around trust and provenance to be explored
  24. 24. FINALLY… The LOCAH data is open for use… …please play with it!Image used under a CC licence from
  25. 25. @bethanar LOCAH blog: used under a CC licence from