Your SlideShare is downloading. ×
Linked data and the LOCAH project ILI2011
Upcoming SlideShare
Loading in...5

Thanks for flagging this SlideShare!

Oops! An error has occurred.

Saving this for later? Get the SlideShare app to save on your phone or tablet. Read anywhere, anytime – even offline.
Text the download link to your phone
Standard text messaging rates apply

Linked data and the LOCAH project ILI2011


Published on

Slides for a presentation given at the Internet Librarian International Conference (ILI2011), October 2011

Slides for a presentation given at the Internet Librarian International Conference (ILI2011), October 2011

Published in: Technology, Education
  • Be the first to comment

  • Be the first to like this

No Downloads
Total Views
On Slideshare
From Embeds
Number of Embeds
Embeds 0
No embeds

Report content
Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

No notes for slide


  • 1. LINKED DATA AND THE LOCAH PROJECT Bethan Ruddock, Library and Archival Services, Mimas, University of @ bethanar #ILI2011
  • 2. LINKED OPEN COPAC & ARCHIVES HUB JISC-funded project (under JISCexpo - exposing digital content for education and research) September 2010 – August 2011 Staff from Mimas, UKOLN, Eduserv Additional expertise from Talis, OCLC, Library of Congress
  • 3. PROJECT AIMS Put archival and bibliographic data at the heart of the Linked Data Web, making new links between diverse content sources, enabling the free and flexible exploration of data and enabling researchers to make new connections between subjects, people, organisations and places to reveal more about our history and society. Make a collection of resources available on the Web as structured data, in particular linked data, where a case can be made that it would benefit teaching, learning, research, administration and/or knowledge transfer in UK higher education Develop a prototype with instructional step-by-step demonstration and documentation to show how the structured content can be used by 3rd party tools and services Explore and report on the opportunities and barriers in making content structured and exposed on the Web for discovery and use. Such opportunities and barriers may coalesce around licensing implications, trust, provenance, sustainability and usability
  • 4. Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch.
  • 5. THE DATA: COPAC• Merged union catalogue of the holdings of over 60 UK libraries• Over 50 million records• Consolidated records• MODS XML (not MARC) A Copac consolidated record created from 5 contributed records. Lines show how contributed records match with one another.
  • 6. THE DATA: ARCHIVES HUB• Descriptions of archive collections from over 200 UK repositories• Nearly 25,000 descriptions – collection-level and multi-level• EAD (Encoded Archival Description)
  • 7. CHALLENGES: VARIANCE• Data from many sources – should adhere to  Standards  AARC2  ISAD(G) BUT  Differences in implementation
  • 8. CHALLENGES: DATA 260 $b: unknown dct:publisher: unknown dct publisher: definition:‘entity responsible for making the resource available’
  • 9. CHALLENGES: MULTIPLE SOURCES A ‘match graph’ of a consolidated Copac record
  • 11. LICENSING• Data comes from contributors  Not ours to redistribute!• Concerns  Provenance  Trust  Control• Consulted  Liaised with contributors and stakeholders
  • 12. THE TECHY STUFFSpecifications required a lot of brainstorming… Image used under a CC licence from
  • 13. ARCHIVES HUB MODEL in Finding maintainedBy/ Repository administeredB Place Postcode Aid maintains (Agent) y/ Unit administers hasPart/ encodedAs/ partOf encodes EAD Document accessProvidedBy/ LevelBiographical hasBiogHist/ topic/ providesAccessTo History isBiogHistFor page level Language Archival language at time topic/ origination hasPart/ Resource page product of Creation Temporal partOf associatedWith Entity extent inScheme Extent Agent Concept Concept Scheme representedBy Is-a foaf:focus Object Is-a associatedWith Person Family Organisation Place Book participates in Birth Death Genre Function at time Temporal Entity
  • 15. Node name MODS field OntologyBibliographicResource <modscollection> bibocardinality property URI/literal ontology0 1 copac:creator Creator URI dc0 m copac:contributor Contributor URI coapc0 1 event:producedIn Production Date URI event0 1 dct:issued Production Date URI dc0 m pode:publicationPlace Place URI pode0 m isbd:P1016 Place URI isbd0 m dct:publisher Publisher URI dc0 1 dct:isPartOf Series URI dc1 m copac:HeldBy Institution URI with Institution as subject1 1 bibo:type Type URI bibo0 m dct:subject Subject URI dc0 m skos:subject subject URI skos0 m dct:language Language URI dc1 1 hub:encodedAs mods URI hub
  • 16.
  • 17. Visualisation PrototypeUsing Timemap – Googlemaps and Simile map/Early stages with thisWill give location and‘extent’ of archive.Will link through toArchives Hub
  • 18. LinkingBBC:Cranford Copac:Cranford VIAF:Dickens DBPedia: Gaskell Geonames: Hub:Gaskell Manchester DBPedia: Dickens Hub:Dickens
  • 19. CHALLENGES: ANONYMOUS Anonymous Anon. anonanon. anonymous Anon . Mask image used under a CC licence from
  • 20. concept/agent/6947473 lacywilliam
  • 21. agent/rys
  • 22. doc/concept/unesco/ photography
  • 23. WHAT NEXT? Linking Lives  name-based approach into the data  integrating archival resource with other resources  DBPedia, VIAF, Copac...  route into archives for different audiences?  issues around trust and provenance to be explored
  • 24. FINALLY… The LOCAH data is open for use… …please play with it!Image used under a CC licence from
  • 25. @bethanar LOCAH blog: used under a CC licence from