UKOLN is supported  by: Aggregation Using Linked Data – LOCAH Project Experiences 23rd June 2011 OAI7, Geneva, Switzerland...
LOCAH Project <ul><li>L inked  O pen  C opac and  A rchives  H ub </li></ul><ul><li>Funded by #JiscEXPO 2/10 ‘Expose’ call...
Archives Hub and Copac <ul><li>UK National Data Services based at Mimas </li></ul><ul><li>Archives Hub is an aggregation o...
What is LOCAH Doing? <ul><li>Part 1: Exposing Archives Hub & Copac data as Linked Data </li></ul><ul><li>Part 2: Creating ...
We’re Aggregating <ul><li>If something is identified, it can be linked to </li></ul><ul><li>We take  items from one datase...
Enhancing our data <ul><li>Already have some links: </li></ul><ul><ul><li>Time - reference.data.gov.uk URIs </li></ul></ul...
http://data.archiveshub.ac.uk/
‘ Aggregates’ property points to http://www.openarchives.org/ore/terms/aggregates
 
Visualisation Prototype <ul><li>Using Timemap –  </li></ul><ul><ul><li>Googlemaps and Simile </li></ul></ul><ul><ul><li>ht...
 
BBC Music
APIs, Mashups and Linked Data <ul><li>Mashups work against a fixed set of data sources </li></ul><ul><ul><li>Hand crafted ...
Aggregation / Integration Challenges
 
 
 
Sustainability <ul><li>Can you rely on data sources long-term?  </li></ul><ul><li>Ed Summers at the Library of Congress cr...
Library of Congress Subject Headings
Scalability <ul><li>Will the Web of Data scale? </li></ul>Example by Bradley Allen, Elsevier  at LOD LAM Summit, SF, USA
Data Modelling <ul><li>Complexity </li></ul><ul><ul><li>Archival description is hierarchical and multi-level </li></ul></u...
Linked Data the Way for Aggregation? <ul><li>Enables ‘straightforward’ aggregation of wide variety of data sources </li></...
Questions for Discussion <ul><li>Will using vocabularies and ontologies always be too difficult? </li></ul><ul><ul><li>Or ...
<ul><ul><li>What constitutes data worth linking to? </li></ul></ul><ul><ul><li>How to find datasets suitable for interlink...
Attribution and CC License  <ul><li>Sections of this presentation adapted from materials created by other members of the L...
Upcoming SlideShare
Loading in...5
×

Aggregation Using Linked Data – LOCAH Project Experiences

5,867

Published on

Workshop with Paul Walk and Herbert Van De Sompel at OAI7, Geneva,

http://indico.cern.ch/conferenceTimeTable.py?confId=103325#20110622

Published in: Technology, Education
0 Comments
1 Like
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total Views
5,867
On Slideshare
0
From Embeds
0
Number of Embeds
0
Actions
Shares
0
Downloads
8
Comments
0
Likes
1
Embeds 0
No embeds

No notes for slide
  • Copac a union catalogue Both successful JISC services running for many years now Locah is a research project – will have to see if go into service with LD interface
  • In hypertext web sites it is considered generally rather bad etiquette not to link to related external material. The value of your own information is very much a function of what it links to, as well as the inherent value of the information within the web page.  So it is also in the Semantic Web. Remember, this is about machines linking – machines need identifiers; humans generally know when something is a place or when it is a person. BBC + DBPedia + GeoNames + Archives Hub + Copac + VIAF = the Web as an exploratory space
  • This also an example of aggregating data
  • More aggregation
  • Data can be integrated from many diff sources
  • Ex
  • “ lower level” units interpreted in context of the higher levels of description Arguably “incomplete” without the contextual data. Relations are asserted, e.g. member-of/component-of But there is no requirement or expectation that data consumers will follow the links describing the relations
  • Aggregation Using Linked Data – LOCAH Project Experiences

    1. 1. UKOLN is supported by: Aggregation Using Linked Data – LOCAH Project Experiences 23rd June 2011 OAI7, Geneva, Switzerland Adrian Stevenson LOCAH Project Manager
    2. 2. LOCAH Project <ul><li>L inked O pen C opac and A rchives H ub </li></ul><ul><li>Funded by #JiscEXPO 2/10 ‘Expose’ call </li></ul><ul><ul><li>1 year project. Started August 2010 </li></ul></ul><ul><li>Partners & Consultants: </li></ul><ul><ul><li>UKOLN – Adrian Stevenson, Julian Cheal </li></ul></ul><ul><ul><li>Mimas – Jane Stevenson, Bethan Ruddock, Yogesh Patel </li></ul></ul><ul><ul><li>Eduserv – Pete Johnston </li></ul></ul><ul><ul><li>Talis – Leigh Dodds, Tim Hodson </li></ul></ul><ul><ul><li>OCLC - Ralph LeVan, Thom Hickey </li></ul></ul><ul><ul><li>Ed Summers </li></ul></ul><ul><li>http://blogs.ukoln.ac.uk/locah/ tag: #locah </li></ul>
    3. 3. Archives Hub and Copac <ul><li>UK National Data Services based at Mimas </li></ul><ul><li>Archives Hub is an aggregation of archival descriptions from archive repositories across the UK </li></ul><ul><ul><li>http://archiveshub.ac.uk </li></ul></ul><ul><li>Copac provides access to the merged library catalogues of libraries throughout the UK, including all national libraries </li></ul><ul><ul><li>http://copac.ac.uk </li></ul></ul>
    4. 4. What is LOCAH Doing? <ul><li>Part 1: Exposing Archives Hub & Copac data as Linked Data </li></ul><ul><li>Part 2: Creating a prototype visualisation </li></ul><ul><li>Part 3: Reporting on opportunities and barriers </li></ul>
    5. 5. We’re Aggregating <ul><li>If something is identified, it can be linked to </li></ul><ul><li>We take items from one dataset and link them to items from other datasets </li></ul>BBC VIAF DBPedia Archives Hub Copac GeoNames
    6. 6. Enhancing our data <ul><li>Already have some links: </li></ul><ul><ul><li>Time - reference.data.gov.uk URIs </li></ul></ul><ul><ul><li>Location - UK Postcodes URIs and Ordnance Survey URIs </li></ul></ul><ul><ul><li>Names - Virtual International Authority File </li></ul></ul><ul><ul><ul><li>Matches and links widely-used authority files - http://viaf.org/ </li></ul></ul></ul><ul><ul><li>Names - DBPedia </li></ul></ul><ul><li>Also looking at: </li></ul><ul><ul><li>Subjects - Library Congress Subject Headings and DBPedia </li></ul></ul>
    7. 7. http://data.archiveshub.ac.uk/
    8. 8. ‘ Aggregates’ property points to http://www.openarchives.org/ore/terms/aggregates
    9. 10. Visualisation Prototype <ul><li>Using Timemap – </li></ul><ul><ul><li>Googlemaps and Simile </li></ul></ul><ul><ul><li>http://code.google.com/p/timemap / </li></ul></ul><ul><li>Early stages with this </li></ul><ul><li>Will give location and ‘extent’ of archive. </li></ul><ul><li>Will link through to Archives Hub </li></ul>
    10. 12. BBC Music
    11. 13. APIs, Mashups and Linked Data <ul><li>Mashups work against a fixed set of data sources </li></ul><ul><ul><li>Hand crafted by humans </li></ul></ul><ul><ul><li>Don’t integrate well </li></ul></ul><ul><li>Linked Data promises an unbound global data space </li></ul><ul><ul><li>Easy dataset integration </li></ul></ul><ul><ul><li>Generic ‘mesh-up’ tools </li></ul></ul>
    12. 14. Aggregation / Integration Challenges
    13. 18. Sustainability <ul><li>Can you rely on data sources long-term? </li></ul><ul><li>Ed Summers at the Library of Congress created http://lcsh.info </li></ul><ul><li>Linked Data interface for LOC subject headings </li></ul><ul><li>People started using it </li></ul>
    14. 19. Library of Congress Subject Headings
    15. 20. Scalability <ul><li>Will the Web of Data scale? </li></ul>Example by Bradley Allen, Elsevier at LOD LAM Summit, SF, USA
    16. 21. Data Modelling <ul><li>Complexity </li></ul><ul><ul><li>Archival description is hierarchical and multi-level </li></ul></ul><ul><li>Dirty Data </li></ul>Licensing <ul><li>‘ Ownership’ of data </li></ul><ul><li>Hard to track attribution </li></ul><ul><li>CC0 for Archives Hub and Copac data </li></ul>
    17. 22. Linked Data the Way for Aggregation? <ul><li>Enables ‘straightforward’ aggregation of wide variety of data sources </li></ul><ul><li>New channels into your data services </li></ul><ul><li>Researchers are more likely to discover sources </li></ul><ul><li>‘ Hidden' collections of repositories become of the Web </li></ul>
    18. 23. Questions for Discussion <ul><li>Will using vocabularies and ontologies always be too difficult? </li></ul><ul><ul><li>Or will the tools appear? – MS Access for Linked Data? </li></ul></ul><ul><li>Will the Web of Data scale? </li></ul>
    19. 24. <ul><ul><li>What constitutes data worth linking to? </li></ul></ul><ul><ul><li>How to find datasets suitable for interlinking? </li></ul></ul><ul><ul><li>How to make my dataset worth linking to? </li></ul></ul><ul><ul><li>How to encourage others to link to my data? </li></ul></ul><ul><ul><li>What is the added value of links? </li></ul></ul><ul><ul><li>How to determine the quality of a link? </li></ul></ul>Questions if you’ve bought in
    20. 25. Attribution and CC License <ul><li>Sections of this presentation adapted from materials created by other members of the LOCAH Project </li></ul><ul><li>This presentation available under creative commons Non Commercial-Share Alike: </li></ul><ul><li>http://creativecommons.org/licenses/by-nc/2.0/uk/ </li></ul>
    1. A particular slide catching your eye?

      Clipping is a handy way to collect important slides you want to go back to later.

    ×