Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

Digital archiving 3.0

3,659 views

Published on

Presentation given at ODW2013 (http://www.w3.org/2013/04/odw/). Goes over the need for institutions doing digital archiving to publish their meta-data as LOD and ensure formats round-tripping for the data

Published in: Technology, Education
  • Be the first to comment

Digital archiving 3.0

  1. 1. Data Archiving and Networked Services Digital Archiving 3.0 “My data open on the Web, ok but how ?” Christophe Guéret (@cgueret) Open Data on the Web, 23 - 24 April 2013DANS is een instituut van KNAW en NWO
  2. 2. A bit of contexthttp://cedar-project.nl http://easy.dans.knaw.nl
  3. 3. Put your data open on the Web! “E-Data & Research”, October 2011 “Sharing knowledge: EC-funded projects on scientific information in the digital age”
  4. 4. Where is your research data ? Just get it from the web site of the research project I think I have have it somewhere on a stick, let me check... It is available as an RDF/XML dump on my test server
  5. 5. All bad answers, really.● We need research data to be – Accessible/readable/usable by anyone – Available in many (>1) years from now – With traceable provenance and usages● Dumping the data on a web site somewhere is not enough
  6. 6. Solution: use a repository “Sharing knowledge: EC-funded projects on scientific information in the digital age”● Data repositories will take over serving the data and have a page for it!● Repository hold two type of data – The data stored – The meta-data about this data
  7. 7. Which format for meta-data ?● LOD is a perfect fit for describing data – Use to refer to and link data items – Facilitates discovery, easy to crawl/index – One description per data item stored – Redirects to actual location of the data● Remaining question: how much meta-data is needed?
  8. 8. Which format for the data?● Many formats around : PDF, SDF, DSPL, XLS, RDF, CSV, SHP, JSON-LD, ...● Translation will imply some extra work for the data owner and not please everyone
  9. 9. Which format for the data? ● Many formats around : PDF, SDF, DSPL, XLS, RDF, CSV, SHP, JSON-LD, ... ● Translation will imply some extra work for the data owner and not please everyoneExpress your data as Buy a DN, decide on a Select vocabularies todescribed resources URI scheme for your data describe your resources
  10. 10. Just get the ● data in theSolution: use a repository repository ● Repositories will take care● Data repositories will take over everything of serving your data ● PS: forget about HTTP URIs for data
  11. 11. Format evolution● Use Content-negotiation to translate and serve different data formats● Ensure everyone gets the format he wants
  12. 12. Format evolution● Use Content-negotiation to translate and serve different data formats● Ensure everyone gets the format he wants ? ?
  13. 13. Next generation archives● Provide long term access to data in several formats● Publish Linked Open Meta-Data about the data stored (DCAT, ...)● Facilitate moving data around archives (LDP, ...)

×