Enhanced publications: an introduction – Arjan Hogenaar, DANS


Published on

OpenAIREplus workshop - “Linking Open Access publications to data – policy development and implementation” (June 11, 2012)

Published in: Technology, Education
  • Be the first to comment

  • Be the first to like this

No Downloads
Total views
On SlideShare
From Embeds
Number of Embeds
Embeds 0
No embeds

No notes for slide

Enhanced publications: an introduction – Arjan Hogenaar, DANS

  1. 1. Data Archiving and Networked Services Enhanced Publications: an introduction Arjan Hogenaar 11 June 2012 DANS is an institute of KNAW and NWO
  2. 2. DANS Data Archiving and Networked Services • Sustainable access to digital research data • Promoting trusted digital repositories (DSA) But also • Services like EASY, NARCIS and National Resolver 2
  3. 3. Enhanced Publications (Traditional) publications enhanced with: - Datasets - Video fragments - Audio fragments - Images - Detailed Information on Organisation/Author - In order to clarify the context - Not all these components have to be included in an EP 3
  4. 4. Two fundamental ways to compose EPs A. ‘Machine-based composition’ • The objects brought together shared already one or more properties • Examples: – ARVODI (in the Netherlands): project description, publication and dataset share a unique identifier - OpenAIREPlus: the EU Grant agreement number • Relationships between the objects are ‘unbiased’ 4
  5. 5. Two fundamental ways to compose EPs B. ‘Man-made composition’ • The objects brought together based on the opinion of the composer • Not always clear why a typical object (an image) has been related to – for instance – a publication 5
  6. 6. Is there a relationships between these two teams? Denmark Netherlands 6
  7. 7. OAI-ORE (Object Re-use and Exchange) for EPs (some basics) • Resource Map to describe an EP (in OAI-ORE: an aggegration) • Aggregation to describe the components of an EP (in OAI-ORE: the aggregated resources) • Aggregated resources may be documents (Eprints) , datasets (data objects), images, metadata records, authors (persons) and so on 7
  8. 8. Data Model of EPs Van de Sompel: OAI-ORE Primer 8
  9. 9. Advantages EPs • Background information easy to find • Conclusions in a paper may be verified via the dataset(s) used • Information is being presented in context 9
  10. 10. Additional Advantages of manmade EPs • Authors may not simply relate components, but add comment why they have chosen to relate them • An author may allow other researchers to add components to the original EP: an EP is no longer a static document • But be aware: in the case of man-made EPs it is not always clear why objects have been put together into one single EP! 10
  11. 11. Example of an EP: textual representation Not very revolutionary! But…. See the xml! 11
  12. 12. Rdf/xml serialisation of the OAIORE of an EP: <rdf:RDF> <rdf:Description rdf:about="http://escape.utwente.nl/rem/561"> <rdf:type rdf:resource="http://www.openarchives.org/ore/terms/ResourceMap"/> <dcterms:created>2011-02-16T14:43:57.756Z</dcterms:created> <dcterms:modified>2011-05-05T08:09:09.571Z</dcterms:modified> <ore:describes rdf:resource="http://escape.utwente.nl/aggregation/561"/> <dcterms:creator rdf:nodeID="183fb582"/> </rdf:Description> <rdf:Description rdf:nodeID="183fb582"> <foaf:name>ESCAPE repository</foaf:name> <foaf:page rdf:resource="http://escape.utwente.nl/"/> </rdf:Description> <rdf:Description rdf:about="http://escape.utwente.nl/aggregation/561"> etc 12
  13. 13. EPs and DANS DANS has three major services: – The portal NARCIS – The Archiving System for Datasets: EASY – The National Resolver 13
  14. 14. EASY and Resolver • EASY is used both as a system to deposit datasets and as an access point to search for datasets: https://easy.dans.knaw.nl/ui/home • The Resolver is a new service, resolving the persistent identifier of an object to its actual URL. The Resolver is crucial in the realisation of sustainable access to information objects: http://persistent-identifier.nl/ 14
  15. 15. NARCIS • Main national portal to research information, publications and datasets (from EASY and 3TU.Datacentrum) • Information gathered using OAI-PMH protocol • Is a starting point for the developments of EPs, as most of the EP-components (project descriptions, descriptions of researchers and institutions, metadata of publications and datasets) have been incorporated in the portal: www.narcis.nl 15
  16. 16. EPs in NARCIS • In co-operation with SURF in 2011: – Study into the possibilities to present EPs in NARCIS – under the following conditions: • OAI-PMH to harvest descriptions of EPs • OAI-ORE to describe the relationships between objects within EPs or between different EPs • RDF/xml for the serialisation of EPs • Incorporation of a visualisation tool to present EPs • Use of persistent identifiers for authors and objects 16
  17. 17. Visualisation of an EP in NARCIS: from RDF/xml to user-friendlyness Only use this slide to present a screenshot of an application. As no style is applied, the screenshot can take up the whole slide. For all other information please use the slide with preset style! 17
  18. 18. Digital Author Identifier: its role in EPs * No doubt on the identity of an author * Centralised system in the Netherlands * May be used in publications, datasets, project descriptions and of course in EPs R.L. Zijdeman has the DAI: 304832960 18
  19. 19. Persistent Identifiers: separating a resource form its actual URL • URLs of object is not stable • But: assigning a PID to an object may overcome this problem • The combination of the object, its PID and an resolver service (to translate the PID in the actual URL of the object) facilitates sustainable access A thesis and its PID as part of an EP in NARCIS 19
  20. 20. EP: summary 1. Total new way in information dissemination 2. EP’s may be deposited in repository just like traditional publications 3. OAI-PMH may be used for the harvesting of description of metadata of EP’s 4. Description of EPs relies on OAI-ORE model (with resource maps and aggregations) 20
  21. 21. Two key aspects in EP development • Digital Preservation • Data Curation 21
  22. 22. Digital Preservation Components of an EP have to be sustainable available – For publications: electronic depots of national libraries assigning Persistent Identifiers to the objects secure information infrastructure - For datasets: Data Curation - Data management plan - Data Seal of Approval (DSA) 22
  23. 23. Digital Curation of Datasets A. Data Management Plan (at the start of a project) : Describing nature of data Describing way of data collection Hard- and software to be used Describing intellectual property rights and legal requirements • Describing (re-)use of data • • • • 23
  24. 24. Digital curation of Data B. Data Seal Of Approval (DSA) http://datasealofapproval.org/ • Ensuring sustainable access to research data • Guidelines for: – Data producer – Data archive – Data consumer • Self-assessment, peer reviewed by DSA board, whereafter DSA wil be granted • Trust is crucial 24
  25. 25. Open Access (as seen from a funder) NWO (Netherlands Organisation for Scientific Research) promotes Open Access in two ways: 1. Open Access to research publications 2. Open Access to research data 25
  26. 26. NWO OA policy: publications • Publications as results of research projects supported by NWO should, irrespective of other publication possibilities, be made accessible to the general public as quickly as possible via Open Access. • Copyrights may only be assigned to a third party insofar as this does not block the option to publish via Open Access • Deviation from this rule is only possible with the explicit prior approval of NWO. 26
  27. 27. NWO OA policy: research data Aim: Maximise access to data for re-use (‘Open Access unless specified otherwise’) Means: Setting up an Open Access Code of Conduct to make rights and obligations visible to all stakeholders Legally: NWO is considered to be the co-producer of data generated as a result of a NWO-grant Practice:Data collected during a research project have to be stored and documented in a thorough and accessible manner 27
  28. 28. Other challenges 1. Authors may combine components from different sources 2. Therefore, sustainable access not guaranteed for all components 3. Maintenance of dynamic composed publications is technically no problem, but the versioning is. 4. Copyright rules may vary for the different components 28
  29. 29. Thank you for your attention For more information please contact arjan.hogenaar@dans.knaw.nl Data Archiving and Networked Services Anna van Saksenlaan 10, 2593 HT The Hague. P.O. Box 93067, 2509 AB The Hague. T +31 (0)70 3446 484, F +31 (0)70 3446 482, E info@dans.knaw.nl 29