Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

Long-term data curation, aka data preservation - EUDAT Summer School (Marjan Grootveld, DANS)

520 views

Published on

Marjan will give an overview of the role of data archives in ensuring the safe stewardship and preservation of data over time. She will explain what it means to be a Trustworthy Digital Repository and the associated policies and processes that need to be in place to ensure data provenance and authenticity. This session will link to Monday’s exploration of the re3data.org portal

Visit: https://www.eudat.eu/eudat-summer-school

Published in: Data & Analytics
  • Be the first to comment

  • Be the first to like this

Long-term data curation, aka data preservation - EUDAT Summer School (Marjan Grootveld, DANS)

  1. 1. www.eudat.eu EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No. 654065 Long-term data curation, aka data preservation Marjan Grootveld, DANS This work is licensed under the Creative Commons CC-BY 4.0 licence #EUDATschool
  2. 2. Two questions 1. What is the oldest data that you have used or looked at, i.e. not generated by you? 2. Where did you find it?
  3. 3. Long-term preservation * Consultative Committee for Space Data Systems. Reference Model for an Open Archival Information System (OAIS). Recommended Practice -- CCSDS 650.0-M-2. Magenta Book, June 2012. https://public.ccsds.org/pubs/650x0m2.pdf
  4. 4. EUDAT Summer School, 3-7 July 2017, Crete Climatological database for the world’s oceans Image copied from https://www.knmi.nl/kennis-en-datacentrum/achtergrond/cliwoc Every yellow dot represents a ship report. Project web site: http://pendientedemigracion.ucm.es/info/cliwoc/
  5. 5. Viking Mars Lander 5 http://www.dpconline.org/docman/miscellaneous/advocacy/340-mind-the-gap-assessing-digital-preservation- needs-in-the-uk/file Data now available from https://pds-imaging.jpl.nasa.gov/volumes/viking.html
  6. 6. EUDAT Summer School, 3-7 July 2017, Crete Institute of Dutch Academy and Research Funding Organisation (KNAW & NWO) since 2005 First predecessor dates back to 1964 (Steinmetz Foundation), Historical Data Archive 1989 Mission: promote and provide permanent access to digital research resources DANS organisation
  7. 7. EUDAT Summer School, 3-7 July 2017, Crete DANS long-term data archive EASY Certified Long- term Archive https://easy.dans.knaw.nl/
  8. 8. EUDAT Summer School, 3-7 July 2017, Crete DANS and DSA • 2005: DANS to promote and provide permanent access to digital research resources • Formulate quality guidelines for digital data repositories including DANS • 2006: 5 basic principles as basis for 16 DSA guidelines • 2009: international DSA Board • Almost 70 seals acquired around the globe, but with a focus on Europe
  9. 9. EUDAT Summer School, 3-7 July 2017, Crete The Certification Pyramid ISO 16363:2012 - Audit and certification of trustworthy digital repositories http://www.iso16363.org/ DIN 31644 standard “Criteria for trustworthy digital archives” http://www.langzeitarchivierung.de http://www.datasealofapproval.org/ https://www.icsu-wds.org/ http://trusteddigitalrepository.eu/
  10. 10. EUDAT Summer School, 3-7 July 2017, Crete DSA and WDS: look-a-likes Communalities: • Lightweight, self assessment, community review Complementarity: • Geographical spread • Disciplinary spread
  11. 11. EUDAT Summer School, 3-7 July 2017, Crete Coming soon:
  12. 12. EUDAT Summer School, 3-7 July 2017, Crete Part of CTS’s 16 requirements R2. The repository maintains all applicable licenses covering data access and use and monitors compliance. R3. The repository has a continuity plan to ensure ongoing access to and preservation of its holdings. R4. The repository ensures, to the extent possible, that data are created, curated, accessed, and used in compliance with disciplinary and ethical norms. R7. The repository guarantees the integrity and authenticity of the data. R8. The repository accepts data and metadata based on defined criteria to ensure relevance and understandability for data users. R10. The repository assumes responsibility for long-term preservation and manages this function in a planned and documented way. R11. The repository has appropriate expertise to address technical data and metadata quality and ensures that sufficient information is available for end users to make quality-related evaluations. R13. The repository enables users to discover the data and refer to them in a persistent way through proper citation. R14. The repository enables reuse of the data over time, ensuring that appropriate metadata are available to support the understanding and use of the data.
  13. 13. EUDAT Summer School, 3-7 July 2017, Crete Guidance document For aspiring repositories reviewers Also about data producers data users Requirements 1–16
  14. 14. EUDAT Summer School, 3-7 July 2017, Crete Levels of curation Plus R0: context and “Level of Curation Performed” ”All levels of curation assume initial deposits are retained unchanged (…) edits are only made on copies of those originals.” “Annotations/edits must fall within the terms of the licence agreed with the data producer...” “the repository will be expected to demonstrate that any such annotations/edits are undertaken and documented by appropriate experts”
  15. 15. EUDAT Summer School, 3-7 July 2017, Crete Exercise Download the Draft CoreTrustSeal Guidance document, read the guidance about Requirements 10, 7 and 14, and answer the following questions: Ad R10: what does this mean for you as a data producer? Ad R7: next time you look for a repository to deposit or reuse data, will it differ from last Tuesday? How? Ad R14: what does this mean for you as a data reuser?
  16. 16. EUDAT Summer School, 3-7 July 2017, Crete Preservation isn’t rocket science. It’s a profession in the trust business. Knossos – M. Grootveld
  17. 17. www.eudat.eu Acknowledgements: Thanks to Ingrid Dillo (DANS) for slides Outlook: a pilot for scoring the FAIRness of existing datasets Author: Marjan Grootveld, DANS This work is licensed under the Creative Commons CC-BY 4.0 licence F A I R 2 User Reviews 1 Archivist Assessment 24 Downloads

×