www.eudat.eu
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No. 654065
Long-term data curation,
aka data preservation
Marjan Grootveld, DANS
This work is licensed under the Creative
Commons CC-BY 4.0 licence
#EUDATschool
Two questions
1. What is the oldest
data that you have
used or looked at,
i.e. not generated by
you?
2. Where did you find it?
Long-term preservation
*	Consultative	Committee	for	Space	Data	Systems.	Reference	Model	for	an	Open	
Archival	Information	System	(OAIS).	Recommended	Practice	-- CCSDS	650.0-M-2.	
Magenta	Book,	June	2012.
https://public.ccsds.org/pubs/650x0m2.pdf
EUDAT	Summer	School,	3-7	July	2017,	Crete
Climatological database for the world’s oceans
Image	copied from https://www.knmi.nl/kennis-en-datacentrum/achtergrond/cliwoc
Every	yellow	dot	represents	a	ship	report.	
Project	web	site:	http://pendientedemigracion.ucm.es/info/cliwoc/
Viking Mars Lander
5
http://www.dpconline.org/docman/miscellaneous/advocacy/340-mind-the-gap-assessing-digital-preservation-
needs-in-the-uk/file Data	now	available	from	https://pds-imaging.jpl.nasa.gov/volumes/viking.html
EUDAT	Summer	School,	3-7	July	2017,	Crete
Institute of
Dutch Academy
and Research
Funding
Organisation
(KNAW & NWO)
since 2005
First predecessor
dates back to
1964 (Steinmetz
Foundation),
Historical Data
Archive 1989
Mission:	promote	
and	provide	
permanent	
access	to	digital	
research	
resources
DANS organisation
EUDAT	Summer	School,	3-7	July	2017,	Crete
DANS long-term data archive
EASY
Certified	Long-
term	Archive
https://easy.dans.knaw.nl/
EUDAT	Summer	School,	3-7	July	2017,	Crete
DANS and DSA
• 2005: DANS to promote and provide permanent
access to digital research resources
• Formulate quality guidelines for digital data
repositories including DANS
• 2006: 5 basic principles as basis for 16 DSA guidelines
• 2009: international DSA Board
• Almost 70 seals acquired around the globe, but with
a focus on Europe
EUDAT	Summer	School,	3-7	July	2017,	Crete
The Certification Pyramid
ISO	16363:2012	- Audit	and	
certification	of	trustworthy	digital	
repositories	
http://www.iso16363.org/
DIN	31644	standard	“Criteria	for	
trustworthy	digital	archives”	
http://www.langzeitarchivierung.de
http://www.datasealofapproval.org/
https://www.icsu-wds.org/
http://trusteddigitalrepository.eu/
EUDAT	Summer	School,	3-7	July	2017,	Crete
DSA and WDS: look-a-likes
Communalities:
• Lightweight, self assessment, community review
Complementarity:
• Geographical spread
• Disciplinary spread
EUDAT	Summer	School,	3-7	July	2017,	Crete
Coming soon:
EUDAT	Summer	School,	3-7	July	2017,	Crete
Part of CTS’s 16 requirements
R2. The repository maintains all applicable licenses covering data access and use
and monitors compliance.
R3. The repository has a continuity plan to ensure ongoing access to and
preservation of its holdings.
R4. The repository ensures, to the extent possible, that data are created, curated,
accessed, and used in compliance with disciplinary and ethical norms.
R7. The repository guarantees the integrity and authenticity of the data.
R8. The repository accepts data and metadata based on defined criteria to ensure
relevance and understandability for data users.
R10. The repository assumes responsibility for long-term preservation and manages
this function in a planned and documented way.
R11. The repository has appropriate expertise to address technical data and
metadata quality and ensures that sufficient information is available for end users to
make quality-related evaluations.
R13. The repository enables users to discover the data and refer to them in a
persistent way through proper citation.
R14. The repository enables reuse of the data over time, ensuring that appropriate
metadata are available to support the understanding and use of the data.
EUDAT	Summer	School,	3-7	July	2017,	Crete
Guidance document
For
aspiring repositories
reviewers
Also about
data producers
data users
Requirements 1–16
EUDAT	Summer	School,	3-7	July	2017,	Crete
Levels of curation
Plus R0: context and “Level of Curation Performed”
”All	levels	of	curation	assume	initial	deposits	are	retained	unchanged	(…)	edits	are	only	
made	on	copies	of	those	originals.”
“Annotations/edits	must	fall	within	the	terms	of	the	licence	agreed	with	the	data	
producer...”
“the	repository	will	be	expected	to	demonstrate	that	any	such	annotations/edits	are	
undertaken	and	documented	by	appropriate	experts”
EUDAT	Summer	School,	3-7	July	2017,	Crete
Exercise
Download the Draft CoreTrustSeal Guidance
document, read the guidance about Requirements
10, 7 and 14, and answer the following questions:
Ad R10: what does this mean for you as a data
producer?
Ad R7: next time you look for a repository to deposit
or reuse data, will it differ from last Tuesday? How?
Ad R14: what does this mean for you as a data
reuser?
EUDAT	Summer	School,	3-7	July	2017,	Crete
Preservation isn’t rocket
science.
It’s a profession in the
trust business.
Knossos	– M.	Grootveld
www.eudat.eu
Acknowledgements:
Thanks to Ingrid Dillo (DANS) for slides
Outlook:
a pilot for scoring the FAIRness
of existing datasets
Author:
Marjan Grootveld, DANS
This	work	is	licensed	under	the	Creative	Commons	CC-BY	4.0	licence
F A I R
2 User Reviews
1 Archivist Assessment
24 Downloads

Long-term data curation, aka data preservation - EUDAT Summer School (Marjan Grootveld, DANS)