This talk gives an overview of current research data management practice with special emphasis on the role libraries can play as actors within larger information infrastructures. Such infrastructures are being increasingly summarized under the term Research Data Repositories (RDR).
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Making Research Data Repositories visible – the re3data Registry of Research Data Repositories
1. KIT – The Research University in the Helmholtz Association
KIT Library
www.kit.edu
Making Research Data Repositories visible –
the re3data Registry of Research Data Repositories
Frank Scholze, Karlsruhe Institute of Technology
2017 International Conference on Leadership and Innovative Management in Academic
Libraries in the Age of New Technology, Tongji University, Shanghai
2. KIT Library2
KIT - Figures and Facts
5 Campuses – 200 haarea
1,000 International scientists
3,000Doctoral students
9,300 Employees
25,000 Students
470 Trainees
300Buildings with a usable
area of 450,000m2
59Patent applications
18Spinoffs and startups
KIT budget
EUR 860 million
Status: 2015
364Professors and executive
scientists
30%
28%
State
funds
42%
Third-
party
funds
Federal
funds
KIT – The Research University in the Helmholtz Association4/11/2017
3. KIT Library3 KIT – The Research University in the Helmholtz Association4/11/2017
KIT-Library Facts
86.000
E-Books
115.000E-Journals
31.300
Users
5,3Information-budget (Mio. Euros)
4. KIT Library4
Large Research Infrastructures at KIT
KIT – The Research University in the Helmholtz Association4/11/2017
5. KIT Library5
Data challenge
Scientific knowledge gain increasingly based on data
Amount of data and number of files continuously growing
Limits in current approaches
Towards FAIR principles
Findable
Accessible
Interoperable
Re-usable
Novel methods needed
KIT – The Research University in the Helmholtz Association4/11/2017
6. KIT Library6
What is Research Data?
Original sources or material
that is created or collated
during research
Research data are of most
varied nature
Research data is different
from conventional library
media
KIT – The Research University in the Helmholtz Association4/11/2017
intrinsic
findings
observations,
measurement data
simulation
technical documentation,
method descriptions
quantum
physics
sociological research
7. KIT Library7
Research Data Life Cycle
KIT – The Research University in the Helmholtz Association4/11/2017
Research data impact the
whole research process
Research data require
support within existing as
well as new information-
infrastructures
(e.g. Research Data
Repositories - RDR)
8. KIT Library8
Research Data Management (RDM)
KIT – The Research University in the Helmholtz Association4/11/2017
Publication
Archive
RDR / Services
ResearchData
Support
Support of research data
management connects all
tiers
Cooperation of many
service units is needed
RDM reaches beyond the
boundaries of a single
institution
9. KIT Library9
The Role for Libraries in RDM
Institutional research data policy development
Cooperation with researchers, research groups, data archives and data
centers
Create Data Librarian posts and develop professional staff skills for data
librarianship
Data management plans for grant applications, intellectual property rights
advice and information materials
Metadata and data standards
Subject specific data management practice
(Institutional) data repository
Research data citation (persistent identifiers)
Services for storage, discovery, reuse and permanent access
KIT – The Research University in the Helmholtz Association4/11/2017
LIBER - 10 Recommendations on Getting Started in RDM
10. KIT Library10
Service Team RDM@KIT
4/11/2017
Joint facility of KIT Library, Steinbuch Centre for Computing, KIT Research
Office, Centre for Cultural and General Studies, Center for Applied Legal
Studies, Institute for Data Processing and Electronics, KIT Archive
Central services and consulting at KIT for archiving and publishing research
data at every stage of the research process:
Project
planning
Proposal
Data
preparation
and selection
Ingest and
storage
Publication
and reference
Access
Preservation
and curation
Reuse of data
KIT – The Research University in the Helmholtz Association
11. KIT Library11
Survey bwFDM (bwfdm.scc.kit.edu)
The observed research data landscape
in Baden-Württemberg
3000 different research groups
social sciences and humanities (31 %)
natural sciences (18%)
engineering (23%)
life science incl. medicine (27%)
700 interviews with researchers
approx. 1h each researcher
1000 user stories extracted from the interviews
KIT – The Research University in the Helmholtz Association4/11/2017
12. KIT Library12
Research Data Policy
Access, Verification & Reuse
Responsibility
Support for RDM
Provision of infrastructure
Foster open science
KIT – The Research University in the Helmholtz Association4/11/2017
14. KIT Library14
users
Enabling and integrating RDM services
KIT – The Research University in the Helmholtz Association4/11/2017
Archive Interface
Technical
Metadata
Research
data
bwDataDiss RADAR
Descriptive
Metadata
Descriptive
Metadata
bwDataArchiv
Technical,
operative
Metadata
… other
services
… other
services
Mo|Re Data,
Chemotion,
...
End users
services
archive
KITopen / CRIS /
KIT Scientific
Publishing
15. KIT Library15
Partner of scientific communities at KIT
Chemotion: Developement of an Electronic
Notebook with Repository Integration
KCDC KASCADE: Cosmic Ray Data Centre
MO|RE data: Motor Research Data
Networking and Co-operation
4/11/2017 KIT – The Research University in the Helmholtz Association
16. KIT Library16
Networking and Co-operation
4/11/2017
Enhance cross collaboration as partners in state, federal and
international projects
Helmholtz Data Federation
E-Science Projects Baden-Württemberg
re3data / DataCite
Research Data Alliance
EUDAT
KIT – The Research University in the Helmholtz Association
17. KIT Library17 KIT – The Research University in the Helmholtz Association4/11/2017
18. KIT Library18
re3data - Mission
KIT – The Research University in the Helmholtz Association4/11/2017
global registry of research data repositories
covers all academic disciplines
presents repositories and portals for the permanent storage and
access of research data sets to researchers, funding bodies,
publishers and scholarly institutions.
promotes a culture of sharing, increased access and better
visibility of research data
19. KIT Library19
Journal Data Policies
• Nature Publishing Group
• “[...] authors are required to make
materials, data and associated
protocols promptly available to
readers without undue qualifications. “
• PLOS
• “PLOS journals require authors to
make all data underlying the findings
described in their manuscript fully
available without restriction, with rare
exception.“
KIT – The Research University in the Helmholtz Association4/11/2017
NPG (2013). Availability of data and materials. Retrieved from http://www.nature.com/authors/policies/availability.html
PLOS (2014). PLOS Editorial and Publishing Policies. Retrieved from http://www.plosone.org/static/policies.action
20. KIT Library20
Registration Policy
To be registered in re3data.org a research data repository must
be run by a legal entity, such as a sustainable institution
(e.g. library, university)
clarify access conditions to the data and repository as well as
the terms of use
have focus on research data
KIT – The Research University in the Helmholtz Association4/11/2017
21. KIT Library21
Metadata Schema
41 Properties on
General information
Responsibilities
Policies
Legal aspects
Technical standards
Quality standards
KIT – The Research University in the Helmholtz Association4/11/2017
22. KIT Library22
Icons
Facilitating the selection process of appropriate research data
repositories
KIT – The Research University in the Helmholtz Association4/11/2017
The research datarepository provides
additional information on its ser vice.
The research datarepository
provides open/restricted/closed
access to its data.
The terms of use and licenses
of the dataare provided by the
research datarepository.
The research datarepository
provides apolicy.
The research datarepository uses
apersistent identifier system to make its
provided data persistent,unique and citable.
The research datarepository is
either certified or suppor ts a
repository standard. RESEARCH
DATA
REPOSITORY
GENERAL
INFORMATION
POLICY
LEGAL
ASPECTS
TECHNICAL
STANDARDS
QUALITY
STANDARDS
23. KIT Library23
Sustainability
2012-2015 DFG project (German Research Center for
Geosciences, Humboldt University Berlin, Karlsruhe Institute of
Technology KIT)
From 2016 on:
merge with DataBib (new partner: Purdue University)
official service of DataCite
re3data.org working group within DataCite
technical maintenance and development financed and
managed by DataCite
International Editorial Board
Cooperation with RDA, DINI, OpenAIRE, BioSharing
KIT – The Research University in the Helmholtz Association4/11/2017
24. KIT Library24
Editorial Board
Hui Wang (National Science Library, Chinese Academy of
Sciences)
Jiban K. Pal (Indian Statistical Institute Library, Kolkata)
Gail Steinhart (Scholarly Communication Librarian at Cornell
University Library in Ithaca, NY)
Sarah Williams (University of Illinois at Urbana-Champaign)
Catherine Jones (Science and Technology Facilities Council
Harwell Oxford)
+ Core Team at KIT and Purdue
KIT – The Research University in the Helmholtz Association4/11/2017
25. KIT Library25
Technology
Open interfaces
RESTful API
Documentation: http://www.re3data.org/api/doc
OpenSearch
Various usage scenarios,
e.g. European Union Open Science Monitor
KIT – The Research University in the Helmholtz Association4/11/2017
26. KIT Library26
EU Open Science Monitor
KIT – The Research University in the Helmholtz Association4/11/2017
27. KIT Library27
Suggest Form
KIT – The Research University in the Helmholtz Association4/11/2017
Suggesting new
repositories
31. KIT Library31
Types of RDR
KIT – The Research University in the Helmholtz Association4/11/2017
Kindling et al https://doi.org/10.1045/march2017-kindling
n = 1,379, 2 RDR with missing values, multiple values possible
32. KIT Library32
Persistent Identifier systems used by RDR
KIT – The Research University in the Helmholtz Association4/11/2017
n = 1421, multiple values possible
Kindling et al https://doi.org/10.1045/march2017-kindling
33. KIT Library33
Access to RDR
KIT – The Research University in the Helmholtz Association4/11/2017
Kindling et al https://doi.org/10.1045/march2017-kindling
n = 1,381, multiple values possible
(Fee,
membership,
registration)
34. KIT Library34
RDR in Chína
KIT – The Research University in the Helmholtz Association4/11/2017
35. KIT Library35
Badges
KIT – The Research University in the Helmholtz Association4/11/2017
Link to the re3data
entry from
repository website
37. KIT Library37
re3data – Recommendations for RDR
Support PID systems for each dataset
Use a data license to clarify access and usage conditions for the
data sets provided
Make metadata and related research data sets available to other
services and research organizations through an API
Ensure compliance with certificates and standards
The institutional responsibility has to be clarified and
communicated (data policy or mission statement).
Create policies to describe the services offered, the terms under
which the repository may be used
Repository software should support technical standards
KIT – The Research University in the Helmholtz Association4/11/2017
38. KIT Library38
Research Data Life Cycle - reloaded
KIT – The Research University in the Helmholtz Association4/11/2017
39. KIT Library39
Thank you for your attention!
KIT – The Research University in the Helmholtz Association4/11/2017
With the exception of all photos and graphics, this slides are licensed under
the “Attribution 4.0 International (CC BY 4.0)“ Licence:
http://creativecommons.org/licenses/by/4.0/
40. KIT Library40
References
Christensen-Dalsgaard, B. et al. (2012). LIBER - 10 Recommendations
on Getting Started in RDM http://libereurope.eu/wp-
content/uploads/The%20research%20data%20group%202012%20v7%
20final.pdf
Whyte, A., Tedds, J. (2011). ‘Making the Case for Research Data
Management’. DCC Briefing Papers. Edinburgh: Digital Curation
Centre. http://www.dcc.ac.uk/resources/briefing-papers
Pampel, H. et al. (2013). Making Research Data Repositories Visible:
The re3data.org Registry. In: Plos ONE, 8 (11), e78080.
doi:10.1371/journal.pone.0078080
Kindling, M. et al. (2017). The Landscape of Research Data
Repositories in 2015: A re3data Analysis. In: D-Lib Magazine Volume
23, Number 3/4. https://doi.org/10.1045/march2017-kindling
KIT – The Research University in the Helmholtz Association4/11/2017