Data management and
governance -
the re3data.org experience
Frank Scholze | Karlsruhe Institute of Technology (KIT)
European Library Advisory Board | Munich 16.5.2013
Outline
• Background
• Research Data Repositories
• re3data.org
Background
• Research data are valuable and ubiquitous
• New technologies facilitate data-intensive science
• Broad discussion about the permanent access to research
data
• Increasing requirements from funders to make data openly
available
• Growing demand for trustable and sustainable research data
repositories
Trend: data journals
OSTP Directive
„digitally formatted
scientific data resulting
from unclassified
research supported
wholly or in part by
Federal funding should
be stored and publicly
accessible to search,
retrieve, and analyze.“
Office of Science and Technology Policy. (2013). Increasing Access to the Results of Federally Funded Scientific Research.
Retrieved from http://www.whitehouse.gov/sites/default/files/microsites/ostp/ostp_public_access_memo_2013.pdf
The Royal Society
„ Where data justify it,
scientists should make
them available in an
appropriate data
repository.“
- Intelligent Openness
(accessible, assessable,
intelligible, useable)
The Royal Society. (2012). Science as an open enterprise. The Royal Society
Science Policy Centre report 02/12.
Retrieved from http://royalsociety.org/uploadedFiles/
Royal_Society_Content/policy/projects/sape/2012-06-20-SAOE.pdf.
Research Data Repositories
• EC: ICT infrastructures for e-science
„The landscape of data repositories
across Europe is fairly heterogeneous,
but there is a solid basis (…) to
overcome the fragmentation and enable
research communities to better manage,
use, share and preserve data.“
Communication from the Commission to the European Parliament, the Council, the European Economic and Social Committee and the Committee
of the Regions. COM(2009) 108 final. Retrieved from http://eur-lex.europa.eu/LexUriServ/LexUriServ.do?uri=COM:2009:0108:FIN:EN:PDF
The Landscape of Research Data
Repositories
funders
researchers
journals
research data
repositories
universities and
research labs
Where can I store my data?
Investigators are expected
to share their data!
Underlying data
must be accessible!
Where can I find data?
Should we offer
repositories
for all disciplines?
Open Data LMU, http://data.ub.uni-muenchen.de
PURR, http://research.hub.purdue.edu
Institutional RDR
Disciplinary RDR
PANGAEA, http://www.pangaea.de
GEO, http://www.ncbi.nlm.nih.gov/geo/
Figshare, http://figshare.com
Multidisciplinary RDR
BDPP, http://www.digitalpantheon.ch
SDDB, http://www.scientificdrilling.org
Project Specific RDR
re3data.org: Objectives
• global registry of research data repositories
• better understanding of the concept of research data
repositories
• help researchers, funding bodies, publishers and scholarly
institutions to find research data repositories
• promote a culture of sharing, increased access and better
visibility of research data
re3data.org: Status quo
• First project phase (2012/13)
• The registry went live in autumn 2012
 web-based registry
 definition of selection criteria
 formulation of a metadata scheme (vocabulary)
re3data.org: Aspects
David Marques (2013)
http://libraryconnect.elsevier.com/ar
ticles/best-practices/2013-
02/research-data-driving-new-
services
re3data.org: Vocabulary
• Vierkant, P., Spier, S., Rücknagel,
J., Gundlach, J., Fichtmüller, D.,
Kindling, M., Pampel, H., et al.
(2012). Vocabulary for the
Registration and Description of
Research Data Repositories.
Version 2.0. doi:10.2312/re3.002
Vocabulary in practice
• Review
• 37 criteria
• 25 controlled vocabularies
• Icons as representation of
sets of criteria
• Value added service for
researchers and repository
managers
A closer look at access icons
Access
Icons
Access to
Repository
open
open or
restricted
closed
Access to
Data
open,
(restricted,
closed)
restricted,
(closed)
closed
Upload
open or
restricted
open or
restricted
-
Orange: values have to be available
Icons and numbers
0
20
40
60
80
100
120
140
160
Certification Open Access Persistent Id All Aspects
From a total of 340 RDR in re3data
Champions
• CLARIN-ERIC
• Archaeology Data Service
• EASY (DANS)
• IQSS Dataverse network
• CrystalEye (beta)
• Durham HepData Project
• figshare
• Global Change Master
Directory (NASA)
• NeuroMorpho
• Neuroscience Information
Framework
• ClinicalTrials.gov
• Ecological Archives (ESA)
• PANGAEA
• WDC for Remote Sensing of
the Atmosphere
RDR by Country
US
GER
UK
CAN
NED
JPN
FRAN
AUS
DEN
Trends and Challenges
• Long tail
• Institutional perspective
• Workflow and information
continuum
• Take intelligent openness seriously
(accessible, assessable, intelligible, useable)
Thank you
With the exception of all photos and graphics, this slides are licensed under
the “Creative Commons Attribution 3.0 Germany (CC BY 3.0)“ Licence.

Elab 16 5-13-re3data-scholze-final

  • 1.
    Data management and governance- the re3data.org experience Frank Scholze | Karlsruhe Institute of Technology (KIT) European Library Advisory Board | Munich 16.5.2013
  • 2.
    Outline • Background • ResearchData Repositories • re3data.org
  • 3.
    Background • Research dataare valuable and ubiquitous • New technologies facilitate data-intensive science • Broad discussion about the permanent access to research data • Increasing requirements from funders to make data openly available • Growing demand for trustable and sustainable research data repositories Trend: data journals
  • 4.
    OSTP Directive „digitally formatted scientificdata resulting from unclassified research supported wholly or in part by Federal funding should be stored and publicly accessible to search, retrieve, and analyze.“ Office of Science and Technology Policy. (2013). Increasing Access to the Results of Federally Funded Scientific Research. Retrieved from http://www.whitehouse.gov/sites/default/files/microsites/ostp/ostp_public_access_memo_2013.pdf
  • 5.
    The Royal Society „Where data justify it, scientists should make them available in an appropriate data repository.“ - Intelligent Openness (accessible, assessable, intelligible, useable) The Royal Society. (2012). Science as an open enterprise. The Royal Society Science Policy Centre report 02/12. Retrieved from http://royalsociety.org/uploadedFiles/ Royal_Society_Content/policy/projects/sape/2012-06-20-SAOE.pdf.
  • 6.
    Research Data Repositories •EC: ICT infrastructures for e-science „The landscape of data repositories across Europe is fairly heterogeneous, but there is a solid basis (…) to overcome the fragmentation and enable research communities to better manage, use, share and preserve data.“ Communication from the Commission to the European Parliament, the Council, the European Economic and Social Committee and the Committee of the Regions. COM(2009) 108 final. Retrieved from http://eur-lex.europa.eu/LexUriServ/LexUriServ.do?uri=COM:2009:0108:FIN:EN:PDF
  • 7.
    The Landscape ofResearch Data Repositories funders researchers journals research data repositories universities and research labs Where can I store my data? Investigators are expected to share their data! Underlying data must be accessible! Where can I find data? Should we offer repositories for all disciplines?
  • 8.
    Open Data LMU,http://data.ub.uni-muenchen.de PURR, http://research.hub.purdue.edu Institutional RDR
  • 9.
  • 10.
  • 11.
  • 12.
    re3data.org: Objectives • globalregistry of research data repositories • better understanding of the concept of research data repositories • help researchers, funding bodies, publishers and scholarly institutions to find research data repositories • promote a culture of sharing, increased access and better visibility of research data
  • 13.
    re3data.org: Status quo •First project phase (2012/13) • The registry went live in autumn 2012  web-based registry  definition of selection criteria  formulation of a metadata scheme (vocabulary)
  • 14.
    re3data.org: Aspects David Marques(2013) http://libraryconnect.elsevier.com/ar ticles/best-practices/2013- 02/research-data-driving-new- services
  • 15.
    re3data.org: Vocabulary • Vierkant,P., Spier, S., Rücknagel, J., Gundlach, J., Fichtmüller, D., Kindling, M., Pampel, H., et al. (2012). Vocabulary for the Registration and Description of Research Data Repositories. Version 2.0. doi:10.2312/re3.002
  • 16.
    Vocabulary in practice •Review • 37 criteria • 25 controlled vocabularies • Icons as representation of sets of criteria • Value added service for researchers and repository managers
  • 17.
    A closer lookat access icons Access Icons Access to Repository open open or restricted closed Access to Data open, (restricted, closed) restricted, (closed) closed Upload open or restricted open or restricted - Orange: values have to be available
  • 18.
    Icons and numbers 0 20 40 60 80 100 120 140 160 CertificationOpen Access Persistent Id All Aspects From a total of 340 RDR in re3data
  • 19.
    Champions • CLARIN-ERIC • ArchaeologyData Service • EASY (DANS) • IQSS Dataverse network • CrystalEye (beta) • Durham HepData Project • figshare • Global Change Master Directory (NASA) • NeuroMorpho • Neuroscience Information Framework • ClinicalTrials.gov • Ecological Archives (ESA) • PANGAEA • WDC for Remote Sensing of the Atmosphere
  • 20.
  • 21.
    Trends and Challenges •Long tail • Institutional perspective • Workflow and information continuum • Take intelligent openness seriously (accessible, assessable, intelligible, useable)
  • 22.
    Thank you With theexception of all photos and graphics, this slides are licensed under the “Creative Commons Attribution 3.0 Germany (CC BY 3.0)“ Licence.