The importance of research
data repositories
Varsha Khodiyar, PhD
Indo-GBC Seminar: Data Sharing at a Global
Level
26th October 2020
IllustrationinspiredbytheworkofChienShiungWu
1
Global Data Sharing / 26th Oct 2020
Why share research data?
2
Global Data Sharing / 26th Oct 2020
What do we mean by FAIR data?
3
Global Data Sharing / 26th Oct 2020
How do we enable FAIR data?
Encourage
the use of
appropriate
repositories
Provide guidance
on data licensing
Ensure the
use of
discipline
specific
standards
Facilitate the
capture of
detailed
metadata
4
Global Data Sharing / 26th Oct 2020
The repository landscape is complex...
5
Global Data Sharing / 26th Oct 2020
...but there are essentially two types of repository
Type Features Examples
Discipline
specific
● Specific for a type of data
● Technical curation by data-specific
experts
● Require data-specific metadata from
authors during data deposition
● May provide data-specific discovery,
analysis and visualization tools
PANGAEA
The Cancer Imaging Archive
chEMBL
Generalist
● Accept all types of data
● No limitations on file format
● Provide data archiving option when no
appropriate discipline specific
repositories are available
figshare
Institutional data repositories
6
Global Data Sharing / 26th Oct 2020
Finding an appropriate repository is not a trivial task
7
Global Data Sharing / 26th Oct 2020
Some communities have mandates on repository use
http://www.insdc.org/
Genetic and genomic data are subject to a
community mandate, and must be deposited to a
INSDC repository.
8
Global Data Sharing / 26th Oct 2020
Some communities are working to increase data findability and
interoperability
http://www.copdess.org/enabling-fair-data-project/
Signatories agree to ensure that Earth and Space sciences data
are deposited to domain-specific repository, upon article
publication
9
Global Data Sharing / 26th Oct 2020
The TRUST Principles are a useful framework for digital
repositories
Lin, D., Crabtree, J., Dillo, I. et al. The TRUST Principles for digital repositories. Sci Data 7, 144 (2020).
https://doi.org/10.1038/s41597-020-0486-7
Khodiyar, V. Future-proofing research data – it’s a question of TRUST Springboard blog post (2020)
https://www.springernature.com/gp/advancing-discovery/blog/blogposts/future-proofing-data-trust/18044648
10
Global Data Sharing / 26th Oct 2020
Cross-publisher consensus on key repository criteria
http://doi.org/10.5281/zenodo.4084763
11
Global Data Sharing / 26th Oct 2020
11
The story behind the image
Chien Shiung Wu (1912–1997)
Chien Shiung Wu was a Chinese American experimental
physicist best known for conducting The Wu experiment
that bears her name. This experiment showed that the
conservation of parity was violated by a weak interaction
and it was possible to distinguish between a mirrored
variation of the world and the mirror image of the current
world. This discovery earned Wu the Wolf Prize in Physics
in 1978.
Thank you
Varsha Khodiyar, PhD
Data Curation Manager, Springer Nature
varsha.khodiyar@nature.com
@varsha_khodiyar
For information on Research Data Support and
other data-related activities at Springer Nature
researchdata@springernature.com
http://go.nature.com/ResearchDataServices

The importance of research data repositories

  • 1.
    The importance ofresearch data repositories Varsha Khodiyar, PhD Indo-GBC Seminar: Data Sharing at a Global Level 26th October 2020 IllustrationinspiredbytheworkofChienShiungWu
  • 2.
    1 Global Data Sharing/ 26th Oct 2020 Why share research data?
  • 3.
    2 Global Data Sharing/ 26th Oct 2020 What do we mean by FAIR data?
  • 4.
    3 Global Data Sharing/ 26th Oct 2020 How do we enable FAIR data? Encourage the use of appropriate repositories Provide guidance on data licensing Ensure the use of discipline specific standards Facilitate the capture of detailed metadata
  • 5.
    4 Global Data Sharing/ 26th Oct 2020 The repository landscape is complex...
  • 6.
    5 Global Data Sharing/ 26th Oct 2020 ...but there are essentially two types of repository Type Features Examples Discipline specific ● Specific for a type of data ● Technical curation by data-specific experts ● Require data-specific metadata from authors during data deposition ● May provide data-specific discovery, analysis and visualization tools PANGAEA The Cancer Imaging Archive chEMBL Generalist ● Accept all types of data ● No limitations on file format ● Provide data archiving option when no appropriate discipline specific repositories are available figshare Institutional data repositories
  • 7.
    6 Global Data Sharing/ 26th Oct 2020 Finding an appropriate repository is not a trivial task
  • 8.
    7 Global Data Sharing/ 26th Oct 2020 Some communities have mandates on repository use http://www.insdc.org/ Genetic and genomic data are subject to a community mandate, and must be deposited to a INSDC repository.
  • 9.
    8 Global Data Sharing/ 26th Oct 2020 Some communities are working to increase data findability and interoperability http://www.copdess.org/enabling-fair-data-project/ Signatories agree to ensure that Earth and Space sciences data are deposited to domain-specific repository, upon article publication
  • 10.
    9 Global Data Sharing/ 26th Oct 2020 The TRUST Principles are a useful framework for digital repositories Lin, D., Crabtree, J., Dillo, I. et al. The TRUST Principles for digital repositories. Sci Data 7, 144 (2020). https://doi.org/10.1038/s41597-020-0486-7 Khodiyar, V. Future-proofing research data – it’s a question of TRUST Springboard blog post (2020) https://www.springernature.com/gp/advancing-discovery/blog/blogposts/future-proofing-data-trust/18044648
  • 11.
    10 Global Data Sharing/ 26th Oct 2020 Cross-publisher consensus on key repository criteria http://doi.org/10.5281/zenodo.4084763
  • 12.
    11 Global Data Sharing/ 26th Oct 2020 11 The story behind the image Chien Shiung Wu (1912–1997) Chien Shiung Wu was a Chinese American experimental physicist best known for conducting The Wu experiment that bears her name. This experiment showed that the conservation of parity was violated by a weak interaction and it was possible to distinguish between a mirrored variation of the world and the mirror image of the current world. This discovery earned Wu the Wolf Prize in Physics in 1978. Thank you Varsha Khodiyar, PhD Data Curation Manager, Springer Nature varsha.khodiyar@nature.com @varsha_khodiyar For information on Research Data Support and other data-related activities at Springer Nature researchdata@springernature.com http://go.nature.com/ResearchDataServices