A basic course on research data management for PhD students. The course consists of 4 parts. The course was given at Eindhoven University of Technology (TUe), 24-01-2017
QUATER-1-PE-HEALTH-LC2- this is just a sample of unpacked lesson
A basic course on Research data management, part 3: sharing your data
1. A basic course on Research data management
part 3: sharing your data
PROOF course Information Literacy and
Research Data Management
TU/e, 24-01-2017
l.osinski@tue.nl, TU/e IEC/Library
Available under CC BY-SA license, which permits copying
and redistributing the material in any medium or format &
adapting the material for any purpose, provided the original
author and source are credited & you distribute the
adapted material under the same license as the original
2. Research data management
Sharing your data, or making your data findable and accessible
with good data practices
+ protecting your data: back up, access control; file naming, organizing
data, versioning
→ sharing your data via collaboration platforms and archives
Caring for your data, or making your data re-usable and
interoperable with good data practices
+ metadata, tidy data, licenses
Research data management
what was it again
3. During research After researchInstitutionDisciplin
Local
ICT
services
Overview research data sharing
and storage services
Data sharing per se is pretty straightforward
4. DataverseNL [TU/e only]: data sharing platform for active research data [based on Harvard’s
Dataverse Project] where you may:
store your data in an organized and safe way
clearly describe your data
version control of your data
arrange access to your data
get recognition for your data
[collaborate on your data]
Various disciplinary initiatives: Open Science Framework, OpenML, RodRep, CRCNS…
General data sharing platforms:
SURFdrive [TU/e only]: Dutch academic Dropbox, 100 Gb, maximum data transfer 16 Gb
every TUe employee can use SURFdrive
Google Drive, Dropbox, Beehub…
SURF Filesender [secure data transfer up to 500 Gb!, WeTransfer up to 2 Gb]
Sharing your data
collaboration or sharing platforms (during your research)
Storage and backup of data through DANS [Dutch
Archiving and Networking Services]
Data transfer: up to 2 Gb per dataset
Dataverse via 4TU.ResearchData: up to 50 Gb free
5. How to create an account:
Go to: https://dataverse.nl/dvn/
Click ‘Log in’ (at the top right), do not click Log in with DVN account
Select Eindhoven University of Technology and log on with your TU/e username
and password
When asked for it, give permission to share your data by answering Yes or click
this Tab
When asked to create an account, answer Yes or click this Tab.
When you succeeded to create an account, your username is your email
address
You now have u user account with DataverseNL but it is not yet possible for you to
‘do something’ with the account!
Sharing your data
DataverseNL
If you are interested in using DataverseNL, please contact me (Leon Osinski)
6. On request
“I'd like to thank E.J. Masicampo and Daniel LaLande for sharing and allowing me to share
their data…”
Daniël Lakens (2014), What p-hacking really looks like: A comment on Masicampo & LaLande (2012)
On a (personal) website
“Let me start by saying that the reason why I put all excel files online, including all the
detailed excel formulas about data constructions and adjustments, is precisely because I
want to promote an open and transparent debate about these important and sensitive
measurement issues.”
Thomas Piketty, My response to the Financial Times, HuffPost The Blog, 29-05-2014 ;
originally published as Addendum: Response to FT, 28-05-2014
A data journal
Journal of open psychology data, Geoscience data journal,
Data in brief, Scientific data, Data reports
Sharing your data
after your research has ended
Source: www.aukeherrema.nl
7. Choose a repository where other researchers in your discipline are sharing their data, for
example LXcat (for plasma data) or GenBank (for genetic sequence data)
Overview of research data repositories: Re3data.org
Use a repository that at least assigns a persistent identifier to your data (DOI) and requires
that you provide adequate metadata
General or multidisciplinary repositories: Zenodo, Figshare, DANS, Dryad, B2SHARE
4TU.ResearchData
+ small medium sized data sets, long tail data
+ static data, ‘frozen’ data sets, ‘milestone’ data sets
+ preferably nonproprietary software formats suitable for long
+ term preservation
+ DOI’s [ persistent identifier for citability and retrievability ]
+ open access
+ long-term availability, Data Seal of Approval
+ Data Citation Index (Thomson Reuters)
+ self-upload (single data sets < 3Gb)
+ special collections of related data sets
Sharing your data
in an established repository (after your research has ended)
8. Link your data to your publication
Sharing your data
link your data to our publication
9. 1. DataverseNL: https://www.dataverse.nl/dvn/
2. Harvard’s Dataverse Project: http://dataverse.org/
3. Open Science Framework: https://cos.io/osf/
4. OpenML: http://www.openml.org
5. RodRep: http://www.rodrep.com/
6. CRCNS: http://crcns.org/
7. SURFdrive: https://www.surfdrive.nl/
8. Google Drive: https://www.google.com/drive/
9. Dropbox: https://www.dropbox.com/
10. Beehub: https://beehub.nl/system/
11. SURF filesender: https://filesender.surfnet.nl/
12. Data on request (blog post Daniel Lakens): http://daniellakens.blogspot.nl/2014/09/what-p-hacking-really-
looks-like.html
13. Data on personal website (Thomas Piketty): http://piketty.pse.ens.fr/en/capital21c2
14. Data journal: Journal of Open Psychology Data: http://openpsychologydata.metajnl.com/
15. Data journal: Geoscience Data Journal: http://onlinelibrary.wiley.com/journal/10.1002/(ISSN)2049-6060
URL’s of mentioned webpages
in order of appearance #1
10. 14. Data journal: Data in brief: http://www.journals.elsevier.com/data-in-brief
15. Data journal: Scientific data: http://www.nature.com/sdata/
18. Data journal: Data reports: http://www.frontiersin.org/news/Data_Reports_a_new_type_of_peer-
reviewed_article_in_Frontiers_journals/1051?utm_source=FRN&utm_medium=ECOM&utm_campaign=T
WT_FRN_1502_datareport
19. Research data catalogue: Re3data.org: http://service.re3data.org/search/results?term=
20. Publishing data: Zenodo: http://www.zenodo.org/
21. Publishing data: Figshare: http://www.figshare.com
22. Publishing data: DANS: http://www.dans.knaw.nl/en
23. Publishing data: Dryad: http://datadryad.org/
24. Publishing data: B2SHARE: https://b2share.eudat.eu/
25. Publishing data: 4TU.ResearchData: https://data.4tu.nl/
26. Long tail research data: http://www.nature.com/neuro/journal/v17/n11/fig_tab/nn.3838_F1.html
27. Nonproprietary software formats:
http://datacentrum.3tu.nl/fileadmin/editor_upload/File_formats/Digital_Preservation_Support_levels.pdf
28. Data Seal of Approval: http://www.datasealofapproval.org
URL’s of mentioned webpages
in order of appearance #2
11. 29. Data Citation Index (Thomson Reuters): http://wokinfo.com/products_tools/multidisciplinary/dci/
30. Self upload 4TU.ResearchData: https://data.4tu.nl/account/login/?next=/upload/
31. Data sets underlying PhD thesis Joos Buijs: http://dx.doi.org/10.4121/uuid:26aba40d-8b2d-435b-b5af-
6d4bfbd7a270
32. PhD thesis Joos Buijs: http://dx.doi.org/10.6100/IR780920
URL’s of mentioned webpages
in order of appearance #3