SlideShare a Scribd company logo
Preparing your data for sharing
and publishing
Varsha Khodiyar, PhD
MRC Cognition and Brain Sciences Unit
Open Science Day 20.11.2018
Howchameleonschangecolour
1
7719 respondents
White paper available from
https://doi.org/10.6084/m9.figshare.5975011
Survey data available from
https://doi.org/10.6084/m9.figshare.5971387
What are researchers concerned about when sharing data?
2
3
44
How to organise your data
5
Make sure your data are well organised
• Data files and folders labelled in an understandable way.
• Data files and folders organised in a logical, easy-to-
follow manner.
• Any acronyms used for data file/folder names clearly
defined, ideally in a README file.
• Data files in a format that are easy for others to reuse /
use the standard format used by your discipline.
6
Increasing reproducibility
• Include any additional information needed to understand the data,
methods, parameters, e.g. which instrument (make and model) was
used to measure blood carbon dioxide levels?
• Include availability statements for any code that was used to view,
parse or analyse the data, in support of the conclusions.
7
Help to organise your data is available
Springer Nature Research Data Support
Researchers
submit their
data files
securely
The Research
Data team
curates the data
and metadata
The data are
published and
linked to the
author’s paper
More information is available on our website here:
http://www.springernature.com/gb/group/data-
policy/data-support-services
8
No one other than the
creator can access the
data, or even knows that
it exists
Before data curation: a researcher’s dataset in a desktop
folder
The dataset is
stored as an
Excel file in a
desktop folder
The file title is not
comprehensible to
anyone but the
creator
No description or
keywords
available
9
Before curation begins
Once received, we check to make sure
that the dataset is suitable for our
curation services. Multiple files in any
format are accepted.
After making these checks, we begin
the curation process. If necessary
we may recommend that the
dataset is split into smaller groups
or collections.
Pre-curation data checks:
 The data aren’t sensitive
 The data don’t include
direct or indirect human
identifiers
 The data shouldn’t be in a
community repository
10
After Springer Nature Research Data Support
Working with the researcher’s manuscript or published paper, we draft a comprehensive
metadata record for the dataset which is sent to the researcher for approval before
being published. Embargoes can be applied if necessary.
The curated dataset will be published with
its own metadata record which includes
rich descriptive information, reuse
conditions, licence, DOI, metrics and
keywords
(this example is
https://doi.org/10.6084/m9.figshare.5259
415)
1111
Choosing a repository to store data
12
Selecting a repository for your data
Considerations:
1. Is there a discipline-specific repository for the type of data you
have generated?
2. Will access to the data need to be controlled?
3. If no discipline-specific repository is available for the data,
does your funder or institute mandate deposition to a
particular repository?
13
Indexing services Curated lists
Sources to help choose a data repository
NEW! Tools to help select repositories
www.nature.com/sdata/data-policies/repositories
https://repositoryfinder.test.datacite.org/
1414
Sensitive data
15
• Consider an appropriate patient consent
framework
 Consent to use data in current study
 Consent to use data for future research
 Consent to share data for use by other
research groups
• Don’t collect more than you need
Collecting sensitive data
16
• Remove direct identifiers
• Aggregate indirect identifiers into groups where possible
• Anonymization or de-identification?
• Use controlled access repositories,
and consider:
 Data use agreement?
 Data access conditions?
Sharing sensitive data
1717
Scholarly credit for generating and sharing research data
18
Data Journals at Springer Nature
www.nature.com/scientificdata
https://bmcresnotes.biomedcentral.com
Data Descriptor
Open access
Sound science
Emphasis on enabling
data reuse
Data peer review
Data Note
Open access
Sound science
Short format
19
Scientific Data, a Nature Research journal
Data Descriptor
Primary article type; sound
science and facilitates data
reuse
Analysis
New analyses or meta-
analyses of existing data
Article
Original reports on
advances in data sharing &
reuse
Comment
Announcements of broad
interest; usually invited
www.nature.com/scientificdata
20
Under the hood of a Data Descriptor
• Context for data generation (background)
• How was data generated?
• How was data processed?
• Where is the data?
• Synthesis
• Analysis
• Conclusions
21
Data peer review
www.nature.com/sdata/policies/for-referees
Experimental
Rigor and
Technical Data
Quality
Were data produced in a sound manner?
Technical quality of data – appropriate statistical analyses?
Experimental rigor - appropriate depth, coverage?
Completeness
of the
Description
Sufficient detail to allow others to reproduce these steps?
Sufficient detail to allow others to reuse this data?
Consistent with relevant minimum reporting standards?
Integrity of the
Data Files and
Repository
Record
Do data files appear complete and match manuscript
descriptions?
Are data archived to the most appropriate repository?
22
What types of data can be published?
Decades old
dataset
Standalone
dataset
Data that has been
used in an analysis
article
Large
consortium
dataset
Data from a
single
experiment
Any data that the researcher
finds valuable and that others
might find useful too
Data associated with a
high impact analysis
article
23
When can a data paper be published?
After data
analysis has been
published
Before analysis has
been published
Authors not
intending to
analyse data
Data papers can be
submitted and published at
any point in the research
workflow, i.e. whenever it
makes most sense for your
data
After data
analysis has been
published
Before the
analysis has been
published
Publication alongside
analysis article
2424
Still unsure about research data?
25
What does research data training offer?
• Directly addresses the main challenges of data sharing
• Part of the Nature Research Academies, offering trusted quality
and value
• A unique perspective and trusted experience within the realm of
research data
• Training for both researchers & information professionals,
appropriate for all levels
• Courses are customised to meet your needs, and are brought to
you (and your researchers)
Springer Nature Research Data Training
Source: https://doi.org/10.6084/m9.figshare.5975011
26
Queries are answered within two business days
Run by members of the Springer Nature Research Data team
Expertise in data curation and management, archiving and
digital preservation, copyright and licensing, Open Access
publishing
Always encourage best practices, e.g. the use of community
repositories for specific data types
Email: researchdata@springernature.com
http://www.springernature.com/gp/group/data-policy/helpdesk
Springer Nature Research Data Helpdesk
2727
The story behind the image
How chameleons change colour
Chameleons are well known for their potential to
change colour but recent research on panther
chameleons is the first to find two layers of
crystal containing cells, each with a potentially
different purpose. Researchers from the
University of Geneva have speculated that the
deeper crystal containing cells may help with the
regulation of temperature, whilst the more
superficial layer of colour changing cells could be
responsible for camouflage or mating displays.
Thank you for listening
Varsha Khodiyar, PhD
Data Curation Manager, Springer Nature
(Data Curation Editor, Scientific Data)
varsha.khodiyar@nature.com

More Related Content

What's hot

The challenge of sharing data well, how publishers can help
The challenge of sharing data well, how publishers can helpThe challenge of sharing data well, how publishers can help
The challenge of sharing data well, how publishers can help
Varsha Khodiyar
 
FAIR Data Knowledge Graphs–from Theory to Practice
FAIR Data Knowledge Graphs–from Theory to PracticeFAIR Data Knowledge Graphs–from Theory to Practice
FAIR Data Knowledge Graphs–from Theory to Practice
Tom Plasterer
 
Shareable by Design: Making Better Use of your Research
Shareable by Design: Making Better Use of your ResearchShareable by Design: Making Better Use of your Research
Shareable by Design: Making Better Use of your Research
London School of Hygiene and Tropical Medicine
 
How to write a data management plan
How to write a data management planHow to write a data management plan
How to write a data management plan
OpenExeter
 
DataONE Education Module 01: Why Data Management?
DataONE Education Module 01: Why Data Management?DataONE Education Module 01: Why Data Management?
DataONE Education Module 01: Why Data Management?
DataONE
 
DataONE Education Module 07: Metadata
DataONE Education Module 07: MetadataDataONE Education Module 07: Metadata
DataONE Education Module 07: Metadata
DataONE
 
Data management plan format
Data management plan formatData management plan format
Data management plan format
Wouter Gerritsma
 
EPSRC Policy Compliance: What researchers need to know
EPSRC Policy Compliance: What researchers need to knowEPSRC Policy Compliance: What researchers need to know
EPSRC Policy Compliance: What researchers need to know
Historic Environment Scotland
 
Data Publishing at Harvard's Research Data Access Symposium
Data Publishing at Harvard's Research Data Access SymposiumData Publishing at Harvard's Research Data Access Symposium
Data Publishing at Harvard's Research Data Access Symposium
Merce Crosas
 
David Shotton - Research Integrity: Integrity of the published record
David Shotton - Research Integrity: Integrity of the published recordDavid Shotton - Research Integrity: Integrity of the published record
David Shotton - Research Integrity: Integrity of the published record
Jisc
 
DataONE Education Module 03: Data Management Planning
DataONE Education Module 03: Data Management PlanningDataONE Education Module 03: Data Management Planning
DataONE Education Module 03: Data Management Planning
DataONE
 
Preparing Your Research Data for the Future - 2015-06-08 - Medical Sciences D...
Preparing Your Research Data for the Future - 2015-06-08 - Medical Sciences D...Preparing Your Research Data for the Future - 2015-06-08 - Medical Sciences D...
Preparing Your Research Data for the Future - 2015-06-08 - Medical Sciences D...
Research Support Team, IT Services, University of Oxford
 
Data management plan template
Data management plan templateData management plan template
Dataset Catalogs as a Foundation for FAIR* Data
Dataset Catalogs as a Foundation for FAIR* DataDataset Catalogs as a Foundation for FAIR* Data
Dataset Catalogs as a Foundation for FAIR* Data
Tom Plasterer
 
Enriching Scholarship 2014 Beyond the Journal Article: Publishing and Citing ...
Enriching Scholarship 2014 Beyond the Journal Article: Publishing and Citing ...Enriching Scholarship 2014 Beyond the Journal Article: Publishing and Citing ...
Enriching Scholarship 2014 Beyond the Journal Article: Publishing and Citing ...Natsuko Nicholls
 
Research Data Management: An Overview - 2014-05-12 - Humanities Division, Uni...
Research Data Management: An Overview - 2014-05-12 - Humanities Division, Uni...Research Data Management: An Overview - 2014-05-12 - Humanities Division, Uni...
Research Data Management: An Overview - 2014-05-12 - Humanities Division, Uni...
Research Support Team, IT Services, University of Oxford
 
The DataTags System: Sharing Sensitive Data with Confidence
The DataTags System: Sharing Sensitive Data with ConfidenceThe DataTags System: Sharing Sensitive Data with Confidence
The DataTags System: Sharing Sensitive Data with Confidence
Merce Crosas
 
Introduction to RDM for Geoscience PhD Students
Introduction to RDM for Geoscience PhD StudentsIntroduction to RDM for Geoscience PhD Students
Introduction to RDM for Geoscience PhD Students
EDINA, University of Edinburgh
 
Data Strategy and Services at the British Library: Data, Software and PIDs
Data Strategy and Services at the British Library: Data, Software and PIDsData Strategy and Services at the British Library: Data, Software and PIDs
Data Strategy and Services at the British Library: Data, Software and PIDs
Sarah Anna Stewart
 
Managing data throughout the research lifecycle
Managing data throughout the research lifecycleManaging data throughout the research lifecycle
Managing data throughout the research lifecycle
Marieke Guy
 

What's hot (20)

The challenge of sharing data well, how publishers can help
The challenge of sharing data well, how publishers can helpThe challenge of sharing data well, how publishers can help
The challenge of sharing data well, how publishers can help
 
FAIR Data Knowledge Graphs–from Theory to Practice
FAIR Data Knowledge Graphs–from Theory to PracticeFAIR Data Knowledge Graphs–from Theory to Practice
FAIR Data Knowledge Graphs–from Theory to Practice
 
Shareable by Design: Making Better Use of your Research
Shareable by Design: Making Better Use of your ResearchShareable by Design: Making Better Use of your Research
Shareable by Design: Making Better Use of your Research
 
How to write a data management plan
How to write a data management planHow to write a data management plan
How to write a data management plan
 
DataONE Education Module 01: Why Data Management?
DataONE Education Module 01: Why Data Management?DataONE Education Module 01: Why Data Management?
DataONE Education Module 01: Why Data Management?
 
DataONE Education Module 07: Metadata
DataONE Education Module 07: MetadataDataONE Education Module 07: Metadata
DataONE Education Module 07: Metadata
 
Data management plan format
Data management plan formatData management plan format
Data management plan format
 
EPSRC Policy Compliance: What researchers need to know
EPSRC Policy Compliance: What researchers need to knowEPSRC Policy Compliance: What researchers need to know
EPSRC Policy Compliance: What researchers need to know
 
Data Publishing at Harvard's Research Data Access Symposium
Data Publishing at Harvard's Research Data Access SymposiumData Publishing at Harvard's Research Data Access Symposium
Data Publishing at Harvard's Research Data Access Symposium
 
David Shotton - Research Integrity: Integrity of the published record
David Shotton - Research Integrity: Integrity of the published recordDavid Shotton - Research Integrity: Integrity of the published record
David Shotton - Research Integrity: Integrity of the published record
 
DataONE Education Module 03: Data Management Planning
DataONE Education Module 03: Data Management PlanningDataONE Education Module 03: Data Management Planning
DataONE Education Module 03: Data Management Planning
 
Preparing Your Research Data for the Future - 2015-06-08 - Medical Sciences D...
Preparing Your Research Data for the Future - 2015-06-08 - Medical Sciences D...Preparing Your Research Data for the Future - 2015-06-08 - Medical Sciences D...
Preparing Your Research Data for the Future - 2015-06-08 - Medical Sciences D...
 
Data management plan template
Data management plan templateData management plan template
Data management plan template
 
Dataset Catalogs as a Foundation for FAIR* Data
Dataset Catalogs as a Foundation for FAIR* DataDataset Catalogs as a Foundation for FAIR* Data
Dataset Catalogs as a Foundation for FAIR* Data
 
Enriching Scholarship 2014 Beyond the Journal Article: Publishing and Citing ...
Enriching Scholarship 2014 Beyond the Journal Article: Publishing and Citing ...Enriching Scholarship 2014 Beyond the Journal Article: Publishing and Citing ...
Enriching Scholarship 2014 Beyond the Journal Article: Publishing and Citing ...
 
Research Data Management: An Overview - 2014-05-12 - Humanities Division, Uni...
Research Data Management: An Overview - 2014-05-12 - Humanities Division, Uni...Research Data Management: An Overview - 2014-05-12 - Humanities Division, Uni...
Research Data Management: An Overview - 2014-05-12 - Humanities Division, Uni...
 
The DataTags System: Sharing Sensitive Data with Confidence
The DataTags System: Sharing Sensitive Data with ConfidenceThe DataTags System: Sharing Sensitive Data with Confidence
The DataTags System: Sharing Sensitive Data with Confidence
 
Introduction to RDM for Geoscience PhD Students
Introduction to RDM for Geoscience PhD StudentsIntroduction to RDM for Geoscience PhD Students
Introduction to RDM for Geoscience PhD Students
 
Data Strategy and Services at the British Library: Data, Software and PIDs
Data Strategy and Services at the British Library: Data, Software and PIDsData Strategy and Services at the British Library: Data, Software and PIDs
Data Strategy and Services at the British Library: Data, Software and PIDs
 
Managing data throughout the research lifecycle
Managing data throughout the research lifecycleManaging data throughout the research lifecycle
Managing data throughout the research lifecycle
 

Similar to Preparing your data for sharing and publishing

Data peer review workshop
Data peer review workshopData peer review workshop
Data peer review workshop
Varsha Khodiyar
 
Effective research data management
Effective research data managementEffective research data management
Effective research data management
Catherine Gold
 
Perspectives on the Role of Trustworthy Repository Standards in Data Journal ...
Perspectives on the Role of Trustworthy Repository Standards in Data Journal ...Perspectives on the Role of Trustworthy Repository Standards in Data Journal ...
Perspectives on the Role of Trustworthy Repository Standards in Data Journal ...
The University of Edinburgh
 
Talk on Research Data Management
Talk on Research Data ManagementTalk on Research Data Management
Talk on Research Data Management
Anita de Waard
 
Introduction to research data management
Introduction to research data managementIntroduction to research data management
Introduction to research data management
dri_ireland
 
Planning for Research Data Management: 26th January 2016
Planning for Research Data Management: 26th January 2016Planning for Research Data Management: 26th January 2016
Planning for Research Data Management: 26th January 2016
IzzyChad
 
Intro to RDM
Intro to RDMIntro to RDM
Intro to RDM
Sarah Jones
 
Research data life cycle
Research data life cycleResearch data life cycle
Research data life cycle
University of Arizona
 
The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...
Projeto RCAAP
 
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
Sarah Anna Stewart
 
Fsci 2018 thursday2_august_am6
Fsci 2018 thursday2_august_am6Fsci 2018 thursday2_august_am6
Fsci 2018 thursday2_august_am6
ARDC
 
Planning for Research Data Management
Planning for Research Data ManagementPlanning for Research Data Management
Planning for Research Data Management
dancrane_open
 
Managing, Sharing and Curating Your Research Data in a Digital Environment
Managing, Sharing and Curating Your Research Data in a Digital EnvironmentManaging, Sharing and Curating Your Research Data in a Digital Environment
Managing, Sharing and Curating Your Research Data in a Digital Environment
philipdurbin
 
Data sharing as part of the research workflow
Data sharing as part of the research workflowData sharing as part of the research workflow
Data sharing as part of the research workflow
Varsha Khodiyar
 
Wilson-npg-scientific data-nfdp13
Wilson-npg-scientific data-nfdp13Wilson-npg-scientific data-nfdp13
Wilson-npg-scientific data-nfdp13
DataDryad
 
Research data management workshop april12 2016
Research data management workshop april12 2016 Research data management workshop april12 2016
Research data management workshop april12 2016
Rebecca Raworth, MLIS
 
Research data management workshop April 2016
Research data management workshop April 2016Research data management workshop April 2016
Research data management workshop April 2016
Rebecca Raworth, MLIS
 
Practical strategies for RDM
Practical strategies for RDMPractical strategies for RDM
Practical strategies for RDM
dancrane_open
 
Practical Strategies for Research Data Management
Practical Strategies for Research Data ManagementPractical Strategies for Research Data Management
Practical Strategies for Research Data Management
Daniel Crane
 
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)OpenAIRE webinar on Open Research Data in H2020 (OAW2016)
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)
OpenAIRE
 

Similar to Preparing your data for sharing and publishing (20)

Data peer review workshop
Data peer review workshopData peer review workshop
Data peer review workshop
 
Effective research data management
Effective research data managementEffective research data management
Effective research data management
 
Perspectives on the Role of Trustworthy Repository Standards in Data Journal ...
Perspectives on the Role of Trustworthy Repository Standards in Data Journal ...Perspectives on the Role of Trustworthy Repository Standards in Data Journal ...
Perspectives on the Role of Trustworthy Repository Standards in Data Journal ...
 
Talk on Research Data Management
Talk on Research Data ManagementTalk on Research Data Management
Talk on Research Data Management
 
Introduction to research data management
Introduction to research data managementIntroduction to research data management
Introduction to research data management
 
Planning for Research Data Management: 26th January 2016
Planning for Research Data Management: 26th January 2016Planning for Research Data Management: 26th January 2016
Planning for Research Data Management: 26th January 2016
 
Intro to RDM
Intro to RDMIntro to RDM
Intro to RDM
 
Research data life cycle
Research data life cycleResearch data life cycle
Research data life cycle
 
The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...
 
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
 
Fsci 2018 thursday2_august_am6
Fsci 2018 thursday2_august_am6Fsci 2018 thursday2_august_am6
Fsci 2018 thursday2_august_am6
 
Planning for Research Data Management
Planning for Research Data ManagementPlanning for Research Data Management
Planning for Research Data Management
 
Managing, Sharing and Curating Your Research Data in a Digital Environment
Managing, Sharing and Curating Your Research Data in a Digital EnvironmentManaging, Sharing and Curating Your Research Data in a Digital Environment
Managing, Sharing and Curating Your Research Data in a Digital Environment
 
Data sharing as part of the research workflow
Data sharing as part of the research workflowData sharing as part of the research workflow
Data sharing as part of the research workflow
 
Wilson-npg-scientific data-nfdp13
Wilson-npg-scientific data-nfdp13Wilson-npg-scientific data-nfdp13
Wilson-npg-scientific data-nfdp13
 
Research data management workshop april12 2016
Research data management workshop april12 2016 Research data management workshop april12 2016
Research data management workshop april12 2016
 
Research data management workshop April 2016
Research data management workshop April 2016Research data management workshop April 2016
Research data management workshop April 2016
 
Practical strategies for RDM
Practical strategies for RDMPractical strategies for RDM
Practical strategies for RDM
 
Practical Strategies for Research Data Management
Practical Strategies for Research Data ManagementPractical Strategies for Research Data Management
Practical Strategies for Research Data Management
 
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)OpenAIRE webinar on Open Research Data in H2020 (OAW2016)
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)
 

More from Varsha Khodiyar

Digital transformation to enable a FAIR approach for health data science
Digital transformation to enable a FAIR approach for health data scienceDigital transformation to enable a FAIR approach for health data science
Digital transformation to enable a FAIR approach for health data science
Varsha Khodiyar
 
Lessons from the UK: Data access, patient trust & real-world impact with heal...
Lessons from the UK: Data access, patient trust & real-world impact with heal...Lessons from the UK: Data access, patient trust & real-world impact with heal...
Lessons from the UK: Data access, patient trust & real-world impact with heal...
Varsha Khodiyar
 
COVID-19 variants, vaccines and tests
COVID-19 variants, vaccines and testsCOVID-19 variants, vaccines and tests
COVID-19 variants, vaccines and tests
Varsha Khodiyar
 
COVID-19 variants and vaccines
COVID-19 variants and vaccinesCOVID-19 variants and vaccines
COVID-19 variants and vaccines
Varsha Khodiyar
 
Data citation and sharing during article publication
Data citation and sharing during article publicationData citation and sharing during article publication
Data citation and sharing during article publication
Varsha Khodiyar
 
The importance of research data repositories
The importance of research data repositoriesThe importance of research data repositories
The importance of research data repositories
Varsha Khodiyar
 
What role can publishers play in the open data ecosystem?
What role can publishers play in the open data ecosystem?What role can publishers play in the open data ecosystem?
What role can publishers play in the open data ecosystem?
Varsha Khodiyar
 
Five essentials factors for unlocking the potential for Open Research Data
Five essentials factors for unlocking the potential for Open Research Data Five essentials factors for unlocking the potential for Open Research Data
Five essentials factors for unlocking the potential for Open Research Data
Varsha Khodiyar
 
New approaches to data management: supporting FAIR data sharing at Springer N...
New approaches to data management: supporting FAIR data sharing at Springer N...New approaches to data management: supporting FAIR data sharing at Springer N...
New approaches to data management: supporting FAIR data sharing at Springer N...
Varsha Khodiyar
 
The value of data curation as part of the publishing process
The value of data curation as part of the publishing processThe value of data curation as part of the publishing process
The value of data curation as part of the publishing process
Varsha Khodiyar
 
Facilitating good research data management practice as part of scholarly publ...
Facilitating good research data management practice as part of scholarly publ...Facilitating good research data management practice as part of scholarly publ...
Facilitating good research data management practice as part of scholarly publ...
Varsha Khodiyar
 
Practical challenges for researchers in data sharing
Practical challenges for researchers in data sharingPractical challenges for researchers in data sharing
Practical challenges for researchers in data sharing
Varsha Khodiyar
 
Update from Data policy standardisation and implementation IG
Update from Data policy standardisation and implementation IGUpdate from Data policy standardisation and implementation IG
Update from Data policy standardisation and implementation IG
Varsha Khodiyar
 
Peer Reviewing Data: experiences from a data journal
Peer Reviewing Data: experiences from a data journalPeer Reviewing Data: experiences from a data journal
Peer Reviewing Data: experiences from a data journal
Varsha Khodiyar
 
Data Publishing and Institutional Repositories
Data Publishing and Institutional RepositoriesData Publishing and Institutional Repositories
Data Publishing and Institutional Repositories
Varsha Khodiyar
 
Gaining credit for sharing research data
Gaining credit for sharing research dataGaining credit for sharing research data
Gaining credit for sharing research data
Varsha Khodiyar
 
Clinical Data Publishing at Scientific Data
Clinical Data Publishing at Scientific DataClinical Data Publishing at Scientific Data
Clinical Data Publishing at Scientific Data
Varsha Khodiyar
 
Privacy and Publication: challenges and opportunities for clinical data
Privacy and Publication: challenges and opportunities for clinical dataPrivacy and Publication: challenges and opportunities for clinical data
Privacy and Publication: challenges and opportunities for clinical data
Varsha Khodiyar
 
Why should researchers care about data curation?
Why should researchers care about data curation?Why should researchers care about data curation?
Why should researchers care about data curation?
Varsha Khodiyar
 
Share & Flourish workshop, Leiden, August 2014
Share & Flourish workshop, Leiden, August 2014Share & Flourish workshop, Leiden, August 2014
Share & Flourish workshop, Leiden, August 2014
Varsha Khodiyar
 

More from Varsha Khodiyar (20)

Digital transformation to enable a FAIR approach for health data science
Digital transformation to enable a FAIR approach for health data scienceDigital transformation to enable a FAIR approach for health data science
Digital transformation to enable a FAIR approach for health data science
 
Lessons from the UK: Data access, patient trust & real-world impact with heal...
Lessons from the UK: Data access, patient trust & real-world impact with heal...Lessons from the UK: Data access, patient trust & real-world impact with heal...
Lessons from the UK: Data access, patient trust & real-world impact with heal...
 
COVID-19 variants, vaccines and tests
COVID-19 variants, vaccines and testsCOVID-19 variants, vaccines and tests
COVID-19 variants, vaccines and tests
 
COVID-19 variants and vaccines
COVID-19 variants and vaccinesCOVID-19 variants and vaccines
COVID-19 variants and vaccines
 
Data citation and sharing during article publication
Data citation and sharing during article publicationData citation and sharing during article publication
Data citation and sharing during article publication
 
The importance of research data repositories
The importance of research data repositoriesThe importance of research data repositories
The importance of research data repositories
 
What role can publishers play in the open data ecosystem?
What role can publishers play in the open data ecosystem?What role can publishers play in the open data ecosystem?
What role can publishers play in the open data ecosystem?
 
Five essentials factors for unlocking the potential for Open Research Data
Five essentials factors for unlocking the potential for Open Research Data Five essentials factors for unlocking the potential for Open Research Data
Five essentials factors for unlocking the potential for Open Research Data
 
New approaches to data management: supporting FAIR data sharing at Springer N...
New approaches to data management: supporting FAIR data sharing at Springer N...New approaches to data management: supporting FAIR data sharing at Springer N...
New approaches to data management: supporting FAIR data sharing at Springer N...
 
The value of data curation as part of the publishing process
The value of data curation as part of the publishing processThe value of data curation as part of the publishing process
The value of data curation as part of the publishing process
 
Facilitating good research data management practice as part of scholarly publ...
Facilitating good research data management practice as part of scholarly publ...Facilitating good research data management practice as part of scholarly publ...
Facilitating good research data management practice as part of scholarly publ...
 
Practical challenges for researchers in data sharing
Practical challenges for researchers in data sharingPractical challenges for researchers in data sharing
Practical challenges for researchers in data sharing
 
Update from Data policy standardisation and implementation IG
Update from Data policy standardisation and implementation IGUpdate from Data policy standardisation and implementation IG
Update from Data policy standardisation and implementation IG
 
Peer Reviewing Data: experiences from a data journal
Peer Reviewing Data: experiences from a data journalPeer Reviewing Data: experiences from a data journal
Peer Reviewing Data: experiences from a data journal
 
Data Publishing and Institutional Repositories
Data Publishing and Institutional RepositoriesData Publishing and Institutional Repositories
Data Publishing and Institutional Repositories
 
Gaining credit for sharing research data
Gaining credit for sharing research dataGaining credit for sharing research data
Gaining credit for sharing research data
 
Clinical Data Publishing at Scientific Data
Clinical Data Publishing at Scientific DataClinical Data Publishing at Scientific Data
Clinical Data Publishing at Scientific Data
 
Privacy and Publication: challenges and opportunities for clinical data
Privacy and Publication: challenges and opportunities for clinical dataPrivacy and Publication: challenges and opportunities for clinical data
Privacy and Publication: challenges and opportunities for clinical data
 
Why should researchers care about data curation?
Why should researchers care about data curation?Why should researchers care about data curation?
Why should researchers care about data curation?
 
Share & Flourish workshop, Leiden, August 2014
Share & Flourish workshop, Leiden, August 2014Share & Flourish workshop, Leiden, August 2014
Share & Flourish workshop, Leiden, August 2014
 

Recently uploaded

EY - Supply Chain Services 2018_template.pptx
EY - Supply Chain Services 2018_template.pptxEY - Supply Chain Services 2018_template.pptx
EY - Supply Chain Services 2018_template.pptx
AlguinaldoKong
 
ESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptxESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptx
muralinath2
 
insect taxonomy importance systematics and classification
insect taxonomy importance systematics and classificationinsect taxonomy importance systematics and classification
insect taxonomy importance systematics and classification
anitaento25
 
Richard's entangled aventures in wonderland
Richard's entangled aventures in wonderlandRichard's entangled aventures in wonderland
Richard's entangled aventures in wonderland
Richard Gill
 
Structural Classification Of Protein (SCOP)
Structural Classification Of Protein  (SCOP)Structural Classification Of Protein  (SCOP)
Structural Classification Of Protein (SCOP)
aishnasrivastava
 
extra-chromosomal-inheritance[1].pptx.pdfpdf
extra-chromosomal-inheritance[1].pptx.pdfpdfextra-chromosomal-inheritance[1].pptx.pdfpdf
extra-chromosomal-inheritance[1].pptx.pdfpdf
DiyaBiswas10
 
Penicillin...........................pptx
Penicillin...........................pptxPenicillin...........................pptx
Penicillin...........................pptx
Cherry
 
Lab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerinLab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerin
ossaicprecious19
 
Anemia_ different types_causes_ conditions
Anemia_ different types_causes_ conditionsAnemia_ different types_causes_ conditions
Anemia_ different types_causes_ conditions
muralinath2
 
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATIONPRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
ChetanK57
 
plant biotechnology Lecture note ppt.pptx
plant biotechnology Lecture note ppt.pptxplant biotechnology Lecture note ppt.pptx
plant biotechnology Lecture note ppt.pptx
yusufzako14
 
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
NathanBaughman3
 
Mammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also FunctionsMammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also Functions
YOGESH DOGRA
 
platelets- lifespan -Clot retraction-disorders.pptx
platelets- lifespan -Clot retraction-disorders.pptxplatelets- lifespan -Clot retraction-disorders.pptx
platelets- lifespan -Clot retraction-disorders.pptx
muralinath2
 
Multi-source connectivity as the driver of solar wind variability in the heli...
Multi-source connectivity as the driver of solar wind variability in the heli...Multi-source connectivity as the driver of solar wind variability in the heli...
Multi-source connectivity as the driver of solar wind variability in the heli...
Sérgio Sacani
 
FAIR & AI Ready KGs for Explainable Predictions
FAIR & AI Ready KGs for Explainable PredictionsFAIR & AI Ready KGs for Explainable Predictions
FAIR & AI Ready KGs for Explainable Predictions
Michel Dumontier
 
In silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptxIn silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptx
AlaminAfendy1
 
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Sérgio Sacani
 
Cancer cell metabolism: special Reference to Lactate Pathway
Cancer cell metabolism: special Reference to Lactate PathwayCancer cell metabolism: special Reference to Lactate Pathway
Cancer cell metabolism: special Reference to Lactate Pathway
AADYARAJPANDEY1
 
Nutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technologyNutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technology
Lokesh Patil
 

Recently uploaded (20)

EY - Supply Chain Services 2018_template.pptx
EY - Supply Chain Services 2018_template.pptxEY - Supply Chain Services 2018_template.pptx
EY - Supply Chain Services 2018_template.pptx
 
ESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptxESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptx
 
insect taxonomy importance systematics and classification
insect taxonomy importance systematics and classificationinsect taxonomy importance systematics and classification
insect taxonomy importance systematics and classification
 
Richard's entangled aventures in wonderland
Richard's entangled aventures in wonderlandRichard's entangled aventures in wonderland
Richard's entangled aventures in wonderland
 
Structural Classification Of Protein (SCOP)
Structural Classification Of Protein  (SCOP)Structural Classification Of Protein  (SCOP)
Structural Classification Of Protein (SCOP)
 
extra-chromosomal-inheritance[1].pptx.pdfpdf
extra-chromosomal-inheritance[1].pptx.pdfpdfextra-chromosomal-inheritance[1].pptx.pdfpdf
extra-chromosomal-inheritance[1].pptx.pdfpdf
 
Penicillin...........................pptx
Penicillin...........................pptxPenicillin...........................pptx
Penicillin...........................pptx
 
Lab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerinLab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerin
 
Anemia_ different types_causes_ conditions
Anemia_ different types_causes_ conditionsAnemia_ different types_causes_ conditions
Anemia_ different types_causes_ conditions
 
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATIONPRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
 
plant biotechnology Lecture note ppt.pptx
plant biotechnology Lecture note ppt.pptxplant biotechnology Lecture note ppt.pptx
plant biotechnology Lecture note ppt.pptx
 
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
 
Mammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also FunctionsMammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also Functions
 
platelets- lifespan -Clot retraction-disorders.pptx
platelets- lifespan -Clot retraction-disorders.pptxplatelets- lifespan -Clot retraction-disorders.pptx
platelets- lifespan -Clot retraction-disorders.pptx
 
Multi-source connectivity as the driver of solar wind variability in the heli...
Multi-source connectivity as the driver of solar wind variability in the heli...Multi-source connectivity as the driver of solar wind variability in the heli...
Multi-source connectivity as the driver of solar wind variability in the heli...
 
FAIR & AI Ready KGs for Explainable Predictions
FAIR & AI Ready KGs for Explainable PredictionsFAIR & AI Ready KGs for Explainable Predictions
FAIR & AI Ready KGs for Explainable Predictions
 
In silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptxIn silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptx
 
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
 
Cancer cell metabolism: special Reference to Lactate Pathway
Cancer cell metabolism: special Reference to Lactate PathwayCancer cell metabolism: special Reference to Lactate Pathway
Cancer cell metabolism: special Reference to Lactate Pathway
 
Nutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technologyNutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technology
 

Preparing your data for sharing and publishing

  • 1. Preparing your data for sharing and publishing Varsha Khodiyar, PhD MRC Cognition and Brain Sciences Unit Open Science Day 20.11.2018 Howchameleonschangecolour
  • 2. 1 7719 respondents White paper available from https://doi.org/10.6084/m9.figshare.5975011 Survey data available from https://doi.org/10.6084/m9.figshare.5971387 What are researchers concerned about when sharing data?
  • 3. 2
  • 4. 3
  • 5. 44 How to organise your data
  • 6. 5 Make sure your data are well organised • Data files and folders labelled in an understandable way. • Data files and folders organised in a logical, easy-to- follow manner. • Any acronyms used for data file/folder names clearly defined, ideally in a README file. • Data files in a format that are easy for others to reuse / use the standard format used by your discipline.
  • 7. 6 Increasing reproducibility • Include any additional information needed to understand the data, methods, parameters, e.g. which instrument (make and model) was used to measure blood carbon dioxide levels? • Include availability statements for any code that was used to view, parse or analyse the data, in support of the conclusions.
  • 8. 7 Help to organise your data is available Springer Nature Research Data Support Researchers submit their data files securely The Research Data team curates the data and metadata The data are published and linked to the author’s paper More information is available on our website here: http://www.springernature.com/gb/group/data- policy/data-support-services
  • 9. 8 No one other than the creator can access the data, or even knows that it exists Before data curation: a researcher’s dataset in a desktop folder The dataset is stored as an Excel file in a desktop folder The file title is not comprehensible to anyone but the creator No description or keywords available
  • 10. 9 Before curation begins Once received, we check to make sure that the dataset is suitable for our curation services. Multiple files in any format are accepted. After making these checks, we begin the curation process. If necessary we may recommend that the dataset is split into smaller groups or collections. Pre-curation data checks:  The data aren’t sensitive  The data don’t include direct or indirect human identifiers  The data shouldn’t be in a community repository
  • 11. 10 After Springer Nature Research Data Support Working with the researcher’s manuscript or published paper, we draft a comprehensive metadata record for the dataset which is sent to the researcher for approval before being published. Embargoes can be applied if necessary. The curated dataset will be published with its own metadata record which includes rich descriptive information, reuse conditions, licence, DOI, metrics and keywords (this example is https://doi.org/10.6084/m9.figshare.5259 415)
  • 12. 1111 Choosing a repository to store data
  • 13. 12 Selecting a repository for your data Considerations: 1. Is there a discipline-specific repository for the type of data you have generated? 2. Will access to the data need to be controlled? 3. If no discipline-specific repository is available for the data, does your funder or institute mandate deposition to a particular repository?
  • 14. 13 Indexing services Curated lists Sources to help choose a data repository NEW! Tools to help select repositories www.nature.com/sdata/data-policies/repositories https://repositoryfinder.test.datacite.org/
  • 16. 15 • Consider an appropriate patient consent framework  Consent to use data in current study  Consent to use data for future research  Consent to share data for use by other research groups • Don’t collect more than you need Collecting sensitive data
  • 17. 16 • Remove direct identifiers • Aggregate indirect identifiers into groups where possible • Anonymization or de-identification? • Use controlled access repositories, and consider:  Data use agreement?  Data access conditions? Sharing sensitive data
  • 18. 1717 Scholarly credit for generating and sharing research data
  • 19. 18 Data Journals at Springer Nature www.nature.com/scientificdata https://bmcresnotes.biomedcentral.com Data Descriptor Open access Sound science Emphasis on enabling data reuse Data peer review Data Note Open access Sound science Short format
  • 20. 19 Scientific Data, a Nature Research journal Data Descriptor Primary article type; sound science and facilitates data reuse Analysis New analyses or meta- analyses of existing data Article Original reports on advances in data sharing & reuse Comment Announcements of broad interest; usually invited www.nature.com/scientificdata
  • 21. 20 Under the hood of a Data Descriptor • Context for data generation (background) • How was data generated? • How was data processed? • Where is the data? • Synthesis • Analysis • Conclusions
  • 22. 21 Data peer review www.nature.com/sdata/policies/for-referees Experimental Rigor and Technical Data Quality Were data produced in a sound manner? Technical quality of data – appropriate statistical analyses? Experimental rigor - appropriate depth, coverage? Completeness of the Description Sufficient detail to allow others to reproduce these steps? Sufficient detail to allow others to reuse this data? Consistent with relevant minimum reporting standards? Integrity of the Data Files and Repository Record Do data files appear complete and match manuscript descriptions? Are data archived to the most appropriate repository?
  • 23. 22 What types of data can be published? Decades old dataset Standalone dataset Data that has been used in an analysis article Large consortium dataset Data from a single experiment Any data that the researcher finds valuable and that others might find useful too Data associated with a high impact analysis article
  • 24. 23 When can a data paper be published? After data analysis has been published Before analysis has been published Authors not intending to analyse data Data papers can be submitted and published at any point in the research workflow, i.e. whenever it makes most sense for your data After data analysis has been published Before the analysis has been published Publication alongside analysis article
  • 25. 2424 Still unsure about research data?
  • 26. 25 What does research data training offer? • Directly addresses the main challenges of data sharing • Part of the Nature Research Academies, offering trusted quality and value • A unique perspective and trusted experience within the realm of research data • Training for both researchers & information professionals, appropriate for all levels • Courses are customised to meet your needs, and are brought to you (and your researchers) Springer Nature Research Data Training Source: https://doi.org/10.6084/m9.figshare.5975011
  • 27. 26 Queries are answered within two business days Run by members of the Springer Nature Research Data team Expertise in data curation and management, archiving and digital preservation, copyright and licensing, Open Access publishing Always encourage best practices, e.g. the use of community repositories for specific data types Email: researchdata@springernature.com http://www.springernature.com/gp/group/data-policy/helpdesk Springer Nature Research Data Helpdesk
  • 28. 2727 The story behind the image How chameleons change colour Chameleons are well known for their potential to change colour but recent research on panther chameleons is the first to find two layers of crystal containing cells, each with a potentially different purpose. Researchers from the University of Geneva have speculated that the deeper crystal containing cells may help with the regulation of temperature, whilst the more superficial layer of colour changing cells could be responsible for camouflage or mating displays. Thank you for listening Varsha Khodiyar, PhD Data Curation Manager, Springer Nature (Data Curation Editor, Scientific Data) varsha.khodiyar@nature.com