Research Data Management, Open Data and Zenodo - 6th National Open Access Conference and OpenAIRE2020 Workshop - Turkey

Research Data Management,
Open Data and Zenodo
6thNationalOpenAccessConferenceandOpenAIRE2020Workshop-Turkey
PedroPrincipe
UniversityofMinho(OpenAIREsupport&trainingmanager)
October 24, 2017, Izmir, Turkey

33expertnodesalloverEurope
tohelpingwith:OpenScience
training &support, OApolicy
alignment, Technical assistance...
HumanNetwork
DigitalNetwork
IntegratedScientificInformation
Systemwithaccessto:22mi
publications, 590Kdatasets, 1006
data providers, 800Κpublications
linked toprojectsfrom14funders…
Two faceted e-Infrastructure

This is where you type in the event5
Network is our super power

TurkishpresenceinOpenAIREinfrastructure:
+
Substantive growth in the number of repositories
Improving metadata quality and interoperability ofscholarly systems
Increase the visibility and impact ofresearch outputs
Alignment ofopen access policies and practices in research institutions
6

Policies and Practices
handin handfor asustainable OA
From Open Access to Open Science…
more facets to consider
Open Science

This is where you type in the event
Relevance of
Open Data and
Research Data
Management
Funders Data
Management
and Sharing
policies
Publishers Data
availability
requirements
Strengthening
the Role of
Institutions
ZENODO, open
repository
from OpenAIRE
and CERN
TOPICS (1/5)

Notjustaboutopenaccesstopublications…
OPEN RESEARCH DATA

good research needs good data
DigitalCurationCenter
10
http://epicgraphic.com/data-cake

Image:https://www.flickr.com/photos/dmh650/4031607067/in/gallery-wlef70-72157633022909105/
Data management is a part of good
research practice.
RCUK Policy and Code of Conduct on the
Governance of Good Research Conduct
Responsible data management is
part of good research.
NWO – Introduction to the pilot Data Management

Makeresearcheasier
Why manage data?
Savedataforlater
Sharedataforre-use
Getcreditforit
Avoidaccusations offraudor
badscience
Meetfunder orinstitution
requirements

Researchdatalifecycle
CREATING
DATA
PROCESSING
DATA
ANALYSING
DATA
PRESERVING
DATA
GIVING
ACCESS TO
DATA
RE-USING
DATA
CREATING DATA: designing research, DMPs, planning
consent, locate existing data, data collection and
management, capturing and creating metadata
RE-USING DATA: follow-up research,
new research, undertake research
reviews, scrutinising findings,
teaching & learning
ACCESS TO DATA: distributing data,
sharing data, controlling access,
establishing copyright, promoting
data
PRESERVING DATA: data storage, back-up & archiving,
migrating to best format & medium, creating metadata and
documentation
ANALYSING DATA: interpreting, &
deriving data, producing outputs,
authoring publications, preparing for
sharing
PROCESSING DATA: entering,
transcribing, checking, validating and
cleaning data, anonymising data,
describing data, manage and store
data
Ref: UK Data Archive: http://www.data-archive.ac.uk/create-manage/life-cycle

Scheme from University of California- Irvine http://www.lib.uci.edu/dss

I1.(meta)datauseaformal,accessible,shared,and
broadlyapplicablelanguageforknowledge
representation.
I2.(meta)datausevocabulariesthatfollowFAIR
principles;
I3.(meta)dataincludequalifiedreferencestoother
(meta)data;
R1.meta(data)arerichlydescribedwithapluralityof
accurateandrelevantattributes;
R1.1.(meta)dataarereleasedwithaclearand
accessibledatausagelicense;
R1.2.(meta)dataareassociatedwithdetailed
provenance;
R1.3.(meta)datameetdomain-relevantcommunity
standards;
A1.(meta)dataareretrievablebytheiridentifier
usingastandardizedcommunicationsprotocol;
A1.1theprotocolisopen,free,anduniversally
implementable;
A1.2.theprotocolallowsforanauthenticationand
authorizationprocedure,wherenecessary;
A2.metadataareaccessible,evenwhenthedata
arenolongeravailable;
Findable:
F1.(meta)dataareassignedagloballyuniqueand
persistentidentifier;
F2.dataaredescribedwithrichmetadata;
F3.metadataclearlyandexplicitlyincludethe
identifierofthedataitdescribes;
F4.(meta)dataareregisteredorindexedina
searchableresource;
Interoperable:
Accessible:
Reusable:
FAIR DATA PRINCIPLES
16

Relevance of
Open Data and
Research Data
Management
Funders Data
Management
and Sharing
policies
Publishers Data
availability
requirements
Strengthening
the Role of
Institutions
ZENODO, open
repository
from OpenAIRE
and CERN
TOPICS (2/5)

80%
of research is
publicly
funded
Source:“AcademicPublishing:SurveyoffunderssupportsthebenignOpenAccessoutcomepricedinto
shares,HSBCGlobalResearch,”February11,2013:
https://www.research.hsbc.com/midas/Res/RDV?ao=20&key=RxArFbnG1P&n=360010.PDF
www.sparc.arl.org

Ofthe20organizationsthatmakeuptheforum,11have
dedicatedpoliciesondatasharingandmanagement.
1.Datamanagement plans
2.Timeframesfordatasharing
3.Requirements fortheuseofpublic
databases or repositories
4.Ethics andconfidentiality requirements
5.Measures totrackorensurecompliance
Funding agencies RDM guidelines and requirements:
(Reviewoffunders’datasharingpoliciesofthePublicHealthResearchDataForum)

• Descriptionofdatatobecollected/created(i.e.
content,type,format,volume...)
• Standards/methodologiesfordatacollection&
management
• EthicsandIntellectualProperty(highlightrestrictionson
datasharinge.g.embargoes,confidentiality)
• Plansfordatasharingandaccess(i.e.how,when,to
whom)
• Strategyforlong-termpreservation
Some funders that require Data Management Plans
21

22
Principles and Recommendations for
successful Open Data policies:

1. Develop explicit policies foropenaccesstoresearchdatawith
clearrolesandresponsibilities.
2. Adoptacomprehensive approach infunding the
implementation ofopen accesstoandpreservation ofresearch
data.
3. Reinforce thesignificance oftheDataManagement Plan(DMP)
toembedandpromote datamanagement asadistinct activity
within theresearchprocess.
4. Raiseawareness andpromote openresearchdatainviewof
leading anopen science paradigm.
RECODE funders specific recommendations:

#1-Openaccesstoresearchdataisanenablerofhighqualityresearch,
afacilitatorofinnovationandsafeguardsgoodresearchpractice.
#2-Therearesoundreasonswhytheopennessofresearchdatamay
needtoberestrictedbutanyrestrictionsmustbejustifiedand
justifiable.
#3-Openaccesstoresearchdatacarriesasignificantcost,which
shouldberespectedbyallparties.
#4-Therightofthecreatorsofresearchdatatoreasonablefirstuseis
recognized.
#5-Useofothers’datashouldalwaysconformtolegal,ethicaland
regulatoryframeworksincludingappropriateacknowledgement.
Concordat on Open Research Data - principles

#6-Good datamanagement isfundamental toallstages ofthe
researchprocess andshould beestablished attheoutset.
#7-Datacuration isvitaltomakedatauseful forothersandfor
long-term preservation ofdata.
#8-Datasupporting publications should beaccessible bythe
publication dateandshould beinaciteable form.
#9-Support forthedevelopment ofappropriate dataskillsis
recognised asaresponsibility forallstakeholders.
#10-Regular reviewsofprogress towards openresearchdata
should beundertaken.
Concordat on Open Research Data - principles

To make the research data generated
by selected Horizon 2020 projects
accessible with as few restrictions as
possible, while at the same time
protecting sensitive data from
inappropriate access.
Information already paid for by
the public should not be paid for
again. Open data is data that is
free to access and reuse
EC
Open Research Data Pilot: aims

DATA, including metadata,
needed to validate the results in
scientific publications.
Other data, including metadata,
as specified in the Data
Management Plan.
Open Research Data policy requirements
Horizon 2020 grantees are encouraged to also share datasets beyond publication

 IncompatiblewiththeHorizon2020obligationtoprotectresultsiftheycan
reasonablybeexpectedtobecommerciallyorindustriallyexploited;
 Incompatiblewiththeneedforconfidentialityinconnectionwithsecurity
issues;
 Incompatiblewithexistingrulesconcerningtheprotectionofpersonaldata;
 Iftheprojectwillnotgenerate/collectanyresearchdata;
 IfthereareotherlegitimatereasonstonottakepartinthePilot
Reasons for total or partial opting out
30

Write, and keep up-to-date, a
Data Management Plan.
Deposit the data in a research
data repository.
Open Research Data policy requirements
Licensing research data - Horizon 2020 Open Access guidelines point to:

1. Datasummary
2. FAIRdata
1. Makingdatafindable,includingprovisionsfor
metadata
2. Makingdataopenlyaccessible
3. Makingdatainteroperable
4. Increasedatare-use(throughclarifying
licences)
3. Allocationofresources
4. Datasecurity
5. Ethicalaspects
6. Otherissues
H2020 template
ExampleH2020DMPsinZenodo
• HelixNebula–HighEnergyPhysicsexample
https://zenodo.org/record/48171#.WATexnriF40
• Tweether–engineering(micro-electronics)example
https://zenodo.org/record/55791#.WATei3riF40
• AutoPost–ICTexample
https://zenodo.org/record/56107#.WATefXriF40
Morelistedat:
www.dcc.ac.uk/resources/data-management-plans/guidance-
examples
http://ec.europa.eu/research/participants/data/ref/h2020/grants_manual/hi/oa_pilot/h2020-hi-oa-data-mgt_en.pdf

Aweb-basedtooltohelpresearcherswriteDMPs
IncludesatemplateforHorizon2020
https://dmponline.dcc.ac.uk

Moreinformation:
https://www.openaire.eu/opendatapilot-repository
Zenodo:http://www.zenodo.org
Re3data.org:http://www.re3data.org
Where to find a repository?

Relevance of
Open Data and
Research Data
Management
Funders Data
Management
and Sharing
policies
Publishers Data
availability
requirements
Strengthening
the Role of
Institutions
ZENODO, open
repository
from OpenAIRE
and CERN
TOPICS (3/5)

Publishers Data Availability Requirements
38

Data availability policy - publishers
Scenarios:
• sendthedatasettothepublisherand
thepublisher publishesthedataset
online.
• thepublisher askstheauthortodeposit
thedatasetinatrustedrepositoryand
tonotifythepublisher.
• thepublisher askstheauthortogive
contactinformationforthosewhowish
tohaveaccesstothedata.
Therequirementsaregenerallyfoundonthe
journal'swebsite.
Anumberofjournalshaveaspecific
DataAvailabilityor DataArchiving
Policy
This is where you type in the event 39

Relevance of
Open Data and
Research Data
Management
Funders Data
Management
and Sharing
policies
Publishers Data
availability
requirements
Strengthening
the Role of
Institutions
ZENODO, open
repository
from OpenAIRE
and CERN
TOPICS (4/5)

PhD student
university
research teamindividual
researcher
supra-
university
Where do I safely keep my data from
my fieldwork, as I travel home?
How can I best keep years worth
of research data secure and
accessible for when I and others
need to re-use it? How do we ensure compliance to
funders’ requirement for several
years of open access to data?
How do we ensure we have access
to our research data after some of
the team have left?
How can our research
collaborations share data,
and make them available
once complete?
Seeking the real win + win + win + win + win… Tony Weir, Director, IT Infrastructure, UoE (2014)

RESEARCH LIKE CYCLE
DMPs, existing data,
documentation, store,
deposit and share datasets
INFRASTRUCTURE
Data archives, repositories,
access, preservation, DOI,
licensing, protection, cloud
GOVERNANCE
Funder, University, publisher,
Research institutions,
national policy, protocols
Research Data Management:
Institutional Strategies? Services? Roadmap.
TRAINING
LEGAL & ETHICAL SUPPORT

Recommendations for
Supporting the Long Tail
of Research Data
RDA

RECOMMENDATIONS
FOR SUPPORTING
THE LONG TAIL OF
RESEARCH DATA

1.Recognize and understand the diversity ofdata created at your
organization, or through your funding support, and develop appropriate
frameworks for managing those data.
• Theuseofdatamanagementplans,alongwithlocalinstitutionalsupportfordata
managementwillcontribute toensuringthatlongtaildataaremanagedandshared
appropriately.
2.Scale existing funding mechanisms to support research data
management for small research projects
• Funding fordatamanagementisoftenavailableforlargeresearchactivities, butmuchlessso
forsmallerscaleresearchprojects.
7 Recommendations for Supporting the Long Tail of Research Data

3. Expand and strengthen the institutional role inmanaging research data.
• Manylongtaildatasetsareatriskofbeinglostbecausetheyarenotmanagedappropriately.Localsupport
forresearcherswillincreasetheadoptionofstandardsandbestpracticesearlieronintheresearchprocess.
• Weencourageuniversitiesandinstitutionstooffersupportservicesforresearchdatamanagement(RDM).
Inparticular,RDMservicesshouldbecomepartofthestandardserviceprovisionofresearchlibraries.
4. Develop and apply common standards across institutions and domains
to ensure greater interoperability across datasets.
• Adistributednetworkofresearchdatamanagementserviceshasmanyadvantagesincludinggreatersupport
forlocalneedsandrequirements,morecomprehensivecoverageandincreasedresilienceagainstloss.
• Werecommendthedevelopmentofcommon,highlevelmetadataelementsthatwillsupportdataintegration
acrossdiversetypesofresearchdataanddisciplines.

5.Supportreproducibilityandtransparencyofresearchbylinkingdata,software,andliterature.
• Reliablylinkingtheliteraturetotheunderlyingdataandtools,suchassoftwareandcode
supportingresearchconclusions,willmakeiteasierforotherstoverifyclaims,whilstalso
facilitatinggreaterreproducibilityofresearch.Weencouragethecommunitytoworktogetherto
identifybestpracticesforlinkingresearchdatawithrelatedliteratureandassociatedtools.
6.Establishgovernancestructuresthatreflectthediversedimensionsofresearchdata.
• Weneedtoensurethatthediversityoflong-taildata,bothintermsofscopeanddiscipline,are
wellrepresentedintheevolvingRDMgovernancestructures.
7.Developcoherentprinciplesandpoliciesforthecollectionandpreservationoflongtaildata.
• Institutionsandfundersneedguidancetodeterminegoodpracticesforassessingthepotential
valueofresearchdata,anddatarepositoriesneedtodeveloppoliciesfortheselection,collection,
curation,andstewardshipofdataandforevaluatingwhichdatahavelongtermvalue.

Relevance of
Open Data and
Research Data
Management
Funders Data
Management
and Sharing
policies
Publishers Data
availability
requirements
Strengthening
the Role of
Institutions
ZENODO, open
repository
from OpenAIRE
and CERN
TOPICS (5/5)

Cross-disciplinaryrepositories
50
Long tail of research data

• Catch-all repository for EUfunded research
• Upto 50 GBperupload
• Data stored in theCERN Data Center
• Persistentidentifiers (DOIs) for every upload
• Includes article level metrics
• Free for thelong tail of Science
• Opento all research outputsfrom all disciplines
• Easily addEC fundinginformation and report via OpenAIRE
Short Facts about Zenodo

Zenodo (OpenAIRE/CERN repository)
52
www.zenodo.org
H2020: Option to gather,
preserve and share project’s
scientific output

.
upload
..
describe
…
publish

Publish
58
http://www.datacite.org
www.openaire.eu

www.openaire.eu
@openaire_eu
facebook.com/groups/openaire
pedroprincipe@sdum.uminho.pt

Research Data Management, Open Data and Zenodo - 6th National Open Access Conference and OpenAIRE2020 Workshop - Turkey

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Research Data Management, Open Data and Zenodo - 6th National Open Access Conference and OpenAIRE2020 Workshop - Turkey

Similar to Research Data Management, Open Data and Zenodo - 6th National Open Access Conference and OpenAIRE2020 Workshop - Turkey (20)

Recently uploaded

Recently uploaded (20)

Research Data Management, Open Data and Zenodo - 6th National Open Access Conference and OpenAIRE2020 Workshop - Turkey

Editor's Notes