SlideShare a Scribd company logo
Reproducibility in Safe Havens
Louise Corti
Service Director Data Publishing and Access
UK Data Service
UKRN From data to metadata: Ensuring reproducibility in
biomedical research
Online
22 October 2020
Copyright © 2020 UK Data Service. Created by University of Essex
Safe haven data
Sensitive survey and administrative data
• Potential risk of disclosure
• Low levels of geography
Data access/ availability
• Restricted under the 5 SAFES framework
(ONS)
• Access only via Secure Lab (safe haven)
• High access bar
• Process for reproduction not set up by
journals
The UK Data Service
• Social science data service, funded by the UKRI ESRC
• Curates and provides access to data for research and
teaching
• Trusted Digital Repository (TDR), accredited to ISO27001 Info
Security Management standard, Digital Economy Act
Processor application in process
• Work closely with research funders and key data producers
/institutions: research centres; UK NSIs, govt departments,
British Library etc
• Around 8000 data collections, from open to secure
Accredited Researcher status
Social /economic sciences: Accredited Researcher concept
 Managed by the UK Statistics Authority
 Secure data owner requirement e.g. ONS, ESRC
 Based on experience/expertise and responsible attitude
 Understanding of disclosure risk and producing Safe Outputs
 Attend a one-day training course and pass a test
Medical research: focus on ‘bona fide researcher’ concept*
• the professional expertise and experience to conduct bona fide
research
• a formal relationship with a bona fide research organisation that
requires compliance with appropriate research governance and
management systems
* https://mrc.ukri.org/documents/pdf/data-sharing-from-population-and-patient-studies/
Reproducibility factors in safe havens
 FAIR data: Use of DOIs for all UKDS data
 FAIR data: Open data formats; high quality documentation
 Mandated use of DOIs in all checked outputs for publication
 Code can be taken out but is reviewed for risk
✘ No Lab rules for coding
✘ No mandated use of R Markdown /Jupyter Notebook
✘ No mandate from data owners to be research reproducible
✘ Weaker provenance chain for ‘research-ready’ admin data
(creation, cleaning and versioning)
cascad = Certification Agency for Scientific Code and Data
Certifying reproducibility (Safe havens)
cascad reproducibility certification attests that the numerical
results reported in a scientific article can be reproduced from a
set of numerical resources (code and data) provided by the
authors
• cascad.tech
• Non-profit academic initiative
• Founded by researchers
• CNRS, HEC Paris, U. Orléans
7Thanks to Christophe Perignon for cascad images
Step 1: ‘Submission’ of paper and digital resources (code + data)
Step 2: Compliance check by cascad staff:
• Gains access to secure environment
• Verifies whether the submitted resources comply with the guidelines
• Checks presentation/structure of code /data, aiming for
interpretable and reusable resources
• Aligns with guidance from AEA and Social Science Data Editors
• Non-conformity = no certificate
Certification process
8
Step 3: Review process
cascad assigns a reproducibility reviewer who:
• Gains access to the safe data environment
• Runs the code and compares the outputs from the code to
the numerical results of the paper
• Create an execution report: data description, code
description, replication steps, and findings
• Proposes a ‘reproducibility rating’
• RRR to DD (Perfect to Serious Discrepancies)
9
Certification process
Step 4: Final certification
• The reproducibility editor assigns the final rating
based on the results reported in the execution report
• The reproducibility certificate and the execution
report are sent to the author
• Cascad staff uploads the certificate on an open
access repository (e.g. Zenodo) with a DOI
• Full execution report available in safe haven
10
Certification process
UK Data Service Pilot
• 4 economists /sociologists Secure Lab users invited to
Be Reproduced to go onto try for the cascad certificate:
3 month deadline
• None would submit current code for rerunning against
outcomes until ’tidied up’
• 1 agreed to try to make deadline
• Started to rework code, but too much of a challenge, they could
not reproduce themselves
• Outcome = must start from scratch
• Approaches discussed at UKDS/ONS Workshop in Feb
2020 called #LoveYourCode
Longer-term goals
 Cascad –type process for safe havens
 Build reproducible processes into data production /analysis
• Encourage code tracking tools
• Replacement of older unreproducible models with new
documented code
• Mandate cleaning, value-added, final code available in safe
haven to enable reproduction and reuse
• Ensure checking process is established, costed (dedicated staff)
• Ensure data have DOIs and DOIs are used
 Training on creating ‘good code’ (e.g. Turing Way)
 Use of Reproducibility Certificates as incentives
 Early adopters – bare all! How I became reproducible
Questions
Louise Corti
UK Data Service
corti@essex.ac.uk

More Related Content

Similar to UKRN workshop 20201022_Corti

Making the most of Open Data
Making the most of Open DataMaking the most of Open Data
Making the most of Open Data
Louise Corti
 
2010 CLARA Nijmegen - Data Seal of Approval tutorial
2010 CLARA Nijmegen - Data Seal of Approval tutorial2010 CLARA Nijmegen - Data Seal of Approval tutorial
2010 CLARA Nijmegen - Data Seal of Approval tutorial
Dirk Roorda
 
Research methods group accelarating impact by sharing data
Research methods group  accelarating impact by sharing dataResearch methods group  accelarating impact by sharing data
Research methods group accelarating impact by sharing dataWorld Agroforestry (ICRAF)
 
RDM Roadmap to the Future, or: Lords and Ladies of the Data
RDM Roadmap to the Future, or: Lords and Ladies of the DataRDM Roadmap to the Future, or: Lords and Ladies of the Data
RDM Roadmap to the Future, or: Lords and Ladies of the Data
Robin Rice
 
FAIR play?
FAIR play? FAIR play?
FAIR play?
Sarah Jones
 
The role of open data in enhancing reproducibility
The role of open data in enhancing reproducibility The role of open data in enhancing reproducibility
The role of open data in enhancing reproducibility
Louise Corti
 
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)OpenAIRE webinar on Open Research Data in H2020 (OAW2016)
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)
OpenAIRE
 
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
Sarah Anna Stewart
 
Scholze liber 2015-06-25_final
Scholze liber 2015-06-25_finalScholze liber 2015-06-25_final
Scholze liber 2015-06-25_final
Karlsruhe Institute of Technology (KIT)
 
Turning FAIR into Reality - Role for Libraries
Turning FAIR into Reality - Role for Libraries Turning FAIR into Reality - Role for Libraries
Turning FAIR into Reality - Role for Libraries
dri_ireland
 
RDM & ELNs @ Edinburgh
RDM & ELNs @ EdinburghRDM & ELNs @ Edinburgh
RDM & ELNs @ Edinburgh
EDINA, University of Edinburgh
 
Ariadne: Lifecycles
Ariadne: LifecyclesAriadne: Lifecycles
Ariadne: Lifecycles
ariadnenetwork
 
Using e-infrastructures for biodiversity conservation - Gianpaolo Coro (CNR)
Using e-infrastructures for biodiversity conservation - Gianpaolo Coro (CNR)Using e-infrastructures for biodiversity conservation - Gianpaolo Coro (CNR)
Using e-infrastructures for biodiversity conservation - Gianpaolo Coro (CNR)
Blue BRIDGE
 
Accessing data for research: data publishing pathways and the Five Safes
Accessing data for research: data publishing pathways and the Five SafesAccessing data for research: data publishing pathways and the Five Safes
Accessing data for research: data publishing pathways and the Five Safes
Louise Corti
 
Ingrid Dillo - Trustworthy repositories for open research data
Ingrid Dillo - Trustworthy repositories for open research dataIngrid Dillo - Trustworthy repositories for open research data
Ingrid Dillo - Trustworthy repositories for open research data
dri_ireland
 
How the Core Trust Seal (CTS) Enables FAIR Data
How the Core Trust Seal (CTS) Enables FAIR DataHow the Core Trust Seal (CTS) Enables FAIR Data
How the Core Trust Seal (CTS) Enables FAIR Data
dri_ireland
 
How core trust seal enables FAIR data - Natalie Harrower
How core trust seal enables FAIR data - Natalie HarrowerHow core trust seal enables FAIR data - Natalie Harrower
How core trust seal enables FAIR data - Natalie Harrower
OpenAIRE
 
Perspectives on the Role of Trustworthy Repository Standards in Data Journal ...
Perspectives on the Role of Trustworthy Repository Standards in Data Journal ...Perspectives on the Role of Trustworthy Repository Standards in Data Journal ...
Perspectives on the Role of Trustworthy Repository Standards in Data Journal ...
The University of Edinburgh
 
Data sharing in the Netherlands
Data sharing in the NetherlandsData sharing in the Netherlands
Data sharing in the Netherlands
Jisc RDM
 
Core Trust Seal for Trustworthy Data Repositories, 2018-04-19
Core Trust Seal for Trustworthy Data Repositories, 2018-04-19Core Trust Seal for Trustworthy Data Repositories, 2018-04-19
Core Trust Seal for Trustworthy Data Repositories, 2018-04-19
Ciarán Quinn
 

Similar to UKRN workshop 20201022_Corti (20)

Making the most of Open Data
Making the most of Open DataMaking the most of Open Data
Making the most of Open Data
 
2010 CLARA Nijmegen - Data Seal of Approval tutorial
2010 CLARA Nijmegen - Data Seal of Approval tutorial2010 CLARA Nijmegen - Data Seal of Approval tutorial
2010 CLARA Nijmegen - Data Seal of Approval tutorial
 
Research methods group accelarating impact by sharing data
Research methods group  accelarating impact by sharing dataResearch methods group  accelarating impact by sharing data
Research methods group accelarating impact by sharing data
 
RDM Roadmap to the Future, or: Lords and Ladies of the Data
RDM Roadmap to the Future, or: Lords and Ladies of the DataRDM Roadmap to the Future, or: Lords and Ladies of the Data
RDM Roadmap to the Future, or: Lords and Ladies of the Data
 
FAIR play?
FAIR play? FAIR play?
FAIR play?
 
The role of open data in enhancing reproducibility
The role of open data in enhancing reproducibility The role of open data in enhancing reproducibility
The role of open data in enhancing reproducibility
 
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)OpenAIRE webinar on Open Research Data in H2020 (OAW2016)
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)
 
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
 
Scholze liber 2015-06-25_final
Scholze liber 2015-06-25_finalScholze liber 2015-06-25_final
Scholze liber 2015-06-25_final
 
Turning FAIR into Reality - Role for Libraries
Turning FAIR into Reality - Role for Libraries Turning FAIR into Reality - Role for Libraries
Turning FAIR into Reality - Role for Libraries
 
RDM & ELNs @ Edinburgh
RDM & ELNs @ EdinburghRDM & ELNs @ Edinburgh
RDM & ELNs @ Edinburgh
 
Ariadne: Lifecycles
Ariadne: LifecyclesAriadne: Lifecycles
Ariadne: Lifecycles
 
Using e-infrastructures for biodiversity conservation - Gianpaolo Coro (CNR)
Using e-infrastructures for biodiversity conservation - Gianpaolo Coro (CNR)Using e-infrastructures for biodiversity conservation - Gianpaolo Coro (CNR)
Using e-infrastructures for biodiversity conservation - Gianpaolo Coro (CNR)
 
Accessing data for research: data publishing pathways and the Five Safes
Accessing data for research: data publishing pathways and the Five SafesAccessing data for research: data publishing pathways and the Five Safes
Accessing data for research: data publishing pathways and the Five Safes
 
Ingrid Dillo - Trustworthy repositories for open research data
Ingrid Dillo - Trustworthy repositories for open research dataIngrid Dillo - Trustworthy repositories for open research data
Ingrid Dillo - Trustworthy repositories for open research data
 
How the Core Trust Seal (CTS) Enables FAIR Data
How the Core Trust Seal (CTS) Enables FAIR DataHow the Core Trust Seal (CTS) Enables FAIR Data
How the Core Trust Seal (CTS) Enables FAIR Data
 
How core trust seal enables FAIR data - Natalie Harrower
How core trust seal enables FAIR data - Natalie HarrowerHow core trust seal enables FAIR data - Natalie Harrower
How core trust seal enables FAIR data - Natalie Harrower
 
Perspectives on the Role of Trustworthy Repository Standards in Data Journal ...
Perspectives on the Role of Trustworthy Repository Standards in Data Journal ...Perspectives on the Role of Trustworthy Repository Standards in Data Journal ...
Perspectives on the Role of Trustworthy Repository Standards in Data Journal ...
 
Data sharing in the Netherlands
Data sharing in the NetherlandsData sharing in the Netherlands
Data sharing in the Netherlands
 
Core Trust Seal for Trustworthy Data Repositories, 2018-04-19
Core Trust Seal for Trustworthy Data Repositories, 2018-04-19Core Trust Seal for Trustworthy Data Repositories, 2018-04-19
Core Trust Seal for Trustworthy Data Repositories, 2018-04-19
 

More from Louise Corti

Transparency and reproducibility in research
Transparency and reproducibility in researchTransparency and reproducibility in research
Transparency and reproducibility in research
Louise Corti
 
Love Your Code Workshop Introduction_Corti_Engeli
Love Your Code Workshop Introduction_Corti_EngeliLove Your Code Workshop Introduction_Corti_Engeli
Love Your Code Workshop Introduction_Corti_Engeli
Louise Corti
 
Use of data in safe havens: ethics and reproducibility issues
Use of data in safe havens: ethics and reproducibility issuesUse of data in safe havens: ethics and reproducibility issues
Use of data in safe havens: ethics and reproducibility issues
Louise Corti
 
Engaging with students and researchers: the case of the social sciences
Engaging with students and researchers: the case of the social sciencesEngaging with students and researchers: the case of the social sciences
Engaging with students and researchers: the case of the social sciences
Louise Corti
 
Incentivising the uptake of reusable metadata in the survey production process
Incentivising the uptake of reusable metadata in the survey production processIncentivising the uptake of reusable metadata in the survey production process
Incentivising the uptake of reusable metadata in the survey production process
Louise Corti
 
The art of depositing social science data: maximising quality and ensuring go...
The art of depositing social science data: maximising quality and ensuring go...The art of depositing social science data: maximising quality and ensuring go...
The art of depositing social science data: maximising quality and ensuring go...
Louise Corti
 
How metadata drives data sharing; UK Data Archive
How metadata drives data sharing; UK Data Archive How metadata drives data sharing; UK Data Archive
How metadata drives data sharing; UK Data Archive
Louise Corti
 

More from Louise Corti (7)

Transparency and reproducibility in research
Transparency and reproducibility in researchTransparency and reproducibility in research
Transparency and reproducibility in research
 
Love Your Code Workshop Introduction_Corti_Engeli
Love Your Code Workshop Introduction_Corti_EngeliLove Your Code Workshop Introduction_Corti_Engeli
Love Your Code Workshop Introduction_Corti_Engeli
 
Use of data in safe havens: ethics and reproducibility issues
Use of data in safe havens: ethics and reproducibility issuesUse of data in safe havens: ethics and reproducibility issues
Use of data in safe havens: ethics and reproducibility issues
 
Engaging with students and researchers: the case of the social sciences
Engaging with students and researchers: the case of the social sciencesEngaging with students and researchers: the case of the social sciences
Engaging with students and researchers: the case of the social sciences
 
Incentivising the uptake of reusable metadata in the survey production process
Incentivising the uptake of reusable metadata in the survey production processIncentivising the uptake of reusable metadata in the survey production process
Incentivising the uptake of reusable metadata in the survey production process
 
The art of depositing social science data: maximising quality and ensuring go...
The art of depositing social science data: maximising quality and ensuring go...The art of depositing social science data: maximising quality and ensuring go...
The art of depositing social science data: maximising quality and ensuring go...
 
How metadata drives data sharing; UK Data Archive
How metadata drives data sharing; UK Data Archive How metadata drives data sharing; UK Data Archive
How metadata drives data sharing; UK Data Archive
 

Recently uploaded

一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
ewymefz
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
Opendatabay
 
standardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhstandardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghh
ArpitMalhotra16
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
Subhajit Sahu
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
ahzuo
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
slg6lamcq
 
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdfSample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Linda486226
 
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
oz8q3jxlp
 
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
AbhimanyuSinha9
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
jerlynmaetalle
 
FP Growth Algorithm and its Applications
FP Growth Algorithm and its ApplicationsFP Growth Algorithm and its Applications
FP Growth Algorithm and its Applications
MaleehaSheikh2
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
axoqas
 
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
nscud
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
v3tuleee
 
一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单
ocavb
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Subhajit Sahu
 
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
vcaxypu
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
u86oixdj
 
My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.
rwarrenll
 
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
pchutichetpong
 

Recently uploaded (20)

一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
 
standardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhstandardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghh
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
 
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdfSample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
 
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
 
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
 
FP Growth Algorithm and its Applications
FP Growth Algorithm and its ApplicationsFP Growth Algorithm and its Applications
FP Growth Algorithm and its Applications
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
 
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
 
一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
 
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
 
My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.
 
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
 

UKRN workshop 20201022_Corti

  • 1. Reproducibility in Safe Havens Louise Corti Service Director Data Publishing and Access UK Data Service UKRN From data to metadata: Ensuring reproducibility in biomedical research Online 22 October 2020 Copyright © 2020 UK Data Service. Created by University of Essex
  • 2. Safe haven data Sensitive survey and administrative data • Potential risk of disclosure • Low levels of geography Data access/ availability • Restricted under the 5 SAFES framework (ONS) • Access only via Secure Lab (safe haven) • High access bar • Process for reproduction not set up by journals
  • 3. The UK Data Service • Social science data service, funded by the UKRI ESRC • Curates and provides access to data for research and teaching • Trusted Digital Repository (TDR), accredited to ISO27001 Info Security Management standard, Digital Economy Act Processor application in process • Work closely with research funders and key data producers /institutions: research centres; UK NSIs, govt departments, British Library etc • Around 8000 data collections, from open to secure
  • 4.
  • 5. Accredited Researcher status Social /economic sciences: Accredited Researcher concept  Managed by the UK Statistics Authority  Secure data owner requirement e.g. ONS, ESRC  Based on experience/expertise and responsible attitude  Understanding of disclosure risk and producing Safe Outputs  Attend a one-day training course and pass a test Medical research: focus on ‘bona fide researcher’ concept* • the professional expertise and experience to conduct bona fide research • a formal relationship with a bona fide research organisation that requires compliance with appropriate research governance and management systems * https://mrc.ukri.org/documents/pdf/data-sharing-from-population-and-patient-studies/
  • 6. Reproducibility factors in safe havens  FAIR data: Use of DOIs for all UKDS data  FAIR data: Open data formats; high quality documentation  Mandated use of DOIs in all checked outputs for publication  Code can be taken out but is reviewed for risk ✘ No Lab rules for coding ✘ No mandated use of R Markdown /Jupyter Notebook ✘ No mandate from data owners to be research reproducible ✘ Weaker provenance chain for ‘research-ready’ admin data (creation, cleaning and versioning)
  • 7. cascad = Certification Agency for Scientific Code and Data Certifying reproducibility (Safe havens) cascad reproducibility certification attests that the numerical results reported in a scientific article can be reproduced from a set of numerical resources (code and data) provided by the authors • cascad.tech • Non-profit academic initiative • Founded by researchers • CNRS, HEC Paris, U. Orléans 7Thanks to Christophe Perignon for cascad images
  • 8. Step 1: ‘Submission’ of paper and digital resources (code + data) Step 2: Compliance check by cascad staff: • Gains access to secure environment • Verifies whether the submitted resources comply with the guidelines • Checks presentation/structure of code /data, aiming for interpretable and reusable resources • Aligns with guidance from AEA and Social Science Data Editors • Non-conformity = no certificate Certification process 8
  • 9. Step 3: Review process cascad assigns a reproducibility reviewer who: • Gains access to the safe data environment • Runs the code and compares the outputs from the code to the numerical results of the paper • Create an execution report: data description, code description, replication steps, and findings • Proposes a ‘reproducibility rating’ • RRR to DD (Perfect to Serious Discrepancies) 9 Certification process
  • 10. Step 4: Final certification • The reproducibility editor assigns the final rating based on the results reported in the execution report • The reproducibility certificate and the execution report are sent to the author • Cascad staff uploads the certificate on an open access repository (e.g. Zenodo) with a DOI • Full execution report available in safe haven 10 Certification process
  • 11. UK Data Service Pilot • 4 economists /sociologists Secure Lab users invited to Be Reproduced to go onto try for the cascad certificate: 3 month deadline • None would submit current code for rerunning against outcomes until ’tidied up’ • 1 agreed to try to make deadline • Started to rework code, but too much of a challenge, they could not reproduce themselves • Outcome = must start from scratch • Approaches discussed at UKDS/ONS Workshop in Feb 2020 called #LoveYourCode
  • 12. Longer-term goals  Cascad –type process for safe havens  Build reproducible processes into data production /analysis • Encourage code tracking tools • Replacement of older unreproducible models with new documented code • Mandate cleaning, value-added, final code available in safe haven to enable reproduction and reuse • Ensure checking process is established, costed (dedicated staff) • Ensure data have DOIs and DOIs are used  Training on creating ‘good code’ (e.g. Turing Way)  Use of Reproducibility Certificates as incentives  Early adopters – bare all! How I became reproducible
  • 13. Questions Louise Corti UK Data Service corti@essex.ac.uk