SlideShare a Scribd company logo
1 of 18
Managing sensitive data at the
Australian Data Archive
“Making Data Social” webinar series
29 March 2017
Dr. Steven McEachern
Director, Australian Data Archive
ANU Centre for Social Research and Methods
Australian National University
Overview
• Sensitive data and the 5 Safes model
• Access to sensitive data in Australia
• Applying the 5 Safes model at ADA
• Sensitive data and the data lifecycle
Sensitive data
• “Sensitive data are data that can be
used to identify anindividual, species, object,
process,or locationthat introduces a risk
of discrimination, harm, or unwanted
attention.”
– ANDS Guide on Publishing and Sharing Sensitive
Data, p.7
– http://www.ands.org.au/__data/assets/pdf_file/0
010/489187/Sensitive-data.pdf
The 5 safes
1. Safe people: Can the researchers be trusted to do the
right thing?
2. Safe projects: Is the data to be used for an appropriate
purpose?
3. Safe settings: Is the environment in which the analysis
takes place safe?
4. Safe data: Is the data appropriately protected?
5. Safe output: Is there a low risk of disclosure in research
outputs?
Desai, T., F. Ritchie and R. Welpton (2016) Five Safes: designing data access for
research. Economics Working Paper Series
1601, University of the West of England.
http://www2.uwe.ac.uk/faculties/BBS/Documents/1601.pdf
What do researchers expect?
(or What is wanted? :-)
• “We emphasize that direct access to micro-data
is critical for success. Alternatives such as access
to synthetic data or submission of computer
programs to agency employees will not address
the key problem of restoring US leadership with
cutting-edge policy-relevant research.”
• Card, Chetty, Feldstein and Saez, 2010 (emphasis
in original)
– http://rajchetty.com/chettyfiles/NSFdataaccess.pdf
What is expected?
• “Here's what you need to do if you want an anonymised 1% sample
of the US Census
– Go to Google and type US Census 1% sample, click on link to the
Census.
– Download each of the state files from the FTP site and merge them
yourself. Or just check things out for one of the states. Whatever you
like.
– Start mucking about to test whether your pet theory is plausible.
• Here's what you need to do if you want an anoymised sample of
the NZ Census, or a Confidentialised Unit Record File (CURF) of any
of big Stats series:
– Go to Stats NZ's site, here.
– Follow the instructions below: …”
• (Followed by several pages of instructions, Application Process,
Assessment Criteria, Methods of Access, …)
Eric Crampton, the New Zealand Initiative, Wellington, formerly University of
Canterbury. http://offsettingbehaviour.blogspot.com.au/2015/10/curf-and-
turf.html
Can we bridge depositor and user
expectations?
I think so. Consider Card et al. again:
“We believe that five conditions must be satisfied to make a data
access program sustainable and efficient:
a) fair and open competition for data access based on scientific merit
b) sufficient bandwidth to accommodate a large number of projects
simultaneously
c) inclusion of younger scholars and graduate students in the
research teams that can access the data
d) direct access to de-identified micro data through local statistical
offices or, more preferably, secure remote connections
e) systematic electronic monitoring to allow immediate disclosure
of statistical results and prevent any disclosure of individual
records”
Current models in Australia
ABS:
• Confidentialised Unit
Record Files (CURFs)
• RADL
• ABSDL
• TableBuilder
ADA:
• Confidentialised Unit
Record Files (CURFs)
Shared (often remote
access) infrastructure:
• AURIN
• SURE (PHRN)
• Data linkage facilities
Ad hoc arrangements:
• “Secure rooms”
• Departmental
arrangements
Applying the 5 Safes
People Projects Settings Data Output
CURFs Yes? Yes? Yes? YES YES
TableBuilder No No YES YES YES
RADL Yes? Yes? YES YES YES
ABSDL Yes? Yes? YES YES YES
ABS Remote
Data Lab Yes Yes? YES YES YES
ADA Yes? No No YES No
AURIN No No YES YES Yes?
SURE (PHRN) Yes Yes YES No? ???
Data Linkage
facilities No? YES Yes? YES ???
Secure rooms Yes? Yes? YES No? ???
Australian experience
• Safe data
– Confidentialisation: ADA, ABS, DSS (HILDA, etc.)
– Indirect access to data: TableBuilder, ADA
• Safe settings
– Aggregated data: TableBuilder, AURIN
– Remote: RADL, ABS (Remote) Data Lab
– Secure environments: ABS (On site) Data Lab,
secure rooms
5 safes: lesser emphasis on…
• Safe outputs
– Difficult to scale (e.g. data lab output reviews)
– This is changing – e.g. TableBuilder is automated
– But need to consider replication and reproducibility
• Safe researchers and safe projects
– Considered in most models, but not closely monitored
– May be difficult to monitor? (Similar issues to the
reporting of research outputs in universities)
– Universities could (and do!!) provide imprimatur for
their staff and students
Frameworks for research practice
• There are existing Australian frameworks for
researcher accountability and responsibilities:
– the Australian Code for the Responsible Conduct of
Research (ACRCR), which sets out institutional and
researcher responsibilities for conduct of research
– (Note that this is currently under review)
– Human Research Ethics Committees (HREC)
• Increasingly, professional and journal
requirements for data sharing:
– E.g. PLOS One, AEA, DA-RT (political science)
– https://www.aeaweb.org/aer/data.php
– http://journals.plos.org/plosone/s/data-availability
Relevant content from ACRCR
• S.2: Management of research data and primary materials
– E.g. 2.7 Maintain confidentiality of research data and primary
materials
– Researchers given access to confidential information must
maintain that confidentiality. Primary materials and confidential
research data must be kept in secure storage. Confidential
information must only be used in ways agreed with those who
provided it. Particular care must be exercised when confidential
data are made available for discussion.
• S.4: Publication and dissemination of research findings
– E.g. 4.2.3 Institutions must ensure that the sponsors of research
understand the importance of publication in research and do
not delay publication beyond the time needed to protect
intellectual property and other relevant interests.
• S.9: Breaches of the Code and misconduct in research
ADA model
• Safe data: data is anonymised
(confidentialised) either prior to deposit or by
ADA archivists
• Safe people: virtually all data access is
mediated, and users must be identified and
provide contact and supervisor details
• Safe projects: users provide a project
description
• Safe settings and safe outputs: NOT applied
ABS Remote Data Lab (virtual enclave)
• Safe data: less of a focus – but the lab does not prohibit
use of safe data practices
– Risk: individual researchers can see individual records
– BUT this assumes “unsafe” people (researchers)
• Safe settings: Remote access environment hosted at
ABS
– Challenge: cost of establishing the system
• Safe outputs: outputs limited only to methods
approved through the access environment (i.e. no
printing)
– Risk: photographing the screen, taking notes
– Again, assumes “unsafe” people
– Challenge: managing output checking
• Safe people:
– Institutional support
• Note ACRCR – code of conduct, and HREC
– Training for researchers prior to access
• Intended breaches are uncommon (per background paper)
• Focus therefore on unintended breach
• Highlight also alternate access options to reduce breach due
to limitations of access method
– Assessing people: research background?
• Experience is difficult to evaluate here
• How would you build up a track record?
• Safe projects
– May be necessary for legislative reasons
– Should this matter?
• Basic research might generate just as useful insights
A suite of options
• Different existing models (each a mix of the 5
safes) all have their place
• Safe people can be incorporated into existing
models
– Many current models assume the “intruder”
– International evidence suggests this is not the case
• ADA has 2 “default” options
The principles should enable the right
mix of “safes” for a given data source
Source: http://www.shinyshiny.tv/2009/12/easymix_-_a_mix.html

More Related Content

What's hot

Guy avoiding-dat apocalypse
Guy avoiding-dat apocalypseGuy avoiding-dat apocalypse
Guy avoiding-dat apocalypseENUG
 
Developing data services: a tale from two Oregon universities
Developing data services: a tale from two Oregon universitiesDeveloping data services: a tale from two Oregon universities
Developing data services: a tale from two Oregon universitiesAmanda Whitmire
 
Sharing Qualitative Data - Challenges and Opportunities
Sharing Qualitative Data - Challenges and OpportunitiesSharing Qualitative Data - Challenges and Opportunities
Sharing Qualitative Data - Challenges and OpportunitiesLancaster University Library
 
Introduction to research data management; Lecture 01 for GRAD521
Introduction to research data management; Lecture 01 for GRAD521Introduction to research data management; Lecture 01 for GRAD521
Introduction to research data management; Lecture 01 for GRAD521Amanda Whitmire
 
Managing Confidential Information – Trends and Approaches
Managing Confidential Information – Trends and ApproachesManaging Confidential Information – Trends and Approaches
Managing Confidential Information – Trends and ApproachesMicah Altman
 
openEHR template development for COVID-19
openEHR template development for COVID-19openEHR template development for COVID-19
openEHR template development for COVID-19openEHR-Japan
 
Graham Pryor
Graham PryorGraham Pryor
Graham PryorEduserv
 
Guideline based CDSS for COVID-19
Guideline based CDSS for COVID-19Guideline based CDSS for COVID-19
Guideline based CDSS for COVID-19openEHR-Japan
 
PA webinar on benefits & costs of FAIR implementation in life sciences
PA webinar on benefits & costs of FAIR implementation in life sciences PA webinar on benefits & costs of FAIR implementation in life sciences
PA webinar on benefits & costs of FAIR implementation in life sciences Pistoia Alliance
 
Introduction to research data management
Introduction to research data managementIntroduction to research data management
Introduction to research data managementMichael Day
 
June2014 brownbag privacy
June2014 brownbag privacyJune2014 brownbag privacy
June2014 brownbag privacyMicah Altman
 
Reproducibility from an infomatics perspective
Reproducibility from an infomatics perspectiveReproducibility from an infomatics perspective
Reproducibility from an infomatics perspectiveMicah Altman
 
10 commandments in rdm funder compliancy
10 commandments in rdm funder compliancy10 commandments in rdm funder compliancy
10 commandments in rdm funder compliancyHannelore Vanhaverbeke
 
Tragedy of the (Data) Commons
Tragedy of the (Data) CommonsTragedy of the (Data) Commons
Tragedy of the (Data) CommonsJames Hendler
 
An Introduction to Machine Learning and Genomics
An Introduction to Machine Learning and GenomicsAn Introduction to Machine Learning and Genomics
An Introduction to Machine Learning and GenomicsBrittany Lasseigne, Ph.D.
 

What's hot (20)

Guy avoiding-dat apocalypse
Guy avoiding-dat apocalypseGuy avoiding-dat apocalypse
Guy avoiding-dat apocalypse
 
Medical image analysis, retrieval and evaluation infrastructures
Medical image analysis, retrieval and evaluation infrastructuresMedical image analysis, retrieval and evaluation infrastructures
Medical image analysis, retrieval and evaluation infrastructures
 
Challenges in medical imaging and the VISCERAL model
Challenges in medical imaging and the VISCERAL modelChallenges in medical imaging and the VISCERAL model
Challenges in medical imaging and the VISCERAL model
 
Developing data services: a tale from two Oregon universities
Developing data services: a tale from two Oregon universitiesDeveloping data services: a tale from two Oregon universities
Developing data services: a tale from two Oregon universities
 
Sharing Qualitative Data - Challenges and Opportunities
Sharing Qualitative Data - Challenges and OpportunitiesSharing Qualitative Data - Challenges and Opportunities
Sharing Qualitative Data - Challenges and Opportunities
 
openEHR v COVID-19
openEHR v COVID-19openEHR v COVID-19
openEHR v COVID-19
 
Introduction to research data management; Lecture 01 for GRAD521
Introduction to research data management; Lecture 01 for GRAD521Introduction to research data management; Lecture 01 for GRAD521
Introduction to research data management; Lecture 01 for GRAD521
 
Cairo
CairoCairo
Cairo
 
From byte to mind
From byte to mindFrom byte to mind
From byte to mind
 
Managing Confidential Information – Trends and Approaches
Managing Confidential Information – Trends and ApproachesManaging Confidential Information – Trends and Approaches
Managing Confidential Information – Trends and Approaches
 
openEHR template development for COVID-19
openEHR template development for COVID-19openEHR template development for COVID-19
openEHR template development for COVID-19
 
Graham Pryor
Graham PryorGraham Pryor
Graham Pryor
 
Guideline based CDSS for COVID-19
Guideline based CDSS for COVID-19Guideline based CDSS for COVID-19
Guideline based CDSS for COVID-19
 
PA webinar on benefits & costs of FAIR implementation in life sciences
PA webinar on benefits & costs of FAIR implementation in life sciences PA webinar on benefits & costs of FAIR implementation in life sciences
PA webinar on benefits & costs of FAIR implementation in life sciences
 
Introduction to research data management
Introduction to research data managementIntroduction to research data management
Introduction to research data management
 
June2014 brownbag privacy
June2014 brownbag privacyJune2014 brownbag privacy
June2014 brownbag privacy
 
Reproducibility from an infomatics perspective
Reproducibility from an infomatics perspectiveReproducibility from an infomatics perspective
Reproducibility from an infomatics perspective
 
10 commandments in rdm funder compliancy
10 commandments in rdm funder compliancy10 commandments in rdm funder compliancy
10 commandments in rdm funder compliancy
 
Tragedy of the (Data) Commons
Tragedy of the (Data) CommonsTragedy of the (Data) Commons
Tragedy of the (Data) Commons
 
An Introduction to Machine Learning and Genomics
An Introduction to Machine Learning and GenomicsAn Introduction to Machine Learning and Genomics
An Introduction to Machine Learning and Genomics
 

Viewers also liked (19)

Internacional. tema 11, 12, 13.
Internacional. tema 11, 12, 13.Internacional. tema 11, 12, 13.
Internacional. tema 11, 12, 13.
 
01 aspectos basicos
01   aspectos basicos01   aspectos basicos
01 aspectos basicos
 
S4 tarea4 grgoe
S4 tarea4 grgoeS4 tarea4 grgoe
S4 tarea4 grgoe
 
Modelo Educativo Adventista
Modelo Educativo AdventistaModelo Educativo Adventista
Modelo Educativo Adventista
 
Presentación postitulo alfabetizacion digital unse
Presentación postitulo alfabetizacion digital unsePresentación postitulo alfabetizacion digital unse
Presentación postitulo alfabetizacion digital unse
 
Ejercicio 2
Ejercicio 2Ejercicio 2
Ejercicio 2
 
Articulo62 cst
Articulo62 cstArticulo62 cst
Articulo62 cst
 
Anorexia y bulimia
Anorexia y bulimiaAnorexia y bulimia
Anorexia y bulimia
 
Gestión Ágil de Proyectos: Scrum, Kanban y XP
Gestión Ágil de Proyectos: Scrum, Kanban y XPGestión Ágil de Proyectos: Scrum, Kanban y XP
Gestión Ágil de Proyectos: Scrum, Kanban y XP
 
Situa aprend didactica critica
Situa aprend didactica criticaSitua aprend didactica critica
Situa aprend didactica critica
 
S4 tarea4 grgoe
S4 tarea4 grgoeS4 tarea4 grgoe
S4 tarea4 grgoe
 
Tercer tema
Tercer temaTercer tema
Tercer tema
 
ORCID for funders webinar -NIH use of ORCID to track outcomes - Richard Iiked...
ORCID for funders webinar -NIH use of ORCID to track outcomes - Richard Iiked...ORCID for funders webinar -NIH use of ORCID to track outcomes - Richard Iiked...
ORCID for funders webinar -NIH use of ORCID to track outcomes - Richard Iiked...
 
Decreto 094 de 1989
Decreto 094 de 1989Decreto 094 de 1989
Decreto 094 de 1989
 
Taller
TallerTaller
Taller
 
Programa de Área CN 2017
Programa de Área CN 2017Programa de Área CN 2017
Programa de Área CN 2017
 
PIKASEN Clasificadora con lector óptico
PIKASEN Clasificadora con lector ópticoPIKASEN Clasificadora con lector óptico
PIKASEN Clasificadora con lector óptico
 
Impact Hub_Abidjan - Columbia Ci3
Impact Hub_Abidjan - Columbia Ci3Impact Hub_Abidjan - Columbia Ci3
Impact Hub_Abidjan - Columbia Ci3
 
Pronunciamiento apo 25 03 2017
Pronunciamiento apo 25 03 2017Pronunciamiento apo 25 03 2017
Pronunciamiento apo 25 03 2017
 

Similar to Managing sensitive data at the Australian Data Archive

Research Data Management
Research Data ManagementResearch Data Management
Research Data ManagementJamie Bisset
 
Fsci 2018 thursday2_august_am6
Fsci 2018 thursday2_august_am6Fsci 2018 thursday2_august_am6
Fsci 2018 thursday2_august_am6ARDC
 
Data Literacy: Creating and Managing Reserach Data
Data Literacy: Creating and Managing Reserach DataData Literacy: Creating and Managing Reserach Data
Data Literacy: Creating and Managing Reserach Datacunera
 
Accessing data for research: data publishing pathways and the Five Safes
Accessing data for research: data publishing pathways and the Five SafesAccessing data for research: data publishing pathways and the Five Safes
Accessing data for research: data publishing pathways and the Five SafesLouise Corti
 
Support Your Data, Kyoto University
Support Your Data, Kyoto UniversitySupport Your Data, Kyoto University
Support Your Data, Kyoto UniversityStephanie Simms
 
Data Management Lab: Session 4 Slides
Data Management Lab: Session 4 SlidesData Management Lab: Session 4 Slides
Data Management Lab: Session 4 SlidesIUPUI
 
Meeting Federal Research Requirements
Meeting Federal Research RequirementsMeeting Federal Research Requirements
Meeting Federal Research RequirementsICPSR
 
Publishing your research: Research Data Management (Introduction)
Publishing your research: Research Data Management (Introduction) Publishing your research: Research Data Management (Introduction)
Publishing your research: Research Data Management (Introduction) Jamie Bisset
 
Meeting Federal Research Requirements for Data Management Plans, Public Acces...
Meeting Federal Research Requirements for Data Management Plans, Public Acces...Meeting Federal Research Requirements for Data Management Plans, Public Acces...
Meeting Federal Research Requirements for Data Management Plans, Public Acces...ICPSR
 
Research Data Management in practice, RIA Data Management Workshop Brisbane 2017
Research Data Management in practice, RIA Data Management Workshop Brisbane 2017Research Data Management in practice, RIA Data Management Workshop Brisbane 2017
Research Data Management in practice, RIA Data Management Workshop Brisbane 2017ARDC
 
Getting to grips with Research Data Management
Getting to grips with Research Data ManagementGetting to grips with Research Data Management
Getting to grips with Research Data ManagementIzzyChad
 
NIH Data Policy or: How I Learned to Stop Worrying and Love the Data Manageme...
NIH Data Policy or: How I Learned to Stop Worrying and Love the Data Manageme...NIH Data Policy or: How I Learned to Stop Worrying and Love the Data Manageme...
NIH Data Policy or: How I Learned to Stop Worrying and Love the Data Manageme...Kristin Briney
 
Open science, open data - FOSTER training, Potsdam
Open science, open data - FOSTER training, PotsdamOpen science, open data - FOSTER training, Potsdam
Open science, open data - FOSTER training, PotsdamPlatforma Otwartej Nauki
 

Similar to Managing sensitive data at the Australian Data Archive (20)

Research Data Management
Research Data ManagementResearch Data Management
Research Data Management
 
Fsci 2018 thursday2_august_am6
Fsci 2018 thursday2_august_am6Fsci 2018 thursday2_august_am6
Fsci 2018 thursday2_august_am6
 
Data Literacy: Creating and Managing Reserach Data
Data Literacy: Creating and Managing Reserach DataData Literacy: Creating and Managing Reserach Data
Data Literacy: Creating and Managing Reserach Data
 
Accessing data for research: data publishing pathways and the Five Safes
Accessing data for research: data publishing pathways and the Five SafesAccessing data for research: data publishing pathways and the Five Safes
Accessing data for research: data publishing pathways and the Five Safes
 
Research Data Management and your PhD
Research Data Management and your PhDResearch Data Management and your PhD
Research Data Management and your PhD
 
Rdm slides march 2014
Rdm slides march 2014Rdm slides march 2014
Rdm slides march 2014
 
Support Your Data, Kyoto University
Support Your Data, Kyoto UniversitySupport Your Data, Kyoto University
Support Your Data, Kyoto University
 
Data Management Lab: Session 4 Slides
Data Management Lab: Session 4 SlidesData Management Lab: Session 4 Slides
Data Management Lab: Session 4 Slides
 
Meeting Federal Research Requirements
Meeting Federal Research RequirementsMeeting Federal Research Requirements
Meeting Federal Research Requirements
 
Publishing your research: Research Data Management (Introduction)
Publishing your research: Research Data Management (Introduction) Publishing your research: Research Data Management (Introduction)
Publishing your research: Research Data Management (Introduction)
 
Researh data management
Researh data managementResearh data management
Researh data management
 
Meeting Federal Research Requirements for Data Management Plans, Public Acces...
Meeting Federal Research Requirements for Data Management Plans, Public Acces...Meeting Federal Research Requirements for Data Management Plans, Public Acces...
Meeting Federal Research Requirements for Data Management Plans, Public Acces...
 
Research Data Management in practice, RIA Data Management Workshop Brisbane 2017
Research Data Management in practice, RIA Data Management Workshop Brisbane 2017Research Data Management in practice, RIA Data Management Workshop Brisbane 2017
Research Data Management in practice, RIA Data Management Workshop Brisbane 2017
 
Open Science and Open Data for Librarians
Open Science and Open Data for LibrariansOpen Science and Open Data for Librarians
Open Science and Open Data for Librarians
 
Getting to grips with Research Data Management
Getting to grips with Research Data ManagementGetting to grips with Research Data Management
Getting to grips with Research Data Management
 
The Ethics of Digital Preservation
The Ethics of Digital PreservationThe Ethics of Digital Preservation
The Ethics of Digital Preservation
 
Burton - Security, Privacy and Trust
Burton - Security, Privacy and TrustBurton - Security, Privacy and Trust
Burton - Security, Privacy and Trust
 
NIH Data Policy or: How I Learned to Stop Worrying and Love the Data Manageme...
NIH Data Policy or: How I Learned to Stop Worrying and Love the Data Manageme...NIH Data Policy or: How I Learned to Stop Worrying and Love the Data Manageme...
NIH Data Policy or: How I Learned to Stop Worrying and Love the Data Manageme...
 
Research-Data-Management-and-your-PhD
Research-Data-Management-and-your-PhDResearch-Data-Management-and-your-PhD
Research-Data-Management-and-your-PhD
 
Open science, open data - FOSTER training, Potsdam
Open science, open data - FOSTER training, PotsdamOpen science, open data - FOSTER training, Potsdam
Open science, open data - FOSTER training, Potsdam
 

More from ARDC

Introduction to ADA
Introduction to ADAIntroduction to ADA
Introduction to ADAARDC
 
Architecture and Standards
Architecture and StandardsArchitecture and Standards
Architecture and StandardsARDC
 
Data Sharing and Release Legislation
Data Sharing and Release Legislation   Data Sharing and Release Legislation
Data Sharing and Release Legislation ARDC
 
Australian Dementia Network (ADNet)
Australian Dementia Network (ADNet)Australian Dementia Network (ADNet)
Australian Dementia Network (ADNet)ARDC
 
Investigator-initiated clinical trials: a community perspective
Investigator-initiated clinical trials: a community perspectiveInvestigator-initiated clinical trials: a community perspective
Investigator-initiated clinical trials: a community perspectiveARDC
 
NCRIS and the health domain
NCRIS and the health domainNCRIS and the health domain
NCRIS and the health domainARDC
 
International perspective for sharing publicly funded medical research data
International perspective for sharing publicly funded medical research dataInternational perspective for sharing publicly funded medical research data
International perspective for sharing publicly funded medical research dataARDC
 
Clinical trials data sharing
Clinical trials data sharingClinical trials data sharing
Clinical trials data sharingARDC
 
Clinical trials and cohort studies
Clinical trials and cohort studiesClinical trials and cohort studies
Clinical trials and cohort studiesARDC
 
Introduction to vision and scope
Introduction to vision and scopeIntroduction to vision and scope
Introduction to vision and scopeARDC
 
FAIR for the future: embracing all things data
FAIR for the future: embracing all things dataFAIR for the future: embracing all things data
FAIR for the future: embracing all things dataARDC
 
ARDC 2018 state engagements - Nov-Dec 2018 - Slides - Ian Duncan
ARDC 2018 state engagements - Nov-Dec 2018 - Slides - Ian DuncanARDC 2018 state engagements - Nov-Dec 2018 - Slides - Ian Duncan
ARDC 2018 state engagements - Nov-Dec 2018 - Slides - Ian DuncanARDC
 
Skilling-up-in-research-data-management-20181128
Skilling-up-in-research-data-management-20181128Skilling-up-in-research-data-management-20181128
Skilling-up-in-research-data-management-20181128ARDC
 
Research data management and sharing of medical data
Research data management and sharing of medical dataResearch data management and sharing of medical data
Research data management and sharing of medical dataARDC
 
Findable, Accessible, Interoperable and Reusable (FAIR) data
Findable, Accessible, Interoperable and Reusable (FAIR) dataFindable, Accessible, Interoperable and Reusable (FAIR) data
Findable, Accessible, Interoperable and Reusable (FAIR) dataARDC
 
Applying FAIR principles to linked datasets: Opportunities and Challenges
Applying FAIR principles to linked datasets: Opportunities and ChallengesApplying FAIR principles to linked datasets: Opportunities and Challenges
Applying FAIR principles to linked datasets: Opportunities and ChallengesARDC
 
How to make your data count webinar, 26 Nov 2018
How to make your data count webinar, 26 Nov 2018How to make your data count webinar, 26 Nov 2018
How to make your data count webinar, 26 Nov 2018ARDC
 
Ready, Set, Go! Join the Top 10 FAIR Data Things Global Sprint
Ready, Set, Go! Join the Top 10 FAIR Data Things Global SprintReady, Set, Go! Join the Top 10 FAIR Data Things Global Sprint
Ready, Set, Go! Join the Top 10 FAIR Data Things Global SprintARDC
 
How FAIR is your data? Copyright, licensing and reuse of data
How FAIR is your data? Copyright, licensing and reuse of dataHow FAIR is your data? Copyright, licensing and reuse of data
How FAIR is your data? Copyright, licensing and reuse of dataARDC
 
Peter neish DMPs BoF eResearch 2018
Peter neish DMPs BoF eResearch 2018Peter neish DMPs BoF eResearch 2018
Peter neish DMPs BoF eResearch 2018ARDC
 

More from ARDC (20)

Introduction to ADA
Introduction to ADAIntroduction to ADA
Introduction to ADA
 
Architecture and Standards
Architecture and StandardsArchitecture and Standards
Architecture and Standards
 
Data Sharing and Release Legislation
Data Sharing and Release Legislation   Data Sharing and Release Legislation
Data Sharing and Release Legislation
 
Australian Dementia Network (ADNet)
Australian Dementia Network (ADNet)Australian Dementia Network (ADNet)
Australian Dementia Network (ADNet)
 
Investigator-initiated clinical trials: a community perspective
Investigator-initiated clinical trials: a community perspectiveInvestigator-initiated clinical trials: a community perspective
Investigator-initiated clinical trials: a community perspective
 
NCRIS and the health domain
NCRIS and the health domainNCRIS and the health domain
NCRIS and the health domain
 
International perspective for sharing publicly funded medical research data
International perspective for sharing publicly funded medical research dataInternational perspective for sharing publicly funded medical research data
International perspective for sharing publicly funded medical research data
 
Clinical trials data sharing
Clinical trials data sharingClinical trials data sharing
Clinical trials data sharing
 
Clinical trials and cohort studies
Clinical trials and cohort studiesClinical trials and cohort studies
Clinical trials and cohort studies
 
Introduction to vision and scope
Introduction to vision and scopeIntroduction to vision and scope
Introduction to vision and scope
 
FAIR for the future: embracing all things data
FAIR for the future: embracing all things dataFAIR for the future: embracing all things data
FAIR for the future: embracing all things data
 
ARDC 2018 state engagements - Nov-Dec 2018 - Slides - Ian Duncan
ARDC 2018 state engagements - Nov-Dec 2018 - Slides - Ian DuncanARDC 2018 state engagements - Nov-Dec 2018 - Slides - Ian Duncan
ARDC 2018 state engagements - Nov-Dec 2018 - Slides - Ian Duncan
 
Skilling-up-in-research-data-management-20181128
Skilling-up-in-research-data-management-20181128Skilling-up-in-research-data-management-20181128
Skilling-up-in-research-data-management-20181128
 
Research data management and sharing of medical data
Research data management and sharing of medical dataResearch data management and sharing of medical data
Research data management and sharing of medical data
 
Findable, Accessible, Interoperable and Reusable (FAIR) data
Findable, Accessible, Interoperable and Reusable (FAIR) dataFindable, Accessible, Interoperable and Reusable (FAIR) data
Findable, Accessible, Interoperable and Reusable (FAIR) data
 
Applying FAIR principles to linked datasets: Opportunities and Challenges
Applying FAIR principles to linked datasets: Opportunities and ChallengesApplying FAIR principles to linked datasets: Opportunities and Challenges
Applying FAIR principles to linked datasets: Opportunities and Challenges
 
How to make your data count webinar, 26 Nov 2018
How to make your data count webinar, 26 Nov 2018How to make your data count webinar, 26 Nov 2018
How to make your data count webinar, 26 Nov 2018
 
Ready, Set, Go! Join the Top 10 FAIR Data Things Global Sprint
Ready, Set, Go! Join the Top 10 FAIR Data Things Global SprintReady, Set, Go! Join the Top 10 FAIR Data Things Global Sprint
Ready, Set, Go! Join the Top 10 FAIR Data Things Global Sprint
 
How FAIR is your data? Copyright, licensing and reuse of data
How FAIR is your data? Copyright, licensing and reuse of dataHow FAIR is your data? Copyright, licensing and reuse of data
How FAIR is your data? Copyright, licensing and reuse of data
 
Peter neish DMPs BoF eResearch 2018
Peter neish DMPs BoF eResearch 2018Peter neish DMPs BoF eResearch 2018
Peter neish DMPs BoF eResearch 2018
 

Recently uploaded

Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room servicediscovermytutordmt
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfchloefrazer622
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...fonyou31
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationnomboosow
 
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...anjaliyadav012327
 
Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 💞 Full Nigh...
Russian Call Girls in Andheri Airport Mumbai WhatsApp  9167673311 💞 Full Nigh...Russian Call Girls in Andheri Airport Mumbai WhatsApp  9167673311 💞 Full Nigh...
Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 💞 Full Nigh...Pooja Nehwal
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAssociation for Project Management
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDThiyagu K
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfciinovamais
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajanpragatimahajan3
 

Recently uploaded (20)

Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room service
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...
 
Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 💞 Full Nigh...
Russian Call Girls in Andheri Airport Mumbai WhatsApp  9167673311 💞 Full Nigh...Russian Call Girls in Andheri Airport Mumbai WhatsApp  9167673311 💞 Full Nigh...
Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 💞 Full Nigh...
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajan
 

Managing sensitive data at the Australian Data Archive

  • 1. Managing sensitive data at the Australian Data Archive “Making Data Social” webinar series 29 March 2017 Dr. Steven McEachern Director, Australian Data Archive ANU Centre for Social Research and Methods Australian National University
  • 2. Overview • Sensitive data and the 5 Safes model • Access to sensitive data in Australia • Applying the 5 Safes model at ADA • Sensitive data and the data lifecycle
  • 3. Sensitive data • “Sensitive data are data that can be used to identify anindividual, species, object, process,or locationthat introduces a risk of discrimination, harm, or unwanted attention.” – ANDS Guide on Publishing and Sharing Sensitive Data, p.7 – http://www.ands.org.au/__data/assets/pdf_file/0 010/489187/Sensitive-data.pdf
  • 4. The 5 safes 1. Safe people: Can the researchers be trusted to do the right thing? 2. Safe projects: Is the data to be used for an appropriate purpose? 3. Safe settings: Is the environment in which the analysis takes place safe? 4. Safe data: Is the data appropriately protected? 5. Safe output: Is there a low risk of disclosure in research outputs? Desai, T., F. Ritchie and R. Welpton (2016) Five Safes: designing data access for research. Economics Working Paper Series 1601, University of the West of England. http://www2.uwe.ac.uk/faculties/BBS/Documents/1601.pdf
  • 5. What do researchers expect? (or What is wanted? :-) • “We emphasize that direct access to micro-data is critical for success. Alternatives such as access to synthetic data or submission of computer programs to agency employees will not address the key problem of restoring US leadership with cutting-edge policy-relevant research.” • Card, Chetty, Feldstein and Saez, 2010 (emphasis in original) – http://rajchetty.com/chettyfiles/NSFdataaccess.pdf
  • 6. What is expected? • “Here's what you need to do if you want an anonymised 1% sample of the US Census – Go to Google and type US Census 1% sample, click on link to the Census. – Download each of the state files from the FTP site and merge them yourself. Or just check things out for one of the states. Whatever you like. – Start mucking about to test whether your pet theory is plausible. • Here's what you need to do if you want an anoymised sample of the NZ Census, or a Confidentialised Unit Record File (CURF) of any of big Stats series: – Go to Stats NZ's site, here. – Follow the instructions below: …” • (Followed by several pages of instructions, Application Process, Assessment Criteria, Methods of Access, …) Eric Crampton, the New Zealand Initiative, Wellington, formerly University of Canterbury. http://offsettingbehaviour.blogspot.com.au/2015/10/curf-and- turf.html
  • 7. Can we bridge depositor and user expectations? I think so. Consider Card et al. again: “We believe that five conditions must be satisfied to make a data access program sustainable and efficient: a) fair and open competition for data access based on scientific merit b) sufficient bandwidth to accommodate a large number of projects simultaneously c) inclusion of younger scholars and graduate students in the research teams that can access the data d) direct access to de-identified micro data through local statistical offices or, more preferably, secure remote connections e) systematic electronic monitoring to allow immediate disclosure of statistical results and prevent any disclosure of individual records”
  • 8. Current models in Australia ABS: • Confidentialised Unit Record Files (CURFs) • RADL • ABSDL • TableBuilder ADA: • Confidentialised Unit Record Files (CURFs) Shared (often remote access) infrastructure: • AURIN • SURE (PHRN) • Data linkage facilities Ad hoc arrangements: • “Secure rooms” • Departmental arrangements
  • 9. Applying the 5 Safes People Projects Settings Data Output CURFs Yes? Yes? Yes? YES YES TableBuilder No No YES YES YES RADL Yes? Yes? YES YES YES ABSDL Yes? Yes? YES YES YES ABS Remote Data Lab Yes Yes? YES YES YES ADA Yes? No No YES No AURIN No No YES YES Yes? SURE (PHRN) Yes Yes YES No? ??? Data Linkage facilities No? YES Yes? YES ??? Secure rooms Yes? Yes? YES No? ???
  • 10. Australian experience • Safe data – Confidentialisation: ADA, ABS, DSS (HILDA, etc.) – Indirect access to data: TableBuilder, ADA • Safe settings – Aggregated data: TableBuilder, AURIN – Remote: RADL, ABS (Remote) Data Lab – Secure environments: ABS (On site) Data Lab, secure rooms
  • 11. 5 safes: lesser emphasis on… • Safe outputs – Difficult to scale (e.g. data lab output reviews) – This is changing – e.g. TableBuilder is automated – But need to consider replication and reproducibility • Safe researchers and safe projects – Considered in most models, but not closely monitored – May be difficult to monitor? (Similar issues to the reporting of research outputs in universities) – Universities could (and do!!) provide imprimatur for their staff and students
  • 12. Frameworks for research practice • There are existing Australian frameworks for researcher accountability and responsibilities: – the Australian Code for the Responsible Conduct of Research (ACRCR), which sets out institutional and researcher responsibilities for conduct of research – (Note that this is currently under review) – Human Research Ethics Committees (HREC) • Increasingly, professional and journal requirements for data sharing: – E.g. PLOS One, AEA, DA-RT (political science) – https://www.aeaweb.org/aer/data.php – http://journals.plos.org/plosone/s/data-availability
  • 13. Relevant content from ACRCR • S.2: Management of research data and primary materials – E.g. 2.7 Maintain confidentiality of research data and primary materials – Researchers given access to confidential information must maintain that confidentiality. Primary materials and confidential research data must be kept in secure storage. Confidential information must only be used in ways agreed with those who provided it. Particular care must be exercised when confidential data are made available for discussion. • S.4: Publication and dissemination of research findings – E.g. 4.2.3 Institutions must ensure that the sponsors of research understand the importance of publication in research and do not delay publication beyond the time needed to protect intellectual property and other relevant interests. • S.9: Breaches of the Code and misconduct in research
  • 14. ADA model • Safe data: data is anonymised (confidentialised) either prior to deposit or by ADA archivists • Safe people: virtually all data access is mediated, and users must be identified and provide contact and supervisor details • Safe projects: users provide a project description • Safe settings and safe outputs: NOT applied
  • 15. ABS Remote Data Lab (virtual enclave) • Safe data: less of a focus – but the lab does not prohibit use of safe data practices – Risk: individual researchers can see individual records – BUT this assumes “unsafe” people (researchers) • Safe settings: Remote access environment hosted at ABS – Challenge: cost of establishing the system • Safe outputs: outputs limited only to methods approved through the access environment (i.e. no printing) – Risk: photographing the screen, taking notes – Again, assumes “unsafe” people – Challenge: managing output checking
  • 16. • Safe people: – Institutional support • Note ACRCR – code of conduct, and HREC – Training for researchers prior to access • Intended breaches are uncommon (per background paper) • Focus therefore on unintended breach • Highlight also alternate access options to reduce breach due to limitations of access method – Assessing people: research background? • Experience is difficult to evaluate here • How would you build up a track record? • Safe projects – May be necessary for legislative reasons – Should this matter? • Basic research might generate just as useful insights
  • 17. A suite of options • Different existing models (each a mix of the 5 safes) all have their place • Safe people can be incorporated into existing models – Many current models assume the “intruder” – International evidence suggests this is not the case • ADA has 2 “default” options
  • 18. The principles should enable the right mix of “safes” for a given data source Source: http://www.shinyshiny.tv/2009/12/easymix_-_a_mix.html