SlideShare a Scribd company logo
A collaborative approach to
“filling the digital preservation
gap” for Research Data
Management
Jenny Mitcham, University of York
Chris Awre, University of Hull
Sarah Romkey, Artefactual Systems
9 November 2015
Research at Hull and York
• Hull :
– 6 Faculties, 25+ academic departments
– c. 22,000 students
– 62% research classed as 3* or 4* in REF 2014
– In top 50 UK institutions by ‘research power’
• York:
– 30+ academic departments
– c. 16,000 students
– Ranked in the top ten of UK universities for research council
income (THE)
Filling the digital preservation gap:
Project aim
“…to investigate
Archivematica and explore
how it might be used to
provide digital preservation
functionality within a wider
infrastructure for Research
Data Management.”
What is Archivematica?
● Free and open-source digital preservation system
(AGPLv3) designed to maintain standards-based,
long-term access to digital objects
● Allows users to process digital objects from ingest to
access in using ISO-OAIS functional model
● Implements format normalization upon ingest and
preserves originals to support emulation and
migration strategies
What is Archivematica?
● Archivematica is a processing pipeline consisting of
a bundle of open-source tools and python scripts
which deliver a series of preservation micro-services
● Archivematica is designed to output high-quality,
standards-compliant Archival Information Packages
(AIPs)
● Bagit, METS, PREMIS
How is Archivematica used?
• For long term preservation of born-digital and
digitized material
• To unpack and preserve contents of disk images
• To create access copies for AtoM, CONTENTdm,
other access systems
• As a “dark archive” for DSpace content
• Research data preservation
– => Jisc Research Data Spring project
Archivematica development partners
and more!
This is a collaboration
University of Hull:
• Chris Awre – Head of Information Services, Library and
Learning Innovation
• Richard Green – Independent Consultant
• Simon Wilson – University Archivist
University of York:
• Julie Allinson – Manager, Digital York
• Jen Mitcham – Digital Archivist
Artefactual Systems
Jisc - part of Research Data Spring
Project structure
• Phase 1 – explore: testing, research,
thinking -produce a report (3 months)
• Phase 2 – develop: make
Archivematica better for RDM, plan
implementation (4 months)
• Phase 3 – implement: set up proof of
concepts at York and Hull (6 months)
Why do we need digital preservation?
Why do we need digital preservation
for research data?
• We can’t ignore digital preservation – moving targets for
data retention mean we need to take this seriously
• Funder requirements around retention:
– NERC - data should be retained for a minimum of 10 years but
for projects of major importance this may need to be 20 years or
longer
– STFC - expect data to be retained for a minimum of 10 years and
data that cannot be re-measured should be retained indefinitely
– Wellcome Trust – expect data to be kept for a minimum of 10
years but suggest longer periods for certain types of data
University of York RDM questionnaire 2013
• Which data management issues have you come
across in your research over the last five years?
– “Inability to read files in old software formats on old
media or because of expired software licences”
– 24% of 181 researchers who answered this question
admitted this had been a problem for them
Why do we need digital preservation
for research data?
What does research data look like?
York RDM questionnaire
2013: Please select the main
types of electronic research
data you generate
Top research data applications at York
What does research data look like?
York RDM questionnaire 2013:
If your project is not yet
complete, can you make an
estimate of the ‘final’ size of
your digital data
Why would we recommend
Archivematica for RDM?
• It is flexible and can be configured in different ways for different
institutional needs and workflows
• It allows many of the tasks around digital preservation to be
carried out in an automated fashion
• It can be used alongside other existing systems as part of a wider
workflow for research data
• It is a good digital preservation solution for those with limited
resources
• It is an evolving solution that is continually driven and enhanced
by and for the digital preservation community
• It gives institutions greater confidence that they will be able to
continue to provide access to usable copies of research data over
time
What are the downsides?
• It isn’t a magic bullet
• There is no guarantee your data will be
readable in the future
• It can only be as good as current digital
preservation practice
• It can be fiddly to install correctly
• The GUI isn’t that intuitive
• You need staff who understand it
Read all about it!
http://digital-archiving.blogspot.co.uk/
How could you use Archivematica?
• Host it in-house and link it to an existing
repository/access system (for example DSpace,
CONTENTdm, Fedora/Hydra ...or a CRIS)
• Host it in-house and use as a standalone system
(you would need to have a storage system in place and establish a
way of facilitating access to the data)
• Sign up for a hosted instance of Archivematica
with archivesDIRECT (combines Archivematica with
DuraCloud storage)
• Sign up for a hosted instance of Archivematica
with Arkivum (combines Archivematica with Arkivum storage)
RDM Workflows at York
• We get a copy of data from a researcher
• We transfer it to Archivematica
• Archivematica packages it up for storage and
creates the Archival Information Package (AIP)
• Archivematica sends the AIP to archival storage
• Metadata is published in data catalogue
• If someone requests the data Archivematica will
create a Dissemination Information Package (DIP)
• DIP will be uploaded to Digital Library for access
How do York plan to use Archivematica?
How can we improve Archivematica?
1. Enable better workflows for RDM (producing a
DIP on request)
2. Allowing the DIP (access copy of data) to be
usable by different repository systems
3. Helping reduce bottlenecks for big data
4. Workflows for unidentified files
5. Enabling easier querying of data within
Archivematica by third party applications
6. Better documentation
All are in progress in Phase 2 of the project
Where to find out more
http://www.york.ac.uk/borthwick/
Where to find out more
Do talk to us if you are interested in finding
out more about this project
Useful links:
Digital archiving blog: http://digital-archiving.blogspot.co.uk/
Archivematica: https://www.archivematica.org/en/
Report: http://dx.doi.org/10.6084/m9.figshare.1481170

More Related Content

What's hot

Goethals Harvard Library's Digital Preservation Repository
Goethals Harvard Library's Digital Preservation RepositoryGoethals Harvard Library's Digital Preservation Repository
Goethals Harvard Library's Digital Preservation Repository
National Information Standards Organization (NISO)
 
Ariadne: Lifecycles
Ariadne: LifecyclesAriadne: Lifecycles
Ariadne: Lifecycles
ariadnenetwork
 
The University of Edinburgh Research Data Management Service Suite
The University of Edinburgh Research Data Management Service SuiteThe University of Edinburgh Research Data Management Service Suite
The University of Edinburgh Research Data Management Service Suite
Robin Rice
 
Ariadne: Archiving and Repositories
Ariadne: Archiving and RepositoriesAriadne: Archiving and Repositories
Ariadne: Archiving and Repositories
ariadnenetwork
 
Ticer summer school_24_aug06
Ticer summer school_24_aug06Ticer summer school_24_aug06
Ticer summer school_24_aug06
SayDotCom.com
 
December 16, 2015 NISO Webinar: Two-Part Webinar: Emerging Resource Types Pa...
December 16, 2015 NISO Webinar: Two-Part Webinar: Emerging Resource Types  Pa...December 16, 2015 NISO Webinar: Two-Part Webinar: Emerging Resource Types  Pa...
December 16, 2015 NISO Webinar: Two-Part Webinar: Emerging Resource Types Pa...
DeVonne Parks, CEM
 
COBWEB, AIP-6, and Access Management Federations
COBWEB, AIP-6, and Access Management FederationsCOBWEB, AIP-6, and Access Management Federations
COBWEB, AIP-6, and Access Management Federations
EDINA, University of Edinburgh
 
Exploiting the value of Dublin Core through pragmatic development
Exploiting the value of Dublin Core through pragmatic developmentExploiting the value of Dublin Core through pragmatic development
Exploiting the value of Dublin Core through pragmatic development
Paul Walk
 
The repository as an interactive research tool
The repository as an interactive research toolThe repository as an interactive research tool
The repository as an interactive research tool
Centre for Digital Scholarship, Leiden University Libraries
 
The University of Edinburgh Research Data Management Service Suite
The University of Edinburgh Research Data Management Service SuiteThe University of Edinburgh Research Data Management Service Suite
The University of Edinburgh Research Data Management Service Suite
Robin Rice
 
Research Support at Leiden University
Research Support at Leiden UniversityResearch Support at Leiden University
December 9, 2015 NISO Webinar: Two-Part Webinar: Emerging Resource Types - Pa...
December 9, 2015 NISO Webinar: Two-Part Webinar: Emerging Resource Types - Pa...December 9, 2015 NISO Webinar: Two-Part Webinar: Emerging Resource Types - Pa...
December 9, 2015 NISO Webinar: Two-Part Webinar: Emerging Resource Types - Pa...
DeVonne Parks, CEM
 
December 9, 2015 NISO Webinar: Two-Part Webinar: Emerging Resource Types - Pa...
December 9, 2015 NISO Webinar: Two-Part Webinar: Emerging Resource Types - Pa...December 9, 2015 NISO Webinar: Two-Part Webinar: Emerging Resource Types - Pa...
December 9, 2015 NISO Webinar: Two-Part Webinar: Emerging Resource Types - Pa...
DeVonne Parks, CEM
 
PECAN Phase 2: Pilot for Ensuring Continuity of Access via Nesli2
PECAN Phase 2: Pilot for Ensuring Continuity of Access via Nesli2 PECAN Phase 2: Pilot for Ensuring Continuity of Access via Nesli2
PECAN Phase 2: Pilot for Ensuring Continuity of Access via Nesli2 EDINA, University of Edinburgh
 
Lightning Talks - Intro
Lightning Talks - IntroLightning Talks - Intro
Lightning Talks - Intro
Jisc RDM
 
Madsen Digital Preservation Policy & Strategy
Madsen Digital Preservation Policy & StrategyMadsen Digital Preservation Policy & Strategy
Madsen Digital Preservation Policy & Strategy
National Information Standards Organization (NISO)
 
EPSRC Policy Compliance: What researchers need to know
EPSRC Policy Compliance: What researchers need to knowEPSRC Policy Compliance: What researchers need to know
EPSRC Policy Compliance: What researchers need to know
Historic Environment Scotland
 
Managing sensitive data at the University of Bristol
Managing sensitive data at the University of BristolManaging sensitive data at the University of Bristol
Managing sensitive data at the University of Bristol
Jisc RDM
 
Referentie Architectuur Onderzoeksdata en Onderzoeksdata diensten catalogus
Referentie Architectuur Onderzoeksdata en Onderzoeksdata diensten catalogusReferentie Architectuur Onderzoeksdata en Onderzoeksdata diensten catalogus
Referentie Architectuur Onderzoeksdata en Onderzoeksdata diensten catalogus
Centre for Digital Scholarship, Leiden University Libraries
 
Standardising research data policies, research data network
Standardising research data policies, research data networkStandardising research data policies, research data network
Standardising research data policies, research data network
Jisc RDM
 

What's hot (20)

Goethals Harvard Library's Digital Preservation Repository
Goethals Harvard Library's Digital Preservation RepositoryGoethals Harvard Library's Digital Preservation Repository
Goethals Harvard Library's Digital Preservation Repository
 
Ariadne: Lifecycles
Ariadne: LifecyclesAriadne: Lifecycles
Ariadne: Lifecycles
 
The University of Edinburgh Research Data Management Service Suite
The University of Edinburgh Research Data Management Service SuiteThe University of Edinburgh Research Data Management Service Suite
The University of Edinburgh Research Data Management Service Suite
 
Ariadne: Archiving and Repositories
Ariadne: Archiving and RepositoriesAriadne: Archiving and Repositories
Ariadne: Archiving and Repositories
 
Ticer summer school_24_aug06
Ticer summer school_24_aug06Ticer summer school_24_aug06
Ticer summer school_24_aug06
 
December 16, 2015 NISO Webinar: Two-Part Webinar: Emerging Resource Types Pa...
December 16, 2015 NISO Webinar: Two-Part Webinar: Emerging Resource Types  Pa...December 16, 2015 NISO Webinar: Two-Part Webinar: Emerging Resource Types  Pa...
December 16, 2015 NISO Webinar: Two-Part Webinar: Emerging Resource Types Pa...
 
COBWEB, AIP-6, and Access Management Federations
COBWEB, AIP-6, and Access Management FederationsCOBWEB, AIP-6, and Access Management Federations
COBWEB, AIP-6, and Access Management Federations
 
Exploiting the value of Dublin Core through pragmatic development
Exploiting the value of Dublin Core through pragmatic developmentExploiting the value of Dublin Core through pragmatic development
Exploiting the value of Dublin Core through pragmatic development
 
The repository as an interactive research tool
The repository as an interactive research toolThe repository as an interactive research tool
The repository as an interactive research tool
 
The University of Edinburgh Research Data Management Service Suite
The University of Edinburgh Research Data Management Service SuiteThe University of Edinburgh Research Data Management Service Suite
The University of Edinburgh Research Data Management Service Suite
 
Research Support at Leiden University
Research Support at Leiden UniversityResearch Support at Leiden University
Research Support at Leiden University
 
December 9, 2015 NISO Webinar: Two-Part Webinar: Emerging Resource Types - Pa...
December 9, 2015 NISO Webinar: Two-Part Webinar: Emerging Resource Types - Pa...December 9, 2015 NISO Webinar: Two-Part Webinar: Emerging Resource Types - Pa...
December 9, 2015 NISO Webinar: Two-Part Webinar: Emerging Resource Types - Pa...
 
December 9, 2015 NISO Webinar: Two-Part Webinar: Emerging Resource Types - Pa...
December 9, 2015 NISO Webinar: Two-Part Webinar: Emerging Resource Types - Pa...December 9, 2015 NISO Webinar: Two-Part Webinar: Emerging Resource Types - Pa...
December 9, 2015 NISO Webinar: Two-Part Webinar: Emerging Resource Types - Pa...
 
PECAN Phase 2: Pilot for Ensuring Continuity of Access via Nesli2
PECAN Phase 2: Pilot for Ensuring Continuity of Access via Nesli2 PECAN Phase 2: Pilot for Ensuring Continuity of Access via Nesli2
PECAN Phase 2: Pilot for Ensuring Continuity of Access via Nesli2
 
Lightning Talks - Intro
Lightning Talks - IntroLightning Talks - Intro
Lightning Talks - Intro
 
Madsen Digital Preservation Policy & Strategy
Madsen Digital Preservation Policy & StrategyMadsen Digital Preservation Policy & Strategy
Madsen Digital Preservation Policy & Strategy
 
EPSRC Policy Compliance: What researchers need to know
EPSRC Policy Compliance: What researchers need to knowEPSRC Policy Compliance: What researchers need to know
EPSRC Policy Compliance: What researchers need to know
 
Managing sensitive data at the University of Bristol
Managing sensitive data at the University of BristolManaging sensitive data at the University of Bristol
Managing sensitive data at the University of Bristol
 
Referentie Architectuur Onderzoeksdata en Onderzoeksdata diensten catalogus
Referentie Architectuur Onderzoeksdata en Onderzoeksdata diensten catalogusReferentie Architectuur Onderzoeksdata en Onderzoeksdata diensten catalogus
Referentie Architectuur Onderzoeksdata en Onderzoeksdata diensten catalogus
 
Standardising research data policies, research data network
Standardising research data policies, research data networkStandardising research data policies, research data network
Standardising research data policies, research data network
 

Viewers also liked

The innatist position (presentation)
The innatist position (presentation)The innatist position (presentation)
The innatist position (presentation)
humaraneduardo
 
Paradox of Perceptions the Evolving and Complex reality Israel Iranian relati...
Paradox of Perceptions the Evolving and Complex reality Israel Iranian relati...Paradox of Perceptions the Evolving and Complex reality Israel Iranian relati...
Paradox of Perceptions the Evolving and Complex reality Israel Iranian relati...Yehudah (Eric) Sunshine
 
Infografía el nuevo profesional de marketing
Infografía el nuevo profesional de marketingInfografía el nuevo profesional de marketing
Infografía el nuevo profesional de marketing
Adgravity
 
Ilicitos tributarios yenni
Ilicitos tributarios yenniIlicitos tributarios yenni
Ilicitos tributarios yenni
yennimar26
 
How To Manage Zero Clients
How To Manage Zero ClientsHow To Manage Zero Clients
How To Manage Zero Clients
Teradici
 
DEVELOPING A CONCEPTUAL MODEL FOR UNDERSTANDING INT ST ENGAGEMENT: THE IMPORT...
DEVELOPING A CONCEPTUAL MODEL FOR UNDERSTANDING INT ST ENGAGEMENT: THE IMPORT...DEVELOPING A CONCEPTUAL MODEL FOR UNDERSTANDING INT ST ENGAGEMENT: THE IMPORT...
DEVELOPING A CONCEPTUAL MODEL FOR UNDERSTANDING INT ST ENGAGEMENT: THE IMPORT...
International Scientific Events
 
Project update: A collaborative approach to "filling the digital preservation...
Project update: A collaborative approach to "filling the digital preservation...Project update: A collaborative approach to "filling the digital preservation...
Project update: A collaborative approach to "filling the digital preservation...
Jenny Mitcham
 
Statistiche sui CAE
Statistiche sui CAEStatistiche sui CAE
Statistiche sui CAE
Dedalus Arianna
 
თავგანწირული მხედარი
თავგანწირული      მხედარითავგანწირული      მხედარი
თავგანწირული მხედარი
shorenagavasheli
 
Jisc Shared Service requirements presentation - 18th November 2015
Jisc Shared Service requirements presentation - 18th November 2015Jisc Shared Service requirements presentation - 18th November 2015
Jisc Shared Service requirements presentation - 18th November 2015
Jenny Mitcham
 
Обучение с результатом. как сделать, чтобы участники не попрактиковались, а н...
Обучение с результатом. как сделать, чтобы участники не попрактиковались, а н...Обучение с результатом. как сделать, чтобы участники не попрактиковались, а н...
Обучение с результатом. как сделать, чтобы участники не попрактиковались, а н...
Natalia Sintsova
 
Securing Large Commercial Deposits
Securing Large Commercial DepositsSecuring Large Commercial Deposits
Securing Large Commercial Deposits
BizBankTX
 
“Filling the digital preservation gap” an update from the Jisc Research Data ...
“Filling the digital preservation gap”an update from the Jisc Research Data ...“Filling the digital preservation gap”an update from the Jisc Research Data ...
“Filling the digital preservation gap” an update from the Jisc Research Data ...
Jenny Mitcham
 

Viewers also liked (15)

The innatist position (presentation)
The innatist position (presentation)The innatist position (presentation)
The innatist position (presentation)
 
Paradox of Perceptions the Evolving and Complex reality Israel Iranian relati...
Paradox of Perceptions the Evolving and Complex reality Israel Iranian relati...Paradox of Perceptions the Evolving and Complex reality Israel Iranian relati...
Paradox of Perceptions the Evolving and Complex reality Israel Iranian relati...
 
Miracle Essential Oil's
Miracle Essential Oil'sMiracle Essential Oil's
Miracle Essential Oil's
 
Infografía el nuevo profesional de marketing
Infografía el nuevo profesional de marketingInfografía el nuevo profesional de marketing
Infografía el nuevo profesional de marketing
 
Ilicitos tributarios yenni
Ilicitos tributarios yenniIlicitos tributarios yenni
Ilicitos tributarios yenni
 
How To Manage Zero Clients
How To Manage Zero ClientsHow To Manage Zero Clients
How To Manage Zero Clients
 
DEVELOPING A CONCEPTUAL MODEL FOR UNDERSTANDING INT ST ENGAGEMENT: THE IMPORT...
DEVELOPING A CONCEPTUAL MODEL FOR UNDERSTANDING INT ST ENGAGEMENT: THE IMPORT...DEVELOPING A CONCEPTUAL MODEL FOR UNDERSTANDING INT ST ENGAGEMENT: THE IMPORT...
DEVELOPING A CONCEPTUAL MODEL FOR UNDERSTANDING INT ST ENGAGEMENT: THE IMPORT...
 
my_resume
my_resumemy_resume
my_resume
 
Project update: A collaborative approach to "filling the digital preservation...
Project update: A collaborative approach to "filling the digital preservation...Project update: A collaborative approach to "filling the digital preservation...
Project update: A collaborative approach to "filling the digital preservation...
 
Statistiche sui CAE
Statistiche sui CAEStatistiche sui CAE
Statistiche sui CAE
 
თავგანწირული მხედარი
თავგანწირული      მხედარითავგანწირული      მხედარი
თავგანწირული მხედარი
 
Jisc Shared Service requirements presentation - 18th November 2015
Jisc Shared Service requirements presentation - 18th November 2015Jisc Shared Service requirements presentation - 18th November 2015
Jisc Shared Service requirements presentation - 18th November 2015
 
Обучение с результатом. как сделать, чтобы участники не попрактиковались, а н...
Обучение с результатом. как сделать, чтобы участники не попрактиковались, а н...Обучение с результатом. как сделать, чтобы участники не попрактиковались, а н...
Обучение с результатом. как сделать, чтобы участники не попрактиковались, а н...
 
Securing Large Commercial Deposits
Securing Large Commercial DepositsSecuring Large Commercial Deposits
Securing Large Commercial Deposits
 
“Filling the digital preservation gap” an update from the Jisc Research Data ...
“Filling the digital preservation gap”an update from the Jisc Research Data ...“Filling the digital preservation gap”an update from the Jisc Research Data ...
“Filling the digital preservation gap” an update from the Jisc Research Data ...
 

Similar to A collaborative approach to "filling the digital preservation gap" for Research Data Management

Using Archivemedia to preserve research data
Using Archivemedia to preserve research dataUsing Archivemedia to preserve research data
Using Archivemedia to preserve research data
ARDC
 
A collaborative approach to "filling the digital preservation gap" for Resear...
A collaborative approach to "filling the digital preservation gap" for Resear...A collaborative approach to "filling the digital preservation gap" for Resear...
A collaborative approach to "filling the digital preservation gap" for Resear...
Jenny Mitcham
 
A collaborative approach to filling the digital preservation gap for RDM
A collaborative approach to filling the digital preservation gap for RDMA collaborative approach to filling the digital preservation gap for RDM
A collaborative approach to filling the digital preservation gap for RDM
northerncollaboration
 
Research Data Management at Imperial College London
Research Data Management at Imperial College LondonResearch Data Management at Imperial College London
Research Data Management at Imperial College London
Sarah Anna Stewart
 
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
Sarah Anna Stewart
 
10-1-13 “Research Data Curation at UC San Diego: An Overview” Presentation Sl...
10-1-13 “Research Data Curation at UC San Diego: An Overview” Presentation Sl...10-1-13 “Research Data Curation at UC San Diego: An Overview” Presentation Sl...
10-1-13 “Research Data Curation at UC San Diego: An Overview” Presentation Sl...
DuraSpace
 
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShareResearch Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
Historic Environment Scotland
 
Engaging with students and researchers: the case of the social sciences
Engaging with students and researchers: the case of the social sciencesEngaging with students and researchers: the case of the social sciences
Engaging with students and researchers: the case of the social sciences
Louise Corti
 
AKVS - Edinburgh Data Repository Experiences June 2016
AKVS - Edinburgh Data Repository Experiences June 2016AKVS - Edinburgh Data Repository Experiences June 2016
AKVS - Edinburgh Data Repository Experiences June 2016
University of Edinburgh
 
Scottish Digital Library Consortium Meeting: Edinburgh DataShare
Scottish Digital Library Consortium Meeting: Edinburgh DataShareScottish Digital Library Consortium Meeting: Edinburgh DataShare
Scottish Digital Library Consortium Meeting: Edinburgh DataShare
Robin Rice
 
RDM Programme at University of Edinburgh
RDM Programme at University of EdinburghRDM Programme at University of Edinburgh
RDM Programme at University of Edinburgh
Historic Environment Scotland
 
L&P Humphrey Stewart-Shearer-Joint Session Project ARC & Federated DMP Pilot
L&P Humphrey Stewart-Shearer-Joint Session Project ARC & Federated DMP PilotL&P Humphrey Stewart-Shearer-Joint Session Project ARC & Federated DMP Pilot
L&P Humphrey Stewart-Shearer-Joint Session Project ARC & Federated DMP Pilot
CASRAI
 
Getting to Grips with Research Data Management
Getting to Grips with Research Data Management Getting to Grips with Research Data Management
Getting to Grips with Research Data Management
IzzyChad
 
Addressing Institutional Research Data Management - University of Edinburgh R...
Addressing Institutional Research Data Management - University of Edinburgh R...Addressing Institutional Research Data Management - University of Edinburgh R...
Addressing Institutional Research Data Management - University of Edinburgh R...EDINA, University of Edinburgh
 
Introduction to Research Data Management
Introduction to Research Data ManagementIntroduction to Research Data Management
Introduction to Research Data Management
EDINA, University of Edinburgh
 
On being a cog rather than inventing the wheel: Edinburgh DataShare as a key ...
On being a cog rather than inventing the wheel: Edinburgh DataShare as a key ...On being a cog rather than inventing the wheel: Edinburgh DataShare as a key ...
On being a cog rather than inventing the wheel: Edinburgh DataShare as a key ...
EDINA, University of Edinburgh
 
Research Data (and Software) Management at Imperial: (Everything you need to ...
Research Data (and Software) Management at Imperial: (Everything you need to ...Research Data (and Software) Management at Imperial: (Everything you need to ...
Research Data (and Software) Management at Imperial: (Everything you need to ...
Sarah Anna Stewart
 
Getting to grips with Research Data Management
Getting to grips with Research Data ManagementGetting to grips with Research Data Management
Getting to grips with Research Data Management
IzzyChad
 

Similar to A collaborative approach to "filling the digital preservation gap" for Research Data Management (20)

Using Archivemedia to preserve research data
Using Archivemedia to preserve research dataUsing Archivemedia to preserve research data
Using Archivemedia to preserve research data
 
A collaborative approach to "filling the digital preservation gap" for Resear...
A collaborative approach to "filling the digital preservation gap" for Resear...A collaborative approach to "filling the digital preservation gap" for Resear...
A collaborative approach to "filling the digital preservation gap" for Resear...
 
A collaborative approach to filling the digital preservation gap for RDM
A collaborative approach to filling the digital preservation gap for RDMA collaborative approach to filling the digital preservation gap for RDM
A collaborative approach to filling the digital preservation gap for RDM
 
Research Data Management at Imperial College London
Research Data Management at Imperial College LondonResearch Data Management at Imperial College London
Research Data Management at Imperial College London
 
RDM@Edinburgh
RDM@EdinburghRDM@Edinburgh
RDM@Edinburgh
 
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
 
RDM@Edinburgh
RDM@EdinburghRDM@Edinburgh
RDM@Edinburgh
 
10-1-13 “Research Data Curation at UC San Diego: An Overview” Presentation Sl...
10-1-13 “Research Data Curation at UC San Diego: An Overview” Presentation Sl...10-1-13 “Research Data Curation at UC San Diego: An Overview” Presentation Sl...
10-1-13 “Research Data Curation at UC San Diego: An Overview” Presentation Sl...
 
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShareResearch Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
 
Engaging with students and researchers: the case of the social sciences
Engaging with students and researchers: the case of the social sciencesEngaging with students and researchers: the case of the social sciences
Engaging with students and researchers: the case of the social sciences
 
AKVS - Edinburgh Data Repository Experiences June 2016
AKVS - Edinburgh Data Repository Experiences June 2016AKVS - Edinburgh Data Repository Experiences June 2016
AKVS - Edinburgh Data Repository Experiences June 2016
 
Scottish Digital Library Consortium Meeting: Edinburgh DataShare
Scottish Digital Library Consortium Meeting: Edinburgh DataShareScottish Digital Library Consortium Meeting: Edinburgh DataShare
Scottish Digital Library Consortium Meeting: Edinburgh DataShare
 
RDM Programme at University of Edinburgh
RDM Programme at University of EdinburghRDM Programme at University of Edinburgh
RDM Programme at University of Edinburgh
 
L&P Humphrey Stewart-Shearer-Joint Session Project ARC & Federated DMP Pilot
L&P Humphrey Stewart-Shearer-Joint Session Project ARC & Federated DMP PilotL&P Humphrey Stewart-Shearer-Joint Session Project ARC & Federated DMP Pilot
L&P Humphrey Stewart-Shearer-Joint Session Project ARC & Federated DMP Pilot
 
Getting to Grips with Research Data Management
Getting to Grips with Research Data Management Getting to Grips with Research Data Management
Getting to Grips with Research Data Management
 
Addressing Institutional Research Data Management - University of Edinburgh R...
Addressing Institutional Research Data Management - University of Edinburgh R...Addressing Institutional Research Data Management - University of Edinburgh R...
Addressing Institutional Research Data Management - University of Edinburgh R...
 
Introduction to Research Data Management
Introduction to Research Data ManagementIntroduction to Research Data Management
Introduction to Research Data Management
 
On being a cog rather than inventing the wheel: Edinburgh DataShare as a key ...
On being a cog rather than inventing the wheel: Edinburgh DataShare as a key ...On being a cog rather than inventing the wheel: Edinburgh DataShare as a key ...
On being a cog rather than inventing the wheel: Edinburgh DataShare as a key ...
 
Research Data (and Software) Management at Imperial: (Everything you need to ...
Research Data (and Software) Management at Imperial: (Everything you need to ...Research Data (and Software) Management at Imperial: (Everything you need to ...
Research Data (and Software) Management at Imperial: (Everything you need to ...
 
Getting to grips with Research Data Management
Getting to grips with Research Data ManagementGetting to grips with Research Data Management
Getting to grips with Research Data Management
 

Recently uploaded

原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
u86oixdj
 
一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单
ocavb
 
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
ahzuo
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
ewymefz
 
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationPredicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Boston Institute of Analytics
 
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
mbawufebxi
 
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdfSample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Linda486226
 
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
Tiktokethiodaily
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year Rebound
Oppotus
 
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
pchutichetpong
 
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
nscud
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
jerlynmaetalle
 
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
John Andrews
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
v3tuleee
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
haila53
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
Opendatabay
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
enxupq
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
vcaxypu
 
SOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape ReportSOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape Report
SOCRadar
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
NABLAS株式会社
 

Recently uploaded (20)

原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
 
一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单
 
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
 
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationPredicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
 
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
 
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdfSample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
 
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year Rebound
 
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
 
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
 
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
 
SOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape ReportSOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape Report
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
 

A collaborative approach to "filling the digital preservation gap" for Research Data Management

  • 1. A collaborative approach to “filling the digital preservation gap” for Research Data Management Jenny Mitcham, University of York Chris Awre, University of Hull Sarah Romkey, Artefactual Systems 9 November 2015
  • 2. Research at Hull and York • Hull : – 6 Faculties, 25+ academic departments – c. 22,000 students – 62% research classed as 3* or 4* in REF 2014 – In top 50 UK institutions by ‘research power’ • York: – 30+ academic departments – c. 16,000 students – Ranked in the top ten of UK universities for research council income (THE)
  • 3. Filling the digital preservation gap: Project aim “…to investigate Archivematica and explore how it might be used to provide digital preservation functionality within a wider infrastructure for Research Data Management.”
  • 4. What is Archivematica? ● Free and open-source digital preservation system (AGPLv3) designed to maintain standards-based, long-term access to digital objects ● Allows users to process digital objects from ingest to access in using ISO-OAIS functional model ● Implements format normalization upon ingest and preserves originals to support emulation and migration strategies
  • 5. What is Archivematica? ● Archivematica is a processing pipeline consisting of a bundle of open-source tools and python scripts which deliver a series of preservation micro-services ● Archivematica is designed to output high-quality, standards-compliant Archival Information Packages (AIPs) ● Bagit, METS, PREMIS
  • 6. How is Archivematica used? • For long term preservation of born-digital and digitized material • To unpack and preserve contents of disk images • To create access copies for AtoM, CONTENTdm, other access systems • As a “dark archive” for DSpace content • Research data preservation – => Jisc Research Data Spring project
  • 8. This is a collaboration University of Hull: • Chris Awre – Head of Information Services, Library and Learning Innovation • Richard Green – Independent Consultant • Simon Wilson – University Archivist University of York: • Julie Allinson – Manager, Digital York • Jen Mitcham – Digital Archivist Artefactual Systems Jisc - part of Research Data Spring
  • 9. Project structure • Phase 1 – explore: testing, research, thinking -produce a report (3 months) • Phase 2 – develop: make Archivematica better for RDM, plan implementation (4 months) • Phase 3 – implement: set up proof of concepts at York and Hull (6 months)
  • 10. Why do we need digital preservation?
  • 11. Why do we need digital preservation for research data? • We can’t ignore digital preservation – moving targets for data retention mean we need to take this seriously • Funder requirements around retention: – NERC - data should be retained for a minimum of 10 years but for projects of major importance this may need to be 20 years or longer – STFC - expect data to be retained for a minimum of 10 years and data that cannot be re-measured should be retained indefinitely – Wellcome Trust – expect data to be kept for a minimum of 10 years but suggest longer periods for certain types of data
  • 12. University of York RDM questionnaire 2013 • Which data management issues have you come across in your research over the last five years? – “Inability to read files in old software formats on old media or because of expired software licences” – 24% of 181 researchers who answered this question admitted this had been a problem for them Why do we need digital preservation for research data?
  • 13. What does research data look like? York RDM questionnaire 2013: Please select the main types of electronic research data you generate
  • 14. Top research data applications at York
  • 15. What does research data look like? York RDM questionnaire 2013: If your project is not yet complete, can you make an estimate of the ‘final’ size of your digital data
  • 16. Why would we recommend Archivematica for RDM? • It is flexible and can be configured in different ways for different institutional needs and workflows • It allows many of the tasks around digital preservation to be carried out in an automated fashion • It can be used alongside other existing systems as part of a wider workflow for research data • It is a good digital preservation solution for those with limited resources • It is an evolving solution that is continually driven and enhanced by and for the digital preservation community • It gives institutions greater confidence that they will be able to continue to provide access to usable copies of research data over time
  • 17. What are the downsides? • It isn’t a magic bullet • There is no guarantee your data will be readable in the future • It can only be as good as current digital preservation practice • It can be fiddly to install correctly • The GUI isn’t that intuitive • You need staff who understand it
  • 18. Read all about it! http://digital-archiving.blogspot.co.uk/
  • 19. How could you use Archivematica? • Host it in-house and link it to an existing repository/access system (for example DSpace, CONTENTdm, Fedora/Hydra ...or a CRIS) • Host it in-house and use as a standalone system (you would need to have a storage system in place and establish a way of facilitating access to the data) • Sign up for a hosted instance of Archivematica with archivesDIRECT (combines Archivematica with DuraCloud storage) • Sign up for a hosted instance of Archivematica with Arkivum (combines Archivematica with Arkivum storage)
  • 20. RDM Workflows at York • We get a copy of data from a researcher • We transfer it to Archivematica • Archivematica packages it up for storage and creates the Archival Information Package (AIP) • Archivematica sends the AIP to archival storage • Metadata is published in data catalogue • If someone requests the data Archivematica will create a Dissemination Information Package (DIP) • DIP will be uploaded to Digital Library for access
  • 21. How do York plan to use Archivematica?
  • 22.
  • 23. How can we improve Archivematica? 1. Enable better workflows for RDM (producing a DIP on request) 2. Allowing the DIP (access copy of data) to be usable by different repository systems 3. Helping reduce bottlenecks for big data 4. Workflows for unidentified files 5. Enabling easier querying of data within Archivematica by third party applications 6. Better documentation All are in progress in Phase 2 of the project
  • 24. Where to find out more http://www.york.ac.uk/borthwick/
  • 25. Where to find out more
  • 26. Do talk to us if you are interested in finding out more about this project Useful links: Digital archiving blog: http://digital-archiving.blogspot.co.uk/ Archivematica: https://www.archivematica.org/en/ Report: http://dx.doi.org/10.6084/m9.figshare.1481170