SlideShare a Scribd company logo
Project update:
A collaborative approach to
“filling the digital preservation
gap” for Research Data
Management
Julie Allinson
Technology Development Manager
Library & Archives
University of York
6 November 2015
Filling the digital preservation gap:
Project aim
“…to investigate
Archivematica and explore
how it might be used to
provide digital preservation
functionality within a wider
infrastructure for Research
Data Management.”
This is a collaboration
University of Hull:
• Chris Awre – Head of Information Services, Library and
Learning Innovation
• Richard Green – Independent Consultant
• Simon Wilson – University Archivist
University of York:
• Julie Allinson – Technology Development Manager
• Jen Mitcham – Digital Archivist
Artefactual Systems
Jisc
Project structure
• Phase 1 – explore: testing, research,
thinking -produce a report (3 months)
• Phase 2 – develop: make
Archivematica better for RDM, plan
implementation (4 months)
• Phase 3 – implement: set up proof of
concepts at York and Hull (6 months)
Phase 1: Read all about it!
http://digital-archiving.blogspot.co.uk/
Why do we need digital preservation
for research data?
• There is a digital preservation gap in current
RDM infrastructures
• We can’t ignore digital preservation – moving
targets for data retention mean we need to take
this seriously
• Funder requirements around retention
University of York RDM questionnaire 2013
• Which data management issues have you come
across in your research over the last five years?
– “Inability to read files in old software formats on old
media or because of expired software licences”
– 24% of 181 researchers who answered this question
admitted this had been a problem for them
Why do we need digital preservation
for research data?
Why Archivematica?
“The goal of the Archivematica project is to
give archivists and librarians with limited
technical and financial capacity the tools,
methodology and confidence to begin
preserving digital information today.”
Why Archivematica?
• Standards-based
• Open Source
• Flexible and customisable
• Compatible with hundreds of file formats
• Advanced search and storage management
• Integrated with third-party systems
From https://ww.archivematica.org/en/
Archivematica for RDM?
• Flexible - can support different institutional needs and
workflows
• Automates many digital preservation tasks
• Can be integrated with other systems
• Good for those with limited resources
• Enhancements driven by and for the digital preservation
community
Archivematica for RDM?
It gives institutions greater confidence
that they will be able to continue to
provide access to usable copies of
research data over time
Phase 2: Improving Archivematica
1. Deliverable 1: Automated DIP regeneration
2. Deliverable 2: METS parsing tools
3. Deliverable 3: Generic search REST API
(proof-of-concept)
4. Deliverable 4: Support multiple checksum
algorithms
5. Deliverable 5: Enhance PRONOM integration
6.Deliverable 6: Automation tools
documentation
Deliverable One
✓Research Data needs to be kept,
but we don’t know if anyone will ever want it
and it might be *massive*
The Solution: enable the DIP to be generated ‘on
request’ and not as part of the initial ingest
Deliverable Two
✓We want to be able to grab the DIP, and
metadata about it for pulling into our
repository
The Solution: a library to help with parsing and
creating METS files
https://github.com/artefactual-labs/mets-reader-writer
Deliverable Three
✓We want to be able to report on what we
have
The Solution: a search API to answer basic
questions about the number of files in storage,
their formats, date of ingest, etc.*
* we’re working with DMAOnline @lancaster
Deliverable Four
✓With large datasets, the current checksum
mechanism in Archivematica could be a
bottleneck
The Solution: support for multiple checksum
algorithms
Deliverable Five
✓What about all those file formats that
Archivematica can’t identify?
The Solution: mechanism for running file
identification with multiple tools and a report of
unidentified formats, working with PRONOM to
improve their coveraage
Deliverable Six
✓We want to make it easier for Institutions to
adopt archivematica
The Solution: documentation and screencasts
for Archivematica automation tools, eg.
https://wiki.archivematica.org/Getting_started#Installation
All of these new features will become
part of the core Archivematica code in
2016
Phase 3
• The plan is to run a third phase of the project to:
✓implement prototype RDM workflows with preservation
using the new Archivematica features at York and Hull
✓use the search API to populate DMAOnline with stats
about datasets
✓do more community outreach
• We will be pitching to Jisc in December for phase
three #fingerscrossed
How do York plan to use Archivematica?
Pure RDMonitor Archivematica
AIP
AIP Store
PURE
Web Services
Archivematica
REST API
DIPRepository
Data
Catalogue
Key:
human to human
machine to machine
human to machine
Where to find out more
http://www.york.ac.uk/borthwick/
The Bigger Picture
•Jisc are looking at building shared services for
RDM
• Our project is inputting into the specification
and discussion
• One area we’d be interested to find out more
about is the appetite for ‘above campus’
options - discussion planned for later.
How could you use Archivematica?
• Host it in-house and link it to an existing
repository/access system (for example DSpace,
CONTENTdm, Fedora/Hydra ...or a CRIS)
• Host it in-house and use as a standalone system
(you would need to have a storage system in place and establish a
way of facilitating access to the data)
• Sign up for a hosted instance of Archivematica
with archivesDIRECT (combines Archivematica with
DuraCloud storage)
• Sign up for a hosted instance of Archivematica
with Arkivum (combines Archivematica with Arkivum storage)
Thanks!
julie.allinson@york.ac.uk
Useful links:
Borthwick website: http://www.york.ac.uk/borthwick/
Digital archiving blog: http://digital-archiving.blogspot.co.uk/
Archivematica: https://www.archivematica.org/en/
Report: http://dx.doi.org/10.6084/m9.figshare.1481170

More Related Content

What's hot

Extending DSpace 7: DSpace-CRIS and DSpace-GLAM for empowered repositories an...
Extending DSpace 7: DSpace-CRIS and DSpace-GLAM for empowered repositories an...Extending DSpace 7: DSpace-CRIS and DSpace-GLAM for empowered repositories an...
Extending DSpace 7: DSpace-CRIS and DSpace-GLAM for empowered repositories an...
4Science
 
ArchivesSpace-Archivematica-DSpace Workflow Integration
ArchivesSpace-Archivematica-DSpace Workflow IntegrationArchivesSpace-Archivematica-DSpace Workflow Integration
ArchivesSpace-Archivematica-DSpace Workflow Integration
Max Eckard
 
TEAM 4: IoT Technologies for Africa
TEAM 4: IoT Technologies for AfricaTEAM 4: IoT Technologies for Africa
TEAM 4: IoT Technologies for Africa
plan4all
 
Hsrs review 2013_04_v3
Hsrs review 2013_04_v3Hsrs review 2013_04_v3
Hsrs review 2013_04_v3Karel Charvat
 
Digital Preservation with Archivematica: An Introduction
Digital Preservation with Archivematica: An IntroductionDigital Preservation with Archivematica: An Introduction
Digital Preservation with Archivematica: An Introduction
Artefactual Systems - Archivematica
 
Archivematica Community Profile: University of Houston by Bethany Scott
Archivematica Community Profile: University of Houston by Bethany ScottArchivematica Community Profile: University of Houston by Bethany Scott
Archivematica Community Profile: University of Houston by Bethany Scott
Artefactual Systems - Archivematica
 
LinkedStat: making ISTAT data more valuable
LinkedStat: making ISTAT data more valuableLinkedStat: making ISTAT data more valuable
LinkedStat: making ISTAT data more valuable
SpazioDati
 
Provenance as a building block for an open science infrastructure
Provenance as a building block for an open science infrastructureProvenance as a building block for an open science infrastructure
Provenance as a building block for an open science infrastructure
Andreas Schreiber
 
ORDS, research data network
ORDS, research data networkORDS, research data network
ORDS, research data network
Jisc RDM
 
Anderson Conglomerating and Collocating Collections
Anderson Conglomerating and Collocating CollectionsAnderson Conglomerating and Collocating Collections
Anderson Conglomerating and Collocating Collections
National Information Standards Organization (NISO)
 
DSpace-CRIS ORCID Integration
DSpace-CRIS ORCID IntegrationDSpace-CRIS ORCID Integration
DSpace-CRIS ORCID Integration
4Science
 
2010 EGITF Amsterdam - Gap between GRID and Humanities
2010 EGITF Amsterdam - Gap between GRID and Humanities2010 EGITF Amsterdam - Gap between GRID and Humanities
2010 EGITF Amsterdam - Gap between GRID and Humanities
Dirk Roorda
 
Elephant in the Room: Scaling Storage for the HathiTrust Research Center
Elephant in the Room: Scaling Storage for the HathiTrust Research CenterElephant in the Room: Scaling Storage for the HathiTrust Research Center
Elephant in the Room: Scaling Storage for the HathiTrust Research Center
Robert H. McDonald
 
JCDL 2015 Doctoral Consortium - A Framework for Aggregating Private and Publi...
JCDL 2015 Doctoral Consortium - A Framework for AggregatingPrivate and Publi...JCDL 2015 Doctoral Consortium - A Framework for AggregatingPrivate and Publi...
JCDL 2015 Doctoral Consortium - A Framework for Aggregating Private and Publi...
Mat Kelly
 
DSpace-CRIS: an open source solution - Cineca euroCRIS membership meeting Por...
DSpace-CRIS: an open source solution - Cineca euroCRIS membership meeting Por...DSpace-CRIS: an open source solution - Cineca euroCRIS membership meeting Por...
DSpace-CRIS: an open source solution - Cineca euroCRIS membership meeting Por...
Andrea Bollini
 
iRODS/Dataverse Project by Jonathan Crabtree
iRODS/Dataverse Project by Jonathan CrabtreeiRODS/Dataverse Project by Jonathan Crabtree
iRODS/Dataverse Project by Jonathan Crabtree
datascienceiqss
 
A Wiki for Archivists? ArchiefWiki.org
A Wiki for Archivists? ArchiefWiki.orgA Wiki for Archivists? ArchiefWiki.org
A Wiki for Archivists? ArchiefWiki.org
Tom Cobbaert
 
News about DSpace-CRIS Anwendertreffen 2020
News about DSpace-CRIS Anwendertreffen 2020News about DSpace-CRIS Anwendertreffen 2020
News about DSpace-CRIS Anwendertreffen 2020
4Science
 
How to enhance your DSpace repository: use cases for DSpace-CRIS, DSpace-RDM,...
How to enhance your DSpace repository: use cases for DSpace-CRIS, DSpace-RDM,...How to enhance your DSpace repository: use cases for DSpace-CRIS, DSpace-RDM,...
How to enhance your DSpace repository: use cases for DSpace-CRIS, DSpace-RDM,...
4Science
 

What's hot (20)

Extending DSpace 7: DSpace-CRIS and DSpace-GLAM for empowered repositories an...
Extending DSpace 7: DSpace-CRIS and DSpace-GLAM for empowered repositories an...Extending DSpace 7: DSpace-CRIS and DSpace-GLAM for empowered repositories an...
Extending DSpace 7: DSpace-CRIS and DSpace-GLAM for empowered repositories an...
 
ArchivesSpace-Archivematica-DSpace Workflow Integration
ArchivesSpace-Archivematica-DSpace Workflow IntegrationArchivesSpace-Archivematica-DSpace Workflow Integration
ArchivesSpace-Archivematica-DSpace Workflow Integration
 
TEAM 4: IoT Technologies for Africa
TEAM 4: IoT Technologies for AfricaTEAM 4: IoT Technologies for Africa
TEAM 4: IoT Technologies for Africa
 
Hsrs review 2013_04_v3
Hsrs review 2013_04_v3Hsrs review 2013_04_v3
Hsrs review 2013_04_v3
 
Archive concept
Archive conceptArchive concept
Archive concept
 
Digital Preservation with Archivematica: An Introduction
Digital Preservation with Archivematica: An IntroductionDigital Preservation with Archivematica: An Introduction
Digital Preservation with Archivematica: An Introduction
 
Archivematica Community Profile: University of Houston by Bethany Scott
Archivematica Community Profile: University of Houston by Bethany ScottArchivematica Community Profile: University of Houston by Bethany Scott
Archivematica Community Profile: University of Houston by Bethany Scott
 
LinkedStat: making ISTAT data more valuable
LinkedStat: making ISTAT data more valuableLinkedStat: making ISTAT data more valuable
LinkedStat: making ISTAT data more valuable
 
Provenance as a building block for an open science infrastructure
Provenance as a building block for an open science infrastructureProvenance as a building block for an open science infrastructure
Provenance as a building block for an open science infrastructure
 
ORDS, research data network
ORDS, research data networkORDS, research data network
ORDS, research data network
 
Anderson Conglomerating and Collocating Collections
Anderson Conglomerating and Collocating CollectionsAnderson Conglomerating and Collocating Collections
Anderson Conglomerating and Collocating Collections
 
DSpace-CRIS ORCID Integration
DSpace-CRIS ORCID IntegrationDSpace-CRIS ORCID Integration
DSpace-CRIS ORCID Integration
 
2010 EGITF Amsterdam - Gap between GRID and Humanities
2010 EGITF Amsterdam - Gap between GRID and Humanities2010 EGITF Amsterdam - Gap between GRID and Humanities
2010 EGITF Amsterdam - Gap between GRID and Humanities
 
Elephant in the Room: Scaling Storage for the HathiTrust Research Center
Elephant in the Room: Scaling Storage for the HathiTrust Research CenterElephant in the Room: Scaling Storage for the HathiTrust Research Center
Elephant in the Room: Scaling Storage for the HathiTrust Research Center
 
JCDL 2015 Doctoral Consortium - A Framework for Aggregating Private and Publi...
JCDL 2015 Doctoral Consortium - A Framework for AggregatingPrivate and Publi...JCDL 2015 Doctoral Consortium - A Framework for AggregatingPrivate and Publi...
JCDL 2015 Doctoral Consortium - A Framework for Aggregating Private and Publi...
 
DSpace-CRIS: an open source solution - Cineca euroCRIS membership meeting Por...
DSpace-CRIS: an open source solution - Cineca euroCRIS membership meeting Por...DSpace-CRIS: an open source solution - Cineca euroCRIS membership meeting Por...
DSpace-CRIS: an open source solution - Cineca euroCRIS membership meeting Por...
 
iRODS/Dataverse Project by Jonathan Crabtree
iRODS/Dataverse Project by Jonathan CrabtreeiRODS/Dataverse Project by Jonathan Crabtree
iRODS/Dataverse Project by Jonathan Crabtree
 
A Wiki for Archivists? ArchiefWiki.org
A Wiki for Archivists? ArchiefWiki.orgA Wiki for Archivists? ArchiefWiki.org
A Wiki for Archivists? ArchiefWiki.org
 
News about DSpace-CRIS Anwendertreffen 2020
News about DSpace-CRIS Anwendertreffen 2020News about DSpace-CRIS Anwendertreffen 2020
News about DSpace-CRIS Anwendertreffen 2020
 
How to enhance your DSpace repository: use cases for DSpace-CRIS, DSpace-RDM,...
How to enhance your DSpace repository: use cases for DSpace-CRIS, DSpace-RDM,...How to enhance your DSpace repository: use cases for DSpace-CRIS, DSpace-RDM,...
How to enhance your DSpace repository: use cases for DSpace-CRIS, DSpace-RDM,...
 

Viewers also liked

"Filling the Digital Preservation Gap" with Archivematica
"Filling the Digital Preservation Gap" with Archivematica"Filling the Digital Preservation Gap" with Archivematica
"Filling the Digital Preservation Gap" with Archivematica
Jenny Mitcham
 
"Filling the digital preservation gap" with Archivematica
"Filling the digital preservation gap" with Archivematica"Filling the digital preservation gap" with Archivematica
"Filling the digital preservation gap" with Archivematica
Jenny Mitcham
 
A collaborative approach to "filling the digital preservation gap" for Resear...
A collaborative approach to "filling the digital preservation gap" for Resear...A collaborative approach to "filling the digital preservation gap" for Resear...
A collaborative approach to "filling the digital preservation gap" for Resear...
Jenny Mitcham
 
A collaborative approach to "filling the digital preservation gap" for Resear...
A collaborative approach to "filling the digital preservation gap" for Resear...A collaborative approach to "filling the digital preservation gap" for Resear...
A collaborative approach to "filling the digital preservation gap" for Resear...
Jenny Mitcham
 
Jisc Shared Service requirements presentation - 18th November 2015
Jisc Shared Service requirements presentation - 18th November 2015Jisc Shared Service requirements presentation - 18th November 2015
Jisc Shared Service requirements presentation - 18th November 2015
Jenny Mitcham
 
“Filling the digital preservation gap” an update from the Jisc Research Data ...
“Filling the digital preservation gap”an update from the Jisc Research Data ...“Filling the digital preservation gap”an update from the Jisc Research Data ...
“Filling the digital preservation gap” an update from the Jisc Research Data ...
Jenny Mitcham
 
Implementing Archivematica, research data network
Implementing Archivematica, research data networkImplementing Archivematica, research data network
Implementing Archivematica, research data network
Jisc RDM
 
Implementing figshare, research data network
Implementing figshare, research data networkImplementing figshare, research data network
Implementing figshare, research data network
Jisc RDM
 
Building a collaborative RDM community, research data network
Building a collaborative RDM community, research data networkBuilding a collaborative RDM community, research data network
Building a collaborative RDM community, research data network
Jisc RDM
 
Clipper, research data network
Clipper, research data networkClipper, research data network
Clipper, research data network
Jisc RDM
 
Grampian safe haven, research data network
Grampian safe haven, research data networkGrampian safe haven, research data network
Grampian safe haven, research data network
Jisc RDM
 
Standardising research data policies, research data network
Standardising research data policies, research data networkStandardising research data policies, research data network
Standardising research data policies, research data network
Jisc RDM
 
DAF Survey Results, research data network
DAF Survey Results, research data networkDAF Survey Results, research data network
DAF Survey Results, research data network
Jisc RDM
 
Gold, silver, bronze - research data network
Gold, silver, bronze - research data networkGold, silver, bronze - research data network
Gold, silver, bronze - research data network
Jisc RDM
 
Caso integrador de itil v3
Caso integrador de itil v3Caso integrador de itil v3
Caso integrador de itil v3
Jose Miguel Coronado
 
PERSPECTIVES OF ONLINE EDUCATION IN THE CZECH REPUBLIC
PERSPECTIVES OF ONLINE EDUCATION IN THE CZECH REPUBLICPERSPECTIVES OF ONLINE EDUCATION IN THE CZECH REPUBLIC
PERSPECTIVES OF ONLINE EDUCATION IN THE CZECH REPUBLIC
International Scientific Events
 
A Child’s Place Casa
A Child’s Place CasaA Child’s Place Casa
A Child’s Place CasaKyra Dillard
 
A POLYPHONIC REANALYSIS INSTRUMENT FOR UNDERSTANDING SCHOOL CULTURES
A POLYPHONIC REANALYSIS INSTRUMENT FOR UNDERSTANDING SCHOOL CULTURESA POLYPHONIC REANALYSIS INSTRUMENT FOR UNDERSTANDING SCHOOL CULTURES
A POLYPHONIC REANALYSIS INSTRUMENT FOR UNDERSTANDING SCHOOL CULTURES
International Scientific Events
 
PURCHASERS CHOICE INC.
PURCHASERS CHOICE INC.PURCHASERS CHOICE INC.
PURCHASERS CHOICE INC.Matthew Davis
 

Viewers also liked (20)

"Filling the Digital Preservation Gap" with Archivematica
"Filling the Digital Preservation Gap" with Archivematica"Filling the Digital Preservation Gap" with Archivematica
"Filling the Digital Preservation Gap" with Archivematica
 
"Filling the digital preservation gap" with Archivematica
"Filling the digital preservation gap" with Archivematica"Filling the digital preservation gap" with Archivematica
"Filling the digital preservation gap" with Archivematica
 
A collaborative approach to "filling the digital preservation gap" for Resear...
A collaborative approach to "filling the digital preservation gap" for Resear...A collaborative approach to "filling the digital preservation gap" for Resear...
A collaborative approach to "filling the digital preservation gap" for Resear...
 
A collaborative approach to "filling the digital preservation gap" for Resear...
A collaborative approach to "filling the digital preservation gap" for Resear...A collaborative approach to "filling the digital preservation gap" for Resear...
A collaborative approach to "filling the digital preservation gap" for Resear...
 
Jisc Shared Service requirements presentation - 18th November 2015
Jisc Shared Service requirements presentation - 18th November 2015Jisc Shared Service requirements presentation - 18th November 2015
Jisc Shared Service requirements presentation - 18th November 2015
 
“Filling the digital preservation gap” an update from the Jisc Research Data ...
“Filling the digital preservation gap”an update from the Jisc Research Data ...“Filling the digital preservation gap”an update from the Jisc Research Data ...
“Filling the digital preservation gap” an update from the Jisc Research Data ...
 
Implementing Archivematica, research data network
Implementing Archivematica, research data networkImplementing Archivematica, research data network
Implementing Archivematica, research data network
 
Implementing figshare, research data network
Implementing figshare, research data networkImplementing figshare, research data network
Implementing figshare, research data network
 
Building a collaborative RDM community, research data network
Building a collaborative RDM community, research data networkBuilding a collaborative RDM community, research data network
Building a collaborative RDM community, research data network
 
Clipper, research data network
Clipper, research data networkClipper, research data network
Clipper, research data network
 
Grampian safe haven, research data network
Grampian safe haven, research data networkGrampian safe haven, research data network
Grampian safe haven, research data network
 
Standardising research data policies, research data network
Standardising research data policies, research data networkStandardising research data policies, research data network
Standardising research data policies, research data network
 
DAF Survey Results, research data network
DAF Survey Results, research data networkDAF Survey Results, research data network
DAF Survey Results, research data network
 
Gold, silver, bronze - research data network
Gold, silver, bronze - research data networkGold, silver, bronze - research data network
Gold, silver, bronze - research data network
 
Caso integrador de itil v3
Caso integrador de itil v3Caso integrador de itil v3
Caso integrador de itil v3
 
PERSPECTIVES OF ONLINE EDUCATION IN THE CZECH REPUBLIC
PERSPECTIVES OF ONLINE EDUCATION IN THE CZECH REPUBLICPERSPECTIVES OF ONLINE EDUCATION IN THE CZECH REPUBLIC
PERSPECTIVES OF ONLINE EDUCATION IN THE CZECH REPUBLIC
 
A Child’s Place Casa
A Child’s Place CasaA Child’s Place Casa
A Child’s Place Casa
 
A POLYPHONIC REANALYSIS INSTRUMENT FOR UNDERSTANDING SCHOOL CULTURES
A POLYPHONIC REANALYSIS INSTRUMENT FOR UNDERSTANDING SCHOOL CULTURESA POLYPHONIC REANALYSIS INSTRUMENT FOR UNDERSTANDING SCHOOL CULTURES
A POLYPHONIC REANALYSIS INSTRUMENT FOR UNDERSTANDING SCHOOL CULTURES
 
PURCHASERS CHOICE INC.
PURCHASERS CHOICE INC.PURCHASERS CHOICE INC.
PURCHASERS CHOICE INC.
 
4546
45464546
4546
 

Similar to Project update: A collaborative approach to "filling the digital preservation gap" for Research Data Management

Using Archivemedia to preserve research data
Using Archivemedia to preserve research dataUsing Archivemedia to preserve research data
Using Archivemedia to preserve research data
ARDC
 
A collaborative approach to filling the digital preservation gap for RDM
A collaborative approach to filling the digital preservation gap for RDMA collaborative approach to filling the digital preservation gap for RDM
A collaborative approach to filling the digital preservation gap for RDM
northerncollaboration
 
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShareResearch Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
Historic Environment Scotland
 
Research Data (and Software) Management at Imperial: (Everything you need to ...
Research Data (and Software) Management at Imperial: (Everything you need to ...Research Data (and Software) Management at Imperial: (Everything you need to ...
Research Data (and Software) Management at Imperial: (Everything you need to ...
Sarah Anna Stewart
 
From Box to Hydra via Archivematica
From Box to Hydra via ArchivematicaFrom Box to Hydra via Archivematica
From Box to Hydra via Archivematica
Jisc RDM
 
2013 05-15 Intro to Archivematica - UBC SLAIS Digital Records Forensics Class
2013 05-15 Intro to Archivematica - UBC SLAIS Digital Records Forensics Class2013 05-15 Intro to Archivematica - UBC SLAIS Digital Records Forensics Class
2013 05-15 Intro to Archivematica - UBC SLAIS Digital Records Forensics ClassCourtney Mumma
 
Caplan and York, 'What It Takes To Make It Last: E-Resources Preservation"
Caplan and York, 'What It Takes To Make It Last:  E-Resources Preservation"Caplan and York, 'What It Takes To Make It Last:  E-Resources Preservation"
Caplan and York, 'What It Takes To Make It Last: E-Resources Preservation"
National Information Standards Organization (NISO)
 
Ariadne: Lifecycles
Ariadne: LifecyclesAriadne: Lifecycles
Ariadne: Lifecycles
ariadnenetwork
 
An Introduction to Digital Preservation
An Introduction to Digital PreservationAn Introduction to Digital Preservation
An Introduction to Digital Preservation
DigitalPreservationEurope
 
UBC Library's Digital Preservation Strategy
UBC Library's Digital Preservation StrategyUBC Library's Digital Preservation Strategy
UBC Library's Digital Preservation Strategy
UBC Library
 
Open source glam tools for building sustainable cultural heritage and digital...
Open source glam tools for building sustainable cultural heritage and digital...Open source glam tools for building sustainable cultural heritage and digital...
Open source glam tools for building sustainable cultural heritage and digital...
LIBIS
 
SCAPE - Scalable Preservation Environments
SCAPE - Scalable Preservation EnvironmentsSCAPE - Scalable Preservation Environments
SCAPE - Scalable Preservation Environments
SCAPE Project
 
Archivematica Camp Houston Slides Stream1.pdf
Archivematica Camp Houston Slides Stream1.pdfArchivematica Camp Houston Slides Stream1.pdf
Archivematica Camp Houston Slides Stream1.pdf
lcofresi
 
Preservation of Research Data: Dataverse / Archivematica Integration by Allan...
Preservation of Research Data: Dataverse / Archivematica Integration by Allan...Preservation of Research Data: Dataverse / Archivematica Integration by Allan...
Preservation of Research Data: Dataverse / Archivematica Integration by Allan...
datascienceiqss
 
Archivematica and the digital archival chain of custody
Archivematica and the digital archival chain of custodyArchivematica and the digital archival chain of custody
Archivematica and the digital archival chain of custody
Artefactual Systems - Archivematica
 
The workflows for the ingest of digital objects into a repository/digital l...
The workflows for the ingest of  digital objects into a repository/digital l...The workflows for the ingest of  digital objects into a repository/digital l...
The workflows for the ingest of digital objects into a repository/digital l...
Hong (Jenny) Jing
 
Research Data Management at Imperial College London
Research Data Management at Imperial College LondonResearch Data Management at Imperial College London
Research Data Management at Imperial College London
Sarah Anna Stewart
 
OpenStack Swift In the Enterprise
OpenStack Swift In the EnterpriseOpenStack Swift In the Enterprise
OpenStack Swift In the Enterprise
Hostway|HOSTING
 
Integrating collections data to build sustainable online resources
Integrating collections data to build sustainable online resourcesIntegrating collections data to build sustainable online resources
Integrating collections data to build sustainable online resources
Museums Computer Group
 
Sgci esip-7-20-18
Sgci esip-7-20-18Sgci esip-7-20-18
Sgci esip-7-20-18
Nancy Wilkins-Diehr
 

Similar to Project update: A collaborative approach to "filling the digital preservation gap" for Research Data Management (20)

Using Archivemedia to preserve research data
Using Archivemedia to preserve research dataUsing Archivemedia to preserve research data
Using Archivemedia to preserve research data
 
A collaborative approach to filling the digital preservation gap for RDM
A collaborative approach to filling the digital preservation gap for RDMA collaborative approach to filling the digital preservation gap for RDM
A collaborative approach to filling the digital preservation gap for RDM
 
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShareResearch Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
 
Research Data (and Software) Management at Imperial: (Everything you need to ...
Research Data (and Software) Management at Imperial: (Everything you need to ...Research Data (and Software) Management at Imperial: (Everything you need to ...
Research Data (and Software) Management at Imperial: (Everything you need to ...
 
From Box to Hydra via Archivematica
From Box to Hydra via ArchivematicaFrom Box to Hydra via Archivematica
From Box to Hydra via Archivematica
 
2013 05-15 Intro to Archivematica - UBC SLAIS Digital Records Forensics Class
2013 05-15 Intro to Archivematica - UBC SLAIS Digital Records Forensics Class2013 05-15 Intro to Archivematica - UBC SLAIS Digital Records Forensics Class
2013 05-15 Intro to Archivematica - UBC SLAIS Digital Records Forensics Class
 
Caplan and York, 'What It Takes To Make It Last: E-Resources Preservation"
Caplan and York, 'What It Takes To Make It Last:  E-Resources Preservation"Caplan and York, 'What It Takes To Make It Last:  E-Resources Preservation"
Caplan and York, 'What It Takes To Make It Last: E-Resources Preservation"
 
Ariadne: Lifecycles
Ariadne: LifecyclesAriadne: Lifecycles
Ariadne: Lifecycles
 
An Introduction to Digital Preservation
An Introduction to Digital PreservationAn Introduction to Digital Preservation
An Introduction to Digital Preservation
 
UBC Library's Digital Preservation Strategy
UBC Library's Digital Preservation StrategyUBC Library's Digital Preservation Strategy
UBC Library's Digital Preservation Strategy
 
Open source glam tools for building sustainable cultural heritage and digital...
Open source glam tools for building sustainable cultural heritage and digital...Open source glam tools for building sustainable cultural heritage and digital...
Open source glam tools for building sustainable cultural heritage and digital...
 
SCAPE - Scalable Preservation Environments
SCAPE - Scalable Preservation EnvironmentsSCAPE - Scalable Preservation Environments
SCAPE - Scalable Preservation Environments
 
Archivematica Camp Houston Slides Stream1.pdf
Archivematica Camp Houston Slides Stream1.pdfArchivematica Camp Houston Slides Stream1.pdf
Archivematica Camp Houston Slides Stream1.pdf
 
Preservation of Research Data: Dataverse / Archivematica Integration by Allan...
Preservation of Research Data: Dataverse / Archivematica Integration by Allan...Preservation of Research Data: Dataverse / Archivematica Integration by Allan...
Preservation of Research Data: Dataverse / Archivematica Integration by Allan...
 
Archivematica and the digital archival chain of custody
Archivematica and the digital archival chain of custodyArchivematica and the digital archival chain of custody
Archivematica and the digital archival chain of custody
 
The workflows for the ingest of digital objects into a repository/digital l...
The workflows for the ingest of  digital objects into a repository/digital l...The workflows for the ingest of  digital objects into a repository/digital l...
The workflows for the ingest of digital objects into a repository/digital l...
 
Research Data Management at Imperial College London
Research Data Management at Imperial College LondonResearch Data Management at Imperial College London
Research Data Management at Imperial College London
 
OpenStack Swift In the Enterprise
OpenStack Swift In the EnterpriseOpenStack Swift In the Enterprise
OpenStack Swift In the Enterprise
 
Integrating collections data to build sustainable online resources
Integrating collections data to build sustainable online resourcesIntegrating collections data to build sustainable online resources
Integrating collections data to build sustainable online resources
 
Sgci esip-7-20-18
Sgci esip-7-20-18Sgci esip-7-20-18
Sgci esip-7-20-18
 

Recently uploaded

Empowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptxEmpowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptx
benishzehra469
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
axoqas
 
Jpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization SampleJpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization Sample
James Polillo
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
Subhajit Sahu
 
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
ewymefz
 
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
nscud
 
Business update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMIBusiness update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMI
AlejandraGmez176757
 
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
ukgaet
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
ewymefz
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
ewymefz
 
一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单
ewymefz
 
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
Tiktokethiodaily
 
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
vcaxypu
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
haila53
 
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationPredicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Boston Institute of Analytics
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
NABLAS株式会社
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
enxupq
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
TravisMalana
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
vcaxypu
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year Rebound
Oppotus
 

Recently uploaded (20)

Empowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptxEmpowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptx
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
 
Jpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization SampleJpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization Sample
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
 
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
 
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
 
Business update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMIBusiness update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMI
 
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
 
一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单
 
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
 
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
 
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationPredicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year Rebound
 

Project update: A collaborative approach to "filling the digital preservation gap" for Research Data Management

  • 1. Project update: A collaborative approach to “filling the digital preservation gap” for Research Data Management Julie Allinson Technology Development Manager Library & Archives University of York 6 November 2015
  • 2. Filling the digital preservation gap: Project aim “…to investigate Archivematica and explore how it might be used to provide digital preservation functionality within a wider infrastructure for Research Data Management.”
  • 3. This is a collaboration University of Hull: • Chris Awre – Head of Information Services, Library and Learning Innovation • Richard Green – Independent Consultant • Simon Wilson – University Archivist University of York: • Julie Allinson – Technology Development Manager • Jen Mitcham – Digital Archivist Artefactual Systems Jisc
  • 4. Project structure • Phase 1 – explore: testing, research, thinking -produce a report (3 months) • Phase 2 – develop: make Archivematica better for RDM, plan implementation (4 months) • Phase 3 – implement: set up proof of concepts at York and Hull (6 months)
  • 5. Phase 1: Read all about it! http://digital-archiving.blogspot.co.uk/
  • 6. Why do we need digital preservation for research data? • There is a digital preservation gap in current RDM infrastructures • We can’t ignore digital preservation – moving targets for data retention mean we need to take this seriously • Funder requirements around retention
  • 7. University of York RDM questionnaire 2013 • Which data management issues have you come across in your research over the last five years? – “Inability to read files in old software formats on old media or because of expired software licences” – 24% of 181 researchers who answered this question admitted this had been a problem for them Why do we need digital preservation for research data?
  • 8. Why Archivematica? “The goal of the Archivematica project is to give archivists and librarians with limited technical and financial capacity the tools, methodology and confidence to begin preserving digital information today.”
  • 9. Why Archivematica? • Standards-based • Open Source • Flexible and customisable • Compatible with hundreds of file formats • Advanced search and storage management • Integrated with third-party systems From https://ww.archivematica.org/en/
  • 10. Archivematica for RDM? • Flexible - can support different institutional needs and workflows • Automates many digital preservation tasks • Can be integrated with other systems • Good for those with limited resources • Enhancements driven by and for the digital preservation community
  • 11. Archivematica for RDM? It gives institutions greater confidence that they will be able to continue to provide access to usable copies of research data over time
  • 12. Phase 2: Improving Archivematica 1. Deliverable 1: Automated DIP regeneration 2. Deliverable 2: METS parsing tools 3. Deliverable 3: Generic search REST API (proof-of-concept) 4. Deliverable 4: Support multiple checksum algorithms 5. Deliverable 5: Enhance PRONOM integration 6.Deliverable 6: Automation tools documentation
  • 13. Deliverable One ✓Research Data needs to be kept, but we don’t know if anyone will ever want it and it might be *massive* The Solution: enable the DIP to be generated ‘on request’ and not as part of the initial ingest
  • 14. Deliverable Two ✓We want to be able to grab the DIP, and metadata about it for pulling into our repository The Solution: a library to help with parsing and creating METS files https://github.com/artefactual-labs/mets-reader-writer
  • 15. Deliverable Three ✓We want to be able to report on what we have The Solution: a search API to answer basic questions about the number of files in storage, their formats, date of ingest, etc.* * we’re working with DMAOnline @lancaster
  • 16. Deliverable Four ✓With large datasets, the current checksum mechanism in Archivematica could be a bottleneck The Solution: support for multiple checksum algorithms
  • 17. Deliverable Five ✓What about all those file formats that Archivematica can’t identify? The Solution: mechanism for running file identification with multiple tools and a report of unidentified formats, working with PRONOM to improve their coveraage
  • 18. Deliverable Six ✓We want to make it easier for Institutions to adopt archivematica The Solution: documentation and screencasts for Archivematica automation tools, eg. https://wiki.archivematica.org/Getting_started#Installation
  • 19. All of these new features will become part of the core Archivematica code in 2016
  • 20. Phase 3 • The plan is to run a third phase of the project to: ✓implement prototype RDM workflows with preservation using the new Archivematica features at York and Hull ✓use the search API to populate DMAOnline with stats about datasets ✓do more community outreach • We will be pitching to Jisc in December for phase three #fingerscrossed
  • 21. How do York plan to use Archivematica? Pure RDMonitor Archivematica AIP AIP Store PURE Web Services Archivematica REST API DIPRepository Data Catalogue Key: human to human machine to machine human to machine
  • 22.
  • 23. Where to find out more http://www.york.ac.uk/borthwick/
  • 24. The Bigger Picture •Jisc are looking at building shared services for RDM • Our project is inputting into the specification and discussion • One area we’d be interested to find out more about is the appetite for ‘above campus’ options - discussion planned for later.
  • 25. How could you use Archivematica? • Host it in-house and link it to an existing repository/access system (for example DSpace, CONTENTdm, Fedora/Hydra ...or a CRIS) • Host it in-house and use as a standalone system (you would need to have a storage system in place and establish a way of facilitating access to the data) • Sign up for a hosted instance of Archivematica with archivesDIRECT (combines Archivematica with DuraCloud storage) • Sign up for a hosted instance of Archivematica with Arkivum (combines Archivematica with Arkivum storage)
  • 26. Thanks! julie.allinson@york.ac.uk Useful links: Borthwick website: http://www.york.ac.uk/borthwick/ Digital archiving blog: http://digital-archiving.blogspot.co.uk/ Archivematica: https://www.archivematica.org/en/ Report: http://dx.doi.org/10.6084/m9.figshare.1481170