Preserving the Fruit of Our Labor: Establishing Digital Preservation Policies and Strategies at the University of Houston Libraries. Santi Thompson, Annie Wu, Drew Krewer, Mary Manning and Rob Spragg
Paper presented at the 12th International Conference on Digital Preservation, November 2-6, 2015. University of North Carolina at Chapel Hill.
Abstract:
To develop a comprehensive digital preservation program for maintaining long-term access to the Libraries’ digital assets and align our practices with national standards and guidelines, the University of Houston (UH) Libraries formed the Digital Preservation Task Force (DPTF) to assess previous digital preservation practices and make recommendations on future efforts. This paper outlines the methodology used, including the task force’s use of existing models and evaluation criteria, to successfully generate new policies and select Archivematica as our system to process and preserve our digital assets. It concludes with recommended strategies for the implementation of the policies and preservation operations.
The University of Hertfordshire (UH) implemented a new
commercial Resource Discovery Service at the same time as it
changed to the Koha Open Source Library Management System. In doing so it moved away from using Google Scholar, as its main platform, at a time when many universities are deciding to only use Google Scholar. Hear about the debate between commercial and non-commercial services and why UH made the decisions it did. After 18 months was it the right decision? What has been the impact on library services and library users?
**Researcher engagement resources: a demonstration**
*Rosie Higman, University of Cambridge/Manchester, Hardy Schwamm, Lancaster University*
Research Data Network
University of St Andrews journal hosting serviceStAndrewsUniLib
Presented as part of an Open Access Week event hosted by University of St Andrews Library: 'The humanities and open access: opportunities and challenges'.
In autumn 2013 the University of Derby Library launched its
discovery service – Library Plus – exclusively for HE students.
Two years later the Library launched a second discovery service
– Discover – for FE and Access students. This presentation will
describe the creation of Discover, the problems encountered
during implementation, and the successes and lessons learnt
from introducing a discovery tool in an FE institution. Discover
presented the Library with new opportunities to look at the
functionality of Library Plus, and how to promote it more
effectively to HE students. It also highlights the challenges of
maintaining two similar but altogether different systems for the
needs of an increasingly diverse student body.
The University of Hertfordshire (UH) implemented a new
commercial Resource Discovery Service at the same time as it
changed to the Koha Open Source Library Management System. In doing so it moved away from using Google Scholar, as its main platform, at a time when many universities are deciding to only use Google Scholar. Hear about the debate between commercial and non-commercial services and why UH made the decisions it did. After 18 months was it the right decision? What has been the impact on library services and library users?
**Researcher engagement resources: a demonstration**
*Rosie Higman, University of Cambridge/Manchester, Hardy Schwamm, Lancaster University*
Research Data Network
University of St Andrews journal hosting serviceStAndrewsUniLib
Presented as part of an Open Access Week event hosted by University of St Andrews Library: 'The humanities and open access: opportunities and challenges'.
In autumn 2013 the University of Derby Library launched its
discovery service – Library Plus – exclusively for HE students.
Two years later the Library launched a second discovery service
– Discover – for FE and Access students. This presentation will
describe the creation of Discover, the problems encountered
during implementation, and the successes and lessons learnt
from introducing a discovery tool in an FE institution. Discover
presented the Library with new opportunities to look at the
functionality of Library Plus, and how to promote it more
effectively to HE students. It also highlights the challenges of
maintaining two similar but altogether different systems for the
needs of an increasingly diverse student body.
Presentation covering the Welsh Ebooks. Part of the Digital literacy in an e-world 2008: the 8th Annual E-Books Conference which took place Thu 30 Oct 2008, delivered by Dr Paul Riley, University of Wales Institute, Cardiff.
The Open to Open Access (O2OA) project, Miggie Pickton, University of Northam...Repository Fringe
The Open to Open Access (O2OA) project, Miggie Pickton, University of Northampton. Presented as part of Repository Fringe 2014, 30-31st July 2014, in Edinburgh.
This session will focus on the development of digital leadership
skills for librarians in the area of collection management.
Within this context digital leadership refers to leadership as a
responsibility as opposed to a role. It will demonstrate a digital
leadership model that can be reused in different work contexts
and the use of online training to develop core competencies.
Illuminating shifting perspectives and roles: the emergence of library staff ...Jen Green
Successfully managing an institution’s scholarly output and meeting the present-day needs of scholars means shifting many aspects of library work. A key driver is that the fundamental concept of digital content management has evolved to include not only resources that are purchased, but also content created locally, found openly, and connected globally.
The Dartmouth Academic Commons (DAC), an open repository, will facilitate long-term public representation of Dartmouth’s scholarly output. Librarians and library staff who collect manage, preserve, and present resources are deeply involved through an evolving process of broadening the scope and nature of collection management work.
Building a Community for Research Data Services: CLIR/DLF E-Research Peer Net...Inna Kouper
Panel at the Digital Library Federation forum, October 27, 2014.
Authors: Chris Kollen (U of Arizona), Sarah Williams (U of Illinois at Urbana-Champaign), Mayu Ishida (U of Manitoba), Kathleen Fear (U of Rochester), Inna Kouper (Indiana U), Kendall Roark (U of Alberta)
LIBER Strategy for libraries and research dataJeannette Frey
A presentation about the new LIBER (Ligue Européenne des bibliothèques de recherche) Strategy 2018-2022, with a special focus on the question of research data.
Regina Raboin introduces the New England Collaborative Data Management Curric...Donna Kafel
Presentation about the New England Collaborative Data Management Curriculum by Regina Raboin of Tufts Tisch Library. This presentation was given at the 2014 ALA ALCTS meeting.
Paper presented at the 12th International Conference on Digital Preservation, November 2-6, 2015. University of North Carolina at Chapel Hill.
Abstract:
An increasing amount of scientific work is performed in silico, such that the entire process of investigation, from experiment to publication, is performed by computer. Unfortunately, this has made the problem of scientific reproducibility even harder, due to the complexity and imprecision of specifying and recreating the computing environments needed to run a given piece of software. Here, we consider from a high level what techniques and technologies must be put in place to allow for the accurate preservation of the execution of software. We assume that there exists a suitable digital archive for storing digital objects; what is missing are frameworks for precisely specifying, assembling, and executing software with all of its dependencies. We discuss the fundamental problems of managing implicit dependencies and outline two broad approaches: preserving the mess, and encouraging cleanliness. We introduce three prototype tools for preserving software executions: Parrot, Umbrella, and Prune.
Presentation covering the Welsh Ebooks. Part of the Digital literacy in an e-world 2008: the 8th Annual E-Books Conference which took place Thu 30 Oct 2008, delivered by Dr Paul Riley, University of Wales Institute, Cardiff.
The Open to Open Access (O2OA) project, Miggie Pickton, University of Northam...Repository Fringe
The Open to Open Access (O2OA) project, Miggie Pickton, University of Northampton. Presented as part of Repository Fringe 2014, 30-31st July 2014, in Edinburgh.
This session will focus on the development of digital leadership
skills for librarians in the area of collection management.
Within this context digital leadership refers to leadership as a
responsibility as opposed to a role. It will demonstrate a digital
leadership model that can be reused in different work contexts
and the use of online training to develop core competencies.
Illuminating shifting perspectives and roles: the emergence of library staff ...Jen Green
Successfully managing an institution’s scholarly output and meeting the present-day needs of scholars means shifting many aspects of library work. A key driver is that the fundamental concept of digital content management has evolved to include not only resources that are purchased, but also content created locally, found openly, and connected globally.
The Dartmouth Academic Commons (DAC), an open repository, will facilitate long-term public representation of Dartmouth’s scholarly output. Librarians and library staff who collect manage, preserve, and present resources are deeply involved through an evolving process of broadening the scope and nature of collection management work.
Building a Community for Research Data Services: CLIR/DLF E-Research Peer Net...Inna Kouper
Panel at the Digital Library Federation forum, October 27, 2014.
Authors: Chris Kollen (U of Arizona), Sarah Williams (U of Illinois at Urbana-Champaign), Mayu Ishida (U of Manitoba), Kathleen Fear (U of Rochester), Inna Kouper (Indiana U), Kendall Roark (U of Alberta)
LIBER Strategy for libraries and research dataJeannette Frey
A presentation about the new LIBER (Ligue Européenne des bibliothèques de recherche) Strategy 2018-2022, with a special focus on the question of research data.
Regina Raboin introduces the New England Collaborative Data Management Curric...Donna Kafel
Presentation about the New England Collaborative Data Management Curriculum by Regina Raboin of Tufts Tisch Library. This presentation was given at the 2014 ALA ALCTS meeting.
Paper presented at the 12th International Conference on Digital Preservation, November 2-6, 2015. University of North Carolina at Chapel Hill.
Abstract:
An increasing amount of scientific work is performed in silico, such that the entire process of investigation, from experiment to publication, is performed by computer. Unfortunately, this has made the problem of scientific reproducibility even harder, due to the complexity and imprecision of specifying and recreating the computing environments needed to run a given piece of software. Here, we consider from a high level what techniques and technologies must be put in place to allow for the accurate preservation of the execution of software. We assume that there exists a suitable digital archive for storing digital objects; what is missing are frameworks for precisely specifying, assembling, and executing software with all of its dependencies. We discuss the fundamental problems of managing implicit dependencies and outline two broad approaches: preserving the mess, and encouraging cleanliness. We introduce three prototype tools for preserving software executions: Parrot, Umbrella, and Prune.
Paper presented at the 12th International Conference on Digital Preservation, November 2-6, 2015. University of North Carolina at Chapel Hill.
Abstract:
Creation and improvement of tools for digital preservation is a difficult task without an established way to assess any progress in their quality. This happens due to low presence of solid evidence and a lack of accessible approaches to create such evidence. Software benchmarking, as an empirical method, is used in various fields to provide objective evidence about the quality of software tools. However, digital preservation field is still missing a proper adoption of that method. This paper establishes a theory of benchmarking of tools in digital preservation as a solid method for gathering and sharing the evidence needed to achieve widespread improvements in tool quality. To this end, we discuss and synthesize literature and experience on the theory and practice of benchmarking as a method and define a conceptual framework for benchmarks in digital preservation. Four benchmarks that address different digital preservation scenarios are presented. We compare existing reports on tool evaluation and how they address the main components of benchmarking, and we discuss the question of whether the field possesses the right combination of social factors that make benchmarking a promising method at this point in time. The conclusions point to significant opportunities for collaborative benchmarks and systematic evidence sharing, but also several major challenges ahead.
Paper presented at the 12th International Conference on Digital Preservation, November 2-6, 2015. University of North Carolina at Chapel Hill.
Abstract: Web resources are increasingly interactive, resulting in resources that are increasingly difficult to archive. The archival difficulty is based on the use of client-side technologies (e.g., JavaScript) to change the client-side state of a representation after it has initially loaded. We refer to these representations as deferred representations. We can better archive deferred representations using tools like headless browsing clients. We use 10,000 seed Universal Resource Identifiers (URIs) to explore the impact of including PhantomJS – a headless browsing tool – into the crawling process by comparing the performance of wget (the baseline), PhantomJS, and Heritrix. Heritrix crawled 2.065 URIs per second, 12.15 times faster than PhantomJS and 2.4 times faster than wget. However, PhantomJS discovered 531,484 URIs, 1.75 times more than Heritrix and 4.11 times more than wget. To take advantage of the performance benefits of Heritrix and the URI discovery of PhantomJS, we recommend a tiered crawling strategy in which a classifier predicts whether a representation will be deferred or not, and only resources with deferred representations are crawled with PhantomJS while resources without deferred representations are crawled with Heritrix. We show that this approach is 5.2 times faster than using only PhantomJS and creates a frontier (set of URIs to be crawled) 1.8 times larger than using only Heritrix.
Paper presented at the 12th International Conference on Digital Preservation, November 2-6, 2015. University of North Carolina at Chapel Hill.
Abstract:
In this paper, we describe an OAIS aligned data model and architectural design that enables us to archive digital information with a single core preservation workflow. The data model allows for normalization of metadata from widely varied domains to ingest and manage the submitted information utilizing only one generalized toolchain and be able to create access platforms that are tailored to designated data consumer communities. The design of the preservation system is not dependent on its components to continue to exist over its lifetime, as we anticipate changes both of technology and environment. The initial implementation depends mainly on the open-source tools Archivematica, Fedora/Islandora, and iRODS.
Paper presented at the 12th International Conference on Digital Preservation, November 2-6, 2015. University of North Carolina at Chapel Hill.
Abstract:
Researchers in information science are placing increased attention on data reuse and on what must be preserved with that data to enable meaningful use by scholars within and across disciplines. Although the focus has been on scientific or quantitative data, this paper expands the discussion to qualitative data – specifically digital video records of practice in the field of education. This is an interesting case because researchers and diverse education professionals are interested in reusing this content, though their needs differ. We focus on three issues that raise challenges for preservation and access: file format, context, and dissemination.
Educational Records of Practice: Preservation and Access Concerns. Elizabeth ...
Similar to Preserving the Fruit of Our Labor: Establishing Digital Preservation Policies and Strategies at the University of Houston Libraries. Santi Thompson, Annie Wu, Drew Krewer, Mary Manning and Rob Spragg
RDM Roadmap to the Future, or: Lords and Ladies of the DataRobin Rice
Story of the new 2017-2020 University of Edinburgh RDM Roadmap, with a Tolkienesque theme for IASSIST-CARTO 2018 in Montreal: "Once upon a data point: sustaining our data storytellers".
Are you interesting in offering data management services at your library but aren’t sure where to start? Then this class is for you! During this session, we will
• Outline the data management topics that are commonly offered in libraries
• Present strategies for how to determine what services might be most useful on your campus and create synergistic partnerships with other university entities
• Dive into how to offer support with data management plans
• Present a case study for using an institutional repository to archive and share research data
• Identify additional training opportunities and open educational resources you can use to develop robust DM services
The class will consist of a mix of presentations, hands on activities, and discussion. So come ready to participate!
Jenny Mitcham from the University of York and Chris Awre from the University of Hull share lessons learned from their project to explore the potential of the digital preservation solution Archivematica to help manage research data that academics within the University produce. The project 'Filling the Digital Preservation Gap' has been carried out with funding from Jisc as part of their Research Data Spring program and was a collaboration of the University of York and the University of Hull. The project did not only explore Archivematica as a possible solution but also how it could integrate with the repositories and other systems for the management of research data.
The Series is jointly sponsored by ANDS and CAUL.
Libraries and Research Data Management – What Works? Summary of a Pre-Survey.LIBER Europe
This presentation by Rob Grim was given at the Scholarly Communication and Research Infrastructures Steering Committee Workshop. The workshop title was Libraries and Research Data Management – What Works?
The benefits and challenges of open access: lessons from practice - Helen Bla...Jisc
Led by Helen Blanchett, subject specialist, scholarly communications, Jisc.
With contribution from Andrew Simpson, associate university librarian (procurement and metadata and systems), Portsmouth University.
In this session you’ll hear in this session you’ll hear about the benefits and challenges of open access.
Connect more in London, 28 June 2016
Similar to Preserving the Fruit of Our Labor: Establishing Digital Preservation Policies and Strategies at the University of Houston Libraries. Santi Thompson, Annie Wu, Drew Krewer, Mary Manning and Rob Spragg (20)
Paper presented at the 12th International Conference on Digital Preservation, November 2-6, 2015. University of North Carolina at Chapel Hill.
Abstract:
The role of the National Archives of Australia is to promote the creation, management and preservation of authentic, reliable and usable Commonwealth government records and enable ongoing public access to the archival resources of the Commonwealth. Records that are created by Commonwealth government agencies and transferred to the National Archives are, of course, predominately digital. Digital records bring a range of challenges, but they also potentially present new opportunities in the way archives can conduct their business. This paper outlines a project currently underway at the National Archives, named Project Chrysalis, which is an end-to-end business system that aims to transform the way in which the Archives does its digital business. Project Chrysalis represents not just a technical solution, but also significant business change for the National Archives. However, if implemented successfully, the project should enable the Archives to sustainably harvest, preserve and provide access to digital records in the information age.
Paper presented at the 12th International Conference on Digital Preservation, November 2-6, 2015. University of North Carolina at Chapel Hill.
Abstract:
The DataNet Federation Consortium uses a policy-based data management system to apply and enforce preservation requirements. This paper describes the Preservation Policy Toolkit developed by the consortium. In particular, the paper describes the infrastructure needed for preservation, presents examples of computer actionable forms of policies, and provides a generic template for designing actionable preservation policies.
Paper presented at the 12th International Conference on Digital Preservation, November 2-6, 2015. University of North Carolina at Chapel Hill.
Abstract:
Memory institutions have already collected a large number of digital objects, predominantly CD-ROMs. Some of them are already inaccessible with current systems, and most of them will be soon. Emulation offers a viable strategy for long-term access to these publications. However, these collections are huge and the objects are missing technical metadata to setup a suitable emulated environment. In this paper we propose a pragmatic approach to technical metadata which we use to implement a characterization tool to suggest a suitable emulated rendering environment.
Paper presented at the 12th International Conference on Digital Preservation, November 2-6, 2015. University of North Carolina at Chapel Hill.
Abstract:
This paper describes some of the challenges the National Library of New Zealand has faced in our efforts to maintain the authenticity of born digital collection items from first transfer to the Library through ingest into our digital preservation system. We assume that assuring the authenticity and integrity of digital objects means preserving the binary objects plus metadata about the objects. We discuss the efforts and challenges of the Library to preserve contextual metadata around the binary object, in particular filenames and file dates. We discuss these efforts from the two perspectives of the digital archivist and the digital preservation analyst, and how these two perspectives inform our current thinking.
Paper presented at the 12th International Conference on Digital Preservation, November 2-6, 2015. University of North Carolina at Chapel Hill.
Abstract:
The National Library of France (BnF) has recently implemented a new module for its Scalable Preservation and Archiving Repository (SPAR) to set up preservation strategies based on formats, agents, workflows, tools and tests, and managed as reference packages in the Archive. This module aims to fulfill an objective: for SPAR to be fully self-documented. Formats, agents and workflows are formally described and preserved along with the Information packages in which such elements are involved. Although this was a feature that was included from the beginnings of SPAR, the new Preservation Planning module aims to provide a tool that can more easily build these reference packages and that will more closely involve domain experts and the IT department in the processes of preservation planning. But the main innovation lies in the documentation of decisions that directed their selection as standards in SPAR: test data are now preserved as a new kind of reference package.
Paper presented at the 12th International Conference on Digital Preservation, November 2-6, 2015. University of North Carolina at Chapel Hill.
Abstract:
We describe a hybrid approach for access to digital objects contained within forensic disk images extracted from physical media. This approach includes the use of emulation-as-a-service (EaaS) to provide web-accessible virtual environments for materials that may not render or execute accuratelyon modern hardware and software, and the use of digital forensics software libraries to produce web-accessible file system views to support single-file access and provide visualizations of the file system.
Paper presented at the 12th International Conference on Digital Preservation, November 2-6, 2015. University of North Carolina at Chapel Hill.
Abstract:
This paper describes how the E-ARK project (European Archival Records and Knowledge Preservation) aims to develop an
overarching methodology for curating digital assets. This methodology must address business needs and operational issues, proposing a technical wall-to-wall reference implementation for the core OAIS flow – Ingest, Archival Storage and Access. The focal point of the paper is the Access part of the OAIS flow. The paper first lays out the access vision of the E-ARK project, and secondly describes the method employed to enable information processing and to pin-point the functional and non-functional requirements. These requirements will allow the E-ARK project to create a standardized format for the Dissemination Information Package (DIP), and to develop the access tools that will process this format. The paper then proceeds to describe the actual DIP format before detailing what the access solution will look like, which tools will be developed and, not least, why the E-ARK Access system will be used and work.
Paper presented at the 12th International Conference on Digital Preservation, November 2-6, 2015. University of North Carolina at Chapel Hill.
Abstract:
In this paper, we describe the development of a file format migrations framework at Harvard Library, using one migration
case study, Kodak PhotoCD images, to demonstrate implementation of the framework.
Paper presented at the 12th International Conference on Digital Preservation, November 2-6, 2015. University of North Carolina at Chapel Hill.
Abstract:
Digital curation is a complex of actors, policies, practices, and technologies enabling successful consumer engagement with
authentic content of interest across space and time. While digital curation is a rapidly maturing field, it still lacks a convincing unified theoretical foundation. A recent internal evaluation of its programmatic activities by the University of California Curation Center (UC3) led quickly to seemingly simple, yet deceptively difficult-to-answer questions. Too many fundamental terms of curation practice remain overloaded and under-formalized, perhaps none more so than “digital object.” To address these concerns, UC3 is developing a new model for conceptualizing the curation domain. While drawing freely from
many significant prior efforts (e.g., Kahn-Wilensky, FRBR, NAA, OAIS, BRM, etc.), the UC3 Sept model also assumes that digital curation is an inherently semiotic activity. Consequently, the model considers curated content with respect to six distinct analytic dimensions: semantics, syntactics, empirics, pragmatics, diplomatics, and dynamics, which refer respectively to content’s underlying abstract meaning or emotional affect, symbolic encoding structures, physical representations, realizing behaviors, evidential authenticity and reliability, and evolution through time. Correspondingly, the model defines an object typology of increasing consumer utility: blobs, artifacts, exemplars, products, assets, records, and heirlooms, which are respectively existential, intentional, purposeful, interpretable, useful, trustworthy, and resilient digital objects. Content engagement is modeled in terms of producer, owner, manager, and consumer roles acting within a continuum of concerns for originating, organizing, and pluralizing curated content. Content policy and strategy are modeled in terms of six high-level imperatives: predilect, collect, protect, introspect, project, and connect. A consistent, comprehensive, and conceptually parsimonious domain model is important for planning, performing, and evaluating programmatic activities in a rigorous and systematic rather than ad hoc and idiosyncratic manner. The UC3 Sept model can be used to make precise yet concise statements regarding curation intentions, activities, and results.
More from 12th International Conference on Digital Preservation (iPRES 2015) (9)
0x01 - Newton's Third Law: Static vs. Dynamic AbusersOWASP Beja
f you offer a service on the web, odds are that someone will abuse it. Be it an API, a SaaS, a PaaS, or even a static website, someone somewhere will try to figure out a way to use it to their own needs. In this talk we'll compare measures that are effective against static attackers and how to battle a dynamic attacker who adapts to your counter-measures.
About the Speaker
===============
Diogo Sousa, Engineering Manager @ Canonical
An opinionated individual with an interest in cryptography and its intersection with secure software development.
Have you ever wondered how search works while visiting an e-commerce site, internal website, or searching through other types of online resources? Look no further than this informative session on the ways that taxonomies help end-users navigate the internet! Hear from taxonomists and other information professionals who have first-hand experience creating and working with taxonomies that aid in navigation, search, and discovery across a range of disciplines.
Sharpen existing tools or get a new toolbox? Contemporary cluster initiatives...Orkestra
UIIN Conference, Madrid, 27-29 May 2024
James Wilson, Orkestra and Deusto Business School
Emily Wise, Lund University
Madeline Smith, The Glasgow School of Art
This presentation by Morris Kleiner (University of Minnesota), was made during the discussion “Competition and Regulation in Professions and Occupations” held at the Working Party No. 2 on Competition and Regulation on 10 June 2024. More papers and presentations on the topic can be found out at oe.cd/crps.
This presentation was uploaded with the author’s consent.
This presentation, created by Syed Faiz ul Hassan, explores the profound influence of media on public perception and behavior. It delves into the evolution of media from oral traditions to modern digital and social media platforms. Key topics include the role of media in information propagation, socialization, crisis awareness, globalization, and education. The presentation also examines media influence through agenda setting, propaganda, and manipulative techniques used by advertisers and marketers. Furthermore, it highlights the impact of surveillance enabled by media technologies on personal behavior and preferences. Through this comprehensive overview, the presentation aims to shed light on how media shapes collective consciousness and public opinion.
Acorn Recovery: Restore IT infra within minutesIP ServerOne
Introducing Acorn Recovery as a Service, a simple, fast, and secure managed disaster recovery (DRaaS) by IP ServerOne. A DR solution that helps restore your IT infra within minutes.
Announcement of 18th IEEE International Conference on Software Testing, Verif...
Preserving the Fruit of Our Labor: Establishing Digital Preservation Policies and Strategies at the University of Houston Libraries. Santi Thompson, Annie Wu, Drew Krewer, Mary Manning and Rob Spragg
1. Preserving the Fruit of Our Labor:
Establishing Digital Preservation
Policies and Strategies at the
University of Houston Libraries
UH Libraries Digital Preservation Task Force
Santi Thompson (Chair)
Annie Wu (Vice Chair)
Drew Krewer
Mary Manning
Rob Spragg
3. Setting the Scene
• Inconsistent practices
• Transitional period
• Leverage expertise
4. Task Force Charge
• Define Scope
• Articulate Priorities and Policies
• Determine Resources
• Align with TDL
• Recommend Next Steps
5. Action Plan for Developing
a Digital Preservation
Program
• Organizational Infrastructure
• Technological Infrastructure
• Resources Framework
6. Digital Preservation Policy Framework
Purpose
Objectives
Mandate
Scope
Challenges
Principles
Roles and Responsibilities
Collaboration
Selection and Acquisition Criteria
Access/Use Criteria
Review Cycle (added)
Policies and Procedures
Roles and Responsibilities
Digital Assets
Digital Preservation Strategies
Technological Infrastructure
Digital Archive Operations
Platform Requirements/Procedures
Technological Infrastructure
OAIS Reference Model
Information Packages
Functional Entities
Digital Preservation Policy
8. Testing the Systems
What have others done before us?
1. Preserving (Digital) Objects with Restricted
Resources (POWWR) Project
2. The National Library of Medicine (NLM) Digital
Repository Evaluation and Selection Working Group
9. The POWRR White Paper
“From Theory to Action: ‘Good
Enough’ Digital Preservation Solutions
for Under-Resourced Cultural Heritage
Institutions”
http://commons.lib.niu.edu/bitstream/10843/13610/1/FromTheoryToAction_PO
WRR_WhitePaper.pdf
11. NLM Consolidated Repository
Test Plan
Prepared by the NLM Digital Repository
Evaluation and Selection Working Group
http://www.nlm.nih.gov/digitalrepository/Consolidated-DR-Testplan-
Template.xls
● Ingest
● Archival Storage
● Metadata
● Additional Technical Infrastructure
13. Results
Advantages Disadvantages
Sustained by active user
community
More complex system
maintenance
Supports complex archival
workflow
Limited storage locations
Aligns with standards and
best practices
Lacks robust reporting and
notification service
14. TDL Storage Services
… Align priorities with digital preservation
standards, best practices, and Texas Digital
Library (TDL) storage services
15. Project Benchmarks
Date Task Force Activity
April 5, 2015 Task force released draft of final report and DP
Policies to internal UH Libraries’ stakeholders for
public comment
May 15, 2015 Task force submitted final report, DP Policies, and
proposed budget to Library Administration
September 16,
2015
Approved by Library Administration; DP
implementation began
19. Image Credits
● Auntie P, "Jigsaw," CC BY-NC-SA 2.0, https://flic.kr/p/8utKeQ
● Bunches and Bits {Karina}, "Reading," CC BY-NC-ND 2.0, https://flic.kr/p/7fRqZP
● Chlot’s Run, “Puzzle Time,” CC BY 2.0, https://flic.kr/p/aZ8r7P
● Scott Maxwell, “Working Together Teamwork Puzzle Concept,” CC BY-SA 2.0, https://flic.kr/p/4fUsNL
● Poppen, “OAIS Functional Entities,”
http://en.wikipedia.org/wiki/Open_Archival_Information_System#/media/File:OAIS-.gif
● Jonathan Vaughan, “Puzzle Nerd Blog”, https://puzzlenerd.wordpress.com/assembly-piece-by-piece
● Mitchelangelo, “Mixte Madness,” CC BY 2.0, https://flic.kr/p/eLxZ9i
● Horla Varlan, “Fat exclamation mark made from jigsaw puzzle pieces,” CC BY 2.0, https://flic.kr/p/7x9bSS
● ---, “Question mark made of puzzle pieces,” CC BY 2.0, https://flic.kr/p/7vB7fR
Inconsistent Practices:
Some workflows followed process outlined in “Implementing METS, MIX, and DC for Sustaining Digital Preservation at the University of Houston Libraries” MINGYU CHEN and MICHELE REILLY; Some objects had technical metadata generated with METS records;
Data located in various network drives; some of which the originals were backed up to a secure, access-restricted network drive.
Transitional Period: UH Libraries reorganized its digitization and repository units during the first six months of 2014. Great opportunity to create new policies and services around area that we have not addressed
Leverage Expertise: UH Libraries acquired new talent and reassigned other personnel -- both of which brought new digital preservation experience and skills to the table
Image Citation Information: https://flic.kr/p/aZ8r7P
Defined the policy’s scope and levels of preservation
Articulated digital preservation priorities by outlining current practices, identifying preservation gaps and areas for improvement, and establishing goals to address gaps – CREATION OF UH Libraries DP Policy
Determined the tools, infrastructure, and other resources needed to address unmet needs and to sustain preservation activities in the future – SELECTION OF PRESERVATION TOOL
Aligned priorities with digital preservation standards, best practices, and Texas Digital Library (TDL) storage services
Recommended roles, responsibilities, and next steps for implementing the strategy and policy
May 2014 to May 2015
Image Citation Information: https://flic.kr/p/8BPU1e
Digital Preservation Management is a workshop aimed at organizations that are looking to respond to digital preservation challenges.
Taught by Nancy McGovern and Kari Smith from MIT Libraries, though it was also developed at Cornell University Libraries.
The core of the workshop centers around the Action Plan for Developing a Digital Preservation Program, which consists of three parts: Organizational Infrastructure, Technological Infrastructure, and Resources Framework.
The organizational infrastructure offers both a general context of the program as well as more narrow operations, processes, and short-term implementation plans
The technological infrastructure uses the OAIS model as a framework to aid in system selection and to develop pertinent procedures surrounding specific functions of the chosen preservation system.
The resources framework outlines startup, ongoing, and contingency costs.
This is the form our digital preservation policy took after attending the workshop and following the action plan.
It follows the DPM workshop model pretty faithfully, though there are some areas we either added or omitted. I can talk more about this later if others are interested
Added a “Review Cycle” to the Policy Framework to demonstrate a commitment to revisiting document
Temporarily omitted a section called “Plans and Strategies”; we are developing that piece through a Digital Preservation Team, which Santi Thompson will speak about later.
Added “Content Selection Strategies for Cloud-Based Storage” under the Preservation Planning functional entity in case upcoming AV and large-scale projects dramatically affect storage priorities over time.
We have drafted the Resources Framework but have chosen to share that portion of the document with administration only.
Digital Assets category under “Policies and Procedures” contains: Quality Creation & Benchmarking, Selection and Acquisition Policies, Transfer Requirements and Deposit Guidelines, and Access and Use Policies
Functional Entities category under “Technological Infrastructure” contains: Pre-ingest, Common Services, Ingest, Archival Storage, Data Management, Administration, Preservation Planning, and Access
Instead of inventing the wheel, we researched to see what others had done ahead of us. We used two white papers extensively: one from the POWRR Project, the other from the NLM Digital Repository Evaluation and Selection Working Group.
From 2012-2014, the Digital POWRR Project, an Institute of Museum and Library Services (IMLS)-funded study investigated, evaluated, and recommended scalable, sustainable digital preservation solutions for libraries with smaller amounts of data and/or fewer resources
The group researched and produced a white paper that evaluated the following tools: Archivematica, Curator’s Workbench, DuraCloud, MetaArchive, Preservica, Internet Archive
Using the POWWR Project research, we were able to narrow the field of digital preservation tools to Rosetta, Archivematica, and Preservica.
Ex Libris’s DAMS was under consideration for testing by a UH DAMS Task Force that was selecting a DAMS while we were selecting a digital preservation tool, so we also looked at Ex Libris’s Rosetta.
Rosetta came in for a day and presented to both Task Forces. However, when the DAMS task force ruled out Ex Libris’s tool for several reasons, including costs, the Digital Preservation Task Force also ruled it out.
This left Preservica and Archivematica. We had initial discussions with both vendors. In the end, Preservica was cost prohibitive. Additionally, we wanted to use a METS wrapper and were dissatisfied with their proprietary wrapper.
This left Archivematica to test—but by what criteria should we evaluate it?
The NLM Digital Repository Evaluation and Selection Working Group (DRESWG) evaluated commercial systems and open source software. The Working Group developed a set of “Master Evaluation Criteria,”.
Their evaluation tool tested in these areas:
· Ingest, etc. (see slide)
Additional Technical Infrastructure Requirements included testing criteria such: can the system respond to OAI-PMH requests, does it confirm to Z39.50 and Z39.87 NISO standards, does it support Unicode, etc.
We tested Archivematica using the tool modified from NLM and afterward, had Archivematica come to UH for a day’s consultation to answer remaining questions and to discuss how their tool is interoperable with other content management systems and DAMS: DSpace, Content DM, ArchivesSpace, DuraCloud, etc.
Advantages
OAIS compliant, which conforms to digital preservation standards and best practices. It uses open source solutions to perform digital preservation activities.Supports the ingest and technical metadata extraction of a wide array of file formats
Actively developed and has a robust, active user community
Supports versioning through the adoption of the Archival Information Collection (AIC)
Records digital preservation events and places this information into METS record as PREMIS metadata and also in log files
Intuitive interface that makes it easy for administrators to customize rules, settings, and workflows; tracks workflow in a transparent way
Supports complex archival workflows with multiple users having access, if desired
Ability to integrate with other digital asset management systems, including CONTENTdm, Fedora, ATOM, and DuraSpace
“No-cost” solution with a pay structure for software support and/or customized features
Disadvantages
The combination of tools can make long term system maintenance more complex. This is not a “set it and forget it” platform.
Ingest of descriptive metadata is limited toCSV file or manual input
Lack of robust reporting and notification to assist with digital curation tasks. Additionally, error reports are sometimes cryptic. They are also generated by each micro-service, which can vary in its description of the problem and possible solution.
Can only store objects in one specified location*
Polling and self-healing are not services currently offered**
The roles for users and administrators are limited
*The task force has identified a work-around for this issue.
**This service may be offered in DuraSpace
The DPTF is recommending that UH Libraries selects Archivematica as its DP tool
Because we are members of the Texas Digital Library, we also have access to Duracloud @ TDL. This will allow us to deposit some materials to the Cloud. TDL will also be one of 5 nodes in the Digital Preservation Network.
Image Citation Information: https://flic.kr/p/4fUsNL
Make recommendations to stakeholders and admin RE: User Agreements with DuraCloud and Archivematica
Purchase services
Install Archivematica
Inventory of Materials to be preserved (SC & MDS)
Create "core records" for born digital materials that need to be preserved.
Could start patron digitization "core records" if decision has been made about inclusion of such work in the Digital Library/Preservation.
Library commitment to Digital Preservation
Better understanding of current risks
Identified key players in the process and have articulated their roles and responsibilities
Expanded copies and storage to off-site location
TDL + DuraCloud
Best practices
OAIS Compliant
Guided by TRAC
Image Content Information: https://flic.kr/p/7x9bSS