SlideShare a Scribd company logo
1 of 49
Preserving Content from
Your Institutional
Repository
Wendy C Robertson and Carol Ann Borchert
NASIG, Buffalo, N.Y., June 8 2013
This work is licensed under a Creative Commons Attribution-NonCommercial 3.0 Unported License.
The Signal
http://blogs.loc.gov/digitalpreservation/
“
”
a permanent, institution-wide repository of
diverse, locally produced digital works (e.g.,
article preprints and postprints, data sets,
electronic theses and dissertations, learning
objects, and technical reports) that is available for
public use and supports metadata harvesting.
University of Houston Libraries, Institutional Repository Task Force. Institutional Repositories. SPEC
Kit 292. July 2006. p.13
An institutional repository is…
An institutional repository is not…
Most IRs currently are not preservation
repositories; they do not meet all the criteria
in Trustworthy Repositories Audit &
Certification (TRAC) or other audits.
10 basic characteristics of digital
preservation repositories (CRL)
1. The repository commits to continuing maintenance of
digital objects for identified community/communities.
2. Demonstrates organizational fitness (including
financial, staffing, and processes) to fulfill its
commitment.
3. Acquires and maintains requisite contractual and legal
rights and fulfills responsibilities.
4. Has an effective and efficient policy framework.
10 basic characteristics (cont.)
5. Acquires and ingests digital objects based upon stated
criteria that correspond to its commitments and
capabilities.
6. Maintains/ensures the integrity, authenticity and
usability of digital objects it holds over time.
10 basic characteristics (cont.)
7. Creates and maintains requisite metadata about
actions taken on digital objects during preservation as
well as about the relevant production, access support,
and usage process contexts before preservation.
8. Fulfills requisite dissemination requirements.
9. Has a strategic program for preservation planning and
action.
10.Has technical infrastructure adequate to continuing
maintenance and security of its digital objects.
The year is 2100. Can you read your files?
Our questions for you
• Who has an IR?
• What platform are you using?
• Who’s backing it up?
• Who’s part of a PLN?
• Who’s having their IR journals
preserved in LOCKSS or Portico?
Question mark sign by Colin_K, on Flickr
Localized disasters
Fire
Flood
http://chronicle.com/blogs/wiredcampus/what-katrina-
can-teach-libraries-about-sandy-and-other-disasters/40986
Hurricane
Tornado
http://blog.al.com/spotnews/2011/10/plans_to_rebuild_birmingham_li.html
http://www.ncsml.org/Content/About-Us/Museum-History/2008-Flood.aspx http://news.bbc.co.uk/onthisday/hi/dates/stories/august/1/newsid_2526000/2526839.stm
War
Tsunami
Earthquake
© 2011 UMD Libraries
http://savemlak.jp/wiki/saveMLAK/en?lang=en&uselang=en
http://www.flickr.com/photos/umd_libraries/6075914283/in/set-72157627383474133http://www.bostonglobe.com/business/2013/02/17/the-smuggled-hard-drives-timbuktu/rCyv0QL1FdLLkw4tjv6hDO/story.html
Disasters with warning
Moving servers out
of the University of
Iowa Libraries, 2008.
© 2008 The University of Iowahttp://digital.lib.uiowa.edu/cdm/ref/collection/flood/id/3414
Disasters with no warning
University of South
Florida, very
localized flood
http://lib.usf.edu/offtheshelf/tampa-library/the-flood-of-09dedication-in-the-face-of-disaster/
“
”
Disaster recovery strategies and backup
systems are not sufficient to ensure survival
and access to authentic digital resources over
time. A backup is a short-term data recovery
solution following loss or corruption and
is fundamentally different to an electronic
preservation archive.
JISC. Digital Preservation: Continued Access to Authentic Digital Assets
(November 2006)
Backups vs. preservation
Exit strategy
Make sure you can easily migrate all your
content and metadata out of your system in a
usable format.
Test, test and test some more
Test that all files are as expected regarding
structure and completeness.
Persistent identifiers
Using persistent identifiers now will help if
you move to a new repository in the future.
Preserving the Web
You may want archive
institutional content
that is not
appropriate for an IR
but which is
appropriate for the
library’s mission.
http://dx.doi.org/10.7207/twr13-01
Archive-It
Archive-It can
preserve journals
and other
scholarly work
from your
institution that
doesn’t go into
your repository.
http://archive-it.org/collections/824
Internet Archive
“The Montana State Library
(MSL) last year moved a
copy of its collection of
3000 born digital state
publications to the Internet
Archive (IA).”—Chris
Stockwell for Montana State
Library, 12/29/2010
http://archive.org/post/340223/how-montana-state-
library-uploaded-batches-of-digital-objects-to-the-
internet-archive
http://archive.org/details/MontanaStateLibrary
IRs are a bit different…
The copy of the document in the repository
often is the only version you have.
Access copy vs. preservation copy
Digitized content may have a preservation
scan as well as the version which displays to
the public.
IRs have special problems…
Automatically adding a cover page to brand
and identify content has change the file,
perhaps even removing accessibility features.
File formats
When possible, use open file formats so
content will remain accessible long into the
future, but will you turn down content in
other formats?
PDF/A (ISO 19005-1:2005)
PDF/A is an ISO standard
“which provides a
mechanism for
representing electronic
documents in a manner
that preserves their
visual appearance over
time, independent of
the tools and systems
for creating or rending
the files.”
http://www.pdfa.org/publication/pdfa-in-a-nutshell-2-0/
U Iowa electronic theses & dissertations
1931 PDFs and 7 XML documents
Supplemented by:
21 .avi
1 .avp
8 .doc
2 .mov
2 .mp3
1 .mp4
4 .mpg
1 .mxf
3 .NTS
2 .pde
6 .pdf
4 .txt
3 .wmv
18 .xls
2 .zip
Public preservation policy
Make your
preservation and
submission policy
clear so that
contributors
understand the
risks of
contributing a non-
open format.
http://services.ideals.illinois.edu/wiki/bin/view/IDEALS/PreservationSupportPolicy
Preservation metadata
PREMIS (PREservation Metadata
Implementation Strategies)
“Preservation metadata supports
activities intended to ensure the
long-term usability of a digital
resource.”—Caplan, p.3
http://www.loc.gov/standards/premis/understanding-premis.pdf
“
”
Metadata can help support authenticity by
documenting the digital provenance
of the resource — its chain of custody and
authorized change history.
Caplan, Priscilla. Understanding PREMIS. Library of Congress, ©2009. p.3
Digital provenance
Methods of preserving data
• Refreshing data
• Migrating data
• Emulating software platform
• Replicating
• Validating data integrity
• Metadata
Long-term preservation options
• Global LOCKSS Network
• Private LOCKSS Network
• Portico
Global LOCKSS Network
• For e-journal content
• Preserves the format as well as the content
• Light archive
• Adding journals to LOCKSS
• Notify LOCKSS of metadata/file changes
• Not all serials are appropriate for Global
LOCKSS
Private LOCKSS Network
• All material from the IR
• Need at least 7 nodes/destinations
• Each should be a LOCKSS Alliance member
• Set up policies and governance for the PLN
Setting up policies for a PLN
• How long is initial
commitment?
• How much notice to
withdraw?
• How do members remove
data for withdrawn
institution?
• Does the group need a
governing body or steering
committee?
• Will the PLN be a dark or
light archive?
• Do any of the members
have embargoed
materials?
Examples of PLNs
Portico
• For e-books and e-journals
• Source files converted to an archive
format
• Dark archive
• Portico is responsible for future content
migrations
• Adding journals to Portico
• Not all serials are appropriate for Portico
Factors to consider in developing a formal
preservation plan
• Organizational &
financial commitment
• Stakeholders
• Local backups vs. long-
term preservation
• Storage needs
• Roles & responsibilities
• Data ingestion
• Policy on deletion of or
embargoes for materials
• Funding
• Staff
Organizational & financial commitment
•What is the long-term financial commitment
from your library or institution?
•Do you have the support of the organization?
From what level of administration?
Stakeholders
•Producers
•Users
•Owners
•Managers
•Funding authorities
•Other parties?
Local backups vs. long-term preservation
•Definition of backups versus preservation
•Metadata, content, software, or all of these?
•How often and who is responsible?
•PLN or other option for long-term preservation
Storage needs
Disk space
 How much
space do you
need?
 Who is
responsible for
maintaining
disks?
Software
 Which
software will
be required?
 Who migrates
information as
software needs
change?
Equipment
 What
equipment will
you need?
 Who will fund
the equipment,
set it up,
maintain it?
Roles & responsibilities
•Who is implementing the plan?
•Who is maintaining the data and how?
•Who is providing support for accessing
material and troubleshooting issues?
Data ingestion
•How are you getting data into the system
for preservation or backup?
•Will this be done in-house or outsourced to
a third party?
•How frequently and in what format?
Funding vs. staffing
• Is it easier to fund these efforts at your organization or
staff them?
• How well-staffed is your organization?
• What kind of expertise do you have (or not have) in the
library?
• What level of commitment does your organization have
to preserve digital information?
Questions?
Wendy Robertson
Digital Scholarship Librarian
University of Iowa Libraries
wendy-robertson@uiowa.edu
@wendycr_ Carol Ann Borchert
Coordinator for Serials
University of South Florida Libraries
borchert@usf.edu
Sources
Ball, Alex. Preservation and Curation in Institutional Repositories. Digital
Curation Centre, UKOLN, 2010. Version 1.3
http://www.dcc.ac.uk/sites/default/files/documents/reports/irpc-report-
v1.3.pdf
Caplan, Priscilla. Understanding PREMIS. Library of Congress, ©2009.
http://www.loc.gov/standards/premis/understanding-premis.pdf
Digital Repository Audit Method Based On Risk Assessment (DRAMBORA). Glasgow,
2009. http://www.dcc.ac.uk/resources/repository-audit-and-
assessment/drambora
JISC. Digital Preservation: Continued Access to Authentic Digital Assets (Nov.
2006)
http://www.jisc.ac.uk/publications/briefingpapers/2006/pub_digipreservationb
p.aspx
Sources
Nestor Working Group. Catalogue of Criteria for Trusted Digital Repositories.
Frankfurt am Main, Dec. 2006. Urn: de:0008-2006060703
OpenDOAR Policies Tool. http://www.opendoar.org/tools/en/policies.php
Oettler, Alexandra. PDF/A in a Nutshell 2.0: PDF for long-term archiving. Berlin:
Association for Digital Document Standards e. V., ©2013.
http://www.pdfa.org/wp-
content/uploads/2013/04/PDFA_in_a_Nutshell_21.pdf
Pennock, Maureen. Web-Archiving. DPC Technology Watch Report 12-01 March
2013. DOI: http://dx.doi.org/10.7207/twr13-01
Reference Model for an Open Archival Information System (OAIS). Recommended
Practice CCSDS 650.0-M-2. Magenta Book, June 2012.
http://public.ccsds.org/publications/archive/650x0m2.pdf
Sources
Trustworthy Repositories Audit & Certification: Criteria and Checklist (TRAC).
Version 1.0. Feb 2007. http://www.crl.edu/archiving-preservation/digital-
archives/metrics-assessing-and-certifying/trac
University of Houston Libraries, Institutional Repository Task Force. Institutional
Repositories. SPEC Kit 292. July 2006.
http://publications.arl.org/Institutional-Repositories-SPEC-Kit-292/3
University of Illinois at Urbana-Champaign. “IDEALS Digital Preservation Support
Policy.” ©2013
https://services.ideals.illinois.edu/wiki/bin/view/IDEALS/PreservationSuppor
tPolicy
University of Illinois at Urbana-Champaign. “Preparing Items for Deposit into
IDEALS. File Format Recommendations” ©2013
https://services.ideals.illinois.edu/wiki/bin/view/IDEALS/SubmissionPrep#Fil
e_Format_Recommendations

More Related Content

More from NASIG

The Power of Cross-unit Data Sharing: Nontraditional Uses for ILLiad
The Power of Cross-unit Data Sharing: Nontraditional Uses for ILLiadThe Power of Cross-unit Data Sharing: Nontraditional Uses for ILLiad
The Power of Cross-unit Data Sharing: Nontraditional Uses for ILLiad
NASIG
 

More from NASIG (20)

Calculating how much your University spends on Open Access and what to do abo...
Calculating how much your University spends on Open Access and what to do abo...Calculating how much your University spends on Open Access and what to do abo...
Calculating how much your University spends on Open Access and what to do abo...
 
Measure Twice and Cut Once: How a Budget Cut Impacted Subscription Renewals f...
Measure Twice and Cut Once: How a Budget Cut Impacted Subscription Renewals f...Measure Twice and Cut Once: How a Budget Cut Impacted Subscription Renewals f...
Measure Twice and Cut Once: How a Budget Cut Impacted Subscription Renewals f...
 
Analyzing workflows and improving communication across departments
Analyzing workflows and improving communication across departments Analyzing workflows and improving communication across departments
Analyzing workflows and improving communication across departments
 
Supporting Students: OER and Textbook Affordability Initiatives at a Mid-Size...
Supporting Students: OER and Textbook Affordability Initiatives at a Mid-Size...Supporting Students: OER and Textbook Affordability Initiatives at a Mid-Size...
Supporting Students: OER and Textbook Affordability Initiatives at a Mid-Size...
 
Access to Supplemental Journal Article Materials
Access to Supplemental Journal Article Materials Access to Supplemental Journal Article Materials
Access to Supplemental Journal Article Materials
 
Communications and context: strategies for onboarding new e-resources librari...
Communications and context: strategies for onboarding new e-resources librari...Communications and context: strategies for onboarding new e-resources librari...
Communications and context: strategies for onboarding new e-resources librari...
 
Full Text Coverage Ratios: A Simple Method of Article-Level Collections Analy...
Full Text Coverage Ratios: A Simple Method of Article-Level Collections Analy...Full Text Coverage Ratios: A Simple Method of Article-Level Collections Analy...
Full Text Coverage Ratios: A Simple Method of Article-Level Collections Analy...
 
Bloomsbury digital resources
Bloomsbury digital resourcesBloomsbury digital resources
Bloomsbury digital resources
 
Web accessibility in the institutional repository crafting user centered sub...
Web accessibility in the institutional repository  crafting user centered sub...Web accessibility in the institutional repository  crafting user centered sub...
Web accessibility in the institutional repository crafting user centered sub...
 
Linked Data at Smithsonian Libraries
Linked Data at Smithsonian Libraries Linked Data at Smithsonian Libraries
Linked Data at Smithsonian Libraries
 
Walk this way: Online content platform migration experiences and collaboration
Walk this way: Online content platform migration experiences and collaboration Walk this way: Online content platform migration experiences and collaboration
Walk this way: Online content platform migration experiences and collaboration
 
Read & Publish – What It Takes to Implement a Seamless Model?
Read & Publish – What It Takes to Implement a Seamless Model?Read & Publish – What It Takes to Implement a Seamless Model?
Read & Publish – What It Takes to Implement a Seamless Model?
 
Mapping Domain Knowledge for Leading and Managing Change
Mapping Domain Knowledge for Leading and Managing ChangeMapping Domain Knowledge for Leading and Managing Change
Mapping Domain Knowledge for Leading and Managing Change
 
When to hold them when to fold them: reassessing big deals in 2020
When to hold them when to fold them: reassessing big deals in 2020When to hold them when to fold them: reassessing big deals in 2020
When to hold them when to fold them: reassessing big deals in 2020
 
Getting on the Same Page: Aligning ERM and LIbGuides Content
Getting on the Same Page: Aligning ERM and LIbGuides ContentGetting on the Same Page: Aligning ERM and LIbGuides Content
Getting on the Same Page: Aligning ERM and LIbGuides Content
 
A multi-institutional model for advancing open access journals and reclaiming...
A multi-institutional model for advancing open access journals and reclaiming...A multi-institutional model for advancing open access journals and reclaiming...
A multi-institutional model for advancing open access journals and reclaiming...
 
Knowledge Bases: The Heart of Resource Management
Knowledge Bases: The Heart of Resource ManagementKnowledge Bases: The Heart of Resource Management
Knowledge Bases: The Heart of Resource Management
 
Practical approaches to linked data
Practical approaches to linked dataPractical approaches to linked data
Practical approaches to linked data
 
Transforming library collections and supporting student learning with collect...
Transforming library collections and supporting student learning with collect...Transforming library collections and supporting student learning with collect...
Transforming library collections and supporting student learning with collect...
 
The Power of Cross-unit Data Sharing: Nontraditional Uses for ILLiad
The Power of Cross-unit Data Sharing: Nontraditional Uses for ILLiadThe Power of Cross-unit Data Sharing: Nontraditional Uses for ILLiad
The Power of Cross-unit Data Sharing: Nontraditional Uses for ILLiad
 

Recently uploaded

Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 

Recently uploaded (20)

On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
 
Fostering Friendships - Enhancing Social Bonds in the Classroom
Fostering Friendships - Enhancing Social Bonds  in the ClassroomFostering Friendships - Enhancing Social Bonds  in the Classroom
Fostering Friendships - Enhancing Social Bonds in the Classroom
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17
 
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdf
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
 
Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptx
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 
How to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptxHow to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptx
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024
 

Preserving Content from Your Institutional Repository

  • 1. Preserving Content from Your Institutional Repository Wendy C Robertson and Carol Ann Borchert NASIG, Buffalo, N.Y., June 8 2013 This work is licensed under a Creative Commons Attribution-NonCommercial 3.0 Unported License.
  • 3. “ ” a permanent, institution-wide repository of diverse, locally produced digital works (e.g., article preprints and postprints, data sets, electronic theses and dissertations, learning objects, and technical reports) that is available for public use and supports metadata harvesting. University of Houston Libraries, Institutional Repository Task Force. Institutional Repositories. SPEC Kit 292. July 2006. p.13 An institutional repository is…
  • 4. An institutional repository is not… Most IRs currently are not preservation repositories; they do not meet all the criteria in Trustworthy Repositories Audit & Certification (TRAC) or other audits.
  • 5. 10 basic characteristics of digital preservation repositories (CRL) 1. The repository commits to continuing maintenance of digital objects for identified community/communities. 2. Demonstrates organizational fitness (including financial, staffing, and processes) to fulfill its commitment. 3. Acquires and maintains requisite contractual and legal rights and fulfills responsibilities. 4. Has an effective and efficient policy framework.
  • 6. 10 basic characteristics (cont.) 5. Acquires and ingests digital objects based upon stated criteria that correspond to its commitments and capabilities. 6. Maintains/ensures the integrity, authenticity and usability of digital objects it holds over time.
  • 7. 10 basic characteristics (cont.) 7. Creates and maintains requisite metadata about actions taken on digital objects during preservation as well as about the relevant production, access support, and usage process contexts before preservation. 8. Fulfills requisite dissemination requirements. 9. Has a strategic program for preservation planning and action. 10.Has technical infrastructure adequate to continuing maintenance and security of its digital objects.
  • 8. The year is 2100. Can you read your files?
  • 9. Our questions for you • Who has an IR? • What platform are you using? • Who’s backing it up? • Who’s part of a PLN? • Who’s having their IR journals preserved in LOCKSS or Portico? Question mark sign by Colin_K, on Flickr
  • 12. War Tsunami Earthquake © 2011 UMD Libraries http://savemlak.jp/wiki/saveMLAK/en?lang=en&uselang=en http://www.flickr.com/photos/umd_libraries/6075914283/in/set-72157627383474133http://www.bostonglobe.com/business/2013/02/17/the-smuggled-hard-drives-timbuktu/rCyv0QL1FdLLkw4tjv6hDO/story.html
  • 13. Disasters with warning Moving servers out of the University of Iowa Libraries, 2008. © 2008 The University of Iowahttp://digital.lib.uiowa.edu/cdm/ref/collection/flood/id/3414
  • 14. Disasters with no warning University of South Florida, very localized flood http://lib.usf.edu/offtheshelf/tampa-library/the-flood-of-09dedication-in-the-face-of-disaster/
  • 15. “ ” Disaster recovery strategies and backup systems are not sufficient to ensure survival and access to authentic digital resources over time. A backup is a short-term data recovery solution following loss or corruption and is fundamentally different to an electronic preservation archive. JISC. Digital Preservation: Continued Access to Authentic Digital Assets (November 2006) Backups vs. preservation
  • 16. Exit strategy Make sure you can easily migrate all your content and metadata out of your system in a usable format.
  • 17. Test, test and test some more Test that all files are as expected regarding structure and completeness.
  • 18. Persistent identifiers Using persistent identifiers now will help if you move to a new repository in the future.
  • 19. Preserving the Web You may want archive institutional content that is not appropriate for an IR but which is appropriate for the library’s mission. http://dx.doi.org/10.7207/twr13-01
  • 20. Archive-It Archive-It can preserve journals and other scholarly work from your institution that doesn’t go into your repository. http://archive-it.org/collections/824
  • 21. Internet Archive “The Montana State Library (MSL) last year moved a copy of its collection of 3000 born digital state publications to the Internet Archive (IA).”—Chris Stockwell for Montana State Library, 12/29/2010 http://archive.org/post/340223/how-montana-state- library-uploaded-batches-of-digital-objects-to-the- internet-archive http://archive.org/details/MontanaStateLibrary
  • 22. IRs are a bit different… The copy of the document in the repository often is the only version you have.
  • 23. Access copy vs. preservation copy Digitized content may have a preservation scan as well as the version which displays to the public.
  • 24. IRs have special problems… Automatically adding a cover page to brand and identify content has change the file, perhaps even removing accessibility features.
  • 25. File formats When possible, use open file formats so content will remain accessible long into the future, but will you turn down content in other formats?
  • 26. PDF/A (ISO 19005-1:2005) PDF/A is an ISO standard “which provides a mechanism for representing electronic documents in a manner that preserves their visual appearance over time, independent of the tools and systems for creating or rending the files.” http://www.pdfa.org/publication/pdfa-in-a-nutshell-2-0/
  • 27. U Iowa electronic theses & dissertations 1931 PDFs and 7 XML documents Supplemented by: 21 .avi 1 .avp 8 .doc 2 .mov 2 .mp3 1 .mp4 4 .mpg 1 .mxf 3 .NTS 2 .pde 6 .pdf 4 .txt 3 .wmv 18 .xls 2 .zip
  • 28. Public preservation policy Make your preservation and submission policy clear so that contributors understand the risks of contributing a non- open format. http://services.ideals.illinois.edu/wiki/bin/view/IDEALS/PreservationSupportPolicy
  • 29. Preservation metadata PREMIS (PREservation Metadata Implementation Strategies) “Preservation metadata supports activities intended to ensure the long-term usability of a digital resource.”—Caplan, p.3 http://www.loc.gov/standards/premis/understanding-premis.pdf
  • 30. “ ” Metadata can help support authenticity by documenting the digital provenance of the resource — its chain of custody and authorized change history. Caplan, Priscilla. Understanding PREMIS. Library of Congress, ©2009. p.3 Digital provenance
  • 31. Methods of preserving data • Refreshing data • Migrating data • Emulating software platform • Replicating • Validating data integrity • Metadata
  • 32. Long-term preservation options • Global LOCKSS Network • Private LOCKSS Network • Portico
  • 33. Global LOCKSS Network • For e-journal content • Preserves the format as well as the content • Light archive • Adding journals to LOCKSS • Notify LOCKSS of metadata/file changes • Not all serials are appropriate for Global LOCKSS
  • 34. Private LOCKSS Network • All material from the IR • Need at least 7 nodes/destinations • Each should be a LOCKSS Alliance member • Set up policies and governance for the PLN
  • 35. Setting up policies for a PLN • How long is initial commitment? • How much notice to withdraw? • How do members remove data for withdrawn institution? • Does the group need a governing body or steering committee? • Will the PLN be a dark or light archive? • Do any of the members have embargoed materials?
  • 37. Portico • For e-books and e-journals • Source files converted to an archive format • Dark archive • Portico is responsible for future content migrations • Adding journals to Portico • Not all serials are appropriate for Portico
  • 38. Factors to consider in developing a formal preservation plan • Organizational & financial commitment • Stakeholders • Local backups vs. long- term preservation • Storage needs • Roles & responsibilities • Data ingestion • Policy on deletion of or embargoes for materials • Funding • Staff
  • 39. Organizational & financial commitment •What is the long-term financial commitment from your library or institution? •Do you have the support of the organization? From what level of administration?
  • 41. Local backups vs. long-term preservation •Definition of backups versus preservation •Metadata, content, software, or all of these? •How often and who is responsible? •PLN or other option for long-term preservation
  • 42. Storage needs Disk space  How much space do you need?  Who is responsible for maintaining disks? Software  Which software will be required?  Who migrates information as software needs change? Equipment  What equipment will you need?  Who will fund the equipment, set it up, maintain it?
  • 43. Roles & responsibilities •Who is implementing the plan? •Who is maintaining the data and how? •Who is providing support for accessing material and troubleshooting issues?
  • 44. Data ingestion •How are you getting data into the system for preservation or backup? •Will this be done in-house or outsourced to a third party? •How frequently and in what format?
  • 45. Funding vs. staffing • Is it easier to fund these efforts at your organization or staff them? • How well-staffed is your organization? • What kind of expertise do you have (or not have) in the library? • What level of commitment does your organization have to preserve digital information?
  • 46. Questions? Wendy Robertson Digital Scholarship Librarian University of Iowa Libraries wendy-robertson@uiowa.edu @wendycr_ Carol Ann Borchert Coordinator for Serials University of South Florida Libraries borchert@usf.edu
  • 47. Sources Ball, Alex. Preservation and Curation in Institutional Repositories. Digital Curation Centre, UKOLN, 2010. Version 1.3 http://www.dcc.ac.uk/sites/default/files/documents/reports/irpc-report- v1.3.pdf Caplan, Priscilla. Understanding PREMIS. Library of Congress, ©2009. http://www.loc.gov/standards/premis/understanding-premis.pdf Digital Repository Audit Method Based On Risk Assessment (DRAMBORA). Glasgow, 2009. http://www.dcc.ac.uk/resources/repository-audit-and- assessment/drambora JISC. Digital Preservation: Continued Access to Authentic Digital Assets (Nov. 2006) http://www.jisc.ac.uk/publications/briefingpapers/2006/pub_digipreservationb p.aspx
  • 48. Sources Nestor Working Group. Catalogue of Criteria for Trusted Digital Repositories. Frankfurt am Main, Dec. 2006. Urn: de:0008-2006060703 OpenDOAR Policies Tool. http://www.opendoar.org/tools/en/policies.php Oettler, Alexandra. PDF/A in a Nutshell 2.0: PDF for long-term archiving. Berlin: Association for Digital Document Standards e. V., ©2013. http://www.pdfa.org/wp- content/uploads/2013/04/PDFA_in_a_Nutshell_21.pdf Pennock, Maureen. Web-Archiving. DPC Technology Watch Report 12-01 March 2013. DOI: http://dx.doi.org/10.7207/twr13-01 Reference Model for an Open Archival Information System (OAIS). Recommended Practice CCSDS 650.0-M-2. Magenta Book, June 2012. http://public.ccsds.org/publications/archive/650x0m2.pdf
  • 49. Sources Trustworthy Repositories Audit & Certification: Criteria and Checklist (TRAC). Version 1.0. Feb 2007. http://www.crl.edu/archiving-preservation/digital- archives/metrics-assessing-and-certifying/trac University of Houston Libraries, Institutional Repository Task Force. Institutional Repositories. SPEC Kit 292. July 2006. http://publications.arl.org/Institutional-Repositories-SPEC-Kit-292/3 University of Illinois at Urbana-Champaign. “IDEALS Digital Preservation Support Policy.” ©2013 https://services.ideals.illinois.edu/wiki/bin/view/IDEALS/PreservationSuppor tPolicy University of Illinois at Urbana-Champaign. “Preparing Items for Deposit into IDEALS. File Format Recommendations” ©2013 https://services.ideals.illinois.edu/wiki/bin/view/IDEALS/SubmissionPrep#Fil e_Format_Recommendations

Editor's Notes

  1. Platforms: Digital Commons/bepress, OJS, CONTENTdm, DSpace, Fedora, Eprints, other?
  2. Bit rot can become a problem—how to handle?Refreshing—transferring data between two types of the same storage medium, particularly storage mediums that deteriorate like CD-ROMsMigration—transfer data to new system environment, convert from one file format or operating system to anotherEmulating—emulates obsolete software platform, imitates old operating systemReplication—duplicate copies of data in one or more storage locationsValidating data integrity—fixity checking; systematically checks data to make sure there’s been no bit rot and that data has not changed/deterioratedMetadata—information on content and creation of file, preservation history, etc.; technical metadata—identifies file characteristicsLOCKSS uses a combo of these methods: copies (replication) are checked against each other (validating data integrity) to make sure they still match and there’s been no data degradation
  3. --Notify LOCKSS that journal is available for preservation, and have IR company/platform work with LOCKSS to allow access to the content for preservation--Light archive; if site goes down, it will be available through LOCKSS quickly--If you make major changes with metadata or files, notify them before making the alterations--Not necessarily all serials appropriate for Global LOCKSS (newsletters, material that is quickly outdated or superseded)
  4. Access vs. preservation copy—sometimes a smaller version of the file is in the IR as an accessible version, with a larger version of the file kept elsewhere as a preservation copy. This came up in the DC PLN discussion.
  5. --Portico has an online form where you can recommend OA journals for them to include, or contact them directly for guidance--must have a trigger event to release content When a publisher ceases operations and titles are no longer available from any other sourceWhen a publisher ceases to publish and offer a title and it is not offered by another publisher or entityWhen back issues are removed from a publisher’s offering and are not available elsewhereUpon catastrophic failure by a publisher’s delivery platform for a sustained period of time
  6. We’ve laid some groundwork on the subject, and this slide could be a whole presentation on its own, but just to get you thinking, here are factors to consider
  7. Ties back to organizational and financial commitment; bookended this discussion with these two slides for a reason.