SlideShare a Scribd company logo
1 of 15
EPrints PreservationWhy we need Preservation Planning by Steve HitchcockEPrints User Group, OR10, Madrid, 9 July 2010
A first take on digital preservation DIGITAL PRESERVATION is      NOT so DIFFICULT      if you WANT to DO IT You will want to do digital preservation if you have a lot of digital content collected over years a specified responsibility and resources for that content an understanding of how that content is used now how it will be needed in future,  how the type of content you collect may change going forward
Another take on digital preservation Digital preservation is identifying what’s required of your repository content tomorrow, and being ready to serve that requirement - or the day after, or the month after. In other words, you can extend this to whatever timescale matters. You can work out everything else from these parameters.  Conversely: what will your content profile look like over the timescale that matters to you? Will it change substantially?
Digital repositories diversifying: institution-wide outputs KeepIt exemplar preservation repositories Research Arts Science Teaching
Slidesharehttp://www.slideshare.net/SteveHitchcock/presentations Source materials (ECS EPrints) http://bit.ly/afof8g Module 1, Organisational issues, audit, selection and appraisal School of ECS, University of Southampton, 19 January 2010 Module 2, institutional and lifecycle preservation costs  School of ECS, University of Southampton, 5 February 2010 Module 3, Primer on preservation workflow, formats and characterisation Westminster-Kingsway College, London, 2 March 2010 Module 4, Putting storage, format management and preservation planning in the repository University of Southampton, 18-19 March 2010 Module 5, Trust, of the repository, of the tools and services it chooses  University of Northampton, 30 March 2010
Work with, not against, your authors and contributors “Preservation begins with the author” U. Rochester (USA) has written its own repository software IR+ to give its authors a Web-based authoring workspace, but watch out for new JISC project DepositMO, connecting the user's computer desktop, especially popular apps such as MS Office, with digital repositories. Which applications are widely used and popular among your authors? Digital content authoring tools are typically chosen on the basis of purpose, utility, familiarity (what is provided, supported by Information Systems?) Rarely are they chosen for format or preservation. Authors will craft their output in the chosen application, but will often throw away that craft if asked to convert to another format
Analyse Check Action ,[object Object]
 File validation
 Virus check
 Bit checking and checksum calculationTools e.g. DROID JHOVE FITS Preservation planning Characterisation: Significant properties and technical characteristics, provenance, format, risk factors Risk analysis Tools Plato (Planets) PRONOM (TNA) P2 risk registry (KeepIt) INFORM (U Illinois) KB ,[object Object]
 Emulation
 Storage selectionPreservation workflow
Accepted repository formats: recent survey What file formats do you accept? Do you convert any to a different format? ALL: Accept any format.   Two: Convert everything to PDF, but store the source files in the background for preservation reasons.  Four: Mention specifically converting Word to PDF: one seeks permission from the author to do this, and uploads as Word if permission is not granted.  One: Mentions converting ZIP files to PDF.  Sue Ashby, University of Portsmouth Library, Summary of responses to IR questionnaire, JISC-REPOSITORIES, 18 February 2010
Some thoughts about formats Free vs open source vs open standard: ,[object Object]
Open Office – free – XML - open standard

More Related Content

Viewers also liked

Digital preservation: workflows and integrations
Digital preservation: workflows and integrationsDigital preservation: workflows and integrations
Digital preservation: workflows and integrationsCillian Joy
 
Preservation and conservation
Preservation and conservationPreservation and conservation
Preservation and conservationDiluwar Hassan
 
PRESERVATION Web archiving
PRESERVATION  Web archivingPRESERVATION  Web archiving
PRESERVATION Web archivingEssam Obaid
 
Conservation and preservation of archival materials and manuscripts 1
Conservation and preservation of archival materials and manuscripts 1Conservation and preservation of archival materials and manuscripts 1
Conservation and preservation of archival materials and manuscripts 1anjalil
 
Preservation and conservation of library materials
Preservation and conservation of library materialsPreservation and conservation of library materials
Preservation and conservation of library materialsJohny Prudencio
 
Preservation conservation program
Preservation conservation programPreservation conservation program
Preservation conservation programFe Angela Verzosa
 

Viewers also liked (7)

Digital preservation: workflows and integrations
Digital preservation: workflows and integrationsDigital preservation: workflows and integrations
Digital preservation: workflows and integrations
 
Knowing the Need: Optimising preservation for library and archive collections
Knowing the Need: Optimising preservation for library and archive collectionsKnowing the Need: Optimising preservation for library and archive collections
Knowing the Need: Optimising preservation for library and archive collections
 
Preservation and conservation
Preservation and conservationPreservation and conservation
Preservation and conservation
 
PRESERVATION Web archiving
PRESERVATION  Web archivingPRESERVATION  Web archiving
PRESERVATION Web archiving
 
Conservation and preservation of archival materials and manuscripts 1
Conservation and preservation of archival materials and manuscripts 1Conservation and preservation of archival materials and manuscripts 1
Conservation and preservation of archival materials and manuscripts 1
 
Preservation and conservation of library materials
Preservation and conservation of library materialsPreservation and conservation of library materials
Preservation and conservation of library materials
 
Preservation conservation program
Preservation conservation programPreservation conservation program
Preservation conservation program
 

Similar to EPrints Preservation: Why we need Preservation Planning

KeepIt Course 3: preservation workflow
KeepIt Course 3: preservation workflowKeepIt Course 3: preservation workflow
KeepIt Course 3: preservation workflowJISC KeepIt project
 
Digital Preservation Best Practices: Lessons Learned From Across the Pond
Digital Preservation Best Practices: Lessons Learned From Across the PondDigital Preservation Best Practices: Lessons Learned From Across the Pond
Digital Preservation Best Practices: Lessons Learned From Across the PondBenoit Pauwels
 
Digital Presentation Best Practices: Lessons Learned From Across the Pond
Digital Presentation Best Practices: Lessons Learned From Across the PondDigital Presentation Best Practices: Lessons Learned From Across the Pond
Digital Presentation Best Practices: Lessons Learned From Across the PondULB - Bibliothèques
 
File Formats for Preservation
File Formats for PreservationFile Formats for Preservation
File Formats for PreservationStephen Gray
 
Digitisation Workshop Pres 2008(V1)
Digitisation Workshop Pres 2008(V1)Digitisation Workshop Pres 2008(V1)
Digitisation Workshop Pres 2008(V1)Mal Booth
 
Planning and Managing Digital Library & Archive Projects
Planning and Managing Digital Library & Archive ProjectsPlanning and Managing Digital Library & Archive Projects
Planning and Managing Digital Library & Archive Projectsac2182
 
Puglia marac-file formats-20111020
Puglia marac-file formats-20111020Puglia marac-file formats-20111020
Puglia marac-file formats-20111020MARAC Bethlehem PC
 
An Introduction to AtoM, Archivematica, and Artefactual Systems
An Introduction to AtoM, Archivematica, and Artefactual SystemsAn Introduction to AtoM, Archivematica, and Artefactual Systems
An Introduction to AtoM, Archivematica, and Artefactual SystemsArtefactual Systems - AtoM
 
Digital Preservation Process: Preparation and Requirements
Digital Preservation Process: Preparation and RequirementsDigital Preservation Process: Preparation and Requirements
Digital Preservation Process: Preparation and RequirementsDigitalPreservationEurope
 
Hans Hofman - European Perspectives on Digital Preservation
Hans Hofman - European Perspectives on Digital PreservationHans Hofman - European Perspectives on Digital Preservation
Hans Hofman - European Perspectives on Digital PreservationNational Digital Forum
 
Digital preservation work at FAO
Digital preservation work at FAODigital preservation work at FAO
Digital preservation work at FAOFAO
 
Supporting PDF accessibility evaluation: Early results from the FixRep project
 Supporting PDF accessibility evaluation: Early results from the FixRep project Supporting PDF accessibility evaluation: Early results from the FixRep project
Supporting PDF accessibility evaluation: Early results from the FixRep projectUKOLN (dev), University of Bath
 
IFLA ARL Webinar Series: Digital Preservation - Managing Publications and Dat...
IFLA ARL Webinar Series: Digital Preservation - Managing Publications and Dat...IFLA ARL Webinar Series: Digital Preservation - Managing Publications and Dat...
IFLA ARL Webinar Series: Digital Preservation - Managing Publications and Dat...IFLAAcademicandResea
 
Digitisation workshop pres 2009(v1)
Digitisation workshop pres 2009(v1)Digitisation workshop pres 2009(v1)
Digitisation workshop pres 2009(v1)Mal Booth
 
Digital Preservation
Digital PreservationDigital Preservation
Digital PreservationMichael Day
 
Sharepoint Document Conversion
Sharepoint Document ConversionSharepoint Document Conversion
Sharepoint Document ConversionColin Gardner
 

Similar to EPrints Preservation: Why we need Preservation Planning (20)

KeepIt Course 3: preservation workflow
KeepIt Course 3: preservation workflowKeepIt Course 3: preservation workflow
KeepIt Course 3: preservation workflow
 
Digital Preservation Best Practices: Lessons Learned From Across the Pond
Digital Preservation Best Practices: Lessons Learned From Across the PondDigital Preservation Best Practices: Lessons Learned From Across the Pond
Digital Preservation Best Practices: Lessons Learned From Across the Pond
 
Digital Presentation Best Practices: Lessons Learned From Across the Pond
Digital Presentation Best Practices: Lessons Learned From Across the PondDigital Presentation Best Practices: Lessons Learned From Across the Pond
Digital Presentation Best Practices: Lessons Learned From Across the Pond
 
File Formats for Preservation
File Formats for PreservationFile Formats for Preservation
File Formats for Preservation
 
Completepresentation
CompletepresentationCompletepresentation
Completepresentation
 
Digitisation Workshop Pres 2008(V1)
Digitisation Workshop Pres 2008(V1)Digitisation Workshop Pres 2008(V1)
Digitisation Workshop Pres 2008(V1)
 
Planning and Managing Digital Library & Archive Projects
Planning and Managing Digital Library & Archive ProjectsPlanning and Managing Digital Library & Archive Projects
Planning and Managing Digital Library & Archive Projects
 
Trm Introduction
Trm IntroductionTrm Introduction
Trm Introduction
 
Puglia marac-file formats-20111020
Puglia marac-file formats-20111020Puglia marac-file formats-20111020
Puglia marac-file formats-20111020
 
An Introduction to AtoM, Archivematica, and Artefactual Systems
An Introduction to AtoM, Archivematica, and Artefactual SystemsAn Introduction to AtoM, Archivematica, and Artefactual Systems
An Introduction to AtoM, Archivematica, and Artefactual Systems
 
Digital Preservation Process: Preparation and Requirements
Digital Preservation Process: Preparation and RequirementsDigital Preservation Process: Preparation and Requirements
Digital Preservation Process: Preparation and Requirements
 
Hans Hofman - European Perspectives on Digital Preservation
Hans Hofman - European Perspectives on Digital PreservationHans Hofman - European Perspectives on Digital Preservation
Hans Hofman - European Perspectives on Digital Preservation
 
Digital preservation work at FAO
Digital preservation work at FAODigital preservation work at FAO
Digital preservation work at FAO
 
Supporting PDF accessibility evaluation: Early results from the FixRep project
 Supporting PDF accessibility evaluation: Early results from the FixRep project Supporting PDF accessibility evaluation: Early results from the FixRep project
Supporting PDF accessibility evaluation: Early results from the FixRep project
 
QQML presentation
QQML presentationQQML presentation
QQML presentation
 
IFLA ARL Webinar Series: Digital Preservation - Managing Publications and Dat...
IFLA ARL Webinar Series: Digital Preservation - Managing Publications and Dat...IFLA ARL Webinar Series: Digital Preservation - Managing Publications and Dat...
IFLA ARL Webinar Series: Digital Preservation - Managing Publications and Dat...
 
Digitisation workshop pres 2009(v1)
Digitisation workshop pres 2009(v1)Digitisation workshop pres 2009(v1)
Digitisation workshop pres 2009(v1)
 
Digital Preservation
Digital PreservationDigital Preservation
Digital Preservation
 
Sharepoint Document Conversion
Sharepoint Document ConversionSharepoint Document Conversion
Sharepoint Document Conversion
 
What is a DMP
What is a DMPWhat is a DMP
What is a DMP
 

More from JISC KeepIt project

Preserving repository content: practical steps for repository managers by Mig...
Preserving repository content: practical steps for repository managers by Mig...Preserving repository content: practical steps for repository managers by Mig...
Preserving repository content: practical steps for repository managers by Mig...JISC KeepIt project
 
Update on the JISC KeepIt Repository Preservation Exemplars Project, June 2010
Update on the JISC KeepIt Repository Preservation Exemplars Project, June 2010Update on the JISC KeepIt Repository Preservation Exemplars Project, June 2010
Update on the JISC KeepIt Repository Preservation Exemplars Project, June 2010JISC KeepIt project
 
Transforming repositories: from repository managers to institutional data man...
Transforming repositories: from repository managers to institutional data man...Transforming repositories: from repository managers to institutional data man...
Transforming repositories: from repository managers to institutional data man...JISC KeepIt project
 
Keepit Course 5: Concluding the course
Keepit Course 5: Concluding the courseKeepit Course 5: Concluding the course
Keepit Course 5: Concluding the courseJISC KeepIt project
 
KeepIt Course 5: DRAMBORA: Risk and Trust and Data Management, by Martin Donn...
KeepIt Course 5: DRAMBORA: Risk and Trust and Data Management, by Martin Donn...KeepIt Course 5: DRAMBORA: Risk and Trust and Data Management, by Martin Donn...
KeepIt Course 5: DRAMBORA: Risk and Trust and Data Management, by Martin Donn...JISC KeepIt project
 
Keepit Course 5: Tools for Assessing Trustworthy Repositories
Keepit Course 5: Tools for Assessing Trustworthy RepositoriesKeepit Course 5: Tools for Assessing Trustworthy Repositories
Keepit Course 5: Tools for Assessing Trustworthy RepositoriesJISC KeepIt project
 
Preservation Planning using Plato, by Hannes Kulovits and Andreas Rauber
Preservation Planning using Plato, by Hannes Kulovits and Andreas RauberPreservation Planning using Plato, by Hannes Kulovits and Andreas Rauber
Preservation Planning using Plato, by Hannes Kulovits and Andreas RauberJISC KeepIt project
 
Physical preservation with EPrints: 1 Storage, by Adam Field, David Tarrant, ...
Physical preservation with EPrints: 1 Storage, by Adam Field, David Tarrant, ...Physical preservation with EPrints: 1 Storage, by Adam Field, David Tarrant, ...
Physical preservation with EPrints: 1 Storage, by Adam Field, David Tarrant, ...JISC KeepIt project
 
KeepIt Course 4: digital preservation recap, by Andreas Rauber, Hannes Kulovi...
KeepIt Course 4: digital preservation recap, by Andreas Rauber, Hannes Kulovi...KeepIt Course 4: digital preservation recap, by Andreas Rauber, Hannes Kulovi...
KeepIt Course 4: digital preservation recap, by Andreas Rauber, Hannes Kulovi...JISC KeepIt project
 
KeepIt Course 4: Putting storage, format management and preservation planning...
KeepIt Course 4: Putting storage, format management and preservation planning...KeepIt Course 4: Putting storage, format management and preservation planning...
KeepIt Course 4: Putting storage, format management and preservation planning...JISC KeepIt project
 
Keepit Course 3: Provenance (and OPM), based on slides by Luc Moreau
Keepit Course 3: Provenance (and OPM), based on slides by Luc MoreauKeepit Course 3: Provenance (and OPM), based on slides by Luc Moreau
Keepit Course 3: Provenance (and OPM), based on slides by Luc MoreauJISC KeepIt project
 
KeepIt Course 3: Applying Preservation Metadata to Repositories
KeepIt Course 3: Applying Preservation Metadata to RepositoriesKeepIt Course 3: Applying Preservation Metadata to Repositories
KeepIt Course 3: Applying Preservation Metadata to RepositoriesJISC KeepIt project
 
Significant Properties - Where Next? (SPs part 6), by Stephen Grace and Garet...
Significant Properties - Where Next? (SPs part 6), by Stephen Grace and Garet...Significant Properties - Where Next? (SPs part 6), by Stephen Grace and Garet...
Significant Properties - Where Next? (SPs part 6), by Stephen Grace and Garet...JISC KeepIt project
 
Supporting Significant Properties in a Working Archive (SPs part 5), by Steph...
Supporting Significant Properties in a Working Archive (SPs part 5), by Steph...Supporting Significant Properties in a Working Archive (SPs part 5), by Steph...
Supporting Significant Properties in a Working Archive (SPs part 5), by Steph...JISC KeepIt project
 
Significant Properties, Practical 2: Stakeholder Analysis (SPs part 4), by St...
Significant Properties, Practical 2: Stakeholder Analysis (SPs part 4), by St...Significant Properties, Practical 2: Stakeholder Analysis (SPs part 4), by St...
Significant Properties, Practical 2: Stakeholder Analysis (SPs part 4), by St...JISC KeepIt project
 
Significant Properties, Practical 1: Object Analysis (SPs part 3), by Stephen...
Significant Properties, Practical 1: Object Analysis (SPs part 3), by Stephen...Significant Properties, Practical 1: Object Analysis (SPs part 3), by Stephen...
Significant Properties, Practical 1: Object Analysis (SPs part 3), by Stephen...JISC KeepIt project
 
InSPECT Significant Properties Framework (SPs part 2), by Stephen Grace and G...
InSPECT Significant Properties Framework (SPs part 2), by Stephen Grace and G...InSPECT Significant Properties Framework (SPs part 2), by Stephen Grace and G...
InSPECT Significant Properties Framework (SPs part 2), by Stephen Grace and G...JISC KeepIt project
 
Introducing Significant Properties (SPs part 1), by Stephen Grace and Gareth ...
Introducing Significant Properties (SPs part 1), by Stephen Grace and Gareth ...Introducing Significant Properties (SPs part 1), by Stephen Grace and Gareth ...
Introducing Significant Properties (SPs part 1), by Stephen Grace and Gareth ...JISC KeepIt project
 

More from JISC KeepIt project (20)

Preserving repository content: practical steps for repository managers by Mig...
Preserving repository content: practical steps for repository managers by Mig...Preserving repository content: practical steps for repository managers by Mig...
Preserving repository content: practical steps for repository managers by Mig...
 
Update on the JISC KeepIt Repository Preservation Exemplars Project, June 2010
Update on the JISC KeepIt Repository Preservation Exemplars Project, June 2010Update on the JISC KeepIt Repository Preservation Exemplars Project, June 2010
Update on the JISC KeepIt Repository Preservation Exemplars Project, June 2010
 
Transforming repositories: from repository managers to institutional data man...
Transforming repositories: from repository managers to institutional data man...Transforming repositories: from repository managers to institutional data man...
Transforming repositories: from repository managers to institutional data man...
 
Keepit Course 5: Concluding the course
Keepit Course 5: Concluding the courseKeepit Course 5: Concluding the course
Keepit Course 5: Concluding the course
 
Keepit Course 5: Revision
Keepit Course 5: RevisionKeepit Course 5: Revision
Keepit Course 5: Revision
 
KeepIt Course 5: DRAMBORA: Risk and Trust and Data Management, by Martin Donn...
KeepIt Course 5: DRAMBORA: Risk and Trust and Data Management, by Martin Donn...KeepIt Course 5: DRAMBORA: Risk and Trust and Data Management, by Martin Donn...
KeepIt Course 5: DRAMBORA: Risk and Trust and Data Management, by Martin Donn...
 
Keepit Course 5: Tools for Assessing Trustworthy Repositories
Keepit Course 5: Tools for Assessing Trustworthy RepositoriesKeepit Course 5: Tools for Assessing Trustworthy Repositories
Keepit Course 5: Tools for Assessing Trustworthy Repositories
 
Keepit Course 5: Trust
Keepit Course 5: TrustKeepit Course 5: Trust
Keepit Course 5: Trust
 
Preservation Planning using Plato, by Hannes Kulovits and Andreas Rauber
Preservation Planning using Plato, by Hannes Kulovits and Andreas RauberPreservation Planning using Plato, by Hannes Kulovits and Andreas Rauber
Preservation Planning using Plato, by Hannes Kulovits and Andreas Rauber
 
Physical preservation with EPrints: 1 Storage, by Adam Field, David Tarrant, ...
Physical preservation with EPrints: 1 Storage, by Adam Field, David Tarrant, ...Physical preservation with EPrints: 1 Storage, by Adam Field, David Tarrant, ...
Physical preservation with EPrints: 1 Storage, by Adam Field, David Tarrant, ...
 
KeepIt Course 4: digital preservation recap, by Andreas Rauber, Hannes Kulovi...
KeepIt Course 4: digital preservation recap, by Andreas Rauber, Hannes Kulovi...KeepIt Course 4: digital preservation recap, by Andreas Rauber, Hannes Kulovi...
KeepIt Course 4: digital preservation recap, by Andreas Rauber, Hannes Kulovi...
 
KeepIt Course 4: Putting storage, format management and preservation planning...
KeepIt Course 4: Putting storage, format management and preservation planning...KeepIt Course 4: Putting storage, format management and preservation planning...
KeepIt Course 4: Putting storage, format management and preservation planning...
 
Keepit Course 3: Provenance (and OPM), based on slides by Luc Moreau
Keepit Course 3: Provenance (and OPM), based on slides by Luc MoreauKeepit Course 3: Provenance (and OPM), based on slides by Luc Moreau
Keepit Course 3: Provenance (and OPM), based on slides by Luc Moreau
 
KeepIt Course 3: Applying Preservation Metadata to Repositories
KeepIt Course 3: Applying Preservation Metadata to RepositoriesKeepIt Course 3: Applying Preservation Metadata to Repositories
KeepIt Course 3: Applying Preservation Metadata to Repositories
 
Significant Properties - Where Next? (SPs part 6), by Stephen Grace and Garet...
Significant Properties - Where Next? (SPs part 6), by Stephen Grace and Garet...Significant Properties - Where Next? (SPs part 6), by Stephen Grace and Garet...
Significant Properties - Where Next? (SPs part 6), by Stephen Grace and Garet...
 
Supporting Significant Properties in a Working Archive (SPs part 5), by Steph...
Supporting Significant Properties in a Working Archive (SPs part 5), by Steph...Supporting Significant Properties in a Working Archive (SPs part 5), by Steph...
Supporting Significant Properties in a Working Archive (SPs part 5), by Steph...
 
Significant Properties, Practical 2: Stakeholder Analysis (SPs part 4), by St...
Significant Properties, Practical 2: Stakeholder Analysis (SPs part 4), by St...Significant Properties, Practical 2: Stakeholder Analysis (SPs part 4), by St...
Significant Properties, Practical 2: Stakeholder Analysis (SPs part 4), by St...
 
Significant Properties, Practical 1: Object Analysis (SPs part 3), by Stephen...
Significant Properties, Practical 1: Object Analysis (SPs part 3), by Stephen...Significant Properties, Practical 1: Object Analysis (SPs part 3), by Stephen...
Significant Properties, Practical 1: Object Analysis (SPs part 3), by Stephen...
 
InSPECT Significant Properties Framework (SPs part 2), by Stephen Grace and G...
InSPECT Significant Properties Framework (SPs part 2), by Stephen Grace and G...InSPECT Significant Properties Framework (SPs part 2), by Stephen Grace and G...
InSPECT Significant Properties Framework (SPs part 2), by Stephen Grace and G...
 
Introducing Significant Properties (SPs part 1), by Stephen Grace and Gareth ...
Introducing Significant Properties (SPs part 1), by Stephen Grace and Gareth ...Introducing Significant Properties (SPs part 1), by Stephen Grace and Gareth ...
Introducing Significant Properties (SPs part 1), by Stephen Grace and Gareth ...
 

Recently uploaded

Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentationphoebematthew05
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Science&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfScience&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfjimielynbastida
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 

Recently uploaded (20)

Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentation
 
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptxVulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Science&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfScience&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdf
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 

EPrints Preservation: Why we need Preservation Planning

  • 1. EPrints PreservationWhy we need Preservation Planning by Steve HitchcockEPrints User Group, OR10, Madrid, 9 July 2010
  • 2. A first take on digital preservation DIGITAL PRESERVATION is NOT so DIFFICULT  if you WANT to DO IT You will want to do digital preservation if you have a lot of digital content collected over years a specified responsibility and resources for that content an understanding of how that content is used now how it will be needed in future, how the type of content you collect may change going forward
  • 3. Another take on digital preservation Digital preservation is identifying what’s required of your repository content tomorrow, and being ready to serve that requirement - or the day after, or the month after. In other words, you can extend this to whatever timescale matters. You can work out everything else from these parameters. Conversely: what will your content profile look like over the timescale that matters to you? Will it change substantially?
  • 4. Digital repositories diversifying: institution-wide outputs KeepIt exemplar preservation repositories Research Arts Science Teaching
  • 5. Slidesharehttp://www.slideshare.net/SteveHitchcock/presentations Source materials (ECS EPrints) http://bit.ly/afof8g Module 1, Organisational issues, audit, selection and appraisal School of ECS, University of Southampton, 19 January 2010 Module 2, institutional and lifecycle preservation costs School of ECS, University of Southampton, 5 February 2010 Module 3, Primer on preservation workflow, formats and characterisation Westminster-Kingsway College, London, 2 March 2010 Module 4, Putting storage, format management and preservation planning in the repository University of Southampton, 18-19 March 2010 Module 5, Trust, of the repository, of the tools and services it chooses University of Northampton, 30 March 2010
  • 6. Work with, not against, your authors and contributors “Preservation begins with the author” U. Rochester (USA) has written its own repository software IR+ to give its authors a Web-based authoring workspace, but watch out for new JISC project DepositMO, connecting the user's computer desktop, especially popular apps such as MS Office, with digital repositories. Which applications are widely used and popular among your authors? Digital content authoring tools are typically chosen on the basis of purpose, utility, familiarity (what is provided, supported by Information Systems?) Rarely are they chosen for format or preservation. Authors will craft their output in the chosen application, but will often throw away that craft if asked to convert to another format
  • 7.
  • 10.
  • 13. Accepted repository formats: recent survey What file formats do you accept? Do you convert any to a different format? ALL: Accept any format.   Two: Convert everything to PDF, but store the source files in the background for preservation reasons. Four: Mention specifically converting Word to PDF: one seeks permission from the author to do this, and uploads as Word if permission is not granted. One: Mentions converting ZIP files to PDF.  Sue Ashby, University of Portsmouth Library, Summary of responses to IR questionnaire, JISC-REPOSITORIES, 18 February 2010
  • 14.
  • 15. Open Office – free – XML - open standard
  • 17.
  • 18. A group task on format risks Choose two formats to compare (e.g. Word vs PDF, Word vs ODF, PDF vs XML, TIFF vs JPEG) By working through the (surviving) list of format risks select a winner (or a draw) between your chosen formats for each risk category (1 point for win) Total the scores to find an overall winning format Suggest one reason why the winning format using this method may not be the one you would choose for your repository
  • 19. Format risk results (from group thinking) PDF 4 Word 1 TIFF 3 JPEG 1 XML 6 PDF 1
  • 20. Alternative thoughts on ‘winning’ formats We were then asked to consider why we might choose NOT to use the format that performed better for these criteria:
• PDF/Word – Why not PDF? PDF is essentially a conversion format, not a source authoring format.
• TIFF/JPEG – Why not TIFF? JPEG is compressed, would take up less space in storage. This factor may be crucial. Archival quality copy or a derivative?
• XML/PDF – Why not XML? Many repository resources are deposited in PDF. Do people understand what they need to do with XML?
  • 21. TIFF vs JPEG 2000? Who’s for JPEG? The major players line up The National Library of the Netherlands evaluated JPEG 2000 against uncompressed TIFF (currently used) for storage capacity, image quality, long-term sustainability, functionality. JPEG 2000 is recommended as future archive format. The British Library recently moved forward to migrate their 80-terabyte newspaper collection from TIFF to JPEG 2000 The Wellcome Library announced they will use JPEG 2000 for their upcoming digitization projects Preservation Planning at the Bavarian State Library Using a Collection of Digitized 16th Century Printings, D-Lib Magazine, Vol15 No. 11/12, Nov/Dec 2009, http://www.dlib.org/dlib/november09/kulovits/11kulovits.html
  • 22. TIFF vs JPEG 2000? What does Plato say? “At this point in time not migrating the TIFF v6 images is the best alternative.” “However, in one year we'll look at this plan again to see if there are more tools available and whether or not the ones we considered in this year's evaluation have been improved.” Preservation Planning at the Bavarian State Library Using a Collection of Digitized 16th Century Printings, D-Lib Magazine, Vol15 No. 11/12, Nov/Dec 2009, http://www.dlib.org/dlib/november09/kulovits/11kulovits.html

Editor's Notes

  1. Digital preservation is an important topic, which can be perceived as technical and scary, and although it appears to attract interest and concern in equal measure, it is practiced somewhat less outside the specialist national and commercial institutions. This is because it often begins from a position of little focus, particularly in terms of realistic timescales, and there are problems when it comes to allocating resources in terms of cost, time and effort. To help understand why this might be, and to gauge at what point digital content and repository managers might expect a natural transition from interest/concern to practice, we produced this rough rule-of-thumb metric. If one or more of these criteria apply, then the application of digital preservation is likely to become magically less onerous and more beneficial for your content.
  2. This is another way of saying the same thing as the previous slide about digital preservation, but it’s important to note the the converse point. We tend to think of what digital preservation can do for the content already collected, but it matters just as much to anticipate what content you will collect in addition over the specified timescale, and how this will affect your content profile. If your content profile is likely to change, then your preservation measures are likely to have to change as well.
  3. What do we mean by content profile? In the KeepIt project we have four exemplar repositories.MiggiePickton’s presentation in the main Open Repositories 2010 conference introduced these repositories and their progress with preservation - see this blogged report of the presentation (http://blogs.ecs.soton.ac.uk/keepit/2010/07/14/exemplars-reveal-seven-steps-to-preservation-readiness/). The exemplars each provide different types of content: research, science data, teaching, arts. At a simple level, that could be the content profile of an institutional repository in the future. You may be able to do a similar analysis for your repository. Going back a few years we envisaged the emergence of preservation services and service providers for digital repositories. Essentially what we have now are a range of preservation tools rather than service provider organizations. So once you have done the analysis of the previous three slides, preservation resolves to the application of selected tools – so preservation is no more or less than other repository applications, whether CRIS, REF or other tools you use with your repository.
  4. In the KeepIt project we ran a practicalcourse to introduce repository managers, from our exemplars and others, to selected preservation tools. The materials are all online, in Slideshare for a quick overview of the presentations, or in our repository for all the original source materials and practical exercises. And there are blogs for context, comment and subsequent practice (http://blogs.ecs.soton.ac.uk/keepit/tag/keepit-course/). Broadly, the training course covered: Understanding your institution: what the institution and its people can do for your repository e.g. in terms of providing content; and the context for what you can do for the institution, in terms of e.g. policy framework. Then you need to establish the resources available to you, in terms of the budgets you can acquire and the costs you have to cover. Finally – not least, but I mean finally – you will want to demonstrate trust, that is your way of showing that you have taken account of all these issues, and the risks all this might pose, and are serving the requirements of your institution. There are tools for all these things.The bit in the middle – modules 3 and 4 – arewhat we are considering today. What we might call technical preservation - understanding and managing the digital bits.
  5. At the heart of technical preservation are file formats. Authors and content creators don’t choose formats, they choose applications based on functionality and what it allows them to create. Machines don’t see functionality, they see bits. Instead they try to represent functionality according to the whole computing environment that is being used at a given moment. Since you can’t control what tools authors use and what systems may be used to access content, you have to get your hands dirty and understand how the machine sees things, in particular how it sees fileformats.
  6. Here is our preservation workflow showing three elements introduced in the preceding practical session of the workshop. We know we can classify the formats of digital objects using tools like DROID, and we’ve suggested that some formats might be high risk - without saying much about which ones and why – and that in such cases we have tools, for example, to migrate such formats to another format. The hard part is the bit in the middle that connects the identification with the action, i.e. what to act on, whether to act and, if so, when.
  7. First, let’s try and relate this to your repositories, especially those that focus on open access collections of research papers. Here are the results from a survey posted on JISC-REPOSITORIES earlier this year. We can see the familiar emphasis on the PDF format.
  8. There may be reasons for choosing PDF, and most will probably resolve to the debate about open source and open standard. A few years ago this was a simple case, notably against Microsoft formats. Two things happened: Nothing much changed in the high usage share of MS Office applications despite competition from free and open standard tools such as Open Office, MS standardized and opened its format specifications.The case is not so simple now.
  9. There are more factors than ‘open’ to consider when assessing the risks associated with different file formats, as we can see from this list produced by the (UK) National Archives.
  10. During our KeepIt course module 3 we set up a short group exercise to select a pair of formats and think about applying the format risk factors identified on the previous slide.
  11. We had three groups, and here are the results for the formats they each chose to compare. We removed two factors that were slightly more technical and might have required some familiarity with documentation. Groups were asked to think about the remaining factors based on no more information than you have here. It’s meant to be that intuitive. How surprising are the results? Perhaps the first result is no surprise given published repository preferences, but the quick reverse for PDF against XML should give pause for thought. The image formats are relevant to our preservation planning exercises at this workshop; the TIFF vs JPEG result might be surprising in the context of information that follows in a slide or two.The table reproduced here comes from this report on KeepIt course module 3 http://blogs.ecs.soton.ac.uk/keepit/2010/03/31/digital-preservation-tools-for-repository-managers-primer-on-preservation-workflow-formats-and-characterisation/
  12. In addition, we asked groups to provide a reason why the result of their format considerationsmight not stand up in all situations. You see, these are not definitive results.
  13. Before we get back to the main workshop let’s consider the image format comparison. In the preceding exercise you imported some images in the GIF format, and next in the workshop we will consider possible migration to other formats. Here we consider TIFF vs JPEG. Remember the group result favoured TIFF over JPEG. Now let’s look at what some large archival organizations are doing with image formats. Basing the scores on the factors given, were the group right or wrong?
  14. The group result was not necessarily wrong. Here’s another view performed by an expert archival group using Plato. It’s about using tools to provide information and expertise, but ultimately it’s your - the content manager’s - judgment and your decision. Try the exercise. It’s not as daunting as it seems. That’s where Plato comes in. We aren’t going to learn how to use Plato in this workshop, merely how to import a preservation plan from Plato, which is designed to act on selected formats, in this case on the GIF image format. Essentially Plato allows you to apply the expert information but control the parameters which lead the final outcome – that’s a decision on what to do with formats that are identified as at-risk, and taking the consequent action.At this point, having considered the role of file formats and the essentials of preservation workflow and preservation planning, we are ready to rejoin the practical workshop.