Cost of digitisation and preservation (24-10-07)


Published on

LIBER Conference 24-10-07

  • Be the first to comment

Cost of digitisation and preservation (24-10-07)

  1. 1. LIBER Digitisation Conference, Copenhagen The cost of digitisation and preservation: The LIFE Project 24-26 October 2007 Richard Davies LIFE 2 Project Manager, The British Library
  2. 2. Overview <ul><li>What is the LIFE Project? </li></ul><ul><li>LIFE 1 and LIFE 2 </li></ul><ul><li>LIFE Models </li></ul><ul><li>Burney Case Study </li></ul><ul><li>Benefits </li></ul><ul><li>Further Information </li></ul>
  3. 3. Lifecycle Information for E-literature <ul><li>Project phases: </li></ul><ul><li>LIFE 1 (12 months) </li></ul><ul><li>LIFE 2 (18 months) </li></ul>
  4. 4. LIFE starts to answer the question: <ul><li>What is the long term cost of preserving digital material? </li></ul>£
  5. 5. Why use lifecycle costing? <ul><ul><li>Enables evaluation of all the financial commitments for an item in a collection </li></ul></ul><ul><ul><li>Important for digital collections, where many costs are largely unknown </li></ul></ul>
  6. 6. Aims <ul><ul><li>Better understanding of the digital lifecycle </li></ul></ul><ul><ul><li>Plan and prepare for digital preservation activities </li></ul></ul><ul><ul><li>Evaluate and improve efforts </li></ul></ul><ul><ul><li>Compare analogue and digital </li></ul></ul>
  7. 7. LIFE 1 project <ul><ul><li>Literature Review </li></ul></ul><ul><ul><li>Economic Lifecycle Model </li></ul></ul><ul><ul><li>Generic Preservation Model </li></ul></ul><ul><ul><li>Case Studies </li></ul></ul><ul><ul><li>International Conference </li></ul></ul>
  8. 8. LIFE 1 Case Studies e-Journals Web Archiving Voluntary Deposit
  9. 9. <ul><li>LIFE 1 </li></ul>LIFE 2
  10. 10. Aim of LIFE 2 <ul><ul><li>To evaluate, refine and </li></ul></ul><ul><ul><li>further develop the techniques </li></ul></ul><ul><ul><li>developed in phase one of LIFE </li></ul></ul>
  11. 11. LIFE 2 deliverables <ul><li>Economic Evaluation of LIFE 1 </li></ul><ul><li>Revision of the LIFE Model </li></ul><ul><ul><ul><ul><li>Version 1.1 (October 2007) </li></ul></ul></ul></ul><ul><ul><ul><ul><li>Version 2 (Summer 2008) </li></ul></ul></ul></ul><ul><li>Updated Preservation Model (Summer 2008) </li></ul><ul><li>Final report </li></ul><ul><li>End of project conference </li></ul>
  12. 12. The LIFE Model v1.1 Inspection Re-ingest Backup Reference Linking User Support Preservation Action Refreshment Metadata Extraction Holdings Update Access Control Preservation Planning Storage Provision Metadata Creation Deposit Access Provision Preservation Watch Repository Admin Re-use Existing Metadata Quality Assurance Lifecycle Elements Access Content Preservation Bit-stream Preservation Metadata Creation Ingest Lifecycle Stage Check-in Obtaining Ordering & Invoicing IPR & Licensing Submission Agreement Selection Acquisition .... .... .... .... Creation or Purchase Access Content Preservation Bit-stream Preservation Metadata Creation Ingest Acquisition Creation or Purchase
  13. 13. LIFE Model v1.1: Non-lifecycle Elements Discounting Administration Inflation Repository Software Management Non-Lifecycle Elements Economic Adjustments Systems / Infrastructure Management and Administration Non-Lifecycle Stage
  14. 14. Generic LIFE Preservation Model <ul><li>The GPM predicted large cost and much activity - the challenge is reducing both. </li></ul><ul><li>Preservation Actions: </li></ul><ul><li>Preservation Tool Cost </li></ul><ul><li>Preservation Metadata </li></ul><ul><li>Performing preservation action </li></ul><ul><li>Quality Assurance </li></ul>
  15. 15. Generic LIFE Preservation Model Frequency of action Tech Watch Preservation action Preservation cost of n objects of a particular format for the period 0 to t. Preservation = + * e.g. 200000 objects of the GIF format for a period of 10 years. <ul><li>Monitoring formats and software for obsolescence </li></ul><ul><li>Preservation planning </li></ul><ul><li>Updating metadata </li></ul><ul><li>The number of preservation actions within the time period calculated </li></ul>Q/A Update object and event metadata Perform preservation action Cost of Preservation tool
  16. 16. Complexity of file formats Q/A Update metadata Perform preservation action Cost of Preservation tool <ul><li>Size </li></ul><ul><li>Complexity </li></ul><ul><li>Proprietary </li></ul><ul><li>Open </li></ul><ul><li>Standardised </li></ul>Frequency of action Tech Watch Preservation action Preservation = + * = Format Complexity Oracle database dump 1 Complex Word, PDF 0.8 Document MPEG3, WAV 0.6 Multimedia EMF, Draw 0.4 Vector XML, HTML 0.3 Mark-up JPEG, GIF 0.2 Bitmap ASCII, Unicode 0.1 Simple Examples Complexity Category
  17. 17. LIFE 2 Case Studies Institutional Repositories Primary Data Digitised Newspapers 01101101010101011001110100110110101010101100111010011011010101010110011101001101101010101011001110100110110101010101100111010110
  18. 18. The Burney Collection <ul><ul><li>Purchased by the British Library in 1818 for £13,500 </li></ul></ul><ul><ul><li>1,100 volumes of the earliest known newspapers </li></ul></ul><ul><ul><li>1,000,000 pages from 17 th , 18 th and 19 th Centuries. </li></ul></ul><ul><ul><li>Re-scanning or re-microfilming is not possible. </li></ul></ul><ul><ul><li>Microfilmed in the 1970s </li></ul></ul><ul><ul><li>Digitisation started in 1995-96 and ran until 2004. </li></ul></ul>
  19. 19. Questions that arise from Burney <ul><ul><li>Comparing digital and analogue lifecycles </li></ul></ul><ul><ul><li>What is the lifecycle cost to an institution of producing digitised surrogates? </li></ul></ul><ul><ul><li>What are the key preservation issues common across digitisation projects of differing scales? </li></ul></ul>
  20. 20. Benefits of LIFE <ul><li>Assess the financial commitment for acquiring or creating new digital materials </li></ul><ul><li>More effective planning for preservation activities </li></ul><ul><li>Comparison of digital lifecycles across collections </li></ul><ul><li>Evaluation and optimisation of existing digital lifecycles </li></ul><ul><li>Predictive future cost of digital preservation </li></ul>
  21. 21. LIFE Website & Blog <ul><li>Website </li></ul><ul><li>LIFE Blog /blog </li></ul>
  22. 22. Thank you. e [email_address] t +44 (0) 20 7412 7182 w x 01101101010101011001110100110110101010101100111010011011010101010110011101001101101010101011001110100110110101010101100111010110
  23. 23. Acknowledgements: <ul><ul><li>LIFE Team (Paul Ayris, Rory McLeod, Helen Shenton & Paul Wheatley) </li></ul></ul><ul><ul><li>Special thanks to Ulla Bøgvad Kejser </li></ul></ul><ul><ul><li>Comments & questions… </li></ul></ul><ul><ul><li>[email_address] </li></ul></ul><ul><ul><li> </li></ul></ul>