Digital Preservation  The Saga Continues
SCAPEhttp://www.dpconline.org/
SCAPE‘Digital Preservation: what Iwish I knew before I started’ • It won’t do itself • It won’t go away • Don’t wait for p...
SCAPEDigital preservation makes bleak reading …
SCAPELet’s restate the problem …•Digital stuff has value. It is an asset.•It has potential and creates new opportunities.•...
SCAPEKey responses1. MigrationChanging the format of a file to ensurethe information content can be read2. EmulationInterv...
SCAPEAccess and long term usedepends on theconfiguration of hardwareand software and thecapacity of the operator.Change is...
SCAPETechnology continues tochange creating theconditions for obsolescence.Need to become a learninginstitution
SCAPEStorage media have a short lifeand storage devices are subjectto obsolescence.Be mobile and format neutral
SCAPEDigital preservation systemsare subject to the sameobsolescence as the objectsthey safeguard.Standards and modularity
SCAPEDigital resources are intolerantof gaps in preservation.Ongoing process
SCAPEThe problems are moresubtle than we realised adecade ago…e.g. file formatobsolescenceChanging file formats?Conformant...
SCAPEHow to pick a winner ...beyond and potentially over-writing the criteria ...repository managers should align the reco...
SCAPEHow to pick a winner...                       You ain’t seen nothing yet                       Data growth on 3 axes ...
SCAPEDigital Preservation as a ‘discipline’                                  Daunting challenge                           ...
SCAPEHow much does it cost?                    Lifecycle costs of digital objects                    vs                   ...
SCAPEGe= ng$      started…$                  www.dpconline.org-
SCAPEThe reality?You don’t need to understandor do all of this.... and it doesn’t all have to exist at the same time
SCAPEThe reality?Get started nownot later
SCAPE        Preservation LifecycleIdentification            DROID, FIDO, FILE, FITS, TIKA…       Characterisation        ...
SCAPE             This TrainingIdentification           DROID            FITS           FIDO             TIKA           FI...
So you have dug a hole?
Stage 2• What did you find?• Is it worth preserving?• What are the problems?
Aim of Training
SCAPE            Time to get married1.   Luis Bravo               1.   Jose Carvalho (SDUM)2.   Jose Casanova            2...
SCAPE              Getting Started (1)•   Wifi Network = SMS, password = Sarmento1881127•   Download Virtualbox (if you do...
SCAPE          Getting Started (2)• Once done, click the machine and press the  settings button (maybe in right click)• Cl...
SCAPE         Getting Started (3)• Password is training.• Ignore update manager if it appears• Press the top left ubuntu h...
SCAPE                Bundle or Not?• Pros                    • Cons  – Single Input/Output     – Out of date  – Consistent...
SCAPE             Questions (1)• What tool would you use?
SCAPE           Training  Keeping Control - ScalableEnvironments for Identification     and Characterisation
SCAPE                     AimsThis training course will cover elements dealing  with scalable identification, characterisa...
SCAPE        Learning Outcomes (1)• Distinguish between different file types and  identify the requirements for characteri...
SCAPE        Learning Outcomes (2)• Critically evaluate characterisation and  identification tools and assess their  advan...
Learning Outcomes (3)• Conduct an in-depth analysis of large volumes  of identification and characterisation data and  fin...
SCAPE         Learning Outcomes (4)• Compare and contrast the differences in  running characterisation and identification ...
SCAPE     Our Last CommitmentSlides will be available Monday!
SCAPEThank You     Franz San Galli
SCAPEThank-You
SCAPEThank You
SCAPE           Next Time…Building Applications Infrastructures         for Action Services     London, September 2013    ...
SCAPE               Then….Critical Path: Effective Evidence Based         Preservation Planning     Denmark, November 2013...
SCAPE                    Tonight   www.goo.gl/q6wKB7:15pm Eleven Bar@ Hotel Fundador   Free Beer*       Our Table         ...
Upcoming SlideShare
Loading in …5
×

Digital Preservation - The Saga Continues - SCAPE Training event, Guimarães 2012

502 views
456 views

Published on

This presentation is an introduction to Digital Preservation given by David Tarrant, Open Planets Foundation, at the first SCAPE Training event, ‘Keeping Control: Scalable Preservation Environments for Identification and Characterisation’, in Guimarães, Portugal on 6-7 December 2012.

Published in: Technology
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
502
On SlideShare
0
From Embeds
0
Number of Embeds
1
Actions
Shares
0
Downloads
3
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

Digital Preservation - The Saga Continues - SCAPE Training event, Guimarães 2012

  1. 1. Digital Preservation The Saga Continues
  2. 2. SCAPEhttp://www.dpconline.org/
  3. 3. SCAPE‘Digital Preservation: what Iwish I knew before I started’ • It won’t do itself • It won’t go away • Don’t wait for perfection
  4. 4. SCAPEDigital preservation makes bleak reading …
  5. 5. SCAPELet’s restate the problem …•Digital stuff has value. It is an asset.•It has potential and creates new opportunities.•Use gives rise to direct and indirect outcomes....but...•Deployment depends on software, hardware and people.•Software, hardware and people change....therefore...•Access is not guaranteed without (some) action•Value, opportunity, impact not guaranteed
  6. 6. SCAPEKey responses1. MigrationChanging the format of a file to ensurethe information content can be read2. EmulationIntervening in the operating system toensure that old software can functionand information content can be read3. Hardware preservationMaintaining access to data and processesby maintaining the physical computingenvironment including hardware andperipherals.4. etcResearch and development field, newsolutions and new approaches continue toemerge, eg virtualisation for preservation
  7. 7. SCAPEAccess and long term usedepends on theconfiguration of hardwareand software and thecapacity of the operator.Change is not a bug.
  8. 8. SCAPETechnology continues tochange creating theconditions for obsolescence.Need to become a learninginstitution
  9. 9. SCAPEStorage media have a short lifeand storage devices are subjectto obsolescence.Be mobile and format neutral
  10. 10. SCAPEDigital preservation systemsare subject to the sameobsolescence as the objectsthey safeguard.Standards and modularity
  11. 11. SCAPEDigital resources are intolerantof gaps in preservation.Ongoing process
  12. 12. SCAPEThe problems are moresubtle than we realised adecade ago…e.g. file formatobsolescenceChanging file formats?Conformant containers?Units of information?
  13. 13. SCAPEHow to pick a winner ...beyond and potentially over-writing the criteria ...repository managers should align the recognition andweighting of criteria with a clear preservation strategythat articulates the purpose of the repository and theneeds of its designated community;Todd, M 2009 ‘File formats for preservation’, DPC Technology Watch Report02/09, online at http://www.dpconline.org/advice/technology-watch-reports.html
  14. 14. SCAPEHow to pick a winner... You ain’t seen nothing yet Data growth on 3 axes •volume •complexity •expectation ... it’s not going to be about obsolescence so much as workflow and capacity
  15. 15. SCAPEDigital Preservation as a ‘discipline’ Daunting challenge Decade of research and development Replete with jargon and acronyms Turf war between professions? A whole new barrier Courtesy NASA/JPL-CaltechThe last decade has shown definitively that usingfancy words are not the same as solving problems
  16. 16. SCAPEHow much does it cost? Lifecycle costs of digital objects vs Lifecycle costs of books vs Lifecycle costs of museum objects vs Lifecycle costs of archives vs Lifecycles costs of historic environment
  17. 17. SCAPEGe= ng$ started…$ www.dpconline.org-
  18. 18. SCAPEThe reality?You don’t need to understandor do all of this.... and it doesn’t all have to exist at the same time
  19. 19. SCAPEThe reality?Get started nownot later
  20. 20. SCAPE Preservation LifecycleIdentification DROID, FIDO, FILE, FITS, TIKA… Characterisation JHOVE, JPYLYZER, exiftool, FITS… Risk Assessment Knowledge + Policy + Risk = Continue Planning Plato Action Migration, Emulation
  21. 21. SCAPE This TrainingIdentification DROID FITS FIDO TIKA FILE Characterisation JHOVE, Exiftool JPYLYZER FITS
  22. 22. So you have dug a hole?
  23. 23. Stage 2• What did you find?• Is it worth preserving?• What are the problems?
  24. 24. Aim of Training
  25. 25. SCAPE Time to get married1. Luis Bravo 1. Jose Carvalho (SDUM)2. Jose Casanova 2. Carlos Duarte3. Vitor Fernandes 3. Luis Ferreira4. Sebastien Leroux 4. Cristiana Freitas5. Joao Pereira 5. Claire Johnson6. Rui Rodrigues 6. Anthony Laerdahl7. Carlos Velentim 7. Helena Medeiros8. Jose Carvalho (Papiro) 8. Antonio Rodrigues9. Omar Coelho 9. Cidalia Ferreira Column 1 Column 2
  26. 26. SCAPE Getting Started (1)• Wifi Network = SMS, password = Sarmento1881127• Download Virtualbox (if you don’t have it)• Start Virtualbox• Plug-in USB memory key• Open the memory key folder and double click the extension pack file to install it (follow instructions at this point)• Return to virtual box:• From the main menu (file), select “Import Appliance”• Browse to the memory key and select the only file• Wait for this to import• Once done you can safely remove the key.
  27. 27. SCAPE Getting Started (2)• Once done, click the machine and press the settings button (maybe in right click)• Click shared folders• Click add• Add a shared folder (e.g. your desktop or downloads folder)• Tick auto-mount!• Click OK to return to the main screen• Start the machine• Wait..
  28. 28. SCAPE Getting Started (3)• Password is training.• Ignore update manager if it appears• Press the top left ubuntu home button and type terminal (select and run the app)• Type: cd /media/sf_Desktop (where Desktop is the folder you shared previously) and press enter• Type: fido *
  29. 29. SCAPE Bundle or Not?• Pros • Cons – Single Input/Output – Out of date – Consistent – Doesn’t Scale – Easy
  30. 30. SCAPE Questions (1)• What tool would you use?
  31. 31. SCAPE Training Keeping Control - ScalableEnvironments for Identification and Characterisation
  32. 32. SCAPE AimsThis training course will cover elements dealing with scalable identification, characterisationand validation of large collections of varying file types. Users will be introduced to a number of tools designed for each of these purposes and involved in problem solving scenarios. Further, users will be required to evaluate the use of scalable and cloud based technologies in developing solutions for given scenarios.
  33. 33. SCAPE Learning Outcomes (1)• Distinguish between different file types and identify the requirements for characterising each.• Carry out a number of identification, characterisation, and duplication detection experiments on example files.
  34. 34. SCAPE Learning Outcomes (2)• Critically evaluate characterisation and identification tools and assess their advantages and disadvantages when used in different scenarios.
  35. 35. Learning Outcomes (3)• Conduct an in-depth analysis of large volumes of identification and characterisation data and find representative sample records suitable for preservation planning experiments.
  36. 36. SCAPE Learning Outcomes (4)• Compare and contrast the differences in running characterisation and identification tools both stand-alone and within workflows.• Envisage a system that combines workflows with identification, characterisation and validation tools to suit a variety of scenarios.
  37. 37. SCAPE Our Last CommitmentSlides will be available Monday!
  38. 38. SCAPEThank You Franz San Galli
  39. 39. SCAPEThank-You
  40. 40. SCAPEThank You
  41. 41. SCAPE Next Time…Building Applications Infrastructures for Action Services London, September 2013 (wet)
  42. 42. SCAPE Then….Critical Path: Effective Evidence Based Preservation Planning Denmark, November 2013 (cold)
  43. 43. SCAPE Tonight www.goo.gl/q6wKB7:15pm Eleven Bar@ Hotel Fundador Free Beer* Our Table * 1 Free Beer subject to completion of online survey!

×