Digital Preservation Chris Smart
Why bother?
Why bother? <ul><li>Paper – already human readable </li></ul><ul><li>Digital – not human readable </li></ul>
<ul><li>Computers are dumb. </li></ul>
<ul><li>Computers are dumb. </li></ul><ul><li>Really. </li></ul>
<ul><li>They are fast. </li></ul>
<ul><li>Everything is ones and zeros. </li></ul><ul><li>(010101100101100101010111001...) </li></ul>
<ul><li>Every little one or zero is called a “bit” </li></ul><ul><li>0 = a bit </li></ul><ul><li>1 = a bit </li></ul>
<ul><li>A group of 8 bits is called a “byte” </li></ul><ul><li>01010110 = a byte </li></ul>
<ul><li>This system is called “binary” (base-2) </li></ul><ul><li>We count in denary (decimal, base-10) </li></ul>
<ul><li>Here's the problem: </li></ul><ul><li>Bits mean  nothing  on their own. </li></ul>
<ul><li>01100001 </li></ul><ul><li>= ? </li></ul>
<ul><li>01100001 </li></ul><ul><li>= 97 </li></ul>
<ul><li>01100001 </li></ul><ul><li>= “a” </li></ul>
<ul><li>01100001 </li></ul><ul><li>=  anything </li></ul>
<ul><li>How do you know  what  it means? </li></ul>
<ul><li>Without a specification, there is no way to know what the binary data means. </li></ul>
<ul><li>File formats are specifications. </li></ul>
<ul><li>01100001 </li></ul><ul><li>= “a” </li></ul>
http://www.flickr.com/photos/janoma/4472147302/
http://www.flickr.com/photos/janoma/4472147302/
<ul><li>Choose your file formats carefully. </li></ul>
So, why bother? <ul><li>Because digital records are easily lost. </li></ul><ul><li>Forever. </li></ul>
<ul><li>Microsoft Office 2003 Service Pack 3 disabled dozens of file formats. </li></ul><ul><li>http://support.microsoft.c...
<ul><li>Relying on a single vendor for file format support is bad, mmm'k. </li></ul>
<ul><li>Using a free and open format </li></ul><ul><li>avoids this problem. </li></ul>
Migration <ul><li>The approach NAA takes. </li></ul>
Digital Preservation Tools <ul><li>Xena </li></ul><ul><li>Detects file format </li></ul><ul><li>Migrates to open format </...
 
<ul><li>That's only half the story. </li></ul>
Management Tools <ul><li>Manifest Maker </li></ul><ul><li>DPR (Digital Preservation Recorder) </li></ul><ul><li>Checksum C...
 
 
 
Digital Preservation Software Platform (DPSP) <ul><li>Free & open source software (GPLv3) </li></ul><ul><li>Single install...
Resources <ul><li>We want to collaborate. </li></ul>
Digital Archive <ul><li>Runs on open source software </li></ul><ul><li>Vendor agnostic hardware </li></ul><ul><li>Refreshe...
 
Questions? <ul><li>ODF made with Linux and LibreOffice (OpenOffice.org). </li></ul><ul><li>Licensed under Creative Commons...
Resources <ul><li>http:// dpsp.sourceforge.net </li></ul><ul><li>[email_address] </li></ul><ul><li>[email_address] </li></...
 
Upcoming SlideShare
Loading in …5
×

Digital preservation by Chris Smart

1,320 views

Published on

Published in: Education, Technology
  • Be the first to comment

  • Be the first to like this

Digital preservation by Chris Smart

  1. 1. Digital Preservation Chris Smart
  2. 2. Why bother?
  3. 3. Why bother? <ul><li>Paper – already human readable </li></ul><ul><li>Digital – not human readable </li></ul>
  4. 4. <ul><li>Computers are dumb. </li></ul>
  5. 5. <ul><li>Computers are dumb. </li></ul><ul><li>Really. </li></ul>
  6. 6. <ul><li>They are fast. </li></ul>
  7. 7. <ul><li>Everything is ones and zeros. </li></ul><ul><li>(010101100101100101010111001...) </li></ul>
  8. 8. <ul><li>Every little one or zero is called a “bit” </li></ul><ul><li>0 = a bit </li></ul><ul><li>1 = a bit </li></ul>
  9. 9. <ul><li>A group of 8 bits is called a “byte” </li></ul><ul><li>01010110 = a byte </li></ul>
  10. 10. <ul><li>This system is called “binary” (base-2) </li></ul><ul><li>We count in denary (decimal, base-10) </li></ul>
  11. 11. <ul><li>Here's the problem: </li></ul><ul><li>Bits mean nothing on their own. </li></ul>
  12. 12. <ul><li>01100001 </li></ul><ul><li>= ? </li></ul>
  13. 13. <ul><li>01100001 </li></ul><ul><li>= 97 </li></ul>
  14. 14. <ul><li>01100001 </li></ul><ul><li>= “a” </li></ul>
  15. 15. <ul><li>01100001 </li></ul><ul><li>= anything </li></ul>
  16. 16. <ul><li>How do you know what it means? </li></ul>
  17. 17. <ul><li>Without a specification, there is no way to know what the binary data means. </li></ul>
  18. 18. <ul><li>File formats are specifications. </li></ul>
  19. 19. <ul><li>01100001 </li></ul><ul><li>= “a” </li></ul>
  20. 20. http://www.flickr.com/photos/janoma/4472147302/
  21. 21. http://www.flickr.com/photos/janoma/4472147302/
  22. 22. <ul><li>Choose your file formats carefully. </li></ul>
  23. 23. So, why bother? <ul><li>Because digital records are easily lost. </li></ul><ul><li>Forever. </li></ul>
  24. 24. <ul><li>Microsoft Office 2003 Service Pack 3 disabled dozens of file formats. </li></ul><ul><li>http://support.microsoft.com/kb/938810 </li></ul>
  25. 25. <ul><li>Relying on a single vendor for file format support is bad, mmm'k. </li></ul>
  26. 26. <ul><li>Using a free and open format </li></ul><ul><li>avoids this problem. </li></ul>
  27. 27. Migration <ul><li>The approach NAA takes. </li></ul>
  28. 28. Digital Preservation Tools <ul><li>Xena </li></ul><ul><li>Detects file format </li></ul><ul><li>Migrates to open format </li></ul><ul><li>Encodes binary in base64 </li></ul><ul><li>Wraps in XML metadata </li></ul>
  29. 30. <ul><li>That's only half the story. </li></ul>
  30. 31. Management Tools <ul><li>Manifest Maker </li></ul><ul><li>DPR (Digital Preservation Recorder) </li></ul><ul><li>Checksum Checker </li></ul>
  31. 35. Digital Preservation Software Platform (DPSP) <ul><li>Free & open source software (GPLv3) </li></ul><ul><li>Single installer (get set up in 10 min) </li></ul><ul><li>Includes digital preservation software </li></ul><ul><li>Includes all third party software </li></ul><ul><li>Runs on a laptop </li></ul>
  32. 36. Resources <ul><li>We want to collaborate. </li></ul>
  33. 37. Digital Archive <ul><li>Runs on open source software </li></ul><ul><li>Vendor agnostic hardware </li></ul><ul><li>Refreshed every few years </li></ul>
  34. 39. Questions? <ul><li>ODF made with Linux and LibreOffice (OpenOffice.org). </li></ul><ul><li>Licensed under Creative Commons Attribution 3.0 Australia License. </li></ul>
  35. 40. Resources <ul><li>http:// dpsp.sourceforge.net </li></ul><ul><li>[email_address] </li></ul><ul><li>[email_address] </li></ul><ul><li>[email_address] </li></ul>

×