Upcoming SlideShare
×

# Digital preservation by Chris Smart

1,320 views

Published on

Published in: Education, Technology
• Full Name
Comment goes here.

Are you sure you want to Yes No
• Be the first to comment

• Be the first to like this

### Digital preservation by Chris Smart

1. 1. Digital Preservation Chris Smart
2. 2. Why bother?
4. 4. <ul><li>Computers are dumb. </li></ul>
5. 5. <ul><li>Computers are dumb. </li></ul><ul><li>Really. </li></ul>
6. 6. <ul><li>They are fast. </li></ul>
7. 7. <ul><li>Everything is ones and zeros. </li></ul><ul><li>(010101100101100101010111001...) </li></ul>
8. 8. <ul><li>Every little one or zero is called a “bit” </li></ul><ul><li>0 = a bit </li></ul><ul><li>1 = a bit </li></ul>
9. 9. <ul><li>A group of 8 bits is called a “byte” </li></ul><ul><li>01010110 = a byte </li></ul>
10. 10. <ul><li>This system is called “binary” (base-2) </li></ul><ul><li>We count in denary (decimal, base-10) </li></ul>
11. 11. <ul><li>Here's the problem: </li></ul><ul><li>Bits mean nothing on their own. </li></ul>
12. 12. <ul><li>01100001 </li></ul><ul><li>= ? </li></ul>
13. 13. <ul><li>01100001 </li></ul><ul><li>= 97 </li></ul>
14. 14. <ul><li>01100001 </li></ul><ul><li>= “a” </li></ul>
15. 15. <ul><li>01100001 </li></ul><ul><li>= anything </li></ul>
16. 16. <ul><li>How do you know what it means? </li></ul>
17. 17. <ul><li>Without a specification, there is no way to know what the binary data means. </li></ul>
18. 18. <ul><li>File formats are specifications. </li></ul>
19. 19. <ul><li>01100001 </li></ul><ul><li>= “a” </li></ul>
20. 20. http://www.flickr.com/photos/janoma/4472147302/
21. 21. http://www.flickr.com/photos/janoma/4472147302/
22. 22. <ul><li>Choose your file formats carefully. </li></ul>
23. 23. So, why bother? <ul><li>Because digital records are easily lost. </li></ul><ul><li>Forever. </li></ul>
24. 24. <ul><li>Microsoft Office 2003 Service Pack 3 disabled dozens of file formats. </li></ul><ul><li>http://support.microsoft.com/kb/938810 </li></ul>
25. 25. <ul><li>Relying on a single vendor for file format support is bad, mmm'k. </li></ul>
26. 26. <ul><li>Using a free and open format </li></ul><ul><li>avoids this problem. </li></ul>
27. 27. Migration <ul><li>The approach NAA takes. </li></ul>
28. 28. Digital Preservation Tools <ul><li>Xena </li></ul><ul><li>Detects file format </li></ul><ul><li>Migrates to open format </li></ul><ul><li>Encodes binary in base64 </li></ul><ul><li>Wraps in XML metadata </li></ul>
29. 30. <ul><li>That's only half the story. </li></ul>
30. 31. Management Tools <ul><li>Manifest Maker </li></ul><ul><li>DPR (Digital Preservation Recorder) </li></ul><ul><li>Checksum Checker </li></ul>
31. 35. Digital Preservation Software Platform (DPSP) <ul><li>Free & open source software (GPLv3) </li></ul><ul><li>Single installer (get set up in 10 min) </li></ul><ul><li>Includes digital preservation software </li></ul><ul><li>Includes all third party software </li></ul><ul><li>Runs on a laptop </li></ul>
32. 36. Resources <ul><li>We want to collaborate. </li></ul>
33. 37. Digital Archive <ul><li>Runs on open source software </li></ul><ul><li>Vendor agnostic hardware </li></ul><ul><li>Refreshed every few years </li></ul>