Archival Technologies


Published on

Archives work is messy -- in many cases archivists have to organize and make accessible large amounts of mixed data in a variety of formats, both physical and digital. Thankfully, there are a variety of technology tools available to help solve the messiness problem and make collections more accessible. In this session, audience members will learn about current and emerging archival technology tools, the pros and cons of the major tools, and resources for further education.

Published in: Education, Technology

Archival Technologies

  1. 1. Archival TechnologiesCliff LandisWeb Services LibrarianGeorgia State University46th Annual Georgia Archives InstituteJune 10 – 21, 2013.
  2. 2. "Good grief, what have we gottenourselves into..."
  3. 3. Learning Objectives● Identify existing and emerging areas ofarchival technology development.● Learn about the capabilities, pros, and consof major archival management tools, such asArchon and Archivists Toolkit.● Learn about the capabilities, pros, and consof major digital collection management tools,such as CONTENTdm and Islandora.● Discover resources for further professionaldevelopment in archival technology areassuch as software, hardware, and standards.
  4. 4. IntroductionsEGO TIME!● Library (and Archival)Technologist● Author of A SocialNetworking Primer forLibrarians (2010)● Professional Geek● I work as a translatorbetween several librarydialects including:Student, Techie,Librarian, Archivist andAdministrator!
  5. 5. Why does this stuff matter?
  6. 6. Two questions1) What one thing do you hope to learntoday?2) What one thing do you hope to do witharchival technology?
  7. 7. The problem of "hidden collections"
  8. 8. The problem of "hidden collections"
  9. 9. The problem of "hidden collections"
  10. 10.
  11. 11. Its all about using the right tool forthe job...
  12. 12. Preliminary Considerations● Free vs. paid● Open source vs. closed source● Local server vs. cloud hosted● Few features vs. many features (vs. somefeatures)● Web-based vs. client-based● Ease of setup, ease of use● Degree of technical support● Standards compliance
  13. 13. Archival managementsoftwareaka, "Whats all this old stuff, where did weput it, and what can we do with it?"
  14. 14. Archon
  15. 15. Archon
  16. 16. Archon● Developed by the University of Illinois atUrbana-Champaign (2006-2011).● Free, Open-Source Software (FOSS), locallyhosted, many features, limited exports.● Has both a back-end (for managing records)and a front-end (for access).● Full life-cycle management. Lacks somefeatures (some metadata exports,deaccessioning, etc.).
  17. 17. Archivists Toolkit
  18. 18. Archivists Toolkit
  19. 19. Archivists Toolkit
  20. 20. Archivists Toolkit● Developed with a Mellon Foundation grant andcontinued by Five Colleges, Inc., New York UniversityLibraries, and the UC San Diego Libraries (2006-2009).● Free, Open-Source Software (FOSS), locally hosted,many features, exports in many standards/formats.● Server and client software● Has a back-end (for managing records). No webpublishing available.● Full life-cycle management. Lacks some features(backup/restore, publishing finding aids, etc.)
  21. 21. Up next: ArchivesSpace
  22. 22. Up next: ArchivesSpace● Funded by a Mellon Foundation grant, created by NewYork University, the University of California San Diego,and the University of Illinois at Urbana-Champaign.Hmmmm...those names look familiar...● The best of both worlds?● Version 1.0 in "late July" 2013● Membership option, free option● "Organizational home" at LYRASIS●●
  23. 23. ICA-AtoM;isad
  24. 24. ICA-AtoM● ICA-AtoM is web-based archival description softwarethat is based on International Council on Archives(ICA) standards. AtoM is an acronym for Access toMemory (2008-2013).● Developed by Artefactual Systems in collaboration withthe ICA Program Commission (PCOM) and a growingnetwork of international partners.● Free, Open-Source Software (FOSS). Web-based, sorequires server or virtual appliance setup.
  25. 25. Others● Adlib Archive● Calm for Archives● Cuadra STAR / Archives● Eloquent Archives● MINISIS M2A● Collective Access● PastPerfect...and many more
  26. 26. Digital collections/assetmanagement softwareaka, "Isnt all that old stuff online by now?"
  27. 27. Fedora
  28. 28. Fedora● NOT the Linux operating system....● aka: Fedora Repository / Fedora Commons● Developed by Cornell University and the University ofVirginia Library, currently supported by DuraSpace● FOSS, server-side.● Flexible architecture, allowing you to customize it (addon components) to meet local needs. Requires morework.● Ingest, management, and basic delivery -- not a full-fledged system for managing digital assets.
  29. 29. Islandora
  30. 30. Islandora● Fedora (asset management), Drupal (websitefunctionality) and Solr (search). Additional "SolutionPacks" of software to manage particular data types(books, PDFs, large images, etc.).● Developed by Prince Edward Island University.● FOSS, server-side. Has to be assembled byprogrammers / systems folks. Requires a LOT of workand maintenance at this point. Not a "download anddouble-click" software.
  31. 31. CONTENTdm
  32. 32. CONTENTdm
  33. 33. CONTENTdm (and a lot of work...)
  34. 34. CONTENTdm● Closed source, OCLC, and paid (expensive!).● A full system for managing digital collections. Can behosted by OCLC or run on your own servers (hostedversion limits customization).● Server-side software, web interface and project clientsoftware. Lots of moving pieces to get to work togetherwith limited documentation and slow technical supportresponse time.
  35. 35. Greenstone
  36. 36. Greenstone
  37. 37. Greenstone● Developed by New Zealand Digital Library Project at theUniversity of Waikato, with support from UNESCO.● FOSS, server-side.● Multi-lingual and multi-national.● Unsure how active the development community is, as Ihavent seen much work on it since 2012.
  38. 38. Dspace
  39. 39. Dspace
  40. 40. Dspace● Developed by the MIT Libraries and Hewlett-PckardLabs● FOSS, server-side. Hosted option available(DSpaceDirect)● Manakin add-on for improved user interface● Not easy to set up or customize, but effective
  41. 41. Others● Tripod2 (Duke University, in-house)● Keystone (Index Data)● EPrints (University of Southampton)● and many more...
  42. 42. Display and Accessaka, "All this old stuff can tell a story..."
  43. 43. Omeka
  44. 44. Omeka
  45. 45. Omeka● Web publishing of narratives around digital collections.● Center for History and New Media (CHNM) at GeorgeMason University● FOSS, server-side. Hosted versions also available.● Designed to be relatively easy to use for non-technicalfolks.● Has plugins available for additional functionality (OAI-PMH, CSV import, Dublin Core, etc.)
  46. 46. Others● Collective Access● Virtual Exhibit (for Past Perfect)● Internet Archive● Picasa/Flickr● Blogs/Websites● and many more...
  47. 47. OTHER STUFF!Cliff says "hang on to your brain..."
  48. 48. Digital Preservation Hardware● Media readers (drives, connections)○ Floppy Discs○ Zip○ Jaz○ CD / DVD / BluRay / Laserdisc○ Cartridges○ Microcards● Write-blockers / Forensic Bridges○ Tableau○ Weibe TechSee: Webinar: “Intro to Digital Preservation #3 — “Management of Incoming Born-Digital Special Collections”
  49. 49. Digital Preservation Software● FITS & JHOVE: used to identify file formatsand extract metadata● IdentityFinder: searches for PersonallyIdentifiable Information (PII)● PREMIS: manage metadata of digital objects● Bagit: file transfers● BitCurator & Archivematica: accessioningthrough accessSee: Intro to Digital Preservation websinar series
  50. 50. Formats & Protocols & Standards!● XML: eXtensible Markup Language● DTD: Document Type Definition (aka "Schema")● EAD: Encoded Archival Description● OAI-PMH: The Open Archives Initiative Protocol forMetadata Harvesting● OAI-ORE: The Open Archives Initiative Protocol forObject Reuse and Exchange● RSS: Really Simple Syndication● DC: Dublin Core (also DCMI)● RDF: Resource Description Framework● SQL: Structured Query Language● MODS: Metadata Object Description Schema● METS: Metadata Encoding and Transmission Standard
  51. 51. Linked Open Data (2010-09)
  52. 52. Linked Open Data
  53. 53. Linked Jazz
  54. 54.
  55. 55. Semantic WebBUT WHAT DOES IT ALL MEAN?!?!!● Microformats: a way of adding human- and machine-readable metadata into existing HTML webpages.○ COinS: ContextObjects in Spans. Allows users toembed machine-readable bibliographic metadata inHTML webpages.● RDFa Lite: Resource Description Framework inattributes - another way of adding human- and machine-readable metadata into existing HTML pages.
  56. 56. Why does this stuff matter?
  57. 57. Resources:● Spiro, Lisa (2009). Archival Management Software: AReport for the Council on Library and InformationResources. and● Bean, Carol (2010). Comparing Digital Library Systems(BeanWorks).● Association of Southeastern Research Libraries.Archived Webinars / Materials.● Digital Preservation - Tools Showcase.● W3C Schools.
  58. 58. Not that it has to be said, but...Disclaimer!All images and excerpts included are beingused under the auspices of Fair Use for thepurposes of nonprofit education, criticism, andcomment as outlined in 17 U.S.C. § 107.
  59. 59. Questions?Cliff LandisWeb Services LibrarianGeorgia State University46th Annual Georgia Archives InstituteJune 10 – 21, 2013.