Archiving and Preserving Born Digital Government Documents - Presentation Transcript
Gone Today, Here Tomorrow: Archiving and Preserving Born Digital Government Documents Molly Bragg, Partner Specialist Internet Archive [email_address] Federal Depository Library Conference Arlington, Virginia October 20, 2008
Internet Archive
Founded in 1996 by Brewster Kahle
Largest public web archive in existence
Designated as a library by the state of California in 2007
Digitized collections of books, audio, moving images
www.archive.org
Partner Needs for Web Capture
Libraries and Archives need web capture beyond general web archive
Partners need to create focused collections
Harvest at specific frequencies
Reporting Features
Hosting, Access and full text search
Archiving Big and Small
Domain crawls for the most comprehensive collections, ex .fr, .au
Curated crawls for large collections, Iraq war, Election Collections
Archive-It service, for smaller sized collections (automated harvesting)
Archiving the U.S. Federal Government
Library of Congress
Congressional Harvests (107 th – 110 th )
NARA
End of Presidental term (2004)
Congressional Election Harvest (2006, 2008)
End of Term 2008 harvest
Collaborative project (LoC, CDL, UNT, GPO)
www.loc.gov/ minerva/
www.webharvest.gov
Archive-It
Subscription service for smaller collection needs
Includes collection management, harvesting, full text search, hosting and access
Collections publicly available at www.archive-it.org
Over 65 partners (State Archive/Libraries, Universities, Federal institutions, Museums, Public Libraries)
Archiving with Archive-It
Publications in born digital formats only
Web archiving allows archivist to capture more than just the publications
At risk content needs to be preserved before it is lost
Supplement paper collections
Builds relationships between archives/libraries and government agencies
Federal Institutions and Archive-It
National Institutes of Health: capture select NIH websites and records
Department of Energy, Office of Scientific and Technical Information: archiving the E-Print Network, a web-based library of published papers, research groups, and electronic documents.
Department of Labor: create an archive of their web presence.
US State Government: North Carolina
State Library / State Archive partnership
1 main collection for all state agencies
Websites for the collection are selected using specific appraisal guidelines
Provide special access portal for the web archives from their own site to brand and market the collection
0 comments
Post a comment