4. WAS … is
A service of the UC Curation Center to
collect, manage, preserve and publish
websites and documents.
5. WAS Snapshot
53 public archives
120+ archives total
7,500+ sites
50+ TB
23 institutions
6. WAS Institutions
• Institute of Governmental Studies • Emory University Library
Library, UCB • Institute for Research on Labor and
• UC Berkeley Office of Public Affairs Employment
• UC Davis Libraries • New York University
• UC Irvine Libraries • Northwestern University Library
• UC Los Angeles Libraries • Purdue University
• UC Riverside Libraries • Stanford University Libraries
• UC San Diego Libraries • Temple University
• UC San Francisco Libraries • University of Arkansas Libraries
• UC Santa Barbara • University of Illinois at Urbana
• UC Santa Cruz McHenry Library Champaign Libraries
• University of Michigan, Bentley
Historical Library
• USDA Economic Research Service
• Water Resources Collections and
Archives
22. Event
• Sudden action
required
• May need many
selectors
• Start date / end date
23. Researcher’s Perspective
• Building collections for research
– Study the topic / event
– Study site change or web-based communication
– Websites are datasets for analysis and data
mining
• Preservation of research
– Archive grant-funded websites
– Selected sites
• Create stable citations for publications
24. Get started!
• Each library has WAS administrator(s)
• Unlimited number of curators per account
• What’s the cost?
– UC does not pay a service fee
– Storage only: $1040/per TB (average site is
$1.46/annually); storage costs to go down
25. Challenges
• Shared collection development
• Metadata issues
• Workflow and cost models for faculty
projects
• Time!
• Limitations of web crawlers
• Websites are messy
26. Contact me!
Rosalie Lack
WAS Service Manager
rosalie.lack@ucop.edu
27. Imagine a world …
“Imagine a world in which libraries and archives had
never existed. No institutions had ever systematically
collected or preserved our collective cultural past: every
book, letter, or document was created, read and then
immediately thrown away. What would we know about
our past?’’
28. This is our world …
“Yet, that is precisely what is happening with the web:
more and more of our daily lives occur within the digital
world, yet more than two decades after the birth of the
modern web, the “libraries” and “archives” of this world
are still just being formed.”
A Vision Of The Role And Future Of Web Archives
Kalev H. Leetaru, Graduate School of Library and Information Science,
University of Illinois. Presented as the keynote address at the 2012 IIPC
General Assembly in Washington, DC.
http://netpreserve.org/sites/default/files/resources/VisionRoles.pdf