Web Archiving Service       (WAS)        Rosalie Lack   rosalie.lack@ucop.edu     Data Curation for Practitioners 2012 Wor...
Imagine a world …
This is our world …
WAS … isA service of the UC Curation Center to collect, manage, preserve and publish websites and documents.
WAS Snapshot        53 public archives        120+ archives total        7,500+ sites        50+ TB        23 institutions
WAS Institutions•   Institute of Governmental Studies      •   Emory University Library    Library, UCB                   ...
WAS OverviewA) Curator Tools
Curator Workflow
1. Create Site            • Enter site name,            URL and description            • Scope            • Capture freque...
2. Capture Sites
3. View Captures              • View   captures              • QA              • Compare
4. Public Access              • Customize the              archive              • Write description              • Create ...
WAS OverviewB) Public Archives
Web Archive ‘home page’
Browse: Site List + Tags
Search: All Sites in an Archive
Integration with your Systems
How are people using WAS?
Institution’s website           • Preserve intuitional             history           • Capture university             news...
Geographically focused
Topical      Support special        research collections
Event   • Sudden action     required   • May need many     selectors   • Start date / end date
Researcher’s Perspective• Building collections for research  – Study the topic / event  – Study site change or web-based c...
Get started!• Each library has WAS administrator(s)• Unlimited number of curators per account• What’s the cost?  – UC does...
Challenges• Shared collection development• Metadata issues• Workflow and cost models for faculty  projects• Time!• Limitat...
Contact me!     Rosalie LackWAS Service Managerrosalie.lack@ucop.edu
Imagine a world …“Imagine a world in which libraries and archives hadnever existed. No institutions had ever systematicall...
This is our world …“Yet, that is precisely what is happening with the web:more and more of our daily lives occur within th...
Was uc3-nov2012wkshps-final
Was uc3-nov2012wkshps-final
Upcoming SlideShare
Loading in...5
×

Was uc3-nov2012wkshps-final

102

Published on

0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total Views
102
On Slideshare
0
From Embeds
0
Number of Embeds
0
Actions
Shares
0
Downloads
1
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

Was uc3-nov2012wkshps-final

  1. 1. Web Archiving Service (WAS) Rosalie Lack rosalie.lack@ucop.edu Data Curation for Practitioners 2012 Workshop
  2. 2. Imagine a world …
  3. 3. This is our world …
  4. 4. WAS … isA service of the UC Curation Center to collect, manage, preserve and publish websites and documents.
  5. 5. WAS Snapshot 53 public archives 120+ archives total 7,500+ sites 50+ TB 23 institutions
  6. 6. WAS Institutions• Institute of Governmental Studies • Emory University Library Library, UCB • Institute for Research on Labor and• UC Berkeley Office of Public Affairs Employment• UC Davis Libraries • New York University• UC Irvine Libraries • Northwestern University Library• UC Los Angeles Libraries • Purdue University• UC Riverside Libraries • Stanford University Libraries• UC San Diego Libraries • Temple University• UC San Francisco Libraries • University of Arkansas Libraries• UC Santa Barbara • University of Illinois at Urbana• UC Santa Cruz McHenry Library Champaign Libraries • University of Michigan, Bentley Historical Library • USDA Economic Research Service • Water Resources Collections and Archives
  7. 7. WAS OverviewA) Curator Tools
  8. 8. Curator Workflow
  9. 9. 1. Create Site • Enter site name, URL and description • Scope • Capture frequency • Robots.txt
  10. 10. 2. Capture Sites
  11. 11. 3. View Captures • View captures • QA • Compare
  12. 12. 4. Public Access • Customize the archive • Write description • Create custom banner and icon
  13. 13. WAS OverviewB) Public Archives
  14. 14. Web Archive ‘home page’
  15. 15. Browse: Site List + Tags
  16. 16. Search: All Sites in an Archive
  17. 17. Integration with your Systems
  18. 18. How are people using WAS?
  19. 19. Institution’s website • Preserve intuitional history • Capture university news and events
  20. 20. Geographically focused
  21. 21. Topical Support special research collections
  22. 22. Event • Sudden action required • May need many selectors • Start date / end date
  23. 23. Researcher’s Perspective• Building collections for research – Study the topic / event – Study site change or web-based communication – Websites are datasets for analysis and data mining• Preservation of research – Archive grant-funded websites – Selected sites• Create stable citations for publications
  24. 24. Get started!• Each library has WAS administrator(s)• Unlimited number of curators per account• What’s the cost? – UC does not pay a service fee – Storage only: $1040/per TB (average site is $1.46/annually); storage costs to go down
  25. 25. Challenges• Shared collection development• Metadata issues• Workflow and cost models for faculty projects• Time!• Limitations of web crawlers• Websites are messy
  26. 26. Contact me! Rosalie LackWAS Service Managerrosalie.lack@ucop.edu
  27. 27. Imagine a world …“Imagine a world in which libraries and archives hadnever existed. No institutions had ever systematicallycollected or preserved our collective cultural past: everybook, letter, or document was created, read and thenimmediately thrown away. What would we know aboutour past?’’
  28. 28. This is our world …“Yet, that is precisely what is happening with the web:more and more of our daily lives occur within the digitalworld, yet more than two decades after the birth of themodern web, the “libraries” and “archives” of this worldare still just being formed.” A Vision Of The Role And Future Of Web Archives Kalev H. Leetaru, Graduate School of Library and Information Science, University of Illinois. Presented as the keynote address at the 2012 IIPC General Assembly in Washington, DC. http://netpreserve.org/sites/default/files/resources/VisionRoles.pdf
  1. A particular slide catching your eye?

    Clipping is a handy way to collect important slides you want to go back to later.

×