Archiving The Deepwater Horizon Oil Spill

238 views

Published on

Seneca, Tracy. Archiving The Deepwater Horizon Oil Spill. International Internet Preservation Consortium. The Hague,Netherlands. May 2011.

Published in: Technology, Business
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
238
On SlideShare
0
From Embeds
0
Number of Embeds
1
Actions
Shares
0
Downloads
4
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide
  • Gave subject experts access to WAS: # site nominations: 0 Gave subject experts access to external site nomination tool: # nominations: 6 Pulled librarian-nominated sites from Delicious: 400+
  • Archiving The Deepwater Horizon Oil Spill

    1. 1. Archiving the Deepwater Horizon Oil Spill http://was.cdlib.org Tracy Seneca California Digital Library
    2. 2. Archive Scope 527 sites 10402 captures May 5 to present tapering to less frequent captures of key sites, about 200 captures per month 76 million + documents 2 TB
    3. 4. Archive Selection & Context <ul><li>Planned archives </li></ul><ul><li>Advance subject expertise </li></ul><ul><li>Time for evaluation </li></ul><ul><li>Time for QA </li></ul><ul><li>Focus on comprehensive capture </li></ul><ul><li>Traditional collection development </li></ul><ul><li>Control over scale </li></ul><ul><li>Event archives </li></ul><ul><li>Act quickly </li></ul><ul><li>No one is the expert </li></ul><ul><li>Collaboration required </li></ul><ul><li>Every efficiency matters </li></ul><ul><li>Frequent shallow captures / rapidly changing sites </li></ul><ul><li>Massive scale </li></ul>http://was.cdlib.org
    4. 5. 3 Challenges <ul><li>Site selection </li></ul><ul><li>Site / capture management </li></ul><ul><li>Quality assurance </li></ul>
    5. 6. Getting Volunteers <ul><li>Tried bringing volunteers into service </li></ul><ul><ul><li>“Add to WAS” browser button </li></ul></ul><ul><li>Tried external nomination tool </li></ul><ul><li>TAP INTO WHAT USERS ARE ALREADY DOING </li></ul>http://was.cdlib.org
    6. 7. LSU tags relevant sites in Delicious CDL imports Delicious JSON feed into WAS ~ 50% delicious ~ 45% 1 curator ~5% everything else http://was.cdlib.org
    7. 8. Site Management - From: Fixed table Not enough control Few batch actions
    8. 9. To
    9. 10. To (2)
    10. 13. Collection Observations <ul><li>Of ~350 sites from the Hurricane Katrina archive, only about 120 were initially relevant to the oil spill </li></ul><ul><ul><li>Different responding organizations </li></ul></ul><ul><li>The relevant sites </li></ul><ul><ul><li>Political offices / government agencies in the region </li></ul></ul><ul><ul><li>News sources in the region </li></ul></ul><ul><ul><li>Environmental organizations </li></ul></ul>
    11. 15. Reminders Use the tools you build At larger scale than your users Take advantage of existing workflows Collection building drives innovation
    12. 16. Next Steps <ul><li>Web Archiving Service </li></ul><ul><ul><li>http://was.cdlib.org </li></ul></ul><ul><ul><li>www.facebook.com/webarchiving </li></ul></ul>Release public archive Review with Louisiana State University librarians

    ×