Archiving the Deepwater Horizon Oil Spill
Upcoming SlideShare
Loading in...5
×
 

Like this? Share it with your network

Share

Archiving the Deepwater Horizon Oil Spill

on

  • 197 views

Presentation for Spring 2011 International Internet Preservation Consortium

Presentation for Spring 2011 International Internet Preservation Consortium

Statistics

Views

Total Views
197
Views on SlideShare
197
Embed Views
0

Actions

Likes
0
Downloads
0
Comments
0

0 Embeds 0

No embeds

Accessibility

Categories

Upload Details

Uploaded via as Microsoft PowerPoint

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment
  • Gave subject experts access to WAS: # site nominations: 0Gave subject experts access to external site nomination tool: # nominations: 6Pulled librarian-nominated sites from Delicious: 400+

Archiving the Deepwater Horizon Oil Spill Presentation Transcript

  • 1. Archiving the Deepwater Horizon Oil Spill
    Tracy Seneca
    California Digital Library
    http://was.cdlib.org
  • 2. Archive Scope
    527 sites
    10402 captures
    May 5 to present
    tapering to less frequent captures of key sites, about 200 captures per month
    76 million + documents
    2 TB
  • 3.
  • 4. Archive Selection & Context
    Planned archives
    Event archives
    Advance subject expertise
    Time for evaluation
    Time for QA
    Focus on comprehensive capture
    Traditional collection development
    Control over scale
    Act quickly
    No one is the expert
    Collaboration required
    Every efficiency matters
    Frequent shallow captures / rapidly changing sites
    Massive scale
    http://was.cdlib.org
  • 5. 3 Challenges
    Site selection
    Site / capture management
    Quality assurance
  • 6. Getting Volunteers
    Tried bringing volunteers into service
    “Add to WAS” browser button
    Tried external nomination tool
    TAP INTO WHAT USERS ARE ALREADY DOING
    http://was.cdlib.org
  • 7. LSU tags relevant sites in DeliciousCDL imports Delicious JSON feed into WAS
    ~ 50% delicious
    ~ 45% 1 curator
    ~5% everything else
    http://was.cdlib.org
  • 8. Site Management - From:
    Fixed table
    Not enough control
    Few batch actions
  • 9. To
  • 10. To (2)
  • 11.
  • 12.
  • 13. Collection Observations
    Of ~350 sites from the Hurricane Katrina archive, only about 120 were initially relevant to the oil spill
    Different responding organizations
    The relevant sites
    Political offices / government agencies in the region
    News sources in the region
    Environmental organizations
  • 14.
  • 15. Reminders
    Use the tools you build
    At larger scale than your users
    Take advantage of existing workflows
    Collection building drives innovation
  • 16. Next Steps
    Release public archive
    Review with Louisiana State University librarians
    Web Archiving Service
    http://was.cdlib.org
    www.facebook.com/webarchiving