How I spend my summer vacations

  • 1,946 views
Uploaded on

WADL2013 presentation by Justin F. Brunelle on his current research projects.

WADL2013 presentation by Justin F. Brunelle on his current research projects.

  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Be the first to comment
    Be the first to like this
No Downloads

Views

Total Views
1,946
On Slideshare
0
From Embeds
0
Number of Embeds
36

Actions

Shares
Downloads
2
Comments
0
Likes
0

Embeds 0

No embeds

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
    No notes for slide

Transcript

  • 1. How I spend my summer vacations Justin F. Brunelle WS-DL Research Group Department of Computer Science Old Dominion University WADL 2013
  • 2. Justin in a nutshell • PhD Student at ODU • Dynamic representations –in the archives –Improved quality from –archived data –Alter-ego: Application Developer • at The MITRE Corporation –Big data & cloud computing
  • 3. How much can we archive?
  • 4. The setup • 1,000 URIs from Twitter • 1,000 URIs from Archive-it • Capture with tools • Study the archivability
  • 5. Good
  • 6. Good
  • 7. Good
  • 8. Meh… Zombies in the Archives
  • 9. Bad
  • 10. Bad
  • 11. Bad
  • 12. Bad
  • 13. Bad
  • 14. Why?
  • 15. Losing the Moment • What we share != What we curate • 4.2% of Twitter is perfectly archived –Losing My Revolution: 11% gone in 2 years • 34.2% of Archive-it is perfectly archived • Accessibility? Gov vs. non-Gov?
  • 16. Measuring memento damage
  • 17. VS.
  • 18. Not all embedded resources are created equal
  • 19. Not all embedded resources are created equal
  • 20. Planned Work • Evaluate importance of missing stuff –Size, position –# CSS Classes –Not all stylesheets created equal – Missing border vs missing functionality – “Whitespace” –Provide Web service • Mechanical Turk evaluation of “damage” • Evaluate collections of mementos
  • 21. What does it all mean? • Archivability is measurable • Damage is measurable • If we can predict archivability…. –We can try new methods of archiving on “hard to capture” mementos –Attempt repairs on existing mementos –Gauge our successes in real-time • Next step: capturing dynamic content