How I spend my summer vacations

2,743 views

Published on

WADL2013 presentation by Justin F. Brunelle on his current research projects.

Published in: Spiritual, Technology, Education
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
2,743
On SlideShare
0
From Embeds
0
Number of Embeds
1,445
Actions
Shares
0
Downloads
4
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

How I spend my summer vacations

  1. 1. How I spend my summer vacations Justin F. Brunelle WS-DL Research Group Department of Computer Science Old Dominion University WADL 2013
  2. 2. Justin in a nutshell • PhD Student at ODU • Dynamic representations –in the archives –Improved quality from –archived data –Alter-ego: Application Developer • at The MITRE Corporation –Big data & cloud computing
  3. 3. How much can we archive?
  4. 4. The setup • 1,000 URIs from Twitter • 1,000 URIs from Archive-it • Capture with tools • Study the archivability
  5. 5. Good
  6. 6. Good
  7. 7. Good
  8. 8. Meh… Zombies in the Archives
  9. 9. Bad
  10. 10. Bad
  11. 11. Bad
  12. 12. Bad
  13. 13. Bad
  14. 14. Why?
  15. 15. Losing the Moment • What we share != What we curate • 4.2% of Twitter is perfectly archived –Losing My Revolution: 11% gone in 2 years • 34.2% of Archive-it is perfectly archived • Accessibility? Gov vs. non-Gov?
  16. 16. Measuring memento damage
  17. 17. VS.
  18. 18. Not all embedded resources are created equal
  19. 19. Not all embedded resources are created equal
  20. 20. Planned Work • Evaluate importance of missing stuff –Size, position –# CSS Classes –Not all stylesheets created equal – Missing border vs missing functionality – “Whitespace” –Provide Web service • Mechanical Turk evaluation of “damage” • Evaluate collections of mementos
  21. 21. What does it all mean? • Archivability is measurable • Damage is measurable • If we can predict archivability…. –We can try new methods of archiving on “hard to capture” mementos –Attempt repairs on existing mementos –Gauge our successes in real-time • Next step: capturing dynamic content

×