Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

Generating stories from Archive-It collections

237 views

Published on

This was presented in Internet Archive Jan 22, 2016. It demonstrates the possible types of stories and how to manually create them from Archive-It collections.

Published in: Science
  • Be the first to comment

  • Be the first to like this

Generating stories from Archive-It collections

  1. 1. Generating stories from Archive-It collections Yasmin AlNoamany 11/22/16
  2. 2. Outlines • Motivation • Framework • Benefit of the work • Possible types of stories • Criteria of the pages in the stories • Generating a story 2
  3. 3. There is more than one collection about “Egyptian Revolution” 3 • “2010-2011 Arab Spring” https://archive-it.org/collections/3101 • “North Africa & the Middle East 2011-2013” https://archive-it.org/collections/2349 • “Egypt Revolution and Politics” https://archive-it.org/collections/2358
  4. 4. Which collection I choose if I want to know about the Egyptian Revolution? 4 • “2010-2011 Arab Spring” https://archive-it.org/collections/3101 • “North Africa & the Middle East 2011-2013” https://archive-it.org/collections/2349 • “Egypt Revolution and Politics” https://archive-it.org/collections/2358
  5. 5. Our goal: web archives + storytelling services à archived enriched stories 5 Archived collectionsStorytelling services Archived enriched stories
  6. 6. How the generated stories will be integrated in Archive-It 6 Proposed “Collection Overviews” added to collection metadata
  7. 7. The archived collection is two dimensions 7 Time URI
  8. 8. Possible types of stories can be generated Fixed, Fixed Fixed Page, Sliding Time Sliding Page, Fixed Time Sliding Page, Sliding Time Same Different Same Different Time URI 8
  9. 9. Possible types of stories can be generated • Not supported yet 9 Fixed, Fixed Fixed Page, Sliding Time Sliding Page, Fixed Time Sliding Page, Sliding Time Same Different Same Different Time URI
  10. 10. Fixed Page, Sliding Time: Same Website at different times https://storify.com/yasmina_anwar/boston-marathon-bombing-story-from- archive-it-same http://www.guardian.co.uk/world/2013/apr/15/boston-marathon-explosion-live 10 R R R R R R t1 t3t2 t5t4 t6
  11. 11. Sliding Page, Fixed Time: Different URIs in the same day (April 15, 2013) https://storify.com/yasmina_anwar/boston-marathon-bombing-story- from-archive-it-coll 11 R1 R2 R3 R4 t1 t3t2 t5t4 t6
  12. 12. Sliding Page, Sliding Time: Different URIs through time https://storify.com/yasmina_anwar/boston-marathon-bombing-from-archive-it-collection 12 R1 R2 R1 R3 R4 R2 t1 t3t2 t5t4 t6
  13. 13. Criteria of choosing the mementos 13
  14. 14. The content language should be English 14
  15. 15. The memento should be on-topic 15 Occupy the U.P. on Jan. 10, 2012 Expired on August 14, 2012
  16. 16. The memento should be on-topic 16 http://wayback.archive- it.org/2358/20110204123927/http://www.bbc.co.uk/news/world/middle_east/ http://wayback.archive- it.org/2358/20130301084729/http://www.bbc.co.uk/news/world/middle_east/
  17. 17. A news article gives better snippet on Storify than a blog post 17news.blogs.cnn.com cnn.com
  18. 18. Deep links gives better snippet on Storify 18http://www.bbc.co.uk/news/world/middle_east/ http://www.bbc.co.uk/news/world-middle-east-12433045
  19. 19. The mementos should not be a Twitter account 19 http://wayback.archive- it.org/1784/20100131023240/http://twitter.com/Haitifeed/
  20. 20. The mementos should not be a Facebook page/group 20 http://wayback.archive- it.org/2358/20141225080305/https:/www.facebook.com/elshaheeed.co.uk
  21. 21. How to generate a story from a collection http://wayback.archive-it.org/3649/20130419171216/http://www.newyorker.com/online/blogs/books/2013/04/come-and-see-the-blood-in-the-streets.html Boston Marathon Bombing: 3649 Story 1: Different URIs through time https://wayback.archive- it.org/3649/20130419171338/http://news.nationalpost .com/2013/04/15/two-explosions-at-boston-marathon- finish-line-injure-dozens-reports/ http://wayback.archive- it.org/3649/20130419171216/http://www.newyorker.com /online/blogs/books/2013/04/come-and-see-the-blood- in-the-streets.html https://wayback.archive- it.org/3649/20130419171237/http://news.nationalpost .com/2013/04/15/boston-marathon-bombing/ https://wayback.archive- it.org/3649/20130419171216/http://www.nbcnews.com/b usiness/economywatch/boston-braces-economic-impact- bomb-blasts-1C9373154 https://wayback.archive- it.org/3649/20130419171416/http://www.guardian.co.u k/world/2013/apr/17/us-muslims-fear-profiling- boston 21
  22. 22. Suggested collections to generate stories from Collection Name Collection URI # seeds 2013 Government Shutdown https://archive-it.org/collections/3936 186 Wikileaks 2010 Document Release Collection https://archive-it.org/collections/2017 41 Earthquake in Haiti https://archive-it.org/collections/1784 132 April 16 Archive https://archive-it.org/collections/694 88 Brazilian School Shooting https://archive-it.org/collections/2535 650 Global Health Events https://archive-it.org/collections/4887 169 2013 Boston Marathon Bombing https://archive-it.org/collections/3649 318 Occupy Movement 2011/2012 https://archive-it.org/collections/2950 955 Egypt Revolution and Politics https://archive-it.org/collections/2358 1112 Russia Plane Crash Sept 7,2011 https://archive-it.org/collections/2823 104 22
  23. 23. We expect three stories for each collection 23 Same Website at different times Different URIs in the same day Different URIs through time

×