Diata12 ARCOMEM

617 views
579 views

Published on

Published in: Technology, Education
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
617
On SlideShare
0
From Embeds
0
Number of Embeds
3
Actions
Shares
0
Downloads
6
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

Diata12 ARCOMEM

  1. 1. ARCOMEMSocialmediaarchivingDominik Frey (SWR) | CosminCabulea (DW) DIATA12, 21.03.2012
  2. 2. SocialmediaarchivingARchiveCOMmunityMEMories: Howtoidentifyandpreserve relevant socialmediacontent? 2
  3. 3. Project consortium01/2011 - 12/2013, fundedbythe EC 3
  4. 4. Usecases Broadcaster: Rock festivals Parliament: Euro Crisis 4
  5. 5. Talk about Rock am Ring News, opinions, facts, rumors, … Links tovideos, images, blogs, … 5
  6. 6. Images 6
  7. 7. Videos 7
  8. 8. What content is relevant? Social web anlysis: popularity, influence, trust, diversity Semanticanalysis: entities, topics, events, opinions 8
  9. 9. Usagescenarios Forarchivistssupportcontentselection&contextualize web archives Forjournalistsfind relevant contentfortheir stories &followthediscussionsaboutit 9
  10. 10. ArchivingworkflowCollect Analyse Archive Present Two stage archiving strategy: web  analyzing storage  archive Archivist describes target HTML and API crawlers fetch content 10
  11. 11. ArchivingworkflowCollect Analyse Archive Present Different modules analyse semantic information & social context to filter relevant content HBase and RDF triple storage 11
  12. 12. ArchivingworkflowCollect Analyse Archive Present Only relevant content is preserved in (W)ARC format Semiautomatic content selection Heritrix and Wayback compatible 12
  13. 13. ArchivingworkflowCollect Analyse Archive Present Fulltext search and facet browsing Semantic and social contextualization Visualizations to be developed on top (not in ARCOMEM sope) 13
  14. 14. TheJournalisticScenario 14
  15. 15. TheJournalisticUseCase 15
  16. 16. The Story 16
  17. 17. Data 17
  18. 18. TheChallenges 18
  19. 19. The Data Layers Social web 19
  20. 20. TheChallenges 20
  21. 21. Vox Civitas User Interface 21
  22. 22. SRSR (Seriously Rapid SourceReview) 22
  23. 23. Riotrumours: howmisinformationspread onTwitterduring a time of crisis 23
  24. 24. ARCOMEM Graphic User Interface (Draft) 24
  25. 25. Third-Party-Brain 25
  26. 26. THANK YOU CONTACT DETAILS Dominik Frey dominik.frey@swr.de CosminCabulea cosmin.cabulea@dw.de www.arcomem.eu 26

×