Your SlideShare is downloading. ×
  • Like
Diata12 ARCOMEM
Upcoming SlideShare
Loading in...5
×

Thanks for flagging this SlideShare!

Oops! An error has occurred.

×

Now you can save presentations on your phone or tablet

Available for both IPhone and Android

Text the download link to your phone

Standard text messaging rates apply

Diata12 ARCOMEM

  • 343 views
Published

 

Published in Technology , Education
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Be the first to comment
    Be the first to like this
No Downloads

Views

Total Views
343
On SlideShare
0
From Embeds
0
Number of Embeds
0

Actions

Shares
Downloads
5
Comments
0
Likes
0

Embeds 0

No embeds

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
    No notes for slide

Transcript

  • 1. ARCOMEMSocialmediaarchivingDominik Frey (SWR) | CosminCabulea (DW) DIATA12, 21.03.2012
  • 2. SocialmediaarchivingARchiveCOMmunityMEMories: Howtoidentifyandpreserve relevant socialmediacontent? 2
  • 3. Project consortium01/2011 - 12/2013, fundedbythe EC 3
  • 4. Usecases Broadcaster: Rock festivals Parliament: Euro Crisis 4
  • 5. Talk about Rock am Ring News, opinions, facts, rumors, … Links tovideos, images, blogs, … 5
  • 6. Images 6
  • 7. Videos 7
  • 8. What content is relevant? Social web anlysis: popularity, influence, trust, diversity Semanticanalysis: entities, topics, events, opinions 8
  • 9. Usagescenarios Forarchivistssupportcontentselection&contextualize web archives Forjournalistsfind relevant contentfortheir stories &followthediscussionsaboutit 9
  • 10. ArchivingworkflowCollect Analyse Archive Present Two stage archiving strategy: web  analyzing storage  archive Archivist describes target HTML and API crawlers fetch content 10
  • 11. ArchivingworkflowCollect Analyse Archive Present Different modules analyse semantic information & social context to filter relevant content HBase and RDF triple storage 11
  • 12. ArchivingworkflowCollect Analyse Archive Present Only relevant content is preserved in (W)ARC format Semiautomatic content selection Heritrix and Wayback compatible 12
  • 13. ArchivingworkflowCollect Analyse Archive Present Fulltext search and facet browsing Semantic and social contextualization Visualizations to be developed on top (not in ARCOMEM sope) 13
  • 14. TheJournalisticScenario 14
  • 15. TheJournalisticUseCase 15
  • 16. The Story 16
  • 17. Data 17
  • 18. TheChallenges 18
  • 19. The Data Layers Social web 19
  • 20. TheChallenges 20
  • 21. Vox Civitas User Interface 21
  • 22. SRSR (Seriously Rapid SourceReview) 22
  • 23. Riotrumours: howmisinformationspread onTwitterduring a time of crisis 23
  • 24. ARCOMEM Graphic User Interface (Draft) 24
  • 25. Third-Party-Brain 25
  • 26. THANK YOU CONTACT DETAILS Dominik Frey dominik.frey@swr.de CosminCabulea cosmin.cabulea@dw.de www.arcomem.eu 26