ARCOMEMSocialmediaarchivingDominik Frey (SWR) | CosminCabulea (DW)           DIATA12, 21.03.2012
SocialmediaarchivingARchiveCOMmunityMEMories: Howtoidentifyandpreserve relevant socialmediacontent?                       ...
Project consortium01/2011 - 12/2013, fundedbythe EC                                    3
Usecases Broadcaster: Rock festivals Parliament: Euro Crisis                                4
Talk about Rock am Ring News, opinions, facts, rumors, … Links tovideos, images, blogs, …                               ...
Images         6
Videos         7
What content is relevant? Social web anlysis:  popularity, influence, trust, diversity Semanticanalysis:  entities, topi...
Usagescenarios Forarchivistssupportcontentselection&contextualize  web archives Forjournalistsfind relevant contentforth...
ArchivingworkflowCollect   Analyse    Archive    Present Two stage archiving strategy: web  analyzing storage  archive...
ArchivingworkflowCollect    Analyse     Archive     Present Different modules analyse semantic information & social conte...
ArchivingworkflowCollect    Analyse     Archive     Present Only relevant content is preserved in (W)ARC format Semiauto...
ArchivingworkflowCollect    Analyse    Archive     Present Fulltext search and facet browsing Semantic and social contex...
TheJournalisticScenario                          14
TheJournalisticUseCase                         15
The Story            16
Data       17
TheChallenges                18
The Data Layers                  Social web                               19
TheChallenges                20
Vox Civitas User Interface                             21
SRSR (Seriously Rapid SourceReview)                                      22
Riotrumours: howmisinformationspread onTwitterduring a time of crisis                                          23
ARCOMEM Graphic User Interface (Draft)                                         24
Third-Party-Brain                    25
THANK YOU        CONTACT DETAILS              Dominik Frey       dominik.frey@swr.de            CosminCabulea     cosmin.c...
Upcoming SlideShare
Loading in...5
×

Diata 2012 ARCOMEM

137

Published on

Presentation of Arcomem on Diata 2012.

Published in: Technology
0 Comments
1 Like
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total Views
137
On Slideshare
0
From Embeds
0
Number of Embeds
0
Actions
Shares
0
Downloads
0
Comments
0
Likes
1
Embeds 0
No embeds

No notes for slide

Diata 2012 ARCOMEM

  1. 1. ARCOMEMSocialmediaarchivingDominik Frey (SWR) | CosminCabulea (DW) DIATA12, 21.03.2012
  2. 2. SocialmediaarchivingARchiveCOMmunityMEMories: Howtoidentifyandpreserve relevant socialmediacontent? 2
  3. 3. Project consortium01/2011 - 12/2013, fundedbythe EC 3
  4. 4. Usecases Broadcaster: Rock festivals Parliament: Euro Crisis 4
  5. 5. Talk about Rock am Ring News, opinions, facts, rumors, … Links tovideos, images, blogs, … 5
  6. 6. Images 6
  7. 7. Videos 7
  8. 8. What content is relevant? Social web anlysis: popularity, influence, trust, diversity Semanticanalysis: entities, topics, events, opinions 8
  9. 9. Usagescenarios Forarchivistssupportcontentselection&contextualize web archives Forjournalistsfind relevant contentfortheir stories &followthediscussionsaboutit 9
  10. 10. ArchivingworkflowCollect Analyse Archive Present Two stage archiving strategy: web  analyzing storage  archive Archivist describes target HTML and API crawlers fetch content 10
  11. 11. ArchivingworkflowCollect Analyse Archive Present Different modules analyse semantic information & social context to filter relevant content HBase and RDF triple storage 11
  12. 12. ArchivingworkflowCollect Analyse Archive Present Only relevant content is preserved in (W)ARC format Semiautomatic content selection Heritrix and Wayback compatible 12
  13. 13. ArchivingworkflowCollect Analyse Archive Present Fulltext search and facet browsing Semantic and social contextualization Visualizations to be developed on top (not in ARCOMEM sope) 13
  14. 14. TheJournalisticScenario 14
  15. 15. TheJournalisticUseCase 15
  16. 16. The Story 16
  17. 17. Data 17
  18. 18. TheChallenges 18
  19. 19. The Data Layers Social web 19
  20. 20. TheChallenges 20
  21. 21. Vox Civitas User Interface 21
  22. 22. SRSR (Seriously Rapid SourceReview) 22
  23. 23. Riotrumours: howmisinformationspread onTwitterduring a time of crisis 23
  24. 24. ARCOMEM Graphic User Interface (Draft) 24
  25. 25. Third-Party-Brain 25
  26. 26. THANK YOU CONTACT DETAILS Dominik Frey dominik.frey@swr.de CosminCabulea cosmin.cabulea@dw.de www.arcomem.eu 26

×