• Share
  • Email
  • Embed
  • Like
  • Save
  • Private Content
ECIR 2013 Keynote - Time for Events
 

ECIR 2013 Keynote - Time for Events

on

  • 2,425 views

 

Statistics

Views

Total Views
2,425
Views on SlideShare
1,591
Embed Views
834

Actions

Likes
2
Downloads
22
Comments
0

5 Embeds 834

http://digitaloctober.ru 720
https://twitter.com 105
http://ru.vingrad.com 6
http://www.digitaloctober.ru 2
https://abs.twimg.com 1

Accessibility

Categories

Upload Details

Uploaded via as Adobe PDF

Usage Rights

CC Attribution-NonCommercial LicenseCC Attribution-NonCommercial License

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

    ECIR 2013 Keynote - Time for Events ECIR 2013 Keynote - Time for Events Presentation Transcript

    • time for eventstelling the world’s stories from social media Mor Naaman Rutgers SC&I & Mahaya, Inc. @informor
    • enter: social media
    • (JCDL 2007)
    • (JCDL 2007)
    • (SIGIR 2007) yes.
    • organize the world’s memories
    • people, together
    • BYOBW
    • outside lands festival
    • organize the world’s memories
    • detectidentifyorganize objectives
    • objectivesdetect ICWSM 2011a JASIST 2011 WebDB 2009 SIGIR 2007
    • objectivesidentify WSDM 2012 ICWSM 2011b WSDM 2010
    • objectives organize ICMR 2012 CHI 2012CSCW 2012 MTAP 2012 VAST 2010WWW 2009
    • today organize identifydetect Vox! multi-site Multiplayer
    • overview E Multi-site content Vox Civitas Multiplayer
    • Egoaleffectively retrieve social media contentfor known events from multiple services[with Hila Becker, Luis Gravano]
    • E
    • Echallengesevent descriptor not well-formedbrief textual descriptorsnoiseformats/conventions/metadata differ
    • Eapproachtwo-step query formulation precision-based recall-basedvalidate queries based on known/extracted event model
    • E Estep 1term extraction from event descriptorsgenerates “high precision” queriese. g. “andrew bird, opening gala,celebrate brooklyn, prospect park”
    • E Estep 2use “high precision” corpus to generatemore general queries to improve recalle. g. “andrew bird concert”, “state farminsurance”
    • E Erecall-oriented queriesBenefits:- Works cross-site- Works with short contentChallenges:- Introduces noise- Potentially large set of queries
    • E Epost-filteringuse known event model (topics, time,location)use queries with a result set thatmatches known model
    • E Efor example...120"100" 80" 60" 40" 20" 0" 6/7/11" 6/8/11" 6/9/11" 6/10/11" 6/11/11" 6/12/11" 6/13/11" [andrew"bird"concert]" [state"farm"insurance]"
    • Eevaluation 1.1"query generation4" 1" 4" 0.9" 0.8" 5" 5" Precision" 0.7"relevance of36"retrieved documentsNDCG% 0.6" 39" 34" 34" Twi7er8MS" 0.5" 0.4" 0.3" YouTube8MS" 7" 0.2" 9" 8" 8" 0.1" 0" 0" 5" 10" 15" 20" 25" Number%of%Documents%k%
    • Etakeawayscan aggregate content fragmentedacross platformsimprove recall, not rely on site-specificfeatures
    • overview E Multi-site content (WSDM 2012) Vox Civitas Multiplayer
    • research questionscan Twitter content around broadcastnews events inform journalistic inquiry?what insights and analyses can weenable through visual analytic tools?[with postdoctoral fellow Nick Diakopoulos]
    • supporting analysisdirect attention to relevant informationautomatic content analysis for filtering – relevance – uniqueness / novelty – sentiment – keyword extraction
    • how to evaluate?directly evaluate the output of thealgorithms (quantitative)deep, extensive evaluation of users’interaction with the system (qualitative)   read more: Olsen (UIST ’07) Naaman (MTAP ’12)
    • Vox evaluation goals•  How effective for generating story ideas?•  What kind of insights/analysis are supported?•  Shortcomings and how features are used?
    • takeawayscan extract reliable event structure fromsocial media
    • overview E Multi-site content Vox Civitas (VAST 2010) Multiplayer
    • what the hell?[with: Lyndon Kennedy, Dan Ellis, Kai Su]
    • supporting analysisextract the signal from people’sattention:find overlapping momentscompute and rank scenesextract scene descriptors
    • audio fingerprinting Wang et al. (ISMIR ’03)
    • two clips, aligned 0:18 3:320:000:00 2:32
    • a story of n clips time
    • from clips to scenesHigher GroundEncore time Happy Birthday, Birthday
    • evaluationquantitative: evaluated matching, sceneextraction…qualitative: evaluated deploymentscenario/task
    • takeawayscan create an event presentation thatgets better them more content is added
    • overview E Multi-site content Vox Civitas Multiplayer (NM&S 2012, ICMR 2012, MTAP 2012, WWW 2009)
    • towards better models oflarge-scale human attention
    • printing press
    • è knowledge archive
    • digital documents
    • èdigital archive
    • the web
    • ènetworked archive
    • social media
    • èexperience archive
    • new methods?
    • search by subject code?
    • explore.new information seeking tasks (andmodels)new applications for social mediacontent
    • explore.beyond real-timepersonal and social
    • questions? mor@rutgers.edu @informorhttp://mornaaman.com
    • thanksLuis GravanoHila BeckerNick DiakopoulosKai SuDan EllisMunmun de ChoudhuryTarikh Korula…