ECIR 2013 Keynote - Time for Events
Upcoming SlideShare
Loading in...5
×
 

ECIR 2013 Keynote - Time for Events

on

  • 2,492 views

 

Statistics

Views

Total Views
2,492
Views on SlideShare
1,658
Embed Views
834

Actions

Likes
2
Downloads
22
Comments
0

5 Embeds 834

http://digitaloctober.ru 720
https://twitter.com 105
http://ru.vingrad.com 6
http://www.digitaloctober.ru 2
https://abs.twimg.com 1

Accessibility

Categories

Upload Details

Uploaded via as Adobe PDF

Usage Rights

CC Attribution-NonCommercial LicenseCC Attribution-NonCommercial License

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

ECIR 2013 Keynote - Time for Events ECIR 2013 Keynote - Time for Events Presentation Transcript

  • time for eventstelling the world’s stories from social media Mor Naaman Rutgers SC&I & Mahaya, Inc. @informor
  • enter: social media
  • (JCDL 2007)
  • (JCDL 2007)
  • (SIGIR 2007) yes.
  • organize the world’s memories
  • people, together
  • BYOBW
  • outside lands festival
  • organize the world’s memories
  • detectidentifyorganize objectives
  • objectivesdetect ICWSM 2011a JASIST 2011 WebDB 2009 SIGIR 2007
  • objectivesidentify WSDM 2012 ICWSM 2011b WSDM 2010
  • objectives organize ICMR 2012 CHI 2012CSCW 2012 MTAP 2012 VAST 2010WWW 2009
  • today organize identifydetect Vox! multi-site Multiplayer
  • overview E Multi-site content Vox Civitas Multiplayer
  • Egoaleffectively retrieve social media contentfor known events from multiple services[with Hila Becker, Luis Gravano]
  • E
  • Echallengesevent descriptor not well-formedbrief textual descriptorsnoiseformats/conventions/metadata differ
  • Eapproachtwo-step query formulation precision-based recall-basedvalidate queries based on known/extracted event model
  • E Estep 1term extraction from event descriptorsgenerates “high precision” queriese. g. “andrew bird, opening gala,celebrate brooklyn, prospect park”
  • E Estep 2use “high precision” corpus to generatemore general queries to improve recalle. g. “andrew bird concert”, “state farminsurance”
  • E Erecall-oriented queriesBenefits:- Works cross-site- Works with short contentChallenges:- Introduces noise- Potentially large set of queries
  • E Epost-filteringuse known event model (topics, time,location)use queries with a result set thatmatches known model
  • E Efor example...120"100" 80" 60" 40" 20" 0" 6/7/11" 6/8/11" 6/9/11" 6/10/11" 6/11/11" 6/12/11" 6/13/11" [andrew"bird"concert]" [state"farm"insurance]"
  • Eevaluation 1.1"query generation4" 1" 4" 0.9" 0.8" 5" 5" Precision" 0.7"relevance of36"retrieved documentsNDCG% 0.6" 39" 34" 34" Twi7er8MS" 0.5" 0.4" 0.3" YouTube8MS" 7" 0.2" 9" 8" 8" 0.1" 0" 0" 5" 10" 15" 20" 25" Number%of%Documents%k%
  • Etakeawayscan aggregate content fragmentedacross platformsimprove recall, not rely on site-specificfeatures
  • overview E Multi-site content (WSDM 2012) Vox Civitas Multiplayer
  • research questionscan Twitter content around broadcastnews events inform journalistic inquiry?what insights and analyses can weenable through visual analytic tools?[with postdoctoral fellow Nick Diakopoulos]
  • supporting analysisdirect attention to relevant informationautomatic content analysis for filtering – relevance – uniqueness / novelty – sentiment – keyword extraction
  • how to evaluate?directly evaluate the output of thealgorithms (quantitative)deep, extensive evaluation of users’interaction with the system (qualitative)   read more: Olsen (UIST ’07) Naaman (MTAP ’12)
  • Vox evaluation goals•  How effective for generating story ideas?•  What kind of insights/analysis are supported?•  Shortcomings and how features are used?
  • takeawayscan extract reliable event structure fromsocial media
  • overview E Multi-site content Vox Civitas (VAST 2010) Multiplayer
  • what the hell?[with: Lyndon Kennedy, Dan Ellis, Kai Su]
  • supporting analysisextract the signal from people’sattention:find overlapping momentscompute and rank scenesextract scene descriptors
  • audio fingerprinting Wang et al. (ISMIR ’03)
  • two clips, aligned 0:18 3:320:000:00 2:32
  • a story of n clips time
  • from clips to scenesHigher GroundEncore time Happy Birthday, Birthday
  • evaluationquantitative: evaluated matching, sceneextraction…qualitative: evaluated deploymentscenario/task
  • takeawayscan create an event presentation thatgets better them more content is added
  • overview E Multi-site content Vox Civitas Multiplayer (NM&S 2012, ICMR 2012, MTAP 2012, WWW 2009)
  • towards better models oflarge-scale human attention
  • printing press
  • è knowledge archive
  • digital documents
  • èdigital archive
  • the web
  • ènetworked archive
  • social media
  • èexperience archive
  • new methods?
  • search by subject code?
  • explore.new information seeking tasks (andmodels)new applications for social mediacontent
  • explore.beyond real-timepersonal and social
  • questions? mor@rutgers.edu @informorhttp://mornaaman.com
  • thanksLuis GravanoHila BeckerNick DiakopoulosKai SuDan EllisMunmun de ChoudhuryTarikh Korula…