• Share
  • Email
  • Embed
  • Like
  • Save
  • Private Content
Leveraging the Semantics of Tweets for Adaptive Faceted Search on Twitter
 

Leveraging the Semantics of Tweets for Adaptive Faceted Search on Twitter

on

  • 1,959 views

Slides presented at ISWC 2011, Bonn, Germany. Corresponding paper: http://iswc2011.semanticweb.org/fileadmin/iswc/Papers/Research_Paper/12/70310001.pdf

Slides presented at ISWC 2011, Bonn, Germany. Corresponding paper: http://iswc2011.semanticweb.org/fileadmin/iswc/Papers/Research_Paper/12/70310001.pdf

Statistics

Views

Total Views
1,959
Views on SlideShare
1,936
Embed Views
23

Actions

Likes
5
Downloads
41
Comments
0

8 Embeds 23

http://a0.twimg.com 12
http://www.valuedfriends.eu 3
http://florian.altherr.name 2
https://bbdev02.ic.uva.nl 2
http://tweetedtimes.com 1
http://bbdev01.ic.uva.nl 1
http://www.onlydoo.com 1
https://twitter.com 1
More...

Accessibility

Categories

Upload Details

Uploaded via as Microsoft PowerPoint

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment
  • Motivation:Information overloadPersonalised “better” search
  • Why do people search on Twitter rather than Google?Real time info & opinion about almost anything
  • Example: HT’11 @Eindhoven, looking for some entertainment events...http://search.twitter.com/http://search.twitter.com/advanced
  • Space limitation + selecting keywords (abbreviations –shorthand notations + colloquial expressions)
  • Highlight 60
  • Very time consuming and overwhelming indeed!
  • Very time consuming and overwhelming indeed!
  • entity extraction and semantic enrichment and relation discovery.
  • Might be better to remove the Costs column...?
  • Our framework extracts typed entities from enriched tweets/news and provides strategies for detecting semantic (trending) relationships between entities. We:investigated the precision and recall of the relation detection strategies,analyzed how the strategies perform for each type of relationships andWhich strategy performs best in detecting relationships between entities?Does the accuracy depend on the type of entities which are involved in a relation?How do the strategies perform for discovering relationships which have temporal constraints, and how fast can the strategies detect (trending) relationships?evaluated the quality and speed for discovering trending relationships that possibly have a limited temporal validity.

Leveraging the Semantics of Tweets for Adaptive Faceted Search on Twitter Leveraging the Semantics of Tweets for Adaptive Faceted Search on Twitter Presentation Transcript

  • Leveraging the Semantics of Tweets for Adaptive Faceted Search on Twitter ISWC, Bonn, Germany, Oct 27th 2011Fabian Abel1, Ilknur Celik 1, Geert-Jan Houben, Patrick Siehndel2 1Web Information Systems, TU Delft, the Netherlands 2L3S Research Center, Hannover, Germany Delft University of Technology
  • What we do: Science and Engineering for the Personal Webdomains: news social mediacultural heritage public datae-learning Personalized Personalized Adaptive Systems Recommendations Search Analysis and User Modeling Semantic Enrichment, Linkage and Alignment user/usage data Social Web Adaptive Faceted Search on Twitter 2
  • 200,000,000 number of tweets published per day Adaptive Faceted Search on Twitter 3
  • 1number of tweets that are interesting for me now Adaptive Faceted Search on Twitter 4
  • Searching on Twitter Adaptive Faceted Search on Twitter 5
  • Issues with Multiple Keywords Search Adaptive Faceted Search on Twitter 6
  • Let’s try to search with One Keyword Adaptive Faceted Search on Twitter 7
  • Page 1Adaptive Faceted Search on Twitter 8
  • Page 2Adaptive Faceted Search on Twitter 9
  • Page 3Adaptive Faceted Search on Twitter 10
  • Music Artist Page 60!! Next Saturday @thatsimpsonguyaka Guilty Simpson will be performing attweet I was Area51 in my hometwonEindhoven. #realliveshit #iwillspinrecords looking hours ago via Blackberry about 9 forLocations Adaptive Faceted Search on Twitter 11
  • Is there an easier way? Faceted Search can help(hypothesis) Expand Query: Current Query: Locations more... Eindhoven Music Events more... Results: 1. Yskiddd: Next Music Artists: saturday@thatsimpsonguy aka Guilty Simpson will be performing at Area51 + Guilty Simpson in my homeytown Eindhoven. + Bryan Adams #realliveshit#iwillspinrecords2 + Elton John + Golden Earring 2. Usee123: Cool #EV3door7980 !!! + Rihanna http://bit.ly/igyyRhL + The eagles 3. sanmiquelmusic: This Saturday Im + 3 Doors Down joining @KrusadersMusic to Intents more... Adaptive Faceted Search on Twitter 12
  • Challenges Adaptive Faceted Search on Twitter 13
  • Facets of a Tweet @bob: JulianAssange got arrested Facet type Facet Value Creator @bob Location Delft, the Netherlands Creation time Nov 29 th 2011Challenge 1: How to infer facets that describe the content of a tweet? Adaptive Faceted Search on Twitter 14
  • Faceted Search: selecting facet-value pairs Expand Query: Current Query: Locations Music + Aachen + Aalborg Number of selectable Results: + Aalesund facet values may be 1. Yskiddd: Next + Aarhus saturday@thatsimpsonguy aka Guilty + Aasiaat very high! Simpson will be performing at Area51 + Abaiang in my homeytown Eindhoven. + Abakan #realliveshit#iwillspinrecords2 more... 2. Usee123: Cool #EV3door7980 !!! http://bit.ly/igyyRhLChallenge more... Events 2: How to adapt the faceted search 3. sanmiquelmusic: This Saturday Im interface to the current demands ofIntents joining @KrusadersMusic to a user? Music Artists more… Adaptive Faceted Search on Twitter 15
  • Adaptive Faceted Search Framework Adaptive Faceted Search on Twitter 16
  • Adaptive Faceted Search Framework user Adaptive Faceted SearchHow to represent How to adapt thethe content of a facet-value pair User and Context Modeling ranking to the tweet? facet extraction current demands of the user? Semantic Enrichment Twitter posts Adaptive Faceted Search on Twitter 17
  • Facet Extraction and Semantic Enrichment powered by Julian Assange @bob: Julian Assange got JulianAssange Tweet-based arrested enrichmentJulian Assange Julian Assange JulianAssangearrested Link-basedLondon JulianAssange, the founder of Julian Assange enrichment WikiLeaks, is under arrest in WikiLeaks London… LondonWikiLeaks Adaptive Faceted Search on Twitter 18
  • Impact of Link-based enrichment Representation of tweets: significantly more facets per tweet with link-based enrichment Adaptive Faceted Search on Twitter 19
  • Faceted Search Strategies• Challenge: most-relevant facet-value pair should appear at the top of the ranking Locations Locations 1. Aachen 1. Eindhoven 2. Aalborg 2. Delft 3. Aalesund 3. Amsterdam• Baseline: hashtag-based 4. Aarhus … 4. 5. Rotterdam London keyword search 2145. Eindhoven … Adaptive Faceted Search on Twitter 20
  • Faceted Search Strategies number of tweets that• Challenge:facet-value pair facet-value pair should appear at the most-relevant contain the FVP top of the ranking Locations Locations 1. Aachen 1. Eindhoven 2. Aalborg 2. Delft 3. Aalesund 3. Amsterdam• Baseline: hashtag-based 4. Aarhus … 4. 5. Rotterdam London keyword search of matching tweets current hit list 2145. Eindhoven …• Faceted Search Strategies: 1. Occurrence frequency: count occurrence frequencies of FVP (baseline) Adaptive Faceted Search on Twitter 21
  • Faceted Search Strategiesstratey Profile Personalized FVP ranking weight in user profile User FVP weight = number of tweets that (location, Delft) 6• Challenge:facet-value pair facet-value pair shouldJazzBaltica) at the most-relevant the FVP rank of (event, appear contain the FVP 4 top of the ranking Locations (person, ChetBaker) 3 Locations 1. Aachen 1. Eindhoven 2. Aalborg 2. Delft 3. Aalesund 3. Amsterdam• Baseline: hashtag-based 4. Aarhus 4. Rotterdam current hit June 27 matching tweets time … 5. London user keyword search of list 2145. Eindhoven July 4 …• Faceted Search Strategies: 1. Occurrence frequency: count occurrence frequencies of FVP (baseline) 2. Personalization: adapt ranking to user profile ( different user modeling strategies possible; here: entire tweeting history of the user) Adaptive Faceted Search on Twitter 22
  • Faceted Search Strategiesstratey Profile Personalized FVP ranking Genre weight in user profile User Genre FVP weight + Blues = number of tweets that + Blues (location, Delft) 6• Challenge:facet-value pair facet-value pair shouldJazzBaltica) at the + Jazz most-relevant the FVP rank of + Jazz appear contain the FVP 4 (event, + JazzMusic + Rock top of the ranking + Rock Locations (person, ChetBaker) 3 + Classic Locations 1. Aachen 1. Eindhoven more... 2. Aalborg more... 2. Delft 3. Aalesund 3. Amsterdam• Baseline: hashtag-based 4. Aarhus 4. Rotterdam current hit June 27 minimize overlaps time … 5. London user keyword search of matching tweets list 2145. Eindhoven July 4 …• Faceted Search Strategies: 1. Occurrence frequency: count occurrence frequencies of FVP (baseline) 2. Personalization: adapt ranking to user profile ( different user modeling strategies possible; here: entire tweeting history of the user) 3. Diversification: increase variety among the top-ranked FVPs Adaptive Faceted Search on Twitter 23
  • Faceted Search Strategiesstratey Profile Personalized FVP ranking weight in user profile Genre (event,FrenchOpen) User search Genre FVP weight + Blues = number of tweets that + Blues 6occurrence (location, Delft)frequency• Challenge:facet-value pair facet-value pair shouldJazzBaltica) at the + Jazz most-relevant the FVP Event + Jazz appear rank of JazzBaltica) contain the + JazzBaltica FVP 4 of FVP (event, (event, + JazzMusic + Rock top of the ranking + Rock Locations (person, ChetBaker) 3 + FrenchOpen + Classic Locations 1. Aachen 1. Eindhoven more... 2. Aalborg more... more... 2. Delft 3. Aalesund 3. Amsterdam• Baseline: hashtag-based 4. Aarhus 4. Rotterdam current hit June 27list27 minimize overlaps time currentlist of matching tweets time … 5. London user June keyword search of matching tweets 20 hit June 2145. Eindhoven July 4 July 4 …• Faceted Search Strategies: 1. Occurrence frequency: count occurrence frequencies of FVP (baseline) 2. Personalization: adapt ranking to user profile ( different user modeling strategies possible; here: entire tweeting history of the user) 3. Diversification: increase variety among the top-ranked FVPs 4. Time-sensitivity:adapt FVP ranking to temporal context• Semantic enrichment: (i) tweet-based and (ii) link-based enrichment Adaptive Faceted Search on Twitter 24
  • Research Questions1. How well does faceted search that is supported by the semantic enrichment perform in comparison to keyword search?2. What strategy performs best in ranking facet-value pairs that allow users to find relevant tweets on Twitter?3. How do the different building blocks of the faceted search framework influence the performance? Adaptive Faceted Search on Twitter 25
  • Dataset more than: 20,000 Twitter users 4 months30,000,000 tweets Egyptian revolution Jan 25 Nov 15 Dec 15 Jan 15 Feb 15 time Adaptive Faceted Search on Twitter 26
  • Evaluation Framework• User Simulation Model [cf. Koren et al., WWW’08]: • Input: search settings = { (user who searches, relevant target tweet) } • Drill down search result list until no more FVPscan be applied or less than 10 tweets match the query • Simulating click behavior: first-matching FVP is selected ( user knows target resource)• Ground truth  relevant target tweet = tweet that has been re-tweeted by the user• Metrics: • Succes@k: probability that relevant FVP appears in the top k (the higher the Succes@k, the faster the search and fewer the user effort) • MRR: mean reciprocal rank of the target tweet when the user selected it Adaptive Faceted Search on Twitter 27
  • Faceted-search vs. hashtag-based (keyword) search Faceted search based on semantic enrichment of tweets outperforms hashtgag-based search significantly. Adaptive Faceted Search on Twitter 28
  • Personalized strategyResults: Overview achieves ~12% better performance than other semantic strategies (and 2 x better than hashtag-based) Adaptive Faceted Search on Twitter 29
  • Impact of link-based enrichment Personalized strategy outperforms baseline significantly Link-based enrichment improves quality for both strategies Adaptive Faceted Search on Twitter 30
  • Impact of time-sensitivity Time-sensitivity based ranking improves quality for both frequency and diversification strategies Adaptive Faceted Search on Twitter 31
  • Application of the Faceted Search Framework Adaptive Faceted Search on Twitter 32
  • 1.2. Twitcident.com Twitter-based crisis management system Semantic enrichment allows for: 1. Grouping tweets3. 4. into incidents 2. Faceted search 3. Thematic Views 4. Analysis Adaptive Faceted Search on Twitter 33
  • ConclusionsWhat we did:• Adaptive Faceted Search on Twitter + Evaluation Framework• Analysis and Evaluation (+ Application in Twitcident)Findings:1. Semantic Enrichment allows for structured representation of the content of tweets  basis for faceted search2. Faceted search performs significantly better than hashtag-based keyword search3. Different building blocks for making faceted search on Twitter adaptive improve the search quality: a) Link-based enrichment: more discoverable tweets, better search performance b) Personalization leads to significant improvements c) Time-sensitivity improves performance as well Adaptive Faceted Search on Twitter 34
  • Thank you!Twitter: @fabianabelhttp://wis.ewi.tudelft.nl/iswc2011/Adaptive Faceted Search on Twitter 35