Mining Surprise

Loading...

Flash Player 9 (or above) is needed to view presentations.
We have detected that you do not have it on your computer. To install it, go here.

0 comments

Post a comment

    Post a comment
    Embed Video
    Edit your comment Cancel

    Favorites, Groups & Events

    Mining Surprise - Presentation Transcript

    1. Surprise, You’ve Got Friends! Problem Definition: Want to present the serendipitous connections? How do we define and mine surprise? Then Aspects of Surprise: Uncanny vs Unexpected Surprise can be measured as the change in prior to posterior probability, is time dependent. Approaches/Main Idea Explicit communities, e.g. flickr group, are also connected by implicit groups, e.g. personomy measures, co-location via geo tags. Can find two people at this moment, this place, with the greatest Explicit Distance, and smallest Implicit Distance, this may surprise or abhor these people. Now Weighting Consider geo similarity. Inverse weight geo location by radius of bounding box, or “fame” of location. Test Case Use geo tagged flickr pictures to find two strangers at a gathering that have a visited the same unusual location in the past.
    2. A unit of surprise --- a wow may be defined for a single model M as the amount of surprise corresponding to a two-fold variation between P(M|D) and P(M), i.e., as log P(M|D)/P(M) (with log taken in base 2) • L. Itti & P. Baldi, Proc. IEEE-CVPR, 2005 • L. Itti & P. Baldi, Proc. NIPS, 2006
    3. First approximation Took the flickr group ISWC      1.  Extract the tag distribution of the event based on the photos in the group(E) 2. For each user: Took the user vector and computer the TF-IDF using the group factor as the IDF 3. For each pair of users Compute the cosine similarity
    4. Results The two “Closest - furthest” people in this group are from the same research institute in Ireland, DERI
    5. Base Case The base-case, using all the information, gives the same result, however the fist pass analysis was naive
    6. Plenty of signals that we wanted to take into account: Network of friends (two individuals were friends in flickr) All tags from all users Geo Coordinates etc etc etc
    7. Some Applications Dating Service For Academics
    8. Some Broader Issues Surprise is time dependent, current signals from some SNS are highly bursty, e.g. batch upload in flickr Connecting this analysis to real world sensors would seem to offer a method for extracting better information on calculating priors. Connecting across data sources certainly will A Pleasure Device “Surprise Engine” may become a tunable mess, much like current search engines.

    + Ian MulvanyIan Mulvany, 2 years ago

    custom

    739 views, 0 favs, 0 embeds more stats

    How do you define "surprise", a potential method fo more

    More info about this document

    © All Rights Reserved

    Go to text version

    • Total Views 739
      • 739 on SlideShare
      • 0 from embeds
    • Comments 0
    • Favorites 0
    • Downloads 0
    Most viewed embeds

    more

    All embeds

    less

    Flagged as inappropriate Flag as inappropriate
    Flag as inappropriate

    Select your reason for flagging this presentation as inappropriate. If needed, use the feedback form to let us know more details.

    Cancel
    File a copyright complaint
    Having problems? Go to our helpdesk?

    Categories