Extracting event and place semantics from Flickr tags

Loading...

Flash Player 9 (or above) is needed to view presentations.
We have detected that you do not have it on your computer. To install it, go here.

2 comments

Comments 1 - 2 of 2 previous next Post a comment

Post a comment
Embed Video
Edit your comment Cancel

Notes on slide 1

Mor… star interns One way to extract knowledge from Flickr tags. This is not about image search - it’s not even about photos

6 Favorites

Extracting event and place semantics from Flickr tags - Presentation Transcript

  1. Tye Rattenbury, Nathan Good, Mor Naaman* Yahoo! Research Berkeley (Yahoo! Advanced Development Division) Towards Automatic Extraction of Event and Place Semantics from Flickr Tags
  2. This Talk is Not About Photos
    • Geo-referenced data
      • Photos
      • Blog posts (GeoRSS)
      • Geo-annotated Web pages
      • KML-based collections
    • Similar issues, similar potential
  3. Find the Differences “ Yahoo! Headquarters” “ Dog” “ SIGIR 2007”
  4. Data Description
  5. Simple Model (photo_id, user_id, time, latitude, longitude) (photo_id, tag)
  6. Information?
      • Flickr “geotagged” in San Francisco
  7. Tag Maps - San Francisco
  8. Tag-based Semantics
    • Derive meaningful data about individual tags
    • Based on the tag’s metadata patterns
    • E.g., ”Yahoo! Headquarters” , “SIGIR2007” , “Dog” .
  9. Tag Semantics: Why?
    • Improved image (and web!) search through query semantics
    • Automatic place- and event-gazetteers
    • Association of missing time/place data based on tags
    • Collection visualization
  10. Next Few Minutes
    • Extended model
    • Sample data
    • Problem definition
  11. Extended Model (photo_id, user_id, time, latitude, longitude) (photo_id, tag) (tag, location) (tag, time)
  12. Tag Patterns
  13. Tag Patterns
  14. Tag Patterns
  15. Definitions
    • Event tag: expected to exhibit significant time-based patterns
    • Location tag: expected to exhibit significant place-based patterns
  16. Problem Definition
    • Can we extract event/place semantics from unstructured tags given their usage distribution (time and location patterns)?
  17. Fundamental Issues
    • “ Regions” of semantic relevance
      • “ church” in a neighborhood is a place
      • “ carnival” in Rio de Janeiro is an event
      • “ California” in San Francsico is not a place
    • Semantic ambiguity
      • “ apple”
      • “ turkey”
      • “ pride”
  18. Proposed Solutions
    • Naïve Scan
      • Burst detection
    • Spatial Scan Statistics
      • Borrowed from disease outbreak detection
    • Scale-structure method
      • Examine the “structure” of the tag’s patterns in multiple scales
  19. Next Few Minutes
    • Intuition behind the three methods
    • Evaluation
    • Summary
  20. Naïve Scan
  21. Spatial Scan
  22. Issues
    • First two methods just look for a local burst in usage (in either time or space)
  23. Scale-structure Identification
    • Select a scale
      • Find the structure of the data for the scale
      • Calculate the entropy of the structure
    • Increase scale, repeat
    • Is the information “well organized”?
  24. Scale-structure Identification
  25. Scale-structure Identification Entropy = 1.421621 Entropy = 0.766263 Entropy = 0.555915 Entropy = 0.062760 “Barry Bonds” tag
  26. San Francisco Experiments
    • ~43k photos
    • ~800 tags
    San Francisco Dataset: 42,000 Photos 800+ popular tags
  27. Experiments byobw Ground Truth: BYOBW?
  28. Experiments
    • Given a ground truth:
      • Run the three methods, vary thresholds
      • Precision vs. recall
  29. The Obligatory Slide Where I Show You that Our Curve is Above the Other Curves
  30. Experiments
  31. Future Work
    • Multiple regions
    • Different spatially and temporally encoded data (e.g., blogs)
    • Other metadata domains besides time and space
      • Visual features: shape primitives, color
      • Semantic features
  32. Questions?
    • Tye Rattenbury, Nathan Good, Mor Naaman
    • Y!RB / Y!A.D.D.
    • Read more, follow: http://www.whyrb.com
    • Slides: http://slideshare.net/mor
    • Contact: mor@yahoo-inc.com
  33. Tag Maps - Paris - Les Blogs?
  34. Issues
    • Noisy data
    • Photographer biases
      • In locations
      • In Tags
    • Wrong data
    Whatever, color, city, spectrum, santa barbara, california, usa, Lookatme, Herbert Bayer Chromatic Gate

+ mormor, 3 years ago

custom

2524 views, 6 favs, 8 embeds more stats

Presentation at SIGIR 2007.

More info about this document

CC Attribution License

Go to text version

  • Total Views 2524
    • 2341 on SlideShare
    • 183 from embeds
  • Comments 2
  • Favorites 6
  • Downloads 0
Most viewed embeds
  • 94 views on http://yahooresearchberkeley.com
  • 79 views on http://www.yahooresearchberkeley.com
  • 4 views on http://wildfire.gigya.com
  • 2 views on http://feeds.feedburner.com
  • 1 views on http://66.249.91.104

more

All embeds
  • 94 views on http://yahooresearchberkeley.com
  • 79 views on http://www.yahooresearchberkeley.com
  • 4 views on http://wildfire.gigya.com
  • 2 views on http://feeds.feedburner.com
  • 1 views on http://66.249.91.104
  • 1 views on http://blog.yahooresearchberkeley.com
  • 1 views on http://www.netvibes.com
  • 1 views on http://209.85.173.132

less

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate. If needed, use the feedback form to let us know more details.

Cancel
File a copyright complaint
Having problems? Go to our helpdesk?

Categories