Foursquare data quick talk @ BBQ Check in 3/21/12

402 views

Published on

Published in: Business, Technology
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
402
On SlideShare
0
From Embeds
0
Number of Embeds
3
Actions
Shares
0
Downloads
3
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

Foursquare data quick talk @ BBQ Check in 3/21/12

  1. 1. E foursquare explore + assorted data fun@benlee
  2. 2. Overviewfoursquare data stats explore similar places interesting phrases venue transitions 2
  3. 3. Data15,000,000+ people30,000,000+ places1,500,000,000+ check-ins
  4. 4. 4
  5. 5. E foursquare explore
  6. 6. Explore Flow• Query parsing• Retrieval• Ranking• Reason generation
  7. 7. Overview: Retrieval Venue Index Check-in Candidate Venues History Venue Social Personalattributes Interactions Interactions
  8. 8. Overview: Ranking / Reasons Venue Social Personal attributes Interactions Interactions PersonalGlobal scores Social Scores Scores Top K venues Global Social Personal reasons reasons reasons
  9. 9. Global attributes• Time decayed popularity• Time of day / day of week filtering• Total interactions (# tips, todos etc)
  10. 10. Blue bottle
  11. 11. Social / Personal scores• # Friends that have checkins• Tips left by friends, etc• Prior visits by user• Similar places user has been in the past
  12. 12. Venue Similarity vi vj Venues for all i,j sim(vi, vj)Users Incredibly Sparse Matrix
  13. 13. Venue Similarity w/ MapReducekeyuser visited venues map vi, vj score emit “all” pairs of visited venues for each user vi, vj score ...keyvi, vj score score ... score reduce Sum up each user’s score contribution final score to this pair of venues
  14. 14. 17
  15. 15. 18
  16. 16. Reason generation- “Dennis left a tip here”- “You and 4 friends have been here”- “This place is on your To-do list” 19
  17. 17. Improving exploreNew signals – Tourist / Local – Everyday / Once-and-awhile – Expert rankRetrieval improvment –Synonyms / query rewriting –Query category matching –Menu text 20
  18. 18. E assorted data fun
  19. 19. Interesting phrases• Hack day project on tip text• Build a n-gram model, find “improbable” phrases relative to background corpus• Inspired by Amazon’s statistically improbable phrases (SIPs) feature
  20. 20. 69 Modern ClassicsCondensed via Amazon’s SIPsbarn cellar, famous pig, grey spider, old sheep,egg sac
  21. 21. 69 Modern ClassicsCondensed via Amazon’s SIPsstillsuit manufacturer, panoplia propheticus,gom jabbar, inkvine scar, ducal signet, factorycrawler, poison snooper
  22. 22. Interesting Venue Phrasesdrip coffees, best coffee, jack dorsey, no wifi,using square
  23. 23. Interesting Venue Phrasesbootie sf, hubba hubba revue, death guild
  24. 24. Interesting Venue Phrasessalted caramel, balsamic strawberry, roastedbanana, honey lavender,
  25. 25. City level phrasesfried chicken, blue bottle, french toastbreakfast tacos, mexican martinis, friedavocadodeep dish pizza, celery salt, hot dogclam chowder, lobster roll, red soxbread pudding, cafe au lait, bloody mary
  26. 26. Phrases by time
  27. 27. Venue transitionsUsers check in ~3 times a dayWhat can we learn from where they go next? 30
  28. 28. 31
  29. 29. 32
  30. 30. Thanksfoursquare.com/jobs@benlee

×