• Save
Big Data, Big Local
Upcoming SlideShare
Loading in...5

Big Data, Big Local



Where 2.0 2011 presentation discussion the use of local businesses and POI as topographical nodes in a complex network

Where 2.0 2011 presentation discussion the use of local businesses and POI as topographical nodes in a complex network



Total Views
Views on SlideShare
Embed Views



10 Embeds 992

http://blog.factual.com 652
http://radar.oreilly.com 284
http://www.linkedin.com 38
http://www.slideshare.net 6
http://static.slidesharecdn.com 3
http://web1.conversationminer.com 3
http://drzubirstation.com 2
https://www.linkedin.com 2
url_unknown 1
http://localhost 1



Upload Details

Uploaded via as Microsoft PowerPoint

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
Post Comment
Edit your comment
  • These coordinates can map to the US, which has its own array of contextual associations… http://open.mapquestapi.com/nominatim/v1/details.php?place_id=4476130 http://www.flickr.com/photos/mav16/3557076001/ http://www.flickr.com/photos/expressmonorail/2531144122/ http://www.flickr.com/photos/video4net/4079991429/
  • They also map to California, which has its own, different context… http://open.mapquestapi.com/nominatim/v1/details.php?place_id=79413431 http://www.flickr.com/photos/muftirythm/5181074455/ http://www.flickr.com/photos/caccamo/1253844134 http://www.flickr.com/photos/the_tahoe_guy/4415371647/
  • And of course to San Francisco, which also has its own context independent of others… http://open.mapquestapi.com/nominatim/v1/details.php?place_id=36061747 http://www.flickr.com/photos/alex-s/80040426/ http://www.flickr.com/photos/salim/402618628/ http://www.flickr.com/photos/26063464@N03/3633118346/
  • The coordinates actual map directly onto an Adult ‘Novelty’ shop, which of course has entirely different associations… Google streetview Image http://www.flickr.com/photos/mmemarilyn/2021853367/ http://www.flickr.com/photos/netzanette/3822981633/ http://www.flickr.com/photos/preppybyday/5076899310/
  • Diff. between grid and graph Coordinates provide location, Businesses and POI provide context Semantic hooks on which we hang activity http://www.flickr.com/photos/iconolith/253426954/
  • http://www.flickr.com/photos/silvery/4461519535/ http://www.flickr.com/photos/brettstark/4386550082/
  • So we got that going for us…
  • Nomalization and canonicalization are huge problem Across all attributes, varying by country. 10 core attributes, 35 countries = 350 rule sets
  • Same store on 8 different sites
  • http://developers.facebook.com/docs/opengraph/
  • Uniform Resource Identifiers accessible via HTTP Dereference: to obtain a copy or representation of the resource it identifies.
  • http://blog.fwix.com/our-geodata-just-got-even-better
  • Large-scale data engineering is a royal PITA We address this so that your efforts go on the application layer – where differentiation counts

Big Data, Big Local Big Data, Big Local Presentation Transcript

  • Big Data, Big Local Tyler Bell @twbell
  • 37.7632,-122.4213 Great for machines Coordinates: For people, less so
  • http://www.flickr.com/photos/mav16/3557076001/ http://www.flickr.com/photos/expressmonorail/2531144122/ http://www.flickr.com/photos/video4net/4079991429/
  • http://www.flickr.com/photos/muftirythm/5181074455/ http://www.flickr.com/photos/caccamo/1253844134 http://www.flickr.com/photos/the_tahoe_guy/4415371647/
  • http://www.flickr.com/photos/alex-s/80040426/ http://www.flickr.com/photos/salim/402618628/ http://www.flickr.com/photos/26063464@N03/3633118346/
  • http://www.flickr.com/photos/preppybyday/5076899310/ http://www.flickr.com/photos/netzanette/3822981633/ http://www.flickr.com/photos/mmemarilyn/2021853367/
  • While coordinates are regular and convenient, they lack context and character Square
  • http://www.flickr.com/photos/iconolith/253426954/
  • The Evolving Local Use Case Yellow Pages Local Search Recommendations Social Engagement Brand Engagement Commercial Engagement Navigation Interaction
  • Local Businesses and POI – An irregular , but extremely rich topographic network
  • Employing POI and Business Listings as topographical nodes brings its own problems…
  • Subway Restaurants Subway Sandwich and Salad Subway Sandwich and Salad Shop Subway Subs and Salads Subway Restaurants Subway Subs Subway Shop Subway Sandwich Shops Subway Sandwichs Subway Sandwiches and Salads Subway Restaurant Subway Sandwiches Subway Sandwiches and Salads Subway Sandwich Shop Subway Subway Sandwiches and Salad Subway Sandwich and Salads Subway Sandwich Poor/Absent Canonicalization
  • Multiple Electronic Representations of one physical entity
  • Webpage URLs have become URIs Identifiers for people, places, things http://developers.facebook.com/docs/opengraph/
  • 14.5m entities pointing to over… 1.5b references found across… 4.7m domains US Local Dataset
  • http://continuations.com/post/4365211963/the-web-stp-challenge-making-apis-useful We need more STP [Straight Through Processing] for the web so that we have fewer stove pipe services and can move to a seamless web instead. The obstacle is no longer a lack of APIs […] the problem is a lack of data mapping/unification services. - Albert Wenger http://twitter.com/#!/cdixon/status/49906284492881920
  • We are able to focus on our core vision of geotagging the web’s content and information while also providing our developers with a great Places Database that is open and free to use.
  • How easily men could make things much better than they are -- if they only all tried together - Winston Churchill
  • Tyler Bell @twbell