Mark Watkins Big Data Presentation
Upcoming SlideShare
Loading in...5

Mark Watkins Big Data Presentation






Total Views
Views on SlideShare
Embed Views



2 Embeds 98 97 1



Upload Details

Uploaded via as Microsoft PowerPoint

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
Post Comment
Edit your comment
  • How familiar are you with mobile?What mobile initiatives have you undertaken, what are your overall corporate mobile goals?

Mark Watkins Big Data Presentation Mark Watkins Big Data Presentation Presentation Transcript

  • BIG DATA AT TELENAVUSING DATA TO IMPROVE YOUR LIFEMark Watkins, general manager, entertainment content@viking2917 2/21/2012 © 2012 Telenav, Proprietary and Confidential 1
  • A PIONEER IN LOCATION SERVICES OUR GPSPublic company: $200M+ revenue, 11 NAVIGATION PARTNERS years in businessLeader in Personalized Mobile Navigation: 30MM+ subscribersLeader in Drive To Mobile Advertising: 750K local advertisersLeader in Mobile Distribution Platforms: 900+ devicesGrowing Global Carrier Audience Reach: 14 carriers in 29 countries 2
  • KEY PROBLEMS WE ARE WORKING ONTraffic & MappingLocal Search for businesses, events, points of interestLifestyle content & recommendation engineCombination of “traditional” big data processing, machine learning and proprietary algorithmsPeople are drowning in information – use “big data” signals to condense to something manageable
  • TRAFFIC & MAPSTraffic-aware routing engine – Navigation is core competency – 1.3B routes/trips since 2007Routes generate traffic/motion data – “probe data” from app (billions/month) – Anonymized & summarized to power routing – Persisted in aggregate form for historical traffic metricsUsed to augment Open Street Map – Turn restrictions, stop signs, road geometry – Deduced from probe patternsTechnology set – Hadoop + Hive
  • AUTOMATED DEVELOPMENT OF RICH LOCAL CONTENT(YOU MAY KNOW THIS AS GOBY) Categorized to taxonomy (“blues”, “hiking trails”) all entities geotagged OTHER FEATURES WORTH NOTING • automatic entity/place creation • aggregated ratings & reviews • proprietary result ranking formula venues automatically recognized; events • domain-specific metadata extraction mapped to venues • sorting by metadata (e.g. price, rating)
  • AUTOMATED DEVELOPMENT OF RICH LOCAL DATAData space is large, but not immense – Tens or Hundreds of millions (or smaller), not billionsBut very complex – Thousands of data sources – attribute space is 10,000 wide – E.g. how many holes in the golf course; how long is the hiking trail?Generates a large, sparse matrix – Ambiguous, conflicting data – Unstructured or semi-structured data – Need to recognize entities & merge/dedup
  • SOME LEARNINGSLots of data sources / signals generate “goodness” – Ranking, Confidence, importance, comprehensiveness“Interesting” ≠ “Most Popular”Frequency of occurrence Museum of Bad Art The Middle East NightclubFred’s dry cleaners Museum of Science 2/21/2012 © 2012 Telenav, Proprietary and Confidential 7
  • COMPOSITE, STRUCTURED LOCAL DATA 2/21/2012 © 2012 Telenav, Proprietary and Confidential 8
  • PERSONALIZED RECOMMENDATIONS 2/21/2012 © 2012 Telenav, Proprietary and Confidential 9
  • RECOMMENDATIONS – WORK IN PROGRESSKey signals – Personalized “interest graph” – “Drive to” data (where are people driving to?) – Entity-level “page rank” – Web/mobile clickstream dataIntegrated with social media – Facebook actions influencing recommendationsKey technology enablers – Large amounts of user-generated data – Proprietary algorithms; machine learning / SVM
  • TELENAV.COM – SCOUT 2/21/2012 © 2012 Telenav, Proprietary and Confidential 11