SlideShare a Scribd company logo
1 of 27
Download to read offline
(A Few) Key Lessons Learned
Building Recommender Systems
For Large-scale Social Networks




                              1
World’s Largest Professional Network

                                     175M+           2/sec
                                     62% non U.S.


                                                    25th
                                                    Most visit website worldwide
                               90
                                                    (Comscore 6-12)



                          55                        >2M
                                                    Company pages



                                                    85%
                    32

               17
           8
 2    4                                             Fortune 500 Companies use
                                                    LinkedIn to hire
2004 2005 2006 2007 2008 2009 2010 2011
          LinkedIn Members (Millions)

                                                                                   2
LinkedIn Homepage




Powered by
Recommendations!




                             3
The Recommendations Opportunity
                         Similar Profiles




                              Connections




       Network updates
                                Events You May
                                Be Interested In




                             News




                                                   4
What are they worth? Think 50%

!  > 50% of connections are from recommendations (PYMK)
!  > 50% of job applications are from recommendations
  (JYMBII)

!  > 50% of group joins are from recommendations (GYML)




                                                          5
What is a Recommender System?




A Recommender selects a product that if
acquired by the “buyer” maximizes value of
both “buyer” and “seller” at a given point in time




                                                 6
Lesson 1 : Recommendations must make
strategic sense…
!  Conflicts of interest between ‘buyers’
   !  E.g. job posters vs. job seekers




!  What is the (long-term) value of an action?
   !  How do we compare an invitation from a job application from a
      group join or the reading of a news?


                                                                      7
Ingredients of a Recommender System


A Recommender processes information and
transforms it into actionable knowledge




     Data
                             Business Logic
   (Feature     Algorithms                    User Experience
                              and Analytics
 Engineering)




                                                            8
Lesson 2: User Experience Matters Most

1.  Understand user intent
         " Define, model, leverage
2.  Be in the user flow
3.  Optimize location on the page
4.  Set right expectations (“You May…”)
5.  Explain recommendations
6.  Interact with the user
7.  Leverage Social Referral
                                          9
User Intent, User Flow, Location, Message

  Job recommendation use cases
User            Optimization                 Impact on job
experience                                   application rate
User intent /   Homepage                          2.5X
location        personalization
User flow /     Before vs. after having            7X
user intent     applied to a job
User flow       LinkedIn homepage vs.             10X
                Jobs homepage
Location        Center rail vs. right rail         5X
Message         Followers vs. leaders              2X


                                                                Item based collaborative filtering:
                                                                " Follower audience
                                                                                                      Content based:
                                                                                                      " Leader audience


                                                                                                                10
User Intent Modeling


1.  Intent labeling
  •    Job seeker, recruiter, outbound professional, content
       consumer, content creator, networker, profile curator


2.  Build normalized propensity model for
    each intent



                                                           11
Job Seeker Intent
!    Features
     !    Behavior: job searches, views &
          applications, job related email
          replies, has a job seeker sub, …
     !    Social graphs: # of colleagues in
          network who recently left, …
     !    Content: title, industry,
          anniversary date, …
!    Propensity Score
     !    Parametric accelerated failure
          time survival model
          !  log Ti = !k"kxik+#$i
     !    Score: P(switch job next month)
!    Evaluation
     !    Gold standard
     !    Directional validation

                                                12
Lesson 3: Recommendations often cater to
        multiple competing objectives
                                            Suggest relevant groups …
                                            that one is more likely to
                                            participate in

TalentMatch
(Top 24 matches of a posted job for sale)
                                            Suggest skilled candidates …
                                            who will likely respond to hiring
                                            managers inquiries



        Semantic + engagement objectives
                                                                         13
Multiple Objective Optimization

Constraint optimization problem

1.  Rank top K’ > K results wrt to primary objective (e.g.
    relevance)
2.  Perturb the ranking with a parametric function which
    leads to the inclusion of secondary objective(s) (e.g.
    engagement)
    !  Measure the perturbation using a distance
        function wrt primary objective
    !  Create a framework to quantify the tradeoff
        between the objectives


                                                             14
TalentMatch Use Case I

•  Proxy for likelihood to answer hiring managers inquiries:
   •  Job seekers are 16X more likely to answer than non-seekers
" Increase % of job seekers in in TalentMatch results
•  Control for TalentMatch booking rate and sent emails
   rate




                                 Distribution of min TalentMatch scores
                                over 1 month of jobs posted on LinkedIn

                                                                          15
TalentMatch Use Case II




                          16
Data Sources for Feature Engineering
                Social Graphs

Content



                      http://inmaps.linkedinlabs.com/
                                                        Behavior


                                                        PVs Queries
                                                        Actions (clicks)




          …
                                                                    17
Lesson 4: The Unreasonable Effectiveness of
Big Data
!  Data jujitsu: slice and dice (smartly), then count
!  Rich features engineered from profiles jujitsu
   !    Job tenure distributions
   !    Job transition probabilities
   !    Related titles, industries, companies
   !    Profile & job seniority
         !  Impact on job recommendations:
             !  25% lift in views, viewers, applications, applicants
             !  90% drop in negative feedback




                                                                       18
Lesson 4: The Unreasonable Effectiveness of
Big Data (cont.)
  !  Region stickiness
  !  Related regions
     !  Impact on job recommendations: 20% lift in views, viewers,
        applications, applicants
  !  Individuals propensity to migrate




                                                                     19
Lesson 5: Data Labeling Can Be Daunting
!  Historical data
   !    Similar objective
   !    Unrelated processes, e.g., same session search selection
        !  reduce presentation bias, position bias
        !  What about intent bias?
!  Random suggestions
   !    Great with ads, company follows
   !    Not for products with high cost of bad recommendations
        !  jobs, alumni groups, …
   !    Not for similar recommendations
!  Crowdsourced manual labeling
   !    Very challenging
   !    Pairwise comparison more suited than absolute rating " higher cost
   !    Expert-sourced manual labeling


                                                                       20
Lesson 6: Measure Everything
1.  Implicit user feedback
   !    E.g. impressions, immediate actions (clicks), secondary actions
   !    Understand and optimize flows, e.g.,
        !      Impact of ‘See more’ link and landing page
        !      Conversion rate of a job view into a job application from various
               channels
              !    E.g. Homepage vs email vs mobile




                                                                                   21
Lesson 6: Measure Everything (cont.)
2.  Explicit user feedback
   !    Understand usefulness of the recommendation with unequal depth
        !    Text based A/B test!
   !    Help prioritize future improvements
   !    Reveal unexpected complexities
        !    E.g. ‘Local’ means different things for different members
   !    Prevent misinterpretation of implicit user feedback!




                                                                         22
Lesson 7: Beware of some A/B testing pitfalls

1.  Novelty effect
   !    E.g., new job recommendation algorithms
        have week-long novelty effect that shows
        lifts twice the stationary (real) one
                                      !"#$%&'()$*'+$,-$#./0'1$+234'$5$67,788$                          !"#$%&'()$*+,-+,,$
                        ,$!!!"
                                                                      *$!!!"
                        +$!!!"

                        *$!!!"
                                                                      )$!!!"

                        )$!!!"                                        ($!!!"
                        ($!!!"                                        '$!!!"
                                                                    -./"01234"526"(7"/89:2;"6<=>2"?"
                        '$!!!"                                      )@(@##"
                                                                       &$!!!"                                                  +,-"./012")3#43##"
                        &$!!!"

                                 1 week lifts                                             2weeks lifts
                                                                      %$!!!"
                        %$!!!"
                                                                      #$!!!"
                        #$!!!"

                            !"                                              !"
                                 !"    ("   #!"   #("   %!"   %("                !"         ("         #!"   #("   %!"   %("




2.  Cannibalization
   !    Zero-sum game or real lift?
3.  Random sampling destroys
    network effect
                                                                                                                                                    23
Lesson 8: One Unified Platform, One API

!  Scaling innovation
   !  Cross-leverage improvements between products
   !  Shared knowledgebase

!  Maintainability
   !  Production serving and tracking
   !  Infrastructure for complete upgrades

!  Performance
   !  Billions of sub second computations

                                                     24
Open Source Technologies



Zoie      Bobo




Kafka    Voldemort


         http://sna-projects.com/sna/   25
Conclusion

!  Social/professional networks are a new
   frontier for recommender systems
!  Still many open questions:
  !    How do we define and measure engagement?
  !    What is the utility of a recommendation for the member?
  !    What is the value of a recommendation for the network?
  !    How do we reconcile utility and value when they conflict?
  !    How do we network A/B test without tears?
  !    …
!  Learn as you go
  !    Track everything and invest in forensics analytics
       !  Breadth – understand holistic impact
       !  Depth – understand flows

                                                                   26
References
•  M. Rodriguez, C. Posse and E. Zhang. 2012. Multiple Objective
   Optimization in Recommendation Systems. To appear in Proceedings of
   the Sixth ACM conference on Recommender systems (RecSys '12)
•  M. Amin, B. Yan, S. Sriram, A. Bhasin and C. Posse. 2012. Social
   Referral : Using Network Connections to Deliver Recommendations. To
   appear in Proceedings of the Sixth ACM conference on Recommender
   systems (RecSys '12)
•  A. Reda, Y. Park, M. Tiwari, C. Posse and S. Shah. 2012. Metaphor:
   Related Search Recommendations on a Social Network. 2012. To appear
   in Proceedings of the 21st International Conference on Information and
   Knowledge Management (CIKM ‘12)
•  B. Yan, l. Bajaj and A. Bhasin. 2011. Entity Resolution Using Social Graphs
   for Business Applications. In Proceedings of the International Conference
   on Advances in Social Network Analysis and Mining (ASONAM ‘11)




                                                                            27

More Related Content

What's hot

Anyone Can Build A Recommendation Engine With Solr: Presented by Doug Turnbul...
Anyone Can Build A Recommendation Engine With Solr: Presented by Doug Turnbul...Anyone Can Build A Recommendation Engine With Solr: Presented by Doug Turnbul...
Anyone Can Build A Recommendation Engine With Solr: Presented by Doug Turnbul...Lucidworks
 
Enhancing relevancy through personalization & semantic search
Enhancing relevancy through personalization & semantic searchEnhancing relevancy through personalization & semantic search
Enhancing relevancy through personalization & semantic searchTrey Grainger
 
Self-learned Relevancy with Apache Solr
Self-learned Relevancy with Apache SolrSelf-learned Relevancy with Apache Solr
Self-learned Relevancy with Apache SolrTrey Grainger
 
Leveraging Lucene/Solr as a Knowledge Graph and Intent Engine: Presented by T...
Leveraging Lucene/Solr as a Knowledge Graph and Intent Engine: Presented by T...Leveraging Lucene/Solr as a Knowledge Graph and Intent Engine: Presented by T...
Leveraging Lucene/Solr as a Knowledge Graph and Intent Engine: Presented by T...Lucidworks
 
Balancing the Dimensions of User Intent
Balancing the Dimensions of User IntentBalancing the Dimensions of User Intent
Balancing the Dimensions of User IntentTrey Grainger
 
Searching on Intent: Knowledge Graphs, Personalization, and Contextual Disamb...
Searching on Intent: Knowledge Graphs, Personalization, and Contextual Disamb...Searching on Intent: Knowledge Graphs, Personalization, and Contextual Disamb...
Searching on Intent: Knowledge Graphs, Personalization, and Contextual Disamb...Trey Grainger
 
The Next Generation of AI-powered Search
The Next Generation of AI-powered SearchThe Next Generation of AI-powered Search
The Next Generation of AI-powered SearchTrey Grainger
 
Extending Solr: Building a Cloud-like Knowledge Discovery Platform
Extending Solr: Building a Cloud-like Knowledge Discovery PlatformExtending Solr: Building a Cloud-like Knowledge Discovery Platform
Extending Solr: Building a Cloud-like Knowledge Discovery PlatformTrey Grainger
 
Reflected Intelligence: Lucene/Solr as a self-learning data system
Reflected Intelligence: Lucene/Solr as a self-learning data systemReflected Intelligence: Lucene/Solr as a self-learning data system
Reflected Intelligence: Lucene/Solr as a self-learning data systemTrey Grainger
 
Crowdsourced query augmentation through the semantic discovery of domain spec...
Crowdsourced query augmentation through the semantic discovery of domain spec...Crowdsourced query augmentation through the semantic discovery of domain spec...
Crowdsourced query augmentation through the semantic discovery of domain spec...Trey Grainger
 
The Apache Solr Smart Data Ecosystem
The Apache Solr Smart Data EcosystemThe Apache Solr Smart Data Ecosystem
The Apache Solr Smart Data EcosystemTrey Grainger
 
Natural Language Search with Knowledge Graphs (Activate 2019)
Natural Language Search with Knowledge Graphs (Activate 2019)Natural Language Search with Knowledge Graphs (Activate 2019)
Natural Language Search with Knowledge Graphs (Activate 2019)Trey Grainger
 
Semantic & Multilingual Strategies in Lucene/Solr
Semantic & Multilingual Strategies in Lucene/SolrSemantic & Multilingual Strategies in Lucene/Solr
Semantic & Multilingual Strategies in Lucene/SolrTrey Grainger
 
Searching and Querying Knowledge Graphs with Solr/SIREn - A Reference Archite...
Searching and Querying Knowledge Graphs with Solr/SIREn - A Reference Archite...Searching and Querying Knowledge Graphs with Solr/SIREn - A Reference Archite...
Searching and Querying Knowledge Graphs with Solr/SIREn - A Reference Archite...Lucidworks
 
Implementing Conceptual Search in Solr using LSA and Word2Vec: Presented by S...
Implementing Conceptual Search in Solr using LSA and Word2Vec: Presented by S...Implementing Conceptual Search in Solr using LSA and Word2Vec: Presented by S...
Implementing Conceptual Search in Solr using LSA and Word2Vec: Presented by S...Lucidworks
 
Building Search & Recommendation Engines
Building Search & Recommendation EnginesBuilding Search & Recommendation Engines
Building Search & Recommendation EnginesTrey Grainger
 
Haystack- Learning to rank in an hourly job market
Haystack- Learning to rank in an hourly job market Haystack- Learning to rank in an hourly job market
Haystack- Learning to rank in an hourly job market Xun Wang
 
How to Build a Semantic Search System
How to Build a Semantic Search SystemHow to Build a Semantic Search System
How to Build a Semantic Search SystemTrey Grainger
 
The Intent Algorithms of Search & Recommendation Engines
The Intent Algorithms of Search & Recommendation EnginesThe Intent Algorithms of Search & Recommendation Engines
The Intent Algorithms of Search & Recommendation EnginesTrey Grainger
 
Intent Algorithms: The Data Science of Smart Information Retrieval Systems
Intent Algorithms: The Data Science of Smart Information Retrieval SystemsIntent Algorithms: The Data Science of Smart Information Retrieval Systems
Intent Algorithms: The Data Science of Smart Information Retrieval SystemsTrey Grainger
 

What's hot (20)

Anyone Can Build A Recommendation Engine With Solr: Presented by Doug Turnbul...
Anyone Can Build A Recommendation Engine With Solr: Presented by Doug Turnbul...Anyone Can Build A Recommendation Engine With Solr: Presented by Doug Turnbul...
Anyone Can Build A Recommendation Engine With Solr: Presented by Doug Turnbul...
 
Enhancing relevancy through personalization & semantic search
Enhancing relevancy through personalization & semantic searchEnhancing relevancy through personalization & semantic search
Enhancing relevancy through personalization & semantic search
 
Self-learned Relevancy with Apache Solr
Self-learned Relevancy with Apache SolrSelf-learned Relevancy with Apache Solr
Self-learned Relevancy with Apache Solr
 
Leveraging Lucene/Solr as a Knowledge Graph and Intent Engine: Presented by T...
Leveraging Lucene/Solr as a Knowledge Graph and Intent Engine: Presented by T...Leveraging Lucene/Solr as a Knowledge Graph and Intent Engine: Presented by T...
Leveraging Lucene/Solr as a Knowledge Graph and Intent Engine: Presented by T...
 
Balancing the Dimensions of User Intent
Balancing the Dimensions of User IntentBalancing the Dimensions of User Intent
Balancing the Dimensions of User Intent
 
Searching on Intent: Knowledge Graphs, Personalization, and Contextual Disamb...
Searching on Intent: Knowledge Graphs, Personalization, and Contextual Disamb...Searching on Intent: Knowledge Graphs, Personalization, and Contextual Disamb...
Searching on Intent: Knowledge Graphs, Personalization, and Contextual Disamb...
 
The Next Generation of AI-powered Search
The Next Generation of AI-powered SearchThe Next Generation of AI-powered Search
The Next Generation of AI-powered Search
 
Extending Solr: Building a Cloud-like Knowledge Discovery Platform
Extending Solr: Building a Cloud-like Knowledge Discovery PlatformExtending Solr: Building a Cloud-like Knowledge Discovery Platform
Extending Solr: Building a Cloud-like Knowledge Discovery Platform
 
Reflected Intelligence: Lucene/Solr as a self-learning data system
Reflected Intelligence: Lucene/Solr as a self-learning data systemReflected Intelligence: Lucene/Solr as a self-learning data system
Reflected Intelligence: Lucene/Solr as a self-learning data system
 
Crowdsourced query augmentation through the semantic discovery of domain spec...
Crowdsourced query augmentation through the semantic discovery of domain spec...Crowdsourced query augmentation through the semantic discovery of domain spec...
Crowdsourced query augmentation through the semantic discovery of domain spec...
 
The Apache Solr Smart Data Ecosystem
The Apache Solr Smart Data EcosystemThe Apache Solr Smart Data Ecosystem
The Apache Solr Smart Data Ecosystem
 
Natural Language Search with Knowledge Graphs (Activate 2019)
Natural Language Search with Knowledge Graphs (Activate 2019)Natural Language Search with Knowledge Graphs (Activate 2019)
Natural Language Search with Knowledge Graphs (Activate 2019)
 
Semantic & Multilingual Strategies in Lucene/Solr
Semantic & Multilingual Strategies in Lucene/SolrSemantic & Multilingual Strategies in Lucene/Solr
Semantic & Multilingual Strategies in Lucene/Solr
 
Searching and Querying Knowledge Graphs with Solr/SIREn - A Reference Archite...
Searching and Querying Knowledge Graphs with Solr/SIREn - A Reference Archite...Searching and Querying Knowledge Graphs with Solr/SIREn - A Reference Archite...
Searching and Querying Knowledge Graphs with Solr/SIREn - A Reference Archite...
 
Implementing Conceptual Search in Solr using LSA and Word2Vec: Presented by S...
Implementing Conceptual Search in Solr using LSA and Word2Vec: Presented by S...Implementing Conceptual Search in Solr using LSA and Word2Vec: Presented by S...
Implementing Conceptual Search in Solr using LSA and Word2Vec: Presented by S...
 
Building Search & Recommendation Engines
Building Search & Recommendation EnginesBuilding Search & Recommendation Engines
Building Search & Recommendation Engines
 
Haystack- Learning to rank in an hourly job market
Haystack- Learning to rank in an hourly job market Haystack- Learning to rank in an hourly job market
Haystack- Learning to rank in an hourly job market
 
How to Build a Semantic Search System
How to Build a Semantic Search SystemHow to Build a Semantic Search System
How to Build a Semantic Search System
 
The Intent Algorithms of Search & Recommendation Engines
The Intent Algorithms of Search & Recommendation EnginesThe Intent Algorithms of Search & Recommendation Engines
The Intent Algorithms of Search & Recommendation Engines
 
Intent Algorithms: The Data Science of Smart Information Retrieval Systems
Intent Algorithms: The Data Science of Smart Information Retrieval SystemsIntent Algorithms: The Data Science of Smart Information Retrieval Systems
Intent Algorithms: The Data Science of Smart Information Retrieval Systems
 

Viewers also liked

Google_Controlled Experimentation_Panel_The Hive
Google_Controlled Experimentation_Panel_The HiveGoogle_Controlled Experimentation_Panel_The Hive
Google_Controlled Experimentation_Panel_The HiveThe Hive
 
Topic 3: Large-scale Distributed Systems
Topic 3: Large-scale Distributed SystemsTopic 3: Large-scale Distributed Systems
Topic 3: Large-scale Distributed SystemsZubair Nabi
 
Committing to Recommendations Strata 2013
Committing to Recommendations Strata 2013Committing to Recommendations Strata 2013
Committing to Recommendations Strata 2013Eric Colson
 
The Science and the Magic of User Feedback for Recommender Systems
The Science and the Magic of User Feedback for Recommender SystemsThe Science and the Magic of User Feedback for Recommender Systems
The Science and the Magic of User Feedback for Recommender SystemsXavier Amatriain
 
BIG2016- Lessons Learned from building real-life user-focused Big Data systems
BIG2016- Lessons Learned from building real-life user-focused Big Data systemsBIG2016- Lessons Learned from building real-life user-focused Big Data systems
BIG2016- Lessons Learned from building real-life user-focused Big Data systemsXavier Amatriain
 
Machine Learning for Q&A Sites: The Quora Example
Machine Learning for Q&A Sites: The Quora ExampleMachine Learning for Q&A Sites: The Quora Example
Machine Learning for Q&A Sites: The Quora ExampleXavier Amatriain
 
Past, present, and future of Recommender Systems: an industry perspective
Past, present, and future of Recommender Systems: an industry perspectivePast, present, and future of Recommender Systems: an industry perspective
Past, present, and future of Recommender Systems: an industry perspectiveXavier Amatriain
 
Past present and future of Recommender Systems: an Industry Perspective
Past present and future of Recommender Systems: an Industry PerspectivePast present and future of Recommender Systems: an Industry Perspective
Past present and future of Recommender Systems: an Industry PerspectiveXavier Amatriain
 
Machine Learning to Grow the World's Knowledge
Machine Learning to Grow  the World's KnowledgeMachine Learning to Grow  the World's Knowledge
Machine Learning to Grow the World's KnowledgeXavier Amatriain
 
Building Large-scale Real-world Recommender Systems - Recsys2012 tutorial
Building Large-scale Real-world Recommender Systems - Recsys2012 tutorialBuilding Large-scale Real-world Recommender Systems - Recsys2012 tutorial
Building Large-scale Real-world Recommender Systems - Recsys2012 tutorialXavier Amatriain
 
Surviving and Thriving in a World of Change
Surviving and Thriving in a World of ChangeSurviving and Thriving in a World of Change
Surviving and Thriving in a World of ChangeWorkday
 
Kdd 2014 Tutorial - the recommender problem revisited
Kdd 2014 Tutorial -  the recommender problem revisitedKdd 2014 Tutorial -  the recommender problem revisited
Kdd 2014 Tutorial - the recommender problem revisitedXavier Amatriain
 
Deep learning: the future of recommendations
Deep learning: the future of recommendationsDeep learning: the future of recommendations
Deep learning: the future of recommendationsBalázs Hidasi
 
Recsys 2016 tutorial: Lessons learned from building real-life recommender sys...
Recsys 2016 tutorial: Lessons learned from building real-life recommender sys...Recsys 2016 tutorial: Lessons learned from building real-life recommender sys...
Recsys 2016 tutorial: Lessons learned from building real-life recommender sys...Xavier Amatriain
 
Using AWS to Build a Graph-Based Product Recommendation System (BDT303) | AWS...
Using AWS to Build a Graph-Based Product Recommendation System (BDT303) | AWS...Using AWS to Build a Graph-Based Product Recommendation System (BDT303) | AWS...
Using AWS to Build a Graph-Based Product Recommendation System (BDT303) | AWS...Amazon Web Services
 
Approximate nearest neighbor methods and vector models – NYC ML meetup
Approximate nearest neighbor methods and vector models – NYC ML meetupApproximate nearest neighbor methods and vector models – NYC ML meetup
Approximate nearest neighbor methods and vector models – NYC ML meetupErik Bernhardsson
 
Recommender Systems with Apache Spark's ALS Function
Recommender Systems with Apache Spark's ALS FunctionRecommender Systems with Apache Spark's ALS Function
Recommender Systems with Apache Spark's ALS FunctionWill Johnson
 
Recommender system algorithm and architecture
Recommender system algorithm and architectureRecommender system algorithm and architecture
Recommender system algorithm and architectureLiang Xiang
 
Collaborative Filtering with Spark
Collaborative Filtering with SparkCollaborative Filtering with Spark
Collaborative Filtering with SparkChris Johnson
 

Viewers also liked (20)

Google_Controlled Experimentation_Panel_The Hive
Google_Controlled Experimentation_Panel_The HiveGoogle_Controlled Experimentation_Panel_The Hive
Google_Controlled Experimentation_Panel_The Hive
 
Topic 3: Large-scale Distributed Systems
Topic 3: Large-scale Distributed SystemsTopic 3: Large-scale Distributed Systems
Topic 3: Large-scale Distributed Systems
 
Committing to Recommendations Strata 2013
Committing to Recommendations Strata 2013Committing to Recommendations Strata 2013
Committing to Recommendations Strata 2013
 
The Science and the Magic of User Feedback for Recommender Systems
The Science and the Magic of User Feedback for Recommender SystemsThe Science and the Magic of User Feedback for Recommender Systems
The Science and the Magic of User Feedback for Recommender Systems
 
BIG2016- Lessons Learned from building real-life user-focused Big Data systems
BIG2016- Lessons Learned from building real-life user-focused Big Data systemsBIG2016- Lessons Learned from building real-life user-focused Big Data systems
BIG2016- Lessons Learned from building real-life user-focused Big Data systems
 
Machine Learning for Q&A Sites: The Quora Example
Machine Learning for Q&A Sites: The Quora ExampleMachine Learning for Q&A Sites: The Quora Example
Machine Learning for Q&A Sites: The Quora Example
 
Past, present, and future of Recommender Systems: an industry perspective
Past, present, and future of Recommender Systems: an industry perspectivePast, present, and future of Recommender Systems: an industry perspective
Past, present, and future of Recommender Systems: an industry perspective
 
Past present and future of Recommender Systems: an Industry Perspective
Past present and future of Recommender Systems: an Industry PerspectivePast present and future of Recommender Systems: an Industry Perspective
Past present and future of Recommender Systems: an Industry Perspective
 
Machine Learning to Grow the World's Knowledge
Machine Learning to Grow  the World's KnowledgeMachine Learning to Grow  the World's Knowledge
Machine Learning to Grow the World's Knowledge
 
Building Large-scale Real-world Recommender Systems - Recsys2012 tutorial
Building Large-scale Real-world Recommender Systems - Recsys2012 tutorialBuilding Large-scale Real-world Recommender Systems - Recsys2012 tutorial
Building Large-scale Real-world Recommender Systems - Recsys2012 tutorial
 
Surviving and Thriving in a World of Change
Surviving and Thriving in a World of ChangeSurviving and Thriving in a World of Change
Surviving and Thriving in a World of Change
 
Kdd 2014 Tutorial - the recommender problem revisited
Kdd 2014 Tutorial -  the recommender problem revisitedKdd 2014 Tutorial -  the recommender problem revisited
Kdd 2014 Tutorial - the recommender problem revisited
 
Deep learning: the future of recommendations
Deep learning: the future of recommendationsDeep learning: the future of recommendations
Deep learning: the future of recommendations
 
Recsys 2016 tutorial: Lessons learned from building real-life recommender sys...
Recsys 2016 tutorial: Lessons learned from building real-life recommender sys...Recsys 2016 tutorial: Lessons learned from building real-life recommender sys...
Recsys 2016 tutorial: Lessons learned from building real-life recommender sys...
 
Using AWS to Build a Graph-Based Product Recommendation System (BDT303) | AWS...
Using AWS to Build a Graph-Based Product Recommendation System (BDT303) | AWS...Using AWS to Build a Graph-Based Product Recommendation System (BDT303) | AWS...
Using AWS to Build a Graph-Based Product Recommendation System (BDT303) | AWS...
 
Approximate nearest neighbor methods and vector models – NYC ML meetup
Approximate nearest neighbor methods and vector models – NYC ML meetupApproximate nearest neighbor methods and vector models – NYC ML meetup
Approximate nearest neighbor methods and vector models – NYC ML meetup
 
Recommender Systems with Apache Spark's ALS Function
Recommender Systems with Apache Spark's ALS FunctionRecommender Systems with Apache Spark's ALS Function
Recommender Systems with Apache Spark's ALS Function
 
Recommender system algorithm and architecture
Recommender system algorithm and architectureRecommender system algorithm and architecture
Recommender system algorithm and architecture
 
Collaborative Filtering with Spark
Collaborative Filtering with SparkCollaborative Filtering with Spark
Collaborative Filtering with Spark
 
Google Big Table
Google Big TableGoogle Big Table
Google Big Table
 

Similar to Key Lessons Learned Building Recommender Systems for Large-Scale Social Networks (KDD 2012)

Linked In Corporate Presentation 050312
Linked In Corporate Presentation 050312Linked In Corporate Presentation 050312
Linked In Corporate Presentation 050312Andy Solty
 
Silicon halton meetup_#25_linked_in
Silicon halton meetup_#25_linked_inSilicon halton meetup_#25_linked_in
Silicon halton meetup_#25_linked_inSilicon Halton
 
High level lcs
High level lcsHigh level lcs
High level lcsfarro73
 
LinkedIn Overview Feb 2012
LinkedIn Overview Feb 2012LinkedIn Overview Feb 2012
LinkedIn Overview Feb 2012Dan Green
 
Rock your CIO’s world with a sensational SharePoint strategy
Rock your CIO’s world with a sensational SharePoint strategyRock your CIO’s world with a sensational SharePoint strategy
Rock your CIO’s world with a sensational SharePoint strategyJacobs Australia
 
LinkedIn Hiring Solutions Nonprofit are Using.
LinkedIn Hiring Solutions Nonprofit are Using.LinkedIn Hiring Solutions Nonprofit are Using.
LinkedIn Hiring Solutions Nonprofit are Using.Box
 
Staying Productive with Social Streams
Staying Productive with Social StreamsStaying Productive with Social Streams
Staying Productive with Social StreamsLuis Benitez
 
The evolution of a global workplace connect 2012 (cnw001)
The evolution of a global workplace connect 2012 (cnw001)The evolution of a global workplace connect 2012 (cnw001)
The evolution of a global workplace connect 2012 (cnw001)Mark Heid
 
Gamification - Toolkit and Process
Gamification - Toolkit and ProcessGamification - Toolkit and Process
Gamification - Toolkit and ProcessZachary Bodnar
 
Linked In Corporate Presentation
Linked In Corporate PresentationLinked In Corporate Presentation
Linked In Corporate PresentationTony A. Majidi
 
Connecting Talent to Opportunity.. at scale @ LinkedIn
Connecting Talent to Opportunity.. at scale @ LinkedInConnecting Talent to Opportunity.. at scale @ LinkedIn
Connecting Talent to Opportunity.. at scale @ LinkedInAnmol Bhasin
 
Building a semantic enterprise content management system v2
Building a semantic enterprise content management system v2Building a semantic enterprise content management system v2
Building a semantic enterprise content management system v2Ron Michael Zettlemoyer
 
Find and be Found: Information Retrieval at LinkedIn
Find and be Found: Information Retrieval at LinkedInFind and be Found: Information Retrieval at LinkedIn
Find and be Found: Information Retrieval at LinkedInDaniel Tunkelang
 
Loving the intranet - User adoption. Sam Marshall ClearBox
Loving the intranet - User adoption. Sam Marshall ClearBoxLoving the intranet - User adoption. Sam Marshall ClearBox
Loving the intranet - User adoption. Sam Marshall ClearBoxSam Marshall
 
Addressing Top CEO Priorities through Social Media Marketing and Metrics
Addressing Top CEO Priorities through Social Media Marketing and MetricsAddressing Top CEO Priorities through Social Media Marketing and Metrics
Addressing Top CEO Priorities through Social Media Marketing and MetricsJacques Pavlenyi
 
Aiinpractice2017deepaklongversion
Aiinpractice2017deepaklongversionAiinpractice2017deepaklongversion
Aiinpractice2017deepaklongversionDeepak Agarwal
 
Web 2.0 Beyond the Hype: Presentation March 27th, 2009
Web 2.0 Beyond the Hype: Presentation March 27th, 2009Web 2.0 Beyond the Hype: Presentation March 27th, 2009
Web 2.0 Beyond the Hype: Presentation March 27th, 2009MarvSmith
 
Maximizing your presence on linked in edited
Maximizing your presence on linked in   editedMaximizing your presence on linked in   edited
Maximizing your presence on linked in editedPhil Mogilev
 
Developing a LinkedIn Recruiting Strategy
Developing a LinkedIn Recruiting StrategyDeveloping a LinkedIn Recruiting Strategy
Developing a LinkedIn Recruiting Strategyaparton
 

Similar to Key Lessons Learned Building Recommender Systems for Large-Scale Social Networks (KDD 2012) (20)

Linked In Corporate Presentation 050312
Linked In Corporate Presentation 050312Linked In Corporate Presentation 050312
Linked In Corporate Presentation 050312
 
Silicon halton meetup_#25_linked_in
Silicon halton meetup_#25_linked_inSilicon halton meetup_#25_linked_in
Silicon halton meetup_#25_linked_in
 
High level lcs
High level lcsHigh level lcs
High level lcs
 
LinkedIn Overview Feb 2012
LinkedIn Overview Feb 2012LinkedIn Overview Feb 2012
LinkedIn Overview Feb 2012
 
Rock your CIO’s world with a sensational SharePoint strategy
Rock your CIO’s world with a sensational SharePoint strategyRock your CIO’s world with a sensational SharePoint strategy
Rock your CIO’s world with a sensational SharePoint strategy
 
Ddu for ap ms edit
Ddu for ap ms   editDdu for ap ms   edit
Ddu for ap ms edit
 
LinkedIn Hiring Solutions Nonprofit are Using.
LinkedIn Hiring Solutions Nonprofit are Using.LinkedIn Hiring Solutions Nonprofit are Using.
LinkedIn Hiring Solutions Nonprofit are Using.
 
Staying Productive with Social Streams
Staying Productive with Social StreamsStaying Productive with Social Streams
Staying Productive with Social Streams
 
The evolution of a global workplace connect 2012 (cnw001)
The evolution of a global workplace connect 2012 (cnw001)The evolution of a global workplace connect 2012 (cnw001)
The evolution of a global workplace connect 2012 (cnw001)
 
Gamification - Toolkit and Process
Gamification - Toolkit and ProcessGamification - Toolkit and Process
Gamification - Toolkit and Process
 
Linked In Corporate Presentation
Linked In Corporate PresentationLinked In Corporate Presentation
Linked In Corporate Presentation
 
Connecting Talent to Opportunity.. at scale @ LinkedIn
Connecting Talent to Opportunity.. at scale @ LinkedInConnecting Talent to Opportunity.. at scale @ LinkedIn
Connecting Talent to Opportunity.. at scale @ LinkedIn
 
Building a semantic enterprise content management system v2
Building a semantic enterprise content management system v2Building a semantic enterprise content management system v2
Building a semantic enterprise content management system v2
 
Find and be Found: Information Retrieval at LinkedIn
Find and be Found: Information Retrieval at LinkedInFind and be Found: Information Retrieval at LinkedIn
Find and be Found: Information Retrieval at LinkedIn
 
Loving the intranet - User adoption. Sam Marshall ClearBox
Loving the intranet - User adoption. Sam Marshall ClearBoxLoving the intranet - User adoption. Sam Marshall ClearBox
Loving the intranet - User adoption. Sam Marshall ClearBox
 
Addressing Top CEO Priorities through Social Media Marketing and Metrics
Addressing Top CEO Priorities through Social Media Marketing and MetricsAddressing Top CEO Priorities through Social Media Marketing and Metrics
Addressing Top CEO Priorities through Social Media Marketing and Metrics
 
Aiinpractice2017deepaklongversion
Aiinpractice2017deepaklongversionAiinpractice2017deepaklongversion
Aiinpractice2017deepaklongversion
 
Web 2.0 Beyond the Hype: Presentation March 27th, 2009
Web 2.0 Beyond the Hype: Presentation March 27th, 2009Web 2.0 Beyond the Hype: Presentation March 27th, 2009
Web 2.0 Beyond the Hype: Presentation March 27th, 2009
 
Maximizing your presence on linked in edited
Maximizing your presence on linked in   editedMaximizing your presence on linked in   edited
Maximizing your presence on linked in edited
 
Developing a LinkedIn Recruiting Strategy
Developing a LinkedIn Recruiting StrategyDeveloping a LinkedIn Recruiting Strategy
Developing a LinkedIn Recruiting Strategy
 

Key Lessons Learned Building Recommender Systems for Large-Scale Social Networks (KDD 2012)

  • 1. (A Few) Key Lessons Learned Building Recommender Systems For Large-scale Social Networks 1
  • 2. World’s Largest Professional Network 175M+ 2/sec 62% non U.S. 25th Most visit website worldwide 90 (Comscore 6-12) 55 >2M Company pages 85% 32 17 8 2 4 Fortune 500 Companies use LinkedIn to hire 2004 2005 2006 2007 2008 2009 2010 2011 LinkedIn Members (Millions) 2
  • 4. The Recommendations Opportunity Similar Profiles Connections Network updates Events You May Be Interested In News 4
  • 5. What are they worth? Think 50% !  > 50% of connections are from recommendations (PYMK) !  > 50% of job applications are from recommendations (JYMBII) !  > 50% of group joins are from recommendations (GYML) 5
  • 6. What is a Recommender System? A Recommender selects a product that if acquired by the “buyer” maximizes value of both “buyer” and “seller” at a given point in time 6
  • 7. Lesson 1 : Recommendations must make strategic sense… !  Conflicts of interest between ‘buyers’ !  E.g. job posters vs. job seekers !  What is the (long-term) value of an action? !  How do we compare an invitation from a job application from a group join or the reading of a news? 7
  • 8. Ingredients of a Recommender System A Recommender processes information and transforms it into actionable knowledge Data Business Logic (Feature Algorithms User Experience and Analytics Engineering) 8
  • 9. Lesson 2: User Experience Matters Most 1.  Understand user intent " Define, model, leverage 2.  Be in the user flow 3.  Optimize location on the page 4.  Set right expectations (“You May…”) 5.  Explain recommendations 6.  Interact with the user 7.  Leverage Social Referral 9
  • 10. User Intent, User Flow, Location, Message Job recommendation use cases User Optimization Impact on job experience application rate User intent / Homepage 2.5X location personalization User flow / Before vs. after having 7X user intent applied to a job User flow LinkedIn homepage vs. 10X Jobs homepage Location Center rail vs. right rail 5X Message Followers vs. leaders 2X Item based collaborative filtering: " Follower audience Content based: " Leader audience 10
  • 11. User Intent Modeling 1.  Intent labeling •  Job seeker, recruiter, outbound professional, content consumer, content creator, networker, profile curator 2.  Build normalized propensity model for each intent 11
  • 12. Job Seeker Intent !  Features !  Behavior: job searches, views & applications, job related email replies, has a job seeker sub, … !  Social graphs: # of colleagues in network who recently left, … !  Content: title, industry, anniversary date, … !  Propensity Score !  Parametric accelerated failure time survival model !  log Ti = !k"kxik+#$i !  Score: P(switch job next month) !  Evaluation !  Gold standard !  Directional validation 12
  • 13. Lesson 3: Recommendations often cater to multiple competing objectives Suggest relevant groups … that one is more likely to participate in TalentMatch (Top 24 matches of a posted job for sale) Suggest skilled candidates … who will likely respond to hiring managers inquiries Semantic + engagement objectives 13
  • 14. Multiple Objective Optimization Constraint optimization problem 1.  Rank top K’ > K results wrt to primary objective (e.g. relevance) 2.  Perturb the ranking with a parametric function which leads to the inclusion of secondary objective(s) (e.g. engagement) !  Measure the perturbation using a distance function wrt primary objective !  Create a framework to quantify the tradeoff between the objectives 14
  • 15. TalentMatch Use Case I •  Proxy for likelihood to answer hiring managers inquiries: •  Job seekers are 16X more likely to answer than non-seekers " Increase % of job seekers in in TalentMatch results •  Control for TalentMatch booking rate and sent emails rate Distribution of min TalentMatch scores over 1 month of jobs posted on LinkedIn 15
  • 17. Data Sources for Feature Engineering Social Graphs Content http://inmaps.linkedinlabs.com/ Behavior PVs Queries Actions (clicks) … 17
  • 18. Lesson 4: The Unreasonable Effectiveness of Big Data !  Data jujitsu: slice and dice (smartly), then count !  Rich features engineered from profiles jujitsu !  Job tenure distributions !  Job transition probabilities !  Related titles, industries, companies !  Profile & job seniority !  Impact on job recommendations: !  25% lift in views, viewers, applications, applicants !  90% drop in negative feedback 18
  • 19. Lesson 4: The Unreasonable Effectiveness of Big Data (cont.) !  Region stickiness !  Related regions !  Impact on job recommendations: 20% lift in views, viewers, applications, applicants !  Individuals propensity to migrate 19
  • 20. Lesson 5: Data Labeling Can Be Daunting !  Historical data !  Similar objective !  Unrelated processes, e.g., same session search selection !  reduce presentation bias, position bias !  What about intent bias? !  Random suggestions !  Great with ads, company follows !  Not for products with high cost of bad recommendations !  jobs, alumni groups, … !  Not for similar recommendations !  Crowdsourced manual labeling !  Very challenging !  Pairwise comparison more suited than absolute rating " higher cost !  Expert-sourced manual labeling 20
  • 21. Lesson 6: Measure Everything 1.  Implicit user feedback !  E.g. impressions, immediate actions (clicks), secondary actions !  Understand and optimize flows, e.g., !  Impact of ‘See more’ link and landing page !  Conversion rate of a job view into a job application from various channels !  E.g. Homepage vs email vs mobile 21
  • 22. Lesson 6: Measure Everything (cont.) 2.  Explicit user feedback !  Understand usefulness of the recommendation with unequal depth !  Text based A/B test! !  Help prioritize future improvements !  Reveal unexpected complexities !  E.g. ‘Local’ means different things for different members !  Prevent misinterpretation of implicit user feedback! 22
  • 23. Lesson 7: Beware of some A/B testing pitfalls 1.  Novelty effect !  E.g., new job recommendation algorithms have week-long novelty effect that shows lifts twice the stationary (real) one !"#$%&'()$*'+$,-$#./0'1$+234'$5$67,788$ !"#$%&'()$*+,-+,,$ ,$!!!" *$!!!" +$!!!" *$!!!" )$!!!" )$!!!" ($!!!" ($!!!" '$!!!" -./"01234"526"(7"/89:2;"6<=>2"?" '$!!!" )@(@##" &$!!!" +,-"./012")3#43##" &$!!!" 1 week lifts 2weeks lifts %$!!!" %$!!!" #$!!!" #$!!!" !" !" !" (" #!" #(" %!" %(" !" (" #!" #(" %!" %(" 2.  Cannibalization !  Zero-sum game or real lift? 3.  Random sampling destroys network effect 23
  • 24. Lesson 8: One Unified Platform, One API !  Scaling innovation !  Cross-leverage improvements between products !  Shared knowledgebase !  Maintainability !  Production serving and tracking !  Infrastructure for complete upgrades !  Performance !  Billions of sub second computations 24
  • 25. Open Source Technologies Zoie Bobo Kafka Voldemort http://sna-projects.com/sna/ 25
  • 26. Conclusion !  Social/professional networks are a new frontier for recommender systems !  Still many open questions: !  How do we define and measure engagement? !  What is the utility of a recommendation for the member? !  What is the value of a recommendation for the network? !  How do we reconcile utility and value when they conflict? !  How do we network A/B test without tears? !  … !  Learn as you go !  Track everything and invest in forensics analytics !  Breadth – understand holistic impact !  Depth – understand flows 26
  • 27. References •  M. Rodriguez, C. Posse and E. Zhang. 2012. Multiple Objective Optimization in Recommendation Systems. To appear in Proceedings of the Sixth ACM conference on Recommender systems (RecSys '12) •  M. Amin, B. Yan, S. Sriram, A. Bhasin and C. Posse. 2012. Social Referral : Using Network Connections to Deliver Recommendations. To appear in Proceedings of the Sixth ACM conference on Recommender systems (RecSys '12) •  A. Reda, Y. Park, M. Tiwari, C. Posse and S. Shah. 2012. Metaphor: Related Search Recommendations on a Social Network. 2012. To appear in Proceedings of the 21st International Conference on Information and Knowledge Management (CIKM ‘12) •  B. Yan, l. Bajaj and A. Bhasin. 2011. Entity Resolution Using Social Graphs for Business Applications. In Proceedings of the International Conference on Advances in Social Network Analysis and Mining (ASONAM ‘11) 27