Large scale social recommender systems at LinkedIn

Large-Scale Social Recommender
Systems at LinkedIn
Mitul Tiwari
Search, Network, and Analytics (SNA)
LinkedIn

Outline
• About LinkedIn
• Social Recommender Systems at LinkedIn
• Social Graph Analysis
• Virality in Social Recommender Systems
• Scaling Challenges
3

LinkedIn by the numbers
4
259M members 2 new members/sec

Outline
• About LinkedIn
• Social Graph Analysis
11

LinkedIn Homepage
• Powered by
recommendations
12

Recommender Ecosystem
13
Similar Profiles
Connections
News
Skill Endorsements

Outline
• LinkedIn Today: Recommend News
• People You May Know and Social Graph Analysis
• Related Searches Recommendation
• Skills Endorsements Suggestions and Social Virality
14

LinkedIn Today: News Recommendation
• Objective: serve valuable professional news, leading to
higher engagement as measured by metrics such as CTR
15

News Recommendation: Explore/Exploit
16Agarwal et. al 2012

News Recommendations: Challenges
• Drop in CTR wrt Time
17

News Recommendation: Challenges
• Same item shown to the same users: drop in CTR
18

News Recommendations: Revised Algorithm
• Explore/Exploit scheme
• Explore: choose an item at random with a small probability (e.g., 5%)
• Exploit: choose highest scoring CTR item (e.g., 95%)
• Temporal smoothing: more weight to recent data
• Impression discounting: discount items with repeat views
• Segmented model: segment users in CTR estimation
19

Outline
20

PYMK: Link Prediction over social Graph
22

People You May Know
• > 50% of total connections and invitations
• Challenges
• Feature Engineering
• Machine Learning
• Scaling
23

People You May Know: Feature Engineering
Alice
Bob Carol
24
How do people
know each other?

Alice
Bob Carol
25
How do people
know each other?

Alice
Bob Carol
Triangle closing
26
How do people
know each other?

Alice
Bob Carol
Triangle closing
Prob(Bob knows Carol) ~ the # of common
connections
27
How do people
know each other?

Triangle Closing in Pig
-- connections in (source_id, dest_id) format in both directions
connections = LOAD `connections` USING PigStorage();
group_conn = GROUP connections BY source_id;
pairs = FOREACH group_conn GENERATE
generatePair(connections.dest_id) as (id1, id2);
common_conn = GROUP pairs BY (id1, id2);
common_conn = FOREACH common_conn GENERATE
flatten(group) as (source_id, dest_id),
COUNT(pairs) as common_connections;
STORE common_conn INTO `common_conn` USING
PigStorage();
28

• Member profile contains various types of organizations
• Company, Schools, Groups, ...
• Can we compute edge affinity based on these organization
information?
• Useful for many applications:
• Recommending members to connect (link prediction)
• Recommending other entities from the same community (community
detection)
29

Organizational Overlap: Feature Engineering
• Insight 1: Connection density increases with organizational
time overlap
30
Hsieh et. al, WWW’13

Organizational Overlap: Feature Engineering
• Insight 2: Connection density decreases with the size of
the organizational
31

Organizational Overlap Model
• Empirical connection
density fits our model
32

How does PYMK work?
• Combine features using a Machine Learning model
33

How does diversity affects Conversion in PYMK
• Graph Structural Diversity Study
• Measure the effects of Structural Diversity in PYMK
recommendation
• Conversion: a connection invitation is sent to one of PYMK
recommendation
34
Huang et. al, RecSys RSWeb’13

How does diversity affects Conversion in PYMK
• Members in recommendation set mapped to a graph G
• Vertices represent members in the recommendation set
• Edges are the connections between those members on LinkedIn social
graph
• 3 measures of structural diversity
• Number of connected components
• Number of triangles
• Average local node degree
35
Huang et. al, RecSys RSWeb’13

Structural Diversity in PYMK
• A connected component
• any pair of vertices are connected by a path or an isolated vertex
• Number of connected components
• a measure of structural diversity [Ugander et al. 2012]
• Smaller number of components => less structural diversity
• Effect on Invitation rate or conversion rate
• ratio of the number of invitations sent and size of recommended set
36

37
• Invitation rate increases as the number of components
decreases

• Lower structural diversity among recommendation set results
in a higher invitation rate
• Different form Facebook data study [Ugander et al. 2012]
• Use case is slightly different
• Effect of structural diversity on recommender system highly depends on
the use
• Don’t generalize structural diversity effects on one recommender system
to all
38

Outline
39

Related Searches Recommendation
• Millions of Searches everyday
• Help users to explore and refine their queries
40
Reda et. al, CIKM’12

Large scale social recommender systems at LinkedIn

Recommended

Recommended

More Related Content

Similar to Large scale social recommender systems at LinkedIn

Similar to Large scale social recommender systems at LinkedIn (20)

More from Mitul Tiwari

More from Mitul Tiwari (7)

Recently uploaded

Recently uploaded (20)

Large scale social recommender systems at LinkedIn

Editor's Notes