SlideShare a Scribd company logo
1 of 59
Recruiting Solutions 
1 
Structure, Personalization, Scale: 
A Deep Dive into LinkedIn Search
Watch the video with slide 
synchronization on InfoQ.com! 
http://www.infoq.com/presentations 
/linkedin-search 
InfoQ.com: News & Community Site 
• 750,000 unique visitors/month 
• Published in 4 languages (English, Chinese, Japanese and Brazilian 
Portuguese) 
• Post content from our QCon conferences 
• News 15-20 / week 
• Articles 3-4 / week 
• Presentations (videos) 12-15 / week 
• Interviews 2-3 / week 
• Books 1 / month
Presented at QCon New York 
www.qconnewyork.com 
Purpose of QCon 
- to empower software development by facilitating the spread of 
knowledge and innovation 
Strategy 
- practitioner-driven conference designed for YOU: influencers of 
change and innovation in your teams 
- speakers and topics driving the evolution and innovation 
- connecting and catalyzing the influencers and innovators 
Highlights 
- attended by more than 12,000 delegates since 2007 
- held in 9 cities worldwide
Overview 
What is LinkedIn search and why should you care? 
What are our systems challenges? 
What are our relevance challenges? 
2
3
4
Search helps members find and be found. 
5
Search for people, jobs, groups, and more. 
6
7
A separate product for recruiters. 
8
Search is the core of key LinkedIn use cases. 
9
What’s unique 
 Personalized 
 Part of a larger product experience 
– Many products 
– Big part 
 Task-centric 
– Find a job, hire top talent, find a person, … 
10
Systems Challenges 
11
Evolution of LinkedIn’ Search Architecture 
2004: 
 No Search Engine 
 Iterate through your network 
and filter 
12 
2004
13 
2007: Introducing Lucene (single shard, multiple replicas) 
Updates Queries 
2004 
2007 
Lucene 
Lucene 
L uLcuecneen e 
(Single Shard) 
Results
14 
2008: Zoie - real-time search (search without commits/shutdown) 
Updates Queries 
2004 
2007 
Lucene 
Lucene 
Lucene 
2008 
Results 
Lucene 
Zoie
15 
2008: Content Store (aggregating multiple input sources) 
2004 
2007 
2008 
Lucene 
Lucene 
Lucene 
Source 1 
Queries 
Results 
Lucene 
Zoie 
Source 2 
…. 
Source N 
Content 
Store 
….
16 
2004 
2007 
2008 
Source 1 
Queries 
Results 
Source 2 
…. 
Source N 
Content 
Store 
…. 
Broker 
Sharded 
2008: Sharded search
17 
2004 
2007 
2008 
Broker 
2009 
Source 1 
Queries 
Results 
Source 2 
…. 
Source N 
Content 
Store 
…. Sensei 
Lucene 
Zoie 
Bobo 
2009: Bobo – Faceted Search
2010: SenseiDB (cluster management, new query language, wrapping existing pieces) 
18 
2004 
2007 
2008 
2009 
2010
19 
2004 
2007 
2008 
2009 
2010 
2011 
2011: Cleo (instant typeahead results)
20 
2004 
2007 
2008 
2009 
2010 
2011 
2013 
2013: Too many stacks 
Group Search 
Article/Post Search 
And more…
Challenges 
 Index rebuilding very difficult 
 Live updates are at an entity granularity 
 Scoring is inflexible 
 Lucene limitations 
 Fragmentation – too many components, too many stacks 
 Economic Graph 
21 
Opportunity
22 
2004 
2007 
2008 
2009 
2010 
2011 
2013 
2014 
2014: Introducing Galene
Life of a Query 
23 
Query Rewriter/ 
Planner 
Results 
Merging 
User 
Query 
Search 
Results 
Search Shard 
Search Shard
Life of a Query – Within A Search Shard 
24 
Rewritten 
Query 
Top 
Results 
From 
Shard 
INDEX 
Top 
Results 
Retrieve a 
Document 
Score the 
Document
Life of a Query – Within A Rewriter 
25 
Query 
DATA 
MODEL 
Rewriter 
State 
Rewriter 
Module 
DATA 
MODEL 
DATA 
MODEL 
Rewritten 
Query 
Rewriter 
Module 
Rewriter 
Module
Life of Data - Offline 
26 
DATA 
MODEL 
INDEX 
Raw Data Derived Data 
DATA 
MODEL 
DATA 
MODEL 
DATA 
MODEL 
DATA 
MODEL
Improvements 
 Regular full index builds using Hadoop 
– Easier to reshard, add fields 
 Improved Relevance 
– Offline relevance, query rewriting frameworks 
 Partial Live Updates Support 
– Allows efficient updates of high frequency fields (no sync) 
– Goodbye Content Store, Goodbye Zoie 
 Early termination 
– Ultra low latency for instant results 
– Goodbye Cleo 
 Indexing and searching across graph entities/attributes 
 Single engine, single stack 
27
Galene Deep dive 
28
Primer on Search 
29
Lucene 
An open source API that supports search functionality: 
 Add new documents to index 
 Delete documents from the index 
 Construct queries 
 Search the index using the query 
 Score the retrieved documents 
30
The Search Index 
 Inverted Index: Mapping from (search) terms to list of 
documents (they are present in) 
 Forward Index: Mapping from documents to metadata 
about them 
31
32 
BLAH BLAH BLAH DanielBLAH BLAH LinkedIn BLAH BLAH BLAH BLAH 
1. 
BLAH BLAH AsifBLAH LinkedIn BLAH BLAH BLAH BLAH 2. BLAH BLAH BLAH 
Daniel Asif LinkedIn 
1 
2 
Inverted Index Forward Index
The Search Index 
 The lists are called posting lists 
 Upto hundreds of millions of posting lists 
 Upto hundreds of millions of documents 
 Posting lists may contain as few as a single hit and as 
many as tens of millions of hits 
 Terms can be 
– words in the document 
– inferred attributes about the document 
33
Lucene Queries 
 term:“asif makhani” 
 term:asif term:daniel 
 +term:daniel +prefix:tunk 
 +asif +linkedIn 
 +term:daniel connection:50510 
 +term:daniel industry:software connection:50510^4 
34
Early termination 
 We order documents in the index based on a static rank – 
from most important to least important 
 An offline relevance algorithm assigns a static rank to 
each document on which the sorting is performed 
 This allows retrieval to be early-terminated (assuming a 
strong correlation between static rank and importance of 
result for a specific query) 
 Also works well with personalized search 
– +term:asif +prefix:makh 
+(connection:35176 connection:418001 connection:1520032) 
35
Partial Updates 
 Lucene segments are “document-partitioned” 
 We have enhanced Lucene with “term-partitioned” 
segments 
 We use 3 term-partitioned segments: 
– Base index (never changed) 
– Live update buffer 
– Snapshot index 
36
37 
Base Index 
Snapshot 
Index 
Live Update 
Buffer
Going Forward 
 Consolidation across verticals 
 Improved Relevance Support 
– Machine-learned models, query rewriting, relevant snippets,… 
 Improved Performance 
 Search as a Service (SeaS) 
 Exploring the Economic Graph 
38
Quality Challenges 
39
The Search Quality Pipeline 
40 
spellcheck query tagging vertical intent query expansion 
New 
document 
Fe 
atures 
Machin 
e 
learning 
model 
score 
New 
document 
Features 
Machin 
e 
learning 
model 
score 
New 
document 
Fe 
atures 
Machine 
learning 
model 
score 
Ordered 
Ordered 
list 
Ordered 
list 
list
Spellcheck 
41 
PEOPLE NAMES 
COMPANIES 
TITLES 
PAST QUERIES 
n-grams 
marissa => ma ar ri is ss sa 
metaphone 
mark/marc => MRK 
co-occurrence counts 
marissa:mayer = 1000 
marisa meyer yahoo 
marissa 
marisa 
meyer 
mayer 
yahoo
Query Tagging 
42 
machine learning data scientist brooklyn
Vertical Intent: Results Blending 
43 
[company] 
[employees] 
[jobs] 
[name search]
Vertical Intent: Typeahead 
44 
P(mongodb | mon) = 5% 
P(monsanto | mons): 50% 
P(mongodb | mong): 80%
Query Expansion 
45
Ranking 
46 
New 
document 
Fe 
atures 
Machin 
e 
learning 
model 
score 
New 
document 
Fe 
atures 
Machin 
e 
learning 
model 
score 
New 
document 
Fe 
atures 
Machine 
learning 
model 
score 
Ordered 
Ordered 
list 
Ordered 
list 
list
Ranking is highly personalized. 
47 
kevin scott
Not just for name search. 
48
Relevance Model 
49 
keywords 
document 
Fe 
atures 
Machine 
learning 
model
Examples of Features 
50 
Search keywords matching title = 3 
Searcher location = Result location 
Searcher network distance to result = 2 
…
Model Training: Traditional Approach 
51 
Documents for 
training 
Features 
Human 
evaluation 
La 
bels 
Machine 
learning 
model
Model Training: LinkedIn’s Approach 
52 
Documents for 
training 
Features 
Human 
evaluation 
Search logs 
La 
bels 
Machine 
learning 
model
Fair Pairs and Easy Negatives 
53 
Flipped 
[Radlinski and Joachims, 2006] 
• Sample negatives from bottom results 
• But watch out for variable length result sets. 
• Compromise, e.g., sample from page 10.
Model Selection 
 Select model based on user and query features. 
– e.g., person name queries, recruiters making skills queries 
 Resulting model is a tree with logistic regression leaves. 
 Only one regression model evaluated for each document. 
54 
X2=? 
b0 +b1T(x1)+...+bnxn 
X10< 0.1234 ? 
a0 +a1P(x1)+...+anQ(xn ) 
g 0 +g1R(x1)+...+g nQ(xn )
Summary 
What is LinkedIn search and why should you care? 
LinkedIn search enables the participants in 
the economic graph to find and be found. 
What are our systems challenges? 
Indexing rich, structured content; retrieving 
using global and social factors; real-time updates. 
What are our relevance challenges? 
Query understanding, personalized 
machine-learned ranking models. 
55
Asif Makhani Daniel Tunkelang 
amakhani@linkedin.com dtunkelang@linkedin.com 
https://linkedin.com/in/asifmakhani https://linkedin.com/in/dtunkelang 
56
Watch the video with slide synchronization on 
InfoQ.com! 
http://www.infoq.com/presentations/linkedin-search

More Related Content

Viewers also liked

id4 rendered floorplan
id4 rendered floorplanid4 rendered floorplan
id4 rendered floorplanMeli Apone
 
009 esercitazione 1 2016
009 esercitazione 1 2016009 esercitazione 1 2016
009 esercitazione 1 2016Luca Marescotti
 
Dedicato ai miei cuccioli
Dedicato ai miei cuccioliDedicato ai miei cuccioli
Dedicato ai miei cuccioliimartini
 
Dalbavancin Journal Club
Dalbavancin Journal ClubDalbavancin Journal Club
Dalbavancin Journal ClubAmy Yeh
 
Digitalstorytelling for math
Digitalstorytelling for mathDigitalstorytelling for math
Digitalstorytelling for mathMaestra Laura
 
diff in matematica
 diff in matematica diff in matematica
diff in matematicaimartini
 
2016 Fundamentals of Planning. 4 - The history of urbanism and the protection...
2016 Fundamentals of Planning. 4 - The history of urbanism and the protection...2016 Fundamentals of Planning. 4 - The history of urbanism and the protection...
2016 Fundamentals of Planning. 4 - The history of urbanism and the protection...Luca Marescotti
 
Birth of modernity Adolf Loos
Birth of modernity Adolf LoosBirth of modernity Adolf Loos
Birth of modernity Adolf LoosSandra Draskovic
 

Viewers also liked (18)

id4 rendered floorplan
id4 rendered floorplanid4 rendered floorplan
id4 rendered floorplan
 
009 esercitazione 1 2016
009 esercitazione 1 2016009 esercitazione 1 2016
009 esercitazione 1 2016
 
La responsabilidad
La responsabilidadLa responsabilidad
La responsabilidad
 
Lote
LoteLote
Lote
 
Dedicato ai miei cuccioli
Dedicato ai miei cuccioliDedicato ai miei cuccioli
Dedicato ai miei cuccioli
 
CFFS promo brochure
CFFS promo brochureCFFS promo brochure
CFFS promo brochure
 
Dalbavancin Journal Club
Dalbavancin Journal ClubDalbavancin Journal Club
Dalbavancin Journal Club
 
013 discussione 2016
013 discussione 2016013 discussione 2016
013 discussione 2016
 
Slide share
Slide shareSlide share
Slide share
 
Digitalstorytelling for math
Digitalstorytelling for mathDigitalstorytelling for math
Digitalstorytelling for math
 
diff in matematica
 diff in matematica diff in matematica
diff in matematica
 
2016 Fundamentals of Planning. 4 - The history of urbanism and the protection...
2016 Fundamentals of Planning. 4 - The history of urbanism and the protection...2016 Fundamentals of Planning. 4 - The history of urbanism and the protection...
2016 Fundamentals of Planning. 4 - The history of urbanism and the protection...
 
Il teatro greco
Il teatro grecoIl teatro greco
Il teatro greco
 
23. Architettura bizantina in Calabria
23. Architettura bizantina in Calabria23. Architettura bizantina in Calabria
23. Architettura bizantina in Calabria
 
17. Architettura romana - Colosseo - Teatro - Circo
17. Architettura romana - Colosseo - Teatro - Circo17. Architettura romana - Colosseo - Teatro - Circo
17. Architettura romana - Colosseo - Teatro - Circo
 
19. Architettura romana - Le Terme
19. Architettura romana - Le Terme19. Architettura romana - Le Terme
19. Architettura romana - Le Terme
 
27. Architettura Gotica in Francia, Inghilterra e Italia
27. Architettura Gotica in Francia, Inghilterra e Italia27. Architettura Gotica in Francia, Inghilterra e Italia
27. Architettura Gotica in Francia, Inghilterra e Italia
 
Birth of modernity Adolf Loos
Birth of modernity Adolf LoosBirth of modernity Adolf Loos
Birth of modernity Adolf Loos
 

Similar to Structure, Personalization, Scale: A Deep Dive into LinkedIn Search

Search at Linkedin by Sriram Sankar and Kumaresh Pattabiraman
Search at Linkedin by Sriram Sankar and Kumaresh PattabiramanSearch at Linkedin by Sriram Sankar and Kumaresh Pattabiraman
Search at Linkedin by Sriram Sankar and Kumaresh PattabiramanThe Hive
 
Recommender Systems @ Scale - PyData 2019
Recommender Systems @ Scale - PyData 2019Recommender Systems @ Scale - PyData 2019
Recommender Systems @ Scale - PyData 2019Sonya Liberman
 
Fried connecting across silos seminar
Fried connecting across silos seminarFried connecting across silos seminar
Fried connecting across silos seminarJeff Fried
 
Recommender Systems @ Scale, Big Data Europe Conference 2019
Recommender Systems @ Scale, Big Data Europe Conference 2019Recommender Systems @ Scale, Big Data Europe Conference 2019
Recommender Systems @ Scale, Big Data Europe Conference 2019Sonya Liberman
 
Enterprise Search Using SharePoint 2010 and FAST
Enterprise Search Using SharePoint 2010 and FASTEnterprise Search Using SharePoint 2010 and FAST
Enterprise Search Using SharePoint 2010 and FASTBert Johnson
 
Enhancing Relevancy & User Experience with #SharePoint Search sps-philly 2015
Enhancing Relevancy & User Experience with #SharePoint Search   sps-philly 2015Enhancing Relevancy & User Experience with #SharePoint Search   sps-philly 2015
Enhancing Relevancy & User Experience with #SharePoint Search sps-philly 2015Gina Montgomery, V-TSP
 
Leverage DSpace for an enterprise, mission critical platform
Leverage DSpace for an enterprise, mission critical platformLeverage DSpace for an enterprise, mission critical platform
Leverage DSpace for an enterprise, mission critical platformAndrea Bollini
 
Search Strategy for Enterprise SharePoint 2013 - Vancouver SharePoint Summit
Search Strategy for Enterprise SharePoint 2013 - Vancouver SharePoint SummitSearch Strategy for Enterprise SharePoint 2013 - Vancouver SharePoint Summit
Search Strategy for Enterprise SharePoint 2013 - Vancouver SharePoint SummitJoel Oleson
 
Software Analytics - Achievements and Challenges
Software Analytics - Achievements and ChallengesSoftware Analytics - Achievements and Challenges
Software Analytics - Achievements and ChallengesTao Xie
 
A Recruiter's Checklist (Strings Attached)
A Recruiter's Checklist (Strings Attached)A Recruiter's Checklist (Strings Attached)
A Recruiter's Checklist (Strings Attached)Maysoun Mohamed
 
AI, Search, and the Disruption of Knowledge Management
AI, Search, and the Disruption of Knowledge ManagementAI, Search, and the Disruption of Knowledge Management
AI, Search, and the Disruption of Knowledge ManagementTrey Grainger
 
Serving Information Needs of Knowledge Workers
Serving Information Needs of Knowledge WorkersServing Information Needs of Knowledge Workers
Serving Information Needs of Knowledge WorkersDebdoot Mukherjee
 
ONA (organizational network analysis) - enabling individuals to impact their ...
ONA (organizational network analysis) - enabling individuals to impact their ...ONA (organizational network analysis) - enabling individuals to impact their ...
ONA (organizational network analysis) - enabling individuals to impact their ...Agron Fazliu
 
Candidate selection tutorial
Candidate selection tutorialCandidate selection tutorial
Candidate selection tutorialYiqun Liu
 
SIGIR 2017 - Candidate Selection for Large Scale Personalized Search and Reco...
SIGIR 2017 - Candidate Selection for Large Scale Personalized Search and Reco...SIGIR 2017 - Candidate Selection for Large Scale Personalized Search and Reco...
SIGIR 2017 - Candidate Selection for Large Scale Personalized Search and Reco...Aman Grover
 
Enterprise Search in SharePoint 2010
Enterprise Search in SharePoint 2010Enterprise Search in SharePoint 2010
Enterprise Search in SharePoint 2010bgerman
 
Scaling Recommendations, Semantic Search, & Data Analytics with solr
Scaling Recommendations, Semantic Search, & Data Analytics with solrScaling Recommendations, Semantic Search, & Data Analytics with solr
Scaling Recommendations, Semantic Search, & Data Analytics with solrTrey Grainger
 
Data council sf amundsen presentation
Data council sf    amundsen presentationData council sf    amundsen presentation
Data council sf amundsen presentationTao Feng
 
Webinar: “ditch Oracle NOW”: Best Practices for Migrating to MongoDB
 Webinar: “ditch Oracle NOW”: Best Practices for Migrating to MongoDB Webinar: “ditch Oracle NOW”: Best Practices for Migrating to MongoDB
Webinar: “ditch Oracle NOW”: Best Practices for Migrating to MongoDBMongoDB
 
Data Mining and the Web_Past_Present and Future
Data Mining and the Web_Past_Present and FutureData Mining and the Web_Past_Present and Future
Data Mining and the Web_Past_Present and Futurefeiwin
 

Similar to Structure, Personalization, Scale: A Deep Dive into LinkedIn Search (20)

Search at Linkedin by Sriram Sankar and Kumaresh Pattabiraman
Search at Linkedin by Sriram Sankar and Kumaresh PattabiramanSearch at Linkedin by Sriram Sankar and Kumaresh Pattabiraman
Search at Linkedin by Sriram Sankar and Kumaresh Pattabiraman
 
Recommender Systems @ Scale - PyData 2019
Recommender Systems @ Scale - PyData 2019Recommender Systems @ Scale - PyData 2019
Recommender Systems @ Scale - PyData 2019
 
Fried connecting across silos seminar
Fried connecting across silos seminarFried connecting across silos seminar
Fried connecting across silos seminar
 
Recommender Systems @ Scale, Big Data Europe Conference 2019
Recommender Systems @ Scale, Big Data Europe Conference 2019Recommender Systems @ Scale, Big Data Europe Conference 2019
Recommender Systems @ Scale, Big Data Europe Conference 2019
 
Enterprise Search Using SharePoint 2010 and FAST
Enterprise Search Using SharePoint 2010 and FASTEnterprise Search Using SharePoint 2010 and FAST
Enterprise Search Using SharePoint 2010 and FAST
 
Enhancing Relevancy & User Experience with #SharePoint Search sps-philly 2015
Enhancing Relevancy & User Experience with #SharePoint Search   sps-philly 2015Enhancing Relevancy & User Experience with #SharePoint Search   sps-philly 2015
Enhancing Relevancy & User Experience with #SharePoint Search sps-philly 2015
 
Leverage DSpace for an enterprise, mission critical platform
Leverage DSpace for an enterprise, mission critical platformLeverage DSpace for an enterprise, mission critical platform
Leverage DSpace for an enterprise, mission critical platform
 
Search Strategy for Enterprise SharePoint 2013 - Vancouver SharePoint Summit
Search Strategy for Enterprise SharePoint 2013 - Vancouver SharePoint SummitSearch Strategy for Enterprise SharePoint 2013 - Vancouver SharePoint Summit
Search Strategy for Enterprise SharePoint 2013 - Vancouver SharePoint Summit
 
Software Analytics - Achievements and Challenges
Software Analytics - Achievements and ChallengesSoftware Analytics - Achievements and Challenges
Software Analytics - Achievements and Challenges
 
A Recruiter's Checklist (Strings Attached)
A Recruiter's Checklist (Strings Attached)A Recruiter's Checklist (Strings Attached)
A Recruiter's Checklist (Strings Attached)
 
AI, Search, and the Disruption of Knowledge Management
AI, Search, and the Disruption of Knowledge ManagementAI, Search, and the Disruption of Knowledge Management
AI, Search, and the Disruption of Knowledge Management
 
Serving Information Needs of Knowledge Workers
Serving Information Needs of Knowledge WorkersServing Information Needs of Knowledge Workers
Serving Information Needs of Knowledge Workers
 
ONA (organizational network analysis) - enabling individuals to impact their ...
ONA (organizational network analysis) - enabling individuals to impact their ...ONA (organizational network analysis) - enabling individuals to impact their ...
ONA (organizational network analysis) - enabling individuals to impact their ...
 
Candidate selection tutorial
Candidate selection tutorialCandidate selection tutorial
Candidate selection tutorial
 
SIGIR 2017 - Candidate Selection for Large Scale Personalized Search and Reco...
SIGIR 2017 - Candidate Selection for Large Scale Personalized Search and Reco...SIGIR 2017 - Candidate Selection for Large Scale Personalized Search and Reco...
SIGIR 2017 - Candidate Selection for Large Scale Personalized Search and Reco...
 
Enterprise Search in SharePoint 2010
Enterprise Search in SharePoint 2010Enterprise Search in SharePoint 2010
Enterprise Search in SharePoint 2010
 
Scaling Recommendations, Semantic Search, & Data Analytics with solr
Scaling Recommendations, Semantic Search, & Data Analytics with solrScaling Recommendations, Semantic Search, & Data Analytics with solr
Scaling Recommendations, Semantic Search, & Data Analytics with solr
 
Data council sf amundsen presentation
Data council sf    amundsen presentationData council sf    amundsen presentation
Data council sf amundsen presentation
 
Webinar: “ditch Oracle NOW”: Best Practices for Migrating to MongoDB
 Webinar: “ditch Oracle NOW”: Best Practices for Migrating to MongoDB Webinar: “ditch Oracle NOW”: Best Practices for Migrating to MongoDB
Webinar: “ditch Oracle NOW”: Best Practices for Migrating to MongoDB
 
Data Mining and the Web_Past_Present and Future
Data Mining and the Web_Past_Present and FutureData Mining and the Web_Past_Present and Future
Data Mining and the Web_Past_Present and Future
 

More from C4Media

Streaming a Million Likes/Second: Real-Time Interactions on Live Video
Streaming a Million Likes/Second: Real-Time Interactions on Live VideoStreaming a Million Likes/Second: Real-Time Interactions on Live Video
Streaming a Million Likes/Second: Real-Time Interactions on Live VideoC4Media
 
Next Generation Client APIs in Envoy Mobile
Next Generation Client APIs in Envoy MobileNext Generation Client APIs in Envoy Mobile
Next Generation Client APIs in Envoy MobileC4Media
 
Software Teams and Teamwork Trends Report Q1 2020
Software Teams and Teamwork Trends Report Q1 2020Software Teams and Teamwork Trends Report Q1 2020
Software Teams and Teamwork Trends Report Q1 2020C4Media
 
Understand the Trade-offs Using Compilers for Java Applications
Understand the Trade-offs Using Compilers for Java ApplicationsUnderstand the Trade-offs Using Compilers for Java Applications
Understand the Trade-offs Using Compilers for Java ApplicationsC4Media
 
Kafka Needs No Keeper
Kafka Needs No KeeperKafka Needs No Keeper
Kafka Needs No KeeperC4Media
 
High Performing Teams Act Like Owners
High Performing Teams Act Like OwnersHigh Performing Teams Act Like Owners
High Performing Teams Act Like OwnersC4Media
 
Does Java Need Inline Types? What Project Valhalla Can Bring to Java
Does Java Need Inline Types? What Project Valhalla Can Bring to JavaDoes Java Need Inline Types? What Project Valhalla Can Bring to Java
Does Java Need Inline Types? What Project Valhalla Can Bring to JavaC4Media
 
Service Meshes- The Ultimate Guide
Service Meshes- The Ultimate GuideService Meshes- The Ultimate Guide
Service Meshes- The Ultimate GuideC4Media
 
Shifting Left with Cloud Native CI/CD
Shifting Left with Cloud Native CI/CDShifting Left with Cloud Native CI/CD
Shifting Left with Cloud Native CI/CDC4Media
 
CI/CD for Machine Learning
CI/CD for Machine LearningCI/CD for Machine Learning
CI/CD for Machine LearningC4Media
 
Fault Tolerance at Speed
Fault Tolerance at SpeedFault Tolerance at Speed
Fault Tolerance at SpeedC4Media
 
Architectures That Scale Deep - Regaining Control in Deep Systems
Architectures That Scale Deep - Regaining Control in Deep SystemsArchitectures That Scale Deep - Regaining Control in Deep Systems
Architectures That Scale Deep - Regaining Control in Deep SystemsC4Media
 
ML in the Browser: Interactive Experiences with Tensorflow.js
ML in the Browser: Interactive Experiences with Tensorflow.jsML in the Browser: Interactive Experiences with Tensorflow.js
ML in the Browser: Interactive Experiences with Tensorflow.jsC4Media
 
Build Your Own WebAssembly Compiler
Build Your Own WebAssembly CompilerBuild Your Own WebAssembly Compiler
Build Your Own WebAssembly CompilerC4Media
 
User & Device Identity for Microservices @ Netflix Scale
User & Device Identity for Microservices @ Netflix ScaleUser & Device Identity for Microservices @ Netflix Scale
User & Device Identity for Microservices @ Netflix ScaleC4Media
 
Scaling Patterns for Netflix's Edge
Scaling Patterns for Netflix's EdgeScaling Patterns for Netflix's Edge
Scaling Patterns for Netflix's EdgeC4Media
 
Make Your Electron App Feel at Home Everywhere
Make Your Electron App Feel at Home EverywhereMake Your Electron App Feel at Home Everywhere
Make Your Electron App Feel at Home EverywhereC4Media
 
The Talk You've Been Await-ing For
The Talk You've Been Await-ing ForThe Talk You've Been Await-ing For
The Talk You've Been Await-ing ForC4Media
 
Future of Data Engineering
Future of Data EngineeringFuture of Data Engineering
Future of Data EngineeringC4Media
 
Automated Testing for Terraform, Docker, Packer, Kubernetes, and More
Automated Testing for Terraform, Docker, Packer, Kubernetes, and MoreAutomated Testing for Terraform, Docker, Packer, Kubernetes, and More
Automated Testing for Terraform, Docker, Packer, Kubernetes, and MoreC4Media
 

More from C4Media (20)

Streaming a Million Likes/Second: Real-Time Interactions on Live Video
Streaming a Million Likes/Second: Real-Time Interactions on Live VideoStreaming a Million Likes/Second: Real-Time Interactions on Live Video
Streaming a Million Likes/Second: Real-Time Interactions on Live Video
 
Next Generation Client APIs in Envoy Mobile
Next Generation Client APIs in Envoy MobileNext Generation Client APIs in Envoy Mobile
Next Generation Client APIs in Envoy Mobile
 
Software Teams and Teamwork Trends Report Q1 2020
Software Teams and Teamwork Trends Report Q1 2020Software Teams and Teamwork Trends Report Q1 2020
Software Teams and Teamwork Trends Report Q1 2020
 
Understand the Trade-offs Using Compilers for Java Applications
Understand the Trade-offs Using Compilers for Java ApplicationsUnderstand the Trade-offs Using Compilers for Java Applications
Understand the Trade-offs Using Compilers for Java Applications
 
Kafka Needs No Keeper
Kafka Needs No KeeperKafka Needs No Keeper
Kafka Needs No Keeper
 
High Performing Teams Act Like Owners
High Performing Teams Act Like OwnersHigh Performing Teams Act Like Owners
High Performing Teams Act Like Owners
 
Does Java Need Inline Types? What Project Valhalla Can Bring to Java
Does Java Need Inline Types? What Project Valhalla Can Bring to JavaDoes Java Need Inline Types? What Project Valhalla Can Bring to Java
Does Java Need Inline Types? What Project Valhalla Can Bring to Java
 
Service Meshes- The Ultimate Guide
Service Meshes- The Ultimate GuideService Meshes- The Ultimate Guide
Service Meshes- The Ultimate Guide
 
Shifting Left with Cloud Native CI/CD
Shifting Left with Cloud Native CI/CDShifting Left with Cloud Native CI/CD
Shifting Left with Cloud Native CI/CD
 
CI/CD for Machine Learning
CI/CD for Machine LearningCI/CD for Machine Learning
CI/CD for Machine Learning
 
Fault Tolerance at Speed
Fault Tolerance at SpeedFault Tolerance at Speed
Fault Tolerance at Speed
 
Architectures That Scale Deep - Regaining Control in Deep Systems
Architectures That Scale Deep - Regaining Control in Deep SystemsArchitectures That Scale Deep - Regaining Control in Deep Systems
Architectures That Scale Deep - Regaining Control in Deep Systems
 
ML in the Browser: Interactive Experiences with Tensorflow.js
ML in the Browser: Interactive Experiences with Tensorflow.jsML in the Browser: Interactive Experiences with Tensorflow.js
ML in the Browser: Interactive Experiences with Tensorflow.js
 
Build Your Own WebAssembly Compiler
Build Your Own WebAssembly CompilerBuild Your Own WebAssembly Compiler
Build Your Own WebAssembly Compiler
 
User & Device Identity for Microservices @ Netflix Scale
User & Device Identity for Microservices @ Netflix ScaleUser & Device Identity for Microservices @ Netflix Scale
User & Device Identity for Microservices @ Netflix Scale
 
Scaling Patterns for Netflix's Edge
Scaling Patterns for Netflix's EdgeScaling Patterns for Netflix's Edge
Scaling Patterns for Netflix's Edge
 
Make Your Electron App Feel at Home Everywhere
Make Your Electron App Feel at Home EverywhereMake Your Electron App Feel at Home Everywhere
Make Your Electron App Feel at Home Everywhere
 
The Talk You've Been Await-ing For
The Talk You've Been Await-ing ForThe Talk You've Been Await-ing For
The Talk You've Been Await-ing For
 
Future of Data Engineering
Future of Data EngineeringFuture of Data Engineering
Future of Data Engineering
 
Automated Testing for Terraform, Docker, Packer, Kubernetes, and More
Automated Testing for Terraform, Docker, Packer, Kubernetes, and MoreAutomated Testing for Terraform, Docker, Packer, Kubernetes, and More
Automated Testing for Terraform, Docker, Packer, Kubernetes, and More
 

Recently uploaded

DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 

Recently uploaded (20)

DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 

Structure, Personalization, Scale: A Deep Dive into LinkedIn Search

  • 1. Recruiting Solutions 1 Structure, Personalization, Scale: A Deep Dive into LinkedIn Search
  • 2. Watch the video with slide synchronization on InfoQ.com! http://www.infoq.com/presentations /linkedin-search InfoQ.com: News & Community Site • 750,000 unique visitors/month • Published in 4 languages (English, Chinese, Japanese and Brazilian Portuguese) • Post content from our QCon conferences • News 15-20 / week • Articles 3-4 / week • Presentations (videos) 12-15 / week • Interviews 2-3 / week • Books 1 / month
  • 3. Presented at QCon New York www.qconnewyork.com Purpose of QCon - to empower software development by facilitating the spread of knowledge and innovation Strategy - practitioner-driven conference designed for YOU: influencers of change and innovation in your teams - speakers and topics driving the evolution and innovation - connecting and catalyzing the influencers and innovators Highlights - attended by more than 12,000 delegates since 2007 - held in 9 cities worldwide
  • 4. Overview What is LinkedIn search and why should you care? What are our systems challenges? What are our relevance challenges? 2
  • 5. 3
  • 6. 4
  • 7. Search helps members find and be found. 5
  • 8. Search for people, jobs, groups, and more. 6
  • 9. 7
  • 10. A separate product for recruiters. 8
  • 11. Search is the core of key LinkedIn use cases. 9
  • 12. What’s unique  Personalized  Part of a larger product experience – Many products – Big part  Task-centric – Find a job, hire top talent, find a person, … 10
  • 14. Evolution of LinkedIn’ Search Architecture 2004:  No Search Engine  Iterate through your network and filter 12 2004
  • 15. 13 2007: Introducing Lucene (single shard, multiple replicas) Updates Queries 2004 2007 Lucene Lucene L uLcuecneen e (Single Shard) Results
  • 16. 14 2008: Zoie - real-time search (search without commits/shutdown) Updates Queries 2004 2007 Lucene Lucene Lucene 2008 Results Lucene Zoie
  • 17. 15 2008: Content Store (aggregating multiple input sources) 2004 2007 2008 Lucene Lucene Lucene Source 1 Queries Results Lucene Zoie Source 2 …. Source N Content Store ….
  • 18. 16 2004 2007 2008 Source 1 Queries Results Source 2 …. Source N Content Store …. Broker Sharded 2008: Sharded search
  • 19. 17 2004 2007 2008 Broker 2009 Source 1 Queries Results Source 2 …. Source N Content Store …. Sensei Lucene Zoie Bobo 2009: Bobo – Faceted Search
  • 20. 2010: SenseiDB (cluster management, new query language, wrapping existing pieces) 18 2004 2007 2008 2009 2010
  • 21. 19 2004 2007 2008 2009 2010 2011 2011: Cleo (instant typeahead results)
  • 22. 20 2004 2007 2008 2009 2010 2011 2013 2013: Too many stacks Group Search Article/Post Search And more…
  • 23. Challenges  Index rebuilding very difficult  Live updates are at an entity granularity  Scoring is inflexible  Lucene limitations  Fragmentation – too many components, too many stacks  Economic Graph 21 Opportunity
  • 24. 22 2004 2007 2008 2009 2010 2011 2013 2014 2014: Introducing Galene
  • 25. Life of a Query 23 Query Rewriter/ Planner Results Merging User Query Search Results Search Shard Search Shard
  • 26. Life of a Query – Within A Search Shard 24 Rewritten Query Top Results From Shard INDEX Top Results Retrieve a Document Score the Document
  • 27. Life of a Query – Within A Rewriter 25 Query DATA MODEL Rewriter State Rewriter Module DATA MODEL DATA MODEL Rewritten Query Rewriter Module Rewriter Module
  • 28. Life of Data - Offline 26 DATA MODEL INDEX Raw Data Derived Data DATA MODEL DATA MODEL DATA MODEL DATA MODEL
  • 29. Improvements  Regular full index builds using Hadoop – Easier to reshard, add fields  Improved Relevance – Offline relevance, query rewriting frameworks  Partial Live Updates Support – Allows efficient updates of high frequency fields (no sync) – Goodbye Content Store, Goodbye Zoie  Early termination – Ultra low latency for instant results – Goodbye Cleo  Indexing and searching across graph entities/attributes  Single engine, single stack 27
  • 32. Lucene An open source API that supports search functionality:  Add new documents to index  Delete documents from the index  Construct queries  Search the index using the query  Score the retrieved documents 30
  • 33. The Search Index  Inverted Index: Mapping from (search) terms to list of documents (they are present in)  Forward Index: Mapping from documents to metadata about them 31
  • 34. 32 BLAH BLAH BLAH DanielBLAH BLAH LinkedIn BLAH BLAH BLAH BLAH 1. BLAH BLAH AsifBLAH LinkedIn BLAH BLAH BLAH BLAH 2. BLAH BLAH BLAH Daniel Asif LinkedIn 1 2 Inverted Index Forward Index
  • 35. The Search Index  The lists are called posting lists  Upto hundreds of millions of posting lists  Upto hundreds of millions of documents  Posting lists may contain as few as a single hit and as many as tens of millions of hits  Terms can be – words in the document – inferred attributes about the document 33
  • 36. Lucene Queries  term:“asif makhani”  term:asif term:daniel  +term:daniel +prefix:tunk  +asif +linkedIn  +term:daniel connection:50510  +term:daniel industry:software connection:50510^4 34
  • 37. Early termination  We order documents in the index based on a static rank – from most important to least important  An offline relevance algorithm assigns a static rank to each document on which the sorting is performed  This allows retrieval to be early-terminated (assuming a strong correlation between static rank and importance of result for a specific query)  Also works well with personalized search – +term:asif +prefix:makh +(connection:35176 connection:418001 connection:1520032) 35
  • 38. Partial Updates  Lucene segments are “document-partitioned”  We have enhanced Lucene with “term-partitioned” segments  We use 3 term-partitioned segments: – Base index (never changed) – Live update buffer – Snapshot index 36
  • 39. 37 Base Index Snapshot Index Live Update Buffer
  • 40. Going Forward  Consolidation across verticals  Improved Relevance Support – Machine-learned models, query rewriting, relevant snippets,…  Improved Performance  Search as a Service (SeaS)  Exploring the Economic Graph 38
  • 42. The Search Quality Pipeline 40 spellcheck query tagging vertical intent query expansion New document Fe atures Machin e learning model score New document Features Machin e learning model score New document Fe atures Machine learning model score Ordered Ordered list Ordered list list
  • 43. Spellcheck 41 PEOPLE NAMES COMPANIES TITLES PAST QUERIES n-grams marissa => ma ar ri is ss sa metaphone mark/marc => MRK co-occurrence counts marissa:mayer = 1000 marisa meyer yahoo marissa marisa meyer mayer yahoo
  • 44. Query Tagging 42 machine learning data scientist brooklyn
  • 45. Vertical Intent: Results Blending 43 [company] [employees] [jobs] [name search]
  • 46. Vertical Intent: Typeahead 44 P(mongodb | mon) = 5% P(monsanto | mons): 50% P(mongodb | mong): 80%
  • 48. Ranking 46 New document Fe atures Machin e learning model score New document Fe atures Machin e learning model score New document Fe atures Machine learning model score Ordered Ordered list Ordered list list
  • 49. Ranking is highly personalized. 47 kevin scott
  • 50. Not just for name search. 48
  • 51. Relevance Model 49 keywords document Fe atures Machine learning model
  • 52. Examples of Features 50 Search keywords matching title = 3 Searcher location = Result location Searcher network distance to result = 2 …
  • 53. Model Training: Traditional Approach 51 Documents for training Features Human evaluation La bels Machine learning model
  • 54. Model Training: LinkedIn’s Approach 52 Documents for training Features Human evaluation Search logs La bels Machine learning model
  • 55. Fair Pairs and Easy Negatives 53 Flipped [Radlinski and Joachims, 2006] • Sample negatives from bottom results • But watch out for variable length result sets. • Compromise, e.g., sample from page 10.
  • 56. Model Selection  Select model based on user and query features. – e.g., person name queries, recruiters making skills queries  Resulting model is a tree with logistic regression leaves.  Only one regression model evaluated for each document. 54 X2=? b0 +b1T(x1)+...+bnxn X10< 0.1234 ? a0 +a1P(x1)+...+anQ(xn ) g 0 +g1R(x1)+...+g nQ(xn )
  • 57. Summary What is LinkedIn search and why should you care? LinkedIn search enables the participants in the economic graph to find and be found. What are our systems challenges? Indexing rich, structured content; retrieving using global and social factors; real-time updates. What are our relevance challenges? Query understanding, personalized machine-learned ranking models. 55
  • 58. Asif Makhani Daniel Tunkelang amakhani@linkedin.com dtunkelang@linkedin.com https://linkedin.com/in/asifmakhani https://linkedin.com/in/dtunkelang 56
  • 59. Watch the video with slide synchronization on InfoQ.com! http://www.infoq.com/presentations/linkedin-search