SlideShare a Scribd company logo
1 of 37
Download to read offline
Data-Driven Approach to Search Relevance
Eric Melz
Measured Search
Praveena Subrahmanyam
Ticketmaster
Los Angeles Search, Data, and Analytics Meetup
June 26, 2017
1
2
About the Speakers
Praveena Subrahmanyam
• Senior Architect and Search Lead at Ticketmaster

• ~ 2 years at Ticketmaster

• Geek, Mom, Travel enthusiast

Eric Melz
• Head of Engineering at Measured Search

• Over 20 years in Tech - LinkedIn, Google, Oracle, etc

• Used to work at TicketMaster
3
About Ticketmaster
The World’s Leading Live Entertainment Company
• A Live Nation Company

• Founded over 40 years ago

• Selling over 400 million tickets each year

• Supporting 240K events, 200K attractions and 100K venues across 80+ countries

• Open API’s

• Follow us @ticketmaster
4
• From the homepage, Search is the Top used feature

• 50-60% of sessions use search
Search at Ticketmaster
5
Challenges
• Relevancy

• Text Relevancy

• Popularity

• Geo

• Personalization

• Fix one thing break another thing!

• Long tail

• Performance 

• Index

• Query

• Scale

• Documents

• QPS

• Multilingual Documents

• Storing

• Querying
6
• Exploratory

• Manual Testing

• Reports

• Feedback

• Social Media

• Internal

• Dev Jams

• Data Driven
Approaches
7
Measured Search Overview
•Intro

•A/B Testing

•A/B Testing for Search

•Model Simulation

•TicketMaster Model Simulator
SearchStax: Open Source based
Platform-as-a-Service
Accelerate your time to market by flattening
the Solr learning curve and going straight to
development. Focus on your search
application and save months of headaches in
setup, provisioning, production readiness and
administration.
Managed Services and
Support
Our always-ready Solr experts are
only a call or an email away – every
day, all day and night, all year round.
Enjoy peace of mind with fully
managed Solr-as-a-Service.
Highly Skilled and Experienced
Open Source Search Experts
Our engineers have decades of
experience and delivered numerous
engagements in the field of search,
analytics and machine learning. These
same search experts are available on
an ad hoc basis to help ensure your
projects success.
Measured Search
8
Accelerate your timeline Peace of Mind On-Demand Expertise
Measured Search® enables companies to elevate the experience of Search
based applications faster and with more confidence.
9
A / B Testing
10
A / B Testing - Fundamentals
Split User population into Segments
Each Segment sees a different variant
• Control - existing version (“A”)

• Treatment - proposed version (“B”)
Variable - metric we hope improves

in the treatment group
11
A / B Testing - Example
Split Users into Segments
• segmentId = userId mod 2
Each Segment sees a different variant
• Control - existing version (“A”)

• Blue Button

• Treatment - proposed version (“B”)

• Green Button
Variable - metric we hope improves

in the treatment group

• Click rate
12
Search - Fundamentals
Paul M
1. Paul McCartney
2. Paul Manafort
3. Justin Bieber
Query (aka Search)
Result Set
Rank (aka Position)
Result Item
13
Search A / B Testing - Variants
Variant parameters: Search Index + Ranking Algorithm
Index A
+
Ranking A
Index B
+
Ranking B
Paul M
?
Control Treatment
14
Search A / B Testing - Variables

Click Through Rate
Paul M
1. Paul McCartney
2. Paul Manafort
3. Justin Bieber
Click!
15
Search A / B Testing - Variables

Click Through Rate (CTR)
Clicked ClickedNot Clicked
Paul M
1. Paul McCartney
2. Paul Manafort
3. Justin Bieber
Paul M
1. Paul McCartney
2. Paul Manafort
3. Justin Bieber
Click!
Paul M
1. Paul McCartney
2. Paul Manafort
3. Justin Bieber
Click!
Paul M
1. Paul McCartney
2. Paul Manafort
3. Justin Bieber
Click!
Clicked
Control
CTR = 3/4
Paul M
1. Justin Bieber
2. Paul Manafort
3. Paul McCartney
Not Clicked
Treatment
Paul M
1. Justin Bieber
2. Paul Manafort
3. Paul McCartney
Not Clicked
Paul M
1. Justin Bieber
2. Paul Manafort
3. Paul McCartney
Paul M
1. Justin Bieber
2. Paul Manafort
3. Paul McCartney
Not Clicked
CTR = 1/4
Clicked
Click!
Score = # Clicks / # Searches
Higher scores are better
16
Search A / B Testing - Variables

Manual (aka Human) Relevance Ranking
Foreach Query Q

• Foreach Item I

• Manually assign Relevance(Q,I)
Query Item Relevance
Paul M Justin Bieber 5
Paul M Paul Manafort 20
Paul M Paul McCartney 98
Paul Ma Justin Bieber 5
Paul Ma Paul Manafort 90
Paul Ma Paul McCartney 70
17
Search A / B Testing - Variables

Human Ranking - Example
Score = Sum(Relevance / Rank )
Higher scores are better
Rank Item Relevance
Relevance /
Rank
1 Paul McCartney 98 98 / 1
2 Paul Manafort 20 20 / 2
3 Justin Bieber 5 5 / 3
Total 109.7
Control
Rank Item Relevance
Relevance /
Rank
1 Justin Bieber 5 5 / 1
2 Paul Manafort 20 20 / 2
3 Paul McCartney 98 98 / 3
Total 47.7
TreatmentPaul M Paul M
18
Search A / B Testing - Variables

Human Ranking - Issue
Foreach Query Q

• Foreach Item I

• Manually assign Relevance(Q,I)
100K queries x 100K items = 10,000,000,000 ratings!
19
Search A / B Testing - Variables

Average Click Position
Paul M
1. Paul McCartney
2. Paul Manafort
3. Justin Bieber
Paul M
1. Paul McCartney
2. Paul Manafort
3. Justin Bieber
Click!
Clicked 3
Paul M
1. Paul McCartney
2. Paul Manafort
3. Justin Bieber
Click!
Clicked 1
Paul M
1. Paul McCartney
2. Paul Manafort
3. Justin Bieber
Click!
Clicked 1
Control
Avg Click Pos =

(1 + 2 + 1 + 1) / 4 =

1.25
Paul M
1. Justin Bieber
2. Paul Manafort
3. Paul McCartney
Treatment
Paul M
1. Justin Bieber
2. Paul Manafort
3. Paul McCartney
Not Clicked
Paul M
1. Justin Bieber
2. Paul Manafort
3. Paul McCartney
Paul M
1. Justin Bieber
2. Paul Manafort
3. Paul McCartney
Clicked 2
Click!
Score = Average(Click Pos)
Lower scores are better
Clicked 2
Click!
Avg Click Pos =

(3 + 2 + 3) / 3 =

2.6
Clicked 3
Click!
Clicked 1
Click!
20
Search A / B Testing - Variables

Mean Reciprocal Rank (MRR)
Paul M
1. Paul McCartney
2. Paul Manafort
3. Justin Bieber
Paul M
1. Paul McCartney
2. Paul Manafort
3. Justin Bieber
Click!
Clicked 3
Paul M
1. Paul McCartney
2. Paul Manafort
3. Justin Bieber
Click!
Clicked 1
Paul M
1. Paul McCartney
2. Paul Manafort
3. Justin Bieber
Click!
Clicked 1
Control
MRR =

(1/1 + 1/2 + 1/1 + 1/1) / 4 =
0.88
Paul M
1. Justin Bieber
2. Paul Manafort
3. Paul McCartney
Treatment
Paul M
1. Justin Bieber
2. Paul Manafort
3. Paul McCartney
Not Clicked
Paul M
1. Justin Bieber
2. Paul Manafort
3. Paul McCartney
Paul M
1. Justin Bieber
2. Paul Manafort
3. Paul McCartney
Clicked 2
Click!
Score = Average(1 / Click Pos)
Higher scores are better (will be in range (0,1])
Clicked 2
Click!
MRR =

(1/3 + 1/2 + 1/3) / 3 =

0.38
Clicked 3
Click!
Clicked 1
Click!
21
A / B Testing - Variables - No Results Searches
Score = # No-Result-Searches/ # Searches
Lower scores are better (will be in range [0,1])
Paul M
1. Paul McCartney
2. Paul Manafort
Paul
NO RESULTS!
No Results
1. Justin Bieber
Results
1. Justin Bieber
Results
Control
No Results =

1/4
Paul
NO RESULTS!
Treatment
Paul M
1. Paul McCartney
2. Paul Manafort
Results
Justin Beeb
NO RESULTS!
Justin Bieb
1. Justin Bieber
No Results
Results
No Results = 

2/4
Results
No Results
Justin Beeb Justin Bieb
22
A / B Testing - Issues
•Need adequate sample sizes to achieve
statistical significance

•Treatment should…

•Have negligible impact to business

•Revenue

•Goodwill

•Be production ready

•Secure

•Performant

•Acceptable UX

•Compatible with prod tech stack

•Have org approval for prod release
23
Model Simulation - Fundamentals
•Alternative to A/B testing - Simulation

•Don’t direct traffic to different variants

•Single variant - control

•Record requests to control

•Replay recorded requests against treatment (in
dev environment)

•Measure performance of treatment against
control
24
Search Model Simulation - Specifics
• Record (from control)

• Searches (queries)

• Searchclicks (queries + item + item position)

• Replay (to treatment)

• Searches - used to compute 

• % of No-Result searches

• Searchclicks - used to compute

•Average Click Position
•MRR
• Report

• Metrics

• Average Click Pos

• MRR

• % of No-Result Searches

•Items clicked on in control, but not found in treatment
25
Model Simulation - Flow
A
Control
Index
B
Treatment
Index
Event
DataSearchStax
Searches
Start Simulation
Fetch Results
Model
Simulator
Fetch
Data
Upload
Results
Track Events
Run Queries
Searcher
Analyst
26
Model Simulation - Tech Stack
• Search Indexes

• Elastic Search

• SearchStax

• Python/Django

• MongoDB

• RDS/MySQL

• Deployed in Measured Search AWS VPC

• Model Simulator

• Jenkins

• Python/Django

• Sqlite

• Docker

• Deployed in Ticketmaster AWS VPC
27
Model Simulator - Jenkins
28
Model Simulator - Reporting - Top
29
Model Simulator - Reporting - Metrics
30
Model Simulator - Reporting - Missing Items
31
Gather
Data
Categorize
Explore
Evaluate
Relevance Refinement Process
32
CATEGORIZE
Try to find patterns and categorize poorly performing
queries
33
EXPLORE
• Attack top queries
• Low hanging fruit
• Examine impact of changes
• Does it come with a cost?
34
EVALUATE
• Run the Model Simulator
• Regression Test
• Performance Test
• Did we become better?
35
Whats Next?
• Anticipatory Testing
• Automated Relevance
36
Q&A
37
Contact Info
Eric Melz
@ericmelz
eric@measuredsearch.com
https://www.measuredsearch.com
Praveena Subrahmanyam
@askpraveena
praveena.subrahmanyam@ticketmaster.com
https://www.ticketmaster.com

More Related Content

Similar to Data-Driven Approach to Search Relevance

Speed Dating the Data Geeks: What you need to know about Nonprofit Analytic T...
Speed Dating the Data Geeks: What you need to know about Nonprofit Analytic T...Speed Dating the Data Geeks: What you need to know about Nonprofit Analytic T...
Speed Dating the Data Geeks: What you need to know about Nonprofit Analytic T...hjc
 
Measuring the Quality of Online Service - Jinyoung kim
Measuring the Quality of Online Service - Jinyoung kimMeasuring the Quality of Online Service - Jinyoung kim
Measuring the Quality of Online Service - Jinyoung kimJin Young Kim
 
10NTC - Data Superheroes - DiJulio
10NTC - Data Superheroes - DiJulio10NTC - Data Superheroes - DiJulio
10NTC - Data Superheroes - DiJuliosarahdijulio
 
The Role of Analytics in Talent Acquisition
The Role of Analytics in Talent AcquisitionThe Role of Analytics in Talent Acquisition
The Role of Analytics in Talent AcquisitionHuman Capital Media
 
Importance, definition and process of market research
Importance, definition and process of market researchImportance, definition and process of market research
Importance, definition and process of market researchInfoQ - GMO Research
 
Feedback & Surveys - How to use the Constant Contact Toolkit Part 2
Feedback & Surveys - How to use the Constant Contact Toolkit Part 2Feedback & Surveys - How to use the Constant Contact Toolkit Part 2
Feedback & Surveys - How to use the Constant Contact Toolkit Part 2Frithjof Petscheleit
 
5. Market Research Presentation.pptx
5. Market Research Presentation.pptx5. Market Research Presentation.pptx
5. Market Research Presentation.pptxRichardMalaluan2
 
Data Quality Doesn’t Just Happen: And Here’s What Some of the Industry’s Most...
Data Quality Doesn’t Just Happen: And Here’s What Some of the Industry’s Most...Data Quality Doesn’t Just Happen: And Here’s What Some of the Industry’s Most...
Data Quality Doesn’t Just Happen: And Here’s What Some of the Industry’s Most...InsightInnovation
 
The Truth About Cross-Channel Attribution... and Why it Does Not Have to be ...
The Truth About Cross-Channel Attribution...  and Why it Does Not Have to be ...The Truth About Cross-Channel Attribution...  and Why it Does Not Have to be ...
The Truth About Cross-Channel Attribution... and Why it Does Not Have to be ...Birst
 
Monthly ppc checklist from a novice account manager
Monthly ppc checklist from a novice account managerMonthly ppc checklist from a novice account manager
Monthly ppc checklist from a novice account managerPartha Dutta
 
How to Run Conjoint Analysis
How to Run Conjoint AnalysisHow to Run Conjoint Analysis
How to Run Conjoint AnalysisQuestionPro
 
How to run conjoint analysis
How to run conjoint analysisHow to run conjoint analysis
How to run conjoint analysisQuestionPro
 
The Promise of First-Party Data: How the Top Brands Get the Strongest ROI for...
The Promise of First-Party Data: How the Top Brands Get the Strongest ROI for...The Promise of First-Party Data: How the Top Brands Get the Strongest ROI for...
The Promise of First-Party Data: How the Top Brands Get the Strongest ROI for...Signal
 
[Mintigo Webinar] Mining For Gold In Your Email House List
[Mintigo Webinar] Mining For Gold In Your Email House List[Mintigo Webinar] Mining For Gold In Your Email House List
[Mintigo Webinar] Mining For Gold In Your Email House ListMintigo1
 
Meta-Analyses in Experimentation: The Whats and Hows
Meta-Analyses in Experimentation: The Whats and HowsMeta-Analyses in Experimentation: The Whats and Hows
Meta-Analyses in Experimentation: The Whats and HowsVWO
 
Engaging with Users on Public Social Media
Engaging with Users on Public Social MediaEngaging with Users on Public Social Media
Engaging with Users on Public Social MediaJeffrey Nichols
 
Market Research to Drive Product Strategy by LinkedIn PM
Market Research to Drive Product Strategy by LinkedIn PMMarket Research to Drive Product Strategy by LinkedIn PM
Market Research to Drive Product Strategy by LinkedIn PMProduct School
 
Detecting Good Abandonment in Mobile Search
Detecting Good Abandonment in Mobile SearchDetecting Good Abandonment in Mobile Search
Detecting Good Abandonment in Mobile SearchJulia Kiseleva
 

Similar to Data-Driven Approach to Search Relevance (20)

Speed Dating the Data Geeks: What you need to know about Nonprofit Analytic T...
Speed Dating the Data Geeks: What you need to know about Nonprofit Analytic T...Speed Dating the Data Geeks: What you need to know about Nonprofit Analytic T...
Speed Dating the Data Geeks: What you need to know about Nonprofit Analytic T...
 
Measuring the Quality of Online Service - Jinyoung kim
Measuring the Quality of Online Service - Jinyoung kimMeasuring the Quality of Online Service - Jinyoung kim
Measuring the Quality of Online Service - Jinyoung kim
 
10NTC - Data Superheroes - DiJulio
10NTC - Data Superheroes - DiJulio10NTC - Data Superheroes - DiJulio
10NTC - Data Superheroes - DiJulio
 
The Role of Analytics in Talent Acquisition
The Role of Analytics in Talent AcquisitionThe Role of Analytics in Talent Acquisition
The Role of Analytics in Talent Acquisition
 
Debunking Ad Testing
Debunking Ad TestingDebunking Ad Testing
Debunking Ad Testing
 
Fashiondatasc
FashiondatascFashiondatasc
Fashiondatasc
 
Importance, definition and process of market research
Importance, definition and process of market researchImportance, definition and process of market research
Importance, definition and process of market research
 
Feedback & Surveys - How to use the Constant Contact Toolkit Part 2
Feedback & Surveys - How to use the Constant Contact Toolkit Part 2Feedback & Surveys - How to use the Constant Contact Toolkit Part 2
Feedback & Surveys - How to use the Constant Contact Toolkit Part 2
 
5. Market Research Presentation.pptx
5. Market Research Presentation.pptx5. Market Research Presentation.pptx
5. Market Research Presentation.pptx
 
Data Quality Doesn’t Just Happen: And Here’s What Some of the Industry’s Most...
Data Quality Doesn’t Just Happen: And Here’s What Some of the Industry’s Most...Data Quality Doesn’t Just Happen: And Here’s What Some of the Industry’s Most...
Data Quality Doesn’t Just Happen: And Here’s What Some of the Industry’s Most...
 
The Truth About Cross-Channel Attribution... and Why it Does Not Have to be ...
The Truth About Cross-Channel Attribution...  and Why it Does Not Have to be ...The Truth About Cross-Channel Attribution...  and Why it Does Not Have to be ...
The Truth About Cross-Channel Attribution... and Why it Does Not Have to be ...
 
Monthly ppc checklist from a novice account manager
Monthly ppc checklist from a novice account managerMonthly ppc checklist from a novice account manager
Monthly ppc checklist from a novice account manager
 
How to Run Conjoint Analysis
How to Run Conjoint AnalysisHow to Run Conjoint Analysis
How to Run Conjoint Analysis
 
How to run conjoint analysis
How to run conjoint analysisHow to run conjoint analysis
How to run conjoint analysis
 
The Promise of First-Party Data: How the Top Brands Get the Strongest ROI for...
The Promise of First-Party Data: How the Top Brands Get the Strongest ROI for...The Promise of First-Party Data: How the Top Brands Get the Strongest ROI for...
The Promise of First-Party Data: How the Top Brands Get the Strongest ROI for...
 
[Mintigo Webinar] Mining For Gold In Your Email House List
[Mintigo Webinar] Mining For Gold In Your Email House List[Mintigo Webinar] Mining For Gold In Your Email House List
[Mintigo Webinar] Mining For Gold In Your Email House List
 
Meta-Analyses in Experimentation: The Whats and Hows
Meta-Analyses in Experimentation: The Whats and HowsMeta-Analyses in Experimentation: The Whats and Hows
Meta-Analyses in Experimentation: The Whats and Hows
 
Engaging with Users on Public Social Media
Engaging with Users on Public Social MediaEngaging with Users on Public Social Media
Engaging with Users on Public Social Media
 
Market Research to Drive Product Strategy by LinkedIn PM
Market Research to Drive Product Strategy by LinkedIn PMMarket Research to Drive Product Strategy by LinkedIn PM
Market Research to Drive Product Strategy by LinkedIn PM
 
Detecting Good Abandonment in Mobile Search
Detecting Good Abandonment in Mobile SearchDetecting Good Abandonment in Mobile Search
Detecting Good Abandonment in Mobile Search
 

More from SearchStax

How do Solr and Azure Search compare?
How do Solr and Azure Search compare?How do Solr and Azure Search compare?
How do Solr and Azure Search compare?SearchStax
 
Five Considerations When Migrating Your On-Premise Solr Infrastructure to Goo...
Five Considerations When Migrating Your On-Premise Solr Infrastructure to Goo...Five Considerations When Migrating Your On-Premise Solr Infrastructure to Goo...
Five Considerations When Migrating Your On-Premise Solr Infrastructure to Goo...SearchStax
 
Customer Webinar: Scaling and Optimizing a Large e-Commerce Drupal Solution
Customer Webinar: Scaling and Optimizing a Large e-Commerce Drupal SolutionCustomer Webinar: Scaling and Optimizing a Large e-Commerce Drupal Solution
Customer Webinar: Scaling and Optimizing a Large e-Commerce Drupal SolutionSearchStax
 
Solr CDCR (Cross Data Center Replication) in AWS
Solr CDCR (Cross Data Center Replication) in AWS Solr CDCR (Cross Data Center Replication) in AWS
Solr CDCR (Cross Data Center Replication) in AWS SearchStax
 
Future of enterprise apps is open source and cloud computing
Future of enterprise apps is open source and cloud computingFuture of enterprise apps is open source and cloud computing
Future of enterprise apps is open source and cloud computingSearchStax
 
Securing Search Data in the Cloud
Securing Search Data in the CloudSecuring Search Data in the Cloud
Securing Search Data in the CloudSearchStax
 

More from SearchStax (6)

How do Solr and Azure Search compare?
How do Solr and Azure Search compare?How do Solr and Azure Search compare?
How do Solr and Azure Search compare?
 
Five Considerations When Migrating Your On-Premise Solr Infrastructure to Goo...
Five Considerations When Migrating Your On-Premise Solr Infrastructure to Goo...Five Considerations When Migrating Your On-Premise Solr Infrastructure to Goo...
Five Considerations When Migrating Your On-Premise Solr Infrastructure to Goo...
 
Customer Webinar: Scaling and Optimizing a Large e-Commerce Drupal Solution
Customer Webinar: Scaling and Optimizing a Large e-Commerce Drupal SolutionCustomer Webinar: Scaling and Optimizing a Large e-Commerce Drupal Solution
Customer Webinar: Scaling and Optimizing a Large e-Commerce Drupal Solution
 
Solr CDCR (Cross Data Center Replication) in AWS
Solr CDCR (Cross Data Center Replication) in AWS Solr CDCR (Cross Data Center Replication) in AWS
Solr CDCR (Cross Data Center Replication) in AWS
 
Future of enterprise apps is open source and cloud computing
Future of enterprise apps is open source and cloud computingFuture of enterprise apps is open source and cloud computing
Future of enterprise apps is open source and cloud computing
 
Securing Search Data in the Cloud
Securing Search Data in the CloudSecuring Search Data in the Cloud
Securing Search Data in the Cloud
 

Recently uploaded

Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxolyaivanovalion
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiSuhani Kapoor
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystSamantha Rae Coolbeth
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiSuhani Kapoor
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfSocial Samosa
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxolyaivanovalion
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 

Recently uploaded (20)

Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptx
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data Analyst
 
E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 

Data-Driven Approach to Search Relevance

  • 1. Data-Driven Approach to Search Relevance Eric Melz Measured Search Praveena Subrahmanyam Ticketmaster Los Angeles Search, Data, and Analytics Meetup June 26, 2017 1
  • 2. 2 About the Speakers Praveena Subrahmanyam • Senior Architect and Search Lead at Ticketmaster • ~ 2 years at Ticketmaster • Geek, Mom, Travel enthusiast Eric Melz • Head of Engineering at Measured Search • Over 20 years in Tech - LinkedIn, Google, Oracle, etc • Used to work at TicketMaster
  • 3. 3 About Ticketmaster The World’s Leading Live Entertainment Company • A Live Nation Company • Founded over 40 years ago • Selling over 400 million tickets each year • Supporting 240K events, 200K attractions and 100K venues across 80+ countries • Open API’s • Follow us @ticketmaster
  • 4. 4 • From the homepage, Search is the Top used feature • 50-60% of sessions use search Search at Ticketmaster
  • 5. 5 Challenges • Relevancy • Text Relevancy • Popularity • Geo • Personalization • Fix one thing break another thing! • Long tail • Performance • Index • Query • Scale • Documents • QPS • Multilingual Documents • Storing • Querying
  • 6. 6 • Exploratory • Manual Testing • Reports • Feedback • Social Media • Internal • Dev Jams • Data Driven Approaches
  • 7. 7 Measured Search Overview •Intro •A/B Testing •A/B Testing for Search •Model Simulation •TicketMaster Model Simulator
  • 8. SearchStax: Open Source based Platform-as-a-Service Accelerate your time to market by flattening the Solr learning curve and going straight to development. Focus on your search application and save months of headaches in setup, provisioning, production readiness and administration. Managed Services and Support Our always-ready Solr experts are only a call or an email away – every day, all day and night, all year round. Enjoy peace of mind with fully managed Solr-as-a-Service. Highly Skilled and Experienced Open Source Search Experts Our engineers have decades of experience and delivered numerous engagements in the field of search, analytics and machine learning. These same search experts are available on an ad hoc basis to help ensure your projects success. Measured Search 8 Accelerate your timeline Peace of Mind On-Demand Expertise Measured Search® enables companies to elevate the experience of Search based applications faster and with more confidence.
  • 9. 9 A / B Testing
  • 10. 10 A / B Testing - Fundamentals Split User population into Segments Each Segment sees a different variant • Control - existing version (“A”) • Treatment - proposed version (“B”) Variable - metric we hope improves
 in the treatment group
  • 11. 11 A / B Testing - Example Split Users into Segments • segmentId = userId mod 2 Each Segment sees a different variant • Control - existing version (“A”) • Blue Button • Treatment - proposed version (“B”) • Green Button Variable - metric we hope improves
 in the treatment group • Click rate
  • 12. 12 Search - Fundamentals Paul M 1. Paul McCartney 2. Paul Manafort 3. Justin Bieber Query (aka Search) Result Set Rank (aka Position) Result Item
  • 13. 13 Search A / B Testing - Variants Variant parameters: Search Index + Ranking Algorithm Index A + Ranking A Index B + Ranking B Paul M ? Control Treatment
  • 14. 14 Search A / B Testing - Variables Click Through Rate Paul M 1. Paul McCartney 2. Paul Manafort 3. Justin Bieber Click!
  • 15. 15 Search A / B Testing - Variables Click Through Rate (CTR) Clicked ClickedNot Clicked Paul M 1. Paul McCartney 2. Paul Manafort 3. Justin Bieber Paul M 1. Paul McCartney 2. Paul Manafort 3. Justin Bieber Click! Paul M 1. Paul McCartney 2. Paul Manafort 3. Justin Bieber Click! Paul M 1. Paul McCartney 2. Paul Manafort 3. Justin Bieber Click! Clicked Control CTR = 3/4 Paul M 1. Justin Bieber 2. Paul Manafort 3. Paul McCartney Not Clicked Treatment Paul M 1. Justin Bieber 2. Paul Manafort 3. Paul McCartney Not Clicked Paul M 1. Justin Bieber 2. Paul Manafort 3. Paul McCartney Paul M 1. Justin Bieber 2. Paul Manafort 3. Paul McCartney Not Clicked CTR = 1/4 Clicked Click! Score = # Clicks / # Searches Higher scores are better
  • 16. 16 Search A / B Testing - Variables Manual (aka Human) Relevance Ranking Foreach Query Q • Foreach Item I • Manually assign Relevance(Q,I) Query Item Relevance Paul M Justin Bieber 5 Paul M Paul Manafort 20 Paul M Paul McCartney 98 Paul Ma Justin Bieber 5 Paul Ma Paul Manafort 90 Paul Ma Paul McCartney 70
  • 17. 17 Search A / B Testing - Variables Human Ranking - Example Score = Sum(Relevance / Rank ) Higher scores are better Rank Item Relevance Relevance / Rank 1 Paul McCartney 98 98 / 1 2 Paul Manafort 20 20 / 2 3 Justin Bieber 5 5 / 3 Total 109.7 Control Rank Item Relevance Relevance / Rank 1 Justin Bieber 5 5 / 1 2 Paul Manafort 20 20 / 2 3 Paul McCartney 98 98 / 3 Total 47.7 TreatmentPaul M Paul M
  • 18. 18 Search A / B Testing - Variables Human Ranking - Issue Foreach Query Q • Foreach Item I • Manually assign Relevance(Q,I) 100K queries x 100K items = 10,000,000,000 ratings!
  • 19. 19 Search A / B Testing - Variables Average Click Position Paul M 1. Paul McCartney 2. Paul Manafort 3. Justin Bieber Paul M 1. Paul McCartney 2. Paul Manafort 3. Justin Bieber Click! Clicked 3 Paul M 1. Paul McCartney 2. Paul Manafort 3. Justin Bieber Click! Clicked 1 Paul M 1. Paul McCartney 2. Paul Manafort 3. Justin Bieber Click! Clicked 1 Control Avg Click Pos = (1 + 2 + 1 + 1) / 4 = 1.25 Paul M 1. Justin Bieber 2. Paul Manafort 3. Paul McCartney Treatment Paul M 1. Justin Bieber 2. Paul Manafort 3. Paul McCartney Not Clicked Paul M 1. Justin Bieber 2. Paul Manafort 3. Paul McCartney Paul M 1. Justin Bieber 2. Paul Manafort 3. Paul McCartney Clicked 2 Click! Score = Average(Click Pos) Lower scores are better Clicked 2 Click! Avg Click Pos = (3 + 2 + 3) / 3 = 2.6 Clicked 3 Click! Clicked 1 Click!
  • 20. 20 Search A / B Testing - Variables Mean Reciprocal Rank (MRR) Paul M 1. Paul McCartney 2. Paul Manafort 3. Justin Bieber Paul M 1. Paul McCartney 2. Paul Manafort 3. Justin Bieber Click! Clicked 3 Paul M 1. Paul McCartney 2. Paul Manafort 3. Justin Bieber Click! Clicked 1 Paul M 1. Paul McCartney 2. Paul Manafort 3. Justin Bieber Click! Clicked 1 Control MRR = (1/1 + 1/2 + 1/1 + 1/1) / 4 = 0.88 Paul M 1. Justin Bieber 2. Paul Manafort 3. Paul McCartney Treatment Paul M 1. Justin Bieber 2. Paul Manafort 3. Paul McCartney Not Clicked Paul M 1. Justin Bieber 2. Paul Manafort 3. Paul McCartney Paul M 1. Justin Bieber 2. Paul Manafort 3. Paul McCartney Clicked 2 Click! Score = Average(1 / Click Pos) Higher scores are better (will be in range (0,1]) Clicked 2 Click! MRR = (1/3 + 1/2 + 1/3) / 3 = 0.38 Clicked 3 Click! Clicked 1 Click!
  • 21. 21 A / B Testing - Variables - No Results Searches Score = # No-Result-Searches/ # Searches Lower scores are better (will be in range [0,1]) Paul M 1. Paul McCartney 2. Paul Manafort Paul NO RESULTS! No Results 1. Justin Bieber Results 1. Justin Bieber Results Control No Results = 1/4 Paul NO RESULTS! Treatment Paul M 1. Paul McCartney 2. Paul Manafort Results Justin Beeb NO RESULTS! Justin Bieb 1. Justin Bieber No Results Results No Results = 2/4 Results No Results Justin Beeb Justin Bieb
  • 22. 22 A / B Testing - Issues •Need adequate sample sizes to achieve statistical significance •Treatment should… •Have negligible impact to business •Revenue •Goodwill •Be production ready •Secure •Performant •Acceptable UX •Compatible with prod tech stack •Have org approval for prod release
  • 23. 23 Model Simulation - Fundamentals •Alternative to A/B testing - Simulation •Don’t direct traffic to different variants •Single variant - control •Record requests to control •Replay recorded requests against treatment (in dev environment) •Measure performance of treatment against control
  • 24. 24 Search Model Simulation - Specifics • Record (from control) • Searches (queries) • Searchclicks (queries + item + item position) • Replay (to treatment) • Searches - used to compute • % of No-Result searches • Searchclicks - used to compute •Average Click Position •MRR • Report • Metrics • Average Click Pos • MRR • % of No-Result Searches •Items clicked on in control, but not found in treatment
  • 25. 25 Model Simulation - Flow A Control Index B Treatment Index Event DataSearchStax Searches Start Simulation Fetch Results Model Simulator Fetch Data Upload Results Track Events Run Queries Searcher Analyst
  • 26. 26 Model Simulation - Tech Stack • Search Indexes • Elastic Search • SearchStax • Python/Django • MongoDB • RDS/MySQL • Deployed in Measured Search AWS VPC • Model Simulator • Jenkins • Python/Django • Sqlite • Docker • Deployed in Ticketmaster AWS VPC
  • 28. 28 Model Simulator - Reporting - Top
  • 29. 29 Model Simulator - Reporting - Metrics
  • 30. 30 Model Simulator - Reporting - Missing Items
  • 32. 32 CATEGORIZE Try to find patterns and categorize poorly performing queries
  • 33. 33 EXPLORE • Attack top queries • Low hanging fruit • Examine impact of changes • Does it come with a cost?
  • 34. 34 EVALUATE • Run the Model Simulator • Regression Test • Performance Test • Did we become better?
  • 35. 35 Whats Next? • Anticipatory Testing • Automated Relevance
  • 37. 37 Contact Info Eric Melz @ericmelz eric@measuredsearch.com https://www.measuredsearch.com Praveena Subrahmanyam @askpraveena praveena.subrahmanyam@ticketmaster.com https://www.ticketmaster.com