SlideShare a Scribd company logo
1 of 16
Confidential Material – Chegg Inc. © 2005 - 2015. All Rights Reserved.Confidential Material – Chegg Inc. © 2005 - 2015. All Rights Reserved.
Practical Relevance Measurement
Walter Underwood
Principal Software Engineer
Confidential Material – Chegg Inc. © 2005 - 2015. All Rights Reserved.
• Measure
• Explain
• Diagnose
• Fix
• Repeat
Data-Driven Improvement
2
Confidential Material – Chegg Inc. © 2005 - 2015. All Rights Reserved.
Actionable
Measure: What is a good metric?
10
Common
interpretation
Accessible,
credible
data
Transparent,
simple
calculation
Juice Analytics: http://www.juiceanalytics.com/writing/choosing-right-metric
Confidential Material – Chegg Inc. © 2005 - 2015. All Rights Reserved.
• Click-through rate (CTR)
• Great for navigational search (one correct answer)
• Great for simple search UI
• Success rate
• With multiple clicks, identify the result that satisfied
• Dwell time, conversion, …
• Needed for informational search
• Useful for complex UI: facets, filters, …
Measure: What is Success?
2
Confidential Material – Chegg Inc. © 2005 - 2015. All Rights Reserved.
• Use these measures to explain search success to others
• 0.50 CTR or success rate is pretty good
• A one percentage point improvement is very good
• Do not over-promise
• Report CTR daily and graph it
Explain: Overall Success
2
Confidential Material – Chegg Inc. © 2005 - 2015. All Rights Reserved.
• What are the problem queries?
• Per-query CTR reports
Diagnose
2
Query CTR Frequency
gorilla 0.45 10,000
orangutan 0.10 100
Confidential Material – Chegg Inc. © 2005 - 2015. All Rights Reserved.
• What should the queries’ CTR be?
• How many clicks would that be?
• How many times were people less happy than expected?
Diagnose: Click Residual
2
Query CTR Frequency Actual Expected Residual
TOTAL 0.50 100,000 50,000
gorilla 0.45 10,000 4,500 5,000 -500
orangutan 0.10 100 10 50 -40
Confidential Material – Chegg Inc. © 2005 - 2015. All Rights Reserved.
• Customer uses different word than content
• The Vocabulary Problem (Furnas, et al; 1987)
• Add synonyms: coat, jacket, parka, anorak, …
• Content doesn’t exist
• We don’t sell that — add “no hits” page with
recommendations
• Add to site/inventory
• Misspellings (about 10% of queries)
• Fuzzy search
• Query suggestions
Fix: What is the cause?
2
Confidential Material – Chegg Inc. © 2005 - 2015. All Rights Reserved.
• Count attempts (queries)
• Count clicks or successes
• Associate clicks with queries
• Handle anomalies
• Hash size
Implementing Metrics
2
Confidential Material – Chegg Inc. © 2005 - 2015. All Rights Reserved.
• Log the query
• Log a hash or unique ID for this attempt
• Will be used to link clicks to attempts
• Random bits generated in server (session log)
• Random bits generated in browser (URL param)
• Hash calculated from user ID, query, and time
Implementing: Count Attempts
2
Confidential Material – Chegg Inc. © 2005 - 2015. All Rights Reserved.
• Search results page will have decorated URLs on results
• Tracking hash
• Rank (1-based)
• Query is optional, you can join with the attempts log
• Result links look like:
/product-1234?tk=ABCD&r=1
/product-5678?tk=ABCD&r=2
Implementing: Count Clicks
2
Confidential Material – Chegg Inc. © 2005 - 2015. All Rights Reserved.
• Multiple clicks from one SRP
• Opening results in new tabs
• Cached SRP
• Solution: Only count one, maybe the last one
• Clicks that don’t match a query attempt
• Bookmarked results
• Clicks from a page loaded a previous day
• Solution: Ignore them
• Bots hitting SRP
• Remove from stats
• JavaScript-generated token helps filter bots
Implementing: Anomalies
2
Confidential Material – Chegg Inc. © 2005 - 2015. All Rights Reserved.
• Birthday Paradox — you’ll need more bits than you think
• Chegg uses:
• 48 random bits
• Encoded in URL-safe Base64
• Eight characters
• Looks like: tk=NmAwoq5e
• 1% chance of collision with 2.4 million searches
• See “Birthday Problem” on Wikipedia
Implementing: Hash Size
2
Confidential Material – Chegg Inc. © 2005 - 2015. All Rights Reserved.
Questions?
2
Confidential Material – Chegg Inc. © 2005 - 2015. All Rights Reserved.
Search Analytics for Your Site — Louis Rosenfeld
3
http://rosenfeldmedia.com/books/search-analytics-for-your-site/
Confidential Material – Chegg Inc. © 2005 - 2015. All Rights Reserved.Confidential Material – Chegg Inc. © 2005 – 2015 by Chegg Inc. All Rights Reserved.
Thank you!
wunder@wunderwood.org
5

More Related Content

Viewers also liked

Prévisions des crises
Prévisions des crises Prévisions des crises
Prévisions des crises Jibin Lin
 
Javed sayyed resume
Javed sayyed resumeJaved sayyed resume
Javed sayyed resumeJaved Sayyed
 
Kt product catalogue s s
Kt product catalogue s sKt product catalogue s s
Kt product catalogue s sSohail Ahmed
 
Happy birthday quote pictures
Happy birthday quote picturesHappy birthday quote pictures
Happy birthday quote pictureskayum1
 
Gabor Frames for Quasicrystals and K-theory
Gabor Frames for Quasicrystals and K-theoryGabor Frames for Quasicrystals and K-theory
Gabor Frames for Quasicrystals and K-theoryMichael Kreisel
 
презентация команды мбоу сш 15 5 класс
презентация команды мбоу сш 15 5 класспрезентация команды мбоу сш 15 5 класс
презентация команды мбоу сш 15 5 класслена резник
 

Viewers also liked (13)

Question 6
Question 6Question 6
Question 6
 
Prévisions des crises
Prévisions des crises Prévisions des crises
Prévisions des crises
 
Praktek TIK
Praktek TIKPraktek TIK
Praktek TIK
 
Javed sayyed resume
Javed sayyed resumeJaved sayyed resume
Javed sayyed resume
 
Normal Labour by Dr Salman
Normal Labour by Dr SalmanNormal Labour by Dr Salman
Normal Labour by Dr Salman
 
Kt product catalogue s s
Kt product catalogue s sKt product catalogue s s
Kt product catalogue s s
 
Magazine Genre Research
Magazine Genre ResearchMagazine Genre Research
Magazine Genre Research
 
VENKATRAMANI RESUME
VENKATRAMANI RESUMEVENKATRAMANI RESUME
VENKATRAMANI RESUME
 
Happy birthday quote pictures
Happy birthday quote picturesHappy birthday quote pictures
Happy birthday quote pictures
 
Gabor Frames for Quasicrystals and K-theory
Gabor Frames for Quasicrystals and K-theoryGabor Frames for Quasicrystals and K-theory
Gabor Frames for Quasicrystals and K-theory
 
Profile Brij Mohan Bhatt
Profile Brij Mohan BhattProfile Brij Mohan Bhatt
Profile Brij Mohan Bhatt
 
Pinagmulan Ng Tao
Pinagmulan Ng Tao Pinagmulan Ng Tao
Pinagmulan Ng Tao
 
презентация команды мбоу сш 15 5 класс
презентация команды мбоу сш 15 5 класспрезентация команды мбоу сш 15 5 класс
презентация команды мбоу сш 15 5 класс
 

Similar to Practical Relevance Measurement for Search Analytics

You Spoke, We Listened – Achieving a New Level of Search Optimization with Go...
You Spoke, We Listened – Achieving a New Level of Search Optimization with Go...You Spoke, We Listened – Achieving a New Level of Search Optimization with Go...
You Spoke, We Listened – Achieving a New Level of Search Optimization with Go...Concept Searching, Inc
 
Measuring ROI on Enterprise Search - John Lenker, Lucidworks
Measuring ROI on Enterprise Search - John Lenker, Lucidworks Measuring ROI on Enterprise Search - John Lenker, Lucidworks
Measuring ROI on Enterprise Search - John Lenker, Lucidworks Lucidworks
 
Data Driven Website Decisions for SEO and CRO
Data Driven Website Decisions for SEO and CROData Driven Website Decisions for SEO and CRO
Data Driven Website Decisions for SEO and CROAmanda King
 
Getting Started with Product Analytics - A 101 Implementation Guide for Begin...
Getting Started with Product Analytics - A 101 Implementation Guide for Begin...Getting Started with Product Analytics - A 101 Implementation Guide for Begin...
Getting Started with Product Analytics - A 101 Implementation Guide for Begin...Vishrut Shukla
 
Tips & Tricks for Getting Things Done Using Analytics Data
Tips & Tricks for Getting Things Done Using Analytics DataTips & Tricks for Getting Things Done Using Analytics Data
Tips & Tricks for Getting Things Done Using Analytics DataCharles Meaden
 
Digital Velocity 2014: "R-Oh-My! Quantifying the ROI of Tag Management"
Digital Velocity 2014: "R-Oh-My! Quantifying the ROI of Tag Management" Digital Velocity 2014: "R-Oh-My! Quantifying the ROI of Tag Management"
Digital Velocity 2014: "R-Oh-My! Quantifying the ROI of Tag Management" Tealium
 
Ericriesleanstartuppresentationforweb2
Ericriesleanstartuppresentationforweb2Ericriesleanstartuppresentationforweb2
Ericriesleanstartuppresentationforweb2Edmund FOng
 
Web Analytics and SEO for Startups
Web Analytics and SEO for StartupsWeb Analytics and SEO for Startups
Web Analytics and SEO for StartupsAdvance Metrics
 
Web Analytics and SEO for Startups
Web Analytics and SEO for StartupsWeb Analytics and SEO for Startups
Web Analytics and SEO for StartupsChristina Meyer
 
SEO in the Age of Artificial Intelligence | How AI influences Search
SEO in the Age of Artificial Intelligence | How AI influences SearchSEO in the Age of Artificial Intelligence | How AI influences Search
SEO in the Age of Artificial Intelligence | How AI influences SearchPhilipp Klöckner
 
"Taming Advanced Analytics Implementations at EA Scale" - Electronic Arts, Di...
"Taming Advanced Analytics Implementations at EA Scale" - Electronic Arts, Di..."Taming Advanced Analytics Implementations at EA Scale" - Electronic Arts, Di...
"Taming Advanced Analytics Implementations at EA Scale" - Electronic Arts, Di...Tealium
 
Search and Social Media Marketing Course Slides - Salford Universtiy
Search and Social Media Marketing Course Slides - Salford UniverstiySearch and Social Media Marketing Course Slides - Salford Universtiy
Search and Social Media Marketing Course Slides - Salford UniverstiyTom Mason
 
Four Laws of Tech Product Economics - Rich Mironov
Four Laws of Tech Product Economics - Rich MironovFour Laws of Tech Product Economics - Rich Mironov
Four Laws of Tech Product Economics - Rich MironovProductCampPortland
 
April 2018 Digital Marketing Roundtable
April 2018 Digital Marketing RoundtableApril 2018 Digital Marketing Roundtable
April 2018 Digital Marketing RoundtableMartha Herdendorf
 
UX Webinar: Always Be Testing
UX Webinar: Always Be TestingUX Webinar: Always Be Testing
UX Webinar: Always Be TestingCharity Dynamics
 
Digital Marketing Training
Digital Marketing TrainingDigital Marketing Training
Digital Marketing TrainingPiyushPahadi
 
Google WorkShop for STC LoneStar Chapter
Google WorkShop for STC LoneStar ChapterGoogle WorkShop for STC LoneStar Chapter
Google WorkShop for STC LoneStar ChapterChristopher Ward
 
SEO Audit Workshop : Frameworks , Techniques and Tools
SEO Audit Workshop : Frameworks , Techniques and Tools SEO Audit Workshop : Frameworks , Techniques and Tools
SEO Audit Workshop : Frameworks , Techniques and Tools NEW MEDIA GURU
 

Similar to Practical Relevance Measurement for Search Analytics (20)

You Spoke, We Listened – Achieving a New Level of Search Optimization with Go...
You Spoke, We Listened – Achieving a New Level of Search Optimization with Go...You Spoke, We Listened – Achieving a New Level of Search Optimization with Go...
You Spoke, We Listened – Achieving a New Level of Search Optimization with Go...
 
Measuring ROI on Enterprise Search - John Lenker, Lucidworks
Measuring ROI on Enterprise Search - John Lenker, Lucidworks Measuring ROI on Enterprise Search - John Lenker, Lucidworks
Measuring ROI on Enterprise Search - John Lenker, Lucidworks
 
Data Driven Website Decisions for SEO and CRO
Data Driven Website Decisions for SEO and CROData Driven Website Decisions for SEO and CRO
Data Driven Website Decisions for SEO and CRO
 
Getting Started with Product Analytics - A 101 Implementation Guide for Begin...
Getting Started with Product Analytics - A 101 Implementation Guide for Begin...Getting Started with Product Analytics - A 101 Implementation Guide for Begin...
Getting Started with Product Analytics - A 101 Implementation Guide for Begin...
 
The Death of The Unpaid Internship
The Death of The Unpaid Internship The Death of The Unpaid Internship
The Death of The Unpaid Internship
 
Tips & Tricks for Getting Things Done Using Analytics Data
Tips & Tricks for Getting Things Done Using Analytics DataTips & Tricks for Getting Things Done Using Analytics Data
Tips & Tricks for Getting Things Done Using Analytics Data
 
Digital Velocity 2014: "R-Oh-My! Quantifying the ROI of Tag Management"
Digital Velocity 2014: "R-Oh-My! Quantifying the ROI of Tag Management" Digital Velocity 2014: "R-Oh-My! Quantifying the ROI of Tag Management"
Digital Velocity 2014: "R-Oh-My! Quantifying the ROI of Tag Management"
 
Ericriesleanstartuppresentationforweb2
Ericriesleanstartuppresentationforweb2Ericriesleanstartuppresentationforweb2
Ericriesleanstartuppresentationforweb2
 
Web Analytics and SEO for Startups
Web Analytics and SEO for StartupsWeb Analytics and SEO for Startups
Web Analytics and SEO for Startups
 
Web Analytics and SEO for Startups
Web Analytics and SEO for StartupsWeb Analytics and SEO for Startups
Web Analytics and SEO for Startups
 
SEO in the Age of Artificial Intelligence | How AI influences Search
SEO in the Age of Artificial Intelligence | How AI influences SearchSEO in the Age of Artificial Intelligence | How AI influences Search
SEO in the Age of Artificial Intelligence | How AI influences Search
 
"Taming Advanced Analytics Implementations at EA Scale" - Electronic Arts, Di...
"Taming Advanced Analytics Implementations at EA Scale" - Electronic Arts, Di..."Taming Advanced Analytics Implementations at EA Scale" - Electronic Arts, Di...
"Taming Advanced Analytics Implementations at EA Scale" - Electronic Arts, Di...
 
Search and Social Media Marketing Course Slides - Salford Universtiy
Search and Social Media Marketing Course Slides - Salford UniverstiySearch and Social Media Marketing Course Slides - Salford Universtiy
Search and Social Media Marketing Course Slides - Salford Universtiy
 
Four Laws of Tech Product Economics - Rich Mironov
Four Laws of Tech Product Economics - Rich MironovFour Laws of Tech Product Economics - Rich Mironov
Four Laws of Tech Product Economics - Rich Mironov
 
April 2018 Digital Marketing Roundtable
April 2018 Digital Marketing RoundtableApril 2018 Digital Marketing Roundtable
April 2018 Digital Marketing Roundtable
 
UX Webinar: Always Be Testing
UX Webinar: Always Be TestingUX Webinar: Always Be Testing
UX Webinar: Always Be Testing
 
Digital Marketing Training
Digital Marketing TrainingDigital Marketing Training
Digital Marketing Training
 
Needle in a Haystack_ACS
Needle in a Haystack_ACSNeedle in a Haystack_ACS
Needle in a Haystack_ACS
 
Google WorkShop for STC LoneStar Chapter
Google WorkShop for STC LoneStar ChapterGoogle WorkShop for STC LoneStar Chapter
Google WorkShop for STC LoneStar Chapter
 
SEO Audit Workshop : Frameworks , Techniques and Tools
SEO Audit Workshop : Frameworks , Techniques and Tools SEO Audit Workshop : Frameworks , Techniques and Tools
SEO Audit Workshop : Frameworks , Techniques and Tools
 

More from enterprisesearchmeetup

More from enterprisesearchmeetup (6)

Cisco meetup-25 april2017
Cisco meetup-25 april2017Cisco meetup-25 april2017
Cisco meetup-25 april2017
 
Algolia - Hosted Search API
Algolia - Hosted Search API Algolia - Hosted Search API
Algolia - Hosted Search API
 
ElasticSearch - Introduction to Aggregations
ElasticSearch - Introduction to AggregationsElasticSearch - Introduction to Aggregations
ElasticSearch - Introduction to Aggregations
 
The Elastic ELK Stack
The Elastic ELK StackThe Elastic ELK Stack
The Elastic ELK Stack
 
Relevancy and Search Quality Analysis - Search Technologies
Relevancy and Search Quality Analysis - Search TechnologiesRelevancy and Search Quality Analysis - Search Technologies
Relevancy and Search Quality Analysis - Search Technologies
 
Scalable Search Analytics
Scalable Search AnalyticsScalable Search Analytics
Scalable Search Analytics
 

Recently uploaded

My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Science&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfScience&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfjimielynbastida
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentationphoebematthew05
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 

Recently uploaded (20)

My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Science&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfScience&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdf
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptxVulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentation
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 

Practical Relevance Measurement for Search Analytics

  • 1. Confidential Material – Chegg Inc. © 2005 - 2015. All Rights Reserved.Confidential Material – Chegg Inc. © 2005 - 2015. All Rights Reserved. Practical Relevance Measurement Walter Underwood Principal Software Engineer
  • 2. Confidential Material – Chegg Inc. © 2005 - 2015. All Rights Reserved. • Measure • Explain • Diagnose • Fix • Repeat Data-Driven Improvement 2
  • 3. Confidential Material – Chegg Inc. © 2005 - 2015. All Rights Reserved. Actionable Measure: What is a good metric? 10 Common interpretation Accessible, credible data Transparent, simple calculation Juice Analytics: http://www.juiceanalytics.com/writing/choosing-right-metric
  • 4. Confidential Material – Chegg Inc. © 2005 - 2015. All Rights Reserved. • Click-through rate (CTR) • Great for navigational search (one correct answer) • Great for simple search UI • Success rate • With multiple clicks, identify the result that satisfied • Dwell time, conversion, … • Needed for informational search • Useful for complex UI: facets, filters, … Measure: What is Success? 2
  • 5. Confidential Material – Chegg Inc. © 2005 - 2015. All Rights Reserved. • Use these measures to explain search success to others • 0.50 CTR or success rate is pretty good • A one percentage point improvement is very good • Do not over-promise • Report CTR daily and graph it Explain: Overall Success 2
  • 6. Confidential Material – Chegg Inc. © 2005 - 2015. All Rights Reserved. • What are the problem queries? • Per-query CTR reports Diagnose 2 Query CTR Frequency gorilla 0.45 10,000 orangutan 0.10 100
  • 7. Confidential Material – Chegg Inc. © 2005 - 2015. All Rights Reserved. • What should the queries’ CTR be? • How many clicks would that be? • How many times were people less happy than expected? Diagnose: Click Residual 2 Query CTR Frequency Actual Expected Residual TOTAL 0.50 100,000 50,000 gorilla 0.45 10,000 4,500 5,000 -500 orangutan 0.10 100 10 50 -40
  • 8. Confidential Material – Chegg Inc. © 2005 - 2015. All Rights Reserved. • Customer uses different word than content • The Vocabulary Problem (Furnas, et al; 1987) • Add synonyms: coat, jacket, parka, anorak, … • Content doesn’t exist • We don’t sell that — add “no hits” page with recommendations • Add to site/inventory • Misspellings (about 10% of queries) • Fuzzy search • Query suggestions Fix: What is the cause? 2
  • 9. Confidential Material – Chegg Inc. © 2005 - 2015. All Rights Reserved. • Count attempts (queries) • Count clicks or successes • Associate clicks with queries • Handle anomalies • Hash size Implementing Metrics 2
  • 10. Confidential Material – Chegg Inc. © 2005 - 2015. All Rights Reserved. • Log the query • Log a hash or unique ID for this attempt • Will be used to link clicks to attempts • Random bits generated in server (session log) • Random bits generated in browser (URL param) • Hash calculated from user ID, query, and time Implementing: Count Attempts 2
  • 11. Confidential Material – Chegg Inc. © 2005 - 2015. All Rights Reserved. • Search results page will have decorated URLs on results • Tracking hash • Rank (1-based) • Query is optional, you can join with the attempts log • Result links look like: /product-1234?tk=ABCD&r=1 /product-5678?tk=ABCD&r=2 Implementing: Count Clicks 2
  • 12. Confidential Material – Chegg Inc. © 2005 - 2015. All Rights Reserved. • Multiple clicks from one SRP • Opening results in new tabs • Cached SRP • Solution: Only count one, maybe the last one • Clicks that don’t match a query attempt • Bookmarked results • Clicks from a page loaded a previous day • Solution: Ignore them • Bots hitting SRP • Remove from stats • JavaScript-generated token helps filter bots Implementing: Anomalies 2
  • 13. Confidential Material – Chegg Inc. © 2005 - 2015. All Rights Reserved. • Birthday Paradox — you’ll need more bits than you think • Chegg uses: • 48 random bits • Encoded in URL-safe Base64 • Eight characters • Looks like: tk=NmAwoq5e • 1% chance of collision with 2.4 million searches • See “Birthday Problem” on Wikipedia Implementing: Hash Size 2
  • 14. Confidential Material – Chegg Inc. © 2005 - 2015. All Rights Reserved. Questions? 2
  • 15. Confidential Material – Chegg Inc. © 2005 - 2015. All Rights Reserved. Search Analytics for Your Site — Louis Rosenfeld 3 http://rosenfeldmedia.com/books/search-analytics-for-your-site/
  • 16. Confidential Material – Chegg Inc. © 2005 - 2015. All Rights Reserved.Confidential Material – Chegg Inc. © 2005 – 2015 by Chegg Inc. All Rights Reserved. Thank you! wunder@wunderwood.org 5

Editor's Notes

  1. Abstract: How do we measure and improve site and intranet search without tons of data and large teams of analysts? How to start instrumenting your site? Walter Underwood will talk about metrics he’s developed at Ultraseek, Netflix, and Chegg. They are straightforward to implement, direct your efforts toward the most improvement, and are easy to explain to executives.