SlideShare a Scribd company logo
1 of 17
Download to read offline
Using AI to Understand Search Intent
MICES 2021
Aritra Mandal
eBay Search
June, 2021
About Me
Aritra Mandal is an applied researcher on the Search team
at eBay. He focuses on search quality and is leveraging
AI/ML, structured data, and knowledge graphs to improve
the search engine that powers eBay’s marketplace. Aritra
received his BEng in computer science from Birla Institute of
Technology and his MS in computer and information science
from Indiana University–Purdue University Indianapolis.
Agenda
• What is Query Understanding?
• Using AI to Understand Search Intent
– Query Categorization
– Query Equivalence and Similarity
• Summary
What is Query Understanding?
Query understanding is the process of inferring the intent of a search query from the
searcher’s keywords.
It takes place before the search engine retrieves and ranks results.
Query understanding is useful not only for retrieval and ranking, but also for interface
decisions, recommendations, promotions, analytics, etc.
Spectrum of Query Intent
• Coarse-Grained
– Query Categorization
– Broad vs. Narrow
• Fine-Grained
– Entity Recognition
– Query Equivalence
Query Categorization
Benefits of Query Categorization
• Relevance filtering: excludes out-of-category results that match keywords.
– Especially useful for non-default sorts, like sorting by price.
• Helps provide searchers accurate summary information about query results.
– Accurate numbers for total number of results and category / facet counts.
– Contextual appropriate sub-categories and facets, sorts, filters, etc.
vs.
Using AI for Query Categorization
• Create labeled training data from clicks or purchases.
– Query => clicked or purchased product => product category.
– Identify the appropriate category with sufficient demand for each query.
– Use fastText, BERT, or any text classification model.
• Apply model to other queries -- especially torso and tail queries.
– Compute categories for medium to low frequency queries online.
Collect Query
Engaged Items
pairs
Queries with
Few or No
engagements
Use category
Hierarchy to roll
up and create
training records
Train Model
Bert/FastText
Category
Prediction for
Queries
Query Similarity based on Intent
• Multiple queries often map to the same intent.
– e.g., mens shoes = shoes for men.
Why group of queries with similar intent ?
• If we can recognize queries with the same (or nearly the same) intent, we can:
– Map poorly performing queries to better-performing equivalent ones.
– Intelligently recover from queries that return few or no results.
– Analyze search behavior grouping by intent, rather than by query.
– Obtain better signals to train machine learning models, e.g., for ranking.
• Recognizing query equivalence and similarity allows us to transform search queries
into canonical representations of search intent, establishing a more robust foundation
to optimize the search experience.
Recognizing Query Equivalence: 2 Strategies
• How do we recognize that two queries represent the same search intent?
– Surface Query Similarity
• stemming, word order, compounds, noise words
• e.g., mens wristwatch = wrist watches for men
– Post-Search Behavior
• engagement (clicks, conversions) with similar results
• requires a way to measure similar results (hold that thought!)
Surface Query Similarity
• The Good:
– Easy to recognize queries that differ only in stemming, word order, etc.
– High precision as a standalone indicator of query equivalence.
– Decent coverage that can be extended (e.g., low-edit-distance query pairs).
– Simple and explainable!
• The Bad:
– There are false positives, e.g., kiss != kisses; dress shirt != shirt dress.
– Need guardrails to avoid embarrassing mistakes.
– Limited coverage; extending it significantly increases risk of false positives.
Post-Search Behavior
• The Good:
– Higher coverage than can be obtained from surface query similarity.
– Learn from user behavior, which can look far beyond literal query tokens.
– Use whole-query context for contextual expansion and relaxation.
• The Bad:
– Complex compared to surface similarity, stochastic, and difficult to explain.
– Replace simple binary query equivalence with continuous similarity metric.
– Picking the right similarity threshold is as much an art as a science.
– Rely on a pipeline of black-box approaches, starting with embeddings.
Representing Queries as Vectors
• Think of a query as a bag of products associated with the query.
– Conversions, clicks, or even impressions.
– Trade-off between signal strength and sparsity.
• Compute query vector as average (mean pooling) of associated product vectors.
– Assumes that products can be mapped to vectors!
– Use pre-trained embeddings, fine-tuning, or can train from scratch.
• Product titles tend to be longer and more self-contained than queries.
– Especially on a marketplace where sellers optimize for findability.
Representing Queries as Vectors
The Devil is in the Details!
• Challenging to decide how similar is similar enough for equivalence.
– Threshold depends on data, application, and specific query.
– Similarity may not be the only goal, e.g., may be trying to increase recall.
• Evaluation of similarity model is critical -- but also challenging.
– Use human judgements of query pairs or end-to-end results.
– Ultimately need to A/B test the end-to-end search application.
• This approach only works when you can pre-compute query vectors offline.
– Associating queries with results at query time is too slow and expensive.
– Offline approach works for head and maybe torso, but not for tail.
Summary
• Query understanding is the process of inferring searcher’s intent from keywords.
• Spectrum from coarse-grained categorization to fine-grained equivalence.
• Categorization improves relevance, navigation, presentation, and analytics.
• Train query categorization using the categories of clicked and purchased products.
• Recognizing query equivalence establishes canonical representation of search intent.
• Model a query as average of associated product vectors from an embedding model.
For More information : arimandal@ebay.com LinkedIn : linkedin.com/in/aritram

More Related Content

What's hot

TPC-H Column Store and MPP systems
TPC-H Column Store and MPP systemsTPC-H Column Store and MPP systems
TPC-H Column Store and MPP systemsMostafa Mokhtar
 
Fuzzy Matching on Apache Spark with Jennifer Shin
Fuzzy Matching on Apache Spark with Jennifer ShinFuzzy Matching on Apache Spark with Jennifer Shin
Fuzzy Matching on Apache Spark with Jennifer ShinDatabricks
 
GraphFrames: DataFrame-based graphs for Apache® Spark™
GraphFrames: DataFrame-based graphs for Apache® Spark™GraphFrames: DataFrame-based graphs for Apache® Spark™
GraphFrames: DataFrame-based graphs for Apache® Spark™Databricks
 
Learning to Rank in Solr: Presented by Michael Nilsson & Diego Ceccarelli, Bl...
Learning to Rank in Solr: Presented by Michael Nilsson & Diego Ceccarelli, Bl...Learning to Rank in Solr: Presented by Michael Nilsson & Diego Ceccarelli, Bl...
Learning to Rank in Solr: Presented by Michael Nilsson & Diego Ceccarelli, Bl...Lucidworks
 
Modularized ETL Writing with Apache Spark
Modularized ETL Writing with Apache SparkModularized ETL Writing with Apache Spark
Modularized ETL Writing with Apache SparkDatabricks
 
Outrageous Ideas for Graph Databases
Outrageous Ideas for Graph DatabasesOutrageous Ideas for Graph Databases
Outrageous Ideas for Graph DatabasesMax De Marzi
 
Migrate Your Hadoop/Spark Workload to Amazon EMR and Architect It for Securit...
Migrate Your Hadoop/Spark Workload to Amazon EMR and Architect It for Securit...Migrate Your Hadoop/Spark Workload to Amazon EMR and Architect It for Securit...
Migrate Your Hadoop/Spark Workload to Amazon EMR and Architect It for Securit...Amazon Web Services
 
Neo4j Training Cypher
Neo4j Training CypherNeo4j Training Cypher
Neo4j Training CypherMax De Marzi
 
Building Large-scale Real-world Recommender Systems - Recsys2012 tutorial
Building Large-scale Real-world Recommender Systems - Recsys2012 tutorialBuilding Large-scale Real-world Recommender Systems - Recsys2012 tutorial
Building Large-scale Real-world Recommender Systems - Recsys2012 tutorialXavier Amatriain
 
Fine Tuning and Enhancing Performance of Apache Spark Jobs
Fine Tuning and Enhancing Performance of Apache Spark JobsFine Tuning and Enhancing Performance of Apache Spark Jobs
Fine Tuning and Enhancing Performance of Apache Spark JobsDatabricks
 
Creating an end-to-end Recommender System with Apache Spark and Elasticsearch...
Creating an end-to-end Recommender System with Apache Spark and Elasticsearch...Creating an end-to-end Recommender System with Apache Spark and Elasticsearch...
Creating an end-to-end Recommender System with Apache Spark and Elasticsearch...sparktc
 
AI Modernization at AT&T and the Application to Fraud with Databricks
AI Modernization at AT&T and the Application to Fraud with DatabricksAI Modernization at AT&T and the Application to Fraud with Databricks
AI Modernization at AT&T and the Application to Fraud with DatabricksDatabricks
 
The Ultimate Pagination for SEO
The Ultimate Pagination for SEOThe Ultimate Pagination for SEO
The Ultimate Pagination for SEOCatalyst
 
Real-time Analytics with Trino and Apache Pinot
Real-time Analytics with Trino and Apache PinotReal-time Analytics with Trino and Apache Pinot
Real-time Analytics with Trino and Apache PinotXiang Fu
 
엘라스틱 서치 세미나
엘라스틱 서치 세미나엘라스틱 서치 세미나
엘라스틱 서치 세미나종현 김
 
Semantic Search Engine: Semantic Search and Query Parsing with Phrases and En...
Semantic Search Engine: Semantic Search and Query Parsing with Phrases and En...Semantic Search Engine: Semantic Search and Query Parsing with Phrases and En...
Semantic Search Engine: Semantic Search and Query Parsing with Phrases and En...Koray Tugberk GUBUR
 
Spark SQL Deep Dive @ Melbourne Spark Meetup
Spark SQL Deep Dive @ Melbourne Spark MeetupSpark SQL Deep Dive @ Melbourne Spark Meetup
Spark SQL Deep Dive @ Melbourne Spark MeetupDatabricks
 

What's hot (20)

How to build a successful Data Lake
How to build a successful Data LakeHow to build a successful Data Lake
How to build a successful Data Lake
 
TPC-H Column Store and MPP systems
TPC-H Column Store and MPP systemsTPC-H Column Store and MPP systems
TPC-H Column Store and MPP systems
 
Fuzzy Matching on Apache Spark with Jennifer Shin
Fuzzy Matching on Apache Spark with Jennifer ShinFuzzy Matching on Apache Spark with Jennifer Shin
Fuzzy Matching on Apache Spark with Jennifer Shin
 
Anatomy Of Ia
Anatomy Of IaAnatomy Of Ia
Anatomy Of Ia
 
GraphFrames: DataFrame-based graphs for Apache® Spark™
GraphFrames: DataFrame-based graphs for Apache® Spark™GraphFrames: DataFrame-based graphs for Apache® Spark™
GraphFrames: DataFrame-based graphs for Apache® Spark™
 
Learning to Rank in Solr: Presented by Michael Nilsson & Diego Ceccarelli, Bl...
Learning to Rank in Solr: Presented by Michael Nilsson & Diego Ceccarelli, Bl...Learning to Rank in Solr: Presented by Michael Nilsson & Diego Ceccarelli, Bl...
Learning to Rank in Solr: Presented by Michael Nilsson & Diego Ceccarelli, Bl...
 
Modularized ETL Writing with Apache Spark
Modularized ETL Writing with Apache SparkModularized ETL Writing with Apache Spark
Modularized ETL Writing with Apache Spark
 
Outrageous Ideas for Graph Databases
Outrageous Ideas for Graph DatabasesOutrageous Ideas for Graph Databases
Outrageous Ideas for Graph Databases
 
Migrate Your Hadoop/Spark Workload to Amazon EMR and Architect It for Securit...
Migrate Your Hadoop/Spark Workload to Amazon EMR and Architect It for Securit...Migrate Your Hadoop/Spark Workload to Amazon EMR and Architect It for Securit...
Migrate Your Hadoop/Spark Workload to Amazon EMR and Architect It for Securit...
 
Neo4j Training Cypher
Neo4j Training CypherNeo4j Training Cypher
Neo4j Training Cypher
 
Building Large-scale Real-world Recommender Systems - Recsys2012 tutorial
Building Large-scale Real-world Recommender Systems - Recsys2012 tutorialBuilding Large-scale Real-world Recommender Systems - Recsys2012 tutorial
Building Large-scale Real-world Recommender Systems - Recsys2012 tutorial
 
Fine Tuning and Enhancing Performance of Apache Spark Jobs
Fine Tuning and Enhancing Performance of Apache Spark JobsFine Tuning and Enhancing Performance of Apache Spark Jobs
Fine Tuning and Enhancing Performance of Apache Spark Jobs
 
Power BI - Power Query
Power BI - Power QueryPower BI - Power Query
Power BI - Power Query
 
Creating an end-to-end Recommender System with Apache Spark and Elasticsearch...
Creating an end-to-end Recommender System with Apache Spark and Elasticsearch...Creating an end-to-end Recommender System with Apache Spark and Elasticsearch...
Creating an end-to-end Recommender System with Apache Spark and Elasticsearch...
 
AI Modernization at AT&T and the Application to Fraud with Databricks
AI Modernization at AT&T and the Application to Fraud with DatabricksAI Modernization at AT&T and the Application to Fraud with Databricks
AI Modernization at AT&T and the Application to Fraud with Databricks
 
The Ultimate Pagination for SEO
The Ultimate Pagination for SEOThe Ultimate Pagination for SEO
The Ultimate Pagination for SEO
 
Real-time Analytics with Trino and Apache Pinot
Real-time Analytics with Trino and Apache PinotReal-time Analytics with Trino and Apache Pinot
Real-time Analytics with Trino and Apache Pinot
 
엘라스틱 서치 세미나
엘라스틱 서치 세미나엘라스틱 서치 세미나
엘라스틱 서치 세미나
 
Semantic Search Engine: Semantic Search and Query Parsing with Phrases and En...
Semantic Search Engine: Semantic Search and Query Parsing with Phrases and En...Semantic Search Engine: Semantic Search and Query Parsing with Phrases and En...
Semantic Search Engine: Semantic Search and Query Parsing with Phrases and En...
 
Spark SQL Deep Dive @ Melbourne Spark Meetup
Spark SQL Deep Dive @ Melbourne Spark MeetupSpark SQL Deep Dive @ Melbourne Spark Meetup
Spark SQL Deep Dive @ Melbourne Spark Meetup
 

Similar to Using AI to understand search intent

Search Behavior Patterns
Search Behavior PatternsSearch Behavior Patterns
Search Behavior PatternsRamzi Alqrainy
 
Query Understanding and Ecommerce
Query Understanding and EcommerceQuery Understanding and Ecommerce
Query Understanding and EcommerceDaniel Tunkelang
 
Enhancing Enterprise Search with Machine Learning - Simon Hughes, Dice.com
Enhancing Enterprise Search with Machine Learning - Simon Hughes, Dice.comEnhancing Enterprise Search with Machine Learning - Simon Hughes, Dice.com
Enhancing Enterprise Search with Machine Learning - Simon Hughes, Dice.comSimon Hughes
 
Demystifying Recommendation Systems
Demystifying Recommendation SystemsDemystifying Recommendation Systems
Demystifying Recommendation SystemsRumman Chowdhury
 
Keyword research - Digital Marketing - SEO
Keyword research - Digital Marketing - SEOKeyword research - Digital Marketing - SEO
Keyword research - Digital Marketing - SEONeeraj Reddy
 
Lecture Notes on Recommender System Introduction
Lecture Notes on Recommender System IntroductionLecture Notes on Recommender System Introduction
Lecture Notes on Recommender System IntroductionPerumalPitchandi
 
Keyword research tools for Search Engine Optimisation (SEO)
Keyword research tools for Search Engine Optimisation (SEO)Keyword research tools for Search Engine Optimisation (SEO)
Keyword research tools for Search Engine Optimisation (SEO)Duncan MacGruer
 
Tuning Up Site Search - IA Summit 2007
Tuning Up Site Search - IA Summit 2007Tuning Up Site Search - IA Summit 2007
Tuning Up Site Search - IA Summit 2007Chris Farnum
 
Data warehouse 16 data analysis techniques
Data warehouse 16 data analysis techniquesData warehouse 16 data analysis techniques
Data warehouse 16 data analysis techniquesVaibhav Khanna
 
Research Theory Pro-Forma (1).pptx
Research Theory Pro-Forma (1).pptxResearch Theory Pro-Forma (1).pptx
Research Theory Pro-Forma (1).pptxBenjaminBirney1
 
Great Survey Design
Great Survey DesignGreat Survey Design
Great Survey DesignSurveyGizmo
 
e3_chapter__5_evaluation_technics_HCeVpPLCvE.ppt
e3_chapter__5_evaluation_technics_HCeVpPLCvE.ppte3_chapter__5_evaluation_technics_HCeVpPLCvE.ppt
e3_chapter__5_evaluation_technics_HCeVpPLCvE.pptappstore15
 
Introduction to Enterprise Search
Introduction to Enterprise SearchIntroduction to Enterprise Search
Introduction to Enterprise SearchFindwise
 
ACIS 2015 Bibliographical-based Facets for Expertise Search
ACIS 2015 Bibliographical-based Facets for Expertise SearchACIS 2015 Bibliographical-based Facets for Expertise Search
ACIS 2015 Bibliographical-based Facets for Expertise SearchGan Keng Hoon
 
Modelling and evaluation
Modelling and evaluationModelling and evaluation
Modelling and evaluationeShikshak
 
detailed notes AMR.pdf
detailed notes AMR.pdfdetailed notes AMR.pdf
detailed notes AMR.pdfRameshG767908
 

Similar to Using AI to understand search intent (20)

Search Behavior Patterns
Search Behavior PatternsSearch Behavior Patterns
Search Behavior Patterns
 
MMM, Search!
MMM, Search!MMM, Search!
MMM, Search!
 
Query Understanding and Ecommerce
Query Understanding and EcommerceQuery Understanding and Ecommerce
Query Understanding and Ecommerce
 
Enhancing Enterprise Search with Machine Learning - Simon Hughes, Dice.com
Enhancing Enterprise Search with Machine Learning - Simon Hughes, Dice.comEnhancing Enterprise Search with Machine Learning - Simon Hughes, Dice.com
Enhancing Enterprise Search with Machine Learning - Simon Hughes, Dice.com
 
Demystifying Recommendation Systems
Demystifying Recommendation SystemsDemystifying Recommendation Systems
Demystifying Recommendation Systems
 
Search Analytics - Comperio
Search Analytics - ComperioSearch Analytics - Comperio
Search Analytics - Comperio
 
Keyword research - Digital Marketing - SEO
Keyword research - Digital Marketing - SEOKeyword research - Digital Marketing - SEO
Keyword research - Digital Marketing - SEO
 
Lecture Notes on Recommender System Introduction
Lecture Notes on Recommender System IntroductionLecture Notes on Recommender System Introduction
Lecture Notes on Recommender System Introduction
 
Haystacks slides
Haystacks slidesHaystacks slides
Haystacks slides
 
Keyword research tools for Search Engine Optimisation (SEO)
Keyword research tools for Search Engine Optimisation (SEO)Keyword research tools for Search Engine Optimisation (SEO)
Keyword research tools for Search Engine Optimisation (SEO)
 
Tuning Up Site Search - IA Summit 2007
Tuning Up Site Search - IA Summit 2007Tuning Up Site Search - IA Summit 2007
Tuning Up Site Search - IA Summit 2007
 
Data warehouse 16 data analysis techniques
Data warehouse 16 data analysis techniquesData warehouse 16 data analysis techniques
Data warehouse 16 data analysis techniques
 
Research Theory Pro-Forma (1).pptx
Research Theory Pro-Forma (1).pptxResearch Theory Pro-Forma (1).pptx
Research Theory Pro-Forma (1).pptx
 
Great Survey Design
Great Survey DesignGreat Survey Design
Great Survey Design
 
e3_chapter__5_evaluation_technics_HCeVpPLCvE.ppt
e3_chapter__5_evaluation_technics_HCeVpPLCvE.ppte3_chapter__5_evaluation_technics_HCeVpPLCvE.ppt
e3_chapter__5_evaluation_technics_HCeVpPLCvE.ppt
 
Introduction to Enterprise Search
Introduction to Enterprise SearchIntroduction to Enterprise Search
Introduction to Enterprise Search
 
meetup-talk
meetup-talkmeetup-talk
meetup-talk
 
ACIS 2015 Bibliographical-based Facets for Expertise Search
ACIS 2015 Bibliographical-based Facets for Expertise SearchACIS 2015 Bibliographical-based Facets for Expertise Search
ACIS 2015 Bibliographical-based Facets for Expertise Search
 
Modelling and evaluation
Modelling and evaluationModelling and evaluation
Modelling and evaluation
 
detailed notes AMR.pdf
detailed notes AMR.pdfdetailed notes AMR.pdf
detailed notes AMR.pdf
 

Recently uploaded

一比一原版田纳西大学毕业证如何办理
一比一原版田纳西大学毕业证如何办理一比一原版田纳西大学毕业证如何办理
一比一原版田纳西大学毕业证如何办理F
 
💚 Call Girls Bahraich 9332606886 High Profile Call Girls You Can Get The S...
💚 Call Girls Bahraich   9332606886  High Profile Call Girls You Can Get The S...💚 Call Girls Bahraich   9332606886  High Profile Call Girls You Can Get The S...
💚 Call Girls Bahraich 9332606886 High Profile Call Girls You Can Get The S...Sareena Khatun
 
pdfcoffee.com_business-ethics-q3m7-pdf-free.pdf
pdfcoffee.com_business-ethics-q3m7-pdf-free.pdfpdfcoffee.com_business-ethics-q3m7-pdf-free.pdf
pdfcoffee.com_business-ethics-q3m7-pdf-free.pdfJOHNBEBONYAP1
 
20240508 QFM014 Elixir Reading List April 2024.pdf
20240508 QFM014 Elixir Reading List April 2024.pdf20240508 QFM014 Elixir Reading List April 2024.pdf
20240508 QFM014 Elixir Reading List April 2024.pdfMatthew Sinclair
 
在线制作约克大学毕业证(yu毕业证)在读证明认证可查
在线制作约克大学毕业证(yu毕业证)在读证明认证可查在线制作约克大学毕业证(yu毕业证)在读证明认证可查
在线制作约克大学毕业证(yu毕业证)在读证明认证可查ydyuyu
 
一比一原版贝德福特大学毕业证学位证书
一比一原版贝德福特大学毕业证学位证书一比一原版贝德福特大学毕业证学位证书
一比一原版贝德福特大学毕业证学位证书F
 
Washington Football Commanders Redskins Feathers Shirt
Washington Football Commanders Redskins Feathers ShirtWashington Football Commanders Redskins Feathers Shirt
Washington Football Commanders Redskins Feathers Shirtrahman018755
 
一比一原版(Curtin毕业证书)科廷大学毕业证原件一模一样
一比一原版(Curtin毕业证书)科廷大学毕业证原件一模一样一比一原版(Curtin毕业证书)科廷大学毕业证原件一模一样
一比一原版(Curtin毕业证书)科廷大学毕业证原件一模一样ayvbos
 
原版制作美国爱荷华大学毕业证(iowa毕业证书)学位证网上存档可查
原版制作美国爱荷华大学毕业证(iowa毕业证书)学位证网上存档可查原版制作美国爱荷华大学毕业证(iowa毕业证书)学位证网上存档可查
原版制作美国爱荷华大学毕业证(iowa毕业证书)学位证网上存档可查ydyuyu
 
一比一原版奥兹学院毕业证如何办理
一比一原版奥兹学院毕业证如何办理一比一原版奥兹学院毕业证如何办理
一比一原版奥兹学院毕业证如何办理F
 
Russian Call girls in Abu Dhabi 0508644382 Abu Dhabi Call girls
Russian Call girls in Abu Dhabi 0508644382 Abu Dhabi Call girlsRussian Call girls in Abu Dhabi 0508644382 Abu Dhabi Call girls
Russian Call girls in Abu Dhabi 0508644382 Abu Dhabi Call girlsMonica Sydney
 
Local Call Girls in Gomati 9332606886 HOT & SEXY Models beautiful and charmi...
Local Call Girls in Gomati  9332606886 HOT & SEXY Models beautiful and charmi...Local Call Girls in Gomati  9332606886 HOT & SEXY Models beautiful and charmi...
Local Call Girls in Gomati 9332606886 HOT & SEXY Models beautiful and charmi...Sareena Khatun
 
Ballia Escorts Service Girl ^ 9332606886, WhatsApp Anytime Ballia
Ballia Escorts Service Girl ^ 9332606886, WhatsApp Anytime BalliaBallia Escorts Service Girl ^ 9332606886, WhatsApp Anytime Ballia
Ballia Escorts Service Girl ^ 9332606886, WhatsApp Anytime Balliameghakumariji156
 
Local Call Girls in Jharsuguda 9332606886 HOT & SEXY Models beautiful and ch...
Local Call Girls in Jharsuguda  9332606886 HOT & SEXY Models beautiful and ch...Local Call Girls in Jharsuguda  9332606886 HOT & SEXY Models beautiful and ch...
Local Call Girls in Jharsuguda 9332606886 HOT & SEXY Models beautiful and ch...Sareena Khatun
 
Tadepalligudem Escorts Service Girl ^ 9332606886, WhatsApp Anytime Tadepallig...
Tadepalligudem Escorts Service Girl ^ 9332606886, WhatsApp Anytime Tadepallig...Tadepalligudem Escorts Service Girl ^ 9332606886, WhatsApp Anytime Tadepallig...
Tadepalligudem Escorts Service Girl ^ 9332606886, WhatsApp Anytime Tadepallig...meghakumariji156
 
Indian Escort in Abu DHabi 0508644382 Abu Dhabi Escorts
Indian Escort in Abu DHabi 0508644382 Abu Dhabi EscortsIndian Escort in Abu DHabi 0508644382 Abu Dhabi Escorts
Indian Escort in Abu DHabi 0508644382 Abu Dhabi EscortsMonica Sydney
 
APNIC Updates presented by Paul Wilson at ARIN 53
APNIC Updates presented by Paul Wilson at ARIN 53APNIC Updates presented by Paul Wilson at ARIN 53
APNIC Updates presented by Paul Wilson at ARIN 53APNIC
 
"Boost Your Digital Presence: Partner with a Leading SEO Agency"
"Boost Your Digital Presence: Partner with a Leading SEO Agency""Boost Your Digital Presence: Partner with a Leading SEO Agency"
"Boost Your Digital Presence: Partner with a Leading SEO Agency"growthgrids
 
20240507 QFM013 Machine Intelligence Reading List April 2024.pdf
20240507 QFM013 Machine Intelligence Reading List April 2024.pdf20240507 QFM013 Machine Intelligence Reading List April 2024.pdf
20240507 QFM013 Machine Intelligence Reading List April 2024.pdfMatthew Sinclair
 
一比一原版澳大利亚迪肯大学毕业证如何办理
一比一原版澳大利亚迪肯大学毕业证如何办理一比一原版澳大利亚迪肯大学毕业证如何办理
一比一原版澳大利亚迪肯大学毕业证如何办理SS
 

Recently uploaded (20)

一比一原版田纳西大学毕业证如何办理
一比一原版田纳西大学毕业证如何办理一比一原版田纳西大学毕业证如何办理
一比一原版田纳西大学毕业证如何办理
 
💚 Call Girls Bahraich 9332606886 High Profile Call Girls You Can Get The S...
💚 Call Girls Bahraich   9332606886  High Profile Call Girls You Can Get The S...💚 Call Girls Bahraich   9332606886  High Profile Call Girls You Can Get The S...
💚 Call Girls Bahraich 9332606886 High Profile Call Girls You Can Get The S...
 
pdfcoffee.com_business-ethics-q3m7-pdf-free.pdf
pdfcoffee.com_business-ethics-q3m7-pdf-free.pdfpdfcoffee.com_business-ethics-q3m7-pdf-free.pdf
pdfcoffee.com_business-ethics-q3m7-pdf-free.pdf
 
20240508 QFM014 Elixir Reading List April 2024.pdf
20240508 QFM014 Elixir Reading List April 2024.pdf20240508 QFM014 Elixir Reading List April 2024.pdf
20240508 QFM014 Elixir Reading List April 2024.pdf
 
在线制作约克大学毕业证(yu毕业证)在读证明认证可查
在线制作约克大学毕业证(yu毕业证)在读证明认证可查在线制作约克大学毕业证(yu毕业证)在读证明认证可查
在线制作约克大学毕业证(yu毕业证)在读证明认证可查
 
一比一原版贝德福特大学毕业证学位证书
一比一原版贝德福特大学毕业证学位证书一比一原版贝德福特大学毕业证学位证书
一比一原版贝德福特大学毕业证学位证书
 
Washington Football Commanders Redskins Feathers Shirt
Washington Football Commanders Redskins Feathers ShirtWashington Football Commanders Redskins Feathers Shirt
Washington Football Commanders Redskins Feathers Shirt
 
一比一原版(Curtin毕业证书)科廷大学毕业证原件一模一样
一比一原版(Curtin毕业证书)科廷大学毕业证原件一模一样一比一原版(Curtin毕业证书)科廷大学毕业证原件一模一样
一比一原版(Curtin毕业证书)科廷大学毕业证原件一模一样
 
原版制作美国爱荷华大学毕业证(iowa毕业证书)学位证网上存档可查
原版制作美国爱荷华大学毕业证(iowa毕业证书)学位证网上存档可查原版制作美国爱荷华大学毕业证(iowa毕业证书)学位证网上存档可查
原版制作美国爱荷华大学毕业证(iowa毕业证书)学位证网上存档可查
 
一比一原版奥兹学院毕业证如何办理
一比一原版奥兹学院毕业证如何办理一比一原版奥兹学院毕业证如何办理
一比一原版奥兹学院毕业证如何办理
 
Russian Call girls in Abu Dhabi 0508644382 Abu Dhabi Call girls
Russian Call girls in Abu Dhabi 0508644382 Abu Dhabi Call girlsRussian Call girls in Abu Dhabi 0508644382 Abu Dhabi Call girls
Russian Call girls in Abu Dhabi 0508644382 Abu Dhabi Call girls
 
Local Call Girls in Gomati 9332606886 HOT & SEXY Models beautiful and charmi...
Local Call Girls in Gomati  9332606886 HOT & SEXY Models beautiful and charmi...Local Call Girls in Gomati  9332606886 HOT & SEXY Models beautiful and charmi...
Local Call Girls in Gomati 9332606886 HOT & SEXY Models beautiful and charmi...
 
Ballia Escorts Service Girl ^ 9332606886, WhatsApp Anytime Ballia
Ballia Escorts Service Girl ^ 9332606886, WhatsApp Anytime BalliaBallia Escorts Service Girl ^ 9332606886, WhatsApp Anytime Ballia
Ballia Escorts Service Girl ^ 9332606886, WhatsApp Anytime Ballia
 
Local Call Girls in Jharsuguda 9332606886 HOT & SEXY Models beautiful and ch...
Local Call Girls in Jharsuguda  9332606886 HOT & SEXY Models beautiful and ch...Local Call Girls in Jharsuguda  9332606886 HOT & SEXY Models beautiful and ch...
Local Call Girls in Jharsuguda 9332606886 HOT & SEXY Models beautiful and ch...
 
Tadepalligudem Escorts Service Girl ^ 9332606886, WhatsApp Anytime Tadepallig...
Tadepalligudem Escorts Service Girl ^ 9332606886, WhatsApp Anytime Tadepallig...Tadepalligudem Escorts Service Girl ^ 9332606886, WhatsApp Anytime Tadepallig...
Tadepalligudem Escorts Service Girl ^ 9332606886, WhatsApp Anytime Tadepallig...
 
Indian Escort in Abu DHabi 0508644382 Abu Dhabi Escorts
Indian Escort in Abu DHabi 0508644382 Abu Dhabi EscortsIndian Escort in Abu DHabi 0508644382 Abu Dhabi Escorts
Indian Escort in Abu DHabi 0508644382 Abu Dhabi Escorts
 
APNIC Updates presented by Paul Wilson at ARIN 53
APNIC Updates presented by Paul Wilson at ARIN 53APNIC Updates presented by Paul Wilson at ARIN 53
APNIC Updates presented by Paul Wilson at ARIN 53
 
"Boost Your Digital Presence: Partner with a Leading SEO Agency"
"Boost Your Digital Presence: Partner with a Leading SEO Agency""Boost Your Digital Presence: Partner with a Leading SEO Agency"
"Boost Your Digital Presence: Partner with a Leading SEO Agency"
 
20240507 QFM013 Machine Intelligence Reading List April 2024.pdf
20240507 QFM013 Machine Intelligence Reading List April 2024.pdf20240507 QFM013 Machine Intelligence Reading List April 2024.pdf
20240507 QFM013 Machine Intelligence Reading List April 2024.pdf
 
一比一原版澳大利亚迪肯大学毕业证如何办理
一比一原版澳大利亚迪肯大学毕业证如何办理一比一原版澳大利亚迪肯大学毕业证如何办理
一比一原版澳大利亚迪肯大学毕业证如何办理
 

Using AI to understand search intent

  • 1. Using AI to Understand Search Intent MICES 2021 Aritra Mandal eBay Search June, 2021
  • 2. About Me Aritra Mandal is an applied researcher on the Search team at eBay. He focuses on search quality and is leveraging AI/ML, structured data, and knowledge graphs to improve the search engine that powers eBay’s marketplace. Aritra received his BEng in computer science from Birla Institute of Technology and his MS in computer and information science from Indiana University–Purdue University Indianapolis.
  • 3. Agenda • What is Query Understanding? • Using AI to Understand Search Intent – Query Categorization – Query Equivalence and Similarity • Summary
  • 4. What is Query Understanding? Query understanding is the process of inferring the intent of a search query from the searcher’s keywords. It takes place before the search engine retrieves and ranks results. Query understanding is useful not only for retrieval and ranking, but also for interface decisions, recommendations, promotions, analytics, etc.
  • 5. Spectrum of Query Intent • Coarse-Grained – Query Categorization – Broad vs. Narrow • Fine-Grained – Entity Recognition – Query Equivalence
  • 7. Benefits of Query Categorization • Relevance filtering: excludes out-of-category results that match keywords. – Especially useful for non-default sorts, like sorting by price. • Helps provide searchers accurate summary information about query results. – Accurate numbers for total number of results and category / facet counts. – Contextual appropriate sub-categories and facets, sorts, filters, etc. vs.
  • 8. Using AI for Query Categorization • Create labeled training data from clicks or purchases. – Query => clicked or purchased product => product category. – Identify the appropriate category with sufficient demand for each query. – Use fastText, BERT, or any text classification model. • Apply model to other queries -- especially torso and tail queries. – Compute categories for medium to low frequency queries online. Collect Query Engaged Items pairs Queries with Few or No engagements Use category Hierarchy to roll up and create training records Train Model Bert/FastText Category Prediction for Queries
  • 9. Query Similarity based on Intent • Multiple queries often map to the same intent. – e.g., mens shoes = shoes for men.
  • 10. Why group of queries with similar intent ? • If we can recognize queries with the same (or nearly the same) intent, we can: – Map poorly performing queries to better-performing equivalent ones. – Intelligently recover from queries that return few or no results. – Analyze search behavior grouping by intent, rather than by query. – Obtain better signals to train machine learning models, e.g., for ranking. • Recognizing query equivalence and similarity allows us to transform search queries into canonical representations of search intent, establishing a more robust foundation to optimize the search experience.
  • 11. Recognizing Query Equivalence: 2 Strategies • How do we recognize that two queries represent the same search intent? – Surface Query Similarity • stemming, word order, compounds, noise words • e.g., mens wristwatch = wrist watches for men – Post-Search Behavior • engagement (clicks, conversions) with similar results • requires a way to measure similar results (hold that thought!)
  • 12. Surface Query Similarity • The Good: – Easy to recognize queries that differ only in stemming, word order, etc. – High precision as a standalone indicator of query equivalence. – Decent coverage that can be extended (e.g., low-edit-distance query pairs). – Simple and explainable! • The Bad: – There are false positives, e.g., kiss != kisses; dress shirt != shirt dress. – Need guardrails to avoid embarrassing mistakes. – Limited coverage; extending it significantly increases risk of false positives.
  • 13. Post-Search Behavior • The Good: – Higher coverage than can be obtained from surface query similarity. – Learn from user behavior, which can look far beyond literal query tokens. – Use whole-query context for contextual expansion and relaxation. • The Bad: – Complex compared to surface similarity, stochastic, and difficult to explain. – Replace simple binary query equivalence with continuous similarity metric. – Picking the right similarity threshold is as much an art as a science. – Rely on a pipeline of black-box approaches, starting with embeddings.
  • 14. Representing Queries as Vectors • Think of a query as a bag of products associated with the query. – Conversions, clicks, or even impressions. – Trade-off between signal strength and sparsity. • Compute query vector as average (mean pooling) of associated product vectors. – Assumes that products can be mapped to vectors! – Use pre-trained embeddings, fine-tuning, or can train from scratch. • Product titles tend to be longer and more self-contained than queries. – Especially on a marketplace where sellers optimize for findability.
  • 16. The Devil is in the Details! • Challenging to decide how similar is similar enough for equivalence. – Threshold depends on data, application, and specific query. – Similarity may not be the only goal, e.g., may be trying to increase recall. • Evaluation of similarity model is critical -- but also challenging. – Use human judgements of query pairs or end-to-end results. – Ultimately need to A/B test the end-to-end search application. • This approach only works when you can pre-compute query vectors offline. – Associating queries with results at query time is too slow and expensive. – Offline approach works for head and maybe torso, but not for tail.
  • 17. Summary • Query understanding is the process of inferring searcher’s intent from keywords. • Spectrum from coarse-grained categorization to fine-grained equivalence. • Categorization improves relevance, navigation, presentation, and analytics. • Train query categorization using the categories of clicked and purchased products. • Recognizing query equivalence establishes canonical representation of search intent. • Model a query as average of associated product vectors from an embedding model. For More information : arimandal@ebay.com LinkedIn : linkedin.com/in/aritram