SlideShare a Scribd company logo
1 of 22
Download to read offline
Extending BM25 with multiple query 
operators 
Roi Blanco 
Yahoo! Research 
Paolo Boldi 
Universitá degli studi di Milano
Overview 
• Motivation 
• Virtual regions and BM25’s event space 
• Experiments 
• Conclusions
Motivation 
• From document understanding to query 
understanding 
• Query rewrites (gazetteers, spell 
correction), named entity recognition, 
query suggestions, query categories, query 
segmentation ... 
• Detecting query intent, triggering verticals 
• direct target towards answers 
• richer interfaces
Signals for Ranking 
• Signals for ranking: matches of query terms 
in documents, query-independent quality 
measures, CTR, among others 
• Probabilistic IR models are all about 
counting 
• occurrences of terms in documents, in 
sets of documents, etc. 
• How to aggregate efficiently a large number 
of “different” counts 
• coming from the same terms 
• no double counts!
Searching for food 
• New York’s greatest pizza 
‣ New OR York’s OR greatest OR pizza 
‣ New AND York’s AND greatest AND 
pizza 
‣ New OR York OR great OR pizza 
‣ “New York” OR “great pizza” 
‣ “New York” AND “great pizza” 
‣ York < New AND great OR pizza 
• among many more.
“Refined” matching 
• Extract a number of virtual regions in the 
document that match some version of the 
query (operators) 
• Each region provides a different evidence 
of relevance (i.e. signal) 
• Aggregate the scores over the different 
regions 
• Ex. :“at least any two words in the query 
appear either consecutively or with an extra 
word between them”
Probability of Relevance 
0 10 20 30 40 50 
0.0000 0.0005 0.0010 0.0015 
tf 
P[relevant] 
!!!!!!! 
!! 
!! 
! 
!!! 
! 
!! 
! 
! 
! 
! 
!! 
!! 
! 
! 
! 
! 
! 
!! 
! 
! 
! 
! 
! 
! 
! 
! 
! 
! 
! 
! 
! 
! 
! 
! 
! 
! 
! 
! 
! 
! 
! or 
2!gram 
2!and
R 
Et 
tft 
t ! q 
BM25 
• Term (tf) independence 
• Vague Prior over terms not 
appearing in the query 
• Eliteness - topical model that 
perturbs the word 
distribution 
• 2-poisson distribution of 
term frequencies over 
relevant and non-relevant 
documents 
Figure 2: BM25 plate diagram representing the over variable independence 
wBM25 
t,d = 
according to p(r | Q = q,D = d), or equivalently 
p(r | Q = q,D = d) 
p(r | Q = q,D = d) !q 
p(D = d | r,Q = q) 
p(D = d | r,Q = q) 
Y 
tft,ˆd 
widf 
tft,ˆd + k1 · t
Virtual regions 
• Different parts of the 
document provide 
different evidence of 
relevance 
• Create a (finite) set of 
(latent) artificial regions 
and re-weight 
p(tfj 
t,d = x | r) = 
! 
e!{0,1} 
p(tfj 
t,d = x | e, r)p(e | r) = 
! 
q!Q 
! 
e,!!{0,1} 
p(tfj 
t,d = x | !j = !, e, r)p(!j = ! | e, r)p(e | r) 
Et 
!j 
j !M 
t ! q 
R 
tfj 
t
Implementation 
• An operator maps a query to a set of 
queries, which could match a document 
• Each operator has a weight 
• The average term frequency in a document 
is 
ˆ tf t,d = 
! 
j!M 
wj · tf j 
t,d 
(1 − bj) + bj · |d|/avdl 
.
Remarks 
• Different saturation (eliteness) function? 
• learn the real functional shape! 
• log-logistic is good if the class-conditional 
distributions are drawn from an exp. 
family 
• Positions as variables? 
• kernel-like method or exp. #parameters 
• Apply operators on a per query or per 
query class basis?
Operator examples 
• BOW: maps a raw query to the set of 
queries whose elements are the single 
terms 
• p-grams: set of all p-gram of consecutive 
terms 
• p-and: all conjunctions of p arbitrary 
terms 
• segments: match only the “concepts” 
• Enlargement: some words might sneak in 
between the phrases/segments
Experimental set-up 
• TREC GOV2 
• 150 Topics (3 sets) 
• Web sample (~100GB) 
• 1000 + 400 Topics (2 sets) 
• Weight learning only (coordinate ascent) 
• Train/Test split, 10-fold CV
Research questions 
• R1: How do the operators work 
individually combined with BM25(F)? 
• R2: How does the combination of 
operators work? 
• R3: Is there a difference in difficult queries?
Single operators results 
TERA04 TERA05 TERA06 
MAP P@10 P@20 MAP P@10 P@20 MAP P@10 P@20 
BM25 0.2648 0.5327 0.5071 0.3228 0.6140 0.5600 0.2928 0.5380 0.5140 
BM25F 0.2697 0.5510 0.5143 0.3284 0.6200 0.5570 0.2935 0.5460 0.5160 
p-AND p = 2 0.2679 0.5429 0.5286 0.3396* 0.6260 0.5920 0.3069 0.5900 0.5300 
phrasal μ = 3 0.2673 0.5306 0.4939 0.3369* 0.6060 0.5790 0.3082* 0.5720 0.5290 
segment μ = 1 0.2685* 0.5143 0.4970 0.3274 0.6080 0.5580 0.3180* 0.5860 0.5350 
segment μ = 1.5 0.2695* 0.5347 0.5010 0.3272 0.6140 0.5620 0.3122* 0.5940 0.5460 
segment μ = 3 0.2690* 0.5143 0.5204 0.3295* 0.6300 0.5840 0.3277* 0.5940 0.5460 
p-grams p = 2,μ = 1 0.2786* 0.5530 0.5120 0.3294 0.5860 0.5470 0.3137* 0.5860 0.5470 
p-grams p = 3,μ = 1 0.2670 0.5060 0.4950 0.3272 0.5980 0.5560 0.3198* 0.6160 0.5600 
Table 2: Performance of single operators (* = statistical significance at p < 0.05 using a one-sided t-test with 
respect to BM25, MAP only). 
operator. The purpose of this experiment is to determine 
whether the sigmoid-like term-frequency normalizing func-tion 
is able to accommodate for different features which stem 
from the same source of evidence (matches of the query term 
in the document). Results are significantly better than the 
baseline and outperform state-of-the-art ranking functions 
that just use matching of query terms (note we are not 
adding query-independent evidence, like link information, 
click-through data, etc.). For instance, the best MAP value 
on each of the TREC topics 701-850 and examined 
the ten queries that produced the largest difference in pre-cision 
between our system and the BM25 baseline. Table shows the queries that obtained the greatest benefit from the 
use of segmentation and/or p-gram operators: in the right-most 
column of the table, we identified the segments (or the 
p-grams) that most contributed to the increased precision, 
allowing to retrieve relevant documents that BM25 missed 
(because they were ranked too low).
Combination of 
operators 
TERA04 TERA05 TERA06 
MAP P@10 P@20 MAP P@10 P@20 MAP P@10 P@20 
BM25 0.2648 0.5327 0.5071 0.3228 0.6140 0.5600 0.2928 0.5380 0.5140 
BM25F 0.2697 0.5510 0.5143 0.3284 0.6200 0.5570 0.2935 0.5460 0.5160 
BM25 + p-grams 0.2898* 0.5939 0.5531 0.3368* 0.6260 0.5800 0.3361* 0.6000 0.5048 
BM25 + segment 0.2866* 0.5653 0.5306 0.3285 0.5980 0.5710 0.3188 0.5720 0.5100 
BM25F + p-grams 0.2908* 0.5959 0.5541 0.3398* 0.6280 0.5830 0.3319* 0.5940 0.5350 
BM25F + segment 0.2869* 0.5653 0.5306 0.3282* 0.5920 0.5510 0.3315* 0.5940 0.5350 
Table 3: Performance of a combination of operators (* = statistical significance at p < 0.05 using a one-sided 
t-test with respect to BM25, MAP only). Configuration for p-gram is p = 2, μ = {1, 2, 3}, and for segment is 
μ = {1, 2, 3}. 
WEB WEB-phrasal 
MAP NDCG P@10 MAP NDCG P@10 
BM25 0.1553 0.2728 0.3892 0.1350 0.2345 0.3133 
BM25F 0.1722* 0.3008 0.4285 0.1575* 0.2842 0.3747 
BM25 + p-grams 0.1822* 0.3126 0.4390 0.1750* 0.3078 0.4010 
BM25 + segment 0.1810* 0.3126 0.4344 0.1725* 0.3029 0.4040 
BM25F + p-grams 0.1854* 0.3216 0.4400 0.1769* 0.3123 0.4101 
BM25F + segment 0.1842* 0.3190 0.4392 0.1749* 0.3165 0.4005 
Table 4: Performance of a combination of operators on the WEB collection (* = statistical significance at
TERA04 TERA05 TERA06 
Phrases vs regular 
MAP P@10 P@20 MAP P@10 P@20 MAP P@10 P@0.2648 0.5327 0.5071 0.3228 0.6140 0.5600 0.2928 0.5380 0.5140 
0.2697 0.5510 0.5143 0.3284 0.6200 0.5570 0.2935 0.5460 0.5160 
queries 
p-grams 0.2898* 0.5939 0.5531 0.3368* 0.6260 0.5800 0.3361* 0.6000 0.5048 
segment 0.2866* 0.5653 0.5306 0.3285 0.5980 0.5710 0.3188 0.5720 0.5100 
p-grams 0.2908* 0.5959 0.5541 0.3398* 0.6280 0.5830 0.3319* 0.5940 0.5350 
segment 0.2869* 0.5653 0.5306 0.3282* 0.5920 0.5510 0.3315* 0.5940 0.5350 
Performance of a combination of operators (* = statistical significance at p < 0.05 using a one-respect to BM25, MAP only). Configuration for p-gram is p = 2, μ = {1, 2, 3}, and for segment WEB WEB-phrasal 
MAP NDCG P@10 MAP NDCG P@10 
BM25 0.1553 0.2728 0.3892 0.1350 0.2345 0.3133 
BM25F 0.1722* 0.3008 0.4285 0.1575* 0.2842 0.3747 
BM25 + p-grams 0.1822* 0.3126 0.4390 0.1750* 0.3078 0.4010 
BM25 + segment 0.1810* 0.3126 0.4344 0.1725* 0.3029 0.4040 
BM25F + p-grams 0.1854* 0.3216 0.4400 0.1769* 0.3123 0.4101 
BM25F + segment 0.1842* 0.3190 0.4392 0.1749* 0.3165 0.4005 
Performance of a combination of operators on the WEB collection (* = statistical significance using a one-sided t-test with respect to BM25, MAP only). Configuration for p-gram is p for segment is μ = {1, 2, 3}. 
on Recommender systems, pages 5–6. 
[11] S. B¨uttcher, C. L. A. Clarke, and B. Lushman.
Examples (that worked) 
Topic no. BM25 MAP Operators MAP Query Which segment / p-gram helped? 
810 0.2759 0.5243 Timeshare resales “Timeshare resales” 
750 0.0903 0.3251 John Edwards womens’ issues “John Edwards” and “womens’ issue” 
806 0.1279 0.3446 Doctors Without Borders “Doctors Without Borders” 
834 0.2819 0.4859 Global positioning system earthquakes “Global positioning” 
745 0.1801 0.3828 Doomsday cults “Doomsday cult” 
835 0.1306 0.3222 Big Dig pork “Big Dig” 
732 0.1438 0.3309 U.S. cheese production “U.S.” and “cheese production” 
723 0.1187 0.3030 Executive privilege “Executive privilege” 
843 0.2420 0.4228 Pol Pot “Pol Pot” 
785 0.2717 0.4323 Ivory billed woodpecker “Ivory billed” 
Table 5: The ten queries that produced the largest gap between standard BM25 and our operator-based 
scoring. The methods in this table use a combination of BM25, segments and 2-grams. 
Research and development in information retrieval, 
pages 797–798. ACM, 2010. 
[20] Y. Li, B.-j. P. Hsu, C. Zhai, and K. Wang. 
Unsupervised Query Segmentation Using Clickthrough 
for Information Retrieval. Proceedings of the 34th 
international ACM SIGIR conference on Research and 
development in Information Retrieval, pages 285–294, 
2011. 
[21] T.-Y. Liu. Learning to Rank for Information Retrieval. 
[30] S. Robertson. The probability ranking principle in IR. 
Journal of documentation, 1977. 
[31] S. Robertson. On event spaces and probabilistic 
models in information retrieval. Information Retrieval, 
8(2):319–329, 2005. 
[32] S. Robertson and S. Walker. Some simple effective 
approximations to the 2-poisson model for 
probabilistic weighted retrieval. Proceedings of the 
SIGIR conference on Research and development in
Conclusions 
• Developed an extension for BM25 that 
accounts for different query operators 
(versions of the query) 
• The main idea is to score individually virtual 
regions that match each operator 
• aggregate the evidence per query term 
• squash the different signals 
• Future work 
• apply other query-dependent signals 
• select the right operators for a query
BM25 
p(r | Q = q,D = d) 
p(r | Q = q,D = d) !q 
p(D = d | r,Q = q) 
p(D = d | r,Q = q) 
!q 
! 
t!q 
p(Dt = dt | r) 
p(Dt = dt | r) 
!q 
" 
t!q,dt>0 
log 
p(Dt = dt | r) · p(Dt = 0 | r) 
p(Dt = dt | r) · p(Dt = 0 | r) 
!q 
" 
t!q,dt>0 
wt,d
Feature dependencies 
• Class-linearly dependent (or affine) features 
• add no extra evidence/signal 
• model overfitting (vs capacity) 
• Still, it is desirable to enrich the model with 
more involved features 
• Some features are surprisingly correlated 
• Positional information requires a large 
number of parameters to estimate 
! O(|D||q|)
Query concept segmentation 
• Queries are made up of basic conceptual 
units, comprising many words 
• “Indian summer victor herbert” 
• Spurious matches: “san jose airport” -> “san 
jose city airport” 
• Model to detect segments based on 
generative language models and Wikipedia 
• Relax matches using factors of the max 
ratio between span length and segment 
length

More Related Content

Viewers also liked

Document similarity with vector space model
Document similarity with vector space modelDocument similarity with vector space model
Document similarity with vector space modeldalal404
 
similarity measure
similarity measure similarity measure
similarity measure ZHAO Sam
 
Deep Learning for Information Retrieval
Deep Learning for Information RetrievalDeep Learning for Information Retrieval
Deep Learning for Information RetrievalRoelof Pieters
 
Genetic Algorithm by Example
Genetic Algorithm by ExampleGenetic Algorithm by Example
Genetic Algorithm by ExampleNobal Niraula
 
Physical Science Ch 2, sec 1 tools in science
Physical Science Ch 2, sec 1 tools in sciencePhysical Science Ch 2, sec 1 tools in science
Physical Science Ch 2, sec 1 tools in sciencemshenry
 
Boris Chan - FITC SCREENS - Becoming Social By Default on Mobile
Boris Chan - FITC SCREENS - Becoming Social By Default on MobileBoris Chan - FITC SCREENS - Becoming Social By Default on Mobile
Boris Chan - FITC SCREENS - Becoming Social By Default on MobileBoris Chan
 
Harry potter and the deathly hallows review.
Harry potter and the deathly hallows review.Harry potter and the deathly hallows review.
Harry potter and the deathly hallows review.Becca McPartland
 
Prunus africana “No chop um, no kill um, but keep um”: From an endangered spe...
Prunus africana “No chop um, no kill um, but keep um”: From an endangered spe...Prunus africana “No chop um, no kill um, but keep um”: From an endangered spe...
Prunus africana “No chop um, no kill um, but keep um”: From an endangered spe...Verina Ingram
 
Challenging Myths on HRM's Attitudes Towards Growth and Development 2008
Challenging Myths on HRM's Attitudes Towards Growth and Development 2008Challenging Myths on HRM's Attitudes Towards Growth and Development 2008
Challenging Myths on HRM's Attitudes Towards Growth and Development 2008Halifax Partnership
 
F:\Itag48 (53011810065)
F:\Itag48 (53011810065)F:\Itag48 (53011810065)
F:\Itag48 (53011810065)BenjamasS
 
Entity Linking via Graph-Distance Minimization
Entity Linking via Graph-Distance MinimizationEntity Linking via Graph-Distance Minimization
Entity Linking via Graph-Distance MinimizationRoi Blanco
 
The health benefits of walking and playing paskins
The health benefits of walking and playing   paskinsThe health benefits of walking and playing   paskins
The health benefits of walking and playing paskinsUCL
 
#ForoEGovAR | Construyendo Políticas Públicas para las Sociedades del Conoci...
 #ForoEGovAR | Construyendo Políticas Públicas para las Sociedades del Conoci... #ForoEGovAR | Construyendo Políticas Públicas para las Sociedades del Conoci...
#ForoEGovAR | Construyendo Políticas Públicas para las Sociedades del Conoci...CESSI ArgenTIna
 
Blue wrap recycling
Blue wrap recyclingBlue wrap recycling
Blue wrap recyclingCapital G
 
Sm250rink
Sm250rinkSm250rink
Sm250rinkRegina
 
Greater Halifax Partnership: A Catalyst for Economic Growth
Greater Halifax Partnership: A Catalyst for Economic GrowthGreater Halifax Partnership: A Catalyst for Economic Growth
Greater Halifax Partnership: A Catalyst for Economic GrowthHalifax Partnership
 
Dissertation lyn woodward cwm jan 2010
Dissertation   lyn woodward cwm jan 2010Dissertation   lyn woodward cwm jan 2010
Dissertation lyn woodward cwm jan 2010Francis Alamina
 

Viewers also liked (20)

Document similarity with vector space model
Document similarity with vector space modelDocument similarity with vector space model
Document similarity with vector space model
 
similarity measure
similarity measure similarity measure
similarity measure
 
Deep Learning for Information Retrieval
Deep Learning for Information RetrievalDeep Learning for Information Retrieval
Deep Learning for Information Retrieval
 
Genetic Algorithm by Example
Genetic Algorithm by ExampleGenetic Algorithm by Example
Genetic Algorithm by Example
 
Physical Science Ch 2, sec 1 tools in science
Physical Science Ch 2, sec 1 tools in sciencePhysical Science Ch 2, sec 1 tools in science
Physical Science Ch 2, sec 1 tools in science
 
Gic2011 aula3-ingles
Gic2011 aula3-inglesGic2011 aula3-ingles
Gic2011 aula3-ingles
 
Boris Chan - FITC SCREENS - Becoming Social By Default on Mobile
Boris Chan - FITC SCREENS - Becoming Social By Default on MobileBoris Chan - FITC SCREENS - Becoming Social By Default on Mobile
Boris Chan - FITC SCREENS - Becoming Social By Default on Mobile
 
Gic2012 aula7-ingles
Gic2012 aula7-inglesGic2012 aula7-ingles
Gic2012 aula7-ingles
 
Harry potter and the deathly hallows review.
Harry potter and the deathly hallows review.Harry potter and the deathly hallows review.
Harry potter and the deathly hallows review.
 
Prunus africana “No chop um, no kill um, but keep um”: From an endangered spe...
Prunus africana “No chop um, no kill um, but keep um”: From an endangered spe...Prunus africana “No chop um, no kill um, but keep um”: From an endangered spe...
Prunus africana “No chop um, no kill um, but keep um”: From an endangered spe...
 
Gic2011 aula7-ingles-theory
Gic2011 aula7-ingles-theoryGic2011 aula7-ingles-theory
Gic2011 aula7-ingles-theory
 
Challenging Myths on HRM's Attitudes Towards Growth and Development 2008
Challenging Myths on HRM's Attitudes Towards Growth and Development 2008Challenging Myths on HRM's Attitudes Towards Growth and Development 2008
Challenging Myths on HRM's Attitudes Towards Growth and Development 2008
 
F:\Itag48 (53011810065)
F:\Itag48 (53011810065)F:\Itag48 (53011810065)
F:\Itag48 (53011810065)
 
Entity Linking via Graph-Distance Minimization
Entity Linking via Graph-Distance MinimizationEntity Linking via Graph-Distance Minimization
Entity Linking via Graph-Distance Minimization
 
The health benefits of walking and playing paskins
The health benefits of walking and playing   paskinsThe health benefits of walking and playing   paskins
The health benefits of walking and playing paskins
 
#ForoEGovAR | Construyendo Políticas Públicas para las Sociedades del Conoci...
 #ForoEGovAR | Construyendo Políticas Públicas para las Sociedades del Conoci... #ForoEGovAR | Construyendo Políticas Públicas para las Sociedades del Conoci...
#ForoEGovAR | Construyendo Políticas Públicas para las Sociedades del Conoci...
 
Blue wrap recycling
Blue wrap recyclingBlue wrap recycling
Blue wrap recycling
 
Sm250rink
Sm250rinkSm250rink
Sm250rink
 
Greater Halifax Partnership: A Catalyst for Economic Growth
Greater Halifax Partnership: A Catalyst for Economic GrowthGreater Halifax Partnership: A Catalyst for Economic Growth
Greater Halifax Partnership: A Catalyst for Economic Growth
 
Dissertation lyn woodward cwm jan 2010
Dissertation   lyn woodward cwm jan 2010Dissertation   lyn woodward cwm jan 2010
Dissertation lyn woodward cwm jan 2010
 

Similar to Extending BM25 with multiple query operators

Feature Engineering
Feature Engineering Feature Engineering
Feature Engineering odsc
 
Can Short Queries be Even Shorter?
Can Short Queries be Even Shorter?Can Short Queries be Even Shorter?
Can Short Queries be Even Shorter?Twitter Inc.
 
Scientific table type classification in digital library (DocEng 2012)
Scientific table type classification in digital library (DocEng 2012)Scientific table type classification in digital library (DocEng 2012)
Scientific table type classification in digital library (DocEng 2012)Seongchan Kim
 
Creating a Custom Serialization Format (Gophercon 2017)
Creating a Custom Serialization Format (Gophercon 2017)Creating a Custom Serialization Format (Gophercon 2017)
Creating a Custom Serialization Format (Gophercon 2017)Scott Mansfield
 
Another Way to Attack the BLOB: Server-side Access via PL/SQL and Perl
Another Way to Attack the BLOB: Server-side Access via PL/SQL and PerlAnother Way to Attack the BLOB: Server-side Access via PL/SQL and Perl
Another Way to Attack the BLOB: Server-side Access via PL/SQL and PerlRoy Zimmer
 
Big Data Competition: maximizing your potential
 exampled with the 2014 Higgs...
Big Data Competition: maximizing your potential
 exampled with the 2014 Higgs...Big Data Competition: maximizing your potential
 exampled with the 2014 Higgs...
Big Data Competition: maximizing your potential
 exampled with the 2014 Higgs...Cheng Chen
 
SciPy 2011 pandas lightning talk
SciPy 2011 pandas lightning talkSciPy 2011 pandas lightning talk
SciPy 2011 pandas lightning talkWes McKinney
 
Regression and Classification with R
Regression and Classification with RRegression and Classification with R
Regression and Classification with RYanchang Zhao
 
10. Getting Spatial
10. Getting Spatial10. Getting Spatial
10. Getting SpatialFAO
 
Reif Regression Diagnostics I and II
Reif Regression Diagnostics I and IIReif Regression Diagnostics I and II
Reif Regression Diagnostics I and IIMegan Reif
 
qc-tools.ppt
qc-tools.pptqc-tools.ppt
qc-tools.pptAlpharoot
 
Clean, Learn and Visualise data with R
Clean, Learn and Visualise data with RClean, Learn and Visualise data with R
Clean, Learn and Visualise data with RBarbara Fusinska
 
Analysis update for GENEVA meeting 2011
Analysis update for GENEVA meeting 2011Analysis update for GENEVA meeting 2011
Analysis update for GENEVA meeting 2011USC
 
Analisis gap dan thurstone
Analisis gap dan thurstoneAnalisis gap dan thurstone
Analisis gap dan thurstoneM Taufiq Budi H
 
Clean, Learn and Visualise data with R
Clean, Learn and Visualise data with RClean, Learn and Visualise data with R
Clean, Learn and Visualise data with RBarbara Fusinska
 
Unsupervised learning
Unsupervised learning Unsupervised learning
Unsupervised learning AlexAman1
 

Similar to Extending BM25 with multiple query operators (20)

Feature Engineering
Feature Engineering Feature Engineering
Feature Engineering
 
Can Short Queries be Even Shorter?
Can Short Queries be Even Shorter?Can Short Queries be Even Shorter?
Can Short Queries be Even Shorter?
 
Instructions
InstructionsInstructions
Instructions
 
Scientific table type classification in digital library (DocEng 2012)
Scientific table type classification in digital library (DocEng 2012)Scientific table type classification in digital library (DocEng 2012)
Scientific table type classification in digital library (DocEng 2012)
 
Creating a Custom Serialization Format (Gophercon 2017)
Creating a Custom Serialization Format (Gophercon 2017)Creating a Custom Serialization Format (Gophercon 2017)
Creating a Custom Serialization Format (Gophercon 2017)
 
Statistic report
Statistic reportStatistic report
Statistic report
 
Another Way to Attack the BLOB: Server-side Access via PL/SQL and Perl
Another Way to Attack the BLOB: Server-side Access via PL/SQL and PerlAnother Way to Attack the BLOB: Server-side Access via PL/SQL and Perl
Another Way to Attack the BLOB: Server-side Access via PL/SQL and Perl
 
Exome Sequencing
Exome SequencingExome Sequencing
Exome Sequencing
 
Big Data Competition: maximizing your potential
 exampled with the 2014 Higgs...
Big Data Competition: maximizing your potential
 exampled with the 2014 Higgs...Big Data Competition: maximizing your potential
 exampled with the 2014 Higgs...
Big Data Competition: maximizing your potential
 exampled with the 2014 Higgs...
 
Assignment
AssignmentAssignment
Assignment
 
SciPy 2011 pandas lightning talk
SciPy 2011 pandas lightning talkSciPy 2011 pandas lightning talk
SciPy 2011 pandas lightning talk
 
Regression and Classification with R
Regression and Classification with RRegression and Classification with R
Regression and Classification with R
 
10. Getting Spatial
10. Getting Spatial10. Getting Spatial
10. Getting Spatial
 
Reif Regression Diagnostics I and II
Reif Regression Diagnostics I and IIReif Regression Diagnostics I and II
Reif Regression Diagnostics I and II
 
qc-tools.ppt
qc-tools.pptqc-tools.ppt
qc-tools.ppt
 
Clean, Learn and Visualise data with R
Clean, Learn and Visualise data with RClean, Learn and Visualise data with R
Clean, Learn and Visualise data with R
 
Analysis update for GENEVA meeting 2011
Analysis update for GENEVA meeting 2011Analysis update for GENEVA meeting 2011
Analysis update for GENEVA meeting 2011
 
Analisis gap dan thurstone
Analisis gap dan thurstoneAnalisis gap dan thurstone
Analisis gap dan thurstone
 
Clean, Learn and Visualise data with R
Clean, Learn and Visualise data with RClean, Learn and Visualise data with R
Clean, Learn and Visualise data with R
 
Unsupervised learning
Unsupervised learning Unsupervised learning
Unsupervised learning
 

More from Roi Blanco

From Queries to Answers in the Web
From Queries to Answers in the WebFrom Queries to Answers in the Web
From Queries to Answers in the WebRoi Blanco
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big DataRoi Blanco
 
Mining Web content for Enhanced Search
Mining Web content for Enhanced Search Mining Web content for Enhanced Search
Mining Web content for Enhanced Search Roi Blanco
 
Influence of Timeline and Named-entity Components on User Engagement
Influence of Timeline and Named-entity Components on User Engagement Influence of Timeline and Named-entity Components on User Engagement
Influence of Timeline and Named-entity Components on User Engagement Roi Blanco
 
Introduction to Information Retrieval
Introduction to Information RetrievalIntroduction to Information Retrieval
Introduction to Information RetrievalRoi Blanco
 
Searching over the past, present and future
Searching over the past, present and futureSearching over the past, present and future
Searching over the past, present and futureRoi Blanco
 
Beyond document retrieval using semantic annotations
Beyond document retrieval using semantic annotations Beyond document retrieval using semantic annotations
Beyond document retrieval using semantic annotations Roi Blanco
 
Keyword Search over RDF Graphs
Keyword Search over RDF GraphsKeyword Search over RDF Graphs
Keyword Search over RDF GraphsRoi Blanco
 
Large-Scale Semantic Search
Large-Scale Semantic SearchLarge-Scale Semantic Search
Large-Scale Semantic SearchRoi Blanco
 
Energy-Price-Driven Query Processing in Multi-center Web Search Engines
Energy-Price-Driven Query Processing in Multi-center WebSearch EnginesEnergy-Price-Driven Query Processing in Multi-center WebSearch Engines
Energy-Price-Driven Query Processing in Multi-center Web Search EnginesRoi Blanco
 
Effective and Efficient Entity Search in RDF data
Effective and Efficient Entity Search in RDF dataEffective and Efficient Entity Search in RDF data
Effective and Efficient Entity Search in RDF dataRoi Blanco
 
Caching Search Engine Results over Incremental Indices
Caching Search Engine Results over Incremental IndicesCaching Search Engine Results over Incremental Indices
Caching Search Engine Results over Incremental IndicesRoi Blanco
 
Finding support sentences for entities
Finding support sentences for entitiesFinding support sentences for entities
Finding support sentences for entitiesRoi Blanco
 

More from Roi Blanco (13)

From Queries to Answers in the Web
From Queries to Answers in the WebFrom Queries to Answers in the Web
From Queries to Answers in the Web
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
Mining Web content for Enhanced Search
Mining Web content for Enhanced Search Mining Web content for Enhanced Search
Mining Web content for Enhanced Search
 
Influence of Timeline and Named-entity Components on User Engagement
Influence of Timeline and Named-entity Components on User Engagement Influence of Timeline and Named-entity Components on User Engagement
Influence of Timeline and Named-entity Components on User Engagement
 
Introduction to Information Retrieval
Introduction to Information RetrievalIntroduction to Information Retrieval
Introduction to Information Retrieval
 
Searching over the past, present and future
Searching over the past, present and futureSearching over the past, present and future
Searching over the past, present and future
 
Beyond document retrieval using semantic annotations
Beyond document retrieval using semantic annotations Beyond document retrieval using semantic annotations
Beyond document retrieval using semantic annotations
 
Keyword Search over RDF Graphs
Keyword Search over RDF GraphsKeyword Search over RDF Graphs
Keyword Search over RDF Graphs
 
Large-Scale Semantic Search
Large-Scale Semantic SearchLarge-Scale Semantic Search
Large-Scale Semantic Search
 
Energy-Price-Driven Query Processing in Multi-center Web Search Engines
Energy-Price-Driven Query Processing in Multi-center WebSearch EnginesEnergy-Price-Driven Query Processing in Multi-center WebSearch Engines
Energy-Price-Driven Query Processing in Multi-center Web Search Engines
 
Effective and Efficient Entity Search in RDF data
Effective and Efficient Entity Search in RDF dataEffective and Efficient Entity Search in RDF data
Effective and Efficient Entity Search in RDF data
 
Caching Search Engine Results over Incremental Indices
Caching Search Engine Results over Incremental IndicesCaching Search Engine Results over Incremental Indices
Caching Search Engine Results over Incremental Indices
 
Finding support sentences for entities
Finding support sentences for entitiesFinding support sentences for entities
Finding support sentences for entities
 

Recently uploaded

New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDGMarianaLemus7
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentationphoebematthew05
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 

Recently uploaded (20)

New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDG
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentation
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 

Extending BM25 with multiple query operators

  • 1. Extending BM25 with multiple query operators Roi Blanco Yahoo! Research Paolo Boldi Universitá degli studi di Milano
  • 2. Overview • Motivation • Virtual regions and BM25’s event space • Experiments • Conclusions
  • 3. Motivation • From document understanding to query understanding • Query rewrites (gazetteers, spell correction), named entity recognition, query suggestions, query categories, query segmentation ... • Detecting query intent, triggering verticals • direct target towards answers • richer interfaces
  • 4. Signals for Ranking • Signals for ranking: matches of query terms in documents, query-independent quality measures, CTR, among others • Probabilistic IR models are all about counting • occurrences of terms in documents, in sets of documents, etc. • How to aggregate efficiently a large number of “different” counts • coming from the same terms • no double counts!
  • 5. Searching for food • New York’s greatest pizza ‣ New OR York’s OR greatest OR pizza ‣ New AND York’s AND greatest AND pizza ‣ New OR York OR great OR pizza ‣ “New York” OR “great pizza” ‣ “New York” AND “great pizza” ‣ York < New AND great OR pizza • among many more.
  • 6. “Refined” matching • Extract a number of virtual regions in the document that match some version of the query (operators) • Each region provides a different evidence of relevance (i.e. signal) • Aggregate the scores over the different regions • Ex. :“at least any two words in the query appear either consecutively or with an extra word between them”
  • 7. Probability of Relevance 0 10 20 30 40 50 0.0000 0.0005 0.0010 0.0015 tf P[relevant] !!!!!!! !! !! ! !!! ! !! ! ! ! ! !! !! ! ! ! ! ! !! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! or 2!gram 2!and
  • 8. R Et tft t ! q BM25 • Term (tf) independence • Vague Prior over terms not appearing in the query • Eliteness - topical model that perturbs the word distribution • 2-poisson distribution of term frequencies over relevant and non-relevant documents Figure 2: BM25 plate diagram representing the over variable independence wBM25 t,d = according to p(r | Q = q,D = d), or equivalently p(r | Q = q,D = d) p(r | Q = q,D = d) !q p(D = d | r,Q = q) p(D = d | r,Q = q) Y tft,ˆd widf tft,ˆd + k1 · t
  • 9. Virtual regions • Different parts of the document provide different evidence of relevance • Create a (finite) set of (latent) artificial regions and re-weight p(tfj t,d = x | r) = ! e!{0,1} p(tfj t,d = x | e, r)p(e | r) = ! q!Q ! e,!!{0,1} p(tfj t,d = x | !j = !, e, r)p(!j = ! | e, r)p(e | r) Et !j j !M t ! q R tfj t
  • 10. Implementation • An operator maps a query to a set of queries, which could match a document • Each operator has a weight • The average term frequency in a document is ˆ tf t,d = ! j!M wj · tf j t,d (1 − bj) + bj · |d|/avdl .
  • 11. Remarks • Different saturation (eliteness) function? • learn the real functional shape! • log-logistic is good if the class-conditional distributions are drawn from an exp. family • Positions as variables? • kernel-like method or exp. #parameters • Apply operators on a per query or per query class basis?
  • 12. Operator examples • BOW: maps a raw query to the set of queries whose elements are the single terms • p-grams: set of all p-gram of consecutive terms • p-and: all conjunctions of p arbitrary terms • segments: match only the “concepts” • Enlargement: some words might sneak in between the phrases/segments
  • 13. Experimental set-up • TREC GOV2 • 150 Topics (3 sets) • Web sample (~100GB) • 1000 + 400 Topics (2 sets) • Weight learning only (coordinate ascent) • Train/Test split, 10-fold CV
  • 14. Research questions • R1: How do the operators work individually combined with BM25(F)? • R2: How does the combination of operators work? • R3: Is there a difference in difficult queries?
  • 15. Single operators results TERA04 TERA05 TERA06 MAP P@10 P@20 MAP P@10 P@20 MAP P@10 P@20 BM25 0.2648 0.5327 0.5071 0.3228 0.6140 0.5600 0.2928 0.5380 0.5140 BM25F 0.2697 0.5510 0.5143 0.3284 0.6200 0.5570 0.2935 0.5460 0.5160 p-AND p = 2 0.2679 0.5429 0.5286 0.3396* 0.6260 0.5920 0.3069 0.5900 0.5300 phrasal μ = 3 0.2673 0.5306 0.4939 0.3369* 0.6060 0.5790 0.3082* 0.5720 0.5290 segment μ = 1 0.2685* 0.5143 0.4970 0.3274 0.6080 0.5580 0.3180* 0.5860 0.5350 segment μ = 1.5 0.2695* 0.5347 0.5010 0.3272 0.6140 0.5620 0.3122* 0.5940 0.5460 segment μ = 3 0.2690* 0.5143 0.5204 0.3295* 0.6300 0.5840 0.3277* 0.5940 0.5460 p-grams p = 2,μ = 1 0.2786* 0.5530 0.5120 0.3294 0.5860 0.5470 0.3137* 0.5860 0.5470 p-grams p = 3,μ = 1 0.2670 0.5060 0.4950 0.3272 0.5980 0.5560 0.3198* 0.6160 0.5600 Table 2: Performance of single operators (* = statistical significance at p < 0.05 using a one-sided t-test with respect to BM25, MAP only). operator. The purpose of this experiment is to determine whether the sigmoid-like term-frequency normalizing func-tion is able to accommodate for different features which stem from the same source of evidence (matches of the query term in the document). Results are significantly better than the baseline and outperform state-of-the-art ranking functions that just use matching of query terms (note we are not adding query-independent evidence, like link information, click-through data, etc.). For instance, the best MAP value on each of the TREC topics 701-850 and examined the ten queries that produced the largest difference in pre-cision between our system and the BM25 baseline. Table shows the queries that obtained the greatest benefit from the use of segmentation and/or p-gram operators: in the right-most column of the table, we identified the segments (or the p-grams) that most contributed to the increased precision, allowing to retrieve relevant documents that BM25 missed (because they were ranked too low).
  • 16. Combination of operators TERA04 TERA05 TERA06 MAP P@10 P@20 MAP P@10 P@20 MAP P@10 P@20 BM25 0.2648 0.5327 0.5071 0.3228 0.6140 0.5600 0.2928 0.5380 0.5140 BM25F 0.2697 0.5510 0.5143 0.3284 0.6200 0.5570 0.2935 0.5460 0.5160 BM25 + p-grams 0.2898* 0.5939 0.5531 0.3368* 0.6260 0.5800 0.3361* 0.6000 0.5048 BM25 + segment 0.2866* 0.5653 0.5306 0.3285 0.5980 0.5710 0.3188 0.5720 0.5100 BM25F + p-grams 0.2908* 0.5959 0.5541 0.3398* 0.6280 0.5830 0.3319* 0.5940 0.5350 BM25F + segment 0.2869* 0.5653 0.5306 0.3282* 0.5920 0.5510 0.3315* 0.5940 0.5350 Table 3: Performance of a combination of operators (* = statistical significance at p < 0.05 using a one-sided t-test with respect to BM25, MAP only). Configuration for p-gram is p = 2, μ = {1, 2, 3}, and for segment is μ = {1, 2, 3}. WEB WEB-phrasal MAP NDCG P@10 MAP NDCG P@10 BM25 0.1553 0.2728 0.3892 0.1350 0.2345 0.3133 BM25F 0.1722* 0.3008 0.4285 0.1575* 0.2842 0.3747 BM25 + p-grams 0.1822* 0.3126 0.4390 0.1750* 0.3078 0.4010 BM25 + segment 0.1810* 0.3126 0.4344 0.1725* 0.3029 0.4040 BM25F + p-grams 0.1854* 0.3216 0.4400 0.1769* 0.3123 0.4101 BM25F + segment 0.1842* 0.3190 0.4392 0.1749* 0.3165 0.4005 Table 4: Performance of a combination of operators on the WEB collection (* = statistical significance at
  • 17. TERA04 TERA05 TERA06 Phrases vs regular MAP P@10 P@20 MAP P@10 P@20 MAP P@10 P@0.2648 0.5327 0.5071 0.3228 0.6140 0.5600 0.2928 0.5380 0.5140 0.2697 0.5510 0.5143 0.3284 0.6200 0.5570 0.2935 0.5460 0.5160 queries p-grams 0.2898* 0.5939 0.5531 0.3368* 0.6260 0.5800 0.3361* 0.6000 0.5048 segment 0.2866* 0.5653 0.5306 0.3285 0.5980 0.5710 0.3188 0.5720 0.5100 p-grams 0.2908* 0.5959 0.5541 0.3398* 0.6280 0.5830 0.3319* 0.5940 0.5350 segment 0.2869* 0.5653 0.5306 0.3282* 0.5920 0.5510 0.3315* 0.5940 0.5350 Performance of a combination of operators (* = statistical significance at p < 0.05 using a one-respect to BM25, MAP only). Configuration for p-gram is p = 2, μ = {1, 2, 3}, and for segment WEB WEB-phrasal MAP NDCG P@10 MAP NDCG P@10 BM25 0.1553 0.2728 0.3892 0.1350 0.2345 0.3133 BM25F 0.1722* 0.3008 0.4285 0.1575* 0.2842 0.3747 BM25 + p-grams 0.1822* 0.3126 0.4390 0.1750* 0.3078 0.4010 BM25 + segment 0.1810* 0.3126 0.4344 0.1725* 0.3029 0.4040 BM25F + p-grams 0.1854* 0.3216 0.4400 0.1769* 0.3123 0.4101 BM25F + segment 0.1842* 0.3190 0.4392 0.1749* 0.3165 0.4005 Performance of a combination of operators on the WEB collection (* = statistical significance using a one-sided t-test with respect to BM25, MAP only). Configuration for p-gram is p for segment is μ = {1, 2, 3}. on Recommender systems, pages 5–6. [11] S. B¨uttcher, C. L. A. Clarke, and B. Lushman.
  • 18. Examples (that worked) Topic no. BM25 MAP Operators MAP Query Which segment / p-gram helped? 810 0.2759 0.5243 Timeshare resales “Timeshare resales” 750 0.0903 0.3251 John Edwards womens’ issues “John Edwards” and “womens’ issue” 806 0.1279 0.3446 Doctors Without Borders “Doctors Without Borders” 834 0.2819 0.4859 Global positioning system earthquakes “Global positioning” 745 0.1801 0.3828 Doomsday cults “Doomsday cult” 835 0.1306 0.3222 Big Dig pork “Big Dig” 732 0.1438 0.3309 U.S. cheese production “U.S.” and “cheese production” 723 0.1187 0.3030 Executive privilege “Executive privilege” 843 0.2420 0.4228 Pol Pot “Pol Pot” 785 0.2717 0.4323 Ivory billed woodpecker “Ivory billed” Table 5: The ten queries that produced the largest gap between standard BM25 and our operator-based scoring. The methods in this table use a combination of BM25, segments and 2-grams. Research and development in information retrieval, pages 797–798. ACM, 2010. [20] Y. Li, B.-j. P. Hsu, C. Zhai, and K. Wang. Unsupervised Query Segmentation Using Clickthrough for Information Retrieval. Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval, pages 285–294, 2011. [21] T.-Y. Liu. Learning to Rank for Information Retrieval. [30] S. Robertson. The probability ranking principle in IR. Journal of documentation, 1977. [31] S. Robertson. On event spaces and probabilistic models in information retrieval. Information Retrieval, 8(2):319–329, 2005. [32] S. Robertson and S. Walker. Some simple effective approximations to the 2-poisson model for probabilistic weighted retrieval. Proceedings of the SIGIR conference on Research and development in
  • 19. Conclusions • Developed an extension for BM25 that accounts for different query operators (versions of the query) • The main idea is to score individually virtual regions that match each operator • aggregate the evidence per query term • squash the different signals • Future work • apply other query-dependent signals • select the right operators for a query
  • 20. BM25 p(r | Q = q,D = d) p(r | Q = q,D = d) !q p(D = d | r,Q = q) p(D = d | r,Q = q) !q ! t!q p(Dt = dt | r) p(Dt = dt | r) !q " t!q,dt>0 log p(Dt = dt | r) · p(Dt = 0 | r) p(Dt = dt | r) · p(Dt = 0 | r) !q " t!q,dt>0 wt,d
  • 21. Feature dependencies • Class-linearly dependent (or affine) features • add no extra evidence/signal • model overfitting (vs capacity) • Still, it is desirable to enrich the model with more involved features • Some features are surprisingly correlated • Positional information requires a large number of parameters to estimate ! O(|D||q|)
  • 22. Query concept segmentation • Queries are made up of basic conceptual units, comprising many words • “Indian summer victor herbert” • Spurious matches: “san jose airport” -> “san jose city airport” • Model to detect segments based on generative language models and Wikipedia • Relax matches using factors of the max ratio between span length and segment length