SlideShare a Scribd company logo
1 of 27
Download to read offline
10/1/2015
Axiomatic Analysis of Smoothing Methods in
Language Models for Pseudo-Relevance Feedback
HUSSEIN HAZIMEH AND CHENGXIANG ZHAI
UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN
1
Pseudo Relevance Feedback
Judgments:
d1 +
d2 -
d3 +
…
dk -
...
Query Retrieval
Engine
Results:
d1 3.5
d2 2.4
…
dk 0.5
...
User
Document
collection
Judgments:
d1 +
d2 +
d3 +
…
dk -
...
top 10
Pseudo feedback
Assume top 10 docs
are relevant
Relevance feedback User judges documents
New
Query
Feedback
Learn from
Examples
2
Pseudo-Relevance Feedback
It’s blind!
Good for high recall information needs
A Blind Superhero. Courtesy of iStock
3
Collection-based Smoothing
Collection-based smoothing is generally used for LM-based retrieval
functions and for PRF models
A commonly used collection-based smoothing scheme is Dirichlet
prior smoothing:
Dirichlet Prior
(Smoothing Parameter)
Document Length
Count of Word in
Document
4
Study of Smoothing Methods in PRF
We will establish both analytically and empirically that collection-
based smoothing is not a good choice for PRF:
◦ It forces PRF models to select very common words
Additive smoothing will be shown to outperform the collection-
based counterpart
5
How Do LM PRF Models Work?
D1
Dn
… Averaging
Function: 𝑨
Scoring
Function: 𝒇
𝑃(𝑤|𝜃1)
𝑃(𝑤|𝜃 𝑛)
𝑃(𝑤|𝜃 𝐶)
𝑃(𝑤|𝜃 𝐹)
6
How Do LM PRF Models Work?
 The feedback LM, 𝜃 𝐹, would generally have the following form:
𝐴: ℝ 𝑛
→ ℝ is an averaging function, e.g. geometric mean
𝑓: ℝ2 → ℝ is a function increasing in the first argument and
decreasing in the second
Rewards common
words in feedback set
Penalizes common
words in collection
7
Problem!
The first argument rewards common words in the collection while the
second penalizes them. The analysis shows that the first argument
usually “wins”!
Rewards common
words in feedback set
and collection
Penalizes common
words in collection
Proportional to
𝑃(𝑤|Θ 𝐶)
8
Overview of the Analysis
We considered three PRF models in the study:
◦ Divergence Minimization Model
◦ Relevance Model
◦ Geometric Relevance Model
Next, we will briefly discuss how the DMM and GRM work and then
give an overview of the axiomatic analysis.
The analysis of the RM is very similar to the GRM and the same
results apply
9
Divergence Minimization Model
(Zhai and Lafferty, 2001)
The DMM solves the following optimization problem:
The solution has a closed form and is given by:
10
Geometric Relevance Model
(Seo and Croft, 2010)
An enhanced form of the Relevance Model (RM) that replaces the
arithmetic mean used in RM by the geometric mean:
Note that the function above is not is not affected by 𝑃(𝑤|𝜃𝑐), i.e.,
the model is not designed to penalize common words.
11
Main Axiom: IDF Effect
(Clinchant and Gaussier, 2013)
 Rationale: A PRF model is expected to penalize common words in
the collection in order to select high quality discriminative terms.
 Given any two words 𝑤1and 𝑤2 from the feedback set 𝐷1, 𝐷2,
12
DMM with Collection-based smoothing: IDF
Effect
Study the sign of:
Not straightforward. Strategy:
◦ Find an attainable lower bound on the expression above
◦ Study the sign of the lower bound
◦ If the lower bound is strictly positive, then DMM supports the IDF effect
13
DMM: Results of Analysis
Conclusion: Using collection-based smoothing the DMM will be either
consistently reward common terms or will select only one feedback term
14
GRM with Collection-based smoothing: IDF
Effect
The GRM cannot support the IDF effect:
It consistently rewards favors common words in the collection
15
Proposed Solution: Additive Smoothing
 Words get additional pseudo-counts:
 Next, we show how additive smoothing prevents the models from
rewarding common terms
16
DMM with Additive Smoothing: IDF Effect
The DMM unconditionally supports the IDF Effect:
Now it is performing the intended objective!
17
DMM: Empirical Validation
Query: “Computer”
18
GRM with Additive Smoothing: IDF Effect
Although the IDF effect is still not supported:
However, common terms are no longer being rewarded!
19
GRM: Empirical Validation
Query: “Computer”
20
Empirical Evaluation: Retrieval Measures
21
Empirical Evaluation: Robustness of
Additive Smoothing
22
Measuring the Discrimination of PRF Models
In previous studies, the average of the IDF of the top terms was used
as an indicator of how discriminative the terms selected by a PRF
method are
Such a measure might not work well in some cases
We propose the Discrimination Measure (DM):
≈ Expected Document Frequency
Constant
23
Empirical Evaluation: Discrimination Measure
A several-fold decrease in the expected document frequency
24
Conclusion
Collection-based smoothing forces PRF models to select very
common terms
◦ The same problem might exist in other applications where LMs are aggregated
Additive smoothing prevents PRF models from rewarding common
terms and increases the retrieval performance significantly
A new measure for quantifying PRF Discrimination
25
Future Work
Should PRF models penalize common words?
Analysis of other smoothing methods such as topic-based smoothing
Inspect areas, other than PRF, where collection-based smoothing is
used in aggregating language models
26
Thanks to SIGIR for the Student Travel Grant!
Thank you for Listening!
27

More Related Content

Similar to Axiomatic Analysis of Smoothing Methods in Language Models for Pseudo-Relevance Feedback

LRP for hand gesture recogntion.pptx
LRP for hand gesture recogntion.pptxLRP for hand gesture recogntion.pptx
LRP for hand gesture recogntion.pptxshamim806425
 
Low rank models for recommender systems with limited preference information
Low rank models for recommender systems with limited preference informationLow rank models for recommender systems with limited preference information
Low rank models for recommender systems with limited preference informationEvgeny Frolov
 
Case2_Best_Model_Final
Case2_Best_Model_FinalCase2_Best_Model_Final
Case2_Best_Model_FinalEric Esajian
 
Two Level Disambiguation Model for Query Translation
Two Level Disambiguation Model for Query TranslationTwo Level Disambiguation Model for Query Translation
Two Level Disambiguation Model for Query TranslationIJECEIAES
 
Machine learning - session 4
Machine learning - session 4Machine learning - session 4
Machine learning - session 4Luis Borbon
 
A Content Boosted Hybrid Recommendation System
A Content Boosted Hybrid Recommendation SystemA Content Boosted Hybrid Recommendation System
A Content Boosted Hybrid Recommendation SystemSeval Çapraz
 
An Integrated Solver For Optimization Problems
An Integrated Solver For Optimization ProblemsAn Integrated Solver For Optimization Problems
An Integrated Solver For Optimization ProblemsMonica Waters
 
Neural Network Model
Neural Network ModelNeural Network Model
Neural Network ModelEric Esajian
 
Efficient Pseudo-Relevance Feedback Methods for Collaborative Filtering Recom...
Efficient Pseudo-Relevance Feedback Methods for Collaborative Filtering Recom...Efficient Pseudo-Relevance Feedback Methods for Collaborative Filtering Recom...
Efficient Pseudo-Relevance Feedback Methods for Collaborative Filtering Recom...Daniel Valcarce
 
Instance Selection and Optimization of Neural Networks
Instance Selection and Optimization of Neural NetworksInstance Selection and Optimization of Neural Networks
Instance Selection and Optimization of Neural NetworksITIIIndustries
 
(Gaurav sawant & dhaval sawlani)bia 678 final project report
(Gaurav sawant & dhaval sawlani)bia 678 final project report(Gaurav sawant & dhaval sawlani)bia 678 final project report
(Gaurav sawant & dhaval sawlani)bia 678 final project reportGaurav Sawant
 
Machine learning in credit risk modeling : a James white paper
Machine learning in credit risk modeling : a James white paperMachine learning in credit risk modeling : a James white paper
Machine learning in credit risk modeling : a James white paperJames by CrowdProcess
 
Implications of Ceiling Effects in Defect Predictors
Implications of Ceiling Effects in Defect PredictorsImplications of Ceiling Effects in Defect Predictors
Implications of Ceiling Effects in Defect Predictorsgregoryg
 
A Comparison of Game Theory and Multiple Objective Dynamic Prioritization 201...
A Comparison of Game Theory and Multiple Objective Dynamic Prioritization 201...A Comparison of Game Theory and Multiple Objective Dynamic Prioritization 201...
A Comparison of Game Theory and Multiple Objective Dynamic Prioritization 201...James McGalliard
 
Machine learning - session 3
Machine learning - session 3Machine learning - session 3
Machine learning - session 3Luis Borbon
 
Programming Paradigms Seminar 1
Programming Paradigms Seminar 1Programming Paradigms Seminar 1
Programming Paradigms Seminar 1neoxiuting
 
The effect of number of concepts on readability of schemas 2
The effect of number of concepts on readability of schemas 2The effect of number of concepts on readability of schemas 2
The effect of number of concepts on readability of schemas 2Saman Sara
 

Similar to Axiomatic Analysis of Smoothing Methods in Language Models for Pseudo-Relevance Feedback (20)

LRP for hand gesture recogntion.pptx
LRP for hand gesture recogntion.pptxLRP for hand gesture recogntion.pptx
LRP for hand gesture recogntion.pptx
 
Low rank models for recommender systems with limited preference information
Low rank models for recommender systems with limited preference informationLow rank models for recommender systems with limited preference information
Low rank models for recommender systems with limited preference information
 
Case2_Best_Model_Final
Case2_Best_Model_FinalCase2_Best_Model_Final
Case2_Best_Model_Final
 
Two Level Disambiguation Model for Query Translation
Two Level Disambiguation Model for Query TranslationTwo Level Disambiguation Model for Query Translation
Two Level Disambiguation Model for Query Translation
 
Developing Movie Recommendation System
Developing Movie Recommendation SystemDeveloping Movie Recommendation System
Developing Movie Recommendation System
 
Machine learning - session 4
Machine learning - session 4Machine learning - session 4
Machine learning - session 4
 
Decision tables
Decision tablesDecision tables
Decision tables
 
A Content Boosted Hybrid Recommendation System
A Content Boosted Hybrid Recommendation SystemA Content Boosted Hybrid Recommendation System
A Content Boosted Hybrid Recommendation System
 
An Integrated Solver For Optimization Problems
An Integrated Solver For Optimization ProblemsAn Integrated Solver For Optimization Problems
An Integrated Solver For Optimization Problems
 
Neural Network Model
Neural Network ModelNeural Network Model
Neural Network Model
 
Efficient Pseudo-Relevance Feedback Methods for Collaborative Filtering Recom...
Efficient Pseudo-Relevance Feedback Methods for Collaborative Filtering Recom...Efficient Pseudo-Relevance Feedback Methods for Collaborative Filtering Recom...
Efficient Pseudo-Relevance Feedback Methods for Collaborative Filtering Recom...
 
MIXHS12-Zhe
MIXHS12-ZheMIXHS12-Zhe
MIXHS12-Zhe
 
Instance Selection and Optimization of Neural Networks
Instance Selection and Optimization of Neural NetworksInstance Selection and Optimization of Neural Networks
Instance Selection and Optimization of Neural Networks
 
(Gaurav sawant & dhaval sawlani)bia 678 final project report
(Gaurav sawant & dhaval sawlani)bia 678 final project report(Gaurav sawant & dhaval sawlani)bia 678 final project report
(Gaurav sawant & dhaval sawlani)bia 678 final project report
 
Machine learning in credit risk modeling : a James white paper
Machine learning in credit risk modeling : a James white paperMachine learning in credit risk modeling : a James white paper
Machine learning in credit risk modeling : a James white paper
 
Implications of Ceiling Effects in Defect Predictors
Implications of Ceiling Effects in Defect PredictorsImplications of Ceiling Effects in Defect Predictors
Implications of Ceiling Effects in Defect Predictors
 
A Comparison of Game Theory and Multiple Objective Dynamic Prioritization 201...
A Comparison of Game Theory and Multiple Objective Dynamic Prioritization 201...A Comparison of Game Theory and Multiple Objective Dynamic Prioritization 201...
A Comparison of Game Theory and Multiple Objective Dynamic Prioritization 201...
 
Machine learning - session 3
Machine learning - session 3Machine learning - session 3
Machine learning - session 3
 
Programming Paradigms Seminar 1
Programming Paradigms Seminar 1Programming Paradigms Seminar 1
Programming Paradigms Seminar 1
 
The effect of number of concepts on readability of schemas 2
The effect of number of concepts on readability of schemas 2The effect of number of concepts on readability of schemas 2
The effect of number of concepts on readability of schemas 2
 

Recently uploaded

Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023ymrp368
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Delhi Call girls
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Researchmichael115558
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxolyaivanovalion
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxMohammedJunaid861692
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxolyaivanovalion
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...amitlee9823
 

Recently uploaded (20)

Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptx
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
 

Axiomatic Analysis of Smoothing Methods in Language Models for Pseudo-Relevance Feedback

  • 1. 10/1/2015 Axiomatic Analysis of Smoothing Methods in Language Models for Pseudo-Relevance Feedback HUSSEIN HAZIMEH AND CHENGXIANG ZHAI UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN 1
  • 2. Pseudo Relevance Feedback Judgments: d1 + d2 - d3 + … dk - ... Query Retrieval Engine Results: d1 3.5 d2 2.4 … dk 0.5 ... User Document collection Judgments: d1 + d2 + d3 + … dk - ... top 10 Pseudo feedback Assume top 10 docs are relevant Relevance feedback User judges documents New Query Feedback Learn from Examples 2
  • 3. Pseudo-Relevance Feedback It’s blind! Good for high recall information needs A Blind Superhero. Courtesy of iStock 3
  • 4. Collection-based Smoothing Collection-based smoothing is generally used for LM-based retrieval functions and for PRF models A commonly used collection-based smoothing scheme is Dirichlet prior smoothing: Dirichlet Prior (Smoothing Parameter) Document Length Count of Word in Document 4
  • 5. Study of Smoothing Methods in PRF We will establish both analytically and empirically that collection- based smoothing is not a good choice for PRF: ◦ It forces PRF models to select very common words Additive smoothing will be shown to outperform the collection- based counterpart 5
  • 6. How Do LM PRF Models Work? D1 Dn … Averaging Function: 𝑨 Scoring Function: 𝒇 𝑃(𝑤|𝜃1) 𝑃(𝑤|𝜃 𝑛) 𝑃(𝑤|𝜃 𝐶) 𝑃(𝑤|𝜃 𝐹) 6
  • 7. How Do LM PRF Models Work?  The feedback LM, 𝜃 𝐹, would generally have the following form: 𝐴: ℝ 𝑛 → ℝ is an averaging function, e.g. geometric mean 𝑓: ℝ2 → ℝ is a function increasing in the first argument and decreasing in the second Rewards common words in feedback set Penalizes common words in collection 7
  • 8. Problem! The first argument rewards common words in the collection while the second penalizes them. The analysis shows that the first argument usually “wins”! Rewards common words in feedback set and collection Penalizes common words in collection Proportional to 𝑃(𝑤|Θ 𝐶) 8
  • 9. Overview of the Analysis We considered three PRF models in the study: ◦ Divergence Minimization Model ◦ Relevance Model ◦ Geometric Relevance Model Next, we will briefly discuss how the DMM and GRM work and then give an overview of the axiomatic analysis. The analysis of the RM is very similar to the GRM and the same results apply 9
  • 10. Divergence Minimization Model (Zhai and Lafferty, 2001) The DMM solves the following optimization problem: The solution has a closed form and is given by: 10
  • 11. Geometric Relevance Model (Seo and Croft, 2010) An enhanced form of the Relevance Model (RM) that replaces the arithmetic mean used in RM by the geometric mean: Note that the function above is not is not affected by 𝑃(𝑤|𝜃𝑐), i.e., the model is not designed to penalize common words. 11
  • 12. Main Axiom: IDF Effect (Clinchant and Gaussier, 2013)  Rationale: A PRF model is expected to penalize common words in the collection in order to select high quality discriminative terms.  Given any two words 𝑤1and 𝑤2 from the feedback set 𝐷1, 𝐷2, 12
  • 13. DMM with Collection-based smoothing: IDF Effect Study the sign of: Not straightforward. Strategy: ◦ Find an attainable lower bound on the expression above ◦ Study the sign of the lower bound ◦ If the lower bound is strictly positive, then DMM supports the IDF effect 13
  • 14. DMM: Results of Analysis Conclusion: Using collection-based smoothing the DMM will be either consistently reward common terms or will select only one feedback term 14
  • 15. GRM with Collection-based smoothing: IDF Effect The GRM cannot support the IDF effect: It consistently rewards favors common words in the collection 15
  • 16. Proposed Solution: Additive Smoothing  Words get additional pseudo-counts:  Next, we show how additive smoothing prevents the models from rewarding common terms 16
  • 17. DMM with Additive Smoothing: IDF Effect The DMM unconditionally supports the IDF Effect: Now it is performing the intended objective! 17
  • 19. GRM with Additive Smoothing: IDF Effect Although the IDF effect is still not supported: However, common terms are no longer being rewarded! 19
  • 22. Empirical Evaluation: Robustness of Additive Smoothing 22
  • 23. Measuring the Discrimination of PRF Models In previous studies, the average of the IDF of the top terms was used as an indicator of how discriminative the terms selected by a PRF method are Such a measure might not work well in some cases We propose the Discrimination Measure (DM): ≈ Expected Document Frequency Constant 23
  • 24. Empirical Evaluation: Discrimination Measure A several-fold decrease in the expected document frequency 24
  • 25. Conclusion Collection-based smoothing forces PRF models to select very common terms ◦ The same problem might exist in other applications where LMs are aggregated Additive smoothing prevents PRF models from rewarding common terms and increases the retrieval performance significantly A new measure for quantifying PRF Discrimination 25
  • 26. Future Work Should PRF models penalize common words? Analysis of other smoothing methods such as topic-based smoothing Inspect areas, other than PRF, where collection-based smoothing is used in aggregating language models 26
  • 27. Thanks to SIGIR for the Student Travel Grant! Thank you for Listening! 27