Embed presentation
Download to read offline









The document discusses alternative set theoretic models in information retrieval, focusing on the extended boolean model which integrates features of vector models with boolean algebra through partial matching and term weighting. It highlights the limitations of the traditional boolean model, proposing a more nuanced approach that accounts for relevance ranking by considering normalized tf-idf factors and generalizing query representations in a multi-dimensional space. The document also touches on the flexibility of the model, indicating its ability to behave as various types of distance-based models by adjusting parameters.
Overview of information retrieval by Prof Neeraj Bhargava and Vaibhav Khanna from MDSU.
Introduction to three models: Set-Based, Extended Boolean, and Fuzzy Set Models.
The Extended Boolean Model introduces partial matching and term weighting, combining Vector model properties.
Conjunctive Boolean queries lack flexibility; a single term's presence can misrepresent relevance.
Two-dimensional representation of queries and documents based on term presence.
Weights in queries are computed using normalized tf-idf factors for better representation.
Extension of the model to t-dimensional space using p-norms for distance measurement.
Varying p allows behaviors of vector and fuzzy models, enhancing query generality and processing.
Task to explain the Extended Boolean Model further, reinforcing learning objectives.








