Relevance based language models

TLDR

We investigate how classical probabilistic IR models relate to emerging language‑modeling approaches, noting that their main limitation is the difficulty of estimating relevance‑model word probabilities. The authors propose a query‑only technique to estimate these relevance‑model probabilities. The method produces highly accurate relevance models that outperform baseline language‑model systems on TREC retrieval and TDT tracking, offering a formal, training‑data‑free approach.

Abstract

We explore the relation between classical probabilistic models of information retrieval and the emerging language modeling approaches. It has long been recognized that the primary obstacle to effective performance of classical models is the need to estimate arelevance model: probabilities of words in the relevant class. We propose a novel technique for estimating these probabilities using the query alone. We demonstrate that our technique can produce highly accurate relevance models, addressing important notions of synonymy and polysemy. Our experiments show relevance models outperforming baseline language modeling systems on TREC retrieval and TDT tracking tasks. The main contribution of this work is an effective formal method for estimating a relevance model with no training data.

References

Page 1

	Year	Citations
The mathematics of statistical machine translation: parameter estimation Peter F. Brown, Vincent J. Della Pietra, Stephen A. Della Pietra,	1993	4.1K
A language modeling approach to information retrieval Jay Ponte, W. Bruce Croft EngineeringIntelligent Information RetrievalInformation Retrievalaugust 1998Query ModelSemantic Web	1998	1.7K
The DET curve in assessment of detection task performance Alvin F. Martín, George R. Doddington, Terri Kamm, EngineeringMachine LearningBiometricsAttentionSpeech Recognition	1997	1.4K
Computational linguistics in 1990 Hans Karlgren Applied LinguisticsNatural Language ProcessingSyntaxLanguage EcologyDigital Linguistics	1990	1.2K
An empirical study of smoothing techniques for language modeling Stanley F. Chen, Joshua Goodman EngineeringCross-lingual RepresentationMultilingual PretrainingLarge Language ModelCorpus Linguistics	1996	754
Statistical Models for Text Segmentation Doug Beeferman, Adam Berger, John Lafferty Machine Learning Natural Language ProcessingEngineeringText SegmentationComputational LinguisticsLanguage Studies	1999	679
Improving the effectiveness of information retrieval with local context analysis Jinxi Xu, W. Bruce Croft ACM Transactions on Information Systems EngineeringIntelligent Information RetrievalQuery ModelSemantic WebCorpus Linguistics	2000	532
On-line new event detection and tracking James Allan, Ron Papka, Victor Lavrenko EngineeringIntelligent Information RetrievalInformation Retrievalaugust 1998Event CorrelationCommunication	1998	523
A hidden Markov model information retrieval system David R. Miller, Tim Leek, Richard Schwartz	1999	469
A THEORETICAL BASIS FOR THE USE OF CO‐OCCURRENCE DATA IN INFORMATION RETRIEVAL C. J. van Rijsbergen Journal of Documentation EngineeringCollaborative Information RetrievalText MiningNatural Language ProcessingInformation Retrieval	1977	465

Page 1