The Probabilistic Relevance Framework (PRF) is a formal framework for document retrieval, grounded in work done in the 1970–1980s, which led to the development of one of the most successful text-retrieval algo¬rithms, BM25. In recent years, research in the PRF has yielded new retrieval models capable of taking into account document meta-data (especially structure and link-graph information). Again, this has led to one of the most successful Web-search and corporate-search algo¬rithms, BM25F. This work presents the PRF from a conceptual point of view, describing the probabilistic modelling assumptions behind the framework and the different ranking algorithms that result from its application: the binary independence model, relevance feedback mod¬els, BM25 and BM25F. It also discusses the relation between the PRF and other statistical models for IR, and covers some related topics, such as the use of non-textual features, and parameter optimisation for models with free parameters.
Article navigation
29 November 2009
Research Article|
September 17 2009
The Probabilistic Relevance Framework: BM25 and Beyond
Stephen Robertson;
Stephen Robertson
Microsoft Research
, 7 J J Thomson Avenue, Cambridge CB3 0FB, UK
Search for other works by this author on:
Hugo Zaragoza
Hugo Zaragoza
Yahoo! Research, Av. Diagonal 177, Barcelona 08028,
Spain
Search for other works by this author on:
Online ISSN: 1554-0677
Print ISSN: 1554-0669
© 2010 S. Robertson and H. Zaragoza
2010
S. Robertson and H. Zaragoza
Licensed re-use rights only
Foundations and Trends in Information Retrieval (2009) 4 (1-2): 1–174.
Citation
Robertson S, Zaragoza H (2009), "The Probabilistic Relevance Framework: BM25 and Beyond". Foundations and Trends in Information Retrieval, Vol. 4 No. 1-2 pp. 1–174, doi: https://doi.org/10.1561/1500000019
Download citation file:
Suggested Reading
Selecting a text similarity measure for a content-based recommender system: A comparison in two corpora
The Electronic Library (July,2019)
RETRACTED: Queries related to COVID-19: a more effective retrieval through finetuned ALBERT with BM25L question answering system
World Journal of Engineering (May,2021)
Finding target and constraint concepts for XML query construction
International Journal of Web Information Systems (November,2015)
A new term‐weighting scheme for naïve Bayes text categorization
International Journal of Web Information Systems (March,2012)
A study of user profile representation for personalized cross-language information retrieval
Aslib Journal of Information Management (July,2016)
Related Chapters
Emotional Analysis of Online Reviews on E-Business Platforms
Internet+ and Electronic Business in China: Innovation and Applications
E-Business Platform Information Search Services
Internet+ and Electronic Business in China: Innovation and Applications
Wonder and the Rhythm of Becoming (Healthy and Vital)
Post-Qualitative Inquiry in Sport, Health and Physical Education
Recommended for you
These recommendations are informed by your reading behaviors and indicated interests.
