A new and promising approach to document clustering consists of utilizing previously formed clusters of queries to cluster documents. To employ this approach in practice a similarity measure for queries must be available. This requirement does not cause any problem in the case of information retrieval systems in which both the search request formulations and document representations are sets of weighted or unweighted index terms. However, in most operational retrieval systems search request formulations are Boolean combinations of index terms. Research into similarity measures for search request formulations of this type has already been undertaken by the author and reported elsewhere. The present paper provides further results of investigations in this area. The novelty of the approach discussed is the incorporation within the methodology described earlier of a weighting mechanism to indicate the relative importance of particular attributes of a given Boolean search request formulation. A modification suggested is based on the standard probabilistic approach to information retrieval.
Article navigation
1 January 1982
Review Article|
January 01 1982
ON A PROBABILISTIC APPROACH TO DETERMINING THE SIMILARITY BETWEEN BOOLEAN SEARCH REQUEST FORMULATIONS
TADEUSZ RADECKI
TADEUSZ RADECKI
Postgraduate School of Librarianship and Information Science, University of Sheffield The author's permanent address: Main Library and Scientific Information Centre, Technical University of Wroclaw, Wybrzeze Wyspianskiego 27, 50–370 Wroclaw, Poland.
Search for other works by this author on:
Publisher: Emerald Publishing
Online ISSN: 1758-7379
Print ISSN: 0022-0418
© MCB UP Limited
1982
Journal of Documentation (1982) 38 (1): 14–28.
Citation
RADECKI T (1982), "ON A PROBABILISTIC APPROACH TO DETERMINING THE SIMILARITY BETWEEN BOOLEAN SEARCH REQUEST FORMULATIONS". Journal of Documentation, Vol. 38 No. 1 pp. 14–28, doi: https://doi.org/10.1108/eb026719
Download citation file:
Suggested Reading
REDUCING THE PERILS OF MERGING BOOLEAN AND WEIGHTED RETRIEVAL SYSTEMS
Journal of Documentation (March,1982)
Weaving Process and Boolean Algebra
Research Journal of Textile and Apparel (May,2004)
Boolean Representation of Fuzzy Sets
Kybernetes (March,1993)
Integration of menu retrieval and Boolean retrieval from a full‐text database
Online Review (May,1985)
Related Chapters
Primer to Tourists’ Perceptions and Assessments Including How-to-build Formal, Implementable, Models of the Tourist Gaze
Tourists’ Perceptions and Assessments
Guideline for Application of fuzzy-set Qualitative Comparative Analysis (fsQCA) in Tourism and Hospitality Studies
Cutting Edge Research Methods in Hospitality and Tourism
Role of Sustainable Finance in Contributing to the Growth of Indian Economy
Digital Transformation for Business Sustainability and Growth in Emerging Markets
Recommended for you
These recommendations are informed by your reading behaviors and indicated interests.
