Full‐text documents are usually searched by means of a Boolean retrieval algorithm that requires the user to specify the logical relationships between the terms of a query. In this paper, we summarise the results to date of a continuing programme of research at the University of Sheffield to investigate the use of nearest‐neighbour retrieval algorithms for full‐text searching. Given a natural‐language query statement, our methods result in a ranking of the paragraphs comprising a full‐text document in order of decreasing similarity with the query, where the similarity for each paragraph is determined by the number of keyword stems that it has in common with the query. A full‐text document test collection has been created to allow systematic tests of retrieval effectiveness to be carried out. Experiments with this collection demonstrate that nearest‐neighbour searching provides a means for paragraph‐based access to full‐text documents that is of comparable effectiveness to both Boolean and hypertext searching and that index term weighting schemes which have been developed for the searching of bibliographical databases can also be used to improve the effectiveness of retrieval from full‐text databases. A current project is investigating the extent to which a paragraph‐based full‐text retrieval system can be used to augment the explication facilities of an expert system on welding.
Article navigation
Review Article|
March 01 1991
Using nearest‐neighbour searching techniques to access full‐text documents Available to Purchase
Suliman Al‐Hawamdeh;
Suliman Al‐Hawamdeh
Department of Information Studies, University of Sheffield, Western Bank, Sheffield S10 2TN, UK
Search for other works by this author on:
Rachel de Vere;
Rachel de Vere
Department of Information Studies, University of Sheffield, Western Bank, Sheffield S10 2TN, UK
Search for other works by this author on:
Geoff Smith;
Geoff Smith
Department of Information Studies, University of Sheffield, Western Bank, Sheffield S10 2TN, UK
Search for other works by this author on:
Peter Willett
Peter Willett
Department of Information Studies, University of Sheffield, Western Bank, Sheffield S10 2TN, UK
Search for other works by this author on:
Publisher: Emerald Publishing
Online ISSN: 2396-9091
Print ISSN: 0309-314X
© MCB UP Limited
1991
Online Review (1991) 15 (3-4): 173–191.
Citation
Al‐Hawamdeh S, de Vere R, Smith G, Willett P (1991), "Using nearest‐neighbour searching techniques to access full‐text documents". Online Review, Vol. 15 No. 3-4 pp. 173–191, doi: https://doi.org/10.1108/eb024372
Download citation file:
201
Views
Suggested Reading
A METHOD FOR DETERMINING k‐NEAREST NEIGHBOURS
Kybernetes (April,1978)
NEAREST NEIGHBOUR SEARCHING IN BINARY SEARCH TREES: SIMULATION OF A MULTIPROCESSOR SYSTEM
Journal of Documentation (February,1987)
Binary k‐nearest neighbor for text categorization
Online Information Review (August,2005)
Robust dual-tone multi-frequency tone detection using k-nearest neighbour classifier for a noisy environment
Applied Computing and Informatics (April,2021)
Not reasonably practicable: are there now greater opportunities for abuse by a nearest relative?
The Journal of Adult Protection (February,2015)
Related Chapters
Nearest Neighbor Imputation for General Parameter Estimation in Survey Sampling
The Econometrics of Complex Survey Data: Theory and Applications
Educational Data Mining for Peer Assessment in Communities of Learners
The Future of Innovation and Technology in Education: Policies and Practices for Teaching and Learning Excellence
Three-dimensional soil profiles based on the geotechnical probes data
clustering
Geotechnical Engineering for Infrastructure and Development: XVI European Conference on Soil Mechanics and Geotechnical Engineering
Recommended for you
These recommendations are informed by your reading behaviors and indicated interests.
