For the storage of information for subsequent retrieval of desired items, two stages of analysis are essential. The first is the determination of the subject content of a given article or paper; the second is the selection of certain words, groups of words, or classification headings by which the subject content is to be represented, either directly or by a suitable coding. Some workers still look forward to the day when it will be possible for the whole of a text to be read and ‘understood’ automatically by a machine; the ‘understanding’ process has been envisaged either as a process of selection of terms by the measure of word frequency, or word‐pair frequency (adjacent terms or terms not too far separated in one sentence) in the text, or by some process of automatic linguistic analysis. Such methods appear unsuitable for several reasons: language, as normally used, is a very difficult medium for exact expression (hence the value of mathematics) and few authors write well enough to avoid all ambiguities; a human reader accustomed to the subject can easily overcome any difficulties due to poor grammar, badly expressed arguments, excess brevity or prolixity in writing and even, sometimes, actual errors; a machine can not do so. Furthermore, the content of a paper is rarely of uniform importance throughout, and it is not worth recording, for subsequent retrieval, details which are merely repetitions of matters described earlier and better elsewhere, and not essential to the main purpose of the paper; for example, in a paper on evaporator design, a description of a standard method of analysis, applied to the contents of the evaporator in determining the efficiency of the design, will not be worth indexing; in a search for analytical methods, retrieval of such a paper would hardly be considered pertinent. A human reader, though far from infallible, can usefully make such judgments.
Article navigation
1 April 1965
Review Article|
April 01 1965
PROBLEMS IN ANALYSIS AND TERMINOLOGY FOR INFORMATION RETRIEVAL Available to Purchase
J. FARRADANE;
J. FARRADANE
Northampton College of Advanced Technology
Search for other works by this author on:
R.K. POULTON;
R.K. POULTON
Northampton College of Advanced Technology
Search for other works by this author on:
MRS S. DATTA
MRS S. DATTA
Northampton College of Advanced Technology
Search for other works by this author on:
Publisher: Emerald Publishing
Online ISSN: 1758-7379
Print ISSN: 0022-0418
© MCB UP Limited
1965
Journal of Documentation (1965) 21 (4): 287–290.
Citation
FARRADANE J, POULTON R, DATTA MS (1965), "PROBLEMS IN ANALYSIS AND TERMINOLOGY FOR INFORMATION RETRIEVAL". Journal of Documentation, Vol. 21 No. 4 pp. 287–290, doi: https://doi.org/10.1108/eb026380
Download citation file:
Suggested Reading
Multiple terminologies: an obstacle to information retrieval
Library Review (August,2004)
Telematics and retribalisation
Aslib Proceedings (March,1987)
Towards a procedure model in terminology management
Journal of Documentation (April,2005)
Bilingual terminology extraction using multi‐level termhood
The Electronic Library (April,2012)
Challenges and issues in terminology mapping: a digital library perspective
The Electronic Library (December,2005)
Related Chapters
Effects of Terminology on Health Queries: An Analysis by User’s Health Literacy and Topic Familiarity
Current Issues in Libraries, Information Science and Related Fields
Chapter 5 Ground clock: stratigraphy and terminology
A Short Course in Geology for Civil Engineers
Recommended for you
These recommendations are informed by your reading behaviors and indicated interests.
