Term conflation methods in information retrievalNon‐linguistic and linguistic approaches

Galvez, Carmen; de Moya‐Anegón, Félix; Solana, Víctor H.

doi:10.1108/00220410510607507

Article navigation

Conceptual Paper| August 01 2005

Term conflation methods in information retrieval: Non‐linguistic and linguistic approaches

Carmen Galvez;

Carmen Galvez

Department of Information Science, University of Granada, Granada, Spain

Search for other works by this author on:

This Site

PubMed

Google Scholar

Félix de Moya‐Anegón;

Félix de Moya‐Anegón

Department of Information Science, University of Granada, Granada, Spain

Search for other works by this author on:

This Site

PubMed

Google Scholar

Víctor H. Solana

Department of Information Science, University of Granada, Granada, Spain

Search for other works by this author on:

This Site

PubMed

Google Scholar

Author & Article Information

Publisher: Emerald Publishing

Online ISSN: 1758-7379

Print ISSN: 0022-0418

2005

Journal of Documentation (2005) 61 (4): 520–547.

https://doi.org/10.1108/00220410510607507

Purpose

To propose a categorization of the different conflation procedures at the two basic approaches, non‐linguistic and linguistic techniques, and to justify the application of normalization methods within the framework of linguistic techniques.

Design/methodology/approach

Presents a range of term conflation methods, that can be used in information retrieval. The uniterm and multiterm variants can be considered equivalent units for the purposes of automatic indexing. Stemming algorithms, segmentation rules, association measures and clustering techniques are well evaluated non‐linguistic methods, and experiments with these techniques show a wide variety of results. Alternatively, the lemmatisation and the use of syntactic pattern‐matching, through equivalence relations represented in finite‐state transducers (FST), are emerging methods for the recognition and standardization of terms.

Findings

The survey attempts to point out the positive and negative effects of the linguistic approach and its potential as a term conflation method.

Originality/value

Outlines the importance of FSTs for the normalization of term variants.

2005

You do not currently have access to this content.

Don't already have an account? Register

Term conflation methods in information retrieval: Non‐linguistic and linguistic approaches

Email Alerts

Cited By

Term conflation methods in information retrieval: Non‐linguistic and linguistic approaches Available to Purchase

Sign in

Client Account

ICE Member Sign In

Email Alerts

Suggested Reading

Related Chapters

Recommended for you

Cited By

Term conflation methods in information retrieval: Non‐linguistic and linguistic approaches