Change search
ReferencesLink to record
Permanent link

Direct link
Swedish full text retrieval: Effectiveness of different combinations of indexing strategies with query terms
University College of Borås.ORCID iD: 0000-0003-0229-3073
2006 (English)In: Information retrieval (Boston), ISSN 1386-4564, E-ISSN 1573-7659, Vol. 9, no 6, 681-697 p.Article in journal (Refereed) Published
Abstract [en]

In this paper, which treats Swedish full text retrieval, the problem of morphological variation of query terms in the document database is studied. The Swedish CLEF 2003 test collection was used, and the effects of combination of indexing strategies with query terms on retrieval effectiveness were studied. Four of the seven tested combinations involved indexing strategies that used normalization, a form of conflation. All of these four combinations employed compound splitting, both during indexing and at query phase. SWETWOL, a morphological analyzer for the Swedish language, was used for normalization and compound splitting. A fifth combination used stemming, while a sixth attempted to group related terms by right hand truncation of query terms. The truncation was performed by a search expert. These six combinations were compared to each other and to a baseline combination, where no attempt was made to counteract the problem of morphological variation of query terms in the document database. Both the truncation combination, the four combinations based on normalization and the stemming combination outperformed the baseline. Truncation had the best performance. The main conclusion of the paper is that truncation, normalization and stemming enhanced retrieval effectiveness in comparison to the baseline. Further, normalization and stemming were not far below truncation.

Place, publisher, year, edition, pages
2006. Vol. 9, no 6, 681-697 p.
National Category
Information Studies
URN: urn:nbn:se:kth:diva-171392DOI: 10.1007/s10791-006-9009-1ISI: 000240803200003OAI: diva2:843563

NR 20150817

Available from: 2015-07-29 Created: 2015-07-29 Last updated: 2015-08-17Bibliographically approved

Open Access in DiVA

No full text

Other links

Publisher's full text

Search in DiVA

By author/editor
Ahlgren, Per
In the same journal
Information retrieval (Boston)
Information Studies

Search outside of DiVA

GoogleGoogle Scholar
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Altmetric score

Total: 50 hits
ReferencesLink to record
Permanent link

Direct link