Change search
ReferencesLink to record
Permanent link

Direct link
Dictionary-based amharic - English information retrieval
KTH, School of Information and Communication Technology (ICT), Computer and Systems Sciences, DSV.
KTH, School of Information and Communication Technology (ICT), Computer and Systems Sciences, DSV.
2005 (English)In: MULTILINGUAL INFORMATION ACCESS FOR TEXT, SPEECH AND IMAGES / [ed] Peters, C; Clough, P; Gonzalo, J; Jones, GJF; Kluck, M; Magnini, B, BERLIN: SPRINGER-VERLAG BERLIN , 2005, Vol. 3491, 143-149 p.Conference paper (Refereed)
Abstract [en]

We present two approaches to the Amharic - English bilingual track in CLEF 2004. Both experiments use a dictionary based approach to translate the Amharic queries into English Bags-of-words, but while one approach removes non-content bearing words from the Amharic queries based on their IDF value, the other uses a list of English stop words to perform the same task. The resulting translated (English) terms are then submitted to a retrieval engine that supports the Boolean and vector-space models. In our experiments, the second approach (based on a list of English stop words) performs slightly better than the one based on IDF values for the Amharic terms.

Place, publisher, year, edition, pages
BERLIN: SPRINGER-VERLAG BERLIN , 2005. Vol. 3491, 143-149 p.
Keyword [en]
Computer aided language translation, Linguistics, Mathematical models, Query languages, Search engines, Vectors
National Category
Computer Science
URN: urn:nbn:se:kth:diva-43209ISI: 000231117600014ScopusID: 2-s2.0-24944554947ISBN: 3-540-27420-0OAI: diva2:447821
5th Workshop of the Cross-Language Evaluation Forum. Bath, ENGLAND. SEP 15-17, 2004
QC 20111013Available from: 2011-10-13 Created: 2011-10-13 Last updated: 2011-10-13Bibliographically approved

Open Access in DiVA

No full text


Search in DiVA

By author/editor
Argaw, Atelach AlemuAsker, Lars
By organisation
Computer and Systems Sciences, DSV
Computer Science

Search outside of DiVA

GoogleGoogle Scholar
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Total: 45 hits
ReferencesLink to record
Permanent link

Direct link