Dictionary based Amharic - English information retrieval
2004 (English)In: CEUR Workshop Proceedings, CEUR-WS , 2004Conference paper (Refereed)
We present two approaches to the Amharic - English bilingual track in CLEF 2004. Both experiments use a dictionary based approach to translate the Amharic queries into English Bags-of-words, but while one approach removes non-content bearing words from the Amharic queries based on their IDF value, the other uses a list of English stop words to perform the same task. The resulting translated (English) terms are then submitted to a retrieval engine that supports the Boolean and vector-space models. In our experiments, the second approach (based on a list of English stop words) performs slightly better than the one based on IDF values for the Amharic terms.
Place, publisher, year, edition, pages
CEUR-WS , 2004.
Vector spaces, Amharic queries, Bilingual tracks, CLEF 2004, Retrieval engines, Stop word, Vector space models, Digital libraries
Electrical Engineering, Electronic Engineering, Information Engineering
IdentifiersURN: urn:nbn:se:kth:diva-196210ScopusID: 2-s2.0-84978086860OAI: oai:DiVA.org:kth-196210DiVA: diva2:1046412
2004 Cross Language Evaluation Forum Workshop, CLEF 2004, co-located with the 8th European Conference on Digital Libraries, ECDL 2004, 15 September 2004 through 17 September 2004
Conference Paper. QC 201611142016-11-142016-11-142016-11-14Bibliographically approved