Dictionary-based Amharic-French information retrieval
2006 (English)In: Accessing Multilingual Information Repositories / [ed] Peters, C; Gey, FC; Gonzalo, J; Muller, H; Jones, GJF; Kluck, M; Magnini, B; DeRijke, M, 2006, Vol. 4022, 83-92 p.Conference paper (Refereed)
We present four approaches to the Amharic - French bilingual track at CLEF 2005. All experiments use a dictionary based approach to translate the Amharic queries into French Bags-of-words, but while one approach uses word sense discrimination on the translated side of the queries, the other one includes all senses of a translated word in the query for searching. We used two search engines: The SICS experimental engine and Lucene, hence four runs with the two approaches. Non-content bearing words were removed both before and after the dictionary lookup. TF/IDF values supplemented by a heuristic function was used to remove the stop words from the Amharic queries and two French stopwords lists were used to remove them from the French translations. In our experiments, we found that the SICS search engine performs better than Lucene and that using the word sense discriminated keywords produce a slightly better result than the full set of non discriminated keywords.
Place, publisher, year, edition, pages
2006. Vol. 4022, 83-92 p.
, Lecture Notes in Computer Science, ISSN 0302-9743 ; 4022
Computer and Information Science
IdentifiersURN: urn:nbn:se:kth:diva-42016ISI: 000241359000009ScopusID: 2-s2.0-33749642299ISBN: 3-540-45697-XOAI: oai:DiVA.org:kth-42016DiVA: diva2:446060
6th Workshop of the Cross-Language Evaluation Forum Location: Vienna, Austria, Date: SEP 21-23, 2005
QC 201110062011-10-062011-10-052011-10-06Bibliographically approved