Amharic-English Information Retrieval with Pseudo Relevance Feedback
2008 (English)In: Advances In Multilingual And Multimodal Information Retrieval / [ed] Peters, C; Jikoun, V; Mandl, T; Muller, H; Oard, DW; Penas, A; Petras, V; Santos, D, 2008, Vol. 5152, 119-126 p.Conference paper (Refereed)
We describe cross language retrieval experiments using Amharic queries and English language d ocument collection. Two monolingual and eight bilingual runs were submitted with variations in terms of 1 sage of long and short queries, presence of pseudo relevance feedback (PRF), and approaches for word sense disambiguation (WSD). We used an Amharic-English machine readable dictionary (MRD), and an online Amharic-English dictionary for lookup translation of query terms. Out of dictionary Amharic query terms were considered as possible named entities, and further filtering was attained through restricted fuzzy matching based on edit distance which is calculated against automatically extracted English proper names. The obtained results indicate that longer queries tend to perform similar to short ones, PRF improves performance considerably, and that queries tend to fare better with WSD rather than using maximal expansion of terms by taking all the translations given in the MRD.
Place, publisher, year, edition, pages
2008. Vol. 5152, 119-126 p.
, Lecture Notes in Computer Science, ISSN 0302-9743 ; 5152
IdentifiersURN: urn:nbn:se:kth:diva-38411DOI: 10.1007/978-3-540-85760-0-15ISI: 000260420000015ScopusID: 2-s2.0-70349837963ISBN: 978-3-540-85759-4OAI: oai:DiVA.org:kth-38411DiVA: diva2:437931
8th Workshop of the Cross-Language Evaluation Forum, CLEF 2007; Budapest; 19 September 2007 through 21 September 2007