The Impact of Phrases in Document Clustering for Swedish
2005 (English)In: Proceedings of the 15th NODALIDA conference, Joensuu 2005 / [ed] Werner, S., 2005, 173-179 p.Conference paper (Refereed)
We have investigated the impact of using phrases in the vector spacemodel for clustering documents in Swedish in different ways. The investigation is carried out on two textsets from different domains: one set of newspaper articles and one set of medical papers.The use of phrases do not improveresults relative the ordinary use ofwords. The results differ significantly between the text types. Thisindicates that one could benefit from different text representations for different domains although a fundamentally different approach probably would be needed.
Place, publisher, year, edition, pages
2005. 173-179 p.
IdentifiersURN: urn:nbn:se:kth:diva-7122ISBN: 952-458-771-8OAI: oai:DiVA.org:kth-7122DiVA: diva2:12038
NoDaLiDa 2005, Joensuu, Finland, 2005
QC 201008062005-09-292005-09-292010-12-20Bibliographically approved