Change search
ReferencesLink to record
Permanent link

Direct link
Sentiment Classification Techniques Applied to Swedish Tweets Investigating the Effects of translation on Sentiments from Swedish into English
KTH, School of Computer Science and Communication (CSC).
KTH, School of Computer Science and Communication (CSC).
2016 (English)Independent thesis Basic level (degree of Bachelor), 10 credits / 15 HE creditsStudent thesisAlternative title
Sentimentklassificeringstekniker applicerade på svenska Tweets för att underöka översättningens påverkan på sentiment vid översättning från svenska till engelska (Swedish)
Abstract [en]

Sentiment classification is generally used for many purposes such as business related aims and opinion gathering. In overall, since most text sources in the world wide web were written in English, available sentiments classifiers were trained on datasets written in English but rarely in other languages. This raised a curiosity and interest in investigating Sentiment Classification methods to implement on Swedish data. Therefor, this bachelor thesis examined to what extent the connotation of Swedish sentiments would be maintained/retained when translated into English. The research question was investigated by comparing the results given by applying Sentiment Classifications techniques. Further, an investigation of the outcomes of a combination of a lexicon based approach and a machine learning based approach by using machine translation on Swedish Tweets was made. The source data was in Swedish and gathered from Twitter, a naive lexicon based approach was used to score the polarity of the Tweets word by word and then a sum of polaritie was calculated.The swedish source data was translated into English, it was run through a supervised machine learning based classifier to where it was scored. In short, the outcomes of this investigation have shown promising results e.g. the translation did not affect the sentiments in a text but rather other circumstances did. These other circumstances was mostly due to cross-lingual sentiment classification problems and supervised machine learning classifiers character.

Place, publisher, year, edition, pages
National Category
Computer Science
URN: urn:nbn:se:kth:diva-186239OAI: diva2:926472
Available from: 2016-05-18 Created: 2016-05-07 Last updated: 2016-05-18Bibliographically approved

Open Access in DiVA

fulltext(960 kB)57 downloads
File information
File name FULLTEXT01.pdfFile size 960 kBChecksum SHA-512
Type fulltextMimetype application/pdf

By organisation
School of Computer Science and Communication (CSC)
Computer Science

Search outside of DiVA

GoogleGoogle Scholar
Total: 57 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Total: 260 hits
ReferencesLink to record
Permanent link

Direct link