Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Viability of Sentiment Analysis for Troll Detection on Twitter: A Comparative Study Between the Naive Bayes and Maximum Entropy Algorithms
KTH, School of Computer Science and Communication (CSC).
KTH, School of Computer Science and Communication (CSC).
2016 (English)Independent thesis Basic level (degree of Bachelor), 10 credits / 15 HE creditsStudent thesisAlternative title
Undersökning av sentimental analys för att hitta troll på Twitter : En jämförande studie mellan sentimentalanalysalgoritmerna Naive Bayes och Maximum Entropy (Swedish)
Abstract [en]

In this study, we investigated whether sentiment analysis could prove to be a viable tool for troll detection on Twitter. The reason why sentiment analysis was analyzed as a possible tool was because of earlier work recognizing it as a feature that could be interesting to examine. By performing two different sentiment analysis methods, Naive Bayes and Maximum Entropy, an idea could be gathered of how well these approaches perform and whether they are viable for troll detection.

The data set used was a set of 3000 tweets under the hashtag #BlackLivesMatter. Sentiment analysis was performed on the data set with both the Naive Bayes and Maximum Entropy approaches. The data was then clustered and visually presented in graphs.

The results showed that sentiment analysis was not viable as a metric alone. However, together with other metrics it could prove to be useful. Ultimately, for k-means clustering, Maximum Entropy seemed to be the preferable sentiment analysis approach when looking at specific users whereas Naive Bayes performed better when researching individual tweets. As for finding trolls, a general conclusion on the viability of the algorithms could not be drawn, however Maximum Entropy was concluded to be preferable in this specific study.

Place, publisher, year, edition, pages
2016.
National Category
Computer Science
Identifiers
URN: urn:nbn:se:kth:diva-186443OAI: oai:DiVA.org:kth-186443DiVA: diva2:927326
Supervisors
Examiners
Available from: 2016-05-12 Created: 2016-05-11 Last updated: 2016-05-12Bibliographically approved

Open Access in DiVA

fulltext(1032 kB)1895 downloads
File information
File name FULLTEXT01.pdfFile size 1032 kBChecksum SHA-512
bdbfe03a15e7caa87fb5027be48b2a2d739c894e1f3280ba5a1fa05e4ccb6f30918cc54088e25c1ef2d219a43e205d68e74c4bf2faf70bf88c422ce09ddb692d
Type fulltextMimetype application/pdf

By organisation
School of Computer Science and Communication (CSC)
Computer Science

Search outside of DiVA

GoogleGoogle Scholar
Total: 1895 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 1072 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf