Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Detecting Sockpuppets in Social Media with Plagiarism Detection Algorithms
KTH, School of Computer Science and Communication (CSC).
2017 (English)Independent thesis Basic level (degree of Bachelor), 10 credits / 15 HE creditsStudent thesisAlternative title
Identifikation av Strumpdockor inom Social Media med Plagiatkontrollalgoritmer (Swedish)
Abstract [en]

As new forms of propaganda and information control spread across the internet, the need for novel ways of identifying them increases as well. One increasingly popular method of spreading false messages on microblogs like Twitter is to disseminate them from seemingly ordinary, but centrally controlled and coordinated user accounts – sockpuppets. In this paper we examine a number of potential methods for identifying these by way of applying plagiarism detection algorithms for text, and evaluate their performance against this type of threat. We identify one type of algorithm in particular – that using vector space modeling of text – as particularly useful in this regard.

Abstract [sv]

Allteftersom  nya  former  av  propaganda  och  informationskontroll  sprider sig över internet krävs också nya sätt att identifiera dessa. En  allt mer populär metod för att sprida falsk information på mikrobloggar  som  Twitter  är  att  göra  det  från  till  synes  ordinära,  men  centralt  kontrollerade och koordinerade användarkonton – på engelska kända  som “sockpuppets”. I denna undersökning testar vi ett antal potentiella  metoder  för  att  identifiera  dessa  genom  att  applicera  plagiatkontrollalgoritmer  ämnade  för  text,  och  utvärderar  deras prestanda mot denna sortens hot. Vi identifierar framför allt en typ av  algoritm  –  den  som  nyttjar  vektorrymdsmodellering  av  text  –  som speciellt användbar i detta avseende. 

Place, publisher, year, edition, pages
2017.
Keywords [en]
NLP, plagiarism detection, Twitter, sockpuppets
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:kth:diva-208553OAI: oai:DiVA.org:kth-208553DiVA, id: diva2:1107323
Supervisors
Examiners
Available from: 2017-06-19 Created: 2017-06-09 Last updated: 2018-01-13Bibliographically approved

Open Access in DiVA

fulltext(337 kB)175 downloads
File information
File name FULLTEXT01.pdfFile size 337 kBChecksum SHA-512
700c8f290e97a805c8685909c7b7ae3e34dcee3c5b5ee0581795453c202a300f5ff9a8bd495c63a9c57678d6753ec08dd866310318304881cf7b7bd95f971e5e
Type fulltextMimetype application/pdf

By organisation
School of Computer Science and Communication (CSC)
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar
Total: 175 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 609 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf