kth.sePublications KTH
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Detection of Emerging Cyberthreats Through Active Learning
KTH, School of Electrical Engineering and Computer Science (EECS), Computer Science, Theoretical Computer Science, TCS. FOI Swedish Defence Research Agency, Stockholm, Sweden.ORCID iD: 0000-0002-2677-9759
AI Vision Sweden AB, Sweden.
FOI Swedish Defence Research Agency, Stockholm, Sweden.
2025 (English)In: Recent Advances in Deep Learning Applications: New Techniques and Practical Examples / [ed] Uche Onyekpe, Vasile Palade, M. Arif Wani, Informa UK Limited , 2025, p. 123-144Chapter in book (Other academic)
Abstract [en]

In the realm of cybersecurity, leveraging machine learning holds promise for advancing threat detection capabilities. Yet, the sheer volume of unlabeled data presents a challenging hurdle to efficient data management. This chapter delves into the efficacy of active learning methodologies in alleviating the burden of manual data labeling. By employing various query strategies, the study identifies the most informative unlabeled data points suitable for labeling. Examining the performance across different query strategies involved testing a transformer model's ability in discerning tweets referencing advanced persistent threats. In scenarios where labeled training data is scarce, the results suggest that the K-means diversity-based query strategy outperforms both the uncertainty-based approach and the random data point selection. Furthermore, the study investigated the cost-effective active learning paradigm, which integrates high-confidence data points into the training dataset. Surprisingly, this approach emerged as the least effective strategy. In summary, the findings not only explain the potential of active learning in cybersecurity, but also underscore the importance of strategic data selection in optimizing model performance. 

Place, publisher, year, edition, pages
Informa UK Limited , 2025. p. 123-144
National Category
Security, Privacy and Cryptography
Identifiers
URN: urn:nbn:se:kth:diva-374103DOI: 10.1201/9781003570882-9Scopus ID: 2-s2.0-105022862964OAI: oai:DiVA.org:kth-374103DiVA, id: diva2:2022167
Note

Part of ISBN 9781032944623, 9781040323977

QC 20251216

Available from: 2025-12-16 Created: 2025-12-16 Last updated: 2025-12-16Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Brynielsson, Joel

Search in DiVA

By author/editor
Brynielsson, Joel
By organisation
Theoretical Computer Science, TCS
Security, Privacy and Cryptography

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 80 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf