kth.sePublikationer KTH
Ändra sökning
Länk till posten
Permanent länk

Direktlänk
Publikationer (10 of 75) Visa alla publikationer
Lindström, M., Brynielsson, J., Cohen, M., Kamrani, F., Lavebrink, S., Limér, C. & Vangeli, M. (2026). Outsmarting Willful-Thinking Opponents: Bayesian Belief Revision for Adversarial Reasoning in Large Language Models. In: Social Networks Analysis and Mining - 17th International Conference, ASONAM 2025, Proceedings: . Paper presented at 17th International Conference on Social Networks Analysis and Mining, ASONAM 2025, Niagara Falls, Canada, Aug 25 2025 - Aug 28 2025 (pp. 559-578). Springer Nature, 16324
Öppna denna publikation i ny flik eller fönster >>Outsmarting Willful-Thinking Opponents: Bayesian Belief Revision for Adversarial Reasoning in Large Language Models
Visa övriga...
2026 (Engelska)Ingår i: Social Networks Analysis and Mining - 17th International Conference, ASONAM 2025, Proceedings, Springer Nature , 2026, Vol. 16324, s. 559-578Konferensbidrag, Publicerat paper (Refereegranskat)
Abstract [en]

In adversarial contexts, success often hinges on understanding not just what the opponent knows, but what they believe and how they revise those beliefs. This study investigates how large language models can be made more resilient and strategically capable by modeling the opponent’s reasoning using Bayesian belief revision. By formalizing negotiations as Bayesian games of incomplete information, it is shown that models equipped with belief revision are better able to counter deceptive or willful-thinking adversaries. The findings underscore the role of second-order reasoning in adversarial settings, with implications for social manipulation in the context of, for example, online communication and intelligence gathering.

Ort, förlag, år, upplaga, sidor
Springer Nature, 2026
Serie
Lecture Notes in Computer Science, ISSN 03029743
Nyckelord
Adversarial modeling, Bayesian belief revision, Behavioral learning, Game theory, Social manipulation
Nationell ämneskategori
Filosofi
Identifikatorer
urn:nbn:se:kth:diva-377811 (URN)10.1007/978-3-032-14107-1_44 (DOI)2-s2.0-105029897517 (Scopus ID)
Konferens
17th International Conference on Social Networks Analysis and Mining, ASONAM 2025, Niagara Falls, Canada, Aug 25 2025 - Aug 28 2025
Anmärkning

Part of ISBN 9783032141064

QC 20260312

Tillgänglig från: 2026-03-12 Skapad: 2026-03-12 Senast uppdaterad: 2026-03-12Bibliografiskt granskad
Lavebrink, S., Brynielsson, J., Cohen, M., Kamrani, F., Limér, C., Lindström, M. & Vangeli, M. (2026). Strategic Steering of Large Language Models via Game-Theoretic Action Space Optimization. In: Social Networks Analysis and Mining: 17th International Conference, ASONAM 2025, Niagara Falls, ON, Canada, August 25–28, 2025, Proceedings, Part III. Paper presented at 17th International Conference on Social Networks Analysis and Mining, ASONAM 2025, Niagara Falls, Canada, Aug 25 2025 - Aug 28 2025 (pp. 508-528). Springer Nature, 16324
Öppna denna publikation i ny flik eller fönster >>Strategic Steering of Large Language Models via Game-Theoretic Action Space Optimization
Visa övriga...
2026 (Engelska)Ingår i: Social Networks Analysis and Mining: 17th International Conference, ASONAM 2025, Niagara Falls, ON, Canada, August 25–28, 2025, Proceedings, Part III, Springer Nature , 2026, Vol. 16324, s. 508-528Konferensbidrag, Publicerat paper (Refereegranskat)
Abstract [en]

This paper investigates how large language models can be steered to act more strategically in text-based negotiation settings. Two prompt-based action space designs are compared, namely emotional tone prompts and explicit offer prompts, within a negotiation environment, and outcomes are compared in simulated dialogues. The results show that both approaches improve strategic outcomes compared to a baseline, with tone-based actions yielding higher agreement rates and offer-based actions providing more stable tradeoffs. These findings demonstrate how action space design influences agent behavior, providing insights for deployment of large language models in strategic negotiation scenarios to gain an advantage in, for example, online influence operations.

Ort, förlag, år, upplaga, sidor
Springer Nature, 2026
Serie
Lecture Notes in Computer Science
Nyckelord
Action space optimization, Adversarial dialogue, Game theory, Influence operations, Large language models
Nationell ämneskategori
Data- och informationsvetenskap
Identifikatorer
urn:nbn:se:kth:diva-377816 (URN)10.1007/978-3-032-14107-1_41 (DOI)2-s2.0-105029907544 (Scopus ID)
Konferens
17th International Conference on Social Networks Analysis and Mining, ASONAM 2025, Niagara Falls, Canada, Aug 25 2025 - Aug 28 2025
Anmärkning

Part of ISBN 9783032141064

QC 20260309

Tillgänglig från: 2026-03-09 Skapad: 2026-03-09 Senast uppdaterad: 2026-03-09Bibliografiskt granskad
Limér, C., Brynielsson, J., Cohen, M. & Rydell, F. (2025). Anti-Submarine Warfare Planning Using Public Belief States and Self-Play. In: Proceedings - 2025 24th International Conference on Machine Learning and Applications, ICMLA 2025: . Paper presented at 24th International Conference on Machine Learning and Applications, ICMLA 2025, Boca Raton, United States of America, December 3-5, 2025 (pp. 1211-1218). Institute of Electrical and Electronics Engineers (IEEE)
Öppna denna publikation i ny flik eller fönster >>Anti-Submarine Warfare Planning Using Public Belief States and Self-Play
2025 (Engelska)Ingår i: Proceedings - 2025 24th International Conference on Machine Learning and Applications, ICMLA 2025, Institute of Electrical and Electronics Engineers (IEEE) , 2025, s. 1211-1218Konferensbidrag, Publicerat paper (Refereegranskat)
Abstract [en]

We consider the problem of how to move active sonars unpredictably in pursuit of a stealthy underwater vehicle. The search problem is formalized as an imperfect-information game played on a discretized nautical chart with fine-grained hydroacoustics. The game is solved approximately using public belief states and self-play following a game-theoretically sound approach. The solution method is shown empirically to approximate the Nash equilibrium in a restricted scenario small enough to be solvable with tabular methods from algorithmic game theory.

Ort, förlag, år, upplaga, sidor
Institute of Electrical and Electronics Engineers (IEEE), 2025
Nyckelord
Anti-submarine warfare, game theory, mobile sensor planning, public belief state, self-play
Nationell ämneskategori
Datavetenskap (datalogi)
Identifikatorer
urn:nbn:se:kth:diva-382372 (URN)10.1109/ICMLA66185.2025.00185 (DOI)2-s2.0-105037002762 (Scopus ID)
Konferens
24th International Conference on Machine Learning and Applications, ICMLA 2025, Boca Raton, United States of America, December 3-5, 2025
Anmärkning

Part of ISBN 9798331559809

QC 20260526

Tillgänglig från: 2026-05-26 Skapad: 2026-05-26 Senast uppdaterad: 2026-05-26Bibliografiskt granskad
Andreasson, A., Artman, H., Brynielsson, J. & Franke, U. (2025). Cyber situation awareness during an emerging cyberthreat: a case study. International Journal of Information Security, 24(5), Article ID 217.
Öppna denna publikation i ny flik eller fönster >>Cyber situation awareness during an emerging cyberthreat: a case study
2025 (Engelska)Ingår i: International Journal of Information Security, ISSN 1615-5262, E-ISSN 1615-5270, Vol. 24, nr 5, artikel-id 217Artikel i tidskrift (Refereegranskat) Published
Abstract [en]

The digitalization of our societies makes them increasingly vulnerable to emerging cyberthreats. These cyberthreats can manifest themselves in the form of organized, sophisticated, and persistent threat actors, as well as nonadversarial mistakes. Staff involved in responding to cyberthreats and handling incidents in organizations need cyber situation awareness. This paper presents a case study on what challenges members of staff involved in cybersecurity in a large, complex organization experience when developing cyber situation awareness while handling a remote code execution vulnerability in the form of Log4Shell. Two types of qualitative empirical material were used for the case study, data collected through semi-structured interviews with ten informants, and internal documentation. The empirical material was analyzed to create a timeline of events in the organization. The results show how information about the threat spread throughout the organization, the types of artifacts that served as common operational pictures, and the role played by information sharing in maintaining staff cyber situation awareness. Three major challenges to the organization were found: (i) information sharing among staff was not effortless, (ii) there was no organization-wide common operational picture established, and (iii) inaccurate information was shared. This study adds a real-world contribution to the literature on organizational handling of cyberthreats.

Ort, förlag, år, upplaga, sidor
Springer Nature, 2025
Nyckelord
Common operational picture, Cyber situation awareness, Cybersecurity, Log4j, Log4Shell, Public sector
Nationell ämneskategori
Systemvetenskap, informationssystem och informatik med samhällsvetenskaplig inriktning Systemvetenskap, informationssystem och informatik Företagsekonomi
Identifikatorer
urn:nbn:se:kth:diva-372052 (URN)10.1007/s10207-025-01106-z (DOI)001581739200001 ()2-s2.0-105017586059 (Scopus ID)
Anmärkning

Not duplicate with DiVA 1955293

QC 20251023

Tillgänglig från: 2025-10-23 Skapad: 2025-10-23 Senast uppdaterad: 2025-10-23Bibliografiskt granskad
Brynielsson, J., Carp, A. & Tegen, A. (2025). Detection of Emerging Cyberthreats Through Active Learning. In: Uche Onyekpe, Vasile Palade, M. Arif Wani (Ed.), Recent Advances in Deep Learning Applications: New Techniques and Practical Examples (pp. 123-144). Informa UK Limited
Öppna denna publikation i ny flik eller fönster >>Detection of Emerging Cyberthreats Through Active Learning
2025 (Engelska)Ingår i: Recent Advances in Deep Learning Applications: New Techniques and Practical Examples / [ed] Uche Onyekpe, Vasile Palade, M. Arif Wani, Informa UK Limited , 2025, s. 123-144Kapitel i bok, del av antologi (Övrigt vetenskapligt)
Abstract [en]

In the realm of cybersecurity, leveraging machine learning holds promise for advancing threat detection capabilities. Yet, the sheer volume of unlabeled data presents a challenging hurdle to efficient data management. This chapter delves into the efficacy of active learning methodologies in alleviating the burden of manual data labeling. By employing various query strategies, the study identifies the most informative unlabeled data points suitable for labeling. Examining the performance across different query strategies involved testing a transformer model's ability in discerning tweets referencing advanced persistent threats. In scenarios where labeled training data is scarce, the results suggest that the K-means diversity-based query strategy outperforms both the uncertainty-based approach and the random data point selection. Furthermore, the study investigated the cost-effective active learning paradigm, which integrates high-confidence data points into the training dataset. Surprisingly, this approach emerged as the least effective strategy. In summary, the findings not only explain the potential of active learning in cybersecurity, but also underscore the importance of strategic data selection in optimizing model performance. 

Ort, förlag, år, upplaga, sidor
Informa UK Limited, 2025
Nationell ämneskategori
Säkerhet, integritet och kryptologi
Identifikatorer
urn:nbn:se:kth:diva-374103 (URN)10.1201/9781003570882-9 (DOI)2-s2.0-105022862964 (Scopus ID)
Anmärkning

Part of ISBN 9781032944623, 9781040323977

QC 20251216

Tillgänglig från: 2025-12-16 Skapad: 2025-12-16 Senast uppdaterad: 2025-12-16Bibliografiskt granskad
Rosengren, J., Brynielsson, J., Johansson, F. & Jonell, P. (2025). Jailbreaking Large Language Models: Safety Alignment, Response Quality, Computational Cost. In: Proceedings - 2025 24th International Conference on Machine Learning and Applications, ICMLA 2025: . Paper presented at 24th International Conference on Machine Learning and Applications, ICMLA 2025, Boca Raton, United States of America, December 3-5, 2025 (pp. 1116-1123). Institute of Electrical and Electronics Engineers (IEEE)
Öppna denna publikation i ny flik eller fönster >>Jailbreaking Large Language Models: Safety Alignment, Response Quality, Computational Cost
2025 (Engelska)Ingår i: Proceedings - 2025 24th International Conference on Machine Learning and Applications, ICMLA 2025, Institute of Electrical and Electronics Engineers (IEEE) , 2025, s. 1116-1123Konferensbidrag, Publicerat paper (Refereegranskat)
Abstract [en]

Large language models are often equipped with safety alignment mechanisms designed to prevent generation of harmful or other unwanted content. However, an increasing number of jailbreaking techniques attempt to circumvent these safeguards, raising significant safety concerns. This paper introduces an open-source evaluation framework that analyzes jailbreaking effectiveness in several dimensions: refusal bypass rate, harmful response quality, impact on general model capabilities, and computational cost. In the study, prompt injection, sampling exploits, and model manipulation techniques are examined across four open-weight instruction-tuned large language models. The results demonstrate that high refusal bypass does not necessarily equate to practical safety compromise. Specifically, model manipulation methods like single refusal direction ablation achieve a high attack success rate, but often degrade general capabilities and require significant computational resources. Meanwhile, sampling-based exploits show a minimal practical threat when assessed with a robust model classifier. The findings emphasize the importance of comprehensive, multi-dimensional evaluation to accurately characterize jailbreaking effectiveness and safety risks in large language models.

Ort, förlag, år, upplaga, sidor
Institute of Electrical and Electronics Engineers (IEEE), 2025
Nyckelord
jailbreaking, Large language models, model manipulation, prompt injection, safety alignment, sampling exploit
Nationell ämneskategori
Artificiell intelligens Säkerhet, integritet och kryptologi
Identifikatorer
urn:nbn:se:kth:diva-382381 (URN)10.1109/ICMLA66185.2025.00172 (DOI)2-s2.0-105036984227 (Scopus ID)
Konferens
24th International Conference on Machine Learning and Applications, ICMLA 2025, Boca Raton, United States of America, December 3-5, 2025
Anmärkning

Part of ISBN 9798331559809

QC 20260526

Tillgänglig från: 2026-05-26 Skapad: 2026-05-26 Senast uppdaterad: 2026-05-26Bibliografiskt granskad
Oscarsson, M., Brynielsson, J., Cohen, M., Kamrani, F. & Limér, C. (2025). The Cost of Uncertainty in Self-play Reinforcement Learning and Search. In: 2025 IEEE International Conference on Intelligence and Security Informatics (ISI): . Paper presented at 21st Annual IEEE International Conference on Intelligence and Security Informatics, ISI 2025, Hong Kong, China, July 12-13, 2025 (pp. 113-120). Institute of Electrical and Electronics Engineers (IEEE)
Öppna denna publikation i ny flik eller fönster >>The Cost of Uncertainty in Self-play Reinforcement Learning and Search
Visa övriga...
2025 (Engelska)Ingår i: 2025 IEEE International Conference on Intelligence and Security Informatics (ISI), Institute of Electrical and Electronics Engineers (IEEE) , 2025, s. 113-120Konferensbidrag, Publicerat paper (Refereegranskat)
Abstract [en]

The combination of reinforcement learning and look-ahead search introduced in AlphaGo, has revolutionized our understanding of tactics and strategy in classical strategy games such as Go and chess. Until recently, this pioneering approach has been limited to perfect information games, where players have full information about the current state of the game. This paper investigates the recent generalization of reinforcement learning with search to imperfect information games, such as poker, where parts of the game state, e.g., the opponent’s hand, is hidden from the player. The paper explores how well this approach scales as the amount of hidden information increases. To this end, the current state of the art in reinforcement learning with search, the student of games general learning algorithm, is reproduced and evaluated across three variants of a custom poker game, each differing by the number of hidden cards dealt to players. It is found that games with less hidden information are learned more effectively, and that computational demands scale sublinearly with increasing hidden information.

Ort, förlag, år, upplaga, sidor
Institute of Electrical and Electronics Engineers (IEEE), 2025
Nyckelord
computer poker, counterfactual regret minimization, imperfect information games, Reinforcement learning, student of games algorithm, tree search
Nationell ämneskategori
Datavetenskap (datalogi)
Identifikatorer
urn:nbn:se:kth:diva-377986 (URN)10.1109/ISI65680.2025.11201174 (DOI)2-s2.0-105030660994 (Scopus ID)
Konferens
21st Annual IEEE International Conference on Intelligence and Security Informatics, ISI 2025, Hong Kong, China, July 12-13, 2025
Anmärkning

Part of ISBN 9798331512767

QC 20260312

Tillgänglig från: 2026-03-12 Skapad: 2026-03-12 Senast uppdaterad: 2026-03-12Bibliografiskt granskad
Andreasson, A., Artman, H., Brynielsson, J. & Franke, U. (2024). Cybersecurity work at Swedish administrative authorities: taking action or waiting for approval. Cognition, Technology & Work, 26(4), 709-731
Öppna denna publikation i ny flik eller fönster >>Cybersecurity work at Swedish administrative authorities: taking action or waiting for approval
2024 (Engelska)Ingår i: Cognition, Technology & Work, ISSN 1435-5558, E-ISSN 1435-5566, Vol. 26, nr 4, s. 709-731Artikel i tidskrift (Refereegranskat) Published
Abstract [en]

In recent years, the Swedish public sector has undergone rapid digitalization, while cybersecurity efforts have not kept even steps. This study investigates conditions for cybersecurity work at Swedish administrative authorities by examining organizational conditions at the authorities, what cybersecurity staff do to acquire the cyber situation awareness required for their role, as well as what experience cybersecurity staff have with incidents. In this study, 17 semi-structured interviews were held with respondents from Swedish administrative authorities. The results showed the diverse conditions for cybersecurity work that exist at the authorities and that a variety of roles are involved in that work. It was found that national-level support for cybersecurity was perceived as somewhat lacking. There were also challenges in getting access to information elements required for sufficient cyber situation awareness.

Ort, förlag, år, upplaga, sidor
Springer Nature, 2024
Nationell ämneskategori
Data- och informationsvetenskap
Forskningsämne
Människa-datorinteraktion
Identifikatorer
urn:nbn:se:kth:diva-354123 (URN)10.1007/s10111-024-00779-1 (DOI)001321655700001 ()2-s2.0-85205049306 (Scopus ID)
Forskningsfinansiär
Försvarsmakten
Anmärkning

QC 20240930

Tillgänglig från: 2024-09-29 Skapad: 2024-09-29 Senast uppdaterad: 2025-04-29Bibliografiskt granskad
Carp, A., Brynielsson, J. & Tegen, A. (2023). Active Learning for Improvement of Classification of Cyberthreat Actors in Text Fragments. In: Proceedings - 22nd IEEE International Conference on Machine Learning and Applications, ICMLA 2023: . Paper presented at 22nd IEEE International Conference on Machine Learning and Applications, ICMLA 2023, Jacksonville, United States of America, Dec 15 2023 - Dec 17 2023 (pp. 1279-1286). Institute of Electrical and Electronics Engineers (IEEE)
Öppna denna publikation i ny flik eller fönster >>Active Learning for Improvement of Classification of Cyberthreat Actors in Text Fragments
2023 (Engelska)Ingår i: Proceedings - 22nd IEEE International Conference on Machine Learning and Applications, ICMLA 2023, Institute of Electrical and Electronics Engineers (IEEE) , 2023, s. 1279-1286Konferensbidrag, Publicerat paper (Refereegranskat)
Abstract [en]

In the domain of cybersecurity, machine learning can offer advanced threat detection. However, the volume of unlabeled data poses challenges for efficient data management. This study investigates the potential for active learning to reduce the effort required for manual data labeling. Through different query strategies, the most informative unlabeled data points were selected for labeling. The performance of different query strategies was assessed by testing a transformer model's ability to accurately distinguish tweets mentioning names of advanced persistent threats. The findings suggest that the K-means diversity-based query strategy outperformed both the uncertainty-based approach and the random data point selection, when the amount of labeled training data was limited. This study also evaluated the cost-effective active learning approach, which incorporates high-confidence data points into the training dataset. However, this was shown to be the least effective strategy.

Ort, förlag, år, upplaga, sidor
Institute of Electrical and Electronics Engineers (IEEE), 2023
Nyckelord
Active learning, advanced persistent threat, cybersecurity, natural language processing
Nationell ämneskategori
Datavetenskap (datalogi)
Identifikatorer
urn:nbn:se:kth:diva-350002 (URN)10.1109/ICMLA58977.2023.00193 (DOI)2-s2.0-85190143463 (Scopus ID)
Konferens
22nd IEEE International Conference on Machine Learning and Applications, ICMLA 2023, Jacksonville, United States of America, Dec 15 2023 - Dec 17 2023
Anmärkning

Part of ISBN 9798350345346

QC 20240705

Tillgänglig från: 2024-07-05 Skapad: 2024-07-05 Senast uppdaterad: 2024-08-01Bibliografiskt granskad
Brynielsson, J., Cohen, M., Hansen, P., Lavebrink, S., Lindström, M. & Tjörnhammar, E. (2023). Comparison of Strategies for Honeypot Deployment. In: Prakash, BA Wang, D Weninger, T (Ed.), Proceedings Of The 2023 Ieee/Acm International Conference On Advances In Social Networks Analysis And Mining, Asonam 2023: . Paper presented at 15th IEEE/ACM Annual International Conference on Advances in Social Networks Analysis and Mining (ASONAM), NOV 06-09, 2023, Kusadasi, Turkey (pp. 612-619). Association for Computing Machinery (ACM)
Öppna denna publikation i ny flik eller fönster >>Comparison of Strategies for Honeypot Deployment
Visa övriga...
2023 (Engelska)Ingår i: Proceedings Of The 2023 Ieee/Acm International Conference On Advances In Social Networks Analysis And Mining, Asonam 2023 / [ed] Prakash, BA Wang, D Weninger, T, Association for Computing Machinery (ACM) , 2023, s. 612-619Konferensbidrag, Publicerat paper (Refereegranskat)
Abstract [en]

Recent experimental studies have explored how well adaptive honeypot allocation strategies defend against human adversaries. As the experimental subjects were drawn from an unknown, nondescript pool of subjects using Amazon Mechanical Turk, the relevance to defense against real-world adversaries is unclear. The present study reproduces the experiments with more relevant experimental subjects. The results suggest that the strategies considered are less effective against attackers from the current population. In particular, their ability to predict the next attack decreased steadily over time, that is, the human subjects from this population learned to attack less and less predictably.

Ort, förlag, år, upplaga, sidor
Association for Computing Machinery (ACM), 2023
Serie
Proceedings of the IEEE-ACM International Conference on Advances in Social Networks Analysis and Mining, ISSN 2473-9928
Nyckelord
Cybersecurity, honeypot, game theory, defense strategy, behavioral learning
Nationell ämneskategori
Data- och informationsvetenskap
Identifikatorer
urn:nbn:se:kth:diva-345925 (URN)10.1145/3625007.3631602 (DOI)001191293500097 ()2-s2.0-85190627573 (Scopus ID)
Konferens
15th IEEE/ACM Annual International Conference on Advances in Social Networks Analysis and Mining (ASONAM), NOV 06-09, 2023, Kusadasi, Turkey
Anmärkning

Part of proceedings ISBN: 979-840070409-3

QC 20240426

Tillgänglig från: 2024-04-26 Skapad: 2024-04-26 Senast uppdaterad: 2024-04-26Bibliografiskt granskad
Organisationer
Identifikatorer
ORCID-id: ORCID iD iconorcid.org/0000-0002-2677-9759

Sök vidare i DiVA

Visa alla publikationer