Endre søk
Link to record
Permanent link

Direct link
Publikasjoner (10 av 75) Visa alla publikasjoner
Lindström, M., Brynielsson, J., Cohen, M., Kamrani, F., Lavebrink, S., Limér, C. & Vangeli, M. (2026). Outsmarting Willful-Thinking Opponents: Bayesian Belief Revision for Adversarial Reasoning in Large Language Models. In: Social Networks Analysis and Mining - 17th International Conference, ASONAM 2025, Proceedings: . Paper presented at 17th International Conference on Social Networks Analysis and Mining, ASONAM 2025, Niagara Falls, Canada, Aug 25 2025 - Aug 28 2025 (pp. 559-578). Springer Nature, 16324
Åpne denne publikasjonen i ny fane eller vindu >>Outsmarting Willful-Thinking Opponents: Bayesian Belief Revision for Adversarial Reasoning in Large Language Models
Vise andre…
2026 (engelsk)Inngår i: Social Networks Analysis and Mining - 17th International Conference, ASONAM 2025, Proceedings, Springer Nature , 2026, Vol. 16324, s. 559-578Konferansepaper, Publicerat paper (Fagfellevurdert)
Abstract [en]

In adversarial contexts, success often hinges on understanding not just what the opponent knows, but what they believe and how they revise those beliefs. This study investigates how large language models can be made more resilient and strategically capable by modeling the opponent’s reasoning using Bayesian belief revision. By formalizing negotiations as Bayesian games of incomplete information, it is shown that models equipped with belief revision are better able to counter deceptive or willful-thinking adversaries. The findings underscore the role of second-order reasoning in adversarial settings, with implications for social manipulation in the context of, for example, online communication and intelligence gathering.

sted, utgiver, år, opplag, sider
Springer Nature, 2026
Serie
Lecture Notes in Computer Science, ISSN 03029743
Emneord
Adversarial modeling, Bayesian belief revision, Behavioral learning, Game theory, Social manipulation
HSV kategori
Identifikatorer
urn:nbn:se:kth:diva-377811 (URN)10.1007/978-3-032-14107-1_44 (DOI)2-s2.0-105029897517 (Scopus ID)
Konferanse
17th International Conference on Social Networks Analysis and Mining, ASONAM 2025, Niagara Falls, Canada, Aug 25 2025 - Aug 28 2025
Merknad

Part of ISBN 9783032141064

QC 20260312

Tilgjengelig fra: 2026-03-12 Laget: 2026-03-12 Sist oppdatert: 2026-03-12bibliografisk kontrollert
Lavebrink, S., Brynielsson, J., Cohen, M., Kamrani, F., Limér, C., Lindström, M. & Vangeli, M. (2026). Strategic Steering of Large Language Models via Game-Theoretic Action Space Optimization. In: Social Networks Analysis and Mining: 17th International Conference, ASONAM 2025, Niagara Falls, ON, Canada, August 25–28, 2025, Proceedings, Part III. Paper presented at 17th International Conference on Social Networks Analysis and Mining, ASONAM 2025, Niagara Falls, Canada, Aug 25 2025 - Aug 28 2025 (pp. 508-528). Springer Nature, 16324
Åpne denne publikasjonen i ny fane eller vindu >>Strategic Steering of Large Language Models via Game-Theoretic Action Space Optimization
Vise andre…
2026 (engelsk)Inngår i: Social Networks Analysis and Mining: 17th International Conference, ASONAM 2025, Niagara Falls, ON, Canada, August 25–28, 2025, Proceedings, Part III, Springer Nature , 2026, Vol. 16324, s. 508-528Konferansepaper, Publicerat paper (Fagfellevurdert)
Abstract [en]

This paper investigates how large language models can be steered to act more strategically in text-based negotiation settings. Two prompt-based action space designs are compared, namely emotional tone prompts and explicit offer prompts, within a negotiation environment, and outcomes are compared in simulated dialogues. The results show that both approaches improve strategic outcomes compared to a baseline, with tone-based actions yielding higher agreement rates and offer-based actions providing more stable tradeoffs. These findings demonstrate how action space design influences agent behavior, providing insights for deployment of large language models in strategic negotiation scenarios to gain an advantage in, for example, online influence operations.

sted, utgiver, år, opplag, sider
Springer Nature, 2026
Serie
Lecture Notes in Computer Science
Emneord
Action space optimization, Adversarial dialogue, Game theory, Influence operations, Large language models
HSV kategori
Identifikatorer
urn:nbn:se:kth:diva-377816 (URN)10.1007/978-3-032-14107-1_41 (DOI)2-s2.0-105029907544 (Scopus ID)
Konferanse
17th International Conference on Social Networks Analysis and Mining, ASONAM 2025, Niagara Falls, Canada, Aug 25 2025 - Aug 28 2025
Merknad

Part of ISBN 9783032141064

QC 20260309

Tilgjengelig fra: 2026-03-09 Laget: 2026-03-09 Sist oppdatert: 2026-03-09bibliografisk kontrollert
Limér, C., Brynielsson, J., Cohen, M. & Rydell, F. (2025). Anti-Submarine Warfare Planning Using Public Belief States and Self-Play. In: Proceedings - 2025 24th International Conference on Machine Learning and Applications, ICMLA 2025: . Paper presented at 24th International Conference on Machine Learning and Applications, ICMLA 2025, Boca Raton, United States of America, December 3-5, 2025 (pp. 1211-1218). Institute of Electrical and Electronics Engineers (IEEE)
Åpne denne publikasjonen i ny fane eller vindu >>Anti-Submarine Warfare Planning Using Public Belief States and Self-Play
2025 (engelsk)Inngår i: Proceedings - 2025 24th International Conference on Machine Learning and Applications, ICMLA 2025, Institute of Electrical and Electronics Engineers (IEEE) , 2025, s. 1211-1218Konferansepaper, Publicerat paper (Fagfellevurdert)
Abstract [en]

We consider the problem of how to move active sonars unpredictably in pursuit of a stealthy underwater vehicle. The search problem is formalized as an imperfect-information game played on a discretized nautical chart with fine-grained hydroacoustics. The game is solved approximately using public belief states and self-play following a game-theoretically sound approach. The solution method is shown empirically to approximate the Nash equilibrium in a restricted scenario small enough to be solvable with tabular methods from algorithmic game theory.

sted, utgiver, år, opplag, sider
Institute of Electrical and Electronics Engineers (IEEE), 2025
Emneord
Anti-submarine warfare, game theory, mobile sensor planning, public belief state, self-play
HSV kategori
Identifikatorer
urn:nbn:se:kth:diva-382372 (URN)10.1109/ICMLA66185.2025.00185 (DOI)2-s2.0-105037002762 (Scopus ID)
Konferanse
24th International Conference on Machine Learning and Applications, ICMLA 2025, Boca Raton, United States of America, December 3-5, 2025
Merknad

Part of ISBN 9798331559809

QC 20260526

Tilgjengelig fra: 2026-05-26 Laget: 2026-05-26 Sist oppdatert: 2026-05-26bibliografisk kontrollert
Andreasson, A., Artman, H., Brynielsson, J. & Franke, U. (2025). Cyber situation awareness during an emerging cyberthreat: a case study. International Journal of Information Security, 24(5), Article ID 217.
Åpne denne publikasjonen i ny fane eller vindu >>Cyber situation awareness during an emerging cyberthreat: a case study
2025 (engelsk)Inngår i: International Journal of Information Security, ISSN 1615-5262, E-ISSN 1615-5270, Vol. 24, nr 5, artikkel-id 217Artikkel i tidsskrift (Fagfellevurdert) Published
Abstract [en]

The digitalization of our societies makes them increasingly vulnerable to emerging cyberthreats. These cyberthreats can manifest themselves in the form of organized, sophisticated, and persistent threat actors, as well as nonadversarial mistakes. Staff involved in responding to cyberthreats and handling incidents in organizations need cyber situation awareness. This paper presents a case study on what challenges members of staff involved in cybersecurity in a large, complex organization experience when developing cyber situation awareness while handling a remote code execution vulnerability in the form of Log4Shell. Two types of qualitative empirical material were used for the case study, data collected through semi-structured interviews with ten informants, and internal documentation. The empirical material was analyzed to create a timeline of events in the organization. The results show how information about the threat spread throughout the organization, the types of artifacts that served as common operational pictures, and the role played by information sharing in maintaining staff cyber situation awareness. Three major challenges to the organization were found: (i) information sharing among staff was not effortless, (ii) there was no organization-wide common operational picture established, and (iii) inaccurate information was shared. This study adds a real-world contribution to the literature on organizational handling of cyberthreats.

sted, utgiver, år, opplag, sider
Springer Nature, 2025
Emneord
Common operational picture, Cyber situation awareness, Cybersecurity, Log4j, Log4Shell, Public sector
HSV kategori
Identifikatorer
urn:nbn:se:kth:diva-372052 (URN)10.1007/s10207-025-01106-z (DOI)001581739200001 ()2-s2.0-105017586059 (Scopus ID)
Merknad

Not duplicate with DiVA 1955293

QC 20251023

Tilgjengelig fra: 2025-10-23 Laget: 2025-10-23 Sist oppdatert: 2025-10-23bibliografisk kontrollert
Brynielsson, J., Carp, A. & Tegen, A. (2025). Detection of Emerging Cyberthreats Through Active Learning. In: Uche Onyekpe, Vasile Palade, M. Arif Wani (Ed.), Recent Advances in Deep Learning Applications: New Techniques and Practical Examples (pp. 123-144). Informa UK Limited
Åpne denne publikasjonen i ny fane eller vindu >>Detection of Emerging Cyberthreats Through Active Learning
2025 (engelsk)Inngår i: Recent Advances in Deep Learning Applications: New Techniques and Practical Examples / [ed] Uche Onyekpe, Vasile Palade, M. Arif Wani, Informa UK Limited , 2025, s. 123-144Kapittel i bok, del av antologi (Annet vitenskapelig)
Abstract [en]

In the realm of cybersecurity, leveraging machine learning holds promise for advancing threat detection capabilities. Yet, the sheer volume of unlabeled data presents a challenging hurdle to efficient data management. This chapter delves into the efficacy of active learning methodologies in alleviating the burden of manual data labeling. By employing various query strategies, the study identifies the most informative unlabeled data points suitable for labeling. Examining the performance across different query strategies involved testing a transformer model's ability in discerning tweets referencing advanced persistent threats. In scenarios where labeled training data is scarce, the results suggest that the K-means diversity-based query strategy outperforms both the uncertainty-based approach and the random data point selection. Furthermore, the study investigated the cost-effective active learning paradigm, which integrates high-confidence data points into the training dataset. Surprisingly, this approach emerged as the least effective strategy. In summary, the findings not only explain the potential of active learning in cybersecurity, but also underscore the importance of strategic data selection in optimizing model performance. 

sted, utgiver, år, opplag, sider
Informa UK Limited, 2025
HSV kategori
Identifikatorer
urn:nbn:se:kth:diva-374103 (URN)10.1201/9781003570882-9 (DOI)2-s2.0-105022862964 (Scopus ID)
Merknad

Part of ISBN 9781032944623, 9781040323977

QC 20251216

Tilgjengelig fra: 2025-12-16 Laget: 2025-12-16 Sist oppdatert: 2025-12-16bibliografisk kontrollert
Rosengren, J., Brynielsson, J., Johansson, F. & Jonell, P. (2025). Jailbreaking Large Language Models: Safety Alignment, Response Quality, Computational Cost. In: Proceedings - 2025 24th International Conference on Machine Learning and Applications, ICMLA 2025: . Paper presented at 24th International Conference on Machine Learning and Applications, ICMLA 2025, Boca Raton, United States of America, December 3-5, 2025 (pp. 1116-1123). Institute of Electrical and Electronics Engineers (IEEE)
Åpne denne publikasjonen i ny fane eller vindu >>Jailbreaking Large Language Models: Safety Alignment, Response Quality, Computational Cost
2025 (engelsk)Inngår i: Proceedings - 2025 24th International Conference on Machine Learning and Applications, ICMLA 2025, Institute of Electrical and Electronics Engineers (IEEE) , 2025, s. 1116-1123Konferansepaper, Publicerat paper (Fagfellevurdert)
Abstract [en]

Large language models are often equipped with safety alignment mechanisms designed to prevent generation of harmful or other unwanted content. However, an increasing number of jailbreaking techniques attempt to circumvent these safeguards, raising significant safety concerns. This paper introduces an open-source evaluation framework that analyzes jailbreaking effectiveness in several dimensions: refusal bypass rate, harmful response quality, impact on general model capabilities, and computational cost. In the study, prompt injection, sampling exploits, and model manipulation techniques are examined across four open-weight instruction-tuned large language models. The results demonstrate that high refusal bypass does not necessarily equate to practical safety compromise. Specifically, model manipulation methods like single refusal direction ablation achieve a high attack success rate, but often degrade general capabilities and require significant computational resources. Meanwhile, sampling-based exploits show a minimal practical threat when assessed with a robust model classifier. The findings emphasize the importance of comprehensive, multi-dimensional evaluation to accurately characterize jailbreaking effectiveness and safety risks in large language models.

sted, utgiver, år, opplag, sider
Institute of Electrical and Electronics Engineers (IEEE), 2025
Emneord
jailbreaking, Large language models, model manipulation, prompt injection, safety alignment, sampling exploit
HSV kategori
Identifikatorer
urn:nbn:se:kth:diva-382381 (URN)10.1109/ICMLA66185.2025.00172 (DOI)2-s2.0-105036984227 (Scopus ID)
Konferanse
24th International Conference on Machine Learning and Applications, ICMLA 2025, Boca Raton, United States of America, December 3-5, 2025
Merknad

Part of ISBN 9798331559809

QC 20260526

Tilgjengelig fra: 2026-05-26 Laget: 2026-05-26 Sist oppdatert: 2026-05-26bibliografisk kontrollert
Oscarsson, M., Brynielsson, J., Cohen, M., Kamrani, F. & Limér, C. (2025). The Cost of Uncertainty in Self-play Reinforcement Learning and Search. In: 2025 IEEE International Conference on Intelligence and Security Informatics (ISI): . Paper presented at 21st Annual IEEE International Conference on Intelligence and Security Informatics, ISI 2025, Hong Kong, China, July 12-13, 2025 (pp. 113-120). Institute of Electrical and Electronics Engineers (IEEE)
Åpne denne publikasjonen i ny fane eller vindu >>The Cost of Uncertainty in Self-play Reinforcement Learning and Search
Vise andre…
2025 (engelsk)Inngår i: 2025 IEEE International Conference on Intelligence and Security Informatics (ISI), Institute of Electrical and Electronics Engineers (IEEE) , 2025, s. 113-120Konferansepaper, Publicerat paper (Fagfellevurdert)
Abstract [en]

The combination of reinforcement learning and look-ahead search introduced in AlphaGo, has revolutionized our understanding of tactics and strategy in classical strategy games such as Go and chess. Until recently, this pioneering approach has been limited to perfect information games, where players have full information about the current state of the game. This paper investigates the recent generalization of reinforcement learning with search to imperfect information games, such as poker, where parts of the game state, e.g., the opponent’s hand, is hidden from the player. The paper explores how well this approach scales as the amount of hidden information increases. To this end, the current state of the art in reinforcement learning with search, the student of games general learning algorithm, is reproduced and evaluated across three variants of a custom poker game, each differing by the number of hidden cards dealt to players. It is found that games with less hidden information are learned more effectively, and that computational demands scale sublinearly with increasing hidden information.

sted, utgiver, år, opplag, sider
Institute of Electrical and Electronics Engineers (IEEE), 2025
Emneord
computer poker, counterfactual regret minimization, imperfect information games, Reinforcement learning, student of games algorithm, tree search
HSV kategori
Identifikatorer
urn:nbn:se:kth:diva-377986 (URN)10.1109/ISI65680.2025.11201174 (DOI)2-s2.0-105030660994 (Scopus ID)
Konferanse
21st Annual IEEE International Conference on Intelligence and Security Informatics, ISI 2025, Hong Kong, China, July 12-13, 2025
Merknad

Part of ISBN 9798331512767

QC 20260312

Tilgjengelig fra: 2026-03-12 Laget: 2026-03-12 Sist oppdatert: 2026-03-12bibliografisk kontrollert
Andreasson, A., Artman, H., Brynielsson, J. & Franke, U. (2024). Cybersecurity work at Swedish administrative authorities: taking action or waiting for approval. Cognition, Technology & Work, 26(4), 709-731
Åpne denne publikasjonen i ny fane eller vindu >>Cybersecurity work at Swedish administrative authorities: taking action or waiting for approval
2024 (engelsk)Inngår i: Cognition, Technology & Work, ISSN 1435-5558, E-ISSN 1435-5566, Vol. 26, nr 4, s. 709-731Artikkel i tidsskrift (Fagfellevurdert) Published
Abstract [en]

In recent years, the Swedish public sector has undergone rapid digitalization, while cybersecurity efforts have not kept even steps. This study investigates conditions for cybersecurity work at Swedish administrative authorities by examining organizational conditions at the authorities, what cybersecurity staff do to acquire the cyber situation awareness required for their role, as well as what experience cybersecurity staff have with incidents. In this study, 17 semi-structured interviews were held with respondents from Swedish administrative authorities. The results showed the diverse conditions for cybersecurity work that exist at the authorities and that a variety of roles are involved in that work. It was found that national-level support for cybersecurity was perceived as somewhat lacking. There were also challenges in getting access to information elements required for sufficient cyber situation awareness.

sted, utgiver, år, opplag, sider
Springer Nature, 2024
HSV kategori
Forskningsprogram
Människa-datorinteraktion
Identifikatorer
urn:nbn:se:kth:diva-354123 (URN)10.1007/s10111-024-00779-1 (DOI)001321655700001 ()2-s2.0-85205049306 (Scopus ID)
Forskningsfinansiär
Swedish Armed Forces
Merknad

QC 20240930

Tilgjengelig fra: 2024-09-29 Laget: 2024-09-29 Sist oppdatert: 2025-04-29bibliografisk kontrollert
Carp, A., Brynielsson, J. & Tegen, A. (2023). Active Learning for Improvement of Classification of Cyberthreat Actors in Text Fragments. In: Proceedings - 22nd IEEE International Conference on Machine Learning and Applications, ICMLA 2023: . Paper presented at 22nd IEEE International Conference on Machine Learning and Applications, ICMLA 2023, Jacksonville, United States of America, Dec 15 2023 - Dec 17 2023 (pp. 1279-1286). Institute of Electrical and Electronics Engineers (IEEE)
Åpne denne publikasjonen i ny fane eller vindu >>Active Learning for Improvement of Classification of Cyberthreat Actors in Text Fragments
2023 (engelsk)Inngår i: Proceedings - 22nd IEEE International Conference on Machine Learning and Applications, ICMLA 2023, Institute of Electrical and Electronics Engineers (IEEE) , 2023, s. 1279-1286Konferansepaper, Publicerat paper (Fagfellevurdert)
Abstract [en]

In the domain of cybersecurity, machine learning can offer advanced threat detection. However, the volume of unlabeled data poses challenges for efficient data management. This study investigates the potential for active learning to reduce the effort required for manual data labeling. Through different query strategies, the most informative unlabeled data points were selected for labeling. The performance of different query strategies was assessed by testing a transformer model's ability to accurately distinguish tweets mentioning names of advanced persistent threats. The findings suggest that the K-means diversity-based query strategy outperformed both the uncertainty-based approach and the random data point selection, when the amount of labeled training data was limited. This study also evaluated the cost-effective active learning approach, which incorporates high-confidence data points into the training dataset. However, this was shown to be the least effective strategy.

sted, utgiver, år, opplag, sider
Institute of Electrical and Electronics Engineers (IEEE), 2023
Emneord
Active learning, advanced persistent threat, cybersecurity, natural language processing
HSV kategori
Identifikatorer
urn:nbn:se:kth:diva-350002 (URN)10.1109/ICMLA58977.2023.00193 (DOI)2-s2.0-85190143463 (Scopus ID)
Konferanse
22nd IEEE International Conference on Machine Learning and Applications, ICMLA 2023, Jacksonville, United States of America, Dec 15 2023 - Dec 17 2023
Merknad

Part of ISBN 9798350345346

QC 20240705

Tilgjengelig fra: 2024-07-05 Laget: 2024-07-05 Sist oppdatert: 2024-08-01bibliografisk kontrollert
Brynielsson, J., Cohen, M., Hansen, P., Lavebrink, S., Lindström, M. & Tjörnhammar, E. (2023). Comparison of Strategies for Honeypot Deployment. In: Prakash, BA Wang, D Weninger, T (Ed.), Proceedings Of The 2023 Ieee/Acm International Conference On Advances In Social Networks Analysis And Mining, Asonam 2023: . Paper presented at 15th IEEE/ACM Annual International Conference on Advances in Social Networks Analysis and Mining (ASONAM), NOV 06-09, 2023, Kusadasi, Turkey (pp. 612-619). Association for Computing Machinery (ACM)
Åpne denne publikasjonen i ny fane eller vindu >>Comparison of Strategies for Honeypot Deployment
Vise andre…
2023 (engelsk)Inngår i: Proceedings Of The 2023 Ieee/Acm International Conference On Advances In Social Networks Analysis And Mining, Asonam 2023 / [ed] Prakash, BA Wang, D Weninger, T, Association for Computing Machinery (ACM) , 2023, s. 612-619Konferansepaper, Publicerat paper (Fagfellevurdert)
Abstract [en]

Recent experimental studies have explored how well adaptive honeypot allocation strategies defend against human adversaries. As the experimental subjects were drawn from an unknown, nondescript pool of subjects using Amazon Mechanical Turk, the relevance to defense against real-world adversaries is unclear. The present study reproduces the experiments with more relevant experimental subjects. The results suggest that the strategies considered are less effective against attackers from the current population. In particular, their ability to predict the next attack decreased steadily over time, that is, the human subjects from this population learned to attack less and less predictably.

sted, utgiver, år, opplag, sider
Association for Computing Machinery (ACM), 2023
Serie
Proceedings of the IEEE-ACM International Conference on Advances in Social Networks Analysis and Mining, ISSN 2473-9928
Emneord
Cybersecurity, honeypot, game theory, defense strategy, behavioral learning
HSV kategori
Identifikatorer
urn:nbn:se:kth:diva-345925 (URN)10.1145/3625007.3631602 (DOI)001191293500097 ()2-s2.0-85190627573 (Scopus ID)
Konferanse
15th IEEE/ACM Annual International Conference on Advances in Social Networks Analysis and Mining (ASONAM), NOV 06-09, 2023, Kusadasi, Turkey
Merknad

Part of proceedings ISBN: 979-840070409-3

QC 20240426

Tilgjengelig fra: 2024-04-26 Laget: 2024-04-26 Sist oppdatert: 2024-04-26bibliografisk kontrollert
Organisasjoner
Identifikatorer
ORCID-id: ORCID iD iconorcid.org/0000-0002-2677-9759