kth.sePublications KTH
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Temporal Intent-Aware Multi-agent Learning for Network Optimization
KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Robotics, Perception and Learning, RPL. Ericsson Research, Stockholm, Sweden.ORCID iD: 0009-0007-9146-0027
Ericsson Research, Stockholm, Sweden.
Ericsson Research, Stockholm, Sweden.
KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Robotics, Perception and Learning, RPL.ORCID iD: 0000-0003-4173-2593
2026 (English)In: Computer Safety, Reliability, and Security. SAFECOMP 2025 Workshops - CoC3CPS, DECSoS, SASSUR, SENSEI, SRToITS, and WAISE, 2025, Proceedings, Springer Nature , 2026, p. 29-40Conference paper, Published paper (Refereed)
Abstract [en]

Cellular networks have grown in size and complexity in recent years. To meet increasing traffic demands, new approaches are needed to replace legacy rule-based controllers and network management systems. Among these, learning-based methods are appealing because they can discover control policies without relying on expert knowledge. Intent-based networking, which describes desired network behavior rather than specific configurations, introduces a new level of abstraction. However, satisfying network intents under temporal constraints remains an open challenge. In this paper, we present a reinforcement learning approach that leverages Signal Temporal Logic (STL) to quantitatively translate network intents into a reward signal. We combine this with a transformer-based neural network architecture to handle temporal dependencies and multi-agent coordination. We evaluate our method in a high-fidelity telecommunications simulator, demonstrating that it outperforms state-of-the-art baselines. Our experiments show an improvement in satisfying temporally dependent intents compared to prior methods.

Place, publisher, year, edition, pages
Springer Nature , 2026. p. 29-40
Keywords [en]
Intent-driven control, Network optimization, Reinforcement learning, Temporal logic
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:kth:diva-370457DOI: 10.1007/978-3-032-02018-5_3Scopus ID: 2-s2.0-105014755551OAI: oai:DiVA.org:kth-370457DiVA, id: diva2:2002067
Conference
Co-Design of Communication, Computing and Control in Cyber-Physical Systems, CoC3CPS 2025 held in conjunction with the 44th International Conference on Computer Safety, Reliability, and Security, SAFECOMP 2025, Stockholm, Sweden, September 9, 2025
Note

Part of ISBN 9783032020178

QC 20250929

Available from: 2025-09-29 Created: 2025-09-29 Last updated: 2025-09-29Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Larsson Forsberg, AlbinTumova, Jana

Search in DiVA

By author/editor
Larsson Forsberg, AlbinTumova, Jana
By organisation
Robotics, Perception and Learning, RPL
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 102 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf