kth.sePublications KTH
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Reinforcement Learning for Optimizing FACTS Setpoints With Limited Set of Measurements
KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Robotics, Perception and Learning, RPL. Hitachi Energy Sweden AB, Västerås, Sweden.ORCID iD: 0000-0002-3138-9915
Hitachi Energy Ltd, Baden-Dättwil, Switzerland.ORCID iD: 0000-0001-5423-2550
Hitachi Energy Sweden AB, Västerås, Sweden.ORCID iD: 0009-0000-1964-2458
KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Robotics, Perception and Learning, RPL.ORCID iD: 0000-0003-0579-3372
2026 (English)In: IEEE Open Access Journal of Power and Energy, E-ISSN 2687-7910, Vol. 13, p. 51-63Article in journal (Refereed) Published
Abstract [en]

Coordinated control of Flexible AC Transmission Systems (FACTS) setpoints can significantly enhance power flow and voltage control. However, optimizing the setpoints of multiple FACTS devices in real-world systems remains uncommon, partly due to challenges in model-based control. Data-driven approaches, such as reinforcement learning (RL), offer a promising alternative for coordinated control. In this work, we address a setting where a useful real-time network model is unavailable. Recognizing the increasing deployment of Phasor Measurement Units (PMUs) for advanced monitoring and control, we consider having access to a few but reliable measurements and a constraint violation signal. Under these assumptions, we demonstrate on several scenarios on the IEEE 14-bus and IEEE 57-bus systems that an RL-based optimization of FACTS setpoints can substantially reduce voltage deviations compared to a fixed-setpoint baseline. To improve robustness and prevent unobserved constraint violations, we show that a complete, albeit simple, constraint violation signal is necessary. As an alternative to relying on such a signal, Dynamic Mode Decomposition is proposed to determine new PMU placements, thereby reducing the risk of unobserved constraint violations. Finally, to assess the gap to an optimal policy, we benchmark the RL-based agent against a model-based optimal controller with perfect information.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE) , 2026. Vol. 13, p. 51-63
Keywords [en]
Decision support systems, Flexible AC Transmission Systems (FACTS), power system control, reinforcement learning
National Category
Control Engineering Other Electrical Engineering, Electronic Engineering, Information Engineering
Identifiers
URN: urn:nbn:se:kth:diva-374972DOI: 10.1109/OAJPE.2025.3645591Scopus ID: 2-s2.0-105025685055OAI: oai:DiVA.org:kth-374972DiVA, id: diva2:2026409
Note

QC 20260127

Available from: 2026-01-09 Created: 2026-01-09 Last updated: 2026-01-27Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Tarle, MagnusBjörkman, Mårten

Search in DiVA

By author/editor
Tarle, MagnusLarsson, MatsIngeström, GunnarBjörkman, Mårten
By organisation
Robotics, Perception and Learning, RPL
Control EngineeringOther Electrical Engineering, Electronic Engineering, Information Engineering

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 20 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf