kth.sePublications KTH
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Finite-time regret minimization for linear quadratic adaptive controllers: An experiment design approach
Université de Lorraine, CNRS, CRAN, F-54000 Nancy, France.
KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Decision and Control Systems (Automatic Control). KTH, School of Engineering Sciences in Chemistry, Biotechnology and Health (CBH), Centres, Centre for Advanced BioProduction by Continuous Processing, AdBIOPRO.ORCID iD: 0000-0002-9368-3079
Laboratoire Ampère, UMR CNRS 5005, Ecole Centrale de Lyon, Université de Lyon, France; Centre National de la Recherche Scientifique (CNRS), France.
2025 (English)In: Automatica, ISSN 0005-1098, E-ISSN 1873-2836, Vol. 180, article id 112459Article in journal (Refereed) Published
Abstract [en]

We tackle the problem of finite-time regret minimization in linear quadratic adaptive control. Regret minimization is a scientific field in both adaptive control and reinforcement learning research communities which studies the so-called trade-off between exploration and exploitation. Even though a large focus has been on linear quadratic adaptive control with theoretical finite-time bound guarantees on the expected regret growth rate, most of the proposed optimal exploration strategies do not take into account the scaling constant associated with the growth rate. Moreover, the exploration strategies are limited to white noise excitation. Using tools from experiment design, we propose a computationally tractable solution for the design of the external excitation chosen as a white noise filtered by a finite impulse response filter which is adapted on-line. In a numerical example it is shown that this approach results in a lower regret in comparison with available strategies.

Place, publisher, year, edition, pages
Elsevier BV , 2025. Vol. 180, article id 112459
Keywords [en]
Adaptive control, Experiment design, Linear quadratic regulator, Regret minimization, Reinforcement learning
National Category
Control Engineering
Identifiers
URN: urn:nbn:se:kth:diva-368671DOI: 10.1016/j.automatica.2025.112459ISI: 001522113100001Scopus ID: 2-s2.0-105008907880OAI: oai:DiVA.org:kth-368671DiVA, id: diva2:1990951
Note

QC 20250821

Available from: 2025-08-21 Created: 2025-08-21 Last updated: 2025-10-03Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Hjalmarsson, Håkan

Search in DiVA

By author/editor
Hjalmarsson, Håkan
By organisation
Decision and Control Systems (Automatic Control)Centre for Advanced BioProduction by Continuous Processing, AdBIOPRO
In the same journal
Automatica
Control Engineering

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 62 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf