kth.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Optimal exploration strategies for finite horizon regret minimization in some adaptive control problems
KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Decision and Control Systems (Automatic Control).ORCID iD: 0000-0003-2008-0127
KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Decision and Control Systems (Automatic Control).ORCID iD: 0000-0002-9368-3079
Laboratoire Ampère, UMR CNRS 5005, Ecole Centrale de Lyon, Université de Lyon, France; Centre National de la Recherche Scientifique (CNRS), France.
2023 (English)In: 22nd IFAC World Congress Yokohama, Japan, July 9-14, 2023, Elsevier BV , 2023, Vol. 56, p. 2564-2569Conference paper, Published paper (Refereed)
Abstract [en]

In this work, we consider the problem of regret minimization in adaptive minimum variance and linear quadratic control problems. Regret minimization has been extensively studied in the literature for both types of adaptive control problems. Most of these works give results of the optimal rate of the regret in the asymptotic regime. In the minimum variance case, the optimal asymptotic rate for the regret is log(T) which can be reached without any additional external excitation. On the contrary, for most adaptive linear quadratic problems, it is necessary to add an external excitation in order to get the optimal asymptotic rate of √T. In this paper, we will actually show from a theoretical study, as well as, in simulations that when the control horizon is pre-specified a lower regret can be obtained with either no external excitation or a new exploration type termed immediate.

Place, publisher, year, edition, pages
Elsevier BV , 2023. Vol. 56, p. 2564-2569
Series
IFAC-PapersOnLine, ISSN 2405-8963 ; 56
Keywords [en]
adaptive control, linear quadratic regulator, linear systems, minimum variance controller, Regret minimization
National Category
Control Engineering
Identifiers
URN: urn:nbn:se:kth:diva-343694DOI: 10.1016/j.ifacol.2023.10.1339Scopus ID: 2-s2.0-85184960468OAI: oai:DiVA.org:kth-343694DiVA, id: diva2:1839889
Conference
22nd IFAC World Congress, Yokohama, Japan, Jul 9 2023 - Jul 14 2023
Note

QC 20240222

Part of ISBN 9781713872344

Available from: 2024-02-22 Created: 2024-02-22 Last updated: 2024-03-25Bibliographically approved

Open Access in DiVA

fulltext(348 kB)35 downloads
File information
File name FULLTEXT01.pdfFile size 348 kBChecksum SHA-512
8b51b546b62578228cc273db95804a4cae60149074bcc8b9970512c42ee893e99c98b73a6d4e59351dd8e29af5e972c0c5b85aaa3de73b67e2446eef8f165e26
Type fulltextMimetype application/pdf

Other links

Publisher's full textScopus

Authority records

Colin, KevinHjalmarsson, Håkan

Search in DiVA

By author/editor
Colin, KevinHjalmarsson, Håkan
By organisation
Decision and Control Systems (Automatic Control)
Control Engineering

Search outside of DiVA

GoogleGoogle Scholar
Total: 35 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 49 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf