kth.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
A Zeroth-Order Momentum Method for Risk-Averse Online Convex Games
KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Decision and Control Systems (Automatic Control). Digital Futures, SE-10044 Stockholm, Sweden..
Duke Univ, Dept Mech Engn & Mat Sci, Durham, NC 27706 USA..
Air Force Res Lab, Eglin AFB, FL USA..
Air Force Res Lab, Eglin AFB, FL USA..
Show others and affiliations
2022 (English)In: 2022 IEEE 61ST CONFERENCE ON DECISION AND CONTROL (CDC), Institute of Electrical and Electronics Engineers (IEEE) , 2022, p. 5179-5184Conference paper, Published paper (Refereed)
Abstract [en]

We consider risk-averse learning in repeated unknown games where the goal of the agents is to minimize their individual risk of incurring significantly high cost. Specifically, the agents use the conditional value at risk (CVaR) as a risk measure and rely on bandit feedback in the form of the cost values of the selected actions at every episode to estimate their CVaR values and update their actions. A major challenge in using bandit feedback to estimate CVaR is that the agents can only access their own cost values, which, however, depend on the actions of all agents. To address this challenge, we propose a new risk-averse learning algorithm with momentum that utilizes the full historical information on the cost values. We show that this algorithm achieves sub-linear regret and matches the best known algorithms in the literature. We provide numerical experiments for a Cournot game that show that our method outperforms existing methods.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE) , 2022. p. 5179-5184
Series
IEEE Conference on Decision and Control, ISSN 0743-1546
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:kth:diva-326430DOI: 10.1109/CDC51059.2022.9992630ISI: 000948128104054Scopus ID: 2-s2.0-85147035163OAI: oai:DiVA.org:kth-326430DiVA, id: diva2:1754293
Conference
IEEE 61st Conference on Decision and Control (CDC), DEC 06-09, 2022, Cancun, MEXICO
Note

QC 20230503

Available from: 2023-05-03 Created: 2023-05-03 Last updated: 2023-05-03Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Wang, ZifanJohansson, Karl H.

Search in DiVA

By author/editor
Wang, ZifanJohansson, Karl H.
By organisation
Decision and Control Systems (Automatic Control)
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 38 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf