kth.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Learning of Nash Equilibria in Risk-Averse Games
KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Decision and Control Systems (Automatic Control).
Duke University, Department of Mechanical Engineering and Materials Science, Durham, NC, USA.
Duke University, Department of Mechanical Engineering and Materials Science, Durham, NC, USA.
KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Decision and Control Systems (Automatic Control).ORCID iD: 0000-0001-9940-5929
2024 (English)In: 2024 American Control Conference, ACC 2024, Institute of Electrical and Electronics Engineers (IEEE) , 2024, p. 3270-3275Conference paper, Published paper (Refereed)
Abstract [en]

This paper considers risk-averse learning in convex games involving multiple agents that aim to minimize their individual risk of incurring significantly high costs. Specifically, the agents adopt the conditional value at risk (CVaR) as a risk measure with possibly different risk levels. To solve this problem, we propose a first-order risk-averse leaning algorithm, in which the CVaR gradient estimate depends on an estimate of the Value at Risk (VaR) value combined with the gradient of the stochastic cost function. Although estimation of the CVaR gradients using finitely many samples is generally biased, we show that the accumulated error of the CVaR gradient estimates is bounded with high probability. Moreover, assuming that the risk-averse game is strongly monotone, we show that the proposed algorithm converges to the risk-averse Nash equilibrium. We present numerical experiments on a Cournot game example to illustrate the performance of the proposed method.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE) , 2024. p. 3270-3275
National Category
Computer Sciences Probability Theory and Statistics
Identifiers
URN: urn:nbn:se:kth:diva-354317DOI: 10.23919/ACC60939.2024.10644891Scopus ID: 2-s2.0-85204439865OAI: oai:DiVA.org:kth-354317DiVA, id: diva2:1902976
Conference
2024 American Control Conference, ACC 2024, Toronto, Canada, Jul 10 2024 - Jul 12 2024
Note

QC 20241003

Part of ISBN 979-8-3503-8265-5

Available from: 2024-10-02 Created: 2024-10-02 Last updated: 2024-10-03Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Wang, ZifanJohansson, Karl H.

Search in DiVA

By author/editor
Wang, ZifanJohansson, Karl H.
By organisation
Decision and Control Systems (Automatic Control)
Computer SciencesProbability Theory and Statistics

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 37 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf