Endre søk
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Statistical and Computational Trade-off in Multi-Agent Multi-Armed Bandits
KTH, Skolan för elektroteknik och datavetenskap (EECS), Intelligenta system, Reglerteknik. Ericsson Research Stockholm, Sweden.ORCID-id: 0000-0002-7668-0650
KTH, Skolan för elektroteknik och datavetenskap (EECS), Intelligenta system, Reglerteknik.ORCID-id: 0000-0002-4679-4673
Ericsson Research Stockholm, Sweden.ORCID-id: 0000-0003-4533-3418
2023 (engelsk)Inngår i: Advances in Neural Information Processing Systems 36 - 37th Conference on Neural Information Processing Systems, NeurIPS 2023, Neural Information Processing Systems Foundation , 2023, Vol. 36Konferansepaper, Publicerat paper (Fagfellevurdert)
Abstract [en]

We study the problem of regret minimization in Multi-Agent Multi-Armed Bandits (MAMABs) where the rewards are defined through a factor graph.We derive an instance-specific regret lower bound and characterize the minimal expected number of times each global action should be explored.This bound and the corresponding optimal exploration process are obtained by solving a combinatorial optimization problem whose set of variables and constraints exponentially grow with the number of agents, and cannot be exploited in the design of efficient algorithms.Inspired by Mean Field approximation techniques used in graphical models, we provide simple upper bounds of the regret lower bound.The corresponding optimization problems have a reduced number of variables and constraints.By tuning the latter, we may explore the trade-off between the achievable regret and the complexity of computing the corresponding exploration process.We devise Efficient Sampling for MAMAB (ESM), an algorithm whose regret asymptotically matches the approximated lower bounds.The regret and computational complexity of ESM are assessed numerically, using both synthetic and real-world experiments in radio communications networks.

sted, utgiver, år, opplag, sider
Neural Information Processing Systems Foundation , 2023. Vol. 36
Serie
Advances in Neural Information Processing Systems, ISSN 1049-5258 ; 36
HSV kategori
Identifikatorer
URN: urn:nbn:se:kth:diva-346142Scopus ID: 2-s2.0-85191164473OAI: oai:DiVA.org:kth-346142DiVA, id: diva2:1855927
Konferanse
37th Conference on Neural Information Processing Systems, NeurIPS 2023, New Orleans, United States of America, Dec 10 2023 - Dec 16 2023
Merknad

QC 20240507

Tilgjengelig fra: 2024-05-03 Laget: 2024-05-03 Sist oppdatert: 2024-07-04bibliografisk kontrollert

Open Access i DiVA

Fulltekst mangler i DiVA

Scopus

Person

Vannella, FilippoProutiere, Alexandre

Søk i DiVA

Av forfatter/redaktør
Vannella, FilippoProutiere, AlexandreJeong, Jaeseong
Av organisasjonen

Søk utenfor DiVA

GoogleGoogle Scholar

urn-nbn

Altmetric

urn-nbn
Totalt: 65 treff
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf