kth.sePublikationer KTH
Ändra sökning
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Model-Based Reinforcement Learning for Cavity Filter Tuning
Ericsson GAIA, Sweden.
Qualcomm, Sweden.
Ericsson GAIA, Sweden.
Ericsson GAIA, Sweden.
Visa övriga samt affilieringar
2023 (Engelska)Ingår i: Proceedings of the 5th Annual Learning for Dynamics and Control Conference, L4DC 2023, ML Research Press , 2023, s. 1297-1307Konferensbidrag, Publicerat paper (Refereegranskat)
Abstract [en]

The ongoing development of telecommunication systems like 5G has led to an increase in demand of well calibrated base transceiver station (BTS) components. A pivotal component of every BTS is cavity filters, which provide a sharp frequency characteristic to select a particular band of interest and reject the rest. Unfortunately, their characteristics in combination with manufacturing tolerances make them difficult for mass production and often lead to costly manual post-production fine tuning. To address this, numerous approaches have been proposed to automate the tuning process. One particularly promising one, that has emerged in the past few years, is to use model free reinforcement learning (MFRL); however, the agents are not sample efficient. This poses a serious bottleneck, as utilising complex simulators or training with real filters is prohibitively time demanding. This work advocates for the usage of model based reinforcement learning (MBRL) and showcases how its utilisation can significantly decrease sample complexity, while maintaining similar levels of success rate. More specifically, we propose an improvement over a state-of-the-art (SoTA) MBRL algorithm, namely the Dreamer algorithm. This improvement can serve as a template for applications in other similar, high-dimensional non-image data problems. We carry experiments on two complex filter types, and show that our novel modification on the Dreamer architecture reduces sample complexity by a factor of 4 and 10, respectively. Our findings pioneer the usage of MBRL which paves the way for utilising more precise and accurate simulators which was previously prohibitively time demanding.

Ort, förlag, år, upplaga, sidor
ML Research Press , 2023. s. 1297-1307
Nyckelord [en]
Model Based Reinforcement Learning, Reinforcement Learning, Telecommunication
Nationell ämneskategori
Datavetenskap (datalogi) Signalbehandling
Identifikatorer
URN: urn:nbn:se:kth:diva-338028ISI: 001221742900099Scopus ID: 2-s2.0-85172912389OAI: oai:DiVA.org:kth-338028DiVA, id: diva2:1804631
Konferens
5th Annual Conference on Learning for Dynamics and Control, L4DC 2023, Philadelphia, United States of America, Jun 16 2023 - Jun 15 2023
Anmärkning

QC 20231013

Tillgänglig från: 2023-10-13 Skapad: 2023-10-13 Senast uppdaterad: 2024-09-05Bibliografiskt granskad

Open Access i DiVA

Fulltext saknas i DiVA

Scopus

Person

Ögren, Petter

Sök vidare i DiVA

Av författaren/redaktören
Ögren, Petter
Av organisationen
Robotik, perception och lärande, RPL
Datavetenskap (datalogi)Signalbehandling

Sök vidare utanför DiVA

GoogleGoogle Scholar

urn-nbn

Altmetricpoäng

urn-nbn
Totalt: 112 träffar
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf