kth.sePublications KTH
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Intelligent reflecting surface-assisted UAV inspection system based on transfer learning
Department of Electrical Engineering, Nanjing University of Aeronautics and Astronautics, Nanjing, China.
Department of Electrical Engineering, Nanjing University of Aeronautics and Astronautics, Nanjing, China; National Mobile Communications Research Laboratory, Southeast University, NanJing, China.
Department of Electrical Engineering, Nanjing University of Aeronautics and Astronautics, Nanjing, China.
KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Information Science and Engineering.ORCID iD: 0000-0002-5407-0835
Show others and affiliations
2024 (English)In: IET Communications, ISSN 1751-8628, E-ISSN 1751-8636, Vol. 18, no 3, p. 214-224Article in journal (Refereed) Published
Abstract [en]

Intelligent reflective surface (IRS) provides an effective solution for reconfiguring air-to-ground wireless channels, and intelligent agents based on reinforcement learning can dynamically adjust the reflection coefficient of IRS to adapt to changing channels. However, most exiting IRS configuration schemes based on reinforcement learning require long training time and are difficult to be industrially deployed. This paper, proposes a model-free IRS control scheme based on reinforcement learning and adopts transfer learning to accelerate the training process. A knowledge base of the source tasks has been constructed for transfer learning, allowing accumulation of experience from different source tasks. To mitigate potential negative effects of transfer learning, quantitative analysis of task similarity through unmanned aerial vehicle (UAV) flight path is conducted. After identifying the most similar source task to the target task, parameters of the source task model are used as the initial values for the target task model to accelerate the convergence process of reinforcement learning. Simulation results demonstrate that the proposed method can increase the convergence speed of the traditional DDQN algorithm by up to 60%.

Place, publisher, year, edition, pages
Institution of Engineering and Technology (IET) , 2024. Vol. 18, no 3, p. 214-224
Keywords [en]
6G, learning (artificial intelligence)
National Category
Robotics and automation Control Engineering
Identifiers
URN: urn:nbn:se:kth:diva-366976DOI: 10.1049/cmu2.12718ISI: 001147115700001Scopus ID: 2-s2.0-85182818285OAI: oai:DiVA.org:kth-366976DiVA, id: diva2:1983878
Note

QC 20250714

Available from: 2025-07-14 Created: 2025-07-14 Last updated: 2025-07-14Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Xiao, Ming

Search in DiVA

By author/editor
Xiao, Ming
By organisation
Information Science and Engineering
In the same journal
IET Communications
Robotics and automationControl Engineering

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 24 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf