Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Optimal UAV Base Station Trajectories Using Flow-Level Models for Reinforcement Learning
KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Information Science and Engineering.ORCID iD: 0000-0001-7974-5096
KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Information Science and Engineering.ORCID iD: 0000-0001-6630-243X
Int Comp Sci Inst, Edge Comp Serv Continu & Aerial Wireless Network, Berkeley, CA 94704 USA.;Univ Calif Berkeley, Berkeley, CA 94720 USA..ORCID iD: 0000-0003-4730-9921
2019 (English)In: IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, ISSN 2332-7731, Vol. 5, no 4, p. 1101-1112Article in journal (Refereed) Published
Abstract [en]

Cellular base stations (BS) and remote radio heads can be mounted on unmanned aerial vehicles (UAV) for flexible, traffic-aware deployment. These UAV base station networks (UAVBSN) promise an unprecendented degree of freedom that can be exploited for spectral efficiency gains as well as optimal network utilization. However, the current literature lacks realistic radio and traffic models for UAVBSN deployment planning and for performance evaluation. In this paper, we propose flow-level models (FLM) for realistically characterizing the UAVBSN performance in terms of a broad range of flow- and system-level metrics. Further, we propose a deep reinforcement learning (DRL) approach that relies on the UAVBSN FLM for learning the optimal traffic-aware UAV trajectories. For a given user traffic density and starting UAV locations, our RL approach learns the optimal UAV trajectories offline that maximizes a cumulative performance metric. We then execute the learned UAV trajectories in a discrete event simulator to evaluate online UAVBSN performance. For M = 9 UAVs deployed in a simulated Downtown San Francisco model, where the UAV trajectories are defined by N = 20 discrete actions, our approach achieves approximately a three-fold increase in the average user throughput compared to the initial UAV placement, while simultaneously balancing traffic loads across the BSs.

Place, publisher, year, edition, pages
IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC , 2019. Vol. 5, no 4, p. 1101-1112
Keywords [en]
UAV base stations, flow-level models, reinforcement learning, proximal policy optimization
National Category
Communication Systems
Identifiers
URN: urn:nbn:se:kth:diva-266404DOI: 10.1109/TCCN.2019.2948324ISI: 000502789700021Scopus ID: 2-s2.0-85073708152OAI: oai:DiVA.org:kth-266404DiVA, id: diva2:1387803
Note

QC 20200122

Available from: 2020-01-22 Created: 2020-01-22 Last updated: 2020-01-22Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records BETA

Saxena, ViditJaldén, Joakim

Search in DiVA

By author/editor
Saxena, ViditJaldén, JoakimKlessig, Henrik
By organisation
Information Science and Engineering
Communication Systems

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 10 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf