kth.sePublications KTH
Change search
Link to record
Permanent link

Direct link
Publications (10 of 14) Show all publications
Wang, Q., Zhang, S., Özger, M. & Cavdar, C. (2026). Independent PPO-based Robust and Scalable Three-dimensional Configuration for Green Massive MIMO Networks. IEEE Transactions on Green Communications and Networking, 10, 2666-2679
Open this publication in new window or tab >>Independent PPO-based Robust and Scalable Three-dimensional Configuration for Green Massive MIMO Networks
2026 (English)In: IEEE Transactions on Green Communications and Networking, E-ISSN 2473-2400, Vol. 10, p. 2666-2679Article in journal (Refereed) Published
Abstract [en]

In green massive MIMO networks, reducing power consumption (PC) while ensuring user quality of service (QoS) is critical for sustainable operation. To this end, we propose a robust and scalable reinforcement learning framework based on independent proximal policy optimization (IPPO), enabling intelligent base station (BS) control through a three-dimensional configuration of antenna activation, sleep mode transitions, and user offloading. Compared to a non-learning simple energy-saving policy, our proposed IPPO algorithm achieves approximately a 20.3% reduction in PC and a 49% improvement in energy efficiency (EE). In addition, it demonstrates significantly faster convergence and better scalability than multi-agent PPO (MAPPO), reducing convergence time by approximately 75% with 49 BSs and by around 90% with 81 BSs.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2026
Keywords
advanced sleep modes, antenna switching, Base station control, energy saving, generalization, IPPO, scalability
National Category
Telecommunications Communication Systems Computer Sciences
Identifiers
urn:nbn:se:kth:diva-379839 (URN)10.1109/TGCN.2026.3677287 (DOI)001746870000003 ()2-s2.0-105034543060 (Scopus ID)
Note

QC 20260420

Available from: 2026-04-20 Created: 2026-04-20 Last updated: 2026-05-04Bibliographically approved
Deng, Y., Zhang, S., Meer, I. A., Özger, M. & Cavdar, C. (2026). Joint Trajectory and Handover Management for UAVs Co-existing with Terrestrial Users: A Multi-Agent DRL Approach. IEEE Transactions on Cognitive Communications and Networking, 12, 1195-1210
Open this publication in new window or tab >>Joint Trajectory and Handover Management for UAVs Co-existing with Terrestrial Users: A Multi-Agent DRL Approach
Show others...
2026 (English)In: IEEE Transactions on Cognitive Communications and Networking, E-ISSN 2332-7731, Vol. 12, p. 1195-1210Article in journal (Refereed) Published
Abstract [en]

Despite increasing interest in cellular-connected unmanned aerial vehicles (UAVs), their integration into existing cellular networks poses substantial challenges, including intense interference from UAVs to terrestrial user equipments (UEs) and numerous redundant handovers. To jointly reduce the generated interference and redundant handovers of cellular-connected UAVs while keeping their low transmission delay, we define an optimization problem subject to constraints on total available bandwidth and quality of service (QoS). Then, we formulate the optimization problem as a decentralized partially observable Markov decision process (Dec-POMDP) in the context of a cooperative game. We further develope a collaborative trajectory and handover management scheme using a multi-agent deep reinforcement learning algorithm, specifically the Q-learning with a MIXer network (QMIX) algorithm, to jointly optimize the aforementioned three metrics. Simulation results demonstrate that QMIX significantly outperforms two benchmark schemes: the conventional handover management (CHM) scheme and the independent dueling double deep recurrent Q-network (ID3RQN) scheme. Compared with the CHM scheme, QMIX reduces the delay, interference, and number of handovers for UAVs by an average of 46.9%, 70.0% and 90.5%, respectively. Compared with the ID3RQN scheme, QMIX reduces the three metrics by an average of 90.0%, 43.0% and 41.7%, respectively.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2026
Keywords
Cellular-connected UAVs, Handover management, Multi-agent deep reinforcement learning, Multi-objective optimization, Trajectory design
National Category
Communication Systems Computer Sciences Robotics and automation Telecommunications
Identifiers
urn:nbn:se:kth:diva-368544 (URN)10.1109/TCCN.2025.3578506 (DOI)001650420100015 ()2-s2.0-105008145025 (Scopus ID)
Note

QC 20260127

Available from: 2025-08-20 Created: 2025-08-20 Last updated: 2026-01-27Bibliographically approved
Zhang, S., Özger, M., Sri Ganesh Seeram, S. S., Godor, I., Feltrin, L., Nordloew, A., . . . Cavdar, C. (2025). 6G for Connected Sky: Holistic Adaptive Combined Airspace and Non Terrestrial Network Architecture. IEEE wireless communications, 32(5), 204-211
Open this publication in new window or tab >>6G for Connected Sky: Holistic Adaptive Combined Airspace and Non Terrestrial Network Architecture
Show others...
2025 (English)In: IEEE wireless communications, ISSN 1536-1284, E-ISSN 1558-0687, Vol. 32, no 5, p. 204-211Article in journal (Refereed) Published
Abstract [en]

The evolution toward 6G networks introduces unprecedented challenges and opportunities, particularly in the realm of serving both aerial and ground users seamlessly. In this article, we propose a holistic adaptive combined airspace and non-terrestrial network (NTN) architecture designed to address the unique requirements of the 6G era. Three principle features - joint sensing, communication, and computation (JSCC) in three dimensions (3D), cloud-native and artificial intelligence (AI) native, and the flexibility of radio access network (RAN) and core functions of the proposed architecture - are presented. Next, two application scenarios are analyzed: one catering to aerial users and the other supporting ground users, each, in particular, supporting communication links. Finally, we look into the network management and control aspects of the proposed architecture and discuss challenges and future research directions.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2025
Keywords
Computer architecture, Sensors, 6G mobile communication, Three-dimensional displays, Artificial intelligence, Satellites, Satellite broadcasting, Cloud computing, Radio access networks, Airplanes
National Category
Communication Systems
Identifiers
urn:nbn:se:kth:diva-367871 (URN)10.1109/MWC.004.2400232 (DOI)001504163400001 ()2-s2.0-105007503088 (Scopus ID)
Note

QC 20251103

Available from: 2025-08-01 Created: 2025-08-01 Last updated: 2025-11-03Bibliographically approved
Sri Ganesh Seeram, S. S., Feltrin, L., Özger, M., Zhang, S. & Cavdar, C. (2025). Handover challenges in disaggregated open RAN for LEO Satellites: tradeoff between handover delay and onboard processing. FRONTIERS IN SPACE TECHNOLOGIES, 6, Article ID 1580005.
Open this publication in new window or tab >>Handover challenges in disaggregated open RAN for LEO Satellites: tradeoff between handover delay and onboard processing
Show others...
2025 (English)In: FRONTIERS IN SPACE TECHNOLOGIES, ISSN 2673-5075, Vol. 6, article id 1580005Article, review/survey (Refereed) Published
Abstract [en]

Given the advancements in next-generation low Earth orbit (LEO) satellites, there is an expected shift from transparent architectures (acting as radio repeaters) to regenerative architectures (hosting a part or all of the gNodeB (gNB) onboard). Such regenerative architectures enable disaggregation and distribution of radio access network (RAN) functions between the ground and space. Open RAN is a promising approach for non-terrestrial networks and offers flexible function placement through open interfaces. The present study examines three open RAN-based regenerative architectures, namely, Split 7.2x (low-layer physical functions onboard), Split 2 (Layers 1 and 2 onboard), and a gNB onboard the satellite. Handover (HO) management becomes increasingly complex in this disaggregated RAN, particularly for LEO satellites, where the part of the gNB is constantly in motion. The choice of regenerative architecture and its dynamic topology influence the additional HO control signals required between the satellite and ground stations. Using a realistic dynamic LEO constellation model, we analyze the interplay among conditional handover (CHO) delay, computational complexity, and control signaling overhead under different network architectures. Our findings reveal that transitioning from a transparent architecture to Split 7.2x does not reduce CHO delay despite the introduction of additional onboard processing. The gNB onboard the satellite minimizes cumulative CHO delay but demands 55%-70% more computational resources than the Split 7.2x architecture. Conversely, although Split 7.2x is computationally more efficient, it increases the cumulative CHO delay by 25%-30%. Additionally, we observed that under limited onboard processing conditions, only the transparent and Split 7.2x architectures supported delay-sensitive services up to 100 ms. In contrast, under ample processing conditions, gNB was suitable for stringent 50 ms requirements, while Split 2 best supported delay-tolerant services with 200 ms requirements.

Place, publisher, year, edition, pages
Frontiers Media SA, 2025
Keywords
open radio access network, non-terrestrial network, functional split, conditional handover, low Earth orbit satellite, regenerative architecture
National Category
Signal Processing
Identifiers
urn:nbn:se:kth:diva-370967 (URN)10.3389/frspt.2025.1580005 (DOI)001522296300001 ()
Note

Correction in DOI 10.3389/frspt.2026.1824394

QC 20251003

Available from: 2025-10-03 Created: 2025-10-03 Last updated: 2026-06-09Bibliographically approved
Sri Ganesh Seeram, S. S., Feltrin, L., Özger, M., Zhang, S. & Cavdar, C. (2025). Handover Delay Minimization in Non-Terrestrial Networks: Impact of Open RAN Functional Splits. In: 2025 12th Advanced Satellite Multimedia Systems Conference and the 18th Signal Processing for Space Communications Workshop, ASMS/SPSC 2025: . Paper presented at 12th Advanced Satellite Multimedia Systems Conference and the 18th Signal Processing for Space Communications Workshop, ASMS/SPSC 2025, Sitges, Spain, Feb 26 2025 - Feb 28 2025. Institute of Electrical and Electronics Engineers (IEEE)
Open this publication in new window or tab >>Handover Delay Minimization in Non-Terrestrial Networks: Impact of Open RAN Functional Splits
Show others...
2025 (English)In: 2025 12th Advanced Satellite Multimedia Systems Conference and the 18th Signal Processing for Space Communications Workshop, ASMS/SPSC 2025, Institute of Electrical and Electronics Engineers (IEEE) , 2025Conference paper, Published paper (Refereed)
Abstract [en]

This paper addresses the challenge of optimizing handover (HO) performance in non-terrestrial networks (NTNs) to enhance user equipment (UE) effective service time, defined as the active service time excluding HO delays and radio link failure (RLF) periods. Availability is defined as the normalized effective service time which is effected by different HO scenarios: Intra-satellite HO is the HO from one beam to another within the same satellite; inter-satellite HO refers to the HO from one satellite to another where satellites can be connected to the same or different GSs. We investigate the impact of open radio access network (O-RAN) functional splits (FSs) between ground station (GS) and LEO satellites on HO delay and assess how beam configurations affect RLF rates and intra- and inter-satellite HO rates. This work focuses on three O-RAN FSs - split 7.2x (low layer 1 functions on the satellite), split 2 (layer 1 and layer 2 functions on the satellite), and gNB onboard the satellite - and two beam configurations (19-beam and 127-beam). In a realistic dynamic LEO satellite constellation where different types of HO scenarios are simulated, we maximize effective service time by tuning the time-to-trigger (TTT) and HO margin (HOM) parameters. Our findings reveal that the gNB onboard the satellite achieves the highest availability, approximately 95.4%, while the split 7.2x exhibits the lowest availability, around 92.8% due to higher intra-satellite HO delays.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2025
Keywords
conditional handover (CHO), low earth orbit (LEO) satellite, non-terrestrial network (NTN), open radio access network (O-RAN), radio link failure (RLF)
National Category
Signal Processing Telecommunications
Identifiers
urn:nbn:se:kth:diva-363094 (URN)10.1109/ASMS/SPSC64465.2025.10946034 (DOI)001479663300004 ()2-s2.0-105002906801 (Scopus ID)
Conference
12th Advanced Satellite Multimedia Systems Conference and the 18th Signal Processing for Space Communications Workshop, ASMS/SPSC 2025, Sitges, Spain, Feb 26 2025 - Feb 28 2025
Note

Part of ISBN 979-8-3315-2235-3

QC 20250506

Available from: 2025-05-06 Created: 2025-05-06 Last updated: 2026-01-06Bibliographically approved
Deng, Y., Zhang, S., Meer, I. A., Özger, M. & Cavdar, C. (2025). Joint Trajectory and Handover Management for UAVs Co-existing with Terrestrial Users: A Multi-Agent DRL Approach.
Open this publication in new window or tab >>Joint Trajectory and Handover Management for UAVs Co-existing with Terrestrial Users: A Multi-Agent DRL Approach
Show others...
2025 (English)In: Article in journal (Other academic) Submitted
Abstract [en]

Despite increasing interest in cellular-connected unmanned aerial vehicles (UAVs), their integration into existing cellular networks poses substantial challenges, including intense interference from UAVs to terrestrial user equipments (UEs) and numerous redundant handovers. To jointly reduce the generated interference and redundant handovers of cellular-connected UAVs while keeping their low transmission delay, we define an optimization problem with total available bandwidth and quality of service (QoS) constraints. Then, we formulate the optimization problem as a partially observable Markov decision process (POMDP) within a cooperative game. We have further developed a collaborative trajectory and handover management scheme using a Multi-Agent Deep Reinforcement Learning (MADRL) algorithm, specifically the Q-learning with a MIXer network (QMIX) algorithm, to optimize the aforementioned three metrics jointly. To demonstrate the superiority of our proposed scheme, we compare it with two benchmarks, namely the conventional handover management (CHM) scheme and the independent dueling double deep recurrent Q-network (ID3RQN) scheme. Simulation results show that QMIX outperforms the other schemes. Compared with the CHM scheme, QMIX reduces the delay, interference, and number of handovers for UAVs by an average of 46.9%, 70.0% and 90.5%, respectively. Compared with the ID3RQN scheme, QMIX reduces the three metrics by an average of 90.0%, 43.0% and 41.7%, respectively. 

Keywords
Cellular-connected UAVs, Trajectory design, Handover management, Multi-agent deep reinforcement learning, Multi-objective optimization.
National Category
Electrical Engineering, Electronic Engineering, Information Engineering
Research subject
Computer Science; Telecommunication
Identifiers
urn:nbn:se:kth:diva-358035 (URN)
Funder
Vinnova
Note

Submitted to: IEEE Transactions on Cognitive Communications and Networking 

Available from: 2025-01-02 Created: 2025-01-02 Last updated: 2025-01-03Bibliographically approved
Zhang, S., Cai, T., Demir, O. T. & Cavdar, C. (2025). Multi-Agent RL for Sleep Mode and Antenna Configuration With User Offloading Under Dynamic Traffic in Massive MIMO Networks. IEEE Transactions on Vehicular Technology, 74(6), 9734-9749
Open this publication in new window or tab >>Multi-Agent RL for Sleep Mode and Antenna Configuration With User Offloading Under Dynamic Traffic in Massive MIMO Networks
2025 (English)In: IEEE Transactions on Vehicular Technology, ISSN 0018-9545, E-ISSN 1939-9359, Vol. 74, no 6, p. 9734-9749Article in journal (Refereed) Published
Abstract [en]

In this paper, we focus on minimizing the total energy consumption of multi-cell massive multiple-input multiple-output (MIMO) networks while simultaneously guaranteeing user quality of service (QoS). This is achieved by optimizing the multi-level advanced sleep modes (ASM), antenna switching, and user association of the base stations (BSs). Due to the interdependence of user association and inter-cell interference in the network, collaborative efforts among individual BSs become imperative. The problem is modeled as a decentralized partially observable Markov decision process (DEC-POMDP) and a multi-agent proximal policy optimization (MAPPO) algorithm is proposed to obtain a collaborative BS control policy. Simulation results demonstrate that the obtained policy can significantly improve network energy efficiency, adaptively switch the BSs into different depths of sleep, reduce inter-cell interference, and maintain good QoS compared to the two benchmark algorithms. The results also validate that enabling user offloading among BSs can improve both user QoS and system performance. The superiority of MAPPO is further affirmed by comparing it with the single-agent deep Q network (DQN) algorithm.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2025
Keywords
Quality of service, Optimization, Massive MIMO, Energy conservation, Energy efficiency, Energy consumption, Antennas, 5G mobile communication, Switches, Interference, Green networks, base station control for energy savings, antenna switching, multi-agent reinforcement learning
National Category
Communication Systems
Identifiers
urn:nbn:se:kth:diva-370524 (URN)10.1109/TVT.2025.3541136 (DOI)001513230700006 ()2-s2.0-85217894312 (Scopus ID)
Note

QC 20251021

Available from: 2025-10-21 Created: 2025-10-21 Last updated: 2025-10-21Bibliographically approved
Liu, S., Zhang, S. & Cavdar, C. (2025). Task Offloading Strategy for Dynamic LEO Satellite and Cloud Networks: A Deep Reinforcement Learning-Based Approach. In: ICC 2025 - IEEE International Conference on Communications: . Paper presented at 2025 IEEE International Conference on Communications, ICC 2025, Montreal, Canada, June 8-12, 2025 (pp. 2454-2459). Institute of Electrical and Electronics Engineers (IEEE)
Open this publication in new window or tab >>Task Offloading Strategy for Dynamic LEO Satellite and Cloud Networks: A Deep Reinforcement Learning-Based Approach
2025 (English)In: ICC 2025 - IEEE International Conference on Communications, Institute of Electrical and Electronics Engineers (IEEE) , 2025, p. 2454-2459Conference paper, Published paper (Refereed)
Abstract [en]

In this paper, we study the task-offloading problem for Internet of Remote Things (IoRT) devices. In the considered scenario, the Low Earth Orbit (LEO) satellite nearest to the devices collects their generated tasks and makes offloading decisions for each task, including selecting the offloading destination and allocating computational resources. Moreover, we consider the motion of satellites within a constellation and assume that a random fraction of computing resources on each edge server is occupied during each time slot. Our objective is to minimize the total longterm latency for all IoRT devices. To address this problem, we propose a Proximal Policy Optimization (PPO)-based algorithm to learn the near-optimal policy. The simulation results demonstrate that our proposed algorithm reduces latency by an average of 21.49% and 50.79% compared to the two benchmark algorithms, perspectively.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2025
Keywords
deep reinforcement learning, LEO, minimum hop-count analysis, mobile edge computing, task offloading
National Category
Communication Systems Signal Processing Computer Sciences
Identifiers
urn:nbn:se:kth:diva-372508 (URN)10.1109/ICC52391.2025.11161995 (DOI)001701279800350 ()2-s2.0-105018474483 (Scopus ID)
Conference
2025 IEEE International Conference on Communications, ICC 2025, Montreal, Canada, June 8-12, 2025
Note

Part of ISBN 9798331505219

QC 20251110

Available from: 2025-11-10 Created: 2025-11-10 Last updated: 2026-05-29Bibliographically approved
Sri Ganesh Seeram, S. S., Feltrin, L., Özger, M., Zhang, S. & Cavdar, C. (2024). Feasibility Study of Function Splits in RAN Architectures with LEO Satellites. In: 2024 Joint European Conference on Networks and Communications and 6G Summit, EuCNC/6G Summit 2024: . Paper presented at 2024 Joint European Conference on Networks and Communications and 6G Summit, EuCNC/6G Summit 2024, Antwerp, Belgium, Jun 3 2024 - Jun 6 2024 (pp. 622-627). Institute of Electrical and Electronics Engineers (IEEE)
Open this publication in new window or tab >>Feasibility Study of Function Splits in RAN Architectures with LEO Satellites
Show others...
2024 (English)In: 2024 Joint European Conference on Networks and Communications and 6G Summit, EuCNC/6G Summit 2024, Institute of Electrical and Electronics Engineers (IEEE) , 2024, p. 622-627Conference paper, Published paper (Refereed)
Abstract [en]

This paper explores the evolution of Radio Access Network (RAN) architectures and their integration into Non-Terrestrial Networks (NTN) to address escalating mobile traffic demands. Focusing on Low Earth Orbit (LEO) satellites as key components of NTN, we examine the feasibility of RAN function splits (FSs) in terms of fronthaul (FH) latency, elevation angle, and bandwidth (BW) across LEO satellites and ground stations (GS), alongside evaluating performance of Conditional Handover (CHO) procedures under diverse scenarios. By assessing performance metrics such as handover duration, disconnection time, and control traffic volume, we provide insights on several aspects such as stringent constraints for Low Layer Splits (LLSs), leading to longer delays during mobility procedures and increased control traffic across the feeder link in comparison with the case when gNodeB is onboard satellite. Despite challenges, LLSs demonstrate minimal onboard satellite computational requirements, promising reduced power consumption and payload weight. These findings underscore the architectural possibilities and challenges within the telecommunications industry, paving the way for future advancements in NTN RAN design and operation.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2024
Keywords
conditional handover, fronthaul, function splits, LEO satellites, radio access networks
National Category
Communication Systems Signal Processing
Identifiers
urn:nbn:se:kth:diva-351747 (URN)10.1109/EuCNC/6GSummit60053.2024.10597025 (DOI)001275093600027 ()2-s2.0-85199905197 (Scopus ID)
Conference
2024 Joint European Conference on Networks and Communications and 6G Summit, EuCNC/6G Summit 2024, Antwerp, Belgium, Jun 3 2024 - Jun 6 2024
Note

Part of ISBN [9798350344998]

QC 20240820

Available from: 2024-08-13 Created: 2024-08-13 Last updated: 2024-09-10Bibliographically approved
Zhang, S., Cai, T., Wu, D., Schupke, D., Ansari, N. & Cavdar, C. (2024). IoRT Data Collection With LEO Satellite-Assisted and Cache-Enabled UAV: A Deep Reinforcement Learning Approach. IEEE Transactions on Vehicular Technology, 73(4), 5872-5884
Open this publication in new window or tab >>IoRT Data Collection With LEO Satellite-Assisted and Cache-Enabled UAV: A Deep Reinforcement Learning Approach
Show others...
2024 (English)In: IEEE Transactions on Vehicular Technology, ISSN 0018-9545, E-ISSN 1939-9359, Vol. 73, no 4, p. 5872-5884Article in journal (Refereed) Published
Abstract [en]

Space air ground integrated network (SAGIN), leveraging low earth orbit (LEO) satellites and Unmanned Aerial Vehicles (UAVs), is expected to play a key role in providing services to Internet of Remote Things (IoRT) in the sixth generation (6G) communications. Our considered SAGIN incorporates a cache node on the UAV to cope with the data rate fluctuation in the backhaul link (UAV to satellite), allowing temporary storage of collected data during low data rate periods. In this paper, we aim to minimize the completion time of data collection in SAGIN by optimizing the UAV trajectory, IoRT device association scheme, and data caching policy (whether to store data temporarily or not in the UAV). Since the formulated problem is challenging to solve by using traditional optimization methods due to the unknown number of decision variables and the changing environment, we propose a deep reinforcement learning (DRL)-based algorithm to efficiently solve it. Simulation results demonstrate that our proposed algorithm requires less time to complete data collection compared to both the circular trajectory scheme and the no-cache node scheme under various setups. Moreover, our proposed algorithm can adapt to uneven data distribution by approaching closer to the IoRT nodes with large data sizes, and it can also mitigate the influence of backhaul link fluctuations with the aid of the cache node.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2024
Keywords
Low Earth orbit (LEO), unmanned aerial vehicle (UAV), deep reinforcement learning (DRL), trajectory optimization
National Category
Robotics and automation
Identifiers
urn:nbn:se:kth:diva-346879 (URN)10.1109/TVT.2023.3336262 (DOI)001205788900043 ()2-s2.0-85178048638 (Scopus ID)
Note

QC 20240527

Available from: 2024-05-27 Created: 2024-05-27 Last updated: 2025-02-09Bibliographically approved
Organisations
Identifiers
ORCID iD: ORCID iD iconorcid.org/0000-0002-3519-9182

Search in DiVA

Show all publications