kth.sePublications KTH
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Practical Machine Learning for Predictions in Mobile Networks
KTH, School of Electrical Engineering and Computer Science (EECS), Computer Science, Software and Computer systems, SCS.ORCID iD: 0000-0003-1992-4740
2025 (English)Doctoral thesis, comprehensive summary (Other academic)
Sustainable development
SDG 9: Industry, innovation and infrastructure
Abstract [en]

The use of machine learning (ML) in mobile networks has surged to tackle the challenges arising from their growing complexity, dynamic behavior, and application demands.However, more support through prediction models for key performance metrics that cannot be directly observed at the base station are needed to better adapt network and application parameters to the changing network state. Moreover, the growing adoption of ML models in telecom networks brings practical challenges, like the scarcity of labeled data for training, heterogeneity of hardware, and evolving conditions, all of which pose hurdles to their seamless deployment in real-world scenarios.

In this thesis we investigate ML-driven methods to predict user performance in dynamic environments and address the practical challenges encountered over a model’s life cycle. First, we present supervised ML approaches for predicting key metrics such as round-trip time and one-way delay, using data collected from 5G mmWave testbed networks.Further, we propose a mobile edge-assisted framework for improving quality of service (QoS) and reducing backhaul load for video streaming applications, by prefetching video segments, under backhaul bandwidth and cache storage restrictions. This framework is based on ML predictions of video segment bitrates in a dynamically changing mobile network. 

Next, we address domain shifts, particularly from the arrival of new user equipment (UE) types, by applying adversarial adaptation strategies that allow models to adapt without requiring labeled samples from the new domain.Further, we apply self-supervised learning approaches to harness unlabeled data from telecom network datasets. This is shown to improve model accuracy in label-scarce scenarios. Finally, to further mitigate data-collection challenges, we introduce an open-source simulation framework for generating large-scale, customizable datasets. Overall, this thesis contributes to the careful design of ML solutions for predicting key performance metrics, and shows how they can, along with robust data generation, model adaptation, and model pretraining techniques, elevate user experience and network automation in modern telecom networks.

Abstract [sv]

Användningen av maskininlärning (ML) inom mobila telekomnätverk har ökat kraftigt för att tackla utmaningar som uppstått med nätverkens växande komplexitet och applikationskrav. Mer hjälp från modeller som kan ge prognoser över viktiga prestandavärden, som inte är direkt mätbara vid basstationen behövs dock för att anpassa nätverk och applikationsparametrar till kontinuerligt skiftande nätverkstillstånd.Den växande tillämpningen av ML-modeller för telekomnätverk för med sig praktiska utmaningar, så som brist på annoterad data för träning, varierande hårdvaruarkitekturer och ständigt förändrade nätverkstillstånd. Dessa hinder försvårar driftsättning av ML i verkliga system.I denna avhandling studerar vi ML-modeller för telekomnätverk samt flera praktiska utmaningar som uppstår under en modells livscykel. Först så presenterar vi vägledda ML-metoder som förutspår viktiga mätvärden inom 5G-nätverk. Dessa metoder tränas med data insamlad  från 5G mmWave testbädds-nätverk.Vidare så utvecklar vi en ML-driven mobile edge-lösning som förbättrar videokvaliteten för videoströmningstjänster i nätverket. Vår lösning bygger på smart förhandsinladdning av videosegment, där ML används för att förutspå lämplig kvalitet på videosegmenten.

Vi studerar också hur ML-modeller påverkas när nätverk ändras till en så stor grad att ett domänskifte sker. Vi tittar speciellt på situationen som uppstår när nya typer av användarutrustnings ansluter sig till nätverk. Genom att tillämpa adaptiva strategier visar vi att ML-modeller kan anpassas utan användandet av annoterad data från den nya domänen.

Vi undersöker också hur icke-vägledda ML-modeller kan utnyttja stora mängder oannoterad data som annars inte används. Vi visar hur dessa modeller kan integreras med vägledda modeller för att förbättra prognosprestandan, speciellt när det råder brist på annoterad data.

Slutligen så introducerar vi ett ramverk, skriven i öppen källkod, för att generera stora, anpassningsbara datamängder för mobila telekomnätverk. Allt inräknat så bidrar denna avhandling med ML-lösningar för bättre användarupplevelser samt drift av moderna telekomnätverk.

Place, publisher, year, edition, pages
Stockholm: KTH Royal Institute of Technology, 2025. , p. 54
Series
TRITA-EECS-AVL ; 2025:64
Keywords [en]
Telecom networks, Mobile networks, User performance prediction, Machine learning, 5G networks, DASH, Mobile edge, Domain adaptation, Self-supervised learning
Keywords [sv]
Telekomnätverk, mobila nätverk, användarprestanda prognoser, maskininlärning, 5G-nätverk, DASH, mobil edge, domänadaption, icke-väglett lärande
National Category
Computer Sciences
Research subject
Computer Science
Identifiers
URN: urn:nbn:se:kth:diva-363883ISBN: 978-91-8106-315-8 (print)OAI: oai:DiVA.org:kth-363883DiVA, id: diva2:1960497
Public defence
2025-08-21, Kollegiesalen, Brinellvägen 6, plan 4, KTH campus, Stockholm, 14:00 (English)
Opponent
Supervisors
Note

QC 20250523

Available from: 2025-05-23 Created: 2025-05-22 Last updated: 2025-06-30Bibliographically approved
List of papers
1. Prediction and exposure of delays from a base station perspective in 5G and beyond networks
Open this publication in new window or tab >>Prediction and exposure of delays from a base station perspective in 5G and beyond networks
Show others...
2022 (English)In: 5G-MeMU '22: Proceedings of the ACM SIGCOMM Workshop on 5G and Beyond Network Measurements, Modeling, and Use Cases, Association for Computing Machinery (ACM) , 2022, p. 8-14Conference paper, Published paper (Refereed)
Abstract [en]

The inherent flexibility of 5G networks come with a high degree of configuration and management complexity. This makes the performance outcome for UEs, more than ever, dependent on intricate configurations and interplay between algorithms at various network components. In this paper, we take initial steps towards a performance exposure system at the base station using a data-driven approach for predicting performance violations in terms of RTT, as observed by the UE, in a 5G mmWave network. We present ML models to predict RTT using low-level and high-frequency base station metrics from a 5G mmWave testbed based on commercially available equipment. Predicting UE performance from a base station perspective, and exposing this knowledge, is valuable for applications to proactively address performance violations. We also compare several methods for feature reduction, which have a significant impact on monitoring load. We demonstrate our model's ability to identify RTT violations, paving the way for network providers towards an intelligent performance exposure system. 

Place, publisher, year, edition, pages
Association for Computing Machinery (ACM), 2022
Keywords
5G, delay prediction, machine learning, measurements, 5G mobile communication systems, Forecasting, Millimeter waves, Configuration and managements, Delay predictions, Exposure system, Inherent flexibility, Machine-learning, Management complexity, Mm waves, Performance, Performance outcome, Base stations
National Category
Communication Systems
Identifiers
urn:nbn:se:kth:diva-327303 (URN)10.1145/3538394.3546039 (DOI)001451287200002 ()2-s2.0-85138281433 (Scopus ID)
Conference
5G-MeMU 2022 - Proceedings of the ACM SIGCOMM 2022 Workshop on 5G and Beyond Network Measurements, Modeling, and Use Cases - Part of SIGCOMM 2022
Note

QC 20230524

Available from: 2023-05-24 Created: 2023-05-24 Last updated: 2025-05-22Bibliographically approved
2. Machine Learning at the Mobile Edge: The Case of Dynamic Adaptive Streaming Over HTTP (DASH)
Open this publication in new window or tab >>Machine Learning at the Mobile Edge: The Case of Dynamic Adaptive Streaming Over HTTP (DASH)
Show others...
2022 (English)In: IEEE Transactions on Network and Service Management, E-ISSN 1932-4537, Vol. 19, no 4, p. 4779-4793Article in journal (Refereed) Published
Abstract [en]

Dynamic Adaptive Streaming over HTTP (DASH) is a standard for delivering video in segments and adapting each segment's bitrate (quality), to adjust to changing and limited network bandwidth. We study segment prefetching, informed by machine learning predictions of bitrates of client segment requests, implemented at the network edge. We formulate this client segment request prediction problem as a supervised learning problem of predicting the bitrate of a client's next segment request, in order to prefetch it at the mobile edge, with the objective of jointly improving the video streaming experience for the users and network bandwidth utilization for the service provider. The results of extensive evaluations showed a segment request prediction accuracy of close to 90% and reduced video segment access delay with a cache hit ratio of 58%, and reduced transport network load by lowering the backhaul link utilization by 60.91%.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2022
Keywords
Streaming media, Bit rate, Prefetching, Servers, Measurement, Bandwidth, Prediction algorithms, Video streaming, DASH, caching, machine learning, MEC, 5G
National Category
Computer Sciences
Identifiers
urn:nbn:se:kth:diva-324906 (URN)10.1109/TNSM.2022.3193856 (DOI)000965284400001 ()2-s2.0-85135765992 (Scopus ID)
Note

QC 20230920

Available from: 2023-03-21 Created: 2023-03-21 Last updated: 2025-08-25Bibliographically approved
3. Generalizable One-Way Delay Prediction Models for Heterogeneous UEs in 5G Networks
Open this publication in new window or tab >>Generalizable One-Way Delay Prediction Models for Heterogeneous UEs in 5G Networks
Show others...
2024 (English)In: Proceedings of 2024 IEEE/IFIP Network Operations and Management Symposium, NOMS 2024 / [ed] Hong, JWK Seok, SJ Nomura, Y Wang, YC Choi, BY Kim, MS Riggio, R Tsai, MH DosSantos, CRP, Institute of Electrical and Electronics Engineers (IEEE) , 2024Conference paper, Published paper (Refereed)
Abstract [en]

From a 5G operator's perspective, accurate estimates of key User Equipments (UEs) performance metrics, especially One-Way Delay (OWD), can provide valuable information. These estimates can trigger management tasks such as reconfiguration to prevent violations of Service Level Objectives (SLOs). Moreover, such insights into UE performance can empower applications to adapt their services to end-users in a more effective manner. We use advanced machine learning over data gathered at the base stations to predict OWD from UEs and show that we are able to predict OWD with over a 2x reduction in percentage error compared to the considered baseline. We discover the close coupling between the performance of the OWD model and the type of UE, which poses a model generalization challenge. Addressing this problem, we demonstrate the shortcomings of the commonly used fine-tuning approach and develop a novel method based on domain adversarial neural networks, that can adapt to a target domain without compromising on the performance of the source domain. Our results show that we can adapt our source model to provide OWD prediction performance within 1-4 percentage points of the ideal scenario when the source and the target domains are the same. Also, our work is grounded in empirical experiments conducted within a 5G testbed, using commercially available hardware.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2024
Series
IEEE IFIP Network Operations and Management Symposium, ISSN 1542-1201
National Category
Telecommunications
Identifiers
urn:nbn:se:kth:diva-354093 (URN)10.1109/NOMS59830.2024.10574985 (DOI)001270140300015 ()2-s2.0-85197951478 (Scopus ID)
Conference
IEEE/IFIP Network Operations and Management Symposium (NOMS), MAY 06-10, 2024, Seoul, SOUTH KOREA
Note

Part of ISBN 979-8-3503-2793-9, 979-8-3503-2794-6

QC 20241003

Available from: 2024-10-03 Created: 2024-10-03 Last updated: 2025-05-23Bibliographically approved
4. Self-supervised Pretraining for User Performance Prediction under Scarce Data Conditions
Open this publication in new window or tab >>Self-supervised Pretraining for User Performance Prediction under Scarce Data Conditions
(English)Manuscript (preprint) (Other academic)
Abstract [en]

Predicting user performance at the base station in telecom networks is a critical task that can significantly benefit from advanced machine learning techniques. However, labeled data for user performance are scarce and costly to collect, while unlabeled data consisting of base station metrics, are more readily accessible. Self-supervised learning provides a means to leverage this unlabeled data, and has seen remarkable success in the domains of computer vision and natural language processing, with unstructured data. Recently, these methods have been adapted to structured data as well, making them particularly relevant to the telecom domain. We apply self-supervised learning to predict user performance in telecom networks. Our results demonstrate that even with simple self-supervised approaches, the percentage of variance of the target values explained by the model, in low-labeled scenarios (e.g., only 100 labeled samples) can be improved fourfold, from 15% to 60%. Moreover, to promote reproducibility and further research in the domain, we open-source a dataset creation framework and a specific dataset created from it that captures scenarios that have been deemed to be challenging for future networks.

Keywords
user performance prediction, telecom networks, mobile networks, machine learning, self-supervised learning, structured data, tabular data, generalizability, sample efficiency
National Category
Computer Sciences
Identifiers
urn:nbn:se:kth:diva-363702 (URN)
Note

QC 20250523

Available from: 2025-05-21 Created: 2025-05-21 Last updated: 2025-05-23Bibliographically approved

Open Access in DiVA

fulltext(1259 kB)353 downloads
File information
File name FULLTEXT01.pdfFile size 1259 kBChecksum SHA-512
4205043ef19d259df00a5f22873f7f0fac4250d42d5ca61e3587ad63cc9c5560d740d43f86b216b4c8ae6ec6ee3574a483a2f6b039c24d40d8ef4dae2af0bc1d
Type fulltextMimetype application/pdf

Authority records

Rao, Akhila

Search in DiVA

By author/editor
Rao, Akhila
By organisation
Software and Computer systems, SCS
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar
Total: 354 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

isbn
urn-nbn

Altmetric score

isbn
urn-nbn
Total: 1197 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf