kth.sePublications KTH
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Systematic Data-Driven Continual Self-Learning
KTH, School of Electrical Engineering and Computer Science (EECS), Computer Science, Software and Computer systems, SCS.ORCID iD: 0000-0003-1558-4670
2023 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

There is a lot of unexploited potential in using data-driven and self-learning methods to dramatically improve automatic decision-making and control in complex industrial systems. So far, and on a relatively small scale, these methods have demonstrated some potential to achieve performance gains for the automated tuning of complex distributed systems. However, many difficult questions and challenges remain in relation to how to design methods and organise their deployment and operation into large-scale real-world systems. For systematic and scalable integration of state-of-the-art machine learning into such systems, we propose a structured architectural approach.

To understand the essential elements of this architecture, we identify a set of foundational challenges and then derive a set of five research questions. These questions drill into the essential and complex interdependency between data streams, self-learning algorithms that never stop learning and the supporting reference and run-time architectural structures. While there is a need for traditional one-shot supervised models, pushing the technical boundaries of automating all classes of machine learning model training will require a continual approach. 

To support continual learning, real-time data streams are complemented with accurate synthetic data generated for use in model training. By developing and integrating advanced simulations, models can be trained before deployment into a live system, for which system accuracy is then measured quantitatively in realistic scenarios. Reinforcement learning, exploring an action space and qualifying effective dynamic action combinations, is here employed for effective network policy learning. While single-agent and centralised model training may be appropriate in some cases, distributed multi-agent self-learning is essential in industrial scale systems, and thus such a scalable and energy-efficient approach is developed, implemented and analysed in detail. 

Energy usage minimisation in software and hardware intense communication systems, such as the 5G radio access system, is an important and difficult problem in its own right. Our work has focused on energy-aware approaches to applying self-learning methods both to energy reduction applications and algorithms. Using this approach, we can demonstrate clear energy savings while at the same time improving system performance.

Perhaps most importantly, our work attempts to form an understanding of the broader industrial system issues of applying self-learning approaches at scale. Our results take some clear, formative, steps towards large-scale industrialisation of self-learning approaches in communication systems such as 5G.

Abstract [sv]

Datadrivna och självlärande system besitter en mycket stor outnyttjad potential för att förbättra automatisk kontroll och automatiskt beslutsfattande i komplexa industriella system. I mindre skala så har dessa metoder visats ha en viss potential rörande förbättrad prestanda för  automatisk justering av komplexa distribuerande system. Trots detta återstår många svåra frågor och utmaningar kring hur man utformar metoder och hur man organiserar implementering och drift för dessa i storskaliga realtidssystem. 

För systematisk och skalbar integrering av moderna maskininlärningstekniker i dessa verkliga och kommersiellt fungerande system föreslår vi här en strukturerad metod. För att förstå de viktigaste beståndsdelarna och arkitektoniska utmaningarna så namnger och förklarar vi en uppsättning sådana. Ur dessa härleder vi sedan fem forskningsfrågor, vilka undersöker det komplexa beroendeförhållandet mellan dataströmmar, självlärande algoritmer med kontinuerlig inlärning, samt stödjande referens- och driftstrukturer.Det finns fortfarande ett behov av övervakade ''one-shot''-modeller, men för att tänja på de tekniska gränserna avseende automatiserad träning av alla olika slags självlärande system så krävs en kontinuerlig metod. För att främja kontinuerlig inlärning kompletteras realtidsdataströmmar med adekvata syntetiska data, genererade för att möjliggöra träning av modellerna.Genom att utveckla och integrera avancerade simuleringar kan system och modeller tränas innan de implementeras för att användas ''live'', där systemets prestanda and korrekthet kan mätas kvantitativt i realistiska scenarier. För effektiv inlärning av en policy för nätverk så används förstärkningsinlärning (''reinforcement learning''), som utforskar en rymd av möjliga handlingar, ofta i kvalificerade kombinationer.

Medan centraliserad träning kan vara lämpligt i vissa fall så är distribuerade och självlärande agenter nödvändiga komponenter i industriellt storskaliga system. Därför utvecklar, implementerar och detaljanalyserar vi en sådan skalbar och energieffektiv metod.Att minska energianvändningen i mjuk- och hårdvaruintensiva kommunikationssystem, som 5G-radiosystemet, är en svår och viktig utmaning i sig. Vårt arbete har fokuserat på en energimedveten ansats med självlärande metoder, både  för tillämpningarna och för de grundläggande algoritmerna. Genom denna ansats har vi lyckats påvisa avsevärda energibesparingar samtidigt som systemets prestanda förbättrats. Till sist så är nyckelresultatet i vårt arbete analysen av de största utmaningarna för självlärande system i industriell skala och vi har därmed  tagit ett stort steg emot storskalig industrialisering av självlärande metoder inom kommunikationssystem 

Place, publisher, year, edition, pages
Stockholm: KTH Royal Institute of Technology, 2023. , p. xxvii, 154
Series
TRITA-EECS-AVL ; 2023:29
Keywords [en]
Data-Driven Methods, Self-Learning Systems, Reinforcement Learning Algorithms, Implementation Architectures
Keywords [sv]
Datadrivna metoder, Självlärande system, Reinforcement Learning-algoritmer, Implementeringsarkitekturer
National Category
Communication Systems Computer Systems
Identifiers
URN: urn:nbn:se:kth:diva-325733ISBN: 978-91-8040-534-8 (print)OAI: oai:DiVA.org:kth-325733DiVA, id: diva2:1750762
Public defence
2023-05-09, Ka-Sal C, KTH, Kistagången 16, Kista, Stockholm, 15:00 (English)
Opponent
Supervisors
Note

QC 20230414

Available from: 2023-04-17 Created: 2023-04-14 Last updated: 2023-04-24Bibliographically approved
List of papers
1. Data Driven Selection of DRX for Energy Efficient 5G RAN
Open this publication in new window or tab >>Data Driven Selection of DRX for Energy Efficient 5G RAN
Show others...
2017 (English)In: 13th International Conference on Network and Service Management (CNSM), 2017, 2017, p. 1-9Conference paper, Published paper (Refereed)
Abstract [en]

The number of connected mobile devices is increasing rapidly with more than 10 billion expected by 2022. Their total aggregate energy consumption poses a significant concern to society. The current 3gpp (3rd Generation Partnership Project) LTE/LTE-Advanced standard incorporates an energy saving technique called discontinuous reception (DRX). It is expected that 5G will use an evolved variant of this scheme. In general, the single selection of DRX parameters per device is non trivial. This paper describes how to improve energy efficiency of mobile devices by selecting DRX based on the traffic profile per device. Our particular approach uses a two phase data-driven strategy which tunes the selection of DRX parameters based on a smart fast energy model. The first phase involves the off-line selection of viable DRX combinations for a particular traffic mix. The second phase involves an on-line selection of DRX from this viable list. The method attempts to guarantee that latency is not worse than a chosen threshold. Alternatively, longer battery life for a device can be traded against increased latency. We built a lab prototype of the system to verify that the technique works and scales on a real LTE system. We also designed a sophisticated traffic generator based on actual user data traces. Complementary method verification has been made by exhaustive off-line simulations on recorded LTE network data. Our approach shows significant device energy savings, which has the aggregated potential over billions of devices to make a real contribution to green, energy efficient networks.

National Category
Computer Sciences
Identifiers
urn:nbn:se:kth:diva-225402 (URN)10.23919/CNSM.2017.8255972 (DOI)000427961400004 ()2-s2.0-85046680815 (Scopus ID)
Conference
13th International Conference on Network and Service Management, CNSM 2017, Tokyo, Japan, November 26-30, 2017
Note

QC 20180507

Available from: 2018-04-19 Created: 2018-04-19 Last updated: 2023-04-14Bibliographically approved
2. Efficient Real-Time Traffic Generation for 5G RAN
Open this publication in new window or tab >>Efficient Real-Time Traffic Generation for 5G RAN
2020 (English)In: Proceedings of IEEE/IFIP Network Operations and Management Symposium 2020: Management in the Age of Softwarization and Artificial Intelligence, NOMS 2020, Institute of Electrical and Electronics Engineers (IEEE) , 2020, article id 9110314Conference paper, Published paper (Refereed)
Abstract [en]

Modern telecommunication and mobile networks are increasingly complex from a resource management perspective, with diverse combinations of software and infrastructure elements that need to be configured and tuned for efficient operation with high quality of service. Increased real-time automation at all levels and time-frames is a critical tool in controlling this complexity. A key component in automation is practical and accurate simulation methods that can be used in live traffic scenarios. This paper introduces a new method with supporting algorithms for sampling key parameters from live or recorded traffic which can be used to generate large volumes of synthetic traffic with very similar rate distributions and temporal characteristics. Multiple spatial renewal processes are used to generate fractional Gaussian noise, which is scaled and transformed into a log-normal rate distribution with discrete arrival events, fitted to the properties observed in given recorded traces. This approach works well for modelling large user aggregates but is especially useful for medium sized and relatively small aggregates, where existing methods struggle to reproduce the most important properties of recorded traces. The technique is demonstrated through experimental comparisons with data collected from an operational LTE network to be highly useful in supporting self-learning and automation algorithms which can ultimately reduce complexity, increase energy efficiency, and reduce total network operation costs.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2020
Series
IEEE IFIP Network Operations and Management Symposium, ISSN 1542-1201
National Category
Computer Systems
Research subject
Computer Science
Identifiers
urn:nbn:se:kth:diva-296231 (URN)10.1109/NOMS47738.2020.9110314 (DOI)000716920500042 ()2-s2.0-85086765703 (Scopus ID)
Conference
2020 IEEE/IFIP Network Operations and Management Symposium, Network Operations and Management Symposium, NOMS 2020 Budapest20 April 2020 through 24 April 2020
Funder
Wallenberg AI, Autonomous Systems and Software Program (WASP)
Note

QC 20210609

Part of proceeding: ISBN 978-1-7281-4973-8

Available from: 2021-06-01 Created: 2021-06-01 Last updated: 2023-04-14Bibliographically approved
3. Reinforcement Learning for Automated Energy Efficient Mobile Network Performance Tuning
Open this publication in new window or tab >>Reinforcement Learning for Automated Energy Efficient Mobile Network Performance Tuning
2021 (English)In: Proceedings of the 2021 17th International Conference on Network and Service Management: Smart Management for Future Networks and Services, CNSM 2021, Institute of Electrical and Electronics Engineers (IEEE) , 2021, p. 216-224Conference paper, Published paper (Refereed)
Abstract [en]

Modern mobile networks are increasingly complex from a resource management perspective, with diverse combinations of software, infrastructure elements and services that need to be configured and tuned for correct and efficient operation. It is well accepted in the communications community that appropriately dimensioned, efficient and reliable configurations of systems like 5G or indeed its predecessor 4G is a massive technical challenge. One promising avenue is the application of machine learning methods to apply a data-driven and continuous learning approach to automated system performance tuning. We demonstrate the effectiveness of policy-gradient reinforcement learning as a way to learn and apply complex interleaving patterns of radio resource block usage in 4G and 5G, in order to automate the reduction of cell edge interference. We show that our method can increase overall spectral efficiency up to 25% and increase the overall system energy efficiency up to 50% in very challenging scenarios by learning how to do more with less system resources. We also introduce a flexible phased and continuous learning approach that can be used to train a bootstrap model in a simulated environment after which the model is transferred to a live system for continuous contextual learning. 

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2021
Keywords
Communication system traffic, Inter-cell interference coordination, Learning systems, Machine learning, Radio resource scheduling, Self-organization, System simulation, Automation, Complex networks, Energy efficiency, Radio interference, Reinforcement learning, Tuning, Wireless networks, Continuous learning, Learning approach, Performance tuning, Radio resources, Resource-scheduling, Self organizations, System simulations, 5G mobile communication systems
National Category
Computer Sciences Computer Systems
Identifiers
urn:nbn:se:kth:diva-316392 (URN)10.23919/CNSM52442.2021.9615550 (DOI)000836226700032 ()2-s2.0-85123399258 (Scopus ID)
Conference
17th International Conference on Network and Service Management, CNSM 2021, Virtual/Online, 25-29 October 2021
Note

Part of proceedings: ISBN 978-3-903176-36-2

QC 20220914

Available from: 2022-08-16 Created: 2022-08-16 Last updated: 2023-04-14Bibliographically approved
4. A Sample Efficient Multi-Agent Approach to Continuous Reinforcement Learning
Open this publication in new window or tab >>A Sample Efficient Multi-Agent Approach to Continuous Reinforcement Learning
2022 (English)In: 2022 18Th International Conference On Network And Service Management (CNSM 2022): INTELLIGENT MANAGEMENT OF DISRUPTIVE NETWORK TECHNOLOGIES AND SERVICES / [ed] Charalambides, M Papadimitriou, P Cerroni, W Kanhere, S Mamatas, L, IEEE , 2022, p. 338-344Conference paper, Published paper (Refereed)
Abstract [en]

As design, deployment and operation complexity increase in mobile systems, adaptive self-learning techniques have become essential enablers in mitigation and control of the complexity problem. Artificial intelligence and, in particular, reinforcement learning has shown great potential in learning complex tasks through observations. The majority of ongoing reinforcement learning research activities focus on single-agent problem settings with an assumption of accessibility to a globally observable state and action space. In many real-world settings, such as LTE or 5G, decision making is distributed and there is often only local accessibility to the state space. In such settings, multi-agent learning may be preferable, with the added challenge of ensuring that all agents collaboratively work towards achieving a common goal. We present a novel cooperative and distributed actor-critic multi-agent reinforcement learning algorithm. We claim the approach is sample efficient, both in terms of selecting observation samples and in terms of assignment of credit between subsets of collaborating agents.

Place, publisher, year, edition, pages
IEEE, 2022
Series
International Conference on Network and Service Management, ISSN 2165-9605
Keywords
Machine learning, Radio resource scheduling
National Category
Computer Sciences Computer graphics and computer vision
Identifiers
urn:nbn:se:kth:diva-323581 (URN)10.23919/CNSM55787.2022.9965060 (DOI)000903721000044 ()2-s2.0-85143886726 (Scopus ID)
Conference
18th International Conference on Network and Service Management (CNSM) - Intelligent Management of Disruptive Network Technologies and Services, OCT 31-NOV 04, 2022, Thessaloniki, GREECE
Note

Part of proceedings: ISBN 978-3-903176-51-5, QC 20230208

Available from: 2023-02-08 Created: 2023-02-08 Last updated: 2025-02-01Bibliographically approved
5. Autonomous load balancing of heterogeneous networks
Open this publication in new window or tab >>Autonomous load balancing of heterogeneous networks
Show others...
2015 (English)In: IEEE Vehicular Technology Conference, Institute of Electrical and Electronics Engineers Inc. , 2015Conference paper, Published paper (Refereed)
Abstract [en]

This paper presents a method for load balancing heterogeneous networks by dynamically assigning values to the LTE cell range expansion (CRE) parameter. The method records hand-over events online and adapts flexibly to changes in terminal traffic and mobility by maintaining statistical estimators that are used to support autonomous assignment decisions. The proposed approach has low overhead and is highly scalable due to a modularised and completely distributed design that exploits self-organisation based on local inter-cell interactions. An advanced simulator that incorporates terminal traffic patterns and mobility models with a radio access network simulator has been developed to validate and evaluate the method. 

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers Inc., 2015
Keywords
Autonomous network management, Distributed algorithms, Selforganising heterogenous networks, Statistical modelling, Balloons, Heterogeneous networks, Network management, Parallel algorithms, Autonomous networks, Cell range expansions, Distributed design, Heterogenous network, Radio access networks, Self organisation, Statistical estimators, Wireless telecommunication systems
National Category
Communication Systems
Identifiers
urn:nbn:se:kth:diva-296615 (URN)10.1109/VTCSpring.2015.7145712 (DOI)2-s2.0-84940399308 (Scopus ID)9781479980888 (ISBN)
Conference
81st IEEE Vehicular Technology Conference, VTC Spring 2015, 11 May 2015 through 14 May 2015
Note

QC 20220124

Available from: 2021-06-09 Created: 2021-06-09 Last updated: 2023-04-14Bibliographically approved

Open Access in DiVA

thesis(5530 kB)795 downloads
File information
File name FULLTEXT02.pdfFile size 5530 kBChecksum SHA-512
df2d3f14b5d6bbbf24c79eddb03ee4dd0a0eec07e7455c158aa240d62f94457e416577519072bc600c332df4725e6b26cf59bc0170d7b69c5e32c3fb030663b7
Type fulltextMimetype application/pdf

Authority records

Corcoran, Diarmuid

Search in DiVA

By author/editor
Corcoran, Diarmuid
By organisation
Software and Computer systems, SCS
Communication SystemsComputer Systems

Search outside of DiVA

GoogleGoogle Scholar
Total: 796 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

isbn
urn-nbn

Altmetric score

isbn
urn-nbn
Total: 1378 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf