Graph attention networks and deep q-learning for service mesh optimization: a digital twinning approach
2024 (English)In: ICC 2024 - IEEE International Conference on Communications, Institute of Electrical and Electronics Engineers (IEEE) , 2024, p. 2913-2918Conference paper, Published paper (Refereed)
Abstract [en]
In the realm of cloud native environments, Ku-bernetes has emerged as the de facto orchestration system for containers, and the service mesh architecture, with its interconnected microservices, has become increasingly prominent. Efficient scheduling and resource allocation for these microservices play a pivotal role in achieving high performance and maintaining system reliability. In this paper, we introduce a novel approach for container scheduling within Kubernetes clusters, leveraging Graph Attention Networks (GATs) for representation learning. Our proposed method captures the intricate dependencies among containers and services by constructing a representation graph. The deep Q-learning algorithm is then employed to optimize scheduling decisions, focusing on container-to-node placements, CPU request-response allocation, and adherence to node affinity and anti-affinity rules. Our experiments demonstrate that our GATs-based method outperforms traditional scheduling strategies, leading to enhanced resource utilization, reduced service latency, and improved overall system throughput. The insights gleaned from this study pave the way for a new frontier in cloud native performance optimization and offer tangible benefits to industries adopting microservice-based architectures.
Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE) , 2024. p. 2913-2918
Keywords [en]
component, formatting, insert, style, styling
National Category
Computer Sciences Computer Systems
Identifiers
URN: urn:nbn:se:kth:diva-353522DOI: 10.1109/ICC51166.2024.10622616Scopus ID: 2-s2.0-85202817543OAI: oai:DiVA.org:kth-353522DiVA, id: diva2:1899197
Conference
59th Annual IEEE International Conference on Communications, ICC 2024, June 9-13, 2024, Denver, United States of America
Note
Part of ISBN: 9781728190549
QC 20240926
2024-09-192024-09-192024-09-26Bibliographically approved