kth.sePublications KTH
Change search
Link to record
Permanent link

Direct link
Tütüncüoglu, FeridunORCID iD iconorcid.org/0000-0001-5050-2373
Alternative names
Publications (10 of 11) Show all publications
Tütüncüoglu, F. & Dán, G. (2026). RAPTOR: Rate-adaptive Pricing and Optimal Resource Allocation in Serverless Edge Computing. IEEE Transactions on Networking, 34, 4333-4344
Open this publication in new window or tab >>RAPTOR: Rate-adaptive Pricing and Optimal Resource Allocation in Serverless Edge Computing
2026 (English)In: IEEE Transactions on Networking, E-ISSN 2998-4157, Vol. 34, p. 4333-4344Article in journal (Refereed) Published
Abstract [en]

Edge computing (EC) is emerging as a key enabler for latency-sensitive applications such as Augmented Reality (AR), autonomous driving, and industrial IoT, by bringing computational resources closer to Wireless Devices (WDs). However, the limited computational capacity inherent to EC presents challenges in resource allocation and in designing pricing mechanisms that provide the right incentives and are aligned with the user-perceived service quality. This paper addresses these challenges by formulating a Stackelberg game that models WDs’ valuation of EC services based on their offloading rates and the service quality they receive. We prove the existence of Stackelberg equilibria and we propose a tractable approximation technique based on log-barrier functions for computing approximate equilibria. Furthermore, to overcome computational issues, we build on the concept of a Differential Stackelberg Equilibrium (DSE) and we propose Stackelberg Gradient Play (SGP), an implicit gradient-based algorithm that ensures convergence to DSE while maintaining efficiency. Extensive simulations show that our approach significantly outperforms existing methods, achieving up to 70% higher revenue for the edge operator while reducing computational overhead substantially. These results underscore the viability of our framework for use in EC systems that require fast, adaptive, and service-aware joint resource management and pricing.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2026
Keywords
Differential Stackelberg Equilibrium, Edge Computing, Gradient Play, Pricing, Rate Adaptation, Stackelberg Game
National Category
Electrical Engineering, Electronic Engineering, Information Engineering Information Systems
Identifiers
urn:nbn:se:kth:diva-378532 (URN)10.1109/TON.2026.3669358 (DOI)001727127200006 ()2-s2.0-105031962751 (Scopus ID)
Note

Not duplicate with DiVA 1958339

QC 20260327

Available from: 2026-03-27 Created: 2026-03-27 Last updated: 2026-04-08Bibliographically approved
Guo, Y., Tütüncüoglu, F., Javeed, A. & Dán, G. (2026). Revenue Optimal Orchestration of ML-Based Services With Dependencies Under Delay and Quality Constraints in Beyond 5G RAN. IEEE Transactions on Networking, 34, 4403-4415
Open this publication in new window or tab >>Revenue Optimal Orchestration of ML-Based Services With Dependencies Under Delay and Quality Constraints in Beyond 5G RAN
2026 (English)In: IEEE Transactions on Networking, E-ISSN 2998-4157, Vol. 34, p. 4403-4415Article in journal (Refereed) Published
Abstract [en]

Effective service deployment and orchestration will be essential to accommodate user workloads with diverse requirements in cloud-native beyond 5G Radio Access Networks (RAN). Orchestration will have to take into account individual service quality requirements, latency constraints, and dependencies, while leveraging unique characteristics of dominant workloads, such as machine learning (ML) models. In this work, we address the orchestration of ML-based services, considering users that request application services that rely on network services, such as localization, positioning, etc. Each service is composed of functions, at potentially different quality levels. The objective is to maximize the network operator's revenue by determining service deployment, quality selection and computational resource allocation. The resulting problem is a mixed-integer non-convex problem, which we show is NP-hard. We provide sufficient conditions for the problem to be submodular, and for the general case we propose JADES, which relies on linear relaxation and convexification to decompose the problem into two subproblems, which are solved iteratively until convergence, followed by dependent randomized rounding. Our evaluation based on synthetic workloads shows that JADES outperforms baselines in terms of operator revenue and computational efficiency.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2026
Keywords
application service, block coordinate descent, function placement, ML-based service, O-RAN, RAN orchestration
National Category
Computer Systems Computer Sciences Communication Systems
Identifiers
urn:nbn:se:kth:diva-379276 (URN)10.1109/TON.2026.3676930 (DOI)001730977400002 ()2-s2.0-105034066428 (Scopus ID)
Note

QC 20260417

Available from: 2026-04-17 Created: 2026-04-17 Last updated: 2026-04-17Bibliographically approved
Tütüncüoglu, F. (2025). Joint Optimization of Pricing and Resource Allocation in Serverless Edge Computing: A Game-Theoretic Perspective. (Doctoral dissertation). Stockholm, Sweden: KTH Royal Institute of Technology
Open this publication in new window or tab >>Joint Optimization of Pricing and Resource Allocation in Serverless Edge Computing: A Game-Theoretic Perspective
2025 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

The rapid advancement of Internet of Things (IoT), Augmented Reality (AR), autonomous systems, and intelligent automation is transforming daily life and revolutionizing industrial processes. These technologies demand significant computational resources while also imposing stringent latency requirements. A common approach to meet computational resource demand is leveraging Cloud Computing (CC), which offers scalable processing capabilities through centralized data centers. Nonetheless, this centralized approach often fails to meet stringent latency requirements due to communication delays caused by the geographical distance between cloud servers and end users. This limitation has led to the emergence of the novel paradigm of Edge Computing (EC), which addresses the latency issue by placing compute units closer to end users.

EC server clusters are expected to be smaller in scale and more geographically dispersed compared to CC data centers. This introduces new challenges, such as limited computational and storage capacity, making efficient resource allocation crucial. Additionally, the network operator managing edge infrastructure must maintain its financial sustainability under different workload characteristics and application requirements of users, necessitating joint and adaptable resource management and pricing strategies. Function-as-a-Service (FaaS) offers a promising approach in this regard, as its pay-as-you-go pricing model allows users to pay only for the resources they consume, while also enabling dynamic resource management by shifting the entire responsibility of application deployment to the operator. However, this flexibility also makes the choice of computation, memory, and bandwidth resources price-dependent, further complicating resource management and pricing.

The papers included in this thesis are organized into three parts, each addressing distinct challenges related to pricing, resource allocation, and system dynamics in EC. In the first part of the thesis, we first consider a setting where Wireless Devices (WDs) minimize energy and the monetary costs of computing tasks, while the operator maximizes revenue by optimizing pricing and application caching under memory constraints. We consider a dynamic setting where the operator has no prior knowledge of the varying availability of WDs over time. We model this interaction as a Stackelberg Game (SG) and demonstrate the existence of an equilibrium. To address information asymmetry, we use Bayesian optimization to learn pricing strategies, establish an upper bound on its asymptotic regret, and propose a greedy approximation algorithm for application caching. We then investigate the joint optimization of compute, communication, and memory resources in a static network setting, where WDs minimize the costs of executing the tasks of their applications, including monetary and energy expenses. We model this interaction as a SG, show the existence of an equilibrium, and prove that computing an equilibrium is NP-hard. We propose an efficient approximation algorithm with a bounded approximation ratio. An interesting feature of our solution is that the operator's revenue is maximized when the WDs maximize their energy savings through computation offloading. Furthermore, we investigate the rate-adaptation problem, where WDs adjust their offloading rates based on available compute resources and pricing. We model the interaction as a SG and propose a Stackelberg gradient play algorithm that computes the operator’s implicit revenue function with respect to the rate selection of the WDs.

The second part of this thesis explores a dynamic network and pricing setting where WDs arrive at the edge cell according to a non-homogeneous stochastic process, and the operator sets prices based on the availability of WDs and their heterogeneous workload characteristics. We formulate the problem of maximizing the revenue ofthe operator as a sequential decision-making problem under uncertainty, where the operator's price can be piecewise linear or non-linear and could vary over time. In a Markovian steady-state setting, we derive analytical results for the optimal pricing strategy, which also serve as a heuristic for the general case. To address the general case, we introduce a Generalized Hidden Parameter Markov Decision Process and propose a dual Bayesian neural network approximator that approximates the state transitions and the revenue to accelerate the learning of the optimal pricing policy. This approach enables pre-training on synthetic traces while adapting quickly to unseen workload patterns.

The third part addresses computational challenges by examining the impact of server contention on both operator revenue and application latency constraints. To address this, we propose a contention model validated through experiments across applications with varying compute demands, including L1/L2/L3 caches, I/O, and memory bus usage. We develop a novel model-based Bayesian optimization algorithm to maximize operator revenue while ensuring that latency and resource capacity constraints are met.

The algorithmic contributions of this thesis in pricing and resource management are intended to provide efficient, deployable, and scalable solutions that strengthen the robustness and efficiency of resource allocation and pricing in EC.

Abstract [sv]

Den snabba utvecklingen av Internet of Things (IoT), Augmented Reality (AR), autonoma system och intelligent automation ger upphov till förändrade levnadsmönster och revolutionerar industriella processer. Dessa teknologier kräver betydande beräkningsresurser samtidigt som de ställer strikta krav på fördröjning (latens). Ett vanligt tillvägagångssätt för att möta behovet av beräkningsresurser är att utnyttja Cloud Computing (CC), som erbjuder skalbara bearbetningsmöjligheter genom centraliserade datacenter. Detta centraliserade tillvägagångssätt misslyckas dock ofta med att uppfylla strikta latenskrav på grund av kommunikationsfördröjningar orsakade av det geografiska avståndet mellan molnservrar och slutanvändare. Denna begränsning har lett till framväxten av det nya paradigmet Edge Computing (EC), som hanterar latensproblemet genom att placera beräkningsenheter närmare slutanvändarna.

EC-serverkluster förväntas vara mindre i skala och geografiskt mer spridda jämfört med CC-datacenter. Detta introducerar nya utmaningar, inklusive begränsad beräknings- och lagringskapacitet, vilket gör effektiv resursallokering avgörande. Dessutom måste nätverksoperatören som hanterar edge-infrastrukturen säkerställa en ekonomiskt hållbar drift, trots varierande arbetsbelastningar och applikationskrav från användare, vilket kräver gemensamma och anpassningsbara strategier för resursstyrning och prissättning. Function-as-a-Service (FaaS) är ett lovande tillvägagångssätt i detta avseende, eftersom dess betala-per-användning-prismodell gör det möjligt för användare att endast betala för de resurser de förbrukar, samtidigt som det möjliggör dynamisk resursstyrning genom att hela ansvaret för applikationsdistribution överförs till operatören. Denna flexibilitet gör dock att valet av beräkning, minne och bandbreddsresurser blir priskänsligt, vilket ytterligare komplicerar resursstyrning och prissättning.

Artiklarna som ingår i denna avhandling är organiserade i tre delar, där varje del behandlar olika utmaningar relaterade till prissättning, resursallokering och systemdynamik i edge computing. I den första delen av avhandlingen betraktar vi ett sammanhang där Wireless Devices (WD:er) minimerar energi- och penningkostnaderna för beräkningsuppgifter, medan operatören maximerar intäkterna genom att optimera prissättning och applikationscaching under minnesbegränsningar. Vi betraktar ett dynamiskt sammanhang där operatören inte har någon förkunskap om den varierande tillgängligheten av WD över tid. Vi modellerar detta som ett Stackelberg Game (SG) och visar att det finns ett jämviktsläge. För att hantera informationsasymmetri använder vi Bayesiansk optimering för att lära oss prissättningsstrategier, fastställer en övre gräns för dess asymptotiska ånger (en: regret), och föreslår en girig approximationsalgoritm för applikationscaching. Vi undersöker sedan att gemensamt optimera beräknings-, kommunikations- och minnesresurser i ett statiskt nätverkssammanhang, där WDs minimerar kostnaderna för att köra applikationsuppgifter, inklusive penning- och energikostnader. Vi modellerar denna interaktion som ett SG, visar existensen av ett jämviktsläge och bevisar att beräkningen av ett sådant är NP-svårt. Vi föreslår en effektiv approximationsalgoritm med en begränsad approximationskvot. En intressant egenskap hos vår lösning är att operatörens intäkter maximeras när WDs maximerar sina energibesparingar genom beräkningsavlastning. Vidare undersöker vi problemet med hastighetsanpassning, där WDs justerar sina avlastningshastigheter baserat på tillgängliga beräkningsresurser och prissättning. Vi modellerar interaktionen som ett SG och föreslår en Stackelberg gradient play-algoritm som beräknar operatörens implicita intäktsfunktion med avseende på WDs val av avlastningshastighet.

Den andra delen av denna avhandling undersöker en dynamisk nätverks- och prissättningsmodell där WDs anländer till edge-cellen enligt en icke-homogen stokastisk process. Operatören sätter priser baserat på tillgången på WDs och deras heterogena arbetsbelastningskarakteristiska. Vi formulerar problemet att maximera operatörens intäkter som ett sekventiellt beslutsproblem under osäkerhet, där operatörens pris kan vara styckvis linjära eller icke-linjära och kan variera över tid. I ett Markovskt stationärt tillstånd härleder vi analytiska resultat för den optimala prissättningsstrategin, denna fungerar också som en heuristik för det allmänna fallet. För att hantera det allmänna fallet introducerar vi en Generaliserad Markovbeslutsprocess med dolda parametrar och föreslår en dubbel Bayesiansk neurala nätverksapproximator som approximerar tillståndsövergångar och intäkter för att påskynda inlärningen av den optimala prissättningspolicyn. Detta tillvägagångssätt möjliggör förträning på syntetisk data samtidigt som det snabbt anpassar sig till nya belastningsmönster.

Den tredje delen tar upp förbisedda beräkningsutmaningar genom att undersöka påverkan av serverkonkurrens på både operatörens intäkter och applikationers latensbegränsningar. För att hantera detta föreslår vi en konkurrensmodell som valideras genom experiment över applikationer med varierande beräkningsbehov, inklusive L1/L2/L3-cache, I/O och minnesbussanvändning. Vi utvecklar en ny modellbaserad Bayesiansk optimeringsalgoritm för att maximera operatörens intäkter samtidigt som latens- och resurskapacitetskrav uppfylls. De algoritmiska bidragen inom området prissättning och resursstyrning är avsedda att fungera som effektiva, implementerbara och skalbara lösningar som förbättrar robustheten och effektiviteten i resursallokering och prissättning inom EC.

Place, publisher, year, edition, pages
Stockholm, Sweden: KTH Royal Institute of Technology, 2025. p. 275
Series
TRITA-EECS-AVL ; 2025:58
Keywords
Edge Computing, Resource Allocation, Pricing Strategies, Stackelberg Games, Combinatorial Optimization, Non-linear Optimization, Bayesian Optimization, Reinforcement Learning, Transfer Learning, edge-beräkning, Resursallokering, Prissättningsstrategier, Stackelbergspel, Kombinatorisk optimering, Icke-linjär optimering, Bayesiansk optimering, Förstärkningsinlärning, Överföringsinlärning
National Category
Electrical Engineering, Electronic Engineering, Information Engineering
Research subject
Electrical Engineering; Computer Science
Identifiers
urn:nbn:se:kth:diva-363353 (URN)978-91-8106-289-2 (ISBN)
Public defence
2025-06-10, https://kth-se.zoom.us/j/68670265353, F3, Lindstedtsvägen 26 & 28, floor 2, KTH Campus, Stockholm, 14:00 (English)
Opponent
Supervisors
Note

QC 20250514

Available from: 2025-05-15 Created: 2025-05-14 Last updated: 2025-12-16Bibliographically approved
Tütüncüoglu, F., Ben-Ameur, A., Dán, G., Araldo, A. & Chahed, T. (2024). Dynamic Time-of-Use Pricing for Serverless Edge Computing with Generalized Hidden Parameter Markov Decision Processes. In: Proceedings - 2024 IEEE 44th International Conference on Distributed Computing Systems, ICDCS 2024: . Paper presented at 44th IEEE International Conference on Distributed Computing Systems, ICDCS 2024, Jersey City, United States of America, Jul 23 2024 - Jul 26 2024 (pp. 668-679). Institute of Electrical and Electronics Engineers (IEEE)
Open this publication in new window or tab >>Dynamic Time-of-Use Pricing for Serverless Edge Computing with Generalized Hidden Parameter Markov Decision Processes
Show others...
2024 (English)In: Proceedings - 2024 IEEE 44th International Conference on Distributed Computing Systems, ICDCS 2024, Institute of Electrical and Electronics Engineers (IEEE) , 2024, p. 668-679Conference paper, Published paper (Refereed)
Abstract [en]

The commercial adoption of Edge Computing (EC) will require pricing schemes that cater to the financial interests of the operators and of the users. Pricing in EC is particularly challenging as it has to take into account the limited amount of edge resources as well as the stochasticity of user workloads due to location-specific workload characteristics and differences in user activity. We formulate the problem of maximizing the revenue of a serverless edge operator through dynamically pricing compute and memory resources under time varying workloads as a sequential decision making problem under uncertainty. We provide analytical results for the optimal pricing strategy in a Markovian setting in steady state. For the general case, we propose a novel Generalized Hidden Parameter Markov Decision Process (GHP-MDP) formulation of the revenue maximization problem, and we propose a dual Bayesian neural network approximator as a solution. The key novelty of the proposed solution is that it can be pre-trained on synthetic traces and adapts fast to previously unseen workload characteristics. We use simulations based on synthetic and real traffic traces to show that the proposed solution is sample-efficient thanks to effective transfer learning, and it outperforms state-of-the-art learning approaches in terms of revenue and learning rate by up to 50% on real traces.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2024
Keywords
dynamic pricing, queuing theory, resource management, Serverless edge computing, transfer learning
National Category
Computer Sciences Communication Systems Computational Mathematics
Identifiers
urn:nbn:se:kth:diva-353498 (URN)10.1109/ICDCS60910.2024.00068 (DOI)001304430200059 ()2-s2.0-85203126163 (Scopus ID)
Conference
44th IEEE International Conference on Distributed Computing Systems, ICDCS 2024, Jersey City, United States of America, Jul 23 2024 - Jul 26 2024
Note

Part of ISBN 9798350386059

QC 20241111

Available from: 2024-09-19 Created: 2024-09-19 Last updated: 2025-05-14Bibliographically approved
Tütüncüoğlu, F. & Dán, G. (2024). Joint Resource Management and Pricing for Task Offloading in Serverless Edge Computing. IEEE Transactions on Mobile Computing, 23(6), 7438-7452
Open this publication in new window or tab >>Joint Resource Management and Pricing for Task Offloading in Serverless Edge Computing
2024 (English)In: IEEE Transactions on Mobile Computing, ISSN 1536-1233, E-ISSN 1558-0660, Vol. 23, no 6, p. 7438-7452Article in journal (Refereed) Published
Abstract [en]

We consider the problem of resource allocation, pricing and application caching for latency sensitive task offloading in serverless edge computing. We model the interaction between a profit-maximizing operator and cost-minimizing Wireless Devices (WDs) as a Stackelberg game where the operator is the leader and decides the price, resource allocation and set of applications to cache, while the WDs are the followers and decide whether to offload their tasks. We first show that the game has a Subgame Perfect Equilibrium (SPE), but computing it, is NP-hard. Importantly, we show that an SPE, which maximizes the operator's revenue, results in minimal energy consumption among the WDs. For computing an approximate SPE, we propose a linear time approximation algorithm with bounded approximation ratio for resource allocation and pricing, and we propose an efficient heuristic based on the utility density of individual applications for the joint optimization of caching, resource allocation and pricing. Our results show that the proposed algorithm outperforms state-of-the-art methods by up to an order of magnitude both in terms of revenue and total energy savings and has small computational overhead. An interesting feature of our results is that the utility of the operator is maximized by a solution that maximizes the WDs' energy savings through computation offloading, which makes it a promising candidate for energy efficient edge cloud deployments.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2024
Keywords
Combinatorial optimization, convex optimization, edge computing, function as a service, stackelberg game
National Category
Communication Systems
Identifiers
urn:nbn:se:kth:diva-348551 (URN)10.1109/TMC.2023.3334914 (DOI)001216462000063 ()2-s2.0-85179035220 (Scopus ID)
Note

QC 20240701

Available from: 2024-07-01 Created: 2024-07-01 Last updated: 2025-05-14Bibliographically approved
Tutuncuoglu, F. & Dán, G. (2024). Optimal Service Caching and Pricing in Edge Computing: a Bayesian Gaussian Process Bandit Approach. IEEE Transactions on Mobile Computing, 23(1), 705-718
Open this publication in new window or tab >>Optimal Service Caching and Pricing in Edge Computing: a Bayesian Gaussian Process Bandit Approach
2024 (English)In: IEEE Transactions on Mobile Computing, ISSN 1536-1233, E-ISSN 1558-0660, Vol. 23, no 1, p. 705-718Article in journal (Refereed) Published
Abstract [en]

Motivated by the emergence of function-as-a-service (FaaS) as a programming abstraction for edge computing, we consider the problem of caching and pricing applications for edge computation offloading in a dynamic environment where (WDs) can be active or inactive at any point in time. We model the problem as a single leader multiple-follower Stackelberg game, where the service operator is the leader and decides what applications to cache and how much to charge for their use, while the WDs are the followers and decide whether or not to offload their computations. We show that the WDs' interaction can be modeled as a player-specific congestion game and show the existence and computability of equilibria. We then show that under perfect and complete information the equilibrium price of the service operator can be computed in polynomial time for any cache placement. For the incomplete information case, we propose a Bayesian Gaussian Process Bandit algorithm for learning an optimal price for a cache placement and provide a bound on its asymptotic regret. We then propose a Gaussian process approximation-based greedy heuristic for computing the cache placement. We use extensive simulations to evaluate the proposed learning scheme, and show that it outperforms state of the art algorithms by up to 50% at little computational overhead.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2024
Keywords
Computational modeling, Costs, Games, Gaussian processes, Pricing, Servers, Task analysis, computation offloading, Computer games, Computer programming, Gaussian distribution, Gaussian noise (electronic), Learning algorithms, Optimization, Polynomial approximation, Bayesian Gaussian process, Cache placement, Computational modelling, Dynamic environments, Edge computing, Game, Programming abstractions
National Category
Computer Sciences
Identifiers
urn:nbn:se:kth:diva-328948 (URN)10.1109/TMC.2022.3221465 (DOI)001136301500005 ()2-s2.0-85141646376 (Scopus ID)
Note

QC 20250611

Available from: 2023-06-14 Created: 2023-06-14 Last updated: 2025-06-11Bibliographically approved
Tütüncüoglu, F. & Dán, G. (2024). Sample-efficient Learning for Edge Resource Allocation and Pricing with BNN Approximators. In: IEEE INFOCOM 2024 - IEEE Conference on Computer Communications Workshops, INFOCOM WKSHPS 2024: . Paper presented at 2024 IEEE Conference on Computer Communications Workshops, INFOCOM WKSHPS 2024, Vancouver, Canada, May 20 2024. Institute of Electrical and Electronics Engineers (IEEE)
Open this publication in new window or tab >>Sample-efficient Learning for Edge Resource Allocation and Pricing with BNN Approximators
2024 (English)In: IEEE INFOCOM 2024 - IEEE Conference on Computer Communications Workshops, INFOCOM WKSHPS 2024, Institute of Electrical and Electronics Engineers (IEEE) , 2024Conference paper, Published paper (Refereed)
Abstract [en]

Edge computing (EC) is expected to provide low latency access to computing and storage resources to autonomous Wireless Devices (WDs). Pricing and resource allocation in EC thus have to cope with stochastic workloads, on the one hand offering resources at a price that is attractive to WDs, one the other hand ensuring revenue to the edge operator. In this paper, we formulate the strategic interaction between an edge operator and WDs as a Bayesian Stackelberg Markov game. We characterize the optimal strategy of the WDs that minimizes their costs. We then show that the operator's problem can be formulated as a Markov Decision Process and propose a model-based reinforcement learning approach, based on a novel approximation of the workload dynamics at the edge cell environment. The proposed approximation leverages two Bayesian Neural Networks (BNNs) to facilitate efficient policy learning, and enables sample efficient transfer learning from simulated environments to a real edge environment. Our extensive simulation results demonstrate the superiority of our approach in terms of sample efficiency, outperforming state-of-the-art methods 30 times in terms of learning rate and by 50% in terms of operator revenue.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2024
Keywords
Bayesian neural networks, Edge computing, Markov decision process
National Category
Computational Mathematics
Identifiers
urn:nbn:se:kth:diva-353568 (URN)10.1109/INFOCOMWKSHPS61880.2024.10620829 (DOI)001300418400120 ()2-s2.0-85202342075 (Scopus ID)
Conference
2024 IEEE Conference on Computer Communications Workshops, INFOCOM WKSHPS 2024, Vancouver, Canada, May 20 2024
Note

Part of ISBN 9798350384475

QC 20240924

Available from: 2024-09-19 Created: 2024-09-19 Last updated: 2025-05-06Bibliographically approved
Tutuncuoglu, F., Josilo, S. & Dán, G. (2023). Online Learning for Rate-Adaptive Task Offloading Under Latency Constraints in Serverless Edge Computing. IEEE/ACM Transactions on Networking, 31(2), 695-709
Open this publication in new window or tab >>Online Learning for Rate-Adaptive Task Offloading Under Latency Constraints in Serverless Edge Computing
2023 (English)In: IEEE/ACM Transactions on Networking, ISSN 1063-6692, E-ISSN 1558-2566, Vol. 31, no 2, p. 695-709Article in journal (Refereed) Published
Abstract [en]

We consider the interplay between latency constrained applications and function-level resource management in a serverless edge computing environment. We develop a game theoretic model of the interaction between rate adaptive applications and a load balancing operator under a function-oriented pay-as-you-go pricing model. We show that under perfect information, the strategic interaction between the applications can be formulated as a generalized Nash equilibrium problem, and use variational inequality theory to prove that the game admits an equilibrium. For the case of imperfect information, we propose an online learning algorithm for applications to maximize their utility through rate adaptation and resource reservation. We show that the proposed algorithm can converge to equilibria and achieves zero regret asymptotically, and our simulation results show that the algorithm achieves good system performance at equilibrium, ensures fast convergence, and enables applications to meet their latency constraints.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2023
Keywords
Generalized Nash equilibrium problem, online learning, resource allocation, serverless edge computing, Computation theory, Data structures, E-learning, Edge computing, Game theory, Job analysis, Learning algorithms, Variational techniques, Wireless sensor networks, Computational modelling, FAA, Generalized Nash equilibrium problems, Resources allocation, Task analysis, Wireless communications
National Category
Computer Systems
Identifiers
urn:nbn:se:kth:diva-327031 (URN)10.1109/TNET.2022.3197669 (DOI)000849231300001 ()2-s2.0-85137579713 (Scopus ID)
Note

QC 20230523

Available from: 2023-05-23 Created: 2023-05-23 Last updated: 2023-05-23Bibliographically approved
Tutuncuoglu, F. & Dán, G. (2021). Optimal Pricing for Service Caching and Task Offloading in Edge Computing. In: 17Th Conference On Wireless On-Demand Network Systems And Services (WONS 2022): . Paper presented at 17th Conference on Wireless On-Demand Network Systems and Services (WONS), MAR 30-APR 01, 2022, ELECTR NETWORK. Institute of Electrical and Electronics Engineers (IEEE)
Open this publication in new window or tab >>Optimal Pricing for Service Caching and Task Offloading in Edge Computing
2021 (English)In: 17Th Conference On Wireless On-Demand Network Systems And Services (WONS 2022), Institute of Electrical and Electronics Engineers (IEEE) , 2021Conference paper, Published paper (Refereed)
Abstract [en]

Motivated by the emergence of function-as-a-service (FaaS) as a programming abstraction for edge computing, we consider the problem of caching and pricing applications for edge computation offloading. We model the problem as a multiplefollower Stackelberg game, where the operator is the leader and decides what applications to cache and how much to charge for their use, while the wireless devices (WDs) are the followers and decide whether or not to offload their computations. We show that the WDs' interaction can be modeled as a player-specific congestion game and show the existence and computability of equilibria. We then show that the equilibrium price of the operator can be computed in polynomial time for any cache placement, and propose a greedy algorithm for computing the applications to be cached. We use extensive simulations to show that the proposed heuristic performs close to optimal at negligible computational overhead.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2021
National Category
Computer Engineering Telecommunications Computer Sciences
Identifiers
urn:nbn:se:kth:diva-316711 (URN)10.23919/wons54113.2022.9764593 (DOI)000838599200018 ()2-s2.0-85130302017 (Scopus ID)
Conference
17th Conference on Wireless On-Demand Network Systems and Services (WONS), MAR 30-APR 01, 2022, ELECTR NETWORK
Note

Part of proceedings: ISBN 978-3-903176-46-1, QC 20230222

Available from: 2022-08-31 Created: 2022-08-31 Last updated: 2023-02-22Bibliographically approved
Tütüncüoglu, F., Dán, G., Balador, A. & Williams, A. COPES: Contention-aware Pricing and Service Placement in Serverless Edge Computing.
Open this publication in new window or tab >>COPES: Contention-aware Pricing and Service Placement in Serverless Edge Computing
(English)Manuscript (preprint) (Other academic)
Abstract [en]

The commercial adoption of Edge Computingn (EC) will necessitate pricing schemes that align with the economic interests of Network Operator (NO), Service Providers (SPs) and mobile users. Pricing in EC is particularly challenging as it has to take into account resource constraints and contention among different workloads, which can lead to performance degradation and potentially to latency violations. We propose to model pricing subject to latency and resource constraints as an extensive game played between a NO that charges SP based on resource use, SP that charge their users a service specific price, and users that can decide to use the SP's services. We show that equilibria do exist, but computing an equilibrium is NP-hard even under complete information. We then propose an approximation algorithm for computing equilibrium prices for given application placement and we propose a novel, model-assisted Bayesian Optimization (BO) scheme combining probabilistic reparametrization and our approximation scheme. Extensive simulations show that our proposed approach achieves up to an order of magnitude higher revenue compared to state-of-the-art approaches, with a small computational overhead, and highlight the importance of taking into account contention among different workloads.

Keywords
Edge Computing, Computation Offloading, Resource Allocation, Bayesian Optimization
National Category
Electrical Engineering, Electronic Engineering, Information Engineering
Identifiers
urn:nbn:se:kth:diva-363366 (URN)
Note

Submitted to Submitted to IEEE Transactions on Mobile Computing

QC 20251230

Available from: 2025-05-14 Created: 2025-05-14 Last updated: 2025-12-30Bibliographically approved
Organisations
Identifiers
ORCID iD: ORCID iD iconorcid.org/0000-0001-5050-2373

Search in DiVA

Show all publications