kth.sePublications KTH
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Optimal Control and Coordination of Autonomous Intelligent Systems by Edge Computing
KTH, School of Industrial Engineering and Management (ITM), Engineering Design, Mechatronics and Embedded Control Systems.ORCID iD: 0000-0003-4535-3849
2025 (English)Doctoral thesis, comprehensive summary (Other academic)
Sustainable development
SDG 9: Industry, innovation and infrastructure, SDG 11: Sustainable cities and communities
Abstract [en]

Autonomous Intelligent Systems (AIS) are transforming various sectors by integrating advanced control theories, artificial intelligence, and cyber-physical systems. However, AIS control development faces significant challenges, including ensuring real-time responsiveness, designing adaptive controllers for dynamic environments, and coordinating multi-agent systems under uncertainties. These issues are exacerbated in resource-constrained settings, where balancing computational demands and real-time performance is critical.

To mitigate these challenges, this thesis leverages edge computing to enhance system performance, so that data-driven methods and optimal control technologies become feasible for complex AIS applications. Edge computing is a scheme that brings computation, communication, and storage resources closer to data sources, to achieve low-latency processing, real-time adaptability, and scalable solutions for AIS applications. It provides two key benefits: (1) offloading computationally intensive tasks to nearby edge servers, so as to ensure responsive and efficient operations despite constrained resources onboard, and (2) facilitating decentralized coordination among multiple agents by exploiting the edge server as a trustworthy node, so as to improve system scalability, reliability, and collaborative decision-making.

Building on the advantages of the offloading and coordination capabilities inherent in edge computing, this thesis investigates how these features can be harnessed to overcome the limitations of AIS in achieving optimal control and coordination. Primary contributions of this thesis include: (1) the development of state estimation and data-driven optimal control algorithms, which enables more precise estimation and control in nonlinear, time-variant systems; (2) the design of edge-based computational task offloading algorithms to achieve real-time adaptive control and learning by distributing computationally intensive tasks, which effectively balances latency and resource constraints; and (3) the introduction of decentralized optimization frameworks for multi-agent systems, which enhances scalability, robustness, and coordination under communication constraints by leveraging edge servers as trustworthy nodes for efficient collaboration and decision-making. All contributions have been validated through case studies in soft robotics and connected and autonomous vehicles, demonstrating their effectiveness and advancements over existing methods.

In summary, this thesis advances AIS capabilities by addressing real-time computational challenges and enabling optimal, data-driven control and decentralized coordination. The integration of edge computing improves the efficiency, scalability, and adaptability of AIS, offering promising opportunities for applications in autonomous mobility and other dynamic domains.

Abstract [sv]

Autonoma intelligenta system (AIS) omvandlar olika sektorer genom att integrera avancerad reglerteknik, artificiell intelligens och cyberfysiska system. Utvecklingen av styrsystem för AIS står dock inför betydande utmaningar, såsom att säkerställa realtidsrespons, utforma adaptiva regulatorer för dynamiska miljöer och samordna multiagentsystem under osäkerheter. Dessa utmaningar är särskilt framträdande i resursbegränsade miljöer, där det är avgörande att balansera beräkningskrav och realtidsprestanda.

För att hantera dessa utmaningar utnyttjar denna avhandling edge computing för att förbättra systemprestanda och möjliggöra data-drivna metoder samt optimal styrning för komplexa AIS-applikationer. Edge computing innebär att beräknings-, kommunikations- och lagringsresurser flyttas närmare datakällorna, vilket möjliggör låglatensbearbetning, realtidsanpassning och skalbara lösningar. Detta tillvägagångssätt erbjuder två betydelsefulla fördelar: (1) avlastning av beräkningstunga uppgifter till närliggande edge-servrar för att säkerställa responsiva och effektiva operationer trots begränsade resurser ombord, och (2) decentraliserad samordning mellan flera agenter genom att utnyttja edge-servrar som tillförlitliga noder, vilket förbättrar systemets skalbarhet, robusthet och collaborative decision-making.

Genom att dra nytta av edge computing och dess möjligheter till beräkningsavlastning och decentraliserad samordning undersöker denna avhandling hur dessa funktioner kan användas för att övervinna AIS-begränsningar inom optimal reglering och koordination. De huvudsakliga bidragen i avhandlingen inkluderar: (1) utveckling av algoritmer för tillståndsskattning och data-driven optimal styrning, vilket möjliggör mer exakt skattning och styrning av icke-linjära, tidsvarierande system; (2) design av edge-baserade algoritmer för beräkningsavlastning, vilket möjliggör realtidsanpassad reglering och inlärning genom att distribuera beräkningstunga uppgifter och därmed balansera latens och resursbegränsningar; samt (3) introduktion av decentraliserade optimeringsramverk för multiagentsystem, vilket förbättrar skalbarhet, robusthet och koordination under kommunikationsbegränsningar genom att utnyttja edge-servrar som pålitliga noder för effektivt samarbete och beslutsfattande. Alla bidrag har validerats genom fallstudier inom mjukrobotik och uppkopplade autonoma fordon, vilket påvisar deras effektivitet och förbättringar jämfört med befintliga metoder.

Sammanfattningsvis stärker denna avhandling AIS genom att adressera realtidsberäkningsutmaningar och möjliggöra optimal, data-driven styrning och decentraliserad koordination. Genom att integrera edge computing förbättras AIS effektivitet, skalbarhet och anpassningsförmåga, vilket öppnar för lovande tillämpningar inom autonom mobilitet och andra dynamiska områden.

Place, publisher, year, edition, pages
Stockholm: Kungliga Tekniska högskolan, 2025. , p. 77
Series
TRITA-ITM-AVL ; 2025:8
Keywords [en]
Optimal Control, Decentralized Coordination, Real-Time Control, Edge Computing, Soft Robotics, Connected and Autonomous Vehicles
Keywords [sv]
Optimal Reglering, Decentraliserad Koordinering, Realtidsstyrning, Edge Computing, Mjukrobotik, Uppkopplade och Autonoma Fordon vi
National Category
Control Engineering Robotics and automation Vehicle and Aerospace Engineering
Research subject
Machine Design
Identifiers
URN: urn:nbn:se:kth:diva-360950ISBN: 978-91-8106-220-5 (print)OAI: oai:DiVA.org:kth-360950DiVA, id: diva2:1942893
Public defence
2025-03-31, Gladan / https://kth-se.zoom.us/j/69313070114, Brinellvägen 85, Stockholm, 13:00 (English)
Opponent
Supervisors
Note

QC 20250320

Available from: 2025-03-07 Created: 2025-03-06 Last updated: 2025-12-17Bibliographically approved
List of papers
1. Shape Estimation of a 3D Printed Soft Sensor Using Multi-Hypothesis Extended Kalman Filter
Open this publication in new window or tab >>Shape Estimation of a 3D Printed Soft Sensor Using Multi-Hypothesis Extended Kalman Filter
2022 (English)In: IEEE Robotics and Automation Letters, E-ISSN 2377-3766, Vol. 7, no 3, p. 8383-8390Article in journal (Refereed) Published
Abstract [en]

This study develops a multi-hypothesis extended Kalman filter (MH-EKF) for the online estimation of the bending angle of a 3D printed soft sensor attached to soft actuators. Despite the advantage of compliance and low interference, the 3D printed soft sensor is susceptible to the hysteresis property and nonlinear effects. Improving measurement accuracy for sensors with hysteresis is a common challenge. Current studies mainly apply complex models and highly nonlinear functions to characterize the hysteresis, requiring a complicated parameter identification process and challenging real-time applications. This study enhances the model simplicity and the real-time performance for the hysteresis characterization. We identify the hysteresis by combining multiple polynomial functions and improving the sensor estimation with the proposed MH-EKF. We examine the performance of the filter in the real-time closed-loop control system. Compared with the baseline methods, the proposed approach shows improvements in the estimation accuracy with low computational complexity.

Place, publisher, year, edition, pages
IEEE, 2022
Keywords
Soft Sensors and Actuators, Modeling, Control, and Learning for Soft Robots, Hydraulic/Pneumatic Actuators
National Category
Control Engineering Signal Processing
Research subject
Electrical Engineering; Machine Design; Production Engineering
Identifiers
urn:nbn:se:kth:diva-315620 (URN)10.1109/lra.2022.3187832 (DOI)000838455200008 ()2-s2.0-85133735530 (Scopus ID)
Projects
TECoSA
Funder
KTH Royal Institute of TechnologyVinnova, TECoSAEU, Horizon 2020, InSecTT
Note

QC 20220912

Available from: 2022-07-14 Created: 2022-07-14 Last updated: 2025-03-06Bibliographically approved
2. Edge-enabled Adaptive Shape Estimation of 3D Printed Soft Actuators with Gaussian Processes and Unscented Kalman Filters
Open this publication in new window or tab >>Edge-enabled Adaptive Shape Estimation of 3D Printed Soft Actuators with Gaussian Processes and Unscented Kalman Filters
2024 (English)In: IEEE Transactions on Industrial Electronics, ISSN 0278-0046, E-ISSN 1557-9948, Vol. 71, no 3, p. 3044-3054Article in journal (Refereed) Published
Abstract [en]

Soft actuators have the advantages of compliance and adaptability when working with vulnerable objects, but the deformation shape of the soft actuators is difficult to measure or estimate. Soft sensors made of highly flexible and responsive materials are promising new approaches to the shape estimation of soft actuators, but suffer from highly nonlinear, hysteresis, and time-variant properties. A nonlinear and adaptive state observer is essential for the shape estimation from soft sensors. Current state estimation methods rely on complex nonlinear data-fitting models, and the robustness of the estimation methods is questionable. This study investigates the soft actuator dynamics and the soft sensor model as a stochastic process characterized by the Gaussian Process (GP) model. The unscented Kalman filter (UKF) is applied to the GP model for more reliable variance adjustment during the sequential state estimation process than conventional methods. In addition, a major limitation of the GP model is its computational complexity during online inference. To improve the real-time performance while guaranteeing accuracy, we introduce an edge server to decrease the onboard computational and memory overhead. The experiments showcase a significant improvement in estimation accuracy and real-time performance compared to baseline methods.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2024
Keywords
Soft Sensors and Actuators; soft robotics; Gaussian process; Unscented Kalman filter
National Category
Control Engineering Signal Processing
Research subject
Applied and Computational Mathematics, Optimization and Systems Theory; Machine Design; Industrial Information and Control Systems
Identifiers
urn:nbn:se:kth:diva-326512 (URN)10.1109/tie.2023.3270505 (DOI)001080899800082 ()2-s2.0-85159841244 (Scopus ID)
Projects
TECoSA
Funder
Vinnova, Tecosa
Note

QC 20230508

Available from: 2023-05-03 Created: 2023-05-03 Last updated: 2025-03-06Bibliographically approved
3. Optimal Gait Control for a Tendon-driven Soft Quadruped Robot by Model-based Reinforcement Learning
Open this publication in new window or tab >>Optimal Gait Control for a Tendon-driven Soft Quadruped Robot by Model-based Reinforcement Learning
(English)Manuscript (preprint) (Other academic)
Abstract [en]

This study presents an innovative approach tooptimal gait control for a soft quadruped robot enabled by four compressible tendon-driven soft actuators. Soft quadruped robots, compared to their rigid counterparts, are widely recognized for offering enhanced safety, lower weight, and simpler fabrication and control mechanisms. However, their highly deformable structure introduces nonlinear dynamics, making precise gait locomotion control complex. To solve this problem, we propose a novel model-based reinforcement learning (MBRL) method. The study employs a multi-stage approach, including state space restriction, data-driven surrogate model training, and MBRL development. Compared to benchmark methods, the proposed approach significantly improves the efficiency and performance of gait control policies. The developed policy is both robust and adaptable to the robot’s deformable morphology. The study concludes by highlighting the practical applicability of these findings in real-world scenarios.

National Category
Control Engineering Robotics and automation
Identifiers
urn:nbn:se:kth:diva-360941 (URN)
Note

Accepted by the 2025 IEEE International Conference on Robotics & Automation, 19–23 May, Atlanta, USA

This research has been carried out as part of the TECoSA Vinnova Com-petence Center for Trustworthy Edge Computing Systems and Applicationsat KTH Royal Institute of Technology

QC 20250307

Available from: 2025-03-06 Created: 2025-03-06 Last updated: 2025-03-07Bibliographically approved
4. Optimal gait design for a soft quadruped robot via multi-fidelity Bayesian optimization
Open this publication in new window or tab >>Optimal gait design for a soft quadruped robot via multi-fidelity Bayesian optimization
Show others...
2025 (English)In: Applied Soft Computing, ISSN 1568-4946, E-ISSN 1872-9681, Vol. 169, article id 112568Article in journal (Refereed) Published
Abstract [en]

This study focuses on the locomotion capability improvement in a tendon-driven soft quadruped robot through an online adaptive learning approach. Leveraging the inverse kinematics model of the soft quadruped robot, we employ a central pattern generator to design a parametric gait pattern, and use Bayesian optimization (BO) to find the optimal parameters. Further, to address the challenges of modeling discrepancies, we implement a multi-fidelity BO approach, combining data from both simulation and physical experiments throughout training and optimization. This strategy enables the adaptive refinement of the gait pattern and ensures a smooth transition from simulation to real-world deployment for the controller. Compared to previous result using a fixed gait pattern, the multi-fidelity BO approach improves the robot’s average walking speed from 0.14 m/s to 0.214 m/s, an increase of 52.7%. Moreover, we integrate a computational task off-loading architecture by edge computing, which reduces the onboard computational and memory overhead, to improve real-time control performance and facilitate an effective online learning process. The proposed approach successfully achieves optimal walking gait design for physical deployment with high efficiency, effectively addressing challenges related to the reality gap in soft robotics.

Place, publisher, year, edition, pages
Elsevier BV, 2025
Keywords
soft quadruped robot; Reality gap; Multi-fidelity Bayesian optimization; Edge computing
National Category
Robotics and automation Control Engineering Other Mechanical Engineering
Research subject
Applied and Computational Mathematics, Optimization and Systems Theory; Information and Communication Technology; Machine Design
Identifiers
urn:nbn:se:kth:diva-357777 (URN)10.1016/j.asoc.2024.112568 (DOI)001383577700001 ()2-s2.0-85211232861 (Scopus ID)
Projects
TECoSAKTH XPRES
Funder
Vinnova, TecosaXPRES - Initiative for excellence in production research
Note

QC 20250204

Available from: 2024-12-17 Created: 2024-12-17 Last updated: 2025-03-06Bibliographically approved
5. Decentralized Convex Optimization for Joint Task Offloading and Resource Allocation of Vehicular Edge Computing Systems
Open this publication in new window or tab >>Decentralized Convex Optimization for Joint Task Offloading and Resource Allocation of Vehicular Edge Computing Systems
2022 (English)In: IEEE Transactions on Vehicular Technology, ISSN 0018-9545, E-ISSN 1939-9359, Vol. 71, no 12, p. 13226-13241Article in journal (Refereed) Published
Abstract [en]

Vehicular Edge Computing (VEC) systems exploit resources on both vehicles and Roadside Units (RSUs) to provide services for real-time vehicular applications that cannot be completed in the vehicles alone. Two types of decisions are critical for VEC: one is for task offloading to migrate vehicular tasks to suitable RSUs, and the other is for resource allocation at the RSUs to provide the optimal amount of computational resource to the migrated tasks under constraints on response time and energy consumption. Most of the published optimization-based methods determine the optimal solutions of the two types of decisions jointly within one optimization problem at RSUs, but the complexity of solving the optimization problem is extraordinary, because the problem is not convex and has discrete variables. Meanwhile, the nature of centralized solutions requires extra information exchange between vehicles and RSUs, which is challenged by the additional communication delay and security issues. The contribution of this paper is to decompose the joint optimization problem into two decoupled subproblems: task offloading and resource allocation. Both subproblems are reformulated for efficient solutions. The resource allocation problem is simplified by dual decomposition and can be solved at vehicles in a decentralized way. The task offloading problem is transformed from a discrete problem to a continuous convex one by a probability-based solution. Our new method efficiently achieves a near-optimal solution through decentralized optimizations, and the error bound between the solution and the true optimum is analyzed. Simulation results demonstrate the advantage of the proposed approach.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2022
Keywords
Vehicular Edge Computing, Task offloading, Multi-server resource allocation, Hierarchical decomposition, Decentralized convex optimization
National Category
Telecommunications Control Engineering Communication Systems
Research subject
Applied and Computational Mathematics, Optimization and Systems Theory; Information and Communication Technology; Transport Science, Transport Infrastructure
Identifiers
urn:nbn:se:kth:diva-316311 (URN)10.1109/tvt.2022.3197627 (DOI)000908826000059 ()2-s2.0-85136055475 (Scopus ID)
Projects
TECoSAInSecTT
Funder
KTH Royal Institute of Technology, XPRESVinnova, TECoSAEU, Horizon 2020, InSecTT
Note

QC 20251222

Available from: 2022-08-11 Created: 2022-08-11 Last updated: 2025-12-22Bibliographically approved
6. Collaborative Collision Avoidance of Connected Vehicles Using ADMM with PI-Regulated Lagrangian Multipliers
Open this publication in new window or tab >>Collaborative Collision Avoidance of Connected Vehicles Using ADMM with PI-Regulated Lagrangian Multipliers
2023 (English)In: 2023 IEEE 19th International Conference on Automation Science and Engineering, CASE 2023, Institute of Electrical and Electronics Engineers (IEEE) , 2023Conference paper, Published paper (Refereed)
Abstract [en]

The decentralized approach is popular for the collaborative collision avoidance of connected vehicles in many scenarios. By modeling the task as a collaborative optimal control problem, Lagrangian methods are widely used to decouple the constraints and enable the decentralized solution. However, potential constraint-violating behavior will lead to oscillations during the Lagrangian update, resulting in more iterations and lower real-time efficiency. Existing methods generally neither address this shortcoming, nor explore the Lagrangian update mechanism. This study takes a control perspective, and solves this collaborative optimal control problem based on an extension of the Alternating Directions Method of Multipliers (ADMM) algorithm by performing the iteration update with a Proportional-Integral-(PI-) regulated controller. The link between the Lagrangian optimization and the PI controller improves the convergence performance during iterations. Simulation results in traffic intersection scenarios demonstrate the advantage of the proposed approach.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2023
National Category
Control Engineering
Identifiers
urn:nbn:se:kth:diva-350186 (URN)10.1109/CASE56687.2023.10260658 (DOI)2-s2.0-85174395188 (Scopus ID)
Conference
19th IEEE International Conference on Automation Science and Engineering, CASE 2023, Auckland, New Zealand, Aug 26 2023 - Aug 30 2023
Note

Part of ISBN 9798350320695

QC 20240708

Available from: 2024-07-08 Created: 2024-07-08 Last updated: 2025-03-06Bibliographically approved
7. Towards Mitigating Communication Latency Influence in Connected Vehicle Networks by Stochastic Decentralized Model Predictive Control
Open this publication in new window or tab >>Towards Mitigating Communication Latency Influence in Connected Vehicle Networks by Stochastic Decentralized Model Predictive Control
(English)Manuscript (preprint) (Other academic)
Abstract [en]

Communication delays in Connected and Auto-mated Vehicle (CAV) networks significantly impact decentralized optimization-based coordination, increasing risks of collisions and degrading system performance. Existing methods are limited by real-time computational challenges, vulnerability to outdated data, scalability constraints, and difficulties in managing uncertainties. This paper presents a Stochastic Decentralized Model Predictive Control (SDMPC) framework to mitigate the adverse effects of communication delays by incorporating a novel stochastic approximation method for modeling uncertainties. Our approach provides a tight probabilistic bound on safety constraints, ensuring accurate trajectory predictions and improved coordination. Simulation results show that the proposed SDMPC framework reduces the average trajectory deviation and lowers collision risks compared to conventional methods under various communication latency conditions. These improvements make SDMPC an effective solution for large-scale CAV networks, enhancing both safety and efficiency.

National Category
Control Engineering
Identifiers
urn:nbn:se:kth:diva-360940 (URN)
Note

This research has been carried out as part of the TECoSA Vinnova Com-petence Center for Trustworthy Edge Computing Systems and Applicationsat KTH Royal Institute of Technology, and also supported by Enablersfor trustworthy, infrastructure supported, autonomous vehicles (ENTICE)project.

QC 20250307

Available from: 2025-03-06 Created: 2025-03-06 Last updated: 2025-03-07Bibliographically approved

Open Access in DiVA

summary(13890 kB)5006 downloads
File information
File name FULLTEXT01.pdfFile size 13890 kBChecksum SHA-512
0d33667513362797d6284f37841a52a6aa22d74834d6c224195af4a00f518b9e0d670b1b6af79e9575545715b0e3b420f90343de0388a325e974eee9e42cb335
Type fulltextMimetype application/pdf

Authority records

Tan, Kaige

Search in DiVA

By author/editor
Tan, Kaige
By organisation
Mechatronics and Embedded Control Systems
Control EngineeringRobotics and automationVehicle and Aerospace Engineering

Search outside of DiVA

GoogleGoogle Scholar
Total: 5006 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

isbn
urn-nbn

Altmetric score

isbn
urn-nbn
Total: 1381 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf