kth.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Optimal Gait Control for a Tendon-driven Soft Quadruped Robot by Model-based Reinforcement Learning
Cyber-physical Systems Lab, Department of Information Technology, Uppsala University, Uppsala, Sweden.ORCID iD: 0009-0003-2381-1850
KTH, School of Industrial Engineering and Management (ITM), Engineering Design, Mechatronics and Embedded Control Systems.ORCID iD: 0000-0003-4535-3849
Cyber-physical Systems Lab, Department of Information Technology, Uppsala University, Uppsala, Sweden.ORCID iD: 0000-0002-8853-4159
KTH, School of Industrial Engineering and Management (ITM), Engineering Design, Mechatronics and Embedded Control Systems.ORCID iD: 0000-0001-5703-5923
(English)Manuscript (preprint) (Other academic)
Abstract [en]

This study presents an innovative approach tooptimal gait control for a soft quadruped robot enabled by four compressible tendon-driven soft actuators. Soft quadruped robots, compared to their rigid counterparts, are widely recognized for offering enhanced safety, lower weight, and simpler fabrication and control mechanisms. However, their highly deformable structure introduces nonlinear dynamics, making precise gait locomotion control complex. To solve this problem, we propose a novel model-based reinforcement learning (MBRL) method. The study employs a multi-stage approach, including state space restriction, data-driven surrogate model training, and MBRL development. Compared to benchmark methods, the proposed approach significantly improves the efficiency and performance of gait control policies. The developed policy is both robust and adaptable to the robot’s deformable morphology. The study concludes by highlighting the practical applicability of these findings in real-world scenarios.

National Category
Control Engineering Robotics and automation
Identifiers
URN: urn:nbn:se:kth:diva-360941OAI: oai:DiVA.org:kth-360941DiVA, id: diva2:1942854
Note

Accepted by the 2025 IEEE International Conference on Robotics & Automation, 19–23 May, Atlanta, USA

This research has been carried out as part of the TECoSA Vinnova Com-petence Center for Trustworthy Edge Computing Systems and Applicationsat KTH Royal Institute of Technology

QC 20250307

Available from: 2025-03-06 Created: 2025-03-06 Last updated: 2025-03-07Bibliographically approved
In thesis
1. Optimal Control and Coordination of Autonomous Intelligent Systems by Edge Computing
Open this publication in new window or tab >>Optimal Control and Coordination of Autonomous Intelligent Systems by Edge Computing
2025 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

Autonomous Intelligent Systems (AIS) are transforming various sectors by integrating advanced control theories, artificial intelligence, and cyber-physical systems. However, AIS control development faces significant challenges, including ensuring real-time responsiveness, designing adaptive controllers for dynamic environments, and coordinating multi-agent systems under uncertainties. These issues are exacerbated in resource-constrained settings, where balancing computational demands and real-time performance is critical.

To mitigate these challenges, this thesis leverages edge computing to enhance system performance, so that data-driven methods and optimal control technologies become feasible for complex AIS applications. Edge computing is a scheme that brings computation, communication, and storage resources closer to data sources, to achieve low-latency processing, real-time adaptability, and scalable solutions for AIS applications. It provides two key benefits: (1) offloading computationally intensive tasks to nearby edge servers, so as to ensure responsive and efficient operations despite constrained resources onboard, and (2) facilitating decentralized coordination among multiple agents by exploiting the edge server as a trustworthy node, so as to improve system scalability, reliability, and collaborative decision-making.

Building on the advantages of the offloading and coordination capabilities inherent in edge computing, this thesis investigates how these features can be harnessed to overcome the limitations of AIS in achieving optimal control and coordination. Primary contributions of this thesis include: (1) the development of state estimation and data-driven optimal control algorithms, which enables more precise estimation and control in nonlinear, time-variant systems; (2) the design of edge-based computational task offloading algorithms to achieve real-time adaptive control and learning by distributing computationally intensive tasks, which effectively balances latency and resource constraints; and (3) the introduction of decentralized optimization frameworks for multi-agent systems, which enhances scalability, robustness, and coordination under communication constraints by leveraging edge servers as trustworthy nodes for efficient collaboration and decision-making. All contributions have been validated through case studies in soft robotics and connected and autonomous vehicles, demonstrating their effectiveness and advancements over existing methods.

In summary, this thesis advances AIS capabilities by addressing real-time computational challenges and enabling optimal, data-driven control and decentralized coordination. The integration of edge computing improves the efficiency, scalability, and adaptability of AIS, offering promising opportunities for applications in autonomous mobility and other dynamic domains.

Abstract [sv]

Autonoma intelligenta system (AIS) omvandlar olika sektorer genom att integrera avancerad reglerteknik, artificiell intelligens och cyberfysiska system. Utvecklingen av styrsystem för AIS står dock inför betydande utmaningar, såsom att säkerställa realtidsrespons, utforma adaptiva regulatorer för dynamiska miljöer och samordna multiagentsystem under osäkerheter. Dessa utmaningar är särskilt framträdande i resursbegränsade miljöer, där det är avgörande att balansera beräkningskrav och realtidsprestanda.

För att hantera dessa utmaningar utnyttjar denna avhandling edge computing för att förbättra systemprestanda och möjliggöra data-drivna metoder samt optimal styrning för komplexa AIS-applikationer. Edge computing innebär att beräknings-, kommunikations- och lagringsresurser flyttas närmare datakällorna, vilket möjliggör låglatensbearbetning, realtidsanpassning och skalbara lösningar. Detta tillvägagångssätt erbjuder två betydelsefulla fördelar: (1) avlastning av beräkningstunga uppgifter till närliggande edge-servrar för att säkerställa responsiva och effektiva operationer trots begränsade resurser ombord, och (2) decentraliserad samordning mellan flera agenter genom att utnyttja edge-servrar som tillförlitliga noder, vilket förbättrar systemets skalbarhet, robusthet och collaborative decision-making.

Genom att dra nytta av edge computing och dess möjligheter till beräkningsavlastning och decentraliserad samordning undersöker denna avhandling hur dessa funktioner kan användas för att övervinna AIS-begränsningar inom optimal reglering och koordination. De huvudsakliga bidragen i avhandlingen inkluderar: (1) utveckling av algoritmer för tillståndsskattning och data-driven optimal styrning, vilket möjliggör mer exakt skattning och styrning av icke-linjära, tidsvarierande system; (2) design av edge-baserade algoritmer för beräkningsavlastning, vilket möjliggör realtidsanpassad reglering och inlärning genom att distribuera beräkningstunga uppgifter och därmed balansera latens och resursbegränsningar; samt (3) introduktion av decentraliserade optimeringsramverk för multiagentsystem, vilket förbättrar skalbarhet, robusthet och koordination under kommunikationsbegränsningar genom att utnyttja edge-servrar som pålitliga noder för effektivt samarbete och beslutsfattande. Alla bidrag har validerats genom fallstudier inom mjukrobotik och uppkopplade autonoma fordon, vilket påvisar deras effektivitet och förbättringar jämfört med befintliga metoder.

Sammanfattningsvis stärker denna avhandling AIS genom att adressera realtidsberäkningsutmaningar och möjliggöra optimal, data-driven styrning och decentraliserad koordination. Genom att integrera edge computing förbättras AIS effektivitet, skalbarhet och anpassningsförmåga, vilket öppnar för lovande tillämpningar inom autonom mobilitet och andra dynamiska områden.

Place, publisher, year, edition, pages
Stockholm: Kungliga Tekniska högskolan, 2025. p. 77
Series
TRITA-ITM-AVL ; 2025:8
Keywords
Optimal Control, Decentralized Coordination, Real-Time Control, Edge Computing, Soft Robotics, Connected and Autonomous Vehicles, Optimal Reglering, Decentraliserad Koordinering, Realtidsstyrning, Edge Computing, Mjukrobotik, Uppkopplade och Autonoma Fordon vi
National Category
Control Engineering Robotics and automation Vehicle and Aerospace Engineering
Research subject
Machine Design
Identifiers
urn:nbn:se:kth:diva-360950 (URN)978-91-8106-220-5 (ISBN)
Public defence
2025-03-31, Gladan / https://kth-se.zoom.us/j/69313070114, Brinellvägen 85, Stockholm, 13:00 (English)
Opponent
Supervisors
Available from: 2025-03-07 Created: 2025-03-06 Last updated: 2025-03-20Bibliographically approved

Open Access in DiVA

fulltext(7752 kB)82 downloads
File information
File name FULLTEXT01.pdfFile size 7752 kBChecksum SHA-512
36aa0509fa2976424071b46c7d538527792ee1957022cfd4778e6621c66b2e22dd66033d0c7bd29bc79afe6f06abac9aef56270524a591f1512ef1ba472d036c
Type fulltextMimetype application/pdf

Authority records

Tan, KaigeFeng, Lei

Search in DiVA

By author/editor
Niu, XuezhiTan, KaigeGürdür Broo, DidemFeng, Lei
By organisation
Mechatronics and Embedded Control Systems
Control EngineeringRobotics and automation

Search outside of DiVA

GoogleGoogle Scholar
Total: 84 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 373 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf