kth.sePublications KTH
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Reinforcement Learning with World Models for Autonomous Excavation Optimization in Wheel Loaders
KTH. Volvo CE, Bolindervägen 5 Eskilstuna 63185 Sweden, Bolindervägen 5.
Volvo CE, Bolindervägen 5 Eskilstuna 63185 Sweden.
Volvo CE, Bolindervägen 5 Eskilstuna 63185 Sweden.
KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Decision and Control Systems (Automatic Control).ORCID iD: 0000-0002-4679-4673
2025 (English)Conference paper, Published paper (Refereed)
Abstract [en]

Automating the bucket-filling task in wheel loaders is challenging due to the complex, nonlinear interaction between the bucket and granular material. This work presents a model-based reinforcement learning approach to optimize the bucket-filling strategy for Zeux, Volvo's autonomous electric wheel loader concept. A Long Short-Term Memory (LSTM) surrogate model is trained on data from Volvo's high-fidelity simulator to emulate realistic dynamics, enabling efficient policy training using Proximal Policy Optimization (PPO) with imagined rollouts. This reduces computational cost and eliminates the need for direct interaction with the high-fidelity simulator. Compared to Volvo's current rule-based driver model, the learned policy achieves 89% improvement in productivity and 56% increase in energy efficiency. Our results show that world models can accelerate reinforcement learning for heavy machinery control, enabling the discovery of strategies that outperform controllers based on human expert behavior.

Place, publisher, year, edition, pages
Elsevier BV , 2025. Vol. 59, p. 72-77
Keywords [en]
Autonomous Systems, Bucket-Filling, Deep Learning, Heavy Machinery Simulation, Reinforcement Learning, Wheel loaders, World Models
National Category
Control Engineering Computer Sciences Robotics and automation Computer Systems
Identifiers
URN: urn:nbn:se:kth:diva-375953DOI: 10.1016/j.ifacol.2025.12.184Scopus ID: 2-s2.0-105026936365OAI: oai:DiVA.org:kth-375953DiVA, id: diva2:2033397
Conference
66th International Conference of Scandinavian Simulation Society, SIMS 2025, Stavanger, Norway, Sep 22 2025 - Sep 24 2025
Note

QC 20260129

Available from: 2026-01-29 Created: 2026-01-29 Last updated: 2026-01-29Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Proutiere, Alexandre

Search in DiVA

By author/editor
Morais, DuarteProutiere, Alexandre
By organisation
KTHDecision and Control Systems (Automatic Control)
Control EngineeringComputer SciencesRobotics and automationComputer Systems

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 7 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf