A phased robotic assembly policy based on a PL-LSTM-SAC algorithmShow others and affiliations
2025 (English)In: Journal of manufacturing systems, ISSN 0278-6125, E-ISSN 1878-6642, Vol. 78, p. 351-369Article in journal (Refereed) Published
Abstract [en]
In order to address the problems with current robotic automated assembly such as limitations of model-based methods in unstructured assembly scenarios, low training efficiency of learning-based methods, and limited performance of policy generalization, this paper proposes two modeling methodologies based on deep reinforcement learning under the overall framework of phased assembly for complex robotic assembly tasks, i.e., separated-phased policy modeling (SPM) and integrated policy modeling (IPM). Regarding policy learning with deep reinforcement learning, we present a refined SAC algorithm that merges a policy-lead mechanism and an LSTM network (i.e., PL-LSTM-SAC). A comprehensive testbed based on the assembly of a triple-task planetary gear train is designed to validate the framework and the proposed approach. Experimental results indicate that the trained assembly policies for each task are effective under both policy modeling methodologies, but SPM has higher stability and policy convergence efficiency than IPM. Physical tests indicate the sim-to-real transferability of the trained policies with both SPM and IPM and an average success rate of 92.0 % is achieved. The PL-LSTM-SAC algorithm proposed can significantly accelerate training speed and enhance compliance and overall performance of assembly actions by a 13.9 % reduction in the average contact force during assembly processes.
Place, publisher, year, edition, pages
Elsevier BV , 2025. Vol. 78, p. 351-369
Keywords [en]
Deep reinforcement learning, LSTM network, Policy-lead mechanism, Robot learning, Robotic assembly
National Category
Robotics and automation
Identifiers
URN: urn:nbn:se:kth:diva-358235DOI: 10.1016/j.jmsy.2024.12.008ISI: 001403370200001Scopus ID: 2-s2.0-85212823822OAI: oai:DiVA.org:kth-358235DiVA, id: diva2:1924869
Note
QC 20250114
2025-01-072025-01-072025-12-05Bibliographically approved