kth.sePublications KTH
Change search
Link to record
Permanent link

Direct link
Publications (7 of 7) Show all publications
Zhong, R., Hu, B., Liu, Z., Qin, Q., Feng, Y., Wang, X. V., . . . Tan, J. (2026). A two-stage framework for learning human-to-robot object handover policy from 4D spatiotemporal flow. Robotics and Computer-Integrated Manufacturing, 98, Article ID 103171.
Open this publication in new window or tab >>A two-stage framework for learning human-to-robot object handover policy from 4D spatiotemporal flow
Show others...
2026 (English)In: Robotics and Computer-Integrated Manufacturing, ISSN 0736-5845, E-ISSN 1879-2537, Vol. 98, article id 103171Article in journal (Refereed) Published
Abstract [en]

Natural and safe Human-to-Robot (H2R) object handover is a critical capability for effective Human–Robot Collaboration (HRC). However, learning a robust handover policy for this task is often hindered by the prohibitive cost of collecting physical robot demonstrations and the limitations of simplistic state representations that inadequately capture the complex dynamics of the interaction. To address these challenges, a two-stage learning framework is proposed that synthesizes substantially augmented, synthetically diverse handover demonstrations without requiring a physical robot and subsequently learns a handover policy from a rich 4D spatiotemporal flow. First, an offline, physical robot-free data-generation pipeline is introduced that produces augmented and diverse handover demonstrations, thereby eliminating the need for costly physical data collection. Second, a novel 4D spatiotemporal flow is defined as a comprehensive representation consisting of a skeletal kinematic flow that captures high-level motion dynamics and a geometric motion flow that characterizes fine-grained surface interactions. Finally, a diffusion-based policy conditioned on this spatiotemporal representation is developed to generate coherent and anticipatory robot actions. Extensive experiments demonstrate that the proposed method significantly outperforms state-of-the-art baselines in task success, efficiency, and motion quality, thereby paving the way for safer and more intuitive collaborative robots.

Place, publisher, year, edition, pages
Elsevier BV, 2026
Keywords
Diffusion models, Human-to-robot object handovers, Human–robot collaboration, Industry 5.0, Robot learning, Spatiotemporal representation
National Category
Robotics and automation
Identifiers
urn:nbn:se:kth:diva-372611 (URN)10.1016/j.rcim.2025.103171 (DOI)001606292700001 ()2-s2.0-105019705885 (Scopus ID)
Note

QC 20251113

Available from: 2025-11-13 Created: 2025-11-13 Last updated: 2025-11-13Bibliographically approved
Liu, Z., Silva, J., Zhong, R., Qin, Q., Roy, N., Nan Fernandez-Ayala, V., . . . Wang, L. (2026). ConstrucTwin: Digital Twin-Driven Multirobot Construction System Toward Industry 5.0. IEEE Transactions on Systems, Man & Cybernetics. Systems, 56(4), 2924-2939
Open this publication in new window or tab >>ConstrucTwin: Digital Twin-Driven Multirobot Construction System Toward Industry 5.0
Show others...
2026 (English)In: IEEE Transactions on Systems, Man & Cybernetics. Systems, ISSN 2168-2216, E-ISSN 2168-2232, Vol. 56, no 4, p. 2924-2939Article in journal (Refereed) Published
Abstract [en]

Rapid advancements in digitalization and artificial intelligence (AI) have catalyzed the adoption of digital twin technologies in the construction sector, enabling real-time synchronization between virtual models and physical systems. Simultaneously, on-site robotic automation has shown promise for reducing physical workloads, enhancing productivity, and contributing to sustainability goals that are key values of Industry 5.0. However, current digital twin implementations rarely incorporate multirobot construction systems, often relying on single-robot approaches or purely offline simulations. This gap hinders the realization of truly integrated construction environments that combine sensing, data analytics, wireless communications, and multirobot coordination. In response, this article proposes ConstrucTwin, a digital twin-driven multirobot construction framework designed to support complex construction tasks in real-world settings. By combining a 5G communication estimation-involved architecture and a cross-level planning strategy, ConstrucTwin streamlines interactions between physical robots and their digital counterparts. Essential tasks such as motion and task-level planning, as well as remote human-in-the-loop (HIL) oversight, are orchestrated within a single unified architecture. Through case studies involving rebar cage and brick wall construction, we demonstrate how an integrated approach to vision-based servoing and multirobot coordination enhances execution speed, precision, and scalability. The results underscore the system’s potential to advance human-centric, resilient, and sustainable construction, thereby aligning with the broader vision of Industry 5.0.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2026
Keywords
Digital twin, Industry 5.0, multirobot construction, smart construction
National Category
Construction Management Computer Sciences Robotics and automation
Identifiers
urn:nbn:se:kth:diva-377918 (URN)10.1109/TSMC.2026.3658622 (DOI)001696642500001 ()2-s2.0-105030692936 (Scopus ID)
Note

QC 20260320

Available from: 2026-03-11 Created: 2026-03-11 Last updated: 2026-03-20Bibliographically approved
Zhong, R., Hu, B., Feng, Y., Liu, Z., Qin, Q., Wang, X. V., . . . Tan, J. (2026). FineMLD: A fine-grained motion latent diffusion for human motion prediction in Human-robot Collaboration. Advanced Engineering Informatics, 70, Article ID 104119.
Open this publication in new window or tab >>FineMLD: A fine-grained motion latent diffusion for human motion prediction in Human-robot Collaboration
Show others...
2026 (English)In: Advanced Engineering Informatics, ISSN 1474-0346, E-ISSN 1873-5320, Vol. 70, article id 104119Article in journal (Refereed) Published
Abstract [en]

Human-robot Collaboration (HRC) in unstructured environments increasingly underscores enabling robots to perceive, understand, and predict human motions to enhance its safety and efficiency, aligning with the principles of Industry 5.0. However, existing studies about human motion prediction primarily rely on deterministic methods, which struggle to capture and quantify the uncertainty in human movements, posing significant safety risks in collaborative environments. To this end, we propose a novel Fine-grained Motion Latent Diffusion (FineMLD) model tailored to generate precise and diverse human motion predictions, providing a crucial foundation for safe HRC. First, we introduce a scale-aware temporal motion latent space to capture spatio-temporal dependencies by using a time-aware human motion encoder and hierarchical motion disentanglement. The hierarchical motion disentanglement achieves a compact latent representation of the motion while enabling more precise motion decoding. Second, we perform mask-based fine-grained motion latent diffusion that incorporates trajectory-guided denoised prediction and motion mask-driven resampling modulation to refine motion predictions. The trajectory-guided denoised prediction integrates observed robot motion trajectories to guide human motion predictions, while the motion mask-driven resampling modulation aligns observed and predicted motions by using a motion mask, thereby improving the consistency and coherence of motion sequences. Finally, we validate the performance of FineMLD on a public dataset, demonstrating its state-of-the-art performance in prediction accuracy and diversity. Furthermore, we conduct physical experiments on a robotic platform to confirm its real-time applicability and robustness in a real-world laboratory setting, highlighting its potential for deployment in unstructured HRC tasks.

Place, publisher, year, edition, pages
Elsevier BV, 2026
Keywords
Human-robot Collaboration, Human motion prediction, Latent diffusion, Industry 5.0
National Category
Robotics and automation
Identifiers
urn:nbn:se:kth:diva-376683 (URN)10.1016/j.aei.2025.104119 (DOI)001632701000001 ()2-s2.0-105023473320 (Scopus ID)
Note

QC 20260223

Available from: 2026-02-23 Created: 2026-02-23 Last updated: 2026-02-23Bibliographically approved
Qin, Q., Liu, Z., Zhong, R., Wang, X. V., Wang, L., Wiktorsson, M. & Wang, W. (2026). Robot digital twin systems in manufacturing: Technologies, applications, trends and challenges. Robotics and Computer-Integrated Manufacturing, 97, Article ID 103103.
Open this publication in new window or tab >>Robot digital twin systems in manufacturing: Technologies, applications, trends and challenges
Show others...
2026 (English)In: Robotics and Computer-Integrated Manufacturing, ISSN 0736-5845, E-ISSN 1879-2537, Vol. 97, article id 103103Article, review/survey (Refereed) Published
Abstract [en]

The manufacturing industry is undergoing a profound transformation toward smart, digital, and flexible production systems under the Industry 4.0 framework. Within this paradigm, Digital Twin (DT) serves as a key enabler, bridging physical and digital domains to simulate, analyse, and optimise manufacturing operations. Concurrently, robotic systems, enhanced by smart sensor perception, Industrial Internet of Things connectivity, and adaptive control mechanisms, are increasingly deployed to handle complex and dynamic tasks. However, the evolving demands of the modern manufacturing industry require a high degree of flexibility and responsiveness, necessitating more intelligent solutions. The Robot Digital Twin (RDT) has emerged as a transformative approach, facilitating dynamic adaptation and continuous operational improvement. This review offers a comprehensive examination of the literature on RDT in manufacturing from both technology and application perspectives, aiming to provide insight for researchers and practitioners in Industry 4.0. The paper introduces a four-layer RDT system architecture and summarises how Industry 4.0 technologies, e.g., the Industrial Internet of Things, Cloud/Edge Computing, 5 G, Virtual Reality, Modelling and Simulation, and Artificial Intelligence, converge and influence the RDT system based on this architecture. Furthermore, the review covers domain-specific and system-level applications, such as assembly, machining, grasping, material handling, human-robot interaction, predictive maintenance, and additive manufacturing systems, with an analysis of their development status. Finally, the trends, practical challenges, and future research directions for RDT systems in manufacturing are summarised at different levels.

Place, publisher, year, edition, pages
Elsevier BV, 2026
Keywords
Advanced robotics, Digital twin, Industry 4.0, Smart manufacturing
National Category
Production Engineering, Human Work Science and Ergonomics Robotics and automation
Identifiers
urn:nbn:se:kth:diva-369277 (URN)10.1016/j.rcim.2025.103103 (DOI)001582099600001 ()2-s2.0-105013503596 (Scopus ID)
Note

QC 20250903

Available from: 2025-09-03 Created: 2025-09-03 Last updated: 2025-12-05Bibliographically approved
Liu, S., Guo, D., Liu, Z., Wang, T., Qin, Q., Wang, X. V. & Wang, L. (2025). A Digital Twin-Enabled Approach to Reliable Human–robot Collaborative Assembly. In: Human Centric Smart Manufacturing Towards Industry 5 0: (pp. 281-304). Springer Nature
Open this publication in new window or tab >>A Digital Twin-Enabled Approach to Reliable Human–robot Collaborative Assembly
Show others...
2025 (English)In: Human Centric Smart Manufacturing Towards Industry 5 0, Springer Nature , 2025, p. 281-304Chapter in book (Other academic)
Abstract [en]

The conventional automation approach has shown bottlenecks in the era of component assembly. What could be automated has been automated in some high tech industrial production, leaving manual work performed by humans. To achieve ergonomic working environments and better productivity, human–robot collabora tion has been adopted for this purpose through combining the strength, accuracy and repeatability of robots with adaptability, high-level cognition, and flexibility. A reliable human–robot collaborative setting should be supported by dynamically updated and precise models. For this purpose, the digital twin can realise the digital representation of physical collaborative settings through simulation modelling and data synchronisation but is limited by communication delay and constraints. This chapter will develop a digital twin-enabled approach to human–robot collaborative assembly. Within this approach, a sensor-driven 3D modelling of the physical devices of interest is developed to realise the physical-to-digital transformation of human–robot workcell, and a Wise-ShopFloor-based platform enabled by sensor data is used to develop a digital twin model of the physical human–robot workcell. Then, function blocks with embedded algorithms are used for assembly planning, decision making and robot control, and a time-ahead execution and planning approach is developed for reliable human–robot collaborative assembly. Finally, the performance of the developed system is demonstrated by a case study of a partial car engine assembly.

Place, publisher, year, edition, pages
Springer Nature, 2025
Keywords
Assembly, Digital twin, Robot
National Category
Production Engineering, Human Work Science and Ergonomics
Identifiers
urn:nbn:se:kth:diva-368723 (URN)10.1007/978-3-031-82170-7_12 (DOI)2-s2.0-105012012683 (Scopus ID)
Note

Part of ISBN 9783031821691, 9783031821707

QC 20250820

Available from: 2025-08-20 Created: 2025-08-20 Last updated: 2025-08-20Bibliographically approved
Liu, S., Liu, Z., Qin, Q., Wang, X. V. & Wang, L. (2025). Multimodal human-robot collaboration: advancements and future directions. International Journal of Manufacturing Research, 20(5), 1-47
Open this publication in new window or tab >>Multimodal human-robot collaboration: advancements and future directions
Show others...
2025 (English)In: International Journal of Manufacturing Research, ISSN 1750-0591, Vol. 20, no 5, p. 1-47Article in journal (Refereed) Published
Abstract [en]

Human-robot collaboration (HRC) envisioned for future factories has been actively explored to facilitate higher overall productivity. The wide applications of HRC in multiple fields, such as manufacturing and production, have seen a series of milestones. In recent years, a shift towards intuitive and natural collaboration between humans and robots has been investigated and discussed for symbiotic scenarios and complex tasks. For this purpose, advancements of multimodality in HRC enable multimodal human-robot interactions and collaboration by utilising different communication channels such as auditory, vision, gestures, haptics, and even brain signals. In addition, understanding human behaviours in terms of intent and motion can be beneficial in achieving mutual human-robot assistance. Within an HRC setting, the digital twin of such a physical collaborative workcell offers a promising tool to implement sim-to-real transformation and on-demand support for real practice. Within the context, this study provides an overview of the past and current status of multimodal HRC and its applications and highlights future research directions.

Place, publisher, year, edition, pages
Inderscience Publishers, 2025
Keywords
assembly, large language model, LLM, multimodality, robot
National Category
Robotics and automation Production Engineering, Human Work Science and Ergonomics
Identifiers
urn:nbn:se:kth:diva-373681 (URN)10.1504/IJMR.2025.149872 (DOI)001616375500001 ()2-s2.0-105022420879 (Scopus ID)
Note

QC 20251208

Available from: 2025-12-08 Created: 2025-12-08 Last updated: 2025-12-08Bibliographically approved
Wang, S., Sun, G., Ma, F., Hu, T., Qin, Q., Song, Y., . . . Liang, J. (2024). DragTraffic: Interactive and Controllable Traffic Scene Generation for Autonomous Driving. In: 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2024: . Paper presented at 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2024, Abu Dhabi, United Arab Emirates, Oct 14 2024 - Oct 18 2024 (pp. 14241-14247). Institute of Electrical and Electronics Engineers (IEEE)
Open this publication in new window or tab >>DragTraffic: Interactive and Controllable Traffic Scene Generation for Autonomous Driving
Show others...
2024 (English)In: 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2024, Institute of Electrical and Electronics Engineers (IEEE) , 2024, p. 14241-14247Conference paper, Published paper (Refereed)
Abstract [en]

Evaluating and training autonomous driving systems require diverse and scalable corner cases. However, most existing scene generation methods lack controllability, accuracy, and versatility, resulting in unsatisfactory generation results. Inspired by DragGAN in image generation, we propose DragTraffic, a generalized, interactive, and controllable traffic scene generation framework based on conditional diffusion. DragTraffic enables non-experts to generate a variety of realistic driving scenarios for different types of traffic agents through an adaptive mixture expert architecture. We employ a regression model to provide a general initial solution and a refinement process based on the conditional diffusion model to ensure diversity. User-customized context is introduced through cross-attention to ensure high controllability. Experiments on a real-world driving dataset show that DragTraffic outperforms existing methods in terms of authenticity, diversity, and freedom. Demo videos and code are available at https://chantsss.github.io/Dragtraffic/.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2024
National Category
Computer graphics and computer vision Computer Systems
Identifiers
urn:nbn:se:kth:diva-359875 (URN)10.1109/IROS58592.2024.10801623 (DOI)001433985300862 ()2-s2.0-85216467705 (Scopus ID)
Conference
2024 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2024, Abu Dhabi, United Arab Emirates, Oct 14 2024 - Oct 18 2024
Note

Part of ISBN 9798350377705]

QC 20250213

Available from: 2025-02-12 Created: 2025-02-12 Last updated: 2025-06-12Bibliographically approved
Organisations
Identifiers
ORCID iD: ORCID iD iconorcid.org/0009-0008-5481-3484

Search in DiVA

Show all publications