kth.sePublications
Change search
Link to record
Permanent link

Direct link
Alternative names
Publications (10 of 81) Show all publications
Mohamed, Y., Lemaignan, S., Güneysu, A., Jensfelt, P. & Smith, C. (2025). Are You an Expert? Instruction Adaptation Using Multi-Modal Affect Detections with Thermal Imaging and Context. In: : . Paper presented at IEEE International Conference on Robot and Human Interactive Communication, Eindhoven University of Technology, Eindhoven, The Netherlands, Aug 25-29, 2025..
Open this publication in new window or tab >>Are You an Expert? Instruction Adaptation Using Multi-Modal Affect Detections with Thermal Imaging and Context
Show others...
2025 (English)Conference paper, Published paper (Refereed)
National Category
Computer Sciences
Identifiers
urn:nbn:se:kth:diva-369162 (URN)
Conference
IEEE International Conference on Robot and Human Interactive Communication, Eindhoven University of Technology, Eindhoven, The Netherlands, Aug 25-29, 2025.
Available from: 2025-08-29 Created: 2025-08-29 Last updated: 2025-09-05Bibliographically approved
Styrud, J., Iovino, M., Norrlöf, M., Björkman, M. & Smith, C. (2025). Automatic Behavior Tree Expansion with LLMs for Robotic Manipulation. In: 2025 IEEE International Conference on Robotics and Automation, ICRA 2025: . Paper presented at 2025 IEEE International Conference on Robotics and Automation, ICRA 2025, Atlanta, United States of America, May 19 2025 - May 23 2025 (pp. 1225-1232). Institute of Electrical and Electronics Engineers (IEEE)
Open this publication in new window or tab >>Automatic Behavior Tree Expansion with LLMs for Robotic Manipulation
Show others...
2025 (English)In: 2025 IEEE International Conference on Robotics and Automation, ICRA 2025, Institute of Electrical and Electronics Engineers (IEEE) , 2025, p. 1225-1232Conference paper, Published paper (Refereed)
Abstract [en]

Robotic systems for manipulation tasks are increasingly expected to be easy to configure for new tasks or unpredictable environments, while keeping a transparent policy that is readable and verifiable by humans. We propose the method BEhavior TRee eXPansion with Large Language Models (BETR-XP-LLM) to dynamically and automatically expand and configure Behavior Trees as policies for robot control. The method utilizes an LLM to resolve errors outside the task planner's capabilities, both during planning and execution. We show that the method is able to solve a variety of tasks and failures and permanently update the policy to handle similar problems in the future.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2025
National Category
Robotics and automation Computer Sciences Computer graphics and computer vision
Identifiers
urn:nbn:se:kth:diva-371379 (URN)10.1109/ICRA55743.2025.11127942 (DOI)2-s2.0-105016707385 (Scopus ID)
Conference
2025 IEEE International Conference on Robotics and Automation, ICRA 2025, Atlanta, United States of America, May 19 2025 - May 23 2025
Note

Part of ISBN 9798331541392

QC 20251010

Available from: 2025-10-10 Created: 2025-10-10 Last updated: 2025-10-10Bibliographically approved
Iovino, M., Förster, J., Falco, P., Jen Chung, J., Siegwart, R. & Smith, C. (2025). Comparison between Behavior Trees and Finite State Machines. IEEE Transactions on Automation Science and Engineering, 22, 21098-21117
Open this publication in new window or tab >>Comparison between Behavior Trees and Finite State Machines
Show others...
2025 (English)In: IEEE Transactions on Automation Science and Engineering, ISSN 1545-5955, E-ISSN 1558-3783, Vol. 22, p. 21098-21117Article in journal (Refereed) Published
Abstract [en]

Behavior Trees (BTs) were first conceived in the computer games industry as a tool to model agent behavior, but they received interest also in the robotics community as an alternative policy design to Finite State Machines (FSMs). The advantages of BTs over FSMs had been highlighted in many works, but there is no thorough practical comparison of the two designs. Such a comparison is particularly relevant in the robotic industry, where FSMs have been the state-of-The-Art policy representation for robot control for many years. In this work we shed light on this matter by comparing how BTs and FSMs behave when controlling a robot in a mobile manipulation task. The comparison is made in terms of reactivity, modularity, readability, and design. We propose metrics for each of these properties, being aware that while some are tangible and objective, others are more subjective and implementation dependent. The practical comparison is performed in a simulation environment with validation on a real robot. We find that although the robot's behavior during task solving is independent on the policy representation, maintaining a BT rather than an FSM becomes easier as the task increases in complexity.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2025
Keywords
Behavior Trees, Collaborative Robotics, Finite State Machines, Mobile Manipulation, Robot Control
National Category
Robotics and automation
Identifiers
urn:nbn:se:kth:diva-371275 (URN)10.1109/TASE.2025.3610090 (DOI)001579020600013 ()2-s2.0-105016561233 (Scopus ID)
Note

QC 20251013

Available from: 2025-10-13 Created: 2025-10-13 Last updated: 2025-10-13Bibliographically approved
Mohamed, Y., Lemaignan, S., Güneysu, A., Jensfelt, P. & Smith, C. (2025). Context Matters: Understanding Socially Appropriate Affective Responses Via Sentence Embeddings. In: Social Robotics - 16th International Conference, ICSR + AI 2024, Proceedings: . Paper presented at 16th International Conference on Social Robotics, ICSR + AI 2024, Odense, Denmark, October 23-26, 2024 (pp. 78-91). Springer Nature
Open this publication in new window or tab >>Context Matters: Understanding Socially Appropriate Affective Responses Via Sentence Embeddings
Show others...
2025 (English)In: Social Robotics - 16th International Conference, ICSR + AI 2024, Proceedings, Springer Nature , 2025, p. 78-91Conference paper, Published paper (Refereed)
Abstract [en]

As AI systems increasingly engage in social interactions, comprehending human social dynamics is crucial. Affect recognition enables systems to respond appropriately to emotional nuances in social situations. However, existing multimodal approaches lack accounting for the social appropriateness of detected emotions within their contexts. This paper presents a novel methodology leveraging sentence embeddings to distinguish socially appropriate and inappropriate interactions for more context-aware AI systems. Our approach measures the semantic distance between facial expression descriptions and predefined reference points. We evaluate our method using a benchmark dataset and a real-world robot deployment in a library, combining GPT-4(V) for expression descriptions and ada-2 for sentence embeddings to detect socially inappropriate interactions. Our results underscore the importance of considering contextual factors for effective social interaction understanding through context-aware affect recognition, contributing to the development of socially intelligent AI capable of interpreting and responding to human affect appropriately.

Place, publisher, year, edition, pages
Springer Nature, 2025
Keywords
embeddings, human-robot interaction, machine learning, Social representation
National Category
Sociology (Excluding Social Work, Social Anthropology, Demography and Criminology) Robotics and automation
Identifiers
urn:nbn:se:kth:diva-362501 (URN)10.1007/978-981-96-3522-1_9 (DOI)2-s2.0-105002016733 (Scopus ID)
Conference
16th International Conference on Social Robotics, ICSR + AI 2024, Odense, Denmark, October 23-26, 2024
Note

Part of ISBN 9789819635214

QC 20250428

Available from: 2025-04-16 Created: 2025-04-16 Last updated: 2025-08-25Bibliographically approved
Mohamed, Y., Séverin, L., Güneysu, A., Jensfelt, P. & Smith, C. (2025). Fusion in Context: A Multimodal Approach to Affective State Recognition. In: : . Paper presented at 34th IEEE International Conference on Robot and Human Interactive Communication, Eindhoven University of Technology, Eindhoven, The Netherlands, Aug 25-29, 2025..
Open this publication in new window or tab >>Fusion in Context: A Multimodal Approach to Affective State Recognition
Show others...
2025 (English)Conference paper, Published paper (Refereed)
National Category
Computer Sciences
Identifiers
urn:nbn:se:kth:diva-369160 (URN)
Conference
34th IEEE International Conference on Robot and Human Interactive Communication, Eindhoven University of Technology, Eindhoven, The Netherlands, Aug 25-29, 2025.
Note

QC 20250905

Available from: 2025-08-29 Created: 2025-08-29 Last updated: 2025-09-05Bibliographically approved
Khanna, P., Naoum, A., Yadollahi, E., Björkman, M. & Smith, C. (2025). REFLEX Dataset: A Multimodal Dataset of Human Reactions to Robot Failures and Explanations. In: Proceedings of the 2025 ACM/IEEE International Conference on Human-Robot Interaction: . Paper presented at ACM/IEEE International Conference on Human-Robot Interaction, HRI, Melbourne, Australia, March 4-6, 2025 (pp. 1032-1036). IEEE
Open this publication in new window or tab >>REFLEX Dataset: A Multimodal Dataset of Human Reactions to Robot Failures and Explanations
Show others...
2025 (English)In: Proceedings of the 2025 ACM/IEEE International Conference on Human-Robot Interaction, IEEE , 2025, p. 1032-1036Conference paper, Published paper (Refereed)
Abstract [en]

This work presents REFLEX: Robotic Explanations to FaiLures and Human EXpressions, a comprehensive multimodal dataset capturing human reactions to robot failures and subsequent explanations in collaborative settings. It aims to facilitate research into human-robot interaction dynamics, addressing the need to study reactions to both initial failures and explanations, as well as the evolution of these reactions in long-term interactions. By providing rich, annotated data on human responses to different types of failures, explanation levels, and explanation varying strategies, the dataset contributes to the development of more robust, adaptive, and satisfying robotic systems capable of maintaining positive relationships with human collaborators, even during challenges like repeated failures

Place, publisher, year, edition, pages
IEEE, 2025
Keywords
Human Robot Interaction, Dataset, Robotic Failures, Explainable AI.
National Category
Computer Sciences
Research subject
Computer Science
Identifiers
urn:nbn:se:kth:diva-360946 (URN)10.5555/3721488.3721616 (DOI)
Conference
ACM/IEEE International Conference on Human-Robot Interaction, HRI, Melbourne, Australia, March 4-6, 2025
Note

QC 20250310

Available from: 2025-03-06 Created: 2025-03-06 Last updated: 2025-03-10Bibliographically approved
Khanna, P., Naoum, A., Yadollahi, E., Björkman, M. & Smith, C. (2025). REFLEX Dataset: A Multimodal Dataset of Human Reactions to Robot Failures and Explanations. In: HRI 2025 - Proceedings of the 2025 ACM/IEEE International Conference on Human-Robot Interaction: . Paper presented at 20th Annual ACM/IEEE International Conference on Human-Robot Interaction, HRI 2025, Melbourne, Australia, March 4-6, 2025 (pp. 1032-1036). Institute of Electrical and Electronics Engineers (IEEE)
Open this publication in new window or tab >>REFLEX Dataset: A Multimodal Dataset of Human Reactions to Robot Failures and Explanations
Show others...
2025 (English)In: HRI 2025 - Proceedings of the 2025 ACM/IEEE International Conference on Human-Robot Interaction, Institute of Electrical and Electronics Engineers (IEEE) , 2025, p. 1032-1036Conference paper, Published paper (Refereed)
Abstract [en]

This work presents REFLEX: Robotic Explanations to FaiLures and Human EXpressions, a comprehensive multimodal dataset capturing human reactions to robot failures and subsequent explanations in collaborative settings. It aims to facilitate research into human-robot interaction dynamics, addressing the need to study reactions to both initial failures and explanations, as well as the evolution of these reactions in long-term interactions. By providing rich, annotated data on human responses to different types of failures, explanation levels, and explanation varying strategies, the dataset contributes to the development of more robust, adaptive, and satisfying robotic systems capable of maintaining positive relationships with human collaborators, even during challenges like repeated failures.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2025
Keywords
Dataset, Explainable AI, Human Robot Interaction, Robotic Failures
National Category
Computer Sciences
Identifiers
urn:nbn:se:kth:diva-363769 (URN)10.1109/HRI61500.2025.10974185 (DOI)2-s2.0-105004877597 (Scopus ID)
Conference
20th Annual ACM/IEEE International Conference on Human-Robot Interaction, HRI 2025, Melbourne, Australia, March 4-6, 2025
Note

Part of ISBN 9798350378931

QC 20250526

Available from: 2025-05-21 Created: 2025-05-21 Last updated: 2025-05-26Bibliographically approved
Styrud, J., Mayr, M., Hellsten, E., Krueger, V. & Smith, C. (2024). BeBOP - Combining Reactive Planning and Bayesian Optimization to Solve Robotic Manipulation Tasks. In: 2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2024): . Paper presented at IEEE International Conference on Robotics and Automation (ICRA), MAY 13-17, 2024, Yokohama, JAPAN (pp. 16459-16466). Institute of Electrical and Electronics Engineers (IEEE)
Open this publication in new window or tab >>BeBOP - Combining Reactive Planning and Bayesian Optimization to Solve Robotic Manipulation Tasks
Show others...
2024 (English)In: 2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2024), Institute of Electrical and Electronics Engineers (IEEE) , 2024, p. 16459-16466Conference paper, Published paper (Refereed)
Abstract [en]

Robotic systems for manipulation tasks are increasingly expected to be easy to configure for new tasks. While in the past, robot programs were often written statically and tuned manually, the current, faster transition times call for robust, modular and interpretable solutions that also allow a robotic system to learn how to perform a task. We propose the method Behavior-based Bayesian Optimization and Planning (BeBOP) that combines two approaches for generating behavior trees: we build the structure using a reactive planner and learn specific parameters with Bayesian optimization. The method is evaluated on a set of robotic manipulation benchmarks and is shown to outperform state-of-the-art reinforcement learning algorithms by being up to 46 times faster while simultaneously being less dependent on reward shaping. We also propose a modification to the uncertainty estimate for the random forest surrogate models that drastically improves the results.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2024
Series
IEEE International Conference on Robotics and Automation ICRA, ISSN 1050-4729
Keywords
Behavior Trees, Bayesian Optimization, Task Planning, Robotic manipulation
National Category
Robotics and automation
Identifiers
urn:nbn:se:kth:diva-360969 (URN)10.1109/ICRA57147.2024.10611468 (DOI)001369728005084 ()2-s2.0-85190848983 (Scopus ID)
Conference
IEEE International Conference on Robotics and Automation (ICRA), MAY 13-17, 2024, Yokohama, JAPAN
Note

Part of ISBN 979-8-3503-8458-1, 979-8-3503-8457-4

QC 20250310

Available from: 2025-03-10 Created: 2025-03-10 Last updated: 2025-03-10Bibliographically approved
Hallen, M., Iovino, M., Sander-Tavallaey, S. & Smith, C. (2024). Behavior Trees in Industrial Applications: A Case Study in Underground Explosive Charging. In: 2024 IEEE 20th International Conference on Automation Science and Engineering, CASE 2024: . Paper presented at 20th IEEE International Conference on Automation Science and Engineering, CASE 2024, Bari, Italy, Aug 28 2024 - Sep 1 2024 (pp. 156-162). Institute of Electrical and Electronics Engineers (IEEE)
Open this publication in new window or tab >>Behavior Trees in Industrial Applications: A Case Study in Underground Explosive Charging
2024 (English)In: 2024 IEEE 20th International Conference on Automation Science and Engineering, CASE 2024, Institute of Electrical and Electronics Engineers (IEEE) , 2024, p. 156-162Conference paper, Published paper (Refereed)
Abstract [en]

In industrial applications Finite State Machines (FSMs) are often used to implement decision making policies for autonomous systems. In recent years, the use of Behavior Trees (BT) as an alternative policy representation has gained considerable attention. The benefits of using BTs over FSMs are modularity and reusability, enabling a system that is easy to extend and modify. However, there exists few published studies on successful implementations of BTs for industrial applications. This paper contributes with the lessons learned from implementing BTs in a complex industrial use case, where a robotic system assembles explosive charges and places them in holes on the rock face. The main result of the paper is that even if it is possible to model the entire system as a BT, combining BTs with FSMs can increase the readability and maintainability of the system. The benefit of such combination is remarked especially in the use case studied in this paper, where the full system cannot run autonomously but human supervision and feedback are needed.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2024
Keywords
behavior Trees, Behavior Trees in Robotics Applications, Finite State Machines, Modularity
National Category
Robotics and automation
Identifiers
urn:nbn:se:kth:diva-367272 (URN)10.1109/CASE59546.2024.10711822 (DOI)001361783100023 ()2-s2.0-85206356835 (Scopus ID)
Conference
20th IEEE International Conference on Automation Science and Engineering, CASE 2024, Bari, Italy, Aug 28 2024 - Sep 1 2024
Note

Part of ISBN 9798350358513

QC 20250717

Available from: 2025-07-17 Created: 2025-07-17 Last updated: 2025-07-17Bibliographically approved
Khanna, P., Fredberg, J., Björkman, M., Smith, C. & Linard, A. (2024). Hand it to me formally! Data-driven control for human-robot handovers with signal temporal logic. IEEE Robotics and Automation Letters, 9(10), 9039-9046
Open this publication in new window or tab >>Hand it to me formally! Data-driven control for human-robot handovers with signal temporal logic
Show others...
2024 (English)In: IEEE Robotics and Automation Letters, E-ISSN 2377-3766, Vol. 9, no 10, p. 9039-9046Article in journal (Refereed) Published
Abstract [en]

To facilitate human-robot interaction (HRI), we aim for robot behavior that is efficient, transparent, and closely resembles human actions. Signal Temporal Logic (STL) is a formal language that enables the specification and verification of complex temporal properties in robotic systems, helping to ensure their correctness. STL can be used to generate explainable robot behaviour, the degree of satisfaction of which can be quantified by checking its STL robustness. In this letter, we use data-driven STL inference techniques to model human behavior in human-human interactions, on a handover dataset. We then use the learned model to generate robot behavior in human-robot interactions. We present a handover planner based on inferred STL specifications to command robotic motion in human-robot handovers. We also validate our method in a human-to-robot handover experiment.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2024
Keywords
Handover, Robots, Robot kinematics, Behavioral sciences, Trajectory, Logic, Robustness, Human-robot handovers, Signal Temporal Logic (STL)
National Category
Robotics and automation Control Engineering
Identifiers
urn:nbn:se:kth:diva-354524 (URN)10.1109/LRA.2024.3447476 (DOI)001316210300020 ()2-s2.0-85201769650 (Scopus ID)
Note

QC 20241011

Available from: 2024-10-11 Created: 2024-10-11 Last updated: 2025-02-05Bibliographically approved
Organisations
Identifiers
ORCID iD: ORCID iD iconorcid.org/0000-0003-2078-8854

Search in DiVA

Show all publications