Open this publication in new window or tab >>2024 (English)In: 2024 33rd IEEE International Conference on Robot and Human Interactive Communication (RO-MAN), 2024Conference paper, Published paper (Refereed)
Abstract [en]
A crucial part of traditional reinforcement learning (RL) is the initial exploration phase, in which trying available actions randomly is a critical element. As random behavior might be detrimental to a social interaction, this work proposes a novel paradigm for learning social robot behavior--the use of shielding to ensure socially appropriate behavior during exploration and learning. We explore how a data-driven approach for shielding could be used to generate listening behavior. In a video-based user study (N=110), we compare shielded exploration to two other exploration methods. We show that the shielded exploration is perceived as more comforting and appropriate than a straightforward random approach. Based on our findings, we discuss the potential for future work using shielded and socially guided approaches for learning idiosyncratic social robot behaviors through RL.
National Category
Computer graphics and computer vision
Identifiers
urn:nbn:se:kth:diva-350432 (URN)
Conference
2024 33rd IEEE International Conference on Robot and Human Interactive Communication (RO-MAN), Pasadena, California, USA August 26th-30th, 2024
Note
Paper will be published later this year (accepted camera-ready version available).
QC 20240717
2024-07-112024-07-112025-02-07Bibliographically approved