kth.sePublications KTH
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Users and Wizards in Conversations: How WoZ Interface Choices Define Human-Robot Interactions
KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Speech, Music and Hearing, TMH.ORCID iD: 0000-0001-5066-7186
KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Speech, Music and Hearing, TMH.ORCID iD: 0009-0006-2058-0112
KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Speech, Music and Hearing, TMH.ORCID iD: 0000-0003-2428-0468
2025 (English)In: Proceedings of Robotics: Science and Systems / [ed] Luca Carlone; Dana Kulic; Gentiane Venture; Jared Strader, Robotics: Science and Systems Foundation , 2025Conference paper, Published paper (Refereed)
Abstract [en]

In this paper, we investigated how the choice of a Wizard-of-Oz (WoZ) interface affects communication with a robot from both the user's and the wizard's perspective. In a conversational setting, we used three WoZ interfaces with varying levels of dialogue input and output restrictions: a) a restricted perception GUI that showed fixed-view video and ASR transcripts and let the wizard trigger pre-scripted utterances and gestures; b) an unrestricted perception GUI that added real-time audio from the participant and the robot c) a VR telepresence interface that streamed immersive stereo video and audio to the wizard and forwarded the wizard's spontaneous speech, gaze and facial expressions to the robot. We found that the interaction mediated by the VR interface was preferred by users in terms of robot features and perceived social presence. For the wizards, the VR condition turned out to be the most demanding but elicited a higher social connection with the users. VR interface also induced the most connected interaction in terms of inter-speaker gaps and overlaps, while Restricted GUI induced the least connected flow and the largest silences. Given these results, we argue for more WoZ studies using telepresence interfaces. These studies better reflect the robots of tomorrow and offer a promising path to automation based on naturalistic contextualized verbal and non-verbal behavioral data. 

Place, publisher, year, edition, pages
Robotics: Science and Systems Foundation , 2025.
Keywords [en]
VR, Wizard-of-Oz, teleoperation, social robotics
National Category
Robotics and automation Human Computer Interaction
Research subject
Human-computer Interaction
Identifiers
URN: urn:nbn:se:kth:diva-379050DOI: 10.15607/RSS.2025.XXI.085OAI: oai:DiVA.org:kth-379050DiVA, id: diva2:2051029
Conference
Robotics: Science and Systems XXI, University of Southern California, Los Angeles, CA, USA, June 21-25, 2025
Note

Part of ISBN 9798990284814

QC 20260415

Available from: 2026-04-07 Created: 2026-04-07 Last updated: 2026-04-15Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full text

Authority records

Torubarova, EkaterinaMiniotaitė, JūraAbelho Pereira, André Tiago

Search in DiVA

By author/editor
Torubarova, EkaterinaMiniotaitė, JūraAbelho Pereira, André Tiago
By organisation
Speech, Music and Hearing, TMH
Robotics and automationHuman Computer Interaction

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 26 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf