kth.sePublications KTH
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
LLM-Driven Augmented Reality Puppeteer: Controller-Free Voice-Commanded Robot Teleoperation
KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Robotics, Perception and Learning, RPL.ORCID iD: 0000-0003-1804-6296
KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Robotics, Perception and Learning, RPL.
KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Robotics, Perception and Learning, RPL.ORCID iD: 0000-0003-3827-3824
KTH, School of Electrical Engineering and Computer Science (EECS).
Show others and affiliations
2025 (English)In: Social Computing and Social Media - 17th International Conference, SCSM 2025, Held as Part of the 27th HCI International Conference, HCII 2025, Proceedings, Springer Nature , 2025, p. 97-112Conference paper, Published paper (Refereed)
Abstract [en]

The integration of robotics and augmented reality (AR) presents transformative opportunities for advancing human-robot interaction (HRI) by improving usability, intuitiveness, and accessibility. This work introduces a controller-free, LLM-driven voice-commanded AR puppeteering system, enabling users to teleoperate a robot by manipulating its virtual counterpart in real-time. By leveraging natural language processing (NLP) and AR technologies, our system—prototyped using Meta Quest 3—eliminates the need for physical controllers, enhancing ease of use while minimizing potential safety risks associated with direct robot operation. A preliminary user demonstration successfully validated the system’s functionality, demonstrating its potential for safer, more intuitive, and immersive robotic control.

Place, publisher, year, edition, pages
Springer Nature , 2025. p. 97-112
Keywords [en]
AR puppeteer, Controller-free, LLM-driven
National Category
Robotics and automation Other Engineering and Technologies
Identifiers
URN: urn:nbn:se:kth:diva-364402DOI: 10.1007/978-3-031-93539-8_7ISI: 001551225200007Scopus ID: 2-s2.0-105007131127OAI: oai:DiVA.org:kth-364402DiVA, id: diva2:1968216
Conference
17th International Conference on Social Computing and Social Media, SCSM 2025, held as part of the 27th HCI International Conference, HCII 2025, Gothenburg, Sweden, Jun 22 2025 - Jun 27 2025
Note

 Part of ISBN 9783031935381

QC 20250613

Available from: 2025-06-12 Created: 2025-06-12 Last updated: 2025-12-08Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Zhang, YuchongOrthmann, BastianWelle, Michael C.van Haastregt, JonneKragic Jensfelt, Danica

Search in DiVA

By author/editor
Zhang, YuchongOrthmann, BastianWelle, Michael C.van Haastregt, JonneKragic Jensfelt, Danica
By organisation
Robotics, Perception and Learning, RPLSchool of Electrical Engineering and Computer Science (EECS)
Robotics and automationOther Engineering and Technologies

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 169 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf