kth.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
A framework for integrating gesture generation models into interactive conversational agents
KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Speech, Music and Hearing, TMH.ORCID iD: 0000-0002-9653-6699
KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Robotics, Perception and Learning, RPL.ORCID iD: 0000-0001-9838-8848
KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Speech, Music and Hearing, TMH.
KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Speech, Music and Hearing, TMH.ORCID iD: 0000-0003-2428-0468
Show others and affiliations
2021 (English)In: Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS, International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS) , 2021, p. 1767-1769Conference paper, Published paper (Refereed)
Abstract [en]

Embodied conversational agents (ECAs) benefit from non-verbal behavior for natural and efficient interaction with users. Gesticulation - hand and arm movements accompanying speech - is an essential part of non-verbal behavior. Gesture generation models have been developed for several decades: starting with rule-based and ending with mainly data-driven methods. To date, recent end to- end gesture generation methods have not been evaluated in a real-time interaction with users. We present a proof-of-concept framework, which is intended to facilitate evaluation of modern gesture generation models in interaction. We demonstrate an extensible open-source framework that contains three components: 1) a 3D interactive agent; 2) a chatbot backend; 3) a gesticulating system. Each component can be replaced, making the proposed framework applicable for investigating the effect of different gesturing models in real-time interactions with different communication modalities, chatbot backends, or different agent appearances. The code and video are available at the project page https://nagyrajmund.github.io/project/gesturebot. 

Place, publisher, year, edition, pages
International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS) , 2021. p. 1767-1769
Keywords [en]
Conversational embodied agents, Non-verbal behavior synthesis, Multi agent systems, Open systems, Speech, Communication modalities, Conversational agents, Data-driven methods, Efficient interaction, Embodied conversational agent, Interactive agents, Open source frameworks, Real time interactions, Autonomous agents
National Category
Human Computer Interaction Computer Sciences
Identifiers
URN: urn:nbn:se:kth:diva-311130Scopus ID: 2-s2.0-85112311041OAI: oai:DiVA.org:kth-311130DiVA, id: diva2:1653872
Conference
20th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2021, 3 May 2021 through 7 May 2021
Note

Part of proceedings: ISBN 978-1-7138-3262-1

QC 20220425

Available from: 2022-04-25 Created: 2022-04-25 Last updated: 2023-01-17Bibliographically approved

Open Access in DiVA

No full text in DiVA

Scopus

Authority records

Nagy, RajmundKucherenko, TarasMoell, BirgerAbelho Pereira, André TiagoKjellström, Hedvig

Search in DiVA

By author/editor
Nagy, RajmundKucherenko, TarasMoell, BirgerAbelho Pereira, André TiagoKjellström, Hedvig
By organisation
Speech, Music and Hearing, TMHRobotics, Perception and Learning, RPL
Human Computer InteractionComputer Sciences

Search outside of DiVA

GoogleGoogle Scholar

urn-nbn

Altmetric score

urn-nbn
Total: 54 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf