Open this publication in new window or tab >>Show others...
2020 (English)In: ICMI '20: Proceedings of the 2020 International Conference on Multimodal Interaction, Association for Computing Machinery (ACM) , 2020Conference paper, Published paper (Refereed)
Abstract [en]
During speech, people spontaneously gesticulate, which plays akey role in conveying information. Similarly, realistic co-speechgestures are crucial to enable natural and smooth interactions withsocial agents. Current end-to-end co-speech gesture generationsystems use a single modality for representing speech: either au-dio or text. These systems are therefore confined to producingeither acoustically-linked beat gestures or semantically-linked ges-ticulation (e.g., raising a hand when saying “high”): they cannotappropriately learn to generate both gesture types. We present amodel designed to produce arbitrary beat and semantic gesturestogether. Our deep-learning based model takes both acoustic andsemantic representations of speech as input, and generates gesturesas a sequence of joint angle rotations as output. The resulting ges-tures can be applied to both virtual agents and humanoid robots.Subjective and objective evaluations confirm the success of ourapproach. The code and video are available at the project page svito-zar.github.io/gesticula
Place, publisher, year, edition, pages
Association for Computing Machinery (ACM), 2020
Keywords
Gesture generation; virtual agents; socially intelligent systems; co-speech gestures; multi-modal interaction; deep learning
National Category
Human Computer Interaction
Research subject
Computer Science
Identifiers
urn:nbn:se:kth:diva-286282 (URN)10.1145/3382507.3418815 (DOI)001437992100029 ()2-s2.0-85096710861 (Scopus ID)
Conference
ICMI '20: INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION Virtual Event Netherlands October 25 - 29, 2020
Projects
EACare
Funder
Swedish Foundation for Strategic Research , RIT15-0107
Note
ICMI 2020 Best Paper Award
Part of Proceedings: ISBN 978-1-4503-7581-8
QC 20211109
2020-11-242020-11-242025-12-05Bibliographically approved