Timing and entrainment of multimodal backchanneling behavior for an embodied conversational agent
2013 (English)Conference paper (Refereed)Text
We report on an analysis of feedback behavior in an Active Listening Corpus as produced verbally, visually (head movement) and bimodally. The behavior is modeled in an embodied conversational agent and displayed in a conversation with a real human to human participants for perceptual evaluation. Five strategies for the timing of backchannels are compared: copying the timing of the original human listener, producing backchannels at randomly selected times, producing backchannels according to high level timing distributions relative to the interlocutor's utterance and pauses, or according to local entrainment to the interlocutors' vowels, or according to both. Human observers judge that models with global timing distributions miss less opportunities for backchanneling than random timing.
Place, publisher, year, edition, pages
ACM SIGCHI , 2013.
, Proceedings of the 15th International Conference on Multimodal Interaction, ICMI’13 - Sydney
IdentifiersURN: urn:nbn:se:kth:diva-185484DOI: 10.1145/2522848.2522890ScopusID: 2-s2.0-84892585468ISBN: 978-145032129-7OAI: oai:DiVA.org:kth-185484DiVA: diva2:921092
Proceedings of the 15th International Conference on Multimodal Interaction, ICMI’13 - Sydney
QC 201604282016-04-192016-04-192016-04-28Bibliographically approved