Change search
ReferencesLink to record
Permanent link

Direct link
Towards rich multimodal behavior in spoken dialogues with embodied agents
KTH, School of Computer Science and Communication (CSC), Speech, Music and Hearing, TMH, Speech Communication and Technology.
2013 (English)In: 4th IEEE International Conference on Cognitive Infocommunications, CogInfoCom 2013 - Proceedings, IEEE Computer Society, 2013, 817-822 p.Conference paper (Refereed)
Abstract [en]

Spoken dialogue frameworks have traditionally been designed to handle a single stream of data - the speech signal. Research on human-human communication has been providing large evidence and quantifying the effects and the importance of a multitude of other multimodal nonverbal signals that people use in their communication, that shape and regulate their interaction. Driven by findings from multimodal human spoken interaction, and the advancements of capture devices and robotics and animation technologies, new possibilities are rising for the development of multimodal human-machine interaction that is more affective, social, and engaging. In such face-to-face interaction scenarios, dialogue systems can have a large set of signals at their disposal to infer context and enhance and regulate the interaction through the generation of verbal and nonverbal facial signals. This paper summarizes several design decision, and experiments that we have followed in attempts to build rich and fluent multimodal interactive systems using a newly developed hybrid robotic head called Furhat, and discuss issues and challenges that this effort is facing.

Place, publisher, year, edition, pages
IEEE Computer Society, 2013. 817-822 p.
Keyword [en]
Dialogue Systems, Facial Synthesis, Furhat Robot, Multimodal communication, Social Robots
National Category
Computer Science Language Technology (Computational Linguistics)
URN: urn:nbn:se:kth:diva-137415DOI: 10.1109/CogInfoCom.2013.6719212ISI: 000349770000140ScopusID: 2-s2.0-84894193853ISBN: 978-1-4799-1543-9OAI: diva2:678887
4th IEEE International Conference on Cognitive Infocommunications, CogInfoCom 2013; Budapest; Hungary; 2 December 2013 through 5 December 2013

QC 20140603

Available from: 2013-12-13 Created: 2013-12-13 Last updated: 2015-12-07Bibliographically approved

Open Access in DiVA

No full text

Other links

Publisher's full textScopus

Search in DiVA

By author/editor
Al Moubayed, Samer
By organisation
Speech Communication and Technology
Computer ScienceLanguage Technology (Computational Linguistics)

Search outside of DiVA

GoogleGoogle Scholar
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Altmetric score

Total: 206 hits
ReferencesLink to record
Permanent link

Direct link