Human interaction as a model for spoken dialogue system behaviour
2010 (English)Doctoral thesis, monograph (Other academic)
This thesis is a step towards the long-term and high-reaching objec-tive of building dialogue systems whose behaviour is similar to a human dialogue partner. The aim is not to build a machine with the same conversational skills as a human being, but rather to build a machine that is human enough to encourage users to interact with it accordingly. The behaviours in focus are cue phrases, hesitations and turn-taking cues. These behaviours serve several important communicative functions such as providing feedback and managing turn-taking. Thus, if dialogue systems could use interactional cues similar to those of humans, these systems could be more intuitive to talk to. A major part of this work has been to collect, identify and analyze the target behaviours in human-human interaction in order to gain a better understanding of these phenomena. Another part has been to reproduce these behaviours in a dialogue system context and explore listeners’ perceptions of these phenomena in empirical experiments.
The thesis is divided into two parts. The first part serves as an overall background. The issues and motivations of humanlike dialogue systems are discussed. This part also includes an overview of research on human language production and spoken language generation in dialogue systems.
The next part presents the data collections, data analyses and empirical experiments that this thesis is concerned with. The first study presented is a listening test that explores human behaviour as a model for dialogue systems. The results show that a version based on human behaviour is rated as more humanlike, polite and intelligent than a constrained version with less variability. Next, the DEAL dia-logue system is introduced. DEAL is used as a platform for the re-search presented in this thesis. The domain of the system is a trade domain and the target audience are second language learners of Swedish who want to practice conversation. Furthermore, a data collection of human-human dialogues in the DEAL domain is presented. Analyses of cue phrases in these data are provided as well as an experimental study of turn-taking cues. The results from the turn-taking experiment indicate that turn-taking cues realized with a di-phone synthesis affect the expectations of a turn change similar to the corresponding human version.
Finally, an experimental study that explores the use of talkspurtinitial cue phrases in an incremental version of DEAL is presented. The results show that the incremental version had shorter response times and was rated as more efficient, more polite and better at indicating when to speak than a non-incremental implementation of the same system.
Place, publisher, year, edition, pages
Stockholm: KTH , 2010. , xi, 226 p.
Trita-CSC-A, ISSN 1653-5723 ; 2010:10
Fluid Mechanics and Acoustics Communication Studies
IdentifiersURN: urn:nbn:se:kth:diva-24258ISBN: 978-91-7415-703-1OAI: oai:DiVA.org:kth-24258DiVA: diva2:346096
2010-09-03, Sal F3, Lindstedtsvägen 26, KTH, Stockholm, 10:00 (English)
Schlangen, David, Associate Professor
Carlson, Rolf, Professor