Change search
ReferencesLink to record
Permanent link

Direct link
Human interaction as a model for spoken dialogue system behaviour
KTH, School of Computer Science and Communication (CSC), Speech, Music and Hearing, TMH, Speech Communication and Technology.ORCID iD: 0000-0003-3585-8077
2010 (English)Doctoral thesis, monograph (Other academic)
Abstract [en]

This thesis is a step towards the long-term and high-reaching objec-tive of building dialogue systems whose behaviour is similar to a human dialogue partner. The aim is not to build a machine with the same conversational skills as a human being, but rather to build a machine that is human enough to encourage users to interact with it accordingly. The behaviours in focus are cue phrases, hesitations and turn-taking cues. These behaviours serve several important communicative functions such as providing feedback and managing turn-taking. Thus, if dialogue systems could use interactional cues similar to those of humans, these systems could be more intuitive to talk to. A major part of this work has been to collect, identify and analyze the target behaviours in human-human interaction in order to gain a better understanding of these phenomena. Another part has been to reproduce these behaviours in a dialogue system context and explore listeners’ perceptions of these phenomena in empirical experiments.

The thesis is divided into two parts. The first part serves as an overall background. The issues and motivations of humanlike dialogue systems are discussed. This part also includes an overview of research on human language production and spoken language generation in dialogue systems.

The next part presents the data collections, data analyses and empirical experiments that this thesis is concerned with. The first study presented is a listening test that explores human behaviour as a model for dialogue systems. The results show that a version based on human behaviour is rated as more humanlike, polite and intelligent than a constrained version with less variability. Next, the DEAL dia-logue system is introduced. DEAL is used as a platform for the re-search presented in this thesis. The domain of the system is a trade domain and the target audience are second language learners of Swedish who want to practice conversation. Furthermore, a data collection of human-human dialogues in the DEAL domain is presented. Analyses of cue phrases in these data are provided as well as an experimental study of turn-taking cues. The results from the turn-taking experiment indicate that turn-taking cues realized with a di-phone synthesis affect the expectations of a turn change similar to the corresponding human version.

Finally, an experimental study that explores the use of talkspurtinitial cue phrases in an incremental version of DEAL is presented. The results show that the incremental version had shorter response times and was rated as more efficient, more polite and better at indicating when to speak than a non-incremental implementation of the same system.

Place, publisher, year, edition, pages
Stockholm: KTH , 2010. , xi, 226 p.
Trita-CSC-A, ISSN 1653-5723 ; 2010:10
National Category
Fluid Mechanics and Acoustics Communication Studies
URN: urn:nbn:se:kth:diva-24258ISBN: 978-91-7415-703-1OAI: diva2:346096
Public defence
2010-09-03, Sal F3, Lindstedtsvägen 26, KTH, Stockholm, 10:00 (English)
QC20100830Available from: 2010-08-30 Created: 2010-08-30 Last updated: 2010-08-31Bibliographically approved

Open Access in DiVA

fulltext(3080 kB)591 downloads
File information
File name FULLTEXT02.pdfFile size 3080 kBChecksum SHA-512
Type fulltextMimetype application/pdf

Search in DiVA

By author/editor
Hjalmarsson, Anna
By organisation
Speech Communication and Technology
Fluid Mechanics and AcousticsCommunication Studies

Search outside of DiVA

GoogleGoogle Scholar
Total: 592 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Total: 407 hits
ReferencesLink to record
Permanent link

Direct link