The additive effect of turn-taking cues in human and synthetic voice
2011 (English)In: Speech Communication, ISSN 0167-6393, Vol. 53, no 1, 23-35 p.Article in journal (Refereed) Published
A previous line of research suggests that interlocutors identify appropriate places to speak by cues in the behaviour of the preceding speaker. If used in combination, these cues have an additive effect on listeners' turn-taking attempts. The present study further explores these findings by examining the effect of such turn-taking cues experimentally. The objective is to investigate the possibilities of generating turn-taking cues with a synthetic voice. Thus, in addition to stimuli realized with a human voice, the experiment included dialogues where one of the speakers is replaced with a synthesis. The turn-taking cues investigated include intonation, phrase-final lengthening, semantic completeness, stereotyped lexical expressions and non-lexical speech production phenomena such as lexical repetitions, breathing and lip-smacks. The results show that the turn-taking cues realized with a synthetic voice affect the judgements similar to the corresponding human version and there is no difference in reaction times between these two conditions. Furthermore, the results support Duncan's findings: the more turn-taking cues with the same pragmatic function, turn-yielding or turn-holding, the higher the agreement among subjects on the expected outcome. In addition, the number of turn-taking cues affects the reaction times for these decisions. Thus, the more cues, the faster the reaction time.
Place, publisher, year, edition, pages
2011. Vol. 53, no 1, 23-35 p.
Turn-taking, Speech synthesis, Human-like interaction, Conversational interfaces
Other Computer and Information Science
IdentifiersURN: urn:nbn:se:kth:diva-30498DOI: 10.1016/j.specom.2010.08.003ISI: 000285663700003ScopusID: 2-s2.0-78649334679OAI: oai:DiVA.org:kth-30498DiVA: diva2:404178
QC 201103162011-03-162011-02-282011-03-16Bibliographically approved