Utterance segmentation and turn-taking in spoken dialogue systems
2005 (English)In: Computer Studies in Language and Speech / [ed] Fisseni, B.; Schmitz, H-C.; Schröder, B.; Wagner, P., Frankfurt am Main, Germany: Peter Lang , 2005, p. 576-587Chapter in book (Refereed)
Abstract [en]
A widely used method for finding places to take turn in spoken dialogue systems is to assume that an utterance ends where the user ceases to speak. Such endpoint detection normally triggers on a certain amount of silence, or non-speech. However, spontaneous speech frequently contains silent pauses inside sentencelike units, for example when the speaker hesitates. This paper presents /nailon/, an on-line, real-time prosodic analysis tool, and a number of experiments in which end-point detection has been augmented with prosodic analysis in order to segment the speech signal into what humans intuitively perceive as utterance-like units.
Place, publisher, year, edition, pages
Frankfurt am Main, Germany: Peter Lang , 2005. p. 576-587
National Category
Computer Sciences Language Technology (Computational Linguistics)
Identifiers
URN: urn:nbn:se:kth:diva-51857ISBN: 3-631-53874-X (print)OAI: oai:DiVA.org:kth-51857DiVA, id: diva2:465150
Note
QC 20120111. tmh_import_11_12_14
2011-12-142011-12-142022-06-24Bibliographically approved