Utterance segmentation and turn-taking in spoken dialogue systems
2005 (English)In: Computer Studies in Language and Speech / [ed] Fisseni, B.; Schmitz, H-C.; Schröder, B.; Wagner, P., Frankfurt am Main, Germany: Peter Lang , 2005, 576-587 p.Chapter in book (Refereed)
A widely used method for finding places to take turn in spoken dialogue systems is to assume that an utterance ends where the user ceases to speak. Such endpoint detection normally triggers on a certain amount of silence, or non-speech. However, spontaneous speech frequently contains silent pauses inside sentencelike units, for example when the speaker hesitates. This paper presents /nailon/, an on-line, real-time prosodic analysis tool, and a number of experiments in which end-point detection has been augmented with prosodic analysis in order to segment the speech signal into what humans intuitively perceive as utterance-like units.
Place, publisher, year, edition, pages
Frankfurt am Main, Germany: Peter Lang , 2005. 576-587 p.
Computer Science Language Technology (Computational Linguistics)
IdentifiersURN: urn:nbn:se:kth:diva-51857ISBN: 3-631-53874-XOAI: oai:DiVA.org:kth-51857DiVA: diva2:465150
QC 20120111. tmh_import_11_12_142011-12-142011-12-142012-01-11Bibliographically approved