Towards incremental speech generation in conversational systems
2013 (English)In: Computer speech & language (Print), ISSN 0885-2308, E-ISSN 1095-8363, Vol. 27, no 1, 243-262 p.Article in journal (Refereed) Published
This paper presents a model of incremental speech generation in practical conversational systems. The model allows a conversational system to incrementally interpret spoken input, while simultaneously planning, realising and self-monitoring the system response. If these processes are time consuming and result in a response delay, the system can automatically produce hesitations to retain the floor. While speaking, the system utilises hidden and overt self-corrections to accommodate revisions in the system. The model has been implemented in a general dialogue system framework. Using this framework, we have implemented a conversational game application. A Wizard-of-Oz experiment is presented, where the automatic speech recognizer is replaced by a Wizard who transcribes the spoken input. In this setting, the incremental model allows the system to start speaking while the user's utterance is being transcribed. In comparison to a non-incremental version of the same system, the incremental version has a shorter response time and is perceived as more efficient by the users.
Place, publisher, year, edition, pages
2013. Vol. 27, no 1, 243-262 p.
Conversational systems, Incremental processing, Speech generation, Wizard-of-Oz
Computer Science Language Technology (Computational Linguistics)
IdentifiersURN: urn:nbn:se:kth:diva-107012DOI: 10.1016/j.csl.2012.05.004ISI: 000311524800014ScopusID: 2-s2.0-84867329282OAI: oai:DiVA.org:kth-107012DiVA: diva2:574362
FunderSwedish Research Council, 2011-6237 2011-6152ICT - The Next Generation
QC 201301092012-12-052012-12-052013-04-11Bibliographically approved