Is it really worth it?: Cost-based selection of system responses to speech-in-overlap
2012 (English)In: Proc. of the IVA 2012 workshop on Realtime Conversational Virtual Agents (RCVA 2012), Santa Crux, CA, USA, 2012Conference paper (Refereed)
For purposes of discussion and feedback, we present a preliminary version of a simple yet powerful cost-based framework for spoken dialogue sys-tems to continuously and incrementally decide whether to speak or not. The framework weighs the cost of producing speech in overlap against the cost of not speaking when something needs saying. Main features include a small number of parameters controlling characteristics that are readily understood, al-lowing manual tweaking as well as interpretation of trained parameter settings; observation-based estimates of expected overlap which can be adapted dynami-cally; and a simple and general method for context dependency. No evaluation has yet been undertaken, but the effects of the parameters; the observation-based cost of expected overlap trained on Switchboard data; and the context de-pendency using inter-speaker intensity differences from the same corpus are demonstrated with generated input data in the context of user barge-ins.
Place, publisher, year, edition, pages
Santa Crux, CA, USA, 2012.
Computer Science Language Technology (Computational Linguistics)
IdentifiersURN: urn:nbn:se:kth:diva-107009OAI: oai:DiVA.org:kth-107009DiVA: diva2:574350
the IVA 2012 workshop on Realtime Conversational Virtual Agents (RCVA 2012)
tmh_import_12_12_05, tmh_id_3780. QC 201212172012-12-052012-12-052012-12-17Bibliographically approved