Change search
ReferencesLink to record
Permanent link

Direct link
A single-port non-parametric model of turn-taking in multi-party conversation
KTH, School of Computer Science and Communication (CSC), Speech, Music and Hearing, TMH, Speech Communication and Technology.ORCID iD: 0000-0001-9327-9482
KTH, School of Computer Science and Communication (CSC), Speech, Music and Hearing, TMH, Speech Communication and Technology.
2011 (English)In: Proc. of ICASSP 2011, Prague, Czech Republic, 2011, 5600-5603 p.Conference paper (Refereed)
Abstract [en]

The taking of turns to speak is an intrinsic property of conversation. It is therefore expected that models of turn-taking, providing a prior distribution over conversational form, can usefully reduce the perplexity of what is observed and processed in real-time spoken dialogue systems. We propose a conversation-independent single-port model of multi-party turn-taking, one which allows conversants to undertake independent actions but to condition them on the past behavior of all participants. The model is shown to generally out perform an existing multi-port model on a measure of perplexity over subsequently observed speech activity. We quantify the effect of history truncation and the success of predicting distant conversational futures, and argue that the framework is sufficiently accessible and has significant potential to usefully inform thedesignandbehaviorofspokendialoguesystems.

Place, publisher, year, edition, pages
Prague, Czech Republic, 2011. 5600-5603 p.
, International Conference on Acoustics Speech and Signal Processing ICASSP, ISSN 1520-6149
National Category
Computer Science Language Technology (Computational Linguistics)
URN: urn:nbn:se:kth:diva-52185DOI: 10.1109/ICASSP.2011.5947629ISI: 000296062406077ScopusID: 2-s2.0-80051653954OAI: diva2:465482
tmh_import_11_12_14. QC 20111228Available from: 2011-12-14 Created: 2011-12-14 Last updated: 2011-12-28Bibliographically approved

Open Access in DiVA

No full text

Other links

Publisher's full textScopus

Search in DiVA

By author/editor
Edlund, JensHeldner, Mattias
By organisation
Speech Communication and Technology
Computer ScienceLanguage Technology (Computational Linguistics)

Search outside of DiVA

GoogleGoogle Scholar
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Altmetric score

Total: 10 hits
ReferencesLink to record
Permanent link

Direct link