Change search
ReferencesLink to record
Permanent link

Direct link
An information-theoretic framework for automated discovery of prosodic cues to conversational structure
KTH, School of Computer Science and Communication (CSC), Speech, Music and Hearing, TMH, Speech Communication and Technology.ORCID iD: 0000-0003-3585-8077
2015 (English)In: ICASSP, IEEE conference proceedings, 2015Conference paper (Refereed)
Abstract [en]

Interaction timing in conversation exhibits myriad variabilities, yet it is patently not random. However, identifying consistencies is a manually labor-intensive effort, and findings have been limited. We propose a conditonal mutual information measure of the influence of prosodic features, which can be computed for any conversation at any instant, with only a speech/non-speech segmentation as its requirement. We evaluate the methodology on two segmental features: energy and speaking rate. Results indicate that energy, the less controversial of the two, is in fact better on average at predicting conversational structure. We also explore the temporal evolution of model 'surprise', which permits identifying instants where each feature's influence is operative. The method corroborates earlier findings, and appears capable of large-scale data-driven discovery in future research.

Place, publisher, year, edition, pages
IEEE conference proceedings, 2015.
, Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing, ISSN 1520-6149
National Category
Computer Science Language Technology (Computational Linguistics)
URN: urn:nbn:se:kth:diva-180401DOI: 10.1109/ICASSP.2015.7178998ScopusID: 2-s2.0-84946040439ISBN: 978-146736997-8OAI: diva2:893735

QC 20160303

Available from: 2016-01-13 Created: 2016-01-13 Last updated: 2016-03-03Bibliographically approved

Open Access in DiVA

No full text

Other links

Publisher's full textScopus

Search in DiVA

By author/editor
Hjalmarsson, Anna
By organisation
Speech Communication and Technology
Computer ScienceLanguage Technology (Computational Linguistics)

Search outside of DiVA

GoogleGoogle Scholar
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Altmetric score

Total: 12 hits
ReferencesLink to record
Permanent link

Direct link