Expression of Affect in Spontaneous Speech: Acoustic Correlates and Automatic Detection of Irritation and ResignationShow others and affiliations
2011 (English)In: Computer speech & language (Print), ISSN 0885-2308, E-ISSN 1095-8363, Vol. 25, no 1, p. 84-104Article in journal (Refereed) Published
Abstract [en]
The majority of previous studies on vocal expression have been conducted on posed expressions. In contrast, we utilized a large corpus of authentic affective speech recorded from real-life voice controlled telephone services. Listeners rated a selection of 200 utterances from this corpus with regard to level of perceived irritation, resignation, neutrality, and emotion intensity. The selected utterances came from 64 different speakers who each provided both neutral and affective stimuli. All utterances were further automatically analyzed regarding a comprehensive set of acoustic measures related to F0, intensity, formants, voice source, and temporal characteristics of speech. Results first showed that several significant acoustic differences were found between utterances classified as neutral and utterances classified as irritated or resigned using a within-persons design. Second, listeners' ratings on each scale were associated with several acoustic measures. In general the acoustic correlates of irritation, resignation, and emotion intensity were similar to previous findings obtained with posed expressions, though the effect sizes were smaller for the authentic expressions. Third, automatic classification (using LDA classifiers both with and without speaker adaptation) of irritation, resignation, and neutral performed at a level comparable to human performance, though human listeners and machines did not necessarily classify individual utterances similarly. Fourth, clearly perceived exemplars of irritation and resignation were rare in our corpus. These findings were discussed in relation to future research.
Place, publisher, year, edition, pages
2011. Vol. 25, no 1, p. 84-104
Keywords [en]
Acoustic features, Automatic speech classification, Emotion recognition, Human-computer interaction, Spontaneous speech
National Category
Computer Sciences Language Technology (Computational Linguistics)
Identifiers
URN: urn:nbn:se:kth:diva-52044DOI: 10.1016/j.csl.2010.03.004ISI: 000282563500006Scopus ID: 2-s2.0-77955421904OAI: oai:DiVA.org:kth-52044DiVA, id: diva2:465338
Funder
Swedish Research Council
Note
tmh_import_11_12_14 QC 20111220
2011-12-142011-12-142024-03-18Bibliographically approved