Automatic Recognition of Anger in Spontaneous Speech
2008 (English)In: INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, BAIXAS: ISCA-INST SPEECH COMMUNICATION ASSOC , 2008, p. 2755-2758Conference paper, Published paper (Refereed)
Abstract [en]
Automatic detection of real life negative emotions in speech has been evaluated using Linear Discriminant Analysis, LDA, with "classic" emotion features and a classifier based on Gaussian Mixture Models, GMMs. The latter uses Mel-Frequency Cepstral Coefficients, MFCCs, from a filter bank covering the 300-3400 Hz region to capture spectral shape and formants, and another in the 20-600 Hz region to capture prosody. Both classifiers have been tested on an extensive corpus from Swedish voice controlled telephone services. The results indicate that it is possible to detect anger with reasonable accuracy (average recall 83%) in natural speech and that the GMM method performed better than the LDA one.
Place, publisher, year, edition, pages
BAIXAS: ISCA-INST SPEECH COMMUNICATION ASSOC , 2008. p. 2755-2758
Keywords [en]
spontaneous speech, natural emotions, anger
National Category
Computer and Information Sciences Communication Studies
Identifiers
URN: urn:nbn:se:kth:diva-29858ISI: 000277026101268Scopus ID: 2-s2.0-84867218213ISBN: 978-1-61567-378-0 (print)OAI: oai:DiVA.org:kth-29858DiVA, id: diva2:399555
Conference
9th Annual Conference of the International-Speech-Communication-Association (INTERSPEECH 2008), Brisbane, AUSTRALIA, SEP 22-26, 2008
Note
QC 20110222
2011-02-222011-02-172022-06-25Bibliographically approved