kth.sePublications KTH
Change search
Link to record
Permanent link

Direct link
Björkner, Eva
Publications (3 of 3) Show all publications
Sundberg, J., Patel, S., Björkner, E. & Scherer, K. (2011). Interdependencies among voice source parameters in emotional speech. IEEE Transactions on Affective Computing, 2(3), 162-174
Open this publication in new window or tab >>Interdependencies among voice source parameters in emotional speech
2011 (English)In: IEEE Transactions on Affective Computing, ISSN 1949-3045, Vol. 2, no 3, p. 162-174Article in journal (Refereed) Published
Abstract [en]

Emotions have strong effects on the voice production mechanisms and consequently on voice characteristics. The magnitude of these effects, measured using voice source parameters, and the interdependencies among parameters have not been examined. To better understand these relationships, voice characteristics were analyzed in 10 actors' productions of a sustained/a/vowel in five emotions. Twelve acoustic parameters were studied and grouped according to their physiological backgrounds, three related to subglottal pressure, five related to the transglottal airflow waveform derived from inverse filtering the audio signal, and four related to vocal fold vibration. Each emotion appeared to possess a specific combination of acoustic parameters reflecting a specific mixture of physiologic voice control parameters. Features related to subglottal pressure showed strong within-group and between-group correlations, demonstrating the importance of accounting for vocal loudness in voice analyses. Multiple discriminant analysis revealed that a parameter selection that was based, in a principled fashion, on production processes could yield rather satisfactory discrimination outcomes (87.1 percent based on 12 parameters and 78 percent based on three parameters). The results of this study suggest that systems to automatically detect emotions use a hypothesis-driven approach to selecting parameters that directly reflect the physiological parameters underlying voice and speech production.

Keywords
Paralanguage analysis, affect sensing and analysis, affective computing, voice source, vocal physiology
National Category
Computer Sciences Natural Language Processing
Identifiers
urn:nbn:se:kth:diva-52246 (URN)10.1109/T-AFFC.2011.14 (DOI)000208758300004 ()2-s2.0-80054843364 (Scopus ID)
Funder
EU, European Research Council, ERC-2008-AdG-230331-PROPEREMO
Note

QC 20111220

Available from: 2011-12-14 Created: 2011-12-14 Last updated: 2025-02-01Bibliographically approved
Patel, S., Scherer, K. R., Björkner, E. & Sundberg, J. (2011). Mapping emotions into acoustic space: The role of voice production. Biological Psychology, 87(1), 93-98
Open this publication in new window or tab >>Mapping emotions into acoustic space: The role of voice production
2011 (English)In: Biological Psychology, ISSN 0301-0511, E-ISSN 1873-6246, Vol. 87, no 1, p. 93-98Article in journal (Refereed) Published
Abstract [en]

Research on the vocal expression of emotion has long since used a "fishing expedition" approach to find acoustic markers for emotion categories and dimensions. Although partially successful, the underlying mechanisms have not yet been elucidated. To illustrate that this research can profit from considering the underlying voice production mechanism, we specifically analyzed short affect bursts (sustained/a/vowels produced by 10 professional actors for five emotions) according to physiological variations in phonation (using acoustic parameters derived from the acoustic signal and the inverse filter estimated voice source waveform). Results show significant emotion main effects for 11 of 12 parameters. Subsequent principal components analysis revealed three components that explain acoustic variations due to emotion, including "tension," "perturbation," and "voicing frequency." These results suggest that future work may benefit from theory-guided development of parameters to assess differences in physiological voice production mechanisms in the vocal expression of different emotions. (C) 2011 Elsevier B.V. All rights reserved.

Keywords
Emotion dimensions, Vocal expression, Affective prosody, Voice quality, Vocal physiology, Glottal waveform, Affect bursts
National Category
Fluid Mechanics
Identifiers
urn:nbn:se:kth:diva-34240 (URN)10.1016/j.biopsycho.2011.02.010 (DOI)000290195100011 ()21354259 (PubMedID)2-s2.0-79953328709 (Scopus ID)
Note
QC 20110613Available from: 2011-06-13 Created: 2011-05-30 Last updated: 2025-02-09Bibliographically approved
Patel, S., Scherer, K. R., Sundberg, J. & Björkner, E. (2010). Acoustic markers of emotions based on voice physiology. In: Proceedings of the International Conference on Speech Prosody: . Paper presented at 5th International Conference on Speech Prosody: Every Language, Every Style SP 2010, Chicago, USA, 10-14 May, 2010.. International Speech Communications Association
Open this publication in new window or tab >>Acoustic markers of emotions based on voice physiology
2010 (English)In: Proceedings of the International Conference on Speech Prosody, International Speech Communications Association , 2010Conference paper, Published paper (Refereed)
Abstract [en]

Acoustic models of emotions may benefit from considering the underlying voice production mechanism. This study sought to describe emotional expressions according to physiological variations measured from the inverse-filtered glottal waveform in addition to standard parameter extraction. An acoustic analysis was performed on a subset of the /a/ vowels within the GEMEP database (10 speakers, 5 emotions). of the 12 acoustic features computed, repeated measures ANOVA showed significant main effects for 11 parameters. Subsequent principal components analysis revealed the three components that explain acoustic variations due to emotion, including “tension” (CQ, H1-H2, MFDR, LTAS) “perturbation” (jitter, shimmer, HNR), and “voicing” (fundamental frequency).

Place, publisher, year, edition, pages
International Speech Communications Association, 2010
Keywords
Acoustic cues, Affect bursts, Emotion, Glottal waveform, Physiology, Vocal expression, Voice quality, Acoustic analysis, Acoustic features, Emotional expressions, Fundamental frequencies, Principal components analysis, Repeated measures, Three component, Voice production, Physiological models
National Category
Natural Language Processing
Identifiers
urn:nbn:se:kth:diva-304740 (URN)2-s2.0-84959118677 (Scopus ID)9780000000002 (ISBN)
Conference
5th International Conference on Speech Prosody: Every Language, Every Style SP 2010, Chicago, USA, 10-14 May, 2010.
Note

QC 20211110

Available from: 2021-11-10 Created: 2021-11-10 Last updated: 2025-02-07Bibliographically approved
Organisations

Search in DiVA

Show all publications