kth.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Acoustic markers of emotions based on voice physiology
KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Speech, Music and Hearing, TMH.ORCID iD: 0000-0002-7234-7551
KTH, School of Computer Science and Communication (CSC), Speech, Music and Hearing, TMH.
2010 (English)In: Proceedings of the International Conference on Speech Prosody, International Speech Communications Association , 2010Conference paper, Published paper (Refereed)
Abstract [en]

Acoustic models of emotions may benefit from considering the underlying voice production mechanism. This study sought to describe emotional expressions according to physiological variations measured from the inverse-filtered glottal waveform in addition to standard parameter extraction. An acoustic analysis was performed on a subset of the /a/ vowels within the GEMEP database (10 speakers, 5 emotions). of the 12 acoustic features computed, repeated measures ANOVA showed significant main effects for 11 parameters. Subsequent principal components analysis revealed the three components that explain acoustic variations due to emotion, including “tension” (CQ, H1-H2, MFDR, LTAS) “perturbation” (jitter, shimmer, HNR), and “voicing” (fundamental frequency).

Place, publisher, year, edition, pages
International Speech Communications Association , 2010.
Keywords [en]
Acoustic cues, Affect bursts, Emotion, Glottal waveform, Physiology, Vocal expression, Voice quality, Acoustic analysis, Acoustic features, Emotional expressions, Fundamental frequencies, Principal components analysis, Repeated measures, Three component, Voice production, Physiological models
National Category
Natural Language Processing
Identifiers
URN: urn:nbn:se:kth:diva-304740Scopus ID: 2-s2.0-84959118677ISBN: 9780000000002 (print)OAI: oai:DiVA.org:kth-304740DiVA, id: diva2:1610363
Conference
5th International Conference on Speech Prosody: Every Language, Every Style SP 2010, Chicago, USA, 10-14 May, 2010.
Note

QC 20211110

Available from: 2021-11-10 Created: 2021-11-10 Last updated: 2025-02-07Bibliographically approved

Open Access in DiVA

No full text in DiVA

Scopus

Authority records

Sundberg, JohanBjörkner, Eva

Search in DiVA

By author/editor
Sundberg, JohanBjörkner, Eva
By organisation
Speech, Music and Hearing, TMHSpeech, Music and Hearing, TMH
Natural Language Processing

Search outside of DiVA

GoogleGoogle Scholar

isbn
urn-nbn

Altmetric score

isbn
urn-nbn
Total: 103 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf