Change search
ReferencesLink to record
Permanent link

Direct link
The Geneva Minimalistic Acoustic Parameter Set (GeMAPS) for Voice Research and Affective Computing
KTH, School of Computer Science and Communication (CSC), Speech, Music and Hearing, TMH.
Show others and affiliations
2016 (English)In: IEEE Transactions on Affective Computing, ISSN 1949-3045, E-ISSN 1949-3045, Vol. 7, no 2, 190-202 p.Article in journal (Refereed) Published
Abstract [en]

Work on voice sciences over recent decades has led to a proliferation of acoustic parameters that are used quite selectively and are not always extracted in a similar fashion. With many independent teams working in different research areas, shared standards become an essential safeguard to ensure compliance with state-of-the-art methods allowing appropriate comparison of results across studies and potential integration and combination of extraction and recognition systems. In this paper we propose a basic standard acoustic parameter set for various areas of automatic voice analysis, such as paralinguistic or clinical speech analysis. In contrast to a large brute-force parameter set, we present a minimalistic set of voice parameters here. These were selected based on a) their potential to index affective physiological changes in voice production, b) their proven value in former studies as well as their automatic extractability, and c) their theoretical significance. The set is intended to provide a common baseline for evaluation of future research and eliminate differences caused by varying parameter sets or even different implementations of the same parameters. Our implementation is publicly available with the openSMILE toolkit. Comparative evaluations of the proposed feature set and large baseline feature sets of INTERSPEECH challenges show a high performance of the proposed set in relation to its size.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2016. Vol. 7, no 2, 190-202 p.
Keyword [en]
Affective computing, acoustic features, standard, emotion recognition, speech analysis, geneva minimalistic parameter set
National Category
Computer Science
URN: urn:nbn:se:kth:diva-194029DOI: 10.1109/TAFFC.2015.2457417ISI: 000383995300007ScopusID: 2-s2.0-84973513831OAI: diva2:1037599
EU, FP7, Seventh Framework Programme, 230331-PROPEREMO

QC 20161017

Available from: 2016-10-17 Created: 2016-10-14 Last updated: 2016-10-17Bibliographically approved

Open Access in DiVA

No full text

Other links

Publisher's full textScopus

Search in DiVA

By author/editor
Sundberg, JohanLaukka, Petri
By organisation
Speech, Music and Hearing, TMH
In the same journal
IEEE Transactions on Affective Computing
Computer Science

Search outside of DiVA

GoogleGoogle Scholar
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Altmetric score

ReferencesLink to record
Permanent link

Direct link