Expression control in singing voice synthesis: Features, approaches, evaluation, and challenges
2015 (English)In: IEEE signal processing magazine (Print), ISSN 1053-5888, E-ISSN 1558-0792, Vol. 32, no 6, 55-73 p.Article in journal (Refereed) Published
In the context of singing voice synthesis, expression control manipulates a set of voice features related to a particular emotion, style, or singer. Also known as performance modeling, it has been approached from different perspectives and for different purposes, and different projects have shown a wide extent of applicability. The aim of this article is to provide an overview of approaches to expression control in singing voice synthesis. We introduce some musical applications that use singing voice synthesis techniques to justify the need for an accurate control of expression. Then, expression is defined and related to speech and instrument performance modeling. Next, we present the commonly studied set of voice parameters that can change perceptual aspects of synthesized voices. After that, we provide an up-to-date classification, comparison, and description of a selection of approaches to expression control. Then, we describe how these approaches are currently evaluated and discuss the benefits of building a common evaluation framework and adopting perceptually-motivated objective measures. Finally, we discuss the challenges that we currently foresee.
Place, publisher, year, edition, pages
IEEE Press, 2015. Vol. 32, no 6, 55-73 p.
Computer Science Language Technology (Computational Linguistics)
IdentifiersURN: urn:nbn:se:kth:diva-180424DOI: 10.1109/MSP.2015.2424572ISI: 000363239200006ScopusID: 2-s2.0-84960404277OAI: oai:DiVA.org:kth-180424DiVA: diva2:893712
tmh_import_16_01_13, tmh_id_4032. QC 201602262016-01-132016-01-132016-02-26Bibliographically approved