Using a Biomechanical Model and Articulatory Data for the Numerical Production of Vowels
2016 (English)In: Interspeech 2016, 2016, 3569-3573 p.Conference paper (Refereed)
We introduce a framework to study speech production using a biomechanical model of the human vocal tract, ArtiSynth. Electromagnetic articulography data was used as input to an inverse tracking simulation that estimates muscle activations to generate 3D jaw and tongue postures corresponding to the target articulator positions. For acoustic simulations, the vocal tract geometry is needed, but since the vocal tract is a cavity rather than a physical object, its geometry does not explicitly exist in a biomechanical model. A fully-automatic method to extract the 3D geometry (surface mesh) of the vocal tract by blending geometries of the relevant articulators has therefore been developed. This automatic extraction procedure is essential, since a method with manual intervention is not feasible for large numbers of simulations or for generation of dynamic sounds, such as diphthongs. We then simulated the vocal tract acoustics by using the Finite Element Method (FEM). This requires a high quality vocal tract mesh without irregular geometry or self-intersections. We demonstrate that the framework is applicable to acoustic FEM simulations of a wide range of vocal tract deformations. In particular we present results for cardinal vowel production, with muscle activations, vocal tract geometry, and acoustic simulations.
Place, publisher, year, edition, pages
2016. 3569-3573 p.
speech production, biomechanical articulatory model, vocal tract geometry, vocal tract acoustics, Finite Element Method
Computer Science Fluid Mechanics and Acoustics
IdentifiersURN: urn:nbn:se:kth:diva-192602DOI: 10.21437/Interspeech.2016-1500OAI: oai:DiVA.org:kth-192602DiVA: diva2:971288
Interspeech, 8-12 Sep 2016, San Francisco
QC 201609202016-09-152016-09-152016-09-20Bibliographically approved