kth.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Synthesis of VV utterances from muscle activation to sound with a 3d model
KTH, School of Computer Science and Communication (CSC), Speech, Music and Hearing, TMH, Speech Communication and Technology.ORCID iD: 0000-0002-8991-1016
KTH, School of Computer Science and Communication (CSC), Speech, Music and Hearing, TMH, Speech Communication and Technology.
2017 (English)In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2017, The International Speech Communication Association (ISCA), 2017, p. 3497-3501Conference paper, Published paper (Refereed)
Abstract [en]

We propose a method to automatically generate deformable 3D vocal tract geometries from the surrounding structures in a biomechanical model. This allows us to couple 3D biomechanics and acoustics simulations. The basis of the simulations is muscle activation trajectories in the biomechanical model, which move the articulators to the desired articulatory positions. The muscle activation trajectories for a vowel-vowel utterance are here defined through interpolation between the determined activations of the start and end vowel. The resulting articulatory trajectories of flesh points on the tongue surface and jaw are similar to corresponding trajectories measured using Electromagnetic Articulography, hence corroborating the validity of interpolating muscle activation. At each time step in the articulatory transition, a 3D vocal tract tube is created through a cavity extraction method based on first slicing the geometry of the articulators with a semi-polar grid to extract the vocal tract contour in each plane and then reconstructing the vocal tract through a smoothed 3D mesh-generation using the extracted contours. A finite element method applied to these changing 3D geometries simulates the acoustic wave propagation. We present the resulting acoustic pressure changes on the vocal tract boundary and the formant transitions for the utterance [Ai].

Place, publisher, year, edition, pages
The International Speech Communication Association (ISCA), 2017. p. 3497-3501
National Category
Language Technology (Computational Linguistics)
Research subject
Speech and Music Communication
Identifiers
URN: urn:nbn:se:kth:diva-212993DOI: 10.21437/Interspeech.2017-1614ISI: 000457505000728Scopus ID: 2-s2.0-85039149051OAI: oai:DiVA.org:kth-212993DiVA, id: diva2:1136223
Conference
18th Annual Conference of the International Speech Communication Association, INTERSPEECH 2017, Stockholm, Sweden, 20 August 2017 through 24 August 2017
Note

QC 20170828

Available from: 2017-08-25 Created: 2017-08-25 Last updated: 2024-03-18Bibliographically approved

Open Access in DiVA

fulltext(461 kB)430 downloads
File information
File name FULLTEXT01.pdfFile size 461 kBChecksum SHA-512
76cc1adf8fc384d3ceb0cdd023b6c7fdc597cdecd5c44fca69c1841d999597806d437b2405fe9364846a0b19b1b4bbcfdab0a304b11d6c51763c1fc752a412e0
Type fulltextMimetype application/pdf

Other links

Publisher's full textScopus

Authority records

Dabbaghchian, SaeedEngwall, Olov

Search in DiVA

By author/editor
Dabbaghchian, SaeedEngwall, Olov
By organisation
Speech Communication and Technology
Language Technology (Computational Linguistics)

Search outside of DiVA

GoogleGoogle Scholar
Total: 430 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 255 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf