Predicting Unseen Articulations from Multi-speaker Articulatory Models
2010 (English)In: Proceedings of the 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010, Makuhari, Japan, 2010, p. 1588-1591Conference paper, Published paper (Refereed)
Abstract [en]
In order to study inter-speaker variability, this work aims to assessthe generalization capabilities of data-based multi-speakerarticulatory models. We use various three-mode factor analysistechniques to model the variations of midsagittal vocal tractcontours obtained from MRI images for three French speakersarticulating 73 vowels and consonants. Articulations of agiven speaker for phonemes not present in the training set arethen predicted by inversion of the models from measurementsof these phonemes articulated by the other subjects. On the average,the prediction RMSE was 5.25 mm for tongue contours,and 3.3 mm for 2D midsagittal vocal tract distances. Besides,this study has established a methodology to determine the optimalnumber of factors for such models.
Place, publisher, year, edition, pages
Makuhari, Japan, 2010. p. 1588-1591
Keywords [en]
Factor analysis, Multi-speaker articulatory model
National Category
Computer Sciences Language Technology (Computational Linguistics)
Identifiers
URN: urn:nbn:se:kth:diva-52154ISI: 000313086500009Scopus ID: 2-s2.0-79959825917ISBN: 978-1-61782-123-3 (print)OAI: oai:DiVA.org:kth-52154DiVA, id: diva2:465449
Conference
11th Annual Conference of the International Speech Communication Association: Spoken Language Processing for All, INTERSPEECH 2010; Makuhari, Chiba; 26 September 2010 through 30 September 2010
Note
tmh_import_11_12_14. QC 20111220
2011-12-142011-12-142024-03-18Bibliographically approved