Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Speaker adaptation of a three-dimensional tongue model
KTH, Superseded Departments, Speech, Music and Hearing.ORCID iD: 0000-0003-4532-014X
2004 (English)In: INTERSPEECH 2004: ICSLP 8th International Conference on Spoken Language Processing / [ed] Kim, S. H.; Young, D. H., 2004, 465-468 p.Conference paper, Published paper (Refereed)
Abstract [en]

Magnetic Resonance Images of nine subjects have been collected to determine scaling factors that can adapt a 3D tongue model to new subjects. The aim is to define few and simple measures that will allow for an automatic, but accurate, scaling of the model. The scaling should be automatic in order to be useful in an application for articulation training, in which the model must replicate the user's articulators without involving the user in a complicated speaker adaptation. It should further be accurate enough to allow for correct acoustic-to-articulatory inversion. The evaluation shows that the defined scaling technique is able to estimate a tongue shape that was not included in the training with an accuracy of 1.5 mm in the midsagittal plane and 1.7 mm for the whole 3D tongue, based on four articulatory measures.

Place, publisher, year, edition, pages
2004. 465-468 p.
National Category
Computer Science Language Technology (Computational Linguistics)
Identifiers
URN: urn:nbn:se:kth:diva-51811Scopus ID: 2-s2.0-85009061149OAI: oai:DiVA.org:kth-51811DiVA: diva2:465106
Conference
NTERSPEECH 2004 - ICSLP 8th International Conference on Spoken Language Processing. Jeju Island, Korea. October 4-8, 2004
Note
QC 20120111. tmh_import_11_12_14Available from: 2011-12-14 Created: 2011-12-14 Last updated: 2012-01-11Bibliographically approved

Open Access in DiVA

No full text

Scopus

Search in DiVA

By author/editor
Engwall, Olov
By organisation
Speech, Music and Hearing
Computer ScienceLanguage Technology (Computational Linguistics)

Search outside of DiVA

GoogleGoogle Scholar

urn-nbn

Altmetric score

urn-nbn
Total: 36 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf