Ändra sökning
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Speaker adaptation of a three-dimensional tongue model
KTH, Tidigare Institutioner, Tal, musik och hörsel.ORCID-id: 0000-0003-4532-014X
2004 (Engelska)Ingår i: INTERSPEECH 2004: ICSLP 8th International Conference on Spoken Language Processing / [ed] Kim, S. H.; Young, D. H., 2004, s. 465-468Konferensbidrag, Publicerat paper (Refereegranskat)
Abstract [en]

Magnetic Resonance Images of nine subjects have been collected to determine scaling factors that can adapt a 3D tongue model to new subjects. The aim is to define few and simple measures that will allow for an automatic, but accurate, scaling of the model. The scaling should be automatic in order to be useful in an application for articulation training, in which the model must replicate the user's articulators without involving the user in a complicated speaker adaptation. It should further be accurate enough to allow for correct acoustic-to-articulatory inversion. The evaluation shows that the defined scaling technique is able to estimate a tongue shape that was not included in the training with an accuracy of 1.5 mm in the midsagittal plane and 1.7 mm for the whole 3D tongue, based on four articulatory measures.

Ort, förlag, år, upplaga, sidor
2004. s. 465-468
Nationell ämneskategori
Datavetenskap (datalogi) Språkteknologi (språkvetenskaplig databehandling)
Identifikatorer
URN: urn:nbn:se:kth:diva-51811Scopus ID: 2-s2.0-85009061149OAI: oai:DiVA.org:kth-51811DiVA, id: diva2:465106
Konferens
NTERSPEECH 2004 - ICSLP 8th International Conference on Spoken Language Processing. Jeju Island, Korea. October 4-8, 2004
Anmärkning
QC 20120111. tmh_import_11_12_14Tillgänglig från: 2011-12-14 Skapad: 2011-12-14 Senast uppdaterad: 2018-01-12Bibliografiskt granskad

Open Access i DiVA

Fulltext saknas i DiVA

Scopus

Sök vidare i DiVA

Av författaren/redaktören
Engwall, Olov
Av organisationen
Tal, musik och hörsel
Datavetenskap (datalogi)Språkteknologi (språkvetenskaplig databehandling)

Sök vidare utanför DiVA

GoogleGoogle Scholar

urn-nbn

Altmetricpoäng

urn-nbn
Totalt: 53 träffar
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf