Change search
ReferencesLink to record
Permanent link

Direct link
Inducing Decision Tree Pronunciation Variation Models from Annotated Speech Data
KTH, School of Computer Science and Communication (CSC), Speech, Music and Hearing, TMH, Speech Communication and Technology.
2005 (English)In: 9th European Conference on Speech Communication and Technology, Lisbon, Portugal, 2005, 1945-1948 p.Conference paper (Refereed)
Abstract [en]

Amodelofpronunciationof words in discourse context has been induced from the annotation of a spoken language corpus. The information included in the annotation is a set of variables hypothesised to be important for thepronunciationof words in discourse context. The annotation is connected to segmentally defined units on tiers corresponding to linguistically relevant units: the discourse, the utterance, the phrase, the word, the syllable and the phoneme. Themodelis represented as atreestructure, making it transparent for analysis and easy to use in aspeechsynthesis system. Using phonemic canonicalpronunciationrepresentations to estimate the segmental string of theannotateddatagives a 22.1% phone error rate.Decisiontreepronunciationvariationmodelsgenerated in a tenfold cross validation procedure showed an average phone error rate of 9.9%. Using multiple context variables for modellingpronunciationvariationcould thus reduce the error rate by 55%, compared to a baseline using canonicalpronunciationrepresentations.

Place, publisher, year, edition, pages
Lisbon, Portugal, 2005. 1945-1948 p.
National Category
Computer Science Language Technology (Computational Linguistics)
URN: urn:nbn:se:kth:diva-51873ScopusID: 2-s2.0-33745220964OAI: diva2:465167
9th European Conference on Speech Communication and Technology; Lisbon; 4 September 2005 through 8 September 2005
tmh_import_11_12_14. QC 20120201Available from: 2011-12-14 Created: 2011-12-14 Last updated: 2012-02-01Bibliographically approved

Open Access in DiVA

No full text


Search in DiVA

By author/editor
Jande, Per-Anders
By organisation
Speech Communication and Technology
Computer ScienceLanguage Technology (Computational Linguistics)

Search outside of DiVA

GoogleGoogle Scholar
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Total: 6 hits
ReferencesLink to record
Permanent link

Direct link