Change search
ReferencesLink to record
Permanent link

Direct link
Annotating Speech Data for Pronunciation Variation Modelling
KTH, School of Computer Science and Communication (CSC), Speech, Music and Hearing, TMH, Speech Communication and Technology.
2005 (English)In: Proceedings, FONETIK 2005, Göteborg, Sweden, 2005, 25-27 p.Conference paper (Other academic)
Abstract [en]

This paper describes methods for annotating recorded speech with information hypothesised to be important for the pronunciation of words in discourse context. Annotation is structured into six hierarchically ordered tiers, each tier corresponding to a segmentally defined linguis-tic unit. Automatic methods are used to seg-ment and annotate the respective annotation tiers. Decision tree models trained on annota-tion from elicited monologue showed a pho-neme error rate of 9.91%, corresponding to a 55.25% error reduction compared to using a canonical pronunciation representation from a lexicon for estimating the phonetic realisation.

Place, publisher, year, edition, pages
Göteborg, Sweden, 2005. 25-27 p.
National Category
Computer Science Language Technology (Computational Linguistics)
URN: urn:nbn:se:kth:diva-51892OAI: diva2:465186
FONETIK 2005, Department of Linguistics, Göteborg University
tmh_import_11_12_14. QC 20120201Available from: 2011-12-14 Created: 2011-12-14 Last updated: 2012-02-01Bibliographically approved

Open Access in DiVA

No full text

Other links

Search in DiVA

By author/editor
Jande, Per-Anders
By organisation
Speech Communication and Technology
Computer ScienceLanguage Technology (Computational Linguistics)

Search outside of DiVA

GoogleGoogle Scholar
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Total: 5 hits
ReferencesLink to record
Permanent link

Direct link