Adaptation and Normalization Experiments in Speech Recognition for 4 to 8 Year old Children.
2005 (English)Conference paper (Refereed)
An experimental offline investigation of the performance of connected digits recognition was performed on children in the age range four to eight years. Poor performance using adult models was improved significantly by adaptation and vocal tract length normalisation but not to the same level as training on children. Age dependent models were tried with limited advantage. A combined adult and child raining corpus maintained the performance for the separately trained categories. Linear frequency compression for vocal tract length nor-malization was attempted but estimation of the warping factor was sensitive to non-speech segments and background noise. Phoneme-based word modeling outperformed the whole word models, even though the vocabulary only consisted of digits.
Place, publisher, year, edition, pages
Lisboa, 2005. 2749-2752 p.
MAP, MLLR, VTLN, speech recognition, child
Language Technology (Computational Linguistics)
IdentifiersURN: urn:nbn:se:kth:diva-12253ScopusID: 2-s2.0-33745256261OAI: oai:DiVA.org:kth-12253DiVA: diva2:306703
QC 201105022010-03-302010-03-302011-05-02Bibliographically approved