On the Benefit of Using Auditory Modeling for Diagnostic Evaluation of Pronunciations
2012 (English)In: International Symposium on Automatic Detection of Errors in Pronunciation Training (IS ADEPT), Stockholm, Sweden, June 6-8, 2012 / [ed] Olov Engwall, 2012, 59-64 p.Conference paper (Refereed)
In this paper we demonstrate that a psychoacoustic model-based distance measure performs better than a speech signal distance measure in assessing the pronunciation of individual foreign speakers. The experiments show that the perceptual based-method performs not only quantitatively better than a speech spectrum-based method, but also qualitatively better, hence showing that auditory information is beneficial in the task of pronunciation error detection. We first present the general approach of the method, which is using the dissimilarity between the native perceptual domain and the non-native speech power spectrum domain. The problematic phonemes for a given non-native speaker are determined by the degree of disparity between the dissimilarity measure for the non-native and a group of native speakers. The two methods compared here are applied to different groups of non-native speakers of various language backgrounds and validated against a theoretical linguistic study.
Place, publisher, year, edition, pages
2012. 59-64 p.
second language learning, auditory model, distortion measure, perceptual assessment, phoneme
Signal Processing Computer Science
IdentifiersURN: urn:nbn:se:kth:diva-95752OAI: oai:DiVA.org:kth-95752DiVA: diva2:529162
International Symposium on Automatic Detection of Errors in Pronunciation Training (IS ADEPT), Stockholm, Sweden, June, 2012
ProjectsSwedish Research Council project 80449001 Computer-Animated LAnguage TEAchers (CALATEA)
QC 201301102012-05-292012-05-292013-01-10Bibliographically approved