Auditory and Dynamic Modeling Paradigms to Detect L2 Mispronunciations
2012 (English)In: 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012, Vol 1, 2012, 898-901 p.Conference paper (Refereed)
This paper expands our previous work on automatic pronunciation error detection that exploits knowledge from psychoacoustic auditory models. The new system has two additional important features, i.e., auditory and acoustic processing of the temporal cues of the speech signal, and classification feedback from a trained linear dynamic model. We also perform a pronunciation analysis by considering the task as a classification problem. Finally, we evaluate the proposed methods conducting a listening test on the same speech material and compare the judgment of the listeners and the methods. The automatic analysis based on spectro-temporal cues is shown to have the best agreement with the human evaluation, particularly with that of language teachers, and with previous plenary linguistic studies.
Place, publisher, year, edition, pages
2012. 898-901 p.
L2 pronunciation error, auditory model, linear dynamic model, distortion measure, phoneme
Signal Processing Other Computer and Information Science Computer Science
IdentifiersURN: urn:nbn:se:kth:diva-102317ISI: 000320827200225ScopusID: 2-s2.0-84878407679ISBN: 978-1-62276-759-5OAI: oai:DiVA.org:kth-102317DiVA: diva2:552320
13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012; Portland, OR; United States; 9 September 2012 through 13 September 2012
QC 201209142012-09-132012-09-132013-08-22Bibliographically approved