kth.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Phoneme Level Non-Native Pronunciation Analysis by an Auditory Model-based Native Assessment Scheme
KTH, School of Computer Science and Communication (CSC), Speech, Music and Hearing, TMH, Speech Communication and Technology. KTH, School of Computer Science and Communication (CSC), Centres, Centre for Speech Technology, CTT.
KTH, School of Computer Science and Communication (CSC), Speech, Music and Hearing, TMH, Speech Communication and Technology. KTH, School of Computer Science and Communication (CSC), Centres, Centre for Speech Technology, CTT.ORCID iD: 0000-0003-4532-014X
2011 (English)In: 12th Annual Conference of the International Speech Communication Association, INTERSPEECH 2011, International Speech Communication Association, INTERSPEECH , 2011, p. 1157-1160Conference paper, Published paper (Refereed)
Abstract [en]

We introduce a general method for automatic diagnostic evaluation of the pronunciation of individual non-native speakers based on a model of the human auditory system trained with native data stimuli. For each phoneme class, the Euclidean geometry similarity between the native perceptual domain and the non-native speech power spectrum domain is measured. The problematic phonemes for a given second language speaker are found by comparing this measure to the Euclidean geometry similarity for the same phonemes produced by native speakers only. The method is applied to different groups of non-native speakers of various language backgrounds and the experimental results are in agreement with theoretical findings of linguistic studies.

Place, publisher, year, edition, pages
International Speech Communication Association, INTERSPEECH , 2011. p. 1157-1160
Series
Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, ISSN 1990-9772
Keywords [en]
second language learning, auditory model, distortion measure, perceptual assessment, phoneme
National Category
Other Computer and Information Science Computer Sciences Signal Processing
Identifiers
URN: urn:nbn:se:kth:diva-39054ISI: 000316502200293Scopus ID: 2-s2.0-84865793022ISBN: 978-1-61839-270-1 (print)OAI: oai:DiVA.org:kth-39054DiVA, id: diva2:439334
Conference
12th Annual Conference of the International Speech Communication Association, INTERSPEECH 2011. Florence, Italy. 28-31 August 2011
Note

QC 20111118.

Shortlisted for the best student paper award prize.

Available from: 2011-09-07 Created: 2011-09-07 Last updated: 2024-03-18Bibliographically approved

Open Access in DiVA

No full text in DiVA

Scopus

Authority records

Engwall, Olov

Search in DiVA

By author/editor
Koniaris, ChristosEngwall, Olov
By organisation
Speech Communication and TechnologyCentre for Speech Technology, CTT
Other Computer and Information ScienceComputer SciencesSignal Processing

Search outside of DiVA

GoogleGoogle Scholar

isbn
urn-nbn

Altmetric score

isbn
urn-nbn
Total: 548 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf