Enhancing Subjective Speech Intelligibility Using a Statistical Model of Speech
2012 (English)In: 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012, Vol 1, 2012, 166-169 p.Conference paper (Refereed)
The intelligibility of speech in adverse noise conditions can be improved by modifying the characteristics of the clean speech prior to its presentation. An effective and flexible paradigm is to select the modification by optimizing a measure of objective intelligibility. Here we apply this paradigm at the text level and optimize a measure related to the classification error probability in an automatic speech recognition system. The proposed method was applied to a simple but powerful band-energy modification mechanism under an energy preservation constraint. Subjective evaluation results provide a clear indication of a significant gain in subjective intelligibility. In contrast to existing methods, the proposed approach is not restricted to a particular modification strategy and treats the notion of optimality at a level closer to that of subjective intelligibility. The computational complexity of the method is sufficiently low to enable its use in on-line applications.
Place, publisher, year, edition, pages
2012. 166-169 p.
speech modification, subjective intelligibility, statistical model of speech
IdentifiersURN: urn:nbn:se:kth:diva-113296ISI: 000320827200042ScopusID: 2-s2.0-84878403433ISBN: 978-1-62276-759-5OAI: oai:DiVA.org:kth-113296DiVA: diva2:587591
13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012; Portland, OR; United States; 9 September 2012 through 13 September 2012
ProjectsListening Talker (LISTA)
FunderEU, FP7, Seventh Framework Programme
QC 201301152013-01-142013-01-142013-08-23Bibliographically approved