A Speech Preprocessing Strategy For Intelligibility Improvement In Noise Based On A Perceptual Distortion Measure
2012 (English)In: 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, 4061-4064 p.Conference paper (Refereed)
A speech pre-processing algorithm is presented to improve the speech intelligibility in noise for the near-end listener. The algorithm improves the intelligibility by optimally redistributing the speech energy over time and frequency for a perceptual distortion measure, which is based on a spectro-temporal auditory model. In contrast to spectral-only models, short-time information is taken into account. As a consequence, the algorithm is more sensitive to transient regions, which will therefore receive more amplification compared to stationary vowels. It is known from literature that changing the vowel-transient energy ratio is beneficial for improving speech-intelligibility in noise. Objective intelligibility prediction results show that the proposed method has higher speech intelligibility in noise compared to two other reference methods, without modifying the global speech energy.
Place, publisher, year, edition, pages
2012. 4061-4064 p.
Near-end speech enhancement, intelligibility improvement, transients
IdentifiersURN: urn:nbn:se:kth:diva-98824ISI: 000312381404033OAI: oai:DiVA.org:kth-98824DiVA: diva2:539512
IEEE International Conference on Acoustics, Speech and Signal Processing
QC 201305242012-07-042012-07-032013-05-24Bibliographically approved