Codebook-based Bayesian speech enhancement for nonstationary environments
2007 (English)In: IEEE transactions on speech and audio processing, ISSN 1063-6676, E-ISSN 1558-2353, Vol. 15, no 2, 441-452 p.Article in journal (Refereed) Published
In this paper, we propose a Bayesian minimum mean squared error approach for the joint estimation of the short-term predictor parameters of speech and noise, from the noisy observation. We use trained codebooks of speech and noise linear predictive coefficients to model the a priori information required by the Bayesian scheme. In contrast to current Bayesian estimation approaches that consider the excitation variances as part of the a priori information, in the proposed method they are computed online for each short-time segment, based on the observation at hand. Consequently, the method performs well in nonstationary noise conditions. The resulting estimates of the speech and noise spectra can be used in a Wiener filter or any state-of-the-art speech enhancement system. We develop both memoryless (using information from the current frame alone) and memory-based (using information from the current and previous frames) estimators. Estimation of functions of the short-term predictor parameters is also addressed, in particular one that leads to the minimum mean squared error estimate of the clean speech signal. Experiments indicate that the scheme proposed in this paper performs significantly better than competing methods.
Place, publisher, year, edition, pages
2007. Vol. 15, no 2, 441-452 p.
Bayesian, Codebooks, Linear predictive coding, Noise estimation, Speech enhancement, Speech processing, Wiener filtering
IdentifiersURN: urn:nbn:se:kth:diva-7735DOI: 10.1109/TASL.2006.881696ISI: 000243914800007ScopusID: 2-s2.0-51449109652OAI: oai:DiVA.org:kth-7735DiVA: diva2:12850
QC 20100903. Uppdaterad från Submitted till Published (20100903)2005-10-202005-10-202011-08-25Bibliographically approved