Change search
ReferencesLink to record
Permanent link

Direct link
Noise Correlation Matrix Estimation for Multi-Microphone Speech Enhancement
KTH, School of Electrical Engineering (EES), Sound and Image Processing.
2012 (English)In: IEEE Transactions on Audio, Speech, and Language Processing, ISSN 1558-7916, Vol. 20, no 1, 223-233 p.Article in journal (Refereed) Published
Abstract [en]

For multi-channel noise reduction algorithms like the minimum variance distortionless response (MVDR) beamformer, or the multi-channel Wiener filter, an estimate of the noise correlation matrix is needed. For its estimation, it is often proposed in the literature to use a voice activity detector (VAD). However, using a VAD the estimated matrix can only be updated in speech absence. As a result, during speech presence the noise correlation matrix estimate does not follow changing noise fields with an appropriate accuracy. This effect is further increased, as in nonstationary noise voice activity detection is a rather difficult task, and false-alarms are likely to occur. In this paper, we present and analyze an algorithm that estimates the noise correlation matrix without using a VAD. This algorithm is based on measuring the correlation of the noisy input and a noise reference which can be obtained, e. g., by steering a null towards the target source. When applied in combination with an MVDR beamformer, it is shown that the proposed noise correlation matrix estimate results in a more accurate beamformer response, a larger signal-to-noise ratio improvement and a larger instrumentally predicted speech intelligibility when compared to competing algorithms such as the generalized sidelobe canceler, a VAD-based MVDR beamformer, and an MVDR based on the noisy correlation matrix.

Place, publisher, year, edition, pages
2012. Vol. 20, no 1, 223-233 p.
Keyword [en]
Multi-microphone, noise correlation matrix, noise reduction, speech enhancement
National Category
Signal Processing
URN: urn:nbn:se:kth:diva-63240DOI: 10.1109/TASL.2011.2159711ISI: 000298325600023ScopusID: 2-s2.0-81155154446OAI: diva2:484676
QC 20120127Available from: 2012-01-27 Created: 2012-01-23 Last updated: 2012-01-27Bibliographically approved

Open Access in DiVA

No full text

Other links

Publisher's full textScopus

Search in DiVA

By author/editor
Gerkmann, Timo
By organisation
Sound and Image Processing
In the same journal
IEEE Transactions on Audio, Speech, and Language Processing
Signal Processing

Search outside of DiVA

GoogleGoogle Scholar
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Altmetric score

Total: 41 hits
ReferencesLink to record
Permanent link

Direct link