The Synchronized Short-Time-Fourier-Transform: Properties and Definitions for Multichannel Source Separation
2011 (English)In: IEEE Transactions on Signal Processing, ISSN 1053-587X, E-ISSN 1941-0476, ISSN 1053-587X, Vol. 59, no 1, 91-103 p.Article in journal (Refereed) Published
This paper proposes the use of a synchronized linear transform, the synchronized short-time-Fourier-transform (sSTFT), for time-frequency analysis of anechoic mixtures. We address the short comings of the commonly used time-frequency linear transform in multichannel settings, namely the classical short-time-Fourier-transform (cSTFT). We propose a series of desirable properties for the linear transform used in a multichannel source separation scenario: stationary invertibility, relative delay, relative attenuation, and finally delay invariant relative windowed-disjoint orthogonality (DIRWDO). Multisensor source separation techniques which operate in the time-frequency domain, have an inherent error unless consideration is given to the multichannel properties proposed in this paper. The sSTFT preserves these relationships for multichannel data. The crucial innovation of the sSTFT is to locally synchronize the analysis to the observations as opposed to a global clock. Improvement in separation performance can be achieved because assumed properties of the time-frequency transform are satisfied when it is appropriately synchronized. Numerical experiments show the sSTFT improves instantaneous subsample relative parameter estimation in low noise conditions and achieves good synthesis.
Place, publisher, year, edition, pages
IEEE Signal Processing Society, 2011. Vol. 59, no 1, 91-103 p.
Fourier transforms;parameter estimation;source separation;synchronisation;anechoic mixtures;delay invariant relative windowed-disjoint orthogonality;multichannel source separation;parameter estimation;short time Fourier transform;synchronized linear transform;time frequency domain;time frequency linear transform;Delay;Fourier transforms;Signal processing algorithms;Speech;Synchronization;Time frequency analysis;Signal analysis;source separation
Research subject Applied and Computational Mathematics
IdentifiersURN: urn:nbn:se:kth:diva-174150DOI: 10.1109/TSP.2010.2088392ISI: 000285519200009ScopusID: 2-s2.0-84877759831OAI: oai:DiVA.org:kth-174150DiVA: diva2:858213
QC 201510052015-10-012015-10-012015-10-05Bibliographically approved