Adaptation of Bayesian models for single-channel source separation and its application to voice/music separation in popular songs
2007 (English)In: IEEE Transactions on Audio, Speech, and Language Processing, ISSN 1558-7916, Vol. 15, no 5, 1564-1578 p.Article in journal (Refereed) Published
Probabilistic approaches can offer satisfactory solutions to source separation with a single channel, provided that the models of the sources match accurately the statistical properties of the mixed signals. However, it is not always possible to train such models. To overcome this problem, we propose to resort to an adaptation scheme for adjusting the source models with respect to the actual properties of the signals observed in the mix. In this paper; we introduce a general formalism for source model-adaptation which is expressed in the framework of Bayesian models. Particular cases of the proposed approach are then investigated experimentally on the problem of separating voice from music in popular songs. The obtained results show that an adaptation scheme can improve consistently and significantly the separation performance in comparison with nonadapted models.
Place, publisher, year, edition, pages
2007. Vol. 15, no 5, 1564-1578 p.
adaptive Wiener filtering, Bayesian model, expectation maximization (EM), Gaussian mixture model (GMM), maximum a posteriori (MAP), model adaptation, single-channel source separation, time-frequency masking
Engineering and Technology
IdentifiersURN: urn:nbn:se:kth:diva-37113DOI: 10.1109/TASL.2007.899291ISI: 000247547000007ScopusID: 2-s2.0-51449094735OAI: oai:DiVA.org:kth-37113DiVA: diva2:432144