Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Single channel speech enhancement using Bayesian NMF with recursive temporal updates of prior distributions
KTH, School of Electrical Engineering (EES), Sound and Image Processing.
KTH, School of Electrical Engineering (EES), Sound and Image Processing.
KTH, School of Electrical Engineering (EES), Sound and Image Processing.
2012 (English)In: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2012, IEEE conference proceedings, 2012, 4561-4564 p.Conference paper, Published paper (Refereed)
Abstract [en]

We present a speech enhancement algorithm which is based on a Bayesian Nonnegative Matrix Factorization (NMF). Both Minimum Mean Square Error (MMSE) and Maximum a-Posteriori (MAP) estimates of the magnitude of the clean speech DFT coefficients are derived. To exploit the temporal continuity of the speech and noise signals, a proper prior distribution is introduced by widening the posterior distribution of the NMF coefficients at the previous time frames. To do so, a recursive temporal update scheme is proposed to obtain the mean value of the prior distribution; also, the uncertainty of the prior information is governed by the shape parameter of the distribution which is learnt automatically based on the nonstationarity of the signals. Simulations show a considerable improvement compared to the maximum likelihood NMF based speech enhancement algorithm for different input SNRs.

Place, publisher, year, edition, pages
IEEE conference proceedings, 2012. 4561-4564 p.
Series
IEEE International Conference on Acoustics, Speech and Signal Processing. Proceedings, ISSN 1520-6149
Keyword [en]
Speech enhancement, NMF, MMSE, MAP
National Category
Electrical Engineering, Electronic Engineering, Information Engineering
Identifiers
URN: urn:nbn:se:kth:diva-75458DOI: 10.1109/ICASSP.2012.6288933ISI: 000312381404158Scopus ID: 2-s2.0-84867609546ISBN: 978-1-4673-0045-2 (print)OAI: oai:DiVA.org:kth-75458DiVA: diva2:490501
Conference
IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2012; Kyoto; 25 March 2012 through 30 March 2012
Funder
ICT - The Next Generation
Note

QC 20121015

Available from: 2012-02-05 Created: 2012-02-05 Last updated: 2013-04-15Bibliographically approved

Open Access in DiVA

fulltext(207 kB)363 downloads
File information
File name FULLTEXT01.pdfFile size 207 kBChecksum SHA-512
19a998a9f2ae1f4a76a1c472deb646c77092c3f85fcdcbda76f45d495eeca74fdbbfcaf7ec503a0c3aa8733101a0afb63777276bc0573058fc4752f1734c4353
Type fulltextMimetype application/pdf

Other links

Publisher's full textScopusIEEEXplore

Search in DiVA

By author/editor
Mohammadiha, NasserTaghia, JalilLeijon, Arne
By organisation
Sound and Image Processing
Electrical Engineering, Electronic Engineering, Information Engineering

Search outside of DiVA

GoogleGoogle Scholar
Total: 363 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
isbn
urn-nbn

Altmetric score

doi
isbn
urn-nbn
Total: 374 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf