Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Generalized postfilter for speech quality enhancement
KTH, School of Electrical Engineering (EES), Sound and Image Processing.
KTH, School of Electrical Engineering (EES), Sound and Image Processing.
KTH, School of Electrical Engineering (EES), Sound and Image Processing.
KTH, School of Electrical Engineering (EES), Sound and Image Processing.
2008 (English)In: IEEE Transactions on Audio, Speech and Language Processing, ISSN 1558-7916, Vol. 16, no 1, 57-64 p.Article in journal (Refereed) Published
Abstract [en]

Postfilters are commonly used in speech coding for the attenuation of quantization noise. In the presence of acoustic background noise or distortion due to tandeming operations, the postfilter parameters are not adjusted and the performance is, therefore, not optimal. We propose a modification that consists of replacing the nonadaptive postfilter parameters with parameters that adapt to variations in spectral flatness, obtained from the noisy speech. This generalization of the postfiltering concept can handle a larger range of noise conditions, but has the same computational complexity and memory requirements as the conventional postfilter. Test results indicate that the presented algorithm improves on the standard postfilter, as well as on the combination of a noise attenuation preprocessor and the conventional postfilter.

Place, publisher, year, edition, pages
2008. Vol. 16, no 1, 57-64 p.
Keyword [en]
Additive noise; Distortion measure; Multiplicative noise; Noise reduction; Perceptually optimal processing; Postfilter; Speech coding; Speech enhancement; Tandeming
National Category
Telecommunications
Identifiers
URN: urn:nbn:se:kth:diva-5947DOI: 10.1109/TASL.2007.909327ISI: 000251947000006Scopus ID: 2-s2.0-64849092071OAI: oai:DiVA.org:kth-5947DiVA: diva2:10489
Note
QC 20100824Available from: 2006-06-02 Created: 2006-06-02 Last updated: 2011-08-25Bibliographically approved
In thesis
1. Human perception in speech processing
Open this publication in new window or tab >>Human perception in speech processing
2006 (English)Doctoral thesis, comprehensive summary (Other scientific)
Abstract [en]

The emergence of heterogeneous networks and the rapid increase of Voice over IP (VoIP) applications provide important opportunities for the telecommunications market. These opportunities come at the price of increased complexity in the monitoring of the quality of service (QoS) and the need for adaptation of transmission systems to the changing environmental conditions. This thesis contains three papers concerned with quality assessment and enhancement of speech communication systems in adverse environments.

In paper A, we introduce a low-complexity, non-intrusive algorithm for monitoring speech quality over the network. In the proposed algorithm, speech quality is predicted from a set of features that capture important structural information from the speech signal.

Papers B and C describe improvements in the conventional pre- and post-processing speech enhancement techniques. In paper B, we demonstrate that the causal Kalman filter implementation is in conflict with the key properties in human perception and propose solutions to the problem. In paper C, we propose adaptation of the conventional postfilter parameters to changes in the noisy conditions. A perceptually motivated distortion measure is used in the optimization of postfilter parameters. Significant improvement over nonadaptive system is obtained.

Place, publisher, year, edition, pages
Stockholm: KTH, 2006
Series
Trita-EE, ISSN 1653-5146 ; 2006:016
Keyword
quality assessment, speech enhancement, postfilter
National Category
Telecommunications
Identifiers
urn:nbn:se:kth:diva-4032 (URN)91-628-6864-0 (ISBN)
Public defence
2006-06-15, E2, Lindstedtsvägen 3, 09:00
Opponent
Supervisors
Note
QC 20100824Available from: 2006-06-02 Created: 2006-06-02 Last updated: 2010-08-24Bibliographically approved

Open Access in DiVA

No full text

Other links

Publisher's full textScopus

Search in DiVA

By author/editor
Grancharov, VolodyaPlasberg, Jan H.Samuelsson, JonasKleijn, Bastiaan
By organisation
Sound and Image Processing
Telecommunications

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 94 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf