Change search
ReferencesLink to record
Permanent link

Direct link
Anthropomorphic coding of speech and audio: A model inversion approach
KTH, School of Electrical Engineering (EES), Sound and Image Processing.
2005 (English)In: EURASTP journal an applied signal processing, ISSN 1110-8657, Vol. 2005, no 9, 1334-1349 p.Article in journal (Refereed) Published
Abstract [en]

Auditory modeling is a well-established methodology that provides insight into human perception and that facilitates the extraction of signal features that are most relevant to the listener. The aim of this paper is to provide a tutorial on perceptual speech and audio coding using an invertible auditory model. In this approach, the audio signal is converted into an auditory representation using an invertible auditory model. The auditory representation is quantized and coded. Upon decoding, it is then transformed back into the acoustic domain. This transformation converts a complex distortion criterion into a simple one, thus facilitating quantization with low complexity. We briefly review past work on auditory models and describe in more detail the components of our invertible model and its inversion procedure, that is, the method to reconstruct the signal from the output of the auditory model. We summarize attempts to use the auditory representation for low-bit-rate coding. Our approach also allows the exploitation of the inherent redundancy of the human auditory system for the purpose of multiple description (joint source-channel) coding.

Place, publisher, year, edition, pages
2005. Vol. 2005, no 9, 1334-1349 p.
Keyword [en]
speech and audio coding, auditory representation, auditory model inversion, auditory synthesis, perceptual domain coding, multiple description coding, auditory filter, frequency, transform
National Category
Signal Processing
URN: urn:nbn:se:kth:diva-15178DOI: 10.1155/ASP.2005.1334ISI: 000233296600005ScopusID: 2-s2.0-27844544161OAI: diva2:333219
QC 20100525 QC 20111011Available from: 2010-08-05 Created: 2010-08-05 Last updated: 2011-10-11Bibliographically approved

Open Access in DiVA

No full text

Other links

Publisher's full textScopus

Search in DiVA

By author/editor
Kleijn, W. Bastiaan
By organisation
Sound and Image Processing
In the same journal
EURASTP journal an applied signal processing
Signal Processing

Search outside of DiVA

GoogleGoogle Scholar
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Altmetric score

Total: 30 hits
ReferencesLink to record
Permanent link

Direct link