Change search
ReferencesLink to record
Permanent link

Direct link
Bit-rate scalable intraframe sinusoidal audio coding based on rate-distortion optimization
KTH, School of Electrical Engineering (EES), Sound and Image Processing.
Show others and affiliations
2006 (English)In: Journal of the Audio Engineering Society, ISSN 1549-4950, Vol. 54, no 3, 167-188 p.Article in journal (Refereed) Published
Abstract [en]

A coding methodology that aims at rate-distortion optimal sinusoid + noise coding of audio and speech signals is presented. The coder divides the input signal into variable-length time segments and distributes sinusoidal components over the segments such that the resulting distortion (as measured by a perceptual distortion measure) is minimized subject to a prespecified rate constraint. The coder is bit-rate scalable. For a given target bit budget it automatically adapts the segmentation and distribution of sinusoids in a rate-distortion optimal manner. The coder uses frequency-differential coding techniques in order to exploit intrasegment correlations for efficient quantization and encoding of the sinusoidal model parameters. This technique makes the coder more robust toward packet losses when used in a lossy-packet channel environment as compared to time-differential coding techniques, which are commonly used in audio or speech coders. In a subjective listening experiment the present coder showed similar or better performance than a set of four MPEG-4 coders operating at bit rates of 16, 24, 32, and 48 kbit/s, each of which was state of the art for the given target bit rate.

Place, publisher, year, edition, pages
2006. Vol. 54, no 3, 167-188 p.
Keyword [en]
speech analysis synthesis, matching pursuits, model, representation
URN: urn:nbn:se:kth:diva-15635ISI: 000237030700004ScopusID: 2-s2.0-33646166772OAI: diva2:333677
QC 20100525Available from: 2010-08-05 Created: 2010-08-05Bibliographically approved

Open Access in DiVA

No full text


Search in DiVA

By author/editor
Kleijn, W. Bastiaan
By organisation
Sound and Image Processing

Search outside of DiVA

GoogleGoogle Scholar
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Total: 33 hits
ReferencesLink to record
Permanent link

Direct link