Audio coding with power spectral density preserving quantization
2012 (English)In: Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on, IEEE , 2012, 413-416 p.Conference paper (Refereed)
The coding of audio-visual signals is generally based on different paradigms for high and low rates. At high rates the signal is approximated directly and at low rates only signal features are transmitted. The recently introduced distribution preserving quantization (DPQ) paradigm provides a seamless transition between these two regimes. In this paper we present a simplified scheme that preserves the power spectral density (PSD) rather than the probability distribution. In a practical system the PSD must be estimated. We show that both forward adaptive and backward adaptive PSD estimation are possible. Our experimental results confirm that preservation of PSD at finite precision leads to a unified coding paradigm that provides effective coding at both high and low rates. An audio coding application shows the perceptual benefits of PSD preserving quantization.
Place, publisher, year, edition, pages
IEEE , 2012. 413-416 p.
, ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, ISSN 1520-6149
audio coding, backward adaptation, forward adaptation, predictive coding, Quantization
Other Electrical Engineering, Electronic Engineering, Information Engineering
IdentifiersURN: urn:nbn:se:kth:diva-104961DOI: 10.1109/ICASSP.2012.6287904ISI: 000312381400102ScopusID: 2-s2.0-84867604674ISBN: 978-146730046-9OAI: oai:DiVA.org:kth-104961DiVA: diva2:570407
2012 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2012, 25 March 2012 through 30 March 2012, Kyoto
FunderICT - The Next Generation
QC 201211192012-11-192012-11-142013-04-15Bibliographically approved