A low-complexity spectro-temporal based perceptual model
2009 (English)In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, IEEE , 2009, 153-156 p.Conference paper (Refereed)
The use of psychoacoustical masking models for audio coding applications has been wide spread over the past decades. In such applications, it is typically assumed that the original input signal serves as a masker for the distortions that are introduced by the lossy coding method that is used. Up to now, these masking models are mostly based on spectral masking. In this paper, we propose a new perceptual model for audio and speech processing algorithms based on spectro-temporal masking. A sophisticated perceptual model is simplified, such that the eventual distortion measure can be written as a frequency-weighted l2-norm. This yields the same computational complexity as conventional spectral-based methods, but with the preservation of the temporal fine structure of the clean signal. It is shown that the new model can successfully avoid pre-echoes and can correctly predict masking curves for various maskers.
Place, publisher, year, edition, pages
IEEE , 2009. 153-156 p.
, IEEE International Conference on Acoustics, Speech and Signal Processing. Proceedings, ISSN 1520-6149
Audio coding, Auditory masking, Psychoacoustics
IdentifiersURN: urn:nbn:se:kth:diva-98831ISBN: 978-142442354-5OAI: oai:DiVA.org:kth-98831DiVA: diva2:539505
2009 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2009; Taipei; 19 April 2009 through 24 April 2009
QC 201208082012-07-042012-07-032012-08-08Bibliographically approved