Change search
ReferencesLink to record
Permanent link

Direct link
A Low-Complexity Spectro-Temporal Distortion Measure for Audio Processing Applications
KTH, School of Electrical Engineering (EES), Sound and Image Processing.
2012 (English)In: IEEE Transactions on Audio, Speech, and Language Processing, ISSN 1558-7916, Vol. 20, no 5, 1553-1564 p.Article in journal (Refereed) Published
Abstract [en]

Perceptual models exploiting auditory masking are frequently used in audio and speech processing applications like coding and watermarking. In most cases, these models only take into account spectral masking in short-time frames. As a consequence, undesired audible artifacts in the temporal domain may be introduced (e.g., pre-echoes). In this article we present a new low-complexity spectro-temporal distortion measure. The model facilitates the computation of analytic expressions for masking thresholds, while advanced spectro-temporal models typically need computationally demanding adaptive procedures to find an estimate of these masking thresholds. We show that the proposed method gives similar masking predictions as an advanced spectro-temporal model with only a fraction of its computational power. The proposed method is also compared with a spectral-only model by means of a listening test. From this test it can be concluded that for non-stationary frames the spectral model underestimates the audibility of introduced errors and therefore overestimates the masking curve. As a consequence, the system of interest incorrectly assumes that errors are masked in a particular frame, which leads to audible artifacts. This is not the case with the proposed method which correctly detects the errors made in the temporal structure of the signal.

Place, publisher, year, edition, pages
2012. Vol. 20, no 5, 1553-1564 p.
Keyword [en]
Audio coding, auditory modeling, perceptual model
National Category
Engineering and Technology
URN: urn:nbn:se:kth:diva-93900DOI: 10.1109/TASL.2012.2184753ISI: 000302083400001ScopusID: 2-s2.0-84858952789OAI: diva2:524939
QC 20120504Available from: 2012-05-04 Created: 2012-05-03 Last updated: 2012-05-04Bibliographically approved

Open Access in DiVA

No full text

Other links

Publisher's full textScopus

Search in DiVA

By author/editor
Taal, Cees H.
By organisation
Sound and Image Processing
In the same journal
IEEE Transactions on Audio, Speech, and Language Processing
Engineering and Technology

Search outside of DiVA

GoogleGoogle Scholar
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Altmetric score

Total: 18 hits
ReferencesLink to record
Permanent link

Direct link