Singing Voice Detection using Modulation Frequency Features
2008 (English)In: Proceedings of ISCA Tutorial and Research Workshop on Statistical and Perceptual Audition (SAPA), 2008, 7-10 p.Conference paper (Refereed)
In this paper, a feature set derived from modulation spectra is applied to the task of detecting singing voice in historical and recent recordings of Greek Rembetiko. A generalization of SVD to tensors, Higher Order SVD (HOSVD), is applied to reduce the dimensions of the feature vectors. Projection onto the “significant” principal axes of the acoustic and modulation frequency subspaces, results in a compact feature set, which is evaluated using an SVM classifier on a set of hand labeled musical mixtures. Fusion of the proposed features with MFCCs and delta coefficients reduces the optimal detection cost from 11.11% to 9.01%.
Place, publisher, year, edition, pages
2008. 7-10 p.
audio classification, modulation spectrum, singing voice activity detection.
IdentifiersURN: urn:nbn:se:kth:diva-193771OAI: oai:DiVA.org:kth-193771DiVA: diva2:1040342
ISCA Tutorial and Research Workshop on Statistical and Perceptual Audition (SAPA)
QC 201611012016-10-272016-10-102016-11-17Bibliographically approved