Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Estimation of vocal duration in monaural mixtures
KTH, School of Computer Science and Communication (CSC), Speech, Music and Hearing, TMH.ORCID iD: 0000-0002-4957-2128
KTH.
KTH.
KTH, School of Engineering Sciences (SCI), Aeronautical and Vehicle Engineering, Marcus Wallenberg Laboratory MWL.ORCID iD: 0000-0001-5723-9571
Show others and affiliations
2014 (English)In: Proceedings - 40th International Computer Music Conference, ICMC 2014 and 11th Sound and Music Computing Conference, SMC 2014 - Music Technology Meets Philosophy: From Digital Echos to Virtual Ethos, National and Kapodistrian University of Athens , 2014, 1172-1177 p.Conference paper, Published paper (Refereed)
Abstract [en]

In this study, the task of vocal duration estimation in monaural music mixtures is explored. We show how presently available algorithms for source separation and predominant f0 estimation can be used as a front end from which features can be extracted. A large set of features is presented, devised to connect different vocal cues to the presence of vocals. Two main cues are utilized; the voice is neither stable in pitch nor in timbre. We evaluate the performance of the model by estimating the length of the vocal regions of the mixtures. To facilitate this, a new set of annotations to a widely adopted data set is developed and made available to the community. The proposed model is able to explain about 78 % of the variance in vocal region length. In a classification task, where the excerpts are classified as either vocal or non-vocal, the model has an accuracy of about 0.94.

Place, publisher, year, edition, pages
National and Kapodistrian University of Athens , 2014. 1172-1177 p.
Keyword [en]
Algorithms, Computer music, Mixtures, Classification tasks, Data set, F0 estimations, Front end
National Category
Other Engineering and Technologies
Identifiers
URN: urn:nbn:se:kth:diva-157969Scopus ID: 2-s2.0-84908895278ISBN: 978-960466137-4 (print)OAI: oai:DiVA.org:kth-157969DiVA: diva2:774125
Conference
40th International Computer Music Conference, ICMC 2014, Joint with the 11th Sound and Music Computing Conference, SMC 2014 - Music Technology Meets Philosophy: From Digital Echos to Virtual Ethos, 14 September 2014 through 20 September 2014, Athens, Greece
Note

QC 20141222

Available from: 2014-12-22 Created: 2014-12-18 Last updated: 2014-12-22Bibliographically approved

Open Access in DiVA

No full text

Scopus

Authority records BETA

Zea, ElíasFriberg, Anders

Search in DiVA

By author/editor
Elowsson, AndersSchön, RagnarHöglund, MattsZea, ElíasFriberg, Anders
By organisation
Speech, Music and Hearing, TMHKTHMarcus Wallenberg Laboratory MWLMusic Acoustics
Other Engineering and Technologies

Search outside of DiVA

GoogleGoogle Scholar

isbn
urn-nbn

Altmetric score

isbn
urn-nbn
Total: 66 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf