kth.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Localisation in virtual choirs: outcomes of simplified binaural rendering
KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Speech, Music and Hearing, TMH, Music Acoustics.
KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Speech, Music and Hearing, TMH, Music Acoustics. KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Speech, Music and Hearing, TMH, Speech Communication and Technology.ORCID iD: 0000-0002-3362-7518
2023 (English)Conference paper, Published paper (Refereed)
Abstract [en]

A virtual choir would find several uses in choral pedagogy and research, but it would need a relatively small computational footprint for wide uptake. On the premise that very accurate localisation might not be needed for virtual rendering of the character of the sound inside an ensemble of singers, a localisation test was conducted using binaural stimuli created using a simplified approach, with parametrically controlled delays and variable low-pass filters (historically known as a ‘shuffler’ circuit) instead of head-related impulse responses. The direct sound from a monophonic anechoic recording of a soprano was processed (1) by sending it to a reverb algorithm for making a room-acoustic diffuse field with unchanging properties, (2) with a second-order low-pass filter with a cut-off frequency descending to 3 kHz for sources from behind, (3) with second-order low-pass head-shading filters with an angle-dependent cut-off frequency for the left/right lateral shadings of the head, and (4) with the gain of the direct sound being inversely proportional to virtual distance. The recorded singer was modelled as always facing the listener; no frequency-dependent directivity was implemented. Binaural stimuli corresponding to 24 different singer positions (8 angles and 3 distances) were synthesized. 30 participants heard the stimuli in randomized order, and indicated the perceived location of the singer on polar plot response sheets, with categories to indicate the possible responses. The listeners’ discrimination of the distance categories 0.5, 1 and 2 meters (1 correct out of 3 possible) was good, at about 80% correct. Discrimination of the angle of incidence, in 45-degreecategories (1 correct out of 8 possible) was fair, at 47% correct. Angle errors were mostly on the ‘cone of confusion’ (back-front symmetry), suggesting that the back-front cue was not very salient. The correct back-front responses (about 50%) dominated only somewhat over the incorrect ones (about 38%). In an ongoing follow-up study, multi-singer scenarios will be tested, and a more detailed yet still parametric filtering scheme will be explored.

Place, publisher, year, edition, pages
2023. article id 41
Keywords [en]
singing, choir, localisation, binaural
National Category
Media and Communication Technology
Research subject
Speech and Music Communication
Identifiers
URN: urn:nbn:se:kth:diva-335329OAI: oai:DiVA.org:kth-335329DiVA, id: diva2:1798566
Conference
Audio Engineering Society Conference: AES 2023 International Conference on Spatial and Immersive Audio, Huddersfield, UK, 23-25 Aug 2023.
Note

QC 20230919

Available from: 2023-09-19 Created: 2023-09-19 Last updated: 2023-09-19Bibliographically approved

Open Access in DiVA

fulltext(457 kB)56 downloads
File information
File name FULLTEXT01.pdfFile size 457 kBChecksum SHA-512
30c038002ec8e7e0aed089f3fb751e2539908b80aeb6a7340b2fdbb1d6bc9d0ae4e3c9680294c0de75b9e391fce4e07007160dc7d564607146dee780040bff77
Type fulltextMimetype application/pdf

Other links

http://www.aes.org/e-lib/browse.cfm?elib=22177

Authority records

Kittimathaveenan, KajornsakTernström, Sten

Search in DiVA

By author/editor
Kittimathaveenan, KajornsakTernström, Sten
By organisation
Music AcousticsSpeech Communication and Technology
Media and Communication Technology

Search outside of DiVA

GoogleGoogle Scholar
Total: 56 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 224 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf