Who am I speaking at?: perceiving the head orientation of speakers from acoustic cues alone
2012 (English)In: Proc. of LREC Workshop on Multimodal Corpora 2012, Istanbul, Turkey, 2012Conference paper (Refereed)
The ability of people, and of machines, to determine the position of a sound source in a room is well studied. The related ability to determine the orientation of a directed sound source, on the other hand, is not, but the few studies there are show people to be surprisingly skilled at it. This has bearing for studies of face-to-face interaction and of embodied spoken dialogue systems, as sound source orientation of a speaker is connected to the head pose of the speaker, which is meaningful in a number of ways. We describe in passing some preliminary findings that led us onto this line of investigation, and in detail a study in which we extend an experiment design intended to measure perception of gaze direction to test instead for perception of sound source orientation. The results corroborate those of previous studies, and further show that people are very good at performing this skill outside of studio conditions as well.
Place, publisher, year, edition, pages
Istanbul, Turkey, 2012.
Computer Science Language Technology (Computational Linguistics)
IdentifiersURN: urn:nbn:se:kth:diva-106998OAI: oai:DiVA.org:kth-106998DiVA: diva2:574368
LREC Workshop on Multimodal Corpora 2012
FunderICT - The Next Generation
tmh_import_12_12_05, tmh_id_3721. QC 201212172012-12-052012-12-052013-04-19Bibliographically approved