Ändra sökning
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
On the effect of the acoustic environment on the accuracy of perception of speaker orientation from auditory cues alone
KTH, Skolan för datavetenskap och kommunikation (CSC), Tal, musik och hörsel, TMH, Tal-kommunikation.ORCID-id: 0000-0001-9327-9482
Stockholm University, Faculty of Humanities, Department of Linguistics.
KTH, Skolan för datavetenskap och kommunikation (CSC), Tal, musik och hörsel, TMH, Tal-kommunikation.ORCID-id: 0000-0002-0397-6442
2012 (Engelska)Ingår i: 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012, Vol 2, 2012, s. 1482-1485Konferensbidrag, Publicerat paper (Refereegranskat)
Abstract [en]

The ability of people, and of machines, to determine the position of a sound source in a room is well studied. The related ability to determine the orientation of a directed sound source, on the other hand, is not, but the few studies there are show people to be surprisingly skilled at it. This has bearing for studies of face-to-face interaction and of embodied spoken dialogue systems, as sound source orientation of a speaker is connected to the head pose of the speaker, which is meaningful in a number of ways. The feature most often implicated for detection of sound source orientation is the inter-aural level difference - a feature which it is assumed is more easily exploited in anechoic chambers than in everyday surroundings. We expand here on our previous studies and compare detection of speaker orientation within and outside of the anechoic chamber. Our results show that listeners find the task easier, rather than harder, in everyday surroundings, which suggests that inter-aural level differences is not the only feature at play.

Ort, förlag, år, upplaga, sidor
2012. s. 1482-1485
Nyckelord [en]
turn-taking, head pose, gaze, acoustic directionality
Nationell ämneskategori
Datavetenskap (datalogi) Språkteknologi (språkvetenskaplig databehandling)
Identifikatorer
URN: urn:nbn:se:kth:diva-107005ISI: 000320827200371Scopus ID: 2-s2.0-84878379315ISBN: 978-1-62276-759-5 (tryckt)OAI: oai:DiVA.org:kth-107005DiVA, id: diva2:574353
Konferens
13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012; Portland, OR; United States; 9 September 2012 through 13 September 2012
Forskningsfinansiär
ICT - The Next Generation
Anmärkning

QC 20130822

Tillgänglig från: 2012-12-05 Skapad: 2012-12-05 Senast uppdaterad: 2018-01-12Bibliografiskt granskad

Open Access i DiVA

Fulltext saknas i DiVA

Scopus

Personposter BETA

Edlund, JensGustafson, Joakim

Sök vidare i DiVA

Av författaren/redaktören
Edlund, JensHeldner, MattiasGustafson, Joakim
Av organisationen
Tal-kommunikation
Datavetenskap (datalogi)Språkteknologi (språkvetenskaplig databehandling)

Sök vidare utanför DiVA

GoogleGoogle Scholar

isbn
urn-nbn

Altmetricpoäng

isbn
urn-nbn
Totalt: 70 träffar
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf