kth.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Effects of Visual Prominence Cues on Speech Intelligibility
KTH, School of Computer Science and Communication (CSC), Speech, Music and Hearing, TMH, Speech Communication and Technology. KTH, School of Computer Science and Communication (CSC), Centres, Centre for Speech Technology, CTT.
KTH, School of Computer Science and Communication (CSC), Speech, Music and Hearing, TMH, Speech Communication and Technology. KTH, School of Computer Science and Communication (CSC), Centres, Centre for Speech Technology, CTT.ORCID iD: 0000-0003-1399-6604
2009 (English)In: Proceedings of Auditory-Visual Speech Processing AVSP'09, Norwich, England, 2009Conference paper, Published paper (Refereed)
Abstract [en]

This study reports experimental results on the effect of visual prominence, presented as gestures, on speech intelligibility. 30 acoustically vocoded sentences, permutated into different gestural conditions were presented audio-visually to 12 subjects. The analysis of correct word recognition shows a significant increase in intelligibility when focally-accented (prominent) words are supplemented with head-nods or with eye-brow raise gestures. The paper also examines coupling other acoustic phenomena to brow-raise gestures. As a result, the paper introduces new evidence on the ability of the non-verbal movements in the visual modality to support audio-visual speech perception.

Place, publisher, year, edition, pages
Norwich, England, 2009.
Keywords [en]
prominence, head-nod, eye-brow, speech intelligibility, talking heads, lip-reading, gesture, visual prosody
National Category
Computer Sciences Natural Language Processing
Identifiers
URN: urn:nbn:se:kth:diva-52099OAI: oai:DiVA.org:kth-52099DiVA, id: diva2:465394
Conference
Auditory-Visual Speech Processing (AVSP) 2009, University of East Anglia, Norwich, UK, September 10-13, 2009
Note
tmh_import_11_12_14. QC 20111222Available from: 2011-12-14 Created: 2011-12-14 Last updated: 2025-02-01Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

http://www.speech.kth.se/prod/publications/files/3357.pdf

Authority records

Beskow, Jonas

Search in DiVA

By author/editor
Al Moubayed, SamerBeskow, Jonas
By organisation
Speech Communication and TechnologyCentre for Speech Technology, CTT
Computer SciencesNatural Language Processing

Search outside of DiVA

GoogleGoogle Scholar

urn-nbn

Altmetric score

urn-nbn
Total: 1666 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf