Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Integrating Audio and Visual Cues for Speaker Friendliness in Multimodal Speech Synthesis
KTH, School of Computer Science and Communication (CSC), Speech, Music and Hearing, TMH, Speech Communication and Technology.ORCID iD: 0000-0002-4628-3769
2007 (English)In: INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, BAIXAS: ISCA-INST SPEECH COMMUNICATION ASSOC , 2007, 1461-1464 p.Conference paper, Published paper (Refereed)
Abstract [en]

This paper investigates interactions between audio and visual cues to friendliness in questions in two perception experiments. In the first experiment, manually edited parametric audio-visual synthesis was used to create the stimuli. Results were consistent with earlier findings in that a late, high final focal accent peak was perceived as friendlier than an earlier, lower focal accent peak. Friendliness was also effectively signaled by visual facial parameters such as a smile, head nod and eyebrow raising synchronized with the final accent. Consistent additive effects were found between the audio and visual cues for the subjects as a group and individually showing that subjects integrate the two modalities. The second experiment used data-driven visual synthesis where the database was recorded by an actor instructed to portray anger and happiness. Friendliness was correlated to the happy database, but the effect was not as strong as for the parametric synthesis.

Place, publisher, year, edition, pages
BAIXAS: ISCA-INST SPEECH COMMUNICATION ASSOC , 2007. 1461-1464 p.
Keyword [en]
audio-visual speech perception, multimodal integration, human-machine interaction, audio-visual speech synthesis
National Category
Computer and Information Science General Language Studies and Linguistics
Identifiers
URN: urn:nbn:se:kth:diva-30673ISI: 000269998600366Scopus ID: 2-s2.0-56149101236ISBN: 978-1-60560-316-2 (print)OAI: oai:DiVA.org:kth-30673DiVA: diva2:403095
Conference
Interspeech Conference 2007, Antwerp, BELGIUM, AUG 27-31, 2007
Note
Book Group Author(s): ISCAAvailable from: 2011-03-11 Created: 2011-03-04 Last updated: 2011-09-13Bibliographically approved

Open Access in DiVA

No full text

Scopus

Authority records BETA

House, David

Search in DiVA

By author/editor
House, David
By organisation
Speech Communication and Technology
Computer and Information ScienceGeneral Language Studies and Linguistics

Search outside of DiVA

GoogleGoogle Scholar

isbn
urn-nbn

Altmetric score

isbn
urn-nbn
Total: 60 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf