kth.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Evaluation of the expressivity of a Swedish talking head in the context of human-machine interaction
KTH, School of Computer Science and Communication (CSC), Speech, Music and Hearing, TMH, Speech Communication and Technology.ORCID iD: 0000-0003-1399-6604
KTH, School of Computer Science and Communication (CSC), Speech, Music and Hearing, TMH, Speech Communication and Technology.
2008 (English)In: Comunicazione parlatae manifestazione delle emozioni: Atti del I Convegno GSCP, Padova 29 novembre - 1 dicembre 2004 / [ed] Emanuela Magno Caldognetto, Federica Cavicchio e Piero Cosi, 2008Conference paper, Published paper (Refereed)
Abstract [en]

ABSTRACTThis paper describes a first attempt at synthesis and evaluation of expressive visualarticulation using an MPEG-4 based virtual talking head. The synthesis is data-driven,trained on a corpus of emotional speech recorded using optical motion capture. Eachemotion is modelled separately using principal component analysis and a parametriccoarticulation model.In order to evaluate the expressivity of the data driven synthesis two tests wereconducted. Our talking head was used in interactions with a human being in a givenrealistic usage context.The interactions were presented to external observers that were asked to judge theemotion of the talking head. The participants in the experiment could only hear the voice ofthe user, which was a pre-recorded female voice, and see and hear the talking head. Theresults of the evaluation, even if constrained by the results of the implementation, clearlyshow that the visual expression plays a relevant role in the recognition of emotions.

Abstract [it]

Una delle piu’ recenti sfide nell´ambito dello sviluppo di sistemi automatici per lariproduzione di parlato audio-visivo è quella di riuscire a sviluppare un modello per laproduzione di parlato espressivo bi-modale. In questa comunicazione verranno presentati irisultati del primo tentativo di far produrre ad una testa parlante svedese una sintesi audiovisiva di parlato espressivo e si discuterá della messa a punto e dei risultati di due testpercettivi condotti allo scopo di valutare l´espressivitá di questa testa parlante, inserita in uncontesto simulato di interazione uomo-macchina.

Place, publisher, year, edition, pages
2008.
National Category
Computer Sciences Natural Language Processing
Identifiers
URN: urn:nbn:se:kth:diva-52016ISBN: 978-88-207-4019-1 (print)OAI: oai:DiVA.org:kth-52016DiVA, id: diva2:465310
Conference
GSCP Padova 29 novembre - 1 dicembre 2004
Note
tmh_import_11_12_14. QC 20120113Available from: 2011-12-14 Created: 2011-12-14 Last updated: 2025-02-01Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

http://www.speech.kth.se/prod/publications/files/3156.pdf

Authority records

Beskow, Jonas

Search in DiVA

By author/editor
Beskow, JonasCerrato, Loredana
By organisation
Speech Communication and Technology
Computer SciencesNatural Language Processing

Search outside of DiVA

GoogleGoogle Scholar

isbn
urn-nbn

Altmetric score

isbn
urn-nbn
Total: 179 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf