Facial expression-based affective speech translation
2014 (English)In: Journal on Multimodal User Interfaces, ISSN 1783-7677, E-ISSN 1783-8738, Vol. 8, no 1, 87-96 p.Article in journal (Refereed) PublishedText
One of the challenges of speech-to-speech trans- lation is to accurately preserve the paralinguistic informa- tion in the speaker’s message. Information about affect and emotional intent of a speaker are often carried in more than one modality. For this reason, the possibility of multimodal interaction with the system and the conversation partner may greatly increase the likelihood of a successful and gratifying communication process. In this work we explore the use of automatic facial expression analysis as an input annotation modality to transfer paralinguistic information at a symbolic level from input to output in speech-to-speech translation. To evaluate the feasibility of this approach, a prototype sys- tem, FEAST (facial expression-based affective speech trans- lation) has been developed. FEAST classifies the emotional state of the user and uses it to render the translated output in an appropriate voice style, using expressive speech synthesis.
Place, publisher, year, edition, pages
2014. Vol. 8, no 1, 87-96 p.
Engineering and Technology
IdentifiersURN: urn:nbn:se:kth:diva-185522DOI: 10.1007/s12193-013-0128-xISI: 000337753000008ScopusID: 2-s2.0-84902376723OAI: oai:DiVA.org:kth-185522DiVA: diva2:922786
QC 201604252016-04-252016-04-212016-04-25Bibliographically approved