Emotional Audio-Visual Arabic Text to Speech
2006 (English)In: Proceedings of the XIV European Signal Processing Conference (EUSIPCO), Florence, Italy, 2006Conference paper (Refereed)
The goal of this paper is to present an emotional audio-visual. Text to speech system for the Arabic Language. The system is based on two entities: un emotional audio text to speech system which generates speech depending on the input text and the desired emotion type, and un emotional Visual model which generates the talking heads, by forming the corresponding visemes. The phonemes to visemes mapping, and the emotion shaping use a 3-paramertic face model, based on the Abstract Muscle Model. We have thirteen viseme models and five emotions as parameters to the face model. The TTS produces the phonemes corresponding to the input text, the speech with the suitable prosody to include the prescribed emotion. In parallel the system generates the visemes and sends the controls to the facial model to get the animation of the talking head in real time.
Place, publisher, year, edition, pages
Florence, Italy, 2006.
, European Signal Processing Conference, ISSN 2219-5491
Computer Science Language Technology (Computational Linguistics)
IdentifiersURN: urn:nbn:se:kth:diva-52075OAI: oai:DiVA.org:kth-52075DiVA: diva2:465369
the XIV European Signal Processing Conference (EUSIPCO)
tmh_import_11_12_14. QC 201112152011-12-142011-12-142011-12-15Bibliographically approved