Multi-level exemplar-based duration generation for expressive speech synthesis
2012 (English)In: Proceedings of Speech Prosody, 2012, Vol. 2012Conference paper (Refereed)Text
The generation of duration of speech units from linguistic in- formation, as one component of a prosody model, is consid- ered to be a requirement for natural sounding speech synthesis. This paper investigates the use of a multi-level exemplar-based model for duration generation for the purposes of expressive speech synthesis. The multi-level exemplar-based model has been proposed in the literature as a cognitive model for the pro- duction of duration. The implementation of this model for dura- tion generation for speech synthesis is not straightforward and requires a set of modifications to the model and that the linguis- tically related units and the context of the target units should be taken into consideration. The work presented in this paper implements this model and presents a solution to these issues through the use of prosodic-syntactic correlated data, full con- text information of the input example and corpus exemplars.
Place, publisher, year, edition, pages
2012. Vol. 2012
Other Engineering and Technologies
IdentifiersURN: urn:nbn:se:kth:diva-185800OAI: oai:DiVA.org:kth-185800DiVA: diva2:923948