A COMPARATIVE EVALUATION OF VOCODING TECHNIQUES FOR HMM-BASED LAUGHTER SYNTHESIS
2014 (English)Conference paper (Refereed)
This paper presents an experimental comparison of various leading vocoders for the application of HMM-based laughter synthesis. Four vocoders, commonly used in HMM-based speech synthesis, are used in copy-synthesis and HMM-based synthesis of both male and female laughter. Subjective evaluations are conducted to assess the performance of the vocoders. The results show that all vocoders perform relatively well in copy-synthesis. In HMM-based laughter synthesis using original phonetic transcriptions, all synthesized laughter voices were significantly lower in quality than copy-synthesis, indicating a challenging task and room for improvements. Interestingly, two vocoders using rather simple and robust excitation modeling performed the best, indicating that robustness in speech parameter extraction and simple parameter representation in statistical modeling are key factors in successful laughter synthesis.
Place, publisher, year, edition, pages
2014. 255-259 p.
, International Conference on Acoustics Speech and Signal Processing ICASSP, ISSN 1520-6149
Laughter synthesis, vocoder, mel-cepstrum, STRAIGHT, DSM, GlottHMM, HTS, HMM
Fluid Mechanics and Acoustics
IdentifiersURN: urn:nbn:se:kth:diva-158336DOI: 10.1109/ICASSP.2014.6853597ISI: 000343655300052ScopusID: 2-s2.0-84905269196ISBN: 978-1-4799-2893-4ISBN: 978-147992892-7OAI: oai:DiVA.org:kth-158336DiVA: diva2:783078
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), MAY 04-09, 2014, Florence, ITALY
QC 201501232015-01-232015-01-072015-01-23Bibliographically approved