Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Clustering Expressive Speech Styles in Audiobooks Using Glottal Source Parameters.
2011 (English)In: 12th Annual Conference of the International-Speech-Communication-Association 2011 (INTERSPEECH 2011), ISCA-INT SPEECH COMMUNICATION ASSOC , 2011, 2409-2412 p.Conference paper, Published paper (Refereed)
Resource type
Text
Abstract [en]

A great challenge for text-to-speech synthesis is to produce ex- pressive speech. The main problem is that it is difficult to syn- thesise high-quality speech using expressive corpora. With the increasing interest in audiobook corpora for speech synthesis, there is a demand to synthesise speech which is rich in prosody, emotions and voice styles. In this work, Self-Organising Fea- ture Maps (SOFM) are used for clustering the speech data using voice quality parameters of the glottal source, in order to map out the variety of voice styles in the corpus. Subjective evalu- ation showed that this clustering method successfully separated the speech data into groups of utterances associated with dif- ferent voice characteristics. This work can be applied in unit- selection synthesis by selecting appropriate data sets to synthe- sise utterances with specific voice styles. It can also be used in parametric speech synthesis to model different voice styles separately. 

Place, publisher, year, edition, pages
ISCA-INT SPEECH COMMUNICATION ASSOC , 2011. 2409-2412 p.
National Category
Engineering and Technology
Identifiers
URN: urn:nbn:se:kth:diva-185519ISI: 000316502201094Scopus ID: 2-s2.0-84865709194ISBN: 978-1-61839-270-1 (print)OAI: oai:DiVA.org:kth-185519DiVA: diva2:922790
Conference
12th Annual Conference of the International-Speech-Communication-Association 2011 (INTERSPEECH 2011) AUG 27-31
Note

QC 20160426

Available from: 2016-04-25 Created: 2016-04-21 Last updated: 2016-04-26Bibliographically approved

Open Access in DiVA

fulltext(348 kB)62 downloads
File information
File name FULLTEXT01.pdfFile size 348 kBChecksum SHA-512
02370488f5ee24db4ceab1b875cfb2e4a27f39d23211515b52e49b70704acfab9479bb1170254e0f5f04162d1056933dc85781ee7f02c5218fcd391ead9d505c
Type fulltextMimetype application/pdf

Scopus

Authority records BETA

Székely, Éva

Search in DiVA

By author/editor
Székely, Éva
Engineering and Technology

Search outside of DiVA

GoogleGoogle Scholar
Total: 62 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

isbn
urn-nbn

Altmetric score

isbn
urn-nbn
Total: 178 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf