Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Cues for Hesitation in Speech Synthesis
KTH, School of Computer Science and Communication (CSC), Speech, Music and Hearing, TMH, Speech Communication and Technology.
KTH, School of Computer Science and Communication (CSC), Speech, Music and Hearing, TMH, Speech Communication and Technology.
2006 (English)In: INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, BAIXAS: ISCA-INST SPEECH COMMUNICATION ASSOC , 2006, 1300-1303 p.Conference paper, Published paper (Refereed)
Abstract [en]

The current study investigates acoustic correlates to perceived hesitation based on previous work showing that pause duration and final lengthening both contribute to the perception of hesitation. It is the total duration increase that is the valid cue rather than the contribution by either factor. The present experiment using speech synthesis was designed to evaluate F0 slope and presence vs. absence of creaky voice before the inserted hesitation in addition to durational cues. The manipulations occurred in two syntactic positions, within a phrase and between two phrases, respectively. The results showed that in addition to durational increase, variation of both F0 slope and creaky voice had perceptual effects, although to a much lesser degree. The results have a bearing on efforts to model spontaneous speech including disfluencies, to be explored, for example, in spoken dialogue systems.

Place, publisher, year, edition, pages
BAIXAS: ISCA-INST SPEECH COMMUNICATION ASSOC , 2006. 1300-1303 p.
Keyword [en]
hesitation, perception, speech synthesis
National Category
Computer and Information Science General Language Studies and Linguistics
Identifiers
URN: urn:nbn:se:kth:diva-30679ISI: 000269965901062Scopus ID: 2-s2.0-44949121869OAI: oai:DiVA.org:kth-30679DiVA: diva2:403013
Conference
9th International Conference on Spoken Language Processing/INTERSPEECH 2006, Pittsburgh, PA, 2006
Note
QC 20110310Available from: 2011-03-10 Created: 2011-03-04 Last updated: 2011-03-10Bibliographically approved

Open Access in DiVA

No full text

Other links

ScopusISCA

Search in DiVA

By author/editor
Carlson, RolfGustafson, KjellStrangert, Eva
By organisation
Speech Communication and Technology
Computer and Information ScienceGeneral Language Studies and Linguistics

Search outside of DiVA

GoogleGoogle Scholar

urn-nbn

Altmetric score

urn-nbn
Total: 36 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf