kth.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
What makes a good pause? Investigating the turn-holding effects of fillers
KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Speech, Music and Hearing, TMH.
KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Speech, Music and Hearing, TMH.ORCID iD: 0000-0003-3513-4132
KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Speech, Music and Hearing, TMH.ORCID iD: 0000-0002-8579-1790
2023 (English)In: Proceedings 20th International Congress of Phonetic Sciences (ICPhS), Prague: International Phonetic Association , 2023, p. 3512-3516, article id 828Conference paper, Published paper (Refereed)
Abstract [en]

Filled pauses (or fillers), such as uh and um, are frequent in spontaneous speech and can serve as a turn-holding cue for the listener, indicating that the current speaker is not done yet. In this paper, we use the recently proposed Voice Activity Projection (VAP) model, which is a deep learning model trained to predict the dynamics of conversation, to analyse the effects of filled pauses on the expected turn-hold probability. The results show that, while filled pauses do indeed have a turn-holding effect, it is perhaps not as strong as could be expected, probably due to the redundancy of other cues. We also find that the prosodic properties and position of the filler has a significant effect on the turn-hold probability. However, contrary to what has been suggested in previous work, there is no difference between uh and um in this regard.

Place, publisher, year, edition, pages
Prague: International Phonetic Association , 2023. p. 3512-3516, article id 828
Series
ICPhS Proceedings, ISSN 2412-0669
Keywords [en]
Hesitation, fillers, turn-taking, spoken dialog, computational modelling
National Category
Natural Language Processing
Identifiers
URN: urn:nbn:se:kth:diva-341383OAI: oai:DiVA.org:kth-341383DiVA, id: diva2:1821229
Conference
20th International Congress of Phonetic Sciences (ICPhS). August 7-11 2023, Prague, Czech Republic
Projects
tmh_turntaking
Note

Part of ISBN 978-80-908 114-2-3

QC 20241028

Available from: 2023-12-19 Created: 2023-12-19 Last updated: 2025-02-07Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

PaperConference proceedingsConference website

Authority records

Jiang, Bing'erEkstedt, ErikSkantze, Gabriel

Search in DiVA

By author/editor
Jiang, Bing'erEkstedt, ErikSkantze, Gabriel
By organisation
Speech, Music and Hearing, TMH
Natural Language Processing

Search outside of DiVA

GoogleGoogle Scholar

urn-nbn

Altmetric score

urn-nbn
Total: 84 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf