Endre søk
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Projection of Turn Completion in Incremental Spoken Dialogue Systems
KTH, Skolan för elektroteknik och datavetenskap (EECS), Intelligenta system, Tal, musik och hörsel, TMH.ORCID-id: 0000-0003-3513-4132
KTH, Skolan för elektroteknik och datavetenskap (EECS), Intelligenta system, Tal, musik och hörsel, TMH.ORCID-id: 0000-0002-8579-1790
2021 (engelsk)Inngår i: SIGDIAL 2021: SIGDIAL 2021 - 22nd Annual Meeting of the Special Interest Group on Discourse and Dialogue, Proceedings of the Conference, Virtual, Singapore 29 July 2021 through 31 July 2021, ASSOC COMPUTATIONAL LINGUISTICS , 2021, s. 431-437Konferansepaper, Publicerat paper (Fagfellevurdert)
Abstract [en]

The ability to take turns in a fluent way (i.e., without long response delays or frequent interruptions) is a fundamental aspect of any spoken dialog system. However, practical speech recognition services typically induce a long response delay, as it takes time before the processing of the user's utterance is complete. There is a considerable amount of research indicating that humans achieve fast response times by projecting what the interlocutor will say and estimating upcoming turn completions. In this work, we implement this mechanism in an incremental spoken dialog system, by using a language model that generates possible futures to project upcoming completion points. In theory, this could make the system more responsive, while still having access to semantic information not yet processed by the speech recognizer. We conduct a small study which indicates that this is a viable approach for practical dialog systems, and that this is a promising direction for future research.

sted, utgiver, år, opplag, sider
ASSOC COMPUTATIONAL LINGUISTICS , 2021. s. 431-437
HSV kategori
Identifikatorer
URN: urn:nbn:se:kth:diva-304761DOI: 10.18653/v1/2021.sigdial-1.45ISI: 000707001800045Scopus ID: 2-s2.0-85136067428OAI: oai:DiVA.org:kth-304761DiVA, id: diva2:1610961
Konferanse
22nd Annual Meeting of the Special-Interest-Group-on-Discourse-and-Dialogue (SIGDIAL), JUL 29-31, 2021, Singapore, SINGAPORE
Prosjekter
tmh_turntaking
Merknad

Part of proceedings: ISBN 978-1-954085-81-7, QC 20230117

Tilgjengelig fra: 2021-11-12 Laget: 2021-11-12 Sist oppdatert: 2025-05-27bibliografisk kontrollert

Open Access i DiVA

Fulltekst mangler i DiVA

Andre lenker

Forlagets fulltekstScopus

Person

Ekstedt, ErikSkantze, Gabriel

Søk i DiVA

Av forfatter/redaktør
Ekstedt, ErikSkantze, Gabriel
Av organisasjonen

Søk utenfor DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric

doi
urn-nbn
Totalt: 181 treff
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf