Ändra sökning
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
A Comparison of Disfluency Distribution in a Unimodal and a Multimodal Speech Interface
KTH, Tidigare Institutioner (före 2005), Tal, musik och hörsel.
KTH, Tidigare Institutioner (före 2005), Tal, musik och hörsel.ORCID-id: 0000-0002-0397-6442
2000 (Engelska)Ingår i: Proceedings of ICSLP 00, 2000Konferensbidrag, Publicerat paper (Refereegranskat)
Abstract [en]

In this paper, we compare the distribution of disfluencies in two human--computer dialogue corpora. One corpus consists of unimodal travel booking dialogues, which were recorded over the telephone. In this unimodal system, all components except the speech recognition were authentic. The other corpus was collected using a semi-simulated multi-modal dialogue system with an animated talking agent and a clickable map. The aim of this paper is to analyze and discuss the effects of modality, task and interface design on the distribution and frequency of disfluencies in these two corpora.

Ort, förlag, år, upplaga, sidor
2000.
Nationell ämneskategori
Teknik och teknologier
Identifikatorer
URN: urn:nbn:se:kth:diva-13331OAI: oai:DiVA.org:kth-13331DiVA, id: diva2:323699
Konferens
International Conference on Spoken Language Processing
Anmärkning

QC 20100611

Tillgänglig från: 2010-06-11 Skapad: 2010-06-11 Senast uppdaterad: 2018-05-21Bibliografiskt granskad
Ingår i avhandling
1. Developing Multimodal Spoken Dialogue Systems: Empirical Studies of Spoken Human–Computer Interaction
Öppna denna publikation i ny flik eller fönster >>Developing Multimodal Spoken Dialogue Systems: Empirical Studies of Spoken Human–Computer Interaction
2002 (Engelska)Doktorsavhandling, sammanläggning (Övrigt vetenskapligt)
Abstract [en]

This thesis presents work done during the last ten years on developing five multimodal spoken dialogue systems, and the empirical user studies that have been conducted with them. The dialogue systems have been multimodal, giving information both verbally with animated talking characters and graphically on maps and in text tables. To be able to study a wider rage of user behaviour each new system has been in a new domain and with a new set of interactional abilities. The five system presented in this thesis are: The Waxholm system where users could ask about the boat traffic in the Stockholm archipelago; the Gulan system where people could retrieve information from the Yellow pages of Stockholm; the August system which was a publicly available system where people could get information about the author Strindberg, KTH and Stockholm; the AdAptsystem that allowed users to browse apartments for sale in Stockholm and the Pixie system where users could help ananimated agent to fix things in a visionary apartment publicly available at the Telecom museum in Stockholm. Some of the dialogue systems have been used in controlled experiments in laboratory environments, while others have been placed inpublic environments where members of the general public have interacted with them. All spoken human-computer interactions have been transcribed and analyzed to increase our understanding of how people interact verbally with computers, and to obtain knowledge on how spoken dialogue systems canutilize the regularities found in these interactions. This thesis summarizes the experiences from building these five dialogue systems and presents some of the findings from the analyses of the collected dialogue corpora.

Ort, förlag, år, upplaga, sidor
Stockholm: KTH, 2002. s. x, 96
Serie
Trita-TMH ; 2002:8
Nyckelord
Spoken dialogue system, multimodal, speech, GUI, animated agents, embodied conversational characters, talking heads, empirical user studies, speech corpora, system evaluation, system development, Wizard of Oz simulations, system architecture, linguis
Nationell ämneskategori
Teknik och teknologier
Identifikatorer
urn:nbn:se:kth:diva-3460 (URN)
Disputation
2002-12-20, 00:00
Anmärkning
QC 20100611Tillgänglig från: 2002-12-11 Skapad: 2002-12-11 Senast uppdaterad: 2010-06-11Bibliografiskt granskad

Open Access i DiVA

fulltext(470 kB)15 nedladdningar
Filinformation
Filnamn FULLTEXT01.pdfFilstorlek 470 kBChecksumma SHA-512
f4b736849f70addfc5ed36fb21c79eb6671cb1765df2482b19a4a8945b69df72c3af53ebacf83caa6e0893d3839cd0335f970ebcc1c29043049d8d28a8efa63e
Typ fulltextMimetyp application/pdf

Personposter BETA

Gustafson, Joakim

Sök vidare i DiVA

Av författaren/redaktören
Bell, LindaGustafson, Joakim
Av organisationen
Tal, musik och hörsel
Teknik och teknologier

Sök vidare utanför DiVA

GoogleGoogle Scholar
Totalt: 15 nedladdningar
Antalet nedladdningar är summan av nedladdningar för alla fulltexter. Det kan inkludera t.ex tidigare versioner som nu inte längre är tillgängliga.

urn-nbn

Altmetricpoäng

urn-nbn
Totalt: 281 träffar
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf