Furhat goes to Robotville: a large-scale multiparty human-robot interaction data collection in a public space
2012 (English)In: Proc of LREC Workshop on Multimodal Corpora, Istanbul, Turkey, 2012Conference paper (Refereed)
In the four days of the Robotville exhibition at the London Science Museum, UK, during which the back-projected head Furhat in a situated spoken dialogue system was seen by almost 8 000 visitors, we collected a database of 10 000 utterances spoken to Furhat in situated interaction. The data collection is an example of a particular kind of corpus collection of human-machine dialogues in public spaces that has several interesting and specific characteristics, both with respect to the technical details of the collection and with respect to the resulting corpus contents. In this paper, we take the Furhat data collection as a starting point for a discussion of the motives for this type of data collection, its technical peculiarities and prerequisites, and the characteristics of the resulting corpus.
Place, publisher, year, edition, pages
Istanbul, Turkey, 2012.
Computer Science Language Technology (Computational Linguistics)
IdentifiersURN: urn:nbn:se:kth:diva-107011OAI: oai:DiVA.org:kth-107011DiVA: diva2:574365
LREC Workshop on Multimodal Corpora
FunderICT - The Next Generation
tmh_import_12_12_05, tmh_id_3723. QC 201212102012-12-052012-12-052013-04-11Bibliographically approved