FARMI: A Framework for Recording Multi-Modal InteractionsVisa övriga samt affilieringar
2018 (Engelska)Ingår i: Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), Paris: European Language Resources Association, 2018, s. 3969-3974Konferensbidrag, Publicerat paper (Refereegranskat)
Abstract [en]
In this paper we present (1) a processing architecture used to collect multi-modal sensor data, both for corpora collection and real-time processing, (2) an open-source implementation thereof and (3) a use-case where we deploy the architecture in a multi-party deception game, featuring six human players and one robot. The architecture is agnostic to the choice of hardware (e.g. microphones, cameras, etc.) and programming languages, although our implementation is mostly written in Python. In our use-case, different methods of capturing verbal and non-verbal cues from the participants were used. These were processed in real-time and used to inform the robot about the participants’ deceptive behaviour. The framework is of particular interest for researchers who are interested in the collection of multi-party, richly recorded corpora and the design of conversational systems. Moreover for researchers who are interested in human-robot interaction the available modules offer the possibility to easily create both autonomous and wizard-of-Oz interactions.
Ort, förlag, år, upplaga, sidor
Paris: European Language Resources Association, 2018. s. 3969-3974
Nationell ämneskategori
Naturvetenskap Teknik och teknologier
Identifikatorer
URN: urn:nbn:se:kth:diva-230237ISI: 000725545004009Scopus ID: 2-s2.0-85058179983OAI: oai:DiVA.org:kth-230237DiVA, id: diva2:1217276
Konferens
The Eleventh International Conference on Language Resources and Evaluation (LREC 2018), Miyazaki, Japan, 7-12 May 2018
Anmärkning
Part of proceedings ISBN 979-10-95546-00-9
QC 20180618
2018-06-132018-06-132022-09-22Bibliografiskt granskad