D64: A corpus of richly recorded conversational interaction
2013 (English)In: Journal on Multimodal User Interfaces, ISSN 1783-7677, Vol. 7, no 1-2, 19-28 p.Article in journal (Refereed) Published
In recent years there has been a substantial debate about the need for increasingly spontaneous, conversational corpora of spoken interaction that are not controlled or task directed. In parallel the need has arisen for the recording of multi-modal corpora which are not restricted to the audio domain alone. With a corpus that would fulfill both needs, it would be possible to investigate the natural coupling, not only in turn-taking and voice, but also in the movement of participants. In the following paper we describe the design and recording of such a corpus and we provide some illustrative examples of how such a corpus might be exploited in the study of dynamic interaction. The D64 corpus is a multimodal corpus recorded over two successive days. Each day resulted in approximately 4 h of recordings. In total five participants took part in the recordings of whom two participants were female and three were male. Seven video cameras were used of which at least one was trained on each participant. The Optitrack motion capture kit was used in order to enrich information. The D64 corpus comprises annotations on conversational involvement, speech activity and pauses as well as information of the average degree of change in the movement of participants.
Place, publisher, year, edition, pages
2013. Vol. 7, no 1-2, 19-28 p.
Multimodality corpus, Conversational involvement, Spontaneous speech
Computer Science Language Technology (Computational Linguistics)
IdentifiersURN: urn:nbn:se:kth:diva-109373DOI: 10.1007/s12193-012-0108-6ISI: 000316062300003ScopusID: 2-s2.0-84874773796OAI: oai:DiVA.org:kth-109373DiVA: diva2:581743
FunderSwedish Research Council, 2009-1766
QC 201304152013-01-022013-01-022013-04-15Bibliographically approved