Change search
ReferencesLink to record
Permanent link

Direct link
D64: A corpus of richly recorded conversational interaction
KTH, School of Computer Science and Communication (CSC), Speech, Music and Hearing, TMH, Speech Communication and Technology.
KTH, School of Computer Science and Communication (CSC), Speech, Music and Hearing, TMH, Speech Communication and Technology.ORCID iD: 0000-0001-9327-9482
Show others and affiliations
2013 (English)In: Journal on Multimodal User Interfaces, ISSN 1783-7677, Vol. 7, no 1-2, 19-28 p.Article in journal (Refereed) Published
Abstract [en]

In recent years there has been a substantial debate about the need for increasingly spontaneous, conversational corpora of spoken interaction that are not controlled or task directed. In parallel the need has arisen for the recording of multi-modal corpora which are not restricted to the audio domain alone. With a corpus that would fulfill both needs, it would be possible to investigate the natural coupling, not only in turn-taking and voice, but also in the movement of participants. In the following paper we describe the design and recording of such a corpus and we provide some illustrative examples of how such a corpus might be exploited in the study of dynamic interaction. The D64 corpus is a multimodal corpus recorded over two successive days. Each day resulted in approximately 4 h of recordings. In total five participants took part in the recordings of whom two participants were female and three were male. Seven video cameras were used of which at least one was trained on each participant. The Optitrack motion capture kit was used in order to enrich information. The D64 corpus comprises annotations on conversational involvement, speech activity and pauses as well as information of the average degree of change in the movement of participants.

Place, publisher, year, edition, pages
2013. Vol. 7, no 1-2, 19-28 p.
Keyword [en]
Multimodality corpus, Conversational involvement, Spontaneous speech
National Category
Computer Science Language Technology (Computational Linguistics)
URN: urn:nbn:se:kth:diva-109373DOI: 10.1007/s12193-012-0108-6ISI: 000316062300003ScopusID: 2-s2.0-84874773796OAI: diva2:581743
Swedish Research Council, 2009-1766

QC 20130415

Available from: 2013-01-02 Created: 2013-01-02 Last updated: 2013-04-15Bibliographically approved

Open Access in DiVA

No full text

Other links

Publisher's full textScopus

Search in DiVA

By author/editor
Oertel, CatharineEdlund, Jens
By organisation
Speech Communication and Technology
In the same journal
Journal on Multimodal User Interfaces
Computer ScienceLanguage Technology (Computational Linguistics)

Search outside of DiVA

GoogleGoogle Scholar
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Altmetric score

Total: 86 hits
ReferencesLink to record
Permanent link

Direct link