kth.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Collecting Visually-Grounded Dialogue with A Game Of Sorts
KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Speech, Music and Hearing, TMH.
KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Speech, Music and Hearing, TMH.ORCID iD: 0000-0001-7327-3059
KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Speech, Music and Hearing, TMH.ORCID iD: 0000-0002-8579-1790
2022 (English)In: Proceedings of the 13th Conference on Language Resources and Evaluation / [ed] Calzolari, N Bechet, F Blache, P Choukri, K Cieri, C Declerck, T Goggi, S Isahara, H Maegaard, B Mazo, H Odijk, H Piperidis, S, European Language Resources Association (ELRA) , 2022, p. 2257-2268Conference paper, Published paper (Refereed)
Abstract [en]

An idealized, though simplistic, view of the referring expression production and grounding process in (situated) dialogue assumes that a speaker must merely appropriately specify their expression so that the target referent may be successfully identified by the addressee. However, referring in conversation is a collaborative process that cannot be aptly characterized as an exchange of minimally-specified referring expressions. Concerns have been raised regarding assumptions made by prior work on visually-grounded dialogue that reveal an oversimplified view of conversation and the referential process. We address these concerns by introducing a collaborative image ranking task, a grounded agreement game we call “A Game Of Sorts”. In our game, players are tasked with reaching agreement on how to rank a set of images given some sorting criterion through a largely unrestricted, role-symmetric dialogue. By putting emphasis on the argumentation in this mixed-initiative interaction, we collect discussions that involve the collaborative referential process. We describe results of a small-scale data collection experiment with the proposed task. All discussed materials, which includes the collected data, the codebase, and a containerized version of the application, are publicly available.

Place, publisher, year, edition, pages
European Language Resources Association (ELRA) , 2022. p. 2257-2268
National Category
Natural Language Processing
Identifiers
URN: urn:nbn:se:kth:diva-323116ISI: 000889371702039Scopus ID: 2-s2.0-85144393073OAI: oai:DiVA.org:kth-323116DiVA, id: diva2:1727580
Conference
13th Conference on Language Resources and Evaluation, 20-25 June, Marseille, France, 2022
Projects
tmh_grounding
Note

Part of proceedings: ISBN 9791095546726

QC 20230125

Available from: 2023-01-16 Created: 2023-01-16 Last updated: 2025-02-07Bibliographically approved

Open Access in DiVA

fulltext(1612 kB)47 downloads
File information
File name FULLTEXT01.pdfFile size 1612 kBChecksum SHA-512
c9fe46f516f6afb30066c8404798f3a4b2d4534b574c2e668b3595e1016e4a5502626c7cd2c87957675c48723c3ea973e0b5c0b3b044b79f6ea8d0f7ca9e5810
Type fulltextMimetype application/pdf

Scopus

Authority records

Willemsen, BramKalpakchi, DmytroSkantze, Gabriel

Search in DiVA

By author/editor
Willemsen, BramKalpakchi, DmytroSkantze, Gabriel
By organisation
Speech, Music and Hearing, TMH
Natural Language Processing

Search outside of DiVA

GoogleGoogle Scholar
Total: 47 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 367 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf