kth.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Crowdsource-based validation of the audio cocktail as a sound browsing tool
KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Speech, Music and Hearing, TMH.ORCID iD: 0000-0003-1262-4876
KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Speech, Music and Hearing, TMH.ORCID iD: 0000-0001-9327-9482
2023 (English)In: Interspeech 2023, International Speech Communication Association , 2023, p. 2178-2182Conference paper, Published paper (Refereed)
Abstract [en]

We conduct two crowdsourcing experiments designed to examine the usefulness of audio cocktails to quickly find out information on the contents of large audio data. Several thousand crowd workers were engaged to listen to audio cocktails with systematically varied composition. They were then asked to state either which sound out of four categories (Children, Women, Men, Orchestra) they heard the most of, or if they heard anything of a specific category at all. The results show that their responses have high reliability and provide information as to whether a specific task can be performed using audio cocktails. We also propose that the combination of crowd workers and audio cocktails can be used directly as a tool to investigate the contents of large audio data.

Place, publisher, year, edition, pages
International Speech Communication Association , 2023. p. 2178-2182
Keywords [en]
annotation, exploration, found speech, hearing, human-in-the-loop
National Category
Natural Language Processing Other Humanities not elsewhere specified
Identifiers
URN: urn:nbn:se:kth:diva-337834DOI: 10.21437/Interspeech.2023-2473ISI: 001186650302072Scopus ID: 2-s2.0-85171584146OAI: oai:DiVA.org:kth-337834DiVA, id: diva2:1803463
Conference
24th International Speech Communication Association, Interspeech 2023, August 20-24, 2023, Dublin, Ireland
Note

QC 20241014

Available from: 2023-10-09 Created: 2023-10-09 Last updated: 2025-02-01Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Fallgren, PerEdlund, Jens

Search in DiVA

By author/editor
Fallgren, PerEdlund, Jens
By organisation
Speech, Music and Hearing, TMH
Natural Language ProcessingOther Humanities not elsewhere specified

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 82 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf