kth.sePublications KTH
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Green MIR?: Investigating computational cost of recent music-Ai research in ISMIR
KTH, School of Electrical Engineering and Computer Science (EECS), Human Centered Technology, Media Technology and Interaction Design, MID.ORCID iD: 0000-0003-1679-6018
KTH, School of Electrical Engineering and Computer Science (EECS), Human Centered Technology, Media Technology and Interaction Design, MID.ORCID iD: 0000-0002-7605-0093
KTH, School of Electrical Engineering and Computer Science (EECS), Human Centered Technology, Media Technology and Interaction Design, MID.ORCID iD: 0000-0002-0028-9030
2024 (English)In: Proceedings of the 25th International Society for Music Information Retrieval (ISMIR) Conference, 2024, p. 371-380Conference paper, Published paper (Refereed)
Abstract [en]

The environmental footprint of Generative AI and other Deep Learning (DL) technologies is increasing. To understand the scale of the problem and to identify solutions for avoiding excessive energy use in DL research at communities such as ISMIR, more knowledge is needed of the current energy cost of the undertaken research. In this paper, we provide a scoping inquiry of how the ISMIR research concerning automatic music generation (AMG) and computing-heavy music analysis currently discloses information related to environmental impact. We present a study based on two corpora that document 1) ISMIR papers published in the years 2017–2023 that introduce an AMG model, and 2) ISMIR papers from the years 2022–2023 that propose music analysis models and include heavy computations with GPUs. Our study demonstrates a lack of transparency in model training documentation. It provides the first estimates of energy consumption related to model training at ISMIR, as a baseline for making more systematic estimates about the energy footprint of the ISMIR conference in relation to other machine learning events. Furthermore, we map the geographical distribution of generative model contributions and discuss the corporate role in the funding and model choices in this body of work.

Place, publisher, year, edition, pages
2024. p. 371-380
Keywords [en]
music information retrieval, MIR, sustainability, energy, generative AI, deep learning
National Category
Music Computer Sciences
Identifiers
URN: urn:nbn:se:kth:diva-356910DOI: 10.5281/zenodo.14877351Scopus ID: 2-s2.0-85219635914OAI: oai:DiVA.org:kth-356910DiVA, id: diva2:1916342
Conference
International Society for Music Information Retrieval Conference (ISMIR), San Francisco, California, United States of America, November 10-14, 2024
Funder
Marianne and Marcus Wallenberg Foundation
Note

Part of ISBN 978-1-7327299-4-0

QC 20250313

Available from: 2024-11-27 Created: 2024-11-27 Last updated: 2025-03-13Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Holzapfel, AndreKaila, Anna-KaisaJääskeläinen, Petra

Search in DiVA

By author/editor
Holzapfel, AndreKaila, Anna-KaisaJääskeläinen, Petra
By organisation
Media Technology and Interaction Design, MID
MusicComputer Sciences

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 439 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf