kth.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Sparks of Musical AGI? Challenges and perspectives in music co-creation with LLMs: A qualitative exploration of the music knowledge of LLMs and their use for music creation
KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Speech, Music and Hearing, TMH. (MUSAiC)ORCID iD: 0000-0002-3468-6974
KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Speech, Music and Hearing, TMH.ORCID iD: 0009-0003-8553-3542
KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Speech, Music and Hearing, TMH.ORCID iD: 0000-0003-2549-6367
2024 (English)Conference paper, Published paper (Refereed)
Abstract [en]

In the paper Sparks of Artificial General Intelligence, the authors show how OpenAI’s GPT-4 is able do well in variety of tasks that be represented with text and claim it to have “a more general intelligence than previous AI models.” One of the tasks they explore is symbolic music generation. In this paper we critically analyze their results and extend the discourse around the capabilities of LLMs for music by exploring additional musical tasks and LLMs. Furthermore, we will investigate the viability of smaller models when used in conjunction with Retrieval Augmented Generation, as well as finetuning on programmatically written prompts using Quantized Low Rank Adapters. Finally, we discuss some critical aspects of LLMs as a tool for music generation.

Place, publisher, year, edition, pages
2024.
Keywords [en]
Large Language Models, Music Co-Creation, Music Understanding, Finetuning
National Category
Computer and Information Sciences Music
Identifiers
URN: urn:nbn:se:kth:diva-352705OAI: oai:DiVA.org:kth-352705DiVA, id: diva2:1895325
Conference
International Conference on AI and Musical Creativity (AIMC) 2024, Oxford UK, 9 - 11 September 2024
Note

QC 20240906

Available from: 2024-09-05 Created: 2024-09-05 Last updated: 2025-02-21Bibliographically approved

Open Access in DiVA

casini_aimc24_soagi(1037 kB)204 downloads
File information
File name FULLTEXT01.pdfFile size 1037 kBChecksum SHA-512
f20aa8a00eb70e002cf7b187b32d7a82493ca057120b5a4a938c00ef422db331b5e05911c5d264aa759890176ae4ed113bef6dcc82a70a2f6f2029a8215ec8ce
Type fulltextMimetype application/pdf

Other links

Paper

Authority records

Casini, LucaJonason, NicolasSturm, Bob

Search in DiVA

By author/editor
Casini, LucaJonason, NicolasSturm, Bob
By organisation
Speech, Music and Hearing, TMH
Computer and Information SciencesMusic

Search outside of DiVA

GoogleGoogle Scholar
Total: 204 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 195 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf