kth.sePublications
Planned maintenance
A system upgrade is planned for 10/12-2024, at 12:00-13:00. During this time DiVA will be unavailable.
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
ChromaFlow: Modeling And Generating Harmonic Progressions With a Transformer And Voicing Encoding
KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Speech, Music and Hearing, TMH.ORCID iD: 0000-0002-3262-4091
KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Speech, Music and Hearing, TMH. Univ. Lille, CNRS, Centrale Lille, UMR 9189 CRIStAL, F-59000 Lille, France.ORCID iD: 0000-0001-7919-3463
KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Speech, Music and Hearing, TMH.ORCID iD: 0000-0003-2549-6367
2024 (English)In: MML 2024: 15th International Workshop on Machine Learning and Music, 2024, Vilnius, Lithuania, Vilnius, Lithuania, 2024Conference paper, Published paper (Refereed)
Abstract [en]

Modeling harmonic progressions in symbolic music is a complex task that requires generating musically coherent and varied chord sequences. In this study, we employ a transformer-based architecture trained on a comprehensive dataset of 48,072 songs, which includes an augmented set of 4,300 original pieces from the iReal Pro application transposed across all chromatic keys. We introduce a novel tokenization and voicing encoding strategy designed to enhance the musicality of the generated chord progressions. Our approach not only generates chord progression suggestions but also provides corresponding voicings tailored for instruments such as piano and guitar. To evaluate the effectiveness of our model, we conducted a listening test comparing the harmonic progressions produced by our approach against those from a baseline model. The results indicate that our model generates progressions with more fluid voicings, coherent harmonic motion, and plausible chord suggestions, effectively utilizing repetition and variation to enhance musicality.

Place, publisher, year, edition, pages
Vilnius, Lithuania, 2024.
National Category
Language Technology (Computational Linguistics)
Identifiers
URN: urn:nbn:se:kth:diva-356209OAI: oai:DiVA.org:kth-356209DiVA, id: diva2:1912305
Conference
15th International Workshop on Machine Learning and Music. ECML PKDD 2024, September 9, 2024
Note

QC 20241115

Available from: 2024-11-11 Created: 2024-11-11 Last updated: 2024-11-15Bibliographically approved

Open Access in DiVA

fulltext(804 kB)21 downloads
File information
File name FULLTEXT01.pdfFile size 804 kBChecksum SHA-512
6c359e7c2f5c239d42228f19b69b9a676943288e11b942c1804acd0180b5d17539875a1bc8fe5fe2d025bcb09561a5728c1d9dec945758e6cf6399f5200542e8
Type fulltextMimetype application/pdf

Other links

PaperAbstractsConference website

Authority records

Dalmazzo, DavidDéguernel, KenSturm, Bob

Search in DiVA

By author/editor
Dalmazzo, DavidDéguernel, KenSturm, Bob
By organisation
Speech, Music and Hearing, TMH
Language Technology (Computational Linguistics)

Search outside of DiVA

GoogleGoogle Scholar
Total: 21 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 71 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf