kth.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Analyzing Music Improvisations Using Unsupervised Machine Learning: Towards Automatically Discovering Creative Cognition Principles
KTH, School of Electrical Engineering and Computer Science (EECS).
2024 (English)Independent thesis Advanced level (degree of Master (Two Years)), 20 credits / 30 HE creditsStudent thesisAlternative title
Analysera musikaliska improvisationer utan tillsyn Maskininlärning : Mot automatisk upptäckt av principer för kreativ kognition (Swedish)
Abstract [en]

In the field of musical expression, the complex relationship between improvisation and the cognitive processes that underlie creativity presents a fascinating yet challenging puzzle, prompting this thesis to explore the connection between musical improvisation and creative cognition among musicians. Focusing on the development of robust methods for feature extraction and representation, it utilizes unsupervised Machine Learning (ML) techniques to project improvisations from a prime melody into a high-level latent space. The methodology involves iterative analysis employing Variational Autoencoder (VAE) models, initially pre-trained with a larger dataset and fine-tuned with a musical improvisation dataset provided by the Max Plank Institute. Evaluation encompasses Evidence Lower Bound (ELBO) loss metric and dimensionality reduction techniques like Principal Component Analysis (PCA), t-distributed Stochastic Neighbor Embedding (t-SNE), Multidimensional Scaling (MDS), and Uniform Manifold Approximation and Projection (UMAP) to explore latent space representations. The results reveal that experienced musicians exhibit a greater divergence from the prime melody compared to amateurs. Moreover, professionals’ samples demonstrate more refined clustering and nuanced adjustments between improvisations projected in the latent space.

Abstract [sv]

Inom musikaliska uttryck är det komplexa förhållandet mellan improvisation och de kognitiva processer som ligger till grund för kreativitet ett fascinerande men utmanande pussel, vilket föranleder denna avhandling att utforska sambandet mellan musikalisk improvisation och kreativ kognition bland musiker. Avhandlingen fokuserar på utvecklingen av robusta metoder för extraktion och representation av funktioner och använder oövervakade maskininlärningstekniker (ML) för att projicera improvisationer från en huvudmelodi till ett latent utrymme på hög nivå. Metoden innebär iterativ analys med hjälp av VAE-modeller (Variational Autoencoder), som ursprungligen förutbildades med ett större dataset och finjusterades med ett dataset för musikalisk improvisation från Max Plank Institute. Utvärderingen omfattar förlustmåttet Evidence Lower Bound (ELBO) och dimensionalitetsreducerande tekniker som Principal Component Analysis (PCA), t-distributed Stochastic Neighbor Embedding (t-SNE), Multidimensional Scaling (MDS) och Uniform Manifold Approximation and Projection (UMAP) för att utforska latenta rymdrepresentationer. Resultaten visar att erfarna musiker uppvisar en större avvikelse från huvudmelodin jämfört med amatörer. Dessutom visar professionella musiker mer raffinerade kluster och nyanserade justeringar mellan improvisationer som projiceras i den latenta rymden.

Place, publisher, year, edition, pages
2024. , p. 44
Series
TRITA-EECS-EX ; 2024:23
Keywords [en]
Music Improvisation, Unsupervised Machine Learning, Creative Cognition
Keywords [sv]
Musikimprovisation, oövervakad maskininlärning, kreativ kognition
National Category
Computer and Information Sciences
Identifiers
URN: urn:nbn:se:kth:diva-345079OAI: oai:DiVA.org:kth-345079DiVA, id: diva2:1849279
Supervisors
Examiners
Available from: 2024-05-07 Created: 2024-04-05 Last updated: 2024-05-07Bibliographically approved

Open Access in DiVA

fulltext(1333 kB)142 downloads
File information
File name FULLTEXT01.pdfFile size 1333 kBChecksum SHA-512
4d69941a1a2c6a54adde3aba51a93aedf8fe1c7f8c6e81bd86835bbd2816a847dbe7308a598a4c5f08475e5fa0817cde4c6b4752bded75a38c383928418d3dc5
Type fulltextMimetype application/pdf

By organisation
School of Electrical Engineering and Computer Science (EECS)
Computer and Information Sciences

Search outside of DiVA

GoogleGoogle Scholar
Total: 142 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 218 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf