kth.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
DDSP-based Neural Waveform Synthesis of Polyphonic Guitar Performance from String-wise MIDI Input
KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Speech, Music and Hearing, TMH.
National Institute of Informatics.
National Institute of Informatics.
Aalto University.
Show others and affiliations
2024 (English)In: Proceedings of the 27th International Conference on Digital Audio Effects (DAFx24), 2024Conference paper, Published paper (Refereed)
Abstract [en]

We explore the use of neural synthesis for acoustic guitar from string-wise MIDI input. We propose four different systems and compare them with both objective metrics and subjective evaluation against natural audio and a sample-based baseline. We aiteratively develop these four systems by making various considerations on the architecture and intermediate tasks, such as predicting pitch and loudness control features. We find that formulating the control feature prediction task as a classification task rather than a regression task yields better results. Furthermore, we find that our simplest proposed system, which directly predicts synthesis parameters from MIDI input performs the best out of the four proposed systems. Audio examples and code are available.

Place, publisher, year, edition, pages
2024.
National Category
Computer and Information Sciences
Identifiers
URN: urn:nbn:se:kth:diva-350217Scopus ID: 2-s2.0-85210235985OAI: oai:DiVA.org:kth-350217DiVA, id: diva2:1883034
Conference
Proceedings of the 27th International Conference on Digital Audio Effects (DAFx24), Guildford, United Kingdom, 3 - 7 September, 2024
Funder
EU, Horizon 2020, 864189
Note

QC 20241205

Available from: 2024-07-08 Created: 2024-07-08 Last updated: 2024-12-05Bibliographically approved

Open Access in DiVA

fulltext(1671 kB)28 downloads
File information
File name FULLTEXT01.pdfFile size 1671 kBChecksum SHA-512
8ed7455630f6ca571b71c12565294c418019fc6b51aac038b02fccfe310b581f544a745b101cbf9768811a26a87422a81cf843711066dd2eee485301ad0d2755
Type fulltextMimetype application/pdf

Other links

ScopusConference websitefulltext

Authority records

Jonason, NicolasSturm, Bob

Search in DiVA

By author/editor
Jonason, NicolasSturm, Bob
By organisation
Speech, Music and Hearing, TMH
Computer and Information Sciences

Search outside of DiVA

GoogleGoogle Scholar
Total: 28 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 100 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf