Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Towards a voice conversion system based on frame selection
KTH, School of Computer Science and Communication (CSC), Media Technology and Interaction Design, MID. (Sound and Music Computing)ORCID iD: 0000-0003-1679-6018
Show others and affiliations
2007 (English)In: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), IEEE Press, 2007, Vol. IV, p. 513-516Conference paper, Published paper (Refereed)
Abstract [en]

The subject of this paper is the conversion of a given speaker's voice (the source speaker) into another identified voice (the target one). We assume we have at our disposal a large amount of speech samples from source and target voice with at least a part of them being parallel. The proposed system is built on a mapping function between source and target spectral envelopes followed by a frame selection algorithm to produce final spectral envelopes. Converted speech is produced by a basic LP analysis of the source and LP synthesis using the converted spectral envelopes. We compared three types of conversion: without mapping, with mapping and using the excitation of the source speaker and finally with mapping using the excitation of the target. Results show that the combination of mapping and frame selection provide the best results, and underline the interest to work on methods to convert the LP excitation.

Place, publisher, year, edition, pages
IEEE Press, 2007. Vol. IV, p. 513-516
Keyword [en]
voice conversion; frame selection; voice mapping
National Category
Media and Communication Technology Signal Processing
Identifiers
URN: urn:nbn:se:kth:diva-193738ISI: 000248909200129Scopus ID: 2-s2.0-34547496196OAI: oai:DiVA.org:kth-193738DiVA, id: diva2:1040439
Conference
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
Note

QC 20161031

Available from: 2016-10-27 Created: 2016-10-10 Last updated: 2018-01-14Bibliographically approved

Open Access in DiVA

No full text in DiVA

Scopus

Search in DiVA

By author/editor
Holzapfel, André
By organisation
Media Technology and Interaction Design, MID
Media and Communication TechnologySignal Processing

Search outside of DiVA

GoogleGoogle Scholar

urn-nbn

Altmetric score

urn-nbn
Total: 10 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf