Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Waveform quantization of speech using Gaussian mixture models
KTH, Superseded Departments, Signals, Sensors and Systems. KTH, Superseded Departments, Wireless at KTH.
2004 (English)In: 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, IEEE , 2004, 165-168 p.Conference paper, Published paper (Refereed)
Abstract [en]

Waveform quantization of speech using Gaussian mixture models (GMMs) is proposed. GMMs are trained directly on the speech waveform, and high dimensional vector quantizers (VQs) that efficiently exploit the redundancy are constructed based on the GMM parameters. Two types of GMMs are studied. The complexity of the scheme is independent of the rate, and the rate can be changed without retraining the VQ. A shape-gain structure improves performance and robustness. Pre- and post-processing using spectral amplitude warping further improves perceptual quality. A 32-dimensional VQ operating at 2 bits/sample reproduces speech sampled at 8 kHz with a PESQ score of 4.2.

Place, publisher, year, edition, pages
IEEE , 2004. 165-168 p.
Series
IEEE International Conference on Acoustics, Speech and Signal Processing. Proceedings, ISSN 1520-6149
Keyword [en]
Computational complexity, Mathematical models, Parameter estimation, Random processes, Robustness (control systems), Vector quantization, Waveform analysis
National Category
Computer and Information Science
Identifiers
URN: urn:nbn:se:kth:diva-44753ISI: 000222173500042Scopus ID: 2-s2.0-4544284645ISBN: 0-7803-8484-9 (print)OAI: oai:DiVA.org:kth-44753DiVA: diva2:451419
Conference
IEEE International Conference on Acoustics, Speech, and Signal Processing Location: Montreal, CANADA Date: MAY 17-21, 2004
Note

QC 20111025

Available from: 2011-10-25 Created: 2011-10-25 Last updated: 2014-12-15Bibliographically approved

Open Access in DiVA

No full text

Scopus

Search in DiVA

By author/editor
Samuelsson, Jonas
By organisation
Signals, Sensors and SystemsWireless at KTH
Computer and Information Science

Search outside of DiVA

GoogleGoogle Scholar

isbn
urn-nbn

Altmetric score

isbn
urn-nbn
Total: 65 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf