Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
PDF-optimized LSF vector quantization based on beta mixture models
KTH, School of Electrical Engineering (EES), Sound and Image Processing (Closed 130101).
KTH, School of Electrical Engineering (EES), Sound and Image Processing (Closed 130101).
2010 (English)In: Proceedings of the 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010, 2010, 2374-2377 p.Conference paper, Published paper (Refereed)
Abstract [en]

The line spectral frequencies (LSF) are known to be the mostefficient representation of the linear predictive coding (LPC) parametersfrom both the distortion and perceptual point of view.By considering the bounded property of the LSF parameters,we apply beta mixture models (BMM) to model the distributionof the LSF parameters. Meanwhile, by following the principlesof probability density function (PDF) optimized vector quantization(VQ), we derive the bit allocation strategy for the BMM.The LSF parameters are obtained from the TIMIT database anda practical VQ is designed. By taking the Bayesian informationcriterion (BIC), the square error (SE) and the spectral distortion(SD) as the criteria, the BMM based VQ outperforms theGaussian mixture model based VQ with uncorrelated Gaussiancomponent (UGMVQ) by about 1-2 bits/vector.

Place, publisher, year, edition, pages
2010. 2374-2377 p.
Keyword [en]
speech coding, vector quantization, line spectral frequencies, beta mixture model, Gaussian mixture model
National Category
Other Electrical Engineering, Electronic Engineering, Information Engineering Computer Science
Research subject
SRA - ICT
Identifiers
URN: urn:nbn:se:kth:diva-33678ISI: 000313086500206Scopus ID: 2-s2.0-79959854021ISBN: 978-1-61782-123-3 (print)OAI: oai:DiVA.org:kth-33678DiVA: diva2:416996
Conference
11th Annual Conference of the International Speech Communication Association: Spoken Language Processing for All, INTERSPEECH 2010; Makuhari, Chiba; 26 September 2010 through 30 September 2010
Note

QC 20111117

Available from: 2011-05-13 Created: 2011-05-13 Last updated: 2014-01-09Bibliographically approved

Open Access in DiVA

No full text

Other links

Scopushttp://www.isca-speech.org

Search in DiVA

By author/editor
Ma, ZhanyuLeijon, Arne
By organisation
Sound and Image Processing (Closed 130101)
Other Electrical Engineering, Electronic Engineering, Information EngineeringComputer Science

Search outside of DiVA

GoogleGoogle Scholar

isbn
urn-nbn

Altmetric score

isbn
urn-nbn
Total: 47 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf