kth.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Estimation of general identifiable linear dynamic models with an application in speech recognition
Technical University of Crete, Department of Electronic & Computer Engineering.
Technical University of Crete, Department of Electronic & Computer Engineering.
Technical University of Crete, Department of Electronic & Computer Engineering.
Technical University of Crete, Department of Electronic & Computer Engineering.
2007 (English)In: 2007 IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol IV, Pts 1-3 / [ed] IEEE, 2007, p. 453-456Conference paper, Published paper (Refereed)
Abstract [en]

Although Hidden Markov Models (HMMs) provide a relatively efficient modeling framework for speech recognition, they suffer from several shortcomings which set upper bounds in the performance that can be achieved. Alternatively, linear dynamic models (LDM) can be used to model speech segments. Several implementations of LDM have been proposed in the literature. However, all had a restricted structure to satisfy identifiability constraints. In this paper, we relax all these constraints and use a general, canonical form for a linear state-space system that guarantees identifiability for arbitrary state and observation vector dimensions. For this system,we present a novel, element-wise Maximum Likelihood (ML) estimation method. Classification experiments on the AURORA2 speech database show performance gains compared to HMMs, particularly on highly noisy conditions.

Place, publisher, year, edition, pages
2007. p. 453-456
Series
International Conference on Acoustics Speech and Signal Processing (ICASSP), ISSN 1520-6149
Keywords [en]
Speech Recognition, Modeling, Identification
National Category
Signal Processing Computer Engineering
Identifiers
URN: urn:nbn:se:kth:diva-49542DOI: 10.1109/ICASSP.2007.366947ISI: 000248909200114ISBN: 1-4244-0727-3 (print)OAI: oai:DiVA.org:kth-49542DiVA, id: diva2:459789
Conference
32nd IEEE International Conference on Acoustics, Speech, and Signal Processing. Honolulu, HI, USA. 15 April 2007 - 20 April 2007
Note
QC 20111130Available from: 2011-11-28 Created: 2011-11-28 Last updated: 2022-06-24Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full text

Search in DiVA

By author/editor
Koniaris, Christos
Signal ProcessingComputer Engineering

Search outside of DiVA

GoogleGoogle Scholar

doi
isbn
urn-nbn

Altmetric score

doi
isbn
urn-nbn
Total: 184 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf