Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Intermediate-State HMMs to Capture Continuously-Changing Signal Features
KTH, School of Electrical Engineering (EES), Sound and Image Processing (Closed 130101).
KTH, School of Electrical Engineering (EES), Sound and Image Processing (Closed 130101).
2011 (English)In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2011, 1828-1831 p.Conference paper, Published paper (Refereed)
Abstract [en]

Traditional discrete-state HMMs are not well suited for describing steadily evolving, path-following natural processes like motion capture data or speech. HMMs cannot represent incremental progress between behaviors, and sequences sampled from the models have unnatural segment durations, unsmooth transitions, and excessive rapid variation. We propose to address these problems by permitting the state variable to occupy positions between the discrete states, and present a concrete left-right model incorporating this idea. We call this intermediate-state HMMs. The state evolution remains Markovian. We describe training using the generalized EM-algorithm and present associated update formulas. An experiment shows that the intermediate-state model is capable of gradual transitions, with more natural durations and less noise in sampled sequences compared to a conventional HMM.

Place, publisher, year, edition, pages
2011. 1828-1831 p.
Keyword [en]
Markov models, HMMs, speech synthesis
National Category
Telecommunications
Identifiers
URN: urn:nbn:se:kth:diva-44027ISI: 000316502200459Scopus ID: 2-s2.0-84865717483ISBN: 978-1-61839-270-1 (print)OAI: oai:DiVA.org:kth-44027DiVA: diva2:449055
Conference
12th Annual Conference of the International Speech Communication Association, INTERSPEECH 2011; Florence; Italy; 27 August 2011 through 31 August 2011
Projects
LISTA
Funder
EU, FP7, Seventh Framework Programme, 256230
Note

QC 20111108

Available from: 2012-02-03 Created: 2011-10-19 Last updated: 2014-01-16Bibliographically approved

Open Access in DiVA

No full text

Other links

ScopusISCA Archive page

Search in DiVA

By author/editor
Henter, Gustav EjeKleijn, W. Bastiaan
By organisation
Sound and Image Processing (Closed 130101)
Telecommunications

Search outside of DiVA

GoogleGoogle Scholar

isbn
urn-nbn

Altmetric score

isbn
urn-nbn
Total: 138 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf