Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
A general-purpose 32 ms prosodic vector for Hidden Markov Modeling
KTH, School of Computer Science and Communication (CSC), Speech, Music and Hearing, TMH, Speech Communication and Technology.
KTH, School of Computer Science and Communication (CSC), Speech, Music and Hearing, TMH, Speech Communication and Technology.ORCID iD: 0000-0001-9327-9482
2009 (English)In: Proceedings of Interspeech 2009, Brighton, UK: ISCA , 2009, 724-729 p.Conference paper, Published paper (Refereed)
Abstract [en]

Prosody plays a central role in communicating via speech, making it important for speech technologies to model. Unfortunately, the application of standard modeling techniques to the acoustics of prosody has been hindered by difficulties in modeling intonation. In this work, we explore the suitability of the recently introduced fundamental frequency variation (FFV) spectrum as a candidate general representation of tone. Experimentson 4 tasks demontrate that FFV features are complimentary to other acoustic measures of prosody and that hidden Markov models offer a suitable modeling paradigm. Proposed improvements yield a 35% relative decrease in error on unseen data and simultaneously reduce time complexity by more than an order of magnitude. The resulting is sufficiently mature for general deployment in a broad range of automatic speech processing applications.

Place, publisher, year, edition, pages
Brighton, UK: ISCA , 2009. 724-729 p.
National Category
Computer Science Language Technology (Computational Linguistics)
Identifiers
URN: urn:nbn:se:kth:diva-52011Scopus ID: 2-s2.0-70450194699OAI: oai:DiVA.org:kth-52011DiVA: diva2:465304
Conference
Interspeech 2009, Brighton, UK
Note
tmh_import_11_12_14 QC 20111216Available from: 2011-12-14 Created: 2011-12-14 Last updated: 2011-12-16Bibliographically approved

Open Access in DiVA

No full text

Scopus

Authority records BETA

Edlund, Jens

Search in DiVA

By author/editor
Heldner, MattiasEdlund, Jens
By organisation
Speech Communication and Technology
Computer ScienceLanguage Technology (Computational Linguistics)

Search outside of DiVA

GoogleGoogle Scholar

urn-nbn

Altmetric score

urn-nbn
Total: 32 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf