Change search
ReferencesLink to record
Permanent link

Direct link
Within-utterance correlation for speech recognition
KTH, School of Computer Science and Communication (CSC), Speech, Music and Hearing, TMH.
1999 (English)Conference paper (Refereed)
Abstract [en]

Relations between non-adjacent parts of an utterance are commonly regarded as an important source of information for speech recognition. However, they have not been very much used in speech recognition systems. In this paper, we include this information by joint distributions of pairs of phones occurring in the same utterance. In addition to relations between acoustic events, we also have incorporated relations between spectral and prosodically oriented information, such as phone duration, position in utterance and funda-mental frequency. Preliminary recognition results on N-best rescoring show 10% word error reduction compared to a baseline Viterbi decoder.

Place, publisher, year, edition, pages
1999. 2479-2482 p.
National Category
Computer and Information Science
URN: urn:nbn:se:kth:diva-91224OAI: diva2:508910
Sixth European Conference on Speech Communication and Technology
NR 20140805Available from: 2012-03-10 Created: 2012-03-10Bibliographically approved

Open Access in DiVA

No full text

Search in DiVA

By author/editor
Blomberg, Mats
By organisation
Speech, Music and Hearing, TMH
Computer and Information Science

Search outside of DiVA

GoogleGoogle Scholar
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Total: 13 hits
ReferencesLink to record
Permanent link

Direct link