Within-utterance correlation for speech recognition
1999 (English)Conference paper (Refereed)
Relations between non-adjacent parts of an utterance are commonly regarded as an important source of information for speech recognition. However, they have not been very much used in speech recognition systems. In this paper, we include this information by joint distributions of pairs of phones occurring in the same utterance. In addition to relations between acoustic events, we also have incorporated relations between spectral and prosodically oriented information, such as phone duration, position in utterance and funda-mental frequency. Preliminary recognition results on N-best rescoring show 10% word error reduction compared to a baseline Viterbi decoder.
Place, publisher, year, edition, pages
1999. 2479-2482 p.
Computer and Information Science
IdentifiersURN: urn:nbn:se:kth:diva-91224OAI: oai:DiVA.org:kth-91224DiVA: diva2:508910
Sixth European Conference on Speech Communication and Technology
NR 201408052012-03-102012-03-10Bibliographically approved