Synthetic phoneme prototypes and source adaptation in a speech recognition system
1989 (English)In: STL-QPSR, Vol. 30, no 1, 131-135 p.Article in journal (Refereed) Published
A recognition system based on a reference library of synthetic phoneme prototypes is described. The phoneme templates are specified in terms of formant synthesis parameters. The vocabulary and grammar is described in a finite-state network where each state represents a phoneme. Atransition between two phonemes in the net is expanded to a number of new states using interpolation on the synthesis parameters or at the spectrum level. Ateach state, a 16channel filter bank section is computed from the synthesis parameters. Adaptation to each speaker's individual voice source spectrum is performed during recognition. Without adaptation, the average recognition for ten male speakers was 88% on an isolated-word task using a 26-word vocabulary. On a vocabulary of3connected digits, !he recognition rate for six male speakers was 87.7%.Adding the voice source adaptation feature raised the performance to 96% and 92.8%, respectively. The improvement varied considerably between the speakers, indicating the usefulness of the voice source adaptation for certain voices
Place, publisher, year, edition, pages
1989. Vol. 30, no 1, 131-135 p.
Computer and Information Science
IdentifiersURN: urn:nbn:se:kth:diva-91466OAI: oai:DiVA.org:kth-91466DiVA: diva2:510366
NR 201408052012-03-152012-03-15Bibliographically approved