Segment boundaries in low latency phonetic recognition
2005 (English)In: NONLINEAR ANALYSES AND ALGORITHMS FOR SPEECH PROCESSING / [ed] Faundez Zanuy M; Janer L; Esposito A; Satue Villar A; Roure J; Espinosa Duro V, 2005, Vol. 3817, 267-276 p.Conference paper (Refereed)
The segment boundaries produced by the Synface low latency phoneme recogniser are analysed. The precision in placing the boundaries is an important factor in the Synface system as the aim is to drive the lip movements of a synthetic face for lip-reading support. The recogniser is based on a hybrid of recurrent neural networks and hidden Markov models. In this paper we analyse the look-ahead length in the Viterbi-like decoder affects the precision of boundary placement. The properties of the entropy of the posterior probabilities estimated by the neural network are also investigated in relation to the distance of the frame from a phonetic transition.
Place, publisher, year, edition, pages
2005. Vol. 3817, 267-276 p.
, Lecture Notes in Artificial Intelligence, ISSN 0302-9743 ; 3817
IdentifiersURN: urn:nbn:se:kth:diva-6152ISI: 000235839300023ScopusID: 2-s2.0-33745452923ISBN: 3-540-31257-9OAI: oai:DiVA.org:kth-6152DiVA: diva2:10781
International Conference on Non-Linear Speech Processing Barcelona, SPAIN, APR 19-22, 2005
QC 201006302006-09-212006-09-212015-08-03Bibliographically approved