Low-Latency Incremental Speech Transcription in the Synface Project
2003 (English)In: Proceedings of the European Conference on Speech Communication and Technology (Eurospeech), Geneva, Switzerland, 2003: vol 2, 2003, 1141-1144 p.Conference paper (Other academic)
In this paper, a real-time decoder for low-latency onlinespeech transcription is presented. The system was developedwithin the Synface project, which aims to improve thepossibilities for hard of hearing people to use conventionaltelephony by providing speech-synchronized multimodalfeedback. This paper addresses the specific issues related toHMM-based incremental phone classification with real-timeconstraints. The decoding algorithm described in this workenables a trade-off to be made between improved recognitionaccuracy and reduced latency. By accepting a longer latencyper output increment, more time can be ascribed tohypothesis look-ahead and by that improve classificationaccuracy. Experiments performed on the Swedish SpeechDatdatabase show that it is possible to generate the sameclassification as is produced by non-incremental decodingusing HTK, by adopting a latency of approx. 150 ms ormore.
Place, publisher, year, edition, pages
2003. 1141-1144 p.
Computer and Information Science
IdentifiersURN: urn:nbn:se:kth:diva-24046OAI: oai:DiVA.org:kth-24046DiVA: diva2:342981
QC 201008112010-08-112010-08-112010-08-12Bibliographically approved