Change search
ReferencesLink to record
Permanent link

Direct link
Speech recogniser sensitivity to the variation of different control parameters in synthetic speech
KTH, School of Computer Science and Communication (CSC), Speech, Music and Hearing, TMH.
1989 (English)Conference paper (Refereed)
Abstract [en]

Knowledge of a speech recognizer's sensitivity to different speech production parameters can be used to improve the system or to predict its behaviour in a given application. In this report, a speech recognition system has been tested using manipulated synthetic speech. A text-to-speech system was used for producing words with the 9 Swedish long vowels in CVC context. A "normal" production of each word served as reference template for the recognition system. The test set consisted of the same words where the value of one control parameter at a time was changed from its original position. The mel cepstrum distance between the reference and the manipulated word was measured. Modifying the pitch, voice source spectral slope and the first four formant frequencies had large influence on the distance, while varying formant bandwiths resulted in small effects. The relation between individual formants is different to results from experiments using natural listeners. The results indicate that the sensitivity to pitch and voice source spectrum variation will degrade the recognizer's performance in speaker-independent applications and during stress and that some form of normalisation is needed.

Place, publisher, year, edition, pages
National Category
Computer and Information Science
URN: urn:nbn:se:kth:diva-93597OAI: diva2:517140
In Proceedings of ESCA Workshop on Speech Input/Output Assessment and Speech Databases
NR 20140805Available from: 2012-04-21 Created: 2012-04-21Bibliographically approved

Open Access in DiVA

No full text

Search in DiVA

By author/editor
Blomberg, Mats
By organisation
Speech, Music and Hearing, TMH
Computer and Information Science

Search outside of DiVA

GoogleGoogle Scholar
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Total: 7 hits
ReferencesLink to record
Permanent link

Direct link