Change search
ReferencesLink to record
Permanent link

Direct link
Signal processing representations of speech
KTH, Superseded Departments, Signals, Sensors and Systems.
2003 (English)In: IEICE transactions on information and systems, ISSN 0916-8532, E-ISSN 1745-1361, Vol. E86D, no 3, 359-376 p.Article, review/survey (Refereed) Published
Abstract [en]

Synergies in processing requirements and knowledge of human speech production and perception have led to a similarity of the speech signal representations used for the tasks of recognition, coding, and modification. The representations are generally composed of a description of the vocal-tract transfer function and, in the case of coding and modification, a description of the excitation signal. This paper provides an overview of commonly used representations. For coding and modification, autoregressive models represented by line spectral frequencies perform well for the vocal tract, and pitch-synchronous filter banks and modulation-domain filters perform well for the excitation. For recognition, good representations are based on a smoothed magnitude response of the vocal tract.

Place, publisher, year, edition, pages
2003. Vol. E86D, no 3, 359-376 p.
Keyword [en]
speech, features, representation, warped frequency scale, linear prediction, word recognition, cepstral coefficients, voiced speech, spectrum, enhancement, sounds, noise
URN: urn:nbn:se:kth:diva-22303ISI: 000181421800002OAI: diva2:341001
QC 20100525Available from: 2010-08-10 Created: 2010-08-10Bibliographically approved

Open Access in DiVA

No full text

Search in DiVA

By author/editor
Kleijn, W. Bastiaan
By organisation
Signals, Sensors and Systems
In the same journal
IEICE transactions on information and systems

Search outside of DiVA

GoogleGoogle Scholar
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Total: 22 hits
ReferencesLink to record
Permanent link

Direct link