Change search
ReferencesLink to record
Permanent link

Direct link
Representing speech
KTH, School of Computer Science and Communication (CSC), Speech, Music and Hearing, TMH.
2015 (English)In: European Signal Processing Conference, European Signal Processing Conference, EUSIPCO , 2015, no MarchConference paper (Refereed)
Abstract [en]

The properties of the speech production process and the auditory periphery have led to the usage of similar speech signal representations for various processing tasks such as speech and speaker recognition, speech synthesis, and speech coding. The representation is generally divided into a description of the vocal-tract transfer function and the excitation source. For recognition purposes, the biased characterization of the vocal-tract transfer function by a time sequence of low-dimension cepstral vectors performs well. For coding and synthesis, we argue that for the vocal-tract transfer function autoregressive (AR) models are more effective than filter banks, while for the excitation source pitch-synchronous filter banks and modulation-domain filters are most effective. A clear trend exists towards the exploitation of the time variation of both the vocal-tract transfer function and the excitation source.

Place, publisher, year, edition, pages
European Signal Processing Conference, EUSIPCO , 2015. no March
Keyword [en]
Filter banks, Modulation, Signal processing, Speech, Speech recognition, Speech synthesis, Auto regressive models, Cepstral vectors, Excitation sources, Modulation domains, Pitch synchronous, Speaker recognition, Speech production, Vocal tract transfer functions, Transfer functions
National Category
Signal Processing Language Technology (Computational Linguistics)
Identifiers
URN: urn:nbn:se:kth:diva-175098ScopusID: 2-s2.0-84937054255OAI: oai:DiVA.org:kth-175098DiVA: diva2:881771
Conference
2000 10th European Signal Processing Conference, EUSIPCO 2000, 4 September 2000 through 8 September 2000
Note

QC 20151211

Available from: 2015-12-11 Created: 2015-10-09 Last updated: 2015-12-11Bibliographically approved

Open Access in DiVA

No full text

Scopus

Search in DiVA

By author/editor
Kleijn, W. Bastiaan
By organisation
Speech, Music and Hearing, TMH
Signal ProcessingLanguage Technology (Computational Linguistics)

Search outside of DiVA

GoogleGoogle Scholar
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Total: 12 hits
ReferencesLink to record
Permanent link

Direct link