Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Enhancing Subjective Speech Intelligibility Using a Statistical Model of Speech
KTH, School of Electrical Engineering (EES), Sound and Image Processing.
KTH, School of Electrical Engineering (EES), Sound and Image Processing.
KTH, School of Electrical Engineering (EES), Sound and Image Processing.
2012 (English)In: 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012, Vol 1, 2012, 166-169 p.Conference paper, Published paper (Refereed)
Abstract [en]

The intelligibility of speech in adverse noise conditions can be improved by modifying the characteristics of the clean speech prior to its presentation. An effective and flexible paradigm is to select the modification by optimizing a measure of objective intelligibility. Here we apply this paradigm at the text level and optimize a measure related to the classification error probability in an automatic speech recognition system. The proposed method was applied to a simple but powerful band-energy modification mechanism under an energy preservation constraint. Subjective evaluation results provide a clear indication of a significant gain in subjective intelligibility. In contrast to existing methods, the proposed approach is not restricted to a particular modification strategy and treats the notion of optimality at a level closer to that of subjective intelligibility. The computational complexity of the method is sufficiently low to enable its use in on-line applications.

Place, publisher, year, edition, pages
2012. 166-169 p.
Keyword [en]
speech modification, subjective intelligibility, statistical model of speech
National Category
Signal Processing
Identifiers
URN: urn:nbn:se:kth:diva-113296ISI: 000320827200042Scopus ID: 2-s2.0-84878403433ISBN: 978-1-62276-759-5 (print)OAI: oai:DiVA.org:kth-113296DiVA: diva2:587591
Conference
13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012; Portland, OR; United States; 9 September 2012 through 13 September 2012
Projects
Listening Talker (LISTA)
Funder
EU, FP7, Seventh Framework Programme
Note

QC 20130115

Available from: 2013-01-14 Created: 2013-01-14 Last updated: 2013-08-23Bibliographically approved

Open Access in DiVA

No full text

Scopus

Search in DiVA

By author/editor
Petkov, Petko N.Kleijn, W. BastiaanHenter, Gustav Eje
By organisation
Sound and Image Processing
Signal Processing

Search outside of DiVA

GoogleGoogle Scholar

isbn
urn-nbn

Altmetric score

isbn
urn-nbn
Total: 279 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf