kth.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
A sparsity based preprocessing for noise robust speech recognition
KTH, School of Electrical Engineering (EES), Communication Theory. KTH, School of Electrical Engineering (EES), Centres, ACCESS Linnaeus Centre.ORCID iD: 0000-0003-2638-6047
2014 (English)In: 2014 IEEE Workshop on Spoken Language Technology, SLT 2014 - Proceedings, 2014, p. 513-518Conference paper, Published paper (Refereed)
Abstract [en]

We show a method to sparsify the speech input that improves the robustness of an automatic speech recognizer. The proposed scheme is added to the system as a preprocessing module prior to the acoustic feature extraction. The preprocessing module passes the input speech signal through a linear predictive (LP) analysis filter and enforces sparsity in the LP residue domain. The sparsified prediction residue finally is filtered to generate the speech signal for computing a sequence of conventional feature vectors used in automatic speech recognition (ASR). Using standard feature vectors, our experiments show that sparsification in LP residue domain improves robustness in ASR performance.

Place, publisher, year, edition, pages
2014. p. 513-518
Keywords [en]
Feature extraction, Linear predictive analysis, Residue signal, Robust speech recognition, Sparsity, Extraction, Speech, Speech communication, Acoustic feature extraction, Automatic speech recognition, Automatic speech recognizers, Noise robust speech recognition, Preprocessing modules, Speech recognition
National Category
Media and Communication Studies
Identifiers
URN: urn:nbn:se:kth:diva-167610DOI: 10.1109/SLT.2014.7078627ISI: 000380375100087Scopus ID: 2-s2.0-84946689467ISBN: 9781479971299 (print)OAI: oai:DiVA.org:kth-167610DiVA, id: diva2:814775
Conference
2014 IEEE Workshop on Spoken Language Technology, SLT 2014, 7 December 2014 - 10 December 2014
Note

QC 20150528

Available from: 2015-05-28 Created: 2015-05-22 Last updated: 2025-02-17Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Chatterjee, Saikat

Search in DiVA

By author/editor
Chatterjee, Saikat
By organisation
Communication TheoryACCESS Linnaeus Centre
Media and Communication Studies

Search outside of DiVA

GoogleGoogle Scholar

doi
isbn
urn-nbn

Altmetric score

doi
isbn
urn-nbn
Total: 77 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf