kth.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
An investigation on mutual information for the linear predictive system and the extrapolation of speech signals
KTH, School of Electrical Engineering (EES), Sound and Image Processing.
KTH, School of Electrical Engineering (EES), Sound and Image Processing.ORCID iD: 0000-0001-7957-5103
2012 (English)In: Proceedings of 10th ITG Symposium on Speech Communication, Institute of Electrical and Electronics Engineers (IEEE), 2012, article id 6309620Conference paper, Published paper (Refereed)
Abstract [en]

Mutual information (MI) is an important information theoretic concept which has many applications in telecommunications, in blind source separation, and in machine learning. More recently, it has been also employed for the instrumental assessment of speech intelligibility where traditionally correlation based measures are used. In this paper, we address the difference between MI and correlation from the viewpoint of discovering dependencies between variables in the context of speech signals. We perform our investigation by considering the linear predictive approximation and the extrapolation of speech signals as examples. We compare a parametric MI estimation approach based on a Gaussian mixture model (GMM) with the k-nearest neighbor (KNN) approach which is a well-known non-parametric method available to estimate the MI. We show that the GMM-based MI estimator leads to more consistent results.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2012. article id 6309620
Keywords [en]
Artificial intelligence, Blind source separation, Extrapolation, Gaussian distribution, Information theory, Learning systems, Nearest neighbor search, Speech, Speech intelligibility, Estimation approaches, Gaussian Mixture Model, K nearest neighbor (KNN), Mutual informations, Nonparametric methods, Predictive systems, Speech signals, Speech communication
National Category
Other Engineering and Technologies
Identifiers
URN: urn:nbn:se:kth:diva-314739Scopus ID: 2-s2.0-84930399266OAI: oai:DiVA.org:kth-314739DiVA, id: diva2:1675800
Conference
10th ITG Symposium on Speech Communication, ITGspeech 2012, 26 September 2012 through 28 September 2012, Braunschweig, Germany
Note

QC 20220623

Part of proceedings: ISBN 978-380073455-9

Available from: 2022-06-23 Created: 2022-06-23 Last updated: 2024-01-08Bibliographically approved

Open Access in DiVA

No full text in DiVA

Scopus

Authority records

Taghia, JalilLeijon, Arne

Search in DiVA

By author/editor
Taghia, JalilLeijon, Arne
By organisation
Sound and Image Processing
Other Engineering and Technologies

Search outside of DiVA

GoogleGoogle Scholar

urn-nbn

Altmetric score

urn-nbn
Total: 38 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf