Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Autoregressive Model-based Speech Packet-Loss Concealment
KTH, School of Electrical Engineering (EES), Sound and Image Processing. KTH, School of Electrical Engineering (EES), Centres, ACCESS Linnaeus Centre.
KTH, School of Electrical Engineering (EES), Sound and Image Processing. KTH, School of Electrical Engineering (EES), Centres, ACCESS Linnaeus Centre.
2008 (English)In: 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, 2008, 4797-4800 p.Conference paper, Published paper (Refereed)
Abstract [en]

We study packet-loss concealment for speech based on autoregressivemodelling using a rigorous minimum mean square error (MMSE) approach.The effect of the model estimation error on predicting the missing segment isstudied and an upper bound on the mean square error is derived. Our exper-iments show that the upper bound is tight when the estimation error is lessthan the signal variance. We also consider the usage of perceptual weightingon prediction to improve speech quality. A rigorous argument is presentedto show that perceptual weighting is not useful in this context. We createsimple and practical MMSE-based systems using two signal models: a basicmodel capturing the short-term correlation and a more sophisticated modelthat also captures the long-term correlation. Subjective quality comparisontests show that the proposed MMSE-based system provides state-of-the-artperformance.

Place, publisher, year, edition, pages
2008. 4797-4800 p.
Series
International Conference on Acoustics Speech and Signal Processing (ICASSP), ISSN 1520-6149
National Category
Telecommunications
Identifiers
URN: urn:nbn:se:kth:diva-24219DOI: 10.1109/ICASSP.2008.4518730ISI: 000257456703183Scopus ID: 2-s2.0-51449119893OAI: oai:DiVA.org:kth-24219DiVA: diva2:345501
Conference
33rd IEEE International Conference on Acoustics, Speech and Signal Processing Las Vegas, NV, MAR 30-APR 04, 2008
Note
QC20100830Available from: 2010-08-25 Created: 2010-08-25 Last updated: 2011-07-08Bibliographically approved
In thesis
1. Robust Multimedia Communications over Packet Networks
Open this publication in new window or tab >>Robust Multimedia Communications over Packet Networks
2010 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

Multimedia communications over packet networks, and in particular the voice over IP (VoIP) application, have become an integral part of society. However, the unreliable and heterogeneous nature of packet networks has led to a best-effort delivery of services. Delay, limitation of bandwidth, and packet-loss rate all affect the quality of service (QoS). In this thesis, we address two important network impairments in the design of robust multimedia communication systems: packet delay-variation and packet-loss.

Paper A considers the mitigation of the effect of packet delay-variation for audio communications by introducing a buffer at the receiver side. A new adaptive playout scheduling approach is proposed to control the buffering length, or, equivalently, the packet playout deadlines, in response to varying network conditions. A Wiener process is used to model the fluctuation of the buffering length without any playout adjustment. The playout scheduling problem is then reformulated as a stochastic impulse control problem by taking the playout adjustment as the control signal. The proposed approach is shown to be the optimal solution to the new control problem. It is demonstrated experimentally that the proposed approach provides improved perceived conversional quality.

Papers B, C and D address the packet-loss issue. Paper B focuses on the design of a low-complexity packet-loss concealment (PLC) method that is compatible with existing speech codecs for VoIP application. The new method is rigorously motivated based on the autoregressive (AR) speech model and the minimum mean squared error (MMSE) criterion. The effect of model estimation error on the prediction of the missing speech segment is also considered and an upper bound for the prediction error is derived. Both the theoretical and experimental results provide insight in the performance of the heuristically designed PLC methods. On the other hand, Paper C and D consider an active packet-loss-resilient coding scheme, namely multiple description coding (MDC). In general, MDC can be used for the transmission of any media data. Paper C derives a simple and accurate approximation of the rate-distortion lower bound of a particular multiple- description scenario and then demonstrates that the performance loss of some practical MD systems can be evaluated easily with the new approximation. Paper D studies the performance limit of a vector Gaussian multiple description scenario. An outer bound to the rate-distortion region is derived, and the outer bound is tight when the problem specializes to the scalar Gaussian case.

 

Place, publisher, year, edition, pages
Stockholm: KTH, 2010. xii, 37 p.
Series
Trita-EE, ISSN 1653-5146 ; 2010:036
National Category
Telecommunications
Identifiers
urn:nbn:se:kth:diva-24223 (URN)
Public defence
2010-08-30, Salongen, KTHB, Osquars backe 25, Stockholm, 10:00 (English)
Opponent
Supervisors
Note
QC20100830Available from: 2010-08-30 Created: 2010-08-25 Last updated: 2010-08-31Bibliographically approved

Open Access in DiVA

No full text

Other links

Publisher's full textScopus

Search in DiVA

By author/editor
Guoqiang, ZhangW. Bastiaan, Kleijn
By organisation
Sound and Image ProcessingACCESS Linnaeus Centre
Telecommunications

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 87 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf