Probabilistic Modelling of Hearing: Speech Recognition and Optimal Audiometry
2009 (English)Licentiate thesis, comprehensive summary (Other academic)
Hearing loss afflicts as many as 10\% of our population.Fortunately, technologies designed to alleviate the effects ofhearing loss are improving rapidly, including cochlear implantsand the increasing computing power of digital hearing aids. Thisthesis focuses on theoretically sound methods for improvinghearing aid technology. The main contributions are documented inthree research articles, which treat two separate topics:modelling of human speech recognition (Papers A and B) andoptimization of diagnostic methods for hearing loss (Paper C).Papers A and B present a hidden Markov model-based framework forsimulating speech recognition in noisy conditions using auditorymodels and signal detection theory. In Paper A, a model of normaland impaired hearing is employed, in which a subject's pure-tonehearing thresholds are used to adapt the model to the individual.In Paper B, the framework is modified to simulate hearing with acochlear implant (CI). Two models of hearing with CI arepresented: a simple, functional model and a biologically inspiredmodel. The models are adapted to the individual CI user bysimulating a spectral discrimination test. The framework canestimate speech recognition ability for a given hearing impairmentor cochlear implant user. This estimate could potentially be usedto optimize hearing aid settings.Paper C presents a novel method for sequentially choosing thesound level and frequency for pure-tone audiometry. A Gaussianmixture model (GMM) is used to represent the probabilitydistribution of hearing thresholds at 8 frequencies. The GMM isfitted to over 100,000 hearing thresholds from a clinicaldatabase. After each response, the GMM is updated using Bayesianinference. The sound level and frequency are chosen so as tomaximize a predefined objective function, such as the entropy ofthe probability distribution. It is found through simulation thatan average of 48 tone presentations are needed to achieve the sameaccuracy as the standard method, which requires an average of 135presentations.
Place, publisher, year, edition, pages
Stockholm: KTH , 2009. , ix, 35 p.
Trita-EE, ISSN 1653-5146 ; 2009:023
auditory models, probabilistic modelling, speech modelling, human speech recognition, hearing aids, cochlear implants, psychoacoustics, diagnostic methods, optimal experiments, audiometry
IdentifiersURN: urn:nbn:se:kth:diva-10386ISBN: 978-91-7415-310-1OAI: oai:DiVA.org:kth-10386DiVA: diva2:216427
2009-05-20, E2, Lindstedtsvägen 3, 11428 Stockholm, 13:00 (English)
Stenfelt, Stefan, Professor
Leijon, Arne, Professor
List of papers