Estimation of the instantaneous pitch in speech
2007 (English)In: IEEE Transactions on Audio, Speech and Language Processing, ISSN 1558-7916, Vol. 15, no 3, 813-822 p.Article in journal (Refereed) Published
An accurate estimation of the pitch is essential for many speech processing applications, such as speech synthesis, speech coding, and speech enhancement. A widely used assumption in most common pitch estimation methods is that pitch is constant over a segment of short duration. This assumption does not apply in reality and leads to inaccurate pitch estimates. In this paper, we present a method for continuous pitch estimation that is able to track fast changes. In the presented framework, the pitch is modeled by a B-spline expansion and optimized in a multistage procedure for increased robustness. The performance of the continuous optimization procedure is compared to state-of-the-art pitch estimation methods and is evaluated both for artificial speech-like signals with known pitch, and for real speech signals. The results of the experiments show that our method leads to a higher accuracy of the estimate of the pitch than state-of-the-art methods.
Place, publisher, year, edition, pages
2007. Vol. 15, no 3, 813-822 p.
instantaneous pitch, pitch estimation, pitch-synchronous processing, splines
Engineering and Technology
IdentifiersURN: urn:nbn:se:kth:diva-9085DOI: 10.1109/TASL.2006.885242ISI: 000244318600007ScopusID: 2-s2.0-37649002185OAI: oai:DiVA.org:kth-9085DiVA: diva2:14645
QC 20100914. Uppdaterad från Submitted till Published (20100914)2006-02-102006-02-102010-09-14Bibliographically approved