Change search
ReferencesLink to record
Permanent link

Direct link
Search system for an audio archive
KTH, School of Computer Science and Communication (CSC).
2015 (English)Independent thesis Advanced level (degree of Master (Two Years)), 20 credits / 30 HE creditsStudent thesis
Abstract [en]

Speed and accuracy of information retrieval is of utmostimportance value in the contemporary world. Multimediadata is usually indexed not according to the content but toseveral keywords that only approximate the content.

The purpose of the study is to explore the possibility ofusing speech recognition algorithms to improve the qualityof human speech audio files retrieval or live media contentanalysis. The project focuses on phonetic algorithms’ abil-ity to compensate for imperfections of speech recognition.

The project has examined several speech recognitiontools, language models and phonetic matching algorithms.The results can be used for further research, developing orimproving commercial products.

Place, publisher, year, edition, pages
National Category
Computer Science
URN: urn:nbn:se:kth:diva-164391OAI: diva2:805690
Educational program
Master of Science - Machine Learning
Available from: 2015-04-20 Created: 2015-04-16 Last updated: 2015-04-20Bibliographically approved

Open Access in DiVA

fulltext(1316 kB)83 downloads
File information
File name FULLTEXT01.pdfFile size 1316 kBChecksum SHA-512
Type fulltextMimetype application/pdf

By organisation
School of Computer Science and Communication (CSC)
Computer Science

Search outside of DiVA

GoogleGoogle Scholar
Total: 83 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Total: 115 hits
ReferencesLink to record
Permanent link

Direct link