Change search
ReferencesLink to record
Permanent link

Direct link
Speech Recognition API
KTH, Superseded Departments, Teleinformatics. KTH, School of Information and Communication Technology (ICT). (CCSlab)
1997 (English)Independent thesis Advanced level (degree of Master (Two Years)), 20 credits / 30 HE creditsStudent thesis
Abstract [en]

Speech technology has the potential to change the way in which we interact with computers and other technical devices. This paper describes the work involved in creating a speech recognition API.

A speech recognition API is a user interface for computer programmers who want to create applications employing speech recognition. Particularly one application, the so called AudioBrowser, is referred to in this report. The AudioBrowser is an application which enables remote access to the World Wide Web by using speech technology. To show how an AudioBrowser could interact with the user, and to evaluate the performance of the speech recognition system, two demos were implemented.

An appropriate platform had to be deployed to be able to run speech technology applications on a PC or a server. The hardware and software used for this platform are described as well as problems encountered when building the platform.

This paper also gives a general background to speech recognition and to various speech recognition systems and software solutions.

The considerations that had to be made when designing the API are discussed, as well as the software implementation of the API.

The API was supposed to support applications running on the VoiceServer, which is a server implementing various telephony and speech technology services. For this reason the API had to conform to the general software architecture of the VoiceServer. This software architecture and the final API are described at the end of this report.

Place, publisher, year, edition, pages
1997. , 68 p.
National Category
Communication Systems
URN: urn:nbn:se:kth:diva-96663OAI: diva2:531917
Subject / course
Educational program
Master of Science in Engineering - Electrical Engineering
1997-08-27, Seminar room "Telegrafen", Isafjordsgatan 22, Kista, 09:00 (English)
Available from: 2012-06-14 Created: 2012-06-08 Last updated: 2013-09-09Bibliographically approved

Open Access in DiVA

No full text

Search in DiVA

By author/editor
Källgården, Ola
By organisation
TeleinformaticsSchool of Information and Communication Technology (ICT)
Communication Systems

Search outside of DiVA

GoogleGoogle Scholar
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Total: 628 hits
ReferencesLink to record
Permanent link

Direct link