Change search
Refine search result
1 - 4 of 4
CiteExportLink to result list
Permanent link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Rows per page
  • 5
  • 10
  • 20
  • 50
  • 100
  • 250
Sort
  • Standard (Relevance)
  • Author A-Ö
  • Author Ö-A
  • Title A-Ö
  • Title Ö-A
  • Publication type A-Ö
  • Publication type Ö-A
  • Issued (Oldest first)
  • Issued (Newest first)
  • Created (Oldest first)
  • Created (Newest first)
  • Last updated (Oldest first)
  • Last updated (Newest first)
  • Disputation date (earliest first)
  • Disputation date (latest first)
  • Standard (Relevance)
  • Author A-Ö
  • Author Ö-A
  • Title A-Ö
  • Title Ö-A
  • Publication type A-Ö
  • Publication type Ö-A
  • Issued (Oldest first)
  • Issued (Newest first)
  • Created (Oldest first)
  • Created (Newest first)
  • Last updated (Oldest first)
  • Last updated (Newest first)
  • Disputation date (earliest first)
  • Disputation date (latest first)
Select
The maximal number of hits you can export is 250. When you want to export more records please use the Create feeds function.
  • 1.
    Ericsson, Chistina
    et al.
    Talboks- och Punktskriftsbiblioteket.
    Klein, Jesper
    Talboks- och Punktskriftsbiblioteket.
    Sjölander, Kåre
    Talboks- och Punktskriftsbiblioteket.
    Sönnebo, Lars
    Talboks- och Punktskriftsbiblioteket.
    Filibuster: a new Swedish text-to-speech system2007In: Proceedings of Fonetik 2007, Stockholm: KTH Royal Institute of Technology, 2007, Vol. 50, no 1, p. 33-36Conference paper (Other academic)
    Abstract [en]

    A Swedish text-to-speech system has been developed at the Swedish Library of Talking Books and Braille (TPB). The system, named Filibuster, is open and extensible and makes it possible to generate synthetic speech with a high degree of control. The Filibuster system is used in the production of talking books in TPBs service for print handicapped students at university level. Through the use of text-to-speech, students can receive their talking books much faster. Also, each book costs less to produce. The system was deployed in February 2007 and during this year the plan is to produce a total of 200 titles. The system has been designed specifically for creating talking book versions of university textbooks. It has a large lexicon, covering some 573,000 words and names. Filibuster includes a comprehensive text pre-processor to write out nonword entities, such as numbers, characters and expressions. So far, one male voice, Folke, has been created, but more are planned.

  • 2.
    Gustafson, Joakim
    et al.
    Voice Technologies, Expert Functions, TeliaSonera, Farsta, Sweden.
    Sjölander, Kåre
    KTH, Superseded Departments, Speech, Music and Hearing.
    Voice creations for conversational fairy-tale characters2004In: Proc 5th ISCA speech synthesis workshop, Pittsburgh, 2004, p. 145-150Conference paper (Refereed)
  • 3.
    Gustafson, Joakim
    et al.
    KTH, Superseded Departments (pre-2005), Speech, Music and Hearing. Telia Research AB, Sweden.
    Sjölander, Kåre
    KTH, Superseded Departments (pre-2005), Speech, Music and Hearing.
    Voice Transformations For Improving Children's Speech Recognition In A Publicly Available Dialogue System2002In: Proceedings of ICSLP 02, International Speech Communication Association , 2002, p. 297-300Conference paper (Refereed)
    Abstract [en]

    To be able to build acoustic models for children, that can beused in spoken dialogue systems, speech data has to be collected. Commercial recognizers available for Swedish are trained on adult speech, which makes them less suitable for children’s computer-directed speech. This paper describes some experiments with on-the-fly voice transformation of children’s speech. Two transformation methods were tested, one inspired by the Phase Vocoder algorithm and another by the Time-Domain Pitch-Synchronous Overlap-Add (TD-PSOLA)algorithm. The speech signal is transformed before being sent to the speech recognizer for adult speech. Our results show that this method reduces the error rates in the order of thirty to fortyfive percent for children users.

  • 4.
    Sjölander, Kåre
    et al.
    KTH, Superseded Departments, Speech, Music and Hearing.
    Heldner, Mattias
    KTH, Superseded Departments, Speech, Music and Hearing.
    Word level precision of the NALIGN automatic segmentation algorithm2004In: Proc of The XVIIth Swedish Phonetics Conference, Fonetik 2004 / [ed] Peter Branderud, Hartmut Traunmüller, 2004, p. 116-119Conference paper (Other academic)
1 - 4 of 4
CiteExportLink to result list
Permanent link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf