Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Free Acoustic and Language Models for Large Vocabulary Continuous Speech Recognition in Swedish
KTH, School of Computer Science and Communication (CSC), Speech, Music and Hearing, TMH, Speech Communication and Technology.
KTH, School of Computer Science and Communication (CSC), Speech, Music and Hearing, TMH, Speech Communication and Technology.ORCID iD: 0000-0002-3323-5311
2014 (English)In: Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), 2014Conference paper, Published paper (Refereed)
Abstract [en]

This paper presents results for large vocabulary continuous speech recognition (LVCSR) in Swedish. We trained acoustic models on the public domain NST Swedish corpus and made them freely available to the community. The training procedure corresponds to the reference recogniser (RefRec) developed for the SpeechDat databases during the COST249 action. We describe the modifications we made to the procedure in order to train on the NST database, and the language models we created based on the N-gram data available at the Norwegian Language Council. Our tests include medium vocabulary isolated word recognition and LVCSR. Because no previous results are available for LVCSR in Swedish, we use as baseline the performance of the SpeechDat models on the same tasks. We also compare our best results to the ones obtained in similar conditions on resource rich languages such as American English. We tested the acoustic models with HTK and Julius and plan to make them available in CMU Sphinx format as well in the near future. We believe that the free availability of these resources will boost research in speech and language technology in Swedish, even in research groups that do not have resources to develop ASR systems.

Place, publisher, year, edition, pages
2014.
Keyword [en]
Speech Resource, Database, Language Modelling
National Category
Computer Science Language Technology (Computational Linguistics)
Identifiers
URN: urn:nbn:se:kth:diva-158151ISI: 000355611001171ISBN: 978-2-9517408-8-4 (print)OAI: oai:DiVA.org:kth-158151DiVA: diva2:774993
Conference
Ninth International Conference on Language Resources and Evaluation (LREC'14), May, 26-31, 2014, Reykjavik, Iceland
Note

QC 20150211

Available from: 2014-12-30 Created: 2014-12-30 Last updated: 2015-09-18Bibliographically approved

Open Access in DiVA

No full text

Other links

http://www.lrec-conf.org/proceedings/lrec2014/summaries/312.html

Authority records BETA

Salvi, Giampiero

Search in DiVA

By author/editor
Vanhainen, NiklasSalvi, Giampiero
By organisation
Speech Communication and Technology
Computer ScienceLanguage Technology (Computational Linguistics)

Search outside of DiVA

GoogleGoogle Scholar

isbn
urn-nbn

Altmetric score

isbn
urn-nbn
Total: 151 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf