Change search
ReferencesLink to record
Permanent link

Direct link
Spontal-N: A Corpus of Interactional Spoken Norwegian
Show others and affiliations
2010 (English)In: Proc. of the Seventh conference on International Language Resources and Evaluation (LREC'10) / [ed] Calzolari, Nicoletta; Choukri, Khalid; Maegaard, Bente; Mariani, Joseph; Odjik, Jan; Piperidis, Stelios; Rosner, Mike; Tapias, Daniel, 2010, 2986-2991 p.Conference paper (Refereed)
Abstract [en]

Spontal-N is a corpus of spontaneous, interactional Norwegian. To our knowledge, it is the first corpus of Norwegian in which the majority of speakers have spent significant parts of their lives in Sweden, and in which the recorded speech displays varying degrees of interference from Swedish. The corpus consists of studio quality audio- and video-recordings of four 30-minute free conversations between acquaintances, and a manual orthographic transcription of the entire material. On basis of the orthographic transcriptions, we automatically annotated approximately 50 percent of thematerial on the phoneme level, by means of a forced alignment between the acoustic signal and pronunciations listed in a dictionary. Approximately seven percent of the automatic transcription was manually corrected. Taking the manual correction as a gold standard, we evaluated several sources of pronunciation variants for the automatic transcription. Spontal-N is intended as a general purpose speech resource that is also suitable for investigating phonetic detail.

Place, publisher, year, edition, pages
2010. 2986-2991 p.
National Category
Computer Science Language Technology (Computational Linguistics)
URN: urn:nbn:se:kth:diva-52124ISBN: 2-9517408-6-7OAI: diva2:465419
the Seventh conference on International Language Resources and Evaluation (LREC'10). Valetta, Malta. 19-20-21 May 2010
tmh_import_11_12_14. QC 20111219Available from: 2011-12-14 Created: 2011-12-14 Last updated: 2011-12-19Bibliographically approved

Open Access in DiVA

No full text

Search in DiVA

By author/editor
Edlund, Jens
By organisation
Speech Communication and Technology
Computer ScienceLanguage Technology (Computational Linguistics)

Search outside of DiVA

GoogleGoogle Scholar
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Total: 11 hits
ReferencesLink to record
Permanent link

Direct link