kth.sePublications KTH
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Word prominence ratings in Swedish television news readings: Effects of pitch accents and head movements
KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Speech, Music and Hearing, TMH.ORCID iD: 0000-0002-4628-3769
2020 (English)In: Proceedings of the International Conference on Speech Prosody, International Speech Communication Association , 2020, Vol. 2020, p. 314-318Conference paper, Published paper (Refereed)
Abstract [en]

Prosodic prominence is a multimodal phenomenon where pitch accents are frequently aligned with visible movements by the hands, head, or eyebrows. However, little is known about how such movements function as visible prominence cues in multimodal speech perception with most previous studies being restricted to experimental settings. In this study, we are piloting the acquisition of multimodal prominence ratings for a corpus of natural speech (Swedish television news readings). Sixteen short video clips (218 words) of news readings were extracted from a larger corpus and rated by 44 native Swedish adult volunteers using a web-based set-up. The task was to rate each word in a clip as either non-prominent, moderately prominent or strongly prominent based on audiovisual cues. The corpus was previously annotated for pitch accents and head movements. We found that words realized with a pitch accent and head movement tended to receive higher prominence ratings than words with a pitch accent only. However, we also examined ratings for a number of carefully selected individual words, and these case studies suggest that ratings are affected by complex relations between the presence of a head movement and its type of alignment, the word's F0 profile, and semantic and pragmatic factors.

Place, publisher, year, edition, pages
International Speech Communication Association , 2020. Vol. 2020, p. 314-318
Series
Proceedings of the International Conference on Speech Prosody, ISSN 2333-2042 ; 2020
Keywords [en]
Audiovisual prosody, Multimodal prominence, Multimodal speech perception, Case-studies, Head movements, Multi-modal, Natural speech, Pitch accents, Speech perception, Video clips, Web based, Semantics
National Category
General Language Studies and Linguistics
Identifiers
URN: urn:nbn:se:kth:diva-290421DOI: 10.21437/SpeechProsody.2020-64Scopus ID: 2-s2.0-85093884721OAI: oai:DiVA.org:kth-290421DiVA, id: diva2:1530372
Conference
10th International Conference on Speech Prosody 2020; Communicative and Interactive Prosody, Tokyo; Japan; 25 May 2020 through 28 May 2020
Note

QC 20210222

Available from: 2021-02-22 Created: 2021-02-22 Last updated: 2022-06-25Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

House, David

Search in DiVA

By author/editor
House, David
By organisation
Speech, Music and Hearing, TMH
General Language Studies and Linguistics

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 136 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf