Change search
Refine search result
1 - 19 of 19
CiteExportLink to result list
Permanent link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Rows per page
  • 5
  • 10
  • 20
  • 50
  • 100
  • 250
Sort
  • Standard (Relevance)
  • Author A-Ö
  • Author Ö-A
  • Title A-Ö
  • Title Ö-A
  • Publication type A-Ö
  • Publication type Ö-A
  • Issued (Oldest first)
  • Issued (Newest first)
  • Created (Oldest first)
  • Created (Newest first)
  • Last updated (Oldest first)
  • Last updated (Newest first)
  • Disputation date (earliest first)
  • Disputation date (latest first)
  • Standard (Relevance)
  • Author A-Ö
  • Author Ö-A
  • Title A-Ö
  • Title Ö-A
  • Publication type A-Ö
  • Publication type Ö-A
  • Issued (Oldest first)
  • Issued (Newest first)
  • Created (Oldest first)
  • Created (Newest first)
  • Last updated (Oldest first)
  • Last updated (Newest first)
  • Disputation date (earliest first)
  • Disputation date (latest first)
Select
The maximal number of hits you can export is 250. When you want to export more records please use the Create feeds function.
  • 1. Camargo, Zuleica
    et al.
    Salomão, Gláucia Laís
    KTH, School of Computer Science and Communication (CSC), Speech, Music and Hearing, TMH.
    Pinho, Sílvia
    Análise acústica e aerodinâmica da voz (Acoustic and aerodynamic assessment of voice)2011In: Tratado de Otorrinolaringologia (Treatise on Otorhinolaryngology) / [ed] Caldas, S.; Melo, J.F.; Martins, R.H.; Selaimen, S., Sao Paulo: Editora Roca , 2011, p. 794-804Chapter in book (Other academic)
  • 2. Eyben, Florian
    et al.
    Salomao, Glaucia Laís
    KTH, School of Computer Science and Communication (CSC), Speech, Music and Hearing, TMH.
    Sundberg, Johan
    KTH, School of Computer Science and Communication (CSC), Speech, Music and Hearing, TMH.
    Scherer, Klaus R.
    Schuller, Bjorn W.
    Emotion in the singing voice-a deeper look at acoustic features in the light of automatic classification2015In: EURASIP Journal on Audio, Speech, and Music Processing, ISSN 1687-4714, E-ISSN 1687-4722, ISSN 1687-4714, article id 19Article in journal (Refereed)
    Abstract [en]

    We investigate the automatic recognition of emotions in the singing voice and study the worth and role of a variety of relevant acoustic parameters. The data set contains phrases and vocalises sung by eight renowned professional opera singers in ten different emotions and a neutral state. The states are mapped to ternary arousal and valence labels. We propose a small set of relevant acoustic features basing on our previous findings on the same data and compare it with a large-scale state-of-the-art feature set for paralinguistics recognition, the baseline feature set of the Interspeech 2013 Computational Paralinguistics ChallengE (ComParE). A feature importance analysis with respect to classification accuracy and correlation of features with the targets is provided in the paper. Results show that the classification performance with both feature sets is similar for arousal, while the ComParE set is superior for valence. Intra singer feature ranking criteria further improve the classification accuracy in a leave-one-singer-out cross validation significantly.

  • 3. Gerholm, Tove
    et al.
    Gustavsson, L.
    Schwarz, I.
    Marklund, U.
    Salomão, Gláucia Laís
    KTH, School of Computer Science and Communication (CSC), Speech, Music and Hearing, TMH. Institutionen för Lingvistiken, Stockholm Universitet.
    Kallioinen, P.
    Andersson, S.
    Eriksson, F.
    Pagmar, D.
    Tahbaz, S.
    The Swedish MINT Project modelling infant language acquisition2015Conference paper (Refereed)
  • 4.
    Salomao, Glaucia Lais
    et al.
    KTH, School of Computer Science and Communication (CSC), Speech, Music and Hearing, TMH. Pontifical Catholic Univ, Brazil.
    Sundberg, Johan
    KTH, School of Computer Science and Communication (CSC), Speech, Music and Hearing, TMH, Music Acoustics.
    What do male singers mean by modal and falsetto register? An investigation of the glottal voice source2009In: Logopedics, Phoniatrics, Vocology, ISSN 1401-5439, E-ISSN 1651-2022, ISSN 1401-5439, Vol. 34, no 2, p. 73-83Article in journal (Refereed)
    Abstract [en]

    The voice source differs between modal and falsetto registers, but singers often try to reduce the associated timbral differences, some even doubting that there are any. A total of 54 vowel sounds sung in falsetto and modal register by 13 male more or less experienced choir singers were analyzed by inverse filtering and electroglottography. Closed quotient, maximum flow declination rate, peak-to-peak airflow amplitude, normalized amplitude quotient, and level difference between the two lowest source spectrum partials were determined, and systematic differences were found in all singers, regardless of experience of singing. The observations seem compatible with previous observations of thicker vocal folds in modal register.

  • 5.
    Salomão, Glaucia Lais
    et al.
    Pontifical Catholic University, Sao Paolo, Brazil.
    Sundberg, Johan
    KTH, School of Computer Science and Communication (CSC), Speech, Music and Hearing, TMH, Music Acoustics.
    Relation between perceived voice register and flow glottogram parameters in males2008In: Journal of the Acoustical Society of America, ISSN 0001-4966, E-ISSN 1520-8524, Vol. 124, no 1, p. 546-551Article in journal (Refereed)
    Abstract [en]

    The perception of modal and falsetto registers was analyzed in a material consisting of a total of 104 vowel sounds sung by 13 choir singers, 52 sung in modal register, and 52 in falsetto register. These vowel sounds were classified by 16 expert listeners in a forced choice test and the number of votes for modal was compared to the voice source parameters: (1) closed quotient (Q(closed)), (2) level difference between the two lowest source spectrum partials (H1-H2), (3) AC amplitude, (4) maximum flow declination rate (MFDR), and (5) normalized amplitude quotient (NAQ, AC amplitude/MFDR* fundamental frequency). Tones with a high value of Q(closed) and low values of H1-H2 and of NAQ were typically associated with high number of votes for modal register, and vice versa, Q(closed) showing the strongest correlation. Some singer subjects produced tones that could not be classified as either falsetto or modal register, suggesting that classification of registers is not always feasible.

  • 6.
    Salomão, Gláucia Laís
    KTH, School of Computer Science and Communication (CSC), Speech, Music and Hearing, TMH.
    O impacto da pesquisa científica na clínica vocal (Impact of voice science in the voice clinic)2009Conference paper (Refereed)
  • 7.
    Salomão, Gláucia Laís
    Pontifícia Universidade Católica de São Paulo (PUC-SP), Departamento de Linguística Aplicada e Estudos da Linguagem (LAEL).
    Registros Vocais no Canto: Aspectos perceptivos, acústicos, aerodinâmicos e fisiológicos da voz modal e de falsete2008Doctoral thesis, monograph (Other academic)
  • 8.
    Salomão, Gláucia Laís
    KTH, School of Computer Science and Communication (CSC), Speech, Music and Hearing, TMH.
    Relationship between perceived vocal registers and flow glottogram parameters: preliminary results2008Conference paper (Refereed)
  • 9.
    Salomão, Gláucia Laís
    KTH, School of Computer Science and Communication (CSC), Speech, Music and Hearing, TMH, Music Acoustics. Pontifical Catholical University, Brazil.
    Relationship between perceived vocal registers and glottal flow parameters: preliminary results2008Conference paper (Refereed)
    Abstract [en]

    The perception of modal and falsetto registers was analyzedin a material consisting of a total of 80 vowel sounds sung by10 choir singers, 40 sung in modal register and 40 in falsettoregister. These vowel sounds were classified by sixteenexpert listeners in a force choice test and the number of votesfor modal was compared with the voice source parameters (1)Closed Quotient (Qclosed)and (2) level difference between thetwo lowest source spectrum partials (H1 – H2). Tones with ahigh value of Qclosed and low values of H1-H2 weretypically associated with high number of votes for modalregister, and vice versa, Qclosed showing the strongestcorrelation. Some singer subjects produced tones that couldnot be classified as either falsetto or modal register,suggesting that category perception of registers is not alwaysfeasible.

  • 10.
    Salomão, Gláucia Laís
    KTH, School of Computer Science and Communication (CSC), Speech, Music and Hearing, TMH.
    Relationships between glottal flow parameters and the perception of the voice quality2013Conference paper (Refereed)
  • 11.
    Salomão, Gláucia Laís
    KTH, School of Computer Science and Communication (CSC), Speech, Music and Hearing, TMH.
    Relevance of glottal flow parameters to the perception of voice quality2013Conference paper (Refereed)
  • 12.
    Salomão, Gláucia Laís
    et al.
    KTH, School of Computer Science and Communication (CSC), Speech, Music and Hearing, TMH.
    Sundberg, Johan
    KTH, School of Computer Science and Communication (CSC), Speech, Music and Hearing, TMH.
    Emotional Coloring in Singing: An in-depth spectral analysis if a tenor's voice2015In: IV International Conference on Music and Emotion ICME IV Book of Abstracts: Emotion in the Singing Voice: Convergent approaches from phonetics, psychology, and computer sciences / [ed] University of Geneva, Geneva University of Music, 2015, p. 65-Conference paper (Refereed)
  • 13.
    Salomão, Gláucia Laís
    et al.
    KTH, School of Computer Science and Communication (CSC), Speech, Music and Hearing, TMH.
    Sundberg, Johan
    KTH, School of Computer Science and Communication (CSC), Speech, Music and Hearing, TMH, Music Acoustics. Swiss Centre for Affective Sciences, University of Geneva .
    Emotional Coloring of the Singing Voice2015In: PEVOC 11th PAN-EUROPEAN VOICE CONFERENCE, Firenze, 2015, p. 80-Conference paper (Refereed)
  • 14.
    Salomão, Gláucia Laís
    et al.
    KTH, School of Computer Science and Communication (CSC), Speech, Music and Hearing, TMH.
    Sundberg, Johan
    Emotional coloringin singing: An in-depth spectral analysis of a tenor's voice: In: Emotion in the Singing Voice: Convergent approaches from phonetics, psychology, and computer sciences2015Conference paper (Refereed)
  • 15.
    Salomão, Gláucia Laís
    et al.
    KTH, School of Computer Science and Communication (CSC), Speech, Music and Hearing, TMH.
    Sundberg, Johan
    KTH, School of Computer Science and Communication (CSC), Speech, Music and Hearing, TMH.
    Perceptual relevance of voice source characteristics in male singers' modal and falsetto registers2010Conference paper (Refereed)
    Abstract [en]

    It is commonly assumed that the voice source differs between modal and falsetto registers. However, singers often try to reduce the timbral differences between registers, some even doubting that they even exist. The purpose of this study was to investigate the relation between the perceived voice register and the voice source parameters in modal and in falsetto registers as analyzed by inverse filtering and electroglottography. A total of 52 falsetto register tones and 52 modal register tones, sung by 13 male more or less experienced choir singers, were classified by 16 expert listeners in a forced choice test. The listeners’ classifications mostly agreed with the registers intended by the singers. For some tones classification differed substantially, presumably because the singers had learnt to reduce the timbral contrasts between the registers. The number of votes for modal was compared to the following voice source parameters: duration of the closed phase and closed quotient (Qclosed), peak-to-peak airflow amplitude, maximum flow declination rate (MFDR), normalized amplitude quotient (NAQ, defined as the ratio between peak-to-peak airflow amplitude and MFDR multiplied by the fundamental frequency) and level difference between the two lowest source spectrum partials. The results showed that: (1) tones with higher values of Qclosed and of MFDR, and lower values of H1-H2 and NAQ were typically associated with higher number of votes to the modal register, and viceversa, (2) NAQ was the single voice source parameter that presented the strongest correlation with the register classification data, while (3) the combination of Qclosed and H1-H2 parameters was the strongest predictor of the voice source perceptual data.

  • 16.
    Salomão, Gláucia Laís
    et al.
    KTH, School of Computer Science and Communication (CSC), Speech, Music and Hearing, TMH.
    Sundberg, Johan
    KTH, School of Computer Science and Communication (CSC), Speech, Music and Hearing, TMH.
    Voice-source differences between male singers’ modal and falsetto registers2008Conference paper (Refereed)
  • 17. Scherer, Klaus R.
    et al.
    Sundberg, Johan
    KTH, School of Computer Science and Communication (CSC), Speech, Music and Hearing, TMH, Music Acoustics.
    Tamarit, Lucas
    Salomão, Gláucia Laís
    KTH, School of Computer Science and Communication (CSC), Speech, Music and Hearing, TMH.
    Comparing the acoustic expression of emotion in the speaking and the singing voice2015In: Computer speech & language (Print), ISSN 0885-2308, E-ISSN 1095-8363, ISSN 0885-2308, Vol. 29, no 1, p. 218-235Article in journal (Refereed)
    Abstract [en]

    We examine the similarities and differences in the expression of emotion in the singing and the speaking voice. Three internationally renowned opera singers produced “vocalises” (using a schwa vowel) and short nonsense phrases in different interpretations for 10 emotions. Acoustic analyses of emotional expression in the singing samples show significant differences between the emotions. In addition to the obvious effects of loudness and tempo, spectral balance and perturbation make significant contributions (high effect sizes) to this differentiation. A comparison of the emotion-specific patterns produced by the singers in this study with published data for professional actors portraying different emotions in speech generally show a very high degree of similarity. However, singers tend to rely more than actors on the use of voice perturbation, specifically vibrato, in particular in the case of high arousal emotions. It is suggested that this may be due to by the restrictions and constraints imposed by the musical structure.

  • 18.
    Södersten, M.
    et al.
    Karolinska Institutet, Sweden.
    Salomão, Gláucia Laís
    KTH, School of Computer Science and Communication (CSC), Speech, Music and Hearing, TMH. Karolinska Institutet, Sweden.
    McAllister, A.
    Karolinska Institutet, Sweden.
    Ternström, Sten
    KTH, School of Computer Science and Communication (CSC), Speech, Music and Hearing, TMH, Music Acoustics.
    Natural Voice Use in Patients With Voice Disorders and Vocally Healthy Speakers Based on 2 Days Voice Accumulator Information From a Database2015In: Journal of Voice, ISSN 0892-1997, E-ISSN 1873-4588, ISSN 0892-1997, Vol. 29, no 5, p. 646.e11-646.e19Article in journal (Refereed)
    Abstract [en]

    Objectives and Study Design. Information about how patients with voice disorders use their voices in natural communicative situations is scarce. Such long-term data have for the first time been uploaded to a central database from different hospitals in Sweden. The purpose was to investigate the potential use of a large set of long-term data for establishing reference values regarding voice use in natural situations. Methods. VoxLog (Sonvox AB, Umeå, Sweden) was tested for deployment in clinical practice by speech-language pathologists working at nine hospitals in Sweden. Files from 20 patients (16 females and 4 males) with functional, organic, or neurological voice disorders and 10 vocally healthy individuals (eight females and two males) were uploaded to a remote central database. All participants had vocally demanding occupations and had been monitored for more than 2 days. The total recording time was 681 hours and 50 minutes. Data on fundamental frequency (F0, Hz), phonation time (seconds and percentage), voice sound pressure level (SPL, dB), and background noise level (dB) were analyzed for each recorded day and compared between the 2 days. Variations across each day were measured using coefficients of variation. Results. Average F0, voice SPL, and especially the level of background noise varied considerably for all participants across each day. Average F0 and voice SPL were considerably higher than reference values from laboratory recordings. Conclusions. The use of a remote central database and strict protocols can accelerate data collection from larger groups of participants and contribute to establishing reference values regarding voice use in natural situations and from patients with voice disorders. Information about activities and voice symptoms would supplement the objective data and is recommended in future studies.

  • 19. Södersten, M.
    et al.
    Salomão, Gláucia Laís
    KTH, School of Computer Science and Communication (CSC), Speech, Music and Hearing, TMH.
    Ternström, Sten
    KTH, School of Computer Science and Communication (CSC), Speech, Music and Hearing, TMH, Music Acoustics.
    McAllister, A.
    Voice use during two days in patients with voice disorders and vocally healthy speakers based on voice accumulator information from a database2013In: Occupational Voice Symposium 2013, London, England, 2013Conference paper (Refereed)
1 - 19 of 19
CiteExportLink to result list
Permanent link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf