Ändra sökning
Avgränsa sökresultatet
2345 201 - 205 av 205
RefereraExporteraLänk till träfflistan
Permanent länk
Referera
Referensformat
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Träffar per sida
  • 5
  • 10
  • 20
  • 50
  • 100
  • 250
Sortering
  • Standard (Relevans)
  • Författare A-Ö
  • Författare Ö-A
  • Titel A-Ö
  • Titel Ö-A
  • Publikationstyp A-Ö
  • Publikationstyp Ö-A
  • Äldst först
  • Nyast först
  • Skapad (Äldst först)
  • Skapad (Nyast först)
  • Senast uppdaterad (Äldst först)
  • Senast uppdaterad (Nyast först)
  • Disputationsdatum (tidigaste först)
  • Disputationsdatum (senaste först)
  • Standard (Relevans)
  • Författare A-Ö
  • Författare Ö-A
  • Titel A-Ö
  • Titel Ö-A
  • Publikationstyp A-Ö
  • Publikationstyp Ö-A
  • Äldst först
  • Nyast först
  • Skapad (Äldst först)
  • Skapad (Nyast först)
  • Senast uppdaterad (Äldst först)
  • Senast uppdaterad (Nyast först)
  • Disputationsdatum (tidigaste först)
  • Disputationsdatum (senaste först)
Markera
Maxantalet träffar du kan exportera från sökgränssnittet är 250. Vid större uttag använd dig av utsökningar.
  • 201. Zangger Borch, D.
    et al.
    Sundberg, Johan
    KTH, Tidigare Institutioner, Tal, musik och hörsel.
    Lindestad, P.
    Thalén, M.
    Vocal fold vibration and voice source aperiodicity in "dist" tones: a study of a timbral ornament in rock singing2004Ingår i: Logopedics, Phoniatrics, Vocology, ISSN 1401-5439, E-ISSN 1651-2022, Vol. 29, nr 4, s. 147-153Artikel i tidskrift (Refereegranskat)
    Abstract [en]

    The acoustic characteristics of so-called 'dist' tones, commonly used in singing rock music, are analyzed in a case study. In an initial experiment a professional rock singer produced examples of 'dist' tones. The tones were found to contain aperiodicity, SPL at 0.3 m varied between 90 and 96 dB, and subglottal pressure varied in the range of 20-43 cm H2O, a doubling yielding, on average, an SPL increase of 2.3 dB. In a second experiment, the associated vocal fold vibration patterns were recorded by digital high-speed imaging of the same singer. Inverse filtering of the simultaneously recorded audio signal showed that the aperiodicity was caused by a low frequency modulation of the flow glottogram pulse amplitude. This modulation was produced by an aperiodic or periodic vibration of the supraglottic mucosa. This vibration reduced the pulse amplitude by obstructing the airway for some of the pulses produced by the apparently periodically vibrating vocal folds. The supraglottic mucosa vibration can be assumed to be driven by the high airflow produced by the elevated subglottal pressure.

  • 202.
    Zetterholm, Elisabeth
    et al.
    Department of Philosophy & Linguistics, Umeå University.
    Blomberg, Mats
    KTH, Tidigare Institutioner, Tal, musik och hörsel.
    Elenius, Daniel
    KTH, Tidigare Institutioner, Tal, musik och hörsel.
    A comparison between human perception and a speaker verification system score of a voice imitation2004Ingår i: Proc of Tenth Australian International Conference on Speech Science & Technology, 2004, s. 393-397Konferensbidrag (Refereegranskat)
    Abstract [en]

    A professional impersonator has been studied when training his voice tomimic two target speakers. A three-fold investigation has been conducted; acomputer-based speaker verification system was used, phonetic-acousticmeasurements were made and a perception test was conducted. Our ideabehind using this type of system is to measure how close to the target voice aprofessional impersonation might be able to reach and to relate this tophonetic-acoustic analyses of the mimic speech and human perception. Thesignificantly increased verification scores and the phonetic-acoustic analysesshow that the impersonator really changes his natural voice and speech in hisimitations. The results of the perception test show that there is no, or only asmall, correlation between the verification system and the listeners whenestimating the voice imitations and how close they are to one of the targetspeakers.

  • 203.
    Zhang, Xi
    et al.
    KTH, Tidigare Institutioner, Signaler, sensorer och system.
    Ottersten, Björn
    KTH, Tidigare Institutioner, Tal, musik och hörsel.
    Power allocation and bit loading for spatial multiplexing in MIMO systems2003Ingår i: 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL V, PROCEEDINGS - SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO AND ELECTROACOUSTICS MULTIMEDIA SIGNAL PROCESSING, NEW YORK: IEEE , 2003, s. 53-56Konferensbidrag (Refereegranskat)
    Abstract [en]

    The power assignment problem is important for Multiple-Input-Multiple-Output (MIMO) systems to achieve high capacity. Although this problem is solved by well-known water filling algorithms, this does not provide an optimal solution if the system is constrained to a fixed raw bit error rate threshold and to discrete modulation orders. In this work an approximate approach, called QoS based WF, is proposed to solve the power assignment problem with such constrains. It is shown to outperform quantization of the conventional water filling solution and a well known bit loading algorithm (Chow's algorithm) used in the Digital Subscriber Lines (DSL).

  • 204. Öhlin, David
    et al.
    Carlson, Rolf
    KTH, Tidigare Institutioner, Tal, musik och hörsel.
    Data-driven formant synthesis2004Ingår i: Proceedings FONETIK 2004: The XVIIth Swedish Phonetics Conference / [ed] Peter Branderud, Hartmut Traunmüller, Stockholm University, 2004, s. 160-163Konferensbidrag (Övrigt vetenskapligt)
  • 205. Öhman, Tobias
    et al.
    Salvi, Giampiero
    KTH, Tidigare Institutioner, Tal, musik och hörsel.
    Using HMMs and ANNs for mapping acoustic to visual speech1999Ingår i: TMH-QPSR, Vol. 40, nr 1-2, s. 45-50Artikel i tidskrift (Övrigt vetenskapligt)
    Abstract [en]

    In this paper we present two different methods for mapping auditory, telephonequality speech to visual parameter trajectories, specifying the movements of ananimated synthetic face. In the first method, Hidden Markov Models (HMMs)where used to obtain phoneme strings and time labels. These where thentransformed by rules into parameter trajectories for visual speech synthesis. In thesecond method, Artificial Neural Networks (ANNs) were trained to directly mapacoustic parameters to synthesis parameters. Speaker independent HMMs weretrained on a phonetically transcribed telephone speech database. Differentunderlying units of speech were modelled by the HMMs, such as monophones,diphones, triphones, and visemes. The ANNs were trained on male, female , andmixed speakers.The HMM method and the ANN method were evaluated through audio-visualintelligibility tests with ten hearing impaired persons, and compared to “ideal”articulations (where no recognition was involved), a natural face, and to theintelligibility of the audio alone. It was found that the HMM method performsconsiderably better than the audio alone condition (54% and 34% keywordscorrect, respectively), but not as well as the “ideal” articulating artificial face(64%). The intelligibility for the ANN method was 34% keywords correct.

2345 201 - 205 av 205
RefereraExporteraLänk till träfflistan
Permanent länk
Referera
Referensformat
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf