kth.sePublikationer
Ändra sökning
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Listener sensitivity to deviating obstruents in WaveNet
Sigmedia Lab, ADAPT Centre, School of Engineering, Trinity College Dublin, Ireland.
KTH, Skolan för elektroteknik och datavetenskap (EECS), Intelligenta system, Tal, musik och hörsel, TMH.ORCID-id: 0000-0001-9327-9482
Sigmedia Lab, ADAPT Centre, School of Engineering, Trinity College Dublin, Ireland.
Sigmedia Lab, ADAPT Centre, School of Engineering, Trinity College Dublin, Ireland.
2023 (Engelska)Ingår i: Interspeech 2023, International Speech Communication Association , 2023, s. 1080-1084Konferensbidrag, Publicerat paper (Refereegranskat)
Abstract [en]

This paper investigates the perceptual significance of the deviation in obstruents previously observed in WaveNet vocoders. The study involved presenting stimuli of varying lengths to 128 participants, who were asked to identify whether each stimulus was produced by a human or a machine. The participants' responses were captured using a 2-alternative forced choice task. The study found that while the length of the stimuli did not reliably affect participants' accuracy in the task, the concentration of obstruents did have a significant effect. Participants were consistently more accurate in identifying WaveNet stimuli as machine when the phrases were obstruent-rich. These findings show that the deviation in obstruents reported in WaveNet voices is perceivable by human listeners. The test protocol may be of wider utility in TTS.

Ort, förlag, år, upplaga, sidor
International Speech Communication Association , 2023. s. 1080-1084
Nyckelord [en]
distortion, obstruents, perception, TTS evaluation, WaveNet
Nationell ämneskategori
Psykologi (exklusive tillämpad psykologi)
Identifikatorer
URN: urn:nbn:se:kth:diva-337831DOI: 10.21437/Interspeech.2023-1843Scopus ID: 2-s2.0-85171585188OAI: oai:DiVA.org:kth-337831DiVA, id: diva2:1803491
Konferens
24th International Speech Communication Association, Interspeech 2023, Dublin, Ireland, Aug 20 2023 - Aug 24 2023
Anmärkning

QC 20231009

Tillgänglig från: 2023-10-09 Skapad: 2023-10-09 Senast uppdaterad: 2023-10-09Bibliografiskt granskad

Open Access i DiVA

Fulltext saknas i DiVA

Övriga länkar

Förlagets fulltextScopus

Person

Edlund, Jens

Sök vidare i DiVA

Av författaren/redaktören
Edlund, Jens
Av organisationen
Tal, musik och hörsel, TMH
Psykologi (exklusive tillämpad psykologi)

Sök vidare utanför DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetricpoäng

doi
urn-nbn
Totalt: 54 träffar
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf