Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Exploring expressivity and emotion with artificial voice and speech technologies
Department of Theatre, Film and Television, University of York, East Campus, Baird Lane, York YO10 5GB, United Kingdom.ORCID iD: 0000-0002-9404-851X
Show others and affiliations
2013 (English)In: Logopedics, Phoniatrics, Vocology, ISSN 1401-5439, E-ISSN 1651-2022, Vol. 38, no 3, p. 115-125Article in journal (Refereed) Published
Abstract [en]

Emotion in audio-voice signals, as synthesized by text-to-speech (TTS) technologies, was investigated to formulate a theory of expression for user interface design. Emotional parameters were specified with markup tags, and the resulting audio was further modulated with post-processing techniques. Software was then developed to link a selected TTS synthesizer with an automatic speech recognition (ASR) engine, producing a chatbot that could speak and listen. Using these two artificial voice subsystems, investigators explored both artistic and psychological implications of artificial speech emotion. Goals of the investigation were interdisciplinary, with interest in musical composition, augmentative and alternative communication (AAC), commercial voice announcement applications, human-computer interaction (HCI), and artificial intelligence (AI). The work-in-progress points towards an emerging interdisciplinary ontology for artificial voices. As one study output, HCI tools are proposed for future collaboration.

Place, publisher, year, edition, pages
Taylor & Francis, 2013. Vol. 38, no 3, p. 115-125
Keywords [en]
speech synthesis, speech technology
National Category
Media Engineering
Identifiers
URN: urn:nbn:se:kth:diva-249414DOI: 10.3109/14015439.2013.810303ISI: 000325007700004PubMedID: 24024543Scopus ID: 2-s2.0-84885016119OAI: oai:DiVA.org:kth-249414DiVA, id: diva2:1313796
Note

QC 20190614

Available from: 2019-05-06 Created: 2019-05-06 Last updated: 2019-06-14Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textPubMedScopus

Search in DiVA

By author/editor
Pauletto, Sandra
In the same journal
Logopedics, Phoniatrics, Vocology
Media Engineering

Search outside of DiVA

GoogleGoogle Scholar

doi
pubmed
urn-nbn

Altmetric score

doi
pubmed
urn-nbn
Total: 97 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf