kth.sePublications KTH
Operational message
There are currently operational disruptions. Troubleshooting is in progress.
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Take a Look, it's in a Book, a Reading Robot
School of Computing Science, Simon Fraser University, Canada.
KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Speech, Music and Hearing, TMH.ORCID iD: 0000-0002-1886-681X
KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Speech, Music and Hearing, TMH.
School of Computing Science, Simon Fraser University, Canada.
Show others and affiliations
2025 (English)In: HRI 2025 - Proceedings of the 2025 ACM/IEEE International Conference on Human-Robot Interaction, Institute of Electrical and Electronics Engineers (IEEE) , 2025, p. 1803-1805Conference paper, Published paper (Refereed)
Abstract [en]

We demonstrate EmojiVoice, a free, customizable text-to-speech (TTS) toolkit for expressive speech on social robots. We demonstrate our voices through storytelling. This task is aimed to be deployed in classrooms, or libraries where the robot can read a story out loud to children. Moreover, we introduce adaptive clarity to to noisy environments and those with reduced comprehension ability. This storytelling robot voice allows us to demonstrate how, using our light weight and customizable TTS, we are able to have a voice that is expressive, engaging, clear and socially appropriate for the task, improving interactions with and perceptions of social robots.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE) , 2025. p. 1803-1805
Keywords [en]
clear speech synthesis, education robots, Expressive speech synthesis, human robot interaction, noise robust speech synthesis, second language speakers, social robotics, storytelling robots
National Category
Robotics and automation Other Engineering and Technologies Human Computer Interaction Natural Language Processing
Identifiers
URN: urn:nbn:se:kth:diva-363761DOI: 10.1109/HRI61500.2025.10973801Scopus ID: 2-s2.0-105004876693OAI: oai:DiVA.org:kth-363761DiVA, id: diva2:1959856
Conference
20th Annual ACM/IEEE International Conference on Human-Robot Interaction, HRI 2025, Melbourne, Australia, Mar 4 2025 - Mar 6 2025
Note

 Part of ISBN 9798350378931 QC 20250526

Available from: 2025-05-21 Created: 2025-05-21 Last updated: 2025-05-26Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Mehta, ShivamHenter, Gustav Eje

Search in DiVA

By author/editor
Mehta, ShivamSyvenky, ZacharyHenter, Gustav Eje
By organisation
Speech, Music and Hearing, TMHRobotics, Perception and Learning, RPL
Robotics and automationOther Engineering and TechnologiesHuman Computer InteractionNatural Language Processing

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 107 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf