kth.sePublikationer KTH
Ändra sökning
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Evaluation of generative models for emotional 3D animation generation in VR
KTH, Skolan för elektroteknik och datavetenskap (EECS), Datavetenskap, Beräkningsvetenskap och beräkningsteknik (CST).ORCID-id: 0000-0002-7414-845X
KTH, Skolan för elektroteknik och datavetenskap (EECS), Datavetenskap, Beräkningsvetenskap och beräkningsteknik (CST).ORCID-id: 0000-0003-1206-5701
KTH, Skolan för elektroteknik och datavetenskap (EECS), Människocentrerad teknologi, Medieteknik och interaktionsdesign, MID.ORCID-id: 0000-0002-6571-0623
KTH, Skolan för elektroteknik och datavetenskap (EECS), Datavetenskap, Beräkningsvetenskap och beräkningsteknik (CST).ORCID-id: 0000-0002-7257-0761
2025 (Engelska)Ingår i: Frontiers in Computer Science, E-ISSN 2624-9898, Vol. 7, artikel-id 1598099Artikel i tidskrift (Refereegranskat) Published
Abstract [en]

Introduction: Social interactions incorporate various nonverbal signals to convey emotions alongside speech, including facial expressions and body gestures. Generative models have demonstrated promising results in creating full-body nonverbal animations synchronized with speech; however, evaluations using statistical metrics in 2D settings fail to fully capture user-perceived emotions, limiting our understanding of the effectiveness of these models. Methods: To address this, we evaluate emotional 3D animation generative models within an immersive Virtual Reality (VR) environment, emphasizing user—centric metrics-emotional arousal realism, naturalness, enjoyment, diversity, and interaction quality—in a real-time human-agent interaction scenario. Through a user study (N = 48), we systematically examine perceived emotional quality for three state-of-the-art speech-driven 3D animation methods across two specific emotions: happiness (high arousal) and neutral (mid arousal). Additionally, we compare these generative models against real human expressions obtained via a reconstruction-based method to assess both their strengths and limitations and how closely they replicate real human facial and body expressions. Results: Our results demonstrate that methods explicitly modeling emotions lead to higher recognition accuracy compared to those focusing solely on speech-driven synchrony. Users rated the realism and naturalness of happy animations significantly higher than those of neutral animations, highlighting the limitations of current generative models in handling subtle emotional states. Discussion: Generative models underperformed compared to reconstruction-based methods in facial expression quality, and all methods received relatively low ratings for animation enjoyment and interaction quality, emphasizing the importance of incorporating user-centric evaluations into generative model development. Finally, participants positively recognized animation diversity across all generative models.

Ort, förlag, år, upplaga, sidor
Frontiers Media SA , 2025. Vol. 7, artikel-id 1598099
Nyckelord [en]
3D emotional animation, generative models, nonverbal communication, user-centric evaluation, virtual reality
Nationell ämneskategori
Människa-datorinteraktion (interaktionsdesign) Datavetenskap (datalogi)
Identifikatorer
URN: urn:nbn:se:kth:diva-369923DOI: 10.3389/fcomp.2025.1598099ISI: 001549678200001Scopus ID: 2-s2.0-105013367950OAI: oai:DiVA.org:kth-369923DiVA, id: diva2:1999042
Anmärkning

QC 20250918

Tillgänglig från: 2025-09-18 Skapad: 2025-09-18 Senast uppdaterad: 2025-09-18Bibliografiskt granskad

Open Access i DiVA

Fulltext saknas i DiVA

Övriga länkar

Förlagets fulltextScopus

Person

Chhatre, KiranGuarese, RenanMatviienko, AndriiPeters, Christopher

Sök vidare i DiVA

Av författaren/redaktören
Chhatre, KiranGuarese, RenanMatviienko, AndriiPeters, Christopher
Av organisationen
Beräkningsvetenskap och beräkningsteknik (CST)Medieteknik och interaktionsdesign, MID
I samma tidskrift
Frontiers in Computer Science
Människa-datorinteraktion (interaktionsdesign)Datavetenskap (datalogi)

Sök vidare utanför DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetricpoäng

doi
urn-nbn
Totalt: 111 träffar
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf