kth.sePublications KTH
Operational message
There are currently operational disruptions. Troubleshooting is in progress.
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Dessie: Disentanglement for Articulated 3D Horse Shape and Pose Estimation from Images
KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Robotics, Perception and Learning, RPL.ORCID iD: 0000-0002-7627-0125
KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Robotics, Perception and Learning, RPL. KTH, Stockholm, Sweden; Scania, Södertälje, Sweden.ORCID iD: 0000-0002-6679-4021
KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Robotics, Perception and Learning, RPL. KTH, Stockholm, Sweden.ORCID iD: 0000-0002-9486-9238
SLU, Uppsala, Sweden.
Show others and affiliations
2025 (English)In: Computer Vision – ACCV 2024 - 17th Asian Conference on Computer Vision, Proceedings, Springer Science and Business Media Deutschland GmbH , 2025, p. 268-288Conference paper, Published paper (Refereed)
Abstract [en]

In recent years, 3D parametric animal models have been developed to aid in estimating 3D shape and pose from images and video. While progress has been made for humans, it’s more challenging for animals due to limited annotated data. To address this, we introduce the first method using synthetic data generation and disentanglement to learn to regress 3D shape and pose. Focusing on horses, we use text-based texture generation and a synthetic data pipeline to create varied shapes, poses, and appearances, learning disentangled spaces. Our method, Dessie, surpasses existing 3D horse reconstruction methods and generalizes to other large animals like zebras, cows, and deer. See the project website at: https://celiali.github.io/Dessie/.

Place, publisher, year, edition, pages
Springer Science and Business Media Deutschland GmbH , 2025. p. 268-288
Keywords [en]
Animal 3D reconstruction, disentanglement
National Category
Computer graphics and computer vision Bioinformatics (Computational Biology)
Identifiers
URN: urn:nbn:se:kth:diva-358262DOI: 10.1007/978-981-96-0972-7_16ISI: 001542340100016Scopus ID: 2-s2.0-85213389101OAI: oai:DiVA.org:kth-358262DiVA, id: diva2:1925462
Conference
17th Asian Conference on Computer Vision, ACCV 2024, Hanoi, Viet Nam, Dec 8 2024 - Dec 12 2024
Note

Part of ISBN 9789819609710]

QC 20250113

Available from: 2025-01-08 Created: 2025-01-08 Last updated: 2025-12-08Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Li, CiYang, YiWeng, ZehangKjellström, Hedvig

Search in DiVA

By author/editor
Li, CiYang, YiWeng, ZehangKjellström, Hedvig
By organisation
Robotics, Perception and Learning, RPL
Computer graphics and computer visionBioinformatics (Computational Biology)

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 657 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf