kth.sePublications KTH
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Relative Representations: Topological and Geometric Perspectives
Amsterdam Machine Learning Lab, University of Amsterdam, Netherlands.
KTH, School of Engineering Sciences (SCI), Mathematics (Dept.), Algebra, Combinatorics and Topology.ORCID iD: 0009-0004-8248-229X
KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Robotics, Perception and Learning, RPL.ORCID iD: 0000-0003-2965-2953
KTH, School of Engineering Sciences (SCI), Mathematics (Dept.), Algebra, Combinatorics and Topology.ORCID iD: 0000-0001-6007-9273
2024 (English)In: Proceedings of UniReps: 2nd Edition of the Workshop on Unifying Representations in Neural Models / [ed] Marco Fumero; Clementine Domine; Zorah Lähner; Donato Crisostomi; Luca Moschella; Kimberly Stachenfeld, ML Research Press , 2024Conference paper, Published paper (Refereed)
Abstract [en]

Relative representations are an established approach to zero-shot model stitching, consisting of a non-trainable transformation of the latent space of a deep neural network. Based on insights of topological and geometric nature, we propose two improvements to relative representations. First, we introduce a normalization procedure in the relative transformation, resulting in invariance to non-isotropic rescalings and permutations. The latter coincides with the symmetries in parameter space induced by common activation functions. Second, we propose to deploy topological densification when fine-tuning relative representations, a topological regularization loss encouraging clustering within classes. We provide an empirical investigation on a natural language task, where both the proposed variations yield improved performance on zero-shot model stitching.

Place, publisher, year, edition, pages
ML Research Press , 2024.
Series
Proceedings of Machine Learning Research ; 285
National Category
Computer graphics and computer vision
Identifiers
URN: urn:nbn:se:kth:diva-370458Scopus ID: 2-s2.0-105014754343OAI: oai:DiVA.org:kth-370458DiVA, id: diva2:2002074
Conference
2nd Edition of the Workshop on Unifying Representations in Neural Models, UniReps 2024, Vancouver, Canada, December 14, 2024
Note

QC 20250929

Available from: 2025-09-29 Created: 2025-09-29 Last updated: 2025-09-29Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Scopusfulltext

Authority records

Marchetti, Giovanni LucaKragic Jensfelt, DanicaScolamiero, Martina

Search in DiVA

By author/editor
Marchetti, Giovanni LucaKragic Jensfelt, DanicaScolamiero, Martina
By organisation
Algebra, Combinatorics and TopologyRobotics, Perception and Learning, RPL
Computer graphics and computer vision

Search outside of DiVA

GoogleGoogle Scholar

urn-nbn

Altmetric score

urn-nbn
Total: 16 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf