kth.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Adjustable Visual Appearance for Generalizable Novel View Synthesis
Computer Vision Group, Chalmers University of Technology, Gothenburg, Sweden.
Computer Vision Group, Chalmers University of Technology, Gothenburg, Sweden.
Computer Vision Group, Chalmers University of Technology, Gothenburg, Sweden.
KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Robotics, Perception and Learning, RPL.ORCID iD: 0000-0001-9296-9166
Show others and affiliations
2025 (English)In: Pattern Recognition and Artificial Intelligence - 4th International Conference, ICPRAI 2024, Proceedings, Springer Nature , 2025, p. 157-171Conference paper, Published paper (Refereed)
Abstract [en]

We present a generalizable novel view synthesis method which enables modifying the visual appearance of an observed scene so rendered views match a target weather or lighting condition, without any scene specific training or access to reference views at the target condition. Our method is based on a pretrained generalizable transformer architecture and is fine-tuned on synthetically generated scenes under different appearance conditions. This allows for rendering novel views in a consistent manner for 3D scenes that were not included in the training set, along with the ability to (i) modify their appearance to match the target condition and (ii) smoothly interpolate between different conditions. Experiments on real and synthetic scenes show that our method is able to generate 3D consistent renderings while making realistic appearance changes, including qualitative and quantitative comparisons. Please refer to our project page for video results: https://ava-nvs.github.io.

Place, publisher, year, edition, pages
Springer Nature , 2025. p. 157-171
Keywords [en]
3D Style Transfer, Generalizable Novel View Synthesis, NeRFs
National Category
Computer graphics and computer vision Signal Processing
Identifiers
URN: urn:nbn:se:kth:diva-361150DOI: 10.1007/978-981-97-8702-9_11Scopus ID: 2-s2.0-85219213349OAI: oai:DiVA.org:kth-361150DiVA, id: diva2:1944105
Conference
4th International Conference on Pattern Recognition and Artificial Intelligence, ICPRAI 2024, Jeju Island, Korea, Jul 3 2024 - Jul 6 2024
Note

Part of ISBN 9789819787012

QC 20250313

Available from: 2025-03-12 Created: 2025-03-12 Last updated: 2025-03-13Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Büsching, Marcel

Search in DiVA

By author/editor
Büsching, Marcel
By organisation
Robotics, Perception and Learning, RPL
Computer graphics and computer visionSignal Processing

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 37 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf