kth.sePublikationer KTH
Ändra sökning
Länk till posten
Permanent länk

Direktlänk
Gutha, Sai bharath chandraORCID iD iconorcid.org/0000-0002-9337-564X
Alternativa namn
Publikationer (2 of 2) Visa alla publikationer
Gutha, S. B., Vinuesa, R. & Azizpour, H. (2025). Inverse Problems with Diffusion Models: A MAP Estimation Perspective. In: Proceedings - 2025 IEEE Winter Conference on Applications of Computer Vision, WACV 2025: . Paper presented at 2025 IEEE/CVF Winter Conference on Applications of Computer Vision, WACV 2025, Tucson, United States of America, Feb 28 2025 - Mar 4 2025 (pp. 4153-4162). Institute of Electrical and Electronics Engineers (IEEE)
Öppna denna publikation i ny flik eller fönster >>Inverse Problems with Diffusion Models: A MAP Estimation Perspective
2025 (Engelska)Ingår i: Proceedings - 2025 IEEE Winter Conference on Applications of Computer Vision, WACV 2025, Institute of Electrical and Electronics Engineers (IEEE) , 2025, s. 4153-4162Konferensbidrag, Publicerat paper (Refereegranskat)
Abstract [en]

Inverse problems have many applications in science and engineering. In Computer vision, several image restoration tasks such as inpainting, deblurring, and super-resolution can be formally modeled as inverse problems. Recently, methods have been developed for solving inverse problems that only leverage a pre-trained unconditional diffusion model and do not require additional task-specific training. In such methods, however, the inherent intractability of determining the conditional score function during the reverse diffusion process poses a real challenge, leaving the methods to settle with an approximation instead, which affects their performance in practice. Here, we propose a MAP estimation framework to model the reverse conditional generation process of a continuous time diffusion model as an optimization process of the underlying MAP objective, whose gradient term is tractable. In theory, the proposed framework can be applied to solve general inverse problems using gradient-based optimization methods. However, given the highly non-convex nature of the loss objective, finding a perfect gradient-based optimization algorithm can be quite challenging, nevertheless, our framework offers several potential research directions. We use our proposed formulation to develop empirically effective algorithms for image restoration. We validate our proposed algorithms with extensive experiments over multiple datasets across several restoration tasks.

Ort, förlag, år, upplaga, sidor
Institute of Electrical and Electronics Engineers (IEEE), 2025
Nyckelord
conditional generation, consistency models, diffusion models, inverse problems, map estimation, optimization
Nationell ämneskategori
Beräkningsmatematik Datorgrafik och datorseende
Identifikatorer
urn:nbn:se:kth:diva-363209 (URN)10.1109/WACV61041.2025.00408 (DOI)001481328900398 ()2-s2.0-105003630084 (Scopus ID)
Konferens
2025 IEEE/CVF Winter Conference on Applications of Computer Vision, WACV 2025, Tucson, United States of America, Feb 28 2025 - Mar 4 2025
Anmärkning

Part of ISBN 9798331510831

QC 20250509

Tillgänglig från: 2025-05-07 Skapad: 2025-05-07 Senast uppdaterad: 2025-12-08Bibliografiskt granskad
Nilsson, A., Wijk, K., Gutha, S. b., Englesson, E., Hotti, A., Saccardi, C., . . . Azizpour, H. (2024). Indirectly Parameterized Concrete Autoencoders. In: International Conference on Machine Learning, ICML 2024: . Paper presented at 41st International Conference on Machine Learning, ICML 2024, Vienna, Austria, Jul 21 2024 - Jul 27 2024 (pp. 38237-38252). ML Research Press
Öppna denna publikation i ny flik eller fönster >>Indirectly Parameterized Concrete Autoencoders
Visa övriga...
2024 (Engelska)Ingår i: International Conference on Machine Learning, ICML 2024, ML Research Press , 2024, s. 38237-38252Konferensbidrag, Publicerat paper (Refereegranskat)
Abstract [en]

Feature selection is a crucial task in settings where data is high-dimensional or acquiring the full set of features is costly. Recent developments in neural network-based embedded feature selection show promising results across a wide range of applications. Concrete Autoencoders (CAEs), considered state-of-the-art in embedded feature selection, may struggle to achieve stable joint optimization, hurting their training time and generalization. In this work, we identify that this instability is correlated with the CAE learning duplicate selections. To remedy this, we propose a simple and effective improvement: Indirectly Parameterized CAEs (IP-CAEs). IP-CAEs learn an embedding and a mapping from it to the Gumbel-Softmax distributions' parameters. Despite being simple to implement, IP-CAE exhibits significant and consistent improvements over CAE in both generalization and training time across several datasets for reconstruction and classification. Unlike CAE, IP-CAE effectively leverages non-linear relationships and does not require retraining the jointly optimized decoder. Furthermore, our approach is, in principle, generalizable to Gumbel-Softmax distributions beyond feature selection.

Ort, förlag, år, upplaga, sidor
ML Research Press, 2024
Nationell ämneskategori
Datavetenskap (datalogi)
Identifikatorer
urn:nbn:se:kth:diva-353956 (URN)2-s2.0-85203808876 (Scopus ID)
Konferens
41st International Conference on Machine Learning, ICML 2024, Vienna, Austria, Jul 21 2024 - Jul 27 2024
Anmärkning

QC 20240926

Tillgänglig från: 2024-09-25 Skapad: 2024-09-25 Senast uppdaterad: 2024-09-26Bibliografiskt granskad
Organisationer
Identifikatorer
ORCID-id: ORCID iD iconorcid.org/0000-0002-9337-564X

Sök vidare i DiVA

Visa alla publikationer