kth.sePublications KTH
Operational message
There are currently operational disruptions. Troubleshooting is in progress.
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
DiffPAD: Denoising Diffusion-based Adversarial Patch Decontamination
KTH, School of Electrical Engineering and Computer Science (EECS), Computer Science. RISE Res Inst Sweden, Borås, Sweden.ORCID iD: 0009-0004-3798-8603
CISPA Helmholtz Ctr Informat Secur, Saarbrucken, Germany.
RISE Res Inst Sweden, Borås, Sweden; Halmstad Univ, Halmstad, Sweden.
RISE Res Inst Sweden, Borås, Sweden.
Show others and affiliations
2025 (English)In: 2025 IEEE/CVF Winter Conference On Applications Of Computer Vision (Wacv), Institute of Electrical and Electronics Engineers (IEEE) , 2025, p. 6602-6611Conference paper, Published paper (Refereed)
Abstract [en]

In the ever-evolving adversarial machine learning landscape, developing effective defenses against patch attacks has become a critical challenge, necessitating reliable solutions to safeguard real-world AI systems. Although diffusion models have shown remarkable capacity in image synthesis and have been recently utilized to counter l(p)-norm bounded attacks, their potential in mitigating localized patch attacks remains largely underexplored. In this work, we propose DiffPAD, a novel framework that harnesses the power of diffusion models for adversarial patch decontamination. DiffPAD first performs super-resolution restoration on downsampled input images, then adopts binarization, dynamic thresholding scheme and sliding window for effective localization of adversarial patches. Such a design is inspired by the theoretically derived correlation between patch size and diffusion restoration error that is generalized across diverse patch attack scenarios. Finally, DiffPAD applies inpainting techniques to the original input images with the estimated patch region being masked. By integrating closed-form solutions for super-resolution restoration and image inpainting into the conditional reverse sampling process of a pre-trained diffusion model, DiffPAD obviates the need for text guidance or finetuning. Through comprehensive experiments, we demonstrate that DiffPAD not only achieves state-of-the-art adversarial robustness against patch attacks but also excels in recovering naturalistic images without patch remnants. The source code is available at https://github.com/JasonFu1998/DiffPAD.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE) , 2025. p. 6602-6611
Series
IEEE Winter Conference on Applications of Computer Vision, ISSN 2472-6737
National Category
Electrical Engineering, Electronic Engineering, Information Engineering
Identifiers
URN: urn:nbn:se:kth:diva-374026DOI: 10.1109/WACV61041.2025.00643ISI: 001521272600154Scopus ID: 2-s2.0-105003628690ISBN: 979-8-3315-1084-8 (print)ISBN: 979-8-3315-1083-1 (print)OAI: oai:DiVA.org:kth-374026DiVA, id: diva2:2023040
Conference
2025 Winter Conference on Applications of Computer Vision-WACV, FEB 28-MAR 04, 2025, Tucson, AZ
Note

QC 20251218

Available from: 2025-12-18 Created: 2025-12-18 Last updated: 2025-12-18Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Fu, JiaHolst, Anders

Search in DiVA

By author/editor
Fu, JiaHolst, Anders
By organisation
Computer ScienceComputational Science and Technology (CST)
Electrical Engineering, Electronic Engineering, Information Engineering

Search outside of DiVA

GoogleGoogle Scholar

doi
isbn
urn-nbn

Altmetric score

doi
isbn
urn-nbn
Total: 5 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf