kth.sePublications KTH
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Modeling and analysis of the 8 filters from the “master key filters hypothesis” for depthwise-separable deep networks in relation to idealized receptive fields based on scale-space theory
KTH, School of Electrical Engineering and Computer Science (EECS), Computational Science and Technology.ORCID iD: 0000-0002-9081-2170
TU Wien.ORCID iD: 0000-0002-8219-005X
TU Wien.
2026 (English)In: Journal of Mathematical Imaging and Vision, ISSN 0924-9907, E-ISSN 1573-7683, Vol. 68, no 3, p. 22:1-22:26, article id 22Article in journal (Refereed) Published
Abstract [en]

This paper presents the results of analyzing and modeling a set of 8 “master key filters”, which have been extracted by applying a clustering approach to the receptive fields learned in depthwise-separable deep networks based on the ConvNeXt architecture. For this purpose, we first compute spatial spread measures in terms of weighted mean values and weighted variances of the absolute values of the learned filters, which support the working hypotheses that: (i) the learned filters can be modeled by separable filtering operations over the spatial domain, and that (ii) the spatial offsets of the those learned filters that are non-centered are rather close to half a grid unit. Then, we model the clustered “master key filters” in terms of difference operators applied to a spatial smoothing operation in terms of the discrete analog of the Gaussian kernel, and demonstrate that the resulting idealized models of the receptive fields show good qualitative similarity to the learned filters.

This modeling is performed in two different ways: (i) using possibly different values of the scale parameters in the coordinate directions for each filter, and (ii) using the same value of the scale parameter in both coordinate directions. Then, we perform the actual model fitting by either (i) requiring spatial spread measures in terms of spatial variances of the absolute values of the receptive fields to be equal, or (ii) minimizing the discrete l1- or l2-norms between the idealized receptive field models and the learned filters. Complementary experimental results then demonstrate that the idealized models of receptive fields have very good predictive properties for replacing the learned filters by idealized filters in depthwise-separable deep networks, thus showing that the learned filters in depthwise-separable deep networks can be well approximated by discrete scale-space filters.

Notably, we show that, for a reduced version of the ConvNeXt architecture, using a set of only 8 discrete scale-space filters leads to almost as good accuracy as for the receptive fields trained from scratch on the ImageNet dataset.

Place, publisher, year, edition, pages
Springer Nature , 2026. Vol. 68, no 3, p. 22:1-22:26, article id 22
Keywords [en]
receptive field, deep learning, discrete, continuous, Gaussian kernel, Gaussian derivative, depthwise-separable networks, scale space
National Category
Computer graphics and computer vision
Research subject
Computer Science
Identifiers
URN: urn:nbn:se:kth:diva-382525DOI: 10.1007/s10851-026-01290-0OAI: oai:DiVA.org:kth-382525DiVA, id: diva2:2063052
Projects
Covariant and invariant deep networks
Funder
Swedish Research Council, 2022-02969
Note

QC 20260528

Available from: 2026-05-27 Created: 2026-05-27 Last updated: 2026-05-28Bibliographically approved

Open Access in DiVA

fulltext(1134 kB)21 downloads
File information
File name FULLTEXT01.pdfFile size 1134 kBChecksum SHA-512
a652fdd08b51d7bfbf9965a7b39aa9e798e01d1ee3194fb2d6b0fd254ab7069b17984b67b498f5c72a4785acfd42a85ba31732da06330cab5d5f8f56c8465bac
Type fulltextMimetype application/pdf

Other links

Publisher's full text

Authority records

Lindeberg, Tony

Search in DiVA

By author/editor
Lindeberg, TonyBabaiee, Zahra
By organisation
Computational Science and Technology
In the same journal
Journal of Mathematical Imaging and Vision
Computer graphics and computer vision

Search outside of DiVA

GoogleGoogle Scholar
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 454 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf