kth.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Covariance properties under natural image transformations for the generalized Gaussian derivative model for visual receptive fields
KTH, School of Electrical Engineering and Computer Science (EECS), Computer Science, Computational Science and Technology (CST). (Computational Brain Science Lab)ORCID iD: 0000-0002-9081-2170
2023 (English)Report (Other academic)
Abstract [en]

The property of covariance, also referred to as equivariance, means that an image operator is well-behaved under image transformations, in the sense that the result of applying the image operator to a transformed input image gives essentially a similar result as applying the same image transformation to the output of applying the image operator to the original image. This paper presents a theory of geometric covariance properties in vision, developed for a generalized Gaussian derivative model of receptive fields in the primary visual cortex and the lateral geniculate nucleus, which, in turn, enable geometric invariance properties at higher levels in the visual hierarchy.

It is shown how the studied generalized Gaussian derivative model for visual receptive fields obeys true covariance properties under spatial scaling transformations, spatial affine transformations, Galilean transformations and temporal scaling transformations. These covariance properties imply that a vision system, based on image and video measurements in terms of the receptive fields according to the generalized Gaussian derivative model, can, to first order of approximation, handle the image and video deformations between multiple views of objects delimited by smooth surfaces, as well as between multiple views of spatio-temporal events, under varying relative motions between the objects and events in the world and the observer.

We conclude by describing implications of the presented theory for biological vision, regarding connections between the variabilities of the shapes of biological visual receptive fields and the variabilities of spatial and spatio-temporal image structures under natural image transformations. Specifically, we formulate experimentally testable biological hypotheses as well as needs for measuring population statistics of receptive field characteristics, originating from predictions from the presented theory, concerning the extent to which the shapes of the biological receptive fields in the primary visual cortex span the variabilities of spatial and spatio-temporal image structures induced by natural image transformations, based on geometric covariance properties.

Place, publisher, year, edition, pages
2023. , p. 38
Keywords [en]
receptive field, image transformations, scale covariance, affine covariance, Galilean covariance, primary visual cortex, lateral geniculate nucleus, vision, theoretical neuroscience, theoretical biology
National Category
Bioinformatics (Computational Biology) Neurosciences Computer graphics and computer vision
Identifiers
URN: urn:nbn:se:kth:diva-324883OAI: oai:DiVA.org:kth-324883DiVA, id: diva2:1744469
Projects
Covariant and invariant deep networks
Funder
Swedish Research Council, 2022-02969
Note

QC 20230328

Available from: 2023-03-20 Created: 2023-03-20 Last updated: 2025-02-01Bibliographically approved

Open Access in DiVA

fulltext(7782 kB)61 downloads
File information
File name FULLTEXT04.pdfFile size 7782 kBChecksum SHA-512
86ad32971196d4665768e4238799b144be22147cf1c2205f8b6f2c925a0f40f338564d473fe3663f69e6cd6c569ba49b0843593c7f4758cc27dd889c2a2dbfd4
Type fulltextMimetype application/pdf

Other links

arXiv:2303.09803

Authority records

Lindeberg, Tony

Search in DiVA

By author/editor
Lindeberg, Tony
By organisation
Computational Science and Technology (CST)
Bioinformatics (Computational Biology)NeurosciencesComputer graphics and computer vision

Search outside of DiVA

GoogleGoogle Scholar
Total: 92 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 929 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf