Covariance properties under natural image transformations for the generalized Gaussian derivative model for visual receptive fields
2023 (English)In: Frontiers in Computational Neuroscience, E-ISSN 1662-5188, Vol. 17, p. 1189949-1-1189949-23
Article in journal (Refereed) Published
Abstract [en]
The property of covariance, also referred to as equivariance, means that an image operator is well-behaved under image transformations, in the sense that the result of applying the image operator to a transformed input image gives essentially a similar result as applying the same image transformation to the output of applying the image operator to the original image. This paper presents a theory of geometric covariance properties in vision, developed for a generalized Gaussian derivative model of receptive fields in the primary visual cortex and the lateral geniculate nucleus, which, in turn, enable geometric invariance properties at higher levels in the visual hierarchy.
It is shown how the studied generalized Gaussian derivative model for visual receptive fields obeys true covariance properties under spatial scaling transformations, spatial affine transformations, Galilean transformations and temporal scaling transformations. These covariance properties imply that a vision system, based on image and video measurements in terms of the receptive fields according to the generalized Gaussian derivative model, can, to first order of approximation, handle the image and video deformations between multiple views of objects delimited by smooth surfaces, as well as between multiple views of spatio-temporal events, under varying relative motions between the objects and events in the world and the observer.
We conclude by describing implications of the presented theory for biological vision, regarding connections between the variabilities of the shapes of biological visual receptive fields and the variabilities of spatial and spatio-temporal image structures under natural image transformations. Specifically, we formulate experimentally testable biological hypotheses as well as needs for measuring population statistics of receptive field characteristics, originating from predictions from the presented theory, concerning the extent to which the shapes of the biological receptive fields in the primary visual cortex span the variabilities of spatial and spatio-temporal image structures induced by natural image transformations, based on geometric covariance properties.
Place, publisher, year, edition, pages
Frontiers Media SA , 2023. Vol. 17, p. 1189949-1-1189949-23
Keywords [en]
receptive field, image transformations, scale covariance, affine covariance, Galilean covariance, primary visual cortex, vision, theoretical neuroscience
National Category
Bioinformatics (Computational Biology) Neurosciences Computer graphics and computer vision
Research subject
Computer Science
Identifiers
URN: urn:nbn:se:kth:diva-327330DOI: 10.3389/fncom.2023.1189949ISI: 001016501200001PubMedID: 37398936Scopus ID: 2-s2.0-85164263587OAI: oai:DiVA.org:kth-327330DiVA, id: diva2:1758831
Projects
Covariant and invariant deep networks
Funder
Swedish Research Council, 2018-03586, 2022-02969
Note
Not duplicate with DiVA 1744469 which is a report.
QC 20230529
2023-05-242023-05-242025-02-01Bibliographically approved