kth.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Covariance properties under natural image transformations for the generalized Gaussian derivative model for visual receptive fields
KTH, School of Electrical Engineering and Computer Science (EECS), Computer Science, Computational Science and Technology (CST). (Computational Brain Science Lab)ORCID iD: 0000-0002-9081-2170
2023 (English)In: Frontiers in Computational Neuroscience, E-ISSN 1662-5188, Vol. 17, p. 1189949-1-1189949-23Article in journal (Refereed) Published
Abstract [en]

The property of covariance, also referred to as equivariance, means that an image operator is well-behaved under image transformations, in the sense that the result of applying the image operator to a transformed input image gives essentially a similar result as applying the same image transformation to the output of applying the image operator to the original image. This paper presents a theory of geometric covariance properties in vision, developed for a generalized Gaussian derivative model of receptive fields in the primary visual cortex and the lateral geniculate nucleus, which, in turn, enable geometric invariance properties at higher levels in the visual hierarchy.

It is shown how the studied generalized Gaussian derivative model for visual receptive fields obeys true covariance properties under spatial scaling transformations, spatial affine transformations, Galilean transformations and temporal scaling transformations. These covariance properties imply that a vision system, based on image and video measurements in terms of the receptive fields according to the generalized Gaussian derivative model, can, to first order of approximation, handle the image and video deformations between multiple views of objects delimited by smooth surfaces, as well as between multiple views of spatio-temporal events, under varying relative motions between the objects and events in the world and the observer.

We conclude by describing implications of the presented theory for biological vision, regarding connections between the variabilities of the shapes of biological visual receptive fields and the variabilities of spatial and spatio-temporal image structures under natural image transformations. Specifically, we formulate experimentally testable biological hypotheses as well as needs for measuring population statistics of receptive field characteristics, originating from predictions from the presented theory, concerning the extent to which the shapes of the biological receptive fields in the primary visual cortex span the variabilities of spatial and spatio-temporal image structures induced by natural image transformations, based on geometric covariance properties.

Place, publisher, year, edition, pages
Frontiers Media SA , 2023. Vol. 17, p. 1189949-1-1189949-23
Keywords [en]
receptive field, image transformations, scale covariance, affine covariance, Galilean covariance, primary visual cortex, vision, theoretical neuroscience
National Category
Bioinformatics (Computational Biology) Neurosciences Computer graphics and computer vision
Research subject
Computer Science
Identifiers
URN: urn:nbn:se:kth:diva-327330DOI: 10.3389/fncom.2023.1189949ISI: 001016501200001PubMedID: 37398936Scopus ID: 2-s2.0-85164263587OAI: oai:DiVA.org:kth-327330DiVA, id: diva2:1758831
Projects
Covariant and invariant deep networks
Funder
Swedish Research Council, 2018-03586, 2022-02969
Note

Not duplicate with DiVA 1744469 which is a report.

QC 20230529

Available from: 2023-05-24 Created: 2023-05-24 Last updated: 2025-02-01Bibliographically approved

Open Access in DiVA

fulltext(2849 kB)100 downloads
File information
File name FULLTEXT02.pdfFile size 2849 kBChecksum SHA-512
2f23b2878544131301db1f8dbb1df9cef0c565dbe1e6b30888f630a8810d84e7bbd4e6c1592271b2f2cf013675623e7582c0924f2157c233222c834f618e4455
Type fulltextMimetype application/pdf

Other links

Publisher's full textPubMedScopus

Authority records

Lindeberg, Tony

Search in DiVA

By author/editor
Lindeberg, Tony
By organisation
Computational Science and Technology (CST)
In the same journal
Frontiers in Computational Neuroscience
Bioinformatics (Computational Biology)NeurosciencesComputer graphics and computer vision

Search outside of DiVA

GoogleGoogle Scholar
Total: 102 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
pubmed
urn-nbn

Altmetric score

doi
pubmed
urn-nbn
Total: 1668 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf