Change search
ReferencesLink to record
Permanent link

Direct link
Invariance of visual operations at the level of receptive fields
KTH, School of Computer Science and Communication (CSC), Computational Biology, CB.ORCID iD: 0000-0002-9081-2170
2013 (English)In: PLoS ONE, ISSN 1932-6203, Vol. 8, no 7, e66990-1-e66990-33 p.Article in journal (Refereed) Published
Abstract [en]

The brain is able to maintain a stable perception although the visual stimuli vary substantially on the retina due to geometric transformations and lighting variations in the environment. This paper presents a theory for achieving basic invariance properties already at the level of receptive fields. Specifically, the presented framework comprises (i) local scaling transformations caused by objects of different size and at different distances to the observer, (ii) locally linearized image deformations caused by variations in the viewing direction in relation to the object, (iii) locally linearized relative motions between the object and the observer and (iv) local multiplicative intensity transformations caused by illumination variations. The receptive field model can be derived by necessity from symmetry properties of the environment and leads to predictions about receptive field profiles in good agreement with receptive field profiles measured by cell recordings in mammalian vision. Indeed, the receptive field profiles in the retina, LGN and V1 are close to ideal to what is motivated by the idealized requirements. By complementing receptive field measurements with selection mechanisms over the parameters in the receptive field families, it is shown how true invariance of receptive field responses can be obtained under scaling transformations, affine transformations and Galilean transformations. Thereby, the framework provides a mathematically well-founded and biologically plausible model for how basic invariance properties can be achieved already at the level of receptive fields and support invariant recognition of objects and events under variations in viewpoint, retinal size, object motion and illumination. The theory can explain the different shapes of receptive field profiles found in biological vision, which are tuned to different sizes and orientations in the image domain as well as to different image velocities in space-time, from a requirement that the visual system should be invariant to the natural types of image transformations that occur in its environment.

Place, publisher, year, edition, pages
2013. Vol. 8, no 7, e66990-1-e66990-33 p.
Keyword [en]
Monkey Inferotemporal Cortex, Gaussian Derivative Model, Inferior Temporal Cortex, Cat Striate Cortex, Object Recognition, Scale-Space, Natural Images, Orientation Selectivity, Response Properties, Size Invariance
National Category
Neurosciences Computer Vision and Robotics (Autonomous Systems)
URN: urn:nbn:se:kth:diva-124629DOI: 10.1371/journal.pone.0066990ISI: 000322391400002ScopusID: 2-s2.0-84880399459OAI: diva2:637630
Swedish Research Council, 2010-4766Knut and Alice Wallenberg Foundation

QC 20130813

Available from: 2013-07-20 Created: 2013-07-20 Last updated: 2013-09-10Bibliographically approved

Open Access in DiVA

fulltext(11660 kB)49 downloads
File information
File name FULLTEXT01.pdfFile size 11660 kBChecksum SHA-512
Type fulltextMimetype application/pdf

Other links

Publisher's full textScopus

Search in DiVA

By author/editor
Lindeberg, Tony
By organisation
Computational Biology, CB
In the same journal
NeurosciencesComputer Vision and Robotics (Autonomous Systems)

Search outside of DiVA

GoogleGoogle Scholar
Total: 49 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Altmetric score

Total: 1419 hits
ReferencesLink to record
Permanent link

Direct link