Change search
ReferencesLink to record
Permanent link

Direct link
Invariance of visual operations at the level of receptive fields
KTH, School of Computer Science and Communication (CSC), Computational Biology, CB.ORCID iD: 0000-0002-9081-2170
2013 (English)Conference paper, Poster (Refereed)
Abstract [en]

Receptive field profiles measured by cell recordings have shown that mammalian vision has developed receptive fields tuned to different sizes and orientations in the image domain as well as to different image velocities in space-time [1, 2]. This article presents a theory by which families of idealized receptive field profiles can be derived mathematically from a small set of basic assumptions that correspond to structural properties of the environment [3, 4]. The article also presents a theory for how basic invariance properties to variations in scale, viewing direction and relative motion can be obtained from the output of such receptive fields, using complementary selection mechanisms that operate over the output of families of receptive fields tuned to different parameters [4]. Thereby, the theory shows how basic invariance properties of a visual system can be obtained already at the level of receptive fields, and we can explain the different shapes of receptive field profiles found in biological vision from a requirement that the visual system should be invariant to the natural types of image transformations that occur in its environment.


The brain is able to maintain a stable perception although the visual stimuli vary substantially on the retina due to geometric transformations and lighting variations in the environment. These transformations comprise (i) local scaling transformations caused by objects of different size and at different distances to the observer, (ii) locally linearized image deformations caused by variations in the viewing direction in relation to the object, (iii) locally linearized relative motions between the object and the observer and (iv) local multiplicative intensity transformations caused by illumination variations. Let us assume that receptive fields should be constructed by linear operations that are shift-invariant over space and/or space-time, with an additional requirement that receptive fields must not create new image structures at coarser scales that do not correspond to simplifications of corresponding structures at finer scales.


Given the above structural conditions, we derive idealized families of spatial and spatio-temporal receptive fields that satisfy these structural requirements by necessity, based on Gaussian kernels, Gaussian derivatives or closely related operators [3, 4].  We show that there are very close similarities between the receptive fields predicted from this theory and receptive fields found by cell recordings in biological vision, including (i) spatial on-center-off-surround and off-center-on-surround receptive fields in the fovea and the LGN, (ii) simple cells with spatial directional preference in V1, (iii) space-time separable spatio-temporal receptive fields in the LGN and V1 and (iv) non-separable space-time tilted receptive fields in V1 [3, 4]. Indeed, from kernels predicted by this theory it is possible to generate receptive fields similar to all the basic types of monocular receptive fields reported by DeAngelis et al [2] in their survey of classical receptive fields.

By complementing such receptive field measurements with selection mechanisms over the parameters in the receptive field families, we show how true invariance of receptive field responses can be obtained under scaling transformations, affine transformations and Galilean transformations [4]. Thereby, the framework provides a mathematically well-founded and biologically plausible model for how basic invariance properties can be achieved already at the level of receptive fields. In this way, the presented theory supports invariant recognition of objects and events under variations in viewpoint, retinal size, object motion and illumination.


1. Hubel DH, Wiesel TN: Brain and Visual Perception, Oxford University Press, 2005. 

2. DeAngelis GC, Anzai A: A modern view of the classical receptive field: Linear and non-linear spatio-temporal processing by V1 neurons. The Visual Neurosciences, MIT Press, vol 1, 705-719, 2004.

3. Lindeberg T: Generalized Gaussian scale-space axiomatics comprising linear scale-space, affine scale-space and spatio-temporal scale-space. J Math Imaging Vis, 2011, 40(1):36-81.

4. Lindeberg T: Invariance of visual operations at the level of receptive fields. PLOS One, in press, doi:10.1371/journal.pone.0066990, preprint available from arXiv:1210.0754.

Place, publisher, year, edition, pages
2013. P242- p.
National Category
Neurosciences Bioinformatics (Computational Biology) Computer Vision and Robotics (Autonomous Systems)
URN: urn:nbn:se:kth:diva-124378DOI: 10.1186/1471-2202-14-S1-P242OAI: diva2:634558
CNS 2013: 22nd Annual Computational Neuroscience Meeting, July 13-18, Paris, France. BMC Neuroscience 14(Suppl 1)
Swedish Research Council, 2010-4766

QC 20130702

Available from: 2013-07-01 Created: 2013-07-01 Last updated: 2013-07-23Bibliographically approved

Open Access in DiVA

fulltext(137 kB)239 downloads
File information
File name FULLTEXT01.pdfFile size 137 kBChecksum SHA-512
Type fulltextMimetype application/pdf

Other links

Publisher's full text

Search in DiVA

By author/editor
Lindeberg, Tony
By organisation
Computational Biology, CB
NeurosciencesBioinformatics (Computational Biology)Computer Vision and Robotics (Autonomous Systems)

Search outside of DiVA

GoogleGoogle Scholar
Total: 239 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Altmetric score

Total: 443 hits
ReferencesLink to record
Permanent link

Direct link