Change search
ReferencesLink to record
Permanent link

Direct link
Scale-Space Theory: A Basic Tool for Analysing Structures at Different Scales
KTH, School of Computer Science and Communication (CSC), Computational Biology, CB.ORCID iD: 0000-0002-9081-2170
1994 (English)In: Journal of Applied Statistics, ISSN 0266-4763, E-ISSN 1360-0532, Vol. 21, 225-270 p.Article in journal (Refereed) Published
Abstract [en]

An inherent property of objects in the world is that they only exist as meaningful entities over certain ranges of scale. If one aims at describing the structure of unknown real-world signals, then a multi-scale representation of data is of crucial importance.

This article gives a tutorial review of a special type of multi-scale representation, linear scale-space representation, which has been developed by the computer vision community in order to handle image structures at different scales in a consistent manner. The basic idea is to embed the original signal into a one-parameter family of gradually smoothed signals, in which the fine scale details are successively suppressed.

Under rather general conditions on the type of computations that are to performed at the first stages of visual processing, in what can be termed the visual front end, it can be shown that the Gaussian kernel and its derivatives are singled out as the only possible smoothing kernels. The conditions that specify the Gaussian kernel are, basically, linearity and shift-invariance combined with different ways of formalizing the notion that structures at coarse scales should correspond to simplifications of corresponding structures at fine scales --- they should not be accidental phenomena created by the smoothing method. Notably, several different ways of choosing scale-space axioms give rise to the same conclusion.

The output from the scale-space representation can be used for a variety of early visual tasks; operations like feature detection, feature classification and shape computation can be expressed directly in terms of (possibly non-linear) combinations of Gaussian derivatives at multiple scales. In this sense, the scale-space representation can serve as a basis for early vision.

During the last few decades a number of other approaches to multi-scale representations have been developed, which are more or less related to scale-space theory, notably the theories of pyramids, wavelets and multi-grid methods. Despite their qualitative differences, the increasing popularity of each of these approaches indicates that the crucial notion of scaleis increasingly appreciated by the computer vision community and by researchers in other related fields.

An interesting similarity with biological vision is that the scale-space operators closely resemble receptive field profiles registered in neurophysiological studies of the mammalian retina and visual cortex.

Place, publisher, year, edition, pages
1994. Vol. 21, 225-270 p.
National Category
Computer Vision and Robotics (Autonomous Systems)
URN: urn:nbn:se:kth:diva-40216DOI: 10.1080/757582976OAI: diva2:457189

QC 20111117

Available from: 2013-04-19 Created: 2011-09-13 Last updated: 2013-04-19Bibliographically approved

Open Access in DiVA

fulltext(911 kB)6799 downloads
File information
File name FULLTEXT01.pdfFile size 911 kBChecksum SHA-512
Type fulltextMimetype application/pdf

Other links

Publisher's full textAt author's home page

Search in DiVA

By author/editor
Lindeberg, Tony
By organisation
Computational Biology, CB
In the same journal
Journal of Applied Statistics
Computer Vision and Robotics (Autonomous Systems)

Search outside of DiVA

GoogleGoogle Scholar
Total: 6799 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Altmetric score

Total: 459 hits
ReferencesLink to record
Permanent link

Direct link