Change search
ReferencesLink to record
Permanent link

Direct link
Human Action Recognition Based on Linear and Non-linear Dimensionality Reduction using PCA and ISOMAP
KTH, School of Electrical Engineering (EES), Automatic Control.
2006 (English)Student paper other, 20 credits / 30 HE creditsStudent thesis
Abstract [en]

Understanding and interpreting dynamic human actions is an important area of research in the field of computer vision and robotics. In robotics, it is closely related to task programming. Traditionally, robot task programming has required an experienced programmer and tedious work. By contrast, Programming by Demonstration is an intuitive method that allows to program a robot in a very flexible way. The programmer demonstrates or shows how a particular task is performed and the robot learns in an efficient and natural manner how to imitate or reproduce the human actions. Here, we develop a general policy for learning the relevant features of a demonstrated activity and we restrict our study to imitation of object manipulation activities. A Nest of Birds magnetic tracker is used for activity recognition and two different dimensionality reduction techniques are applied.

The first one uses linear dimensionality reduction in order to find the underlying structure of the data. Particularly, Principal Component Analysis (PCA) is used to learn a set of principal components (PCs) to characterize the data. The main problem using PCA is that linear PCs cannot represent the non-linear nature of human motion. The second method uses a non-linear dimensionality reduction technique. Specifically, spatio-temporal Isomap is applied to uncover the intrinsic non-linear geometry of the data, and it is captured through computing the geodesic manifold distances between all pairs of data points.

For classification purposes, both PCA and ST-Isomap can be viewed as a preprocessing step. When the dimensionality of the input data is so high that becomes intractable, most classification methods will suffer and even fail in their goals due to their sensitivity to the input data dimensionality. Fortunately, high dimensional data often represent phenomena that are intrinsically low dimensional. Thus, the problem of high dimensional data classification can be solved by first mapping the original data into a lower dimensional space using a dimensionality reduction method such as PCA or ST-Isomap and then applying K-nearest neighbors (K-NN), radial basis functions (RBF) or any other classification method to classify of the query sequence.

In the first stage of our work, PCA combined with k-means clustering is applied.

In the second stage of our work, spatio temporal Isomap (ST-Isomap) combined with Shepard’s interpolation is applied.

For classification purposes, simple Euclidean distances are used.

The experimental evaluation shows that a linear dimensionality reduction technique can not find the intrinsic structure of human motions due to their non-linear nature. In contrary, a non-linear one, such as spatio-temporal Isomap is able to uncover a low dimensional space in which the data lies facilitating the classification step in a much better way than PCA.

Place, publisher, year, edition, pages
2006. , 163 p.
National Category
Control Engineering
URN: urn:nbn:se:kth:diva-107500OAI: diva2:576309
Available from: 2012-12-13 Created: 2012-12-12 Last updated: 2012-12-13Bibliographically approved

Open Access in DiVA

fulltext(1781 kB)188 downloads
File information
File name FULLTEXT01.pdfFile size 1781 kBChecksum SHA-512
Type fulltextMimetype application/pdf

By organisation
Automatic Control
Control Engineering

Search outside of DiVA

GoogleGoogle Scholar
Total: 188 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Total: 74 hits
ReferencesLink to record
Permanent link

Direct link