Change search
ReferencesLink to record
Permanent link

Direct link
Enhancing Object RecognitionBased on Contextual InformationUsing Markov Logic Networks
KTH, School of Computer Science and Communication (CSC), Computer Vision and Active Perception, CVAP.
2016 (English)Independent thesis Advanced level (degree of Master (Two Years)), 80 credits / 120 HE creditsStudent thesis
Abstract [en]

Perception is a crucial part of an autonomous robotic system, as it processes sensory input and extracts useful information for action planning and execution such as recognizing objects in the environment where the robot is to act. Although object recognition has been studied extensively and a lot of progress has been made, current systems often face difficulties in dealing with ambiguities and uncertainties in the raw sensory data. It has been shown that using contextual information can reduce these ambiguities, however many different types of context measures exist and it is not always clear which type is the most effective for classification purposes.In this thesis we study how and to what extentMarkov Logic Networks (MLN) can be used to increase robustness in object classification by making use of context. MLNs consist of a combination of first-order logic and Markov Random fields, allowing for a solid framework for defining soft and hard rules that can be used efficiently in classification. Structure learning methods for MLNs allow for automatic improvement of the structure as well as flexibility when expanding the classification space. Therefore, it was of particular interest to study how the learning of the structure of an MLN performed against a manually constructed counterpart. We propose spatial relations, e.g.,’isRightTo’, ’isLeftTo’ and ’isAbove’, as a measure of context in order to reduce classification errors of items in various household scenes. Our experimental evaluations start with a comparison with a commonly used probabilistic classifier, the NaiveBayes classifier. Furthermore, we use a publicly available dataset to compare structure learning with a state-of-the-art system which uses MLNs with a manually designed structure. In addition, we test our approach with and without spatial relations on this dataset. Overall, the results show that MLNs outperform conventional classification algorithms and that spatial relations and structure learning increase the classification accuracy.

Place, publisher, year, edition, pages
National Category
Computer and Information Science
URN: urn:nbn:se:kth:diva-190015OAI: diva2:950444
Educational program
Master of Science in Engineering - Electrical Engineering
2016-06-20, 13:00 (English)
Available from: 2016-08-18 Created: 2016-07-29 Last updated: 2016-08-18Bibliographically approved

Open Access in DiVA

fulltext(2968 kB)6 downloads
File information
File name FULLTEXT01.pdfFile size 2968 kBChecksum SHA-512
Type fulltextMimetype application/pdf

By organisation
Computer Vision and Active Perception, CVAP
Computer and Information Science

Search outside of DiVA

GoogleGoogle Scholar
Total: 6 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Total: 16 hits
ReferencesLink to record
Permanent link

Direct link