Change search
ReferencesLink to record
Permanent link

Direct link
Interactive Perception: From Scenes to Objects
KTH, School of Computer Science and Communication (CSC), Computer Vision and Active Perception, CVAP. KTH, School of Computer Science and Communication (CSC), Centres, Centre for Autonomous Systems, CAS.
2012 (English)Doctoral thesis, monograph (Other academic)
Abstract [en]

This thesis builds on the observation that robots, like humans, do not have enough experience to handle all situations from the start. Therefore they need tools to cope with new situations, unknown scenes and unknown objects. In particular, this thesis addresses objects. How can a robot realize what objects are if it looks at a scene and has no knowledge about objects? How can it recover from situations where its hypotheses about what it sees are wrong? Even if it has built up experience in form of learned objects, there will be situations where it will be uncertain or mistaken, and will therefore still need the ability to correct errors. Much of our daily lives involves interactions with objects, and the same will be true robots existing among us. Apart from being able to identify individual objects, the robot will therefore need to manipulate them.

Throughout the thesis, different aspects of how to deal with these questions is addressed. The focus is on the problem of a robot automatically partitioning a scene into its constituting objects. It is assumed that the robot does not know about specific objects, and is therefore considered inexperienced. Instead a method is proposed that generates object hypotheses given visual input, and then enables the robot to recover from erroneous hypotheses. This is done by the robot drawing from a human's experience, as well as by enabling it to interact with the scene itself and monitoring if the observed changes are in line with its current beliefs about the scene's structure.

Furthermore, the task of object manipulation for unknown objects is explored. This is also used as a motivation why the scene partitioning problem is essential to solve. Finally aspects of monitoring the outcome of a manipulation is investigated by observing the evolution of flexible objects in both static and dynamic scenes. All methods that were developed for this thesis have been tested and evaluated on real robotic platforms. These evaluations show the importance of having a system capable of recovering from errors and that the robot can take advantage of human experience using just simple commands.

Place, publisher, year, edition, pages
Stockholm: KTH Royal Institute of Technology, 2012. , viii, 135 p.
Trita-CSC-A, ISSN 1653-5723 ; 2012:11
National Category
Computer Vision and Robotics (Autonomous Systems)
URN: urn:nbn:se:kth:diva-103061ISBN: 978-91-7501-495-1OAI: diva2:559818
Public defence
2012-10-29, F3, Lindstedsvägen 26, KTH, Stockholm, 10:00 (English)
ICT - The Next Generation

QC 20121011

Available from: 2012-10-11 Created: 2012-10-02 Last updated: 2013-04-15Bibliographically approved

Open Access in DiVA

thesis_niklas_bergstrom(9796 kB)645 downloads
File information
File name FULLTEXT01.pdfFile size 9796 kBChecksum SHA-512
Type fulltextMimetype application/pdf

Search in DiVA

By author/editor
Bergström, Niklas
By organisation
Computer Vision and Active Perception, CVAPCentre for Autonomous Systems, CAS
Computer Vision and Robotics (Autonomous Systems)

Search outside of DiVA

GoogleGoogle Scholar
Total: 645 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Total: 357 hits
ReferencesLink to record
Permanent link

Direct link