Exploiting structure in man-made environments
2012 (English)Doctoral thesis, monograph (Other academic)
Robots are envisioned to take on jobs that are dirty, dangerous and dull, the three D's of robotics. With this mission, robotic technology today is ubiquitous on the factory floor. However, the same level of success has not occurred when it comes to robots that operate in everyday living spaces, such as homes and offices.
A big part of this is attributed to domestic environments being complex and unstructured as opposed to factory settings which can be set up and precisely known in advance. In this thesis we challenge the point of view which regards man-made environments as unstructured and that robots should operate without prior assumptions about the world. Instead, we argue that robots should make use of the inherent structure of everyday living spaces across various scales and applications, in the form of contextual and prior information, and that doing so can improve the performance of robotic tasks.
To investigate this premise, we start by attempting to solve a hard and realistic problem, active visual search. The particular scenario considered is that of a mobile robot tasked with finding an object on an entire unexplored building floor. We show that a search strategy which exploits the structure of indoor environments offers significant improvements on state of the art and is comparable to humans in terms of search performance. Based on the work on active visual search, we present two specific ways of making use of the structure of space. First, we propose to use the local 3D geometry as a strong indicator of objects in indoor scenes. By learning a 3D context model for various object categories, we demonstrate a method that can reliably predict the location of objects. Second, we turn our attention to predicting what lies in the unexplored part of the environment at the scale of rooms and building floors. By analyzing a large dataset, we propose that indoor environments can be thought of as being composed out of frequently occurring functional subparts. Utilizing these, we present a method that can make informed predictions about the unknown part of a given indoor environment.
The ideas presented in this thesis explore various sides of the same idea: modeling and exploiting the structure inherent in indoor environments for the sake of improving robot's performance on various applications. We believe that in addition to contributing some answers, the work presented in this thesis will generate additional, fruitful questions.
Place, publisher, year, edition, pages
Stockholm: KTH Royal Institute of Technology, 2012. , vi, 103 p.
Trita-CSC-A, ISSN 1653-5723 ; 2012:14
robotics, mapping, computer vision
IdentifiersURN: urn:nbn:se:kth:diva-104410ISBN: 978-91-7501-549-1OAI: oai:DiVA.org:kth-104410DiVA: diva2:564466
2012-11-20, F3, Lindstedtsvägen 26, KTH, Stockholm, 10:00 (English)
Duckett, Tom, Professor
Jensfelt, Patric, Ass. Proffessor
FunderEU, FP7, Seventh Framework Programme, ICT-215181
QC 201211052012-11-052012-11-012012-11-05Bibliographically approved