Change search
ReferencesLink to record
Permanent link

Direct link
Representation and Learning for Robotic Grasping, Caging, and Planning
KTH, School of Computer Science and Communication (CSC), Computer Vision and Active Perception, CVAP.
2016 (English)Doctoral thesis, monograph (Other academic)
Abstract [en]

Robots need to grasp, handle, and manipulate objects, navigate their environment, and understand the state of the world around them. Like all artificial intelligence agents, they have to make predictions, formulate goals, reason about actions, and make plans. Expressive, informative, and compact representations of their state, task, or environment are therefore essential, because they allow us to address these problems by computational means. To create suitable representations, we need to consider the agent’s goals, means or resources, external performance requirements, and have to decide what is relevant to the task.

This thesis investigates the construction, learning, and application of representations in different robotic scenarios. We study representations and algorithms for agents that have the goal to reliably grasp an object, prevent an object from escaping by caging, or learn a model of their interaction with the environment to be able to plan actions and follow the state of the world. Each of the scenarios considers different aspects of representation: Efficient computation and optimization, tractable reasoning, relating different parameterizations, or autonomous learning and execution of behavior under uncertainty.

For the grasping agent, we introduce an embedding space that allows us to associate contact locations with hand postures and derive a hierarchical representation of object surfaces which together give rise to an efficient fingertip grasp synthesis algorithm. For the caging agent, we only consider objects with holes through their body which allows us to focus on caging configurations that mechanically interlock objects and hands similar to links of a chain. Further, we change from a geometric to a topology- based representation which allows us to construct caging configurations by control-based optimization and sampling-based search. For the learning agent, we consider the environment and robot as a dynamical system and learn predictive state representations that are directly based on observable data. We demonstrate two contrasting methods to influence the resulting model. For an in-hand manipulation task, we consider training sequences as strings of symbols and introduce feature functions that integrate both actions and observations to reduce state ambiguity. For a simulated visual navigation task, we learn a feature embedding with prior information and training labels to enhance model interpretability while at the same time improving planning performance. 

Place, publisher, year, edition, pages
Stockholm: KTH Royal Institute of Technology, 2016. , 358 p.
Series
TRITA-CSC-A, ISSN 1653-5723 ; 2016:16
Keyword [en]
Grasping, Caging, Planning, State Representation, Optimization, Topology, Manipulation, Reinforcement Learning
National Category
Computer Vision and Robotics (Autonomous Systems)
Research subject
Computer Science
Identifiers
URN: urn:nbn:se:kth:diva-186424ISBN: 978-91-7729-000-1OAI: oai:DiVA.org:kth-186424DiVA: diva2:927290
Public defence
2016-06-14, F3, F3 Lindstedtsvägen 26, KTH Campus, Stockholm, 13:00 (English)
Opponent
Supervisors
Note

QC 20160518

Available from: 2016-05-18 Created: 2016-05-11 Last updated: 2016-07-08Bibliographically approved

Open Access in DiVA

fulltext(68304 kB)10 downloads
File information
File name FULLTEXT01.pdfFile size 68304 kBChecksum SHA-512
9b984bcbc077aae3af5362574228d3fdc250cbb129ee86f81cbc02ab176f7ccf9c5ada59383dc3962a10afe47bc44927c5f50d2c8bec716664c0695f9aed94a4
Type fulltextMimetype application/pdf

Search in DiVA

By author/editor
Stork, Johannes Andreas
By organisation
Computer Vision and Active Perception, CVAP
Computer Vision and Robotics (Autonomous Systems)

Search outside of DiVA

GoogleGoogle Scholar
Total: 10 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Total: 102 hits
ReferencesLink to record
Permanent link

Direct link