Full-text not available in DiVA
Author:
Elfwing, Stefan (KTH, School of Computer Science and Communication (CSC), Centres, Centre for Autonomous Systems, CAS)
Uchibe, E. (ATR Computational Neuroscience Labs, Japan)
Doya, K. (ATR Computational Neuroscience Labs, Japan)
Christensen, Henrik (KTH, School of Computer Science and Communication (CSC), Centres, Centre for Autonomous Systems, CAS)
Title:
Multi-Agent Reinforcement Learning: Using Macro Actions to Learn a Mating Task
Department:
KTH, School of Computer Science and Communication (CSC), Centres, Centre for Autonomous Systems, CAS
Publication type:
Conference paper (Refereed)
Language:
English
In:
2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS); Sendai
Conference:
2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS); Sendai; 28 Sept. - 2 Oct. 2004
Pages:
3164-3169
Year of publ.:
2004
URI:
urn:nbn:se:kth:diva-7565
Permanent link:
http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-7565
ISBN:
0780384636
Subject category:
Computer Science
SVEP category:
Computer science
Keywords(en) :
Approximation theory; Autonomous agents; Computer simulation; Mathematical models; Multi agent systems; Parameter estimation; Problem solving
Abstract(en) :

Standard reinforcement learning methods are inefficient and often inadequate for learning cooperative multi-agent tasks. For these kinds of tasks the behavior of one agent strongly depends on dynamic interaction with other agents, not only with the interaction with a static environment as in standard reinforcement learning. The success of the learning is therefore coupled to the agents' ability to predict the other agents' behaviors. In this study we try to overcome this problem by adding a few simple macro actions, actions that are extended in time for more than one time step. The macro actions improve the learning by making search of the state space more effective and thereby making the behavior more predictable for the other agent. In this study we have considered a cooperative mating task, which is the first step towards our aim to perform embodied evolution, where the evolutionary selection process is an integrated part of the task. We show, in simulation and hardware, that in the case of learning without macro actions, the agents fail to learn a meaningful behavior. In contrast, for the learning with macro action the agents learn a good mating behavior in reasonable time, in both simulation and hardware.

Note:
QC 20100706
Available from:
2007-10-23
Created:
2007-10-23
Last updated:
2010-07-06
Statistics:
19 hits