ACM Home Page
Please provide us with feedback. Feedback
Analysis of an evolutionary reinforcement learning method in a multiagent domain
Full text PdfPdf (2.12 MB)
Source
International Conference on Autonomous Agents archive
Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 1 table of contents
Estoril, Portugal
SESSION: Agent and multi-agent learning table of contents
Pages 291-298  
Year of Publication: 2008
ISBN:978-0-9817381-0-9
Authors
Jan Hendrik Metzen  German Research Center for Artificial Intelligence (DFKI), Bremen, Germany
Mark Edgington  University of Bremen, Bremen, Germany
Yohannes Kassahun  University of Bremen, Bremen, Germany
Frank Kirchner  German Research Center for Artificial Intelligence (DFKI), Bremen, Germany
Sponsors
ACM: Association for Computing Machinery
AAAI : Association for the Advancement of Artifical Intelligence
Publisher
Bibliometrics
Downloads (6 Weeks): 12,   Downloads (12 Months): 110,   Citation Count: 2
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  

ABSTRACT

Many multiagent problems comprise subtasks which can be considered as reinforcement learning (RL) problems. In addition to classical temporal difference methods, evolutionary algorithms are among the most promising approaches for such RL problems. The relative performance of these approaches in certain subdomains (e. g. multiagent learning) of the general RL problem remains an open question at this time. In addition to theoretical analysis, benchmarks are one of the most important tools for comparing different RL methods in certain problem domains. A recently proposed multiagent RL benchmark problem is the RoboCup Keepaway benchmark. This benchmark is one of the most challenging multiagent learning problems because its state-space is continuous and high dimensional, and both the sensors and the actuators are noisy. In this paper we analyze the performance of the neuroevolutionary approach called Evolutionary Acquisition of Neural Topologies (EANT) in the Keepaway benchmark, and compare the results obtained using EANT with the results of other algorithms tested on the same benchmark.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
P. J. Angeline, G. M. Saunders, and J. B. Pollack. An evolutionary algorithm that constructs recurrent neural networks. IEEE Transactions on Neural Networks, 5:54--65, 1994.
 
2
 
3
P. Bentley and S. Kumar. Three ways to grow designs: A comparison of embryogenies for an evolutionary design problem. In Proceedings of the Genetic and Evolutionary Computation Conference, volume 1, pages 35--43, Orlando, Florida, USA, 13--17 July 1999. Morgan Kaufmann.
 
4
J. C. Bongard and R. Pfeifer. Repeated structure and dissociation of genotypic and phenotypic complexity in artificial ontogeny. In Proceedings of the Genetic and Evolutionary Computation Conference, pages 829--836, 2001.
 
5
F. Gomez, J. Schmidhuber, and R. Miikkulainen. Efficient non-linear control through neuroevolution. In Proceedings of the European Conference on Machine Learning, 2006.
 
6
F. Gruau. Neural Network Synthesis Using Cellular Encoding and the Genetic Algorithm. PhD thesis, Ecole Normale Superieure de Lyon, Laboratoire de l'Informatique du Parallelisme, France, January 1994.
 
7
Y. Kassahun. Towards a Unified Approach to Learning and Adaptation. PhD thesis, Technical Report 0602, Institute of Computer Science and Applied Mathematics, Christian-Albrechts University, Kiel, Germany, Feb 2006.
8
 
9
 
10
H. Kitano. Designing neural networks using genetic algorithms with graph generation system. Complex Systems, 4:461--476, 1990.
 
11
A. Lindenmayer. Mathematical models for cellular interactions in development, parts I and II. Journal of Theoretical Biology, 18:280--315, 1968.
 
12
 
13
S. Nolfi and D. Parisi. Growing neural networks. Technical Report PCIA-91-15, Institute of Psychology, Rome, 1991.
14
 
15
 
16
B. Sendhoff and M. Kreutz. Variable encoding of modular neural networks for time series prediction. In Congress on Evolutionary Computation, pages 259--266, 1999.
 
17
N. Siebel, J. Krause, and G. Sommer. Efficient learning of neural networks with evolutionary algorithms. In Proceedings of the 29th German Symposium for Pattern Recognition, pages 466--475, 2007.
 
18
 
19
 
20
P. Stone, R. S. Sutton, and G. Kuhlmann. Reinforcement learning for RoboCup-soccer Keepaway. Adaptive Behavior, 13(3):165--188, 2005.
 
21
22
 
23
J. Vaario, A. Onitsuka, and K. Shimohara. Formation of neural structures. In Proceedings of the Fourth European Conference on Articial Life, pages 214--223, 1997.
 
24
 
25
 
26
 
27
X. Yao. Evolving artificial neural networks. Proceedings of the IEEE, 87(9):1423--1447, 1999.


Collaborative Colleagues:
Jan Hendrik Metzen: colleagues
Mark Edgington: colleagues
Yohannes Kassahun: colleagues
Frank Kirchner: colleagues