|
ABSTRACT
Many multiagent problems comprise subtasks which can be considered as reinforcement learning (RL) problems. In addition to classical temporal difference methods, evolutionary algorithms are among the most promising approaches for such RL problems. The relative performance of these approaches in certain subdomains (e. g. multiagent learning) of the general RL problem remains an open question at this time. In addition to theoretical analysis, benchmarks are one of the most important tools for comparing different RL methods in certain problem domains. A recently proposed multiagent RL benchmark problem is the RoboCup Keepaway benchmark. This benchmark is one of the most challenging multiagent learning problems because its state-space is continuous and high dimensional, and both the sensors and the actuators are noisy. In this paper we analyze the performance of the neuroevolutionary approach called Evolutionary Acquisition of Neural Topologies (EANT) in the Keepaway benchmark, and compare the results obtained using EANT with the results of other algorithms tested on the same benchmark.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
P. J. Angeline, G. M. Saunders, and J. B. Pollack. An evolutionary algorithm that constructs recurrent neural networks. IEEE Transactions on Neural Networks, 5:54--65, 1994.
|
| |
2
|
|
| |
3
|
P. Bentley and S. Kumar. Three ways to grow designs: A comparison of embryogenies for an evolutionary design problem. In Proceedings of the Genetic and Evolutionary Computation Conference, volume 1, pages 35--43, Orlando, Florida, USA, 13--17 July 1999. Morgan Kaufmann.
|
| |
4
|
J. C. Bongard and R. Pfeifer. Repeated structure and dissociation of genotypic and phenotypic complexity in artificial ontogeny. In Proceedings of the Genetic and Evolutionary Computation Conference, pages 829--836, 2001.
|
| |
5
|
F. Gomez, J. Schmidhuber, and R. Miikkulainen. Efficient non-linear control through neuroevolution. In Proceedings of the European Conference on Machine Learning, 2006.
|
| |
6
|
F. Gruau. Neural Network Synthesis Using Cellular Encoding and the Genetic Algorithm. PhD thesis, Ecole Normale Superieure de Lyon, Laboratoire de l'Informatique du Parallelisme, France, January 1994.
|
| |
7
|
Y. Kassahun. Towards a Unified Approach to Learning and Adaptation. PhD thesis, Technical Report 0602, Institute of Computer Science and Applied Mathematics, Christian-Albrechts University, Kiel, Germany, Feb 2006.
|
 |
8
|
Yohannes Kassahun , Mark Edgington , Jan Hendrik Metzen , Gerald Sommer , Frank Kirchner, A common genetic encoding for both direct and indirect encodings of networks, Proceedings of the 9th annual conference on Genetic and evolutionary computation, July 07-11, 2007, London, England
[doi> 10.1145/1276958.1277162]
|
| |
9
|
Yohannes Kassahun , Jan Hendrik Metzen , Jose Gea , Mark Edgington , Frank Kirchner, A General Framework for Encoding and Evolving Neural Networks, Proceedings of the 30th annual German conference on Advances in Artificial Intelligence, September 10-13, 2007, Osnabrück, Germany
[doi> 10.1007/978-3-540-74565-5_17]
|
| |
10
|
H. Kitano. Designing neural networks using genetic algorithms with graph generation system. Complex Systems, 4:461--476, 1990.
|
| |
11
|
A. Lindenmayer. Mathematical models for cellular interactions in development, parts I and II. Journal of Theoretical Biology, 18:280--315, 1968.
|
| |
12
|
|
| |
13
|
S. Nolfi and D. Parisi. Growing neural networks. Technical Report PCIA-91-15, Institute of Psychology, Rome, 1991.
|
 |
14
|
|
| |
15
|
|
| |
16
|
B. Sendhoff and M. Kreutz. Variable encoding of modular neural networks for time series prediction. In Congress on Evolutionary Computation, pages 259--266, 1999.
|
| |
17
|
N. Siebel, J. Krause, and G. Sommer. Efficient learning of neural networks with evolutionary algorithms. In Proceedings of the 29th German Symposium for Pattern Recognition, pages 466--475, 2007.
|
| |
18
|
|
| |
19
|
|
| |
20
|
P. Stone, R. S. Sutton, and G. Kuhlmann. Reinforcement learning for RoboCup-soccer Keepaway. Adaptive Behavior, 13(3):165--188, 2005.
|
| |
21
|
|
 |
22
|
|
| |
23
|
J. Vaario, A. Onitsuka, and K. Shimohara. Formation of neural structures. In Proceedings of the Fourth European Conference on Articial Life, pages 214--223, 1997.
|
| |
24
|
|
| |
25
|
Shimon Whiteson , Matthew E. Taylor , Peter Stone, Empirical Studies in Action Selection with Reinforcement Learning, Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems, v.15 n.1, p.33-50, March 2007
[doi> 10.1177/1059712306076253]
|
| |
26
|
|
| |
27
|
X. Yao. Evolving artificial neural networks. Proceedings of the IEEE, 87(9):1423--1447, 1999.
|
|