|
ABSTRACT
This paper presents a Learning Classifier System (LCS) where each classifier condition is represented by a feed-forward multi-layered perceptron (MLP) network. Adaptive behavior is realized through the use of self-adaptive parameters and neural constructivism, providing the system with a flexible knowledge representation. The approach allows for the evolution of networks of appropriate complexity to solve a continuous maze environment, here using either discrete-valued actions, continuous-valued actions, or continuous-valued actions of continuous duration. In each case, it is shown that the neural LCS employed is capable of developing optimal solutions to the reinforcement learning task presented in this paper.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Ahluwalia, M.&Bull, L. 1999. A Genetic Programming Classifier System. In W. Banzhaf, J. Daida, A.E. Eiben, M.H. Garzon, V. Honavar, M. Jakiela&R.E. Smith (Eds.) Proceedings of the Genetic and Evolutionary Computation Conference GECCO--99. San Mateo, CA: Morgan Kaufmann, pp11--18.
|
| |
2
|
Belue, L. M.&Bauer Jr., K. W. 1995. Determining input features for multilayer perceptrons. Neurocomputing 7:111--121.
|
| |
3
|
|
| |
4
|
Boyan, J. A.&Moore, A. W. 1995. Generalization in reinforcement learning: Safely approximating the value function. In G. Tesauro, D. S. Touretzky, and T. K. Leen, editors, Advances in Neural Information Processing Systems 7, the MIT Press, Cambridge, MA , pp369--376.
|
| |
5
|
|
| |
6
|
|
| |
7
|
Bull, L., Hurst, J.,&Tomlinson, A. 2000. Self-Adaptive Mutation in Classifier System Controllers. In J-A. Meyer, A. Berthoz, D. Floreano, H. Roitblatt&S.W. Wilson (Eds.) From Animals to Animats 6 -- The Sixth International Conference on the Simulation of Adaptive Behaviour, MIT Press.
|
 |
8
|
|
| |
9
|
Carse, B.&Fogarty, T.C. 1994. A Delayed-Action Classifier System for Learning in Temporal Environments. International Conference on Evolutionary Computation: 670--673.
|
| |
10
|
Casillas, J. Carse, B.&Bull, L. 2007. Fuzzy XCS: A Michigan Genetic Fuzzy System. IEEE Transactions on Fuzzy Systems 15(4): 536--550.
|
| |
11
|
|
| |
12
|
|
| |
13
|
Holland, J. H. 1976. Adaptation. In R. Rosen&F. M. Snell (Eds.) Progress in Theoretical Biology 4. New York: Academic Press, pp263--293.
|
 |
14
|
|
 |
15
|
|
| |
16
|
|
| |
17
|
|
| |
18
|
|
| |
19
|
|
| |
20
|
Lanzi, P. L., Loiacono, D., Wilson, S. W.&Goldberg, D. E. 2005. XCS with computed prediction in continuous multistep environments. In Proceedings of the IEEE Congress on Evolutionary Computation CEC-2005, IEEE, Edinburgh, UK, pp2032--2039.
|
| |
21
|
Quartz, S. R.&Sejnowski, T. J. 1997. The Neural Basis of Cognitive Development: A Constructionist Manifesto. Behavioural and Brain Sciences 20(4): 537--596.
|
| |
22
|
|
| |
23
|
Schlessinger, E., Bentley, P. J., Lotto, R. B. 2005. Analysing the Evolvability of Neural Network Agents through Structural Mutations. Proc. of European Conference on Artificial Life (ECAL 2005), September 5--9, 2005, Canterbury, UK.
|
| |
24
|
Studley, M.&Bull, L. 2005. X-TCS: Accuracy-based Learning Classifier System Robotics. In Proceedings of the IEEE Congress on Evolutionary Computation. IEEE, pp2099--2106.
|
| |
25
|
|
 |
26
|
|
| |
27
|
Valenzuela-Rendón, M. 1991. The Fuzzy Classifier System: a Classifier System for Continuously Varying Variables. In Proceedings of the 4th International Conference on Genetic Algorithms (ICGA91) pp346--353.
|
 |
28
|
Shimon Whiteson , Peter Stone , Kenneth O. Stanley , Risto Miikkulainen , Nate Kohl, Automatic feature selection in neuroevolution, Proceedings of the 2005 conference on Genetic and evolutionary computation, June 25-29, 2005, Washington DC, USA
[doi> 10.1145/1068009.1068210]
|
| |
29
|
|
| |
30
|
|
| |
31
|
Wilson, S. W. 2001. Function Approximation with a Classifier System. In Spector, L., D., G. E., Wu, A., Langdon, W.B., Voight, H. M., and Gen, M., (Eds.) Proceedings of the Genetic and Evolutionary Computation Conference (GECCO 01) Morgan Kaufmann. pp974--981.
|
| |
32
|
Wilson, S. W. 2007. Three Architectures for Continuous Action. Learning Classifier Systems. International Workshops, IWLCS 2003-2005, Revised Selected Papers. In T. Kovacs, X. Llorà, K. Takadama, P. L. Lanzi, W. Stolzmann, S. W. Wilson (Eds.) Lecture Notes in Artificial Intelligence (LNAI-4399),. Berlin, Springer-Verlag. pp. 239--257.
|
|