| XCS with computed prediction in multistep environments |
| Full text |
Pdf
(324 KB)
|
| Source
|
Genetic And Evolutionary Computation Conference
archive
Proceedings of the 2005 conference on Genetic and evolutionary computation
table of contents
Washington DC, USA
SESSION: Learning classifier systems and other genetics-based machine learning
table of contents
Pages: 1859 - 1866
Year of Publication: 2005
ISBN:1-59593-010-8
|
|
Authors
|
|
Pier Luca Lanzi
|
Artificial Intelligence and Robotics Laboratory (AIRLab), Milano, Italy and University of Illinois at Urbana Champaign, Urbana, IL
|
|
Daniele Loiacono
|
Artificial Intelligence and Robotics Laboratory (AIRLab), Milano, Italy
|
|
Stewart W. Wilson
|
University of Illinois at Urbana Champaign, Urbana, IL and Prediction Dynamics, Concord, MA
|
|
David E. Goldberg
|
University of Illinois at Urbana Champaign, Urbana, IL
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 4, Downloads (12 Months): 22, Citation Count: 4
|
|
|
ABSTRACT
XCSF extends the typical concept of learning classifier systems through the introduction of computed classifier prediction. Initial results show that XCSF's computed prediction can be used to evolve accurate piecewise linear approximations of simple functions. In this paper, we take XCSF one step further and apply it to typical reinforcement learning problems involving delayed rewards. In essence, we use XCSF as a method of generalized (linear) reinforcement learning to evolve piecewise linear approximations of the payoff surfaces of typical multistep problems. Our results show that XCSF can easily evolve optimal and near optimal solutions for problems introduced in the literature to test linear reinforcement learning methods.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
J. A. Boyan and A. W. Moore. Generalization in reinforcement learning: Safely approximating the value function. In G. Tesauro, D. S. Touretzky, and T. K. Leen, editors, Advances in Neural Information Processing Systems 7, pages 369--376, Cambridge, MA, 1995. The MIT Press.
|
| |
2
|
M. V. Butz and S. W. Wilson. An algorithmic description of xcs. Journal of Soft Computing, 6(3--4):144--153, 2002.
|
| |
3
|
P. L. Lanzi and M. Colombetti. An Extension to the XCS Classifier System for Stochastic Environments. In W. Banzhaf, J. Daida, A. E. Eiben, M. H. Garzon, V. Honavar, M. Jakiela, and R. E. Smith, editors, Proceedings of the Genetic and Evolutionary Computation Conference (GECCO), pages 353--360, Orlando (FL), July 1999. Morgan Kaufmann.
|
| |
4
|
T. J. Perkins and D. Precup. A convergent form of approximate policy iteration. pages 1595--1602, 2003.
|
| |
5
|
R. S. Sutton. Generalization in reinforcement learning: Successful examples using sparse coarse coding. In D. S. Touretzky, M. C. Mozer, and M. E. Hasselmo, editors, Advances in Neural Information Processing Systems 8, pages 1038--1044. The MIT Press, Cambridge, MA., 1996.
|
| |
6
|
|
| |
7
|
|
| |
8
|
S. Thrun and A. Schwartz. Issues in Using Function Approximation for Reinforcement Learning. 1993.
|
| |
9
|
|
| |
10
|
S. W. Wilson. Classifier Fitness Based on Accuracy. Evolutionary Computation, 3(2):149--175, 1995. http://prediction-dynamics.com/.
|
| |
11
|
|
| |
12
|
|
| |
13
|
S. W. Wilson. Classifier systems for continuous payoff environments. In K. Deb, R. Poli, W. Banzhaf, H.-G. Beyer, E. Burke, P. Darwen, D. Dasgupta, D. Floreano, J. Foster, M. Harman, O. Holland, P. L. Lanzi, L. Spector, A. Tettamanzi, D. Thierens, and A. Tyrrell, editors, Genetic and Evolutionary Computation -- GECCO-2004, Part II, volume 3103 of Lecture Notes in Computer Science, pages 824--835, Seattle, WA, USA, 26-30 June 2004. Springer-Verlag.
|
CITED BY 4
|
|
Pier Luca Lanzi , Daniele Loiacono , Stewart W. Wilson , David E. Goldberg, Classifier prediction based on tile coding, Proceedings of the 8th annual conference on Genetic and evolutionary computation, July 08-12, 2006, Seattle, Washington, USA
|
|
|
|
|
|
Pier Luca Lanzi , Daniele Loiacono , Stewart W. Wilson , David E. Goldberg, Generalization in the XCSF Classifier System: Analysis, Improvement, and Extension, Evolutionary Computation, v.15 n.2, p.133-168, Summer 2007
|
|
|
|
|