ACM Home Page
Please provide us with feedback. Feedback
XCS with computed prediction in multistep environments
Full text PdfPdf (324 KB)
Source Genetic And Evolutionary Computation Conference archive
Proceedings of the 2005 conference on Genetic and evolutionary computation table of contents
Washington DC, USA
SESSION: Learning classifier systems and other genetics-based machine learning table of contents
Pages: 1859 - 1866  
Year of Publication: 2005
ISBN:1-59593-010-8
Authors
Pier Luca Lanzi  Artificial Intelligence and Robotics Laboratory (AIRLab), Milano, Italy and University of Illinois at Urbana Champaign, Urbana, IL
Daniele Loiacono  Artificial Intelligence and Robotics Laboratory (AIRLab), Milano, Italy
Stewart W. Wilson  University of Illinois at Urbana Champaign, Urbana, IL and Prediction Dynamics, Concord, MA
David E. Goldberg  University of Illinois at Urbana Champaign, Urbana, IL
Sponsors
SIGEVO: ACM Special Interest Group on Genetic and Evolutionary Computation
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 4,   Downloads (12 Months): 22,   Citation Count: 4
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1068009.1068323
What is a DOI?

ABSTRACT

XCSF extends the typical concept of learning classifier systems through the introduction of computed classifier prediction. Initial results show that XCSF's computed prediction can be used to evolve accurate piecewise linear approximations of simple functions. In this paper, we take XCSF one step further and apply it to typical reinforcement learning problems involving delayed rewards. In essence, we use XCSF as a method of generalized (linear) reinforcement learning to evolve piecewise linear approximations of the payoff surfaces of typical multistep problems. Our results show that XCSF can easily evolve optimal and near optimal solutions for problems introduced in the literature to test linear reinforcement learning methods.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
J. A. Boyan and A. W. Moore. Generalization in reinforcement learning: Safely approximating the value function. In G. Tesauro, D. S. Touretzky, and T. K. Leen, editors, Advances in Neural Information Processing Systems 7, pages 369--376, Cambridge, MA, 1995. The MIT Press.
 
2
M. V. Butz and S. W. Wilson. An algorithmic description of xcs. Journal of Soft Computing, 6(3--4):144--153, 2002.
 
3
P. L. Lanzi and M. Colombetti. An Extension to the XCS Classifier System for Stochastic Environments. In W. Banzhaf, J. Daida, A. E. Eiben, M. H. Garzon, V. Honavar, M. Jakiela, and R. E. Smith, editors, Proceedings of the Genetic and Evolutionary Computation Conference (GECCO), pages 353--360, Orlando (FL), July 1999. Morgan Kaufmann.
 
4
T. J. Perkins and D. Precup. A convergent form of approximate policy iteration. pages 1595--1602, 2003.
 
5
R. S. Sutton. Generalization in reinforcement learning: Successful examples using sparse coarse coding. In D. S. Touretzky, M. C. Mozer, and M. E. Hasselmo, editors, Advances in Neural Information Processing Systems 8, pages 1038--1044. The MIT Press, Cambridge, MA., 1996.
 
6
 
7
 
8
S. Thrun and A. Schwartz. Issues in Using Function Approximation for Reinforcement Learning. 1993.
 
9
 
10
S. W. Wilson. Classifier Fitness Based on Accuracy. Evolutionary Computation, 3(2):149--175, 1995. http://prediction-dynamics.com/.
 
11
 
12
 
13
S. W. Wilson. Classifier systems for continuous payoff environments. In K. Deb, R. Poli, W. Banzhaf, H.-G. Beyer, E. Burke, P. Darwen, D. Dasgupta, D. Floreano, J. Foster, M. Harman, O. Holland, P. L. Lanzi, L. Spector, A. Tettamanzi, D. Thierens, and A. Tyrrell, editors, Genetic and Evolutionary Computation -- GECCO-2004, Part II, volume 3103 of Lecture Notes in Computer Science, pages 824--835, Seattle, WA, USA, 26-30 June 2004. Springer-Verlag.


Collaborative Colleagues:
Pier Luca Lanzi: colleagues
Daniele Loiacono: colleagues
Stewart W. Wilson: colleagues
David E. Goldberg: colleagues