ACM Home Page
Please provide us with feedback. Feedback
Classifier prediction based on tile coding
Full text PdfPdf (490 KB)
Source Genetic And Evolutionary Computation Conference archive
Proceedings of the 8th annual conference on Genetic and evolutionary computation table of contents
Seattle, Washington, USA
SESSION: Learning Classifier systems and other genetics-based machine learning: papers table of contents
Pages: 1497 - 1504  
Year of Publication: 2006
ISBN:1-59593-186-4
Authors
Pier Luca Lanzi  Politecnico di Milano, Milano, Italy and University of Illinois at Urbana Champaign, Urbana, IL
Daniele Loiacono  Politecnico di Milano, Milano, Italy
Stewart W. Wilson  University of Illinois at Urbana Champaign, Urbana, IL and Prediction Dynamics, Concord, MA
David E. Goldberg  University of Illinois at Urbana Champaign, Urbana, IL
Sponsors
SIGEVO: ACM Special Interest Group on Genetic and Evolutionary Computation
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 3,   Downloads (12 Months): 36,   Citation Count: 5
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1143997.1144242
What is a DOI?

ABSTRACT

This paper introduces XCSF extended with tile coding prediction: each classifier implements a tile coding approximator; the genetic algorithm is used to adapt both classifier conditions (i.e., to partition the problem) and the parameters of each approximator; thus XCSF evolves an ensemble of tile coding approximators instead of the typical monolithic approximator used in reinforcement learning. The paper reports a comparison between (i) XCSF with tile coding prediction and (ii) plain tile coding. The results show that XCSF with tile coding always reaches optimal performance, it usually learns as fast as the best parametrized tile coding, and it can be faster than the typical tile coding setting. In addition, the analysis of the evolved tile coding ensembles shows that XCSF actually adapts local approximators following what is currently considered the best strategy to adapt the tile coding parameters in a given problem.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
L. Booker. Approximating value functions in classifier systems. volume 183 of Studies in Fuzziness and Soft Computing, pages 45--61. Springer, 2005.
 
2
J. A. Boyan and A. W. Moore. Generalization in reinforcement learning: Safely approximating the value function. In G. Tesauro, D. S. Touretzky, and T. K. Leen, editors, Advances in Neural Information Processing Systems 7, pages 369--376, Cambridge, MA, 1995. The MIT Press.
 
3
M. V. Butz and S. W. Wilson. An algorithmic description of XCS. Journal of Soft Computing, 6(3--4):144--153, 2002.
 
4
S. A. Glantz and B. K. Slinker. Primer of Applied Regression & Analysis of Variance. McGraw Hill, 2001. second edition.
5
 
6
P. L. Lanzi, D. Loiacono, S. W. Wilson, and D. E. Goldberg. XCS with Computed Prediction for the Learning of Boolean Functions. In Proceedings of the IEEE Congress on Evolutionary Computation -- CEC-2005, Edinburgh, UK, 2005. IEEE.
7
 
8
 
9
S. I. Reynolds. Reinforcement Learning with Exploration. PhD thesis, School of Computer Science. The University of Birmingham, Birmingham, B15 2TT, Dec. 2002.
 
10
A. A. Sherstov and P. Stone. Function approximation via tile coding: Automating parameter choice. In Proc. Symposium on Abstraction, Reformulation, and Approximation (SARA-05), Edinburgh, Scotland, UK, 2005.
 
11
R. S. Sutton. Generalization in reinforcement learning: Successful examples using sparse coarse coding. In D. S. Touretzky, M. C. Mozer, and M. E. Hasselmo, editors, Advances in Neural Information Processing Systems 8, pages 1038--1044. The MIT Press, Cambridge, MA., 1996.
 
12
 
13
 
14


Collaborative Colleagues:
Pier Luca Lanzi: colleagues
Daniele Loiacono: colleagues
Stewart W. Wilson: colleagues
David E. Goldberg: colleagues