| Classifier prediction based on tile coding |
| Full text |
Pdf
(490 KB)
|
| Source
|
Genetic And Evolutionary Computation Conference
archive
Proceedings of the 8th annual conference on Genetic and evolutionary computation
table of contents
Seattle, Washington, USA
SESSION: Learning Classifier systems and other genetics-based machine learning: papers
table of contents
Pages: 1497 - 1504
Year of Publication: 2006
ISBN:1-59593-186-4
|
|
Authors
|
|
Pier Luca Lanzi
|
Politecnico di Milano, Milano, Italy and University of Illinois at Urbana Champaign, Urbana, IL
|
|
Daniele Loiacono
|
Politecnico di Milano, Milano, Italy
|
|
Stewart W. Wilson
|
University of Illinois at Urbana Champaign, Urbana, IL and Prediction Dynamics, Concord, MA
|
|
David E. Goldberg
|
University of Illinois at Urbana Champaign, Urbana, IL
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 12, Downloads (12 Months): 47, Citation Count: 7
|
|
|
ABSTRACT
This paper introduces XCSF extended with tile coding prediction: each classifier implements a tile coding approximator; the genetic algorithm is used to adapt both classifier conditions (i.e., to partition the problem) and the parameters of each approximator; thus XCSF evolves an ensemble of tile coding approximators instead of the typical monolithic approximator used in reinforcement learning. The paper reports a comparison between (i) XCSF with tile coding prediction and (ii) plain tile coding. The results show that XCSF with tile coding always reaches optimal performance, it usually learns as fast as the best parametrized tile coding, and it can be faster than the typical tile coding setting. In addition, the analysis of the evolved tile coding ensembles shows that XCSF actually adapts local approximators following what is currently considered the best strategy to adapt the tile coding parameters in a given problem.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
L. Booker. Approximating value functions in classifier systems. volume 183 of Studies in Fuzziness and Soft Computing, pages 45--61. Springer, 2005.
|
| |
2
|
J. A. Boyan and A. W. Moore. Generalization in reinforcement learning: Safely approximating the value function. In G. Tesauro, D. S. Touretzky, and T. K. Leen, editors, Advances in Neural Information Processing Systems 7, pages 369--376, Cambridge, MA, 1995. The MIT Press.
|
| |
3
|
M. V. Butz and S. W. Wilson. An algorithmic description of XCS. Journal of Soft Computing, 6(3--4):144--153, 2002.
|
| |
4
|
S. A. Glantz and B. K. Slinker. Primer of Applied Regression & Analysis of Variance. McGraw Hill, 2001. second edition.
|
 |
5
|
|
| |
6
|
P. L. Lanzi, D. Loiacono, S. W. Wilson, and D. E. Goldberg. XCS with Computed Prediction for the Learning of Boolean Functions. In Proceedings of the IEEE Congress on Evolutionary Computation -- CEC-2005, Edinburgh, UK, 2005. IEEE.
|
 |
7
|
|
| |
8
|
|
| |
9
|
S. I. Reynolds. Reinforcement Learning with Exploration. PhD thesis, School of Computer Science. The University of Birmingham, Birmingham, B15 2TT, Dec. 2002.
|
| |
10
|
A. A. Sherstov and P. Stone. Function approximation via tile coding: Automating parameter choice. In Proc. Symposium on Abstraction, Reformulation, and Approximation (SARA-05), Edinburgh, Scotland, UK, 2005.
|
| |
11
|
R. S. Sutton. Generalization in reinforcement learning: Successful examples using sparse coarse coding. In D. S. Touretzky, M. C. Mozer, and M. E. Hasselmo, editors, Advances in Neural Information Processing Systems 8, pages 1038--1044. The MIT Press, Cambridge, MA., 1996.
|
| |
12
|
|
| |
13
|
|
| |
14
|
|
|