| Learning classifier system equivalent with reinforcement learning with function approximation |
| Full text |
Pdf
(57 KB)
|
| Source
|
Genetic And Evolutionary Computation Conference
archive
Proceedings of the 2005 workshops on Genetic and evolutionary computation
table of contents
Washington, D.C.
SESSION: IWLCS contributions
table of contents
Pages: 92 - 93
Year of Publication: 2005
|
|
Authors
|
|
Atsushi Wada
|
ATR NIS, Hikaridai, Seika-cho, Soraku-gun, Kyoto, Japan
|
|
Keiki Takadama
|
Tokyo Institute of Technology, Nagatsuta-cho, Midori-ku, Kanagawa, Japan
|
|
Katsunori Shimohara
|
ATR NIS, Hikaridai, Seika-cho, Soraku-gun, Kyoto, Japan
|
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 2, Downloads (12 Months): 25, Citation Count: 1
|
|
|
ABSTRACT
We present an experimental comparison of the reinforcement process between Learning Classifier System (LCS) and Reinforcement Learning (RL) with function approximation (FA) method, regarding their generalization mechanisms. To validate our previous theoretical analysis that derived equivalence of reinforcement process between LCS and RL, we introduce a simple test environment named Gridworld, which can be applied to both LCS and RL with three different classes of generalization: (1) tabular representation; (2) state aggregation; and (3) linear approximation. From the simulation experiments comparing LCS with its GA-inactivated and corresponding RL method, all the cases regarding the class of generalization showed identical results with the criteria of performance and temporal difference (TD) error, thereby verifying the equivalence predicted from the theory.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Butz, M., Kovacs, T., Lanzi, P. L., Wilson, S. W.: Toward a theory of generalization and learning in xcs. IEEE Transactions on Evolutionary Computation 8 (2004) 28--46
|
| |
2
|
M. V. Butz, D. E. Goldberg, and P. L. Lanzi. Bounding learning time in XCS. In Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-2004), 2004.
|
| |
3
|
|
| |
4
|
P. L. Lanzi. Learning classifier systems from a reinforcement learning perspective. Soft Computing, 6:162--170, 2002.
|
| |
5
|
|
| |
6
|
S. W. Wilson. ZCS: A zeroth level classifier system. Evolutionary Computation, 2(1):1--18, 1994.
|
| |
7
|
S. W. Wilson. Classifier fitness based on accuracy. Evolutionary Computation, 3(2):149--175, 1995.
|
|