| Fuzzy CMAC with automatic state partition for reinforcementlearning |
| Full text |
Pdf
(1.71 MB)
|
Source
|
ACM/SIGEVO Summit on Genetic and Evolutionary Computation
archive
Proceedings of the first ACM/SIGEVO Summit on Genetic and Evolutionary Computation
table of contents
Shanghai, China
SESSION: Full papers
table of contents
Pages 421-428
Year of Publication: 2009
ISBN:978-1-60558-326-6
|
|
Authors
|
|
Huaqing Min
|
South China University of Technology, Guangzhou, China
|
|
Jiaan Zeng
|
South China University of Technology, Guangzhou, China
|
|
Ronghua Luo
|
South China University of Technology, Guangzhou, China
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): n/a, Downloads (12 Months): n/a, Citation Count: 0
|
|
|
ABSTRACT
Most of reinforcement learning (RL) algorithms use value function to seek the optimal policy. In large or even continuous states, function approximation approaches must be used to represent value function. The structures of function approximators influence the learning performance greatly. However, the design of structures relies too much on human designer and inappropriate design can lead to poor performance. In this paper, we propose a novel function approximator called Fuzzy CMAC (FCMAC) with automatic state partition (ASP-FCMAC) to automate the structure design for FCMAC. Based on CMAC (also known as tile coding), ASP-FCMAC employs fuzzy membership function to lower the computation load, and analyzes Bellman error as well as learning weights to partition the state automatically so as to generate the structure of FCMAC. Empirical results in both mountain car and RoboCup Keepaway domains demonstrate that ASP-FCMAC can automatically generate the structure of FCMAC and agent using it can learn efficiently.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
|
| |
3
|
Peter Stone, Richard S. Sutton, and Gregory Kuhlmann. Reinforcement learning for RoboCup-soccer keepaway. Adaptive Behavior, 13(3):165--188, 2005.
|
| |
4
|
Peter Stone, Gregory Kuhlmann, Matthew E. Taylor, and Yaxin Liu. Keepaway soccer: From machine learning testbed to benchmark. In Itsuki Noda, Adam Jacoff, Ansgar Bredenfeld, and Yasutake Takahashi, editors, RoboCup-2005: Robot Soccer World Cup IX, volume 4020, pages 93--105, Berlin, 2006. Springer Verlag.
|
| |
5
|
|
 |
6
|
|
| |
7
|
L. Tokarchuk. Fuzzy and tile coding approximation techniques for coevolution in reinforcement learning. Technical report, University of London, 2006.
|
| |
8
|
Shimon Whiteson, Matthew E. Taylor, and Peter Stone. Adaptive tile coding for value function approximation. Technical Report AI-TR-07-339, University of Texas at Austin, 2007.
|
| |
9
|
Alexander A. Sherstov and Peter Stone. Function approximation via tile coding: Automating parameter choice. In J.-D. Zucker and I. Saitta, editors, SARA 2005, volume 3607 of Lecture Notes in Artificial Intelligence, pages 194--205, Berlin, 2005. Springer Verlag.
|
| |
10
|
|
| |
11
|
|
| |
12
|
S.F. Su, Z.J. Lee, and Y.P. Wang. Robust and fast learning for fuzzy cerebellar model articulation controllers. IEEE Transactions on Systems, Man, and Cybernetics Part B: Cybernetics, 36(1):203 -- 208, 2006.
|
 |
13
|
Pier Luca Lanzi , Daniele Loiacono , Stewart W. Wilson , David E. Goldberg, Classifier prediction based on tile coding, Proceedings of the 8th annual conference on Genetic and evolutionary computation, July 08-12, 2006, Seattle, Washington, USA
[doi> 10.1145/1143997.1144242]
|
| |
14
|
Sridhar Mahadevan. Samuel meets amarel: Automating value function approximation using global state space analysis. In Proceedings of AAAI, pages 1000--1005, 2005.
|
| |
15
|
Ishai Menache, Shie Mannor, and Nahum Shimkin. Basis function adaptation in temporal difference reinforcement learning. Annals of Operations Research, 134:215--238, 2005.
|
|