ACM Home Page
Please provide us with feedback. Feedback
Fuzzy CMAC with automatic state partition for reinforcementlearning
Full text PdfPdf (1.71 MB)
Source
ACM/SIGEVO Summit on Genetic and Evolutionary Computation archive
Proceedings of the first ACM/SIGEVO Summit on Genetic and Evolutionary Computation table of contents
Shanghai, China
SESSION: Full papers table of contents
Pages 421-428  
Year of Publication: 2009
ISBN:978-1-60558-326-6
Authors
Huaqing Min  South China University of Technology, Guangzhou, China
Jiaan Zeng  South China University of Technology, Guangzhou, China
Ronghua Luo  South China University of Technology, Guangzhou, China
Sponsors
SIGEVO: ACM Special Interest Group on Genetic and Evolutionary Computation
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): n/a,   Downloads (12 Months): n/a,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1543834.1543891
What is a DOI?

ABSTRACT

Most of reinforcement learning (RL) algorithms use value function to seek the optimal policy. In large or even continuous states, function approximation approaches must be used to represent value function. The structures of function approximators influence the learning performance greatly. However, the design of structures relies too much on human designer and inappropriate design can lead to poor performance. In this paper, we propose a novel function approximator called Fuzzy CMAC (FCMAC) with automatic state partition (ASP-FCMAC) to automate the structure design for FCMAC. Based on CMAC (also known as tile coding), ASP-FCMAC employs fuzzy membership function to lower the computation load, and analyzes Bellman error as well as learning weights to partition the state automatically so as to generate the structure of FCMAC. Empirical results in both mountain car and RoboCup Keepaway domains demonstrate that ASP-FCMAC can automatically generate the structure of FCMAC and agent using it can learn efficiently.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
 
3
Peter Stone, Richard S. Sutton, and Gregory Kuhlmann. Reinforcement learning for RoboCup-soccer keepaway. Adaptive Behavior, 13(3):165--188, 2005.
 
4
Peter Stone, Gregory Kuhlmann, Matthew E. Taylor, and Yaxin Liu. Keepaway soccer: From machine learning testbed to benchmark. In Itsuki Noda, Adam Jacoff, Ansgar Bredenfeld, and Yasutake Takahashi, editors, RoboCup-2005: Robot Soccer World Cup IX, volume 4020, pages 93--105, Berlin, 2006. Springer Verlag.
 
5
6
 
7
L. Tokarchuk. Fuzzy and tile coding approximation techniques for coevolution in reinforcement learning. Technical report, University of London, 2006.
 
8
Shimon Whiteson, Matthew E. Taylor, and Peter Stone. Adaptive tile coding for value function approximation. Technical Report AI-TR-07-339, University of Texas at Austin, 2007.
 
9
Alexander A. Sherstov and Peter Stone. Function approximation via tile coding: Automating parameter choice. In J.-D. Zucker and I. Saitta, editors, SARA 2005, volume 3607 of Lecture Notes in Artificial Intelligence, pages 194--205, Berlin, 2005. Springer Verlag.
 
10
 
11
 
12
S.F. Su, Z.J. Lee, and Y.P. Wang. Robust and fast learning for fuzzy cerebellar model articulation controllers. IEEE Transactions on Systems, Man, and Cybernetics Part B: Cybernetics, 36(1):203 -- 208, 2006.
13
 
14
Sridhar Mahadevan. Samuel meets amarel: Automating value function approximation using global state space analysis. In Proceedings of AAAI, pages 1000--1005, 2005.
 
15
Ishai Menache, Shie Mannor, and Nahum Shimkin. Basis function adaptation in temporal difference reinforcement learning. Annals of Operations Research, 134:215--238, 2005.

Collaborative Colleagues:
Huaqing Min: colleagues
Jiaan Zeng: colleagues
Ronghua Luo: colleagues