| Recursive least squares and quadratic prediction in continuous multistep problems |
| Full text |
Pdf
(6.49 MB)
|
Source
|
Genetic And Evolutionary Computation Conference
archive
Proceedings of the 2008 GECCO conference companion on Genetic and evolutionary computation
table of contents
Atlanta, GA, USA
WORKSHOP SESSION: Learning classifier systems
table of contents
Pages 1985-1992
Year of Publication: 2008
ISBN:978-1-60558-131-6
|
|
Authors
|
|
Daniele Loiacono
|
Politecnico di Milano, Milan, Italy
|
|
Pier Luca Lanzi
|
Politecnico di Milano, Milan, Italy and University of Illinois at Urbana Champaign, Urbana, IL
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 7, Downloads (12 Months): 36, Citation Count: 0
|
|
|
ABSTRACT
XCS with computed prediction, namely XCSF, has been recently extended in several ways. In particular, a novel prediction update algorithm based on recursive least squares and the extension to polynomial prediction led to significant improvements of XCSF. However, these extensions have been studied so far only on single step problems and it is currently not clear if these findings might be extended also to multistep problems. In this paper we investigate this issue by analyzing the performance of XCSF with recursive least squares and with quadratic prediction on continuous multistep problems. Our results show that both these extensions improve the convergence speed of XCSF toward an optimal performance. As showed by the analysis reported in this paper, these improvements are due to the capabilities of recursive least squares and of polynomial prediction to provide a more accurate approximation of the problem value function after the first few learning problems.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Justin A. Boyan and Andrew W. Moore. Generalization in reinforcement learning: Safely approximating the value function. In Advances in Neural Information Processing Systems 7, pages 369--376, Cambridge, MA, 1995. The MIT Press.
|
| |
2
|
Martin V. Butz and Martin Pelikan. Analyzing the evolutionary pressures in XCS. In Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-2001), pages 935--942, San Francisco, California, USA, 7-11 July 2001. Morgan Kaufmann.
|
| |
3
|
Martin V. Butz and Stewart W. Wilson. An algorithmic description of XCS. Journal of Soft Computing, 6(3-4):144--153, 2002.
|
| |
4
|
Graham C. Goodwin and Kwai Sang Sin. Adaptive Filtering: Prediction and Control. Prentice-Hall information and system sciences series, March 1984.
|
| |
5
|
|
| |
6
|
P. L. Lanzi and D. Loiacono. XCSF with neural prediction. In Evolutionary Computation, 2006. CEC 2006. IEEE Congress on, pages 2270--2276, 2006.
|
 |
7
|
|
 |
8
|
|
 |
9
|
|
 |
10
|
|
| |
11
|
Pier Luca Lanzi , Daniele Loiacono , Stewart W. Wilson , David E. Goldberg, Generalization in the XCSF Classifier System: Analysis, Improvement, and Extension, Evolutionary Computation, v.15 n.2, p.133-168, Summer 2007
[doi> 10.1162/evco.2007.15.2.133]
|
 |
12
|
|
| |
13
|
C.J.C.H. Watkins. Learning from delayed reward. PhD thesis, 1989.
|
| |
14
|
|
| |
15
|
|
| |
16
|
|
| |
17
|
|
| |
18
|
|
| |
19
|
Stewart W. Wilson. Classifier systems for continuous payoff environments. In Genetic and Evolutionary Computation -- GECCO-2004, Part II, volume 3103 of Lecture Notes in Computer Science, pages 824--835, Seattle, WA, USA, 26-30 June 2004. Springer-Verlag.
|
|