ACM Home Page
Please provide us with feedback. Feedback
Towards efficient online reinforcement learning using neuroevolution
Full text PdfPdf (324 KB)
Source
Genetic And Evolutionary Computation Conference archive
Proceedings of the 10th annual conference on Genetic and evolutionary computation table of contents
Atlanta, GA, USA
POSTER SESSION: Genetics-based machine learning and learning classifier systems posters table of contents
Pages 1425-1426  
Year of Publication: 2008
ISBN:978-1-60558-130-9
Authors
Jan Hendrik Metzen  DFKI GmbH, Bremen, Germany
Frank Kirchner  DFKI GmbH, Bremen, Germany
Mark Edgington  University of Bremen, Bremen, Germany
Yohannes Kassahun  University of Bremen, Bremen, Germany
Sponsors
ACM: Association for Computing Machinery
SIGEVO: ACM Special Interest Group on Genetic and Evolutionary Computation
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 4,   Downloads (12 Months): 48,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1389095.1389371
What is a DOI?

ABSTRACT

For many complex Reinforcement Learning (RL) problems with large and continuous state spaces, neuroevolution has achieved promising results. This is especially true when there is noise in sensor and/or actuator signals. These results have mainly been obtained in offline learning settings, where the training and the evaluation phases of the systems are separated. In contrast, for online RL tasks, the actual performance of a system matters during its learning phase. In these tasks, neuroevolutionary systems are often impaired by their purely exploratory nature, meaning that they usually do not use (i.e. exploit) their knowledge of a single individual's performance to improve performance during learning. In this paper we describe modifications that significantly improve the online performance of the neuroevolutionary method Evolutionary Acquisition of Neural Topologies and discuss the results obtained in the Mountain Car benchmark.



Collaborative Colleagues:
Jan Hendrik Metzen: colleagues
Frank Kirchner: colleagues
Mark Edgington: colleagues
Yohannes Kassahun: colleagues