ACM Home Page
Please provide us with feedback. Feedback
Reward allotment in an event-driven hybrid learning classifier system for online soccer games
Full text PdfPdf (146 KB)
Source Genetic And Evolutionary Computation Conference archive
Proceedings of the 8th annual conference on Genetic and evolutionary computation table of contents
Seattle, Washington, USA
SESSION: Real-world applications: papers table of contents
Pages: 1753 - 1760  
Year of Publication: 2006
ISBN:1-59593-186-4
Authors
Yuji Sato  Hosei University, Tokyo, JAPAN
Yosuke Akatsuka  Hosei University, Tokyo, JAPAN
Takenori Nishizono  Hosei University, Tokyo, JAPAN
Sponsors
SIGEVO: ACM Special Interest Group on Genetic and Evolutionary Computation
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 5,   Downloads (12 Months): 34,   Citation Count: 2
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1143997.1144282
What is a DOI?

ABSTRACT

This paper describes our study into the concept of using rewards in a classifier system applied to the acquisition of decision-making algorithms for agents in a soccer game. Our aim is to respond to the changing environment of video gaming that has resulted from the growth of the Internet, and to provide bug-free programs in a short time. We have already proposed a bucket brigade algorithm (a reinforcement learning method for classifiers) and a procedure for choosing what to learn depending on the frequency of events with the aim of facilitating real-time learning while a game is in progress. We have also proposed a hybrid system configuration that combines existing algorithm strategies with a classifier system, and we have reported on the effectiveness of this hybrid system. In this paper, we report on the results of performing reinforcement learning with different reward values assigned to reflect differences in the roles performed by forward, midfielder and defense players, and we describe the results obtained when learning is performed with different combinations of success rewards for various type of play such as dribbling and passing. In 200 matches played against an existing soccer game incorporating an algorithm devised by humans, a better win ratio and better convergence were observed compared with the case where learning was performed with no roles assigned to all of the in-game agents.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Barry, A. Limits in Long Path Learning with XCS. In Proceedings of the Fifth Annual Genetic and Evolutionary Computation Conference. Vol. 2, LNCS 2724, Springer-Verlag, Berlin, Heidelberg, 2003, 1832--1843.
 
2
 
3
 
4
Butz, M.V., Goldberg, D.E., and Lanzi, P.L. Gradient-Based Learning Updates Improve XCS Performance in Multistep Problems. In Proceedings of the Sixth Annual Genetic and Evolutionary Computation Conference. Vol. 2, LNCS 3103, Springer-Verlag, Berlin, Heidelberg, 2004, 751--762.
 
5
Dawson, D. Improving Performance in Size-Constrained Extended Classifier Systems. In Proceedings of the Fifth Annual Genetic and Evolutionary Computation Conference. Vol. 2, LNCS 2724, Springer-Verlag, Berlin, Heidelberg, 2003, 1870--1881.
 
6
 
7
 
8
 
9
Holland, J.H. Escaping brittleness: The possibilities of general-purpose learning algorithms applied to parallel rule-based systems. In Michalski, R.S. et al. (eds.): Machine Learning II, Morgan Kaufmann Publishers, CA, 1986, 593--623.
 
10
 
11
Huang, C-H. and Sun, C-T. Parameter Adaptation within Co-adaptive Learning Classifier Systems. In Proceedings of the Sixth Annual Genetic and Evolutionary Computation Conference. Vol. 2, LNCS 3103, Springer-Verlag, Berlin, Heidelberg, 2004, 774--784.
 
12
Kitano, H., Asada, M., Kuniyoshi, Y., Noda, I., Osawa, E., and Matsubara, H. RoboCup: A challenge problem for AI. AI Magazine, Vol. 18, 1997, 73--85.
 
13
Kovacs, T. What Should a Classifier System Learn and How Should We Measure It? Journal of Soft Computing, Vol. 6, No. 3-4, 2002, 171--182.
 
14
Luke, S. Genetic Programming Produced Competitive Soccer Softbot Teams for RoboCup 97. In Proceedings of the Third Annual Genetic Programming Conference. Morgan Kaufmann Publishers, San Francisco, CA, 1998, 204--222.
 
15
 
16
 
17
 
18
 
19
RoboCup web page. http://www.robocup.org/
 
20
Sato, Y., and Kanno, R. Event-driven Hybrid Learning Classifier Systems for Online Soccer Games. In Proceedings of the 2005 IEEE Congress on Evolutionary Computation. IEEE Press, Edinburgh, 2005, 2091--2098.
 
21
 
22
Wilson, S.W. Classifier Fitness Based on Accuracy. Evolutionary Computation, Vol. 3, No. 2, 1995, 149--175.
 
23
Wilson, S.W. Generalization in the XCS Classifier System. In Proceedings of the Third Annual Genetic Programming Conference. Morgan Kaufmann Publishers, San Francisco, CA, 1998, 665--674.


Collaborative Colleagues:
Yuji Sato: colleagues
Yosuke Akatsuka: colleagues
Takenori Nishizono: colleagues