ACM Home Page
Please provide us with feedback. Feedback
Simulating human grandmasters: evolution and coevolution of evaluation functions
Full text PdfPdf (323 KB)
Source
Genetic And Evolutionary Computation Conference archive
Proceedings of the 11th Annual conference on Genetic and evolutionary computation table of contents
Montreal, Québec, Canada
SESSION: Track 13: real world application table of contents
Pages 1483-1490  
Year of Publication: 2009
ISBN:978-1-60558-325-9
Authors
Omid David-Tabibi  Bar-Ilan University, Ramat-Gan, Israel
H. Jaap van den Herik  Tilburg University, Tilburg, Netherlands
Moshe Koppel  Bar-Ilan University, Ramat-Gan, Israel
Nathan S. Netanyahu  Bar-Ilan University, Ramat-Gan, Israel
Sponsors
SIGEVO: ACM Special Interest Group on Genetic and Evolutionary Computation
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 20,   Downloads (12 Months): 38,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1569901.1570100
What is a DOI?

ABSTRACT

This paper demonstrates the use of genetic algorithms for evolving a grandmaster-level evaluation function for a chess program. This is achieved by combining supervised and unsupervised learning. In the supervised learning phase the organisms are evolved to mimic the behavior of human grandmasters, and in the unsupervised learning phase these evolved organisms are further improved upon by means of coevolution.

While past attempts succeeded in creating a grandmaster-level program by mimicking the behavior of existing computer chess programs, this paper presents the first successful attempt at evolving a state-of-the-art evaluation function by learning only from databases of games played by humans. Our results demonstrate that the evolved program outperforms a two-time World Computer Chess Champion.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
P. Aksenov.phGenetic algorithms for optimising chess position scoring. M. Sc. Thesis, University of Joensuu, Finland, 2004.
 
3
T.S. Anantharaman.Extension heuristics. ICCA Journal, 14(2):47--65, 1991.
 
4
 
5
D.F. Beal.Experiments with the null move. Advances in Computer Chess 5, ed. D.F. Beal, pages 65--79. Elsevier Science, Amsterdam, 1989.
 
6
D.F. Beal and M.C. Smith. Quantification of search extension benefits. ICCA Journal, 18(4):205--218, 1995.
 
7
 
8
 
9
M.S. Campbell and T.A. Marsland. A comparison of minimax tree search algorithms. Artificial Intelligence, 20(4):347--367, 1983.
 
10
J.R. Capablanca. Chess Fundamentals, ed. N. de Firmian, Random House, Revised ed., 2006.
 
11
S. Chinchalkar. An upper bound for the number of reachable positions. ICCA Journal, 19(3):181--183, 1996.
12
 
13
 
14
C. Donninger. Null move and deep search: Selective search heuristics for obtuse chess programs. ICCA Journal, 16(3):137--143, 1993.
 
15
J.J. Gillogly. The technology chess program. Artificial Intelligence, 3(1-3):145--163, 1972.
 
16
 
17
A. Hauptman and M. Sipper. Using genetic programming to evolve chess endgame players. In Proceedings of the 2005 European Conference on Genetic Programming, pages 120--131. Springer, Lausanne, Switzerland, 2005.
 
18
A. Hauptman and M. Sipper. Evolution of an efficient search algorithm for the Mate-in-N problem in chess. In Proceedings of the 2007 European Conference on Genetic Programming, pages 78--89. Springer, Valencia, Spain, 2007.
 
19
E.A. Heinz. Extended futility pruning. ICCA Journal, 21(2):75--83, 1998.
 
20
 
21
G. Kendall and G. Whitwell. An evolutionary approach for the tuning of a chess evaluationfunction using population dynamics. In Proceedings of the 2001 Congress on Evolutionary Computation, pages 995--1002. IEEE Press, World Trade Center, Seoul, Korea, 2001.
 
22
H.L. Nelson. Hash tables in Cray Blitz.phICCA Journal, 8(1):3--13, 1985.
 
23
A. Reinfeld. An improvement to the Scout tree-search algorithm. ICCA Journal, 6(4):4--14, 1983.
 
24
J. Schaeffer. The history heuristic. ICCA Journal, 6(3):16--19, 1983.
 
25
 
26
J. Schaeffer, M. Hlynka, and V. Jussila. Temporal difference learning applied to a high-performance game-playing program. In Proceedings of the 2001 International Joint Conference on Artificial Intelligence, pages 529--534. Seattle, WA, 2001.
 
27
J.J. Scott. A chess-playing program. Machine Intelligence 4, eds. B. Meltzer and D. Michie, pages 255--265. Edinburgh University Press, Edinburgh, 1969.
 
28
D.J. Slate and L.R. Atkin. Chess 4.5 -- The Northwestern University chess program. Chess Skill in Man and Machine, ed. P.W. Frey, pages 82--118. Springer-Verlag, New York, 2nd ed., 1983.
 
29
 
30
 
31
W. Tunstall-Pedoe. Genetic algorithms optimising evaluation functions. ICCA Journal, 14(3):119--128, 1991.
 
32
M.A. Wiering. TD learning of game evaluation functions with hierarchical neural architectures. Master's Thesis, University of Amsterdam, 1995.

Collaborative Colleagues:
Omid David-Tabibi: colleagues
H. Jaap van den Herik: colleagues
Moshe Koppel: colleagues
Nathan S. Netanyahu: colleagues