Skip to main content
Log in

GP-Gammon: Genetically Programming Backgammon Players

  • Published:
Genetic Programming and Evolvable Machines Aims and scope Submit manuscript

Abstract

We apply genetic programming to the evolution of strategies for playing the game of backgammon. We explore two different strategies of learning: using a fixed external opponent as teacher, and letting the individuals play against each other. We conclude that the second approach is better and leads to excellent results: Pitted in a 1000-game tournament against a standard benchmark player—Pubeval—our best evolved program wins 62.4% of the games, the highest result to date. Moreover, several other evolved programs attain win percentages not far behind the champion, evidencing the repeatability of our approach.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Similar content being viewed by others

References

  1. J. R. Anderson, and C. Lebiere, The Atomic Components of Thought, Lawrence Erlbaum Associates: Mahwah, NJ, 1998.

    Google Scholar 

  2. K. Chellapilla, “A preliminary investigation into evolving modular programs without subtree crossover,”. in, Genetic Programming 1998: Proceedings of the Third Annual Conference, J. R. Koza, W. Banzhaf, K. Chellapilla, K. Deb, M. Dorigo, D. B. Fogel, M. H. Garzon, D. E. Goldberg, H. Iba, and R. Riolo (Eds.). University of Wisconsin, Madison, Wisconsin, USA, 1998, pp. 23–31.

  3. F. Dahl, “JellyFish Backgammon,” 1998–2004. http://www.jellyfish-backgammon.com.

  4. P. Darwen, “Why co-evolution beats temporal-difference learning at backgammon for a linear architecture, but not a non-linear architecture,” in Proceedings of the 2001 Congress on Evolutionary Computation (CEC-01). Seoul Korea, 2001, pp. 1003–1010.

  5. R. Gross, K. Albrecht, W. Kantschik, and W. Banzhaf, “Evolving chess playing programs,” in, GECCO 2002: Proceedings of the Genetic and Evolutionary Computation Conference, W. B. Langdon, E. Cantú-Paz, K. Mathias, R. Roy, D. Davis, R. Poli, K. Balakrishnan, V. Honavar, G. Rudolph, J. Wegener, L. Bull, M. A. Potter, A. C. Schultz, J. F. Miller, E. Burke, and N. Jonoska (Eds.). New York, 2002, pp. 740–747.

  6. A. Hauptman, and M. Sipper, “GP-EndChess: Using genetic programming to evolve chess endgame players,” in Proceedings of 8th European Conference on Genetic Programming (EuroGP2005), M. Keijzer, A. Tettamanzi, P. Collet, J. van Hemert, and M. Tomassini, (Eds), vol. 3447 of Lecture Notes in Computer Science, Springer-Verlag, Heidelberg, 2005, pp. 120–131.

  7. J. R. Koza, Genetic programming: On the Programming of Computers by Means of Natural Selection. MIT Press: Cambridge, MA, 1992.

    Google Scholar 

  8. J. R. Koza, Genetic Programming II: Automatic Discovery of Reusable Programs. MIT Press: Cambridge, Massachusetts, 1994.

    Google Scholar 

  9. J. R. Koza, F. H. Bennett III, D. Andre, and M. A. Keane, Genetic Programming III: Darwinian Invention and Problem Solving, Morgan Kaufmann: San Francisco, California, 1999.

    Google Scholar 

  10. D. J. Montana, “Strongly typed genetic programming,” Evolutionary Computation, vol. 3, no. 2, pp. 199–230, 1995.

    Google Scholar 

  11. J. B. Pollack, A. D. Blair, and M. Land, “Coevolution of a backgammon player,” in, Artificial Life V: Proceedings of the Fifth International Workshop on the Synthesis and Simulation of Living Systems, C. G. Langton and K. Shimohara (Eds.), Cambridge, MA, 1997, pp. 92–98.

  12. J. B. Pollack, A. D. Blair, and M. Land, “DEMO Lab”s HC-Gammon”, 1997. http://demo.cs.brandeis.edu/bkg.html.

  13. D. Qi, and R. Sun, “Integrating reinforcement learning, bidding and genetic algorithms,” in Proceedings of the International Conference on Intelligent Agent Technology (IAT-2003), 2003, pp. 53–59.

  14. E. M. A. Ronald, M. Sipper, and M. S. Capcarrère, “Design, Observation, Surprise! A Test of Emergence,” Artificial Life vol. 5, no. 3, pp. 225–239, 1999.

    Article  Google Scholar 

  15. S. Sanner, J. R. Anderson, C. Lebiere, and M. Lovett, “Achieving efficient and cognitively plausible learning in backgammon,” in, Proceedings of the 17th International Conference on Machine Learning (ICML-2000), P. Langley (Ed.), Stanford, CA, 2000, pp. 823–830.

  16. Y. Shichel, E. Ziserman, and M. Sipper, “GP-Robocode: Using genetic programming to evolve robocode players,” in Proceedings of 8th European Conference on Genetic Programming (Euro GP2005), M. Keijzer, A. Tettamanzi, P. Collet, J. van Hemert, and M. Tomassini, (Eds), vol. 3447 of Lecture Notes in Computer Science, Springer-Verlag, Heidelberg, 2005, pp. 143–154.

  17. M. Sipper, “A success story or an old wives' Tale? On judging experiments in evolutionary computation,” Complexity, vol. 5, no. 4, pp. 31–33, 2000.

    Article  Google Scholar 

  18. G. Tesauro, “NEUROGAMMON: A neural-network backgammon learning program,” Heuristic Programming in Artificial Intelligence, vol. 1, no. 7, pp. 78–80, 1989.

    Google Scholar 

  19. G. Tesauro, “Software–Source Code Benchmark player ‘pubeval.c’”. http://www.bkgm.com/rgb/rgb.cgi?view+610 1993.

  20. G. Tesauro, “Temporal difference learning and TD-Gammon,” Communications of the ACM, vol. 38, no. 3, pp. 58–68, 1995.

    Article  Google Scholar 

  21. X. Yao, “Evolving artificial neural networks,” Proceedings of the IEEE, vol. 87, no. 9, pp. 1423–1447, 1999.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yaniv Azaria.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Azaria, Y., Sipper, M. GP-Gammon: Genetically Programming Backgammon Players. Genet Program Evolvable Mach 6, 283–300 (2005). https://doi.org/10.1007/s10710-005-2990-0

Download citation

  • Received:

  • Revised:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10710-005-2990-0

Keywords

Navigation