skip to main content
10.1145/3071178.3071316acmconferencesArticle/Chapter ViewAbstractPublication PagesgeccoConference Proceedingsconference-collections
research-article

Coevolving deep hierarchies of programs to solve complex tasks

Published:01 July 2017Publication History

ABSTRACT

Scaling genetic programming to organize large complex combinations of programs remains an under investigated topic in general. This work revisits the issue by first demonstrating the respective contributions of coevolution and diversity maintenance. Competitive coevolution is employed to organize a task in such a way that the most informative training cases are retained. Cooperative coevolution helps discover modularity in the solutions discovered and, in this work, is fundamental to constructing complex structures of programs that still execute efficiently (the policy tree). The role of coevolution and diversity maintenance is first independently established under the task of discovering reinforcement learning policies for solving Rubik's Cubes scrambled with 5-twists. With this established, a combined approach is then adopted for building large organizations of code for representing policies that solve 5 to 8-twist combinations of the Cube. The resulting 'deep' policy tree organizes hundreds of programs to provide optimal solutions to tens of millions of test cube configurations.

References

  1. Isidro M. Alvarez, Will N. Browne, and Mengjie Zhang. 2016. Human-inspired Scaling in Learning Classifier Systems: Case Study on the n-bit Multiplexer Problem Set. In ACM Genetic and Evolutionary Computation Conference. 429--436. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. S. J. Arfaee, S. Zilies, and R. C. Holte. 2011. Learning heuristic functions for large state spaces. Artificial Intelligence 175 (2011), 2075--2098. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. E. B. Baum and I. Durdanovic. 2000. Evolution of cooperative problem solving in an artificial economy. Neural Computation 12 (2000), 2743--2775. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. E. D. de Jong. 2007. A monotonic archive for Pareto-coevolution. Evolutionary Computation 15, 1 (2007), 61--93. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. J. Demšar. 2006. Statistical comparisons of classifiers over multiple data sets. Journal of Machine Learning Research 7, 1 (2006), 1--30. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. J. A. Doucette, P. Lichodzijewski, and M. I. Heywood. 2012. Hierarchical task decomposition through symbiosis in reinforcement learning. In ACM Genetic and Evolutionary Computation Conference. 97--104. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. J. A. Doucette, A. R. McIntyre, P. Lichodzijewski, and M. I. Heywood. 2012. Symbiotic coevolutionary genetic programming: A benchmarking study under large attribute spaces. Genetic Programming and Evolvable Machines 13 (2012), 71--101. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. N. El-Sourani, S. Hauke, and M. Borschbach. 2010. An evolutionary approach for solving the Rubik's Cube incorporating exact methods. In EvoApplications: Part I (LNCS), Vol. 6024. 80--89. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. S. G. Ficici and J. B. Pollack. 2001. Pareto optimality in coevolutionary learning. In European Conference on Advances in Artificial Life (LNAI), Vol. 2159. 316--325. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. F. Gomez and R. Miikkulainen. 1997. Incremental evolution of complex general behavior. Adaptive Behavior 5, 3--4 (1997), 317--342. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. A. Hauptman, A. Elyasaf, M. Sipper, and A. Karmon. 2009. GP-Rush: Using genetic programming to evolve solvers for the rush hour puzzle. In ACM Genetic and Evolutionary Computation Conference. 955--962. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. William H. Hsu, Scott J. Harmon, Edwin Rodriguez, and Christopher Zhong. 2004. Empirical comparison of incremental reuse strategies in genetic programming for keep-away soccer. In Genetic and Evolutionary Computation Conference - Late Breaking Papers.Google ScholarGoogle Scholar
  13. W. Jaśkowski, K. Krawiec, and B. Wieloch. 2007. Knowledge reuse in genetic programming applied to visual learning. In Proceedings of the ACM Genetic and Evolutionary Computation Conference. 1790--1797. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. M. Keijzer, C. Ryan, and M. Cattolico. 2004. Run transferable libraries - learning functional bias in problem domains. In Proceedings of the Genetic and Evolutionary Computation Conference (LNCS), Vol. 3103. 531--542.Google ScholarGoogle Scholar
  15. S. Kelly and M. I. Heywood. 2014. On Diversity, Teaming, and Hierarchical Policies: Observations from the Keepaway Soccer Task. In European Conference on Genetic Programming (LNCS), Vol. 8599. Springer, 75--86. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. S. Kelly and M. I. Heywood. 2015. Knowledge Transfer from Keepaway Soccer to Half-field Offense Through Program Symbiosis: Building Simple Programs for a Complex Task. In ACM Genetic and Evolutionary Computation Conference. 1143--1150. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. S. Kelly, P. Lichodzijewski, and M. I. Heywood. 2012. On run time libraries and hierarchical symbiosis. In IEEE Congress on Evolutionary Computation. 3245--3252.Google ScholarGoogle Scholar
  18. R. E. Korf. 1997. Finding optimal solutions to Rubik's Cube using pattern databases. In AAAI Conference on Artificial Intelligence. 700--705. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Y. LeCun, Y. Bengio, and G. Hinton. 2015. Deep Learning. Nature 521 (2015), 436--444.Google ScholarGoogle ScholarCross RefCross Ref
  20. P. Lichodzijewski and M. Heywood. 2010. The Rubic's Cube and GP temporal sequence learning: An initial study. In Genetic Programming Theory and Practice VIII. Springer, Chapter 3, 35--54.Google ScholarGoogle Scholar
  21. P. Lichodzijewski and M. I. Heywood. 2008. Managing team-based problem solving with Symbiotic Bid-based Genetic Programming. In ACM Genetic and Evolutionary Computation Conference. 363--370. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. C. D. Rosin and R. K. Belew. 1997. New methods for competitive coevolution. Evolutionary Computation 5 (1997), 1--29. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Robert J. Smith, Stephen Kelly, and Malcolm I. Heywood. 2016. Discovering Rubik's Cube Subgroups using Coevolutionary GP: A Five Twist Experiment. In ACM Genetic and Evolutionary Computation Conference. 789--796. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. L. Spector, K. Harrington, and T. Helmuth. 2012. Tag-based modularity in tree-based genetic programming. In Proceedings of the ACM Genetic and Evolutionary Computation Conference. 815--826. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. P. Stone. 2000. Layered learning in multiagent systems. MIT Press. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Matthew E. Taylor and Peter Stone. 2011. An Introduction to Inter-task Transfer for Reinforcement Learning. AI Magazine 32, 1 (2011), 15--34.Google ScholarGoogle ScholarCross RefCross Ref
  27. S. Wu and W. Banzhaf. 2011. Rethinking multilevel selection in genetic programming. In Proceedings of the ACM Genetic and Evolutionary Computation Conference. 1403--1410. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Coevolving deep hierarchies of programs to solve complex tasks

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      GECCO '17: Proceedings of the Genetic and Evolutionary Computation Conference
      July 2017
      1427 pages
      ISBN:9781450349208
      DOI:10.1145/3071178

      Copyright © 2017 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 1 July 2017

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      GECCO '17 Paper Acceptance Rate178of462submissions,39%Overall Acceptance Rate1,669of4,410submissions,38%

      Upcoming Conference

      GECCO '24
      Genetic and Evolutionary Computation Conference
      July 14 - 18, 2024
      Melbourne , VIC , Australia

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader