Skip to main content

Genetic Programming for Classification: An Analysis of Convergence Behaviour

  • Conference paper
  • First Online:
Book cover AI 2002: Advances in Artificial Intelligence (AI 2002)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2557))

Included in the following conference series:

Abstract

This paper investigates the unexpected convergence behaviour of genetic Programming (GP) for classification problems. Firstly the paper investigates the relationship between computational effort and attainable classification accuracy. Secondly we attempt to understand why GP classifiers sometimes fail to reach satisfactory levels of accuracy for certain problems regardless of computational effort. The investigation uses an artificially generated dataset for which certain properties are known in advance for the exploration of these areas.

Results from this artificial problem show that by increasing computational effort, in the form of larger population sizes and more generations, the probability of success for a run does improve, but that the computational cost far outweighs the rate of this success. Also, some runs, even with very large populations running for many generations, became stagnant and were unable to find an acceptable solution. These results are also reflected in real world classification problems.

From analysis of sub-tree components making up successful and unsuccessful programs it was noted that a small number of particular components were almost always present in successful programs, and that these components were often absent from unsuccessful programs. Also a variety of components appeared in unsuccessful programs that were never present in successful ones. Evidence from runs suggests that these components represent paths leading to optimal and sub-optimal branches in the evolutionary search space. Additionally, results suggest that if suboptimal components (which mirror the concept of deception in genetic algorithms) are relatively greater in number than the optimal components for the problem, then the chances of GP finding a successful solution are reduced.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Wolfgang Banzhaf, Peter Nordin, Robert E. Keller and Frank D. Francone. Genetic Programming — An Introduction; On the Automatic Evolution of Computer Programs and its Applications. Morgan Kaufmann, dpunkt.verlag, January 1998.

    Google Scholar 

  2. C.L. Blake and C.J. Merz. UCI repository of machine learning databases http://www.ics.uci.edu/?mlearn/mlrepository.html.

  3. Robert Feldt and Peter Nordin. Using factorial experiments to evaluate the effect of genetic programming parameters. In Riccardo Poli, Wolfgang Banzhaf, William B. Langdon, Julian F. Miller, Peter Nordin and Terence C. Fogarty (editors), Genetic Programming, Proceedings of EuroGP’2000, Volume 1802 of LNCS, pages 271–282, Edinburgh, 15–16 April 2000. Springer-Verlag.

    Google Scholar 

  4. Matthias Fuchs. Large populations are not always the best choice in genetic programming. In Wolfgang Banzhaf, Jason Daida, Agoston E. Eiben, Max H. Garzon, Vasant Honavar, Mark Jakiela and Robert E. Smith (editors), Proceedings of the Genetic and Evolutionary Computation Conference, Volume 2, pages 1033–1038, Orlando, Florida, USA, 13–17 July 1999. Morgan Kaufmann.

    Google Scholar 

  5. Chris Gathercole. An Investigation of Supervised Learning in Genetic Programming. Ph.D. thesis, University of Edinburgh, 1998.

    Google Scholar 

  6. D. Goldberg. Simple genetic algorithms and the minimal, deceptive problem. In L. Davis (editor), Genetic Algorithms and Simulated Annealing, pages 74–88. Morgan Kaufmann, 1987.

    Google Scholar 

  7. John R. Koza. Genetic Programming: On the Programming of Computers by Means of Natural Selection. MIT Press, 1992.

    Google Scholar 

  8. Thomas Loveard and Victor Ciesielski. Representing classification problems in genetic programming. In Proceedings of the Congress on Evolutionary Computation, Volume 2, pages 1070–1077, COEX, Seoul, Korea, 27–30 May 2001. IEEE Press.

    Google Scholar 

  9. Sean Luke. When short runs beat long runs. In Lee Spector, Erik D. Goodman, Annie Wu, W. B. Langdon, Hans-Michael Voigt, Mitsuo Gen, Sandip Sen, Marco Dorigo, Shahram Pezeshk, Max H. Garzon and Edmund Burke (editors), Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-2001), pages 74–80, San Francisco, California, USA, 7–11 July 2001. Morgan Kaufmann.

    Google Scholar 

  10. Riccardo Poli. General schema theory for genetic programming with subtree-swapping crossover. In Julian F. Miller, Marco Tomassini, Pier Luca Lanzi, Conor Ryan, Andrea G. B. Tettamanzi and William B. Langdon (editors), Genetic Programming, Proceedings of EuroGP’2001, Volume 2038 of LNCS, pages 143–159, Lake Como, Italy, 18–20 April 2001. Springer-Verlag.

    Chapter  Google Scholar 

  11. Riccardo Poli and W. B. Langdon. A new schema theory for genetic programming with one-point crossover and point mutation. In John R. Koza, Kalyanmoy Deb, Marco Dorigo, David B. Fogel, Max Garzon, Hitoshi Iba and Rick L. Riolo (editors), Genetic Programming 1997: Proceedings of the Second Annual Conference, pages 278–285, Stanford University, CA, USA, 13–16 July 1997. Morgan Kaufmann.

    Google Scholar 

  12. Walter Alden Tackett. Genetic programming for feature discovery and image discrimination. In Stephanie Forrest (editor), Proceedings of the 5th International Conference on Genetic Algorithms, ICGA-93, pages 303–309, University of Illinois at Urbana-Champaign, 17–21 July 1993. Morgan Kaufmann.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Loveard, T., Ciesielski, V. (2002). Genetic Programming for Classification: An Analysis of Convergence Behaviour. In: McKay, B., Slaney, J. (eds) AI 2002: Advances in Artificial Intelligence. AI 2002. Lecture Notes in Computer Science(), vol 2557. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36187-1_27

Download citation

  • DOI: https://doi.org/10.1007/3-540-36187-1_27

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-00197-3

  • Online ISBN: 978-3-540-36187-9

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics