Genetic Programming for Classification: An Analysis of Convergence Behaviour

Loveard, Thomas; Ciesielski, Vic

doi:10.1007/3-540-36187-1_27

Thomas Loveard³ &
Vic Ciesielski³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2557))

Included in the following conference series:

Australian Joint Conference on Artificial Intelligence

1127 Accesses
2 Citations

Abstract

This paper investigates the unexpected convergence behaviour of genetic Programming (GP) for classification problems. Firstly the paper investigates the relationship between computational effort and attainable classification accuracy. Secondly we attempt to understand why GP classifiers sometimes fail to reach satisfactory levels of accuracy for certain problems regardless of computational effort. The investigation uses an artificially generated dataset for which certain properties are known in advance for the exploration of these areas.

Results from this artificial problem show that by increasing computational effort, in the form of larger population sizes and more generations, the probability of success for a run does improve, but that the computational cost far outweighs the rate of this success. Also, some runs, even with very large populations running for many generations, became stagnant and were unable to find an acceptable solution. These results are also reflected in real world classification problems.

From analysis of sub-tree components making up successful and unsuccessful programs it was noted that a small number of particular components were almost always present in successful programs, and that these components were often absent from unsuccessful programs. Also a variety of components appeared in unsuccessful programs that were never present in successful ones. Evidence from runs suggests that these components represent paths leading to optimal and sub-optimal branches in the evolutionary search space. Additionally, results suggest that if suboptimal components (which mirror the concept of deception in genetic algorithms) are relatively greater in number than the optimal components for the problem, then the chances of GP finding a successful solution are reduced.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Wolfgang Banzhaf, Peter Nordin, Robert E. Keller and Frank D. Francone. Genetic Programming — An Introduction; On the Automatic Evolution of Computer Programs and its Applications. Morgan Kaufmann, dpunkt.verlag, January 1998.
Google Scholar
C.L. Blake and C.J. Merz. UCI repository of machine learning databases http://www.ics.uci.edu/?mlearn/mlrepository.html.
Robert Feldt and Peter Nordin. Using factorial experiments to evaluate the effect of genetic programming parameters. In Riccardo Poli, Wolfgang Banzhaf, William B. Langdon, Julian F. Miller, Peter Nordin and Terence C. Fogarty (editors), Genetic Programming, Proceedings of EuroGP’2000, Volume 1802 of LNCS, pages 271–282, Edinburgh, 15–16 April 2000. Springer-Verlag.
Google Scholar
Matthias Fuchs. Large populations are not always the best choice in genetic programming. In Wolfgang Banzhaf, Jason Daida, Agoston E. Eiben, Max H. Garzon, Vasant Honavar, Mark Jakiela and Robert E. Smith (editors), Proceedings of the Genetic and Evolutionary Computation Conference, Volume 2, pages 1033–1038, Orlando, Florida, USA, 13–17 July 1999. Morgan Kaufmann.
Google Scholar
Chris Gathercole. An Investigation of Supervised Learning in Genetic Programming. Ph.D. thesis, University of Edinburgh, 1998.
Google Scholar
D. Goldberg. Simple genetic algorithms and the minimal, deceptive problem. In L. Davis (editor), Genetic Algorithms and Simulated Annealing, pages 74–88. Morgan Kaufmann, 1987.
Google Scholar
John R. Koza. Genetic Programming: On the Programming of Computers by Means of Natural Selection. MIT Press, 1992.
Google Scholar
Thomas Loveard and Victor Ciesielski. Representing classification problems in genetic programming. In Proceedings of the Congress on Evolutionary Computation, Volume 2, pages 1070–1077, COEX, Seoul, Korea, 27–30 May 2001. IEEE Press.
Google Scholar
Sean Luke. When short runs beat long runs. In Lee Spector, Erik D. Goodman, Annie Wu, W. B. Langdon, Hans-Michael Voigt, Mitsuo Gen, Sandip Sen, Marco Dorigo, Shahram Pezeshk, Max H. Garzon and Edmund Burke (editors), Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-2001), pages 74–80, San Francisco, California, USA, 7–11 July 2001. Morgan Kaufmann.
Google Scholar
Riccardo Poli. General schema theory for genetic programming with subtree-swapping crossover. In Julian F. Miller, Marco Tomassini, Pier Luca Lanzi, Conor Ryan, Andrea G. B. Tettamanzi and William B. Langdon (editors), Genetic Programming, Proceedings of EuroGP’2001, Volume 2038 of LNCS, pages 143–159, Lake Como, Italy, 18–20 April 2001. Springer-Verlag.
Chapter Google Scholar
Riccardo Poli and W. B. Langdon. A new schema theory for genetic programming with one-point crossover and point mutation. In John R. Koza, Kalyanmoy Deb, Marco Dorigo, David B. Fogel, Max Garzon, Hitoshi Iba and Rick L. Riolo (editors), Genetic Programming 1997: Proceedings of the Second Annual Conference, pages 278–285, Stanford University, CA, USA, 13–16 July 1997. Morgan Kaufmann.
Google Scholar
Walter Alden Tackett. Genetic programming for feature discovery and image discrimination. In Stephanie Forrest (editor), Proceedings of the 5th International Conference on Genetic Algorithms, ICGA-93, pages 303–309, University of Illinois at Urbana-Champaign, 17–21 July 1993. Morgan Kaufmann.
Google Scholar

Download references

Author information

Authors and Affiliations

School of Computer Science, RMIT University, GPO Box 2476V, 3001, Melbourne, Victoria, Australia
Thomas Loveard & Vic Ciesielski

Authors

Thomas Loveard
View author publications
You can also search for this author in PubMed Google Scholar
Vic Ciesielski
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Australian Defence Force Academy, University of New South Wales, ACT 2600, Canberra, Australia
Bob McKay
Computer Science Laboratory, Australian National University, RSISE Building, ACT 0200, Canberra, Australia
John Slaney

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Loveard, T., Ciesielski, V. (2002). Genetic Programming for Classification: An Analysis of Convergence Behaviour. In: McKay, B., Slaney, J. (eds) AI 2002: Advances in Artificial Intelligence. AI 2002. Lecture Notes in Computer Science(), vol 2557. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36187-1_27

Download citation

DOI: https://doi.org/10.1007/3-540-36187-1_27
Published: 08 November 2002
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-00197-3
Online ISBN: 978-3-540-36187-9
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics