A comparison of tournament and lexicase selection paradigms in regression problems: error-based fitness versus correlation fitness
Created by W.Langdon from
gp-bibliography.bib Revision:1.8469
- @InProceedings{bakurov:2025:GECCO,
-
author = "Illya Bakurov and Aidan Murphy and Charles Ofria and
Wolfgang Banzhaf",
-
title = "A comparison of tournament and lexicase selection
paradigms in regression problems: error-based fitness
versus correlation fitness",
-
booktitle = "Proceedings of the 2025 Genetic and Evolutionary
Computation Conference",
-
year = "2025",
-
editor = "Aniko Ekart and Nelishia Pillay",
-
pages = "970--979",
-
address = "Malaga, Spain",
-
series = "GECCO '25",
-
month = "14-18 " # jul,
-
organisation = "SIGEVO",
-
publisher = "Association for Computing Machinery",
-
publisher_address = "New York, NY, USA",
-
keywords = "genetic algorithms, genetic programming",
-
isbn13 = "979-8-4007-1465-8",
-
URL = "
https://doi.org/10.1145/3712256.3726448",
-
DOI = "
doi:10.1145/3712256.3726448",
-
size = "10 pages",
-
abstract = "Lexicase parent selection considers training cases
separately, postulating that aggregated fitness reduces
the information about the behavior of individuals.
Originally lexicase was proposed in the context of
program synthesis, characterized by uncompromising
problems that require qualitatively different actions
for different inputs, but it has since been extended to
regression problems. To facilitate valley-crossing a
relaxation parameter epsilon was added broadening the
pass condition at a given training case. Although
epsilon-lexicase has demonstrated superior
effectiveness, it was compared against selection
methods that aggregated squared (or absolute) errors.
Recent contributions, however, demonstrate that
correlation fitness functions can lead to significant
performance gains over the root mean square error
(RMSE) in tournament-guided evolution for symbolic
regression. Here we compare epsilon-lexicase (with and
without down-sampling) against tournament selection
using both error- and correlation-based fitness to
guide Genetic Programming (GP). We also assess batch
epsilon-lexicase selection as an intermediate
condition. Finally, we explore different selection
pressures to assess the exploration-exploitation
trade-off. We analyze the experimental results using
different metrics, including code redundancy,
sharpness-awareness and selection impact. Our results
demonstrate that tournament selection with correlation
fitness function significantly outperforms
epsilon-lexicase on regression problems and that its
batch variant also benefits from correlation-based
aggregation.",
-
notes = "GECCO-2025 GP A Recombination of the 34th
International Conference on Genetic Algorithms (ICGA)
and the 30th Annual Genetic Programming Conference
(GP)",
- }
Genetic Programming entries for
Illya Bakurov
Aidan Murphy
Charles Ofria
Wolfgang Banzhaf
Citations