Fitness Landscape Analysis of Automated Machine Learning Search Spaces
Created by W.Langdon from
gp-bibliography.bib Revision:1.7917
- @InProceedings{Pimenta:2020:evoCOP,
-
author = "Cristiano G. Pimenta and Alex G. C. {de Sa} and
Gabriela Ochoa and Gisele L. Pappa",
-
title = "Fitness Landscape Analysis of Automated Machine
Learning Search Spaces",
-
booktitle = "European Conference on Evolutionary Computation in
Combinatorial Optimization (EvoCOP 2020)",
-
year = "2020",
-
editor = "L. Paquete and C. Zarges",
-
volume = "12102",
-
series = "Lecture Notes in Computer Science",
-
pages = "114--130",
-
address = "Seville, Spain",
-
month = "15-17 " # apr,
-
organisation = "EvoStar, Species",
-
publisher = "Springer",
-
keywords = "genetic algorithms, genetic programming, TPOT, AutoML,
Fitness landscape analysis, Automated Machine Learning,
Fitness distance correlation, Neutrality",
-
isbn13 = "978-3-030-43679-7",
-
DOI = "doi:10.1007/978-3-030-43680-3_8",
-
abstract = "The field of Automated Machine Learning (AutoML) has
as its main goal to automate the process of creating
complete Machine Learning (ML) pipelines to any dataset
without requiring deep user expertise in ML. Several
AutoML methods have been proposed so far, but there is
not a single one that really stands out. Furthermore,
there is a lack of studies on the characteristics of
the fitness landscape of AutoML search spaces. Such
analysis may help to understand the performance of
different optimization methods for AutoML and how to
improve them. This paper adapts classic fitness
landscape analysis measures to the context of AutoML.
This is a challenging task, as AutoML search spaces
include discrete, continuous, categorical and
conditional hyperparameters. We propose an ML pipeline
representation, a neighborhood definition and a
distance metric between pipelines, and use them in the
evaluation of the fitness distance correlation (FDC)
and the neutrality ratio for a given AutoML search
space. Results of FDC are counter-intuitive and require
a more in-depth analysis of a range of search spaces.
Results of neutrality, in turn, show a strong positive
correlation between the mean neutrality ratio and the
fitness value.",
-
notes = "http://www.evostar.org/2020/ EvoCOP2020 held in
conjunction with EuroGP'2020, EvoMusArt2020 and
EvoApplications2020",
- }
Genetic Programming entries for
Cristiano Guimaraes Pimenta
Alex G C de Sa
Gabriela Ochoa
Gisele L Pappa
Citations