Towards fast approximations for the hypervolume indicator for multi-objective optimization problems by Genetic Programming

doi:10.1016/j.asoc.2022.109103

Applied Soft Computing

Volume 125, August 2022, 109103

https://doi.org/10.1016/j.asoc.2022.109103 Get rights and content

Highlights

•
A model that approximates the Hypervolume indicator is obtained using GP.
•
Models are highly efficient, orders of magnitude faster in total computation time.
•
Models are highly accurate, and can be used to guide an indicator-based MOEA.

Abstract

Hypervolume (HV) has become one of the most popular indicators to assess the quality of Pareto front approximations. However, the best algorithm for computing these values has a computational complexity of $O (N^{k / 3} polylog (N))$ for $N$ candidate solutions and $k$ objectives. In this study, we propose a regression-based approach to learn new mathematical expressions to approximate the HV value and improve at the same time their computational efficiency. In particular, Genetic Programming is used as the modeling technique, because it can produce compact and efficient symbolic models. To evaluate this approach, we exhaustively measure the deviation of the new models against the real HV values using the DTLZ and WFG benchmark suites. We also test the new models using them as a guiding mechanism within the indicator-based algorithm SMS-EMOA. The results are very consistent and promising since the new models report very low errors and a high correlation for problems with 3, 4, and 5 objectives. What is more striking is the execution time achieved by these models, which in a direct comparison against standard HV calculation achieved extremely high speedups of close to 100X for a single front and over 1000X for all the HV contributions in a population, speedups reach over 10X in full runs of SMS-EMOA compared with the standard Monte Carlo approximations of the HV, particularly for large population sizes. Finally, the evolved models generalize across multiple complex problems, using only two problems to train the problems from the DTLZ benchmark and performing efficiently and effectively on all remaining DTLZ and WFG benchmark problems.

Introduction

Real-world problems often require the simultaneous optimization of several competing objectives, leading to multi-objective optimization problems (MOPs). One important characteristic of MOPs is that their solution sets, the so-called Pareto sets, as well as their images, the Pareto fronts, typically form objects of dimension $(k - 1)$ , where $k$ is the number of objectives considered in the given problem. For the numerical treatment of such problems, multi-objective evolutionary algorithms (MOEAs) have caught many researchers’ and practitioners’ interest during the last two decades. Reasons for this include that MOEAs are of global nature, very robust, require minimal assumptions on the model, and are capable of computing finite-size approximations of the entire Pareto set/front of the given MOP in a single run of the algorithm. Since the outcome of every MOEA is an entire set of candidate solutions (population) that ideally resembles the solution set (mainly the Pareto front), one question that naturally arises is how to measure the obtained approximation quality. This is needed to compare different solution sets and guide the MOEA towards the “best” Pareto front approximation.

One performance indicator that is widely used is the Hypervolume indicator (HV, [1], [2]). Although this indicator has several valuable properties [2], [3], [4], it has one critical weakness: the cost for evaluating the HV value of given candidate sets grows exponentially with the number of objectives. That is, while this cost is relatively low for bi-objective problems (compared to the overall cost of a MOEA), the computation of the HV values become the bottleneck for MOPs with more objectives, which represents a severe drawback for the applicability of the HV in modern applications. Since decision-making processes are getting more sophisticated, it is a natural consequence that also the related MOPs increase their number of optimization objectives. And this is not only valid for the quality assessment of a given solution set, but even for the correct functioning of those MOEAs that are based on computing thousands of HV values and HV contributions (i.e., the contribution of an individual of a given population to the HV value) within one run of the algorithm. A straightforward implementation of the HV value of a given set $S$ with a magnitude $N$ leads to a complexity of $O (N^{k + 1})$ , while the best algorithm has a complexity of $O (N^{k / 3} polylog (N))$ . Literature reports several methods that aim for a reduction of the computational cost for the HV. For instance, some methods proposed algorithms that reduce the complexity of the computation for specific cases [3], [5]; others employ techniques to approximate the values of the Hypervolume [6]; and finally, algorithms specialized on the Hypervolume contributions have also been proposed [7], [8], [9].

In this work, we propose using a machine learning regression technique that can produce relatively simple and efficient models that approximate the Hypervolume indicator’s behavior. The goal is to approximate the real indicator value, with minimal deviation, for any given problem. The modeling strategy considered is Genetic Programming (GP), which can produce models expressed as symbolic mathematical expressions. The GP system is set up to obtain efficient and straightforward models, avoiding unnecessarily large or complex structures. Thus, the resulting expressions’ main advantage is their computational complexity, which significantly speeds up the computational times (mostly runs in linear time) while keeping the quality in the obtained approximation.

Accordingly, we can summarize the main contributions of this study as follows.

•
We pose the problem of deriving approximate models of the HV indicator as a supervised learning problem that we approach through GP regression.
•
We show that the learned models are highly efficient, particularly when combined with an adequate updating process, achieving large speedups relative to the state-of-the-art.
•
We present results that show that the evolved models effectively approximate the HV indicator, allowing them to be used in two common scenarios: (1) quality indicators and (2) guiding the search of an indicator-based MOEA, both tasks tested for 3-objectives, 4-objective, and 5-objective MOPs.
•
The evolved models are quite general, since models trained on two benchmarks can be used to guide an indicator based MOEA on a variety of different MOPs.

The remainder of this paper is organized as follows. In Section 2, we briefly present some definitions and related work on multi-objective optimization and GP, respectively. In Section 3, we present the problem formulation, posing it as a supervised learning problem on which to apply GP. Afterwards, we outline our proposed approach in Section 4 and provide details of the main algorithms used. In Section 5 we present the main results of our study. Finally, Section 6 contains the conclusions and future work.

Section snippets

Background on multi-objective optimization and performance indicators

A continuous MOP can be mathematically defined as follows: $\begin{matrix} min_{x \in D} & F (x), \\ s.t. & G (x) \leq 0 \\ H (x) = 0 . \end{matrix}$ Hereby, $F : D \subset ℜ^{n} \to ℜ^{k}$ , $F (x) = (f_{1} (x), \dots, f_{k} (x))$ is the objective function that is defined by the individual objectives $f_{i} : D \subset ℜ^{n} \to ℜ$ . The domain $D$ of $F$ is defined by the subset of the $ℜ^{n}$ that satisfies all inequality and equality constraints, $D ≔ {x \in ℜ^{n} : G (x) \leq 0 and H (x) = 0} .$

The optimality of a MOP is defined by the concept of dominance. Let $v, w \in ℜ^{k}$ , then we say that the vector $v$ is less than $w$ ( $v <_{p} w$ ), if $v_{i} < w_{i}$ for all $i \in {1, \dots, k}$

Computational problem statement

In this work, the goal of deriving a MOEA performance indicator is posed as a synthesis or learning problem instead of a traditional analytical or formal derivation. While this could be modeled in different ways, we propose defining a supervised machine learning problem to build a new model to compute a performance indicator. To do so, it is necessary to define a target functionality that a learning algorithm will attempt to match, contained in a set of training instances. From this, it is then

Methodology

This section outlines our proposed approach to derive models that can efficiently approximate the HV indicator. The proposed methodology includes the following main stages.

1.
Generate the learning dataset. From a set of widely used MO benchmarks, we obtain a sample of approximations to the Pareto front. We made this by running a MOEA on these problems, and used a heuristic sampling policy. To compute the HV for each data set (ground truth), we apply the WFG implementation.¹
2.

Experiments and results

The experiments were carried out on a Dell R730 Power Edge Server with 2X Intel Xeon E5-2650 processors and 512 GB RAM running KVM virtual machines over Ubuntu Linux. The GPTIPS 2.0 software was downloaded from https://sites.google.com/site/gptips4matlab, running on MATLAB Version: 9.3.0.713579 (R2017b). To compute the HV indicator,we used the WFG implementation,³ the SMS-EMOA code was obtained from PlatEMO,⁴ with the

Conclusions and future work

In this work, we have presented a new methodology that allows approximating the Hypervolume (HV) values for MOPs. In particular, we have, for the first time in related literature, a supervised learning problem and used GP to evolve solutions for it. We have confirmed the reliability of our models through a comprehensive set of experimental evaluations. Numerical results show that our models approximate the real HV value of the selected benchmark problems, DTLZ and WFG, with great accuracy but

CRediT authorship contribution statement

Cristian Sandoval: Methodology, Software, Validation, Investigation, Data curation, Visualization. Oliver Cuate: Formal analysis, Investigation, Writing – review & editing, Methodology. Luis C. González: Conceptualization, Writing – original draft, Writing – review & editing, Supervision, Project administration, Funding acquisition. Leonardo Trujillo: Conceptualization, Methodology, Resources, Writing – original draft, Writing – review & editing, Supervision, Project administration, Funding

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgments

The first author was supported by CONACYT (Mexico) doctoral scholarship with CVU number 789493. Oliver Cuate acknowledges Instituto Politécnico Nacional and funding from project SIP 20221947.

References (60)

JaszkiewiczA.
Improved quick hypervolume algorithm
Comput. Oper. Res.
(2018)
BeumeN. et al.
SMS-EMOA: MUltiobjective selection based on dominated hypervolume
European J. Oper. Res.
(2007)
BringmannK. et al.
Approximating the least hypervolume contributor: NP-hard in general, but fast in practice
Theoret. Comput. Sci.
(2012)
BringmannK. et al.
Approximating the volume of unions and intersections of high-dimensional geometric objects
Comput. Geom.
(2010)
TangW. et al.
Fast hypervolume approximation scheme based on a segmentation strategy
Inform. Sci.
(2020)
AguileraJ. et al.
From neighbors to strengths - the k-strongest strengths (kSS) classification algorithm
Pattern Recognit. Lett.
(2020)
TayJ.C. et al.
Evolving dispatching rules using genetic programming for solving multi-objective flexible job-shop problems
Comput. Ind. Eng.
(2008)
OlagueG. et al.
Evolutionary-computer-assisted design of image operators that detect interest points using genetic programming
Image Vis. Comput.
(2011)
Z-FloresE. et al.
Regularity and matching pursuit feature extraction for the detection of epileptic seizures
J. Neurosci. Methods
(2016)
Z-FloresE. et al.
Modeling the adsorption of phenols and nitrophenols by activated carbon using genetic programming
J. Cleaner Prod.
(2017)

SoteloA. et al.

Identification of epilepsy stages from ECoG using genetic programming classifiers

Comput. Biol. Med.

(2013)

ZitzlerE. et al.

Multiobjective evolutionary algorithms: a comparative case study and the strength Pareto approach

IEEE Trans. Evol. Comput.

(1999)

ZitzlerE. et al.

The hypervolume indicator revisited: On the design of Pareto-compliant indicators via weighted integration

BeumeN. et al.

On the complexity of computing the hypervolume indicator

IEEE Trans. Evol. Comput.

(2009)

IshibuchiH. et al.

Comparison of hypervolume, IGD and IGD+ from the viewpoint of optimal distributions of solutions

BaderJ. et al.

HypE: An algorithm for fast hypervolume-based many-objective optimization

Evol. Comput.

(2011)

BradstreetL. et al.

Updating exclusive hypervolume contributions cheaply

BringmannK. et al.

An efficient algorithm for computing hypervolume contributions

Evol. Comput.

(2010)

EmmerichM.T.M. et al.

Computing hypervolume contributions in low dimensions: Asymptotically optimal algorithm and complexity results

HillermeierC.

Nonlinear Multiobjective Optimization: A Generalized Homotopy Approach, Vol. 135

(2001)

Van VeldhuizenD.A.

Multiobjective Evolutionary Algorithms: Classifications, Analyses, and New InnovationsTech. Rep.

(1999)

CoelloC.A.C. et al.

Solving multiobjective optimization problems using an artificial immune system

Genet. Program. Evol. Mach.

(2005)

SchützeO. et al.

Using the averaged Hausdorff distance as a performance measure in evolutionary multiobjective optimization

IEEE Trans. Evol. Comput.

(2012)

BogoyaJ.M. et al.

A (p,q)-averaged Hausdorff distance for arbitrary measurable sets

Math. Comput. Appl.

(2018)

BogoyaJ.M. et al.

The averaged Hausdorff distances in multi-objective optimization: A review

Mathematics

(2019)

BrockhoffD. et al.

On the properties of the R2 indicator

HansenM.P. et al.

Evaluating the Quality of Approximations to the Non-Dominated Set

(1994)

ZitzlerE. et al.

Quality assessment of Pareto set approximations

DilettosoE. et al.

A weakly Pareto compliant quality indicator

Math. Comput. Appl.

(2017)

IshibuchiH. et al.

Modified distance calculation in generational distance and inverted generational distance

Cited by (11)

A self-adaptive joint optimization framework for marine hybrid energy storage system design considering load fluctuation characteristics
2024, Applied Energy
Recently, with the development of new energy technologies, all-electric ships (AESs) with hybrid energy storage system (HESS) are becoming a promising solution to reduce fuel consumption and emissions. However, the high maneuverability of ships during the actual navigation places higher performance requirements on the HESS, which presents a nonlinear and multi-objective challenge for the HESS design. Therefore, it is necessary to consider the coupling between HESS sizing and energy management strategy (EMS). In this paper, a self-adaptive joint optimization framework (SJOF) for marine HESS design considering load fluctuation characteristics is proposed, which can find the optimal decision solution with excellent system economic and battery life performance for AES HESS design. Based on the rain flow counting (RFC) method, a multi-objective joint optimization method considering life cycle cost (LCC) and battery degradation index (BDI) is introduced into SJOF. Besides, a novel EMS including the self-adaptive segmentation mechanism (SSM) and power allocation is proposed, which can achieve the most efficient energy scheduling.
Multi-objective optimization model for railway heavy-haul traffic: Addressing carbon emissions reduction and transport efficiency improvement
2024, Energy
This paper establishes a multi-objective optimization model for railway heavy-haul trains, focusing on reducing carbon emissions and improving transport efficiency. The model integrates optimization of the route and the vehicle load rate, significantly reducing carbon emissions and enhancing transport efficiency. It addresses the challenges and characteristics of heavy-haul trains, introducing multi-objective optimization problems related to transport carbon emissions and efficiency. Using a pigeon-inspired optimization algorithm, the model considers joint constraints between carbon emissions and transport efficiency objectives. To overcome challenges in multi-objective transportation problems, the paper proposes a forward-learning pigeon-inspired optimization algorithm based on a surrogate-assisted model. This approach calculates the quality of the candidate solution using a surrogate model, reducing time costs. The algorithm employs a forward-learning strategy to enhance learning from non-dominant solutions. Experimental validation with benchmark functions confirms the effectiveness of the model and offers optimized solutions. The proposed method reduces carbon emissions while maintaining transport efficiency, contributing innovative ideas for the development of sustainable heavy-duty trains.
Multi-objective evolutionary optimization of extreme gradient boosting regression models of the internal turning of PEEK tubes
2024, Expert Systems with Applications
Internal turning or boring is an attractive machining process for hole enlarging as it presents low costs, good flexibility, and adaptability. The process presents some instability problems due to the high relation length/diameter ratio of the boring bar. To achieve the best process conditions, predictive models must be estimated and optimization must be conducted. This work presents a statistical learning approach for modeling and optimization of the internal turning process in PEEK tubes. Cutting speed, feed rate, and fixture position were considered input parameters. Cross-validation is used for learning and model selection, including k-fold and bootstrap approaches. The results pointed out that the extreme gradient boosting model was the best for all predictors. For $R_{a}$ the final prediction metrics results were $R M S E = 0.1395$ , $M A E = 0.1126$ , and $R^{2} = 1.0031$ , for $F_{c}$ , $R M S E = 1.8609$ , $M A E = 0.9311$ , and $R^{2} = 0.9280$ , and for $R o n_{t}$ , $R M S E = 21.3084$ , $M A E = 17.8053$ , and $R^{2} = 0.6562$ . These results were much superior to the other methods. Multi-objective evolutionary optimization was performed considering the extreme gradient boosting models for roughness, total roundness, and cutting force, besides the deterministic model of the material removal rate. The NSGA-II method was selected considering the hypervolume metric. The pseudo-weight approach is used for select high trade-off solutions to be used in practical production scenarios.
Improving multi-objective evolutionary algorithms using Grammatical Evolution
2024, Swarm and Evolutionary Computation
Multi-objective evolutionary algorithms (MOEAs) have become an effective choice to solve multi-objective optimization problems (MOPs). However, it is well known that Pareto dominance-based MOEAs struggle in MOPs with four or more objective functions due to a lack of selection pressure in high dimensional spaces. The main choices for dealing with such problems are decomposition-based and indicator-based MOEAs. In this work, we propose the use of Grammatical Evolution (an evolutionary computation search technique) to generate functions that can improve decomposition-based and indicator-based MOEAs. Namely, we propose a methodology to generate new scalarizing functions, which are known to have a great impact in the performance of decomposition-based MOEAs and in some indicator-based MOEAs. Additionally, we propose another methodology to generate hypervolume approximations, since the hypervolume is a popular performance indicator used not only in indicator-based MOEAs but also to assess performance of MOEAs. Using our first methodology, we generate two new scalarizing functions and provide their corresponding experimental validation to show that they exhibit a competitive behavior when compared against some well-known scalarizing functions such as ASF, PBI and the Tchebycheff scalarizing function. Using our second methodology, we produce 4 different hypervolume approximations and compare their performance against the Monte Carlo method and against two other state-of-the-art hypervolume approximations. The experimental results show that our functions exhibit a good compromise in terms of quality and execution time.
An adaptive multi-objective joint optimization framework for marine hybrid energy storage system design considering energy management strategy
2023, Journal of Energy Storage
The electric propulsion ship with the hybrid energy storage system (HESS) has environmental friendliness and significant advantages in terms of low fuel consumption. Due to the high maneuverability and load fluctuation of vessels, marine HESS design problem is nonlinear and multi-objective. Aiming at HESS design problem with complex working conditions during vessel practical operation process, and since there is a strong coupling between HESS and power allocation, in this paper an adaptive multi-objective joint optimization framework (AMJOF) for HESS design considering energy management strategy (EMS) is proposed. A multi-objective joint optimization method is introduced into the framework to achieve strong working condition adaptability and reach the most suitable energy management for marine HESS design with the lowest investment cost and minimum battery degradation. In addition, based on adaptive segmentation and frequency control, a novel EMS is proposed, which achieves efficient power allocation, and the evaluation indicator is introduced to quantify the performance of different optimization. The results indicate that the AMJOF can achieve excellent performance and find the optimal solution for operation of the ship.
Multi-objective Dwarf Mongoose Optimization Algorithm with Leader Guidance and Dominated Solution Evolution Mechanism
2024, Journal of Frontiers of Computer Science and Technology

View all citing articles on Scopus

View full text

Towards fast approximations for the hypervolume indicator for multi-objective optimization problems by Genetic Programming

Highlights

Abstract

Introduction

Section snippets

Background on multi-objective optimization and performance indicators

Computational problem statement

Methodology

Experiments and results

Conclusions and future work

CRediT authorship contribution statement

Declaration of Competing Interest

Acknowledgments

Comput. Oper. Res.

European J. Oper. Res.

Theoret. Comput. Sci.

Comput. Geom.

Inform. Sci.

Pattern Recognit. Lett.

Comput. Ind. Eng.

Image Vis. Comput.

J. Neurosci. Methods

J. Cleaner Prod.

Comput. Biol. Med.

Multiobjective evolutionary algorithms: a comparative case study and the strength Pareto approach

IEEE Trans. Evol. Comput.

The hypervolume indicator revisited: On the design of Pareto-compliant indicators via weighted integration

On the complexity of computing the hypervolume indicator

IEEE Trans. Evol. Comput.

Comparison of hypervolume, IGD and IGD+ from the viewpoint of optimal distributions of solutions

HypE: An algorithm for fast hypervolume-based many-objective optimization

Evol. Comput.

Updating exclusive hypervolume contributions cheaply

An efficient algorithm for computing hypervolume contributions

Evol. Comput.

Computing hypervolume contributions in low dimensions: Asymptotically optimal algorithm and complexity results

Nonlinear Multiobjective Optimization: A Generalized Homotopy Approach, Vol. 135

Multiobjective Evolutionary Algorithms: Classifications, Analyses, and New InnovationsTech. Rep.

Solving multiobjective optimization problems using an artificial immune system

Genet. Program. Evol. Mach.

Using the averaged Hausdorff distance as a performance measure in evolutionary multiobjective optimization

IEEE Trans. Evol. Comput.

A (p,q)-averaged Hausdorff distance for arbitrary measurable sets

Math. Comput. Appl.

The averaged Hausdorff distances in multi-objective optimization: A review

Mathematics

On the properties of the R2 indicator

Evaluating the Quality of Approximations to the Non-Dominated Set

Quality assessment of Pareto set approximations

A weakly Pareto compliant quality indicator

Math. Comput. Appl.

Modified distance calculation in generational distance and inverted generational distance