Knowledge discovery in multiobjective optimization problems in engineering via Genetic Programming

doi:10.1016/j.eswa.2017.12.008

Expert Systems with Applications

Volume 99, 1 June 2018, Pages 93-102

https://doi.org/10.1016/j.eswa.2017.12.008 Get rights and content

Highlights

•
A Genetic Programming approach is proposed for knowledge discovery in Innovization.
•
An alternative solution for the treatment of consistency of units is proposed.
•
An external file is used to maintain all potential design principles of interest.
•
A procedure to avoid obtaining trivial solutions is also included.
•
Results indicate that the proposals contribute to the discovery of new solutions.

Abstract

Recently, the interest of the researchers has grown in post-optimality analyses, with the search for intrinsic properties of the optimal solutions of a given problem. Innovization has been defined as a process of knowledge discovery, in the form of mathematical relationships between variables, objectives, constraints, and parameters, from the output of an optimization problem. Genetic Programming (GP) is a bio-inspired metaheuristic capable of automatically evolving programs that can be used in this process. In spite of its wide applicability, GP techniques can present some issues when tackling knowledge discovery problems. Here, three modifications are proposed in a GP technique available in the literature for Innovization problems: (i) a method to ensure the consistency of the units of the principles using protected operations that ignore invalid terms, (ii) a strategy to avoid trivial solutions, and (iii) the use of an external archive to store the solutions of interest found during the search. Computational experiments are presented using four engineering case studies (namely, a two-member-truss, a welded beam, the cutting of a metal part, and a composite gear) to verify the capacity of the proposed GP method in finding design principles.

Introduction

Optimization involves the study and the use of methods to determine the parameters that lead to the best solutions according to the objectives of interest. A problem is classified as multiobjective when it presents multiple and conflicting objectives which must be simultaneously optimized. According to Deb and Srinivasan (2006), in engineering one can expect some similarities among solutions of an optimization problem, related to their optimum conditions. Nowadays, the interest of the researchers has grown in post-optimality studies for the search of intrinsic properties of the optimal solutions of a given optimization problem.

The term Innovization (innovation through optimization) was proposed in Deb and Srinivasan (2006) to refer to a process of knowledge discovery from the output of an optimization problem, in the form of mathematical relations between variables, objectives, constraints, and parameters, that could be thought of as rules of thumb for creating optimal designs. Such relations, or design principles, provide (i) the discovery of promising regions of the search space, (ii) the creation of new solutions without running a new optimization process, and (iii) a deeper understanding of the problem.

The idea of finding knowledge about the solutions can be applied to single objective problems. However, the plurality of solutions usually obtained when solving multiobjective problems provide more data to search for properties that may reveal novel characteristics about the problem (Deb & Srinivasan, 2006).

The Innovization process was extended in Bandaru and Deb (2010), Bandaru, Aslam, Ng, and Deb (2015); Bandaru and Deb (2013). In the automated Innovation (Bandaru & Deb, 2010) a Genetic Algorithm (GA) is used to search for design principles that may reveal characteristics of the solutions present along the Pareto Front. The search space is formed by the decision variables, objectives, constraints, and additional functions that can be indicated by the user.

Several other studies, based on machine learning methods, visualization techniques, or even analytical methods, were carried out to reveal information in multiobjective problems.

Kohonen self organizing maps (SOMs) (Kohonen, 1990) were used by Chiba, Imamura, Amemiya, Jeong, and Yamamoto (2006); Doncieux and Hamdaoui (2011); Obayashi and Sasaki (2003) to extract information from Pareto-Optimal Solutions. SOMs are a type of unsupervised recurrent neural network capable of spatially separating multidimensional data in groups with similar characteristics, keeping the most related groups close to each other. Obayashi and Sasaki (2003) used SOMs to search for patterns in supersonic aircraft fusion designs. SOMs were also used by Doncieux and Hamdaoui (2011) to identify patterns that affect the velocity in the design of an ornithopter’s wing.

A visualization based technique was proposed by Pryke, Mostaghim, and Nazemi (2007), where heatmaps were used to visualize the decision and objective spaces simultaneously, making it possible to identify correlations between them. An advantage of this method is that it can be applied to problems with more than three objectives, a limitation usually observed in visualization techniques.

Ulrich, Brockhoff, and Zitzler (2008) proposed the use of dendrograms to cluster non-dominated solutions for the discovery of design principles. The technique was successfully applied to the knapsack problem and also to the design of an embedded processor.

In order to extract knowledge from non-dominated solutions, (Kudo & Yoshikawa, 2012) proposed the use of the Isomap visualization method, a nonlinear dimensionality reduction technique based on the geodetic distance. The proposed method calculates the geodetic distance among objectives and decision variables.

Ulrich (2013) proposed a bi-objective formulation to the problem of finding correlations between decision variables and the objective space, along with an algorithm capable of solving it, called Pareto-front Analyzer (PAN).

A search method called Multiobjective Robust Design Exploration was proposed by Sugimura, Jeong, Obayashi, and Kimura (2009), which consists of a multiobjective optimization followed by the analysis of the solutions using association rules, searching for correlations between decision variables and objective values. Association rules are expressions of the form “if-then” which can be used to infer causality.

An extensive review of techniques and applications regarding data mining in multiobjective problems can be found in Bandaru, Ng, and Deb (2017a); 2017b).

One step towards discovering more general design principles was the adoption of Genetic Programming (GP) (Bandaru & Deb, 2013), a technique widely used for the evolution of programs, such as symbolic expressions and classifier models.

A modification was necessary to use GP as the search algorithm: besides finding principles that reflect the similarities present in the Pareto Front, the models found must correctly relate the decision variables with respect to the basic units involved. Thus, (Bandaru & Deb, 2013) proposed the use of a GP in which the consistency of the units is verified during the search process, in order to encourage the generation of valid candidate solutions. The approach chosen by the authors was penalization, which assigns an arbitrarily large penalty value to a candidate solution that presents inconsistent physical units.

Here, an alternative solution for enforcing the consistency of units is proposed, which involves performing protected operations that ignore the invalid terms of the expressions. The use of protected operations is commonly adopted in GP to handle the execution of arithmetic operations involving invalid values. For example, it is common to use a protected division operation, that returns a pre-defined value when the denominator is equal to zero.

In order to obtain more diverse solutions, the introduction of an external archive is proposed here to maintain all potential design principles of interest, which could be otherwise lost throughout the search process.

It has also been observed that some design principles can be generated which, given their simplicity, do not add knowledge about the problem, being therefore irrelevant. Such solutions are called here trivial solutions. A procedure is proposed to avoid the presence of this type of principle in the population.

The techniques proposed here are applied to four case studies in engineering that have already been studied in the Innovization literature. The problems involve the designs of: a two-member-truss, a welded beam, the cutting of a metal bar, and a composite gear.

Section snippets

Innovization

An automated Innovization process was proposed in Bandaru and Deb (2010) that consists of using a Genetic Algorithm (GA) in the search for invariant properties, that is, design principles that may reveal characteristics present in the Pareto-optimal solutions. Those principles can be expressed as symbolic expressions containing decision variables, objectives and constraints. Thus, it is desired to find functions of the form $Ψ_{i} (x, f (x), g (x)) = c_{i},$ where f(x) are the objective functions, g(x) are

Proposed methods

Here an alternative solution is proposed to handle the consistency among units, in which protected operations are created to ignore the invalid terms of the models. Thereby, invalid models are not eliminated via penalty, but repaired. It is also proposed a strategy to avoid obtaining trivial solutions when a GP technique is used. Finally, it is proposed to use an external archive to maintain the solutions of interest found during the search process.

Computational experiments

Computational experiments were carried out to evaluate if the methods proposed here contribute to finding more diverse design principles. For each multiobjective problem, 30 independent runs were performed using ¹ the NSGA-II algorithm (Deb, Pratap, Agarwal, & Meyarivan, 2002). The Pareto Fronts obtained in each run of a given problem are joined and a final Pareto Front is then generated containing all

Concluding remarks

In this work three modifications were proposed for the Innovization process based on Genetic Programming: (i) an alternative way to promote consistency in the use of units, (ii) the use of an external archive to maintain the promising solutions, and (iii) a procedure to avoid obtaining trivial, and therefore irrelevant, solutions.

From the results obtained in four case studies in engineering it can be concluded that (i) and (iii) contribute to obtaining a larger number of good solutions and

Acknowledgements

The authors thank the support of CAPES, CNPq (grant 310778/2013-1), FAPEMIG (grant APQ-03414-15), and PPGMC/UFJF.

References (21)

S. Bandaru et al.
Generalized higher-level automated innovization with application to inventory management
European Journal of Operational Research
(2015)
S. Bandaru et al.
Data mining methods for knowledge discovery in multi-objective optimization: Part a - survey
Expert Systems with Applications
(2017)
K. Chiba et al.
Design exploration of shielding effect for aircraft engine noise
Eccomas cfd 2006: Proceedings of the european conference on computational fluid dynamics, egmond aan zee, the netherlands, september 5–8, 2006
(2006)
S. Bandaru et al.
Automated discovery of vital knowledge from Pareto-optimal solutions: First results from engineering design
Ieee congress on evolutionary computation
(2010)
S. Bandaru et al.
Towards automating the discovery of certain innovative design principles through a clustering-based optimization technique
Engineering Optimization
(2011)
S. Bandaru et al.
A dimensionally-aware genetic programming architecture for automated innovization
International conference on evolutionary multi-criterion optimization
(2013)
S. Bandaru et al.
Data mining methods for knowledge discovery in multi-objective optimization: Part b - new developments and applications
Expert Systems with Applications
(2017)
K. Deb
Optimal design of a welded beam via genetic algorithms
AIAA Journal
(1991)
K. Deb et al.
A fast and elitist multiobjective genetic algorithm: nsga-ii
IEEE Transactions on Evolutionary Computation
(2002)
K. Deb et al.
Innovization: innovating design principles through optimization

There are more references available in the full text version of this article.

Cited by (9)

Discovering generalized design knowledge using a multi-objective evolutionary algorithm with generalization operators
2020, Expert Systems with Applications
Citation Excerpt :
Logical rules are easy to understand, as they can easily be used for building mental models of the knowledge they convey. Popular data mining methods to extract design rules include decision trees (Graening et al., 2008; Jahr, Calborean, Vintan, & Ungerer, 2012; Yan, Qiao, Simpson, Li, & Zhang, 2012), association rule mining (ARM) (Bandaru, Ng, & Deb, 2017; Ng, Bandaru, & Frantzén, 2016; Watanabe, Chiba, & Kanazaki, 2014), and evolutionary algorithms (EA) (Bandaru & Deb, 2010; Russo, Bernardino, & Barbosa, 2018). These methods can be used to identify design features that are common among good (e.g., non-dominated) designs.
The early-phase design of complex systems is a challenging task, as a decision maker has to take into account the intricate relationships among different design variables. A popular way to help decision makers easily identify important design features is to use data mining. However, many of the existing algorithms output design features that are too complex (e.g., conjunction of many literals with unrelated predicates), making it difficult for a user to understand, remember, and apply these features to find better designs. In this paper, we introduce a new data mining method that extracts compact design features through knowledge generalization. The proposed method performs a search over the space of features using a multi-objective evolutionary algorithm that contains a set of generalization operators in addition to conventional evolutionary operators. Both variables and feature types are generalized by using an ontology defining a set of domain-specific concepts and relationships. Generalization leads to more compact and insightful features, as generalized knowledge encompasses wider concepts. A comparative experiment is conducted on a real-world system architecting problem to demonstrate the gain in compactness of the extracted features without significant reductions in predictive power.
Self-adaptive MRPBIL-DE for 6D robot multiobjective trajectory planning
2019, Expert Systems with Applications
Citation Excerpt :
From literature, it was found that most of research work are focusing on single objective optimisation while studies on multiobjective optimisation for such a robot design problem have been rarely presented. Over the last decade, numerous MOMHs have been proposed (Gao, Zhou, Li, Pan, & Yi, 2015; Hidalgo-Paniagua, Vega-Rodríguez, & Ferruz, 2016; Liu, Zhang, He, & Jiang, 2018; Nuaekaew, Artrit, Pholdee, & Bureerat, 2017; Onan, Korukoğlu, & Bulut, 2016; Pholdee & Bureerat, 2013a,b; Qingfu & Hui, 2007; Robič & Filipič, 2005; Russo, Bernardino, & Barbosa, 2018; Wang, Jiao, & Yao, 2015; Wang, Purshouse, & Fleming, 2013; Zareizadeh, Helfroush, Rahideh, & Kazemi, 2018; Zhang, Tian, & Jin, 2015). Some of effective and efficient MOMHs are an improved two-archive algorithm (Two_Arch2) (Wang et al., 2015), a preference-inspired co-evolutionary algorithm using goal vectors (PICEA-g), a knee point driven evolutionary algorithm for many-objective optimization (KnEA) (Zhang et al., 2015), an unrestricted population size evolutionary multiobjective optimisation algorithm (UPS-EMOA) (Aittokoski & Miettinen, 2010), hybridisation of real-code population-based incremental learning and differential evolution (MRPBIL-DE) (Pholdee & Bureerat, 2013b).
This work presents self-adaptive multiobjective real-code population-based incremental learning hybridised with differential evolution (MRPBIL-DE) for solving a 6D robot trajectory planning multiobjective optimisation problem. The objective functions are assigned to minimise travelling time and minimise maximum jerk taking place during motion while the constraints are velocity, acceleration and jerk constraints. A five order polynomial function is used to represent a motion equation while the motion path is divided into two sub-paths; from initial to intermediate positions and from intermediate to final positions. The optimiser is used to find a set of design variables including joint positions, velocities and accelerations at intermediate positions, moving time from the initial to intermediate positions, and that from the intermediate to final positions. Several multiobjective meta-heuristics (MOMHs) along with the proposed algorithm are used to solve the trajectory optimisation problem of robot manipulators while their performances are investigated. The results indicated that the proposed algorithm is effective and efficient for multiobjective robot trajectory planning optimisation problem. The results obtained from such a method are set as the baseline for further study of robot trajectory planning optimisation.
A novel Error-Correcting Output Codes algorithm based on genetic programming
2019, Swarm and Evolutionary Computation
Citation Excerpt :
GP is a widely deployed evolutionary algorithm, proposed by Koza [43]. Up to now, it has been successfully applied to different optimization problems in diverse fields, such as interpreting reinforcement learning policies [44] and different types of knowledge discovery [45,46] by treating individuals as symbolic expressions. And some GP based learning algorithms treated individuals as learners, so as to applied to rainfall prediction [47], building ensemble learning systems [48] and outlier elimination [49].
Error-Correcting Output Codes (ECOC) is widely used in the field of multiclass classification. As an optimal codematrix is key to the performance of an ECOC algorithm, this paper proposes a genetic programming (GP) based ECOC algorithm (GP-ECOC). In the design of individual of our GP, each terminal node represents a class, and nonterminal nodes combine the classes in their child nodes. In this way, an individual is a class combination tree, and represents an ECOC codematrix. A legality checking process is embedded in our algorithm to check each codematrix, so as to ensure each codematrix satisfying ECOC constraints. Those violating the constraints will be corrected by a proposed Guided Mutation operator. Before fitness evaluation, a local optimization algorithm is proposed to append new columns for tough classes, so as to improve the generalization ability of each individual and accelerate the evolutionary speed. In this way, our GP can evolve optimal codematrices through the evolutionary process. Experiments show that compared with other ensemble algorithms, our algorithm can achieve stable and high performances with relatively small ensemble scales on various UCI data sets. To the best of our knowledge, it is the first time that GP has been applied to implement the ECOC encoding algorithm. Our Python code is available at https://github.com/samuellees/gpecoc.
Heat production optimization using bio-inspired algorithms
2018, Engineering Applications of Artificial Intelligence
Citation Excerpt :
While Dubey et al. (2018) presented a comparative overview of recent advances in using bio-inspired methods for managing wind power dispatch. Evolutionary computing is also reported to be efficient in computer aided detection (Morra et al., 2018) and multi-objective problems of engineering systems (Russo et al., 2018). Meanwhile in Liao et al. (2018) bio-inspired methods were used to minimize the power consumption for an application of luminance control.
Energy efficiency of industrial systems is one of key features for optimal use of resources and the lowest costs of energy for users. In the recent time optimization of heating plants and heat distribution systems becomes an important venue for novel methods and innovative constructions. Various proposals can be seen for more efficient performance of heating systems in changing weather conditions.
In this article results of using bio-inspired methods for intensification of the district heating plant to work with maximum efficiency at the lowest costs are presented. The research is focused on developing bio-inspired approaches for a mathematical model of a district heating plant in various weather conditions. The research model represents a sample district heating plant, in which circulation of hot water is performed in two heat exchangers supplied by controlled pumps. The system was calibrated with the use of proposed Polar Bear Optimization and the results were compared to one of best known heuristics, Particle Swarm Optimization. An objective function describing the operation of the plant was developed and found applicable for proposed bio-inspired approach. The research results have shown that proposed methodology is efficient for all simulated weather conditions and various boundary conditions. Comparison the obtained results with non-optimal parameters confirms huge profits from applying right settings of the system.
Black Hole Mechanics Optimization: a novel meta-heuristic algorithm
2020, Asian Journal of Civil Engineering
Global Optimization Using Mixed Integer Quadratic Programming On Non-Convex Two-Way Interaction Truncated Linear Multivariate Adaptive Regression Splines
2020, arXiv

View all citing articles on Scopus

View full text

Knowledge discovery in multiobjective optimization problems in engineering via Genetic Programming

Highlights

Abstract

Introduction

Section snippets

Innovization

Proposed methods

Computational experiments

Concluding remarks

Acknowledgements

European Journal of Operational Research

Expert Systems with Applications

Automated discovery of vital knowledge from Pareto-optimal solutions: First results from engineering design

Ieee congress on evolutionary computation

Towards automating the discovery of certain innovative design principles through a clustering-based optimization technique

Engineering Optimization

A dimensionally-aware genetic programming architecture for automated innovization

International conference on evolutionary multi-criterion optimization

Data mining methods for knowledge discovery in multi-objective optimization: Part b - new developments and applications

Expert Systems with Applications

Optimal design of a welded beam via genetic algorithms

AIAA Journal

A fast and elitist multiobjective genetic algorithm: nsga-ii

IEEE Transactions on Evolutionary Computation

Innovization: innovating design principles through optimization