Identification of response surface models using genetic programming

https://doi.org/10.1016/j.ymssp.2005.12.003Get rights and content

Abstract

There is a move in modern research in Structural Dynamics towards analysing the inherent uncertainty in a given problem. This may be quantifying or fusing uncertainty models, or can be propagation of uncertainty through a system or calculation. If the system of interest is represented by, e.g. a large Finite Element (FE) model the large number of computations involved can rule out many approaches due to the expense of carrying out many runs. One way of circumnavigating this problem is to replace the true system by an approximate surrogate/replacement model, which is fast-running compared to the original. In traditional approaches using response surfaces a simple least-squares multinomial model is often adopted. The objective of this paper is to extend the class of possible models considerably by carrying out a general symbolic regression using a Genetic Programming approach. The approach is demonstrated on both univariate and multivariate problems with both computational and experimental data.

Introduction

The design and optimisation of structures and systems has become increasingly sophisticated and challenging in recent years by having to face more inherent uncertainties. Alongside Finite Element Methods (FEM), there are many other packages that help in reducing the modelling workload. However, in terms of understanding the effects of uncertainty the computational effort for running many analyses remains non-trivial and this raises serious problems for uncertainty propagation methods which rely on sampling from the population of parameters like Monte-Carlo methods. In addition to this, the query-and-response procedure is often by trial and error and the researcher may never fully understand the input–output relationship and therefore never be able to identify the best setting for input values.

The basic approach to this problem is to develop an approximation of the analyses that is more efficient to execute, and to obtain more ‘insight’ into the relationship between the input and output. This approximate/surrogate model of the original is also sometimes called a metamodel.

The usual methods for creation of a metamodel involve choosing a function or basis of functions in order to represent the data in the most appropriate way. Whether this is linear regression, quadratic or even a Fourier series, all require some form of background knowledge of the experimental/simulation data that one is attempting to fit. Once the proper function is chosen, one proceeds to a series of analyses which determine the ‘best’ values for the free parameters of the model.

A traditional metamodelling approach often involves using a multivariate response surface model. A general response surface method would be first to screen the data to reduce the set of design variables that are most influential to the output. Then with the chosen function, one would start to perform a general fitting, followed by some refining procedures. If the researcher decided that such a function does not give a promising fit, he/she may then consider the option of using different functions.

The research in traditional metamodelling reached its mature state some time ago, but there are still some known problems, such as difficulties with the polynomial basis functions, problems in trying to fit a highly non-linear model, explosion of coefficients, etc.

In short, the creation of the surrogate model requires both the discovery of the correct function that fits the data and the appropriate numeric coefficients of the function. The problem can be seen as finding a proper function, in symbolic form, which fits a given finite sample of data, or symbolic regression [1].

An alternative approach to the problem is to adopt an evolutionary algorithm, such as genetic programming (GP). Researchers and software developers apply computers to a wide variety of design problems. Rather than spending time and effort to choose the proper function for every new problem, a preferable choice would be to use an algorithm proven through extensive trials to be reusable and robust [2]. Genetic programming is a subclass of evolutionary computation techniques but unlike the other evolutionary algorithms that evolve numerical values; it evolves functions as a solution for a given problem.

In this work, the genetic programming was developed using the Java programming language [3] and applied to fit both computational and experimental data. The computational data were obtained according to the theoretical formulae defining the structural modulus and Poisson's ratio of the hexagonal honeycomb [4]. The experimental data was from Los Alamos National Laboratory, where a 9 DOF mass-spring system was studied for damage detection purposes [5]. (These examples are rather arbitrary, they were chosen simply as data sets of appropriate complexity to illustrate the effectiveness of the Genetic Program for fitting response surfaces via symbolic regression.) The results also showed that Genetic Programming could identify the most influential design variables with respect to the output.

Section snippets

Theory of genetic programming

Genetic Programming (GP) shares its dominant concept with the more well-known Genetic Algorithm (GA) in that it allows a population of solutions to a given problem to evolve under a model of Darwinian natural selection and iteratively improve in search of the optimum. However, GP increases the functionality (and complexity) of the algorithm by also allowing the structure of the solution to undergo adaptation. The structure of an individual in a GP is typically a hierarchical computer program or

Surface fitting

The Java coded GP algorithm was first tested on a computational model derived from a classical honeycomb structural modulus and Poisson's ratio [4]; 171 data points were generated from Eqs. (1), (2) below. The purpose of this example is to justify the concept of the GP in representing the data points. As the variables are a function of only two input parameters, they can be represented as two-dimensional surfaces for ease of visualisation. All the data were used for the surface fitting, i.e.

Modelling of data from an experimental structure

The following modelling is based on experimental data obtained from Los Alamos National Laboratory from a 9 degree-of-freedom mass-spring system. (Although it is not particularly relevant to the current study, the structure was intended for an investigation of damage simulation of an A340 airbus, it is clearly highly simplified.). The inputs are 9 normalised torsional and bending stiffnesses taking values from –1, 0 and +1. The system is ‘damaged’ by having the nominal stiffness (0) varied. The

Results for natural frequency

The above data were tested 10 times (i.e. with different randomly generated initial populations) at each different combination of genetic operators, with population of 300 and 2000 generations. A smaller number of generations than in the simple honeycomb example was used because of the considerable overhead on training induced by cross-validation. The best validation fitnesses for the 10 simulations are selected and presented in Fig. 7. The best combination of all was determined to have a

Conclusions

The general conclusion is that Genetic Programming appears to provide a viable alternative to more traditional structures for the purpose of metamodelling. This study shows that the GP can outperform neural network models and polynomials (e.g. the natural frequency model) while having the positive advantage that the structure of the model is determined. There is also evidence that the GP also determines the relevance of the inputs. From Fig. 7, Fig. 9, there is an indication that the GP

References (15)

  • J.R. Koza, Genetic Programming on the Programming of Computers by Means of Natural Selection, Seventh Printing, MIT...
  • P. Bentley, An introduction to evolutionary design by computers, in: Evolutionary Design by Computers, Morgan Kaufmann...
  • D. Flanagan

    JAVA In a Nutshell A Desktop Quick Reference

    (2002)
  • L.J. Gibson et al.

    Cellular Solids Structure and Properties

    (1997)
  • A.L. Cundy, Use of response surface metamodels in damage identification of dynamic structures, M.Sc. Thesis in...
  • L. Panait, S. Luke, Alternative Bloat Control Methods, GECCO 2004, Lecture Notes in Computer Science, vol. 3103,...
  • T. Soule et al.

    An analysis of the causes of code growth in genetic programming

    Genetic Programming and Evolvable Machines

    (2002)
There are more references available in the full text version of this article.

Cited by (41)

  • Dynamic parameter inverse analysis of concrete dams based on Jaya algorithm with Gaussian processes surrogate model

    2021, Advanced Engineering Informatics
    Citation Excerpt :

    The iterative parameter inversion method requires a considerable number of finite element calculations; hence, the calculation expense increases significantly. An efficient way to reduce the computational cost of inverse analysis is to replace the FE model with a surrogate model that is faster and has fewer parameters [34]. The response surface model is a commonly used surrogate model.

  • HYGP-MSAM based model for slewing bearing residual useful life prediction

    2019, Measurement: Journal of the International Measurement Confederation
  • A genetic programming approach based on Lévy flight applied to nonlinear identification of a poppet valve

    2014, Applied Mathematical Modelling
    Citation Excerpt :

    The identification of dynamic nonlinear systems is a hard task as demonstrated by the great effort devoted by researchers in the last decade. A lot of identification methods have been presented up to now [1–20], even if their performances seem to depend on the type of the unknown system and on the training data. This paper focuses the application of a proposed GP approach based on Lévy flight and OLS applied to NARX modeling.

  • Statistical modeling of frequency response function estimation for uncertainty quantification

    2013, Mechanical Systems and Signal Processing
    Citation Excerpt :

    Point-wise uncertainty intervals and uncertainty envelope function are developed by considering, respectively, the uncertainty at discrete frequency lines and continuous frequency range. Response surface and first-order reliability method (FORM) are also good surrogates to expedite the evaluation of original FE model [4,6]. A general symbolic regression using genetic programming is deployed in Ref. [6], in which the improvement has been demonstrated on both univariate and multivariate problems with both computational and experimental data.

  • Probabilistic uncertainty analysis of an FRF of a structure using a Gaussian process emulator

    2011, Mechanical Systems and Signal Processing
    Citation Excerpt :

    A metamodel may be statistical or non-statistical. Techniques used for building non-statistical metamodels include neural networks [9], support vector machines [10,11], and genetic programming [12]. For a statistical metamodel, any statistical regression technique may be used.

View all citing articles on Scopus
View full text