Smooth fitting with a method for determining the regularization parameter under the genetic programming algorithm

doi:10.1016/S0020-0255(01)00084-6

Information Sciences

Volume 133, Issues 3–4, April 2001, Pages 175-194

https://doi.org/10.1016/S0020-0255(01)00084-6 Get rights and content

Abstract

This paper deals with the smooth fitting problem under the genetic programming (GP) algorithm. To reduce the computational cost required for evaluating the fitness value of GP trees, numerical weights of GP trees are estimated by adopting both linear associative memories (LAM) and the Hook and Jeeves (HJ) method. The quality of smooth fitting is critically dependent on the choice of the regularization parameter. So, we present a novel method for choosing the regularization parameter. Two numerical examples are given with the comparison of generalized cross-validation (GCV) B-splines.

Introduction

We consider an univariate function approximation. The underlying model for approximations is $y(t)=μ(t)+ε(t),$ where μ(t) is an unknown function, and ε(t) is noise such that E(ε(t))=0, and variance is σ². y(t) is observed for $t=t_{0},t_{1},…,t_{n} (t_{0} <t_{1} <⋯<t_{n})$ , and from this, a learning set L {(t_i,y_i)∣y_i≡y(y_i)}_i=0,…,n is constructed. The goal is to find $μ ̄ (t)$ , which is the estimate of μ(t), using L. There may be a lot of methods for $μ ̄ (t)$ . For instance, polynomials, locally weighted regressions, splines, and neural networks can be used.

In the paper, we try to use genetic programming (GP) [1], [2] for seeking the good estimate of μ(t). To do this, two major problems should be resolved.

First, numerical weights attached to nodes of the GP tree should be estimated in a computationally efficient way. The choice of an optimal GP tree requires the estimation of numerical parameters or weights that are attached to some nodes of a GP tree [3], [4]. GP trees with the poor estimation of its numerical weights could earn a very low grade, and be readily excluded in the next evolving process, although they are potentially good candidates for μ(t). The original GP algorithm mainly focuses on dynamically modifying the structure of GP trees, and thus suffers from a lack of estimation techniques for numerical weights. Provided that nonlinear optimization methods are applied to estimating weights of the GP tree [3], [4], [5], [6], [7], [8], [9], it is a very time-consuming process since usually the population consists of several hundreds or even thousands of GP trees. The approach taken in this paper is to use both linear associative memories (LAM) [10], [11], [12], [13], [14], [15] and the Hook and Jeeves (HJ) search method [16], [17] in a combined manner. This allows to reduce significant amounts of the computational cost.

Second, since L is corrupted by noise, the fitness function should contain the regularization term for smooth fitting. Thus, the fitness function consists of two terms; one that takes account into how well the GP tree is fitted against L, and another one, called the regularization term that represents the degree of smoothness [18]. Here, the most important task is to select a proper regularization parameter enabling to attain a solution that is near the data given in L and, at the same time, is as smooth as possible. If the parameter is too small, the solution shows unwanted oscillations, and if the parameter is too large, the solution shows oversmooths. Since, most popular parameter selection methods [19], [20], [21], [22], [23], [24] pose difficulties in being used under the GP algorithm with the estimation of numerical weight, so we have devised a simple heuristic method. This method is very computationally efficient, and sufficient for selecting good GP trees. As far as we know, our paper is the first reported one that concerns with smooth fitting with the choice of the adequate regularization parameter in the GP algorithm.

Numerical examples with the comparison of generalized cross-validation (GCV) B-spline [20], [25] are given in this paper. The results show that the GP tree outperforms GCV B-spline in most cases. Especially, the estimation of differentiation by the GP tree is far better than that of GCV B-spline.

The paper is organized as follows. In Section 2, we discuss the regularized fitness function and the efficient way of estimating numerical weights of GP trees. Also, in this section, the appropriate function set used in generating GP trees is considered. The method for choosing the regularization parameter is presented in Section 3. Section 4 contains brief descriptions for the overall framework of smooth fitting in the GP algorithm. Numerical examples are given in Section 5. Finally, Section 6 summarizes the results of the paper and offers concluding remarks.

Section snippets

Genetic programming for smooth fitting of noisy data

GP [1], [2], that is an extension of genetic algorithms, deals with tree structures representing computer programs as individuals. Here, computer program refers to a GP tree, a candidate model for μ(t) in this paper. The GP tree is generated as the combination of functions and terminals, which are defined in a function set and a terminal set, respectively. The structure of GP tree is dynamically modified by genetic operators in order to minimize its fitness function value during the evolving

The choice of the regularization parameter

There are various methods for selecting λ. These include discrepancy principle (DP) [19], cross-validation (CV), the composite residual and smoothing operator (CRESO) [21], the L-curve method [22], [23], and the zero-crossing method (ZC) [24]. The DP method demands for knowledge of the noise variance σ². This is a major disadvantage for the practical usage. CV, typically leave-one-out CV, does not require σ², but the estimation of CV errors is prohibitively expensive even if the size of

Overall framework

The overall process for smooth fitting under the GP algorithm is shortly described as follows.

(a) Once a population is randomly created or a new population is generated from the previous generation by applying genetic operators, weights of all trees are initialized as 1 so as to become standard GP trees introduced by Koza [1]. For each tree in the population, add the random number, whose size is less than 0.5, to whole weights of the tree, and then start to estimate weights by applying LAM. In

Numerical results

In this section, two examples are presented; the bell-shaped function, the function having two peaks. Also, for the purpose of comparison, the results of GCV B-spline [20] are given. We have determined the degree of splines as 5, because in many cases quintic splines give the best results. Also, the most well-known GCVSPL package [25], which adopts GCV B-spline, is used throughout this paper.

The learning set is in the form of {(t_i,μ(t_i)+ε_i)∣t_i=a+i(b−a)/n}_i=0,…,n, where a and b are the starting

Conclusion

In this paper, we have proposed a smooth fitting method based on GP. Key elements for the success are the fast estimation of weights of GP trees, and the proper choice of the regularization parameter. For estimating weights, we have intoduced LAMs used in roughly estimating weights of whole trees in a population with the greatly reduced computational cost, and the HJ method with the regularization process used in seeking more accurate weights of trees that are potentially good candidates of $μ ̄$

Acknowledgements

This work is supported in part by LG CalTex Oil Company and Research Institute of Marine Systems Engineering (RIMSE) of Seoul National University.

References (25)

R.E. Kalaba et al.
Linear and nonlinear associative memories for parameter estimation
Information Sciences
(1992)
Y.S. Yeun et al.
Function approximations by coupling neural networks and genetic programming trees with oblique decision trees
Artificial Intelligence in Engineering
(1999)
H. Woltring
A Fortran package for generalized cross-validation spline smoothing and differentiation
Adv. Eng. Soft
(1986)
J.R. Koza
Genetic Programming: On the Programming of Computers by Means of Natural Selection
(1992)
J.R. Koza
Genetic Programming II: Automatic Discovery of Reusable Programs
(1994)
K.C. Sharman, A.I. Esparcia-Alcazar, Y. Li, Evolving signal processing algorithms by genetic programming, in: Proc....
A.H. Watson, I.C. Parmee, Identification of fluid systems using genetic programming, in: Proceedings of the Fourth...
G.J. Gray et al.
Structural system identification using genetic programming and a block diagram oriented simulation tool
Electronics Letters
(1996)
K.D. Bettenhausen, P. Marenbach, S. Freyer, H. Rettenmaier, U. Nieken, Self-organizing structured modeling of a...
B. McKay, M.J. Willis, H.G. Hiden, G.A. Montague, G.W. Barton, Identification of industrial processes using genetic...

T. Weinbrenner, Genetic programming techniques applied to measurement data, Diploma Thesis, Department of Electronics...

Y.S. Yang, Y.S. Yeun, Design knowledge acquisition using genetic programming for the midship section design, in: Proc....

Cited by (10)

Dynamic population variation in genetic programming
2009, Information Sciences
Three innovations are proposed for dynamically varying the population size during the run of the genetic programming (GP) system. These are related to what is called Dynamic Population Variation (DPV), where the size of the population is dynamically varied using a heuristic feedback mechanism during the execution of the GP with the aim of reducing the computational effort compared with Standard Genetic Programming (SGP). Firstly, previously developed population variation pivot functions are controlled by four newly proposed characteristic measures. Secondly, a new gradient based pivot function is added to this dynamic population variation method in conjunction with the four proposed measures. Thirdly, a formula for population variations that is independent of special constants is introduced and evaluated. The efficacy of these innovations is examined using a comprehensive range of standard representative problems. It is shown that the new ideas do have the capacity to provide solutions at a lower computational cost compared with standard genetic programming and previously reported algorithms such as the plague operator and the static population variation schemes previously introduced by the authors.
A new hybrid method for solving inverse heat conduction problems
2021, International Journal of Mechanics
Hessian Complexity Measure for Genetic Programming-Based Imputation Predictor Selection in Symbolic Regression with Incomplete Data
2020, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
A survey of statistical machine learning elements in genetic programming
2019, IEEE Transactions on Evolutionary Computation
Distributed Global Function Model Finding for Wireless Sensor Network Data
2016, Applied Sciences (Switzerland)
Intelligent system based on genetic programming for atrial fibrillation classification
2009, Applied Artificial Intelligence

View all citing articles on Scopus

⁴: URL: http://road.daejin.ac.kr/∼yeonyun

¹: Tel.: +82-42-868-7239.

²: Tel.: +82-2-880-7338; fax: +82-2-888-9298.

³: Tel.: +82-2-880-7330; fax: +82-2-888-9298.

⁵: URL: http://insdel.snu.ac.kr

View full text

Smooth fitting with a method for determining the regularization parameter under the genetic programming algorithm

Abstract

Introduction

Section snippets

Genetic programming for smooth fitting of noisy data

The choice of the regularization parameter

Overall framework

Numerical results

Conclusion

Acknowledgements

Information Sciences

Artificial Intelligence in Engineering

Adv. Eng. Soft

Genetic Programming: On the Programming of Computers by Means of Natural Selection

Genetic Programming II: Automatic Discovery of Reusable Programs

Structural system identification using genetic programming and a block diagram oriented simulation tool

Electronics Letters