Smooth fitting with a method for determining the regularization parameter under the genetic programming algorithm

https://doi.org/10.1016/S0020-0255(01)00084-6Get rights and content

Abstract

This paper deals with the smooth fitting problem under the genetic programming (GP) algorithm. To reduce the computational cost required for evaluating the fitness value of GP trees, numerical weights of GP trees are estimated by adopting both linear associative memories (LAM) and the Hook and Jeeves (HJ) method. The quality of smooth fitting is critically dependent on the choice of the regularization parameter. So, we present a novel method for choosing the regularization parameter. Two numerical examples are given with the comparison of generalized cross-validation (GCV) B-splines.

Introduction

We consider an univariate function approximation. The underlying model for approximations isy(t)=μ(t)+ε(t),where μ(t) is an unknown function, and ε(t) is noise such that E(ε(t))=0, and variance is σ2. y(t) is observed for t=t0,t1,…,tn(t0<t1<⋯<tn), and from this, a learning set L {(ti,yi)∣yiy(yi)}i=0,…,n is constructed. The goal is to find μ̄(t), which is the estimate of μ(t), using L. There may be a lot of methods for μ̄(t). For instance, polynomials, locally weighted regressions, splines, and neural networks can be used.

In the paper, we try to use genetic programming (GP) [1], [2] for seeking the good estimate of μ(t). To do this, two major problems should be resolved.

First, numerical weights attached to nodes of the GP tree should be estimated in a computationally efficient way. The choice of an optimal GP tree requires the estimation of numerical parameters or weights that are attached to some nodes of a GP tree [3], [4]. GP trees with the poor estimation of its numerical weights could earn a very low grade, and be readily excluded in the next evolving process, although they are potentially good candidates for μ(t). The original GP algorithm mainly focuses on dynamically modifying the structure of GP trees, and thus suffers from a lack of estimation techniques for numerical weights. Provided that nonlinear optimization methods are applied to estimating weights of the GP tree [3], [4], [5], [6], [7], [8], [9], it is a very time-consuming process since usually the population consists of several hundreds or even thousands of GP trees. The approach taken in this paper is to use both linear associative memories (LAM) [10], [11], [12], [13], [14], [15] and the Hook and Jeeves (HJ) search method [16], [17] in a combined manner. This allows to reduce significant amounts of the computational cost.

Second, since L is corrupted by noise, the fitness function should contain the regularization term for smooth fitting. Thus, the fitness function consists of two terms; one that takes account into how well the GP tree is fitted against L, and another one, called the regularization term that represents the degree of smoothness [18]. Here, the most important task is to select a proper regularization parameter enabling to attain a solution that is near the data given in L and, at the same time, is as smooth as possible. If the parameter is too small, the solution shows unwanted oscillations, and if the parameter is too large, the solution shows oversmooths. Since, most popular parameter selection methods [19], [20], [21], [22], [23], [24] pose difficulties in being used under the GP algorithm with the estimation of numerical weight, so we have devised a simple heuristic method. This method is very computationally efficient, and sufficient for selecting good GP trees. As far as we know, our paper is the first reported one that concerns with smooth fitting with the choice of the adequate regularization parameter in the GP algorithm.

Numerical examples with the comparison of generalized cross-validation (GCV) B-spline [20], [25] are given in this paper. The results show that the GP tree outperforms GCV B-spline in most cases. Especially, the estimation of differentiation by the GP tree is far better than that of GCV B-spline.

The paper is organized as follows. In Section 2, we discuss the regularized fitness function and the efficient way of estimating numerical weights of GP trees. Also, in this section, the appropriate function set used in generating GP trees is considered. The method for choosing the regularization parameter is presented in Section 3. Section 4 contains brief descriptions for the overall framework of smooth fitting in the GP algorithm. Numerical examples are given in Section 5. Finally, Section 6 summarizes the results of the paper and offers concluding remarks.

Section snippets

Genetic programming for smooth fitting of noisy data

GP [1], [2], that is an extension of genetic algorithms, deals with tree structures representing computer programs as individuals. Here, computer program refers to a GP tree, a candidate model for μ(t) in this paper. The GP tree is generated as the combination of functions and terminals, which are defined in a function set and a terminal set, respectively. The structure of GP tree is dynamically modified by genetic operators in order to minimize its fitness function value during the evolving

The choice of the regularization parameter

There are various methods for selecting λ. These include discrepancy principle (DP) [19], cross-validation (CV), the composite residual and smoothing operator (CRESO) [21], the L-curve method [22], [23], and the zero-crossing method (ZC) [24]. The DP method demands for knowledge of the noise variance σ2. This is a major disadvantage for the practical usage. CV, typically leave-one-out CV, does not require σ2, but the estimation of CV errors is prohibitively expensive even if the size of

Overall framework

The overall process for smooth fitting under the GP algorithm is shortly described as follows.

(a) Once a population is randomly created or a new population is generated from the previous generation by applying genetic operators, weights of all trees are initialized as 1 so as to become standard GP trees introduced by Koza [1]. For each tree in the population, add the random number, whose size is less than 0.5, to whole weights of the tree, and then start to estimate weights by applying LAM. In

Numerical results

In this section, two examples are presented; the bell-shaped function, the function having two peaks. Also, for the purpose of comparison, the results of GCV B-spline [20] are given. We have determined the degree of splines as 5, because in many cases quintic splines give the best results. Also, the most well-known GCVSPL package [25], which adopts GCV B-spline, is used throughout this paper.

The learning set is in the form of {(ti,μ(ti)+εi)∣ti=a+i(ba)/n}i=0,…,n, where a and b are the starting

Conclusion

In this paper, we have proposed a smooth fitting method based on GP. Key elements for the success are the fast estimation of weights of GP trees, and the proper choice of the regularization parameter. For estimating weights, we have intoduced LAMs used in roughly estimating weights of whole trees in a population with the greatly reduced computational cost, and the HJ method with the regularization process used in seeking more accurate weights of trees that are potentially good candidates of μ̄

Acknowledgements

This work is supported in part by LG CalTex Oil Company and Research Institute of Marine Systems Engineering (RIMSE) of Seoul National University.

References (25)

  • T. Weinbrenner, Genetic programming techniques applied to measurement data, Diploma Thesis, Department of Electronics...
  • Y.S. Yang, Y.S. Yeun, Design knowledge acquisition using genetic programming for the midship section design, in: Proc....
  • Cited by (10)

    View all citing articles on Scopus
    4

    URL: http://road.daejin.ac.kr/∼yeonyun

    1

    Tel.: +82-42-868-7239.

    2

    Tel.: +82-2-880-7338; fax: +82-2-888-9298.

    3

    Tel.: +82-2-880-7330; fax: +82-2-888-9298.

    5

    URL: http://insdel.snu.ac.kr

    View full text