Abstract
This paper describes a technique on an optimization of treestructure data by of multiobjective evolutionary algorithm, or multiobjective genetic programming. GP induces bloat of the tree structure as one of the major problem. The cause of bloat is that the tree structure obtained by the crossover operator grows bigger and bigger but its evaluation does not improve. To avoid the risk of bloat, a partial sampling operator is proposed as a mating operator. The size of the tree and a structural distance are introduced into the measure of the treestructure data as the objective functions in addition to the index of the goodness of tree structure. GP is defined as a threeobjective optimization problem. SD is also applied for the ranking of parent individuals instead to the crowding distance of the conventional NSGAII. When the index of the goodness of treestructure data is two or more, the number of objective functions in the above problem becomes four or more. We also propose an effective manyobjective EA applicable to such the manyobjective GP. We focus on NSGAII based on Pareto partial dominance (NSGAIIPPD). NSGAIIPPD requires beforehand a combination list of the number of objective functions to be used for Pareto partial dominance (PPD). The contents of the combination list greatly influence the optimization result. We propose to schedule a parameter r meaning the subset size of objective functions for PPD and to eliminate individuals created by the mating having the same contents as the individual of the archive set.
Introduction
A technique of genetic programming (GP) [17, 18] is an algorithm to optimize structured data based on a evolutionary algorithm (EA) [11, 25]. GP is applied to various fields such as program synthesis [5], function generations [14] and rule set discoveries [30]. Although GP is very effective for optimizing structured data, it has several problems such as getting into a bloat, inadequate optimization of constant nodes, being easily captured in local optimal solution area when applied to complicated problems. The main cause of the bloat is a crossover operator which exchanges partial trees of parent individuals [2, 3, 7, 27], where this paper focuses on the optimization of treestructure data by means of GP. Several techniques to reduce the bloat have been proposed by improving the simple crossover operation [6, 10, 13, 18, 19, 26]. Although these methods have successfully inhibited bloat to a certain extent, effective search has not necessarily been performed. Moreover, there is no theoretical basis that crossover is effective for optimizing the treestructure data.
Apart from reduction of the bloat, a search method for optimizing the graph structure has been proposed [15]. Although this method is suitable for searching various size of the structured data consisting of nodes and branches, the algorithm is complicated and the computation cost is very high. In this paper, we exclude the crossover operator which is the cause of the bloat in GP, and propose a partial sampling (PS) operator [29] as a new mating operator. In PS operator, first of all, a partial sample of a partial tree structure is extracted from several individuals of a parent individual group by a procedure of a proliferation. Next, the partial tree structure obtained by the proliferation is combined with a new tree structure by a metastasis. In this paper, two types of metastasis are prepared for GP, one that depends on the original upper node and the other that does not. Repeating the proliferation and the metastasis regenerates a new treestructure data for the next generation.
Moreover, a multiobjective EA (MOEA) technique for suppressing the bloat problem and acquire many kinds of various treestructure data is applied for GP by adding two more objective functions. One of the newly added objective functions is the size of the treestructure data. Furthermore, the relative position of the target individual in the population in terms of the structural distance (SD) is also evaluated as an objective function. Then, the optimization of the treestructure data is formulated as a multiobjective optimization problem (MOP) based on these three objective functions. NSGAII [8, 9] is applied to this MOP. In the conventional NSGAII, a crowding distance (CD) is applied for ranking the front set overflowing from the parent group. Because the conventional NSGAII sorts such the individuals of the overflowing front set with CD and selects extreme solution, diversity about tree structure is not maintained. In this paper, SD is applied, instead of CD, for ranking the overflowing front set from the parent group.
The proposed technique and the conventional techniques are applied to a double spiral problem [6, 38] for verification. This problem is a classification problem containing two classes of point sets arranged on a spiral shape to be classified with a function. This problem is well known as one of difficult problem to solve with a neural network.
The number of the index of the goodness of the treestructure data When the index of the goodness of treestructure data is two or more, the number of objective functions in the above problem becomes four or more. We also propose an effective manyobjective EA (MaOEA) applicable to such the manyobjective GP. Manyobjective optimization problems (MaOPs) are difficult to solve and is tackled by many researchers [4, 8, 9, 39,40,41]. Although SPEA2 [39,40,41] and NSGAII [8, 9] are well known as powerful algorithm for MOPs, they do not work so effectively for MaOPs [1, 12, 33]. In this paper, we handle the case of solving an MaOP by NSGAII based algorithm.
When applying NSGAII or SPEA2 to MaOP, as the objective number increases, most of the solutions in the solution set, or population, become a relation that is not superior or inferior to each other. This relation is called nondominated (ND) relationship. As a result, the convergence of the obtained set of Pareto Optimal Solutions (\({\mathcal{POS}}\)) to the optimum Pareto front remarkably decreases. Sato et al. [34] have proposed a concept of Pareto partial dominance that makes it easier to determine the superiority/inferiority relationship between solutions by using several objective functions instead of all objective functions as an algorithm for such MaOP. Since NSGAII based on Pareto partial dominance (NSGAIIPPD) focuses on a relatively small number of objectives, solutions are easy to decide superiority/inferiority even on MaOP, and an effective selection pressure can be expected.
SPEA2 with a shiftbased density estimation (SDE) strategy [20, 21, 23] is also very strong algorithm to solve multiobjective optimization problems. This method requires a lot of computational cost to forcibly rank individual subsets in nondominant relationships. Also when optimizing the tree structure by SPEA2 technique with SDE, it has been difficult to suppress bloat. Therefore, this research focuses on CD which is advantageous in terms of simplicity and less computational cost. And this paper proposes SD for the purpose of suppressing the bloat.
NSGAIIPPD has the following three problems. The first problem is that a combination list of the number of objects to be used for Pareto partial dominance must be specified before the optimization. The second one is that an appropriate number of selected objectives according to the complexity of the problem in undecided. Moreover, the contents of the combination list greatly influence the optimization result. NSGAIIPPD performs ND sorting using all objective functions at a specific generation cycle, and preserves parents as an archive set for the next generation. This process generates child individuals having the same contents as the already existing individual in the archive set in some cases. As a result, the same individuals increases in the first front set, which disturbs effective ranking in the front selection. This is the third problem. By consideration of these problems, this paper proposes a simple scheduling technique of partial objective set used for Pareto partial dominance and a technique of killing individuals having the same contents in preserving the archive set [28]. In order to verify the effectiveness of the proposed techniques, we examine a manyobjective 0/1 knapsack problem [41].
Partial sampling operator for mating
One of the main causes of the bloat is the crossover operator generally applied in the conventional GPs, used for regenerating a new treestructure data. This paper proposes to exclude the crossover operator from the conventional GP and to apply PS operator for regeneration of a new treestructure data instead of the crossover operator. The PS operator creates a new treestructure data by partially sampling tree structures from a parent individual and joining them together. This procedure is called a proliferation. The proliferation is terminated according to the probability, \(p_{\mathrm{t}}\). Partially sampled subtree structures by the proliferation are combined together by a metastasis. Two types of the metastasis are prepared, one that depends on the original upper node and the other that does not. We call the the former as an upper node depend metastasis and the latter as a random metastasis.
In the initial proliferation, a root node, \(n_{i,{\mathrm{root}}}\), of an individual, \({{\mathrm{indiv}}}_i\), randomly selected from a parent group \({\mathbf{P}}^{g}\) is copied to a set \({\mathbf{T}}_{\mathrm{sub}}\) as shown in Fig. 1. The initial proliferation is started from the root node, \(n_{i,{\mathrm{root}}}\), of the individual, \({{\mathrm{indiv}}}_i\). In this example, the starting root node contains an identification, A. Let \({\mathbf{N}}_{\mathrm{cand}}\) be a set of all lower nodes under the node of \({\mathbf{T}}_{\mathrm{sub}}\), where that node is not selected as a node of \({\mathbf{T}}_{\mathrm{sub}}\) yet. One node is randomly selected from \({\mathbf{N}}_{\mathrm{cand}}\) and copied to \({\mathbf{T}}_{\mathrm{sub}}\). The proliferation terminates according to the proliferation terminate probability, \(p_{\mathrm{t}}\), or when \({\mathbf{N}}_{\mathrm{cand}} = \emptyset\). When the proliferation is terminated, the set \({\mathbf{T}}_{\mathrm{sub}}\) thus obtained is copied to \({\mathbf{T}}_{\mathrm{new}}\), where is a set of nodes as a new treestructure data. The set \({\mathbf{T}}_{\mathrm{sub}}\) is initialized to \(\emptyset\). Furthermore, the root node of the partial tree structure \({\mathbf{T}}_{\mathrm{new}}\) in the initial proliferation is randomly generated in a low probability on the initial proliferation.
In the conventional GP with variable structure length, small partial structures are assembled by an regenerating operator, for example, crossover or mutation, and these partial structures are combined to generate a new treestructure data of a large size [31, 32]. When the conventional GP increases the average size of the treestructure data, the size of the partial structure also preserved for the next generation increases. Therefor, scheduling the probability, \(p_{\mathrm{t}}\), as follows prevents the size of the partial tree structure from explodingly increasing.
where \({\mathbf{R}}^{g}\) denotes the population at the gth generation, \({\mathbf{P}}^{g} \subset {\mathbf{R}}^{g}\) denotes the parent set at the gth generation,
\({\mathrm{AverageSize}}({\mathbf{R}}^{g})\) denotes a function returning the average size of each tree structure of the population, and \({\mathrm{Succ}}( \cdot )\) denotes a function returning the average size of the partial tree structure that the argument set takes over from the previous generation.
A partial tree structure is grown by applying one of two kinds of metastasis to the partial tree structure obtained by the initial proliferation. One of two kinds of metastasis is a random metastasis. The random metastasis activates according to a metastasis selection probability, \(p_{{\mathrm{met}}}\). The other one metastasizes depending on the upper node. The upper node depend metastasis activates according to the probability, \(1p_{{\mathrm{met}}}\). The partial tree structure \({\mathbf{T}}_{\mathrm{new}}\) shown in the Fig. 1 has three empty branched numbered as 1, 2 and 3. The branch 1 has the upper node A, and the branches 2 and 3 have upper node D. Now, suppose that the branch 1 is selected as a target of the upper node depend metastasis. In the next proliferation, a node having the upper node A is selected from the parent group, \({\mathbf{P}}^{g}\). On the other hand, if the random metastasis is applied to the partial tree structure \({\mathbf{T}}_{\mathrm{new}}\), the beginning node for the next proliferation is randomly selected from the parent group, \({\mathbf{P}}^{g}\).
A new node is selected from the parent group, \({\mathbf{P}}^{g}\), according to the decided metastasis type. This node is not necessarily a root node. The proliferation is started from the selected node again.
By repeating the proliferation and the metastasis, new treestructure data is generated as shown in Fig. 2. However, when the metastasis applied to only one parent individual, or when a parent individual having the same structure as the generated tree structure, the generated tree structure is eliminated and PS operator is performed again. The terminal nodes are given as a random number in a low probability, where this is based on the conventional mutation idea.
Multiobjective GP with structural distance
Optimizing the treestructure data based only on the index of its own goodness brings problems that causes the bloat but also that the optimization is caught in a local optimum region. Depending on the structure of the local optimum region, the optimization stagnates, causing an illusion as if the obtained solution(s) were ultimate optimal. To avoid the risk of such the problems, this paper, therefor, proposes a technique to optimize the treestructure data based on the size of the tree structure and SD in the population in addition to the index of the goodness of tree structure.
In this paper, three objective functions, \(h_1\), \(h_2\) and \(h_3\), are defined as follows to be used for the multiobjective optimization . An objective function according to the goodness of an individual, \({\mathrm{indiv}}_i\), is described by the following equation.
where \({\mathrm{root}}_i\) denotes a root node of the individual, \({\mathrm{indiv}}_i\), and performance(root_{i}) denotes a function that returns value of the goodness of the tree structure beginning from the root node, root_{i}.
An objective function according to the size of tree structure is defined by the following equation.
where Size(root_{i}) denotes a function that returns the number of the nodes of the tree structure beginning from the root node, root_{i}.
An objective function according to average of SD in the population is defined by the following equation.
where \(N_{\mathrm{pop}}\) denotes the size of the population, and
\({\mathrm{SD}}({\mathrm{root}}_i , {\mathrm{root}}_k )\) denotes a function that returns SD between \({\mathrm{indiv}}_i\) and \({\mathrm{indiv}}_k\). In order to calculate SD, weights are given to all the nodes of the tree structured data by means of the following steps, when the tree structured data is initially generated. An example of giving weights to the tree structure is shown in Fig. 3.

(Step 1) Give weight 1 to the root node.

(Step 2) Assume that W is a weight given to the current node.

(Step 3) Equally distribute weights to the lower nodes of the current node so that the total is W/2.
Two tree structures are compared in order from the root node to check conformity of both nodes as shown in Fig. 3. The distance, \({\mathrm{SD}}({\mathrm{root}}_i , {\mathrm{root}}_k )\), is initialized as zero. When different nodes are found in the conformity comparison, the weight of that node is added to the distance. The lower nodes below the different node are all ignored. Especially, when the tree structures of both are completely different, \({\mathrm{Distance}}({\mathrm{root}}_i , {\mathrm{root}}_k )\) is given 1 as the maximum value.
Now, we have defined the threeobjective optimization problem. NSGAII shown in Fig. 4 is applied to solve this problem. When several objective functions mean goodness of the tree structure and should not be joined together, the number of them and the two objective functions indicated above, \(h_2\) and \(h_3\), are the total number of objective functions in the proposed method in this paper. NSGAII selects parent individuals by using nondominated sorting and ranking with CD. Since treestructure data is to be optimized in this paper, CD based only on the value of the objective function does not necessarily bring the diversity of the tree structure. Therefore, this paper propose to use SD when selecting parents from the rank set overflowing from the parent group. A block chart of the modified NSGAII with SD is shown in Fig. 5.
Manyobjective evolutionary algorithm for MaOGP
MOP is a problem that optimizes, or maximizes in this paper, multiple objective functions under several constraints. Since the objective functions are in a tradeoff relationship with each other, it is not possible, in general, to obtain the only one solution that completely satisfies all the objective functions. Therefore, we require to obtain \({\mathcal{POS}}\) of compromised solutions without superiority or inferiority to each other. For the objective function vector \({\mathbf{f}}\) consisting m objective functions, \(f_i\), the problem of finding the variable vector \({\mathbf{x}}\) that maximizes the value of \(f_i\) in the feasible region S in the solution space is defined as follows.
When the values of the objective function, \(f_i\), of two solutions x and y satisfy the following relation, we say that the solution x dominates the solution y.
where \({\mathbf{M}}\) denotes a set of the indexes for the objective function, \(\{ 1,2, \ldots , m\}\). When there is no solution dominates a solution \({\mathbf{x}}\), the solution \({\mathbf{x}}\) is called noninferior solution. A set of such the noninferior solutions is defined as the following \({\mathcal{POS}}\).
A Pareto front showing the the tradeoff relation between the objective functions is defined as follows.
Several effective studies [4, 8, 9, 39,40,41] have been made on MOP as defined by Eq.(6). NSGAII shown in Fig. 4 is a powerful multiobjective optimization scheme as a method proposed on one of these studies. NSGAII applies nondominated sorting (ND sorting) to the population \({\mathbf{Q}}\), and the individuals are classified to several ranked subsets, \({\mathbf{F}}_1,\) \({\mathbf{F}}_2,\) \({\mathbf{F}}_3,\) \(\ldots\). While not exceeding the size of the parent set \({\mathbf{P}}\), the individuals of each subset are moved to the parent set in order. Individuals of the subset that exceeds the size of the parent set is sorted using crowding distance (CD sorting) and moved to the parent set. The individuals not selected are culled. The mating operators generates the child set \({\mathbf{C}}\) from the parent set \({\mathbf{P}}\) by using the crossover and mutation operators.
Although NSGAII effectively solves MOP with less than four objective functions, as the objective number m increases, an appropriate \({\mathcal{POS}}\) could not be obtained even by those methods containing the conventional NSGAII. When ND is performed based on the conventional Pareto dominance using all m objective functions, as the number of objective function increases, a subset of solutions satisfying Eq.(7) is difficult to obtain [37]. Then most solutions of the population become noninferior solutions. As a result, the superiority/inferiority relationship between solutions is difficult to determined, and the selection pressure in the optimization is significantly reduced. This paper focuses to NSGAII with Pareto partial dominance shown in Fig. 6 for solving MaOP. Pareto partial dominance is based on a concept of partially applying Pareto domination to r objective functions extracted from all m objective functions. The Pareto partial dominance is defined by the following formula.
where \({\mathbf{R}}\) denotes a set of r indexes selected from \({\mathbf{M}}\). Since conditions satisfying Pareto partial dominance are relaxed as compared with the conventional dominance using all m objective functions, the population is easier to rank finely in MaOP with large m.
In NSGAIIPPD, first of all, given r, the number of objective functions to be considered in the partial ND sorting, a combination list of \(_{m} C _{r}\) selections is prepared beforehand. For each \(I_g\) generations, the combination of the objective functions to be considered for Pareto partial dominance is changed, and \({\mathbf{R}}_{g+1}\) is selected with performing ND sorting on \({\mathbf{P}}_{g} + {\mathbf{C}}_{g} + {\mathbf{A}}\) using all m objective functions and copied to the archive set \({\mathbf{A}}\), where + denotes the direct sum.
Improvement of NSGAII based on Pareto partial dominance
NSGAIIPPD has the following three problems. The first problem is that the subset size of the objective functions to be used for Pareto partial dominance is required to beforehand specify before the optimization in a form of a list, or the combination list. The second one is that an appropriate value of the subset size according to the complexity of the problem is unknown. The contents of the combination list greatly influence the optimization result. On the other hand, the creation of the combination list is a very troublesome and difficult task for the user. NSGAIIPPD performs ND sorting using all objective functions at a specific generation cycle, and preserves parents as an archive set for the next generation. This process generates child individuals having the same contents as the already existing individual in the archive set in some cases. As a result, the same individuals increases in the first front set, which disturbs effective ranking in the front selection. This is the third problem. In order to avoid these problems, this paper proposes two improvements. A block chart of the improved NSGAIIPPD is shown in Fig. 7.
As the first improvement, a subset size scheduling (SSS) is proposed for NSGAIIPPD. NSGAIIPPD treated in this paper does not use the combination list for each \(I_g\) generation cycle. The parameter r is given by the following equations.
where m denotes the number of the objective functions, \({\text{rand}}\_{\text{int}}(\cdot )\) denotes a function returns a random integer less than the argument, B denotes an integer parameter larger than 1 and less than m/2, and G denotes the end generation. Figure 8 shows the possible value of the selection number, r.
In NSGAIIPPD, several individuals having the same contents as an individual already existing in the children, \({\mathbf{C}}_{t}\), or the archive set, \({\mathbf{A}}\), are generated and stored by the mating. If the optimization proceeds while sustaining such the individuals having relatively good evaluation, duplicates of them increases within the population. If the problem to be optimized is relatively simple, individuals with the same content are frequently generated during the optimization. The second improvement is elimination of such the individuals having the same contents of an individual already existing in the children, \({\mathbf{C}}_{g}\), and the archive set, \({\mathbf{A}}\), after the mating. In other words, the duplicates created by the mating is eliminated, we call this elimination of duplicates (EoD). Since the optimization problem treated in this paper is the maximizing problem, by setting the value of all objective functions of such the individual to 0, the individual are eliminated. The same content individual become the worst individual. After EoD, the mating does not reproduce the individual.
Verification of the proposed techniques
Double spiral problem
A double spiral problem is applied to verify an effectiveness of the proposed techniques. The double spiral problem is a problem of classifying two data sets arranged in a spiral shape, and it is known as a problem that is difficult to solve even using neural networks [6, 38]. These two data sets are arranged as shown in Fig. 9 and are to be classified by the following function f.
where (x, y) denotes the coordinates of each point on the twodimensional plane, and \({\mathbf{D}}_1\) and \({\mathbf{D}}_2\) denote the data sets expressed with the red crosses and the blue circles shown in Fig. 9 respectively. In this paper, the case when \(f(x,y)=0\) is treated as classification failure at the point (x, y).
The following nodes are prepared as elements for constituting the classifying function f.
where \({\mathcal{N}}_N\) denotes a set of nonterminal node, \({\mathcal{N}}_T\) denotes a set of terminal node, and “\({\mathrm{ifltz}}\)” denotes a function with three arguments representing a conditional branch as follows,
The objective function, \(h_1\), according to the goodness of an individual is defined by the following equation.
In this double spiral problem, the function, \({\mathrm{Size}}({\mathrm{root}}_i)\), applied for the objective function according to the size of tree structure is defined as the number of nodes of the tree structure.
In order to verify the effectiveness, the following four combinations are applied to the double spiral problem, combination of the conventional operators and CD (expressed as “’\({\mathrm{CO}}+{\mathrm{MU}}\) & CD”), combination of the conventional operators and SD (expressed as “CO+MU & SD”), combination of PS operator and CD (expressed as “PS & CD”) and combination of PS operator and SD (expressed as “PS & SD”). The conventional operators denotes the conventional crossover and the conventional mutations [13, 17, 18, 36]. The size of the population, \(N_{\mathrm{pop}}\), the running generations and the number of points in the double spiral problem, \(\left {\mathbf{D}}_1\cup {\mathbf{D}}_2\right\), are defined as 100, 1, 000, 000 and 190 respectively. The probability, \(p_{\mathrm{met}}\), for selecting the type of the metastasis is tried to 0.5, 0.25 and 0.75.
Figure 10 shows distributions on the \(h_2  h_1\) plane of the firstfront set in the final generation. As shown by Fig. 10, NSGAII with combining PS operator and SD has given the best solution set, distributed in the upper right direction, in the widest range. The solutions obtained by NSGAII with combining PS operator and CD has relatively high diversity but their evaluations are not so good. NSGAII with combining the conventional operators and CD has given relatively good solutions but their diversity is low. NSGAII with combining the conventional operators and SD has given the worst solution set with the lowest diversity.
Figure 11 shows a comparison of distribution on the \(h_2h_1\) plane of the first front set in the final generation when 3objective and 2objective optimizations are executed by using the PS operator with \(p_{\mathrm{met}}=0.50\) and SD for the ranking. Compared to the distribution of solutions given by 2objective optimization, the 3objective optimization has acquired far better solutions in wider range. When PS operator with \(p_{\mathrm{met}}=0.50\) and CD are combined, the same result has been obtained as shown in Fig. 12. This shows an effectiveness of multiobjective optimization of the treestructure data as proposed in this paper.
Figure 13 shows values of Norm [35] and MS [39] given by each method. In this figure, \({\mathrm{PS}}*.**\) denotes when PS operator with the metastasis selection probability, \(p_{\mathrm{met}}\), which is equal to \(*.**\) is used for the mating. Concerning both Norm and MS values, the best result has been obtained by the method using PS operator with \(p_{\mathrm{met}}=0.50\) and SD. The results using PS operator have gathered in the upper right of the figure, whereas the results using the conventional crossover and the conventional mutation have gathered in the lower left. This shows the superiority of PS operator. On the other hand, the advantage of SD has not been clearly shown by this experiment. SD have optimized relatively better only when combined with PS operator. NSGAII even with SD has given the worst results when combined with the conventional operators. The reason for this result is considered as that SD has a low ability to preserve extreme solutions as CD does. In the case of the multiobjective optimization of the tree structure, the ability to retain the diversity of tree structures like the ranking with SD is necessary, then an improvement to add ability to preserve the extreme solutions like CD should be considered.
Manyobjective 0/1 knapsack problem
In order to verify the effectiveness of the improved technique, a manyobjective 0/1 knapsack Problem (MaOKSP) [42] is performed. MaOKSP composed of m knapsacks and j items. The capacity of the ith knapsack is \(c_i\). The weight and the price of the jth item are \(w_{ij}\) and \(p_{ij}\) respectively in the ith knapsack. Let an individual \({\mathbf{x}} \in {0,1}^n\) be the n dimensional vector that selects the items. MaOKSP is defined by the following formula.
where the number of items n, the weight matrix \(w_{ij}\), the profit matrix \(p_{ij}\) and the knapsack capacity vector \(c_i\) are defined as follows,
\({\mathcal{POS}}\) obtained by the optimization is evaluated by using Maximum Spread (MS) [39] and Norm [35].
Norm [35] and Maximum Spread (MS) [39] are applied for evaluation of each method. Norm denotes a measure of the convergence of the population to the Pareto front \({\mathcal{PF}}\) and is defined by the following equation.
where \({\mathbf{x}}_{j}\) denotes the jth individual of the Pareto front, \({\mathcal{PF}}\). The larger the Norm value, the closer the approximate Pareto front, \({\mathcal{PF}}\). MS denotes a measure of the spread of the first front at the final generation [39] and is defined by the following equation.
The larger the MS value, the wider the spread of the population given by the optimization.
The conventional NSGAII, NSGAIIPPD when \(r=3\), \(r=6\) and \(r=8\), NSGAIIPPD in the case of giving the combination list shown in Table 1 and the improved technique are carried out for the verification. The optimization is performed by setting the objective number to \(m=4,6,8,10\) and the iterative generations to \(G=1{,}000{,}000\).
Figure 14 shows transition of the number of individuals of the firstfront according to the generation in the case that \(m=10\) and \(I_g = 500\). In the figure, “NSGAII” denotes the results by the conventional NSGAII, “PPD(\(\hbox {r}=^{*}\))” denotes the results by NSGAIIPPD with the constant value of \(r=*\), “PPD(list)” denotes the results by NSGAII based on Pareto partial dominance with the combination list shown in Table 1, and “Improved” denotes the results by the algorithm proposed in this paper. The conventional NSGAII and NSGAIIPPD in \(r=8\) has given large number of the individuals of the firstfront set throughout the optimization. NSGAIIPPD with \(r=6\) has given the number next to them. At the end of the optimization, the improved technique has caught up with these values. NSGAIIPPD with the combination list is also similar.
Figure 15 shows Norm values after the optimization to the objective number m in the case that \(I_g = 500\). In any technique, the convergence to \({\mathcal{POS}}\) increases as the number of objectives increases. Although, regarding to the convergence, NSGAIIPPD in the case that \(r=3\), NSGAIIPPD with the combination list and the improved technique have given almost equivalent results, the conventional NSGAII has given relatively poor results.
Figure 16 shows MS values after the optimization to the objective number m in the case that \(I_g = 500\). The MS value, or the diversity of \({\mathcal{POS}}\), given by NSGAIIPPD in the case that \(r=3\) decreases as the objective number increases, whereas it increases with the other three techniques. In the improved technique, since r increases as the generation progresses, the superiority/inferiority relationship of solutions becomes difficult to decide by Pareto partial dominance at the end of the optimization, and many individuals belong to the firstfront set. As a result, since most individuals of the parents are ranked by the CD sorting, and it is considered that diversity has increased. NSGAIIPPD with the combination list has shown diversity equal to or less than that of the improved technique. The reason that sufficient diversity has not been obtained by NSGAIIPPD in the case that \(r=3\) is considered as because partial dominance by using all objectives has not been performed only between 900,000 and 1 million generations. Regarding the diversity of solutions, the conventional NSGAII has given the highest value.
Figure 17 shows Norm values to the generation g in the case that \(m=10\) and \(I_g = 500\). In NSGAIIPPD, the convergence to \({\mathcal{POS}}\) tends to decrease as the value of the parameter r increases. In this technique, when r approaches m, the solutions are hard to dominated by the partial dominance, so a large number of individuals are selected as the firstfront set. As a result, sufficient ranking is not made in the nondominated sorting, and the convergence has deteriorated. On the other hand, although the improved technique has shown the highest convergence at the beginning of the optimization, the convergence has declined at the final stage. In the improved technique, since the value of r increases as the generation progresses, the solutions become hard to dominated by the partial dominance. As a result, sufficient ranking is not made in the nondominated sorting, and the convergence has deteriorated in the final stage.
Figure 18 shows MS values to the generation g in the case that \(m=10\) and \(I_g = 500\). Although the diversity in the cases of the conventional NSGAII and NSGAIIPPD in \(r=8\), maintains a high value throughout, the convergence is low as shown in Fig. 17, so it is not necessary to pay attention to them. On the other hand, the diversity is rising as the optimization progress in the case of the improved technique. Moreover, the improved technique brings relatively high convergence as shown in Fig. 17, so that the superiority of the improved technique is shown overall.
Conclusion
In this paper, multiobjective optimization of treestructure data, or MOGP, has been proposed, where the tree structure size and the structural distance (SD) are additionally introduced into the measure of the goodness of the tree structure as the objective functions. Furthermore, the partial sampling (PS) operator has been proposed to effectively search tree structure while avoiding the bloat. In order to verify the effectiveness of the proposed techniques, they have applied to the double spiral problem. By means of the multiobjective optimization of treestructure data, we have found that more diverse and better tree structures are acquired. The proposed method incorporating PS operator and SD in NSGAII has given relatively good results. However, since PS operator has low ability to numerically optimize constant nodes on the tree structure, it has not well worked effectively for the function optimization. In addition, since ranking with SD in NSGAII has low ability to preserve extreme solutions in the objective function space, solutions not have been effectively selected.
When the index of the goodness of treestructure data becomes two or more, the number of objective functions in MOGP becomes four or more, MaOGP. The improved NSGAIIPPD applicable to such the MaOGP has been also proposed in this paper. In the improvement, we have proposed SSS and EoD.
The improved NSGAIIPPD with SSS and EoD and other conventional techniques are applied to the manyobjective 0/1 knapsack problem for verification of the effectiveness. The improved NSGAIIPPD has given the higher diversity than other techniques as the number of the objective functions of the problem increases. On the other hand, the improved NSGAIIPPD has given the convergence equal to or higher than the other techniques even when the number of the objective functions becomes large. By means of the proposed simple scheduling of the parameter r, sufficient convergence has been obtained in the early generations with the smaller r, and the diversity has been supplemented in the generations with the larger r at the end of the optimization.
In the future, a technique to incorporate numerical optimization ability such as a particle swarm optimization [16] and the mutation to PS operator and the ranking selection technique combining SD and CD should be considered in the future. The PS operator proposed in this paper has a mechanism to terminate the proliferation, but does not have no mechanism to forcibly exit from the PS operator. Such the mechanism to forcibly exit from the PS operator should be considered.
Since the improved NSGAIIPPD still has given insufficient results in terms of the diversity, we need to improve this point while maintaining the current convergence. Although each technique has been applied to the relatively simple manyobjective 0/1 knapsack problem in this paper, we need to apply to more complicated problems and verify the effectiveness. We also need to pursue the combination list and to compare the further improved NSGAIIPPD and the conventional NSGAIIPPD with the optimal combination list. And then, we need to propose an effective MaOGP by combining these improved techniques in the future.
In this paper, the quality indicator MS is applied to assess the diversity of the final solutions. However, MS is able to simply be affected by the convergence of the solutions, in favor of poorlyconverged solutions. In this sense, MS is not necessarily effective for the assessment of the diversity. In the future research, it is necessary to consider techniques such as a diversity comparison indicator (DCI) [22] to assess the diversity of the solutions. On the other hand, the convergence of the solutions has been evaluated only using Norm. In this regard, the future research needs to visualize solutions in multiobjective optimization with parallel coordinates [24], which can partially reflect the convergence, spread and uniformity.
References
Aguirre HE, Tanaka K (2007) Working principles, behavior, and performance of moles on MNKlandscapes. Eur J Oper Res 181(3):1670–1690
Angeline PJ (1997) Subtree crossover: building block engine or macromutation. Genet Program 97:9–17
Angeline PJ (1998) Subtree crossover causes bloat. In: Genetic programming 1998: proceedings on 3rd annual conference. Morgan Kaufmann
Coello CAC, Lamont GB, Van Veldhuizen DA et al (2007) Evolutionary algorithms for solving multiobjective problems, vol 5. Springer, Berlin
David C, Kroening D (2017) Program synthesis: challenges and opportunities. Philos Trans R Soc A 375(2104):20150403
De Bonet JS, Isbell Jr CL, Viola PA (1997) Mimic: finding optima by estimating probability densities. In: Advances in neural information processing systems, pp 424–430
De Jong ED, Watson RA, Pollack JB (2001) Reducing bloat and promoting diversity using multiobjective methods. In: Proceedings of the 3rd annual conference on genetic and evolutionary computation. Morgan Kaufmann Publishers Inc, pp 11–18
Deb K (2001) Multiobjective optimization using evolutionary algorithms, vol 16. Wiley, Hoboken
Deb K, Agrawal S, Pratap A, Meyarivan T (2000) A fast elitist nondominated sorting genetic algorithm for multiobjective optimization: NSGAII. In: International conference on parallel problem solving from nature. Springer, pp 849–858
Francone FD, Conrads M, Banzhaf W, Nordin P (1999) Homologous crossover in genetic programming. In: Proceedings of the 1st annual conference on genetic and evolutionary computation, vol 2. Morgan Kaufmann Publishers Inc, pp 1021–1026
Goldberg D (1989) Genetic algorithms in optimization, search and machine learning. AddisonWesley, Reading
Hughes EJ (2005) Evolutionary manyobjective optimisation: many once or one many? In: The 2005 IEEE congress on evolutionary computation, 2005, vol 1. IEEE, pp 222–227
Ito T, Iba H, Sato S (1998) Nondestructive depthdependent crossover for genetic programming. In: European conference on genetic programming. Springer, pp 71–82
Jamali A, Khaleghi E, Gholaminezhad I, NarimanZadeh N, Gholaminia B, JamalOmidi A (2017) Multiobjective genetic programming approach for robust modeling of complex manufacturing processes having probabilistic uncertainty in experimental data. J Intell Manuf 28(1):149–163
Karger D (1995) Random sampling in graph optimization problems. Ph.D. thesis, Stanford University
Kenny J (1995) Particle swarm optimization. In: Proceedings of 1995 IEEE international conference on neural networks, pp 1942–1948
Koza JR (1992) Genetic programming II, automatic discovery of reusable subprograms. MIT Press, Cambridge
Koza JR (1994) Genetic programming as a means for programming computers by natural selection. Stat Comput 4(2):87–112
Langdon WB (1999) Size fair and homologous tree crossovers. Centrum voor Wiskunde en Informatica
Li K, Wang R, Zhang T, Ishibuchi H (2018) Evolutionary manyobjective optimization: a comparative study of the stateoftheart. IEEE Access 6:26194–26214
Li M, Grosan C, Yang S, Liu X, Yao X (2018) Multiline distance minimization: a visualized manyobjective test problem suite. IEEE Trans Evolut Comput 22(1):61–78. https://doi.org/10.1109/TEVC.2017.2655451
Li M, Yang S, Liu X (2014) Diversity comparison of Pareto front approximations in manyobjective optimization. IEEE Trans Cybern 44(12):2568–2584. https://doi.org/10.1109/TCYB.2014.2310651
Li M, Yang S, Liu X (2014) Shiftbased density estimation for Paretobased algorithms in manyobjective optimization. IEEE Trans Evolut Comput 18(3):348–365
Li M, Zhen L, Yao X (2017) How to read manyobjective solution sets in parallel coordinates [educational forum]. IEEE Comput Intell Mag 12(4):88–100
Mitchell M, Crutchfield JP, Das R et al (1996) Evolving cellular automata with genetic algorithms: a review of recent work. In: Proceedings of the first international conference on evolutionary computation and its applications (EvCAf96), vol 8. Moscow
Mühlenbein H, Paass G (1996) From recombination of genes to the estimation of distributions I. Binary parameters. In: International conference on parallel problem solving from nature. Springer, pp 178–187
Nordin P, Francone F, Banzhaf W (1995) Explicitly defined introns and destructive crossover in genetic programming. Adv Genet Program 2:111–134
Ohki M (2018) Linear subset scheduling for manyobjective optimization using NSGAII based on Pareto partial dominance. In: 15th international conference on informatics in control, automation and robotics, vol 1. INSTIC, pp 277–283
Ohki M (2018) Partial sampling operator and structural distance ranking for multiobjective GP. In: 2018 international conference on computer and applications (ICCA), pp 265–270
Ohmoto S, Takehana Y, Ohki M (2013) A consideration on relationship between optimizing interval and dealing day in optimization of stock day trading rules. IEICE technical report, vol 113, no 2103, pp 67–70
Poli R, Langdon WB (1998) On the search properties of different crossover operators in genetic programming. In: University of Wisconsin. Morgan Kaufmann, pp 293–301
Poli R, McPhee NF, Vanneschi L (2008) The impact of population size on code growth in GP: analysis and empirical validation. In: Proceedings of the 10th annual conference on Genetic and evolutionary computation. ACM, pp 1275–1282
Purshouse RC, Fleming PJ (2003) Conflict, harmony, and independence: relationships in evolutionary multicriterion optimisation. In: International conference on evolutionary multicriterion optimization. Springer, pp 16–30
Sato H, Aguirre HE, Kiyoshi T (2010) Effects of moea temporally switching Pareto partial dominance on manyobjective 0/1 knapsack problems. Trans Jpn Soc Artif Intell 25:320–331
Sato M, Aguirre HE, Tanaka K (2006) Effects of \(\delta\)similar elimination and controlled elitism in the NSGAII multiobjective evolutionary algorithm. In: IEEE congress on evolutionary computation, 2006. CEC 2006. IEEE, pp 1164–1171
Sawada K, Kano H (2003) Structured evolution strategy for optimization problems using multiobjective methods. In: Proceedings of the 30th symposium, society of instrument and control engineers, March 2003. Society of Instrument and Control Engineering, pp 1–6
Tsuchida K, Sato H, Aguirre HE, Tanaka K (2009) Analysis of NSGAII and NSGAII with CDAS, and proposal of an enhanced CDAS mechanism. JACIII 13(4):470–480
Yang JM, Kao CY (2000) An evolutionary algorithm to training neural networks for a twospiral problem. In: Proceedings of the 2nd annual conference on genetic and evolutionary computation. Morgan Kaufmann Publishers Inc, pp 1025–1032
Zitzler E (1999) Evolutionary algorithms for multiobjective optimization: methods and applications, vol 63. Citeseer
Zitzler E, Laumanns M, Thiele L (2001) SPEA2: improving the strength Pareto evolutionary algorithm. TIKreport 103
Zitzler E, Thiele L (1998) Multiobjective optimization using evolutionary algorithms comparative case study. In: International conference on parallel problem solving from nature. Springer, pp 292–301
Zitzler E, Thiele L (1999) Multiobjective evolutionary algorithms: a comparative case study and the strength Pareto approach. IEEE Trans Evolut Comput 3(4):257–271
Acknowledgements
This research work has been supported by JSPS KAKENHI Grant No. JP17K00339. The author would like to thank to her families, the late Miss Blackin’, Miss Blanc, Miss Caramel, Mr. Civita, Miss Marron, Miss Markin’, Mr. Yukichi and Mr. Ojarumaru, for bringing her daily healing and good research environment.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The author declares that she has no conflicts of interest.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Ohki, M. Multiobjective genetic programming with partial sampling and its extension to manyobjective. SN Appl. Sci. 1, 207 (2019). https://doi.org/10.1007/s424520190208y
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s424520190208y
Keywords
 Manyobjective genetic programming
 Partial sampling
 Tree structural distance
 Pareto partial dominance
 Subset size scheduling
 Elimination of duplicates