Abstract
In the past two decades, multi-objective evolutionary algorithms (MOEAs) have achieved great success in solving two or three multi-objective optimization problems. As pointed out in some recent studies, however, MOEAs face many difficulties when dealing with many-objective optimization problems(MaOPs) on account of the loss of the selection pressure of the non-dominant candidate solutions toward the Pareto front and the ineffective design of the diversity maintenance mechanism. This paper proposes a many-objective evolutionary algorithm based on vector guidance. In this algorithm, the value of vector angle distance scaling(VADS) is applied to balance convergence and diversity in environmental selection. In addition, tournament selection based on the aggregate fitness value of VADS is applied to generate a high quality offspring population. Besides, we adopt an adaptive strategy to adjust the reference vector dynamically according to the scales of the objective functions. Finally, the performance of the proposed algorithm is compared with five state-of-the-art many-objective evolutionary algorithms on 52 instances of 13 MaOPs with diverse characteristics. Experimental results show that the proposed algorithm performs competitively when dealing many-objective with different types of Pareto front.
Introduction
Multi-objective evolutionary algorithm (MOEAs) is a population-based heuristic algorithm that allows the search of Pareto approximation sets in a single run, becoming a universal and effective method to solve multi-objective optimization problems (MOPs) with two or three objectives [1, 2]. Because MOPs exists widely in the real world especially in optimal back-propagation artificial neural network [3, 4], MOEAs may generate contribution in missing data management task in smart systems [5, 6]. Therefore, multi-objective optimization has become an interesting research field. In the past decades, many classical Pareto dominance-based algorithms have been proposed to solve multi-objective optimization problems, such as NSGA-II [7], SPEA2 [8] and PESA-II [9]. However, Pareto-dominance-based algorithms are not well-suited to tackle MOPs with more than three objectives called as the many-objective problems(MaOPs) [10]. The selection pressure towards Pareto front based on Pareto dominance decreases sharply with the increase of the number of objectives. When the number of objectives is large, almost all of the solutions in the population are non-dominant, which makes the Pareto-based selection indistinguishable from the individuals [11].
Faced with such challenges, various strategies are put forward to improve the performance of multi-objective algorithm in solving many-objective problems. In the existing research, the most common methods can be divided into the following five categories [12]. Dominance-based methods Indicator-based methods Objective reduction-based methods Preference-based methods Decomposition-based methods
In recent years, several different forms of dominance-based methods have been proposed to improve the selective pressure. Several new dominance concepts have been well studied and applied, such as, ɛ-dominance [13, 14], α-dominance [15], fuzzy Pareto dominance [16, 17] and grid-based dominance [18]. These algorithms modify the dominance criterion to accelerate the convergence in solving many-objective problems. However, the main disadvantage of these Pareto advantages in heuristic selection is that it often involves one or more parameters and the parameters associated with these problems. Another typical method is to combine Pareto-dominance-based method with convergence-based metrics. For example, ε-indicator preference [19], shift-based density estimation strategy [20], a mating selection based on favorable convergence [21] and knee point-based secondary selection [22].
The second category is known as the indicator-based methods which convert a MOP into the problem of optimizing an indicator by evaluating the solutions using performance indicators. For example, methods based on HV indictors include S-metric selection-based multi-objective evolutionary algorithm(SMS-EMOA)[23], HV contribution based on Monte Carlo sampling algorithm(HypE) [24], HV contribution-based multi-objective particle swarm optimizer and HV-based dynamic neighborhood method [25]. Unfortunately, when the number of objective increases, the computational cost for the calculation of hypervolume is prohibitively more expensive. Apart from the hypervolume indicator, R2 indicator, which achieves a comprehensive balance between convergence and diversity, can be regarded as the mutual preference based on the contribution to each reference vector, such as R2-EMOA [26], R2MOPSO [27], R2-IBEA [28], MOMOBI-II [29] and TS-R2EA [30]. Besides, some researchers are exploring other indicators to replace the dominance-based approach, such as GD [31], IGD [32], IGD+ [33] and ▵ p [34].
The third category is known as reduce the number of objectives, which finds the relevant objectives and eliminate these redundant ones that do not need to describe the Pareto front. some techniques, such as feature selection [35], principal component analysis [37] and Pareto corner search [38], are proposed in recent years. However, objective reduction is inevitably accompanied by the loss of information of the original problem. Therefore, whether this technique is suitable for solving MaOPs depends on the characteristics of the problem itself.
The forth category is based on preference-based approaches [38]. In this case, preference ordering-based approaches include k-optimality relation [16] and maximum ranking [39]. The main characteristic of the preference based ranking method is that it emphasizes certain target areas to perform better, but the main shortcomings is that it performs worse overall. Another method is preference incorporation-based approaches. In this situation, decision makers usually preferred only the Pareto frontier in which they were most interested and can obtain the effective work, such as PBEA [40], r-NSGA-II [41] and TKR-NSGA-II [42]. However, premature convergence could contribute to the inferior results.
The fifth category is the most popular decomposition-based approaches. MOEA/D [43] is the classical decomposition-based multi-objective evolutionary algorithm which decomposes a MOP into a number of subproblems and use a set of weight vectors to optimize these subproblems based on a aggregated function. This method becomes a turning point on the path of multi-objective optimization algorithm exploration, and some decomposition-based algorithms are proposed gradually, such as MOEA/D-DE [44], MOEA/D-DD [45] and MOEA/D-M2M [46]. These algorithms are variations on the MOEA/D framework to solve many-objective problems and the researchers are not satisfied with this single research direction. Another category based on decomposition method is NSGA-III [47] which need a set of predefined evenly distributed reference points to maintain the diversity of the population. Then, reference vectors is referred as vectors that is use to decompose the original objective space, such as [48, 49]. In recent years, Cheng [50] proposed a reference vector-guided EA, which designed angle-penalized distance to balance the convergence and diversity of the candidate solutions. Maha [51] proposed a new decomposition-based NSGA-II for MaOPs, in this case, reference point-based dominance enables a set of well-distributed reference points to create a strict partial order over a set of non-dominant solutions. Jiang [52] introduced an efficient reference direction-based density estimator to enhance the diversity of candidate solutions. Liu [53] proposes a fuzzy decomposition-based MOEA to address the problem of the similarity metric affection.
Most of the existing MOEAs focus on enhancing convergence and maintaining diversity, it is noted that the use of reference vector will become particulary important for MaOEAs, not only can users choose a solution with better distribution based on the distribution of the reference vector, but they can also choose a solution with better convergence based on the fitness value generated by the reference vector.
As shown in the existing research [47, 50], reference points or reference vectors are evenly distributed in the whole objective space that can guide the candidate solutions to toward the Pareto front and evenly distributed. In addition, the value of vector angle distance scaling is applied in [54] to assign scores for each function ranking the individuals. Motivated by the methods of reference vectors and the aim to use the fitness value to achieve the convergence and diversity, we propose a many-objective evolutionary algorithm based on vector angle distance scaling. Compared with existing decomposition-based methods, the main contribution of this algorithm can be summarized as follows. A scalarization method, termed as the new fitness value of vector angle distance function. In the proposed method, the distance between the candidate solutions and the ideal point is used as the convergence criterion, and the cosine of the angle between the candidate solutions and reference vector is regarded as the diversity criterion. Scaling the fitness of vector angle distance, relative to the best performance solutions on the reference vector. In this process, scale each column by the minimum found, and then use the second minimum to scale the result. This method can deal with many-objective problems by using a compact analytic method to obtain the well solutions. A final aggregate fitness will be found in each population and will be designed into tournament selection. This method improve the tournament selection in both convergence and diversity. An adaptation strategy is applied to adjust the distribution of reference vector, which can improve the ability of algorithm to solve the PF with asymmetric objective function.
The rest of this paper is organized as follows. Section 2 describes some background knowledge. Section 3 introduces the proposed MaOEA-VADS algorithm. Then, section 4 presents and discusses the experimental results. Finally, section 5 concludes this paper.
Background
Basic definition
Generally, a minimization multi-objective optimization problem(MOP) with m objectives and n-dimensional decision can be mathematically defined as
The aggregation-based methods uses an aggregation function to assign scores for each solution, and then these solutions will be selected according to the scores. MSOPS-II [54] proposed that the combination of weighted min-max function with vector angle distance scaling is used to assign scores for each function to rank the individuals. In MSOPS-II, the design of weighted min-max focuses on the convergence and the vector angles distance scaling contributes to the diversity. In this case, current population is used to generate target vectors to improve the distribution of solutions. MOEA/D [43] is proposed that decomposes a multi-objective problem into a set of scalar subproblems using uniformly distributed weight vectors, where weighted sum method, weighted Tchebycheff method and penalty-based boundary intersection were discussed to solve MOPs and MaOPs. In MOEA/D, neighborhood information defined according to the Euclidean distance between weight vectors is used to optimize subproblems, where the distributed weight vectors contributes to diversity of solutions.
The reference vectors-based methods uses the association between reference vectors and solutions to select these solutions with good convergence and distribution. NSGA-III [47] based on the framework of NSGA-II is proposed that each solution is associated with a reference vector according to the perpendicular distance defined by a solution to reference vector. In this case, these solutions with small niche counts will have more changes to be selected, which improve the diversity performance in solving MaOPs. RVEA [50] uses a set reference vectors to partition the objective space into a number of subspaces and the angle penalized distance is calculated in each subspace for one solution. Then, a reference vector adaptation strategy is adopted to deal with scaled problems with no-well normalized objective functions for MaOPs.
Existing references describe that uniformly distributed reference vector paly the key role which contribute algorithm to achieving the good convergence and distribution. Motivated by MSOPS-II, MOEA/D and RVEA, the selection pressure is improved by calculating the vector angles distance scaling between these reference vectors and solutions. In addition, a aggregation function based on vector angles distance scaling is applied into the mating selection for improve the selection pressure and the quality of offspring population. Moreover, a well distributed reference vectors in the whole objective space is necessary to adjust adaptively for various problems.
The framework of MaOEA-VADS algorithm
Framework of MaOEA-VADS aglorithm
The main framework of the proposed MaOEA-VADS is given in the Algorithm 1, where the population size N, the number of objectives M and the maximal number of iterations t max are given. Firstly, we initialize a population P0 with N individuals randomly and a set of unit reference vectors. In the main loop, a tournament selection is applied to select some promising solutions for generating offspring O t . Then, environmental selection is performed to select a new population Pt+1 from the union of current population P t and their offspring Qt+1. The main loop of MaOEA-VADS is similar to most MaOEAs, but the proposed algorithm is improved in mating selection and environmental selection.
1:
2: Initializes the population P0 with N individuals Initializes a set of unit reference vector (W0 = w1, w2, . . . , w N )
3: [P0, r]= Environmental selection(P0,W,N)
4:
5:
6: Qt+1= Create offspring(P t , N, r)
7: R t = P t ∪ Qt+1
8: [Pt+1, r]= Environmental selection(R t ,W,N)
9: t = t + 1
10: if mod (t, f r * t max ) =0
11: W = W0 * (z max - z min )
12:
13: end
14:
Initialization procedure
The initialization procedure of MaOEA-VADS includes the initial population P0, a set of reference vectors W. In this paper, the initial population is randomly sampled from the decision space using a uniformly distribution. Then, a set of uniform distributed reference vectors ia generated. In most existing algorithms, a systematic approach from Das and Dennis’s method is used to generate reference vectors. The number of reference vectors is N =
Environmental selection
Environmental selection is applied to survive N better candidate solutions to regenerate new population for the next generation from the union of the current parent population and offspring population. In this situation, the selected candidate solutions should have the good convergence and diversity performance. However, it is not easy to maintain convergence and diversity in solving high dimensional problems. In this paper, we propose vector angle distance scaling method based on reference vectors. The procedure of environmental selection is listed in Algorithm 2.
1:
2: R t = P t
3:
4: R t = P t ∪ Q t
5:
6: R t = P t ∪ Q t
7:
8:
9:
10: Scale each column by the minimum value found S1 (f)
11: Scale the rows which have the minimum value by the second minimum value (S1 (f))
12: Calculate the aggregate fitness r of each solutions
13: [Pt+1, r′]=VADS-based selection(S1, W, N)
14:
In the proposed MaOEAs-VADS, environmental selection operation is based on a set of reference vector to calculate the fitness to select quality solutions with good convergence and diversity. Firstly, the offspring population and the parent population are combined to form set R t . Then, the metric of the vector angle distance scaling between each solution and vector is calculated. After obtaining the vector angle distance scaling metric S, the improved associated metric S1 is generated, and the aggregate fitness r of each solution are calculated. In Algorithm 2, lines 10 through lines 12 describe the calculation process. First of all, we find the minimum value and index for each reference vector, and all the value of each reference vector are scaled by the minimum value found. Then, the rows which have the minimum value is scaled by the second minimum value, and the scaled metric S1 is obtained. Secondly, the final aggregate fitness r can be found by the minimum scaled value. Finally, the next generation population will be selected based on the scaled metric S1, and the detail selection process is shown in Algorithm 3.
1: Set Pt+1 =∅, r′ =∅
2:
3: [temp, index] = min S1 (f)
4: Find a solution closest to the reference vector x i based on index and temp
5: Pt+1 = Pt+1 ∪ {x i }
6: Delete the reference vector corresponding to the solution x i and reference vector w j , and get the new S1
7: Aggregate values corresponding to selected individuals are also selected to be retained
8: r′ = r′ ∪ {r i }
9:
10:
In Algorithm 3, the selection operation of the population is carried out based on the scaled vector angle distance scale value(VADS). Vector angle distance scaling(VADS) has been introduced in [54]. The VADS value is calculated by the following formula:
In this formula, the numerator is the magnitude of the vector of objectives |F| and the denominator is the cosine value of the angle between the objective vector and reference vector, where the constant factor b is contributed to scaling the ability of the angle cosine. The distance along the objective vector is assigned as the direction of algorithm convergence. The magnitude of angle cosine can not only affect the VADS value, but also represent the distance between a solution and the reference vector, so as to show the diversity between solution sets. In Fig. 1, an example showing how to associate a solution with a reference vector is listed.

Example showing how to associate a solution with a reference vector.
In this example, solution A and solution B are two objective vector, and w i is a reference vector. θ1 and θ2 are the angles between w i and A and B, respectively. Since θ2 < θ1, the solution B is associated with reference vector w i . Low θ for a solution represents the a high angle cosine value, then the low VADS value will be obtained. Therefore, low VADS value may lead to a solution with good convergence and diversity.
Firstly, the solution will be found in S1 based on the minimum VADS value, the solution x i will be stored into next generation population Pt+1 and the corresponding reference vector w j will be deleted. Then the aggregate fitness corresponding to the selected individual is selected to generate the offspring population. This process ensures the selection of the solution for the corresponding VADS value of each vector and indicates the solution with good convergence and distribution. In this situation, this selection process is carried out until the population size |Pt+1| is satisfied to the population size N.
After the initialization is completed, the main loop begins to renew the population in each generation. In this paper, a tournament selection based on the new aggregate fitness r is designed to select better parent individuals. A low r value represents the solution with good convergence and diversity so as to improve the quality of parent generation matching and effect the quality of offspring population. Then, the simulated binary crossover(SBX) and the polynomial mutation(PM) are used to generate the offspring population [7]. The simulated binary crossover(SBX) is calculated as follows:
Besides, polynomial mutation(PM) is used to enhance the diversity and the formula is as follows. δ is calculated as follows:
The offspring individual is generated when a random ind ≤ p
m
is true, where p
m
= 1/n is mutation probability.
Test problems
The thirteen test problems incorporate with two test suits: Deb-Thiele-Laumanns-Zitzler(DTLZ) [55] and Walking Fish Group(WFG) [56] are applied to conduct the experiments. The DTLZ test suite consists of four problems (DTLZ1-4) while WFG test suite consists of nine problems (WFG1-9). In addition, the test problems needs to set a certain number of objectives and dimensions during the experiment, so the number of objectives m, the decision variables n and the parameters k and l and reference point z ref are listed in Table 1. Therefore, using such a set of test problems can reflect the universality of the algorithm.
The characteristics and parameter settings of 13 test problems
The characteristics and parameter settings of 13 test problems
Some metrics are needed to evaluate the quality of the optimization algorithm. In the existing research, two metrics, the inverted generation distance (IGD) [57] and Hypervolume(HV) [24], are widely used performance metrics, which can measure both convergence and diversity.
Inverted Generation Distance (IGD) measures te average Euclidean distance from the a set of uniformly distributed points across PF to the closet solution in its approximation set.
Hypervolume (HV) is defined as tge volume of the hypercube enclosed in the objective space by the reference point and every vector in the Pareto approximation set A. This is mathematically defined as:
In this section, we set the same parameters including the number of objectives m, population size N, the maximal number of generation t max , the parameters of generating uniformly distributed reference vectors H1 and H2 in Table 2.
Parameters setting of algorithm running
Parameters setting of algorithm running
Besides, we employ the standard toolkits to implement the peer algorithms and use their original study to set the parameters of compared algorithms. Parameters include neighborhood size T, crossover probability p c , mutation probability p m , distribution index η c of simulated binary crossover (SBX), distribution index η m of mutation operation. Then, some other special parameters are also indicated in Table 3.
Parameter settings of compared algorithms
In this section, aiming to demonstrate the superiority of MaOEA-VADS implementation required for convergence and diversity in dealing with many-objective problems, the proposed algorithm is to be validated to compare with MOEA/D [43], MOMOBI-II [29], NSGA-III [47], RVEA [50] and RPD-NSGA-II [51] on DTLZ and WFG benchmark test problems. In addition, we analyze the experimental results from performance comparisons on many-objective test problems and parameter b.
Performance comparisons
The analysis of data
Firstly, we run all the algorithms 20 times for each problem with different number of objectives independently and compare the performance metrics of the algorithms. Table 4 and 6 list the mean and standard deviation of IGD and HV respectively of the six algorithms on DTLZ and WFG problems suits. In addition, Wilconxon’s rank sum test [58] is carried out between MaOEA-VADS and each compared algorithm on each test problem at 0.05 significance level, which is aimed to investigate whether MaOEA-VADS is superior to compared algorithms in a statistical sense. In this case, +, ≈ and - is applied to indicate that MaOEA-VADS is superior to similar to and inferior to the compared algorithm in the corresponding column, respectively. In the statistical, the minimum IGD and maximum HV are recorded, where the best result based on the mean values for each test problem is highlighted in boldface and Bold italic font represents that the indicator value of the compared algorithm is superior to MaOEA-VADS when there is no significant difference.
Comparison of the MOEAs on IGD value with means and standard deviations for DTLZ and WFG benchmark with 4 different dimensions
Comparison of the MOEAs on IGD value with means and standard deviations for DTLZ and WFG benchmark with 4 different dimensions
Comparison of the MOEAs on HV value with means and standard deviations for DTLZ and WFG benchmark with 4 different dimensions
The comparison statistical summary of MaOEA-VADS and five algorithms on the DTLZ and WFG test problems
Comparison of the MOEAs on HV value with means and standard deviations for WFG benchmark with 4 different dimensions
Besides, the comparison statistical summary of MaOEA-VADS and five algorithms on the DTLZ and WFG test problems in Table 8. In this table, the number of test problems where MaOEA-VADS is significantly better than, similar to and worse than the compared algorithms is recorded and the statistics of the best results for each algorithm in all test problems are described. As an be seen from Table 8, the optimal value of MaOEA-VADS occupies half or more of the quantity, and has obvious advantages compared with other algorithms.
From Table 4, it can be seen that MaOEA-VADS obtained 33 best IGD values out of 52 test problems, lager than the IGD values obtained by the compared algorithms(i.e., MOEA/D, MOMOBI-II, NSGA-III, RVEA, and RPD-NSGA-II got the number of best result are 1, 5, 4, 9, 0, respectively). The penultimate row of Table 4 records that MaOEA-VADS shows significantly better performance than any the compared algorithm over more than half of the test problems. From Table 6, it can be seen that MaOEA-VADS has achieved 23 best HV values out of 52 test problems, lager than the HV values the other algorithms(i.e., MOEA/D, MOMOBI-II, NSGA-III, RVEA, and RPD-NSGA-II got the number of best result are 2, 10, 6, 8, 3, respectively). In addition, MaOEA-VADS performs significantly preponderance than any the compared algorithm. Therefore, the results of IGD and HV value statistics prove that the algorithm has certain competitiveness when dealing with high-dimensional problems.
DTLZ1 is characterized by linear and multi-modal, and its biggest challenge is to jump out of multiple fronts and converge to the true front. As evidenced by statistical results of IGD and HV in Table 4 and Table 6, MaOEA-VADS algorithm does not show significant competitiveness over other algorithms especially compared to RVEA. In HV statistics, MaOEA-VADS performs better than MOEA/D for the four objectives, better than NSGA-III for 8- and 10-objective and similar to MOMOBI-II for 8- and 10-objective but worse than MOMOBI-II and NSGA-III for 3- and 5-objective. However, MaOEA-VADS shows the poor performance in dealing with DTLZ1 when compared with RVEA and RPD-NSGA-II. DTLZ2 is a simple concave problem. For the statical results in Table 4, MaOEA-VADS shows the competitive performance in IGD statistics than the compared algorithms for 3-, 5-, 8- and 10-objectives expect RVEA for 5- and 10-objective. However, for the statical results in Table 6, MaOEA-VADS obtains the HV mean values that are all superior to and similar to other algorithms for 3-, 5-, 8- and 10-objectives. What’s more, the best results come from MaOEA-VADS expect 3-objective. The statistical results indicate that MaOEA-VADS performs well in convergence and diversity for dealing with concave problems. Concerning DTLZ3, which is featured by concave and multi-modal. Jumping out of local optimality is a great challenge for all algorithms to solve this problem. As shown in Table 4 and Table 6, MaOEA-VADS behaves the superior performance for the four objectives than other algorithms expect RVEA for 8- and 10-objective. On the remaining problems, MaOEA-VADS presents the advantage over the compared algorithms, which indicates that MaOEA-VADS has certain advantages in handling concave and multi-modal problem. DTLZ4 is characterized by concave and biased, where the challenge lies in how to get the biased solutions. In Table 4, MaOEA-VADS shows the superior performances than the other algorithms but worse than RVEA for 5-objective and 10-objective. In Table 6, MaOEA-VADS shows the superior and similar performance to compared algorithms. These results indicate that MaOEA-VADS obtains the competitiveness for solving concave and biased PF.
WFG1 is introduced to test the ability of algorithm to solve mixed and biased PF. As shown in In Table 4, MaOEA-VADS shows inferior performance than other five algorithms for 5-, 8- and 10-objective. In Table 6. MaOEA-VADS behaves the superior performance to NSGA-III and RVEA but inferior to MOEA/D and MOMOBI-II for 3-, 5- and 10-objective. In general, MaOEA-VADS still need to be improved in handling the WFG1 problem. Regarding WFG2, which is a disconnected test problem. MaOEA-VADS displays the competitive performance than other algorithms for four objective expect NSGA-III 3-objective in Table 4. As indicated by the statistical results in Table 6, MaOEA-VADS performs superior and similar performance than compared algorithms but inferior to NSGA-III for 3- and 10-objective and RPD-NSGA-II for 5-objective. Overall, MaOEA-VADS algorithm can be quite competitive in dealing with discontinuous problems. However, as for WFG3, MaOEA-VADS behaves worse performance than MOEA/D, MOMOBI-II, NSGA-III and RPD-NSGA-II and superior to RVEA. Form this results, MaOEA-VADS have yet to be improved in dealing with liner and non-separable problems. WFG4 is recommended to test the ability of algorithm to deal with concave and multi-modal problem. As shown in Table 4, MaOEA-VADS displays the superior and similar performance to compared algorithms. And in Table 6, MaOEA-VADS shows that almost all of these problems are superior to other algorithms expect for RDP-NSGA-II for 10-objective. Considering WFG5, which is a concave and deceptive test problem. As shown in Table 4 and Table 6, MaOEA-VADS behaves the better results than other algorithms and only slightly worse than RVEA in HV data statistics for 8- and 10-objective. WFG6 is introduced to concave and non-separable PF. As recorded in Table 4, MaOEA-VADS performs better than MOEA/D, MOMOBI-II, RVEA and RPD-NSGA-II, but it shows the worse performance than NSGA-III for 8-objective. In Table 6, MaOEA-VADS shows similar performance than MOMOBI-II for 3-objective and all of the four objectives are superior to other algorithms. Regarding WFG7, which is adopted to test ability of algorithm to solve concave and biased PF. As shown by the statistical results in Table 4 and Table 6, MaOEA-VADS performs the best results for 3-, 5-, 8- and 10-objective than other algorithms and obtains the best results for four objectives. WFG8 is featured by concave, biased and non-separable, MaOEA-VADS only shows the worse HV value than MOMOBI-II for 3-objective in Table 6. WFG9 is characterized by concave, biased, deceptive and multi-modal. As shown in Table 4 and Table 6, MaOEA-VADS achieves competitive performance for 3-, 5-, 8- and 10-objective than other algorithms, which indicate the proposed algorithm has good convergence and distribution when solving the problem of high dimensional correlation.
The statistical results can only be compared from the accuracy of the algorithm, but it is difficult to see the convergence process and distribution of the algorithm. Therefore, we analyze the figures about Pareto front and convergence of six algorithms in some distinct problems. In the process of drawing the convergence graph, the x-coordinate is the maximum iterative algebra, and the y-coordinate is log(IGD) and log(HV), which is used to make the graph clearer.
For 3-objective test problems, Fig. 2 lists the optimal solutions obtained by six algorithms on DTLZ3 and WFG5 for the 3-objective. As can be seen in Fig. 2 (a)-(f), the obtained PF of MaOEA-VADS on DTLZ3 is significantly more uniform, and the distribution of solutions is better. However, in Fig. 2 (g)-(l), the Pareto fronts of MaOEA-VADS, NSGA-III and RVEA show similar distribution, and the distribution of solutions is relatively uniform. Fig. 3 and Fig. 4 list the representation of IGD and HV convergence trend by six algorithms on partial test problems for 3-objective. MaOEA-VADS obtains the best IGD accuracy on DTLZ1, DTLZ2, DTLZ3, DTLZ4, WFG1, WFG4, WFG5 and WFG7. However, MaOEA-VADS obtains the best HV accuracy on DTLZ2, DTLZ3, DTLZ4, WFG1, WFG5 and WFG7, the second best values on WFG4 and WFG6. Although MaOEA-VADS algorithm may not have the fastest convergence speed in the process of convergence, but it has more advantages in convergence accuracy, which indicates the possibility of this algorithm in dealing with the 3-objective problems.

The representation of the optimal solutions obtained by each algorithm on the 3-objective DTLZ3(a)-(f) and WFG5(g)-(l).

The representation of IGD convergence trend by each algorithm on the 3-objective of partial functions.

The representation of HV convergence trend by each algorithm on the 3-objective of partial functions.
For 5-objective test problems, Fig. 5 lists the optimal solutions obtained by six algorithms on WFG2 (a)-(f) and WFG8 (g)-(l) for the 5-objective. MOEA/D and MOMOBI-II are obviously very poorly distributed, MaOEA-VADS performs the similar convergence and distribution to RVEA. Fig. 6 and Fig. 7 list the representation of IGD and HV convergence trend by six algorithms on partial test problems for 5-objective. MaOEA-VADS obtains the best IGD accuracy on DTLZ1, DTLZ2, DTLZ4, WFG2, WFG7, WFG8 and WFG9 and achieves the best HV accuracy on DTLZ2, DTLZ4, WFG4, WFG5, WFG7, WFG8 and WFG9. Although the convergence accuracy of the two indicators is different on the same problem, which is caused by the different calculation methods, it also indicates that the algorithm has good convergence. In addition, as can be seen from Fig. 6 and Fig. 7, MaOEA-VADS also has a certain advantage in convergence rate for WFG problems.

The representation of the optimal solutions obtained by each algorithm on the 3-objective WFG2(a)-(f) and WFG8(g)-(l).

The representation of IGD convergence trend by each algorithm on the 5-objective of partial functions.

The representation of HV convergence trend by each algorithm on the 5-objective of partial functions.
For 8-objective test problems, Fig. 8 shows the representation of optimal PF by six algorithms on DTLZ4 (a)-(f) and WFG7 (g)-(l) for 8-objective. It is obvious that the distribution of RPD-NSGA-II is more uniform than MOEA/D, MOMOBI-II, NSGA-III and RVEA, it is still slightly worse than MaOEA-VADS. As shown in Fig. 9, MaOEA-VADS achieves the best IGD accuracy on DTLZ4, WFG2, WFG7, WFG8 and WFG9. RVEA converges the fastest on DTLZ1 and DTLZ2 but with the same accuracy as MaOEA-VADS. As shown in Fig. 10, MaOEA-VADS displays significant superiority in HV accuracy on WFG4, WFG6, WFG7 and WFG9. For 10-objective test problems, Fig. 11 lists the optimal PF on DTLZ2 (a)-(f) and WFG10 (g)-(l) for 10-objective. Fig. 11(a) and (g) shows that the distribution of DTLZ2 and WFG9 in MOEA/D was poor. MOMOBI-II, NSGA-III, RVEA and RPD-NSGA-II can obtain the relatively uniform solutions when solving DTLZ2, but some solutions have not converged to the true Pareto front when solving WFG9. However, MaOEA-VADS performs the good convergence and distribution on DTLZ2 and WFG9. As can be seen from Fig. 12, the convergence accuracy and convergence rate of MaOEA-VADS are the highest on WFG2, WFG7, WFG9. For the problems of DTLZ1, DTZL2, DTLZ3 and DTLZ4, MaOEA-VADS can maintain the accuracy of RVEA, but it is inferior to the convergence speed. As shown in Fig. 13, MaOEA-VADS obtains the best HV accuracy on WFG7 and WFG9. In the course of convergence diagram of DTLZ2, DTLZ4, WFG2, WFG5 and WFG6, MaOEA-VADS show slightly worse convergence speed, but it is superior to other algorithms. For WFG5, RVEA is superior to MaOEA-VADS and RPD-NSGA-II in convergence accuracy, but inferior to them in convergence speed.

The representation of the optimal solutions obtained by each algorithm on the 3-objective DTLZ4(a)-(f) and WFG7(g)-(l).

The representation of IGD convergence trend by each algorithm on the 8-objective of partial functions.

The representation of HV convergence trend by each algorithm on the 8-objective of partial functions.

The representation of the optimal solutions obtained by each algorithm on the 3-objective DTLZ2(a)-(f) and WFG9(g)-(l).

The representation of IGD convergence trend by each algorithm on the 10-objective of partial functions.

The representation of HV convergence trend by each algorithm on the 10-objective of partial functions.
To sum up, we can conclude that MaOEA-VADS has greater potential when dealing with higher dimensional problems and can achieve good convergence and distribution in most problems. However, according to the law of no free lunch [59], MaOEA-VADS algorithm is not guaranteed to be superior to other algorithms on all problems. At least for most problems, the proposed algorithm obtains more best results.
In this experiment, the parameter b ia a key factor for the calculation of vector angle distance. Therefore, in order to study the sensitivity of performance to the parameter b, WFG1-WFG9 characterized by different types have been taken to verify the impact of parameter b. The mean IGD and HV of 20 independent runs with varying the parameter value b=10, 50, 100. In order to observe the impact of b on the performance of the algorithm, Table 9 shows the mean IGD and HV obtained using different values of b for WDFG1-WFG9.
In this experiment, b = 10 has the significant advantage in solving WFG1. For WFG2, b = 100 obviously obtains the best results in both IGD and HV, but b = 100 performs the worst performance for WFG3. Due to the different calculation methods of indicators, b = 50 has a certain advantage in IGD statistics, while b = 10 occupies the best value in 8- and 10-objective for WFG3. Considering WFG4, b = 50 produced the best results in IGD statistics for 5- and 8-objectives, but it performs slightly worse than b = 10 for 3-objective and b = 100 for 10-objective. Taking into WFG5 and WFG6, b = 10 shows the superior performance for 3-objective in both IGD and HV, b = 50 only performs the best values for 10-objective in IGD statistics. In view of WFG7, b = 50 shows the best IGD value for 3-, 5- and 8-objective and obtains the better HV value for 5- and 10-objective. However, b = 50 performs worse than b = 10 for WFG8 but better than b = 100. Then, in consideration of WFG9, b = 50 obtains the best IGD value for 5- and 8-objective and the best HV values for 3-, 5- and 10-objectives.
In the last row of Table 9, we list the number of problems that have the best results out of 36 functions. As can be seen from the statistical results, the number of the best IGD and best HV for b = 50 are 14 and 16, respectively. However, the number of the best IGD and best HV for b = 10 are 15 and 11, respectively. In order to filter b = 100 and b = 50 clearly, we counts the number of superior solutions and inferior solutions between b = 50 and b = 100 and b = 10, respectively. In the Table 9, +/- represents that b = 50 is superior to and inferior to other parameters, respectively. As can be seen from the penultimate row in the Table 9, there are 29 and 21 IGD values better than b = 100 for b = 50, and there are 27 and 25 HV values better than b = 100 for b = 50. Therefore, through the overall data analysis, we finally choose b = 50 as the VADS calculation, even though b = 100 is the common value in this article [54].
Conclusions and future work
In order to deal with MaOPs with various types of Pareto fronts, a many-objective evolutionary algorithm based on vector angle distance scaling is presented, termed MaOEA-VADS. It contains three main improvements, i.e., calculate the new fitness value of vector angle distance function and scaling the fitness in environmental selection, the final aggregate fitness update the population. In this experiment, the calculation of vector angle distance reflects the convergence and diversity criterion and the reference vectors are adopted to guide solutions towards to accurate Pareto front. Then, scaling fitness of vector angle distance process is a compact analytic method to obtain better solutions. Besides, a final aggregate fitness will be applied into mating selection, which improve matching process to select better parent generation to produce quality offspring. What’s more, vector angle distance scaling strategy guarantees the effectiveness of the proposed algorithm on MaOPs. MaOEA-VADS is compared with five algorithms, i.e. one classical MOEAs and four MaOEAs. Experimental results have demonstrated that MaOEAs-VADS outperforms other compared algorithms on overall 52 test problems. Therefore, MaOEA-VADS is a promising and versatile algorithm with good convergence and diversity.
It is worth mentioning that MaOEA-VADS is a promising algorithm to solve MaOPs. However, it is a hindrance when handling PFs with complicated mixed geometries, non-separable and deceptive(e.g. WFG1, WFG2 and WFG3). In the future, we would like to focus on dealing with more complex PF shapes.
Footnotes
Acknowledgment
The authors would like to thank the editor and reviewers for their helpful comments and suggestions to improve the quality of this paper. This work was supported by National Key Research and Development Project under Grant 2018YFC1602704, and National Natural Science Foundation of China under Grant 61873006.
