Abstract
Differential evolution (DE) is one of the most effective ways to solve global optimization problems. However, considering the traditional DE has lower search efficiency and easily traps into local optimum, a novel DE variant named hybrid DE and simulated annealing (SA) algorithm for global optimization (HDESA) is proposed in this paper. This algorithm introduces the concept of “ranking” into the mutation operation of DE and adds the idea of SA to the selection operation. The former is to improve the exploitation ability and increase the search efficiency, and the latter is to enhance the exploration ability and prevent the algorithm from trapping into the local optimal state. Therefore, a better balance can be achieved. The experimental results and analysis have shown its better or at least equivalent performance on the exploitation and exploration capability for a set of 24 benchmark functions. It is simple but efficient.
Introduction
Global optimization problems are very common in the process of theoretical research and practical application. Most of the traditional analytical optimization methods need to calculate the derivative value of the objective function and some other auxiliary information to determine the search direction. What’s more, there are many restrictions on the convexity and linearity of the function. However, in science or engineering practice, the objective function is not always differentiable, or the problem simply cannot be expressed by a function. Therefore, many of the optimization problems cannot be addressed well by traditional methods. Since the 1980 s, intelligent optimization algorithms have developed rapidly. Especially, evolutionary algorithms (EAs) have become increasingly popular due to it is derivative-free mechanism and excellent optimization ability to avoid local optimum. Differential evolution (DE) [1, 2] is one of the most effective EAs to solve continuous global optimization problems, whether the function is convex or non-convex, which was proposed by Storn and Price in 1995. The general idea of DE is to generate fresh individuals from the randomly generated initial individuals after each mutation and crossover operation, and then compare their fitness values with the previous ones, and the better ones are selected to save as offspring. Continue to iterate according to this process, and ultimately the optimal solution is obtained. The simple coding, high optimization efficiency and excellent robustness of the DE algorithm make it one of the most frequently used algorithms for solving various test and practical problems in diverse domains, such as chemical engineering [3], image processing [4], power engineering [5, 6], industrial engineering [7], electrical engineering [8] and so on.
However, when solving some complex optimization problems with high dimensions, the traditional DE is likely to appear premature and trap into the local optima. Besides, the convergence rate of DE is not high enough, so that it will affect the versatility of DE [9, 10]. In response to the two shortcomings mentioned above, various work about DE variants have been done to improve the performance of the traditional DE, which are mainly reflected in the following aspects: the setting of control parameters (i.e., population size NP, the crossover rate C r and the scaling factor F), the improvement of evolution strategies (i.e., mutation, crossover and selection operators) and the application of hybrid algorithms.
Plenty of work can be found in the literature on finding out the proper control parameters settings to improve the versatility of DE. In order to obtain a good solution, Storn and Price suggested that NP should be 5D - 10D, F should be 0.5 and C r should be 0.1 or 0.9, where D represents the dimension of the problem [2]. Gamlerle et al. [11] believed that NP should be selected from 3D to 8D, F should be set to 0.6 and C r should lie in the interval [0.3, 0.9]. Ronkkonen et al. [12] recommended that NP should be selected from 2D to 4D, F should lie in the interval [0.4, 0.95] with the initial value of 0.9, and C r should lie between [0, 0.2] for a separable function while for multimodal or non-separable function it should be in the range of [0.9, 1]. Zielinski and Laur [13] suggested that the setting F and C r ⩾ 0.6 are good choices to obtain good results in many cases. In summary, it is not difficult to find that researchers mentioned above have reached a consistent conclusion that F should lie between [0.4, 1], C r should be either close to 1 or 0 [14]. As for the setting of NP, there is no consensus yet.
Many attempts have also been made on the modifications of evolution strategies to improve the convergence rate and population diversity. According to the order of DE steps, the first is mutation operation, which is also the core step of DE. Zhong et al. [15] proposed a novel DE variant with a fresh mutation operator which took the influence of reference individuals into account. Yu et al. [16] raised a novel adaptive mutation operator for DE, which was adaptive only when the individuals clustered around the local optimum. Tan et al. [17] put forward a new DE variant equipped with an adaptive mutation operator based on the fitness landscape. Mezura-Montes et al. [18] introduced a new mutation operator adopting the information of the individual who performed best in the current population to generate a new trial vector. Biswas et al. [19] introduced a fresh mutation operator called parent-centric mutation based on neighborhoods to decrease niching behavior in DE. Sun and Cai [20] used a dynamic neighborhood learning (DNL) aiming at enhancing the efficiency of mutation operator. Mohamed [21] proposed a triangular mutation operator, where three vectors were randomly selected, and the difference between best and worst vector was added to its mutation rule. Kaelo and Ali [22] presented a mutation operator that applied the attraction-repulsion idea of electromagnetism to enhance DE performance. Das et al. [23] adopted the concept of neighborhood to improve the “current-to-best/1” strategy by using “target-to-best/1” strategy. Zhang et al. [24] introduced the “current-to-pbest/1” strategy, where the top 100 p % individuals have the possibility to enter into the next generation. Lu et al. [25] proposed an improved elite archive mutation operator according to the quantitative research on chromosome variability as well as population diversity.
Crossover operation is the next step, where a new trial vector is created. Zaharie [26] was committed to the study of comparing binomial and exponential crossover operator. Zhao and Suganthan [27] proposed that the exponential crossover operator was better for optimization problems with high dimensions. Wang et al. [28] introduced a novel crossover operator named orthogonal crossover operator. Fan et al. [29] proposed an enhanced DE variant with adaptive crossover strategies. Deng et al. [30] introduced a new crossover operator with Jrand number decreasing mechanism and feedback guide technique.
The selection operation is the last step of the evolution process. Since the selection operation is a special step that makes it different from other algorithms, there are not many variants based on this step. Yi et al. [31] introduced a p-best selection operator to improve population diversity. In this operator, the p-best of the offspring with a better fitness value is suitable for exploitation. Liang et al. [32] proposed a distance-based elite selection mechanism.
In addition to the above modifications, hybridization is increasingly popular. Arab and Alfi [33] hybridized memetic algorithm (MA) with local search (LS) called MA-ALS to improve the convergence rate and speed. Mousavi and Alfi [34] proposed a hybrid algorithm based on fractional particle swarm optimization (FPSO) and memetic algorithm (MA) called FPSOMA to solve optimization problems. Moreover, many hybridizations based on DE and other promising global optimization algorithms have been proposed to improve the performance of traditional DE. Zhang et al. [35] proposed a new hybridization of DE and cuckoo search (CS) called CSDE. In CSDE, the population are divided into two groups and adopted DE and CS respectively to exchange information and make full use of the advantages of each other. Shimpi Singh Jadon et al. [36] proposed a HABCDE algorithm which is a hybridization of the artificial bee colony algorithm (ABC) and DE. The onlooker bee operator of ABC adopted the DE/best/1/bin evolution strategy of DE in HABCDE. Liu et al. [37] proposed a hybrid whale optimization algorithm (WOA) enhanced by Lévy flight and DE (WOA-LFDE), which employed the “DE/current-to-pbest/1” strategy to help WOA to escape from the local optima and improve the robustness of WOA. Babak Mohammadi et al. [38] proposed a MLP-PSODE algorithm. In MLP-PSODE, multi-layer perceptron (MLP) is hybridized with PSO and then, adding DE.
In the search process, simulated annealing (SA) accepts inferior solutions with a certain probability that decreases as the temperature reduces and eventually approaches to zero [39]. Through affecting the probability, the temperature can control the algorithm’s progress. In the beginning, the random search accounts for more, and as the temperature is reduced, the greedy search starts to prevail. This feature of SA can effectively avoid the procedure trapping into the local optima and ultimately achieve the global optima. Furthermore, the strong local exploitation capability of SA algorithm is conducive to improving the convergence rate of DE. These advantages urge many scholars to choose SA as an appropriate algorithm to hybridize with DE to enhance its performance. Hu et al. [40] proposed a SaDESA algorithm, which is based on self-adaptive DE and SA. Olenšek et al. [41] introduced a parallel algorithm based on DE and SA, which used a greedy strategy to accept individuals in local search and Metropolis criterion in SA to accept individuals in the rest. Zhao et al. [42] proposed a iSADE which adopted the idea of SA to jump out of the possible local optimum attraction. Zeng et al. [43] introduced a hybrid algorithm using the Metropolis criterion in SA to further optimize the solution after the selecting operation of DE, aiming at improving the global optimization ability. Assadi et al. [44] proposed a hybrid DE with population-based SA. He et al. [45] employed SA algorithm to the mutation operator of DE by using an adaptive selection mechanism where the vectors were chosen adaptively according to their historical performance. Mirsadeghi et al. [46] proposed a novel hybrid algorithm, where the concept of temperature in SA was utilized to strike a balance between the exploration and exploitation capability.
Although a lot of work on improving the performance of DE has been done, the defects such as complex coding, excessive control parameters, and inability to balance the exploration and exploitation capability prompted us to continue to study the improvement of DE performance. It is also observed that most of the work mentioned above focuses on the improvement of mutation operations, whereas few variants on selection operations. Moreover, as mentioned previously, hybrid DE with SA can make full use of the advantages of both algorithms to improve the global and local searching ability at the same time. Based on these considerations, this paper proposes a hybrid DE and SA algorithm for global optimization (HDESA). Concerning the previous research, the control parameter F and C r are respectively set to 0.5 and 0.9 in this algorithm, and mutation operator and selection operator are modified simultaneously. The coding of the algorithm is very simple, and it is relatively efficient. First of all, the concept of ranking is added when the parent individuals are randomly selected in the mutation operation. The probability of being selected for each individual is related to the ranking of their fitness values at the current generation [47]. The higher the ranking of the vector, the more likely it is selected as the parent vector, which improves the exploitation ability of DE. Furthermore, to strike a balance between the exploration and exploitation capability of DE, considering the Metropolis criterion in SA can primarily prevent premature convergence, our algorithm replaces the greedy selection operator in DE with the accept with a certain probability one in SA. In other words, Metropolis criterion is used to determine whether to accept the trial vector after mutation operation. This modification aims to increase the population diversity as well as improve the exploration capability. According to the experimental results, HDESA can primarily improve the performance of DE.
The rest of paper is organized as follows. The basic procedures of DE and SA are briefly introduced in Section 2. Section 3 gives the novel DE variant HDESA. Experimental results and analysis are presented in Section 4. At last, Section 5 draws the conclusion from this paper.
Differential evolution (DE) and simulated annealing (SA)
In this section, the basic steps of traditional differential evolution (DE) and simulated annealing (SA) are introduced.
Differential evolution (DE)
For the optimization problem:
Where D denotes the dimension of the solution space,
(1) Generate initial population
The initial population
Where xj,i (0) represents the j th gene of the i th individual of the 0 th eneration and NP is the population size.
(2) Mutation operation
At each generation g, a mutant vector v
i
(g) is created in this operation based on the parent population x
i
(g) according to:
The above strategy is the most classic mutation strategy named DE/rand/1 [1, 2]. Besides, the mutation vectors can also be generated in some other common ways:
DE/best/1:
DE/current-to-best/1:
DE/rand/2:
DE/best/2:
Where F represents the mutation scaling factor, x best denotes the best component in the current generation, r1∼ r5 ∈ { 1, 2, …, NP } ∖ { i }, and r1 ≠ r2 ≠ r3 ≠ r4 ≠ r5. DE is good at global searching but a little weak in local searching in that r1 ∼ r5 are randomly selected, especially when the mutation operator does not adopt the best individual (i.e., x best ) [23].
(3) Crossover operation
At each generation g, a trial vector uj,i (g) is created in this operation based on the parent population x
i
(g) and the middle generation v
i
(g):
Where C r denotes the crossover rate, j rand denotes an integer randomly selected between 1 and D.
(4) Selection operation
DE adopts a greedy algorithm to select individuals who enter the next generation:
For each individual, the solution obtained is better or equal to the global optimum achieved by the individual through mutation, crossover, and selection operations [1, 2]. Following Fig. 1 explains the schematic diagram of the traditional DE.

The schematic diagram of the traditional DE.
SA is inspired by the solid annealing process, and the cooling schedule is used to control the algorithm progress [48]. When the control parameter T slowly drops and tends to zero, the algorithm will obtain the global optimal solution. The basic steps of SA are listed as follows: Set the cooling schedule parameters and randomly generate an initial solution x0, where the parameters in the cooling schedule are initial temperature T0, decay function, end temperature T
f
, and Markov chain length L. Perturb the current solution x1 to generate a new solution x2, the increment df = f (x2) - f (x1). If df < 0, then p = 1, else If rand (0, 1) < p, x2 is accepted as the new current solution x1 = x2, else keep the current solution x1, the above judgment criterion is Metropolis criterion. If the termination condition is met, the current solution x1 is output as the optimal solution, else go to step (2) after reducing the temperature.
SA does not depend on the initial value, which means that there is no necessary connection between the solution obtained by the algorithm and the initial solution. Therefore, SA has good robustness. Furthermore, the use of Metropolis criterion improves the exploration ability of SA. To illustrate the method more clearly, a schematic diagram of SA is shown as Fig. 2.

The schematic diagram of SA.
The hybrid differential evolution and simulated annealing algorithm, HDESA, is proposed in this paper. The motivation of the work and the construction process of HDESA are given in this section.
Motivation of the work
Many DE variants add self-adaptive approaches to enhance the performance of DE. Although the performance has been improved to a certain extent, the defects such as complex coding and too many control parameters appear [6, 30]. Similarly, some hybrid algorithms also ignore the simplicity of the code due to the pursuit of algorithm performance. Moreover, most hybrid algorithms have disadvantages such as too many control parameters and dependence on specific problems [45, 46]. It can also be observed that most of the DE variants focus on improving mutation operations, whereas few variants focus on the selection operation, so some of them cannot achieve a balance between exploration and exploitation capability. Based on the considerations mentioned above, a new DE variant named hybrid DE and SA algorithm (HDESA) is proposed to solve continuous global optimization problems. Unlike some other hybrid algorithms, HDESA has a relatively simple coding and has almost no additional parameters. The algorithm introduces the concept of ranking into the mutation operation of DE and adds the idea of SA to the selection operation. The former is to enhance the exploitation capability of DE and increase the convergence rate, and the latter is to enhance the exploration ability and prevent it from trapping into the local optimum, then an excellent balance can be achieved.
Framework of hybrid DE and SA algorithm (HDESA)
HDESA adopts the same initial population generation operation and crossover operation as described in (1) and (3) of the traditional DE steps in Section 2. The details of the mutation and selection operations of our algorithm are described as follows.
Ranking based mutation operation
In the conventional DE, three individuals are randomly selected from the current population, which results in the poor exploitation. According to the selection pressure widely used in the conventional DE, the better the individual, the higher the chance of being offspring. The current population is ranked according to their fitness values. Choosing each vector based on the ranking of their fitness values in the current population instead of selecting it randomly in this step. That is to say, the higher the ranking of the vector, the more likely it is selected as the parent vector [47]. The pseudocode of the ranking-based mutation operator is given in Table 1.
Pseudocode of the ranking-based mutation operator
Pseudocode of the ranking-based mutation operator
(1) Weight assignment
To enhance the local search capability of DE, weights are assigned for each vector according to the ranking of their fitness values. First, based on the fitness value of each vector, sort the individuals in the population in ascending order (i.e., from the best to the worst). Then, calculate the weights of the vectors as follows:
Where NP denotes the population size.
(2) Selection probability
After obtaining the weight for each vector, calculate the selection probability p
i
of the i
th
vector x
i
according to:
According to the above formula, the better the vectors, the higher the selection probability they will get.
(3) Vector selection
After obtaining the selection probability of each vector, another question is, will all of the vectors be selected based on the selection probability? The answer is NO. In this paper, the selection of r1 and r2 is linked to their rankings, whereas r3 should be selected randomly as the traditional DE does in Section 2. The reason why the selection of r3 is not also dependent on its ranking is that the population diversity should be ensured. Otherwise, it may cause the algorithm to premature convergence and easily trap into local optimum. The specific operations for selecting r1 and r2 are as follows: Randomly select different r1, r2∈ { 1, 2, …, NP } ∖ { i }, and calculate their selection probability p
r
1
, p
r
2
; Compare the random number r randomly generated between 0 and 1 with the selection probability p
r
1
, p
r
2
. If p
r
1
, p
r
2
> r, select r1 and r2, otherwise, return to step a.
In the mutation operation, we add the concept of ranking to enhance the local search capability of the algorithm. In this operation, what we need is to avoid premature convergence. The Metropolis criterion of SA can primarily prevent the algorithm from trapping into the local optima and achieve the global optimum by designing a constantly changing probability during the search process. Besides, the competitive local search capability of SA is also helpful to increase the convergence rate of DE. Based on the above considerations, the original greedy selection operator in DE is replaced with the “accept with a certain probability” operator in SA. It is consistent with the cooling progress of SA. The improved selection operation steps are presented as follows:
At each generation g: Calculate the fitness value f (x
i
(g)) and f (u
i
(g)) of the parent vector x
i
(g) and the trial vector u
i
(g) and the increment delta (i) = f (u
i
(g)) - f (x
i
(g)); Compare the value of f (u
i
(g)) and f (x
i
(g)), that is, if delta (i) ⩽ 0, then accept the test vector u
i
(g) into the next generation, otherwise, compare the value of exp (- delta/T) with the randomly generated number r between 0 and 1. If exp (- delta/T) ⩾ rand (0, 1), also accept the test vector u
i
(g) into the next generation, otherwise, keep x
i
(g):
Where T is the current temperature. Besides, it should be noted that to keep this process consistent with the cooling progress in SA, we set DE to de-temperature every 20 iterations. In this way, the population diversity of DE can be increased, and the competitive local search ability of SA can be retained as well. The pseudocode of this operator is given in Table 2.
Pseudocode of SA-based Selection operator
Adding the ranking-based mutation operator and the SA-based selection operator we previously proposed, a hybrid DE and SA algorithm (HDESA) is presented subsequently. The pseudocode of HDESA is given in Table 3. With these two novel operations, DE will perform better.
Pseudocode of HDESA
In this subsection, both time and space complexity of HDESA are presented [49, 50].
Time complexity
If NP is the number of the population, D is the dimension of the solution space, G is the maximum iteration number, then the time complexity of each operation of HDESA is analyzed as follows: The HDESA initializes the population in O(1) time. Mutation operator requires O(NP×(D + log(NP)+1)) time, where population sorting and probability calculation are added. Crossover operator requires O(NP×D) time. Selection operator requires O(NP×(D + 1)) time, where probability calculation is added.
In general, the value of D is larger than the value of log(NP). Thus, the total time complexity of HDESA is equal to O(NP×D×G) for maximum number of iterations G.
Space complexity
HDESA uses the number of population which is NP to calculate the space complexity, and the dimension of the search space is D. Therefore, the total space complexity of HDESA is O(NP×D).
Experimental results
Some experimental results and analysis are given in this section to study the performance of the proposed algorithm HDESA.
Benchmark functions
Twenty-four benchmark functions proposed at the CEC2005 special session were used to study the performance of HDESA. If the number of test instances is smaller, the results may not be comprehensive enough, and the algorithm may be biased towards certain types of problems. More details for each function are given in [51]. These 24 functions can be divided into four groups: Unimodal functions (F1 - F5) Basic Multimodal functions (F6 - F12) Expanded multimodal functions (F13 - F14) Hybrid composition functions (F15 - F24)
In the experimental research of this paper, to compare the proposed HDESA with other algorithms about their performance fairly, unified parameters are set in this study: the population size NP = 50, the dimension of the functions D = 30 and 50, the maximum number of evaluations for each function FES = 500000. Furthermore, the initial temperature in SA involved in this paper T0 = 10000; the de-temperature parameter k = 0.9; the number of iterations at each temperature m = 20 (which is similar to Markov chain length L in SA). More details on parameter settings for algorithms mentioned in this study are given in Table 4.
Parameter settings for algorithms mentioned in this study
Parameter settings for algorithms mentioned in this study
The evaluation of the performance of HDESA is obtained by comparing with other algorithms, which are SA, DE and rank-DE [47].
To avoid the large deviation caused by one run, the number of independent runs of the algorithm is set to 30. Each function performs 30 independent evolutions, and the optimal value obtained in each run is obtained and the mean of these results can be computed. The mean and standard deviation function error values of SA, DE, rank-DE and HDESA over 30 independent runs on 24 benchmark functions at D = 30 and D = 50 are presented in Table 5 and 6. To further analyze the performance of the algorithm, the Wilcoxon signed-rank test at α = 0.05 was adopted to compare the HDESA with the traditional DE, classic SA and rank-DE, respectively. The comparison results can also be obtained in Tables 5 and 6. In addition, the final rankings of the four algorithms mentioned above calculated by the Friedman test for all functions are given in Table 7.
Experimental results of SA, DE, rank-DE and HDESA at D = 30
Experimental results of SA, DE, rank-DE and HDESA at D = 30
* “Mean Error” and “Std Dev” denote the average and standard deviation of the function error values obtained 30 runs, respectively. “+”, “–”, “≈” represent that the performance of our approach is better than, worse than, and the same as that of its competitors according to the Wilcoxon signed-rank test at α= 0.05, respectively.
Experimental results of SA, DE, rank-DE and HDESA at D = 50
* “Mean Error” and “Std Dev” denote the average and standard deviation of the function error values obtained 30 runs, respectively. “+”, “–”, “≈”represent that the performance of our approach is better than, worse than, and the same as that of its competitors according to the Wilcoxon signed-rank test at α= 0.05, respectively.
Average rankings of SA, DE, rank-DE and HDESA according to the Friedman test for all functions
Obviously, HDESA has better performance than the other three algorithms as a whole according to the last three rows of Tables 5 and 6 and the average rankings showed in Table 7. The detailed description is presented as follows: Unimodal functions (F1 - F5). On these five unimodal functions, no matter D = 30 or 50, HDESA performs the best. For D = 30, HDESA is significantly better than DE and rank-DE on three benchmark functions, respectively. SA cannot outperform HDESA on any unimodal function. For D = 50, HDESA outperforms SA, DE and rank-DE on 5, 4 and 5 benchmark functions, which performs even better than it does at D = 30. On the one hand, the possibility of choosing better solutions among the population is increased by using the ranking-based mutation operator, which can speed up the convergence rate when solving unimodal functions. On the other hand, the SA-based selection operator increases the population diversity and avoids over-exploitation. Hence, it can help strike a balance between the exploitation ability and the exploration ability. Basic multimodal functions (F6 - F12). Clearly, on these seven basic multimodal functions, HDESA is the best among the four algorithms at both D = 30 and 50. For D = 30, it outperforms SA, DE and rank-DE on 5, 5 and 6 benchmark functions, respectively. SA and DE outperform HDESA on one function, and rank-DE cannot perform better than HDESA on any problems. For D = 50, HDESA is significantly better than SA, DE and rank-DE on four benchmark functions, respectively. We can also observe that rank-DE outperforms DE only on one test function (1 out of 7) no matter D = 30 or 50, which confirms that the ranking-based mutation operator only may lead to over-exploitation and local optima. Our proposed HDESA adopts SA-based selection operator to balance the exploitation and exploration ability, proving very useful. Expanded multimodal functions (F13 - F14). It is clear that HDESA remains the winner of the four algorithms on these two expanded multimodal functions at both D = 30 and 50. SA, DE and rank-DE cannot outperform HDESA on any test function. Hybrid composition functions (F15 - F24). These functions are far more complicated than other test functions mentioned above because each of them contains ten sub-functions. So, it is challenging to solve them for almost all algorithms. On these ten test functions, HDESA outperforms SA, DE and rank-DE on 7, 8 and 8 tests at D = 30, respectively. The performance of SA, DE and rank-DE can only surpass HDESA on 0, 2 and 3 tests at D = 50. Thus, HDESA is still the winner on the ten hybrid composition functions. The superior performance of HDESA has been further demonstrated here.
With respect to the Friedman test showed in Table 7, the average rankings of SA, DE, rank-DE and HDESA for all functions are presented. From the results, it can be seen that HDESA obtains the best ranking at both D = 30 and 50, which also tells us that HDESA performs best among these four algorithms.
To further clarify the comparison of the HDESA algorithm with other algorithms, the logarithm of mean function error values of SA, DE, rank-DE and HDESA over 30 independent runs on 24 test functions at D = 30 versus the number of FES is plotted in Fig. 3. It can be seen from Fig. 3 that the overall convergence speed of HDESA is faster than the other three algorithms on most functions, and there is no local optimum.

Convergence graph of SA, DE, rank-DE and HDESA on 24 test functions.
Moreover, the search history and trajectory are given in Fig. 4. The first column of Fig. 4 is the benchmark function. The second column of Fig. 4 is the search history, which tells that the solution is searched extensively at the beginning and the optimal solution is exploited at the end. Besides, the third column of Fig. 4 gives the trajectory, from which we can draw the conclusion that the search ability of HDESA is quite strong and the diversity of the population is guaranteed. Finally, the fourth and fifth columns of Fig. 4 prove the high convergence rate of the algorithm once again.

Search history and trajectory of HDESA.
In summary, according to the results and analysis, it is not difficult to notice that the proposed HDESA can vastly improve the performance of DE on various problems, which are either unimodal or multimodal, separable or non-separable, clean or noisy, continuous or non-continuous. This is because HDESA uses ranking-based mutation operator, which speeds up the convergence of the algorithm and introduces the idea of SA in the selection operation. Metropolis criterion is employed to choose the individuals as offspring rather than selecting them greedily, enabling the algorithm to have a strong global search capability without losing its exploitation capability.
Different settings of the number of iterations at each temperature m may influence the performance of HDESA. The m is an important parameter, which may influence the performance of the algorithm. It is necessary to test the influence of the parameter. To verify the performance of HDESA with different value of m, twelve typical benchmark functions are adopted to perform at m = 20 and 25, respectively. The twelve functions selected cover four types of functions, which are unimodal functions, basic multimodal functions, expanded multimodal functions and hybrid composition functions. Table 8 gives the average function error values of HDESA over 30 independent runs on twelve benchmark functions at m = 20 and 25, respectively.
Experimental results of HDESA over 30 independent runs on 12 test functions at m = 20 and 25, respectively
Experimental results of HDESA over 30 independent runs on 12 test functions at m = 20 and 25, respectively
There are no significant differences in the performance of the proposed algorithm HDESA when m takes 20 and 25, which shows that the setting of m has little influence on the performance of the HDESA algorithm. So, we simply set m to 20.
In the ranking-based mutation operation, the higher the ranking of the vector, the more likely it is selected as the parent vector, which can enhance the exploitation capability of DE. However, it may lead to trapping in local optimum for some cases. The Metropolis criterion in SA can effectively improve the population diversity and ultimately achieve the global optimum, replacing the original greedy selection operator in DE. Inspired by this, we propose a hybrid DE and SA algorithm to balance the exploration and exploitation ability, which is simple but efficient. The main innovation points of the algorithm are listed as follows: The concept of ranking is added to the mutation operation to make full use of the information of the better individuals in the current population, enhancing the exploitation ability of DE greatly. The greedy selection operator in DE is replaced with the accept with a certain probability one in SA, increasing the population diversity and balancing the exploitation and exploration capability. Unlike some other hybrid algorithms, HDESA has a relatively simple coding and has almost no additional parameters without losing its efficiency.
The experimental research has been conducted on 24 benchmark global functions proposed at the CEC2005 special session. HDESA was compared with three other algorithms, i.e., SA, DE and rank-DE. The experimental results and analysis have shown its better or at least equivalent performance on exploitation and exploration ability. Besides, the influence of the number of iterations at each temperature m on the algorithm is experimentally studied.
In this paper, we introduce SA-based selection operator to DE algorithm. There remain other effective algorithms that we can utilize to enhance the performance of DE. Thus, we will try to hybrid some other advanced algorithms such as PSO or some ideas from other algorithms with DE to get better results in our near future work.
Footnotes
Acknowledgments
This research was funded by the China Natural Science Foundation (No.71974100), Natural Science Foundation (No. BK20191402) in Jiangsu Province, Major Project of Philosophy and Social Science Research in Colleges and Universities in Jiangsu province (2019SJZDA039), Qing Lan Project (R2019Q05), and philosophy and Social Sciences in Universities of Jiangsu (No. 2020SJA0182).
