Abstract
Teaching-learning-based optimization algorithm (TLBO) is a swarm intelligence optimization algorithm that simulates classroom teaching phenomenon. In order to solve the problem that TLBO algorithm is easy to fall into local optimum and has poor stability, an improved teaching-learning-based optimization algorithm based on fusion difference mutation (IDMTLBO) is proposed. Firstly, adaptive teaching factors are introduced. Secondly, in the teaching stage, each student studies according to the gap between himself and the teacher, which improves the convergence speed and convergence accuracy of the algorithm. Finally, in the learning stage, students are divided into two levels according to their learning level, and two students are randomly selected to improve the iterative equation in the learning stage with the difference mutation strategy, It improves the disadvantage that the algorithm is easy to fall into local optimum. Numerical experiments show that the convergence speed and convergence accuracy of the algorithm are obviously better than TLBO algorithm, DMTLBO algorithm, DSTLBO algorithm.
Keywords
Introduction
Teaching-learning-based optimization (TLBO) belongs to swarm intelligence algorithm [1, 2]. It was proposed by Indian scholar Rao in 2011. The principle of the algorithm is to simulate the teaching phenomenon in the class, which is divided into teaching stage and learning stage. Through the joint action of these two stages, the purpose of improving students is achieved. Compared with Genetic algorithm [3], Particle Swarm Optimization algorithm (PSO) [4], Ant Colony Optimization (ACO) [5], TLBO algorithm has a simple structure, is easy to implement, and has few parameters. It has been successfully applied to many practical optimization problems, such as job shop scheduling problem [6], distribution network reactive power optimization problem [7], and inbound aircraft scheduling and deployment problem [8].
TLBO algorithm has received extensive attention since it was proposed. In order to improve the performance of TLBO algorithm, researchers put forward improvements to the TLBO algorithm in [9–20]. Matej crepinsek studied in [9] whether TLBO is a new star of meta heuristic algorithm, and whether the theory or practice supporting its dominant position is effective. It is proved that although TLBO performs well on unconstrained problems, it has a low success rate for some optimization problems, revealing some shortcomings of TLBO compared with other meta heuristic algorithms. Li Huirong proposed an adaptive teaching factors in [11], improved the iterative equation in the learning stage by using the mutation strategy in differential evolution algorithm, ensuring the diversity of the population. Bi Xiaojun fused mutation strategy of the differential evolution algorithm in learning stage [12]. A hybrid learning strategy is proposed, and added a perturbation strategy in the later stage of the algorithm to ensure the global optimality of the algorithm. Zhai Junchang proposed an improved TLBO algorithm to solve the global optimization problem in view of the disadvantage of premature convergence of TLBO [13]. In the learning phase, according to the adaptability of two learners, a poor student randomly selects an excellent student to learn, or an excellent student learns from the teacher, which improves the performance of the TLBO algorithm. Ma Yunpeng introduced a new population mechanism into the traditional teaching optimization algorithm in [14]. Chen D adopted the variable population size (VTTLBO) in the form of a triangle, which reduced the calculation cost of the original TLBO and extended it to optimize the parameters of the artificial neural network (ANN) [16]. Tuo, SH proposed a hybrid search algorithm, HSTLBO, which combines harmony search (HS) and teaching-learning-based optimization (TLBO), and uses adaptive selection strategy to solve complex optimization problems cooperatively in [17]. Huang Jida proposed an effective hybrid algorithm for continuous optimization problems, expanded the scope of application, modified it, and applied it to constrained optimization problems in [18]. Yu K combined feedback mechanism and differential evolution (DE) in [19], and introduced mutation crossover operation of DE to increase population diversity and prevent premature convergence. The hybrid perturbation mechanism is used to ensure that the algorithm can get rid of the local optimum. Liu Yin improved the performance of basic TLBO by using adaptive teaching factors and learning process integrating differential evolution in [20]. These improvements are roughly divided into two categories, namely algorithm performance and algorithm application improvements. However, these improved TLBO algorithms have slow convergence speed and are easy to fall into local optimum in the late iteration, which leads to premature convergence of the algorithm.
To solve the above problems, this paper proposes an improved teaching-learning-based optimization algorithm based on fusion difference mutation (IDMTLBO). First, the teaching factor T of the basic algorithm is improved, and a linear decreasing adaptive teaching factor is proposed to enhance the local search ability of the algorithm. Secondly, in the teaching stage, it is proposed to learn through the difference between students’ own level and teachers, which improves the accuracy and convergence speed of the algorithm. Finally, differential mutation strategy is applied in the learning stage to make the results approach the optimal solution. Students are divided into excellent students and inferior students according to their learning level. At the same time, two learning objects with different levels are selected to enhance the global search ability. Compared with other improvements, the improvement of TLBO algorithm in this paper not only speeds up the convergence speed and improves the convergence accuracy of the algorithm, but also compared with the improvement made by other scholars, the method in this paper is simpler, more practical and more efficient. Numerical experiments show that IDMTLBO algorithm is superior to TLBO algorithm, DMTLBO algorithm and DSTLBO algorithm in convergence speed and convergence accuracy.
TLBO Algorithm
In the TLBO algorithm, the subjects that students learn represent the dimensions of the population, and the number of middle school students in the class represents the size of the population. Teachers are defined as the students with the best performance in the class, and students’ academic performance is equivalent to the value of fitness. The specific process of the teaching optimization algorithm is described below.
Class initialization
Assuming that the number of students in the class is n and the subject is d, the initial population is generated by formula (1).
Among them, k = 1, 2, ⋯ , n, X k denotes the kth student in the class, rand functions denote the random number between [0, 1], Min X and Max X represent the minimum and maximum values in the solution space X.
In the actual teaching, students learn according to the gap between the average level of the class and the teacher. In the ith iteration, the gap between the kth student in the jth discipline and the teacher is given by Equation (2).
In formula (3),
Teachers ‘teaching is not necessarily able to effectively improve students ’ ability, students can also learn from each other to enhance. In this phase, random selection of two different students k1, k2, Produce new students during the ith iteration as formula (4).
Where f (Xi,k) represents the fitness value for each student. Update if
Adaptive teaching factors
In teaching-learning-based optimization algorithm, teaching factor T is derived from formula T = round (1 + rand), and teaching factor T determines the change of mean value. According to the formula of teaching factor, the value of teaching factor can only be 1 or 2, This means that students either have not learned knowledge from teachers, or have learned all knowledge. But this is often not the case in actual teaching. The knowledge learned by students is generally between 1 and 2. Formula (5) is now used to solve this problem.
In the teacher stage of TLBO algorithm, each student learns according to the difference between teacher and class average. Among them, the gap between the kth student in the jth discipline and the teacher in the ith iteration is given by the formula Differencei,j,k = rand (Xi,j,kBest - T × Meani,j). However, in the actual teaching, students often learn according to their own learning level and the difference between teachers. Therefore, the teacher stage of TLBO algorithm is improved as follows.
Updated according to formula (7).
In formula (7), each student in the class learns according to their own learning level and the difference between teachers, which improves the limitation of learning by middle school students in Equation (2) according to the average level of the class and the difference between teachers, and improves the accuracy and convergence speed of the algorithm.
Differential evolution algorithm (DE) is in a random population, which makes the results approximate to the optimal solution through mutual cooperation and competition among individuals. The algorithm has simple structure and is often used to deal with practical optimization problems.
Students can learn knowledge from teachers and improve themselves through mutual learning. In the basic algorithm, students in the learning stage are only affected by two students, and this stage ignores the influence of teachers on students ’ learning. According to the mutation strategy in the differential evolution algorithm, we improve the equation in the learning stage as follows in [11].
Xi,k1, Xi,k2 in the above equation represents two different students in the ith iteration process. If
If only one student is randomly selected, the student may be poor or even the worst in learning. This choice may make the learning effect poor. Therefore, formula (8) should be improved. If two students are randomly selected in the class, the influence of only one poor student on learning effect can be avoided. The improvement method is as follows.
In this formula, Xi,kBest is a teacher, used to guide students to learn, so that students take teachers as the goal of self - promotion. In the original algorithm, the limitation of
When the students ’ learning level is higher than the average level of the class, the students only learn according to the gap between themselves and teachers; when the students ’ learning level is lower than the average level of the class, the student not only learns according to the gap between themselves and teachers, but also selects two learning objects with different levels. According to the gap between the two different learning objects, the influence of poor students on learning is reduced, and the randomness of rand function is used to maintain the characteristics of TLBO algorithm. In addition, the algorithm of teaching factor is also improved to enhance the global search ability.
In summary, we will introduce the steps of IDMTLBO as follows.
Step 1: Sets the total number of the whole class as n, the population dimension as x, and the maximum number of iterations as Imax, so that the current number of iterations is i = 1.
Step 2: Initializes the class in the entire search space.
Step 3: Teacher stage, according to the formula (5–7) to produce new students, keep good grades.
Step 4: Learning stage, all students in the class and the average level of students in the class to do a comparison, according to (9) learning, get
Step 5: Repeat the teaching process for all students throughout the class.
Simulation experiments and results analysis
Test function
In order to verify the effectiveness of the improved TLBO algorithm, eight different standard test functions are used for simulation experiments. These standard test functions are universal in life, and the global optimal values are 0. The test functions are as follows.
Sphere function: There is a unique global minimum value, which is obtained by summing the squares of the minimum values of d independent variables with the same domain.
Schwefel’s Problem 22 Function: When the independent variable approaches infinity, the function will form a large number of local extreme value regions. The global optimal value is located at the boundary of the definition domain.
Generalized Rastigin function: The position of the minimum value is regular, which is used to detect a situation where the solution is regular, and the algorithm is practical.
Schwefel’s Problem 12 function: The independent variable of this function is epistatic, so its gradient direction will not change along the axis direction, which is difficult to find the best.
Generalized Rosenbrock function: Used to test the performance of optimization algorithm. Its global minimum is located in a narrow parabolic valley. However, although the valley is easy to find, it is difficult to converge to the minimum.
Step function: When the function approaches infinity in the definition domain, different step phenomena will occur at the given interval, and a large number of local extremum will be generated between each step, which makes it difficult to find the optimal solution.
Generalized Griewank function: The function solution changes with the quantity change, and there are a lot of local extremum in the real data distribution of the function, so the detection algorithm has the ability to jump out of the local.
Ackely function: When using gradient method to optimize a multi-dimensional point, there are often multiple directions. This function detects the global convergence rate of an algorithm. When the dimension increases, its direction gradient and direction of advance are various.
Analysis of effect
In order to verify the effectiveness of IDMTLBO algorithm, the original algorithm is expressed as (TLBO) the algorithm with adaptive teaching factor is denoted as (ATLBO), the improved algorithm in the teaching stage is denoted as (MTLBO), and the improved algorithm in the learning stage is denoted as (LTLBO). Set the class number to 50, the population dimension d = 50, the maximum number of iterations Imax = 500. Each test function runs 20 times independently, and the minimum, average and standard deviation of the results are shown in Table 2.
TLBO, ATLBO, MTLBO, LTLBO and IDMTLBO test results
TLBO, ATLBO, MTLBO, LTLBO and IDMTLBO test results
It can be seen from Table 2 that the minimum, average and standard deviation of ATLBO, MTLBO, LTLBO and IDMTLBO running 20 are better than the basic TLBO algorithm. It can be seen that the introduction of adaptive teaching factors and the improvement of teaching and learning stages have improved the performance of the algorithm. By comparing TLBO and MTLBO, it can be concluded that learning according to the gap between students and teachers in the teaching stage has significantly improved students’ learning level, improved the stability of the algorithm, and prevented the algorithm from falling into local optimization prematurely. Compared with TLBO and LTLBO, we can get the improved differential mutation strategy in the learning stage, which further expands the search scope of the space, maintains the diversity of the population, and improves the solution accuracy.
In order to verify the advancement of IDMTLBO algorithm, IDMTLBO algorithm, DMTLBO algorithm and DSTLBO algorithm are run 20 times independently under the conditions of 50 class members, d = 30 of population dimension and Imax = 200 of maximum iteration number. The results are shown in Table 3.
DMTLBO, DSTLBO and IDMTLBO test results
The results of Table 3 are analyzed, and the three algorithms are analyzed from the minimum, average and standard deviation, so as to verify the convergence accuracy, advancement and stability of IDMTLBO algorithm. The global optimal values of the eight standard test functions proposed in Table 1 are 0. Therefore, the closer the experimental results are to 0, the more effective the algorithm is. The smaller the standard deviation is, the smaller the floating range of the operation result is, and the more stable the algorithm is.
Standard test function expression and search space
It can be seen from Table 3 that the minimum and average values of IDMTLBO algorithm are closer to the global optimal value 0, which shows that the performance of the algorithm is greatly improved by adding adaptive teaching factors and the improved differential mutation strategy. From the perspective of standard deviation, the test results of IDMTLBO algorithm are smaller than those of other algorithms, which indicates that the improved differential mutation strategy further expands the search scope of space, effectively maintains the diversity of the population, and improves the accuracy of the solution. Therefore, it can be concluded that the convergence accuracy, progressiveness and stability of this algorithm are obviously superior to the other two algorithms.
Figure 1 shows the evolution curve of DMTLBO, DSTLBO and IDMTLBO algorithms. It can be seen from the figure that IDMTLBO algorithm converges to the global optimal solution quickly, DMTLBO algorithm and DSTLBO algorithm converge slowly, and TLBO algorithm falls into the local optimal solution. Therefore, IDMTLBO algorithm is obviously superior to DMTLBO algorithm and DSTLBO algorithm in terms of both convergence speed and convergence accuracy, and converges to the best in a small number of iterations. This is because students improve the stability of the algorithm according to their learning gap with teachers, and the improved differential mutation strategy effectively maintains the diversity of the population, with strong optimization ability. To sum up, IDMTLBO algorithm has higher optimization precision and stronger optimization ability, and the algorithm has simple structure and short running time. Therefore, the proposed algorithm is progressiveness.

Evolution curve of DMTLBO, DSTLBO and IDMTLBO algorithms.
In view of the shortcomings of TLBO algorithm, IDMTLBO first introduces an adaptive teaching factor, which makes up for the shortcomings of TLBO algorithm teaching factor can only take 1 or 2. Secondly, the learning process in the learning stage is improved to enable students to learn according to the gap between themselves and teachers, and enhance the stability of the algorithm. Finally, in the learning stage, the improved differential variation strategy is integrated, and students are divided into excellent students and inferior students. The excellent students learn according to the gap between themselves and teachers, and the inferior students learn according to the gap between themselves and teachers and two students randomly selected. At the same time, two students are selected to avoid the limitation of selecting only one student as the inferior student, effectively maintaining the diversity of the population, and improving the solution accuracy. Through the simulation experiments on the eight standard test functions in Table 1, it can be seen from the results that the IDMTLBO algorithm proposed in this paper has good convergence performance, and the main advantages of this algorithm are simple structure, easy to understand, few super parameters, and high convergence performance. Therefore, this algorithm has good prospects for development and can be further applied to other practical optimization problems.
Footnotes
Acknowledgments
The authors would like to thank the anonymous reviewers, the Editor-in-chief and the Associate Editor, for providing valuable comments and helpfull suggestions.
