Abstract
Genetic algorithm (GA) is a search algorithm for solving optimization problems and is an important part of evolutionary algorithms (EA). The main purpose of this research is to improve the mathematical modeling of GA, and to explore how to overcome the shortcomings of traditional algorithms under the Schema Theorem. The research results show that the improved GA can be more widely applied to the optimization problems and play an important role in solving practical problems.
Introduction
Genetic Algorithm (GA) is a computational model simulating the natural selection of Darwinian evolution and the biological evolution process of Genetic mechanism. It is a method to search for the optimal solution by simulating the natural evolution process. Its main characteristic is to operate directly on the structural object, and there is no limit of derivation and continuity of function. It has inherent implicit parallelism and better global optimization ability. By using probabilistic optimization method, the optimized search space can be acquired and guided automatically without definite rules, and the search direction can be adjusted adaptively. Genetic algorithm takes all individuals in a population as the object, and USES the randomization technique to search a encoded parameter space efficiently. Among them, selection, crossover and mutation constitute the genetic operation of genetic algorithm. Parameter coding, initial population setting, fitness function design, genetic operation design and control parameter setting constitute the core content of genetic algorithm.
The Schema Theorem is the basic theorem of GA. The principle is that a schema of lower order and has an average fitness higher than the population will also reflect deterministic differences under different schemas, for phenomenon such as prematurity that may occur in GA, mathematical modeling is required to make the algorithm more efficient.
Therefore, this paper will also conduct analysis in genetic operations, under the schema of same order, it analyzes the different forms, explains the mechanism of GA more effectively, so as to provide a mathematical basis for future researches.
Literature review
Li and Gao weakened the conditions needed by the schema theory of genetic algorithm, which made the schema theory further generalized. By studying the deception function of genetic algorithm, the conditions of building module assumption were determined. When it could be correctly recognized that a function was a deceptive function, the corresponding conditions of building block hypothesis could be obtained [1]. Bharathi et al. deduced another equivalent form of the model theorem by using more accurate expression of the average fitness of the population based on a single model of the population, which was verified by simulation experiments [2]. In the application of genetic algorithm, it is not difficult to find that genetic algorithm has a large number of random operations, which shows that genetic algorithm is a random search algorithm. In order to characterize the limit form of genetic algorithm, different Markov chain models have been used by scholars all over the world. Zhu et al. put forward the theory that the population sequence of genetic algorithm conformed to Markov chains could be used as a finite Markov chains to discuss. Some properties of transition probability matrix of finite Markov chains were used to study the convergence of genetic algorithm [3]. In the subsequent research work, French scholar Cerf regarded genetic algorithm as a special model of generalized simulated annealing algorithm on the basis of a series of previous studies on simulated annealing and generalized simulated annealing. Based on this understanding, further research was carried out [4]. In the Vose model, the state of the population is defined as a probability vector, whose length represents the population size. For example, the
In summary, the above research mainly discusses the establishment condition of genetic algorithm, mode comparison and the analysis of advantages and disadvantages among different modes. The local interference of multi-mode search on diversity when genetic algorithm converged too early is analysed, the measures to reduce diversity are put forward, and the scheduling problem is discussed. However, there are few researches on how to overcome the shortcomings of traditional algorithm under schema theorem. Based on the above research status, in the analysis of the genetic operation, in the same order mode, how to analyse its different forms, more effectively explain the mechanism of genetic algorithm, and provide a mathematical basis for future research is discussed.
Method
GA modeling and optimization
The optimization problem is a relatively new branch of mathematics. Due to the wide application of electronic computers, the optimization problem has rapidly developed in the early 1950s, making it a subject that is still very active until now. The optimization problem has a wide range of applications, such as structural optimal design, chemical engineering optimal design, electronic device optimal design, transportation scheme design, oilfield development program, etc., which are widely used in social and economic construction, as well as industrial and agricultural production. According to the principle of survival of the fittest, evolutionary generation produces better and better approximate solutions. In each generation, individuals are selected according to the size of individual fitness in the problem domain, crossover and mutation are conducted by means of genetic operators of natural genetics, resulting in populations representing new sets of solutions. This process will result in the population of the offspring being more adapted to the environment than the previous generation, like natural evolution. After decoding, the optimal individual in the last population can be used as the approximate optimal solution to the problem. The operation flow of the traditional GA is shown in Fig. 1.
Flow of traditional GA.
GA dynamically adjusts the search process by using the crossover probability and the mutation probability, so that the solution approximates the optimal solution. What is GA dealing with is not the parameter itself, but the encoded gene individual, so GA can directly manipulate some structural objects, which makes the application of GA more extensive. GA can process multiple individuals, that is, it can evaluate multiple solutions. This avoids traditional algorithms from falling into local extreme points when searching for multi-peak distribution search space. Therefore, GA has higher global search ability.
The crossover operator is the most important component of the genetic operators. It not only plays a crucial role in the convergence of the genetic operator, but also has a great influence on the convergence speed of GA. The operation of the traditional crossover operator is often blind. The offspring generation formed by the cross-interchange of the parental chromosomes is likely to discard or destroy the excellent genetic schemas possessed by the parental individuals. The standard intersection formula is shown in Eq. (1).
Where G is the total number of evolutionary generations pre-set by this population, and
Generally, for population initialization, GA is completely random. The randomness of the initial population tends to make each individual unevenly distributed in the solution space, so it is easy to lead to unbalanced evolution results in the iterative process, and finally lead to local optimal solutions. To overcome this problem, we use a new method to evenly distribute the initial individuals in the solution space. The specific scheme is shown in Fig. 2.
Population initialization schemas.
According to the Schema Theorem, the essence of GA should be to replace the original schema and form an excellent schema. In the operation of GA, the new schema should keep the early population evolution schema as much as possible so as to maintain the diversity of the population. In the later stages of population evolution, appropriate schemas should be maintained so as to prevent damages, thereby reducing the possibility of prematurity. Formula of the evolutionary algorithm is shown in Eq. (2).
Individuals with high fitness are likely to have excellent schemas, so they can better adapt to the relatively small crossover and mutation probabilities in the evolutionary process. Conversely, individuals with low fitness not only do not contain good schemas, but when they intersect with individuals with good schemas, they may instead destroy good schemas. Therefore, individuals with low fitness are more suitable for smaller crossover probabilities. Therefore, it is possible to increase the probabilities of crossover and mutation among individuals whose fitness level is close to the average to explore potential good schemas. In the formula, gen represents the current evolutionary generation, and max means the maximum evolutionary generation.
The crossover operator has a great influence on the convergence effect of GA. However, traditional crossover operation tends to be blind. Offspring individuals are formed by crossover of male parent and female parent discarding or self-destructing some of the excellent genetic schemas present in their own chromosomes. Therefore, in addition to studying the adjustable crossover and mutation probabilities, a crossover strategy has been proposed based on recent studies. As a result, the excellent genetic schemas existing in the male parent and female parent should be protected as much as possible so that they can be continued in future generations. Its definition is shown in Fig. 3.
Crossover strategy description.
According to the crossover operation definition schema shown in Fig. 4.
Crossover operation definition methods.
The coding similarity between individuals is quite small in the early stage of SCAGA. So, in order to ensure that significant genetic schemas are not destroyed by the evolution of the genome, the scp value used to control the crossover operation should be quite small. On the contrary, in the later stage of SCAGA, the difference between individuals is small, but the value of scp will increase. According to the formula, dynamic control of standard intersection can improve the convergence and convergence speed of the algorithm. The Schema Theorem is an effective method for analyzing GA. It has a good effect in predicting the growth rate in the algorithm process. It can analyze the schemas of crossover and mutation affecting genetic propagation in the algorithm process. In the analysis of the research results, a new evolutionary algorithm SCAGA is proposed based on the improved GA. This algorithm uses dynamic methods to adjust the crossover and mutation probabilities. The solution of GA is usually affected by random initial populations and genetic parameters. The specific improvement is to make SCAGA get better optimization performance and convergence performance. At the same time, a new crossover strategy was proposed to decide whether to take the crossover operation or not. Therefore, it can be concluded that SCAGA has been greatly improved and can effectively solve the problem.
This study analyzed mathematical modeling of improved GA under the Schema Theorem, that is, parallel GA based on small population strategy and adaptive GA based on improved genetic operator and crossover strategy. Aiming at some problems in practical applications, this paper made several improvements to the genetic operators of traditional GA, and proposed to use multiple small populations in parallel operation under the same total amount of individuals, namely the SGPGA. The results of the example prove that SGPGA not only guaranteed the simplicity of calculation, but also improved the effectiveness of traditional GA. In future researches, we can still use the method of adjusting the crossover probability and the mutation probability to optimize the adaptive GA of the evolution process, which has faster convergence and wider applicability than traditional GA.
