Abstract
The grasshopper optimization algorithm (GOA) has received extensive attention from scholars in various real applications in recent years because it has a high local optima avoidance mechanism compared to other meta-heuristic algorithms. However, the small step moves of grasshopper lead to slow convergence. When solving larger-scale optimization problems, this shortcoming needs to be solved. In this paper, an enhanced grasshopper optimization algorithm based on solitarious and gregarious states difference is proposed. The algorithm consists of three stages: the first stage simulates the behavior of solitarious population learning from gregarious population; the second stage merges the learned population into the gregarious population and updates each grasshopper; and the third stage introduces a local operator to the best position of the current generation. Experiments on the benchmark function show that the proposed algorithm is better than the four representative GOAs and other metaheuristic algorithms in more cases. Experiments on the ontology matching problem show that the proposed algorithm outperforms all metaheuristic-based method and beats more the state-of-the-art systems.
Keywords
Introduction
Since the metaheuristic algorithms have higher performance and greater optimization driving force [1], these techniques have been widely used in a variety of applications, such as image segmentation [2], feature selection [3], very large scale integration (VLSI) circuits [4], hypergraph path [5], and data mining [6]. Meta-heuristics can solve problems that cannot be solved by conventional methods, and it aims to provide sufficiently good solutions for optimization problems. Therefore, meta-heuristics are also called high-level heuristics technique or global search technique or nature-inspired optimization algorithms [1]. Over the past decade, many nature-inspired optimization algorithms have been proposed such as particle swarm optimization, whale optimization algorithm, bacterial foraging optimization algorithm, firefly optimization algorithm, bat optimization algorithm and artificial bee colony optimization algorithm.
Grasshopper optimization algorithm (GOA) is a new swarm intelligence-based algorithm. Further, it is a bio-inspired optimization technology by simulating the behaviors of grasshopper swarm foraging [7]. The GOA algorithm is also a population-based stochastic global search technology. Compared with other population-based intelligence algorithms, this algorithm can well balance the exploration and exploitation process through adaptive adjustment of the comfort zone coefficient. In particular, the movement of a grasshopper needs its own position, the current best target position, and the all grasshoppers position. As the result, the algorithm is able to fully explore and exploit the search space. Specifically, GOA has the advantage of high exploration and local optima avoidance due to the high repulsive force between grasshoppers. The attraction forces between the grasshoppers and the adaptive comfort zone prompt GOA to maintain a strong exploitation. However, the small step moves of grasshopper lead to slow convergence speed. When solving larger-scale optimization problems, this shortcoming needs to be resolved. Considering larger research space, we try to improve the convergence rate of GOA through the biological behavior of grasshopper swarm.
In this work, two states of gregarious and solitarious are transplanted into the original GOA [8, 9]. The density information of the population is used to divide the original population into two subpopulations to simulate the gregarious and solitarious behavior. The mechanism accelerates the speed of convergence to the optimal solution.
The remainder of this discussion is organized as follows: Section 2 outlines existing works. The proposed algorithm SGBGOA is described in Section 3. Section 4 implemented a series of detailed experiments. Section 5 summarizes this work and gives a plan for the future.
Related works
Basic GOA
The grasshopper optimization algorithm [7, 22] is one of the most recent metaheuristic algorithms, which simulates the movement behavior of grasshoppers by a mathematically model. The mathematical model is defined as follows:
To solve optimization problem, a mathematical model is defined as follows:
X
i
is the position vector of the i-th grasshopper; c1 is similar to the inertia weight of particle swarm optimization, which balances the exploration and exploitation; δ and θ are the upper and lower bounds of the search range; s is the attraction function; d
ij
is the distance between the i-th grasshopper and the j-th grasshopper;
x j is the position vector of the j-th grasshopper; x i is the position vector of the i-th grasshopper; c2 is an internal parameter, which is the decreasing coefficient to shrink the comfort zone, repulsion zone and attraction zone; The c1 and c2 is defined as follows:
The grasshopper optimization algorithm has attracted widespread attention from scholars due to its powerful search capabilities. In recent years, many researchers have proposed different improved variants from the tuning of parameters, the topology structure of the population and the hybridizing evolutionary strategy. This three categories are discussed below.
For the variants of GOA based on parameter tuning, since this comfort zone coefficient c is a key parameter to maintain a proper balance between exploration and exploitation, the turning of this parameter has important research value. Algamal et al. [10] proposed an improved grasshopper optimization algorithm (PGOA). In PGOA, a parameter turning equation is proposed to improve the exploitation process of the algorithm, which can be quickly reduced in a small number of iterations. Arora et al. [11] employ chaos map to provide the parameters c1 and c 2 (CGOA). In this study, different chaotic maps are embedded into the GOA algorithm to tune the parameters adaptively. Experimental result shows that the Circle map is the most effective method for providing the parameters c1 and c2 at the same time.
For the topological structure of the population, it affects the diversity of the population and the exploration of the GOA. Ewees et al. [12] proposed a grasshopper optimization algorithm based on opposition-based learning (OBLGOA). The opposite learning strategy [23] is used in 50% of the population, and the learned individuals are judged according to the fitness value, and the good individuals are propagated to the next iteration. Bala et al. [13] proposed a grasshopper optimization algorithm with simple attraction and repulsion. The algorithm divides the population into two sub-populations, the best 25% of individuals are selected to the top group, and the remaining 75% of individuals are combined to the bottom group. Subsequently, two individuals are selected randomly from top and bottom group according to a certain probability. Next, the absolute value of the difference between the two individuals is defined as diff. If each gene in diff less than 3, the algorithm performs a repulsion. If each gene in diff greater than 7, the algorithm performs an attraction.
For the variants of GOA based on hybridizing evolutionary strategy. Luo et al. [32] proposed an improved hybrid grasshopper optimization algorithm. The Gaussian mutation and Levy-flight strategy are used to improve the search performance of GOA. In addition, the OBL strategy is also used for all agents in the current population. El-shorbagy et al. [14] proposed a grasshopper optimization algorithm based on hybrid genetic algorithm. The improved algorithm mainly combines exploitation of the GA and exploration of the GOA.
Proposed new algorithm
The grasshopper optimization algorithm has been widely used in artificial intelligence and engineering applications.
In this paper, the two state differences between the solitarious and gregarious behaviors of grasshopper swarm are transplanted into GOA to enhance the performance while improving the convergence rate, called SGBGOA. The motivation and the principle of the SGBGOA are explained in the following two sections respectively.
Motivation for GOA improvement
Since the original grasshopper optimization algorithm uses a small step and slow movement manner, the convergence speed of the GOA is slow when calculating large optimization problems. In order to solve this problem, this work studies the biological behavior of grasshopper swarms.
Topaz et al. [8] claimed that grasshoppers exhibit two phases of mutual conversion: gregarious and solitarious. The individuals with solitarious state are repelled by other grasshoppers, and gregarious grasshoppers are attracted to the conspecifics and formed large aggregations such as marching hopper bands. These two behavioral states strongly depend on the local population density. In sparse food environment, the gregarious grasshoppers will transition to solitarious state. In a dense food environment, the solitarious grasshoppers will transition to gregarious state. Furthermore, some studies have shown that grasshoppers of raising alone have acquired most of the behavioral characteristics of the gregarious state within 4–8 hours of crowding, including the tendency to aggregate [26, 27]. The characteristic encourages the tendency of other grasshoppers to transform into gregarious state. Inspired by these studies, a novel enhanced grasshopper optimization algorithm based on the gregarious and solitarious states difference is proposed. The population of the original grasshopper optimization algorithm is divided into two subpopulations: sparse population and dense population. The sparse population represents the solitarious state. The dense population represents the gregarious state. In the solitarious state, since the solitarious individual can obtain most of the characteristics of the population from the gregarious state, the algorithm determines whether the solitarious individual is attracted or repelled by calculating the repulsive force between the solitarious individual and gregarious individual. It is worth noting that individuals with small repulsive forces, i.e., conspecifics, are easily attracted to the gregarious state. Individuals with high repulsive force, i.e., non-conspecifics, move into the opposite direction, and them are saved in sparse population. In the gregarious state, the learned sparse population are merged with the dense population to form a large aggregation. The details of the algorithm are introduced in the following subsections.
SGBGOA algorithm
Based on the above analysis. Three learning stages are designed: the sparse population learn from dense population, each grasshopper learns from all population, and a local operator that learn from the best target. For the first stage, we propose two learning equations, which are used to learn from the conspecifics and move into the opposite direction for the non-conspecifics. The opposite movement strategy for non-conspecifics is also indicated mutual repulsion. For the second stage, the sparse population learned by the first stage is combined with the dense population, and then each individual employs the update equation to learn from all other individuals. The algorithm updates the dense population in each iteration. The mechanism realize the transition process from the solitarious to gregarious state. Finally, a chemotactic operator is applied to the optimal target position of the current generation to enhance the local search ability of the algorithm. The specific steps are described as follows:
First, the grasshopper population is initialized randomly and the fitness value of each grasshopper is calculated; the grasshopper population is divided into two subpopulations: sparse population and dense population. Specifically, taking the minimization problem as an example, the grasshopper position is sorted in ascending order according to the fitness of the grasshopper. The first 50% of the individuals are selected into the dense population, and the last 50% are selected to the sparse population. Second, sparse population learn from dense population. In order to determine the repulsive force between individual of sparse population and dense population, a repulsive force equation is used [8]. It is defined as follows:
In order to realize the process of sparse population learning from dense population, two equations are proposed to model this process. Specifically, the Equation (8) represent learning from the conspecifics, and the Equation (9) represent learning from the opposite direction of the non-conspecifics. The two equations are defined as follows:
Next, two individuals are selected randomly from the dense population. two repulsive forces are calculated between individual of the sparse population and these two individuals by Equation (7) respectively. The two repulsive forces are denoted as Rep1 and Rep2, respectively. If Rep1 is less than Rep2, the individual of the sparse population learns from the dense individual through Equation (8). Otherwise, it learns from the opposite direction of the dense individual with large repulsive force through Equation (9). After that, if the learned individual is better than the current individual, the position of the individual of the sparse population is updated. This detailed learning process is described in Algorithm 1
The sparse population learned by the first stage are combined with the dense population to form aggregates. The position of each grasshopper in the aggregates is updated by Eq. (4). That is, the position of each grasshopper is updated based on its own current position, the target position, and the position of all other grasshoppers. After that, the fitness values of all grasshoppers are recalculated and sorted in ascending order. The top 50% of individuals are selected to update the dense population.
In order to enhance the local search capability of the SGBGOA, a chemotaxis operator [25] is used to further exploitation feasible solutions near the current best target location. In detail, the chemotaxis operator comes from the bacterial foraging optimization algorithm, which has strong local search capabilities. Chemotaxis operators include tumbling and swimming operators [24]. Swimming represents the pattern of individuals moving in a certain direction. The individual changes the direction of movement by tumbling. If the fitness value is not improved, it moves several steps in the new direction until the maximum number of moving or a stable fitness value is reached. The chemotactic operation of the ith individual is shown as follows:
Algorithm 3 shows the SGBGOA algorithm
In this section, three experimental tasks are implemented to verify the performance of the proposed SGBGOA algorithm. Before the experiment, the twenty-three benchmark functions are described. The parameters of each algorithm are set. Regarding the first task, the proposed algorithm SGBGOA is compared with three representative GOA variants and the other metaheuristic techniques in test functions. The mean (Mean) and standard (Std) deviation are calculated separately for each algorithm. For the second task, the convergence speed of the proposed algorithm is compared with other metaheuristic algorithm on each test function. All algorithms are executed independently on each test function 20 times, and the average is calculated. For the third task, a real-world application, i.e., ontology alignment is tested. Ontology alignment is implemented as an optimization problem by using the proposed SGBGOA, named SGBGOA-OM. Section 4.3.3 studies the maximum number of iterations required for the algorithm to find a high-quality alignment to prove whether the proposed algorithm can speed up the completion of the task. For alignment quality, the results are compared with the original GOA, three representative GOA variants and the other metaheuristic techniques such as CPSO [15]. In order to further verify the effect of the SGBGOA-OM method, several state-of-the-art ontology alignment methods are selected.
Experiment 1: Numerical experiments
Twenty-three benchmark functions [7] are used in this experiment, which include seven unimodal functions and sixteen multimodal functions. Furthermore, the unimodal function has only one extreme point in their domain. In contrast, the multimodal function has multiple extreme points and multiple local optima. Therefore, the feature is most suitable for evaluating the performance of the proposed algorithm, especially the ability to jump out of the local optimum. The detailed information of these functions is described in Table 1.
The benchmark functions
The benchmark functions
The parameters involved in the algorithms include the own parameters and public parameters. In order to make each algorithm have the best performance, the own parameters of each algorithm adopt original paper. The common parameters of each algorithm, i.e., the maximum number of iterations (maxIter) and the size of population (N), are set to the same value. In benchmark function experiment, maxIter = 500 and N = 100. The detailed parameter settings of each algorithm are shown in Table 2.
The parameter setting for all algorithm
The parameter setting for all algorithm
In this experiment, each test function is executed independently by using SGBGOA, OBLGOA, PGOA, CGOA, GOA, Bat Algorithm (BA), and Firefly Algorithm (FA), and the mean and standard deviation are calculated. The results are shown in Table 3. Compared with other GOA algorithms in terms of mean, it can be clearly seen from the results that the proposed SGBGOA algorithm outperforms other GOA algorithms in all functions except F1, F4, F7, F8, F10, F15, and F20. It is worth noting that all algorithms have not achieved the optimal value on the function F8. For functions F14, F17, F16, and F18, the all GOA algorithms found the same optimal value. Concerning unimodal functions (F1-F7), the SGBGOA algorithm obtains better solution on F2, F3, F5 and F6. It performs poorly on F1, F4 and F7 functions compared to the second best OBLGOA. Concerning multimodal functions (F8-F23), the proposed SGBGOA algorithm finds better solutions for all functions except F8, F10 and F20. On the contrary, although the OBLGOA algorithm obtains the same solution on F14, F17, and F18, it obtains a poor solution in solving most multimodal functions. The possible reason is that OBLGOA only uses the OBL learning strategy to change the diversity of the population. Therefore, the exploitation ability of the algorithm has not been improved. As a result, when solving functions with multiple extreme points, the exploration and exploitation process of this algorithm may not be well balanced.
Comparison results of SGBGOA, OBLGOA, PGOA, CGOA, GOA, BA and FA
Comparison results of SGBGOA, OBLGOA, PGOA, CGOA, GOA, BA and FA
Compared with other metaheuristic techniques BA and FA in terms of mean, the proposed SGBGOA achieves the best performance for most of the functions except F2, F9, F20, F21, and F23. For functions F16, F17, F18, F22, these algorithms found the same optimal value. For unimodal functions (F1-F7), The SGBGOA only fails to obtain a good solution on the function F2. Concerning multimodal functions (F8-F23), the proposed SGBGOA does not reach the optimal value on the function F8, F9, F20, F21, and F23.
Based on the above analysis, although the accuracy for a few functions is slightly lower than OBLGOA, the SGBGOA algorithm can obtain better accuracy compared to OBLGOA, PGOA, CGOA and GOA algorithm. The proposed algorithm outperforms other meta-heuristic algorithms in most functions. The main reason is that the proposed algorithm is able to quickly discover the feasible area by adopting the mechanism of the sparse population learning from dense population and to find the approximate optimal solution by using the local chemotaxis operator. Moreover, this proposed algorithm has a good balance for exploration and exploitation.
In this experiment, the convergence process of the SGBGOA algorithm is performed on the 23 functions and compared with based on GOA and other metaheuristic techniques. Figure 1 shows the convergence curves of each algorithm. The abscissa indicates the number of iterations, and the ordinate indicates the fitness. This figure clearly shows that the proposed SGBGOA algorithm speeds up the convergence to the optimal solution in all functions except functions F2,F7,F8, and F9. Improvement of the convergence speed is mainly due to the continuous learning of sparse population from dense population, which reduces the probability of learning from poor individuals. Furthermore, this strategy speeds up the movement to the global optimal solution. This is more conducive to the exploitation process of the local search strategy, which is to avoid premature convergence and fall into the local optimum. On the basis of the above analysis, it can be concluded that the improvement strategy of the SGBGOA algorithm is successful. This strategy can improve the accuracy and convergence speed of the original algorithm.

Convergence curves for all algorithms.
In this subsection, the proposed SGBGOA algorithm is tested in different challenges of large ontology alignment problem. Specifically, a large real-world ontology Anatomy track and two Large Biomedical tracks are used to evaluate the performance of the SGBGOA-OM method. First of all, this ontology alignment problem and ontology alignment are formally defined as an optimization problem. Then the experimental results of the three matching tasks are analyzed by using three evaluation indicators: Precision, Recall and F-measure.
Mathematical model of ontology alignment
Ontology alignment is an important process in data integration [16]. The evolutionary algorithms have proven to be an effective method for ontology alignment [28, 29]. In this experiment, the ontology alignment process is transformed into an optimization problem. Furthermore, a suboptimal alignment is found through the SGBGOA algorithm. For details, please refer to our previous work [31]. The ontology alignment method is named SGBGOA-OM.
The purpose of ontology alignment is to find a correspondences set between the given ontology concepts. This target can be defined as a function f:
The mathematical model of ontology alignment as an optimization problem is defined by the following equations:
where
Parameter configuration
The configuration of the parameter is shown in Table 4.
The parameter setting for all algorithm
The parameter setting for all algorithm
The maximum number of iterations (MaxIter) required for the SGBGOA-OM to find a high-quality alignment is studied to prove whether the proposed algorithm can speed up the completion of the task. Table 5 shows the average results of 10 independent runs at different iteration times for each algorithm on Anatomy track. It can be clearly seen that the proposed SGBGOA-OM method can find a high-quality alignment as F-measure = 0.855 in the case of only MaxIter = 10. Furthermore, the proposed method can quickly converge to the optimal solution, thereby improving the execution efficiency of ontology alignment.
Comparison results with other better algorithms under different iteration. The times are in milliseconds
Comparison results with other better algorithms under different iteration. The times are in milliseconds
Table 6 shows the comparison results with based on GOA algorithms and the other metaheuristic techniques. Each algorithm is independently executed 10 times, and the average value is calculated. In terms of F-measure, the SGBGOA-OM algorithm obtains the best results. Further, the SGBGOA-OM method can find the suboptimal alignment, i.e., the value of the optimal F-measure is 0.867. Since the SGBGOA-OM method does not use background knowledge in the execution process, five ontology alignment methods from OAEI such as POMAP++ [30], SANOM [30], Lily [17], Wiktionary [18], ALIN [19], FCAMap-KG [20] and DOME [21], are selected to compare with the SGBGOA-OM method. As shown in Table 7, the SGBGOA-OM method outperforms all other methods except POMAP++ and SANOM in terms of F-measure. This result shows that the proposed algorithm can find a high-quality alignment with a fast convergence rate.
Comparison results of SGBGOA-OM with other algorithms
Comparison results of SGBGOA-OM with other algorithms
Comparison results of SGBGOA-OM with other systems participated OAEI-2019
Based on the above analysis, it can be concluded that the proposed algorithm is able to achieve better alignment than the other algorithms. It can improve the efficiency of the original GOA in practical applications, especially in large-scale complexproblems.
The FMA-NCI-small task is to match the FMA fragment with 3696 classes and the NCI fragment with 6488 classes.
The FMA-SNOMED-small task is to match the FMA fragment with 10157 classes and the SNOMED fragment with 13412 classes. All results are the average of 5 independent executions of each algorithm. The proposed SGBGOA-OM method is compared with GOA-based algorithms, the other meta-heuristic algorithms and the systems from OAEI1 competition. The results of the FMA-NCI-small task are given in Table 8, in which it can clearly observe that the proposed SGBGOA-OM method outperforms all metaheuristic-based methods, DOME, and AGM in terms of F-measure. Compared with FCAMapKG and LogMapLt, it achieves competitive results. Table 9 shows the comparison results on the FMA-SNOMED-small task. The proposed SGBGOA-OM method obtains the highest quality alignment and is significantly better than other meta-heuristic-based methods and the competition systems fromOAEI 1 .
The results of the alignment on largebio task FMA-NCI small
The results of the alignment on largebio task FMA-NCI small
The results of the alignment on largebio task FMA-SNOMED small
In this paper, a novel grasshopper optimization algorithm based on solitarious and gregarious states difference is proposed to enhance the performance of the original GOA algorithm. In the proposed algorithm, a novel strategy of sparse population learning from dense population is proposed to improve the global search ability of the GOA. A chemotaxis operator is used near the best target location to improve the local search capability of the GOA. The SGBGOA algorithm is executed on 23 benchmark functions. The experimental results show that the proposed algorithm can obtain a better solution than several representative GOAs and the other metaheuristic techniques. Finally, the proposed algorithm is implemented in different challenges of ontology alignment problem. Experimental results demonstrate that the SGBGOA can improve the efficiency and quality in large ontology alignment tasks compared to other algorithms. In the future, we intend to study the performance of SGBGOA in the other application.
