Abstract
We propose a hybrid application of Population Based Ant Colony Optimization that uses a data mining procedure to wisely initialize the pheromone entries. Hybridization of metaheuristics with data mining techniques has been studied by several researchers in recent years. In this line of research, frequent patterns in a number of initial high-quality solutions are extracted to guide the subsequent iterations of an algorithm, which results in an improvement in solution quality and computational time. Our proposal possesses certain differences from and contributions to existing literature. Instead of one single run that incorporates both the main metaheuristic and the data mining module inside, we propose to carry out independent runs and collect elite sets over these trials. Another contribution is the way we use the knowledge gained from the application of the data mining module. The extracted knowledge is used to initialize the memory model in the algorithm rather than to construct new initial solutions. One additional contribution is the use of a path mining algorithm (a specific sequence mining algorithm) rather than Apriori-like association mining algorithms. Computational experiments, conducted both on symmetric Travelling Salesman Problem and symmetric/asymmetric Quadratic Assignment Problem instances, showed that our proposal produces significantly better results, and is more robust than pure applications of population-based ant colony optimization.
Keywords
Introduction
Metaheuristics have found a broad application area in achieving fast and acceptable solutions of combinatorial optimization problems over the past decades. Ant Colony Optimization (ACO) [6, 7] is one of the most popular and largely applied technique that is inspired by food searching behavior of real ants. A colony of artificial ants work independently, but share their knowledge of good quality solution characteristics to achieve high quality solutions in an iterative manner. In every iteration, each of several ants stochastically constructs a feasible solution based on pheromone (trail ants left behind) information and problem specific heuristic information. This solution construction process is generally same among ACO variants. Different ACO applications vary in the way they update the pheromone matrix at the end of an iteration or throughout an iteration. Reader is referred to Dorigo and Stutzle [8] for a broad review of different pheromone update approaches in ACO applications.
Population-based ACO (PACO) [11] carries out pheromone update only if an update occurs in a so-called solution archive which is a population of solutions kept in fixed size while the algorithm runs. Whenever a new solution is appended to the archive, one of the present solutions is chosen to leave the archive according to the update strategy used. The newly added solution adds pheromone, and the leaving solution evaporates the formerly added pheromone of its own. This pheromone update procedure makes the algorithm run much faster than the traditional ACO algorithms. PACO is successfully applied to various optimization problems [3, 4, 13, 23]. Its pure [11] and hybrid [22] Travelling Salesman Problem (TSP) applications also shown to be promising. Furthermore, the algorithm parameters were analyzed in detail when applied to TSP and Quadratic Assignment Problem (QAP) [14, 13] since PACO parameter settings and algorithm behavior are strongly problem dependent.
Metaheuristic applications have been shown to perform much better when they are hybridized with some other heuristics or techniques [2]. Applying local search to PACO results, for example, is a common convention, which combines the strength of PACO in exploiting promising areas in the search space with the strength of local search in exploring a certain promising area. There are innumerable hybrid applications in the literature in every type of combinatorial optimization problem, that make use of two or more metaheuristics (or metaheuristics-heuristics) in conjunction with each other with the hope to create a balance between intensification and diversification aspects of a search process. Hybridization of metaheuristics with data mining (DM), however, is a more recent, and less studied approach in searching for quick and acceptable solutions to combinatorial optimization problems. In this line of research, the main aim is to analyze a set of good quality solutions -collected while the metaheuristic algorithm runs- in order to extract valuable knowledge about the characteristics of high quality areas in solution space and use this knowledge to guide the search algorithm through the promising regions of the solution space. This approach is used to hybridize several metaheuristics and applied to a number of research problems with favorable results in terms of solution quality and computational time.
In this study, we propose a hybrid version of PACO that incorporates a DM module to guide the search process. A number of initial independent PACO trials were carried out, during which an elite set of good quality solutions have been collected. Using DM techniques, frequent patterns in these solutions have been extracted to guide a number of following independent trials. We show that the proposal here produces significantly better results, on average, than pure PACO trials when applied both to several symmetric TSP instances from TSPLIB and to symmetric/asymmetric QAP instances from QAPLIB.
Related work
Ribeiro et al. [17, 16] were the first to propose a hybrid application of GRASP with data mining (DM-GRASP), where, after a certain number of GRASP iterations, they extract patterns from an elite set of solutions to guide the succeeding GRASP iterations. They worked on a Set Packing Problem (SPP) as a case study. A feasible solution to a SPP is represented as a set of items to be included in a packing. A set that have a maximum total cost of items without violating any given restrictions (constraints that define which items cannot appear together in a packing) defines an optimal solution. GRASP iteratively constructs a solution (construction phase) using a greedy function, and applies local search to this initial solution to find a local optimum. Many GRASP iterations are carried out to find a good sub-optimal solution. They used Apriori-like algorithms to mine frequent itemsets (patterns or subset of items) that appear in a significant number of good-quality GRASP-solutions. They carried out more GRASP iterations, later again, where they used these frequent patterns as part of the construction phase of each GRASP iteration. This way, they forced the solutions produced by the construction phase to include some pattern among the frequent patterns. DM-GRASP is later applied to Maximum Diversity Problem [20], Server Replication for Reliable Multicast Problem [19], and the
Other than GRASP, some other metaheuristics were also hybridized with Martins et al. [12] applied DM procedure to a state-of-the-art hybrid heuristic-which combines ideas from multi-start local search, path relinking and tour merging – for the classical
Guerine et al. [9] studied one-commodity pickup-and-delivery TSP, and enabled a direct implementation of Frequent Itemset Mining (FIM) algorithms to mine solutions which are defined as sequence of elements where the order of elements is important. Since FIM algorithms are not suitable for mining solutions defined as ordered items, they use a solution representation where each successive item pair is defined as a new single item, thus re-representing the same solution as a set of unordered items. They also proposed [10] later a multi-DM scheme for the same problem case, where they called DM routine multiple times throughout the running process of the GRASP/VND algorithm.
All the above-mentioned research incorporate DM procedure into the search process mainly as follows. The main algorithm, either GRASP, GA or some other, starts to process and carries out some number of iterations, visiting many different areas in the solution space. During this first running phase of the algorithm, an elite set of relatively good quality solutions are collected. This elite set is analyzed using DM techniques to extract knowledge about the characteristics of superior solutions. These characteristics are in form of patterns that are sets of items representing an incomplete solution. New complete solutions, then, are constructed using both these patterns and the same construction procedure as the main algorithm uses. These new complete solutions act either as initial solutions – in GRASP – or new individuals – in GA – from which the main algorithm continues to process. This way the search process is facilitated and guided through promising regions in the search space.
Our study possesses some differences from the existing research about hybridization of metaheuristics using DM techniques. First, the knowledge gained from applying DM is used to initialize the memory model (pheromone matrix) in PACO rather than to construct new solutions. Reflecting the knowledge on hand into a memory model enables the DM hybridization approach to be applied to a range of other similar metaheuristic techniques. Second, instead of one single run that incorporates both the main metaheuristic and the DM module inside, we propose to carry out independent trials and collect elite sets over these trials. Independent trials of PACO enable the search process to scan a wide range of areas in the solution space, and a subsequent knowledge extraction by DM uncovers the characteristics of the promising regions over those previously visited areas. Lastly, the solution sequences are mined directly using a path mining algorithm, ASIPATH [5], rather than Apriori-like association mining algorithms used up to date in all existing research in this topic.
Another contribution of this work is the idea of knowledge-based pheromone initialization prior to performing a PACO trial. Pheromone initialization in PACO studied only – to our knowledge – by Shi et al. [21]. In their probability initialization approach, as they call it, they initialize the pheromone values based both on how many of the archive solutions include the transition from city
Organization of the paper
The rest of the paper is organized as follows. We describe PACO algorithm in Section 1.1. Our proposal of hybridizing PACO with DM is introduced and detailed in Section 2.2. Section 3.1 includes a representation and discussion of our computational experiments. Concluding remarks are given in Section 4.
Initialization
Population based ant colony optimization
PACO is shown to perform better than the numerous ACO algorithms [11, 21, 23]. In contrast to the other ACO algorithms, it stores a population of solutions. Pheromone update is based on an update in this population. Here, we will describe PACO procedure specifically for the TSP and QAP. An overall representation of PACO algorithm for the TSP is given in Algorithm 1.2. Since for the QAP the main PACO engine is the same as in TSP, with slight differences in definitions as we detail in the following, we do not include a separate algorithmic representation of the algorithms for the QAP.
Solution construction
TSP
In each iteration, each of a predefined number of ants constructs a complete TSP tour, a feasible solution, as given in “Tour Construction” part in Algorithm 1.2. An example of a TSP tour for a
where
where
where
where
The main solution construction mechanism is the same as in TSP, however a feasible solution is a permutation of locations in QAP. One can choose to represent a QAP solution as a permutation of facilities as well. Based on our permutation of locations representation, for a
We refer to Stützle and Dorigo [24] for the heuristic information which is based on flow and distance potentials of facilities and locations, respectively. The heuristic desirability of assigning facility
where
where
A predefined number
There are three strategies to update the solution archive. Age-based strategy makes the oldest solution exit the archive. Quality-based strategy accepts the iteration-best solution only if it is better than the worst-quality solution of the archive. Elitist-based strategy is similar to age-based strategy except that it always keeps the global best solution, the best solution found up to that time during the whole trial, in the archive. New solution replaces the globally best solution if it has a higher quality. Otherwise, new solution is appended to the archive and the oldest solution of the archive leaves.
Pheromone update
Pheromone update in pheromone matrix is carried out in PACO based on the solutions that leave or enter the archive. Here, we will give the pheromone update procedure only for the elitist-based strategy since it is the one that concerns in this study. Whenever a solution replaces the globally best solution, an elitist-update is carried out. Accordingly, new elitist solution adds
where
where
In this section, we describe in detail our hybrid PACO-DM-PACO proposal. An overall representation of PACO-DM-PACO approach for the TSP problem is given in Algorithm 2. For the QAP, the algorithmic skeleton of the PACO-DM-PACO is the same, however there is a significant difference in how we extract knowledge of frequent couples, which we detail in the following subsection.
PACO-DM-PACO procedure consists of two main phases. In the first phase, we propose to run
In other words, instead of carrying out
Data mining procedure-ASIPATH
The elite set of solutions obtained from the first phase is analyzed using ASIPATH path mining algorithm [5]. ASIPATH is a special case of SPADE algorithm introduced in [26]. The main advantage of SPADE algorithm and its variants is to prune frequent itemset search space by discarding infrequent ones efficiently in a parallel search manner. In contrast to frequent itemset mining algorithms, ASIPATH can be used directly to mine solutions that are represented as sequences, where the order of items in the solution is important.
Suppose tour is a permutation representation of a given feasible tour for a TSP instance with
Consider, for example, a TSP instance with 4 cities. All possible TSP tours -that visits each city once and returns to the starting point- are given in permutation representation in Table 1. We started each representation from city 1 arbitrarily. Suppose we collect an elite set consisting of 4 solutions. Then, we choose the first four tours in Table 1, since their costs are the lowest. Suppose, also, we have a sup parameter of 0.50, which means that at least 2 of the tours in the elite set should include a city pair for it to be considered as frequent. When we apply ASIPATH to the elite set of this 4-city sample problem, we obtain
Permutation representation of tours for a 4-city TSP
Permutation representation of tours for a 4-city TSP
Permutation of locations representation for a 5-facility QAP
For the QAP, supppose
Initialization() Tour_Construction() Update_IterEliteSet(eSize) Iteration-Best_solution() Pheromone_update&Archieve_update() }
Initialization
When
Once ASIPATH outputs the frequencies of the city or assignment pairs that meet sup criterion, these frequencies are used to initialize the pheromone matrix for following trials, as shown in “Initialization” part in Algorithm 2. Pheromone is added to entries of the pheromone matrix corresponding to frequent city or assignment pairs. Higher amount of pheromone is added to pairs with higher frequency, as shown in Eq. (10).
where
In this section, we represent the computational study to compare performances of PACO and PACO-DM-PACO trials. We used 33 symmetric TSP and 33 symmetric/asymmetric QAP instances from the well-known TSP and QAP repositories available at
PACO-specific parameters have been analyzed in detail for the TSP and the QAP in the literature [11, 14, 13, 22]. Thus, we did not find it necessary to repeat a detailed parameter analysis here, hence we referred to several studies to determine the constant parameters of PACO. It is common to use
In the first PACO phase of our PACO-DM-PACO proposal, we run 10 independent purely PACO trials. We run each trial of the small, medium and large TSP instances respectively for 100, 1000 and 2000 seconds, while for QAP we stop after 1000 seconds in any instance. We collected an elite set of size 100 in each of these trials, making a total of 1000 elite solutions collected in this first phase. sup parameter in DM module is taken for TSP as low as 0.10 wherever memory restrictions allow. Otherwise, it is taken as 0.20 or 0.25. For QAP, sup is a constant over all instances and has the value 0.05. A high sup parameter would result in low number of frequent city or assignment pairs, particularly in large instances. However, here we would prefer to leave a trace in the memory even if the specific pair occurs in even 100 or 50 of 1000 elite solutions. That way, the algorithm will favor to traverse that edge or make that assignment rather than one that does not appear even in an elite set with such a low sup parameter. During pheromone initialization, we used all pairs that meet the sup parameter. We run 10 independent
Comparison of PACO and PACO-DM-PACO trials for TSP
Comparison of PACO and PACO-DM-PACO trials for TSP
Bold shows the better result. Bold red shows that the difference is statistically significant (Wilcoxon Rank Sum Test,
Comparison of PACO and PACO-DM-PACO trials for QAP
Bold shows the better result. Bold red shows that the difference is statistically significant (Wilcoxon Rank Sum Test,
PACO trials again, in the second phase, using the initialized pheromone matrix. The statistics for the last 10-trial phase is compared with 20 purely PACO trials where no DM module is applied.
The algorithms were implemented in MATLAB 9.3. All the runs were carried out in a computer with Intel Core i5, 2.40 GHz and 8 GB RAM, running under Windows 10 operating system.
The statistics for PACO and PACO-DM-PACO trials for TSP and QAP are given respectively in Tables 3 and 4.
For the TSP, PACO-DM-PACO approach produces solutions with higher quality, in average, than PACO in 23 of 24 small instances, and in 100% of medium and large instances. The difference is statistically significant in 70% of the small instances, in 4 of the total 6 medium instances and in all 3 of the large instances with a 95% confidence (refer to next subsection). We can see from the table that the superiority of the average solution quality increases as instances get larger. This can be observed more clearly in Fig. 1 which is a plot of average cost values obtained in PACO and PACO-DM-PACO trials for each instance. Table 3 indicates, additionally, that the best tour length found in PACO-DM-PACO trials is lower (better) than that of PACO in 76% of all instances.
For the QAP, the performance of our proposal over pure PACO trials is showing a different behaviour among the four instance classes. We can observe that the performance is clearly worse in Class I instances. This is due to the search landscape of that specific class. These instances may not structurally resemble the optimum solution as they get closer to the optimum objective value [25]. In other words, a solution with a lower objective value may not have more locations of items in common with the optimum solution. This search space feature is in contrast with the main idea of our proposal, where we assume the better the solution quality the more similar the solution with the optimum. Classes II and IV, however, have the exact opposite characteristic of the search landscape [25], resulting in an improved solution quality with our PACO-DM-PACO proposal. Our approach produces higher quality results, in average, in 75% of Class II instances and 62.5% of Class IV instances, respectively. The statistical significance of the superiority of our proposal is higher in Class IV instances. PACO-DM-PACO has the highest performance in Class III instances. It produced better results, in average, in all these instances except one, and the difference is statistically significant in 5 of these 7 instances.
A lower standard deviation of the percentage deviations of the trials from the optimal solution indicates that an algorithm is more robust, it produces similar quality results in different trials. Robustness gives an idea about how predictable would the error be, and how much we can rely on the algorithm in a single trial. Tables 3 and 4 demonstrate that in 76% of both TSP and QAP instances the PACO algorithm with DM module produces more robust solutions. Also, we can observe that generally the proposed algorithm gets more robust as the problem size increases.
We use tests of statistical significance to determine whether the superiority of PACO-DM-PACO is statistically significant. We apply Non-parametric Wilcoxon Rank Sum Test to costs of each of 33 instances (a sample table for a TSP instance is given in Table 5) both in TSP and QAP experiments to determine whether the differences between the outputs of the two algorithms are statistically significant.
Plot of average cost values in PACO and DM-PACO trials for each TSP instance.
Wilcoxon Rank Sum Test is used when one wants to determine whether two samples -not necessarily in equal size- come from the same population. This way, it is possible to understand whether the difference between the samples is statistically significant or it is due to random reasons.
The null and alternative hypothesis are constructed as follows:
We used a 95% confidence level (
Another statistical test we use is the non-parametric Wilcoxon Signed Rank test to determine whether PACO-DM-PACO produces better solutions than PACO. It is a test on paired samples that tests population median of the differences of results.
Then, the null hypothesis versus the alternative are as:
Comparing overall Average Cost results given in Table 3 using Wilcoxon Signed Rank test, we obtain a
A representation of samples used in Wilcoxon Rank Sum test
Implementing DM techniques to extract frequent patterns from a set of good quality solutions and using these patterns to guide the search process is an emergent topic in hybridization of metaheuristics in recent years. In the present paper, we implemented a DM module after several independent PACO trials to predict the promising regions of the solution space and used this knowledge in subsequent PACO trials to further exploit these regions. The knowledge from DM module is used to initialize the pheromone entries instead of constructing new initial solutions. One more novelty in our proposal is the use of a path-mining algorithm (a special type of sequence mining algorithm) instead of frequent itemset mining algorithms in the DM process. Several symmetric TSP and symmetric/asymmetric QAP instances are used to compare PACO-DM-PACO trials with pure PACO trials. It is shown that the proposed approach leads to significantly better solution quality, in average. Additionally, the results obtained through integration of a DM module are shown to be more robust.
In the PACO side, the results here can be further improved, as a future study, by applying a local search procedure to solutions ants find in each iteration. A parallel implementation of the algorithm can be carried out as well.
In the DM side, DM module can be applied several times during the trials instead of the single application. Additionally, our approach can be incorporated into many other metaheuristics that use a memory model, and the resulting behavior can be analyzed.
