Abstract
Ensemble pruning is usually used to improve classification ability of an ensemble using less number of classifiers, and it is an NP-hard problem. Existing ensemble pruning approaches always find the optimal sub-ensemble using diversity of classifiers or running heuristic search algorithms separately. Diversity and accuracy of classifiers are widely recognized as two important properties of an ensemble. The increase of the diversity of classifiers must lead to the decrease of the average accuracy of the whole classifiers, and vice versa, so there is a tradeoff between diversity and accuracy of classifiers. Finding the tradeoff is the key to a successful ensemble. Heuristic algorithms have good results when it comes to finding the tradeoff, but it is unfeasible to do an exhaustive search. Hence, we propose a Spread Binary Artificial Fish swarm algorithm combined with a Double-fault measure for Ensemble Pruning (SBAFDEP) using a combination of diversity measures and heuristic algorithms. First, the classifiers in an initial pool are pre-pruned using a double-fault measure, which significantly alleviates the computational complexity of ensemble pruning. Second, the final ensemble is efficiently assembled from the retaining classifiers after pre-pruning using the proposed Spread Binary Artificial Fish Swarm Algorithm (SBAFSA). Simulation and experiment results on 25 UCI datasets show that SBAFDEP performs better than other state-of-the-art pruning approaches. It provides a novel research idea for ensemble pruning.
Introduction
Ensemble learning is a challenging task in pattern recognition [1], machine learning [2], and data mining [3]. There are mainly two steps with respect to constructing a classification prediction model. In the first step, multiple classifiers with a large diversity are generated; in the second step, the final results are achieved by merging the predictions of all the classifiers. Remarkable improvement of an ensemble composed of multiple classifiers in predictive ability in comparison with a single classifier [4, 5], which has been applied in many fields, e. g., face recognition [6], age prediction [7], image processing [8], medical diagnoses [9], bioactive molecule prediction [10], intrusion detection [11], and the like. The increase of rapid growth in the size of data, which brings heavy storage requirements and computational overheads for ensemble learning [4]. Therefore it motivates the appearance of ensemble pruning [4]. Ensemble \nobreak pruning aims at achieving a better predictive ability using an sub-ensemble composed of a part of classifiers extracted from an initial pool of classifiers, which requires less resource. Typically, the final results are attained by aggregating a fraction of classifiers in an initial pool [4, 5].
It is well-accepted that diversity of classifiers is the key to the success of an ensemble [12]. The increase of the diversity among classifiers must lead to the decrease of the average accuracy of the whole classifiers, and vice versa. Hence there is a tradeoff between diversity and the average accuracy of classifiers [13]. The remarkable improvements of an ensemble can be attained when finding the tradeoff [14]. However, finding the tradeoff will bring on large computational requirements, including the training costs, pruning costs, and prediction costs [5].
For an ensemble with M classifiers, there are 2 M - 1 nonempty subsets, and it is an NP-hard problem [4]. Thus, it is unfeasible for most of the ensemble pruning approaches to find the exact solution; we just need to find the near-optimal sub-ensemble [5]. To find the near-optimal sub-ensemble, the literature in the Section 1.1 proposed various ensemble pruning approaches.
Based on the above analysis, it is easy to see that most ensemble pruning approaches search for the final ensemble using diversity measures or heuristic searching algorithms separately. Those pruning approaches based on diversity measures, using different strategies, cannot exactly find the optimal sub-ensemble; Those pruning methods based on heuristic algorithms cannot also exhaustively search for it. To address the said issue, we attempt to search for the final ensemble using a combination of heuristic algorithms and diversity measures. It is the reasons that a part of classifiers with bad performance are pre-pruned using diversity measures [4], which dramatically alleviates the computational complexity of ensemble pruning, and that the sub-ensemble very close to being optimal is efficiently achieved from the retaining classifiers after pre-pruning using heuristic algorithms [15].
In addition, Double-Fault measure (DF) do well on measuring the diversity of classifiers [14, 16], and pre-pruning the classifiers who perform badly. Artificial Fish Swarm Algorithm (AFSA) has some advantages with respect to searching for the optimal solution [17–22], and AFSA can be used as a searching strategy to find the final ensemble extracted from the retaining classifiers after pre-pruning. Therefore, Spread Binary Artificial Fish swarm algorithms combined with Double-fault measure for Ensemble Pruning (SBAFDEP) is proposed. The mission is achieved in two steps: (1) The classifiers in an initial pool of classifiers are pre-pruned using the double-fault measure, which markedly reduces the computational overheads of ensemble pruning. (2) The final ensemble is efficiently attained from the retaining classifiers after pre-pruning using the proposed SBAFSA. The combination of DF and SBAFSA provides a new research technique for ensemble pruning.
Related work
Plenty of researchers studied the pruning techniques, so a lot of effective ensemble pruning methods have been proposed. In general, these algorithms are classified into four categories [5, 23], which are listed as follows.
Ordering-based pruning refers to those methods which first assign a rank to each classifier in an initial pool of classifiers based on a certain diversity measure. Then the diversity measures of the classifiers with over a predefined value are selected to construct the final ensemble. Martınez-Muñoz et al. [4, 25] proposed ordering-based approaches, including Reduce-Error (RE), Complementarity measure (COM), Margin Distance Minimization (MDM), and found that the classifiers are complementary. Experimental results on UCI datasets demonstrate that an ordered ensemble outperforms the original ensemble. Margineantu et al. [26] proposed Kappa pruning, which sorts the classifiers (smallest to largest) according to their Kappa measure, and the classifiers with the smaller Kappa measures are integrated. Lu et al. [27] proposed an ordering-based approach, which achieves the final ensemble by ordering the ensemble members according to their contribution of each classifier in a descending order. Margin-based Ordered AGgregation for ensemble pruning (MOAG) is proposed by Guo et al. [28]. It selects the classifiers with larger margin-based criterion to constitute the final ensemble. Guo et al. [29] proposed a novel metric named margin and diversity-based measure, and the final ensemble is composed of members in a decreasing order based on that measure. Galar et al. [3] proposed a new ordering-based pruning method, which improves the ensemble performance of ensembles composed of a fraction of classifiers for imbalanced datasets, and achieves a good result.
Optimization-based approaches convert the ensemble pruning problem into a combinatorial optimization problem which attempts to find the optimal sub-ensemble that performs the best [5]. Zhou et al. [15] proposed a well-known approach called a Genetic Algorithm based Selective ENsemble (GASEN), random weights are assigned to the classifiers in an initial pool of classifiers, and GA is used to evolve the weights. The classifiers whose weights are above a predefined threshold can be selected to constitute the final ensemble. Zhang et al. [30] proposed a Selective ENsemble algorithm based on the Glowworm Swarm Optimization algorithm (GSOSEN). It uses the GSO as a searching strategy instead of the GA of GASEN.
Rokach et al. [31] proposed Collective Agreement-based ensemble Pruning (CAP), which considers the diversity and the accuracy of classifiers simultaneously. CAP needs to calculate the agreement measure between the member predictions and the true class labels, as well as the agreement measure between any two members. The member which highly agrees with the true class labels and has low inter-agreement with others can be selected to constitute the final ensemble.
Clustering-based approaches employ clustering techniques to find some representative classifiers to construct the final ensemble. It involves two steps: In the first step, the classifiers in an initial pool of classifiers are partitioned into different clusters. The classifiers extracted from the same cluster make similar results, while the classifiers created from different clusters perform in a more diverse manner. There are several clustering techniques which can be used in ensemble pruning, for example: k-means [32], deterministic annealing [33], hierarchical agglomerative clustering [34], etc. In the second step is to make the classifiers in the ensemble perform diversely. To do this we should extract the classifiers from the different clusters. The classifiers at the centroid of each cluster are selected to constitute the pruned ensemble, which was proposed by Bakker and Heskes [33].
This section discusses other pruning approaches, which are the algorithms that do not belong to any one of the aforementioned three categories. Martınez-Muñoz et al. [26] proposed an ensemble pruning approach, which prunes an ensemble using AdaBoost. An et al. [16] proposed the Double-Fault measure based on voting-based ELM (DF-D), which achieves the final ensemble using an one-side confidence interval according to the double-fault measures of the classifiers. An effective Ensemble Pruning algorithm based on Frequent Pattern (EP-FP) was proposed by Zhou et al. [35], which finds the frequent classifiers by calculating a Boolean matrix, then integrating the classifiers. Cavalcanti et al. [36] proposed combining Diversity measures for ensemble Pruning (DivP). On one hand, the different diversity measures are combined using Genetic Algorithm (GA) to constitute the combining diversity measure. On the other hand, the pruned ensemble is achieved using a graph coloring method. Ykhlef et al. [23] proposed ensemble Pruning based on Simple Coalitional Games (SCG-P), which prunes an ensemble using a minimal winning coalition achieved by calculating the Banzeff power factor of each classifier. Dai et al. [37] proposed three new diversity measures for ensemble pruning by considering diversity and accuracy simultaneously, with the final ensemble achieved using a Greedy Ensemble Pruning (GEP) algorithm.
Contributions and outline
The contributions of this work are described as follows: A novel approach for ensemble pruning is proposed using the combination of SBAFSA and DF. The proposed SBAFSA performs well when it comes to the convergence speed and precision. DF can be used to pre-prune the classifiers with bad performance, which significantly alleviates the computational complexity of ensemble pruning. Experimental results on 25 UCI datasets demonstrate the effectiveness and efficiency of the proposed method. It provides a new research idea for ensemble pruning.
This paper’s basic structure is as follows. In Section 2, the basic concept of diversity measures is presented. In Section 3, SBAFSA is proposed. We introduce SBAFDEP in Section 4, and how to use it. Simulation and experiments are expressed in Section 5. Finally, conclusions and future works are given in Section 6.
Diversity measures
Diversity of classifiers is very important for a successful ensemble, but what standard should be used to measure the diversity of the classifiers? How can we make use of the existing diversity measures to design a classifier with a good amount of diversity and a strong generalization performance? The above problems are unresolved issues [16]. Diversity in ensemble systems determines the performance of ensemble learning, so we need an effective diversity measure. Although many diversity measures are used in the literature, there is not a widely accepted diversity measure. No matter which diversity measure we take, solid empirical and theoretical validation should be presented. Five pair-wise diversity measures are listed as follows [36].
The main notations used in this work are summarized as follows:
N: the number of samples;
X: the sample dataset, X = {x1, x2, ⋯ , x N };
Y: the class labels of sample set X, Y = {y1, y2, ⋯ , y N };
M: the size of the classifiers;
F: the classifier set, F = {f1, f2, ⋯ , f M };
f i (x k ): classification result of the sample x k classified by the classifier f i , i = 1, 2, ⋯ , M, k = 1, 2, ⋯ , N;
Contingency table for two classifiers
Contingency table for two classifiers
To calculate diversity measures between two classifiers, the contingency table [36] between f i and f j is presented in Table 1.
In Table 1, a is the number of examples in X correctly classified by both f i and f j ; b is the number of examples correctly classified by f i and incorrectly classified by f j ; c is for the number of examples incorrectly classified by f i and correctly classified by f j ; d is the number of examples incorrectly classified by both classifiers. Thus, a + b + c + d = N.
The correlation coefficient measure (ρ) is formulated using Equation (1).
The Q-statistic measure (Q) is similar to a correlation coefficient measure. It rewritten as Equation (2).
The pair-wise Kappa measure (Kp) was usually employed to analyze the diversity of classifiers. It is presented as Equation (3).
The disagreement measure is expressed as Equation (4).
The double-fault measure is calculated as Equation (5).
Li et al. [17] first proposed Artificial Fish Swarm Algorithm (AFSA) in 2002, which is a novel swarm-intelligence technique, inspired by the natural fish behaviors, including swarming, following, and preying behaviors. It has been applied in many optimization problems, such as combinatorial optimization, function optimization et al. [18, 19]. An initial population of Artificial Fishes (AFs) is produced using a random function, AFSA has the abilities of self-organized, and it does well when it comes to converging on an optimal solution. The global optimal solution recorded in the bulletin board is updated using swarming, following, and preying behaviors at every iteration process. The parameters of AFSA include N (the size of the AF’s population), Visual (a visual scope), Step (the maximal moving step length), and δ (a crowd factor). The swarming, following, and preying behaviors are introduced in detail in Ref. [17, 20]. To solve binary discrete problems, the proposed SBAFSA is presented.
Mapping operation
To make a discrete AFSA simple yet efficient, we can update the Afs’ position by mapping operation in a simple way, and the position is mapped into 0 or 1. Assuming that the current state of AF is X
i
= (xi1, ⋯ , x
ik
, ⋯ , x
in
) (n is the number of the space’s dimension). The mapping operation is modified as follows [38].
Where rand is a random number in (0, 1), and S (x) = 1/ - (1 + exp(- x)) is a sigmoid function.
In the basic AFSA, AFs will rapidly gather together using swarming and following behavior, which may lead to low population diversity and prematurity convergence. To avoid prematurity convergence, spread behavior is introduced into AFSA. As we know, during biological evolution, organisms can move from one habitat to another one for the environmental stress, population density and change of living environment [39]. It cannot only balance and keep the stabilization of eco-environment, but prevent further extinction of the organism. We are inspired by that, SBAFSA is presented.
Assign nutrition value
Where t is the current iteration, N0 is a constant,
Then set a upper limit N
up
and a lower limit N
down
. If
Where Y i is the fitness function of the current AF, ⌊⌋ is a rounded down, Ymax, Ymin, Bmax and Bmin are the optimal and the worst value of the current population, the number of the maximum and the minimum seeds, respectively.
Offspring generated using breeding operation are distributed into the binary solution space, who follow Gaussian distributions. We use the parent AF, its visual as the axis, the variance of Gaussian distributions [40], respectively. The positions of the generated offspring are shown as Equations (9) and (10).
Where λ0 is a constant, tmax is the number of the maximal iteration, X ij is a new generated offspring.
The value of the tangent function decreases with the decrease of the independent variable. It is easy to see that, the greater the value of λ is, the more far away from their parent the generated offspring is. At the later evolution stage of AFSA, value of λ decreases with the decrease of the iteration t, and the generated offspring is getting closer to their parent. All of the above can broaden the searching range of the AFs.
After the spread behavior, there will have a great increase with respect to the population size. To adequately decrease those AFs who perform badly, and a selection behavior is used. In SBAFSA, the generated offspring will compete with their parents. If
SBAFDEP
The proposed approach is presented in this section, which uses the combination of SBAFSA and DF. It involves two steps: the classifiers in an initial pool are pre-pruned using the double-fault measure, and the retaining classifiers after pre-pruning are further pruned using SBAFSA. The proposed SBAFDEP is presented as follows.
Initial pool of classifiers
An initial pool with M classifiers is generated using the bootstrap sampling method in Bagging. As we know, the base classifiers in ensemble pruning must be unstable [36]. The Extreme Learning Machine (ELM) is unstable [16], and it has fast learning speed. Thus, the ELM is suitable as the base classifier in this work.
Pre-pruning based on double-fault measure
For an initial pool of classifiers which contains M classifiers, it has 2 M - 1 nonempty subsets, which is an NP-hard problem. Heuristic algorithms do well when it comes to finding the tradeoff between the average accuracy and the diversity of classifiers. While it is hard for heuristic algorithms to exhaustively search for the optimal solution. We need to pre-prune the classifiers in the pool, which can significantly reduce the computational burdens of ensemble pruning.
Generally speaking, five pair-wise diversity measures work well with respect to measuring the diversity among classifiers, so which one can be used as diversity measure for pre-pruning? We find that the double-fault measure performs better than others via past experiments. We attempt to employ the double-fault measure to pre-prune the classifiers with bad performance. We will discuss the above question in detail in the Section 5.1.
Pruning based on SBAFSA
The fitness function of the selective problem can be formulated as follows [15, 30].
Where A indicates the classification accuracy between the predictive results and the class labels
The outline of SBAFDEP is shown in Fig. 1. First, the diverse classifiers are generated using the bootstrap sampling method, which are used to constitute an initial pool of classifiers. Second, the classifiers are sorted (smallest to largest) according to their double-fault measures, and the first M′ classifiers with smaller double-fault measures are retained. Third, SBAFSA is employed to search for the optimal sub-ensemble from the retaining M′ classifiers. The classification accuracy of sub-ensemble is taken as the objective function, and the bulletin board is used to record the optimal solution by calculating the fitness value of each AF in the initialization population. The optimal solution in the bulletin board can be updated using leaping, swarming, following, preying, spread and selection behaviors in SBAFSA, the sub-ensemble extracted from the retaining classifiers can be efficiently achieved.

The outline of SBAFDEP.
To evaluate the performance of SBAFDEP, we have carried out experiments on 25 UCI classification tasks. The experiments were implemented in Matlab 2017a. The experiments were repeated 30 times independently for a goal of reducing random effects of experiments, and the average results were attained. The parameters of SBAFSA were set as follows: the visual scope is half of the number of the retaining classifiers after pre-pruning, the crowd factor δ = 0.9, Step = 1, B max = 5, Bmin = 1, λ0 = 1.5, tmax = 500, N up = 5, N down = 1.
The performance of SBAFDEP was assessed based on 25 UCI datasets, which are shown in Table 2. Experimental datasets were divided into five folds using k-fold cross validation with k = 5, and three folds for training, one for validation, and one for testing.
Description of UCI datasets used in this work
Description of UCI datasets used in this work
The classifiers in an initial pool were pre-pruned using the double-fault measure, which markedly reduced the number of the classifiers with worse performance and less diversity. However, how many classifiers should be retained using the double-fault measure to constitute the final ensemble? We will discuss the above question as follows.
The average ensemble accuracies decease as the increase of five pair-wise diversity measures. The ensemble accuracy can then be improved by pre-pruning a collection of classifiers with the larger pair-wise diversity measures, which has been proven in literature [12, 14]. Figure 2 shows that the average classification accuracies of random bagging and ordered bagging based on the pair-wise diversity measures with initial pools composed of 100, 200 and 300 classifiers on Column and Spambase datasets. We can find from Fig. 2 that, the curves of ordered ensemble accuracy based on the five pair-wise diversity measures go up first and then go down, and that the double-fault measure outperforms the other measures. In the beginning, there are a few classifiers in the ensemble, so the ensemble accuracy goes up. As the increase of the size of the initial pool, there are many redundant classifiers in the ensemble, which results in the decline of the ensemble accuracy. We can also see from Fig. 2 that, the ensemble accuracy can achieve the maximum before the number of classifiers reaches 25. In other words, the first 25 classifiers with the smaller double-fault measures should be remained. Therefore, we advise that the number of the retaining classifiers is 25, namely, M′ = 25. The retaining classifiers are used for further pruning based on SBAFSA.
Tables 3 and 4 indicate the results of SBAFDEP on 25 datasets, and in comparison to the original ensemble (Bagging). SBAFDEP does well when it comes to pruning the classifiers in the initial pool, and more than 80% of the classifiers in the pool has been pruned using SBAFDEP. The number of classifiers has been significantly reduced after pre-pruning, so that SBAFSA can achieve the final ensemble with a good efficiency. It is easy to see from Tables 3 and 4 that, SBAFDEP performs well when the number of the initial pool of classifiers is set at 200. When the number of classifiers is over 200, the ensemble accuracy can be slightly improved. So it is unnecessary to continue increasing the size of the initial pool. We advise that the size of the initial pool of classifiers is set at 200.

Average ensemble performance of random bagging and ordered bagging based on pair-wise diversity measures on Column and Spambase datasets.
Comparison with Bagging on different pool sizes (50, 100, 150) on the test datasets. SB: SBAFDEP, Bag: Bagging
To further evaluate the performance of SBAFDEP, we compare it with the following approaches: Bagging [41], Kappa [26], AGOB [24], POBE [25], DREP [42], DF-D [16], GASEN [15], GSOSEN [30], MOAG [28], EP-FP [35], RRE [5], DivP [36], SCG-P [23] and SDAcc [37]. Bagging can make the initial pool of classifiers perform diversely by selecting each sample with the same probability. Kappa, AGOB, POBE and MOAG show that the order in the classifiers is of importance. DREP starts the ensemble with just one classifier, and grows the ensemble by adding new classifiers, which makes the pruned ensemble perform in a diverse number of ways, and achieves good results. DF-D selects the classifiers whose double-fault measures belong to the one-side confidence interval. GASEN and GSOSEN employ GA and GSO to search for the optimal sub-ensemble, respectively. RRE achieves the final ensemble composed of the selected classifiers using RE and the discarded classifiers. DivP combines different pair-wise diversity matrices, and achieves the final ensemble using a graph coloring method. SCG-P assesses the diversity contribution of each classifier using the Banzhaf power index, and the final ensemble with the minimal winning coalition extracted from the classifiers in an initial pool can be achieved. SDAcc considers the diversity and the accuracy of the classifiers simultaneously, which makes the pruned ensemble attain a good result.
Comparison with Bagging on different pool sizes (200, 250, 300) on the test datasets. SB: SBAFDEP, Bag: Bagging
Classification accuracy and size of the pruned ensembles achieved by comparative approaches on pool size of 200
To evaluate the performance of SBAFDEP, we compare it with other ensemble pruning approaches with the initial pool size of 200. The results achieved by all the pruning approaches using the initial pool sizes of 200 are reported in Tables 5 and 6. “Win”/“Tie”/ “Loss” indicate that the number of times in which SBAFDEP scores were better/neutral/inferior than the other pruning approaches. We can see from Tables 5 and 6 that, SBAFDEP can achieve better results than the other pruning approaches with less number of classifiers on most validation datasets. The members of the pruned ensembles achieved by SBAFDEP are more than that of DivP, but SBAFDEP performs better than DivP.
Classification accuracy and size of the pruned ensembles achieved by comparative approaches on pool size of 200
Comparison results using t-test method
Remark: H = 1 shows that hypothesis H0 is rejected, i.e., SBAFDEP significantly improves the classification performance of other state-of-the-art pruning approaches at 5% significance level, respectively.
To test whether the difference between the proposed approach and other approaches is significant or not, a t-test method at a significance level of 0.05 was used in this work. If the p-value was less than 0.05, then the hypothesis H0 was rejected, and the difference between the two approaches was significant, and vice versa. The p-values produced in the test are presented in Table 7, which indicates that the proposed method attains a better result with significant differences in all fourteen cases. In addition, the trade-off between the average sizes of the pruned ensemble and the average accuracy is shown in Fig. 3. It is easy to see from Fig. 3 that, SBAFDEP outperforms other pruning approaches.

Relationship between ensemble size and average performance of different approaches.
In the proposed SBAFDEP, SBAFSA is used to search for the optimal sub-ensemble from the retaining classifiers after pre-pruning. Thus, the performance of the proposed SBAFSA needs to be evaluated, and it is compared with the following heuristic algorithms: SbAFSA (Simplified binary Artificial Fish Swarm Algorithm) [19], IBAFSA (Improved Binary Artificial Fish Swarm Algorithm) [21], BAFSA (Binary Artificial Fish Swarm Algorithm) [18], IAFSA (Improved Artificial Fish Swarm Algorithm) [22], AFSA [20], IDGSO [30], and GA [15]. The following experiments using an initial pool of 200 classifiers were implemented on Column and Spambase datasets, which are displayed in Fig. 4. After pre-pruning according to the double-fault measure, the above eight heuristic algorithms were employed to search for the optimal sub-ensemble. These heuristic algorithms were implemented as described in their respective papers.
From Fig. 4, it is easy to see that SBAFSA achieves a faster convergence speed than the other seven approaches, and we can also find that the performance achieved by SBAFSA, first rise up and stay stable. When the number of iterations is over 500, the performance of SBAFSA cannot be significantly improved. We advise that the iterations is set at 500. We can see from Fig. 5.1 that, when the visual scope is 25, SBAFSA performs the best. Thus, we advise that the suitable visual scope is set at half its size of classifiers after pre-pruning. As the increase of the population size, the classification accuracy attained by SBAFSA mounts and levels off, which is shown in Fig. 5.2. We advise that the population size is set at 25.

Relationship between the performance of heuristic algorithms and iterations on the Column and Spambasedatasets.

Relationship between the performance of heuristic algorithms and parameters on the Column dataset.
In this work, we proposed a novel ensemble pruning approach, named spread binary artificial fish swarm algorithm combined with a double-fault measure for ensemble pruning (SBAFDEP). We then search for the final ensemble using SBAFSA after pre-pruning based on a double-fault measure. Heuristic algorithms tend to do well when it comes to finding the tradeoff between the accuracy and the diversity of classifiers. Ensemble pruning is an NP complete problem, but it is hard for heuristic algorithms to do an exhaustive search. On one hand, a double-fault measure can be used to pre-prune the classifiers in a generated pool, which can downsize the pool, and dramatically reduce the computational burdens of ensemble pruning. On the other hand, the final ensemble extracted from the retaining classifiers after pre-pruning is efficiently achieved using the proposed SBAFSA. Therefore, the combination SBAFSA and double-fault measure can tackle the ensemble pruning problem. Experimental comparisons on 25 UCI datasets demonstrate that SBAFDEP outperforms other state-of-the-art pruning approaches, and that the proposed SBAFSA performs better than other binary heuristic algorithms with respect to the convergence speed and precision.
In future work, we will try to employ other diversity measures for pre-pruning, and then search for the final ensemble using heuristic algorithms. We believe that these combinations of diversity measures and heuristic algorithms can generate promising results, which can provide new research ideas for ensemble pruning.
Footnotes
Acknowledgement
This work was supported by the National Nature Science Foundation of China under Grant No. 91546108, No. 71490725, No. 71271071, No. 71301041, and No. 61806068, the National Key Research and Development Plan under Grant No. 2016YFF0202604, the Natural science foundation of Huzhou under Grant No. 2018YZ11, and the Open Research Fund Program of Key Laboratory of Process Optimization and Intelligent Decision-making (Hefei University of Technology), Ministry of Education. The authors would like to thank the reviewers for their comments and suggestions.
