Abstract
In this study, we propose a novel fast learning data-driven method for the design of interval type-2 fuzzy logic system (IT2FLS). In order to accelerate the learning speed, we present a parallel mechanism to generate the antecedents of the IT2FLS and the least square method based learning algorithm to optimize the consequents. Firstly, driven by different sub-data sets, corresponding type-1 fuzzy logic systems (T1FLSs) which have the same initial fuzzy partition (thus the same initial fuzzy rule base) are parallelly obtained through the popular ANFIS method. Then, an ensembling strategy is proposed to form the type-2 fuzzy partition for each input variable through merging corresponding type-1 fuzzy sets (T1FSs) in the type-1 fuzzy partitions of the learned T1FLSs. By this strategy, the antecedents of the IT2FLS are determined and then fixed, however, the consequent parameters still need to be optimized. To achieve both excellent performance and fast training speed, a least square method based learning algorithm is provided for the optimization of the consequent parameters. Finally, three benchmark problems and one real-world application are given, and detailed comparisons with some well performed methods are made. Simulation and comparison results have verified the effectiveness and superiorities of the proposed method.
Introduction
As a popular extension of the conventional fuzzy logic system (type-1 fuzzy logic system: T1FLS), type-2 fuzzy logic system (T2FLS) nowadays is attracting more and more attentions [1–6]. In the T2FLS, type-2 fuzzy sets (T2FSs) including interval T2FSs (IT2FSs) [7, 8] and generalized T2FSs (GT2FSs) [9, 10] are adopted to replace the type-1 fuzzy sets (T1FSs). Compared to T1FSs, T2FSs have more parameters and enhanced ability to handle high levels of uncertainties [1, 3]. Naturally, T2FLSs have shown better performance than T1FLSs in many research fields, especially in the modeling, prediction and control applications. Due to the complexity of the generalized T2FLSs, till now, the most widely used T2FLSs are the interval ones. In the interval T2FLSs (IT2FLSs), IT2FSs which have interval-valued membership grades are utilized, and the computational complexity can be greatly reduced.
Generally, two ways exist in designing satisfactory IT2FLSs. One is by the manual setting while the other is by data-driven methods. In the control applications, IT2FLSs used as controllers are usually designed by the first way [11]. In the prediction and modeling applications, IT2FLSs utilized as predictors or system models are often obtained by data-driven methods. For example, to train the IT2FLSs or interval type-2 fuzzy neural networks, different kinds of data-driven learning algorithms were proposed, such as the meta-cognitive sequential learning algorithm [12], the gradient descent algorithm [13, 14], the least squares algorithms [15, 16], the hybrid algorithm [17–19], the vectorization-optimization-method [20], the multi-objective optimization method [21] and the genetic algorithm [22]. However, among all these studies, IT2FLSs are directly designed without using the popular design methods of T1FLSs. Some other works have tried to generate IT2FLS from T1FLS. In [23, 24], T1FLS was firstly designed, and then transformed to the type-2 case. In such studies, the trained T1FLS is used to initialize the antecedent and consequent parameters of the corresponding IT2FLS, and then such parameters are further optimized by learning algorithms. In this study, we present one different scheme for transforming several learned T1FLSs to one IT2FLS. In the proposed scheme, we firstly divide the whole data set to some sub-data sets, and for each sub-data set, one T1FLS will be generated by the popular ANFIS method [25, 26]. This process will be accomplished in parallel. The obtained different T1FLSs are then ensembled to form the antecedents of the IT2FLS. This is realized by the proposed ensembling strategy which can merge different type-1 fuzzy partitions into one corresponding interval type-2 fuzzy partition for each input variable.
On the other aspect, no matter which kind of method is used to design an IT2FLS, it is a time-consuming task due to its complex input-output mapping and its large number of parameters to be tuned, especially the parameters in the antecedent parts of type-2 fuzzy rules that are usually optimized by the steepest descent methods. One approach to solve this problem is to use the extreme learning method (ELM) [27, 28] which is based on the least square method that has fast learning speed and good performance. However, the ELM for IT2FLS randomly generates the parameters of the antecedent IT2FSs. Hence, this can not guarantee the coverage of the input domain, and the linguistic meanings will be lost. In this study, we utilize the above-mentioned strategy to generate the antecedent IT2FSs to solve such problems. And, the training processes of different T1FLSs for sub-data sets are done in parallel so as to accelerate the learning speed. Although the antecedents of the IT2FLS can be determined by the previous strategy, its consequent parameters still need to be learned to improve the performance. In the consequent parameter learning, a least square method based learning algorithm is presented in order to further fasten the training speed.
The main novelties and contributions of this study are listed as follows: For the sub-data sets which are divided from the training data set, T1FLSs are generated in parallel by ANFIS method. The parallel mechanism can fasten the learning speed of the proposed data-driven method. An ensembling strategy is proposed to form the type-2 fuzzy partition for each input variable through merging corresponding T1FSs in the type-1 fuzzy partitions of the learned T1FLSs. Detailed analysis has shown us how to ensemble different triangular T1FSs to corresponding trapezoidal IT2FS. The consequent parameters of the IT2FLS is optimized by the least square method. As well known, the least square method can achieve both smallest training error and smallest scaled parameters which can assure the approximation and generalization performances of the IT2FLS. Also, the learning speed of the least square method is very fast. Three benchmark problems including the prediction of the Mackey-Glass time series, the identification of one second-order time-varying system, the identification of one nonlinear plant, and one real-world wind speed prediction application are given. Detailed comparisons with some well performed methods, e.g. ANFIS [25, 26], BPNN [29, 30], RBF-AFS [31], DFNN [32], GEBF-OSFNN [33], TSCIT2FNN [34], TSK FNS [35], SANFIS [36] and Simpl-eTS [37] are made. Simulation and comparison results demonstrate that the proposed method not only has better performance but also has greatly reduced learning time.
The rest of this paper is organized as follows. In Section 2, preliminaries on IT2FS and IT2FLS will be briefly given. In Section 3, the proposed data-driven fast learning method will be presented in detail. In Section 4, simulation and comparisons will be made. At last, conclusions will be drawn in Section 5.
Preliminaries
In this section, we will briefly introduce the definition of IT2FS and the inference process of the IT2FLS.
IT2FS
The FS with crisp membership function (MF) is called T1FS while the one with fuzzy MF is named T2FS [1, 3]. Below, we formally introduce the definition of T2FS.
When the fuzzy MF grade degrades to a crisp MF grade, T2FS becomes the well-known T1FS.
Nowadays, due to the complexity of the general T2FS, its special case – IT2FS is widely used as a substitute and has found lots of applications in many research fields. The MF grade of an IT2FS is an interval. One IT2FS can be completely described by its lower MF (LMF) and upper MF (UMF) , i.e.
In this study, triangular T1FSs and trapezoidal IT2FSs as shown in Fig. 1 will be adopted.
As analyzed in many papers on fuzzy logic [1, 40], the multiple-input-multiple-output (MIMO) fuzzy logic system (FLS), including the type-2 case, can be decomposed into several multiple-input-single-output (MISO) FLSs. Hence, in this study, we only consider the IT2FLS with n inputs and one output. It can be readily extended to the multi-output case. The inference structure of an IT2FLS is shown in Fig. 2. Compared with the T1FLS, the IT2FLS has a type-reducer to transmit IT2FSs to T1FSs.
Suppose that, for the sth input variable x s , there are m s IT2FSs to partition the domain of x s . Hence, to be a complete fuzzy rule base, there are type-2 fuzzy rules, one of which can be expressed as
When a crisp input x = (x1, x2, . . . , x
n
) is given to the IT2FLS, with the singleton fuzzifier and type-2 inference process, we can obtain the interval firing strength of Rule (j1, i2, ⋯ , j
n
) as [1, 8]
In order to obtain a crisp output, the output processing including type-reduction and defuzzification is needed in an IT2FLS. There exist several popular type-reduction and defuzzification methods. For simplicity, in this study, we adopt the Begian-Melek-Mendel (BMM) method [41]. With this method, the crisp output of the IT2FLS can be computed as [41]
where α ≥ 0, β ≥ 0 and α + β = 1. Usually, α and β are respectively set to be 0.5.
To design an IT2FLS, the first thing is to generate its fuzzy rules including the antecedent parts and consequent parts. Then, the parameters in the fuzzy rules need to be optimized to achieve satisfactory performance. In this section, we will give a novel learning method to realize such objectives. Here, we firstly list the main steps of the proposed method, and then we will discuss each step in detail in the following subsections.
In this algorithm, we adopt two strategies to guarantee the learning speed. Firstly, the parallel mechanism is utilized to generate the antecedent parts of the IT2FLS. And then, the least square method which has fast learning speed is used to optimize the consequent parameters of the IT2FLS.
There are only two kinds of parameters to be manually determined before the operation of this algorithm. The first kind is the parameter m i – the number of FSs for the input variable x i (i = 1, ⋯ , n). To prevent the fuzzy rule explosion problem, the value of m i should be decided according to the number of input variables. Generally, 2 ≤ m i ≤ 5 is ok. The other one is the parameter K – the number of the sub-data sets. Both the size of the training data and the number of the computer cores will influence the determination of the value K.
Generating initial fuzzy partitions for input variables
Suppose that the training data set is , where and N is the number of the training samples.
From the training data, the input domain of the input variable x
s
can be estimated as
In our study, triangular T1FSs are adopted. The triangular fuzzy set can be expressed as
The initial fuzzy partition of the input variable x
s
is shown in Fig. 3, where the input domain is partitioned uniformly by m
s
triangular fuzzy sets. In other words, the parameters of the triangular fuzzy set can be expressed as
Then, we can generate the initial T1FLS with the following fuzzy rules
In order to improve efficiency, random division will be applied to the whole data set to obtain K sub-data sets. Each sub-data set will be used to optimize the initial T1FLS to generate corresponding trained T1FLS. In this study, ANFIS will be adopted to train the T1FLSs. These T1FLSs begin from the same initial T1FLS whose antecedent T1FSs are determined as discussed in Subsection 3.1. The form of the fuzzy rules in the initial T1FLS is as shown in (13).
From this initial T1FLS, the K sub-data sets are used to train the K T1FLSs in parallel by the ANFIS method in which the antecedent parameters of T1FLSs are learned using the back-propagation algorithm (steepest descent method) while the consequent parameters are optimized through both the back-propagation algorithm and the least square method. For each sub-data set, one trained T1FLS will be generated, i.e. K trained T1FLSs will be obtained finally. Notice that at this stage, the whole data set has been divided. The sizes of the sub-data sets are one-Kth of the size of the whole data set. Also, the learning processes are run in parallel. So, the learning speed will be faster than generating T1FLS using the whole data set.
Suppose that, after being trained by the ANFIS method, the fuzzy rule base of the kth T1FLS is as follows
In this study, we obtain the type-2 fuzzy partition for each variable through ensembling the K type-1 fuzzy partitions in the K learned T1FLSs. Here, trapezoidal IT2FS is adopted to ensemble the triangular T1FSs in the type-1 fuzzy partitions. In detail, a trapezoidal IT2FS in the type-2 fuzzy partition for the sth input variable is the ensemble of the corresponding triangular T1FSs . And, the IT2FS can be depicted by eight parameters where are the parameters of its UMF while are the parameters of its LMF. The detailed ensembling strategy is shown in Fig. 4.
From Fig. 4, we can determine the parameters of the UMF as
The endpoints of the triangular LMF can be computed as
Determining the vertex of the LMF is not straightforward. From Fig. 4, the vertex is the lowest one of the K * K intersecting points of the left and right lines of the K T1FSs. Such K * K intersecting points denoted as can be derived as
As a result, the vertex can then be determined as
From above discussion, the antecedent IT2FSs in the type-2 fuzzy rules in the fuzzy rule base are obtained by the ensembling strategy. However, the parameters (interval weights) in its consequent part still need to be optimized. Below, we will use the least square algorithm to realize this objective.
To begin, the input-output mapping of the IT2FLS in (7) is rewritten in the matrix form as
For the training samples , we need to train the IT2FLS to approximate these samples, i.e.
The training of the consequent parameters of the IT2FLS can be achieved by solving the previous linear equation under the constraint of the minimum norm least square as
The smallest norm least-square solution of this optimization problem can be obtained as
From (34), we can see that the least square method only needs to compute the Moore-Penrose generalized inverse of the training matrix. As a result, its learning speed will be very fast. On the other aspect, from (33), we can observe that the least square method can minimize the prediction or identification errors. Hence, satisfactory optimization results can be expected.
In this section, we will use three benchmark problems and one real-world application to verify the effectiveness and advantages of the proposed method. The three benchmark problems are the chaotic Mackey-Glass time series prediction, one second-order time-varying system identification, and one nonlinear plant identification. And, the real-world application is to predict the wind speed based on historical wind speed time series.
Also, the proposed method for IT2FLS will be compared with some popular and well performed methods, such as the ANFIS, BPNN, RBF-AFS [31], DFNN [32], GEBF-OSFNN [33], TSCIT2FNN [34], and Simpl-eTS [37]. For comparison, we will use the following root mean squared error (RMSE) index to evaluate the performances of different methods
In order to show the fast learning properties of the proposed method, we run the IT2FLS, the ANFIS trained by the whole data set, and the BPNN at the same simulation platform which has four 3.20 GHz Cores and 4 GB RAM. Among the compared methods (IT2FLS, BPNN, ANFIS), only the proposed one adopts the stochastic strategy. Hence, we run the proposed method 100 times in each example and then compute their 95% confidence intervals for both the RMSE and the training time.
In this first example, we consider the chaotic Mackey-Glass time series prediction problem. The IT2FLS, ANFIS and BPNN are applied to the Mackey-Glass time series generated by the time-delay differential equation as follows
In this simulation, we generate the simulation data from the initial conditions x (0) =1.2, x (k) =0 (k < 0). As in [31–34], 1000 input-output data pairs are generated. And, the first 500 are selected for training while the left 500 are for testing. Four inputs [x (t - 24) , x (t - 18) , x (t - 12) , x (t - 6)] are used to predict the current value x (t) by the models.
Prediction results of the proposed method for the testing data are shown in Fig. 5. From this figure, we can observe that the performance of the IT2FLS is very good, and the prediction errors lie in a very small scale. Comparison results of different models are shown in Table 1 where the learning epoch for the IT2FLS is 100(1) which means that the epoch of the ANFIS method for the T1FLSs is set to be 100 and the consequent parameters are determined by the least square method only once. Also, in this and following tables, the performance indices of IT2FLS is the 95% confidence intervals of the corresponding indices in the 100 running times. From the statistical point of view, the IT2FLS performs best compared with the ANFIS, BPNN, RBF-AFS [31], DFNN [32], GEBF-OSFNN [33], TSCIT2FNN [34]. According to the training time, the IT2FLS with the proposed learning method has the fastest learning speed compared to the ANFIS and BPNN.
In this example, we use the proposed method to identify one second order time-varying system which is expressed by the following equation [35]
In this case, we use two variables y (t - 1) and u (t) to predict the output y (t). We generate the training data in the same way as in [35]. In the training, the control input u (t) is randomly generated in [-1, 1] when t = 1, ⋯ , 400 and is given by sin(πt/45) for the rest ones. In the testing, the control input is given as
For the testing data, the identified result and errors of the proposed method are shown in Fig. 6. Again, the performance of the IT2FLS is satisfied. Comparisons with ANFIS, BPNN, T2 TSK FNS and T1 TSK FNS [35] are listed in Table 2. In this example, the proposed method has similar or better performance than the comparison methods. From the statistical point of view, it still spends much less time to obtain better performance compared with the ANFIS and BPNN.
This example shows the identification of one nonlinear plant using the proposed method. The nonlinear plant is depicted as follows [36, 37]
There are three inputs [y (t) , y (t - 1) , u (t)] and one output y (t + 1) in this identification problem. The training and testing data are collected in the same way as in [36, 37]. Totally, 5000 and 200 data pairs are respectively generated in the training and testing.
In this example, the identification results and errors of the IT2FLS are demonstrated in Fig. 7. From this figure, the identified errors are very fine. Comparisons with ANFIS, BPNN, SANFIS [36] and Simpl-eTS [37] are given in Table 3. Again, the performance of the proposed method is the best whereas it spends much less training time.
Accurate prediction of wind speed is very helpful to the maintenance scheduling and dispatch planning [42, 43]. However, accurate wind speed prediction is a challenging task due to the wind’s unstable and random natures.
In this experiment, the wind speed data were downloaded from the website: “http://climate. weather.gc.ca/”. The data were collected every hour between July 1, 2014 and July 31, 2015 in Regina, Saskatchewan, Canada. In the data set, there are 9504 samples. We select the data during July 1, 2014 and June 30, 2015 (totally 8760 samples) for training, and those during July 1, 2015 and July 31, 2015 (totally 744 samples) for testing. And, we use the previous four hours’ wind speeds to predict that of the next hour.
After being trained, the prediction results of the IT2FLS for the testing data are depicted in Fig. 8. From this figure, we can observe that satisfactory performance can be achieved by the proposed method. Comparison results of the IT2FLS, ANFIS and BPNN are listed in Table 4 in detail. From this table, it is obvious that, compared with ANFIS and BPNN, the IT2FLS can give similar performance. However, the learning speed of the proposed method is greatly improved. The proposed method has about 60 times faster learning speed than ANFIS and about 50 times faster learning speed than BPNN.
Conclusion remarks
In this paper, we presented a fast learning approach for the data-driven design of IT2FLSs. In the proposed approach, we provide two strategies to accelerate the learning speed. The first one is for the antecedent IT2FSs generation in which parallel learning mechanism is used. The second one is for the consequent parameters learning which is realized by the least square method. Such method has fast learning speed and can achieve excellent approximation and generalization performances as proven by our four examples.
In the proposed IT2FLS, its antecedent parts are generated by an ensembling strategy. This strategy abandons the steepest descent method or back-propagation algorithm based parameter learning schemes which are widely-used in many existing studies on the design of IT2FLSs. Therefore, the drawback of long training time in the steepest descent methods can be avoided.
The performance and fast learning speed have been verified by three benchmark problems and one real-world application. This tells us that the proposed method may be more suitable for the problems with large-scaled data, such as the traffic flow prediction and the stock market prediction applications.
Based upon the satisfactory approximation and generalization performances of the proposed strategy, it can also be applied to more general nonlinear systems introduced in [44, 45] and more complex industrial processes studied in [46, 47]. This will also be one of our future research directions. On the other aspect, data preprocessing methods including the clustering analysis may be expected to further improve the performance of the proposed strategy. We will also try to explore the clustering analysis based learning strategies in the near future.
Footnotes
Acknowledgments
This work is supported by National Natural Science Foundation of China (61473176, 61273149, 61573225), and the Natural Science Foundation of Shandong Province for Outstanding Young Talents in Provincial Universities (ZR2015JL021).
