Abstract
Fuzzy clustering has been widely applied in T-S fuzzy model identification for nonlinear systems, however, tradition type-1 fuzzy clustering algorithms can’t deal with uncertainties in real world, an improved interval type-2 fuzzy c-regression model (IT2-FCRM) clustering is proposed for T-S fuzzy model identification in this paper. The improved IT2-FCRM adapts a new objective function, which makes the boundary of clustering more clearly and reduces the influence of outliers or noisy data on clustering results. The premise parameters of T-S fuzzy model are upper and lower hyperplanes obtained by improved IT2-FCRM, and the upper and lower hyperplanes are used to build hyper-plane-shaped type-2 Gaussian membership function. Compared with the hyper-sphere-shaped membership function of tradition IT2-FCRM, the hyper-plane-shaped membership function is more coincided with point to plane sample distance described by FCRM clustering. The simulation results of several benchmark problems and a real bed temperature in circulating fluidized bed plant show that the identification algorithm has higher accuracy.
Keywords
Introduction
For a complex controlled object or uncertain system, it is usually very difficult or even impossible to determine the accurate mathematical model. Fuzzy system can approximate nonlinear dynamic system with arbitrary accuracy, it is widely applied in modeling nonlinear system. The fuzzy identification algorithm can establish fuzzy model of nonlinear system using system’s input and output sample data by certain algorithm. T-S fuzzy model is a local linearization model [1], and its premise parts are generally input variables, output variables and/or state variables of the system, and the consequent parts are linear expressions. It is an important method for fuzzy modeling of complex nonlinear multivariable system.
T-S fuzzy model identification includes structure identification and parameter identification. The main task of structure identification is to select input variables, rules number, membership functions and parameter identification is to define parameters of membership functions and linear expressions for consequent parts. In general, structure identification mainly relies on expert knowledge and experience, which is time consuming and inefficient. Fuzzy clustering can classify sample data into several classes according to their similarity degrees without teacher’s guidance or prior knowledge, and it is widely applied to T-S fuzzy model identification in recent years [2–4]. The commonly used clustering algorithms in T-S fuzzy model identification mainly includes fuzzy c-means (FCM), Gustafson-Kessel (G-K), Gath-Geva (G-G), subtractive clustering and so on. The above fuzzy clustering algorithms are hyper-sphere-shaped clustering for the sample data distance is Euclidean distance, which indicates that the input-output data in one clustering belongs to the same hypersphere shaped. However, T-S fuzzy model is described by linear functions, which means that the input-output data in one clustering should be sharing the same functional relationship. Then fuzzy c-regression model (FCRM) clustering [5], whose distance is point to plane, is proposed for T-S fuzzy model identification [6–9]. The premise membership function of T-S fuzzy model identified by FCRM clustering algorithm is bell-shaped, like Gaussian membership function. In order to modeling T-S fuzzy model more accurate, Li proposed a novel hyper-plane-shaped membership function [10, 11], which is more suitable for T-S fuzzy model identification when adapting FCRM clustering algorithm.
For the membership degree of tradition fuzzy sets (type-1 fuzzy sets) is crisp value, and it can’t deal with uncertainties of real applications, like measurement noise of input and output data, different person may define different membership functions, nonlinearity of actuators, sensor drift and so on. With the development of fuzzy sets, type-2 fuzzy sets have been concerned in recent years. The concept of type-2 fuzzy sets was firstly proposed by Zadeh [12]. Type-2 fuzzy sets are defined by primary membership function and secondary membership function. And it contains a type reduction procedure compared with type-1 fuzzy sets. Type reduction is a complex operation and limited the application of type-2 fuzzy sets for a long time. Mendel and his team simplified the secondary membership function of type-2 fuzzy sets to 1, defined interval type-2 fuzz sets [13], which reduced the complexity of type reduction. With the theories development of type-2 fuzzy sets, some efficient type reduction algorithms for general type-2 fuzzy sets have been studied [14]. Based on these efficient type reduction algorithms, type-2 fuzzy logic system has been applied in many fields, such as chaotic time series prediction [15], fuzzy controlling [16], clustering, classification and pattern recognition [17], neuro-fuzzy systems [18], fuzzy model identification [19] and so on. Also, type-2 fuzzy clustering has been applied in T-S fuzzy model identification, such as interval type-2 FCRM (IT2-FCRM) clustering [20, 21], interval type-2 fuzzy c-means [22]. Zou introduced a hyper-plane-shaped membership function in premise parts of T-S fuzzy model when applied IT2-FCRM clustering for T-S fuzzy model identification procedure [23, 24].
In order to overcome the shortcoming of FCM that it didn’t have bounded support or decay rapidly when producing membership functions, Höppner proposed a modified FCM clustering by a new objective function and rewarded more crisp membership degrees [25]. Zhu extended the new objective function and proposed a generalized fuzzy c-means clustering algorithm [26]. In this paper, an improved interval type-2 fuzzy c-regression model clustering is proposed for T-S fuzzy model identification. The main contributions of this paper mainly contains the following twofold. The first contribution is the proposal of an improved interval type-2 fuzzy c-regression model clustering, the improved IT2-FCRM introduces a new objective function in [26]. The new objective function is applied in tradition type-1 fuzzy c-regression model (T1-FCRM) clustering and the converge of this new T1-FCRM will be proofed. Furthermore, this new T1-FCRM will extend to IT2-FCRM by 2 weighting exponents in the membership degree. And the improved IT2-FCRM is applied in T-S fuzzy model primary parameters identification. The secondary contribution is the application of a hyper-plane-shaped membership function, which is more coincidence with the consequent parts of T-S fuzzy model.
The rests of this paper are organized as follows: Section 2 is the definition of type-2 T-S fuzzy model. Section 3 describes general IT2-FCRM clustering algorithm. Section 4 proposes the improved IT2-FCRM clustering algorithm and T-S fuzzy model identification procedure. Section 5 presents the simulation results by the proposed identification algorithm and section 6 draws the conclusions and future works.
Type-2 T-S fuzzy model
T-S fuzzy model is defined by several if-then fuzzy rules, the i-th rule is shown as follow:
and
where i = 1,2, . . . ,c, c is the number of fuzzy rules, x
j
(j = 1,2, . . . ,M) are the fuzzy model inputs, y
i
is the i-th fuzzy rule output,
Type-2 T-S model shares the same forms with type-1, it contains 3 conditions: Premise fuzzy subset
then
Premise fuzzy subset
then
where, Premise fuzzy subset
then
In this paper, A2-C0 type-2 T-S fuzzy model is applied for nonlinear system identification.
Interval type-2 fuzzy c-regression model
For N pairs of sample data (
In (1),
The task of FCRM is to minimize the objective function as (2).
In (2), m is weighting exponent, usually is 2, u
ik
is the degree of input-output data (
d
ik
in (2) is the distance of (
In (4),
The FCRM clustering iterative equations are shown as (5) and (6).
In (6):
IT2-FCRM clustering contains 2 weighting exponents in the objective function, m1 and m2, and there exits 2 objective functions to be minimized as (7).
The upper and lower membership degrees can be represented as (8) and (9).
The average of
Gaussian function is applied in premise membership function for T-S fuzzy model, and the upper and lower Gaussian membership functions are defined as (10) and (11).
In (10):
In (11):
New objective function
An improved fuzzy partitions by modifying the objective function of FCM was proposed in [25], which is shown as (12).
The first part of (12) is the objective function of FCM and weighting exponent m is 2. More generally, Zhu extended this fuzzy partitions and proposed a general objective function as (13) [26].
The degree u ik can be updated by (14).
To ensure
If α= 0, then the new objective function (13) is the same as (2), so the objective function of FCM is a special case of this new objective function.
In this paper, this new objective function is applied in FCRM clustering, and next, the convergence of this new FCRM clustering algorithm will be discussed.
Define,
Firstly,
Proof:
This theorem can be proved by the derivation of uik and calculate the H(
According to (17), H(
Since m > 1 and
Secondary, in order to minimum the function J
L
, the regression function parameters
Similar to Bezdek’s proof in [27], by the above information:
According to (16), if type-1 FCRM extends to interval type-2 FCRM, the upper and lower fuzzy membership degrees can be represented as (20) and (21).
In (20) and (21):
Li proposed a hyper-plane-shaped membership function as (23), which is more suitable to describe fuzzy clustering algorithm whose sample distance is point to plane, like FCRM clustering.
η is a constant that tunes the fuzzy membership grades, and d
ik
(
Based on upper and lower membership degrees, the upper and lower hyperplanes parameters can be calculated by (25) and (26) [23].
In (25):
In (26):
So the corresponding upper and lower hyper-plane-shaped membership functions can be defined as (27) and (28).
In (27):
In (28):
In consequent parameters identification procedure, the membership degree of sample data (
Giving N pairs of input-output sample data ( Set identification parameters: number of clustering c, 2 weighting exponents m1 and m2, adjustment coefficients η and α, maximum iterations r
max
and threshold ɛ. Initialize Update the upper and lower fuzzy degrees Update the upper and lower hyperplanes parameters Update For each training data (
All N pairs of training data can be represented as (31):
Where:
The optimal estimation of
In this section, some nonlinear systems are tested to verify the identification effects of proposed algorithm. The mean square error (MSE) and root mean square error (RMSE) will be used as the performance indexes when compared with other methods, which are defined as:
One-dimensional (1-D) sin function
In this example, a 1-D sin function is defined as follow:
The input of fuzzy model is x, and the fuzzy model is
The sample data points are uniformly sampled between [–40, 0)∪ (0,40], and the identification parameters are c = 4, m1 = 2 m2 = 7, η= 50, α= 0.99. Table 1 lists the performance comparison with other methods and Table 2. shows the premise and consequent identification parameters for one-dimensional sin function.
Performance comparison of different identification methods (One-dimensional sin function)
Performance comparison of different identification methods (One-dimensional sin function)
Identification parameters of One-dimensional sin function
In this example, a double inputs and single output (DISO) nonlinear function is tested, which is shown as follow:
The input of fuzzy model is x1 and x2, and the fuzzy model is
The 50 training data are collected from [29], and the identification parameters are c = 6, m1 = 2, m2 = 7, η= 20, α= 0.99. Figure 1 shows the output curves of T-S fuzzy model and original system and Fig. 2 shows the errors curve.

Fuzzy model and original output curves (DISO function).

Fuzzy model and original output errors curve (DISO function).
Table 3 lists the performance comparison with other methods and Table 4 shows the premise and consequent identification parameters for DISO function.
Performance comparison of different identification methods (DISO function)
Identification parameters of DISO function (c = 6)
In this example, the Mackey-Glass (M-G) chaotic time series are tested. The chaos equation is shown as follow:
The initial value is set to 1.2 from x(0) to x(17). The 1000 pairs chaotic data are selected between t = 124 and t = 1123. The fuzzy model can be expressed as:
The first 800 pairs of data are used to build T-S fuzzy model, the left 200 pairs data are used to test the fuzzy system. And the identification parameters are c = 2, m1 = 1.5, m2 = 7, η= 20, α= 0.99. Figure 3 shows the output curves of T-S fuzzy model and original system and Fig. 4 shows the errors curve.

Fuzzy model and original output curves (M-G chaos data).

Fuzzy model and original output errors curve (M-G chaos data).
Table 5 lists the performance comparison with other methods and Table 6 shows the premise and consequent identification parameters for M-G chaos data.
Performance comparison of different identification methods (M-G chaos data)
Identification parameters of M-G chaos data
In this example, a gas-furnace process contains 296 Box-Jenkins data sets are tested. The input of this actual system is the gas flow rate and the output of this system is carbon dioxide concentration.
The fuzzy model can be expressed as:
The first 148 pairs of data are used to build T-S fuzzy model, the left 148 pairs of data are used to test the fuzzy system. And the identification parameters are c = 2, m1 = 2, m2 = 7, η= 5, α= 0.99. Figure 5 shows the output curves of T-S fuzzy model and original system and Fig. 6 shows the errors curve.

Fuzzy model and original output curves (Box-Jenkins data).

Fuzzy model and original output errors curve (Box-Jenkins data).
Table 7 lists the performance comparison with other methods when the total 296 pairs of data are used for training and Table 8 shows the premise and consequent identification parameters for Box-Jenkins data.
Performance comparison of different identification methods (Box-Jenkins data: 296 training data)
Identification parameters of Box-Jenkins data(c = 4)
Table 9 lists the performance comparison with other methods when the 148 pairs of data are used for training and the left 148 pairs of data are used for testing.
Performance comparison of different identification methods (Box-Jenkins data: 148 training data)
In this example, the bed temperature data of 2×50 MW circulating fluidized bed (CFB) plant was used. The bed temperature of CFB plant is mainly effected by coal feed quantity, primary air flow and secondary air flow. In this case, 1800 pairs of operation data are collected and the sample time is 1 s. The fuzzy model can be expressed as:

Bed temperature and coal feed quantity.

Primary air flow and secondary air flow.
And the identification parameters are c = 2, m1 = 2, m2 = 7, η= 20, α= 0.99. Figure 9 shows the output curves of T-S fuzzy model and original system and Fig. 10 shows the errors curve.

Fuzzy model and original output curves (CFB data).

Fuzzy model and original output errors curve (CFB data).
Table 10 lists the performance comparison with other methods when the total 1400 pairs of data are used for training and Table 11 shows the premise and consequent identification parameters for CFB data.
Performance comparison of different identification methods (CFB data: 1400 training data)
Identification parameters of CFB data
For type-2 fuzzy logic system can handling uncertainties more robust than type-1, in this paper, an improved IT2-FCRM clustering is proposed for T-S fuzzy model identification. The improved IT2-FCRM clustering algorithm introduced a novel membership constraint function and utilized a new objective function, which makes the IT2-FCRM less sensitive to outliers or noisy data and can classify sample data more accurate. As the distance of FCRM is point to plane, compared with the tradition hyper-sphere-shaped membership function, the proposed IT2-FCRM applies a hyper-plane-shaped membership function, which is more suitable to describe consequent linear expressions of T-S fuzzy model.
The premise parameters of T-S fuzzy model are upper and lower hyperplanes executed by proposed IT2-FCRM clustering algorithm. Then the upper and lower hyperplanes will define hyper-plane-shaped type-2 Gaussian membership functions and the consequent parameters of T-S fuzzy model are determined by OLS algorithm. The simulation results show the effectiveness of proposed T-S fuzzy model identification algorithm.
The future works mainly focus the following 2 aspects: There are 4 parameters to determine the identification accuracy of T-S fuzzy model, that are m1, m2, η and α. In this paper, these parameters are trialed, and the next step is to optimize these 4 parameters by some intelligent optimization algorithms. As the development of theories and applications of general type-2 fuzzy sets, another research will focus on general type-2 fuzzy c-regression model clustering applied in T-S fuzzy model identification.
