Abstract
This study attempts to develop a quasi type-2 fuzzy regression model in full quasi type-2 fuzzy environment. To estimate the parameters of the proposed model, first, a weighted distance between quasi type-2 fuzzy numbers is defined based on L1-norm. Then some approximations for multiplication of two quasi type-2 fuzzy numbers (QT2FNs) are introduced. The problem of estimation of the parameters relies on a non-linear optimization problem, which is converted to a linear optimization problem. The method can handel both symmetric and asymmetric data. Two real world examples demonstrate the feasibility and efficiency of the proposed method. The predictive performance of the model is examined by cross-validation, and a similarity measure is used to compare our model with a similar model.
Introduction
Fuzzy set theory is a tool to handle the real world problems in which the information are uncertain and ambiguous. In such situations, a membership function is used which assigns a real number in [0, 1] for a membership degree of an element is a set. In many real-world problems the exact membership degree may not be identified. A fuzzy set with uncertainty in the membership function is called type-2 fuzzy set. Such sets can be used in situations where there are uncertainty in both membership degree and in the shape of the membership function.
Some examples of such situations have been identified in [17], e.g., when a measurement is corrupted with nonstationary noise and the mathematical description of nonstationary is unknown; when features in a pattern recognition application have statistical attributes that are nonstationary and the mathematical descriptions of the nonstationarities are unknown; when membership values are extracted from a group of experts using questionnaries, etc. Studies on Type-2 fuzzy sets briefy summarized as follows.
Karnik and Mendel [14], discussed in set theoretic operations for type-2 sets, properties of membership grades of type-2 sets, and type-2 relations and other compositions, and cartesian products under minimum and product t-norms. Mendel and John [17] defined a new representation theorem of type-2 fuzzy sets and introduced formulas for the union, intersection, and complement of type-2 fuzzy sets by using this new representation. Grzegorzewski [5] introduced some distances between interval-valued fuzzy sets based on Hausdorff metric. Hong and Lee [11] discussed some algebraic properties and a distance measure for interval-valued fuzzy numbers. Mendel and Liu [18] defined a quasi type-2 fuzzy logic system as a restricted special case of a type-2 fuzzy logic system represented by its α-planes.
Research on fuzzy regression analysis began by Celmin’s [2], Diamond [3] and Tanaka et al. [25], and was continued by some authors e.g. Wang and Tsaur [26]; Kao and Chyu [13]; Nasrabadi et al. [21]; Yao and Yu [28]; Hassanpour et al. [7–10]; Kelkinnama and Taheri [15]. Recently, Rabiei et al. [23] proposed a Least-squares approach to regression modeling for interval-valued fuzzy data. The authors formulated a weighted goal programming model to calculate the quasi type-2 fuzzy coefficients of regression, when the input data are crisp and the output data are QT2FNs [12].
When there exist some outliers in the data set, it is usually preferred to use a robust approach. Traditionally, the regression analysis based on the method of least absolutes deviations was used as a robust method with respect to the least squares method in modeling a data set in which there were some outliers [4]. For example, Hassanpour et al. [8] proposed a goal programming approach based on least absolutes deviations to fuzzy regression modeling for triangular fuzzy input-output data. The effect of outliers is also compared with some methods in numerical examples. Their results showed that goal programming approach is less sensitive than the compared methods. In the present paper we introduce a goal programming model to estimate the regression coefficients when the regression coefficients, as well as the independent variables (inputs) and the response variable (output), are quasi type-2 fuzzy numbers. The rest of the paper is organized as follows: Section 2 contains some preliminaries of quasi type-2 fuzzy set theory. Also, a distance between quasi type-2 fuzzy numbers is introduced. In Section 3, we explain our proposed method. In Section 4, two real world data sets are used to illustrate how the proposed method is implemented. The predictive performance of the model is examined by cross-validation in Section 5. Finally, a brief conclusion is given in Section 6.
Preliminaries
Some concepts in type-2 fuzzy set theory are reviewed in this section [6, 17]. Also, a weighted distance between two QT2FNs is introduced.
Type-2 fuzzy sets
A type-2 fuzzy set (T2FS), denoted by , in a crisp set X is characterised by a type-2 membership function , i.e.,
where fx′ (u) : J x → [0, 1] is a function that assigns a secondary grade to each primary grade u for some fixed x. The VS is a type-1 fuzzy set (T1FS) in [0, 1].
The Footprint Of Uncertainty (FOU) is derived from the union of all primary memberships. The FOU is bounded by two membership functions, a lower one, and an upper one, . The FOU can be described in terms of its upper and lower membership functions which themselves are T1FSs:
The principal membership function (PrMF) defined as the union of all the primary memberships having secondary grades equal to 1
An interval type-2 fuzzy set (IT2FS) is defined as a T2FS whose all secondary grades are of unity i.e. for all x, f x (u) =1. An IT2FS can be completely determined by its FOU given by Equation (1).
Let be a T2FS satisfying the following propositions: [6] A1: All the VSs of the T2FS are fuzzy numbers, i.e.
A2: All the VSs of the T2FS are piecewise functions of the same type (e.g. linear).
The first assumption assures that the T2FS contains an FOU and a Pr. This fact is clear since all the VSs are normal which makes it clear that for all the domain values there is at least one primary grade with secondary grade at unity. The second property assures that only a set parameters are needed to define this kind of T2FSs which is directly related to FOU and Pr. These assumptions allow this kind of T2FS be completely determined using its FOU and Pr, just like a T1FS which can be completely determined by its core and support, based on certain assumptions.
The Extension Principle has been used by Zadeh [29] and Mizumoto and Tanaka [19] to derive the intersection and union of T2FSs. Karnik and Mendel [14] provide an in-depth investigation on these operations.
The triangular QT2FN, can also be denoted by its left and right spreads as follows:
is called symmetric if α1 = β1, α = β and α2 = β2. In such a case is denoted by .
An ordinary triangular fuzzy number can be considered as a degenerated QT2FN in which all of the spreads of , and are equal and their secondary membership functions are zero. Also, a real number can be considered as a degenerated QT2FN whose spreads and secondary membership functions are zero. The following formulas for addition of two triangular QT2FNs and multiplication of a triangular QT2FN by a scaler are drawn from extension principle of Hamrawi et al. [6].
For multiplication of two QT2FNs, there is not an analytically exact formula. The multiplication of two T2FNs based on extension principle is not necessarily a T2FN. Several multiplications are defined between type-1 fuzzy numbers, that their extension to T2FNs and using in field of fuzzy regression is needed more research. The following approximation can be considered in a similar way to that of Hassanpour et al. [8].
But, using a similar argument for fuzzy sets, the righthand side relations hold (see [9], Proposition 2.7), and the proof is complete. □
The observed data in our study are assumed to be QT2FN. So, we try to close the membership functions of observed and estimated responses from fuzzy linear regression model by closing their corresponding parameters. Since QT2FNs are completely characterized by the parameters of their FOU and Pr, closing the parameteres of two QTFNs is enough (in fact necessary and sufficient) to close their membership functions, which is the purpose of this paper. To do this, based on the distance defined between two triangular fuzzy numbers by Hassanpour et al. v, we propose the following weighted distance between QT2FNs in which different weights (w i ) are used to show different importance of the parameters [12].
Among advantages of using d w , we can refer to its ease in both theory and application. Furthermore, the special formula of the proposed distance helps us to convert the nonlinear programming model proposed to calculate the regression coefficients, to a linear one. The advantages of this conversion are that solving linear programming (LP) problems is very easy, and their exact solution can be obtained by the Simplex method. However, most of available algorithms for solving nonlinear programming problems yield approximate solutions. In addition, depending on the relative importance of the parameters of quasi type-2 fuzzy numbers, we can assign different values to the weights w i and obtain different solutions. For example, one can set w i = 1 for all i, if the parameters of Pr and FOU have the same importance, and can set w5 = w6 = w7 = 2, w1 = w2 = w3 = w4 = 1 if the importance of Pr is twice the importance of FOU.
and .
.
.
The proposed regression model
Consider a set of QT2F data , in which ) is the value of ith independent variable () and is the corresponding value of dependent variable in the ith case. The purpose of quasi type-2 fuzzy linear regression (QT2FLR) is to fit a quasi type-2 fuzzy linear model to the given QT2F data. This model can be considered as follows:
In model (5), the coefficients are assumed to be QT2FNs. These parameters must be estimated such that for i = 1, …, n, the estimated responses ,
In this paper, the QT2FLR coefficients are supposed to be non-symmetric QT2FN. Also, given inputs ;,;,;,, i = 1, …, n, j = 1, …, p, are supposed to be positive non-symmetric QT2FNs, by a simple translation of all data, if necessary. In addition, suppose that the observed responses are non-symmetric QT2FNs , i = 1, …, n.
To calculate the regression coefficients, it is necessary to evaluate the product of two QT2FNs and in (6). It is clear that the multiplication A ⊗ B in (2.1) depends on the sign of QT2FNs. Therefore, one must formulate different models for different states of the sign of regression coefficients. We attempt to formulate a non-linear programming (NLP) model for estimating the regression coefficients which are independent of the sign of regression coefficients. Another problem with using the multiplication (2.1) is that it is proposed only for two positive and/or negative QT2FNs. So, it cannot be used for QT2FNs whose supports contain both positive and negative real numbers. Therefore, we approximate the multiplication of two QT2FNs by some (in fact by infinite) QT2FNs. Then, the best approximation is chosen among them, to minimize a suitable function.
Note that a real number A has infinite representations in the form of A = A′ - A″, where A′ and A″ are nonnegative real numbers.
Suppose and are two QT2FNs where is unrestricted in sign and is a positive QT2FN. Set in which
By using Proposition 2.2, approximation (3), and addition (3) we have:
Let for j = 1, …, p, where
for k = c, α1, β1, α, β, α2, β2. In fact, n
ik
and p
ik
(k ∈ {c, α1, β1, α, β, α2, β2}) are the negative and positive deviations between the parameters of the ith estimated and observed response, respectively. It can be easily seen that
Similar relations are hold true for the other deviation variables. By using the above deviation variables, the model (10) converts to the following GP model:
It is clear that if for all i, we can set for each i and j. Accordingly, the constraints (14) and (15) will be equivalent, and one of them (in fact n constraints) can be removed. Similarly, if () for all i, we can set () for each i and j. Therefore, the constraints (12) and (13) ((16) and (17)) will be be equivalent, and the constraints (13) ((17)) can be removed, then we obtain a smaller model. In addition, one can remove the constraints (18) and solve the obtained LP model by the simplex method.
Since, this model finds the best approximation for by minimizing the absolute deviations between the papameters of the observed and approximated responces, the problem always has feasible solutions (This is an advantage of goal programming problems).
The algorithm for constructing the QT2FLR model is as follows:
The complexity of simplex method for solving linear programming is exponential. One shortcoming of LP-based approaches is that the LP models which are used for estimating the regression coefficients have a large number of constraints, especially for a large number of observations. So the number of constraints increasing rapidly when the number of observations increases. Of course, with available LP solvers (e.g. MATLAB, LINGO and MPL), the size of LP is not important. Of course, there are other methods such as interior point methods which has polynomial time complexity.
In this section, we introduce two concepts for evaluating the proposed regression model.
Leave-one-out cross validation
To investigate the performance of the model, we apply an index based on the cross validation method [27] to examine the predictive ability of the models. To this end, for i = 1, …, n, the ith observation is left out from the data set, while the remaining observations are used to develop a quasi type-2 fuzzy regression model. Then the obtained model is used to predict the response value of the ith observation (denoted by ). Finally, to compare the ith observed response and the predicted value , we calculate the mean of distances d w between y i and which we call it MDC.
Now, by using the above indices, the relative error of the estimated responses can be calculated as
Goodness of fit of the IT2F regression model
To evaluate the goodness of fit of the interval type-2 fuzzy linear regression (IT2FLR) model, we use a weighted similarity measure between two IT2FNs ([1]), and it is used to compare our model with similar model.
In this section, we provide two applied examples to explain how the proposed method is applicable to derive regression model for quasi type-2 fuzzy observations.
One of the classical problems in soil science is the measurement of physical, chemical, and biological soil properties. The problem results from the difficulty, time and cost of direct measurements. Pedomodels, which have become a popular topic in soil science and environmental research, are predictive functions of certain soil properties based on other easily or cheaply measured properties [22]. Here, two pedomodels including one and two independent variables are studied to develop the relationships between different chemical and physical soil properties by means of quasi type-2 fuzzy regression technique. Based on a study in a part of Silakhor plain (situated in the province of Lorestan, west of Iran), different soil physical and chemical properties were measured using standard procedures. But, due to some impreciseness in experimental environment, the observed data were reported as IT2FNs [24]. Since examples containing quasi type-2 fuzzy data did not exist in previous works, we changed its output data to triangular QT2FNs.
Pedomodel of ESP-SAR
We first wish to provide a relationship between exchangeable sodium percentage (ESP), as the dependent variable, and sodium absorption ratio (SAR), as an independent variable. The exchange sodium percentage, ESP, governs the source/sink phenomenon for ionic constituents, i.e., sodium, as a contaminant in sodic soils, is calculated from the ratio of exchangeable sodium, Na x , to cation exchangeable capacity, CEC. All these soil parameters, measured on soil colloidal surface, are time consuming and costly. In this case, ESP is considered as cost and time variable, therefore the need for less expensive indirect measurement is emphasized.
In this study, for each choice of and , the regression model for the data of Table 1 is as follows:
In the above model, non-symmetric QT2FNs and are cation sodium absorption ratio (SAR) and exchange sodium percentage (ESP), respectively.
According to the proposed method, the regression coefficients are obtained as
The second model provides a relationship between cation exchange capacity (CEC), as a function of two soil variables namely percentage of sand content (SAND) and organic matter content (OM). In the soil, organic matter can enhance the CEC, while the sand content has negative effect on the cation exchange capacity [20]. In this case, CEC is considered as cost and time variable, therefore the need for less expensive indirect measurement is emphasized.
In this study, for each choice of and , j = 1, 2 the regression model for the data of Table 3 is as follows
According to the proposed method, the estimated coefficients are obtained as
In the above model, asymmetric QT2FNs , and are cation exchange capacity (CEC), percentage of sand content (SAND) and organic matter content (OM), respectively.
The above QT2F regression model can be used to predict the CEC of a new case. For example, if for a new case,
In order to evaluate the predictive ability of the above models, the MDC is calculated for each model. The value of the MDC for the ESP-SAR regression was obtained to be 19.9, and the value of MD* is 14.25. Note that the relative error between MDC and MD* is RE = 0.28. The value of the MDC for the CEC-OM-SAND regression was obtained to be 24.55, which is close to the value of MD*, i.e. 20.51. Note that the relative error between MDC and MD* is RE = 0.16.
It is appeared that predictive ability to the MSL-DW model is much better than the CEC-OM-SAND model, and predictive ability to the CEC-SAND-OM model is better than the ESP-SAR model.
Recently, Rabiei et al. [23] used a distance on the space of interval type-2 fuzzy numbers (which determinated using its FOU) and proposed a least-squares method (LS) to obtain coefficients of the proposed model. In soil science examples, if input and output variables are considered to be interval-valued fuzzy numbers, we use the distance introduced in Eq. (19). In the case of interval type-2 fuzzy input-output data, the similarity measures between observed and predicted values in two regression model are shown in Table 2.
In soil science examples, the obtained results from comparison the maen of similarity mesaures, shows that the is less than , therefore, the observed and estimated values in proposed approach is more fit than the least squares model provided by rabiei et al. [24].
In some real systems, especially environmental systems, the relation between variables can be investigated in quasi type-2 fuzzy environment. In this work, we proposed a weighted goal programming approach to estimate the coefficients of quasi type-2 fuzzy linear regression model with quasi type-2 fuzzy input-output data and coefficients. The nonlinear programming model which has been presented to calculate the regression coefficient is converted to a goal programming model and then to a linear programmnig model easily. The advantage of this conversion is that linear programming problems can be solved exactly by available algorithms. Whereas, the available algorithms for solving nonlinear programming problems often give approximate solutions. The applicability of the proposed approach was investigated by using two real data sets in soil science. The predict ability of the model evaluated by cross-validation method. The proposed model is general and can be applied to any field of research. In soil science examples, the obtained results from comparison the maen of similarity mesaures, showed that, in the proposed goal programming approach, the observed and estimated values are more consistent than least squares model provided by rabiei et al.
