Abstract
The uncertainty is an important attribute about data that can arise from different sources including randomness and fuzziness, therefore in uncertain environments, especially, in modeling, planning, decision-making, and control under uncertainty, most data available contain some degree of fuzziness, randomness, or both, and at the same time, some of this data may be anomalous (outliers). In this regard, the new fuzzy regression approaches by creating a functional relationship between response and explanatory variables can provide efficient tools to explanation, prediction and possibly control of randomness, fuzziness, and outliers in the data obtained from uncertain environments.
In the present study, we propose a new two-stage fuzzy linear regression model based on a new interval type-2 (IT2) fuzzy least absolute deviation (FLAD) method so that regression coefficients and dependent variables are trapezoidal IT2 fuzzy numbers and independent variables are crisp. In the first stage, to estimate the IT2 fuzzy regression coefficients and provide an initial model (by original dataset), we introduce two new distance measures for comparison of IT2 fuzzy numbers and propose a novel framework for solving fuzzy mathematical programming problems. In the second stage, we introduce a new procedure to determine the mild and extreme fuzzy outlier cutoffs and apply them to remove the outliers, and then provide the final model based on a clean dataset. Furthermore, to evaluate the performance of the proposed methodology, we introduce and employ suitable goodness of fit indices. Finally, to illustrate the theoretical results of the proposed method and explain how it can be used to derive the regression model with IT2 trapezoidal fuzzy data, as well as compare the performance of the proposed model with some well-known models using training data designed by Tanaka et al. [55], we provide two numerical examples.
Keywords
Introduction
Fuzziness and randomness are two important sources of uncertainty. Randomness describes the uncertainty of event occurrence and is determined through experimentation or over time. Fuzziness describes event ambiguity and measures the degree that an event occurs and nothing to do with time or experimentation, which is mainly caused by human subjective judgment, incomplete knowledge, and noise [1, 35]. In the many mathematical programming problems (LP, NLP, and GP, etc.) and decision-making environments, data available contain fuzziness, randomness, or both, and at the same time, some of this data is outliers (anomalous). On the other hand, analysis of such data is one of the most important challenges for decision-makers and researchers. For this reason, many techniques have been introduced for modeling and analyzing such data. In this regard, classical regression techniques have useful, efficient, and complete tools to analyzing the randomness and outliers in crisp data [11, 32]. In addition, fuzzy regression approaches via combining the statistical techniques with fuzzy logic approaches provide us efficient tools to analyze and management of randomness, fuzziness, and outliers in data [3, 11]. It is worth noting that in the fuzzy regression analysis, the deviations (difference between estimated and observed responses values) are caused by system fuzziness and considered as fuzzy errors while in conventional regression analysis the deviations are caused by observations inconsistency and considered as random errors [9–11].
The fuzzy regression models depending on whether each of its main components (i.e., input-output and parameters) are fuzzy or crisp, fall into one of the three categories as following [12, 13]: Crisp input and fuzzy output with fuzzy parameters, Fuzzy input-output with crisp parameters, Fuzzy input-output with fuzzy parameters
According to [12, 54] the fuzzy regression models from the point of view the techniques used to estimate the parameters, can be broadly classified are classified into four main categories as follows: Fuzzy least absolute deviations method, Fuzzy least squares deviations method, Machine learning technique, Heuristic approaches
In this part of the study, to identify potentially fruitful future directions, we present a comprehensive literature review based on the categories mentioned above.
Fuzzy least absolute deviations approach
Tanaka et al. [55] based on the fuzzy least absolute deviations approach (FLAD) in 1982 first introduced the fuzzy regression method. In this method, the fuzzy regression problem is formulated as a mathematical programming problem in which minimized the entire fuzziness of the model via minimizing the total spread of fuzzy regression coefficients [11]. Simplicity in programming and computations is one of the main advantages of this approach while sensitivity to outliers, and provides too wide ranges in estimates is one of its main disadvantages. This technique has been investigated and improved by numerous authors [9–11, 54]. For instance, Shakouri et al. [50] proposed a fuzzy linear regression model with absolute errors and optimum uncertainty. They expanded the model based on the idea of reducing the distance between observed and predicted response variables. In the same year, [choy and buckly] suggested two methods to obtain the fuzzy least absolute deviation estimators for two common fuzzy linear regression models. Taheri and Kelkinnama [54] introduced a new metric based on absolute errors to develop fuzzy regression analysis. They determined the crisp parameters and fuzzy errors at the same time by minimizing the total of distances between the estimated and observed fuzzy responses. Zeng et al. [59] applied the fuzzy least absolute deviations approach to estimate the fuzzy regression coefficients with fuzzy parameters, fuzzy dependent variable, and crisp independent variables. Li et al. [37] introduced a fuzzy linear regression model based on the fuzzy least absolute error estimator with the trapezoidal fuzzy numbers.
Some other fuzzy regression models have been reported based on the fuzzy mathematical programming approaches. For instance, Hojati et al. [24] proposed a goal programming approach to estimate the parameters of a fuzzy regression model via estimating the predicted band with the help of the endpoints of the input intervals. Hassanpour et al. [21] suggested a goal programming (GP) approach to estimate crisp regression coefficients with fuzzy input-output data that minimizes total absolute errors between central points of estimated and observed output. Rafiei and Ghoreyshi [47] introduced a bi-objective GP approach for a fuzzy regression model with fuzzy input-output and fuzzy parameters in two stages. Hosseinzade et al. [28] presented a weighted GP method to estimate the type-2 fuzzy regression coefficients via minimizes the total absolute errors between the estimated and observed outputs for fuzzy output and crisp inputs. To handle the outlier problem, they proposed an omission approach and examined the behavior of value changes in the objective function when observations are omitted. In another study [29], they introduced a GP approach to estimate the fuzzy regression coefficients when the parameters and input-output variables are quasi type-2 fuzzy numbers.
Fuzzy least-squares deviations approach
The fuzzy least-squares deviations method (FLSD), is based on the ordinary least squares procedure which was initially proposed by Diamond in 1988 [15]. In the FLSD approach, the purpose is to minimize some square error between the estimated and observed responses [12, 13]. From the robustness point of view, in the fuzzy linear/nonlinear regression model when there exist some abnormal values or outliers in the dataset, the fuzzy least square estimator method is not robust and very sensitive to outliers [51, 58] while the least absolute estimator method is robust and less sensitive than least square to outliers. It is worth noting that sometimes even a single observation may dramatically influence the value of the parameter estimates.
In recent years, the FLSD approach has been improved and extended by a large number of researchers. For instance, D’urso and Massari [13] proposed a weighted fuzzy least median square and fuzzy least squares estimation to estimate the fuzzy regression coefficients. They also introduced some theoretical properties and a suitable generalization of the determination coefficient to investigate the goodness of fit the proposed model. Chachi et al. [7] introduced a squared distance based on intervals to estimate the fuzzy regression coefficients when the data available of input-output variables are triangular fuzzy numbers. Rabie et al. [45, 46], proposed a fuzzy least-squares technique to fuzzy regression modeling with interval-valued fuzzy input-output and parameters and showed how to obtain the fuzzy coefficients of the regression model. Mashinchi et al. [39] used a FLSD approach and applied a two-step method to detect the outlier in a dataset first, and then fit a model on the clean dataset. Shafaei Bajestan et al. [49] introduced a fuzzy nonlinear regression problem with IT2 fuzzy input-output based on the fuzzy least-squares method.
Machine learning methods
The generalization capability of fuzzy regression models has been enhanced via combining machine-learning approachs, such as support vector machines, neural networks, and evolutionary algorithms. Support vector machines are supervised learning models with associated learning algorithms that are employed for function estimation problems and pattern recognition [12]. In recent years, this approach improved by several papers. For instance, Hong and Hwang [25] suggested the convex optimization technique of fuzzy multiple linear and non-linear regression approaches using support vector machines, and develop support vector fuzzy regression machines. In another study [26] they introduced an estimation of fuzzy multiple non-linear regression approaches with fuzzy input-output data using a least-squares support vector machine.
The evolutionary algorithms and genetic algorithms are mostly used in optimization problems [12]. Some other fuzzy regression models have been reported based on evolutionary algorithms. For example, Gkountakou and Papadopoulos [20] introduced a linear regression model via an adaptive neuro-fuzzy inference system to estimate the compressive cement strength. Buckley and Hayashi [6] proposed a fuzzy genetic algorithm to solve fuzzy optimization problems, such as fuzzy linear regression analysis. Chan et al. [8] introduced an intelligent fuzzy regression method via an evolutionary algorithm and estimate the parameter of the model based on fuzzy least-squares method. Ezadi and Allahviranloo [17] proposed a numerical solution to fuzzy regression modeling based on z-numbers by the improved neural network.
Heuristic methods
Heuristic methods include some novel approaches, which combine the fuzzy least absolutes; fuzzy least squares, or Machine learning techniques [12]. In this context, Wei and Watada [56] introduced an expected regression technique based on credibility theory with type-2 fuzzy input-output data. Yang and Yin [58] presented a fuzzy varying coefficient regression model with robust analysis after deleting the abnormal data. Shakouri and Nadimi [51] introduced a basic curve by conventional regression to apply the fuzzy number concept and recognize the outlier data, and used the linguistic variables with conventional regression to determining outliers data. Hesamian and Akbari [22] proposed a fuzzy semi-parametric quantile regression methodology with fuzzy predictors, crisp coefficients, and fuzzy responses. In another study [23], they introduced an extension for the classical partial univariate regression model with crisp inputs and fuzzy output. Arefi [3] investigated a new approach to the problem of quantile regression modeling based on the fuzzy response variable and fuzzy parameters. They introduced a loss function between fuzzy numbers and then fit a quantile regression model between the available data based on the proposed loss function. Additionally, there are numerous papers in the framework of robust fuzzy regression analysis, fuzzy cluster-wise regression method, fuzzy regression approach with time series, fuzzy regression based on entropy, fuzzy regression analysis with Monte Carlo methods, fuzzy regression analysis based on bootstrap techniques, etc.
As mention above, in the last three decades, many studies have been performed in the field of fuzzy regression models and these models in terms of fitting accuracy, efficiency, robustness, sensitivity to outliers, and simplicity of calculations have been compared with each other in different ways. However, most of these studies have been done based on type-1 fuzzy sets [12] while type-1 fuzzy sets, despite the acceptable performance, to model data with high linguistic uncertainty, aggregation the knowledge of several experts in decision-making scenarios have not favorable abilities. In this regard, trapezoidal interval type-2 fuzzy numbers using linguistic rating systems have a favorable ability to overcome most of the above-mentioned shortcomings [18, 57].
With the review of the articles published in the context of the fuzzy linear regression and so far as the authors know, there has not been any research on fuzzy linear regression technique based on perfectly normal trapezoidal IT2 fuzzy data. Accordingly, the main objective of this paper is to introduce a fuzzy linear regression model when dependent variable and coefficients are perfectly normal trapezoidal IT2 fuzzy numbers and independent variables are crisp. The novelty and main contributions of this study relative to the other existing fuzzy linear regression approaches, summarized as follows: In this study, we introduce a new method to compute the multiplication and division of trapezoidal IT2 FNs. To provide a new framework for estimating the IT2 fuzzy regression coefficients, we propose a new technique for using the unrestricted in sign IT2 fuzzy variables in crisp/fuzzy mathematical programming problems, and then highlights the differences and similarities between crisp/fuzzy LP, crisp/fuzzy GP, and crisp/fuzzy NLP. We propose a new interval type-2 fuzzy least absolute deviations approach to build an interval type-2 fuzzy regression model. We propose some new distance and similarity measures based on a new metric on the space of trapezoidal IT2 FSs, which are easy to handle and interpret, and reflect the intuitive meaning of distance and similarity between two trapezoidal IT2 FSs. In addition, we prove that the suggested measures satisfy the properties of the axiomatic definition for distance measures. In this study, we explain the process of identification of outliers in the fuzzy dataset, suggest a new procedure to determine the mild and extreme fuzzy outlier cutoffs, and then propose a novel procedure to the parameter estimation of the case-deletion linear regression model in interval type-2 fuzzy environment. In the present study, to evaluate the trapezoidal IT2 fuzzy regression models, we introduce some new performance methods based on new similarity and distance measures. We also propose a new leave-one-out cross-validation approach to investigate the goodness of fit of the proposed model
To do the above, in Section 2, first, the concepts of IT2 fuzzy sets, trapezoidal IT2 fuzzy sets, arithmetic operations between these sets is introduced in detail and then a new class of distance measures to rank and computing the difference between two trapezoidal IT2 fuzzy sets is provided. Next, introduce a new methodology for using the unrestricted in sign IT2 fuzzy variables in fuzzy/crisp mathematical programming approaches, and then highlight the similarities and differences between fuzzy/crisp GP, fuzzy/crisp LP, fuzzy/crisp NL. In Section 3, introduce and investigate a new fuzzy least absolute deviations approach to build a new fuzzy regression model, for crisp input, IT2 fuzzy output, and parameters. To estimate the parameters of the model, the proposed methodology is carried out in five steps. In Section 4, we introduce a new procedure to determine the mild and extreme fuzzy outlier cutoffs and apply them to remove the outliers (through the initial model), then final the model by clean dataset is provided. In addition, to investigate the performance of the proposed model, we provide a new leave-one-out cross-validation methodology then to illustrate the theoretical results of the proposed approach and its performances, employ a numerical example. Finally, we present a comprehensive conclusion in Section 5.
Preliminaries and fundamental concepts
Throughout this paper, we use ⊗, ⊕ , ⊖ and ø to denote the product, addition, subtraction, and division between fuzzy numbers, respectively, and Ω to denote the universe of discourse. In addition, we use R to denote the real numbers,

The left and right spread of trapezoidal T1 FN

The FOU and secondary membership grade of
According to [19, 57] A type-2 fuzzy set (also called a general type-2 fuzzy set) is a collection of infinite type-1 fuzzy sets into a single fuzzy set. The membership grade of each element of a type-2 fuzzy set is a type-1 fuzzy set with support bounded by the interval [0, 1], which provides an additional degree of freedom for handling linguistic uncertainties in the modeling of uncertainty. A type-2 fuzzy set denoted as
An IT2 FS such as
Trapezoidal interval type-2 fuzzy numbers
Trapezoidal IT2 fuzzy numbers are a kind of important fuzzy number, which can express linguistic assessments by transforming into numerical variables objectively.

Trapezoidal IT2 FN

Trapezoidal IT2 FN
For symmetric normalized fuzzy number
The addition operation:
The subtraction operation:
The multiplication with a scalar w:
The multiplication operation:
The division operation:
The n th root:
The power operation:
With no doubt, fuzzy distance measures play a vital role to differentiate between two fuzzy sets or objects and so far, many fuzzy distance measures have been introduced including the Hamming distance, the Euclidean distance, and the Minkowski distance. Based on the existing literature, fuzzy Hamming distance measures have been extensively studied due to their applications in many fields, e.g. regression analysis, risk analysis, data mining, medical diagnosis, signal processing, pattern recognition, decision-making, network comparison, and the like. On the other hand, in the trapezoidal IT2 fuzzy regression analysis, one of the most important aspects of the analysis of randomness and fuzziness is the usage of a convenient distance on the family of trapezoidal IT2 fuzzy sets. In this regard, the Hamming distance is a convenient distance for the family of trapezoidal IT2 fuzzy sets, which can easily provide the intuitive meaning of the difference between the estimated and observed trapezoidal IT2 fuzzy responses in IT2 fuzzy regression analysis [4, 52].
It is obvious that if
To deal with the maximization or minimization of the mathematical programming approaches and compare with fuzzy numbers, some authors propose a ranking method to compute the signed distance from fuzzy number to y-axis [4, 16]. In this part of study, a new signed distance measure to rank the trapezoidal IT2 FNs is defined. The signed distance between
For case w ≥ 0:
For case w < 0 {% }
Free variables in mathematical programming problems
The linear programming formulation restricts the sign of unknown variables to be nonnegative, and the nonnegative condition is a constraint (requirement) in solution procedures of these problems. Free (unrestricted in sign) variables are variables that have no upper or lower bound [41]. For each free variable ɛ
i
, we use the decomposition into the negative and the positive parts as follows:
In the many fuzzy mathematical programming problems, the constraints and/or parameters are included non-probabilistic uncertainty and defined by the knowledge of several experts. To aggregation and convert the different subjective opinions and knowledge of several experts in decision-making scenarios into crisp, the trapezoidal IT2 FSs have favorable abilities. In this regard, several useful linguistic rating systems have been introduced [19, 53]. Figure 2.5 depicted aggregation of the different opinions of several experts in decision-making scenarios.

Aggregation the different opinions of several experts’ in decision-making scenarios.
The general standard form of crisp LP problem is defined as
In addition, according to the equality between trapezoidal IT2 FNs given in Definition 2.4, the fuzzy equality (8.1)
If there exist any
Let
Suppose that
Now, suppose that
In this section, based on the metric suggested in section 2, we will introduce and investigate a new fuzzy least absolute deviations approach to build a new fuzzy regression model, for crisp input and IT2 fuzzy output and parameters. Consider
In the model (10),
In model (11),
To estimate the parameters of proposed model (in an optimal way), the proposed methodology is carried out in the six steps:
By using (1), (2) and, substituting
Using Definition 2.5, and substituting
Using (16), we have:
Using (11) and substituting (17) and (18) in
Minimizing the
By substitution (22–25) in problem (21), the IT2 fuzzy mathematical programming problem (21) can be transformed into the GP problem as following:
If If
According to above relations,
The Theorem 3.2 shows that problem (27) yields the exact regression parameters and estimated outputs if the given trapezoidal IT2 fuzzy input-output data satisfy in a trapezoidal IT2 fuzzy linear model.
The optimal solution of problem (27) contains The observed responses (
We know that
Clearly, they satisfy in the constraints (27.3)–(27.6). Without loss of generality, for simplicity, we assume that
By substituting (28) and (29) in (27.1) and (27.2), we have:
The Equations (30) and (31) show that
From (32) and (33) we conclude that
Consider that there are n sample data in the Ω (the universe of discourse), denoted by
From (34.1) and (34.2), it can be concluded that in a fuzzy linear regression based on FLAD, when increase the number of data and explanatory variables, the number of constraints increases rapidly and this leads to computational complexity in fuzzy regression. Also with removing or adding an explanatory variable, all constraints must be readjusted. Therefore, the number of data and explanatory variables might be limited in fuzzy regression analysis based on FLAD based on LP. Of course, with available linear programming solvers (e.g. MATLAB, LINGO, MPL, Maximal Software), the size of linear programming problems is not important.
Frequently in fuzzy/crisp mathematical programming problems, the data set contains some elements that are outliers [26]. Among the data set, an observation that has a bigger residual value than the majority of data is called an outlier or abnormal data [39, 51]. Outliers may occur due to unwanted errors during the collection, recording, or transcribing of the data. [13, 33] In the dataset may have various types of outliers including Outlier is in the spreads and/or centers of the output variables, Outlier is in the spreads and/or centers of the input variables, Outlier is in the spreads and/or centers of the input-output variables,
Outlier is a very important aspect of data analysis and the existence of outliers in a set of experimental observations may dramatically influence the value of the parameter estimates, therefore detection and elimination of outliers have a significant effect on overall results. Of course, the presence of the outliers in the dataset is not always a disadvantage and drawback. For example, in data mining, the detection of outliers in the dataset is more interesting than the detection of inliers; while in clustering problems, outliers supposed as noise observations that should be deleted to performing a more reliable clustering [39, 51].
The first and third quartile are insensitive to outlying data values and the difference between them (i.e., third quartile first quartile=Q3 - Q1) is known as the interquartile range. The interquartile range (IQR) is often used to find outliers in dataset. In this study we use the interquartile range (IQR) to define the mild and extreme outlier cutoffs as [Q1 - 1.5 × IQR, Q3 + 1.5 × IQR] and [Q1 - 3 × IQR, Q3 + 3 × IQR]], respectively. Therefore the data values that lie below Q1 - 1.5 × IQR or above Q3 + 1.5 × IQR are called mild outliers and marked with a circle “o” the data values that lie below Q1 - 3 × IQR or above Q3 + 3 × IQR are called extreme outliers and marked with an asterisk “∗”
Performance measures
To evaluate the fuzzy regression models, several performance measures have been proposed by authors [3, 59]. In this section, we introduce some new performance measures to evaluate the IT2 fuzzy regression models.
Fuzzy similarity measure
The fuzzy similarity measure proposed by [48] is used to determine the similarity of two fuzzy sets [10, 54]. In a fuzzy regression model, a larger value of fuzzy similarity measure indicates better fuzzy model performance.
To determine the similarity degree between the estimated and observed IT2 fuzzy responses, we introduce new mean similarity measures as
where,
Fuzzy distance measure
This measure is used to determine the absolute differences between the estimated and observed fuzzy responses. A smaller mean of fuzzy distances indicates smaller estimation errors, and thus higher prediction performance [10, 48]. A good fuzzy regression model will produce a smaller mean fuzzy absolute error (mean fuzzy distance). To determine the distance between the estimated and observed fuzzy responses, we introduce new mean absolute error (ME) and mean relative erro (MRD) measures as
where
where
Goodness-of-fit criteria
To investigate the goodness of fit of the proposed model, a leave-one-out cross-validation (LOOCV) approach is employed, which uses a single observation from the whole data sets as the validation data set, and the remaining observations as the training data sets. The validation process is repeated until each observation of the whole data sets is used as follows steps [31]:
Numerical examples
In this section, we provide two numerical examples. The first example illustrates the theoretical results of the proposed approach and explain how the proposed method is applicable to derive the regression model for trapezoidal interval type-2 fuzzy data. In the second example, we compare the performance of proposed model with some existing well-known models using the training data designed by Tanaka et al. [55]
Values of input (x
j
) and output the response variable (
)
Values of input (x
j
) and output the response variable (
Values of observed and predicted response variable and the distance between them
To illustrate the theoretical results of the proposed two-stage method and explain how to use it to derive a model with such data, in the first stage, to provide an initial model (before removing outliers from the original dataset), we estimate the IT2 fuzzy regression coefficients (
Figure 4.1 depicts the observed and fitted values of

Plot of the observed and fitted values of
Results of LOOCV approach
In the second stage, in order to identify and remove the outliers, the predicted response values (
After identifying and eliminating the outliers, using clean dataset given in the second and third columns of Table 3, the trapezoidal IT2F regression coefficients are obtained as
The trapezoidal IT2 fuzzy regression model (36) can be applied to predictive of
To investigate the performance of the proposed model, a leave-one-out cross-validation (LOOCV) approach is employed and the results presented in the fifth to seventh columns of Table 3.
Dataset for Example 2
Comparison of fuzzy regression models and their capability indices in Example 2
Since the value of Md(-) = 1.298 is near to the value of Md = 1.224 and R
E
= 0.0604 become smaller, the predictive ability of the proposed model is convenient and it is an optimal model. The obtained results of Table 3 are illustrated in Fig. 4.2. In view of the fact that the trapezoidal IT2 fuzzy number cannot be represented as a crisp in a coordinate system, in Fig. 4.2, we use the line segments to represent the values of

The line segments to represent of
The results of fitting the proposed model and aforementioned models as well as their performances based on the mean similarity degree (MS) between the estimated and observed fuzzy responses and men relative error (MRE) between the estimated and observed fuzzy responses are presented in Table 5. From the fourth column of Table 5, the mean relative error obtained by our model is 0.151, which is less than the mean relative error of Chen-Nien model [10], Diamond model [15], Kim-Bishu model [34], Li model [37], Shafaei Bajestani model [49], Tanaka model [55] and Zeng model [59]. The mean relative error obtained by Hosseinzadeh model [27] is 0.152, which is close to the amount obtained from the proposed model. On the other hand, by referring to the fifth column of Table 5, the mean similarity degree (MS) between the estimated and observed fuzzy responses for the proposed method is 0.891, which clearly larger than the MS obtained from the models mentioned in the second column of Table 5. The results presented in the fourth and fifth columns of Table 5 show that, while the indexes for some models (especially model Hosseinzadeh et al.) are close to that of the proposed model, the proposed model has better performance concerning MS and MRE.
The uncertainty can arise from different sources including randomness and fuzziness, which is related to chance, incomplete information, human subjective judgment, noise, etc. In this context, Fuzzy regression analysis (through creating a functional relationship between response and explanatory variables), provide efficient and effective tools for the explanation, prediction, and possibly control of randomness and fuzziness in the fuzzy dataset. In recent years, many studies in the field of fuzzy regression models have been performed and these models in terms of fitting accuracy, efficiency, robustness, sensitivity to outliers, and simplicity of calculations have been compared with each other in different ways, but most of these studies have been done based on type-1 fuzzy sets. Type-1 fuzzy sets, despite the acceptable performance, to handle the high-level linguistic uncertainty, aggregation the knowledge of several experts in decision-making scenarios have not favorable abilities. Given that trapezoidal IT2 fuzzy numbers using linguistic rating systems have a favorable ability to overcome the shortcomings of type-1 fuzzy sets, we introduced a new fuzzy linear regression model with trapezoidal IT2 fuzzy output, trapezoidal IT2 fuzzy parameters, and crisp inputs based on the fuzzy least absolute estimator. Some features and advantage of the present study are as Some algebraic operators on type-1 fuzzy numbers are extended to perfectly normal trapezoidal IT2 fuzzy numbers then two new distance measures have been introduced. For use of the unrestricted in sign IT2 fuzzy variables in fuzzy/crisp mathematical programming approaches a new approach have been proposed. To estimate the IT2 fuzzy regression coefficients a novel framework is provided. In order to achieve this objective, the similarities and differences between fuzzy/crisp GP, fuzzy/crisp LP, and fuzzy/crisp NLP problems have been highlighted then answered the question how each one can lead to the other. To determine the mild and extreme fuzzy outlier cutoffs, first, a new procedure has been introduced and then it is employed to detect and remove the fuzzy outliers in the original dataset. To evaluate and compare the performance of trapezoidal IT2 fuzzy regression models, some new performance measures have been introduced.
Though the experimental results represent that the suggested algorithm has better efficiency and performance, but the complexity of computation is a potential problem. So, in the future study, we will further study the algorithm optimization of fuzzy least absolute deviation, and provide better solutions for the fuzzy least absolute deviation applications which contain a large amount of data. In the future, we will also introduce new IT2F distance measures based on extended Euclidean distance and a new IT2F least square deviation procedure to estimate the IT2 fuzzy regression coefficients.
Meanwhile, the extension of our suggested procedure to formulate the fuzzy linear regression approach with interval-valued Atanassov’s intuitionistic fuzzy sets based on fuzzy least absolute estimator and/or fuzzy least squares estimator can be potential topics for future work.
