Generalized dynamic attribute reduction based on similarity relation of intuitionistic fuzzy rough set

Abstract

In view of the characteristics with big data, high feature dimension, and dynamic for a large-scale intuitionistic fuzzy information systems, this paper integrates intuitionistic fuzzy rough sets and generalized dynamic sampling theory, proposes a generalized attribute reduction algorithm based on similarity relation of intuitionistic fuzzy rough sets and dynamic reduction. It uses dynamic reduction sampling theory to divide a big data set into small data sets and relative positive domain cardinality instead of dependency degree as decision-making condition, and obtains reduction attributes of big intuitionistic fuzzy decision information systems, and achieves the goal of extracting key features and fault diagnosis. The innovation of this paper is that it integrates generalized dynamic reduction and intuitionistic fuzzy rough set, and solves the problem of big data set which cannot be solved by intuitionistic fuzzy rough set. Taking an actual data as an example, the scientificity, rationality and effectiveness of the algorithm are verified from the aspects of stability, diagnostic accuracy, optimization ability and time complexity. Compared with similar algorithms, the advantages of the proposed algorithm for big data processing are confirmed.

Keywords

Intuitionistic fuzzy rough set similarity relation relative positive domain generalized dynamic reduction large fuzzy decision information system attribute reduction

1 Introduction

Large-scale fuzzy information system has the characteristics of large data size, high feature dimension and heterogeneity, and so on. Traditional feature selection algorithms are inefficient or even impossible for attribute reduction of such information systems. Rough set is an effective way to deal with attribute reduction in discrete information systems [1], but when dealing with attribute reduction in continuous or fuzzy information systems, it is necessary to discretize the attributes firstly, which will lose part of the information. Fuzzy rough set is a generalization and extension of rough set. It has good effect in dealing with attribute reduction of continuous or fuzzy information systems [2], especially intuitionistic fuzzy rough set [3]. However, these methods need to compute the discernibility matrix [4], fuzzy similarity relation matrix [5], or the transitive closure and the closeness matrix [5, 6]. In addition, the computational ability and efficiency are limited using variable precision fuzzy rough sets [7], hybrid fuzzy rough sets [8] or extended fuzzy rough sets [9 –11] for large fuzzy information system. How to solve attribute reduction and feature extraction of large-scale fuzzy information system with large amount of data, high feature dimension, dynamic and heterogeneous characteristics is an important issue in current big data processing.

In order to improve the efficiency of attribute reduction and extracting feature fault parameters and the adaptability of large data processing, based on the existing similarity relation of intuitionistic fuzzy rough set and the attribute reduction algorithm of dynamic reduction theory, multiple performance acceleration is realized at the data and method level [12]. At the data layer, generalized dynamic reduction theory is used to transform large-scale fuzzy information system into a series of small-scale fuzzy information systems. At the method layer, the principles of attribute reduction based on similarity and dissimilarity, relative positive domain and dependence degree of intuitionistic fuzzy rough set are used to maximize the efficiency of attribute reduction.

2 Generalized dynamic reduction theory

2.1 Presentation of dynamic reduction

As one of the important research contents of data mining and data reduction technology, attribute reduction refers to the deletion of redundant attributes and their attribute values in the decision table, but its prerequisite is to maintain the dependency relationship between condition attributes and decision attributes of the original decision table unchanged. At present, the reduction algorithm based on rough set theory can be divided into two kinds according to whether there is heuristic information or not. One is blind method, which does not use any heuristic information to obtain a reduction, but the result is unsatisfactory. The other is heuristic algorithm [13, 14], whose idea is to start with the core of the decision table and regard it as a reduction, then add attributes according to certain heuristic information, that is the importance of attributes, until the reduction of the decision table is obtained. For example, there are attribute reduction algorithms based on mutual information, based on discernibility matrix, and based on Pawlak attribute importance, etc. But the problem of these algorithms is that they can only reduce the small-scale compatible decision information system. When the decision information system has a large size of data, the generalization ability of the decision rules obtained by these algorithms is limited. At the same time, the decision information system contains a lot of noise data, so it is an urgent problem to get stability a reduction.

Aim to solve the above problems, Jan. G. Bazan has proposed dynamic reduction algorithms [15 –17]. These algorithms are sampling large and complex decision information systems many times, and transform the reduction of complex systems into the intersection of several reductions of sub-decision information systems, and obtain more stable reduction. They have good adaptability to variable data sets, and provide an effective way to solve the attribute reduction for large-scale fuzzy information systems. The facts show that the reduction obtained by dynamic reduction has high stability. It has a good performance in big data processing, adaptability to variable data sets, stability of reductions, and anti-noise ability [18].

2.2 Determination of f-family range

–Determination of lower limit for f family

For attribute reduction of incremental data, each additional data are regarded as a sub-table, the original data are taken as a master table, and the parent table can also be regarded as a sub-table set too. Regardless of whether data increasing with time, attribute reduction can use sampling method to obtain sub-tables. If the original table has |U| data, and a new table should select about |U| /|F| data size each time. F is called the F family of decision information system DS. That is, for all decision subsystems P(DS) of DS, there are F ⊆ P (DS) and F ≠ Φ . |F| represents the capacity of the F family. According to the study of the F family capacity in document [19], it can get.

$| F | ⩾ t_{α}^{2} \cdot P_{G} (R) \cdot (1 - P_{G} (R)) / {(Δ MLE (P_{G} (R)))}^{2}$ (1)

Where, P_G (R) is the probability of dynamic reduction R appearing in a sub-table of the original decision table. MLE (P_G (R)) is the maximum likelihood estimation of P_G (R). ΔMLE (P_G (R)) is the maximum acceptable error of MLE (P_G (R)). t_α represents the interval function related to the allowable maximum error, and it must satisfy, $1 - α = \sqrt{2 / π} \int_{- t_{α}}^{t_{α}} exp {(- t / 2)}^{2} dt$ (2)

When P_G (R) =1/4, P_G (R) • (1 - P_G (R)) obtains the maximum value 1/4, and then the minimum value of |F| is obtained, $| F | ⩾ t_{α}^{2} / - (4 {(Δ MLE (P_{G} (R)))}^{2})$ (3)

Where, the value of t_α can be querying through the normal distribution table.

–Determination of upper limit of f family

If too many sub-tables are extracted, the computational complexity will increase and the time complexity of the algorithm will be improved. From Bernonlli’s idea, when the decision sub-tables extracted from the F family is large enough, the continued extraction of sub-tables has little effect on the stability of reduction. Therefore, there must be an upper limit for the capacity of F family. From the stability of reduction, Bazan, et al. [19], had studied the upper limit of F family capacity.

Given the initial decision system S = (U, C ∪ D), where U is an universe, C is a non-null conditional attribute set, D is a non-null decision attribute set, and S′ = (U_S′, C ∪ D) is a full sample decision system. Where, there is U_S′ =∪ { U_B|B ∈ F }, and U_S′ ⊆ U. The similarity index β₁ of decision classification ability between decision sub-system B and full sample decision system S′ is as follows:

$β_{1} = | Pos (U_{B}, C, D) | / | Pos (U_{S^{'}}, C, D) | ⩾ | U_{B} | / | U_{S^{'}} |$ (4)

Similarly, the decision-making ability index β₂ between full sample decision-making system S′ and initial decision-making system S,

$β_{2} = | Pos (U_{S^{'}}, C, D) | / | Pos (U_{S}, C, D) | ⩾ | U_{S^{'}} | / | U_{S} |$ (5)

The stability parameter of F family relative positive domain is,

$\begin{matrix} {SC}_{S}^{Pos} (F, B) \\ = | {B \in F, β_{1} ⩾ | U_{B} | / U_{S^{'}}, β_{2} ⩾ | U_{S^{'}} | / U_{S}} | / | F | \end{matrix}$ (6)

Obviously, ${SC}_{S}^{Pos} (F, B) \in [0, 1]$ . However, when ${SC}_{S}^{Pos} (F, B) \in [0, 0.5]$ , it shows that the extracted subsystems can not meet the requirements of similarity and have no research value. ${SC}_{S}^{Pos} (F, B) = 1$ shows that all the decision-making sub-systems extracted have similar decision-making classification ability with the full sample decision-making system S′, and that all the decision-making sub-systems extracted have similar decision-making classification ability with the initial decision-making system S. ${SC}_{S}^{Pos} (F, B) = 0$ shows that all the decision-making sub-systems extracted do not have similar decision-making classification ability with the full sample decision-making system S′, and that the full sample decision-making system S′ does not have the same decision-making classification ability with the initial decision-making system S, and the similarity decision indexes of all decision sub-tables do not meet the requirements.

If ${SC}_{S}^{Pos} (F, B) > λ$ (λ is a threshold), then there is

$| F | = | {B \in F, β_{1} ⩾ | U_{B} | / U_{S^{'}}, β_{2} ⩾ | U_{S^{'}} | / U_{S}} | / λ$ (7)

When the range of F family capacity is determined, it can replace the capacity cardinality of F family specified by experts in probabilistic extraction method. When the number of F family reaches this range, there is no need to extract sub-tables. Therefore, a dynamic reduction method without prior knowledge is obtained. Document [20] demonstrates that the number of effective sub-tables extracted by this method is less.

–F Sub-table extraction strategy

What kind of sampling strategy adopted to extract the subfamily F is the premise of dynamic reduction. The quality of sampling strategy will also directly affect the accuracy of reduction results. When decision tables contain massive data or are constantly changing, the used sampling strategies are different. When a decision table contains a big data information or the decision table is changing, direct reduction of the initial decision table will produce a great reduction error, so the results of reduction can not be used as a guide for drawing up decision-making rules, otherwise the decision-making will fail because of the instability of data. Therefore, when decision table contains massive data information or the decision table is changing, the decision rules obtained by static reduction can not describe the characteristics of the object to be studied very well, but the initial decision table is processed for extracting the sub-table family, that is F family, as the object of reduction, and then the reductions of the sub-tables are intersected, the relatively stable reduction is finally obtained. The characteristics of the initial decision table can be well described by these reductions that exist in most sub-tables. At present, there are two methods for F family sub-tables extraction: trace extraction and probability sampling [20]. Trace extraction method does not take into account the principle of determining the stability coefficient of the decision sub-table, can not extract representative reductions, and the accuracy of dynamic reductions is low [20]. Based on this, this paper chooses the probability sampling method. After the total number of sub-tables and the number of sub-tables in F family are determined, the decision-making sub-tables are extracted according to the following strategies. In order to make the probability of a reduction of F family sub-tables more than 1/2 and exist simultaneously in reductions of the initial decision table, the number of objects in the decision sub-tables that needs to be randomly extracted is greater than 50% of the original decision table. The steps of the probabilistic sampling method are described as follows:

Algorithm 1 Probability Sampling Method

Step 1, N/5 decision sub-tables are randomly selected from initial decision table which size is 90%,80%,70%,60% and 50% of the size of initial decision table.

Step 2, to compute the reductions of these N decision sub-tables.

Step 3, to select reductions whose stability coefficient is not less than a fixed threshold from these reductions.

Step 4, to intersect these reductions obtained in step 3 and get the final dynamic reduction.

Before sampling, this sampling strategy calculates the minimum number of decision sub-tables that need to be extracted. It takes accuracy as the first consideration factor, and then determines the size of a single decision sub-table as effective as possible. They consider reducing the reduction error.

–F-dynamic reduct

Definition 1 (F-dynamic reduction) For a given decision-making information system DT = (U, C ∪ d), U is an universe, and C is the non-null conditional attribute set, and d is a non-null decision attribute. B = (U_B, C ∪ d) is an arbitrary decision information sub-system of decision information system DT, and where U_B ⊆ U. F is a decision information sub-system family of DT. $DR (DT, F) = RED (DT, d) \cap \dots \underset{B \in F}{\cap} RED (B, d)$ (8)

Where, any element in the system is called F-dynamic reduction of decision information system DT.

The final reduction of F-dynamic reduction is calculated by the intersection of all reductions of the initial decision information system and all reductions of decision information sub-system randomly extracted from it. Therefore, the reduction is the most stable and the generalization ability of the decision rules is the strongest. However, this method is too strict for the reduction of F family dynamic reductions, which is dependent on the initial decision information system. Document [21] uses dynamic reduction method to process UCI data sets. The results show that many data sets can not get F family dynamic reduction. Therefore, it need to generalize the dynamic reduction concept, and introduce (F–λ)-dynamic reduction, improve the ability of F family dynamic reduction.

Definition 2 ((F–λ) -dynamic reduce) For a given decision information system DT = (U, C ∪ d), where U is an universe, and C is the non-null conditional attribute set, and d is a non-null decision attribute. B = (U_B, C ∪ d) is an arbitrary decision information sub-system for decision information system DT, where U_B ⊆ U. F is a decision information sub-system family of decision information system DT.

$\begin{matrix} {DR}_{λ} (DT, F) = {R \in RED (DT, d) : \\ \frac{| {B \in F : R \in RED (B, d)} |}{| F |} ⩾ λ} \end{matrix}$ (9)

Where, any element is called the (F-λ)-dynamic reduction of decision information system DT, and λ ∈ [0, 1] is called the precision coefficient of (F–λ)-dynamic reduction. | { B ∈ F : R ∈ RED (B, d) } |/|F| is called the stability coefficient of (F–λ)-dynamic reduction R relative to F family.

(F–λ)-dynamic reduction has the properties as follows,

If F ={ DT }, then DR (DT, F) = RED (DT, d).

If λ₂ ⩾ λ₁, then DR_{λ
₂} (DT, F) ⊆ DR_{λ
₁} (DT, F).

DR₁ (DT, F) = DR (DT, F).

(F–λ)-dynamic reduction of decision information system is an extension of F family dynamic reduction, and reduces the dependence on initial decision information system in F family dynamic reduction. However, when the data increases, the decision rules generated by the new decision information system will be greatly different from those generated by the original decision information system, which requires further expansion of dynamic reduction to improve the adaptability and generalization ability of reduction.

Definition 3 (F-generalized dynamic reduction) For a given decision information system DT = (U, C ∪ d), U is an universe, and C is the non-null conditional attribute set, and d is a non-null decision attribute. B = (U_B, C ∪ d) is an arbitrary decision information sub-system for decision information system DT, where U_B ⊆ U. F is a decision-making information sub-system family of decision information system DT. $GDR (DT, F) = \underset{B \in F}{\cap} RED (B, d)$ (10)

Where, any element is called F-generalized dynamic reduction of decision information system DT.

When the number of objects and attributes in the decision information system is very large, the complexity and difficulty computing the reduction of the decision system will become immeasurable. F-family generalized dynamic reduction is no longer to reduce the initial decision information system, but to intersect the reductions of all decision sub-systems randomly extracted. In a sense, generalized dynamic reduction is more extensive than dynamic reduction.

Definition 4 ((F–λ)-generalized dynamic reduction) For a given decision information system DT = (U, C ∪ d), U is an universe, and C is the non-null conditional attribute set, and d is a non-null decision attribute. B = (U_B, C ∪ d) is an arbitrary decision information sub-system for decision information system DT, where U_B ⊆ U. F is a decision information sub-system family of decision information system DT.

$\begin{matrix} {GDR}_{λ} (DT, F) \\ = {R \subseteq C : \frac{{B \in F : R \in RED (B, d)}}{| F |} ⩾ λ} \end{matrix}$ (11)

Where, any element is called (F–λ)-generalized dynamic reduction in the decision information system DT. Where λ ∈ [0, 1] is the precision coefficients of (F–λ)-generalized dynamic reduction, and | { B ∈ F : R ∈ RED (B, d) } |/|F| is called the stability coefficients of (F–λ)-generalized dynamic reduction R relative to F family.

(F–λ)-generalized dynamic reduction has the properties as follows,

DR (DT, F) ⊆ GRD (DT, F).

If λ₂ ⩾ λ₁, then GDR_{λ
₂} (DT, F) ⊆ GDR_{λ
₁} (DT, F).

DR₁ (DT, F) ⊆ GDR (DT, F).

If DT ∈ F, then DR (DT, F) = GDR (DT, F).

3 Intuitionistic fuzzy rough set theory

Intuitionistic fuzzy rough set (IFRS) is the integration product of intuitionistic fuzzy set and rough set theory, and takes inclusion degree as a bridge between intuitionistic fuzzy set and rough set, and introduces variable precision concept. Therefore, IFRS has a certain fault-tolerant ability and can effectively deal with mixed data types in information systems, including symbolic data, continuous value data and fuzzy data. When IFRS are used for knowledge acquisition, firstly the intuitionistic fuzzy similarity relation should be established, and can be constructed by similarity measure between objects. Secondly, because the intuitionistic fuzzy relation must satisfy the two-dimensional constraint of the intuitionistic fuzzy set, i.e., the membership degree and the non-membership degree are not negative and the sum in the interval [0,1], and the similarity is a mapping x × y → [0, 1], which can only express the degree of similarity between two elements, that is, membership degree, but it can not express the degree of non-similarity between two elements, i.e., non-membership degree.

Definition 5 (Intuitionistic Fuzzy Information System) Let S = (U, C, V_C, D, V_D, F) is defined as an intuitionistic fuzzy information system. Where U = {x₁, x₂, ⋯ , x_n} represents the set of objects. The conditional attribute set C is composed of symbolic attribute sets C^s, C^r and intuitionistic fuzzy attribute set C^if. The corresponding codomain of conditional attributes is V = V^s ∪ V^r ∪ V^if.V_D is the codomain of decision attribute D. F : U × C ∪ D → V_C ∪ V_D is an information function, which assigns an attribute value to each attribute of each object, such as ∀R ∈ C, F (x_i, R) ∈ V_C.

In intuitionistic fuzzy information system S = (U, C, V_C, D, V_D, F), ∀a ∈ C, if a is a continuous attribute or an intuitionistic fuzzy attribute, then an intuitionistic fuzzy similarity relation R_a can be defined for a, and it can be simplified as R.

Definition 6 (Intuitionistic Fuzzy Decision Table) Let IFIS = (U, A, V, F), then attribute set A is divided into a conditional attribute set C and a decision attribute set D. IFIS = (U, C ∪ D, V, F) is called intuitionistic fuzzy decision table or intuitionistic fuzzy decision information system. When the condition attribute set C is an intuitionistic fuzzy attribute set and the decision attribute set D is an ordinary discrete attribute set, IFIS = (U, C ∪ D, V, F) is called intuitionistic fuzzy conditional information system. When the condition attribute set C is an ordinary discrete attribute set, the decision attribute set D is an intuitionistic fuzzy attribute set, IFIS = (U, C ∪ D, V, F) is called intuitionistic fuzzy objective information system.

Define 7 (Intuitionistic Index) For each fuzzy subset of the attribute set A, π_A (x) =1 - μ_A (x) - γ_A (x) is called the intuitionistic index. It indicates the hesitancy degree of x relative to A. For general fuzzy subsets, the intuitive index is 0. For intuitive fuzzy subsets, the intuitive index is not 0.

Definition 8 Let R∈IFR(U×U), ∀x,y,z∈U, then

If R(x,y) = 1 _L , then R is self-reflexive.

If R(x,y) = R(y,x), then R is symmetrical.

If $R_{\land}^{\lor} \circ_{ρ}^{β} R ⩽_{L} R$ , then R is transitive. ° is a composition relation, ∧ and ∨ are Zadeh operators.λ and ρ are general fuzzy t-module or s-module.

If T (R (x, z) , R (z, y)) ⩽ _LR (x, y), then T is transitive.

If $\sup_{z \in U} T_{M} R (x, z), R (z, y)) ⩽_{L} R (x, y)$ , then R is sup-min transitive.

Definition 9 (Intuitionistic Fuzzy Equivalence Relation and Similarity Relation) Let R∈IFR(U×U). If R satisfies self-reflexivity, transitivity and symmetry, then R is called intuitionistic fuzzy equivalence relation on U. If R satisfies self-reflexivity and symmetry, then R is called intuitionistic fuzzy similarity relation on U.

The properties of R determine the properties of knowledge obtained after the universe U is divided. If R is a general equivalence relation, the knowledge obtained after U/R is a clear equivalence class. If R is an intuitionistic fuzzy equivalence relation, the knowledge obtained after U/R is an intuitionistic fuzzy equivalence class, i.e., each class is an intuitionistic fuzzy set. If R is an intuitionistic fuzzy similarity relation, the knowledge obtained after U/R is an intuitionistic fuzzy similarity class. If R is a general intuitionistic fuzzy relation, then R divides the universe U into several intuitionistic fuzzy sets.

Definition 10 (Dissimilarity Degree d_R = (d_ij) _n×n between Continuous Attributes of Intuitionistic Fuzzy Rough Sets) When attribute a is a continuous value attribute in intuitionistic fuzzy information systems, the dissimilarity degree d_ij between x_i and x_j is d (x_i, x_j) = f (|a (x_i) - a (x_j) |). It is denoted by d (x_i, x_j) = d_ij ∈ [0, 1]. Where, the function f will transform |a (x_i) - a (x_j) | into interval [0, 1]. It satisfies,

f (0) =0, f (∞) =1, f (•) ∈ [0, 1].

x ⩾ y ⇒ f (x) ⩾ f (y).

Definition 11 (Dissimilarity Degree d_R = (d_ij) _n×n between Intuitionistic Fuzzy Attributes of Intuitionistic Fuzzy Rough Sets) When attribute a is an intuitionistic fuzzy attribute in intuitionistic fuzzy information systems, a is described by a set of intuitionistic fuzzy linguistic values. They denoted by IFL = {R₁, R₂, ⋯ , R_h}. Each object value under IFL is corresponding to an intuitionistic fuzzy set on IFL, which is expressed as IF (x_i) = {(μ_{x
_i} (R_k) , (γ_{x
_i} (R_k))/R_k|k = 1, 2, ⋯ , h}. Where, μ_{x
_i} (R_k) is the membership degree, γ_{x
_i} (R_k) is the non-membership degree, and π_{x
_i} (R_k) = 1 - μ_{x
_i} (R_k) - γ_{x
_i} (R_k) is hesitation degree. It is denoted by $τ_{x_{i}} (R_{k}) = μ_{x_{i}} (R_{k}) + \frac{1}{2} π_{x_{i}} (R_{k})$ . The dissimilarity degree between object x_i and object x_j can be measured by dissimilarity degree between intuitionistic fuzzy set IF (x_i) and IF (x_j) corresponding to x_i and x_j. It is measured by,

$\begin{matrix} d (x_{i}, x_{j}) = \frac{1}{3 h} \sum_{k = 1}^{h} (| μ_{x_{i}} (R_{k}) - μ_{x_{j}} (R_{k}) | \\ + | γ_{x_{i}} (R_{k}) - γ_{x_{j}} (R_{k}) | + | π_{x_{i}} (R_{k}) - π_{x_{j}} (R_{k}) | \\ + | τ_{x_{i}} (R_{k}) - τ_{x_{j}} (R_{k}) |) \end{matrix}$ (12)

It is easy to be proved that Equation (12) satisfies the requirement of intuitionistic fuzzy dissimilarity definition [22].

Definition 12 (Similarity Degree s_R = (s_ij) _n×n between Intuitionistic Fuzzy Attributes of Intuitionistic Fuzzy Rough Sets) Similarity degree and dissimilarity degree are two dual concepts. Above, the dissimilarity degree between objects has been obtained. So long as a monotone decreasing function g is defined, the similarity degree can be obtained through the dissimilarity degree. When d (x_i, x_j) ∈ [0, 1], then g (1) ⩽ g (d (x_i, x_j)) ⩽ g (0), that is 0 ⩽ (g (d (x_i, x_j)) - g (1))/(g (0) - g (1)) ⩽1. Thus the similarity degree s (x_i, x_j) between x_i and x_j is obtained as shown in Equation (13). It is easy to be proved that the similarity calculation method Equation (13) satisfies the constraints of intuitionistic fuzzy similarity degree definition. $s (x_{i}, x_{j}) = (g (d (x_{i}, x_{j})) - g (1)) / (g (0) - g (1))$ (13)

The monotone decreasing function g (x) can select 1 - x, e^-x or 1/(x + 1). Where, e.g., g (x) = e^-x can be selected here, so that the similarity degree is shown in Equation (14), $s_{ij} = s (x_{i}, x_{j}) = (e^{- d (x_{i}, x_{j})} - e^{- 1}) / (1 - e^{- 1})$ (14)

Dissimilarity matrix and similarity matrix are fuzzy matrices satisfying self-reflexivity and symmetry.

Intuitionistic fuzzy similarity relations corresponding to continuous attributes or intuitionistic fuzzy attributes is R = (r_ij) _n×n ={ (μ_R (x_i, x_j) , γ_R (x_i, x_j)) |x_i, x_j ∈ U }. The construction problem discussed as below. Firstly, the similarity relation must satisfy the two-dimensional constraints of intuitionistic fuzzy sets. Secondly, when the intuitionistic fuzzy similarity relation by using similarity degree and dissimilarity degree is established, the corresponding relationship between similarity degree and dissimilarity degree should be considered. That is, for ∀x_i, x_j ∈ U, and there is a corresponding relationship between the membership degree μ_R (x_i, x_j) of similarity relation R and similarity degree s (x_i, x_j), and between the non-membership degree γ_R (x_i, x_j) of similarity relation R and dissimilarity d (x_i, x_j). Similarity degree reflects the degree of similarity between x_i and x_j, while dissimilarity degree does not reflect the degree of similarity. Finally, it should be considered the relationship between similarity degree and dissimilarity degree, i.e., 1 - s (x_i, x_j) implies the concept of dissimilarity degree, while 1 - d (x_i, x_j) implies the concept of similarity degree. Based on this, the construction theorems of intuitionistic fuzzy similarity relations are obtained as follow.

Theorem 1. Supposes that U is a non-empty finite universe, ∀x_i, x_j ∈ U, λ ∈ [0, 1], then the binary relation R ={ (μ_R (x_i, x_j) , γ_R (x_i, x_j)) |x_i, x_j ∈ X } on U × U is an intuitionistic fuzzy similarity relation. ${\begin{matrix} μ_{R} (x_{i}, x_{j}) = \frac{s (x_{i}, x_{j}) + 1 - d (x_{i}, x_{j})}{2} \\ γ_{R} (x_{i}, x_{j}) = \frac{d (x_{i}, x_{j}) + λ (1 - s (x_{i}, x_{j}))}{2} \end{matrix}$ (15)

Proof.

–First, the proof proves that binary relation R is intuitionistic fuzzy binary relation. According to Equation (15), it can be obtained that μ_R (x_i, x_j) ∈ [0, 1], γ_R (x_i, x_j) ∈ [0, 1]. Because of λ ∈ [0, 1], therefore $\begin{matrix} μ_{R} (x_{i}, x_{j}) + γ_{R} (x_{i}, x_{j}) \\ = (S (X_{i}, X_{j}) + 1 + λ (1 - S (X_{i}, X_{j}))) / 2 \\ ⩽ (S (X_{i}, X_{j}) + 1 + (1 - S (X_{i}, X_{j}))) / 2 = 1 \end{matrix}$

Therefore, 0 ⩽ μ_R (x_i, x_j) + γ_R (x_i, x_j) ⩽1, R is intuitionistic fuzzy binary relation.

–Second, ∀x_i, x_j ∈ U,

$\begin{matrix} μ_{R} (x_{i}, x_{j}) & = (s (x_{i}, x_{j}) + 1 - d (x_{i}, x_{j})) / 2 \\ = (1 + 1 - 0) / 2 = 1 . \\ γ_{R} (x_{i}, x_{j}) & = (d (x_{i}, x_{j}) + λ (1 - s (x_{i}, x_{j}))) / 2 \\ = (0 + λ (1 - 1)) / 2 = 0 . \end{matrix}$

That is, R is self-reflexivity. $μ_{R} (x_{i}, x_{j}) = μ_{R} (x_{j}, x_{i}) γ_{R} (x_{i}, x_{j}) = γ_{R} (x_{j}, x_{i}) .$

That is, R is symmetry.

So, R is an intuitionistic fuzzy similarity relation.

End.

From theorem 1, it can see that, when λ = 1 the sum of membership and non-membership of intuitionistic fuzzy similarity relation is 1, and then intuitionistic fuzzy similarity relation is transformed into ordinary fuzzy similarity relation. In practical application, λ can be selected according to specific preferences.

In conclusion, the intuitionistic fuzzy similarity relation R = (r_ij) _n×n ={ (μ_R (x_i, x_j) , γ_R (x_i, x_j)) | x_i, x_j ∈ U } corresponding to continuous attributes and intuitionistic fuzzy attributes has been established. ∀x_i ∈ U (x_i) _R represents R similar classes of object x_i, and are intuitionistic fuzzy subsets on universe U. ${(x_{i})}_{R} = (r_{i 1} / x_{1}, r_{i 2} / x_{2}, \dots r_{in} / x_{n})$ (16)

Since the general equivalence relation can be regarded as a special intuitionistic fuzzy relation, it can be treated as the same as the intuitionistic fuzzy similarity relation. For combinations of multiple conditional attributes, their similarity relations and similar classes are defined as follows.

Definition 13 (Composite Similar Classes of Multiple Conditional Attributes) For an intuitionistic fuzzy information system (U, C, V_C, D, V_D, F), ∀a ∈ C, R_a represents the corresponding intuitionistic fuzzy similarity relations for a attribute, a subset of conditional attributes A, A₁, A₂ ⊆ C, and R_A represents the intuitionistic fuzzy similarity relation produced by A, then

$R_{A} = \underset{a \in A}{\cap} R_{a}, (x)_{A} = \underset{a \in A}{\cap} (x)_{a}$ .

R_A₁∪A₂ = R_A
₁ ∩ R_A
₂.

It can be obtained from definition 13, if A₁ ⊆ A₂, then R_{A
₁} ⊇ R_{A
₂}, [x] _{A
₁} ⊇ [x] _{A
₂}. That is, the more attributes there are, the less similar classes there are.

Definition 14 (Relative Positive Domain of Intuitionistic Fuzzy Information System) Let (U, C, V_C, D, V_D, F) be an intuitionistic fuzzy information system, and P ⊆ C, the positive domain P of D is expressed as ${pos}_{P}^{k} (D)$ , ${pos}_{P}^{k} (D) \subseteq U$ . That is, ${pos}_{P}^{k} (D) = \underset{X \in U / D}{\cup} P^{-} X$ (17)

Where, P^-X = {x_i|I ([x_i] _P, X) ₁ ⩾ k}. k is the preset lower approximation threshold, which reflects the fault tolerance of the system. The smaller the threshold k, the stronger the fault tolerance of the system.

Generally, the larger the relative positive domain is, the more perfect the knowledge in the knowledge base is, the more accurate the approximation of the concept is, and the smaller the boundary domain is, and vice versa.

Definition 15 (Attribute Dependency Degree of Intuitionistic Fuzzy Rough Sets) Let P is a subset of non-null conditional attributes in intuitionistic fuzzy information system S = (U, C, V_C, D, V_D, F), R ∈ P. If ${pos}_{P}^{k} (D) = {pos}_{P - {R}}^{k} (D)$ , then R is unnecessary relative to D in P. Otherwise, R is necessary relative to D in P. ∀R ∈ P, if all R are necessary relative to D, then P is called independent relative to D. Otherwise P is called dependent relative to D. The classification ability of intuitionistic fuzzy information systems S = (U, C, V_C, D, V_D, F) can be measured by the approximation (or k) dependency degree of knowledge D relative to knowledge C. It can be expressed as $v_{C}^{k} (D)$ , $v_{C}^{k} (D) \in [0, 1]$ , $v_{C}^{k} (D) = | {pos}_{C} (D) | / | U |$ (18)

Where, |pos_C (D) | represents the cardinality of the relative positive domain pos_C (D). The dependence degree of attribute group represents the inclusion relationship between two attribute groups, that is, when attribute group B depends on attribute group A (denoted as A ⇒ B), if and only if ind (A) ⊆ ind (B).

Definition 16 (Relative Reduction of Intuitionistic Fuzzy Information Systems) Let (U, C, V_C, D, V_D, F) be an intuitionistic fuzzy information system, S ⊆ P ⊆ C. If and only if S is an independent subfamily of P relative to D, and ${pos}_{S}^{k} (D) = {pos}_{P}^{k} (D)$ , then S is called a reduction set of P relative to D. Generally, the reduction set of P relative to D is not unique. The intersection of all reduction sets of P relative to D is called as the core of P relative to D, which is denoted as ${core}_{D}^{k} (P)$ . The minimal dimension reduction in all reduction sets of P relative to D is called as minimal reduction.

In a decision table, different attributes may have different importance. In order to find out the importance of some attributes (or attribute sets), the general method is to remove some attributes from the table, and then examine how classification will change without this attribute. If these attributes are removed, the corresponding classification changes greatly, then the importance of this attribute is high. Conversely, the importance of the attribute is low.

4 Algorithm description

4.1 Algorithm process

The basic idea of dynamic attribute reduction algorithm for large data based on similarity relation of intuitionistic fuzzy attributes is described as follows. Firstly, the continuous attributes of decision tables are intuitionistic fuzzified according to the standardization method in document [23]. Then, the intuitionistic fuzzy decision information system is sampled by dynamic sampling technology, and the intuitionistic fuzzy decision information sub-system family {U₁, U₂, ⋯ , U_n } is obtained. Secondly, for each decision information sub-system U_i, the cardinality of the relative positive domain for each level attribute combination in U_i is taken as the basis of heuristic search, and the attribute with the largest cardinality of the relative positive domain is selected and put into the candidate subset, ... ... . Until the combination of hierarchical attributes satisfies the criterion of the maximizing cardinality of the relative positive domain, and the reduction R_i of the subfamily U_i is obtained. This cycle lasts until all reductions {R₁, R₂, ⋯ R_n } of the families of intuitionistic fuzzy decision information sub-system are computed. Finally, the relative reduction R of intuitionistic fuzzy decision information system is obtained by intersecting these reductions, that is R = R₁ ∩ R₂ ∩ ⋯ ∩ R_n.

Based on the above ideas, the process of generalized dynamic attribute reduction algorithm based on similarity relation of intuitionistic fuzzy rough set is as follow as Fig. 1.

Fig. 1

General dynamic attribute reduction based on fuzzy similarity relation of intuitionistic fuzzy rough set.

4.2 Algorithm description

Generalized dynamic attribute reduction algorithm based on fuzzy similarity relation of intuitionistic fuzzy rough set and is described as follows,

Algorithm 2 Generalized dynamic attribute reduction algorithm based on fuzzy similarity relation of intuitionistic fuzzy rough set

Input: A hybrid information system S = (U, C, V_C, D, V_D, F), threshold k.

Output: An approximate minimum relative reduction R of S = (U, C, V_C, D, V_D, F).

Step 1, to preprocess data and construct fuzzy information system U.

Step 2, to carry out intuitionistic fuzzification for continuous attributes, to obtain the membership degree of each object to each intuitionistic fuzzy attribute, and then obtain the non-membership degree γ (x) using intuitionistic index π (x) and according to the formula γ (x) =1 - μ (x) - π (x). So that the value of each attribute can be expressed by pairs 〈μ (x) , γ (x)〉 and the intuitionistic fuzzy information system $U^{'}$ can be obtained after intuitionistic fuzzification.

Step 3, to determine F family capacity N, that is |F|, of intuitionistic fuzzy information system $U^{'}$ . Where, the determination of stability coefficient A_i of generalized dynamic reduction R relative to F family is very important. Document [21] considers that the system can obtain the most stable generalized reduction when Φ ∈[0.5,1], that is, the sampling coverage of the sub-family must be more than 50%. By using probability sampling method, N1 subtables with coverage 90%,80%,70%,60% and 50% in decision information system are randomly sampled to join F families, where N = 5•N1. Dynamic sampling generates F family of intuitionistic fuzzy decision system $U^{'}$ , i.e. subfamily A₁, A₂, ⋯ , A_c. For each X = {x₁, x₂, ⋯ , x_n}, the step 4 to step 7 of the loop is executed, and the resulting reductions are recorded as x_j = (x_j1, x_j2, ⋯ , x_jm) , j = 1, 2, ⋯ , n.

Step 4, to initialize, R_i =∅, ${pos}_{R_{i}}^{k} (D) = 0$ , C′ = C.

Step 5, according to the value of decision attribute D, the objects in the universe U′ are sorted and the set of equivalent classes {X|X ∈ U/D} is obtained.

Step 6, for all a_i ∈ C′,

Step 6.1, to calculate the intuitionistic fuzzy similarity relation R_{iR_i∪{a_i}} = R_{iR
_i} ∩ R_{i{a_i}} according to Equation (15).

Step 6.2, to calculate the cut set $R_{{iR}_{i} \cup {a_{i}}}^{k}$ of intuitionistic fuzzy similarity relation R_{iR_i∪{a_i}} at the threshold level k.

Step 6.3, to calculate fuzzy similarity classes according to $R_{{iR}_{i} \cup {a_{i}}}^{k}$ .

Step 6.4, to calculate the relative positive domain ${pos}_{R_{i} \cup {a_{i}}}^{k} (D)$ according to Equation (17).

Step 6.5, to calculate the relative positive domain cardinality $| {pos}_{R_{i} \cup {a_{i}}}^{k} (D) |$ .

Step 7, for all a_i ∈ C′, to find the maximum relative positive domain cardinality ${pos}_{max}^{k} = max {| {pos}_{R_{i} \cup {a_{i}}}^{k} (D) | | a_{i} \in C^{'}}$ .

Step 7.1, if there are more than one maximum value of $| {pos}_{R_{i} \cup {a_{i}}}^{k} (D) |$ , then any one conditional attribute a_k with the largest cardinality of relative positive domain is selected as the candidate conditional attribute.

Step 7.2, if ${pos}_{max}^{k} > {pos}_{R_{i}}^{k} (D)$ , then R_i = R_i ∪ {a_k}, pos_{R
_i} (D) = pos_{R_i∪{a_k}} (D), C′ = C - {a_k}, and return to step 6.

Step 7.3, if ${pos}_{max}^{k} ⩽ {pos}_{R_{i}}^{k} (D)$ , the algorithm terminates this loop, the reduction set R_q (q = 1, ⋯ , n) is output.

Step 8, to intersect operation $R = R_{1} \cap R_{2} \cap \dots \cap R_{n}$ , and the reduction R of intuitionistic fuzzy decision information system is obtained.

5 Experimental results and analysis

5.1 Experimental data and parameter setting

In order to verify the principle, the performance of the algorithm is verified by the fault diagnosis case of certain aircraft in ground state. The engine fault diagnosis model can be represented by the following nonlinear equations [24]: $y = f (T_{1} *, N_{1}, N_{2}, Φ pc, P_{M}, B)$ (19)

Where,

y——the output value. It indicates the engine state.

T₁^*——Total inlet temperature.

N₁——Low-pressure rotor speed.

N₂——High-pressure rotor speed.

Φ _pc ——Tail nozzle indication value.

P_M——Lubricating oil pressure.

B——Vibration value of engine crankcast.

Let T = (U, S, R, D) be an engine fault fuzzy information system. Where U = x₁, x₂, ... , x _n is a set of engine fault data, which is abbreviated as universe. R = total inlet temperature, low-pressure rotor speed, high-pressure rotor speed, tail nozzle indication value, lubricating oil pressure, vibration value of engine crankcast is a continuous conditional attribute set, respectively noted as R ={ r₁, r₂, r₃, r₄, r₅, r₆ }. D = engine state y is a discrete decision attribute set. S = R ∪ D is an all attribute set. The intuitionistic fuzzy information system of engine fault is obtained from the data in document [24], as shown in Table 1.

Table 1

An intuitionistic fuzzy information system for engine fault(R_100 %, π_ij = 0.01)

U	r₁	r₂	r₃	r₄	r₅	r₆	D
x ₁	<0.100, 0.890>	<0.838, 0.152>	<0.860, 0.130>	<0.921, 0.069>	<0.737, 0.253>	<0.200, 0.790>	<0,1>
x ₂	<0.375, 0.615>	<1.000,0.000>	<1.000,0.000>	<1.000,0.000>	<1.000,0.000>	<0.200, 0.790>	<0,1>
x ₃	<0.150, 0.840>	<0.980, 0.010>	<0.985, 0.005>	<0.863, 0.127>	<0.705, 0.285>	<0.188, 0.802>	<1,0>
x ₄	<0.625, 0.365>	<0.727, 0.263>	<0.730, 0.260>	<0.933, 0.057>	<0.684, 0.306>	<0.440, 0.550>	<0,1>
x ₅	<0.119, 0.871>	<0.977, 0.013>	<0.987, 0.003>	<0.872, 0.118>	<0.732, 0.258>	<0.125, 0.865>	<1,0>
x ₆	<0.450, 0.540>	<0.788, 0.202>	<0.785, 0.205>	<0.944, 0.046>	<0.737, 0.253>	<0.460, 0.530>	<0,1>
x ₇	<0.350, 0.640>	<0.939, 0.051>	<0.970, 0.020>	<0.899, 0.091>	<0.763, 0.227>	<0.500, 0.490>	<0,1>
x ₈	<1.000,0.000>	<0.904, 0.086>	<0.875, 0.115>	<1.000,0.000>	<0.937, 0.053>	<0.500, 0.490>	<0,1>
x ₉	<0.150, 0.840>	<0.848, 0.142>	<0.865, 0.125>	<0.854, 0.136>	<0.684,0.306>	<1.000,0.000>	<0,1>

The parameters are set as follows that fuzzy index π_ij = 0.01, λ = 0 . 6, intuitionistic fuzzy partition threshold κ = 0 . 6. When the acceptable error ΔMLE(P_G(R)) is 11.11%, α = 9, t_α = 0.703, then |F| ⩽ 10. It is taken that |F| = 10. There are two subfamily tables for each sampling probability.

5.2 Sampling and system reduction

The subfamily is expressed as F - R_{(P_G,χ)}. Where, P_G represents the sampling probability, χ represents the sampling times. Two subfamilies which are respectively containing 90%,80%,70%,60%,50% samples were sampled respectively from original decision information system, and they are noted as F-R_(90 %_,1), F-R_(90 %_,2), F-R_(80 %_,1), F-R_(80 %_,2), F-R_(70 %_,1), F-R_(70 %_,2), F-R_(60 %_,1), F-R_(60 %_,2), F-R_(50 %_,1), F-R_(50 %_,2), and their reductions are noted as R_(90%,1), R_(90%,2), R_(80%,1), R_(80%,2), R_(70%,1), R_(70%,2), R_(60%,1), R_(60%,2), R_(50%,1), R_(50%,2).

According to the algorithm 2, the reduction of the intuitionistic fuzzy information system is the result of intersection operation of sub-family reductions after dynamic sampling. Under the condition that the threshold is (α,β) = (0.6, 0.4), the reduction results of all sub-families are shown in Table 2. The intuitionistic fuzzy rough information system after attribute reduction is shown in Table 3.

Table 2
Results of system and subfamilies reduction

subfamily Reduction Number of attributes in reduction

A¹ [25] B² [26] C³ [27] A [25] B [26] C[27]

F-R_(50 %_,₁₎ {r₁, r₆} {r₁} {r₁, r₆} 2 1 2

F-R_(50 %_,₂₎ {r₁, r₆} {r₁} {r₁, r₆} 2 1 2

F-R_(60 %_,₁₎ {r₁, r₆} {r₁, r₆} {r₁, r₆} 2 2 2

F-R_(60 %_,₂₎ {r₁, r₆} {r₁} {r₁, r₆} 2 1 2

F-R_(70 %_,₁₎ {r₁, r₆} {r₁, r₆} {r₁, r₆} 2 2 2

F-R_(70 %_,₂₎ {r₁, r₆} {r₁} {r₁, r₆} 2 1 2

F-R_(80 %_,₁₎ {r₁, r₆} {r₁, r₆} {r₁, r₆} 2 2 2

F-R_(80 %_,₂₎ {r₁, r₃, r₆} {r₁} {r₁, r₅, r₆} 3 1 3

F-R_(90 %_,₁₎ {r₁, r₆} {r₁, r₆} {r₁, r₅, r₆} 2 2 3

F-R_(90 %_,₂₎ {r₁, r₃, r₆} {r₁, r₆} {r₁, r₅, r₆} 3 2 3

IFRS_(100 %) {r₁, r₃, r₆} {r₁, r₆} {r₁, r₅, r₆} 3 2 3

subfamily	Reduction	Number of attributes in reduction
F-R_(50 %_,₁₎	{r₁, r₆}	{r₁}	{r₁, r₆}	2	1	2
F-R_(50 %_,₂₎	{r₁, r₆}	{r₁}	{r₁, r₆}	2	1	2
F-R_(60 %_,₁₎	{r₁, r₆}	{r₁, r₆}	{r₁, r₆}	2	2	2
F-R_(60 %_,₂₎	{r₁, r₆}	{r₁}	{r₁, r₆}	2	1	2
F-R_(70 %_,₁₎	{r₁, r₆}	{r₁, r₆}	{r₁, r₆}	2	2	2
F-R_(70 %_,₂₎	{r₁, r₆}	{r₁}	{r₁, r₆}	2	1	2
F-R_(80 %_,₁₎	{r₁, r₆}	{r₁, r₆}	{r₁, r₆}	2	2	2
F-R_(80 %_,₂₎	{r₁, r₃, r₆}	{r₁}	{r₁, r₅, r₆}	3	1	3
F-R_(90 %_,₁₎	{r₁, r₆}	{r₁, r₆}	{r₁, r₅, r₆}	2	2	3
F-R_(90 %_,₂₎	{r₁, r₃, r₆}	{r₁, r₆}	{r₁, r₅, r₆}	3	2	3
IFRS_(100 %)	{r₁, r₃, r₆}	{r₁, r₆}	{r₁, r₅, r₆}	3	2	3

¹A: Algorithm in this paper and the algorithm based on the dependence degree. ²B: Algorithm based on discernibility matrix. ³C: Algorithm in document [27].

Table 3

Intuitionistic Fuzzy Rough Information System after Attribute Reduction (π_ij = 0.01, threshold (α,β) = (0.6,0.4))

U	r₁	r₆	D
x ₁	<0.100, 0.890>	<0.200, 0.790>	<0,1>
x ₂	<0.375, 0.615>	<0.200, 0.790>	<0,1>
x ₃	<0.150, 0.840>	<0.188, 0.802>	<1,0>
x ₄	<0.625, 0.365>	<0.440, 0.550>	<0,1>
x ₅	<0.119, 0.871>	<0.125, 0.865>	<1,0>
x ₆	<0.450, 0.540>	<0.460, 0.530>	<0,1>
x ₇	<0.350, 0.640>	<0.500, 0.490>	<0,1>
x ₈	<1.000,0.000>	<0.500, 0.490>	<0,1>
x ₉	<0.150, 0.840>	<1.000,0.000>	<0,1>

The generalized dynamic reduction of whole intuitionistic fuzzy information system is as GDR_IFRS ={ r₁, r₆ }. While the reduction of generalized attribute reduction based on discernibility matrix of intuitionistic fuzzy rough set is GDR_IFRS ={ r₁ }.

5.3 Stability analysis

Rough set method is used to reduce the data in Table 1, and the attribute sets are reduced as {r₁, r₂, r₄, r₆}, {r₁, r₂, r₅, r₆}, {r₂, r₄, r₅, r₆} and {r₁, r₃, r₅, r₆}. Under the condition of 100% sampling probability, the generalized dynamic reduction of intuitionistic fuzzy information system is a subset of the static reduction of rough set. This shows that, compared with rough set reduction, generalized dynamic reduction can get more refined reduction.

The stability analysis of each sub-family reduction is shown in Table 2. The envelope of dynamic reduction is described by the number of attributes in dynamic reduction, as shown in Fig. 2. The envelope analysis shows that the generalized dynamic reduction based on the relative positive domain of intuitionistic fuzzy rough sets is relatively stable. When the sampling coverage is between 50% and 70%, the system reduction reaches the most stable state, as shown in Fig. 2(a). The generalized attribute reduction algorithm based on discernibility matrix of intuitionistic fuzzy rough set achieves the most stable state when the sampling coverage is more than 90%, as shown in Fig. 2(b). From Fig. 2, compared with the envelope range of the generalized dynamic reduction algorithm based on discernibility matrix of intuitionistic fuzzy rough set, the generalized dynamic reduction algorithm based on the relative positive domain of intuitionistic fuzzy rough sets is smaller than that of the generalized dynamic reduction algorithm based on discernibility matrix of intuitionistic fuzzy rough set. It shows that the stability of the generalized dynamic reduction algorithm in this paper is better than that of the algorithm based on discernibility matrix of the intuitionistic fuzzy rough set.

Fig. 2

Variation curves of samples and reduced envelopes.

In the process of dynamic reduction, the cardinality of the relative positive domain keeps increasing and then remains unchanged. There is no case that the number of attributes is increasing while the cardinality, dependence and importance of the relative positive domain is decreasing, to see Fig. 3.

Fig. 3

Variation curves of relative positive domain cardinality during reduction.

5.4 Diagnostic accuracy verification

Rough set method is used to reduce the data of Table 4, the reductions are {r₁, r₂, r₄, r₆}, {r₁, r₂, r₅, r₆}, {r₂, r₄, r₅, r₆}, and {r₁, r₃, r₅, r₆}. Compared with rough set method, it is found that the algorithm proposed in this paper can get smaller reductions, and the number of reduction attribute is less, and the redundant information is less.

Table 4
Diagnostic accuracy compared with other methods

Reduction Method Reduction or attribute combinations System state Diagnostic results Accuracy

Added attribute combinations {r₁, r₃, r₆} 0,0,0 1,0,1 33.3%

{r₂, r₃, r₄, r₆} 0,0,0 0,0,1 66.7%

{r₂, r₃, r₅, r₆} 0,0,0 1,0,1 33.3%

{r₂, r₃, r₆} 0,0,0 1,1,0 33.3%

{r₁, r₅, r₆} 0,0,0 1,0,1 33.3%

Reduction of algorithm in this paper and in document [27] {r₁, r₆} 0,0,0 0,1,0 66.7%

Algorithm based on discernibility matrix [26] {r₁} 0,0,0 0,1,0 66.7%

Reduction of traditional rough set method {r₁, r₂, r₄, r₆} 0,0,0 1,0,1 33.3%

{r₁, r₂, r₅, r₆} 0,0,0 0,0,1 66.7%

{r₂, r₄, r₅, r₆} 0,0,0 0,0,1 66.7%

{r₁, r₃, r₅, r₆} 0,0,0 1,0,1 33.3%

{r₁, r₂, r₃, r₄, r₅, r₆} 0,0,0 0,0,1 66.7%

Reduction Method	Reduction or attribute combinations	System state	Diagnostic results	Accuracy
Added attribute combinations	{r₁, r₃, r₆}	0,0,0	1,0,1	33.3%
	{r₂, r₃, r₄, r₆}	0,0,0	0,0,1	66.7%
	{r₂, r₃, r₅, r₆}	0,0,0	1,0,1	33.3%
	{r₂, r₃, r₆}	0,0,0	1,1,0	33.3%
	{r₁, r₅, r₆}	0,0,0	1,0,1	33.3%
Reduction of algorithm in this paper and in document [27]	{r₁, r₆}	0,0,0	0,1,0	66.7%
Algorithm based on discernibility matrix [26]	{r₁}	0,0,0	0,1,0	66.7%
Reduction of traditional rough set method	{r₁, r₂, r₄, r₆}	0,0,0	1,0,1	33.3%
	{r₁, r₂, r₅, r₆}	0,0,0	0,0,1	66.7%
	{r₂, r₄, r₅, r₆}	0,0,0	0,0,1	66.7%
	{r₁, r₃, r₅, r₆}	0,0,0	1,0,1	33.3%
	{r₁, r₂, r₃, r₄, r₅, r₆}	0,0,0	0,0,1	66.7%

In order to further verify the validity of the extracted feature parameters, the 7∼9 samples are taken as test samples, the rest samples are taken as training samples, the attribute values in the reduction set are taken as BP neural network input, and the engine state is taken as output to compare and calculate. The results are shown in Table 4. In order to verify the scientificity of the algorithm and the validity of the feature parameters, the experiment adds the combination of some attributes excluded from the reduction to investigate the interference of redundant attributes.

The validation results show that, compared with rough set method, the proposed algorithm obtains fewer feature parameters, eliminates the interference of redundant attributes, and maintains the highest diagnostic accuracy of feature parameters obtained by rough set method. Redundant attributes r₂, r₄, r₅ have less interference to diagnostic accuracy, while redundant attribute r₃ has the greatest interference ability, and the combination precision of reduction containing attribute r₃ is very low. The diagnostic accuracies of reductions {r₂, r₃, r₆} and {r₃, r₆} are the same, it indicates that attribute r₂ is redundant. The diagnostic accuracies of reductions {r₂, r₃, r₅, r₆} and {r₂, r₃, r₆} is the same, it indicates that attribute r₅ is redundant.

The diagnostic accuracy of reduction {r₁, r₆} is higher than that of reduction {r₁, r₂, r₄, r₆}, and attribute r₂ is redundant, so the attribute r₄ is not only redundant, but also has greater interference ability. Compared with the diagnostic accuracy of attribute combination {r₂, r₃, r₄, r₆}, the diagnostic accuracy of attribute combination {r₂, r₃, r₅, r₆} is lower. Under the condition that r₂, r₃, r₄ are all redundant attributes, and r₅ is redundant attribute too, so they can be removed from reduction. The diagnostic accuracies of reduction sets {r₁} and {r₁, r₆} are the same, but it can not be explained whether attribute r₆ is redundant. Because all of r₂, r₃, r₄ are redundant attributes, the diagnostic accuracies of attribute sets {r₂, r₃, r₄, r₆} and {r₂, r₄, r₅, r₆} has reached or reached the same level of {r₁} and {r₁, r₆}, and they are the highest. This shows that when the contribution of attribute r₁ to correct diagnosis is 0, the attribute r₆ plays an important role in making correct diagnosis. They are also the key attributes in describing the fuzzy information system and can not be omitted.

In summary, there are still too many redundant attributes in the attribute reduction using rough set method. The generalized attribute reduction method based on discernibility matrix of intuitionistic fuzzy rough set has the problem of over-reduction, and the key attribute r₆ is omitted in the reduction set. However, the algorithm in this paper can get the relatively correct reduction set of intuitionistic fuzzy information system. In this way, fault diagnosis can be realized with fewer characteristic parameters, which greatly reduce the workload and improve the efficiency of fault diagnosis. At the same time, it can also solve the redundancy problem of diagnosis parameters.

5.5 Optimization effect analysis

Known from Table 1 and Table 3, the system needs 144 storage units to store before reduction. While after reduction, the system only needs 72 storage units, which reduces the storage space by 50%.

According to the classification and discrete results of document [28], the traditional rough set reduction algorithm is used to extract the diagnostic rules, if the total set {r₁, r₂, r₃, r₄, r₅, r₆} of conditional attributes as characteristic parameters, then there are 1024 diagnostic rules extracted from the decision table. Using the algorithm proposed in this paper, if the set {r₁, r₆} of reduction attribute is taken as characteristic parameters, then there are 32 diagnostic rules extracted from the decision table. The size of rule base is reduced by 98.88%.

This proves that the generalized dynamic attribute reduction algorithm based on relative positive domain of intuitionistic fuzzy rough set can get fewer rules after extracting the feature parameters. That is, 1024 diagnostic rules need to be stored before, while only 32 rules need to be stored, and the optimization effect is obvious. This greatly reduces the size of rule base, saves a lot of storage space, effectively solves the problem of big knowledge base, and improves the efficiency of diagnosis.

5.6 Time complexity analysis

Assumption S = (U, P ∪ Q, V, f) is an intuitionistic fuzzy knowledge representation system. Where, |U| = n, |P| = m, and the number of sub-families is |F|. In the dynamic sampling stage, each sampling almost needs to traverse the domain of the system. Therefore, the time complexity of the sub-family sampling process is O (mn|F|). In document [26], the maximum time complexity of the algorithm based on the intuitionistic fuzzy discernibility matrix of intuitionistic fuzzy rough set approximates to O (|F|m (m + 1) n²/2). The time complexity of the attribute reduction algorithm based on the dependency degree of intuitionistic fuzzy rough set is approximating to O (|F|m (m + 1) n²/2) [25]. The time complexity of the proposed algorithm in this paper is approximate to O (|F|m (m - 1) n²/2). Compared with other algorithms, the algorithm proposed in this paper has obvious advantages in time consumption and can save 27.69% time in the case in this paper.

5.7 Comprehensive analysis

A fuzzy rough set attribute reduction method based on fuzzy distance measure of similarity relation is proposed in document [27]. This method is similar to the method based on discernibility matrix and attribute dependency, but it is still powerless to solve the problem of large data sets. Therefore, it can only be compared with the proposed method for the small data set reduction results after generalized sampling.

The parameters are selected as follows p = 2, k = 3, δ = 0.001. According to the reduction results in Table 2, the algorithm in this paper obtains the same generalized dynamic reduction GDR_IFRS ={ r₁, r₆ } as the method in document [27]. The difference is that the algorithm proposed in this paper has redundant attribute r₃ in the reduction of subsets, while the algorithm in document [27] have redundant attribute r₅ in the reductions of subsets, and the stability is also very good. This has been described above.

According to the fault diagnosis results in Table 4, the diagnostic accuracy of the algorithm in document [27] is the same as that of the algorithm in this paper. After removing redundant attribute {r₅}, the diagnostic accuracy is also improved.

The time complexity of the algorithm in document [27] is O (mn² + (m² + m)/2). Combining with the sampling time complexity O (mn|F|) of the sub-family, the time complexity of the synthesis algorithm is $O (| F | m^{2} n (n^{2} + \frac{1}{2} m + \frac{1}{2}))$ , which is higher than that of the algorithm proposed in this paper.

6 Conclusion

In view of the characteristics of dynamic, big data and fuzziness in big data sets, this paper innovatively proposes a generalized dynamic attribute reduction algorithm for big data based on dynamic reduction sampling theory and similarity relation of intuitionistic fuzzy rough set. The dynamic reduction theory is innovatively used to solve the problem of big data set and its dynamics, the intuitionistic fuzzy rough attribute reduction algorithm is used to solve the problem of fuzziness, and the cardinality of relative positive domain is innovatively used to replace the dependence degree as the criterion of reduction reasonableness. The stable reduction of decision table is obtained. The main conclusions and innovations are as follows:

Using dynamic reduction sampling theory, after dynamic sampling, we divide a big data set into small data sets. Through real data validation, dynamic sampling can effectively solve the dynamic increment problem in large-scale fuzzy information system, and obtain stable reduction. The algorithm in this paper provides a selection scheme for data mining and knowledge discovery in dynamic large-scale fuzzy information system.

The algorithm in this paper is an attribute reduction method based on relative positive domain cardinality of intuitionistic fuzzy rough set similarity, using relative positive domain cardinality instead of dependency degree as decision-making condition. It reduces the complexity of the algorithm, extracts the attribute combination with the largest relative positive domain cardinality, effectively extracts the key factors affecting the classification decision-making of the system, removes redundancy, improves the interference ability, and accurately describes the system. The obtained fault diagnosis rule base and sample storage space are optimized.

Compared with similar attribute reduction algorithms, such as rough sets, the reduction algorithm based discernibility matrix and the reduction algorithm based dependence degree of intuitionistic fuzzy rough set, the algorithm proposed in this paper can not only overcome the problem of over-rough reduction and over-reduction, but also get more refined and reasonable approximation reduction, and reduce time consumption. It provides an effective and accurate way for big data processing.

Compared with similar attribute reduction algorithms, such as the reduction algorithm based discernibility matrix and the reduction algorithm based fuzzy distance measure, the algorithm proposed in this paper can own small time complexity on the premise of ensuring diagnostic accuracy. More importantly, it has the ability to adapt to big data set reduction.

At present, incremental computing [29], parallel computing [30, 31] have become new hotspots of big data mining and attribute reduction, so the next research directions are:

By putting the algorithm into bigger data sets and parallel computing environment for attribute reduction, the time complexity of the algorithm will be further reduced and the adaptability of the algorithm to big data processing will be further enhanced.

The adaptability of the algorithm to hybrid information system will be studied, and the processing ability of the algorithm to discrete attributes and multi-decision attributes hybrid information system will be enhanced.

Footnotes

Acknowledgments

The author would like to thank those researchers such as LEI Yingjie, KONG Weiwei, who had studied intuitionistic fuzzy rough theories. The author would like to thank Jan. G. Bazan who proposed generalized dynamic reduction theory. This work is supported and inspired in part by a grant from them.

References

Hongyan

and Zuqiang

, New heuristic algorithm for attribute reduction in decision-theoretic rough set, Computer Science 43(6) (2016), 218–222.

Dongmei

, Tao

and Tao

, Continuous attribute reduction algorithm based on interval type-2 fuzzy rough sets, Application Research of Computers 32(5) (2015), 1379–1382.

Jensen

and Shen

, Semantics-preserving dimensionality reduction, Rough and Fuzzy-Rough-Based Approaches, IEEE Trans On Knowledge and Data Engineering 16(12) (2004), 1457–1471.

Zhihe

and Xiaohui

, Improved difference matrix heuristic attribute reduction algorithm, Computer Engineering and Design 37(4) (2016), 1032–1036.

Qing

, Shanlin

and Wenjun

, A novel attributes reduction algorithm of intuitionistic fuzzy-valued information system, Fuzzy Systems and Mathematics 28(4) (2014), 138–143.

Hai

, Attribute reductions in intuitionistic fuzzy decision information systems, Mathematics in Practice and Theory 46(13) (2016), 148–153.

Tingquan

, Chengdong

and Yuetong

, Fuzzy similarity relation based variable precision fuzzy rough sets, CAAI Transactions on Intelligent Systems 7(2) (2012), 148–152.

Tao

, Zenglin

, Fangan

et al., A hybrid feature gene selection method based on fuzzy neighborhood rough set with information entropy, International Journal of Signal Processing and Pattern Recognition 7(6) (2014), 95–110.

Qing

, Shanlin

and Wenjun

, A novel attributes reduction algorithm of intuitionistic fuzzy-valued information system, Fuzzy Systems and Mathematics 28(4) (2014), 138–143.

10.

Jensen

and Mac Parthal'ain

, Nearest neighbour-based fuzzy-rough feature selection, Lecture Notes in Computer Science Volume 8536 (2014), 35–46.

11.

Jensen

, Mac Parthal'ain

and Cornelis

, Feature grouping-based fuzzy-rough feature selection, Proceedings of the IEEE International Conference on Fuzzy Systems(FUZZ-IEEE14) (2014), 1488–1495.

12.

Han

L.X.

, Liew

C.S.

, Hemert

J.V.

et al., A generic parallel processing model for facilitating data mining and integration, Parallel Computing 37 (2011), 157–171.

13.

X.H.

, Knowledge discovery in databases: an attribute-oriented rough set approach, Ph.D. Dissertation, University of Regina, (1995).

14.

X.H.

and Cercone

, Learning in relational database: a rough set approach, International Journal of Computational Intelligence 11(2) (1995), 323–337.

15.

Bazan

, Showron

and Synak

, Dynamic reducts as a tool for extracting laws from decision tables, In: L. Polkowski, A. Skowron, eds. Methodologies for Intelligent System: Proc. 8th International Symposium ISMIS’94, Charlotte, NG, LNAI 869, Spring Verlag, (1994), 346–355.

16.

Bazan

, A compasion of dynamic and non-dynamic rough set methods for extracting laws from decision tables, In: L. Polkowski, A. Skowron eds. Rough sets in Knowledge Discovery: Methodology and Applications, Physica-Verlag, Heidelberg, (1998), 321–365.

17.

Bazan

, Dynamic reducts and statistical inference, In: L. Polkowski, A. Skowron, eds. Proceeding of the Sixth International Conference, Information Processing and Management of University in Knowledge-Based Systems (IMPU’96), Granada, Spain. (1996), 1147–1152.

18.

Jiayang

, Songqiao

and An

, Study for dynamic reduct based on rough set, Mini-Micro Systems 27(11) (2006), 2056–2060.

19.

Jan

and Bazan

, Dynamic reducts and statistical inference, In: Proceedings of the Sixth International Conference, Information Processing and Management of Uncerntainty in Knowledge-Based Systems(IPMU’96), Granada, Spain, (1996), 1147–1152.

20.

Xingqin

, Research on dynamic reduction sampling technology, Ph.D. Dissertation, University of Electronic Science and Technology of China, (2011).

21.

Jensen

, Tuson

and Shen

, Finding rough and fuzzy-rough set reducts with SAT, Information Sciences 255 (2014), 100–120.

22.

Yanli

, Yingjie

and Zhaoyuan

, Construction of intuitionistic fuzzy similarity relation, Computer Applications 28(2) (2008), 311–314.

23.

Zhang

C.P.

, Statistic analysis technique and application, Chongqiang University Press, Chongqiang (1998), 147–154.

24.

Chuanchao

, Research on diagnosis algorithms based on rough set and fuzzy set and its application for aircraft, M.S. thesis, China Civil Aviation University, Tianjing, (2008).

25.

Yingjie

, Yanli

, Weiwei

et al., Intuitionistic fuzzy-rough set theory and application, Science Press, Beijing (2013), 168–186.

26.

Chuanchao

, Large data generalized dynamic fault feature extraction algorithm based on intuitionistic fuzzy-rough set discernibility matrix, Journal of Computers 14(1) (2019), 1–24.

27.

Wang

, et al., Fuzzy rough set-based attribute reduction using distance measures, Knowledge-Based Systems (2018). http://doi.org/10.1016/j.knosys.2018.10.038.

28.

Yunxue

, An algorithm of acquisition for diagnostic parameters of engine fault based on fuzzy-rough sets, Proceedings of 2007 IEEE International conference on Automation and Logistics (2007), 2306–2309.

29.

Lingfang

, Hui

, Chengwen

et al., On incremental NAVE bayesian classification algorithm based on dynamic reduction, Computer Applications and Software 32(3) (2015), 188–191.

30.

Yanqin

, Suping

and Mingyuan

, Attribute reductions based parallel computing using fuzzy-rough sets, Light Industry Science and Technology 203(10) (2015), 72–83.

31.

Suruchi

Ms.

, Nandgaonkar

and Raut

A.B.

, A survey on parallel method for rough set using Map Reduce technique for data mining, International Journal Of Engineering And Computer Science 4(5) (2015), 14160–14163.