Abstract
As a technique for granular computing, rough sets deal with the vagueness and granularity in information systems. Covering-based rough sets are natural extensions of the classical rough sets by relaxing the partitions to coverings and have been applied in many fields. However, many vital issues in covering-based rough sets, including attribute reduction, are NP-hard and therefore the algorithms for addressing them are usually greedy. Hence, it is necessary and helpful to generalize the covering-based rough sets from different viewpoints. In this paper, a new type of covering-based rough sets, named parametric covering-based rough sets, are proposed and some properties and applications of the parametric covering-based rough sets are investigated. First, a concept of inclusion degree is introduced into covering-based rough set theory to explore some properties of the parametric covering approximation space. Second, the parametric covering-based rough sets are established on the basis of inclusion degree. Moreover, some properties of the parametric covering-based rough sets are introduced. Third, it is found that the calculations of corresponding parametric covering-based lower and upper approximations can be converted into the operations of matrices, which makes the calculations convenient. Finally, a simple application of the parametric covering-based rough sets to network security is introduced.
Keywords
Introduction
Rough set theory (RST) was originally proposed by Pawlak [26, 27] in 1982 as a useful mathematical tool for dealing with the vagueness and granularity in information systems and data analysis. This theory can approximately characterize an arbitrary subset of a universe by using two definable subsets called lower and upper approximations [7]. Pawlak’s rough set is built on equivalence relations. However, an equivalence relation imposes restrictions and limitations on many applications [11, 40]. To address this issue, many extensions have been made by replacing equivalence relations with other notions such as reflexive relations [20], similarity relations [32], tolerance relations [31], arbitrary relations [23], coverings [8, 46–49], and fuzzy relations [12, 22]. Meanwhile, different rough set models have not only combined with each other, but also connected with other theories [1, 39]. For instance, Zakowski [43] established covering-based rough set theory by exploiting coverings of universal sets. The study on covering-based rough sets is very necessary and important. Particularly, in recent years, with the fast development of computer science and technology, how to use the effective mathematical tools to address practical problems has become more and more essential.
A further RST innovation has been the development by Ziarko [50] of a variable precision rough sets (VPRS) model, which incorporates probabilistic decision rules. This is an important extension, since as noted by Kattan et al. [19], “In real-world decision making, the patterns of classes often overlap, suggesting that predictor information may be incomplete... This lack of information results in probabilistic decision making, where perfect prediction accuracy is not expected”. VPRS deals with partial classification by introducing a precision parameter β (in the rough set the β value is zero). The β value represents a bound on the conditional probability of a proportion of objects in a condition class, which are classified to the same decision class. Ziarko [50] considered β as a classification error and it is defined to be in the domain [0.0, 0.5). Because the VPRS model has no formal historical background of having empirical evidence to support any particular method of β-reduct selection [2], VPRS-related research studies do not focus in detail on the choice of the precision parameter (β) value. Ziarko [50] proposed the β value to be specified by the decision maker. Beynon [3] proposed two methods of selecting a β-reduct without such a known β value. Beynon [4] proposed the allowable β value range to be an interval, where the quality of classification may be known prior to determining the β value range. The extended VPRS was introduced by Katzberg et al. [10], which allowed asymmetric bounds l and u to be used. The VPRS models the restriction l < 0.5 and u = 1 - l must hold. Beynon [5] introduced the (l, u)-quality graph, which elucidates the associated level of quality of classification, based on the selected l and u values. And another research works for VPRS can be found in [25, 33].
The covering-based rough set model based on variable precision was presented by Zhang and Wang [45] in order to expand rough set application space and to generalize the classical rough set. But there are two basic relational formulas of variable precision covering rough set inclusion
However, the above research works for the VPRS are all based on a parameter β. In this paper, by introducing the concept of inclusion degree, a new type of covering-based rough sets, named parametric covering-based rough sets are defined, which can not only generalize the theory of covering-based rough sets, but extend the applications of covering-based rough sets to diverse fields. Especially, this parameter covering-based rough set model is defined based on two parameters 0 ≤ β < α ≤ 1. On one hand, since there exist two parameters in this parameter covering-based rough set model, it is easy to see that this model is more complex than another VPRS models. On the other hand, the accuracy of this parameter covering-based rough set model is higher than another VPRS models in the practical problems. First, the definition of inclusion degree is proposed and some of its properties are explored. Moreover, through inclusion degree, some problems of covering-based rough sets can be addressed. Second, the parametric covering-based rough sets are proposed by means of inclusion degree and the properties of parametric covering-based rough sets are investigated. Third, it is found that the calculations of corresponding parametric covering-based lower and upper approximations can be represented by the operations of matrices. Finally, an application of parametric covering-based rough sets to network security is given.
The remainder of this paper is organized as follows: In Section 2, some basic concepts and properties of covering-based rough sets are introduced. In Section 3, the parametric covering-based rough sets are proposed by means of inclusion degree. Moreover, characterizations of parametric covering-based rough sets are investigated. In Section 4, the matrix representations of parametric covering-based lower and upper approximations are proposed. In Section 5, an application of the parametric covering-based rough sets is introduced. Section 6 concludes this paper.
Preliminaries
We introduce the definitions of covering and partition at first.
In the following discussion, unless stated to the contrary, the universe of discourse U is considered to be finite and nonempty.
It is clear that a partition of U is certainly a covering of U. So the concept of covering is an extension of the concept of partition. Neighborhood [6, 41] is a concept used widely in covering based rough sets. It is defined as follows.
It is clear that x ∈ N
If y ∈ N
After the concept of neighborhood has been given, we can introduce the concept of neighborhoods.
There is an important property of neighborhoods presented by the following proposition.
By the definition of Cov (
Parametric covering-based rough sets
In this section, we introduce a new concept named inclusion degree into covering-based rough sets. Based on this, the parametric covering-based rough set model is established and characterizations of parametric covering-based rough sets are investigated.
The inclusion degree based on covering approximation space
For any x ∈ U and X ⊆ U, the inclusion degree of x to X with respect to
(1)
are hold, where X, Y ⊆ U and x, y ∈ U.
According to the definition of inclusion degree, it is easy to see the inclusion degree of x to X with respect to
(⇐): If there exist K
i
, K
j
∈
The above proposition shows a necessary and sufficient condition for covering to be a partition from the viewpoint of inclusion degree.
Neighborhood is an important concept in covering-based rough sets. That under what condition neighborhoods form a partition is a meaningful issue induced by this concept. Many scholars have paid attention to this issue and presented some necessary and sufficient conditions. In the following proposition, a new necessary and sufficient condition for neighborhoods to form a partition is presented from the viewpoint of inclusion degree.
(⇐): For any x, y ∈ U, if
The above propositions and examples show some properties and applications of inclusion degree. In the following subsection, the parametric covering-based rough sets are established from the perspective of inclusion degree. Based on this work, characterizations of the parametric covering-based rough sets are investigated.
Characterizations of the parametric covering-based rough sets
We firstly define the parametric covering-based rough sets by using inclusion degree.
Suppose β = 0.4, α = 0.75. Then
For any 0 ≤ β < α ≤ 1,
The above definition shows a new type covering-based rough sets with parameter α and β. In fact, this type of covering-based rough set model focuses on the approximation operations of X ⊆ U in CAS from the viewpoint of quantity. Specifically, this covering-based rough set model is decided by two parameters, i.e., 0 ≤ β < α ≤ 1. As we all known, rough set can approximately characterize an arbitrary subset of a universe by using two definable subsets called lower and upper approximation operators. Furthermore, we study some properties of this type of covering-based rough sets. In the first, the positive region, boundary region and negative region of
If
Like many other types of covering-based rough sets on CAS = (U,
(1)
The above proposition shows some important properties of the parametric covering-based rough sets. In fact, it is clear that positive region will increase with parameter α decrease, negative region will increase with parameter α increase and that boundary region will dwindle for the parametric covering-based rough sets. In the following examples we study another properties of the parametric covering-based rough sets.
In following, we give the roughness and precision of the parametric covering-based rough sets.
According to the roughness and precision of X, bn (X, α, β) =∅ if and only if η
(1) 0 ≤ η
Interdependency of the lower and upper approximation operations
According to the above definitions and propositions, the parametric covering-based rough sets have been established from the viewpoint of inclusion degree. In this subsection, the conditions for two coverings to generate the same parametric covering-based lower and upper approximation operations are studied. For this purpose, we firstly introduce the key concept defined in [46], namely reduct of a covering.
Proposition 3.20 guarantees that after deleting a reducible subset in a covering, it is still a covering, while Proposition 3.21 shows that deleting a reducible subset in a covering will not generate any new reducible elements or make other previously reducible elements irreducible. Consequently, we can compute the reduct of a covering of a domain by deleting all reducible elements. The remainder still consists of a covering of the domain and is irreducible.
By Proposition 3.21 and the definition of the reduct, it is clear that reduct(
The following proposition on the condition under which two coverings generate the same parametric covering-based lower approximation operations.
The above proposition shows a sufficient but not necessary condition under which two coverings generate the same parametric covering-based lower approximation operations. Let us propose an example as follows to expound this proposition.
For parametric covering-based upper approximation operations in the parametric covering-based rough sets, we are still faced with the question of when two coverings will generate the same upper approximation operation. In the following discussions, the condition under which two coverings generate the same parametric covering-based lower approximation operations is explored.
Same as Proposition 3.23, the above proposition shows a sufficient but not necessary condition under which two coverings generate the same parametric covering-based upper approximation operations.
Topological properties of parametric covering-based rough sets
In this subsection, some topological characterizations of the parametric covering-based rough sets will be investigated. For this purpose, some fundamental definitions and results of topology should be introduced.
The members of the topology
For an operator
Based on the above basic definitions, the following propositions can be proposed.
The above proposition shows the parametric covering-based lower approximation operator is an interior operator on U when the parametric α = 1. In the following discussions, we prove that the parametric covering-based upper approximation operator is a closure operator on U under some conditions.
Based on the above discussions, the following proposition is presented.
In this section, we mainly establish the parametric covering-based rough sets model and then investigate some properties of the parametric covering-based rough sets. Especially, the conditions under which two coverings generate the same parametric covering-based approximation operations are proposed. In the following section, the matrix representations of parametric covering-based lower and upper approximation operations will be explored.
Matrix representations of parametric covering-based lower and upper approximation operations
In this section, we will show that the calculations of the parametric covering-based lower and upper approximations of the parametric covering-based rough sets can be performed through operations of matrices, which makes the calculations much more convenient. For this purpose, some basic definitions and remarks are introduced as follows.
The definition of operations between Boolean matrix and Boolean vector is similar to above definition. Here ∧ denotes minimum and ∨ denotes maximum.
Let us illustrate the above definitions by a simple example.
Neighborhood is an important concept in covering-based rough sets. In [17], authors proposed the matrix representation of neighborhood as the following theorem shown.
It is easy to find that M
The above proposition shows a matrix representation of neighborhood of covering. Based on this representation, the inclusion degree can be represented by matrix approach.
In the following discussions, a matrix representation of the parametric covering lower and upper approximations of X about CAS is presented.
In this section, we mainly present the matrix representations of parametric covering-based upper and lower approximations.
An application of the parametric covering-based rough sets to network
In [13], Ge investigated separations in covering approximation spaces and gave some characterizations of these separations and some relations among these separations. As an application of these results, investigations on network security were converted into investigations on separations in covering approximation spaces by taking covering approximation spaces as mathematical models of networks. In this section, we introduce the application of the new type of covering-based rough sets into network security based on [13]. First, some definitions and lemmas should be introduced.
Now, we give a simple application of the new type of covering-based rough sets into network security.
(2)⇒ (4): If
(3)⇔ (4): It is easy to prove by Definitions 3.1, 3.8 and Proposition 3.3.
This completes the proof of this proposition.□
The above proposition shows some sufficient and necessary conditions for a network has S1-security from the viewpoint of the new type of covering-based rough sets. A successfully application of this new type of covering-based rough sets into network is a course examination network in Soochow University [13].
Conditions of Network
In order to prevent cheats in a course examination, it is necessary to guarantee that every student can only acquire his/her own exam questions in the network Establishment of Network Put V is the set of 30 users: V = {x1, x2, …, x30}. Put For each Conversion of Network By Lemma 5.3, we convert the network U is the abstract set of V: U = {x1, x2, …, x30}. Security of Network It is not difficult to check that It is not difficult to compute that
According to the analysis of the above example, it is easy to know that the conclusions are more scientific, reasonable and suitable to the reality when applying the new type of covering-based rough sets to the reality. Therefore, the decision of the reality will be more exact in the practice. Finally, we give a practical example of the parameter covering-based rough set model defined inDefinition 3.8.
Suppose that the symptoms in X = {x1, x2, x5, x6} are all involved in a certain disease A, taking α = 1, β = 0.5 as the critical value, then we can calculate the lower approximation and upper approximation of X as follows.
Since
Conclusions
In this paper, by introducing the concept of inclusion degree, a new type of covering-based rough sets, named parametric covering-based rough sets were proposed, which can not only generalize the theory of covering-based rough sets, but extend the applications of covering-based rough sets to diverse fields. First, the definition of inclusion degree was defined and some of its properties were investigated based on covering approximation space. Moreover, through inclusion degree, some problems of covering-based rough sets can be addressed. Second, the parametric covering-based rough sets were established by means of inclusion degree and the properties of parametric covering-based rough sets were explored. Third, it is found that the calculations of corresponding parametric covering-based lower and upper approximations can be represented by the operations of matrices. Finally, an application of parametric covering-based rough sets to network security was introduced. Though much research has been conducted in this paper, there are still many interesting issues worth studying. Relationships between parametric covering-based rough sets and other types of covering-based rough sets. Dependency of lower and upper approximation operations of the parametric covering-based rough sets. Axiomatic system for approximation operations of the parametric covering-based rough sets. Matroidal structures of the parametric covering-based rough sets.
In further work, we will conduct more specific research alone the lines of the above four issues, especially axiomatic systems and matroidal properties [21, 39] for the parametric covering-based rough sets.
Compliance with ethical standards
Conflict of interest: Author Bin Yang declares that he has no conflict of interest.
Ethical approval: This article does not contain any studies with human participants or animals performed by any of the authors.
Footnotes
Acknowledgments
The authors are greatly thankful to anonymous reviewers for sharing their valuable comments that significantly improved the quality of the paper. This research was supported by the National Natural Science Foundation of China [grant No. 61562079] and the Initial Foundation for Scientific Research of Northwest A & F University [grant No. 2452018054].
