Abstract
For a formal context, we define an equivalence relation on the set of attributes. Through this equivalence relation, we define the lower and upper approximation operators relative to the family of semiconcepts of the formal context. We study on the two operators with the further properties that are interesting and valuable in the theory of rough set. In addition, we research on the lattice properties of all of semiconcepts in a formal context. Using this lattice, we set up two operators, and find their approximation properties in the theory of rough set. The two ways for giving approximations generalize the idea of Pawlak rough set approximations from one universal set to two non-related universal sets. We provide examples to exam the correct of the two approximation ways in this paper.
Introduction
The theory of rough set, proposed by Pawlak [1], is motivated by practical needs in classification, concept formation, and data analysis with insufficient and incomplete information (cf. Pawlak [2]). It is a very useful mathematical tool in classification of a collected data under equivalence relation (see [1 –3]). We interpret the rough set theory as an extension of set theory with two additional unary set-theoretic operators referred to as approximation operators (cf. Pawlak [1]). There are many generalized approximations of the standard Pawlak model (see [4 –9]) and the applied areas of Pawlak model and the generalized models (see [2 , 11]). Some of the generalized approximations are applied to formal concept analysis, e.g. Xu, Yao and Miao [6].
Formal concept analysis, proposed by Wille [12] as a mathematical theory of concepts, is used successfully in the area of knowledge representation and knowledge processing (cf. [13, 14]). With the development of formal concept analysis, Vormbrock and Wille point [15] that formal concept analysis has been formally enriched by introducing the notions of semiconcepts.
Semiconcepts have been first considered in the development of conceptual knowledge systems in Luksch and Wille [16]. After that, there are many results for researching on semiconcepts (cf. [14
, 17–21]). How to describe and define semiconcepts properly is a basic question of the philosophical doctrine of semiconcepts. If we can characterize semiconcepts with a new idea such as rough set which is different from that before, then for the theory semiconcepts, we will make advantage and give a good answer to the above basic question. We observe the fact that the existed applications of rough set in the research of formal concept analysis bring plentiful results. All of these applications are discussed on one universal set or on one universal set up to isomorphism as some existed results of rough set do. However, some information systems need to be described on two non-related sets at the same time. For example, a formal context is consisted by two non-related universal sets: one is the set of objects and another is the set of attributes. In addition, a semiconcept is described in a formal context by two parts: one is a subset of objects and another is a subset of attributes. Additionally, we hope to obtain a good success with rough set to semiconcept since rough set has been successful applied in many kinds of information systems. Up to now, for rough set theory, some informations are described by two sets such as Wong, Wang and Yao [22] generalized the rough set model using two distinct but related universal sets, and Liu [23] discussed some properties and applications of rough set based on two distinct but related universal sets as that in Wong, Wang and Yao [22]. In fact, the correspondent results in [22, 23] are expressed on ‘one’ set since there is a map between the two universal sets. If we use the standard Pawlak model or an existed extension of standard Pawlak model on rough set to describe semiconcept, it is a little difficult since all of these models are defined in one universal set, or two related universal sets. However a semiconcept needs to be expressed by two finite non-related sets such that the two finite sets do not have a map from one set to another one as that defined in Wong, Wang and Yao [22]. Based on this idea, we should solve the extension of the ground set of rough set from one set or two related sets to two non-related sets. We notice that Pawlak [1, 2] and Liu [23] said, one of the main directions of research in rough set theory is naturally the generalization of the Pawlak rough set approximations. Hence, in this paper, with the assistance of Pawlak standard model (see Pawlak [1]) and a lattice model (cf. Yao [4]) in rough set respectively, we will generalize standard Pawlak model and the lattice model to two finite and nonempty sets O and P for a formal context
The rest of this paper is arranged as follows: Section 2 is to review some notations and lemmas what we need later on. In Section 3, for the semiconcepts in a formal context, with respect to an equivalence relation on the attribute set, we can construct the approximation operators with the assistance of the family of equivalence classes. Another content is based on lattice theory to define approximation operators. We prove that any of the two kinds of approximations can characterize the set of semiconcepts, respectively. The last one is Section 4 which concludes this paper and leaves room for our future works.
Some notations and lemmas
This section will review some definitions and properties needed in the sequel. For more details, formal concept analysis is referred to Ganter and Wille [13], semiconcept is seen Vormbrock and Wille [15], and rough set is seen Pawlak [2].
(2)[15] In an arbitrary context
(2) Ganter and Wille [13,p.17] indicate that every formal context can be represented by a cross table. We may easily see that full rows and full columns are always reducible (see Ganter and Wille [13,p.24]); thereby we mean objects g with g′ = P and attributes m with m′ = O, respectively. Therefore, in this paper, we consider the formal contexts with no full rows and no full columns.
(1.1)
(2)[13,p.19] If T is an index set and, for every t ∈ T, A
t
⊆ O is a set of objects, then
Since the family of ⊓-semiconcepts has the dual property for the family of ⊔-semiconcepts, we only consider the set of ⊓-semiconcepts in this paper. Furthermore, we will say a semiconcept instead of a ⊓-semiconcept in what follows. All of ⊓-semiconcepts in a formal context
Let
(1) (X 1, Y 1) ⊆ (X 2, Y 2) : ⇔ X 1 ⊆ X 2 and Y 1 ⊆ Y 2;
(2) (X 1, Y 1) ∪ (X 2, Y 2) : = (X 1 ∪ X 2, Y 1 ∪ Y 2);
(3) (X 1, Y 1) ∩ (X 2, Y 2) : = (X 1 ∩ X 2, Y 1 ∩ Y 2).
Generalized approximation operators
Pawlak’s standard rough set model [1] depends on an equivalence relation on a universal set. Many researchers have developed standard rough set model in many ways [4–7 , 25].
Xu, Yao and Miao [6] discussed the approximations in formal concept analysis with lattice-theoretic operators. But, it deals with the extent lattice and intent lattice respectively. In fact, it still defined on one dimension, that is on one universal set. Hence, all the discussions in [6] are similar to that in [4 , 11] if we consider the dimension in their background sets.
Though there are covering approximation spaces (see Safari and Hooshmandasl [24]) which are a generalization of equivalence-based rough set theories. They still consider the approximation spaces defined on one dimension space. That is to say, these covering approximation spaces generalized Pawlak’s standard model with one dimension space only.
Mao and Kang [25] studied the approximations for concept lattice with the aid of variable precision rough set which is an extension of rough set (see Ziarko [26]). However, it needs to know all of concepts in a formal context, and then to approximate the sets not concepts. In fact, it approximates the sets in the family of objects or the family of attributes respectively. Hence, it also generalized the idea of Pawlak’s standard model only in one dimension space.
In the above, we analyze the approximation operators defined on several typical rough sets respectively. As a result, we find that none of them is defined on two non-related universal sets, though it is important to the theory of approximation in formal concept analysis.
We know from Vormbrock [14] that in a formal context, every concept is a semiconcept and not vice versa. Up to now, there does not find the approximation operators for semiconcepts. All of these existed approximation results in [4–7 , 25] for formal concept analysis come from the thought of Pawlak’s standard model on one universal set. Hence, it is necessary to consider approximation operators for semiconcept with the assistance of Pawlak’s standard model from one universal set to two dimensions space, i.e. two non-related universal sets. In addition, we may be assured that lattice theory is important also to the research of semiconcepts.
All the results provided here are useful for concept lattice since every concept is a semiconcept.
When we consider the approximations on the family of semiconcepts in a formal context
From Pawlak’s standard model for rough set in [1, 2] and the other extension models, we determine that in an information system
(p1) it should give a family of subsets on a background set S defined by
(p2) the lower approximation operator of
(p3) the lower and upper approximation operators can characterize the fundamental sets.
In this paper, we will pay attention to semiconcepts in a formal context
Approximation operators relevant to an equivalence relation
First, we will discuss some special elements in ß(
(1) X = O ⇒ (X, ∅) ∈ B (
(2) (∅ , Y) ∈ ß (
By Definition 2.1(1) and Remark 2.1, we may easily obtain X = O⇒ X′ = ∅. Thus, (X, ∅) ∈ ß (
To prove item (2).
By Definition 2.1 and Remark 2.1, we may easily see ∅′ = P if ∅ ⊆ O. Thus, (∅ , Y) ∈ ß (
Conversely, according to the given Y = P and Remark 2.1, we find that if X′ = P for X ⊆ O, then X =∅. Hence, we obtain the correct of (∅ , Y) ∈ ß (
We will use an example to show the converse of item (1) in Lemma 3.1 not to be true.
We set: Kempinski=a 1, Hilton=a 2, Ibis=a 3 and ISSY=a 4; Guest House=b 1, Hotel=b 2, First Class=b 3 and Luxus=b 4. Then Table 1 is expressed as Table 2.
Using Definition 2.1, we can obtain all of X′ for any X ⊆ O 1 = {a 1, a 2, a 3, a 4} as follows. In fact, we see P 1 = {b 1, b 2, b 3, b 4}.
∅′ = P 1; {a 1} ′ = {b 2, b 3, b 4} , {a 2} ′ = {b 2, b 3}, {a 3} ′ = {b 2} , {a 4} ′ = {b 1}; {a 1, a 2} ′ = {b 2, b 3}, {a 1, a 3} ′ = {b 2} , {a 1, a 4} ′ = ∅ , {a 2, a 3} ′ = {b 2}, {a 2, a 4} ′ =∅ , {a 3, a 4} ′ = ∅; {a 1, a 2, a 3} ′ = {b 2} , {a 1, a 2, a 4} ′ =∅ , {a 2, a 3, a 4} ′ = ∅, {a 1, a 3, a 4} ′ =∅; {a 1, a 2, a 3, a 4} ′ =∅.
All of semiconcepts are (X, X′) for any X ⊆ O 1.
We see {a
2, a
4} ′ =∅. This means ({a
2, a
4} , ∅) ∈ ß (
Next, we define a relation on P for a formal context
b 1 Eb 2 if and only if {b 1} ′ = {b 2} ′.
We may easily prove E to be an equivalence relation on P. The equivalence class containing x for x ∈ P is given by [x] E .
From Definition 3.1, we find that [b]
E
is the set which is indistinguishable with b ∈ P. In the coming discussion, we will find that we only need to pay attention to [b]
E
for any b ∈ P. This hints that whether | [b]
E
|>1 or | [b]
E
|=1 will not affect our discussion. Additionally, for the set of attributes, it is easy to obtain {
E
∣ b ∈ P}. Hence, we omit to show how to obtain [b]
E
for any b ∈ P in a formal context
Second, for the case of X≠ ∅ and Y≠ ∅, we will find the characterization of (X, Y) ∈ ß (
According to Lemma 2.1 and Definitions 2.1 and 3.1, we find
=
Summing up the above, we obtain
We will demonstrate the two operators defined in Definition 3.2 to fulfil the works (p2) and (p3) for (X, Y) ⊆ (O, P) with X≠ ∅ and Y≠ ∅.
(1)
(2)
Next, we will prove item (2) with two steps.
Step 1. If
Step 2. If (X, Y) ∈ ß (
In view of Definition 2.1, we obtain Y = X′. Combining Lemma 3.2, we receive
Third, we will give two approximation operators and discuss the properties relative to (p2) and (p3) for (X, Y) ⊆ (O, P) such that at least one of X and Y is ∅ in the following.
After we analyze Lemmas 3.1 and 3.2, we can provide some definitions as follows for (X, Y) ⊆ (O, P) such that at least one of X and Y is an empty set.
(D1) If X =∅, then we define
(D2) If Y =∅, then we define
We will obtain the following results relative to (D1) and (D2).
(1)
(2)
We easily find
We also easily find
To prove item (2).
When
When (∅ , Y) ∈ ß (
When
When (X, ∅) ∈ ß (
Analyzing the approximations for (∅ , Y) and (X, ∅) where X ⊆ O and Y ⊆ P, we may easily find the following statements.
(i)
(ii)
(iii)
(iv)
Combining the statements (i)-(iv), Theorem 3.1 and Lemma 3.3, we can state that for any (X, Y) ⊆ (O, P),
Next, we will analyze the rationality of
(A1) From Definition 3.2, we see that using equivalence relation, we define
(A2) Considering Lemmas 3.2 and 3.3 and the statements (i)-(iv), for any (X, Y) ⊆ (O, P), we find
(A3) In view of Theorem 3.1 and Lemma 3.3, we can point that
(A4) Based on the above analysis (A1)-(A3), we can express that
We will use an example to express the results in this subsection.
Let X
1 = {a
1, a
2} and Y
1 = {b
4}. From Example 3.1, we find
Furthermore, by Definition 3.2, we attain
Let X
2 = {a
1, a
2, a
3} and Y
2 = {b
2}. Then, there is L (X
2, Y
2) = {({a
1} , [b
2]
E
∩ {b
2}) , ({a
1} , [b
3]
E
∩ {b
2}) , ({a
1} , [b
4]
E
∩ {b
2}) , ({a
2} , [b
2]
E
∩ {b
2}), ({a
2} , [b
3]
E
∩ {b
2}) , ({a
3} , [b
2]
E
∩ {b
2})} = {({a
1} , {b
2}) , ({a
1} , ∅) , ({a
2} , {b
2}) , ({a
2} , ∅) , ({a
3} , {b
2})} and U (X
2, Y
2) = {({a
1} , [b
2]
E
∪ {b
2}) , ({a
1} , [b
3]
E
∪ {b
2}) , ({a
1} , [b
4]
E
∪ {b
2}) , ({a
2} , [b
2]
E
∪ {b
2}) , ({a
2} , [b
3]
E
∪ {b
2}) , ({a
3} , [b
2]
E
∪ {b
2})} = {({a
1} , {b
2}) , ({a
1} , {b
2, b
3}) , ({a
1} , {b
2, b
4}) , ({a
2} , {b
2}) , ({a
2} , {b
2, b
3}) , ({a
3} , {b
2})}. Hence, using Definition 3.2, we obtain
Example 3.2 shows the correct of Theorem 3.1 and Lemma 3.2.
By similar means of the discussion for double Boolean algebras for the set of ⊓-semiconcepts and ⊔-semiconcepts in Vormbrack and Wille [15], we will obtain the following results.
(1) (A 1, B 1) ⊑ (A 2, B 2) : ⇔ A 1 ⊆ A 2 and B 1 ⊇ B 2.
(2) (A 1, B 1) ∧ (A 2, B 2) : = (A 1 ∩ A 2, (A 1 ∩ A 2) ′) , (A 1, B 1) ∨ (A 2, B 2) : = (A 1 ∪ A 2, (A 1 ∪ A 2) ′).
Then, (ß (
First, we will prove that (ß (
We may easily see A 1 ⊆ A 1 and B 1 ⊇ B 1 from A 1 = A 1 and B 1 = B 1. Thus, we can decide (A 1, B 1) ⊑ (A 1, B 1).
If (A 1, B 1) ⊑ (A 2, B 2) and (A 2, B 2) ⊑ (A 1, B 1). Then, we obtain A 1 ⊆ A 2 and B 1 ⊇ B 2 from (A 1, B 1) ⊑ (A 2, B 2), and meanwhile, we attain A 2 ⊆ A 1 and B 2 ⊇ B 1 from (A 2, B 2) ⊑ (A 1, B 1). Summarizing, we obtain A 1 = A 2 and B 1 = B 2. So, (A 1, B 1) = (A 2, B 2) holds.
If (A 1, B 1) ⊑ (A 2, B 2) and (A 2, B 2) ⊑ (A 3, B 3). Then, we obtain A 1 ⊆ A 2 and B 1 ⊇ B 2 from (A 1, B 1) ⊑ (A 2, B 2), and at the same time, we receive A 2 ⊆ A 3 and B 2 ⊇ B 3 from (A 2, B 2) ⊑ (A 3, B 3). Thus, we receive A 1 ⊆ A 2 ⊆ A 3 and B 1 ⊇ B 2 ⊇ B 3. This follows (A 1, B 1) ⊑ (A 3, B 3).
In one word, owing to Ganter and Wille [13,p.2], (ß (
Second, we will prove that (ß (
We may easily know A
1 ∩ A
2 ⊆ A
j
(j = 1, 2). From (A
j
, B
j
) ∈ ß (
Using Lemma 2.1(2) and Definition 2.1, we infer to
Therefore, according to Ganter and Wille [13,p.5], we confirm that (ß (
Vormbrock and Wille [15] provided basic theorem on semiconcept algebra, which is a complete pure double Boolean algebra for the family of ⊓-semiconcepts and ⊔-semiconcepts. That is to say, the two operators of meet and join for the semiconcept algebra in [15] are defined for the set of ⊓-semiconcepts and ⊔-semiconcepts. Hence, it does not confirm that the set of ⊓-semiconcepts is closed for the two operators of meet and join for semiconcept algebra in [15]. Furthermore, Theorem 3.2 is necessary for ß (
Yao [4] discussed the approximation operators for a finite Boolean algebra defined on one dimension. We are known that a finite Boolean algebra is a complete lattice and not vice versa. Theorem 3.2 shows that the lattice property of (ß (
Yao [27] discussed some approximation operators for formal concept analysis. However, it dealt those approximation operators on extension set and intension set, respectively. Thus, we can state that those approximation operators in [27] are defined on one dimension set respectively.
Therefore, it is an essential duty for formal concept analysis to define approximation operators on two dimensions. Meanwhile, the new operators should keep the original characteristic that in one dimension such as that in [4,27, 4,27]. Since every concept is a semiconcept, we will consider the essential duty on the set of semiconcepts in a formal context. Based on these analysis, we can define the approximation operators for semiconcepts lattice (ß (
Let X ⊆ O and Y ⊆ P. We set
The operators
(1) If
(2) If
(3) (∅ , P) ⊆ (X, Y) ⊆ (O, ∅) is not correct. That is, if
(4) If
(5) If (X, Y) ∈ ß (
(6) If
(7) If
By Lemma 3.1, we find (∅ , P) , (O, ∅) ∈ ß (
Let (M, N) ∈ ß (
Additionally, it is easy to see M ⊆ O and N⊇ ∅. This implies (M, N) ⊑ (O, ∅). That is to say, (O, ∅) is the greatest element in (ß (
To prove item (3).
If (∅ , P) ⊆ (X, Y). Then we obtain P ⊆ Y. According to Y ⊆ P, we receive Y = P. This means (X, Y) = (X, P). So, we determine (X, P) notsubseteq (O, ∅) since P≠ ∅ follows Pnotsubseteq∅. Hence, (∅ , P) ⊆ (X, Y) ⊆ (O, ∅) does not hold. Considered this result with items (1) and (2), we confirm
To prove item (4).
From the above item (1) and
If (X, Y) ∈ ß (
Therefore, item (4) is demonstrated.
To prove item (5).
According to (X, Y) ⊆ (X, Y) and (X, Y) ∈ ß (
To prove items (6) and (7).
When
From Lemma 3.4, we can state that for any (X, Y) ⊆ (O, P), if we hope to characterize (X, Y) ∈ ß (
(1)
(2)
(3)
The following example will show that for (X, Y) ⊆ (O, P), if
Let X
1 = {a
1, a
2, a
4} and Y
1 = {b
1, b
4}. Combining Example 3.1, we can obtain
Let X
2 = {a
1} and Y
2 = {b
2, b
4}. From Example 3.1, we see
Combining Lemma 3.4 and Example 3.3, we find that for (X, Y) ∈ (O, P), we only need to pay our attention to the case of
(1)
(2)
Let
Considering Theorem 3.2, Lemma 2.1 with Definition 2.1, we infer to
Combining Theorem 3.2, we obtain
Therefore, we receive the item (1) to be correct.
To prove item (2).
(⇒) If
Considering item (1) and the given, we decide
(⇐) If (X, Y) ∈ ß (
According to Definition 3.3, we obtain
By the definition of
Summing up the above, we can express
Actually, from the proof in Theorem 3.3, we may easily obtain the following result.
Theorem 3.3, Corollary 3.1 and Lemma 3.4 taken together implies that for any (X, Y) ⊆ (O, P),
Let X
3 = {a
1, a
2} and Y
3 = {b
2, b
3}. Then there are
In fact, using Definition 2.1, we obtain (X
3, Y
3) ∈ ß (
Example 3.4 shows the correct of Theorem 3.3 and Corollary 3.1.
Conclusion
In a formal context
(1) the lower approximation of (X, Y)⊆ (X, Y) ⊆ the upper approximation of (X, Y);
(2) the lower approximation of (X, Y) equals to the upper approximation of (X, Y) if and only if (X, Y) ∈ ß (
Therefore, we can state that the approximation methods provided in this paper for a formal context are the generalizations of Pawlak’s correspondent idea from one set to two non-related sets with respect to the family of semiconcepts. We hope that our methods here can give a thinking to generalize rough set from one universal set to two or more than two non-related universal sets in the future. These will be done in our future works.
Footnotes
Acknowledgement
This research was supported by National Natural Science Foundation of China [grant number 61572011] and Natural Science Foundation of Hebei University [grant number A2018201117]. The author is grateful to Professor Y.Y. Yao for giving some discussions to semiconcepts during the author’s visiting at University of Regina in 2017.
