Abstract
Formal concept analysis, originally proposed by Wille, is a mathematical tool to analyse and represent data in the form of complete formal context. However, in situations with incomplete information, one only has partial knowledge about a concept, recently, a common conceptual framework of the notions of interval sets and incomplete formal contexts for representing partially-known concepts were presented. In this study, we examine and reinterpret the existing studies on partially known concepts by means of three-valued logics. By treating an incomplete formal context as a three-valued formal context and considering the one-to-one correspondence between interval sets and three-valued mappings, we investigate the condition under which the four types of partially known concepts can be generated by using three-valued implication operators. Moreover, we also evaluate the role of three-valued logic in characterizing attribute implications. A sufficient and necessary condition for computing the true value of an implication correctly in the sense of Kriple semantics is provided.
Introduction
Formal concept analysis has been introduced by Wille and developed by Ganter and Wille ([1]) as an effective method to data mining and knowledge discovery. It is formulated based on the notion of a formal context, which is a binary relation between a set of objects and a set of properties of attributes. The binary relation induces set-theoretic operators from sets of objects to sets of properties, and from sets of properties to sets of objects, respectively. The fundamental notion of a formal concept is defined as a pair of a set of objects and a set of properties connected by the two set-theoretic operators.
For a formal context in [6], it is a known fact that an object o either have an attribute a or not, such a formal context is also called a complete formal context. However, in real situations, we may only have incomplete information. Concretely, for an object, we know a set of attributes that the object has, we also know another set of attributes that the object does not have, for the rest of attributes, we do not know their states. Such a formal context is called an incomplete formal context in the existing literature ([1, 18]). A difficulty in such a formal context is that the formal concepts cannot be produced exactly by using Wille’s method. We may only have partially-known concepts. Several proposals on partially-known concepts in an incomplete formal context have been made and investigated. For instance, Burmeister and Holzer ([1]) introduced a pair of certain and possible extents and a pair of certain and possible intents of a concept, which are useful in characterizing partially-known concepts. Obiedkov ([14]) used a completion of an incomplete formal context for modeling incomplete formal context. Djouadi et al. ([5]) studied the least and the greatest completions of an incomplete formal context to characterize the family of all possible completions. Li et al. ([11]) introduced the notion of an approximate concept with a pair of sets of attributes as the intent and a set of objects as the extent. In a straightforward way, Li and Wang ([12]) adopted the notion of three-way formal concepts from Qi et al. to an incomplete context. In [10], the authors put forward a method to compress the approximate concept lattice using K-medoids clustering. To describe logical formulas between columns of incomplete contexts, the Kripke-semantics are used for propositional formulas over the set of attributes ([8, 9]). Recently, in [18], Yao introduced a common conceptual (i.e., interpretational) framework of partially-known concepts in incomplete formal contexts by using the notion of interval sets. In [13], Ma et al. introduced the theory of interval sets into the object-oriented concept lattice, and proposed an object-oriented interval-set concept lattice. In [17], the basic ideas underlying fuzzy logic were introduced into the study of threeway formal concept analysis.
Three-valued logics ([4]) are straightforward generalization of Boolean logic based on the most simple bipolar scale
The rest of the paper proceeds as follows: in order to make the paper as self-contained as possible, we recapitulate in Section 2 the definition of concepts, formal contexts and implications in three-valued logics. In Section 3, we present a novel characterization of partially-known concepts by means of three-valued logics. In Section 4, we evaluate the role of three-valued logic in characterizing attribute implications. A sufficient and necessary condition for computing the true value of an implication correctly in the sense of Kriple semantics is provided. And in Section 5, we complete this paper with some concluding remarks.
Preliminaries
Formal context and concept
Let K = (OB, AT, I) be a complete formal context ([6]), for a set of objects O ⊆ OB and a set of attributes A ⊆ AT, by considering the maximal set of attributes shared by all objects in O and the maximal set of objects sharing all attributes in A, we arrive at a pair of derivation operators and their properties, as listed below.
As stated in the introduction part, in real situations, we may only have incomplete information. That is, for an object, we know a set of attributes that the object has, we also know another set of attributes that the object does not have, however, for the rest of attributes, we do not know their states. Such a formal context is called an incomplete formal context in the literature ([1, 18]). In what follows, we use K = (OB, AT, {+ , ? , -} , J) to denote an incomplete context, where the symbol ? means an unknown state.
In an incomplete context K = (OB, AT, {+, ?, -}, J), for a set of objects we can no longer derive a set of attributes shared by these objects by using the derivation operator σ, and for a set of attributes nor can we derive a set of objects that share all these attributes by using the derivation operator τ. Burmeister and Hozer and Obiedkow generalized derivation operators σ and τ into modal-style derivation operators as given in the next definition.
Similarly, corresponding to the derivation operator τ, a pair of lower and upper derivation operators that maps a set of attributes A ⊆ AT to a pair of the sets of objects as follows:
Three-valued logic
Three-valued logics are straightforward generalizations of Boolean logic based on the most simple bipolar scale
In [4], the authors defined a maximal family of sensible conjunctions, in addition to the existing ones.
If x ≤ y then x∗ z ≤ y ∗ z ; If x ≤ y then z∗ x ≤ z ∗ y ; 0 ∗ 0 =0 ∗ 1 =1 ∗ 0 =0 and 1 ∗ 1 =1 .
In the case of implication, the authors give a general definition, which extend Boolean logic and supposes monotonicity.
If x ≤ y then y→ z ≤ x ∗ z ; If x ≤ y then z→ x ≤ z ∗ y ; 0 → 0 =1 → 1 =1 and 1 ∗ 0 =0 .
We can easily derive from Definition 2.5 that
All implications on
in the sense of Definition 2.5
All implications on
In [18], four types of partially-known concepts in an incomplete context were proposed. They are SE-SI concepts (i.e., set extent and set intent), SE-ISI concepts (i.e., set extent and interval set intent), ISE-SI concepts (i.e., interval set extent and set intent) and ISE-ISI concepts (i.e., interval set extent and interval set intent). In what follows, we aim to present a novel characterization of these concepts by means of three-valued logics.
Three-valued logics and SE-ISI concepts and ISE-SI concepts
For an incomplete formal context, the collection of objects has altogether three different states with respect to the set of attributes, the truth values are {0, ? , 1}, or equivalently,
In what follows, we mainly consider the truth ordering, then
(L, ∧ , ∨ , 0, 1) is a complete lattice with the least element 0 and the greatest element 1; (L, ⊗ , 1) is a commutative monoid; (⊗ , →) is an adjoint pair in L, i.e. a ⊗ b ≤ c ⇔ a ≤ b → c, a, b, c ∈ L .
In a residuate lattice L = (L, ∧ , ∨ , ⊗ , → , 0, 1), → is called a residual implication. It is a known fact that there exists altogether 2 residual implications on
Łukasiewicz conjunction on
G
In what follows, we will focus on the following question: if we adopt the residual implication on
According to Definition 3.2, if we employ Łukasiewicz implication operator →, we have
f∗ (a) = ⋀o∈OB (f (o) → I (o, a)) = (f (1) → I (1, a)) ∧ (f (2) → I (2, a)) ∧ (f (3) → I (3, a)) ∧ (f (4) → I (4, a)) ∧ (f (5) → I (5, a)) ∧ (f (6) → I (6, a)) = (1 → 1) ∧ (1 → 1) ∧ (1 → 1) ∧ (0 →
f∗ (b) = ⋀o∈OB (f (o) → I (o, b)) = (f (1) → I (1, b)) ∧ (f (2) → I (2, b)) ∧ (f (3) → I (3, b)) ∧ (f (4) → I (4, b)) ∧ (f (5) → I (5, b)) ∧ (f (6) → I (6, b)) = (1 → 1) ∧ (1 → 0) ∧ (1 → 1) ∧ (0 → 1) ∧ (0 → 1) ∧ (0 → 1) = 0,
f∗ (c) = ⋀o∈OB (f (o) → I (o, c)) = (f (1) → I (1, c)) ∧ (f (2) → I (2, c)) ∧ (f (3) → I (3, c)) ∧ (f (4) → I (4, c)) ∧ (f (5) → I (5, c)) ∧ (f (6) → I (6, c)) = (1 →
similarly, f∗ (d) = f∗ (e) =0 . That is, f∗ is defined as follows:
It can be checked easily that the corresponding interval set is [{a} , {a, c}], as desired.
Since Łukasiewicz implication and G
The following proposition shows that if we use Gaines-Rescher implication, then
We thus conclude from Proposition 3.1-Proposition 3.4 that the collection SE-ISI and ISE-SI concepts can be obtained by employing Łukasiewicz implication (or equivalently, G
However, if we employ Sette and Gaines-Rescher implication in E-I direction, we may obtain a different result.
The above proposition shows that if we employ Soboci
According to Definition 3.2, if we employ G
f∗ (a) = ⋀o∈OB (f (o) → I (o, a)) = (f (1) → I (1, a)) ∧ (f (2) → I (2, a)) ∧ (f (3) → I (3, a)) ∧ (f (4) → I (4, a)) ∧ (f (5) → I (5, a)) ∧ (f (6) → I (6, a)) = (0 → 1) ∧ (0 → 1) ∧ (1 → 1) ∧ (1 →
f∗ (b) = ⋀o∈OB (f (o) → I (o, b)) = (f (1) → I (1, b)) ∧ (f (2) → I (2, b)) ∧ (f (3) → I (3, b)) ∧ (f (4) → I (4, b)) ∧ (f (5) → I (5, b)) ∧ (f (6) → I (6, b)) = (0 → 1) ∧ (0 → 0) ∧ (1 → 1) ∧ (1 → 1) ∧ (0 → 1) ∧ (0 → 1) = 1,
f∗ (c) = ⋀o∈OB (f (o) → I (o, c)) = (f (1) → I (1, c)) ∧ (f (2) → I (2, c)) ∧ (f (3) → I (3, c)) ∧ (f (4) → I (4, c)) ∧ (f (5) → I (5, c)) ∧ (f (6) → I (6, c)) = (0 →
similarly, f∗ (d) =1, f∗ (e) =0 . That is, f∗ is defined as follows:
It can be checked easily that the corresponding interval set is [{b, c} , {a, b, c, d}], as desired.
The above proposition shows that if we employ Sette implication, then O* is equal to the derivation operator in the greatest completion of an incomplete formal context.
We summarize the obtained results in Table 4.
An incomplete formal context (U, AT, {+ , ? , -})
The relationship between 3-valued logic and SE-ISI concepts
For ISE-ISI concepts, both extents and intents are interval sets, they can be equivalently represented by three-valued mappings. In what follows, we will examine the following issue: Whether ISE-ISI concepts can be generated by using the existing implications on
The following proposition provides different ISE-ISI concepts by using all possible implications on
(ii) If we use G
(iii) If we use Sette implication, then for
(iv) If we use Nelson implication, then for
(v) If we use Kleene implication, then for
(vi) If we use Ja
(vii) If we use Soboci
(viii) If we use Gaines-Rescher implication, then for
(ix) If we use Bochvar external implication, then for
Proposition 3.7 tells that if we use different types if implications on
According to Definition 3.4, if
The role of three-valued logic in characterizing attribute implication
Attribute implications
Attribute implications are important tools of conceptual data analysis. Let K = (OB, AT, I) be a complete context. For sets A, B ⊆ AT of attributes, we designate by A → B an attribute implication, where A is called the premise, B is called the conclusion of the implication. We say that this implication holds (is valid) in K, if and only if, for every object o ∈ OB to which all attributes from A apply, all attributes from B apply, too, or equivalently, A ⊆ σ ({o}) implies B ⊆ σ ({o}) .
By treating attributes in M as propositional variables, we can also interpret A → B as a propositional formula ∧a∈A → ∧ b∈B. Such attribute implications contain a lot of information about the data, and the concept lattice of a given complete context K is determined up to isomorphism by the knowledge of a set of attribute implications valid in K and generating the set of all attribute implications valid in K with respect to the ARMSTRONG rules [1].
For incomplete contexts, attribute implications can be generated as follows. Let K = (OB, AT, {+, ?, -}, J) be an incomplete formal context. An attribute implication A → B is certainly valid in K if it holds in any completion of K; an attribute implication A → B is possibly valid in K, if it holds in some completion of K; otherwise, the attribute implication is invalid in K. Usually, we assign the value 1 to those certainly valid attribute implications, the value
Evaluating attribute implications with Kleene logic
In [1], the three-valued Kleene logic is used to evaluate attribute implications. Every pair (o, a) (o ∈ OB, a ∈ AT) can be associated with a propositional variable, whose value is given by J (o, a), that is,
Proposition 4.2 shows a way to evaluate an attribute implication syntactically with respect to any incomplete context, because the implications A → B and A → (B \ A) always contain the same information.
Note that Proposition 4.1 and 4.2 hold for attribute implications with disjoint sets for premise and conclusion. By considering a more general case, we can present the following sufficient and necessary condition for computing the true value of an attribute implication correctly by means of three-valued Kleene logic.
Let A → B be an attribute implication and C = A ∩ B. For o ∈ OB, c ∈ C, a ∈ A \ C, b ∈ B \ C, we call (J (o, a), J (o, b), J (o, c)) a truth vector of o with respect to A → B if J (o, c) & J (o, a) → J (o, c) & J (o, b) = &o ∈OB (& a∈A J (o, a) → &a∈B J (o, a)). Denote
Owing to Proposition 4.1, we will consider the case C≠ ∅ in the following discussion.
Proposition 4.3 holds under the preliminary condition A\ C ≠ ∅ and B\ C ≠ ∅. For the left cases, we have the following results.
A comparative analysis between the computed value and the true value w.r.t. different truth vectors
Truth table corresponding to Proposition 4.4
Truth table corresponding to Proposition 4.5
In this paper, we present a novel characterization of partially-known concepts by means of three-valued logics. Concretely, we examine in detail what type of implications on
Our results are based on the preliminary assumption that the possible attribute values of each object are independent, that is, all completions of the incomplete formal context are possible. In the future study, we will focus on the situation where objects in incomplete contexts are not independent to each other, and present the logical characterization of partially known concepts. Furthermore, it is an interesting topic to extend the present study to incomplete fuzzy formal context and provide the corresponding characterizations by using fuzzy logics.
Footnotes
Appendix
Considering the fact that O is a crisp set (that is, O (o) is either 0 or 1) and the fundamental property a →
L
b = 1 ⇔ a ≤ b of Łukasiewicz implication, we then obtain that O* (a) =1 if and only if for any o ∈ OB, O (o) =1 implies I (o, a) =1, if and only if for any o ∈ O, I (o, a) =1. Combining the definition of
Similarly,
We use f to denote the three-valued set on the universe corresponding to the interval set
Considering that for Łukasiewicz implication →, a → b = 1 if and only if
Now, we will show the proof by considering the necessity part and sufficiency part, respectively.
“Necessity”: Suppose, on the contrary, that
“Sufficiency”: If &o∈OB (& a∈AJ (o, a) → & b∈B
Case 1: &o∈OB (& a∈AJ (o, a) → & b∈BJ (o, b)) =1: We then have from Table 5 that the three-valued Kleene-logic computes the truth value of A → B with respect to K correctly in the sense of our Kriple-semantics.
Case 2: &o∈OB (& a∈AJ (o, a) → & b∈BJ (o, b)) =0: We then have from Table 5 that the three-valued Kleene-logic computes the truth value of A → B with respect to K correctly in the sense of our Kriple-semantics,
Case 3:
