Abstract
A lattice-valued information system is an important model in the field of artificial intelligence and the notion of homomorphisms between lattice-valued information systems is a kind of tools to study data compression in a lattice-valued information system. This paper investigates invariant characterizations of information structures in a lattice-valued information system under homomorphisms based on data compression. Information structures in a lattice-valued information system is first proposed by using set vectors. Then, dependence and independence between information structures in the same lattice-valued information system is characterized by the inclusion degree. Finally, a complex massive lattice-valued information system can be compressed into a relatively small-scale lattice-valued information system by means of homomorphisms and it is proved that some characterizations of information structures in a lattice-valued information system under homomorphisms based on data compression are invariant, that is, some of the same data structures are obtained.
Keywords
Introduction
An information system based on rough set theory was proposed by Pawlak [25, 26]. Most applications of rough set theory, such as rule extraction [2, 38], feature selection [7–9, 24], uncertainty modeling [3, 32], reasoning with uncertainty [40, 41] are related to information systems.
In reality there are always complex and massive data in information systems. How to remove data redundancy while maintaining the same data structures and to discover potentially useful information is important. Thus, we need to consider “Data compression” in information systems. Data compression is a promising technique for storage and transmission of data. When data compression is used, data is transmitted faster, and file storage requires less space. Many aspects of data compression are described in Leweler et al. [19] and Storer [30]. Data compression in information systems is referred to two aspects of operations on data, one is to reduce stored and transferred data volume, the other is to reduce data dimension. Data volume reduction can be viewed as a many-to-one mapping between information systems in mathematics. Data dimension reduction can be explained as attribute reducts of information systems.
Homomorphisms between information systems as a kind of tool to study data compression in information systems was introduced by Grzymala-Busse in [6]. Homomorphisms are very useful for aggregating sets of objects, attributes, and descriptors of the original system. As explained in [27], homomorphisms allow one to translate the information contained in one granular world into the granularity of another granular world and thus provide a communication mechanism for exchanging information with other granular worlds. Li et al. [22] studied invariant characterizations of information systems under homomorphisms. Wang et al. [35–37] investigated generalized information systems and covering information systems under homomorphisms, and pointed out that the notion of homomorphisms between information systems can be regarded as a kind of tools to study the compression of data volume in complex massive databases.
Granular computing, introduced by Zadeh [43], is an effective tool in artificial intelligence [43–46]. Basic notions of granular computing contain information granulation, organization and causation. Zadeh deemed “Granulation involves decomposition of whole into parts, organization involves integration of parts into whole, and causation involves association of causes and effects”. Later, Lin [11, 12] and Yao [41, 42] emphasized the importance of granular computing.
Granules are clumps consisting of objects drawn together by indistinguishability, similarity, and proximity of functionality [12, 23]. Granulation of objects in an information system leads to information granules. A granular structure is a mathematical structure of the family of information granules, where the inner structure of each information granule is visible, and the interactions between information granules are detected by the visible structures [13–15].
In granular computing in information systems, the study of information structure is one of important research directions. An equivalence relation is a special kind of similarities between objects from a data set. Given an information system, an attribute subset determines an equivalence relation on the set of objects. This equivalence relation partitions the set of objects into some disjoint subsets, these subsets are called equivalence classes. If two objects belong to the same equivalence class, then we may say that they cannot be distinguished under this equivalence relation. Thus, every equivalence class can be viewed as an information granule consisting of indistinguishable objects [11, 28]. All information granules constitutes a vector, this vector is called an information structure in this information system induced by this attribute subset. Information structures in an information system are namely granular structures in the meaning of granular computing.
It is known that an unknown target concept can be characterized approximately by existing knowledge structures in a knowledge base, which is one of the strengths of rough set theory. In granular computing in knowledge bases, Qian et al. [28] studied knowledge structures in a knowledge base. Li et al. [20] discussed relationships between knowledge bases and proved that knowledge reductions, coordinate families and necessary relations in a knowledge base are invariant and inverse invariant under homomorphisms. Li et al. [21] studied properties of knowledge structures in a knowledge base by means of condition information amounts, inclusion degrees and lower approximation operators, and gave group, lattice, mapping, and soft characterizations of knowledge structures in a knowledge base. Their results have been shown to be very helpful for knowledge discovery from knowledge bases and significant for establishing a framework of granular computing in knowledge bases [29]. Obviously, knowledge structures in a knowledge base are namely granular structures in the meaning of granular computing.
In an information system, if the domain of every attribute is a lattice according to a decreasing or increasing preference, then this information system is called a lattice-valued information system. The purpose of this paper is to obtain invariant characterizations of information structures in a lattice-valued information system under homomorphisms based on data compression.
The remaining part of this paper is organized as follows. In Section 2, we recall some concepts about binary relations, lattices, lattice-valued information systems and homomorphisms. In Section 3, we describe information structures in a lattice-valued information system through set vectors and define dependence and independence between them. In Sections 4, we give some properties of information structures. In Sections 5, we obtain invariant characterizations of information structures under homomorphisms based on data compression. Sections 6 summarizes this paper.
Preliminaries
In this section, we recall some concepts about binary relations, lattices, lattice-valued information systems and homomorphisms.
Throughout this paper, U and V denotes two non-empty finite sets called the universes, 2 U denotes the family of all subsets of U, |X| denotes the cardinality of X ∈ 2 U . All mappings are assumed to be surjective.
Put
Binary relations and lattices
Recall that R is a binary relation on U whenever R ⊆ U × U. If (x, y) ∈ R, then we denote it by xRy.
Let R be a binary relation on U. Then R is called reflexive, if xRx for any x ∈ U. symmetric, if xRy implies yRx for any x, y ∈ U. anti-symmetric, if xRy and yRx imply x = y for any x, y ∈ U. transitive, if xRy and yRz imply xRz for any x, y, z ∈ U.
Let R be a binary relation on U. Then R is called an equivalence relation on U, if R is reflexive, symmetric and transitive.
In this paper, 2U×U denotes the family of all binary relations on U.
Given R ∈ 2U×U. If R = δ, then R is called a universal relation; if R =▵, then R is called an identity relation.
Let L be a non-empty set. Given R is a binary relation on L. Then R is call a partial order on L, if R is reflexive, anti-symmetric and transitive. Moreover, the pair (L, R) or L is called a partial order set (briefly, a poset).
In this paper, the partial order R on L is denoted by ≤.
If a ≤ b and a ≠ b, then we denote it by a < b.
a ∈ L is called a top element of L, if a ≤ x for any x ∈ L. a ∈ L is called a bottom element of L, if x ≤ a for any x ∈ L.
If the poset L has top elements a1, a2 (resp. bottom elements b1, b2), then a1 = a2 (resp. b1 = b2). This unique top element (resp. bottom element) is denoted by 1 L or 1 (resp. 0 L or 0).
a ∈ L is called a above boundary in S, if a ≤ x for any x ∈ S. a ∈ L is called a under boundary in S, if x ≤ a for any x ∈ S. a = ∨ S, if a is a minimal above boundary in S. a = ∧ S, if a is a maximal under boundary in S.
If S = {a, b}, then we denote ∨S = a ∨ b and ∧S = a ∧ b.
Lattice-valued information systems
If P ⊆ A, then (U, P) is called a subsystem of (U, A).
If (U, A) is an information system and P ⊆ A, then an equivalence relation (or indiscernibility relation) ind (P) can be defined by
Obviously, ind (P) = ⋂ a∈Pind ({a}).
Let L≼ = (U, A) be a lattice-valued information system. If for each attribute a ∈ A, V a = ({1, 2, ⋯, k}, ∨, ∧), then L≼ = (U, A) is a classic information system; If for each attribute a ∈ A, V a = ([0, 1], ∨, ∧), then L≼ = (U, A) is a fuzzy information system; If for each attribute a ∈ A, V a = (2 W , ∪, ∩), then L≼ = (U, A) is a set-valued information system; If for each attribute a ∈ A, V a = (I ([0, 1]), ∨, ∧), then L≼ = (U, A) is a interval value information system.
Given a ⊆ A. Define x ≼ a y ⇔ a (x) ≤ a a (y). Then ≼ a is called the dominance relation on U with respect to the attribute a ([39]). And x ≼ a y means that “y is at least as good as x with respect to the criterion a” and then we may say that y dominates x or x is dominated by y with respect to the attribute a.
A lattice-valued information system (U,A)
A lattice-valued information system (U,A)
According to above expression, we can find that V
a
1
= {1, 2, 3} is a lattice with real numbers, where ≤
a
1
is “≤” between two real numbers. So the dominance relation on U with respect to the attribute a1 can be defined as
V
a
2
= {0.3, 0.6, 0.9} is a lattice with numerical numbers located in the unit interval [0, 1], from which ≤
a
2
is “≤” between two fuzzy elements. And the dominance relation on U with respect to the attribute a2 can be defined as
V
a
3
= {{0}, {0, 1}, {0, 1, 2}} is a lattice with set-valued elements, where ≤
a
3
is “⊆” between two two sets. Then the dominance relation on U with respect to the attribute a3 can be defined as
Given P ⊆ A. Define
Then ≼ P is called the dominance relation on U with respect to the attribute subset P [39]. And x ≼ P y means that “y is at least as good as x with respect to the criterion set P” and then we may say that y dominates x or x is dominated by y with respect to the attribute subset P.
Obviously, ≼ P = ⋂ a∈P ≼ a .
Denote
Then,
Obviously,
≼ P is reflexive and transitive, U/≼ P constitutes a covering of U.
In mathematics, homomorphisms between two information systems can be explained as a mapping. Li et al. [22] studied invariant characterizations of information systems under homomorphisms. Wang et al. [37] pointed out that this homomorphism can be regarded as a kind of tools to study data compression in information system.
The triple h = (h1, h2, h3) is called a homomorphism from (U, A) to (V, B), if for all u ∈ U and a ∈ A,
The above definition may also applies to lattice-valued information systems.
Then the triple h = (h1, h2, h3) is called a homomorphism from (U, A) to (V, B), if for each a ∈ A, h3 ↾
U
a
: U
a
→ Vh2(a) is order-preserving, and for any u ∈ U, a ∈ A,
We write (U, A) ∼ h (V, B).
Some concepts of information structures in a lattice-valued information system
In this section, we propose some concepts of information structures in the same lattice-valued information system.
Let L≼ = (U, A) be a lattice-valued information system. Given P ⊆ A. Then,
If ≼ P is an identity relation on U, then S (P) = ([x1], [x2], ⋯⋯, [x n ]).
Note that
Then
Then
The following definition depicts dependence and independence between information structures in a lattice-valued information system.
a) S (Q) is called to depend on S (P), if for each i, b) S (Q) is called to depend strictly on S (P), if S (P) ⪯ S (Q) and S (P) ≠ S (Q), we write S (P) ≺ S (Q). a) S (Q) is called to depend partially on S (P), if for exists i, b) S (Q) is called to depend partially strictly on S (P), if S (P) ⊑ S (Q) and S (P) ≠ S (Q), we write S (P) ⊏ S (Q). S (Q) is called to be independent of S (P), if for each i,

S(P) and S(Q)
Figure 1 means S (P) ≺ S (Q), Fig. 2 means S (P) ⊏ S (Q) and S (Q)
S (P), Fig. 3 means S (P) ⊏ S (Q) and S (Q) ⊏ S (P), Fig. 4 means S (P) ⋈ S (Q) and S (Q) ⋈ S (P).
Obviously,
S (A) ≺ S (P4) ≺ S (P1) ≺ S (∅), S (A) ≺ S (P5) ≺ S (P3) ≺ S (∅), S (A) ≺ S (P2) ≺ S (∅), S (P4) ≺ S (P2), S (P4) ≺ S (P3), S (P5) ≺ S (P1).
S (P1) ⊏ S (P2), S (P2) ⊏ S (P1); S (P1) ⊏ S (P3), S (P3) ⊏ S (P1); S (P2) ⊏ S (P3), S (P3) ⊏ S (P2); S (P2) ⊏ S (P5), S (P5) ⊏ S (P2); S (P4) ⊏ S (P5), S (P5) ⊏ S (P4) (see Fig. 5).

S(U,A)
S (P) = S (Q) ⇔ ≼
P
= ≼
Q
; S (P)⪯ S (Q) ⇔ ≼
P
⊆ ≼
Q
;
0 ≤ D (S (Q)/S (P)) ≤1; S (P) ⪯ S (Q) implies D (S (Q)/S (P)) =1; S (P) ⊑ S (Q) ⊑ S (L) implies D (S (P)/S (L)) ≤ D (S (P)/S (Q)).
We have
This example illustrates that
The following theorem show that dependence and independence between information structures in a lattice-valued information system can be quantitatively described by the inclusion degree.
S (P) ⪯ S (Q) ⇔ D (S (Q)/S (P)) =1. S (P) ⋈ S (Q) ⇔ D (S (Q)/S (P)) =0. S (P) ⊑ S (Q) ⇔0 < D (S (Q)/S (P)) ≤1.
“⟸”. Put
Then
Then
Thus for each i,
It follows that for each i,
Hence S (P) ⪯ S (Q).
(2) “⇒”. Since S (P) ⋈ S (Q), we have ∀ i,
Thus D (S (Q)/S (P)) =0.
“⟸”. Since D (S (Q)/S (P)) =0, we obtain that for each i,
Then, ∀ i,
(3) This holds by (2). □
Invariant characterizations of information structures in a lattice-valued information system under homomorphisms based on data compression
Homomorphisms between two lattice-valued information systems may be a tool for studying data compression in lattice-valued information systems. Then, a complex massive lattice-valued information system can be compressed into a relatively small-scale information system by means of homomorphisms. Meanwhile, some same data structures in lattice-valued information systems are invariant.
Note that (U, A) ∼
h
(V, B). Then, ∀ a ∈ P,
So ∀ a ∈ P, g a (u) = g a (u*).
Thus u ∈ g P (u*).
“⟸”. Suppose u ∈ g
P
(u*). Then, ∀ a ∈ P, g
a
(u) = g
a
(u*), i.e., ∀ a ∈ P, h3 (a (u)) = (h3 (a (u*)). Since (U, A) ∼
h
(V, B), we have ∀ a ∈ P, h2 (a) (h1 (u)) = h3 (a (u)), h3 (a (u*)) = h2 (a) (h1 (u*). Then
Thus
Hence
Then
Thus
Let y = h1 (x), v = h1 (u). Then,
Note that (U, A) ∼
h
(V, B). Then, ∀ a ∈ A,
So ∀ a ∈ P1, h3 (a (u)) ≤ h2(a)h3 (a (x)).
Since h3 ↾
U
a
is order-preserving, we have ∀ a ∈ P1, a (u) ≤
a
a (x). This implies (u, x) ∈ ≼
P
1
. By hypothesis, (u, x) ∈ ≼
P
2
, i.e., ∀ a ∈ P2, a (u) ≤
a
a (x). Since h3 ↾
U
a
is order-preserving, we have ∀ a ∈ P2,
Then, ∀ a ∈ P2,
This implies v ≼ h2(P2)y. So (v, y) ∈ ≼ h2(P2).
Thus ≼h2(P1) ⊆ ≼ h2(P2).
Conversely, suppose ≼h2(P1) ⊆ ≼ h2(P2), ∀ (u, x) ∈ ≼
P
1
, we have u ≼
P
1
x. Then
Note that (U, A) ∼
h
(V, B). Then, ∀ a ∈ A,
Since h3 ↾ U a is order-preserving, we have ∀ a ∈ P1, h3 (a (u)) ≤ h2(a)h3 (a (x)).
Then, ∀ a ∈ P1, h2 (a) (h1 (u)) ≤ h2(a)h2 (a) (h1 (x)). This implies (h1 (x), h1 (u)) ∈ ≼ h2(P1).
By hypothesis, (h1 (x), h1 (u)) ∈ ≼ h2(P2), i.e., ∀ a ∈ P2, h2 (a) (h1 (u)) ≤ h2(a)h2 (a) (h1 (x)). Then, ∀ a ∈ P2, h3 (a (u)) ≤ h2(a)h3 (a (x)).
Since h3 ↾ U a is order-preserving, we have ∀ a ∈ P2, a (u) ≤ a a (x). This implies u ≼ P 2 x. Then (u, x) ∈ ≼ P 2 .
Thus ≼ P 1 ⊆ ≼ P 2 . □
S (P1) = S (P2) ⇔ S (h2 (P1)) = S (h2 (P2)) ; S (P1) ⪯ S (P2) ⇔ S (h2 (P1)) ⪯ S (h2 (P2)).
(1) This follows from Theorem 4.1 and Proposition 5.3. □
Let v = h1 (u). Then, ∀ a ∈ P1, h2 (a) (h1 (u)) ≤ h2(a)h2 (a) (h1 (x i )).
Note that (U, A) ∼
h
(V, B). Then, ∀ a ∈ A,
So ∀ a ∈ P1, h3 (a (u)) ≤ h2(a)h3 (a (x i )).
Since h3 ↾
U
a
is order-preserving, we have ∀ a ∈ P1, a (u) ≤
a
a (x
i
). This implies u ≼
P
1
x
i
, i.e., (u, x
i
) ∈ ≼
P
1
. Then
Then, ∀ a ∈ P2,
So (v, y
j
) ∈ ≼ h2(P2). This implies
Thus S (h2 (P1)) ⊑ S (h2 (P2)).
Conversely, suppose S (h2 (P1)) ⊑ S (h2 (P2)). Then ∃ j,
Note that (U, A) ∼
h
(V, B). Then, ∀ a ∈ A,
Since h3 ↾ U a is order-preserving, we have ∀ a ∈ P1, h3 (a (u)) ≤ h2(a)h3 (a (x i )).
Then, ∀ a ∈ P1, h2 (a) (h1 (u)) ≤ h2(a)h2 (a) (h1 (x
i
)). This implies (h1 (u), h1 (x
i
)) ∈ ≼ h2(P1). Then
This implies (h1 (u), h1 (x i )) ∈ ≼ h2(P2), i.e., ∀ a ∈ P2, h2 (a) (h1 (u)) ≤ h2(a)h2 (a) (h1 (x i )). Then, ∀ a ∈ P2, h3 (a (u)) ≤ h2(a)h3 (a (x i )).
Since h3 ↾
U
a
is order-preserving, we have ∀ a ∈ P2, a (u) ≤
a
a (x
i
). Then (u, x
i
) ∈ ≼
P
2
. So
Thus S (P1) ⊑ S (P2). □
S (P1)⊏ S (P2) ⇔ S (h2 (P1)) ⊏ S (h2 (P2)) ; S (P1) ⋈ S (P2) ⇔ S (h2 (P1)) ⋈ S (h2 (P2)).
(1) This follows from Theorem 5.7. □
The lattice-valued information system (U,A)
The lattice-valued information system (U,A)
A lattice-valued information system (U,A)
A mapping h1 : U → V is defined as follows:
Define a mapping h2 : A → B as follows:
Put U
a
i
= {a
i
(u
s
) :1 ≤ s ≤ 15}, V
b
j
= {b
j
(v
t
) :1 ≤ t ≤ 6}. Then
A mapping h3 : U* → V* is defined as follows: ∀ w ∈ U
a
h3 ↾
U
a
:
It is easy to verify that for any u ∈ U and a ∈ A,
Thus (U, A) ∼ h (V, B) with h = (h1, h2, h3).
It is clear that the original lattice-valued information system (U, A) is relatively complex and its image lattice-valued information system (V, B) is simpler. Theorem 5.5, Corollary 5.6, Theorem 5.7, Corollary 5.8 illustrate the fact that equality, dependence, partial dependence and independence between information structures in (U, A) and (V, B) are equivalent to each other.
Pick Q1 = {b1}, Q2 = {b2}, Q3 = {b3}, Q4 = {b1, b2}. Q5 = {b1, b3}, Q6 = {b2, b3}.
It is easy to verify that S (B) ≺ S (Q4) ≺ S (Q1) ≺ S (∅), S (B) ≺ S (Q5) ≺ S (Q3) ≺ S (∅), S (B) ≺ S (Q2) ≺ S (∅), S (Q4) ≺ S (Q2), S (Q4) ≺ S (Q3), S (Q5) ≺ S (Q1).
S (Q1) ⊏ S (Q2), S (Q2) ⊏ S (Q1) ; S (Q1) ⊏ S (Q3), S (Q3) ⊏ S (Q1); S (Q2) ⊏ S (Q3), S (Q3) ⊏ S (Q2); S (Q2) ⊏ S (Q5), S (Q5) ⊏ S (Q2); S (Q4) ⊏ S (Q5), S (Q5) ⊏ S (Q4).
Now, equality, dependence, partial dependence and independence between information structures of the image information system (V, B) are obtained. Thus, we may obtain equality, dependence, partial dependence and independence between information structures of the original lattice-valued information system (U, A).
In this paper, information structures in a lattice-valued information system have been considered as set vectors. Dependence and independence between information structures have been depicted. Properties of information structures are given by using the inclusion degree. Invariant characterizations of information structures in a lattice-valued information system under homomorphisms based on data compression have been obtained (see Table 4).
Invariant characterizations of information structures in lattice-valued information systems under homomorphisms based on data compression
These results will be significant for establishing a framework of granular computing in information systems. In the future, we will give some applications for dealing with knowledge discovery in lattice-valued information systems.
Footnotes
Acknowledgments
The authors would like to thank the editors and the anonymous reviewers for their valuable suggestions which have helped immensely in improving the quality of this paper. This work is supported by National Natural Science Foundation of China (11461005), Natural Science Foundation of Guangxi (2016GXNSFAA380045, 2016GXNSFAA380282, 2016GXNSFAA380286), The excellent Youth Foundation of Hunan Provincial Department of Education (16B141).
