Abstract
Formal concept analysis (FCA) become an alternative approach to extract and represent knowledge of real world systems. That knowledge can be obtained from implications rules extracted of concept lattices formed by ordered formal concepts. However, in complex systems the number of formal concepts can be large. To deal with this complexity of the FCA, concept reduction techniques can be applied in order to balance the quality of information, and the computational cost for generating and handling the lattice.
In this paper, we develop a novel approach to represent the behavior of physical processes through qualitative rules based on proper implications (minimum representation of the data) extracted from the reduced concept lattice.
As a case study, the cold rolling process was considered. This process characterize by the strong non-linearity among its parameters. The results show that the qualitative behavior of the rolling process is preserved even when the reduction techniques are applied.
The approach can be used to understand the relationship between process parameters through implication rules under different operating conditions of a process. The paper discusses some generic procedures that can be adopted to apply this approach to other industrial processes.
Introduction
Formal concept analysis (FCA) [81] is a field of mathematics based on concept lattice theory, which has been proposed as an alternative to represent and extract knowledge of real world systems. FCA considers a data set containing instances, variables and incidence relations among them to represent the systems or physical processes. The extracted knowledge can be obtained from implications ou association rules extracted from concept lattices, in order to describe the qualitative behavior of the process. In this way, it is not necessary a more detailed study about the physical, economics, biological, etc. principles that control the properties of systems [40, 24, 71, 56, 88, 87].
In FCA, data set is represented by a binary relation
The great potential of FCA is provided by the concept lattice, a complete lattice formed by the set of organized formal concepts extracted from the formal context, and an implication rules set
Evidencing from the experience for industrial applications, any new approach, technique or method to extract knowledge from data must seek representations based on implication rules to feed an supervisor system or by a symbolic representation of easy understanding especially by a human operator. FCA gathers these characteristics and allows to change the detail level (granularity) of the behavior implication rules
In this work, we develop a novel approach to represent the behavior of physical processes through qualitative rules based on proper implications, extracted from the reduced concept lattice. As a case study, we consider the cold rolling process [60], which has a non-linear behavior on several parameters: input thickness, front and back tension, average yield stress and friction coefficient. It should be noted that these relationships are fundamental to the rolling process [88]. Any change on either of them will cause changes on the rolling load, consequently on the outgoing thickness. The FCA is able to capture these changes and to represent them in a simple and easy manner by extracting the knowledge present in the data set in order to understand the behavior of physical process. The results shown that the relationship between certain variables, such as output thickness and rolling load; and between friction coefficient and rolling load are preserved, even when the reduction techniques are applied.
The motivation for this research is to capture knowledge from reduced conceptual lattice, over physical systems through qualitative rules, to provide the understanding of the process by less-experienced operators. Moreover, it provides information about the analyzed process that can be used in the design of on-line control and supervision systems. Thus, this paper discusses, in a generic form, the procedures to be adopted to apply this work to other industrial processes.
The rest of the paper is organized as follows: Section 2 proposes the use of the FCA as an alternative to represent and extract knowledge of physical processes through reduced conceptual lattice. In Section 3, a case study on the cold rolling process is presented followed by Section 4 with the contributions and the conclusions of this work.
Related work
There are many techniques for concept lattice reduction, each with different characteristics. Dias and Vieira [21] proposed the classification of techniques for concept lattice reduction in three groups: redundant information removal, simplification, and selection.
The techniques of redundant information removal seek to produce a concept lattice isomorphic to the original. In general, they aim to find a formal context with a minimum number of objects or attributes keeping unchanged the structure of the concept lattice [91, 75, 76, 79, 50, 73, 61, 77, 74, 59, 58, 47, 46, 53, 48, 49]. In general, those techniques operate by reducing attributes, and so they are called attribute reduction techniques.
In [80], the authors classify the attributes as absolutely necessary, relatively necessary or absolutely unnecessary. An attribute is considered absolutely necessary if it is present in all minimal sets. It is considered relatively necessary if it is present in at least one minimal set. And finally, if an attribute is not in any minimal set, is considered unnecessary.
In [53], the author shows that the classification of attributes proposed by [80] is independent of: concept lattices, property-oriented concept lattices and object-oriented concept lattices (property-oriented concept lattice and object-oriented concept lattices are originated from rough set theory [57]). As a consequence, only one of those three types of lattices needs to be considered for obtaining the minimal set of attributes.
The techniques of simplification seek an abstraction of the concept lattice, i.e. a high-level overview that preserves only the essential aspects [28, 17, 7, 4, 66, 67, 41, 30, 40]. The technique is one that, from a formal context or a concept lattice, abstracts non-essential differences between concepts, objects or attributes.
Clustering techniques can be applied to sets of objects, attributes or formal concepts. In [17] clustering of objects is performed in order to reduce the dimensionality of concept lattices.
The techniques of selection act by selecting formal concepts, objects or attributes based on a relevance criterion [42, 43, 64, 2, 63, 70, 8, 9, 12, 10, 62, 68, 5, 90]. Some authors propose to use all user background knowledge to create constraints on attributes.
The method proposed by [5] assigns a weight to each attribute in order to express its relevance, and then selects formal concepts considered relevant.
In this work, in order to extract qualitative behavior from reduced concept lattice, we consider the selection techniques.
Formal concept analysis
In the FCA, the data obtained from the operation of a physical process are presented as a formal context, a triple
Example of formal context
Example of formal context
Given a set of objects
The notation
The set of formal concepts can be ordered by the partial order
All line diagrams in this paper were drawn using the Conexp software [84].
Concept lattice originated from the formal context.
The implication rules are a specialty of the notion of association rules, because, in general, the first can be seen as a special case of the second, with 100% confidence. Differently of the association rules, the implication rules support inference axioms, although the associations are more flexible than implications because they permit discovery of approximate dependencies.
In order to describe the qualitative behavior of the process, in this work, we are interested in implications that we are certain that will occur (confidence 100%). With association rules, we do not have certain that given the premise the conclusion will happen. This makes it difficult to understand the behavior of the physical process.
On the other hand, it is known that, to find the association rules, for example, applying the apriori algorithm, the generation of the set of frequent itemsets is computationally expensive. Thus, FCA seeks for implications by means of the formal concepts, which are pairs of objects and attributes. In this case, the size of the search space could reduces and the computation may be more efficient [15].
Given a formal context
.
Given a formal context
An implication
.
If
.
An implication
.
An implication
.
A set of implications
.
A set of implications
.
An implication
From the concept lattice of Fig. 1, one can extract the following implication set
For physical process will be convenient that each implication represents a minimum condition of an operating condition, which would allow an easy understanding of the qualitative behavior of the process. For this, we will require that the complete set of implications
the right hand side of each implication is unitary: if superfluous implications are not allowed: if specializations are not allowed, i.e. left hand sides are minimal: if
A complete set of implications in
.
Let
However, in some cases we must extract such implications from a subset of formal concepts, which forms a given section of a concept lattice, and that section might not form a complete concept lattice [64].3
The sets of objects or attributes of the considered concepts are not a closure systems.
Considering a formal concept
.
It will not have specializations of its implications.
.
The right hand side of its implications will be unitary.
.
It will be consistent with the original derivation operators when applied to only the elements of the concept.
.
It will have only implications with support greater than zero.4
Implications with support equal to zero are not inconsistent, but possible to logically exist. It should be noted that, in applications related to physical processes, the most appropriate proper implications are those with support greater than zero.
.
The set of implications must reflect the lattice formed by the actual set of chosen concepts.
Propositions 1–4 are guaranteed by the proper use of minimum generators [11] as lhs (left hand side), and property 5 is guaranteed by ensuring those properties globally (topological ordering given by the cover relation of the lattice, smallest intentions first). The order in which the set of concepts is explored is important to ensure that the lhs of an implication is as small as possible.
Recently, despite the computational cost, minimal generators have gained prominence in FCA. Its importance is due, mainly, to the fact that they favor the principle of minimum description length (MDL), i.e. the best hypothesis for a given set of data is the one that leads to the best compression of the data [31].
In order to deal with the complexity of a concept lattice, techniques for concept lattice reduction [21] seek a balance between quality of information, computational cost to generate and maintain the lattice and easiness of analysis [5, 53, 63, 47, 43, 70, 90, 49, 52, 16, 83, 6].
In this subsection we present a brief review of reduction techniques. Among the various reduction techniques discussed and classified by Dias and Vieira [21], we selected the techniques stability [43] and minimal generators [5] belonging to the selection class. The stability technique is the main technique of reduction observed in the literature and presents satisfactory results [43, 38, 64]. On the other hand, the selection technique, based on minimum generators, has gained prominence in the FCA. Minimum generators represents, without loss, the knowledge present in the formal concepts. A brief description of the techniques is explained in the following subsections.
.
A technique for concept lattice reduction is one that aims to reduce the concept lattice complexity, both in terms of magnitude and inter-relationships, while maintaining relevant information.
Reduction technique based on the stability index
This technique proposes the stability index, which aims to measure how much the intention of a concept depends on its extension [43]. The stability index
In this paper, the stability index is applied in the selection of formal concepts that represent physical processes. Therefore, the algorithm proposed in [64] was used.
Reduction technique based on minimum generators
Formal concepts can be represented in a compact manner through the minimal generators [29]. Let
The selection of formal concepts from minimum generators was proposed in [5]. For each attribute of the formal context a weight is assigned in order to describe its relevance (
Minimum generators allows the selection of formal concepts that best represent the physical process under analysis.
Extraction and evaluation of qualitative behavior of physical processes using proper implications
For the understanding of physical processes, the representation of qualitative behavior through implications may be appropriate because it is easy to understand for less experienced operators. The set of implications to be extracted corresponds to proper implications. In this paper, an implication
The proper implications must be evaluated in relation to their expressiveness. For this, suppose
if if
The cases
where
The cold rolling process has been highly automated and reached a high standard industrial level. In this process, the objectives are always larger productivity and better quality of the final product in terms of thickness and shape. The literature shows several non-conventional strategies to reach these objectives, as neural networks [87, 44, 54, 14], fuzzy logic [34] and genetic programming [69]. The only condition for the new strategies is to represent correctly the behavior of the process in a quantitative and qualitative way for different operating conditions so that they can be used in the design of on-line control and supervision systems.
The mathematical model of the cold rolling process can be obtained through the study of variables involved in the process, as Alexander’s mathematical model [1] considered one of the most complete. However, the model is known by requesting numerical solution with significant computational effort. A more natural alternative to understand this kind of process is through symbolic representation, expressing the cause-effect relation among the several parameters through qualitative rules [18].
Into classical theories there are several models to calculate the rolling load required for the deformation process. These models, Eq. (2), are non-linear functions of several parameters. It is possible to observe that any change in the input thickness
where
During the rolling process, the cylinders are compressed against the strip by a force transmitted by the back rolls. Since the rolling mill is not perfectly rigid, the output thickness can be expressed by the elastic equation of the rolling mill, Eq. (3).
where
In order to illustrate the behavior of a rolling mill, Fig. 2 shows variations in the nominal operational condition due to the variation in several parameters [25]. Curve (I) corresponds to an increment in the back and/or front tensions of stress. Curve (II) represents a decrement in the input thickness; curve (III) corresponds to the operational condition; curve (IV) represents an increment in the output thickness and curve (V) shows possible increment in the friction coefficient and in the average yield stress or a decrement in the back and/or front tensions.
Operational condition of a rolling mill.
A synthetic database representing the rolling process was provided by the Data Science Research Group (DSRgroup).5
Data for the material and the rolling mill were defined as:
The reference database has ten million records, a high number that makes it difficult to apply the FCA. In [89], statistical techniques was proposed for the construction of representative datasets, maintaining the generalization capacity of the model constructed from the data. Using these approaches, a new equation
This total of samples is sufficient to report the results with 90% confidence [32].
For each variable were selected records that contained the maximum, minimum and near average values; The register representing the nominal operating condition was chose; The remaining elements were selected randomly in order to reach the total of 1691 records.
In this paper, the objective is to represent, through the FCA, all the sample space of the data and to demonstrate the potentialities of the FCA to represent the process qualitatively. The Table 2 displays the maximum and minimum values of each variable for the total set of selected data.
Maximum and minimum values for each variable of the rolling process
Following the previous procedures 10 samples were selected. Among the selected samples, half was used for modeling through the FCA (formal context, concept lattice and extraction of proper implications) and the other half used for validation; i.e., analysis of the representativeness of the implications.
The values of the variables of the rolling process are real numbers, as can be seen in Table 2. However, the FCA works with discrete values. In this case, some alternatives are the use of pattern structures [37], FCA fuzzy [3] or discretization of the continuous values [82]. Among these alternatives, it was chosen to use a discretization process proposed in [88]. An equidistant discretization was performed on the data, applying a total of 15 cut-off conditions in each variable, which resulted in 105 attributes (7 variables divided into 15 intervals each). Table 3 shows the discretization applied in the rolling load
Discretization applied in the rolling load
, friction coefficient
and output thickness
Discretization applied in the rolling load
Applying the discretization process in the 5 samples selected to construct the models, a set of formal contexts
where
An aspect that needs to be highlighted is the one related to the number of intervals (discretization). The number of intervals for each parameter has an influence in the number and quality of the rules obtained. So, the number of intervals can be considered as a regulatory mechanism of the detail level of the rules obtained. However, this regulation is highly dependent on the quality of the data. On the other hand, it is important to observe that the discretization interval (to build the formal context) should not be inferior to the one of the sampling interval considered to collect data of a physics process. This is due to the fact of the extracted rules in this interval cannot be significant.
The formal contexts built from the data samples were named contexts #1 through #5, respectively. An example of implication obtained from the context #1 is
The algorithm proposed in [22] was used to extract proper implications with support greater than zero through the concept lattice. This algorithm is part of the EF-Concept Analysis package [20] in evolution and available at
The Table 4 represents the number of formal concepts, their proper implications, and the representation rate
Summary of the results related to proper implications for the original validation contexts
On the other hand, the rolling process involves non-linear relations that make the analysis difficult. Fortunately, some process behaviors are known as, for example, the relationship between variables:
Rolling load rolling load
Rolling load and output thickness have inverse behavior, i.e., the smaller the output thickness, the greater the rolling load. On the other hand, the higher the friction, the greater the rolling load required in the production process. These relationships are considered fundamental in the rolling process [88].
These behaviors can be observed if we consider all the proper implications
In this case, 2 implications,
The Fig. 3a–e presents, from an easy-to-view temperature graphic, the behavior of the rolling load variables
For example, the position (
Mapping the relationship between rolling load (
Similarly, for the output thickness and rolling load, the Fig. 4a–f analyze the relationship between rolling load
Mapping the relationship between rolling load (
Reductions in the number of formal concepts, quantitative of proper implications and representativeness caused by the stability index for the context #1.
In the next section the same procedure will be applied, after the usage of stability reduction techniques [43] and minimum generators [5]. In this part, the knowledge obtained through the FCA prior to the application of reduction technique will be considered a baseline.
Initially, the selection of concepts is applied through the criterion of the stability index. Figure 5 presents the reductions in the number of formal concepts, the number of proper implications and the representativeness as a function of the stability index for the context #1. First of all, note that different threshold (or cut points) values can be chosen and the optimal values are, obviously, dependent on the data. Here, the threshold were indicated by domain expert. Notice that, different threshold values can be chosen and the optimal values are dependent on the data. In this work, the threshold was indicated by the domain expert, which seeks the threshold more adequate to obtain more representative rules that satisfy the previous knowledge about of the process being analyzed. Using a stability index of 0.1 no formal concept is removed. Between 0.13 and 0.24 the number of formal concepts is reduced to 20918 (approximately 1%). From 0.25 to 0.37 the number of formal concepts is reduced to 11417 (45.7%). The selection of the stability index is dependent on the intended application. A stability index equal to 0.37 was chosen based on observed reductions. Values less than 0.37 show a reduction in the number of formal concepts very low, and greater values provide a reduction of more than 50%. This value will be used in all formal contexts.
In Fig. 5, it can also be observed the number of proper implications and the index of representativeness. Note that certain reductions increase the number of proper implications. When a concept that generates its proper implications is removed, its superior neighbours can generate more specific and more signifi- cant implications. Consequently, the number of proper implications may increase [22]. This increment is the result of the application of the algorithm proposed in [22] to extract proper implications from a subset of formal concepts that do not form a conceptual lattice [64]. Regarding the representativeness (
The Fig. 6a and b present the relationships between rolling load and output thickness and rolling load and friction coefficient for the proper implications of the reduced formal contexts considering the average values. Comparing to the ratios prior to the application of the reduction technique, shown in Figs 3 and 4, it is noted that the behavior of the process was preserved for the considered operating conditions.
Mapping the average relations between parameters of the rolling process through proper implications obtained after the reduction through the stability technique which consider average values.
Next, the concept selection is applied through the analysis of minimum generators. Initially, it is necessary to determine the relevance of attributes; i.e., the relevance of the variables of the rolling process and, consequently, of the attributes
Figure 7 presents the reductions of the number of formal concepts, the number of proper implications and representativeness caused by the minimum relevance for the reduction technique based on the mini-mum generators for the context #1 using the mentioned set of weights. Up to a minimum relevance of approximately 0.1 no reduction is achieved. Using between 0.11 and 0.6 reductions range from 1% to 95%. It is observed that, as with the stability technique (Fig. 5), no large reductions were found when there was a small variation in the minimum relevance. Among the reduction configurations evaluated, a minimum relevance of 0.4 was chosen. This value results in a reduction of 37.69%, 37.73%, 37.66%, 37.39% and 37.89% in the number of formal concepts for contexts #1 through #5, respectively.
Reductions in the number of formal concepts, quantitative of proper implications and representativeness in function of the minimum relevance for the reduction technique based on minimum generators for the context #1.
A 37% reduction in the number of formal concepts resulted in an average of 88,381 proper implications and an average representativeness rate of 98%. Obviously, the mentioned values are lower than those obtained through the FCA without applying the reduction. However, they are better than the results presented by the stability index for a similar reduction in the number of formal concepts.
The Fig. 8a and b present the relationships between rolling load and output thickness and between rolling load and friction coefficient through proper implications obtained after reduction by the minimum generator technique considering the average values. Even before an average reduction of 37% in the number of formal concepts, the previously analyzed behavior of the rolling process was preserved.
Mapping the average relations between parameters of the rolling process through proper implications obtained after reduction by the technique of minimum generators considering the average values.
We propose to use proper implications extracted from reduced concept lattice to represent the behavior of the process being studied in a symbolic and qualitative form. In proper implications set, the right hand side of each implication is unitary, superfluous implications are not allowed, specializations are not allowed, i.e. left hand sides are minimal. This set is appropriate because each implication represents an operating condition of the physical process. In other words, each implication expresses that a minimal conjunction of ranges of values for certain variables imply in a range of values for a particular variable. In the case study, the cold rolling process was considered. The relation between the variables output thickness and rolling load; and between friction coefficient and rolling load were analyzed through the extracted implications from original and reduced concept lattice. It is known that output thickness and rolling load are inversely proportional. On the other hand, friction coefficient and rolling load are proportional. The results demonstrated that even if using an average reduction of 40% in the number of formal concepts, reduction techniques based on minimum generators and stability are able to select the formal concepts that best represent the process. The proper implications generated by the conceptual reduced lattice by such techniques represented, satisfactorily, the analyzed part of the rolling process.
It is important to note that the FCA approach extracts relations among the parameters involved in the process, independently of antecedent and consequent. This means that FCA tries to show the qualitative relations learned from the data. Note also that FCA seeks an implication base from which other implication rules can be deduced.
In this work, formal concept analysis (FCA) has been applied to extract and represent knowledge of physical process. The FCA can be used to understand the qualitative behavior processes without the need of complex mathematical models. From a set of collected data, it is possible to apply the FCA and use the proper implications to bear an intelligent system for on-line control and supervision.
We propose use proper implications extracted from reduced concept lattice to represent the behavior of the process being studied in a symbolic and qualitative form. In proper implications set, the right hand side of each implication is unitary, superfluous implications are not allowed and specializations are not allowed; i.e., left hand sides are minimal. This set is appropriate because each implication represents a condition of operation of the physical process. In other words, each implication expresses that a minimal conjunction of ranges of values for certain variables imply in a range of values for a particular variable.
As case study, the cold rolling process was considered. How the relation between the variables output thickness and rolling load; and between friction coefficient and rolling load are previously knowledge of domain expert, it was possible to validate the extracted implications from original and reduced concept lattice. It is important to note that the previous knowledge is essential to validate our approach. We highlight that this knowledge is based on physical principles, which govern of the process, which can be obtained by means of modeling or simulation with incomplete knowledge as proposed by Qualitative Reasoning theory [39].
The results shown that even if using an average reduction of 40% in the number of formal concepts, reduction techniques based on minimum generators and stability are able to select the formal concepts that best represent the process.
The FCA produces qualitative behavior rules where the dependent parameters of the process can or cannot be the consequent variables of the rules extracted. This can be useful in industrial processes for automation, control and design of supervisory systems, where it is important to consider rules that have the dependent parameters as consequent. This can help the operator to identify which parameters can change the operating condition, such as the controllable variables for a control system.
It is important to note that the FCA approach extracts relations among the parameters involved in the process, independently of antecedent and consequent. This means that the FCA tries to show the qualitative relations learned from the data. Notice also that the FCA looks for a implications base from which other implication rules can be deduced. the FCA produces qualitative behavior rules where the dependent parameters of the process can or cannot be the consequent variables of the rules extracted. This can be useful in industrial processes for automation, control and design of supervisory systems, where it is important to consider rules that have the dependent parameters as consequent. This can help to identify which parameters can change the operational condition, such as the controllable variables for a control system.
As future works we intend to apply the FCA in physical process with different characteristics. Finally, some FCA extension has recently been applied for complex data based on rough set [51, 85, 33, 78]. It is important to highlight the work of [78] which proposes a formalism for the stages of preprocessing and data mining, essential stages of a KDD process. The proposal is an extension of the Pawlak’s Information System (rough set theory) which establishes that knowledge is an ability to classify objects, which intrinsically the techniques, methods and algorithms of data mining seek. In this work we look for the objects (operating conditions) that can characterize a behavior of the physical process under study. We pretend to use these extensions in order to extract knowledge from the industrial process.
Footnotes
Acknowledgments
We would also acknowledge the financial support received from the Foundation for Research Support of Minas Gerais state, FAPEMIG; the National Council for Scientific and Technological Development, CNPq; Coordination for the Improvement of Higher Education Personnel, CAPES.
Nomenclature
