Abstract
In order to improve the accuracy of the state prediction model, a dynamic Bayesian network state prediction model based on the relationship of prediction variables is designed. The prediction model of dynamic Bayesian network structure learning algorithm was improved, integrated into the Gibbs sampling algorithm model prediction, joined the predicted relationship between different factors affecting the node, is given based on the variable relationship between the dynamic Bayesian network structure design, using a moment on the different nodes and state influence factors to predict the probability distribution of the moment state nodes. The experimental results show that the model is simple in structure, more accurate than the traditional learning method of Bayesian network structure, and more practical.
Keywords
Introduction
At present, there are two main methods for learning Bayesian network structure. One is the scoring search [1] learning method, the basic idea is to traverse all possible structures, and then use a standard to measure each structure, and then find the best structure. This method is simple and standard, but it requires the operation of scoring function and the search of structure space to increase the complexity of operation exponentially with the increase of variables. Therefore, it is only suitable for local learning or heuristic learning with certain structure prior knowledge. The other is a learning method based on dependency analysis. The core idea of this method is: firstly, conduct statistical tests on training data sets, especially conditional independence tests, to determine the conditional independence between variables; Then, a directed acyclic graph is constructed by using the conditional independence between variables to cover as much of these conditional independence as possible. In fact, it is to find a measure that can best fit with a given instance data set, the former is called network parameter learning, the latter is called network structure learning. Therefore, structural learning is an important part of Bayesian network learning. In addition, through the variable conditions between independence test and
With the application of Bayesian network in computational biology, engineering and many other fields, the learning method of network structure in the construction of Bayesian model can be improved to play a greater role. In view of the shortcomings of the traditional learning methods, such as long computation time, difficult reasoning and low efficiency, this paper combines the above two algorithms to improve the Bayesian network structure learning algorithm based on the prediction relationship of prediction variables, which can more accurately infer the Bayesian network structure. After the Bayesian network structure is determined, we give the state prediction model, which can fully reflect the dependence between different influencing factors and state nodes at adjacent moments, as well as the dependence between influencing factors and state nodes at the same time, so as to improve the accuracy of the prediction model.
Dynamic Bayesian network
Dynamic Bayesian network is an extension of Bayesian network modeling, and also it is a series timely process. Its set of random variables can evolve over time and is a compressed representation of complex random processes. The widely used Markov chain [2] prediction is a typical application of dynamic Bayesian networks for prediction. A general dynamic Bayesian network has two characteristics:
The topology of the network is the same in each time slice, and the slices are connected by a similar arc; The network at time
Simplified initial network and transfer network.
Figure 1 simplified initial network and transfer network. Assume
Based on the dynamic Bayesian network characteristics 1), the evolution of dynamic Bayesian networks can be simplified to the initial Bayesian network.
In the term of the dynamic Bayesian network characteristics 2), the above formula can be simplified as
Bayesian network structure [3] learning algorithm based on predictive variable prediction relationship has two main parts:
Establishing initial Bayesian network structure based on absolute prediction ability; Adjusting initial Bayesian based on conditional prediction ability Network structure (increasing the missing arc, removing the extra arc, adjusting the direction of the arc).
Discrete influence factor node
Established
Based on the definition of the prediction relationship between the nodes of different influencing factors, we will give a description of the Bayesian structure [4] learning algorithm based on the prediction relationship of predictors. The specific steps are as follows:
Step 1 calculates the predictive power between nodes.
Step 2 determines the initial Bayesian network structure.
then oriented (
then oriented (
then random orientation.
Step 3 increases the missing arc, mincutset (
and
then oriented (
then oriented (
Step 4 removes the extra arc
then delete (arc(
Step 5 adjusts the direction of the arc
then oriented (
then oriented (
The discrete Bayesian network structure learning algorithm based on the predictive ability between variables has the following characteristics:
High learning efficiency and accuracy; Building a Bayesian network structure based on predictive ability to make the learned structure tend to be simple so that this can avoid over-fitting of data; It can process incomplete data, does not require variable ordering, and has anti-noise data function.
The state model [5] can be constructed under the condition that the above-mentioned Bayesian network structure and assumption parameters are known. The model proposed in this paper contains two nodes: state node and influencing factor node; and three kinds of dependencies: the dependence between different influencing factors and state nodes in adjacent time and the influencing factors. The dependency between the state nodes. There is only one state node and multiple influencing factor nodes at the same time. The general idea of the prediction F165 algorithm proposed in this paper is: when the value of the influencing factor is unknown at this moment, the Gibbs sampling prediction algorithm is used to predict the probability distribution of the influencing factors at that moment based on the value of the influencing factor at the previous moment. Then combine the value of the state node at the previous moment to predict the current state.
Gibbs sampling prediction algorithm
The Gibbs sampling prediction algorithm is a simple MCMC (Markov chain-Monte Carlo) method. The MCMC method is an important random method related to statistical physics, which is widely used in Bayesian inference and machine learning. The Gibbs sampling prediction algorithm has the advantages of simplicity and fast calculation speed, and is a local optimization algorithm. The algorithm for finding a model based on the Gibbs sampling prediction algorithm is as follows:
choose randomly
While (Count matrix
For (scan all sequences in sequence).
Select
Sampling update: calculating the fitness of the model
End For
End While.
Finally, the output convergence counting matrix is obtained, and the pseudo-calculation probability matrix is calculated
In order to measure the degree of influence between random events, we propose the concept of support between events, which supports positive values, regardless of the negative effects between events. Regarding the support between events, we give the following definitions:
Based on this, we give the definition of the degree of support of the influencing factor node to the state node: if the values of the
State prediction algorithm
Based on the construction of dynamic Bayesian network [6] and the predictive relationship, this paper uses the Bayesian network inference method and the Gibbs sampling prediction algorithm to predict the state. The steps of the complete state prediction algorithm are as follows:
After obtaining the value of the influencing factor at the last moment, the Gibbs sampling prediction algorithm is used to predict the probability distribution of the influencing factors at that moment; Calculate the support level of the influencing factor node to the state node; Calculating the support of the state node of the previous time to the state node at the current time; Using the support obtained in step 3) to correct the support results obtained in step 2);
End.
The dynamic Bayesian network state prediction model [7] is generated according to the constructed Bayesian network structure and state prediction algorithm based on the variable prediction relationship [8]. The model and the traditional model were used to predict the pond water quality, and then compared with the actual measured pond water quality to obtain the prediction accuracy of the dynamic Bayesian network state prediction model and the traditional model. The measured parameters of pond water quality factors include water temperature, dissolved oxygen, ammonia nitrogen, total phosphorus, total nitrogen, permanganate, etc. According to expert knowledge and consulting related books, the obtained data will be discretized accordingly. After the water quality is divided into five grades, the pre-processed parameter data is input into the prediction model to predict the model accuracy. The test results are shown in Table 1.
Experimental result data
Experimental result data
Experiments show that the dynamic Bayesian network based on predicted link state prediction model is not only simple in structure, and high accuracy, is larger than the traditional model, basic can achieve about 90%, and the typical cases, some cases with missing data, through the experiment proved that the model can deal with missing data, also can prevent data for fitting phenomenon, is one of the better state prediction model, is worthy of application.
This paper proposes a dynamic Bayesian network state prediction model based on prediction relationship. The construction of dynamic Bayesian network model mainly includes network structure learning and probability table construction. In the construction of the network model, firstly, we consider the prediction relationship between the nodes of different influencing factors and establish the Bayesian network structure [9]. On this basis, we not only consider the influence relationship between the state nodes but also introduce the state prediction algorithm is established by the dependence between the influencing factor node and the state node and the different influencing factors at the adjacent time. The proposed dynamic Bayesian network structure based on the prediction relationship can simplify the structure obtained by learning, prevent over-fitting, and has high accuracy. The state prediction algorithm uses the influencing factor node of the previous moment to predict the current state [10]. The probability distribution of the node is corrected by the state node at the previous moment, which greatly improves the accuracy of the state prediction model. In summary, the dynamic Bayesian network state prediction based on prediction relationship combines the advantages of dynamic Bayesian network structure and state prediction model based on prediction relationship, and improves the prediction accuracy of the model.
Footnotes
Acknowledgments
Supported by “the Fundamental Research Funds for the Central Universities” (2242018k30002); National key research and development program of the 13th five-year plan (No. 2018YFF0213601); Supported by the Natural Science Foundation of the Jiangsu Higher Education Institutions of China (Grant No. 18KJB510038); supported by the Funds of Nantong Applied Basic Research Plan (GY12017015).
