Abstract
Transmission lines’ condition monitoring is an important part of smart grid construction. To ensure fast and efficient transmission of data, many mash-based wireless networks devices are adopted to collect status information. Since these nodes are exposed to the natural environment, vulnerable to damage, so it is very necessary to be predicting nodes’ fault. However, these mesh nodes are affected by a variety of complex and time-series factors, and traditional models are difficult to achieve effective failure prediction. To solve this problem, this paper proposes a self-adapting multi-LSTM ensemble regression model for transmission line network’s wireless mesh node failure prediction (MLSTM-FP), through establishes the corresponding relationship between similar time factors and LSTMs, the proposed model can realize multi time series data self-adapting and accurate failure prediction of transmission line network’s wireless mesh nodes, The experimental results show that the proposed method has a good prediction ability than traditional methods.
Introduction
Transmission lines’ condition monitoring is an important part of smart grid construction. To ensure fast and efficient transmission of data, many mash-based wireless networks are deployed [1]. In a real operating environment, the nodes of the mash network may be continuously eroded and destroyed by various natural/human factors, resulting in the failure of the network nodes; therefore, the prediction of transmission line network’s wireless mesh nodes is very important for the grid scientific management [2]. It is obviously impossible to adapt to the inspection of a large number of nodes by means of manual one by one, and it is necessary to introduce an artificial intelligence automated method [3]; using this technique to prediction can grasp the mesh nodes’ life and damage trends and has good research value.
There are a lot of research achievements in the prediction of power equipment status: Through wavelet transform, algorithms can extract some key feature from the existing load data, thereby improving the accuracy of the forecasting [4, 5]. To cope with more parameters generated by wavelet transform, data selection method and multi-model integrated method can be to enhance the stability of forecasting [6]. Based on wave analysis, various models such as neural networks, support vector machine and hybrid-model are introduced into the field to improve the quality of prediction [7, 8]. Due to the complexity characteristics of the short-term load forecasting, it is necessary to explore deeper features from historical data, so in recent years, deep learning technology has received extensive attention in this field. By constructing the input data as feature maps, convolutional neural networks (CNNs) can perform forecasting and obtain better accuracy [9]. Based on the introduction of fuzzy time series, feature maps can contain more descriptive information and further improve the ability of CNN to cope with more complex load date [10, 11]. For the data contains obvious time series characteristics, the memory mechanism of the item contained in the neural network models are very important [13]. Recurrent neural networks (RNN) and long short-term memory networks (LSTM) can be used in load forecasting, and deep character of items and relationship between nods can be obtained from complicated data [16].
Although above researches have achieved good results in prediction of power equipment status, but mesh nodes are affected by a variety of complex and time-series factors, and traditional algorithms are difficult to achieve effective prediction [12]; When factors hidden in data are complicated but corresponding training data’s amount is limited, single times series model will tend to over-fitting with training data, thereby reducing the forecast ability of the model [14, 15]; Therefore, it is necessary to construct an adaptive mechanism of input data, and create sub-models according to different time series to prevent over-fitting and improve the generalization ability of the model. In this paper a self-adapting multi-LSTM ensemble regression model for failure prediction of transmission line network from wireless mesh nodes’ data (MLSTM-FP), through establishes the ensemble relationship between similar time factors and LSTMs, the proposed model can realize multi time series data self-adapting and accurate failure prediction of transmission line network’s wireless mesh nodes, The experimental results show that the proposed method has a good prediction ability than traditional methods.
Methodology
Long Short Term Memory network
Long Short Term Memory networks (LSTMs) are an effective nonlinear recent neural network. Owning to the excellent ability in processing series and nonlinear relationship of data, these network models are widely adopted in time series data analysis application, The LSTM can solve the long-term dependency problem of the traditional time series processing algorithm and achieve higher prediction accuracy. LSTM has the structure shown below.
Structure of the LSTM.
As shown in Fig. 1, the LSTM consists of a number of repeating modules, each module corresponding to the input
where
where
where
Based on this output state, the output of the module is:
Through this process, the process of transferring the memory of the previous node to the next node is realized, and the weight of each part is optimized by the training data to achieve the purpose of sequence data analysis. It can be seen from the formula that each module of LSTM contains two parts, forgetting and memory, which can realize the memory process of input values within a certain sequence range. This feature is very important in the process of sequence data processing. However, because the operation environments of mesh nodes are affected by many factors; so the frequency and amplitude of data corresponding to factors are also vary greatly. The use of a single LSTM is likely to cause confusion in different sequences. When using a larger complex neural network, this problem will cause over-fitting in training data, Therefore, new methods need to be proposed to solve the current problem.
This paper proposes a self-adapting multi-LSTM ensemble regression model for failure prediction of transmission line networks from wireless mesh nodes’ data (MLSTM-FP). The main goal of this method is to use multiple LSTMs to independently process the influencing factors of condition, and then integrate the outputs of these LSTMs to predict the mesh nodes’ condition. The MLSTM-FP model’s structured are shown as below.
Structure of MLSTM-FP.
As shown in above figure, the input of MLSTM-FP model is multi-factor time series data of mesh nodes’ historical data
(1) Self-adapting separator component: It duty is to divide
The weight
The above algorithm can adaptively process the input historical data, and separate whole data set into groups according to the characteristics of each factor.
(2) LSTM group component: After previous component, the MLSTM-FP model create
(3) Time series ensemble component: This component ensemble the output of previous LSTM group component, the detail of algorithm is:
Through DE-alg, the integration of time series data can be realized.
(4) Regression component: In this component a regression LSTM model is constructed, this model corresponding to
Through the above four components, MLSTM-FP can obtain a group of time series factors with different trends and frequencies. These factors are mapped into a unified multi-dimensional sequence data after the three components of model. This sequence data includes the mesh nodes’ key characteristics, and based on these characters, the regression model achieve more accurate and stable predictions.
The overall process of the method is shown as follows.
Overall process of the method.
As can be seen from Fig. 3, the overall process of method has four steps:
Input data: Select typical transmission line network’s wireless mesh nodes, analyze factors that may affect their characters changes and store data. Data preprocessing: First, normalize all the data to map them into [0, 1], create a data with times series grid of 96 sampling points per day, and interpolate all factor data so that they can input and output together; Integrate process of preprocessing obtains a data preprocessing model. Training: Input data for training and obtain the MLSTM-FP model. Forecasting: Input mesh nodes’ factors use preprocessing model and MLSTM-FP model process factors sequentially to obtain prediction result.
Program realization and dataset
This paper realize all the algorithms in Python 3.6, where the deep neural networks is implemented by TensorFlow, the grouping and data preprocessing use Numpy and Scikit-learn extension packages. All algorithms were executed on an Intel Xeon/16G/GTX 1070 computer, and the all experimental results were obtained on this computer. This paper introduces the data of 100 mesh nodes of transmission line network in the of year 2016, 2017 and 2018 in a region of northern China, in which 2016 and 2017 are used as training data and 2018 as test data. There are many factors that influence the status of a mesh node. The input data includes the following contents:
Temperature change data: Daily, hourly, weekly temperature change range, minimum and maximum temperature. Rainfall data: The average rainfall in the area. Light intensity data: The average daily light intensity, the light intensity at two o’clock in the afternoon. Wind speed data: The average wind speed of the area. Equipment usage frequency and date: Equipment data transmission amount, holiday and working day parameters.
The typical characteristics of the above factors are that the features and frequencies are different greatly. Integrate these factors into a single multi-dimension data for time series analysis may cause analysis confusion and thus reduce the quality of prediction, so the method proposed in this paper is needed for effective processing these factors.
In order to verify the effectiveness of the algorithm, the algorithm is compared with LSTM, LSTM
where
Evaluation indicators comparison of 5 algorithms
As can be seen from the Table 1, due to the lack of directly processing sequence data ability, the NN (NFAFA
Overall accuracy comparison of five methods
As can be seen from the Table 2, the accuracy of NN and SVM processing is often low, and the accuracy is significantly improved after the introduction of LSTM, but the accuracy fluctuation is more obvious when there are fewer samples; It has the ability to process sequence data, and the accuracy is improved compared with SVM. The MLSTM-FP proposed in this paper has achieved good results under each sample size. The accuracy trend of the five methods is shown in the Fig. 4.
Accuracy trend of the five methods.
It can be seen from the Fig. 4 that NN and SVM always archive lower accuracy than deep models, this shows that the deep models in the field of failure prediction are superior to traditional shallow models. MLSTM-FP not only achieves the highest detection accuracy among the five algorithms, but also its accuracy changes are relatively stable; especially in the case of small sample size, the accuracy has always been improved; It is proved that the MLSTM-FP algorithm is also superior to other four traditional methods in stability.
Transmission line network’s data is affected by a variety of noise factors. These factors will change with changes in the real application environment. Adapting to these influencing factors has a greater impact on the accuracy of failure prediction. The noise information simulated by the laboratory cannot exhaust all possible factors, so this situation leads to the lower accuracy of the traditional model in practical applications. The ensemble model proposed in this paper perfectly solves this problem: Each sub-model has predictive capabilities, and the solution space received by each sub-model is different, so the response of each model to noise is also different; multiple models making decisions together these models can greatly enhance the anti-noise capability of the overall model, thereby obtaining stable and effective prediction capabilities. It can be seen from Table 2 and Fig. 4 that MLSTM-FP has great advantages in accuracy and stability, especially when samples number is small. At the same time, deep learning models usually require more samples to obtain good results. Using ensemble is a very good strategy adapt with small sample size. The entire model does not over-fit with a small sample size; these results proved the real application value of MLSTM-FP.
Summary
The lifetime of the mash network nodes on the transmission line is affected by many factors. How to deeply extract the features behind the data and make effective forecasting is very important. This paper proposes an MLST-FP model that uses multiple LSTMs to map complex factors to a uniform series data, and then integrate data and perform regression analysis. The experimental results show that the proposed method is superior to the traditional method in prediction accuracy and stability. It has good practical value in the field of energy management and grid fault detections.
Footnotes
Acknowledgments
This work was supported in part by the Scientific and Technological Planning Project of Jilin Province (20180101057JC). A Project Supported by Scientific and Technological Planning Project of Jilin Province (2018C036-1).
