Aiming at the traditional topology identification based on steady-state operation, a topology identification method considering power system transient data is proposed. Firstly, the power system is dynamically modeled. Through theoretical derivation, the feature vectors that can reflect the topology information are extracted, and the topology identification problem is transformed into a sparse vector recovery problem. Based on compressive sensing theory, the orthogonal matching pursuit algorithm is adopted to solve the sparse recovery problem. Since the identification process is bidirectional, there may be some identification conflicts. For this consideration, an optimization strategy is introduced to improve the original algorithm. The influence of each algorithm parameter on the topology identification performance is then studied. By considering the transient process, a large amount of effective identification data was obtained in only a few processes. Finally, a simulation test on the proposed algorithm on the IEEE standard 22-bus power distribution system is conducted. The results show that the improved algorithm has outperformed the traditional algorithm.
With the continuous development of power technology, power transmission management is playing an increasingly important role. A good management of the power distribution system is closely related to the real-time monitoring of its status, while the reasonable monitoring of the power distribution system requires obtaining its network topology structure in advance. However, the topology of power distribution system may change due to the aging of the line, the action of the switch and etc., which brings obstacles to the real-time monitoring of the system. In addition, the topology identification of the power distribution system is of great significance to its fault line identification, power flow calculation, power distribution and stability control [1, 2, 3], which further implies the research value of this subject.
The topology structure of the power distribution system resembles many similarities with complex networks. For example, the movement of nodes follows similar dynamic equation constraints, the coupling state of the network is similar, and the network structures both have sparse characteristics [4, 5]. Therefore, many methods of researching complex networks can be used for reference in the study of power distribution systems. At present, many scholars have conducted related researches on the topology identification of complex networks. There are two most commonly used methods for this type of research. One is based on network synchronization, and the other is based on statistical theory. Guanrong Chen and etc. studied the global synchronization and asymptotic stability of complex dynamical networks. Based on the reference state, they obtained sufficient conditions for global synchronization and stability. On this basis, Shuna Zhang, Xiaoqun Wu and etc., realize the restoration of the complex network topology by adopting a method of constructing external network synchronization [6, 7]. This type of method requires constructing an external network as the same size of the original network, and the external network is completely consistent with the target network through design and drive. Therefore, this kind of method is easier to implement in small-scale networks. But when the network size is too large, the convergence speed of the algorithm will get slow. In addition, if the target network falls into internal self-synchronization during the synchronization process, it will cause the external network to fail to synchronize with the target network [8]. Scholars such as EM Lourenco researched the power grid using a Bayesian method, and developed a Bayesian-based hypothesis testing program. The program can be used by combining standardized Lagrangian multipliers with topology error handling. This method is used to replace the need to run the repeated state estimator for hypothesis evaluation, which reduces the possibility of topology error identification [9]; the method based on Bayesian estimation requires statistical methods to obtain prior information about the network topology, but In many cases, the prior information of the network topology is not available, especially for the power distribution system with constantly changing topology.
In recent years, a topology identification method based on sparse recovery theory has appeared in the field of topology identification research. The method fully utilizes the sparse characteristics of the structure of the power distribution system, and does not require prior information of the network. Also, the convergence speed of the algorithm is very impressive. Babakmehr Mohammad and other scholars combined the concepts of compressed sensing and graph theory, and proposed a smart grid topology identification and monitoring solution [10, 11]. But their research is based on the steady-state operation data of the power distribution system. While in the actual application process, a steady state of the power distribution system may last for a long period of time, which will result in small differences in the data collected during this period. Thus, there is little effective identification data that can be obtained. In other words, it takes a long period of time to obtain sufficient data for topology identification, and it is necessary to ensure that the network topology does not change during the acquisition of these data, which greatly limits the real-time and practicality of this method.
In this context, the data collection process is greatly shortened due to the sampling from transient data and thus the efficiency of topology identification will be notably improved. Apart from promoting topology identification efficiency, utilizing transient data can also raise the identification performance. This is because the transient operation process of the power distribution system will produce data with large differences, that is, valid data. And with the development of phasor measurement unit (PMU) technology, the accuracy and real-time performance of recording data also make it possible to accurately and efficiently record and save these transient data [12]. In addition, the PMU also comes with its own communication function, which can timely transmit the collected effective data back to the data processing center for real-time identification of the topology of the power distribution system. The transient data not only improves the efficiency of topology identification, but also improves the success rate of topology identification to a certain extent.
Problem description and modeling
A power distribution system with generators or loads and transmission lines can be represented by a graph set with nodes and edges, where , represents a set composed of generator nodes, represents a set composed of load nodes; represents a set composed of edges, corresponding to transmission lines of the power distribution system. Because when the entire power distribution system is operating normally, whether it is always running in a steady state or with small fluctuations on the load side, the voltage phase angle difference of all nodes in the network is relatively small. In other words, the value of in the network is relatively small, thus, . Therefore, the DC model can describe the dynamics of the entire system well. The corresponding dynamic equation of each node of the power grid is as follows [13, 14]:
Where represent the inertia coefficient, damping coefficient, injected power, rotor angle, transient reactance and electromotance of the -th generator node, respectively; is the voltage amplitude of the -th node, is the relative phase angle of the -th node that compared to the reference node, is the positive active power consumption of the -th node, and is the frequency power factor of the -th node. denotes the -th entry of matrix , and the definition of matrix is as follows:
Where denotes the susceptance between node and node of the power distribution system. Transform the above formula to get the following equation:
Where . and represent the sampling time point and the maximum number of sampling respectively. is the perception matrix, and is the sparse vector containing network topology information to be estimated. It is worth noting that the element was removed in the researched problem. because it is of no value in obtaining the power grid topology and its modulus value is relatively large compared to other elements, which may affect the algorithm performance. is the Gaussian white noise caused by measurement error. The definition of is as follows:
Where , for convenience of expression, the time scale on the right side of the above equation has been omitted. The derivative of the nodal phase angle adopts first-order forward difference as an approximation, that is, . The time interval of the difference should be neither too large nor too small, otherwise a large approximation error will be introduced. How to determine the appropriate value will be discussed later. The above problem can be transformed into a recovery problem of sparse vector , which is equivalent to solving the following optimization problem:
Where is the vector containing network topology information to be estimated.
If sufficient time scale data can be obtained, the above problem can be settled by a sparse recovery method. In general, data obtained from the power distribution system comes from its steady state or transient state, as shown in Fig. 1. During steady-state operation, the data of all nodes remain basically stable, with only small fluctuations within a certain range. The difference between such data is very small, and the effective data that can be used for topology identification is also very limited. In the process of transient operation, the data generated by each node of the power distribution system is quite different, so there are more effective data that can be used for topology identification. Despite the fact that the power distribution system is under steady-state operation most of the time, occasional load fluctuations on the load side may redistribute the power flow of the power distribution system, and the power distribution system will then enter a transient process, and then gradually return to stability. Assuming that a large load fluctuation suddenly occurs at time , then the power distribution system will immediately enter a transient process, which will produce a series of data with large differences. Effective data that can be used for topology identification are abundant. After identifying and judging that the system enters the transient process during the operation of the power distribution system, the sampling frequency can be appropriately increased to obtain as much efficient data as possible, which not only reduces invalid data, but also improves the efficiency of topology identification. To a certain extent, the identification success rate can also be improved.
Dynamical process of and the valid sample points in power distribution network.
In general, because the data difference is not large in the steady state, the PMU will sample the data at a smaller frequency. Due to the large difference between the time scale data difference before and after the transient process, a threshold can be set in advance. If the difference between the two sampled data exceeds this threshold, it is considered that the system has entered a transient process, and the control center immediately sends instructions to the PMU to increase the sampling frequency. In other words, the sampling frequency is a function related to the difference between the sampled data before and after. Taking the phase angle of the node as the reference judgment value, then,
Where represents the sampling frequency of the PMU of the -th node. represent the sampling frequency of steady-state operation and transient operation, respectively. represents the difference between the sampling and measurement values of the -th node at time and time , and is the judgment threshold set for node in advance.
Improved orthogonal matching pursuit algorithm based on weighted average strategy
After collecting enough effective data of , the above optimization problem can be solved. Traditional methods include least squares method, ridge regression algorithm, etc. [15]. Although these methods can also solve the problem very well in general, they all have one thing in common. That is, the above problem is a sparse vector reconstruction problem, which is not used. For the reconstruction of sparsevectors, the greedy algorithm can effectively reduce the running times of the algorithm. In this paper, the classic orthogonal matching pursuit algorithm is adopted. Compared with the least square method and ridge regression algorithm, this algorithm has faster execution speed and smaller identification error. And because the orthogonal matching pursuit algorithm stops the calculation when it reaches the iterative output threshold. The possibility of algorithm misjudgment is greatly reduced.
Assuming that and can be measured by the PMU and stored in a specific location, then the corresponding matrix and vector can be obtained according to Eq. (4), and then can be estimated using the orthogonal matching pursuit algorithm. The specific execution steps of the orthogonal matching pursuit algorithm are shown in Table 1.
Input: matrix , vector , . Output: Estimated value of the power distribution system topology. When , repeat all the following steps: Step 1: Initialize the residual , the reconstruction signal 0, the index set , the number of iterations , and the counter 0. Step 2: Calculate the projection coefficient (inner product value) of each column of the residual and perception matrix , . Step 3: Find the largest element in and the corresponding position pos. Step 4: Update the index set , and the atomic set . Step 5: Use least squares to find the approximate solution . Step 6: Update the residual . Step 7: Determine whether the iteration meets the stopping condition or . If it is satisfied, it will stop and output , . Otherwise, go to step 1. End: Construct matrix according to estimated vectors .
In each iteration of the orthogonal matching pursuit algorithm, the column most relevant to in will be selected to calculate the corresponding elements in , until the stop condition is reached, the iteration will be stopped and will be output. It can be seen from the logic of the algorithm that for each node, the algorithm detects its connectivity with other nodes. In other words, each edge is estimated twice.
Assuming that and are two already solved column vectors, there will be a and corresponding to them. Ideally, these two elements should be exactly the same, but due to measurement noise and calculation errors, there may be differences in the results of the two calculations, that is, one element is 0 and the other is not 0. Therefore, whether this edge exists or not requires further judgment. In this paper, a weighted average strategy is designed for this concern. From the logic of the algorithm execution, the accuracy of estimation is positively correlated with the number of non-zero elements of the vector, so a vector with fewer non-zero elements has greater “decision power”.
Assuming that the obtained transient data is sufficient, for each node, groups can be obtained to estimate the vector times. The final result obtained for the -th time is denoted as (. For all and , the positive times are respectively denoted as and . For each and , the average sparsity (also known as the average degree of nodes) of the identification results for times is recorded as and respectively. So as long as there is a difference in sign between and , a weight coefficient is introduced as follows:
Where represents the number of edges between nodes and in estimations. As long as this coefficient is not less than an empirical threshold, then it can be considered that there are eventually edges between nodes and . The algorithm flow design of the weighted average strategy is shown in Table 2.
Process of improved orthogonal matching pursuit algorithm with an optimized weighted strategy
Algorithm 2: Improved orthogonal matching pursuit algorithm with an optimized weighted strategy (OMPOW)
Input: Sensing matrix , measurement vector , . Output: Estimated value of the power distribution system topology. Step 1: Get all from Algorithm 1. Step 2: When there is any , calculate according to Eq. (7), if , go to step 3, otherwise go to Step 4. Step 3: Calculate the arithmetic mean of all non-zero and , then record it as , . Step 4: 0. Step 5: , and output .
Numerical simulation and analysis
In this part, this paper verifies the proposed algorithm on MATPOWER’s IEEE standard 22-bus power distribution test system [16]. In the simulation process, the assumption that the test noise of each node data obeys a Gaussian random distribution with zero mean, that is, , , while denotes the noise intensity scaled at an 0.5% interval from 0 to 5%. Unless otherwise specified, the maximum sampling times for simulation is 18, and the default noise level is 1%. The transient reactance vector of one generator node of the IEEE standard 22-bus power distribution system is 0.2. Power and voltage data (such as and ) are obtained through MATPOWER’s power flow calculation. Since the changes of and during the entire transient process are small enough, it can be assumed that they are constants throughout the entire transient process.
The change of the topology recovery success rate at different noise intensity and sampling time intervals is shown in Fig. 2, where the topology recovery success rate is defined as the ratio between the number of edges that are correctly identified and the total number of network edges throughout the whole simulation process. It can be seen from the figure that when the measurement noise is 0, the topology recovery success rate is almost 100% under the condition of small sampling interval, but when the sampling interval is too large (such as ), the success rate of topology identification decreases slightly. Although a larger sampling interval will introduce an error, this error will be greatly reduced by the small proportionality coefficient in Eq. (4), which will eventually make the error small relative to on the right side of the equation. So, it will not have a great impact on the final identification result. On the other hand, a small will greatly amplify the error caused by measurement noise. It can be seen from the figure that when there is no noise, the smaller the sampling interval, the smaller the approximate error of the phase angle derivative function. Therefore, a very small can make the identification success rate 100%. However, as long as a very small noise (such as 0.25%) occurs, it will be amplified many times, leading to a rapid decline in the identification success rate. According to the color block diagram, the identification effect is the best when the sampling time interval is 0.3, so the following simulations all set 0.3.
Topology recovery performance vs. sample interval and noise level over 500 realizations.
The influence of the iteration stop threshold on the topology reconstructure results is also studied in this paper. The graph of the identification success rate changing with the iteration stopping threshold is shown in Fig. 3.
As can be seen from the figure, the smaller the iteration stopping threshold is, the higher the success rate of topology identification will be. However, the smaller iteration termination threshold means that the algorithm takes longer to execute, which will reduce the efficiency of topology identification and increase the calculation cost in the identification process. Therefore, under the premise of ensuring the highest topology identification success rate, selecting a maximum iteration stopping threshold can not only ensure a better identification effect, but also greatly reduce the running time of the algorithm, thereby improving the efficiency of topology identification. According to Fig. 3, the value of is between 0.02 and 0.04. Therefore, in the subsequent simulations, we set to 0.03. See Table 3 for other simulation parameters.
System partial parameter list
Simulation parameters
Values
Motor inertia coefficient
1
Motor damping coefficient
1
Motor frequency power factor
0.02
Threshold coefficient
0.5
Maximum iteration number
22
Topology recovery performance vs. iteration stopping threshold over 500 realizations.
Comparison of susceptance matrix recovery performance in one single test.
Figure 4 shows the network topology color map solved by the orthogonal matching pursuit algorithm and the solution result after adding the designed weighted average strategy. Among them, the deep blue spots indicate that there is no connection between the two nodes. The slight blue dots indicate that there is a connection between the two nodes. And the green spots indicate that there is actually an edge between the two nodes but the algorithm identification result shows that there is no connection (false positive). The slight yellow spots indicate that there is actually no edge between the two nodes but the algorithm identification result shows that there is (false negative). And the yellow spots indicate the negative element of the diagonal of the susceptance matrix. It can be seen from the figure that due to the existence of identification conflicts, the topology matrix directly output is not symmetric, which is obviously not in line with reality. Under the given simulation parameters, the orthogonal matching pursuit algorithm only has false positive results, no false negative result. And after processing with the given weighted average strategy, the false positive results are well eliminated.
The optimized weighted strategy introduced by the algorithm is designed based on the negatively correlated relationship between the identification accuracy rate and the sparsity of the column vector corresponding to each node. As shown in Fig. 5, node 2 requires more sampling times than node 1 when the same identification accuracy is achieved. This is because the sparsity of the estimated vector corresponding to node 2 is basically around 3, while the sparsity of the vector corresponding to node 1 is only around 1. By comparing Fig. 5A and B, it can be found that the new algorithm with the weighted average strategy not only reduces the number of sampling size required to achieve the same identification success rate, but also improves the success rate of topology identification under the same sampling size.
Topology recovery performance vs. sample size for each node over 500 realizations.
Topology recovery performance vs. sample size for the whole grid over 500 realizations.
Topology recovery performance vs. noise level for the whole grid over 500 realizations.
In the simulation process, the maximum sampling size and noise level have a great influence on the result of topology identification. In order to further study their influence on the identification result, a series of simulation tests are carried out in this paper. It can be seen from Fig. 6 that it takes at least 5 measurements to make it possible to restore the topology of the IEEE standard 22-bus distribution system. Under the condition that the topology restoration can be carried out, the restoration effect of the proposed optimization algorithm is better than that of the original orthogonal matching pursuit algorithm. In addition, when the noise level varies from 0 to 5%, the algorithm with weighted average strategy also has higher success rate of topology identification than the original algorithm (see Fig. 7). As mentioned in the previous section, when there is no noise at all, the identification success rate of both algorithms is relatively high. When noise is first introduced, the identification success rate suffers a relatively large drop, and the subsequent change trend is relatively slow. It is because the introduced noise signal is greatly reduced by the small coefficient before . In other words, the algorithm proposed in this paper is robust within the noise level tested. Although the noise level reaches 5%, the proposed algorithm can still guarantee the identification accuracy of more than 90%.
Conclusion
By considering the use of transient operating data in the distribution network, this paper greatly improves the speed of traditional topology reconstructure methods in power networks. In addition, an optimized weighted strategy based on sparse recovery algorithm is introduced to address the conflict issue caused by topology bidirectional identification in power system. Our proposed algorithm can well address topology identification problems in power systems providing the topology remains unchanged in the sampling process. The simulation results based on the standard IEEE standard 22-bus power distribution system in MATPOWER prove the feasibility of the algorithm. By collecting and storing transient data in the PMU, the power network topology can be identified quickly and efficiently, which is of great significance for state estimation as well as line outage detection in practical engineering.
References
1.
AlamM. et al., PMU based line outage identification using comparison of current phasor measurement technique, International Journal of Electrical Power and Energy Systems, 115 (2020), 105501–105501.
2.
SegM., Power flow control capability of the power transistor-assisted Sen transformer and the unified power flow controller: a close comparison, IET Generation, Transmission and Distribution14(15) (2020), 3033–3041.
3.
ShuY.B. and TangY., Analysis and recommendations for the adaptability of China’s power system security and stability relevant standards, CSEE Journal of Power and Energy Systems3(4) (2017), 334–339.
4.
ArianosS. et al., Power grid vulnerability: A complex network approach, Chaos: An Interdisciplinary Journal of Nonlinear Science19(1) (2009), 175.
5.
ChenZ.H. et al., Robustness of interdependent power grids and communication networks: A complex network perspective, IEEE Transactions on Circuits and Systems II: Express Briefs65(1) (2017), 115–119.
6.
LiZ., and ChenG.R., Global synchronization and asymptotic stability of complex dynamical networks, IEEE Transactions on Circuits and Systems II: Express Briefs53(1) (2006), 28–33.
7.
ZhangS.N.WuX.Q.LuJ. et al., Recovering structures of complex dynamical networks based on generalized outer synchronization, IEEE Transactions on Circuits and Systems I: Regular Papers61(11) (2014), 3216–3224.
8.
ChenL.LuJ., and ChiK.T., Synchronization: an obstacle to identification of network topology, IEEE Transactions on Circuits and Systems II: Express Briefs56(4) (2009), 310–314.
9.
LourencoE.M.CostaA.S. and ClementsK.A., Bayesian-based hypothesis testing for topology error identification in generalized state estimation, IEEE Transactions on Power Systems19(2) (2004), 1206–1215.
10.
BabakmehrM. et al., Smart-grid topology identification using sparse recovery, IEEE Transactions on Industry Applications52(5) (2016), 4375–4384.
11.
BabakmehrM. et al., Compressive sensing-based topology identification for smart grids, IEEE Transactions on Industrial Informatics12(2) (2016), 532–543.
12.
DusabimanaE. and YoonS.G., A survey on the micro-phasor measurement unit in distribution networks, Electronics9(2) (2020), 305.
13.
NishikawaT. and MotterA.E., Comparative analysis of existing models for power-grid synchronization, New Journal of Physics17(1) (2015), 015012.
14.
StottB.JardimJ. and AlsacO., DC power flow revisited, IEEE Transactions on Power Systems24(3) (2009), 1290–1300.
15.
TibshiraniR., Regression shrinkage and selection via the lasso: a retrospective, Journal of the Royal Statistical Society: Series B (Statistical Methodology)73(3) (2011), 273–282.
16.
ZimmermanR.D. et al., MATPOWER: Steady-state operations, planning, and analysis tools for power systems research and education, IEEE Transactions on Power Systems26(1) (2010), 12–19.