Abstract
In performance evaluation, the widely used root-mean-square error is easily affected by large error terms and is also an incomprehensive measure. Therefore, the error spectrum as a comprehensive measure was proposed for parameter estimation. However, error spectrum (ES) is a three-dimension plot (among ES, r axis and time t axis) in the whole time horizon in dynamic evaluation system, which is not intuitive and easy to be analyzed. To smooth this, a new dynamic error spectrum (NDES) is proposed in dynamic evaluation system in this paper. Firstly, the NDES is defined for EPE in dynamic systems. Secondly, the computation method is proposed to calculate the NDES. Thirdly, several nice properties of NDES are presented for dynamic system performance evaluation. Finally, the effectiveness of the proposed new dynamic error spectrum is verified by a numerical example.
Keywords
Introduction
Estimation performance evaluation (EPE) has been widely applied in estimation or filtering [1, 2], track fusion [3], and target tracking [4], etc. Usually, EPE is divided into two components: performance evaluation and peformance ranking, where performance evaluation should rely mainly on numerical information, while performance ranking depends on preference information [5]. An extensive literatures are by now available on both performance ranking [6–9], and evaluation [10–12]. Since the existing measures can only reflects one aspect of EPE, the error spectrum was presented in [13]. In EPE, ES can reveal more information because it aggregates several commonly incomprehensive measures. However, ES has some limitations and drawbacks, which attracts lots of research to improve the ES [14–16]. In particular, it is difficult to evaluate the dynamic system performance based on ES, because ES is a three dimensional (3D) plot over the total time span. Accordingly, a dynamic error spectrum (DES) based on the average value of ES was introduced in [16]. Furthermore, we find that DES combines several incomprehensive metrics into a single metric, which reflects the estimation accuracy of an estimator the same as the incomprehensive metric. Therefore, we proposed range error spectrum induced area (RESA) and DES induced area (DESA) to overcome the problem of DES [17, 18]. However, the above proposed dynamic measures (i.e., DES, RESA and DESA) need to combine all the three-dimensional plot (ES curve in dynamic systems) into a single two-dimensional plot. Therefore, it is still not easy to find the better estimator when two ES curves intersect with each other among whole time horizon. Clearly, a worthwhile problem is how to apply ES to dynamic evaluation systems.
So, a new dynamic error spectrum (NDES) is proposed to the dynamic system evaluation, which is different from the DES introduced in [16]. First, the NDES is defined according to the definition of ES. Second, the computation method is proposed to calculate the NDES. Finally, several nice properties of NDES are presented to evaluate dynamic systems.
This paper is organized as follows. In Section 1, the definition of the NDES is proposed according to the definition of ES. Then, the computational method is proposed to calculate the NDES in Section 1. Furthermore, some nice properties of the NDES are presented in Section 1. Finally, a numerical example is provided in Section 1 to illustrate the superiority of the proposed metrics. Section 1 concludes this paper.
Definition of NDES
As discussed in the introduction Section 1, Li proposed the ES to the EPE, which is a three-dimension (3D) plot when applied to dynamic evaluation systems [13, 17]. Although ES can be drawn as 3D plot without difficulties by many software, it is still difficult for it to be analyzed in dynamic evaluation systems. To alleviate this, a new dynamic error spectrum for dynamic evaluation systems is proposed in the following section. First, a new dynamic error spectrum (NDES) is defined for EPE in dynamic systems, which is still an aggregation of many incomprehensive measures. Second, the properties of the NDES are presented according to the original ES. Finally, the volume error spectrum is proposed to calculate the NDES.
Denote
Furthermore, at a time instant t, for an estimation error set
Thus, for r ∈ [- ∞ , + ∞] and t ∈ [T0, T], the NDES is defined as
For a continuous
Similarly, for a discrete
Therefore, we can summarize the definition of the NDES as follows.
Clearly, for any time interval t ∈ [T0, T], the smaller S
Unfortunately, NDES still suffers from the computation problem, because the error distribution is rarely available in dynamic system. Therefore, we will propose a volume error spectrum (VES) to approximate the above NDES in the following part.
Previously, we proposed an area error spectrum (AES), in which the area under ES curve was used [17,18, 17,18]. That is,
For r ∈ [-1, 2], we have
In this paper, we call the right expression of Equation (10) volume error spectrum (VES), which represents the estimation accuracy within the time interval
Furthermore, according to Equation (10), we can obtain the following results.
(a)When t is fixed, VES is transformed to the AES, which represents the concentration of the estimation errors.
(b)when S(r, t) >0, then VES(r, t) >0.
As stated before, we can see that VES is a real number in EPE. So, it can be applied to dynamic evaluation systems.
From Equation (8), we can see that NDES has a series of nice important properties for dynamic performance evaluation.
Clearly, the NDES at t = t0 becomes the ES, which has all the features of the ES.
So, we can see from (b)-(f) that the most useful parameter intervals of the NDES are the interval [-1, 2] of r and the interval [T0, T] of t, respectively, which are applied in this paper.
and
Furthermore, let
Similarly,
Clearly, by Equation (13) and Equation (14), we can conclude that
So, S
Hereafter, a numerical example is designed to illustrate the superiority of the NDES in performance evaluation.
The state estimation models
Suppose that a nonlinear non-Gaussion state estimation model is given by
and the state noise
Furthermore, we used two types of nonlinear estimation algorithms (NEA) [20–23], such as extended Kalman filter (EKF) [24, 25] and unscented Kalman filter (UKF) [26–28] to estimate the above nonlinear Gaussian models, where the noise distributions predictive and filtering were assumed Gaussians, and the linearization of the functions in the process and observation equations was employed in EKF. However, the UKF shows that it is easier to approximate a probability distribution than to approximate an arbitrary nonlinear function or transformation, that is, the UKF is a more suitable method for calculating the statistics of a random variable.
The initialization parameters of the EKF and UKF were summarized in Table 1.
Initialization parameters
Initialization parameters
In the following, the NDES was applied to evaluate the performance of the EKF and UKF. Let T = 50s(m = 50), after one Monte-Carlo run, we obtained a tracking result of the above two types of NEA, respectively. Furthermore, over 500 (M = 500) Monte-Carlo runs and at every time instant t, the average tracking result was utilized to show the performance of the EKF and UKF shown in Fig. 1, which shows that the measurement y matches the state x within the time interval [0, 8s] and [22s, 30s], and on the contrary, the measurement y mismatches the state x within the time interval [9s, 21s] and [31s, 50s].

The average tracking results of the EKF and UKF.
In this paper, the average tracking result was calculated as
Thus, the corresponding estimation errors at time instant t were given by
Substituting Equation (19) into the NDES, the evaluated results were shown in Fig. 2, which shows the evaluated results obtained by the NDES. Figure 3 are the vertical view, the front view and the right view of the NDES, respectively.

The NDES of the EKF and UKF.

t - r plane to the NDES.
As depicted in Fig. 2, it is hard to see which NEA performs better. However, we can see from Fig. 3 that the NDES aggregates several commonly incomprehensive measures such as the root mean square error (RMSE), average Euclidean error (AEE), geometric average error (GAE), and harmonic average error (HAE).
To illustrate the advantages of NDEs, Fig. 4 shows the evaluation results based on the above commonly incomprehensive measures, which are consistent with the results obtained by the NDES evaluation in Fig. 3.
From Figs. 3 and 4, we can see further that the UKF outperforms the EKF within the time intervals [0s, 31s] according to the AEE or the RMSE. On the contrary, the latter is better than the former according to the HAE. So, it is difficult to see which NEA performs better according to the above incomprehensive measures. Fortunately, the NDES can provide a good solution to this problem based on Equation (10). In other words, for [T0, T] = [0, 5] and r = [-1, 2], applying the VES to this case, we obtain

Incomprehensive measures of the EKF and UKF.
That is
Obviously, the calculation accuracy of the UKF is superior to the EKF.
In addition, we can also obtain the computational time of the EKF and the UKF shown in Fig. 5.

The estimation time of the EKF and UKF.
Furthermore, the average computational time is calculated as
Although the computational load of the UKF is larger than the EKF, the UKF still meets the computational requirements.
As stated before, we can conclude that the UKF has obvious advantages in calculation accuracy except the computational load usually meeting the computational requirements [29].
The main contribution of this paper is that a new dynamic error spectrum (NDES) was proposed for dynamic system performance evaluation. Firstly, the NDES was defined according to the definition of ES. Secondly, the computation method was proposed to calculate the NDES. Thirdly, several nice properties of NDES were presented for dynamic system performance evaluation. Finally, simulations show that the NDES is intuitive and reflects the estimation performance directly in dynamic evaluation systems. Furthermore, the NDES can applied the metric to dynamic systems and can be more powerful, and a quick evaluation result were given by using VES in dynamic systems. In future work, we will apply NDES to the evaluation of nonlinear estimation algorithm and dynamic system performance evaluation and ranking.
Footnotes
Acknowledgments
We would like to thank anonymous reviewers for their helpful comments and suggestions. This work was supported in part by the national natural science foundation of China through grant No. 72271243, No. 71801222 and No. 61973253, the national science foundation of shaanxi province of China through grant No. 2018JQ6019, the national postdoctoral program for innovative talents through grant No. BX201700104.
