Abstract
In this paper, a novel method for optimal solution in long-run is proposed to solve the test sequencing problem under unreliable tests that exist widely in real applications. The fault diagnosis process is presented and reformulated as a typical Markov Decision Process model, with the uncertainty of the tests is descripted by false alarm and detection probabilities which are to be the transition probabilities of the model. Moreover, the repeated test is adopted to improve the reliability of fault diagnosis. The cost and information gain are both considered in test chosen to achieve fast diagnosis with minimum cost. An application on the launcher device of a missile is studied in detail, as well as the comparisons in diagnostic performance among the proposed solution and the ones of traditional methods. Both the simulation and results illustrate the validity and feasibility of the proposed method.
Introduction
The problem to optimize test sequencing has drawn wide attention of scholars both at home and abroad, and some effective methods have been proposed. Among them, Johnson firstly proposed the greedy algorithm based on information heuristic to optimize the test selection step by step [1]. Pattipati et al. developed a dynamic planning (DP) algorithm [2, 3], but it brings large amount of calculation. And then they combined the AND/OR graph search method with information heuristic functions [4], of which AO* algorithm is widely applied and studied [5–7] but easy to get stuck in backtracking and searching for a global optimal solution and causes a surge in iterations. The Rollout algorithm is developed [8–10] to solve the optimal sequential test problem with less complexity. A bottom-up algorithm is proposed [11] to build a decision tree which can get the global optimal solution, but this may cause combinational explosion. Recently, intelligent algorithms such as the genetic algorithm [12], discrete binary particle swarm optimization (DPSO) [13], support vector machine (SVM) [14] applied in the optimal test sequencing have been suggested to solve for complex real-world systems.
Although the test sequencing problem can be optimized in a way by the methods above theoretically, they are based on a hypothesis that the tests are always perfect. In fact, in the practical systems of engineering and marketing applications [15, 16], on account of the uncertainty caused by imprecision or interference of electromagnetic, unreliable sensors or components, abrupt environmental disturbances, changing subsystem interconnections and so on [16–19] the available measurement data may be incorrect or imprecise as the tests are unreliable. For this, Tao et al. [20] adopted Markov jump systems (MJSs) to characterize the unideal measurements for sensor failures in reliable control. Zhang et al. [21, 22] suggested balanced truncation approach to simplify the MJSs or systems of a Markovian process. Wang and Zuo [23] applied the fuzzy logic theory to solve the uncertainty information in the safety diagnosis process. Various algorithms for test sequencing with unreliable tests are discussed by Raghavan et al. [24]. The evaluation functions with unreliable tests are developed in Ref. [25] along with the diagnostic strategies maximizing the test cost and diagnostic accuracy respectively. The misdiagnosis cost is also proposed with Rollout algorithm [26] to obtain the optimal diagnostic strategy under unreliable tests. Wei et al. [27] developed a tabu search algorithm and a simulation-based evaluation technique to find high-quality solutions in sequential system diagnosis with imperfect tests. All the methods mentioned above, just simply consider the cost or misdiagnosis cost generated by unreliable tests and they are based on traditional algorithms, with large amount of computation and analysis to get the optimal solutions.
The fault diagnosis under unreliable tests is a typical sequential decision problem with uncertainty in the outcomes of all available tests. As an efficient technique for sequential decision problems in dynamic and uncertain environments, Markov Decision Process (MDP) has widely been applied to model systems with probabilistic and nondeterministic behaviors in hospital management [28], marketplace [29], defense against jamming attacks [30], mobile communications [31] and the source of uncertainty can be directly modeled as probabilistic components in the model. So in this paper, we introduce the MDP to get the optimal test sequencing in fault diagnosis with unreliable tests. Firstly, the mathematical description of test sequencing problem is presented and the uncertainty of unreliable tests is descripted as false alarm probability and detection probability. With the Markov property of classic fault diagnosis process, the diagnosis model or framework based on MDP is outlined. And the repeated application of a test is adopted to improve the reliability of fault diagnosis in the proposed model. The test cost and information gain are both considered with weighted sum method to achieve fast fault diagnosis with minimum cost. By solving the proposed model, the optimal test sequencing in long-run with high-reliability and less computation is obtained. Moreover, comparisons in diagnostic performance among different test sequencings of the proposed method and traditional ones are discussed.
Test sequencing problem description
In this work, we assume that there is only one or no faults occurring in the system during the fault diagnosis process. The test sequencing problem can be described as a quintuple of {F, p, T, C, D}, the details are as follows. F = {f0, f1, …, f
m
} (m ≥ 1) is a finite ambiguity set or group of failure sources before test in a fault diagnosis process, where f0 is the “no-fault” condition and f
i
represents that the i-th failure occurs only. p = {p (f0) , p (f1) , …, p (f
m
)} (m ≥ 1) is a set of priori probabilities of all failure sources. T = {t1, t2, …, t
n
} (n ≥ 1) is a finite set of all available tests with binary outcomes, where t
j
(1 ≤ j ≤ n) can check one or more specific failure sources. C = {c1, c2, …, c
n
} (n ≥ 1) is a set of test cost measured of considering execution time, power and some other economic factors. Each c
j
(1 ≤ j ≤ n) is one-one correspondence to the test t
j
∈ T (1 ≤ j ≤ n). D = (d
ij
) is a binary matrix of dimension m×n which represents the logic relationships between the set of failure sources F and the test set T, where d
ij
= 1 as the failure f
i
can be detected by test t
j
and 0 otherwise. Besides, this matrix can be obtained by reachability analysis using fault tree model, information flow model or multi-signal model [32].
This problem actually is to find the optimal test sequencing with minimum test cost to achieve quickly failure source isolation with the knowledge above. However, the tests are unreliable for the false alarm and missed detection caused by various interferences existed extensively in real-systems. The reliability of each test t
j
can be characterized by detection and false-alarm probability pair (P
dj
, P
fj
), where P
dj
= {t
j
fails | any of failure source monitored by t
j
has failed} and P
dj
= {t
j
fails | none of failure source monitored by t
j
has failed}. Moreover, P
dj
+ P
fj
= 1. Combined the reliability (P
dj
, P
fj
) with the correlation matrix D, we can get the correlation matrix under unreliable tests, denoted as B = (b
ij
), and is given by:
Where b ij represents the probability of test t j fails when the failure f i has occurred. By the way, the false-alarm probability or miss-detection probability can be estimated from the historical statistics data or be obtained from the reliability experiment, which will be my next study.
Since the tests are imperfect, it makes sense for applying the same test multiple times. When the results of repeated tests are inconsistent, this fault diagnosis process should go back to the previous step, which means that the next fault state is to be the previous one. This operation could forcefully improve the reliability of the diagnosis results.
Generally, the fault diagnosis problem can be descripted as a process of decreasing the ambiguity in the system fault state, namely fault inference machine. The final malfunction of a system is gradually determined through inferring with the test outcomes of each step, and the principle of inferring is expressed as
The classic model of a discrete time MDP can be descripted as a five-tuple
S is the state space, a set of states which descript the current situation of the system at each time step. A(i) is the set of available actions at state i as A is the set of all possible actions which could control the system dynamics or state transitions. K is the set of time steps where decisions need to be made. p
ij
(a) denotes the probability of state i transfers to state j of taking an action a, which is probabilistic description of the uncertainty in the results of all possible actions. r(i,a) denotes the instantaneous reward or payoff received at state i of taking action a.
This model successfully shows that the state evolution dynamics of a system is controlled by an agent choosing and executing the action a
k
at each time step k ∈ K, and decision sequencing of this procedure is called policy, written as π. To evaluate a policy, the expected cumulative sum of instantaneous rewards along a MDP trajectory, also called the value function, is developed and defined as
The final target of a MDP is to search for an optimal strategy π*, which provides the best rewards sequencing and maximizes the value function. Generally, the optimal value function can be expressed by
Based on the above analysis, we could get the fault state transitions based on MDP as Fig. 1, according to the inference machine as Equation (2) with the possible outcomes of the chosen tests and repeated tests. In Fig. 1, “t1P” represents that the outcome of test t1 is “passed” as d ij = 1, whereas “t1F” means “failed” and corresponds to d ij = 0.

Fault state transitions under unreliable test.
Consequently, the fault diagnosis model based on MDP under unreliable tests is built. Similarly, the fault state space S = { F1, F2, …, F q } (q ≥ 1) is composed of all the possible fault states that may occur in a system, where F1 ={ f0, f1, …, f m } (m ≥ 0) is the initial ambiguous fault group. Specifically, when the diagnosis object is a Line Replaceable Unit (LRU) and the failure source should be isolated to a Shop Reliable Unit (SRU), {f0, f1, … , f m } contains all the failure sources or modes in the SURs attached to the LRU.
And the action space A is equal to test set T, i.e. A = T = { t1, t2, …, t
n
} (n ≥ 1). The state transition probability of fault state F → F
jp
is closely related to the false alarm and missed detection probability pair (P
dj
, P
fj
) of test t
j
, as mentioned in section 2 of Equation (1), and it can be further described as
Particularly, when t
j
is applied for the second time subsequently, at state F
jp
or F
jf
, the deduction process is expressed as
As known, the goal of fault diagnosis is to realize fast fault detecting and isolating with minimum test cost. So we make the weighed sum of test cost C ={ c1, c2, …, c
n
} and the information gain be the instantaneous reward of executing test at each step, and they are defined as
According to Equation (4), the value function of cumulative reward in fault diagnosis process with initial state F1 can be expanded as
Then the optimal diagnostic policy can be deduced as
By solving Equations (11) and (13) with the classic algorithms, like policy iteration algorithm and value iteration algorithm [33], the optimal test sequencing under unreliable tests can be figured out. In the next section, details of this method will be demonstrated through an application case.
In this work, the optimal test sequencing for the suspension and launcher test on a certain type of missile is obtained using the proposed method. The airborne suspension ejection equipment is an important interface device that realizes the mechanical, electrical, RF and gas connections between the aircraft and missile. After analyzing the signals and test requirements of them, the possible failure sources and available tests are obtained and given in Table 1 specifically, along with the logic relationships among them, namely correlation matrix D, and the probabilities about failure occurrences, false alarm and detection probabilities of the tests. Then the fault diagnosis model based on MDP under unreliable tests could be built smoothly as expressed in Section 3.
The dependency matrix, failure occurrence rates, false alarm and detection probabilities, and test expenses of launcher ignition cable module
The dependency matrix, failure occurrence rates, false alarm and detection probabilities, and test expenses of launcher ignition cable module
1) With the matrix D in Table 1, firstly all the possible fault states of this component can be deduced following the inference process as Fig. 1, and they are enumerated in Table 2. Distinctly, the state space of this system is S ={ F1, F2, …, F24 }, and the act set is A ={ t1, t2, t3, t4, t5 }. Particularly, the terminal states of a fault diagnosis process are confirmed as the state of single failure source can be isolated with no ambiguity finally, in this work, which are F12, F13, F16, F17, F19, F21.
2) All the state transition probabilities can be determined according to Equation (6) with the information in Table 2. Because of the limited space, the probabilities of all state transitions are no longer listed one by one. For t1, there is a transition probability matrix E1 to describe all the transfer relationships and probabilities of the state transitions under it and E1 is given as
All the possible fault states of the launcher device
In section 3, the instantaneous reward of a test action has been defined by Equations (7) ∼ (9). And the rewards of each test and state transition are further obtained with the analysis and results above, which are not listed one by one here for limited space. By the way, the weight coefficient is set as α = 0.5 in this work for cost and efficiency are equally important, or other values as appropriate.
Through the above procedures, this diagnosis model is completely built, and the optimal test sequencing of diagnosing this equipment can be obtained by solving the optimal value function of Equation (11) with all the necessary parameters and values are known. Since there is a complete MDP tool box in MATLAB, the policy iteration algorithm is applied and programmed to get the solution of this problem or Equation (11). The results are shown in Figs. 2 and 3, and the diagnostic performance comparisons on different parameters and different test sequencings are also made by simulations.

The best decisions for all states.

The utility values of the best test sequencings.
In Fig. 2, the subplots of red, purple and blue represent the policies of γ = 0.9, 0.8, 0.7 respectively. Easy to see, there is no difference between test sequencings of discounts 0.9 and 0.8, and the best test sequencings of γ = 0.8, 0.7 only have different test selections on state F5 and F7. As shown in Fig. 3, the results of optimal value function Equation (12) under different initial fault states for discount 0.9, 0.8 and 0.7 are nearly the same with little difference before F11. So the best sequencing may be different for different weigh expectations on the future rewards in test selection. Moreover, the comparison results among different sequencings are given in Figs. 4–6, and γ = 0.8 in later work.

The best test actions under different test sequencings.

Fault state transitions under different test sequencings, where horizontal axis corresponds to the test or time step(k) of diagnosis process and vertical axis represents the possile fault states.

The terminal states occupancy under different test sequencings, where vertical axis is coordinate to the occupancy of all terminal states (%) and horizontal axis corresponds to the number of experiments.
In Fig. 4, the best test selections for all fault states are given under three different optimal test sequencings, which are based on MDP, greedy algorithm and a random test sequencing. And the test actions of some states may be consistent under those three different sequencings. The Monte Carlo experiments with three different test sequencings are conducted to discuss their diagnostic performances.
Basically, the state transitions of different test sequencings for one single experiment are shown in Fig. 5. It’s obvious that the MDP test sequencing and greedy test sequencing end with the terminal state F17 and F12 respectively within 2 steps, and the system may get back to the initial fault state at the 4-th step under the random sequencing as the blue polyline and cost much more to reach a terminal state.
To compare the diagnostic performance, the occupancy or visiting rate of terminal state which indicates the ability of fault isolation is introduced in this experiment. Then we have done 1600 Mote Carlo experiments for these three test sequencings respectively and calculate their average occupancy of terminal states as shown in Fig. 6. In addition, the test step of each experiment is 4 which is adequate for fault diagnosis in this case from Fig. 5. As known in Fig. 6, the occupancy of terminal states is 50% approximately for MDP test sequencing and 45% for greedy test sequencing. That means, the MDP test sequencing is better than the greedy one for about 0.25 step faster, for the random test sequencing, the occupancy rate is so tiny for successful diagnosis. All the results above indicate that the proposed method is scientific and valid to get the optimal test sequencing under unreliable tests, and the obtained solution has better performance in fault diagnosis than traditional ones. The sample size is small in this case, or the advantage could be more obvious.
This work provides a novel intelligent method to generate an optimal test sequencing for fault diagnosis, considering the extensive existence of unreliable tests in real-world applications. The fault diagnosis process is modeled based on classic MDP as a process that the ambiguous fault state transfers to one of less ambiguous and could be identified to be a terminal state at last, with the detection and false alarm probabilities of unreliable tests to be the fault state transition probabilities and two important indexes in fault diagnosis, test cost and the information gain, are adopted to achieve fast diagnosis with minimum cost. By solving this diagnosis model based on MDP under unreliable tests, the optimal test sequencing is obtained consequently. The simulation case has demonstrated the effectiveness and feasibility of the proposed method. The results of comparisons also show that the test sequencing of proposed method has better diagnostic performance than the ones of traditional methods. Besides, how to acquire the precise value of transition probability under unreliable tests is worthy of study.
