Abstract
Reliability assessment of complex engineered systems is challenging as epistemic uncertainty and common cause failure (CCF) are inevitable. The probabilistic common cause failure (PCCF), which characterizes the simultaneous failures of multiple components with distinguished chances, is a generalized model of traditional CCF model. To accurately assess system reliability, it is of great significance to take both the effects of PCCF and the epistemic uncertainty of components’ state probabilities into account. In this paper, an evidential network model is proposed to assess system reliability with interval-valued PCCFs and epistemic uncertainty associated with components’ state probabilities. The procedures of computing the mass distribution of a component suffering from multiple PCCFs are detailed. The inference algorithm in the evidential network is, then, used to calculate the mass distribution of the entire system. The Birnbaum importance measure is also defined to identify the weak components under PCCFs and epistemic uncertainty. A safety instrumented system is exemplified to demonstrate the effectiveness of the proposed evidential network model in terms of coping with PCCFs and epistemic uncertainty. The importance results show that both the epistemic uncertainty associated with components’ state probabilities and PCCFs have impact on components’ importance.
Keywords
Introduction
Reliability modeling and assessment of complex engineering systems have become a challenging task due to many factors such as complex dependency among components in a system, impact from external environments, and epistemic uncertainty associated with parameters of components’ failure/degradation models. Various methods and tools have been investigated to analyze the reliability of complex engineering systems, including the fault tree analysis [1], the Markov chains [2], the stochastic petri nets [3], and the Bayesian networks (BN) [4]. Among them, the BN models have been widely implemented in reliability and safety analysis from a frequentist or subjectivist perspective. The BN was first developed by Boudali and Dugan [5] who demonstrated how fault trees can be implemented by using BNs. As compared to the traditional fault tree analysis, the BN models possess some attractive features in representing the functioning and malfunctioning dependency between the system and its components [6]: (i) Logic gates, such as the “AND” gate and “OR” gate, can only represent a deterministic dependency in-between nodes, whereas the conditional probability tables (CPTs) in BN could represent any arbitrary probabilistic relationship; (ii) The forward and backward inferences in BN enable dynamically updating the marginal probability distributions of all nodes, whereas the fault tree can only assess the failure of top event in a bottom-up fashion; (iii) Fault tree represents its events at the same level with the independency assumption, whereas BN releases such assumption. Dependency, such as the failure dependency among components, cascading failure can also be modeled by BN. Due to its powerful capability of probabilistic reasoning, a suite of investigations have been made to implement the BN in reliability assessment of a diversity of engineering systems [4 , 7].
Nevertheless, in BN models, the probability of a node being in a specific proposition can only be represented by a crisp value. The epistemic uncertainty, which arises from lack of knowledge and data and/or vague judgments from experts, is one of the important challenges in system reliability assessment [8, 10]. Some non-probabilistic methods can be used to represent epistemic uncertainty, such as the evidence theory [9], the fuzzy theory [10, 11], the interval theory [12]. The evidence theory has received considerable attentions in the field of uncertainty reasoning [9], decision making analysis [13], information fusion [14] and so forth. Some attempts have been carried out to incorporate the evidence theory into BN models. Simon et al. [15] first analyzed the reliability of complex engineering systems with epistemic uncertainty associated with components’ state probabilities by using of BN and evidence theory. Zhao et al. [16] investigated the BN model combined the evidential inference algorithm to assess the reliability of power distribution system. Simon and Webber [17] extended the CPT under the evidence theory, namely the conditional belief mass table (CBMT), when the mass functions were conditioned on evidential variables. The evidential network (EN) model was developed by them to infer system reliability bounds. From the fault tree perspective, Yang et al. [18] developed an EN model to manipulate the epistemic uncertainty associated with failure probabilities of basic events. The logic gates, such as “AND” gate, “OR” gate, and “NOR” gate, with imprecise knowledge were converted to the EN model. The CBMTs introduced by Simon and Webber [17] were used to quantify the dependency between the outputs and inputs of the logic gates under epistemic uncertainty. As the EN model is capable of reasoning epistemic uncertainty for complex systems as reported in [17, 18], it is used in the present work to quantify the epistemic uncertainty associated with the parameters of components’ reliability models.
It is worth noting that the foregoing studies on system reliability assessment by EN models were based on a strong premise that nodes in the same level are statistically independent. In general, due to physical interaction and external environments/shocks, components or subsystems in a system may fail simultaneously. Such phenomena is termed as common cause failure (CCF) in the reliability community [19]. As EN model inherits the capability of modeling dependency in the BN model [17], some attempts have been made to incorporate CCF into the EN models. Mi et al. [20] investigated an EN model embedded with the CCF to calculate system reliability. As compared to the CBMT defined in [17], the conditional belief/ plausibility tables were developed to calculate the lower and upper bounds of system reliability. More recently, Mi et al. [21] extended their previous work into multi-state systems, some new nodes were added into the EN model to represent the CCFs. Zuo et al. [23] utilized the CBMT defined by Simon and Webber [17] to infer the mass distributions of child nodes and developed another type of evidential network model embedded with the CCF to assess system reliability. However, it bears noting that the above CCF models all assumed that the occurrence of common cause events will definitely cause the failure of all the affected components. Such an assumption may not be necessarily true in practical cases. Put another way, there are many scenarios in which a common cause event causes the failure of affected components with distinct chances, with the guaranteed failure being just a special case. Such phenomena of probabilistic dependency among components are often called the probabilistic common cause failure (PCCF) [22]. It worth mentioning that reliability assessment of systems under both epistemic uncertainty and PCCFs has not been reported to date. In this paper, the probability that a common cause event may result in the failure of a specific component also contains epistemic uncertainty and is represented as an interval value. The mass distribution of a component suffering from multiple interval-valued PCCFs is, then, calculated by the EN model. The CBMTs defined in Ref. [17] is utilized to facilitate the mass inference in EN model. Therefore, the system reliability can be computed by applying the belief and plausibility functions into the leaf node of the EN model. Moreover, to quantitatively measure the contribution of a component’s failure to a system failure, the Birnbaum importance measure is extended under epistemic uncertainty and PCCFs. A safety instrumented system is exemplified to validate the proposed EN model and the extended Birnbaum importance measure.
The remainder of this paper is rolled out as follows. In Section 2, the fundamentals of the evidence theory are briefly reviewed. The procedures of computing the mass distribution of a component suffering from interval-valued PCCFs are provided in Section 3. The Birnbaum importance measure is extended under the interval-valued PCCFs and epistemic uncertainty in Section 4. The proposed method is implemented to a safety instrumented system in Section 5. Some conclusions are drawn in Section 6.
Fundamentals of the evidence theory
The evidence theory, also known as Dempster-Shafer evidence theory, was initialized by Dempster [24] and developed by Shafer [25] to characterize the epistemic uncertainty. It has widespread applications in the fields of information fusion [14], fault diagnosis [13] and reliability analysis [9 , 27]. The fundamentals of the evidence theory will be briefly reviewed here.
Definition of the frame of discernment (FoD)
In the framework of evidence theory, the frame of discernment (FoD) Ω
X
is defined as an exhaustive set which contains all the propositions of a random variable X, and it can be expressed as:
In the evidence theory, an independent probability-like measurement can be assigned to the power set 2
ΩX
. This probability-like description is called the basic belief assignment (BBA) and can be represented by a mass function m : 2
ΩX
→ [0, 1], mapping from 2
ΩX
to the unit interval [0,1]. Similar to the probability measure, the mass function should satisfy three following axioms:
The mass function m (A) represents the quantity that the truth lies in the proposition A. If m (A) ≠0 satisfies, A is called a focal set. Note that if all the focal sets are singletons, the mass function, then, degenerates to the Bayesian mass.
Calculation of the belief and plausibility functions
Unlike to the probability theory which assigns a crisp value to a proposition B, the evidence theory employs an interval measure [Bel(B), Pl(B)] to quantitatively characterize the belief degree of proposition B. The lower bound, called the belief function, is defined as the total amount of the masses that support proposition B:
The upper bound, namely the plausibility function, is defined as the sum of all the masses of propositions that agree with proposition B, either partially or totally:
By introducing
The epistemic uncertainty associated with the probability of proposition B can be represented by an interval of [Prmin {B} , Prmax {B}], where Prmin {B} and Prmax {B} denote the minimum and maximum probabilities of proposition B, respectively. In the evidence theory, the epistemic uncertainty associated with the probability of proposition B can be quantified by the belief and plausibility functions, and one has Bel (B) ≤ Pr {B} ≤ Pl (B). Hence, the relation between the evidence theory and the interval theory can be readily formulated as:
In this section, due to the lack of knowledge, the probabilistic common cause failure (PCCF) is defined as an interval value, the evidential network with interval-valued PCCF is then constructed to assess system reliability.
Interval-valued probabilistic common cause failure
In the conventional reliability assessment of systems with CCF, it is assumed that the occurrence of a common cause event will result in the guaranteed failure of all the affected components. However, there are many real-world cases in which a common cause event causes the failure of affected components with different chances. The probabilistic dependency among components caused by a common cause event is termed as the probabilistic common cause failure (PCCF) which are to be studied in this paper. However, due to the lack of knowledge and insufficient information, the failure probability of components caused by the common cause events inevitably contains epistemic uncertainty, and it is represented as an interval value.
In addition to the above definitions of PCCF, some general assumptions are itemized as follows:
(1) A system is subject to different common cause events, i.e., CC 1,CC 2,...,CC M and each common cause event may result in different components’ failure. The probability of the occurrence of a CCF CC i (i = 1, 2, . . . , M) can be represented as Pr {CC i }. Several parametric methods can be used to model, such as the α -factor model, β -factor model, and the Multiple Greek letters (MGL) model. In this study, the β -factor model [29] is implemented to quantitatively model the probability of the occurrence of CCFs, and β i represents the probability of the occurrence of the i th CCF, i.e., Pr {CC i } = β i .
(2) Once a common cause event has occurred, it may result in the failure of some components in a probabilistic manner. The affected components by the i th CCF can be represented as a probabilistic common cause group PCCG i . Specifically, PCCG i,l represents component C l is probabilistically affected by CC i . This allows one component belonging to more than one PCCG.
(3) The probability of component C
l
affected by the i th CCF is represented as an interval value [Pr min {PCCG
i,l
} , Pr max {PCCG
i,l
}]. Hence, the mass that CC
i
(i = 1, 2, . . . , M) causes the failure of component C
l
(C
l
∈ PCCG
i
) is:
A mass has to assigned to the FoD of PCCG i,l to represent the quantity of ignorance that has caused, or not, the failure of component C l :
It is obvious that
The evidential network (EN), which was introduced by Simon [17], is a graphic representation of the mass relationships. Similar to the Bayesian networks, a general EN consists of N nodes, {X
1, X
2, . . . , X
n
}, and Z directed arcs between two nodes. Fig.1 illustrates an evidential network with 5 nodes and 4 directed arcs. Each nodes X
i
(i = 1, 2, 3, 4, 5) represents a random variable which manifests all the propositions of X
i
and all its subsets. If a directed arc from X
j
to X
i
exists, X
i
called a “parent” of X
i
, denoted as

An illustration of evidential network.
Without loss of generality, for component C l subject to multiple probabilistic common cause events, PCCG1, PCCG2,..., the contributions to the failure of component C l can be divided into i + 1 parts, i.e., the independent part and the dependent part j (1 ≤ j ≤ i) caused by PCCG j . The failure of component C l can be constructed as an “OR” gate, as shown in Fig. 2, and the corresponding evidential network of the “OR” gate can be constructed as shown in Fig. 3.

The fault tree of component C l with multiple PCCGs.
The steps of computing the mass distribution of nodeC l are as follows:
Let Pr {CCG
k
} (1 ≤ k ≤ 2
i
) denotes the probability of the occurrence of CCG
k
and

The corresponding evidential network of component C l .
where R
l
and R
l,ind
denote the functioning state with and without PCCFs of component C
l
, respectively. “Part 1” in Eq.(12) is the independent mass that component C
l
is functioning, whereas “Part 2” is the probability that some common cause events in the k th common cause group (1 ≤ k ≤ 2
i
) do not occur. “Part 3” is the mass that the occurred common cause events cannot result in the failure of component C
l
. The mass that component C
l
is in failure state can be separated into three items. “Item 1” is the mass that component C
l
fails independently:
“Item 3” is the mass that the state of component C l is unknown but at least one of the occurred common cause events results in the failure of component C l :
where [R
l,ind
, F
l,ind
] denotes the unknown state without PCCFs. “Part 1” in Eqs.(14) and (15) is the mass that at least one of the occurred common cause events result in the failure of component C
l
. Note that “Part 1” in Eqs.(14) and (15) can be computed by the Inclusive-Exclusive principle shown as follows:
Therefore, the total mass that component C
l
fails can be computed by:
The mass that component C l is in the unknown state (functioning or failed) can also be divided into three items. “Item 1” is the mass that state of component C l is unknown meanwhile the PCCGs do not affect component C l :
where [R l , F l ] denotes the unknown state with PCCFs. “Item 2” is the mass that the state of component C l is unknown meanwhile one does not know whether the PCCGs affect component C l or not. For a specific CCG k (1 ≤ k ≤ 2 i ), the mass that one does not know whether the PCCGs affect component C l or not can be computed by one minus the mass that the PCCGs affect component C l and the mass the PCCGs do not affect component C l , and it results in:
“Item 3” is the mass that component C l is functioning but one does not know whether the PCCGs affect component C l or not, and it corresponds to:
Therefore, the total mass that the state of component C
l
is unknown can be computed by:
After the mass distributions of all root nodes were calculated by the above Steps 1 and 2, the evidential inference algorithm, i.e., Eq.(11) can be used to infer the mass distributions of all child nodes. Therefore, the mass distribution of the leaf node, i.e., the system node, can be obtained. Finally, by applying the belief and plausibility functions to the system node, the lower and upper bounds of system reliability can be assessed.
The importance measures focus on investigating how a specific component reliability affects the reliability of a system. The importance ranking results are significant in identifying the weak components so as to provide useful guideline for maintenance and reliability improvement. A suite of importance measures has been proposed from various perspectives on system reliability/failure, such as the Birnbaum importance measure, the Fussell-Vesely importance measure (FV), the risk reduction worth (RRW), and the risk achievement worth (RAW). These importance measures are, however, developed under the conventional probabilistic framework, and they are incapable of tackling the epistemic uncertainty and PCCFs which are simultaneously studied in this work.
In this work, the well-known probabilistic Birnbaum importance measure is extended to the case where both the epistemic uncertainty and interval-valued PCCFs exist. The Birnbaum importance measure quantifies the most critical component to the system reliability, and it is defined as:
To avoid the interval explosion problem of computing Eq.(24), the affined arithmetic (AA) introduced in [30] is implemented to calculate the extended Birnbaum importance measure, and it results in:
A safety instrumented system (SIS), as shown in Fig. 4, is exemplified to examine the effectiveness of the proposed evidential network for complex engineering systems. The SIS is used to evaluate the safety target level for the vessels. In this study, we aim at evaluating the reliability of the SIS when both the epistemic uncertainty associated with parameters of components’ reliability models and the PCCFs exist. The fault tree of the SIS can be found in Ref. [31]. The evidential network can be constructed from the fault tree of the SIS by transforming all the basic events into roots nodes and all the intermediate events into child nodes. The DAG of the evidential network for the SIS is shown in Fig. 5. In Fig. 5, nodes X1X14 are the root nodes, and each logic gate in the fault tree is converted into a child node, i.e., the nodes E1, E3 E14, and D1 D10. The mass distribution of the SIS can be obtained by calculating the masses of node D10. Therefore, the reliability bounds of the SIS can be computed by applying the belief and plausibility functions into the mass distribution of node D10.

The configuration of the safety instrumented system [31].

The evidential network of the SIS without PCCFs [31].
It should be noted that even though the reliability assessment of this SIS under fuzzy set theory has been studied in [31], the common cause failure, which related to the simultaneous failure of components due to the physical interactions and some external shocks, inevitably exist in the SIS. Because the SIS consists of many identical components which are more likely to suffer from the common cause failure (CCF) [21]. Moreover, the occurrence of these CCFs does not necessarily result in the guaranteed failure of components/subsystems in the SIS, but may cause the affected components to fail with interval-valued probability (bounded in [0,1]). In this example, the following interval-valued PCCFs are considered:
(1) The pressure transmitters (PTs) X1, X2 and the flow transmitters (FTs) X4, X5, and X6, can fail simultaneously due to external shocks, i.e., PCCG1 = {X1, X2, X4, X5, X6} . Based on the expert experience, the occurrence probability of this CCF is β 1 = 0.01, and this CCF results in the failure of the PTs, i.e., X1, X2, with an interval-valued probability [0.6, 0.7], as well as the failure of the FTs, i.e., X4, X5, and X6, with an interval-valued probability [0.7, 0.8].
(2) The temperature switches (TSs) X7, X8, and the level switches (LVs) X13, X14, will fail at the same time due to external shocks, i.e.,PCCG2 = {X7, X8, X13, X14} . Based on the expert experience, the occurrence of the CCF β 2 = 0.01 . The occurrence of the CCF results in the failure of both the TSs and LVs with an interval-valued probability [0.7,0.8].
To compute the mass distribution of the SIS, the mass distributions of all the components should be identified first. In this example, as the PTs and FTs are mechanical components, these components are assumed to comply with the Weibull distribution, whereas the remaining components are assumed to obey the exponential distribution as they are electronic components. The model parameter settings for all the components are listed in Table 1 based on the historical data [32].
The failure parameters of the components in the oil system (Time unit: hours)
For the components complying with the exponential distribution, the mass distribution can be calculated by:
where
where

The evidential network of the SIS with PCCFs.
Firstly, without considering the PCCFs, the reliability of the SIS can be assessed by the EN model shown in Fig. 7. The interval-valued reliability results indicate that the epistemic uncertainties associated with the model parameters of components are propagated to the entire system through the EN model.
By taking account of the interval-valued PCCFs in the SIS, another two nodes PCCF1 and PCCF2 are additionally added to the evidential network as shown in Fig. 6, these two nodes manifest the influence of the occurrence of PCCFs on the components, and its mass distribution can be calculated by Eqs.(8) (10). In this example, by setting:
m (PCCG1,X1) = m (PCCG1,X2) =0.6β
1 = 0.006; m (PCCG1,X4) = m (PCCG1,X5) = m (PCCG1,X6) = 0.7β
1 = 0.007;
for node PCCF2. The mass distributions of all the components suffering from interval-valued PCCFs can be computed by Eqs. (12)–(22). The reliability bounds of the SIS with PCCFs can be, then, calculated by the proposed evidential network model, and the result is shown in Fig. 7. As observed in Fig. 7, the interval-valued PCCFs between components lead to a decrement of the reliability bounds as compared with the results without taking account of interval-valued PCCFs. Hence, the interval-valued PCCFs should be properly addressed when conducting the reliability assessment of such complex engineering systems.

The reliability bounds of the SIS with/without PCCFs.
Furthermore, the importance of all the components in SIS can be calculated by Eq. (24). The results are depicted in Fig. 8. As shown in Fig. 8, the Birnbaum importance of each component extends to an interval value. These interval values provide an answer to the influence of epistemic uncertainty associated with model parameters on the importance measures.

The Birnbaum importance of all the components in SIS.
Moreover, the importance of the logic solver starts from a high value at time t=0 and end up with 0 as time goes to infinite. Because the failure of the logic solver at the beginning will directly incur the failure of the entire system. In the same manner, only the logic solver recovering from a worse state to a better state can’t make system work again if the system is very aged. Hence, the importance of the logic solver starts from a high value at the beginning and approaches to zero when time goes to infinite. However, as the failure of other components cannot lead to the failure of the SIS at the beginning, while the recovering of these components cannot make the system work again at a very aged time, hence, the importance of these components begins with zero and ends up with zero. It bears noting that the most critical component to system reliability/failure over time can be identified, i.e., the logic solver. This ranking result is also consistent with the result in Ref. [31] under the fuzzy theory. It indicates that a small variation of the reliability of the logic solver causes a relatively greater change to the reliability of the SIS. This result drives our attentions to the reliability improvement of the logic solver.
The reliability assessment of complex engineering systems is critical as reliability-related decisions can be effectively made. The epistemic uncertainty associated with model parameters and the PCCFs are two important issues in system reliability assessment. In this paper, by taking both the epistemic uncertainty associated with model parameters and the interval-valued PCCFs into account, the reliability assessment based on the EN model was conducted. The epistemic uncertainties associated with the parameters of components’ failure models were quantified by interval values. The mass distribution of each component was then computed under the evidence theory. Then, the evidential inference algorithm was implemented to calculate the mass distribution of the system node. Finally, two functions, namely the belief and plausibility functions, were utilized to compute the lower and upper bounds of system reliability. Furthermore, the Birnbaum importance measure was defined under epistemic uncertainty and PCCFs. A safety instrumented system was exemplified to demonstrate the effectiveness of our proposed evidential network model. The results showed that the epistemic uncertainty of model parameters and the interval-valued PCCFs have impact on the system reliability analysis. Moreover, the most critical components of the SIS were identified by our proposed importance measure. The importance results can further provide guideline to the reliability-related decisions, such as reliability improvement and maintenance planning.
Footnotes
Acknowledgements
The authors greatly acknowledge grant support from the Science Challenge Project under contract number TZ2018007, the National Natural Science Foundation of China under contract numbers 71771039 and 61877009, and the Sichuan Provincial Science and Technology Plan Project under contract number 2018GZ0396.
