Medical Reasoning with Rough-Set Influence Diagrams

Abstract

There are several advantages to evaluating a problem using influence diagram operations. The analyst can use a representation that is natural to the decision maker, since the algorithm executes all of the inference and analysis automatically. The influence diagram solution procedure can also result in significant gains in efficiency. Conditional independence is clearly exhibited in the diagram, so the size of intermediate calculations can be reduced, resulting in considerable reductions in both processing time and memory requirements. However, when imprecise knowledge from data sets is involved in the systems, how to reason from approximate information becomes a main issue in evaluating influence diagrams effectively. This study develops an alternative knowledge model, rough-set influence diagrams (RSID), which combine rough-set decision rules and graphical structures of influence diagrams in medical settings. The proposed RSID provides a comprehensive schema for knowledge representation and decision support.

1. Introduction

There are several important capabilities of a medical reasoning system: diagnosis, prediction, treatment planning, etc. (Kao, 2008; Kao and Li, 2005; Kononenko, 2001; Leibovici et al., 2000; Li and Kao, 2005; Long, 2001). Diagnosis is the process of reconstructing past facts from observed evidence. Prediction is the process of projecting the evidence from hypotheses. Treatment planning is reasoning about the costs and effectiveness of treatments on patients. Usually, medical practice requires various kinds of reasoning to be carried out simultaneously. Hence, the capability for multiple reasoning tasks is critical to the performance of medical decision support systems. Moreover, medical expert systems become more complex when considering the mechanisms of the human body and their mutual interactions with environmental factors.

In the field of medical informatics, the graphical decision models such as Bayesian networks and influence diagrams have been widely used (Jensen, 2001; Kao, 2008; Kao and Li, 2005; Li and Kao, 2005; López-Díaz and Rodríguez-Muñiz, 2007; Neapolian, 2004; Nease and Owens, 1997; Oliver and Smith, 1990; Owens et al., 1997; Pearl, 1997). Influence diagrams are originally proposed as a compact representation of decision trees for symmetric decision scenarios, and are now regarded more as an extension of Bayesian networks (Pearl, 1997). Influence diagrams are a graphical technique for a decision problem under uncertainty (Howard and Matheson, 1981; Nease and Owens, 1997; Tatman and Shachter, 1990), and useful as knowledge representation and decision models (Jensen, 2001; Nease and Owens, 1997; Oliver and Smith, 1990; Shachter, 1986; Tatman and Shachter, 1990). Various methods have been developed for evaluating influence diagrams. In previous studies, the main numerical models of the influence diagrams used to be probability distributions (Howard and Matheson, 1981; Shachter, 1986).

There are several advantages in evaluating a problem using influence diagram operations. The analyst can use the representation that is natural to the decision maker and the algorithm executes the inference and analysis intelligently. The influence diagram solution procedure can also result in significant gains in efficiency. Conditional independence is clearly exhibited in the diagram, so the size of intermediate calculations can be reduced, resulting in considerable reductions in both processing time and memory requirements.

However, when imprecise knowledge from data sets is involved in the systems, how to reason from approximate information becomes a main issue in evaluating influence diagrams effectively. This study proposes an alternative numerical model for the knowledge in influence diagrams with rough sets. Rough set theory was first introduced by Pawlak (1982) as a tool for dealing with risk and impreciseness in decision making. The probabilistic approaches have been previously applied to rough set theory (Pawlak, 2002, 2003; Pawlak et al., 1988; Yao, 2003). Based on an approach recently proposed by Kao (2008), this study intends to propose an alternative numerical model for influence diagrams, which extends influence diagrams into rough-set influence diagrams. In rough-set influence diagrams, each chance node and value node is associated with the dependency, which expresses the uncertain features of the node. This study also develops rough-set-based influence diagrams that combine rough-set decision rules with the graphical structure of the influence diagrams found in medical settings.

The rest of this study is organized as follows. Section 2 illustrates the notations and the framework of influence diagrams. Section 3 describes the concept of rough sets and decision rules. Section 4 introduces how rough-set theory establishes the numerical model and decision support in influence diagrams. Section 5 illustrates the case studies of metastatic cancer and urinary tract infection. Section 6 gives the discussions and concluding remarks.

2. Influence Diagrams

Influence diagrams were originally introduced by Howard and Matheson (1981) as a compact representation of decision models. They are also thought of as an extension of Bayesian networks (Jensen, 2001; Pearl, 1997). An influence diagram is a directed acyclic graph (DAG) with three types of nodes: decision, chance, and value. A decision node, drawn as a rectangle, represents choices available to the decision makers. A chance node, drawn as a circle, represents random variables or uncertain quantities. A value node or utility node, shown as a diamond, represents the utility or the objective to be maximized in terms of expectation.

An influence diagram (ID) can be defined as \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*}{ \rm ID} = ( V , L , P ) \tag{1}\end{align*} \end{document}

where V denotes the set of nodes, L denotes the set of directed arcs, and P represents the numerical model. The composition of the node set V can be expressed as (2) \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*}V = V_{D} \cup V_{R} \cup V_{U} \tag{2}\end{align*} \end{document}

where V_D denotes the decision node sets, V_R represents the sets of chance nodes, and V_U denotes the value nodes to be optimized. This study uses the uppercase letters to represent the nodes and lowercase letters for the value of a variable.

There are two types of directed arcs: conditional arcs (into chance and value nodes) and informational arcs (into decision nodes). The meanings of the directed arcs depend on their destinations. Arcs pointing to value nodes represent approximate or functional dependence. Arcs into decision nodes imply time precedence and are informational; that is, they show which variables will be known to the decision makers before the decision is made. Conventionally, the knowledge of chance nodes is expressed with probability distributions while the outcomes of the value node are projected with a value table.

An influence diagram satisfies the following conditions:

(i) The directed graph has no cycles.

(ii) The value node has no successors.

(iii) There is a directed path that contains all of the decision nodes.

An example of influence diagrams for metastatic cancer and treatment is modified from Cooper (1984) as shown in Figure 1, where V = V_D ∪ V_R ∪ V_U, V_D = {T}, V_R = {B, C, I, M}, and V_U = {Q}.

FIG. 1.

An example of influence diagrams for metastatic cancer and treatment.

Metastatic cancer is a possible cause of a brain tumor and is also an explanation for increased total serum calcium. In Figure 1, there is one decision node T (treatment), which has two alternatives to take: yes or no. Four chance variables are relevant to the biological test and treatment problems: B (brain tumor), C (coma), I (increased total serum calcium), and M (metastatic cancer). The utility function Q (quality-adjusted life expectancy) is to be maximized. The meaning and states of the nodes are summarized in Table 1.

Table 1.

The Descriptions and States of the Nodes in Figure 1

Node	Description	State
B	Brain tumor	B ∈ {0, 1}, 0: absent, 1: present
C	Coma	C ∈ {0, 1}, 0: absent, 1: present
I	Increased total serum calcium	I ∈ {0, 1}, 0: absent, 1: present
M	Metastatic cancer	M ∈ {0, 1}, 0: absent, 1: present
T	Treatment	T ∈ {Yes, No}
Q	Quality-adjusted life expectancy	[0, 150]

In Figure 1, the states of I and B determine the state of C. The states of I and B are conditioned on the manifestation of M. The outcome of M and the decision on T will determine the value of Q, where M provides the information prior to decision making. Usually, the causal relationships and decision rules in influence diagrams are expressed with probability distributions and a value table.

In medical informatics, influence diagrams are widely used knowledge representation and decision models under uncertainty. However, there are some limitations of utilizing the above approaches for solving medical reasoning problems:

(i) All associated probabilities are assumed to be non-fuzzy.

(ii) It is difficult to consider the constraints for the relationships among the nodes in Bayesian networks or influence diagrams in maximizing the utility functions.

(iii) Treatment decisions and diagnostic problems are not considered in one paradigm.

(iv) When imprecise knowledge is involved in medical systems, how to reason from the approximate information efficiently becomes a main issue in evaluating influence diagrams.

Hence, this study proposes an alternative approach for modeling the causal relationships in influence diagrams with rough sets.

3. Rough Sets

3.1. Information system

Rough set theory starts with information represented by a table called an information system (Pawlak, 1991). An information system is a 4-tuple S = (U, A, V_a, f_a), where:

(i) U is the universe, a nonempty finite set of objects.

(ii) \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$A = \{ a_{1} , a_{2} , \ldots , a_{m} \} $$ \end{document} is a nonempty finite set of attributes C ∪ D, where C and D is a finite set of condition and decision attributes, respectively.

(iii) V_a is a domain of the attribute a, each attribute a : U → V_a for a ∈ A.

(iv) f_a : U × A → V_a is the total decision function called the information function such that f (x, a) ∈ V_a for \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$\forall$$ \end{document} a ∈ A, \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$\forall$$ \end{document} x ∈ U.

3.2. Indiscernibility relation

Let S = (U, A, V_a, f_a) be an information system, B ⊆ A and X ⊆ U. With any subset of attributes B ⊆ A, a binary indiscernibility relation, is called B-indiscernibility relation, which is defined by: \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*}IND ( B ) = \{ ( x , y ) \in U \times U: a ( x ) = a ( y ) , \forall a \in B \} \tag{3}\end{align*} \end{document}

For any subset X ⊆ U, the lower and upper approximation can be expressed as (4) and (5), respectively: \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*}\underline{B ( x ) } = \bigcup_{x \in U} \{ B ( x ) : B ( x ) \subseteq X \} \tag{4}\end{align*} \end{document} \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*} {\overline{B ( x )}} = \bigcup_{x \in U} \{ B ( x ) : B ( x ) \cap X \neq \O \} \tag{5}\end{align*} \end{document}

That is, the elements of \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$\underline{B ( x ) }$$ \end{document} are all the elementary objects certainly belonging to X. The elements of \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$\overline{B ( x ) }$$ \end{document} are at least one object belonging to X. With the lower and upper approximation of a set X ⊆ U, the universe can be divided into three regions, the boundary region BND(x), the positive region POS(x), and the negative region NEG(x): \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*}BND ( x ) = \overline{B ( x ) } - \underline{B ( x ) } \tag{6}\end{align*} \end{document} \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*}POS ( x ) = \underline{B ( x ) } \tag{7}\end{align*} \end{document} \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*}NEG ( x ) = U - \overline {B ( x ) } \tag{8}\end{align*} \end{document}

If the boundary region of X is an empty set, BND(x) = Ø, then X is a crisp set with respect to N; otherwise, if BND(x) ≠ Ø, then X is a rough (approximate) set with respect to N.

3.3. Decision rules

Let |C| denote the set of all objects from U that have the meaning of C in S where |*| indicates the cardinality of a (finite) set.

If a decision rule in S is C → D, meaning IF C THEN D, where \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$C = \{ c_{1} , c_{2} , \ldots , c_{n} \} $$ \end{document} is the condition attribute, and \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$D = \{ d_{1} , d_{2} , \ldots , d_{m} \} $$ \end{document} is the decision attribute of the decision rule, respectively, then the support of the decision rule C → D in S will be supp(C, D) = card(C ∩ D).

For any C in S its probability is defined by σ(C) = p(|C|). With the decision rule C → D in S the conditional probability is σ (D | C) = p(|D| | |C|).

The strength of the decision rule C → D in S, denoted by σ(C, D), is defined as \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*}\sigma ( C , D ) = \frac { supp ( C , D ) } { card ( U ) } \tag { 9 } \end{align*} \end{document}

The certainty factor of the decision rule C → D in S, denoted by cer(C, D), is defined as \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*}cer ( C , D ) = \sigma ( D\, \mid\, C ) = \frac { supp ( C , D ) } { card ( C ) } = \frac { \sigma ( C , D ) } { \sigma ( C ) } \tag { 10 } \end{align*} \end{document}

The certainty factor is interpreted as a conditional probability that y belongs to D given y belongs to C. It is easy to see that cer(C, D) ∈ [0, 1]. If cer(C, D) = 1, then the given decision rule is a deterministic or certain decision rule in S. Otherwise, 0 < cer(C, D) <1, the given decision rule is a nondeterministic or uncertain decision rule in S.

The coverage factor of the decision rule C → D in S, denoted by cov(C, D), is defined as \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*}cov ( C , D ) = \sigma ( C \,\mid \,D ) = \frac { supp ( C , D ) } { card ( D ) } = \frac { \sigma ( C , D ) } { \sigma ( D ) } \tag { 11 } \end{align*} \end{document}

3.4. Probabilistic properties

Let C → D be the decision rule in S, then the following properties (12)–(17) are valid (Pawlak, 2002, 2003; Yao, 2003). Note that (14) and (15) are referring to total probability theorem, (16) and (17) are referring to Bayes' theorem \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*}\sum_{C^{ \prime} \in C ( x ) }cer ( C^{ \prime} , D ) = 1 \tag{12}\end{align*} \end{document} \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*}\sum_{D^{ \prime} \in D ( x ) }cov ( C , D^{ \prime} ) = 1 \tag{13}\end{align*} \end{document} \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*}\sigma ( C ) = \sum_{D^{ \prime} \in D ( x ) }cov ( C , D^{ \prime} ) \cdot \sigma ( D^{ \prime} ) = \sum_{D^{ \prime} \in D ( x ) } \sigma ( C , D^{ \prime} ) \tag{14}\end{align*} \end{document} \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*}\sigma ( D ) = \sum_{C^{ \prime} \in C ( x ) }cer ( C^{ \prime} , D ) \cdot \sigma ( C^{ \prime} ) = \sum_{C^{ \prime} \in C ( x ) } \sigma ( C^{ \prime} , D ) \tag{15}\end{align*} \end{document} \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*}cer ( C , D ) = \frac { \sigma ( C , D ) } { \sigma ( C ) } = \frac { \sigma ( C , D ) } { \sum_ { D^ { \prime } \in D ( x ) } \sigma ( C , D^ { \prime } ) } = \frac { cov ( C , D ) \cdot \sigma ( D ) } { \sigma ( C ) } \tag { 16 } \end{align*} \end{document} \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*}cov ( C , D ) = \frac { \sigma ( C , D ) } { \sigma ( D ) } = \frac { \sigma ( C , D ) } { \sum_ { C^ { \prime } \in C ( x ) } \sigma ( C^ { \prime } , D ) } = \frac { cer ( C , D ) \cdot \sigma ( C ) } { \sigma ( D ) } \tag { 17 } \end{align*} \end{document}

4. ROUGH-SET INFLUENCE DIAGRAMS (RSID)

Most studies on influence diagrams (Howard and Matheson, 1981; Nease and Owens, 1997; Pearl, 1997; Shachter, 1986) used to describe the dependency and its associated numerical model with probability distributions. However, when impreciseness is involved in the domain, the decision makers may need more flexible uncertainty measures for analysis. Rough set theory can be an alternative measure when faced with such a problem.

Given the influence diagrams structure and the original data sets, rough set theory provides a basis for extracting the knowledge and expressing the dependency among nodes in the influence diagrams. To represent the ontology, we define that a rough-set influence diagram is a directed acyclic graph RSID = (U, A, f ), where:

(i) U is the universe, a nonempty finite set of objects.

(ii) A is a set of nodes where A ≡ V_D ∪ V_R ∪ V_U ≡ V_D ∪ (C ∪ D) ∪ V_U.

(iii) f is the flow function representing the strength, certainty factor, and coverage factor of the decision rule.

With every branch of (x_i, x_i₊₁) there is a directed arc from node x_i to x_i₊₁ as shown in Figure 2. The certainty, coverage, and strength of a branch (x_i, x_i₊₁) are defined as cer(x_i, x_i₊₁), cov(x_i, x_i₊₁), and σ(x_i, x_i₊₁), respectively.

FIG. 2.

A directed arc from node x_i to x_i₊₁.

A directed path from x₁ to x_n for x₁, x_n ∈ A denoted [x₁, x_n], is a sequence of nodes \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$x_{1} , x_{2} , \ldots , x_{i} , \ldots , x_{n} , 1 \leq i \leq n$$ \end{document} , as shown in Figure 3.

FIG. 3.

A directed path [x₁, x_n] from node x₁ to x_n.

Then, the certainty of a directed path [x₁, x_n] is defined as \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*}cer [ x_{1} , x_{n} ] = \mathop \prod \limits_{i = 1}^{n - 1} cer ( x_{1} , x_{n} ) \tag{18}\end{align*} \end{document}

The coverage of a directed path [x₁, x_n] is defined as \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*}cov [ x_{1} , x_{n} ] = \mathop \prod \limits_{i = 1}^{n - 1} cov ( x_{1} , x_{n} ) \tag{19}\end{align*} \end{document}

The strength of a directed path [x₁, x_n] is defined as \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*}\sigma [ x_{1} , x_{n} ] = \begin{cases} \sigma ( x_1 ) cer [ x_1 , x_n ] = \sigma ( x_{1} ) \mathop \prod \limits_{i = 1}^{n - 1} cer ( x_{1} , x_{n} ) \\ \sigma ( x_{n} ) cov [ x_{1} , x_{n} ] = \sigma ( x_{n} ) \mathop \prod \limits_{i = 1}^{n - 1} cov ( x_{1} , x_{n} ) \end{cases} \tag{20}\end{align*} \end{document}

The set of all directed paths from x₁ to x_n denoted 〈x₁, x_n〉 as shown in Figure 4, where [x₁, x_n] ∈ 〈x₁, x_n〉.

FIG. 4.

The set, \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$\langle x_{1} , x_{n} \rangle$$ \end{document} , of all directed paths from node x₁ to x_n.

Then, the certainty of 〈x₁, x_n〉 is \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*}cer \langle x_{1} , x_{n} \rangle = \sum_{ [ x_{1} , x_{n} ] \in \langle x_{1} , x_{n} \rangle} cer [ x_{1} , x_{n} ] \tag{21}\end{align*} \end{document}

The coverage of 〈x₁, x_n〉 is \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*}cov \langle x_{1} , x_{n} \rangle = \sum_{ [ x_{1} , x_{n} ] \in \langle x_{1} , x_{n} \rangle} cov [ x_{1} , x_{n} ] \tag{22}\end{align*} \end{document}

The strength of 〈x₁, x_n〉 is \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*}\sigma \langle x_{1} , x_{n} \rangle = \sum \limits_{ [ x_{1} , x_{n} ] \in \langle x_{1} , x_{n} \rangle} \sigma [ x_{1} , x_{n} ] = \begin{cases} \sum \limits_{ [ x_{1} , x_{n} ] \in \langle x_{1} , x_{n} \rangle} \sigma ( x_{1} ) \mathop \prod \limits_{i = 1}^{n - 1} cer ( x_{1} , x_{n} ) \\ \sum \limits_{ [ x_{1} , x_{n} ] \in \langle x_{1} , x_{n} \rangle} \sigma ( x_{n} ) \mathop \prod \limits_{i = 1}^{n - 1} cov ( x_{1} , x_{n} ) \end{cases} \tag{23}\end{align*} \end{document}

The main objective of an influence diagram with a value node is to maximize the expected value (Shachter, 1986; Tatman and Shachter, 1990). To perform this, the following transformations are introduced. Consider the following cases:

Case 1: Removal of a chance node by summing it out of the joint can be accomplished, as shown in Figure 5. This corresponds to \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*} \begin{split} \sigma ( x_{5} \mid x_{1} , x_{2} , x_{3} ) \leftarrow \sum_{x_{4}} \sigma ( x_{4} , x_{5} \mid x_{1} , x_{2} , x_{3} ) \\ = \sum_{x_{4}} \sigma ( x_{5} \mid x_{2} , x_{3} , x_{4} ) \sigma ( x_{4} \mid x_{1} , x_{2} ) \end{split} \tag{24}\end{align*} \end{document}

FIG. 5.

Removal of a chance node x₄ into x₅ by summation.

Case 2: Arc reversal between two chance nodes, as shown in Figure 6, corresponds to the Bayes theorem and can be performed by the two-step operation as (25) and (26). \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*} \begin{split} \sigma ( x_{5} \mid x_{1} , x_{2} , x_{3} ) \leftarrow \sum_{x_{4}} \sigma ( x_{4} , x_{5} \mid x_{1} , x_{2} , x_{3} ) \\ = \sum_{x_{4}} \sigma ( x_{5} \mid x_{2} , x_{3} , x_{4} ) \sigma ( x_{4} \mid x_{1} , x_{2} ) \end{split} \tag{25}\end{align*} \end{document} \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*} \begin{split} \sigma ( x_ { 4 } \mid x_ { 1 } , x_ { 2 } , x_ { 3 } , x_ { 5 } ) \leftarrow \frac { \sigma ( x_ { 4 } , x_ { 5 } \mid x_ { 1 } , x_ { 2 } , x_ { 3 } ) } { \sigma ( x_ { 5 } \mid x_ { 1 } , x_ { 2 } , x_ { 3 } ) } \\ = \frac { \sigma ( x_ { 5 } \mid x_ { 2 } , x_ { 3 } , x_ { 4 } ) \sigma ( x_ { 4 } \mid x_ { 1 } , x_ { 2 } ) } { \sigma ( x_ { 5 } \mid x_ { 1 } , x_ { 2 } , x_ { 3 } ) } \end{split} \tag { 26 }\end{align*} \end{document}

FIG. 6.

Reversal of an arc from node x₄ to x₅.

Case 3: Removal of a chance node by expectation, as shown in Figure 7, can be accomplished as \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*}E [ U ( u \mid x_{1} , x_{2} , x_{3} ) ] \leftarrow E [ U_{x_{4}} ( E [ U ( u \mid x_{2} , x_{3} , x_{4} ) ] \mid x_{1} , x_{2} ) ] \tag{27}\end{align*} \end{document}

FIG. 7.

Removal of a chance node x₄ into u by expectation.

Case 4: Removal of a decision node by maximization, as shown in Figure 8, can be accomplished as \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*}E [ U ( u \mid x_{2} ) ] \leftarrow \max_{d} E [ U ( u \mid d , x_{2} ) ] \tag{28}\end{align*} \end{document}

FIG. 8.

Removal of a decision node d by maximization.

According to the cases above, the alternative numerical model based on rough set can be constructed for influence diagrams.

5. Numerical Examples

Example 1. Consider an influence diagram as shown in Figure. 1. According to (9), and (10), the certainty factors and value functions can be obtained in Table 2.

Table 2.

The Certainty Factors and Value Functions of Figure 1

σ(m = 1) = 0.2

cer(m, i) : σ(i = 1|m = 1) = 0.8

σ(i = 1|m = 0) = 0.2

cer(m, b) : σ(b = 1|m = 1) = 0.7

σ(b = 1|m = 0) = 0.2

cer((b, i), c) : σ(c = 1|b = 1, i = 1) = 0.9

σ(c = 1|b = 1, i = 0) = 0.6

σ(c = 1|b = 0, i = 1) = 0.6

σ(c = 1|b = 0, i = 0) = 0.05

Q = q(m = 0, t = No) = 100

Q = q(m = 1, t = No) = 20

Q = q(m = 0, t = Yes) = 80

Q = q(m = 1, t = Yes) = 70

The objective of this problem is to maximize the expected utilities as follows. \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*}\max_{t} E [ U ( q \mid m , t ) ] = \max_{t} \sum q ( m , t ) \sigma ( m \mid b , c , i ) \tag{29}\end{align*} \end{document}

where σ(m|b, c, i) is the posterior (belief) function of m, and \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*} \begin{split} \sigma ( m \mid b , c , i ) = \frac { \sigma ( m , b , c , i ) } { \sigma ( b , c , i ) } \\ = \frac { \sigma ( m ) \sigma ( b \mid m ) \sigma ( i \mid m ) \sigma ( c \mid b , i ) } { \sum \limits_ { m } \sigma ( m ) \sigma ( b \mid m ) \sigma ( i \mid m ) \sigma ( c \mid b , i ) } \end{split} \tag { 30 } \end{align*} \end{document}

Suppose the observation c = 0 then: \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*}\sigma ( m = 1 \mid c = 0 ) = \frac { { \sum \limits_ { b , c = 0 , i } \sigma ( m = 1 ) \sigma ( b \mid m = 1 ) \sigma ( i \mid m = 1 ) \sigma ( c = 0 \mid b , i ) } } { \sum \limits_ { b , c = 0 , i , m } \sigma ( m ) \sigma ( b \mid m ) \sigma ( i \mid m ) \sigma ( c = 0 \mid b , i ) } \tag { 31 } \end{align*} \end{document}

The numerator and denominator of (31) can be computed as \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*} \sum_{b , c = 0 , i} \sigma ( m = 1 ) \sigma ( b \mid m = 1 ) \sigma ( i \mid m = 1 ) \sigma ( c = 0 \mid b , i ) \\ = \sigma ( m = 1 ) \sigma ( b = 1 \mid m = 1 ) \sigma ( i = 1 \mid m = 1 ) \sigma ( c = 0 \mid b = 1 , i = 1 ) + \\ \sigma ( m = 1 ) \sigma ( b = 1 \mid m = 1 ) \sigma ( i = 0 \mid m = 1 ) \sigma ( c = 0 \mid b = 1 , i = 0 ) + \\ \sigma ( m = 1 ) \sigma ( b = 0 \mid m = 1 ) \sigma ( i = 1 \mid m = 1 ) \sigma ( c = 0 \mid b = 0 , i = 1 ) + \\ \sigma ( m = 1 ) \sigma ( b = 0 \mid m = 1 ) \sigma ( i = 0 \mid m = 1 ) \sigma ( c = 0 \mid b = 0 , i = 0 ) \\ = 0.2 \times 0.7 \times 0.8 \times 0.1 + 0.2 \times 0.7 \times 0.2 \times 0.4 + \\ 0.2 \times 0.3 \times 0.8 \times 0.4 + 0.2 \times 0.3 \times 0.2 \times 0.95 \\ = 0.053\end{align*} \end{document} \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*} \sum_{b , c = 0 , i} \sigma ( m = 0 ) \sigma ( b \mid m = 0 ) \sigma ( i \mid m = 0 ) \sigma ( c = 0 \mid b , i ) \\ = \sigma ( m = 0 ) \sigma ( b = 1 \mid m = 0 ) \sigma ( i = 1 \mid m = 0 ) \sigma ( c = 0 \mid b = 1 , i = 1 ) + \\ \sigma ( m = 0 ) \sigma ( b = 1 \mid m = 0 ) \sigma ( i = 0 \mid m = 0 ) \sigma ( c = 0 \mid b = 1 , i = 0 ) + \\ \sigma ( m = 0 ) \sigma ( b = 0 \mid m = 0 ) \sigma ( i = 1 \mid m = 0 ) \sigma ( c = 0 \mid b = 0 , i = 1 ) + \\ \sigma ( m = 0 ) \sigma ( b = 0 \mid m = 0 ) \sigma ( i = 0 \mid m = 0 ) \sigma ( c = 0 \mid b = 0 , i = 0 ) \\ = 0.8 \times 0.2 \times 0.2 \times 0.1 + 0.8 \times 0.2 \times 0.8 \times 0.4 + \\ 0.8 \times 0.8 \times 0.2 \times 0.4 + 0.8 \times 0.8 \times 0.8 \times 0.95 \\ = 0.592\end{align*} \end{document}

Then (31) can be obtained as \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*} \sigma ( m = 1 \mid c = 0 ) = \frac { \sum_ { b , c = 0 , i } \sigma ( m = 1 ) \sigma ( b \mid m = 1 ) \sigma ( i \mid m = 1 ) \sigma ( c = 0 \mid b , i ) } { \sum_ { b , c = 0 , i , m } \sigma ( m ) \sigma ( b \mid m ) \sigma ( i \mid m ) \sigma ( c = 0 \mid b , i ) } \\ = \frac { 0.053 } { 0.053 + 0.592 } = 0.0822 \end{align*} \end{document} \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*} \sigma ( m = 0 \mid c = 0 ) = \frac { \sum_ { b , c = 0 , i } \sigma ( m = 0 ) \sigma ( b \mid m = 0 ) \sigma ( i \mid m = 0 ) \sigma ( c = 0 \mid b , i ) } { \sum_ { b , c = 0 , i , m } \sigma ( m ) \sigma ( b \mid m ) \sigma ( i \mid m ) \sigma ( c = 0 \mid b , i ) } \\ = \frac { 0.592 } { 0.053 + 0.592 } = 0.9178 \end{align*} \end{document}

Hence \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*} & E [ U ( q \mid m , t = { \rm Yes} ) ] \\ & = \sum_{b , i} q ( m = 1 , t = { \rm Yes} ) \sigma ( m = 1 \mid c = 0 ) + \sum_{b , i} q ( m = 0 , t = { \rm Yes} ) \sigma ( m = 0 \mid c = 0 ) \\ & = q ( m = 1 , t = { \rm Yes} ) \sum_{b , i} \sigma ( m = 1 \mid c = 0 ) + q ( m = 0 , t = { \rm Yes} ) \sum_{b , i} \sigma ( m = 0 \mid c = 0 ) \\ & = 70 \times 0.0822 + 80 \times 0.9178 \\ = 79.178\end{align*} \end{document} \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*} & E [ U ( q \mid m , t = { \rm No} ) ] \\ & = \sum_{b , i} q ( m = 1 , t = { \rm No} ) \sigma ( m = 1 \mid c = 0 ) + \sum_{b , i} q ( m = 0 , t = { \rm No} ) \sigma ( m = 0 \mid c = 0 ) \\ & = q ( m = 1 , t = { \rm No} ) \sum_{b , i} \sigma ( m = 1 \mid c = 0 ) + q ( m = 0 , t = { \rm No} ) \sum_{b , i} \sigma ( m = 0 \mid c = 0 ) \\ & = 20 \times 0.0822 + 100 \times 0.9178 \\ & = 93.424\end{align*} \end{document}

Hence, the optimal decision to maximize the utilities is \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*}\max_{t}E [ U ( q \mid m , t ) ] = \max_{t} \{ 79.178 , 93.424 \} = 93.424\end{align*} \end{document}

where the optimal decision is t = No (no treatment).

Example 2. Consider a case of urinary tract infection (UTI) modified from Leibovici et al. (2000), as shown in Figure 9.

FIG. 9.

A case of urinary tract infection (UTI).

There are one decision node V_D = {Tr}, two value nodes V_u = {C, G}, and eighteen chance nodes V_R = {C_o, UL, UTI, P₁, P₂, P₃, P₄, T₁, T₂, T₃, S₁, S₂, S₃, S₄, S₅, S₆, S₇, S₈} relevant to the urinary tract infection and treatments. The descriptions and states of the nodes are listed as below.

1. Tr (antibiotic treatment): A decision node that the treatment will be suitable if it matches the in vitro susceptibility of the pathogens. In this case, we consider one state for no treatment and five antibiotic drugs. There are six alternatives, \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$Tr = \{ tr \} , tr = 0 , 1 , \ldots , 5$$ \end{document} , where tr = 0 means no treatment; \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$tr = i , i = 1 , 2 \ldots , 5$$ \end{document} stands for the ith treatment prescribed.

2. C (costs of antibiotic treatment): A value node associated with costs of antibiotic treatments, tr_i.

3. G (gain in life expectancy): A value node associated with the gain in life expectancy obtained by prescribing an antibiotic drug, which is a function of the coverage of urinary tract infection (Co) and the underlying disorder of the patient (UL).

4. Co (coverage of urinary tract infection): The percent of pathogens of urinary tract infection impressionable to an antibiotic drug. The states of Co are not covered (Co = 0) and covered (Co = 1), Co ∈ {0, 1}.

5. P_i (pathogen): A microorganism capable of causing urinary tract infection. In this case, four pathogens are considered: P₁ (Klebsiella pneumoniae), P₂ (Pseudomonas aeruginosa), P₃ (Escherichia coli), and P₄ (proteus). The states of P_i are not severe (P_i = 0) and severe (P_i = 1), P_i ∈ {0, 1},i = 1, 2, 3, 4.

6. S_i (symptoms of urinary tract infection): The manifestations caused from urinary tract infection. In this case, eight symptoms are considered: S₁ (suprapubic pain), S₂ (frequent micturition), S₃ (flank pain), S₄ (urinary), S₅ (serum albumin), S₆ (fever), S₇ (leucocyturie), and S₈ (hematuria). The states of S_i are absent (S_i = 0) and present \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$( S_i = 1 ) , \ S_i \in \{ 0 , 1 \} , i = 1 , 2 , \ldots , 8$$ \end{document} .

7. T₁ (bacteriological tests): Three bacteriological tests are considered: T₁ (growth of microorganisms in blood), T₂ (growth of microorganisms in urine), and T₃ (nitrite test). The states of T_i are negative (T_i = 0) and positive (T_i = 1), T_i ∈ {0, 1}, i = 1, 2, 3.

8. UL (underlying disorder of the patient): This will be illustrated by an equivalent base years of the remaining life.

9. UTI (urinary tract infection): The states of UTI are not infected (UTI = 0) and infected (UTI = 1), UTI ∈ {0, 1}.

The descriptions and states of the nodes are summarized in Table 3. Consider the evidence that a patient has suffered from frequent micturition (S₂ = 1), flank pain (S₃ = 1), urinary symptoms (S₄ = 1), and leucocyturie (S₇ = 1), but has not fallen into a suprapubic pain (S₁ = 0), serum albumin (S₅ = 0), fever (S₆ = 0), or hematuria (S₈ = 0). The evidence set is {S₁ = 0,S₂ = 1,S₃ = 1,S₄ = 1,S₅ = 0,S₆ = 0,S₇ = 1,S₈ = 0}. We intend to maximize the expected net gain of life expectancy according to the symptoms as below \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*}\max_{tr} Gain = E [ G ( co , ul ) - C ( tr ) ] \tag{32}\end{align*} \end{document}

Table 3.

The Descriptions of the Nodes in Figure 9

Node	Description	State
Tr	Antibiotic treatment	\documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$Tr = \{ tr_{i} \} , tr_{i} \in \{ 0 , 1 \} , i = 0 , 1 \ldots , 5$$ \end{document}
C	Costs of antibiotic treatment
G	Gain in life expectancy
Co	Coverage of UTI	Co ∈ {0, 1}, 0: not covered, 1: covered
P ₁	Klebsiella pneumonia	P₁ ∈ {0, 1}, 0: absent, 1: present
P ₂	Pseudomonas aeruginosa	P₂ ∈ {0, 1}, 0: absent, 1: present
P ₃	Escherichia coli	P₃ ∈ {0, 1}, 0: absent, 1: present
P ₄	Proteus	P₄ ∈ {0, 1}, 0: absent, 1: present
R	Resistance to antibiotic drugs	R ∈ {0, 1}, 0: not resistant, 1: resistant
S ₁	Suprapubic pain	S₁ ∈ {0, 1}, 0: absent, 1: present
S ₂	Frequent micturition	S₂ ∈ {0, 1}, 0: absent, 1: present
S ₃	Flank pain	S₃ ∈ {0, 1}, 0: absent, 1: present
S ₄	Urinary	S₄ ∈ {0, 1}, 0: absent, 1: present
S ₅	Serum albumin	S₅ ∈ {0, 1}, 0: absent, 1: present
S ₆	Fever	S₆ ∈ {0, 1}, 0: absent, 1: present
S ₇	Leucocyturie	S₇ ∈ {0, 1}, 0: absent, 1: present
S ₈	Hematuria	S₈ ∈ {0, 1}, 0: absent, 1: present
T ₁	Growth of microorganisms in blood	T₁ ∈ {0, 1}, 0: negative, 1: positive
T ₂	Growth of microorganisms in urine	T₂ ∈ {0, 1}, 0: negative, 1: positive
T ₃	Nitrite test	T₃ ∈ {0, 1}, 0: negative, 1: positive
UL	Underlying disorder of the patient
UTI	Urinary tract infection	UTI ∈ {0, 1}, 0: not infected, 1: infected

Following the assumption (Kao and Li, 2005) that the underlying disorder and health status can be converted to an equivalent base year; in this case, 35 years base and 1 year gained in life can be regarded as equivalent to $55,000 (given ul) (Leibovici et al., 2000). The gain can be computed as below \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*} \begin{split} & Gain \\ & = 55 , 000 \times \sigma ( co \mid s_{1} , s_{2} , \ldots , s_{8} ) - C ( tr ) \\ & = 55 , 000 \times \sum_{ \scriptstyle p_{1} , p_{2} , p_{3} , p_{4} , \atop \scriptstyle tr , uti} \sigma ( co \mid p_{1} , p_{2} , p_{3} , p_{4} , tr , uti , s_{1} , s_{2} , \ldots , s_{8} ) - C ( tr ) \end{split} \tag{33}\end{align*} \end{document}

Selected rough sets of this case are shown in Table 4.

Table 4.

Selected Rough Sets for Figure 9

σ(p₁=1)=0.99,σ(p₂=1)=0.97,σ(p₃=1)=0.93,σ(p₄=1)=0.99

σ(s₁=1|uti=1)=0.65,σ(s₁=1|uti=0)=0.01

σ(s₂=1|uti=1)=0.90,σ(s₂=1|uti=0)=0.10

σ(s₃=1|uti=1)=0.60,σ(s₃=1|uti=0)=0.05

σ(s₄=1|uti=1)=0.80,σ(s₄=1|uti=0)=0.05

σ(s₅=1|uti=1)=0.60,σ(s₅=1|uti=0)=0.10

σ(s₆=1|uti=1)=0.70,σ(s₆=1|uti=0)=0.01

σ(s₇=1|uti=1)=0.80,σ(s₇=1|uti=0)=0.05

σ(s₈=1|uti=1)=0.75,σ(s₈=1|uti=0)=0.05

We adopt the stochastic simulation method as shown in Table 5 to obtain the optimal treatment and utility. The results are shown in Table 6. The optimal treatment is tr₃, which brings about $22,820 equivalent net gain.

Table 5.

The Stochastic Simulation Algorithm

for i=1 to simulation_ no do:

Choose the current node X from UnknownNodeList.

If EOF, then X=the first node in UnknownNodeList

and proceed to next iteration.

Compute

\documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$ \frac { \sigma ( X = 1 \mid W_x ) } { \sigma ( X = 0 \mid W_x ) } $$ \end{document}

, where W_x is the conditions of all

neighboring nodes (Markov blanket) of X without X itself.

Assign X the random number generator favoring 1 by the ratio

If X=1, then count(X)=count(X)+1.

Assign X the next node in UnknownNodeList.

End. /* of for */

Compute BEL(*) the ratio of the target value of every node * in UnknownNodeList. Then check BEL(*) with test data to validate the accuracy of classification.

Table 6.

The Result of Urinary Tract Infection (UTI)

Treatment	Certainty Co	Gain	Cost	Net gain
tr ₀	0.360	19800	5000	14800
tr ₁	0.770	42350	20000	22350
tr ₂	0.835	45925	25000	20925
tr ₃	0.924	50820	28000	22820
tr ₄	0.795	43725	32000	11725
tr ₅	0.898	49390	32000	9390

6. Conclusions and Future Plans

This study proposes an alternative numerical framework for influence diagrams, rough sets, and develops the rough-set influence diagrams that combine decision rules with the graphical structure in medical settings. Considering the imprecise knowledge, this study formulates the causal relationships and the decision rules among the nodes (attributes) with rough sets from information systems. The proposed knowledge model provides a comprehensive schema for knowledge representation and decision support.

For future studies, there are several potential themes: (1) to integrate analysis with rough sets in various graphical decision models, including Bayesian networks, decision trees, and influence diagrams; (2) hybrid decision analysis with fuzzy sets and rough sets in graphical decision models; and (3) potential applications of intelligent decision support with rough sets, such as supply chain management, business strategic analysis, and biomedicine.

Footnotes

Acknowledgments

The author would like to appreciate the anonymous referees for their careful reading and fruitful comments on the manuscript. Also special thanks to the National Science Council, Taiwan, for financially supporting this research under grant no. NSC 100-2410-H-141-006-MY2 and 102-2410-H-141-012-MY2.

Author Disclosure Statement

The author declares that there is no conflict of interest regarding the publication of this article. No competing financial interests exist.

References

Cooper

G.F.

1984. NESTOR: a computer-based medical diagnostic aid that integrates causal and probabilistic knowledge [PhD dissertation]. Department of Computer Science, Stanford University.

Howard

R.A.

, and Matheson

J.E.

1981. Influence diagrams, 719–762. In Howard

R.A.

, and Matheson

J.E.

, eds. The Principles and Applications of Decision Analysis. Strategic Decisions Group, Menlo Park, CA.

Jensen

F.V.

2001. Bayesian Networks and Decision Graphs. Springer-Verlag, Inc., New York.

Kao

H.Y.

2008. Diagnostic reasoning and medical decision-making with fuzzy influence diagrams. Comput. Meth. Prog. Bio., 90, 9–16.

Kao

H.Y.

, and Li

H.L.

2005. A diagnostic reasoning and optimal treatment model for bacterial infections with fuzzy information. Comput. Meth. Prog. Bio., 77, 23–37.

Kononenko

2001. Machine learning for medical diagnosis: history, state of the art and perspective. Artif. Intell. Med., 23, 89–109.

Leibovici

, Fishman

, Schunheyder

H.C.

, et al. 2000. A causal probabilistic network for optimal treatment of bacterial infections. IEEE Trans. Knowl. Data Eng., 12, 517–528.

H.L.

, and Kao

H.Y.

2005. Constrained abductive reasoning with fuzzy parameters in bayesian networks. Comput. Oper. Res., 32, 87–105.

Long

W.J.

2001. Medical informatics: reasoning methods. Artif. Intell. Med., 23, 71–87.

10.

López-Díaz

, and Rodríguez-Muñiz

L.J.

2007. Influence diagrams with super value nodes involving imprecise information. Eur. J. Oper. Res., 179, 203–219.

11.

Neapolian

R.E.

2004. Learning Bayesian Networks. Pearson Education Inc., Upper Saddle River, NJ.

12.

Nease

R.F.

, and Owens

D.K.

1997. Use of influence diagrams to structure medical decisions. Med. Decis. Making, 17, 263–275.

13.

Oliver

R.M.

, and Smith

J.Q.

1990. Influence Diagrams, Belief Nets and Decision Analysis, John Wiley & Sons, New York.

14.

Owens

D.K.

, Shachter

R.D.

, and Nease

R.F.

1997. Representation and analysis of medical decision problems with influence diagrams. Med. Decis. Making, 17, 251–262.

15.

Pearl

1997. Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference. Morgan Kaufmann Publishers, Inc., Burlington, MA.

16.

Pawlak

1982. Rough sets. Info. J. Info. Comput. Sci., 11, 341–356.

17.

Pawlak

1991. Rough Sets-Theoretical Aspects of Reasoning About Data. Kluwer Academic Publishers, Dordrecht.

18.

Pawlak

2002. Rough sets, decision algorithms and Bayes' theorem. Eur. J. Oper. Res., 136, 181–189.

19.

Pawlak

2003. Probability, truth and flow graph. Electron. Notes Theor. Comput. Sci., 82, 1–9.

20.

Pawlak

, Wong

S.K.M.

, and Ziarko

1988. Rough sets: probabilistic versus deterministic approach. Int. J. Man Mach. Stud., 29, 81–95.

21.

Shachter

R.D.

1986. Evaluating influence diagrams. Oper. Res., 34, 871–882.

22.

Tatman

J.A.

, and Shachter

R.D.

1990. Dynamic programming and influence diagrams. IEEE Trans. Syst. Man Cybernet., 20, 365–379.

23.

Yao

Y.Y.

2003. Probabilistic approaches to rough sets. Expert Syst., 20, 287–297.