Abstract
Medical assisted decision-making plays a key role in providing accurate and reliable medical advice. But in medical decision-making, various uncertainties are often accompanied. The belief rule base (BRB) has a strong nonlinear modeling capability and can handle uncertainties well. However, BRB suffers from combinatorial explosion and tends to influence explainability during the optimization process. Therefore, an interval belief rule base with explainability (IBRB-e) is explored in this paper. Firstly, pre-processing using extreme gradient boosting (XGBoost) is performed to filter out features with lower importance. Secondly, based on the filtered features, explainability criterion is defined. Thirdly, evidence reasoning (ER) rule is chosen as an inference tool, while projection covariance matrix adaptive evolutionary strategy (P-CMA-ES) algorithm with explainability constraints is chosen as an optimization algorithm. Lastly, the validation of the model is performed through a breast cancer case. The experimental results show that IBRB-e has good explainability while maintaining high accuracy.
Introduction
Medical assisted decision-making plays a pivotal role in ensuring patients’ health management and subsequent treatment [1]. However, in actual clinical decision-making, a common challenge often arises, that is, the scarcity of an adequate number of human medical decision-making experts with rich experience [2]. Additionally, with the explosive growth of medical data, especially when dealing with a substantial volume of intricate medical information within a short timeframe, various unpredictable challenges and errors may arise during the medical decision-making process [3]. In this context, accurate medical assisted decision-making methods become crucial to ensure that patients receive the best possible medical care and treatment plans.
In recent years, the introduction of artificial intelligence into the medical field has brought about entirely new methods and avenues for medical assisted decision-making [4]. These primarily encompass the following three methods: data-driven methods, model-driven methods, and hybrid information methods. The data-driven methods depend on extensive training data during modeling, utilizing patterns learned from the data to construct models [5]. For example, Haddi et al. [6] proposed an automatic medical decision-making method based on the relevance vector machine (RVM), and optimized the RVM by configuring various kernel functions and parameters. This approach demonstrated excellent performance in addressing atrial fibrillation issues. Andre et al. [7] used a single convolutional neural network (CNN) to classify skin lesions, which achieved performance comparable to all test experts on the task, proving that CNN is capable of classifying skin cancers with the ability of dermatologists. Huang et al. [8] proposed an intelligent data-driven model using support vector machine (SVM) and random forest (RF), which applied to a diagnostic case of cough-variant asthma (CVA). The results demonstrated that the model outperformed individual SVM and RF models, showcasing improved predictive performance. Nicolas et al. [9] introduced a data-driven approach to diagnose Parkinson’s disease (PD), leveraging vertical Ground Reaction Forces (vGRFs) data obtained during the gait cycle. This method effectively distinguishes PD subjects from those with other neurodegenerative disorders. However, in situations involving a small volume of data or high model complexity, the data-driven methods may run the risk of overfitting. This refers to the tendency of the model to excessively adapt to the training data, resulting in poorer performance when applied to new, unseen data.
The model-driven methods describe and solve problems by constructing mathematical models based on prior knowledge, rules, and theories in the modeling process [10]. For instance, Wu et al. [11] applied a “white-box” modeling method. The molecular subtypes of breast cancer are predicted in a multimodal environment using the BI-RADS function of mammography and MRI. Tolks et al. [12] proposed a classification method for a dynamic model of diabetes, which employed principal component analysis (PCA) and support vector machine (SVM) to identify an appropriate classifier. The classification results demonstrated a strong discriminative capability among these categories. Uzair et al. [13] introduced a novel model-driven deep deterministic learning (MDDDL) approach, and defined the system behavior of MDDDL based on pattern recognition of extracted features and subsequently employed empirical methods to effectively evaluate the performance of MDDDL. However, building and maintaining a model can require significant time, resources, and expertise. Especially when complex problems and large-scale data are involved, the design and adjustment of models can become very complex.
The hybrid information methods combine the above two models by combining multiple different types of data to jointly model the data to obtain more comprehensive information and more accurate predictions. For example, Thosini et al. [14] created a deep fuzzy neural network to manage uncertain genetic data for disease diagnosis. Results on six datasets showcased the superior performance of the novel approach over traditional methods using different gene selection techniques. Matteo et al. [15] developed a hybrid machine deep learning model to classify patients into two outcome categories, the CNN classifier is used as a feature extractor, and experiments proved that the model provided doctors with very reliable clinical decision support. Diogo et al. [16] employed a hybrid modeling approach to patients with chronic obstructive pulmonary disease (COPD). By combining hierarchical clustering and decision tree classification techniques, they generated a classification model with a predictive accuracy of 90.4%, thereby offering patients a reliable and affordable alternative for disease management. The hybrid information method combines the advantages of both approaches and allows for faster modeling of data.
BRB is originally proposed by Yang et al. [17], which is a typical hybrid method [18]. As an evidence-based reasoning knowledge representation, it is essentially an expert system [19]. BRB performs well in dealing with various uncertainties [20]. It deals with uncertainty by synthesizing input information and modeling correlations between antecedent properties and outcomes. In addition, BRB can not only maintain a high accuracy rate under small sample data, but also has a certain degree of explainability [21], which is of great significance for medical assistant decision-making. Therefore, BRB has been widely used in many fields [22, 23, 24, 25, 26, 27, 28, 29, 30], in addition, BRB has been widely used in the medical field, for example, Han et al. [31] proposed a hierarchical BRB disease diagnosis model based on power set (HBRBp), which solves the problem of combination rule explosion and inability to effectively deal with local ignorance. It proves the effectiveness of BRB in the field of medical decision-making.
However, the BRB needs to address two issues in medical assisted decision-making: (1) The Cartesian product is employed by BRB for rule combination, which can lead to the problem of rule explosion when there are too many attributes or reference values are considered [32], thereby reducing the efficiency of the model. (2) While optimization algorithms are employed to enhance the BRB model, it is important to note that these algorithms often come with inherent randomness. The explainability of BRB can potentially be impacted by this randomness, consequently leading to a reduction in overall credibility [33]. Therefore, an IBRB-e medical assisted decision-making model with explainable constraints is proposed. Firstly, the reference values for premise attributes are set in interval form, and a new rule combination method is employed, avoiding the issue of rule explosion. Secondly, ER rule is used as the inference engine, taking into account the weights and reliability of the rules. Finally, the optimization process utilizes the P-CMA-ES optimization algorithm with explainability constraints, enhancing the explainability of the model.
This paper offers the following key contributions: (1) The IBRB-e model is constructed to ensure both the accuracy and explainability of the model. (2) Applying IBRB-e to the medical field to help doctors make more reliable medical decisions.
The rest of this paper is as follows: In Section 2, problems in aiding decision-making by disease are addressed. In Section 3, the construction of IBRB-e is introduced, the step to derive feature importance using XGBoost is described, and the process of modeling, inference, and optimization is described. In Section 4, the effectiveness is illustrated by the example of breast cancer. This paper is summarized in Section 5.
BRB and problem formulation
This section focuses on the problems faced in the modeling process of the IBRB-e medical assisted decision-making model. In Subsection 2.1, the BRB (rule building and reasoning optimization) is briefly introduced. In Subsection 2.2, various problems suffered by the IBRB-e medical assisted decision-making model during the construction process are discussed.
Description of BRB
BRB is modeled as a rule-based model with attributes that allow for the incorporation of qualitative knowledge, and uncertainty can be expressed through belief distribution. The description of
The reasoning process of BRB is roughly divided into three steps: Step 1: calculating the rule matching degree. Step 2: calculating the activation weights of the rules. Step 3: using ER parsing algorithm for rule synthesis, but ER parsing algorithm does not consider rule reliability, which leads to the inability to clearly understand the contribution of each rule to the result. The P-CMA-ES algorithm is the optimization algorithm of BRB, but P-CMA-ES algorithm without explainable constraints is prone to lose the explainability in the optimization process. In addition, BRB uses the form of Cartesian product to construct the belief table, which is prone to explode the combination rules when there are too many premise attributes. The IBRB-e model presented in this paper successfully solves the above problems.
Based on the special needs of the medical field, the application of IBRB-e model in this context requires consideration of the following three important questions:
Question 1: How to solve the combinatorial rule explosion problem that exists in BRB? To solve this problem, the IBRB-e model uses interval addition to reduce rule generation, which avoiding exponential growth of rules.
Question 2: How can the problem of belief rule bases losing explainability during optimization be solved? To solve this problem, this paper adds explainability constraints on the basis of the consistent P-CMA-ES to improve the explainability. The explainability constraints can be described as follows:
where
where
Question 3: How to construct an IBRB-e model for considering rule reliability? This paper establishes a reasonable nonlinear mapping relationship
where
This section mainly describes the modeling process of IBRB-e model. Firstly, in Section 3.1, the XGBoost algorithm and its application in feature importance analysis are described in detail. Secondly, in Section 3.2, the rule construction for the IBRB-e model is explained. Thirdly, the Section 3.3 to Section 3.5 describe the modeling, reasoning, and optimization of the IBRB-e model respectively
Feature importance analysis
In recent years, experts have been exploring techniques for solving problems with high-dimensional datasets [34]. Feature importance analysis plays a very important role in dimensionality reduction [35]. Through this analysis, it can help to determine which features have the most significant impact on the prediction results. By calculating the importance score of the features, the contribution of each feature to the model can be calculated to select and optimize the model.
As an algorithm in the field of machine learning, XGBoost has been widely used in many fields [36]. Therefore, this paper chooses to use XGBoost to analyze the importance of data features. The step of determining feature importance in XGBoost involves training multiple decision trees. During training, feature coverage and gain will be calculated for each decision tree construct. Coverage-based feature importance entails taking the coverage of each feature in the cumulative decision tree and then ranking the features according to their importance. The gain-based feature importance will accumulate the gain of each feature and rank by importance.
Decision tree updates are derived from the objective function given by the formula.
where
According to the literature [37], the explainability standard can be described as follows: (1) The number of activated rules is the same as the premise attribute. It should not be the case that there is only one input but no corresponding spacing rule (2) The division of reference intervals should be more reasonable, and the division of reference intervals should not overlap in principle. (3) The model should have sufficient meaning. To ensure that it can be adapted to the actual applications. (4) The conversion from input to output should take into account the belief of the original sample information. (5) The reasoning process must be explainable and the results must be traceable. (6) The number of setting interval rules should be appropriate to avoid affecting the performance.

Overall structure of IBRB-e.
Based on the BRB and inspired by the paper, this paper proposes IBRB-e model. The reference value is set as an interval in this model, and the interval addition is employed to construct a belief table for addressing the issue of rule explosion in BRB. Furthermore, explainability constraints are introduced in the optimization process, which makes the process more reasonable, and the
where
Generally, the modeling is as follows:
Step 1: Analyzing the problem, that is, determine why the problem arose and the effect of the problem when it is solved. The premise attribute is generally selected as the antecedent of the problem, and the output is selected as the effect after the problem is solved.
Step 2: Relevant reference points and reference values are established. Unlike the BRB, the reference values of this model are set in the form of intervals. In the IBRB-e model, when a data sample of a premise attribute falls in an interval, the rule corresponding to that interval is activated. This means that one rule is activated for each data sample of the premise attribute. Subsequently, all the rules activated by the premise attribute are jointly involved in the inference process of the model. For instance, assuming two premise attributes, each with four reference points, the reference points and reference value Settings for BRB and IBRB-e are shown in Tables 1–4. The four tables show the difference between the reference value settings of IBRB-e and BRB.
Reference value of premise attribute 1 in BRB.
Reference value of premise attribute 1 in BRB.
Reference value of premise attribute 2 in BRB.
Reference value of premise attribute 1 in IBRB-e.
Reference value of premise attribute 2 in IBRB-e.
Step 3: The corresponding belief table is set up. The BRB creates the belief table using a Cartesian product approach, and the method of interval addition and combination is employed by IBRB-e to create the belief rule table [38]. The difference between the generation rules is presented in Figure 2, and the difference in rule generation is illustrated in Figure 3. Specifically, when BRB and IBRB-e in Step 2 create their own belief tables, the generated rules are shown in Tables 5 and 6.
The belief table of BRB.
The belief table of IBRB-e.
From the Tables 5 and 6, when BRB and IBRB-e each have four reference points for a single premise attribute, BRB generates 16 rules, while IBRB-e only generates 8 rules. This effect is more obvious when the reference value is large. Therefore, IBRB-e does not require too many premise properties to maintain model efficiency while improving accuracy.

The distinction between IBRB-e and BRB generation belief rules.

The difference in rule generation.

Reasoning process.
According to Figure 3, the reference points are set for each of the two premise attributes. The reference points for the first premise attribute are P, M, G, and E, and the reference points for the second premise attribute are VL, L, H, and VH, respectively. in the BRB model, since the rule combination is performed by Cartesian product, the number of generated rules is 4*4
The ER parsing algorithm is not utilized by the IBRB-e medical assistant decision-making model. Instead, IBRB-e emphasizes the weight of the evidence and the reliability of the evidence, using ER rule as the inference machine. The process of inference is depicted in Figure 4.
Step 1: In ER rule, evidence is acquired through varying approaches, it may be affected by various interferences during the acquisition process, and the evidence might not be entirely dependable.
where
Step 2: If the weight
where any
Step 3: The
where
Step 4: The collective support degree
where
Step 5: Finally, the final expected utility and output belief distribution is given by:
where
As an optimization algorithm, the P-CMA-ES exhibits the following benefits [39]: (1) Fast convergence: Using parallel computing, multiple solutions can be evaluated at the same time to speed up the convergence speed. (2) Scalability: It is suitable for high-performance computing systems to fully utilize computing resources. (3) Robustness: It can handle complex, nonlinear and multimodal optimization problems. (4) Global optimization does not rely on gradient information and is suitable for problems where gradients are unavailable or expensive. (5) Adaptive step size: By adaptively adjusting the step size, a balance is found between exploration and utilization. (6) Population strategy: Maintaining the population of candidate solutions helps to maintain diversity and avoid falling into local optimal solutions. (7) Wide application: It can be used for optimization problems with continuous, discrete and mixed variables.
Since the parameters of the model may lose meaning during the optimization process, in this paper, a P-CMA-ES with explainability constraints is represented, as shown in Figure 5, which has a certain degree of explainability and holds significant importance within the medical domain. The optimized objective function can be expressed as follows:
where MSE is a loss function, and the specific formula is given by:
where
Step 1: The required parameters is established. The parameters subject to optimization are as follows:
where
Step 2: Data sampling. The update operation of the parameters through the operation can be expressed as follows:
where
Step 3: Incorporating explainability constraints. Adding explainability to ensure that IBRB-e is reasonable, the constraints are as follows:
where
Step 4: Projection. Through projection, the solution vector can conform the explainable conditions added to the optimization, and the equation is given by:
where
Step 5: Updating the parameter. The formula is as follows:
where
Step 6: Calculating the covariance matrix. The specific calculation is as follows:
where
Under generation

The process of P-CMA-ES.
In IBRB-e, it is inevitable that two inputs fall into the same range, thereby activating the same rule. For this situation, setting the range more densely is a very good solution.
The running environment of this experiment is Windows 11 version and the running software is MATLAB R2021b version. The datasets used in the experiment are all opening source datasets. The experimental division of the dataset is done by the leave-out method, where a portion of the data is selected, after which the training and test sets are divided proportionally.
Description of the dataset
The dataset comes from the Wisconsin Breast Cancer Diagnostic Data Set on the UCI website, which is open source and extensively utilized in the domains of machine learning and data mining tasks for breast cancer diagnosis. The data is a class-balanced dataset, and most commonly used for dichotomous problems, to determine whether a tumor is benign or malignant.
The dataset contains the following characteristics (feature values):
Radius: The average distance from the tumor boundary to the tumor centroid.
Texture: The standard deviation of the gray, which measures the degree of gray change in an image.
Perimeter: The length of the perimeter of the tumor. Area: The size of the area of the tumor.
Smoothness: The local length change of the boundary.
Compactness: Square of perimeter divided by area minus 1.
Concavity: The severity of the concave part of the profile.
Concave points: The count of contour points within the depression.
Symmetry: The symmetry of the tumor.
Fractal dimension: A fractal dimension that uses “coastline estimation”.
The decision-making is divided into two types: benign tumors and breast cancer. Benign tumors are evaluated as “0”, and breast cancer tumors are evaluated as “1”. The mean, standard error, and worst values of the above ten features are included in the dataset, resulting in a total of 30 features in the dataset, there are 569 samples for each feature, including 212 samples for malignant tumors and 357 samples for benign tumors. In order to better advance the experimental process, outliers are removed from this dataset and mean interpolation operations are performed on the missing values. After the set-aside method, 96 malignant tumors and 106 benign tumors are selected as test samples, and the remaining samples are used for training.
Analysis of the importance of characteristics to the results
The purpose of feature importance analysis is to improve the applicability of the data [40], and XGBoost is an efficient algorithm commonly used for classification and regression, therefore, the XGBoost is used in this paper as a tool for importance analysis. When performing importance analysis, the importance of the input features to the results is important, which can be performed by calculating feature importance methods: XGBoost provides a way to directly view feature importance, by calling the properties, the importance score of each feature to the prediction can be obtained. This can help quickly understand which features exert the most significant on the prediction. The feature importance analysis is shown in Figure 6.

The degree to which the feature affects the outcome.
Since the influence degree of some features on the results is negligible compared with the features with a high degree of influence, only the first eight features are selected and placed in the above table, and the first two features with the greatest influence are opted as the premise attributes, which greatly reduces the complexity of the model.
(1) Settings of optimization parameters
Optimizing parameter settings holds importance in enhancing algorithm or model performance, boosting efficiency, mitigating overfitting or underfitting, accommodating diverse datasets, conserving resources, and gaining insights into algorithm behavior. By adjusting the parameters, computer algorithms and machine learning models can perform better in real-world applications.
In this paper, a round of experiments is randomly selected, and the parameters in the experiment are presented in Table 7, in Table 7, the number of rules generated is 31, and the reference interval, rule reliability, rule weight, and output belief distribution (malignant versus benign tumors) are described in detail for each rule.
Parameter settings.
Parameter settings.
It can be seen from the table that the reference interval setting of premise attribute 1 of IBRB-e ranges from 50 to 252, with a total of 12 reference values; the reference interval setting of premise attribute 2 ranges from
(2) Rules for the diagnosis of breast cancer diseases are established
After determining the indicators needed for diagnosis and the feature importance of the diagnostic indicators, to avoid the problem of too many input attributes of BRB causing the explosion of combination rules, the IBRB-e model is established, at the same time, based on the existing expert knowledge, the reference values for the premise attributes are selected.
(1) Evaluation indicators
Case analysis usually requires appropriate evaluation indicators to be more reliable, so this paper uses the following indicators to evaluate the case:
(1) Accuracy: This reflects how many of the disease samples are correctly diagnosed. The calculation equation is shown below:
(2) Precision: Precision reflects how many samples of a certain type are correctly diagnosed when they have been diagnosed. The calculation equation is shown below:
(3) Recall: Recall reflects the number of the initial actual samples correctly diagnosed. The calculation equation is shown below:
where TP is the True positives (predicted to be positive case and actually positive case), TN is the True Negatives (predicted negative case and actually negative case), FP is the False Positives (predicted positive case and actually negative case), FN is the False Negatives (predicted negative case and actually positive case). In the experiment, positive cases are benign tumors and negative cases are malignant tumors.
(2) Model output
After the preparatory work is completed, the two selected features can be obtained, namely, the maximum circumference of the tumor and the average concave point. Figures 7 and 8 show the data distribution of perimeter worst and concave point worst in benign and malignant tumors respectively

Data distribution with the worst perimeter.

Data distribution with the mean concave point.

The two most important feature directions.
The distribution of experimental samples for these two features is shown in Figure 9, with blue indicating benign and red indicating malignant. The selected premise attributes are input into the IBRB-e model, and the corresponding fitting results are shown in Figure 10. The accuracy rate of the model can reach 99.5%. As seen from the above figure, the IBRB-e has a good fitting effect for the diagnosis of breast cancer.
(3) Robustness of IBRB-e
To evaluate the robustness of the model, different proportions of samples are randomly selected from 212 malignant tumor samples and 357 benign tumor samples each time for the test set, and the rest for the training set. The accuracy of IBRB-e and BRB under the ratio of training set to test set is 2:8, 3:7, 4:6, 5:5, 6:4, 7:3, 8:2, respectively. The evaluation results are shown in the Table 8.
Accuracy of IBRB-e and BRB under different data ratios.

Result fit plot.
As can be seen from the Table 8, the accuracy rate fluctuated with different sample sizes, but remained generally high and much higher than traditional BRB. It indicates that IBRB-e has certain robustness and maintains good performance when the data sample size changes.
To sum up, IBRB-e performs well in this comparative experiment in terms of average accuracy, highest accuracy, lowest accuracy and robustness. The IBRB-e’s superior capabilities in accuracy, stability and reliability make it a very favorable choice for solving specific problems.
(4) the efficiency of IBRB-e
After proving the robustness of IBRB-e, the efficiency of the model needs to be further verified. The model generates rules by adding intervals, which is much more convenient than the traditional brb by Cartesian product, so the running time is shorter.

Operation efficiency comparison.
Then, an experiment is set up. In order to facilitate recording, the number of reference values of the two premise attributes of the control IBRB-e and BRB model is the same, and then the regular number generated by IBRB-e and BRB is recorded when the number of reference values is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 respectively. As can be seen in subgraph a) of Figure 11, when the number of reference values of premise attribute 1 and premise attribute 2 are both 10, the number of BRB rules has reached 100, which increases the complexity of the model.
Further, the running time of IBRB-e and BRB is also shown in subgraph b) of Figure 11. When the number of reference values of the premise attribute increases continuously (here, the number of reference values of the two premise attributes is also the same), the running time of IBRB-e does not significantly slow down. However, when the number of reference values of the premise attribute 1 and premise attribute 2 are both 10, the running time of IBRB-e does not significantly slow down. The BRB running time has reached 98 seconds, which greatly affects the efficiency of the model, so the optimization effect of IBRB-e in terms of simplification of rules and running time is very obvious.
(5) Belief distribution
Expert knowledge is the key to optimization. IBRB-e uses the improved P-CMA-ES optimization algorithm, and its optimization process involves the gradual absorption of expert knowledge. In contrast, IBRB uses the P-CMA-ES optimization algorithm without adding explainability constraints in the optimization process, and lacks the ability to absorb expert knowledge. Resulting in an overemphasis on accuracy and a decline in explainability. Figure 12 illustrates a comparison of the belief distribution between expert knowledge, IBRB and IBRB-e.
When the optimized belief is close to the expert knowledge, more key features of the expert knowledge are retained. As can be seen from the figure, the IBRB-e model shows better agreement with expert knowledge and effectively describes the actual benign and malignant tumors of breast cancer. The model retains more features of expert knowledge in the optimization process, thus enhancing explainability, on the contrary, IBRB makes many rules that are inconsistent with expert knowledge, resulting in a loss of explainability.

Belief distribution of IBRB-e, IBRB, and expert knowledge.
In addition, in assisted decision problems, there is an intermediate transition phase between different decision classes. At this stage, the attribute similarity between two similar categories is very high, resulting in local ignorance information, which may lead to the objective fact error of the model’s decision. Therefore, in order to effectively represent ignorance information, an intermediate state is introduced, and the belief rule is assigned to three states, namely M (malignant), Mid, and B (Benign). This approach better captures local ignorance and enhances representation.
(1) The comparison accuracy

Comparative experiment fit plot.
Accuracy stability analysis of different algorithms.
Comparison of recall and precision.
In the comparative experiment shown in Table 9 and Figure 13, various machine learning algorithms, including IBRB-e, BRB, decision trees, SVM, KNN, and BP, are assessed for the performance in a specific task. Analyzing the experimental outcomes allows the remarkable performance of the IBRB-e model to stand out:
Highest Accuracy: Exceptional performance is demonstrated by the IBRB-e, achieving the highest accuracy of 99.5%, surpassing the highest accuracy achieved by other models. This underscores the capability of IBRB-e to achieve a significantly high classification accuracy for this task. Average Accuracy: An impressive average accuracy of 98.01% is achieved by IBRB-e, which far exceeded the average accuracy of the other models. The robustness and reliability of IBRB-e are well proven. Lowest Accuracy: Even in terms of the lowest accuracy, the performance of IBRB-e remained outstanding, reaching 96.53%, which is notably higher than the lowest accuracy of other models. This underscores the resilience of IBRB-e, maintaining elevated accuracy even in the face of challenging data instances.
(2) The comparison of precision and recall
In order to evaluate the model more comprehensively. The precision and recall metrics are introduced. The recall rate measures the model’s ability to correctly identify positive cases and therefore provides information about the proportion of cases the model successfully identifies out of all positive cases detected. The experiment is still conducted for ten rounds and two methods, logistic regression as well as artificial neural networks, are added and the results of the experimental evaluation are shown in the Table 10.
As can be seen from the table, IBRB-e has a better performance in Precision as well as Recall, with an average Precision of 99.05% and an average Recall of 97.71%, which indicates that IBRB-e has a more prominent advantage in diagnosing malignant tumors. In addition, Logistic Regression Linear Discriminant Analysis and Artificial Neural Network also have more stable performance Back Propagation and Support Vector Machine have larger differences in Precision and Recall, which may be caused by their poorer effect of predicting malignant tumors and better effect of predicting benign tumors instead. Decision Tree and K Nearest Neighbors perform very generally and do not predict malignancy very accurately.
To verify the scalability of the IBRB-e disease diagnosis method, 10 rounds of experiments are conducted using three publicly available datasets. The three data sets are Obesity risk, Heart Failure prediction, and Diabetes prediction. The experimental results show that IBRB-e performs well on these datasets. Details of the complete experiment can be found in Table 11 of this paper.
Reference value intervals for different disease decisions.
Reference value intervals for different disease decisions.

Extended experimental results of IBRB-e.
Where
Average accuracy of IBRB-e extended experiment.
From Figure 14 and Table 12, it can be seen that the IBRB-e medical decision-making method consistently maintains an accuracy of over 88.75% in scalability experiments. Particularly noteworthy is its robust performance when dealing with the obesity risk dataset, and the average accuracy is 95.7142% for the obesity risk dataset, 91.7417% for the heart failure dataset and 94.0667% for the diabetes dataset, which is closely related to the excellent explainability of the IBRB-e medical decision-making method. Through extended experiments, it can be demonstrated that IBRB-e has potential application value, providing effective clinical decision support for medical decision-makers.
To address the problems of BRB in the field of medical assisted decision making, this paper applies the IBRB-e model, which presents the reference values in the form of intervals, and builds the belief rule table by adding the intervals, thus effectively solving the dilemma of combinatorial rule explosion. In addition, while using ER rules as the reasoning machine, the reliability of the rules is considered and the model is optimized using the P-CMA-ES optimization algorithm with explainability constraints, which improves the accuracy and explainability of the model.
However, the applicability of IBRB-e is limited in the case of uneven sample distribution. In this case, an unreasonable activation of rules may occur. And if there are fewer premise attributes and reference values, then the advantage of the IBRB-e is not obvious. Future work can solve these two shortcomings and refine the model to make it more suitable for real-world clinical applications and provide more reliable and transparent decision support for medical decisions.
Footnotes
Funding statement
This work was supported in part by the Postdoctoral Science Foundation of China under Grant No. 2020M683736, in part by the Teaching reform project of higher education in Heilongjiang Province under Grant No. SJGY20210456, in part by the Natural Science Foundation of Heilongjiang Province of China under Grant No. LH2021F038, in part by the Social Science Foundation of Heilongjiang Province of China under Grant No. 21GLC189, in part by the Foreign Expert Projects in Heilongjiang under Grant No. GZ20220131, in part by the Graduate innovation project of Harbin Normal University under Grant No.HSDSSCX2023-3.
Competing interests
The authors declare no competing interests.
Data availability
The datasets generated and analysed during the current study are available on the following website:
