Applying SVR-PPSO,SVR-SSO,and SVR-BBO to estimate california bearing capacity of stabilized pond ash using admixtures

Abstract

In the present study, three hybrid models include support vector regression-salp swarm optimization (SVR-SSO), support vector regression-biogeography-based (SVR-BBO), and support vector regression-phasor particle swarm optimization (SVR- PPSO) was applied to forecast pond ash’s CBR value modified with lime sludge (LS) and lime (LI). In the developed models, five variables were selected as inputs. It can result that the developed integrated models have R2 bigger than 0.9952. It means the agreement between observed and forecasted values by hybrid models is mainly similar to represent the highest accuracy. In both the training and testing stages, PSO-SVR results from better performance than the BBO-SVR model, with R2, RMSE, MAE, and PI equal to 0.9983, 0.6439, 0.3181, and 0.0081 for training data, and 0.9975, 0.7319, 0.4135, and 0.0141 for testing data, respectively. So, by considering the OBJ index, the OBJ value for PSO-SVR is 12.966, lower than BBO-SVR at 16.9957. Therefore, the PSO-SVR model outperforms another model to estimate the CBR of pond ash modified with LI and LS, consequently being recognized as the proposed model that makes it to be used for practical applications.

Keywords

California bearing ratio phasor particle swarm optimization biogeography-based optimization salp swarm optimization support vector regression

1 Introduction

Thermal power plants produce various ashes, such as fly ash and pond ash [1, 2]. Bottom ash generated from the boiler and fly ash produced from the electrostatic precipitators is blended with water and produced slurry and disposed of in ash ponds [3]. Fly ash type C has a large calcium content level and could react with water [4]. Other ashes are class F with low calcium and great thermal power plant [5]. This type of ash does not have sufficient strength to be used as an independent construction component. This ash’s deficiency could be improved by other cementitious materials like lime and cement, which were evaluated by several studies [6 –9]. The production of acetylene, sugar, and water softening plants is lime sludge as another material. Annually, about 4.5 million tons of lime sludge is produced in India [10], causes disposal, and alters into a worrying environmental matter [11 –13]. Due to this material containing calcium carbonate [14], it can be used in several construction fields, like a substitution of cement [10], in the road [15], and in construction materials [16].

California bearing ratio (CBR) is a superior parameter to appraise the soil subgrade, either flexible or rigid pavements [17]. CBR test is performed on compacted samples of soil in the lab and is performed on the ground [18]. Besides CBR being the time needed, the test conclusion can be untrustable due to the specimen disturbance and restrictions of testing circumstances. Thus, artificial-based techniques can be efficient in the various forecasting procedures [19], particularly CBR [20].

Pavement subgrade soils were modified by different waste marble powders: 1) calcite marble and 2) dolomitic marble powder. California bearing ratio was provided to compute the improvement in capacity. According to the comparison based on the values of CBR, the inclusion of an appropriate amount and kind of marble powder positively affected the bearing capacity of the subgrade soil with no regard to the curing time [21, 22].

Regarding artificial intelligence methods, the neural network (NN) is a human brain simulation and was extensively agreed in various branches to illustrate the connections between inputs and output. Thus, the neural network is an adequately accurate solution for predicting the most geotechnical engineering field. The neural network has been effectively applied in different branches of geotechnical engineering, such as the CS and the frozen sand of Young’s modulus [23, 24], bearing capacity of the pile [25, 26], stability of slope [27 –29], and underground structures [30, 31].

In spite of several NN applications being created in geotechnical engineering, research for predicting the CBR is low. Single and multiple analyses were employed for forecasting the CBR value by a dataset collected from highways of Turkey located in different regions. The conclusion presents that the NN has appropriate results compared to statistical models [32]. The neural network and multiple regression methods were used for forecasting the CBR value of modified soil with admixtures such as lime and quarry [33]. Both proposed models predicted the CBR precisely, while the neural network outperforms multiple regression. Two modeling methods, named random forest and M5P-Tree models, are applied for modeling the soaked CBR value of pond ash. Pond ash was stabilized with the help of additives like lime and lime sludge. Standard statistical parameters estimated the performance of models. Despite both of the model’s performance for predicting CBR is good enough, it is explicit which the random forest technique outperforms the M5P model [34].

For dealing with real-world challenges related to HPC, machine learning techniques have proven to be effective. Noteworthy methods include the artificial neural network (ANN), gene expression programming (GEP), support vector machine (SVM), multi-layer perceptron neural network (MLP), and the multigroup approach for data management. These techniques have been utilized to predict the desired output data in the context of HPC. In recent years, machine learning approaches have demonstrated significant promise in predicting concrete strength. Support Vector Regression (SVR) [35] has risen as an efficient and robust method among these techniques for modeling and predicting complex nonlinear relationships. The successful application of SVR in various fields can be attributed to its capability to effectively deal with nonlinearity, reduce the risk of overfitting, and handle high-dimensional data [36].

Despite SVR demonstrating promising results, the optimization of its parameters is essential to further improve its predictive performance [35]. With this objective in mind, researchers have investigated the incorporation of diverse optimization algorithms to enhance the accuracy and convergence speed of the SVR model. In this paper, biogeography-based optimizer (BBO), phasor particle swarm optimization (PPSO), and salp swarm optimization (SSO) have emerged as effective optimizers in different domains.

Lately, the application of geosynthetic stabilized soil has become popular to construct tolerable pavement structures. The CBR evaluates its resistance. The study’s main aim was to figure out and assess the competency of various techniques, for instance, ANN, M5 model trees, and random forests, to estimate the CBR of modified soil [37].

In the present study, two hybrids, biogeography-based support vector regression (BBO-SVR) and particle swarm optimization-support vector regression (PSO-SVR), were applied to forecast pond ash’s CBR value. To aim goal, the developed models include five inputs named MDD, OMC, L, LS, and CP, and CBR as a target parameter. So as to appraise the precision of the proposed models, five performance criteria (R², RMSE, MAE, PI, and OBJ) were considered.

2 Methodologies

2.1 Dataset explanation

To train the proposed models for predicting the CBR value, a dataset comprising 109 experimental measurements was collected from published documents (refer to Table 1) [38]. Eight different influencing variables to the value of output were taken into account as input parameters. The statistical explanation of the dependent and independent parameters utilized for proposing the integrated models is supplied in Table 2. Fig. 1 presents a multi-Y scatter plot illustrating the connection between the input and output variables, visually depicting their interrelation.

Table 1
Mix designs and CBR results

Number LL PL PI MDD OMC (%) SDA (%) QD (%) OPC (%) CBR (%)

1 31.9 25.3 6.6 1.506 22 12 20 5 43.5

2 44.1 26.8 17.3 1.38 29.5 4 0 2 21.61

3 31.2 24.1 7.1 1.438 25.4 20 10 5 48.36

4 38.9 30 8.9 1.412 26.5 12 10 2 34.85

5 33.5 25.5 8 1.777 24 20 0 2 55.35

6 23.1 17.9 5.2 1.437 25.2 10 0 5 48.25

7 33.1 22.4 10.7 1.495 23.2 4 10 5 34.48

8 38.9 30 8.9 1.412 26.5 12 10 2 36.23

9 33.2 30.3 2.9 1.501 24.7 10 10 5 55.31

10 39.2 30 9.2 1.695 25.5 0 0 8 31.63

11 39.1 30.9 8.2 1.581 22.1 4 20 8 46.23

12 38.2 31.1 7.1 1.438 25.4 20 10 5 42.09

13 41.6 24.7 16.9 1.555 22.1 0 10 5 24.17

14 37.8 26.6 11.2 1.553 23 8 20 2 38.78

15 36.7 27.1 9.6 1.434 26 12 0 8 31.19

16 45.7 37.2 8.5 1.469 25.5 20 20 5 53.78

17 36.2 27.8 8.4 1.461 24 16 10 8 36.37

18 29.2 21.5 7.7 1.512 23.5 20 20 2 40.05

19 33.4 30.4 3.0 1.521 23.6 10 10 5 60.23

20 45.7 37.2 8.5 1.469 25.5 20 20 5 38.36

21 23.1 17.9 5.2 1.437 25.2 10 0 5 44.05

22 33.2 30.3 2.9 1.501 24.7 10 10 5 62.08

23 32.5 24.6 7.9 1.555 22.32 10 10 8 42.55

24 40.6 24.3 16.3 1.528 24 0 10 5 30.58

25 31.9 25.3 6.6 1.418 24.5 16 0 5 43.61

26 39.1 30.9 8.2 1.581 22.1 4 20 8 43.42

27 29.9 24.5 5.4 1.555 22.32 10 20 5 37.64

28 35.3 26.6 8.7 1.527 23 16 20 2 38.22

29 40.3 36.1 4.2 1.401 27.5 20 0 8 51.79

30 42.5 30.3 12.2 1.365 28.76 0 20 8 52.29

31 35.3 26.6 8.7 1.527 23 16 20 2 35.81

32 23.6 21.3 2.3 1.55 20 20 0 8 62.18

33 44.1 26.8 17.3 1.38 29.5 4 0 2 19.7

34 39.1 22.5 16.6 1.402 27.32 0 20 2 23.26

35 36.7 27.1 9.6 1.434 26 12 0 8 31.95

36 42.5 30.3 12.2 1.365 28.76 0 20 8 36.59

37 42.1 29.6 12.5 1.43 20.4 8 0 5 27.41

38 21.2 19.1 2.1 1.508 23.3 20 20 8 46.99

39 45.7 37.2 8.5 1.469 25.5 20 20 5 32.43

40 29.2 21.5 7.7 1.512 23.5 20 20 2 42.77

41 33.2 25.1 8.1 1.388 26.6 20 10 2 34.69

42 37.6 25.7 11.9 1.554 22 10 10 2 35.24

43 37.9 28.7 9.2 1.418 24.5 16 0 5 36.37

44 40.6 24.3 16.3 1.528 24 0 10 5 26.22

45 23.1 17.9 5.2 1.437 25.2 10 0 5 49.73

46 52.1 32.6 19.5 1.422 26.5 0 0 2 23.78

47 33.5 25.5 8 1.777 24 20 0 2 54.18

48 37.8 26.6 11.2 1.553 23 8 20 2 36.55

49 33.9 24.6 9.3 1.526 22.4 8 10 8 26.76

50 38.9 27.3 11.6 1.53 18.9 0 20 8 32.3

51 41.6 24.7 16.9 1.555 22.1 0 10 5 23.14

52 38.9 27.3 11.6 1.53 18.9 0 20 8 34.03

53 52.1 32.6 19.5 1.422 26.5 0 0 2 27.57

54 37.8 26.6 11.2 1.553 23 8 20 2 33.93

55 39.1 30.9 8.2 1.581 22.1 4 20 8 45.95

56 40.3 36.1 4.2 1.401 27.5 20 0 8 38.05

57 36.2 27.8 8.4 1.461 24 16 10 8 38.31

58 23.6 21.3 2.3 1.55 20 20 0 8 65.29

59 37.9 28.7 9.2 1.418 24.5 16 0 5 38.31

60 33.2 25.1 8.1 1.388 26.6 20 10 2 38.93

61 39.1 30.9 8.2 1.581 22.1 4 20 8 45.43

62 33.1 22.4 10.7 1.495 23.2 4 10 5 28.6

63 37.6 25.7 11.9 1.554 22 10 10 2 38.92

64 23.6 21.3 2.3 1.55 20 20 0 8 54.2

65 40.6 25.3 15.3 1.528 24 0 10 5 24.35

66 39.1 22.5 16.6 1.402 27.32 0 20 2 23.49

67 38.9 27.3 11.6 1.53 18.9 0 20 8 33.03

68 23.1 17.9 5.2 1.437 25.2 10 0 5 57.07

69 31.2 24.1 7.1 1.438 25.4 20 10 5 38.12

70 36.2 27.8 8.4 1.461 24 16 10 8 38.23

71 40.3 36.1 4.2 1.401 27.5 20 0 8 48.55

72 29.9 24.5 5.4 1.555 22.32 10 20 5 42.58

73 37.6 25.7 11.9 1.554 22 10 10 2 30.01

74 37.8 26.6 11.2 1.553 23 8 20 2 34.35

75 44.1 26.8 17.3 1.38 29.5 4 0 2 23.7

76 31.2 24.1 7.1 1.438 25.4 20 10 5 49.43

77 39.2 30 9.2 1.695 25.5 0 0 8 29.72

78 37.9 28.7 9.2 1.418 24.5 16 0 5 38.23

79 38.9 30 8.9 1.412 26.5 12 10 2 34.89

80 31.9 25.3 6.6 1.506 22 12 20 5 44.39

81 29.2 21.5 7.7 1.512 23.5 20 20 2 45

82 33.1 22.4 10.7 1.495 23.2 4 10 5 32.58

83 35.3 26.6 8.7 1.527 23 16 20 2 35.73

84 33.2 25.1 8.1 1.388 26.6 20 10 2 39.85

85 37.6 25.7 11.9 1.554 22 10 10 2 42.83

86 33.9 24.6 9.3 1.526 22.4 8 10 8 19.69

87 42.5 30.3 12.2 1.365 28.76 0 20 8 57.58

88 39.1 22.5 16.6 1.402 27.32 0 20 2 25.03

89 42.1 29.6 12.5 1.43 20.4 8 0 5 30.17

90 21.2 19.1 2.1 1.508 23.3 20 20 8 57.95

91 29.9 24.5 5.4 1.555 22.32 10 20 5 42.27

92 33.5 25.5 8 1.777 24 20 0 2 41.66

93 40.3 36.1 4.2 1.401 27.5 20 0 8 55.93

94 33.2 30.3 2.9 1.501 24.7 10 10 5 62.68

95 21.2 19.1 2.1 1.508 23.3 20 20 8 66.75

96 31.9 25.3 6.6 1.506 22 12 20 5 49.22

97 42.1 29.6 12.5 1.43 20.4 8 0 5 36.32

98 35.3 26.6 8.7 1.527 23 16 20 2 39.59

99 36.7 27.1 9.6 1.434 26 12 0 8 35.1

100 42.5 30.3 12.2 1.365 28.76 0 20 8 47.43

101 39.1 22.5 16.6 1.402 27.32 0 20 2 24.29

102 31.9 25.3 6.6 1.506 22 12 20 5 48.2

103 32.5 24.6 7.9 1.555 22.32 10 10 8 41.95

104 29.9 24.5 5.4 1.555 22.32 10 20 5 49.63

105 33.2 30.3 2.9 1.521 23.6 10 10 5 57.87

106 33.1 22.4 10.7 1.495 23.2 4 10 5 33.3

107 42.1 29.6 12.5 1.43 20.4 8 0 5 36.3

108 33.5 25.5 8 1.777 24 20 0 2 44.34

109 38.9 30 8.9 1.412 26.5 12 10 2 33.54

Number	LL	PL	PI	MDD	OMC (%)	SDA (%)	QD (%)	OPC (%)	CBR (%)
1	31.9	25.3	6.6	1.506	22	12	20	5	43.5
2	44.1	26.8	17.3	1.38	29.5	4	0	2	21.61
3	31.2	24.1	7.1	1.438	25.4	20	10	5	48.36
4	38.9	30	8.9	1.412	26.5	12	10	2	34.85
5	33.5	25.5	8	1.777	24	20	0	2	55.35
6	23.1	17.9	5.2	1.437	25.2	10	0	5	48.25
7	33.1	22.4	10.7	1.495	23.2	4	10	5	34.48
8	38.9	30	8.9	1.412	26.5	12	10	2	36.23
9	33.2	30.3	2.9	1.501	24.7	10	10	5	55.31
10	39.2	30	9.2	1.695	25.5	0	0	8	31.63
11	39.1	30.9	8.2	1.581	22.1	4	20	8	46.23
12	38.2	31.1	7.1	1.438	25.4	20	10	5	42.09
13	41.6	24.7	16.9	1.555	22.1	0	10	5	24.17
14	37.8	26.6	11.2	1.553	23	8	20	2	38.78
15	36.7	27.1	9.6	1.434	26	12	0	8	31.19
16	45.7	37.2	8.5	1.469	25.5	20	20	5	53.78
17	36.2	27.8	8.4	1.461	24	16	10	8	36.37
18	29.2	21.5	7.7	1.512	23.5	20	20	2	40.05
19	33.4	30.4	3.0	1.521	23.6	10	10	5	60.23
20	45.7	37.2	8.5	1.469	25.5	20	20	5	38.36
21	23.1	17.9	5.2	1.437	25.2	10	0	5	44.05
22	33.2	30.3	2.9	1.501	24.7	10	10	5	62.08
23	32.5	24.6	7.9	1.555	22.32	10	10	8	42.55
24	40.6	24.3	16.3	1.528	24	0	10	5	30.58
25	31.9	25.3	6.6	1.418	24.5	16	0	5	43.61
26	39.1	30.9	8.2	1.581	22.1	4	20	8	43.42
27	29.9	24.5	5.4	1.555	22.32	10	20	5	37.64
28	35.3	26.6	8.7	1.527	23	16	20	2	38.22
29	40.3	36.1	4.2	1.401	27.5	20	0	8	51.79
30	42.5	30.3	12.2	1.365	28.76	0	20	8	52.29
31	35.3	26.6	8.7	1.527	23	16	20	2	35.81
32	23.6	21.3	2.3	1.55	20	20	0	8	62.18
33	44.1	26.8	17.3	1.38	29.5	4	0	2	19.7
34	39.1	22.5	16.6	1.402	27.32	0	20	2	23.26
35	36.7	27.1	9.6	1.434	26	12	0	8	31.95
36	42.5	30.3	12.2	1.365	28.76	0	20	8	36.59
37	42.1	29.6	12.5	1.43	20.4	8	0	5	27.41
38	21.2	19.1	2.1	1.508	23.3	20	20	8	46.99
39	45.7	37.2	8.5	1.469	25.5	20	20	5	32.43
40	29.2	21.5	7.7	1.512	23.5	20	20	2	42.77
41	33.2	25.1	8.1	1.388	26.6	20	10	2	34.69
42	37.6	25.7	11.9	1.554	22	10	10	2	35.24
43	37.9	28.7	9.2	1.418	24.5	16	0	5	36.37
44	40.6	24.3	16.3	1.528	24	0	10	5	26.22
45	23.1	17.9	5.2	1.437	25.2	10	0	5	49.73
46	52.1	32.6	19.5	1.422	26.5	0	0	2	23.78
47	33.5	25.5	8	1.777	24	20	0	2	54.18
48	37.8	26.6	11.2	1.553	23	8	20	2	36.55
49	33.9	24.6	9.3	1.526	22.4	8	10	8	26.76
50	38.9	27.3	11.6	1.53	18.9	0	20	8	32.3
51	41.6	24.7	16.9	1.555	22.1	0	10	5	23.14
52	38.9	27.3	11.6	1.53	18.9	0	20	8	34.03
53	52.1	32.6	19.5	1.422	26.5	0	0	2	27.57
54	37.8	26.6	11.2	1.553	23	8	20	2	33.93
55	39.1	30.9	8.2	1.581	22.1	4	20	8	45.95
56	40.3	36.1	4.2	1.401	27.5	20	0	8	38.05
57	36.2	27.8	8.4	1.461	24	16	10	8	38.31
58	23.6	21.3	2.3	1.55	20	20	0	8	65.29
59	37.9	28.7	9.2	1.418	24.5	16	0	5	38.31
60	33.2	25.1	8.1	1.388	26.6	20	10	2	38.93
61	39.1	30.9	8.2	1.581	22.1	4	20	8	45.43
62	33.1	22.4	10.7	1.495	23.2	4	10	5	28.6
63	37.6	25.7	11.9	1.554	22	10	10	2	38.92
64	23.6	21.3	2.3	1.55	20	20	0	8	54.2
65	40.6	25.3	15.3	1.528	24	0	10	5	24.35
66	39.1	22.5	16.6	1.402	27.32	0	20	2	23.49
67	38.9	27.3	11.6	1.53	18.9	0	20	8	33.03
68	23.1	17.9	5.2	1.437	25.2	10	0	5	57.07
69	31.2	24.1	7.1	1.438	25.4	20	10	5	38.12
70	36.2	27.8	8.4	1.461	24	16	10	8	38.23
71	40.3	36.1	4.2	1.401	27.5	20	0	8	48.55
72	29.9	24.5	5.4	1.555	22.32	10	20	5	42.58
73	37.6	25.7	11.9	1.554	22	10	10	2	30.01
74	37.8	26.6	11.2	1.553	23	8	20	2	34.35
75	44.1	26.8	17.3	1.38	29.5	4	0	2	23.7
76	31.2	24.1	7.1	1.438	25.4	20	10	5	49.43
77	39.2	30	9.2	1.695	25.5	0	0	8	29.72
78	37.9	28.7	9.2	1.418	24.5	16	0	5	38.23
79	38.9	30	8.9	1.412	26.5	12	10	2	34.89
80	31.9	25.3	6.6	1.506	22	12	20	5	44.39
81	29.2	21.5	7.7	1.512	23.5	20	20	2	45
82	33.1	22.4	10.7	1.495	23.2	4	10	5	32.58
83	35.3	26.6	8.7	1.527	23	16	20	2	35.73
84	33.2	25.1	8.1	1.388	26.6	20	10	2	39.85
85	37.6	25.7	11.9	1.554	22	10	10	2	42.83
86	33.9	24.6	9.3	1.526	22.4	8	10	8	19.69
87	42.5	30.3	12.2	1.365	28.76	0	20	8	57.58
88	39.1	22.5	16.6	1.402	27.32	0	20	2	25.03
89	42.1	29.6	12.5	1.43	20.4	8	0	5	30.17
90	21.2	19.1	2.1	1.508	23.3	20	20	8	57.95
91	29.9	24.5	5.4	1.555	22.32	10	20	5	42.27
92	33.5	25.5	8	1.777	24	20	0	2	41.66
93	40.3	36.1	4.2	1.401	27.5	20	0	8	55.93
94	33.2	30.3	2.9	1.501	24.7	10	10	5	62.68
95	21.2	19.1	2.1	1.508	23.3	20	20	8	66.75
96	31.9	25.3	6.6	1.506	22	12	20	5	49.22
97	42.1	29.6	12.5	1.43	20.4	8	0	5	36.32
98	35.3	26.6	8.7	1.527	23	16	20	2	39.59
99	36.7	27.1	9.6	1.434	26	12	0	8	35.1
100	42.5	30.3	12.2	1.365	28.76	0	20	8	47.43
101	39.1	22.5	16.6	1.402	27.32	0	20	2	24.29
102	31.9	25.3	6.6	1.506	22	12	20	5	48.2
103	32.5	24.6	7.9	1.555	22.32	10	10	8	41.95
104	29.9	24.5	5.4	1.555	22.32	10	20	5	49.63
105	33.2	30.3	2.9	1.521	23.6	10	10	5	57.87
106	33.1	22.4	10.7	1.495	23.2	4	10	5	33.3
107	42.1	29.6	12.5	1.43	20.4	8	0	5	36.3
108	33.5	25.5	8	1.777	24	20	0	2	44.34
109	38.9	30	8.9	1.412	26.5	12	10	2	33.54

Liquid limit (LL), plastic limit (PL), Plasticity Index (PI), optimum moisture content (OMC) and maximum dry density (MDD), Soil Dispersibility Index (SDA), Quality Deficiency (QD), Ordinary Portland Cement (OPC), California Bearing Ratio (CBR).

Table 2

The statistical description of the dependent and independent variables

Data phase	Variable	Min	Max	Average	St. deviation
Train data	LL	21.20	52.10	36.20	6.47
	PL	17.90	37.20	26.82	4.52
	PI	2.10	19.50	9.39	4.31
	MDD	1.37	1.78	1.49	0.08
	OMC	18.90	29.50	24.20	2.47
	SDA (%)	0.00	20.00	10.55	7.42
	QD (%)	0.00	20.00	10.26	8.16
	OPC (%)	2.00	8.00	5.00	2.40
	CBR (%)	19.70	65.29	39.15	10.68
Test data	LL	21.20	42.50	35.03	5.35
	PL	19.10	36.10	26.38	3.73
	PI	2.10	16.60	8.65	3.65
	MDD	1.37	1.78	1.50	0.10
	OMC	20.40	28.76	24.02	2.35
	SDA (%)	0.00	20.00	10.91	6.60
	QD (%)	0.00	20.00	11.52	8.34
	OPC (%)	2.00	8.00	4.82	2.36

Fig. 1

The scatter of input and output variables.

2.2 Support Vector Regression (SVR)

Support vector machines are regression and classification models derived from statistical learning theory. The Support vector machines-based classification methods are based on the generality of the best separation of classes. If the classes are separable –this method selects, from among the unlimited number of linear classifiers, the one that minimizes the generalization error, or at least an upper bound on this error, derived from structural risk minimization. SVR has additive advantages when compared to other regression methods [39, 40]. Considering a training dataset { (x_i, y_i) , I = 1, … n }, where n is the size of the training dataset, (x_i) is the input vector (y_i) is the output vector, respectively.

$f (x) = 〈 m, x 〉 + d$ (1) where mɛR^N and d ɛ R . 〈m, x〉 represents the dot product in space R^N Vector m in Equation (1) specifies the orientation of a discriminant plane, while scalar d determines the distance between the discriminant plane and the origin.

Minimizing the Euclidean norm leads to a smaller value of m, indicating the flatter nature of Equation (1) defined by ∥m ∥ ². Thus, an optimization problem for regression can be written as:

$minimize \frac{1}{2} ∥ m ∥^{2}$ (2)

$subject to {\begin{matrix} y_{i - 〈 m, x_{i} 〉 - d \leq ɛ} \\ 〈 m, x_{i} 〉 + d - y_{i \leq ɛ} \end{matrix}$ (3)

Equation (2) assumes that there exists an expansion that can generate an error smaller than ɛ for all training pairs. However, in real-world scenarios, there may be situations, such as those encountered in classification problems, that do not conform to this assumption.so that’s why, to allow some more error, slack variables ξ, ξ can be optimized and to handle the infeasible constraints of the optimization problem defined in Equation (3), the following approach can be employed [41].

$\begin{matrix} minimise \frac{1}{2} ∥ m ∥^{2} + G \sum_{i = 1}^{n} (ξ_{i +} ξ_{i}^{'}) \\ subject to y_{i -} 〈 m, x_{i} 〉 - d \leq ɛ + ξ_{i} \\ 〈 m, x_{i} 〉 + d - y_{i \leq} \leq ɛ + ξ_{i}^{'} \\ So ξ_{i}, ξ_{i}^{'} \geq 0 for all i = 1, 2, 3 \dots . n . \end{matrix}$ (4)

The constant G > 0 is a parameter of user-defined which determines the trade-off between function flatness. Equation (3) defines the objective primal function in the minimization problem [42]. By transforming the problem into a dual space representation using Lagrangian multipliers, the inequalities can be replaced with a simpler form, allowing the optimization problem to be solved more easily. Equation (5) the lagrangian can be formed by introducing positive multipliers $λ_{i}, λ_{i}^{'}, η_{i}, η_{i}^{'}, i = 1, 2, 3, 4, \dots, n$ and constraint equations multiplying by these multipliers, and finally subtracting the results from the objective function (i . e, ∥ m ∥ ²) in Equation (5). The lagrangian can be written as:

$\begin{matrix} P = & \frac{1}{2} ∥ m ∥^{2} + G \sum_{i = 1}^{n} (ξ_{i} + ξ_{i}^{'}) \\ - \sum_{i = 1}^{n} λ_{i} (ɛ + ξ_{i} - y_{i} + 〈 m, x_{i} 〉 + d) \end{matrix}$ (5)

$\begin{matrix} - \sum_{i = i}^{n} λ_{i}^{'} (ɛ + ξ_{i}^{'} + y_{i} - 〈 m, x_{i} 〉 - d) \\ - \sum_{i = 1}^{n} (η_{i} ξ_{i} + η_{i}^{'} ξ_{i}^{'}) \end{matrix}$ (6)

In Equation (6), the dual variables have to suffice $λ_{i}, λ_{i}^{'}, η_{i}, η_{i}^{'} \geq 0$ . The formalin of the optimization problem involved in the specification of SVR can be sourced in Equation (6) by locating the saddle point of the Lagrange function defined. Equation (6) saddle points can be obtained by partial equate derivative of P concerning ξ_i and $ξ_{i}^{'}, m, d$ , to 0 and get:

$\partial_{m} P = m - \sum_{i = 1}^{n} (λ_{i}^{'} - λ_{i}) . x_{i} = 0$ (7)

$\partial_{d} P = \sum_{i = 1}^{n} (λ_{i}^{'} - λ_{i}) = 0$ (8)

$\partial_{ξ_{i}} P = G - λ_{i} - η_{i} = 0$ (9)

$\partial_{ξ_{i}^{'}} P = G - η_{i}^{'} - λ_{i}^{'} = 0$ (10)

Results of substituting Equations (7)–(10) in Equation (6) in the optimization problem of maximizing:

$\begin{matrix} - \frac{1}{2} \sum_{i = 1}^{n} \sum_{j = 1}^{n} (λ_{i}^{'} - λ_{i}) (λ_{j}^{'} - λ_{j}) (x_{i} . x_{j}) \\ - ɛ \sum_{i = 1}^{n} (λ_{i}^{'} + λ_{i}) + \sum_{i = 1}^{n} y_{i} (λ_{i}^{'} - λ_{i}) \end{matrix}$ (11)

$subject to \sum_{i = 1}^{n} (λ_{i}^{'} - λ_{i}) = 0 and λ_{i}, λ_{i}^{'} \in [0, G]$ (12)

In Equations (9) and (10) dual variables $η_{i}, η_{i}^{'}$ are eliminated by using conditions and can now be written as $λ_{i}^{'} = G - η_{i}^{'} and λ_{i} = G - η_{i}$ , in Equation (13), the prediction problem can now be written as:

$f (X) = \sum_{i = 1}^{n} (λ_{i}^{'} - λ_{i}) 〈 x_{i}, x 〉 + d$ (13)

In Equation (7), the regression problem in feature space can be written by replacing x_j . x_i with Φ (x_j) . Φ (x_i). Thus, in Equation (11), the optimization problem can be written as:

$\begin{matrix} maximize - \frac{1}{2} \sum_{i = 1}^{n} \sum_{j = 1}^{n} (λ_{j}^{'} - λ_{j}) (λ_{i}^{'} - λ_{i}) \\ N (x_{j} . x_{i}) - ɛ \sum_{i = 1}^{n} (λ_{i} + λ_{i}^{'}) + \sum_{i = 1}^{n} y_{i} (λ_{i} - λ_{i}^{'}) \end{matrix}$ (14)

$subject to \sum_{i = 1}^{n} (λ_{i} - λ_{i}^{'}) = 0 and λ_{i}^{'}, λ_{i} \in [0, G]$ (15)

$N (x_{j}, x_{i}) \equiv Φ (x_{j}) . Φ (x_{i})$ (16)

In Equation (8), the function of SVR can now be written as:

$f (x) = \sum_{i = 1}^{n} (λ_{i} - λ_{i}^{'}) N (x, x_{i}) + d$ (17)

Finally, the kernel function. In this optimization problem, it is computed rather than Φ (x) to reduce the computational cost of dealing with the high-dimension feature space [43].

2.3 Phasor Particle Swarm Optimization (PPSO)

2.3.1 The parameter’s setting

By leveraging the advancements in PSO algorithms demonstrated in previous research, it becomes possible to regulate and guide a system or process by employing suitable control methods. To optimize a given problem successfully, the parameters of Particle Swarm Optimization (PSO) need to incorporate various strategies. The primary objective of this study is to improve the convergence capability during the optimization process by enhancing the effectiveness of optimization. To achieve these goals, the PPSO utilizes appropriate and efficient phasor angle functions to generate control parameters for PSO. Each particle is assigned an individual scalar phasor angle to implement a range of strategies effectively. Mathematical functions involving both cosine and sine are employed to model the PSO control parameters based on these phasor angles. As an illustration, it is noted that the ith particle is denoted by $\vec{X_{i}} ∠ θ_{i}$ , where $\vec{X_{i}}$ represents the magnitude vector and θ_i denotes the phasor angle.

Like the PSO - TVAC in [44] and a modern PSO - TVAC [45], the inertia weight values of PPSO is considered zero. The proposed model for particle movement in PPSO is outlined below. However, it is possible to enhance this method by incorporating concepts from other improved PSO algorithms.

$\begin{matrix} V_{i}^{it} = & p (θ_{i}^{it}) \times ({pbest}_{i}^{it} - x_{i}^{it}) + g (θ_{i}^{it}) \\ \times ({Gbest}_{i}^{it} - x_{i}^{it}) \end{matrix}$ (18)

The PPSO algorithm has chosen the following functions after analyzing a large number of $g (θ_{i}^{it})$ and $p (θ_{i}^{it})$ functions.

$p (θ_{i}^{it}) = {| \cos θ_{i}^{it} |}^{2 \times \sin θ_{i}^{it}}$ (19)

$g (θ_{i}^{it}) = {| \sin θ_{i}^{it} |}^{2 \times \cos θ_{i}^{it}}$ (20)

The functions proposed in this study, relying solely on the phasor angles of the particles, exhibit various behaviors, including value reversal, simultaneous increase or decrease of values, reaching large values, and attaining identical values. These behaviors contribute to adaptive search characteristics, balancing local and global searches. As a result, PPSO stands out as an adaptive and non-parametric algorithm capable of avoiding local optima and mitigating premature convergence, a drawback frequently observed in traditional PSO approaches.

2.3.2 Formulation of PSO

In each iteration of the algorithm, the velocity of individual particles is calculated using the following formula.

$\begin{matrix} V_{i}^{it} = & {| \cos θ_{i}^{it} |}^{2 \times \sin θ_{i}^{it}} \times ({pbest}_{i}^{it} - x_{i}^{it}) \\ + {| \sin θ_{i}^{it} |}^{2 \times \cos θ_{i}^{it}} \times ({Gbest}_{i}^{it} - x_{i}^{it}) \end{matrix}$ (21)

Subsequently, the particle’s position is updated utilizing the following equation:

${\vec{x}}_{i}^{it + 1} = {\vec{x}}_{i}^{it} + {\vec{V}}_{i}^{it}$ (22)

Following that, the determination of the Personal Best (P_best) and Global Best (G_best) positions follow a process similar to the conventional PSO algorithm. Afterward, the phasor angles and maximum velocities of the particles undergo an update according to the following procedure:

$\begin{matrix} θ_{i}^{it + 1} = & θ_{i}^{it} + T (θ) \times (2 π) = θ_{i}^{it} \\ + | \cos (θ_{i}^{it}) + \sin (θ_{i}^{it}) | \times (2 π) \end{matrix}$ (23)

$\begin{matrix} V_{i, \max}^{it + 1} = & W (θ) \times (X_{\max} - X_{\min}) \\ = {| \cos θ_{i}^{it} |}^{2} \times (X_{\max} - X_{\min}) \end{matrix}$ (24)

It is worth noting that Equations (18) to (20) and (23) to (24) are based on empirical formulas that were chosen after testing a vast array of functions. The number of functions assessed for this purpose was substantial, and it is not feasible to present all of them here. Figure 2 displays the flowchart of PPSO.

Fig. 2

Flowchart of PPSO.

2.4 Biogeography-Based Optimization (BBO)

This subsection presents an overview of the biogeography-based optimization algorithm. Imagine there is an optimization problem along with a set of candidate solutions. In this context, a favorable solution can be compared to a habitat with a high habitat suitability index (HSI), analogous to a geographical area well-suited for biological species in biogeography. Within the optimization problem, the HSI serves as a measure of the solution’s quality, often referred to as fitness or goodness of the habitat [46]. Conversely, a subpar solution resembles a habitat with a low HSI. High Habitat Suitability Index (HSI) in solutions indicates habitats that support a diverse array of species, whereas low HSI solutions signify habitats with fewer species. In essence, the number of species represented by a solution depends on its corresponding HSI. Solutions with high HSI are more likely to share their characteristics with other solutions, while solutions with low HSI are more inclined to accept shared attributes from other solutions. This novel approach to address general optimization problems is biogeography-based optimization (BBO). Like other evolutionary algorithms, BBO relies on two crucial steps: information sharing, achieved through migration in BBO, and mutation. Migration, as a probabilistic operator, enhances a habitat, denoted as H_i. The migration rates of each habitat are utilized to probabilistically share features among habitats. For every habitat H_i, its immigration rate, K_i, is employed to determine probabilistically whether it should undergo immigration [47]. If immigration is chosen, then the emigrating habitat, H_j, is also selected probabilistically based on the emigration rate, l_j. In this context, migration is defined as the process by which features are exchanged between habitats to improve their overall optimization performance.

$H_{i} (SIV) \leftarrow H_{j} (SIV)$ (25)

In the field of biogeography, an SIV, known as a Suitability Index Variable, plays a crucial role in assessing an island’s livability. Within the context of BBO an SIV represents a characteristic of a solution, analogous to a gene in GAs. Mutation serves as a probabilistic operator responsible for randomly altering an island’s SIV, relying on the pre-existing probability of species count in that habitat [48, 49]. Mutation serves the primary objective of promoting diversity within the population. In the case of low HSI solutions, mutation offers an opportunity to enhance the quality of these solutions. Conversely, for high HSI solutions, mutation can further improve them beyond their current state. The pseudocode for BBO′s migration operator is depicted in Algorithm 1.

Algorithm 1. Pseudo-code for the Migration Operator of biography-based optimization.
fori = 1 to Mdo
forj = 1 to bdo
ifrand < μ_i then
Utilize roulette wheel selection to choose a habitat H_c based on the emigration rate γ:
H_i (SIV_j) ← H_c (SIV_j)
end if
end for
end for

2.5 Salp Swarm Optimization (SSO)

SSO [50] is a novel optimization technique that addresses diverse optimization problems. This method emulates the movement of salps, barrel-shaped plankton belonging to the Salp family, found in the wild. Salps share similarities with jellyfish in terms of their tissue structure, and their movement is primarily driven by the expulsion and contraction of water within their gel-like bodies, which have a high water content. By imitating this natural behavior, the optimization method finds solutions efficiently for various optimization challenges [51, 52]. The marine salp exhibits a herding behavior known as the salp chain, which aids them in inefficient movement and feeding by adapting swiftly to changes in their environment. Drawing inspiration from this behavior, the algorithm has mathematically modeled and applied the Salp chain to test its effectiveness in solving an optimization problem.

SSO initiates by segregating the population into two categories: leaders and followers. The rest of the salps are referred to as followers, while the salp preceding them in the sequence assumes the role of the leader. The salp’s position is determined in an n-dimensional space, representing the search space of the problem, with n denoting the number of variables under consideration. These salps diligently search for food sources, which indicate the direction of the entire herd’s destination. To frequently update the location of the Salp leader, the following formula is employed for this purpose:

$y_{j}^{1} = {\begin{matrix} P_{j} + b_{1} ((u d_{j} - l d_{j}) b_{2} + l d_{j}) b_{3} \leq 0 \\ P_{j} + b_{1} ((u d_{j} - l d_{j}) b_{2} - l d_{j}) b_{3} > 0 \end{matrix}$ (26)

Here $y_{j}^{1}$ is the leader’s position in the j^th dimension, the food source in that dimension is P_j, and the lower and upper bounds are ld and ud_j, respectively. b₂ and b₃ are randomly generated in the range [0, 1] to maintain explore space. The parameter b₁ plays a crucial role in this algorithm as it acts as a key factor in balancing the search stage and the utilization stage. Its calculation is carried out as follows:

$n_{1} = 2 e^{- {(\frac{4 l}{l_{\max}})}^{2}}$ (27)

Here l_max and l show the maximum number of iterations and the current repetition, respectively. Once the leader’s position has been updated, SSA proceeds to update the position of the followers using the following formula:

$y_{j}^{i} = \frac{1}{2} (y_{j}^{i} + y_{j}^{i - 1})$ (28)

$y_{j}^{i}$ is the ith position of follower in the jth dimension, and (i) is higher than 1.

The pseudocode for the SSA algorithm is illustrated in Algorithm 2.

Algorithm 2. Salp Swarm Algorithm (SSA)
Initialize a population X.
repeat
Compute the objective function for each solution x_i.
Update the best salp (solution) (F = X^b).
Update c₁using Equation (27).
fori = 1 : Ndo
ifi = =1 then
Update the position of salp using Equation (26)
else
Update the position of salp using Equation (28)
end if
end for
until (t < t_max)
Return the best solution F.

2.6 Hybridization

In machine learning models, hybridization involves amalgamating various algorithms, techniques, or approaches to form a cohesive, unified model. The purpose of hybridization is to capitalize on the unique strengths of each component, resulting in improved overall performance, accuracy, or efficiency compared to using any single method in isolation. Hybridizing Support Vector Regression (SVR) with different optimization algorithms can potentially lead to improved performance and model robustness. In all three cases, the primary objective is to find the best hyperparameters for the SVR model to achieve better generalization and performance on the given dataset. By leveraging the strengths of each optimization algorithm, the hybridized approach can potentially enhance the model’s ability to handle complex relationships and improve its overall accuracy. However, it is essential to conduct thorough experimentation and fine-tuning to ensure the hybridized model’s optimal configuration. To enhance comprehension of the procedure utilized in this study, Fig. 3 has been provided.

Fig. 3

The process of Hybridization.

2.7 Performance evaluators

Various appraisals were utilized to evaluate the workability of models for forecasting the CBR value. Coefficient of determination (R²), root mean squared error (RMSE), mean absolute percentage error (MAPE), mean absolute error (MAE), performance index (PI), and OBJ (Equations (29)–(33)):

$R^{2} = {(\frac{\sum_{p = 1}^{P} (t_{P} - \bar{t}) (y_{P} - \bar{y})}{\sqrt{[\sum_{p = 1}^{P} {(t_{P} - \bar{t})}^{2}] [\sum_{p = 1}^{P} {(y_{P} - \bar{y})}^{2}]}})}^{2}$ (29)

$RMSE = \sqrt{\frac{1}{P} \sum_{p = 1}^{P} {(y_{p} - t_{p})}^{2}}$ (30)

$MAE = \frac{1}{P} \sum_{p = 1}^{P} | y_{p} - t_{p} |$ (31)

$PI = \frac{1}{| \bar{t} |} \frac{RMSE}{\sqrt{R^{2}} + 1}$ (32)

$\begin{matrix} OBJ = & \frac{n_{train}}{P} * (\frac{RMS E_{train} + MA E_{train}}{R_{train}^{2} + 1}) \\ + \frac{n_{test}}{P} * (\frac{RMS E_{test} + MA E_{test}}{R_{test}^{2} + 1}) \end{matrix}$ (33) where, y_P, t_P, $\bar{t}$ , and $\bar{y}$ represent the predicted values of the Pth pattern, the target values of the Pth pattern, the averages of the target values, and the averages of the predicted values, respectively.

3 Result and discussion

3.1 Convergence curve analysis

Figure 4 illustrates the convergence curve of the proposed machine learning models, also known as a learning curve, indicating how model performance evolves or stabilizes with increasing training data. The y-axis represents the RMSE, while the x-axis displays the number of iterations. Analyzing this curve provides valuable insights into the model’s learning behavior and facilitates data-driven decisions concerning the need for additional training data or whether the model has reached its optimal performance. The initial iterations demonstrate the steepness of the models’ learning, and some overlap exists among the models. However, beyond the 90th iteration, all models attain their optimal results. Remarkably, SVR-SSO stands out with its superior performance relative to the other models, as evident from the results.

Fig. 4

The convergence curve for the presented hybrid models.

3.2 Models comparison

This section presents the results of training the admixture inputs of the CBR using machine learning methods. The classical SVR model achieved acceptable results but with a lack of accuracy. Hybridization techniques were employed to address this, utilizing PPSO, BBO, and SSO to enhance the SVR model’s performance. The proposed hybrid models are denoted as SVR-PPSO, SVR-BBO, and SVR-SSO. The evaluation of these models is carried out in two stages: training and testing. The dataset consists of a total of 109 CBR samples, which are utilized in both stages. More specifically, 70% of the data is allocated for training, while the remaining 30% is used for testing. To ensure unbiased and robust model performance comparisons, various performance evaluators, including R², RMSE, MAE, PI, and OBJ, were utilized. These evaluators provide a comprehensive assessment of the models and enable a thorough evaluation of their effectiveness. In the context of the evaluators, a high R2 value approaching 1 indicates an optimal result, while low values for RMSE, MAR, PI, and OBJ, approaching 0, indicate an optimal outcome. Table 3 presents the mentioned models’ results and their corresponding metrics. A rank score ranging from 1 to 4 has been provided to facilitate a more effective comparison of the models. The highest accuracy result for each metric is assigned rank 4, while the least accurate results are assigned rank 1.

Table 3
The results of created models for predicting CBR

Models SVR SVR-SSO SVR-PPSO SVR-BBO

Network results Training data R² 0.9615 0.9894 0.9785 0.9709

Rank score for R² 1 4 3 2

RMSE 2.172 1.122 1.581 1.825

Rank score for RMSE 1 4 3 2

MAE 1.747 0.673 1.299 1.534

Rank score for MAE 1 4 3 2

PI 0.028 0.014 0.020 0.024

Rank score for PI 1 4 3 2

Testing data R² 0.9540 0.9965 0.9880 0.9842

Rank score for R² 1 4 3 2

RMSE 2.604 0.819 1.303 1.489

Rank score for RMSE 1 4 3 2

MAE 2.1745 0.559 1.090 1.198

Rank score for MAE 1 4 3 2

PI 0.0315 0.010 0.016 0.018

Rank score for PI 1 3 3 2

OBJ 2.621 0.607 1.104 1.216

Total ranking score 8 32 24 16

Rank 1 4 3 2

Models	SVR	SVR-SSO	SVR-PPSO	SVR-BBO
Network results	Training data	R²	0.9615	0.9894	0.9785	0.9709
		Rank score for R²	1	4	3	2
		RMSE	2.172	1.122	1.581	1.825
		Rank score for RMSE	1	4	3	2
		MAE	1.747	0.673	1.299	1.534
		Rank score for MAE	1	4	3	2
		PI	0.028	0.014	0.020	0.024
		Rank score for PI	1	4	3	2
	Testing data	R²	0.9540	0.9965	0.9880	0.9842
		Rank score for R²	1	4	3	2
		RMSE	2.604	0.819	1.303	1.489
		Rank score for RMSE	1	4	3	2
		MAE	2.1745	0.559	1.090	1.198
		Rank score for MAE	1	4	3	2
		PI	0.0315	0.010	0.016	0.018
		Rank score for PI	1	3	3	2
OBJ	2.621	0.607	1.104	1.216
Total ranking score	8	32	24	16
Rank	1	4	3	2

As shown in Table 3, the results clearly indicate the SVR single form’s lack of accuracy compared to the other models, as evidenced by its RMSE value of 2.172 in the training phase and 2.604 in the testing phase. However, when applying optimizers to SVR, a significant improvement of at least 20% in its lowest RMSE is observed, demonstrating the effectiveness of the applied optimizers. The SVR-SSO model achieved the highest accuracy among all the other models, demonstrated by the highest R² value and the lowest error indicators. Notably, the SVR-SSO model’s accuracy further improved during the testing phase compared to its performance during the training phase. These results underscore the model’s effectiveness in predicting CBR values with precision. However, despite the improvement observed in the other two hybrid models compared to the single form of SVR, their accuracy still falls short of the SVR-SSO model. The SVR-SSO model outperforms the SVR-PPSO model by 45% and the SVR-BBO model by 58% in terms of the lowest RMSE.

Figure 5 presents a scatter plot illustrating the correlation between the measured and predicted values of the proposed hybrid models, allowing for an assessment of their accuracy. The plot differentiates the values from the train and test phases using distinct colored squares scattered around a center line that represents an R² value of 1. The y-axis represents the measured values, while the x-axis represents the predicted values. In an ideal scenario, the majority of train and test values are closely aligned with the center line, indicating the accuracy of the models. The proximity of the scattered points to the center line suggests that the predictions are in good agreement with the actual measurements, and the models’ performance is noteworthy. Upon observing the SVR-SSO scatter plot, it becomes evident that the test phase values are almost perfectly aligned with the center line, indicating an excellent correlation between the predicted and measured values. Furthermore, the training phase values show a maximum level of correlation with the center line, signifying the model’s strong performance in accurately predicting both the training and test data points. The proximity of the scattered points to the center line in both phases demonstrates the remarkable accuracy and effectiveness of the SVR-SSO model in this analysis. However, both the SVR-PPSO and SVR-BBO models exhibit similar behavior in their performance, with their data showing an acceptable dispersion around the center line. While they do not achieve the same level of perfect alignment as the SVR-SSO model, the scattered points for both SVR-PPSO and SVR-BBO remain close to the center line. This indicates a reasonable correlation between the predicted and measured values for both training and test phases. Though not as accurate as the SVR-SSO model, the SVR-PPSO and SVR-BBO models still demonstrate commendable performance in their predictions, making them valuable alternatives in this analysis.

Fig. 5

The scatter plot for the correlation between predicted and measured CBR.

The line-symbol plot in Fig. 6 offers a visually detailed representation of the correlation between the measured and predicted values of the hybrid models. The measured values are depicted by a black line, and the alignment of the train and test values with this line indicates the accuracy of the models. By using this plot, it becomes evident how closely the train and test values align with the measured value line, allowing for a quick assessment of each model’s accuracy. The clarity of the plot aids in visualizing the performance of the hybrid models and provides valuable insights into the quality of their predictions.

Fig. 6

The line-symbol plot for the comparison between predicted and measured CBR.

Fig. 7

The error percentage for the column plot of developed models.

To better visualize the models’ performance, Figs. 7 and 8 have been provided to indicate the errors of the models. As evident from the figures, the majority of SVR-SSO data values are below 5 percent error, with a maximum error of 7.74% occurring between sample numbers 30 to 40. Figure 7 reinforces the low error percentage, with the majority of data points having errors close to 0 and distributed below 5% error.

Fig. 8

The scatter interval plot for the error percentage of presented models.

However, SVR-PPSO and SVR-BBO exhibit higher error rates of 10.5% and 11.79%, respectively, both occurring in the training phase. This is evident when observing their error balance, ranging between -10 to 10, and the distribution of errors is more pronounced in Fig. 7. In contrast, the error distribution of SVR-BBO exhibits a notably flatter normal distribution in comparison to alternative models, as discernible from Fig. 8. The distribution of errors in SVR-BBO, as visually represented in Fig. 8, is distinguishably flatter when compared to other models. Conversely, SVR-SSO demonstrates a higher concentration of data points near zero percent, indicative of heightened model precision and a distinctively sharper normal distribution. This observation underscores the superiority of SVR-SSO in terms of predictive accuracy and model performance, as evidenced by the distribution characteristics displayed in Fig. 8. Such nuanced distinctions within the error distributions, as portrayed in Fig. 8, shed light on the efficacy of different optimization strategies when applied to support vector regression.

Figure 9 illustrates a Taylor diagram comparing CBR’s predicted and measured values. The Taylor diagram serves as a valuable tool for evaluating the performance of multiple models in relation to a reference dataset, which, in this case, is the CBR value. In Fig. 9, the correlation coefficient is represented on a scale from -1 to 1, indicating the extent of agreement or disagreement between the model predictions and the observed CBR values. A correlation coefficient of 1 denotes a perfect match, while -1 indicates a perfect inverse relationship. Additionally, the standard deviation is depicted on a scale from 0 to 15, signifying the variability of the model predictions in comparison to the measured CBR values. A lower standard deviation indicates greater consistency and agreement between the models and the reference dataset. In Fig. 9, the green, pink, and blue dots represent the SVR-SSO, SVR-PPSO, and SVR-BBO models, respectively. Upon analyzing the diagram, it becomes evident that the SVR-SSO model is the closest to the reference point, indicating a suitable match with the target. Both SVR-BBO and SVR-PPSO demonstrate a close performance, but upon closer inspection, it can be observed that SVR-PPSO slightly outperforms SVR-BBO.

Fig. 9

The Taylor diagram for the predicted and measured CBR.

4 Conclusion

The present study proposed three hybrid models coupled with the support vector regression (SVR) to estimate the California bearing capacity (CBR) of pond ash modified using admixtures. The proposed models include SVR-Biogeography-Based optimization (SVR-BBO), SVR-Particle Swarm Optimization (SVR_PPSO), and SVR-Salp Swarm optimization (SVR-SSO).

It can result that the developed integrated models have R² bigger than 0.9540. results depict that the agreement between observed and estimated CBR from hybrid algorithms is mainly similar to represent the greatest precision.

The standalone support vector regression (SVR) model exhibited high error rates and a low R² value, indicating insufficient predictive accuracy. During the training phase, the SVR model achieved an R² of 0.9615, RMSE of 2.172, MAE of 1.747, and PI of 0.028. A hybridization technique has been obtained to enhance the SVR’s performance. In both the learning and testing stages, SVR-SSO has better performance than the SVR-PPSO and SVR-BBO model, with R², RMSE, MAE, and PI equal to 0.9894, 1.122, 0.6731, and 0.014 for training data, and 0.9965, 0.819, 0.559, and 0.010 for testing data, respectively. So, by considering the OBJ index, the OBJ value for SVR-SSO is 0.607, lower than SVR-PPSO and SVR-BBO with OBJ values of 1.104 and 1.216, respectively. Therefore, the SVR-SSO model outperforms other models in estimating the CBR of pond ash modified by LI and LS, consequently being recognized as the proposed model.

As it is obvious from the presented figures, in all the proposed hybrid models, the estimated CBR illustrates a remarkable correlation with observed ones, approving developed hybrid models’ potential to estimate the CBR value of pond ash stabilized with LS and LI with great precision. Therefore, both proposed models conclude the lowest error in the CBR forecasting procedure, depicting precise prediction that makes it to be used for practical applications.

References

Esmaeili Falak , Mahzad , Reza Sarkhani Benemaran , Rasoul Seifi , Improvement of the mechanical and durability parameters of construction concrete of the Qotursuyi Spa, Concrete Research 13.2 (2020), 119–134.

Kumar Bera , Ashis , Ambarish Ghosh , Amalendu Ghosh , Compaction characteristics of pond ash, Journal of Materials in Civil Engineering 19.4 (2007), 349–357.

Suthar

, Aggarwal

Analysis of heavy metals in pond ash samples from Haryana, Proceedings of 29th ResearchWorld International Conference. LasVegas, USA, 16th–17th March. 2017.

Parsa Jafar , Stuart

, Munson-McGee , Robert Steiner , Stabilization/solidification of hazardous wastes using fly ash, Journal of Environmental Engineering 122.10 (1996), 935–940.

Ghosh Ambarish , Chillara Subbarao , Tensile strength bearing ratio and slake durability of class F fly ash stabilized with lime and gypsum, Journal of Materials in Civil Engineering 18.1 (2006), 18–27.

Pandian

N.S.

, Fly ash characterization with reference to geotechnical applications, Journal of the Indian Institute of Science 84.6 (2004), 189.

Ghosh Ambarish Environmental and engineering characteristics of stabilized low lime fly ash. Diss. IIT Kharagpur, 1996.

Ghosh Ambarish , Chillara Subbarao , Strength characteristics of class F fly ash modified with lime and gypsum, Journal of Geotechnical and Geoenvironmental Engineering 133.7 (2007), 757–766.

Suthar

and Aggarwal

, Class-F pond ash a potential highway construction material–a review, Indian Highways 43.8 (2015), 23–32.

10.

Sahu Vaishali and Gayathri

, The use of fly ash and lime sludge as partial replacement of cement in mortar, International Journal of Engineering and Technology Innovation 4.1 (2014), 30.

11.

Battaglia

, et al, Reduction of Pb and Zn bioavailable forms in metal polluted soils due to paper mill sludge addition: Effects on Pb and Zn transferability to barley, Bioresource Technology 98.16 (2007), 2993–2999.

12.

Calace

, et al, Metal-contaminated soil remediation by means of paper mill sludges addition: chemical and ecotoxicological evaluation, Environmental Pollution 136.3 (2005), 485–492.

13.

Mahmood Talat , Allan Elliott , A review of secondary sludge reduction technologies for the pulp and paper industry, Water Research 40.11 (2006), 2093–2112.

14.

Medhi

U.J.

, Talukdar

A.K.

and Deka

, Physicochemical characteristics of lime sludge waste of paper mill and its impact on growth and production of rice, Journal of Industrial Pollution Control 21.1 (2005), 51–58.

15.

Talukdar Kumar

, A study of paper mill lime sludge for stabilization of village road sub-base, Int J Emerg Technol Adv Eng 5.2 (2015), 389–393.

16.

Singh

, Garg

Utilization of waste lime sludge as building materials (2008).

17.

Ghosh Ambarish , Compaction characteristics and bearing ratio of pond ash stabilized with lime and phosphogypsum, Journal of Materials in Civil Engineering 22.4 (2010), 343–351.

18.

Day , Robert

Soil testing manual. McGraw-Hill, 2001.

19.

Benemaran Reza Sarkhani , Mahzad Esmaeili-Falak , Optimization of cost and mechanical properties of concrete with admixtures using MARS and PSO, Computers and Concrete, An International Journal 26.4 (2020), 309–316.

20.

Kin Judul

California Bearing Ratio Correlation with Soil Index Properties. Diss. University Technology Malaysia, 2006.

21.

Yorulmaz Aysegul , Osman Sivrikaya , Firdevs Uysal , Evaluation of the bearing capacity of poor subgrade soils stabilized with waste marble powder according to curing time and freeze-thaw cycles, Arabian Journal of Geosciences 14.5 (2021), 1–10.

22.

Masoumi Fariborz , et al, Automatic calibration of the groundwater simulation model with high parameter dimensionality using sequential uncertainty fitting approach, Water Supply 20.8 (2020), 3487–3501.

23.

Sarkhani Benemaran Reza , Mahzad Esmaeili-Falak , Hooshang Katebi , Physical and numerical modelling of pile-stabilised saturated layered slopes, Proceedings of the Institution of Civil Engineers-Geotechnical Engineering (2020), 1–16.

24.

Esmaeili-Falak Mahzad , et al, Predicting triaxial compressive strength and Young’s modulus of frozen sand using artificial intelligence methods, Journal of Cold Regions Engineering 33.3 (2019), 04019007.

25.

Nassr Ali , et al, A new approach to modeling the behavior of frozen soils, Engineering Geology 246 (2018), 82–90.

26.

Das Sarat Kumar , Prabir Kumar Basudhar , Undrained lateral load capacity of piles in clay using artificial neural network, Computers and Geotechnics 33.8 (2006), 454–459.

27.

Park

H.I.

and Cho

C.W.

, Neural network model for predicting the resistance of driven piles, Marine Georesources and Geotechnology 28.4 (2010), 324–344.

28.

Erzin Yusuf , Tulin Cetin , The prediction of the critical factor of safety of homogeneous finite slopes using neural networks and multiple regressions, Computers & Geosciences 51 (2013), 305–313.

29.

Erzin Yusuf , Tulin Cetin , The prediction of the critical factor of safety of homogeneous finite slopes using neural networks and multiple regressions, Computers & Geosciences 51 (2013), 305–313.

30.

Zhao Hong-bo , Slope reliability analysis using a support vector machine, Computers and Geotechnics 35.3 (2008), 459–467.

31.

Shi Jonathan Jingsheng , Reducing prediction error by transforming input data for neural networks, Journal of Computing in Civil Engineering 14.2 (2000), 109–116.

32.

Yoo Chungsik , Joo-Mi Kim , Tunneling performance prediction using an integrated GIS and neural network, Computers and Geotechnics 34.1 (2007), 19–30.

33.

Yildirim

OJESwA Gunaydin , Estimation of California bearing ratio by using soft computing systems, Expert Systems with Applications 38.5 (2011), 6381–6391.

34.

Sabat Akshaya Kumar , Prediction of California bearing ratio of a soil stabilized with lime and quarry dust using artificial neural network, Electronic Journal of Geotechnical Engineering 18 (2013), 3261–3272.

35.

Suthar Manju , Praveen Aggarwal , Modeling CBR value using RF and M5P techniques, Mendel 25(1) (2019).

36.

Raja Muhammad Nouman Amjad , Sanjay Kumar Shukla , Muhammad Umer Arif Khan , An intelligent approach for predicting the strength of geosynthetic-reinforced subgrade soil, International Journal of Pavement Engineering (2021), 1–17.

37.

Xing-Xing Shen , Cao Wei-Wei , Li Kai , Predicting CBR value of stabilized pond ash with lime and lime sludge using multivariate adaptive regression splines, Engineering Research Express 3.4 (2021), 045036.

38.

Simon Dan , Biogeography-based optimization, IEEE transactions on Evolutionary Computation 12.6 (2008), 702–713.

39.

Kennedy James , Russell Eberhart Particle swarm optimization, Proceedings of ICNN'95-international conference on neural networks Vol. 4. IEEE, 1995.

40.

Wang Lipo ed. Support vector machines: theory and applications. Vol. 177. Springer Science & Business Media, 2005.

41.

Vapnik Vladimir The nature of statistical learning theory. Springer science & business media, 1999.

42.

Al-Fugara , A’kif ,et al, Spatial mapping of groundwater springs potentiality using grid search-based and genetic algorithm-based support vector regression, Geocarto International (2020), 1–20.