Abstract
The estimation of compressive strength includes time-consuming, finance-wasting, and laboring approaches to undertaking High-performance concrete (HPC) production. On the other side, a vast volume of concrete consumption in industrial construction requires an optimal mix design with different percentages to reach the highest compressive strength. The present study considered two deep learning approaches to handle compressive strength prediction. The robustness of the deep model was put high through two novel optimization algorithms as a novelty in the research world that played their precise roles in charge of model structure optimization. Also, a dataset containing cement, silica fume, fly ash, the total aggregate amount, the coarse aggregate amount, superplasticizer, water, curing time, and high-performance concrete compressive strength was used to develop models. The results indicate that the AMLP-I and GMLP-I models served the highest prediction accuracy. R2 and RMSE of AMLP-I stood at 0.9895 and 1.7341, respectively, which declared that the AMLP-I model could be presented as the robust model for estimating compressive strength. Generally, using optimization algorithms to boost the capabilities of prediction models by tuning the internal characteristics has increased the reliability of artificial intelligent approaches to substitute the more experimental practices.
Keywords
Introduction
The massive demand for concrete in every industrial construction caused a vast consumption of materials in concrete production. These issues aimed researchers to look for an optimal concrete that maintains the primary properties of normal concrete and improves the concrete’s mechanical properties. Recently, admixtures like silica fume, fly ash, metakaolin, ground granulated blast furnace slag and rice husk ash are at the top of the admixture list used in concrete mixture designs [1–6]. Most of the admixtures are applied to substitute the concrete’s cohesive material, mainly the different types of cement [7, 8].
The similarity of fly ash to the normal Portland cement in terms of particle size and shape and water usage reduction percentage paved the way to consider it a partial replacement of cement in the concrete design. The best form of fly ash substitution is achievable with a superplasticizer, which can help improve concrete’s mechanical and applicable properties like strength, workability, durability, eco-saving, and cost-saving properties [9]. Fly ash is partially replaced with cohesive concrete material to reach the optimal concrete design with an alternative enhancement of properties. Reduction of CO2 emission in using fly ash in a concrete mix design and the reported increment of compressive strength of admixed concrete are mostly feasible [10–12]. As it is reported, the optimal percentage of fly ash substitution with the cohesive material of the concrete is located between 20% to 50%, where the upper percentage range can be increased up to 10% if the compressive strength in the early days of concrete is unimportant [13, 14].
A concrete’s aggregates can be divided into coarse aggregate, fine aggregate, and filler. The main task of filler is to fill the missed atmosphere between the coarse and fine aggregates. The size of silica fume particles helps this admixture handle the roles of filler and pozzolanic action [15, 16]. Also, the pozzolanic reaction of silica fume and its particle size presents a remarkable increment in concrete properties [17–22]. Although the rise of silica fume usage in the concrete mix design leads to a decrement in its workability, adding silica fume has a remarkable impact on the short-term compressive strength rising [23, 24]. As the porosity of the concrete decreases, the compressive strength of the concrete tends to increase more. This engagement of compressive strength is rooted in the proper usage of silica fume coupled with superplasticizer (filler properties of silica fume) [25–27]. A self-consolidating concrete in the sulfuric acid medium admixed with silica fume was implemented in a study. This study replaced about 9% of concrete cohesive material with silica fume. The results demonstrated a considerable increase in the concrete’s CS of with silica fume [28]. Another study also considered the effects of micro-silica on the compressive and flexural strength of cement mortar. This study also declared cement mortar’s compressive and flexural strength enhancement due to micro-silica addition [29].
The compressive strength of concrete is the main indicator of concrete which is the foundation of other properties of concrete. Estimating compressive strength producing samples of concrete with various mix designs is a time-wasting and finance-wasting method normally implemented in different sites and labs. The recent development of artificial intelligence models facilitated the difficulty of compressive strength estimation. [30–32]. Dao et al. developed ANN and ANFIS-based prediction models to estimate the prediction of compressive strength. They employed 210 samples to put forward the precise estimation of geopolymer concrete compressive strength [33]. In another study, Liker and Mustafa provided a prediction model for compressive strength. They used 52 different concrete mixtures with fly ash for this aim, which ended in 180 samples[34]. Ali et al., in their study, computed the mechanical features of concrete containing tensile strength, compressive strength, and flexural strength for roller compacted concrete pavement. They applied classification-based regression models such as random forest and M5 prime models to implement the prediction of mentioned properties [35]. Saridemir also used a multi-layer neural network to predict concrete’s compressive strength, including micro-silica. He employed 195 samples with 33 various mixture designs. This feedforward multi-layer network provided a robust prediction of compressive strength [30]. Moodi et al. (2022) employed machine learning techniques of Radial Basis Function Neural Networks (RBFNN), ANN, and Support vector regression (SVR) frameworks for estimating the compressive strength of constructional samples without using any adjusting tool to optimize the working procedure of main prediction models and the correlation coefficient values could be computed at 0.971, 0.973, and 0.901, respectively [36]. In another research, Chen et al. (2022), with using 324 concrete samples in a dataset including 5 inputs as independent constituents, namely cement, water, coarse aggregates, fine aggregates, and superplasticizers, modeled the mechanical feature of compressive strength with the conventional support vector regression (SVR) model and without any complementing certain approach. The results operated that the estimation reliability and accuracy of SVR were R2 = 0.973, RMSE = 1.595, MAE = 0.312, MAPE = 2.469 [37]. Pazuki et al. utilized an RBF neural network to define the CS. They provided 327 SCC samples including CFFA to generate the estimation model [38]. Behnood et al. [39] developed a hybrid model, coupling ANN, and Grey wolf optimization algorithm to predict the compressive strength of concrete admixed with silica fume. A multi-objective optimization was considered in their study to reach the ANN model with the lowest error. Siddique et al. [40] also used the ANN model to predict the CS of self-compacting concrete, including bottom ash. They studied the effects of input on the compressive strength of various concrete ages. The developed ANN-I and ANN-II models delivered a good prediction of compressive strength.
Based on previous research, most studies have employed the prediction models without using any successor tool to enhance the ability of the main anticipating model. Using the machine learning technique has solely led to setting the arbitrary parameters of models with inefficient desired numbers based on previous research. Alternatively, using optimization algorithms and prediction models separately can enter more errors in the modeling process. Therefore, in the present study, the prediction of CS of an HPC admixed with silica fume and fly ash is goaled utilizing Artificial neural networks (ANN). For this purpose, as the novelty of this paper and in the scientific world, ANN was optimized with an arithmetic optimization algorithm, and a grasshopper optimization algorithm was developed that has not been proposed in any research. Many researchers are using ANN for predictions of specific features of construction materials successfully such as: (Lin and Wu, 2021, Dao et al., 2020, Moradi et al., 2021, Singh et al., 2021, Wu, 2021); however, for GOA and AOA optimization algorithms (Qin et al., 2021, Lv & Peng, 2021, Kandiri et al., 2021, Kaveh & Hamedani, 2022, Agushaka & Ezugwu, 2021, Abualigah et al., 2022, Gürses et al., 2021) with acceptable results [41–51].
Also, a dataset containing cement, coarse aggregate, total aggregate, fly ash, silica fume, superplasticizer, water, curing time, and compressive strength was applied as input and target values. Three ANN models were optimized with an arithmetic algorithm named AMLP-I, AMLP-II, and AMLP-III, and three optimized with grasshopper algorithms named GMLP-I, GMLP-II, and GMLP-III trained and tested by the abovementioned intakes to achieve a precise prediction model of compressive strength. The evaluation of six proposed hybrid models considered several statistical metrics: R2, RMSE, MDAPE, MAPE, and VAF.
Materials and methodology
A dataset containing the amount of cement (C), silica fume, fly ash, total aggregates, coarse aggregates, water, superplasticizer, curing time, and compressive strength are collected from the literature [14, 52]. Different mix designs of concrete for the ages of 28 days to 180 days are considered to estimate the long-term and short-term compressive strength of HPC concrete. The dataset is divided into three parts based on the water to cement ratio of 0.3, 0.4, and 0.5. also, the cement applied in this collection is classified as Portland cement of ASTM Ttype1. Moreover, the fly ash is of low calcium, the same as the type of ASTM class F. the frequency of dataset members’ usage in different mix designs is depicted in Fig. 1. In Table 1, the data set’s main statistical properties are also listed. A preferred data set can be categorized into two types: I- independent parameters, as the constituents composed to produce the concrete HPC samples; II-dependent parameters of concrete compressive strength that are altered by the inputs presented as data type I. Thus, all data will be fed to a hybrid model. To this end, the training phase is done to model get learned with input and output (target) values. In this phase, 30 percent of data is used with the sigmoid function. In the next stage, data will be validated in which the model will be asked to produce output values by entering 15 percent of inputs. Then the target values are enrolled in the model to remodify the weights and biases using input (I) and target (II) data based on the Mean Squared Error (MSE). Therefore, in the final testing phase, the remaining 15 percent of inputs will be entered to model the compressive strength values. However, during the modeling process, the number of neurons in hidden layers is optimized to generate the best conditions for working the developed model at a desirable level to have low cost and network complexity [53, 54]. Figure 2 has shown the process of current research stages and coupling ANN with optimizers.

The histogram of input and target variables: a) C b) CT (day) c) CA/C ratio d) CA/TA ratio e) FA/C ratio f) SF/C ratio g) SP/C ratio (%) h) W/C ratio i) CS value (MPa).
Statistical properties’ explanation for data applied in train and test steps

Schematic view of current research stages.
Artificial neural network structures are inspired by the biological networks shown in the human brain and are classified as computational black-box processes [55, 56]. A biological neural network consists of a large number of neurons that are connected to influence each other in order to provide optimized results and be highly accurate. An ANN structure mostly contains several layers with different numbers of neurons in the layers. Layer one, called the input layer, is not the operational layer and has tasks that receive data. The several neurons in the input layer rely on the dataset’s number. The hidden and output layers are the operational layers of the ANN structure, each neuron in these layers obtains a signal from the previous neuron and sends an output signal to the succeeding neuron after the process is complete. This strategy includes linear and nonlinear mathematical relationships such as:
In Equations (1) and (2), B
j
is the bias of jth neuron, IN
i
is the ith input signal. wi,j is the ith input of jth neuron’s weight. In addition, f is the activation function. The best-known activation function is the hyperbolic tangent-sigmoid function, which is utilized in this work as the hidden layer activation function and is constructed in the following [57–59].
The output layer structure has one neuron and is organized as an operation layer. The activation function for this layer is primarily a pure line.
In addition, a training method is run on the ANN model, and biases and weights are set so that the precision of the model output matches a satisfactory level. This training function in this study is the Levenberg-Marquardt algorithm because of its high performance in compressive strength prediction models [60].
The Arithmetic Optimization Algorithm is a candidate-based algorithm with an algebraic concept that uses arithmetic operators to update and find new positions of a set without calculating their results [61]. Arithmetic is one of the fundamental algorithmic procedures of number theory that begins with the initialization of randomly generated candidate solutions and is a major feature of current mathematics.
The algorithm contains two main sections: exploitation and search. Then developing the initial candidates, need to define and implement the exploration or development search space utilizing the Mathematical Optimizer Accelerator Facility (MOA).
Where Max and Min determine the maximum and minimum values of MOA. The iter is the current iteration and Max iter is the number of maximum iterations.
The exploratory search function is performed by highly distributed values applying arithmetic multiplication (M) and division (D) operators to the exploratory search function. Operators M and D produce high variability and do not help to achieve goals, but applying subtraction (S) and addition (A) arithmetic operators in the utilization stage leads to the best goals.
If r1 > MOA the search step of the algorithm is in progress. Using the M and D operators, he position of the search step is updated utilizing Equation 6.
Where best (c
j
) is the global best position, lb and ub are the lower and upper the explore area’s bound. ɛ shows a little value, and μ sets the control parameter to explore the method and is set to 0.499 in this study. MOP is determined as math optimization probability and is computed in the following.
In Equation 7, α denotes the precision sensitivity parameter used during the iteration and is equal to 5.
In addition, if r1 < MOA the exploit step occurs. In this step, arithmetic operators S and A are applied for the deep dense domain search. This depth-first exploration is modeled in the following:
The schematic flowchart of the hybrid arithmetic base neural network is shown in Fig. 3.

Hybrid AMLP predictive model’s schematic flowchart.
GOA is a swarm-based algorithm that models the behavior of the grasshopper insect to converge on the best solution [62]. The life cycle of a grasshopper includes egg, nymph, and adulthood [63]. In adulthood, a grasshopper exhibit a swarm-based demeanor covering long distances, with two characteristics: long range and rapid movements. The behavior of this grasshopper can be represented by a formula in the following:
Hence, x i determines the position of ith grasshopper, and S i is the parameter that shows the social interaction of grasshoppers.
d
ij
donates the space between ith and jth grasshopper and
In Equation 11, f and l show the strength or length of the force of attraction. Since locusts are wingless, wind direction is the main criterion for movement.
Where G
i
and A
i
are the gravity force and wind advection of ith grasshopper.
ub
d
and lb
d
are the upper and lower boundary,
In the present study, the values of c min and c max are equal to 0.0001 and 1, respectively. In Fig. 4, the flowchart of optimizing the ANN model by the Grasshopper algorithm.

Hybrid GMLP predictive model’s schematic flowchart.
Some statistical measures are believed to assess the robustness of the two new hybrid models (GRBFN and ERBFN).
∘ Coefficient of determination (R2):
∘ Root mean squared error (RMSE):
∘ Mean absolute percentage error (MAPE):
∘ Median of absolute percentage error:
∘ Variance account factor
Where t
n
and p
n
are measured and predicted values, respectively. N shows the samples’ total number.
Compressive strength (CS) estimation is the main issue considered in evaluating concrete properties. The experimental process of CS prediction is done by making a sample for each concrete mixture design. Also, highlighting the effect of admixtures on the CS with the experimental form requires an extended range of time, budget, and laboring. Moreover, the huge volume of concrete consumption needed material, and CO2 emission forced researchers and constructors to apply by-product admixtures in the concrete mix design. The present study employs the concrete mix design of two admixtures, fly ash and silica fume. These admixtures are replaced with the cohesive material of HPC concrete with the presence of a superplasticizer. A different mix of concrete containing fly ash, silica fume, and superplasticizer is tested, and the compressive strength is recorded.
Two hybrid feedforward multi-layer perceptron neural networks (MLP) optimized by arithmetic optimization algorithm and grasshopper optimization algorithm employed to perform the compressive strength prediction of the concrete. In the process of modeling, six predictive models one hidden layer AOA base MLP (AMLP-I), two hidden layers AOA base MLP (AMLP-II), three hidden layers AOA based MLP (AMLP-III), one hidden layer GOA based MLP (GMLP-I), two hidden layers GOA based MLP (GMLP-II), and three hidden layers GOA based MLP (GMLP-III) developed to serve the best prediction of compressive strength. Also, in these models, the number of neurons in each hidden layer structure is optimized by the optimization algorithms, and Levenberg-Marquardt was in charge of training the weights and biases. The activation function of the hidden layers is assumed to be the sigmoid activation function, and the output layer utilizes the pure line as an activation function.
The intake set of data to run the hybrid proposed models included cement, fly ash/cement ratio, silica fume/ cement ratio, superplasticizer/cement ratio, coarse aggregate/cement ratio, total aggregate/cement ratio, water/cement ratio, curing time as input and the compressive strength as target values. The data set is normally rand-permed to have all ages of concrete and all percentage of admixture in both the training and testing step, then the data set is divided into the training and testing phase with the proportion of 70% and 30%, respectively. Table 1 shows that the maximum and minimum data in the training and testing step represent the fair use of all materials ranges in both phases.
Introduced statistical indicators to assess the accuracy of models. Based on these metrics, the workability and performance of the models are evaluated, and the best model that outperforms an acceptable prediction can be selected accurately. The statistical metrics used in this study are R2, RMSE, MAPE, MDAPE, and VAF.
The robustness of the models is validated considering the RMSE values of proposed hybrid models with the model developed by Yin et al. [52], which applied the same data set in their study. The best value of RMSE obtained from the Yin et al. work stood at 4.876, and the RMSE’s value for this paper is 1.7341, which declared the precise accuracy of the proposed hybrid models.
Table 2 shows AMLP and GMLP with one, two, and three hidden layers considering the R2 and RMSE. As it is clear from Table 2, in both models, the best amount of R2 and RMSE is for a model with a hidden layer, namely AMLP-1 and GMLP-I. in both hybrid models, the layered model shows a good convergence in the training phase and testing phase as well. It is noticeable that the AMLP and GMLOP model, the AMLP-II, and GMLP-II present the weakest results compared to other delivered models. Also, for both AMLP-I and GMLP-I, R2 and RMSE are better than the training phase.
Statistical properties of the training and the testing phases of all hybrid models
Statistical properties of the training and the testing phases of all hybrid models
Figure 5 represents the k-fold cross-validation of the AMLP model and GMLP model in the training and the testing step.it is obvious from Fig. 5(a–d) that each model’s best results are for the models with one hidden layer. Also, the figure demonstrates that the worst results are for models with two hidden layers in both models.

The k-fold cross-validation of hybrid AMLP and GMLP models in the train and the test phases: a) AMLP-Train b) AMLP-Test c) GMLP-Train d) GMLP-Test.
As demonstrated in Table 2, the AMLP-I and GMLP-I model performs best in their group. As it is clear from Table 3, the best R2 values are obtained from AMLP-I in the test phase, and the next model with a higher R2 value is the GMLP-I in the testing phase. AMLP-I model could get 0.9770 in the training phase while GMLP-I could obtain 0.9708 with 0.6% lower than the earlier model. However, in the testing phase, AMLP-I was 0.8% more than GMLP-I with the correlation index of R2 with 98.95% and 98.17% for the AMLP-I and GMLP-I alternatively. However, Ayubi rad, in a study, employed the model LSSVR -least square support vector regression- according to the couple simulated annealing (CSA) for finding the non-linear relation between the mechanical compressive strength feature of concrete and eight ingredients as inputs containing cement, the water, the fine aggregates, fly ashes, the superplasticizer, the blast furnace slags, the coarse aggregates, Ages of the sample). According to the R2 index of 92% and RMSE error indicator of 6.17 MPa, train and test results have indicated that the ANN model and CSA-LSSVR have great potential to predict the compressive strength rates [64]. In another study, Gholampour et al. investigated the application of machine learning to estimating concrete constituents. In another study, Dao et al. (2020) utilized ANN to estimate the compressive strength rates of constructional materials. That maximum R2 value was calculated as 0.976 in the training phase, and an R2 in the test phase was computed as 0.972 [44]. The best RMSE values are the least that belong to the AMLP-I model again in the testing phase, and the highest RMSE value is for the GMLP-I model in the training phase. AMLP-I model could be assessed by RMSE at a level of 2.23 MPa in the training phase, while GMLP-I could obtain 2.50, which is 12.1% higher than the earlier model. However, in the testing phase, AMLP-I was 33% more than GMLP-I, with the correlation index of RMSE with 1.73 and 2.30 MPa for the AMLP-I and GMLP-I, respectively.
The output results of the best models in the training and testing steps considering statistical metrics
In contrast with R2, the second stage considering RMSE value is for the training step of the AMLP-I model. The MAPE values caught from the models showed the same trend as R2. Also, from the MAPE index point of view, the AMLP-I model was assessed by MAPE at a level of 2.2818 MPa in the training phase, while GMLP-I could obtain 2.2850 MPa. However, in the testing phase, AMLP-I was 13% more than GMLP-I with the correlation index of R2 with 1.97 and 2.21 MPa for the AMLP-I and GMLP-I, respectively. In the case of MDAPE, GMLP-I achieved the best values in both the training and the testing. The VAF values also went by the R2 value with the same ranking. The best model among four models with delivered results in Table 3 is evaluated through a ranking system. The ranking system showed that the AMLP-I and GMLP-I models caught the first and second stages in the test phase. Moreover, the third and fourth stages are for the AMLP-I and GMLP-I in the training phase.
A comprehensive understanding of models’ behavior is obtainable from the scatter plot drawn for the AMLP models and GMLP models in Fig. 6. The scatter plot proves that the test phase performed better in all models than in the training phase. The GMLP-I and AMLP-I have the best behavior compared with other models. Also, the fitted line for AMLP-I in the test phase depicted the best matches with the Y = X line. GMLP-I with R2 of 0.9708 in the training phase was improved in the testing phase by as much as one percent with the rate of 0.9817. But relative to GMLP-II correlation rate was reduced to 0.9664 with a 0.45% reduction percentage. By increasing the number of hidden layers to three, the R2 index was calculated at 0.9684.

Scatter plot of hybrid AMLP and GMLP models: a) AMLP-I b) AMLP-II c) AMLP-III d) GMLP-I e) GMLP-II f) GMLP-III.
The HPC compressive strength estimation is studied in this work. A data set is including fly ash and silica fume considered to develop the prediction model. Two multi-layer perceptron-based prediction models were optimized by arithmetic, and grasshopper optimization algorithms were applied to perform the compressive strength prediction. The outset of results can be summarized as follow. A dataset containing fly ash and silica fume integrated with a superplasticizer was employed to run the proposed prediction models An arithmetic optimization algorithm optimized two multi-layer perceptron models, and grasshopper optimization algorithms were applied to implement the prediction. The AMLP-I model showed its precise capability to predict the HPC compressive strength. The GMLP-I model also generated a robust prediction with acceptable accuracy in the prediction of compressive strength. The best prediction accuracy was obtained from AMLP-I in the test phase, and the lowest prediction accuracy was obtained from GMLP-I in the training phase. The output of both models showed a robust prediction potential of compressive strength. However, the AMLP-I showed the best results and was better qualified to model the prediction of compressive strength.
