Abstract
Reinforced concrete structural walls (RCSWs) are integral part of buildings and other structures and are used to carry in-plane and out-plane loads. For assessment purposes and to ensure safety and resilience of the structure, the curvature, capacity, and strain demands of RCSWs need to be estimated. Nonlinear numerical models are increasingly used in earthquake engineering design and assessment, where it is critical to develop high fidelity simulation tools to precisely forecast the global and local behavior of RCSWs. Plastic deformations concentrated in small regions, i.e. plastic hinges, and characterized through the concept of plastic hinge length (PHL), are conventionally used to define the inelastic response of RCSWs. In this study machine learning (ML) algorithms were leveraged in evaluating the PHL of RCSWs. A database containing 721 planar and nonplanar RCSW samples were utilized for training and testing ML models. Four different algorithms were employed, namely: XGBoost, CatBoost, Random Forest (RF), and genetic programming. The RF model outperformed other ML counterparts. The results of the best ML model were also compared against empirical equations available in literature and nonlinear regression analysis. The proposed ML model provided much better predictions than the existing relationships. Predictions of most empirical models were found to be extremely over (or under) conservative. The proposed ML model was also presented via an online tool where users can obtain the PHL of any RCSW simply by inputting wall parameters and clicking a run button.
Keywords
Highlights
Developed a machine learning (ML) model to predict the plastic hinge length (PHL) of reinforced concrete structural walls (RCSWs). Four ML models examined: XGBoost, CatBoost, Random Forest (RF), and genetic programming. RF model outperformed other ML models, existing expressions, and regression models. SHapley Additive exPlanation (SHAP) framework was used to identify the significant wall parameters on PHL. A user-friendly web application is presented for the ML model, allowing practitioners to determine the PHL easily.
Introduction
Reinforced concrete structural walls (RCSWs) are particularly advantageous at bearing lateral loads like wind or seismic loads due to their high strength and rigidity (Cao et al., 2021). In regions of high seismic activities, using structural walls to counteract lateral loads is a widely common practice (Barkhordari, 2022; Cao et al., 2022). It is desirable to design a structural wall to behave inelastically during a devastating earthquake (Wang et al., 2021), where it can maintain large deformations without failure (Barkhordari and Tehranizadeh, 2021). The characteristics of the structural system affect the pattern and size of the inelastic deformations inside the structure (Barkhordari, 2022). To effectively assess structural walls, it is essential to follow well-established guidelines that consider a complete understanding of the wall’s behavior for the geometry, materials, and loading (Massone et al., 2021; López et al., 2022).
Although the inelastic curvature will vary significantly as the wall height increases, it is conventional to assume it is uniform above the base (Figure 1) at a height corresponding to the length of the plastic hinge (Nasr et al., 2022). Tall buildings have several floor slabs connecting the walls, causing complicated interactions between walls and floor slabs (Barkhordari and Es-Haghi, 2021). To ensure consistency of the overall deformations at the story level, slender walls exhibit larger flexural deformations around the wall base (Beiraghi and Siahpolo, 2017; Beiraghi, 2018a, 2018b; Barkhordari et al., 2021). Furthermore, a large plastic hinge is more likely to occur at higher levels of tall buildings due to higher-mode effects (Panagiotou and Restrepo, 2009; Beiraghi, 2018c, 2019a, 2019b; Beiraghi and Kheyroddin, 2021). As a result, the assumed value of the plastic hinge length (PHL) has a significant impact on the plastic deformation and associated maximum displacement capacity of RCWs (Pozo et al., 2021). Curvature distribution of shear reinforced concrete wall.
Many studies initially advocated using a PHL equal to half of the wall length (
Applications of various ML algorithms in RCSWs.
The main goal of this study is to fill a gap in the literature by developing a comprehensive and useful ML based model that processes large amounts of data using different ML algorithms with a database of nonplanar and rectangular RCWs t predict PHL for RCSWs. Four different algorithms are considered including XGBoost, CatBoost, RF, and Genetic Programming (GP). The root-mean-squared error, coefficient of determination, mean absolute percentage error, and median absolute error are utilized as assessment metrics to assess the efficiency of ML models. Results showed the RF algorithm outperformed other ML models in terms of assessment metrics. The results of the best model are compared against previous equations developed for PHL calculation. The performance of the proposed model was much better than the existing relationships. Also, the SHapley Additive exPlanation (SHAP) method is employed to describe the outputs of the best model and identify the most notable variables in determining the PHL.
Dataset description
The accuracy and robustness of prediction in ML models is significantly influenced by the inputs. This study takes into account a brand-new database of nonplanar and rectangular RCSWs which is compiled by (Hoult, 2022). The database contains 721 planar and nonplanar reinforced concrete structural walls (RCSW) samples. The PHL of RCSWs is the desired output variable. The input parameters of this database are the ratio of the secondary cracking ( Cross-sections of RCSWs in database compiled by Hoult (2022). Statistical properties of input and output variables.

A correlation matrix (Figure 3) was used to show how the input parameters relate to one another. As can be seen, all input parameters had very low correlations (<0.5). The ratio of shear span ( Correlation matrix of the inputs. Inputs and output correlation.

Description of ML models
Among many algorithms, some have been reported to perform well in previous studies (Barkhordari and Es-Haghi, 2021; Feng et al., 2021; Sarir et al., 2021; Asteris et al., 2022; Liu et al., 2022), have been used in this research. Four types of ML models, namely Extreme Gradient Boosting (XGBoost), RF, Categorical features-support gradient Boosting (CatBoost), and GP, are developed in this study to forecast the PHL of RCSWs. For XGBoost, RF, and CatBoost models, Bayesian optimization is employed to efficiently tune their hypermeters (Abdollahi et al., 2022).
XGBoost model
Chen and Guestrin (2016) created the decision-tree-based ensemble ML technique known as XGBoost. This approach, which is a more sophisticated variant of the gradient boosting algorithm, was created to produce results with both high accuracy and little chance of overfitting (Asteris et al., 2022). Three key components of gradient boosting are a loss component to be minimized, a weak learner to produce estimates, and an additive model to append more weak learners. Regularization is appended to the target function with a loss function as an upgrade from a straightforward gradient boosting approach to get rid of overfitting. The model’s objective function (J) and general formula for the prediction at step t are written as follows:
Hyperparameter of XGBoost model.
The posterior probability serves as a stand-in for an objective function in this way. The surrogate function can be utilized to approximate the objective function and use that information to guide future sample decisions. While sampling, the posterior is carefully used in what is known as the “acquisition” function, for example when acquiring extra samples. The acquisition will improve the conditional probability of positions in the search to create the next instance, whereas the model leverages the knowledge of the objective function to sample the portion of the search space that is most likely to be profitable. The posterior is modified once fresh examples have been added to the dataset and evaluated using the objective function. This procedure is continued until the extrema of the target function are found, a satisfactory outcome is found, or the available resources are depleted (Fu et al., 2022).
Random forest model
Regression trees (RTs) have a very high variation, and a small change in the input data can have a big impact on the output because of the training process’ inherent hierarchy, which causes an error in the top splitting to ripple down to all the splits below (Liu et al., 2022). As a result, overfitting frequently occurs, which lowers the RT model’s capacity for applicability. To address the instability of a standalone RT, RF is offered as a simple ML model. Following the continuous learning approach, RF builds many RT models based on subsamples arbitrarily chosen from the training set using a bagging strategy. Additionally, each RT model’s input feature count is randomized in relation to the total features. Finally, the results from all RT models are integrated to generate an inference based on a particular input by using averaging. In Figure 5, the RF procedure is depicted. The optimized hyperparameters’ values for the tunning process are shown in Table 4. RF model procedure. Hyperparameter of RF model.
CatBoost model
XGBoost is comparable to CatBoost, an updated Gradient Boosting Decision Tree framework (Prokhorenkova et al., 2018). Traditional boosting techniques reduce errors by changing a training set’s weights (Alzubi et al., 2022). One of CatBoost’s main advantages is its capacity to combine many data kinds, like photos, audio, and text features. In contrast to most ML algorithms, which are unable to manage non-numeric values, CatBoost provides an unconventional method of managing categorical data, needing only a minimal amount of categorical feature transformation. The trees are instead generated by CatBoost by enforcing the requirement that all nodes at the same grade test the same predictor under the same conditions (symmetric trees). As a result, an index of a leaf can be determined using bitwise operations. The tree process enables a straightforward fitting strategy, and the tree architecture functions as a regularization to identify the best solution and prevent overfitting. In other words, symmetric trees, also known as balanced trees, in CatBoost refer to trees where the separating criterion holds true for all nodes at the same level of the tree. On the other hand, XGBoost produces asymmetric trees, which means the separation criteria for every node within the same level can vary. Figure 6 displays CatBoost structure versus XGBoost structure. The hyperparameters of CatBoost and their optimized value are listed in Table 5. CatBoost structure versus XGBoost structure. Hyperparameter of CatBoost model.
Genetic programming model
GP, an evolutionary computing algorithm, finds extremely realistic near-optimal expressions for complex systems using heuristic optimization techniques (Naderpour et al., 2022). The following explanations provide a summary of the primary GP evolutionary cycle processes: 1. Initialization: immediately after the analysis starts, GP model initializes a predetermined number of arbitrary candidates. The candidates are sometimes referred to as a model or a solution. Trees expressions are frequently employed to represent GP individuals. Figure 7 depicts a straightforward illustration of a GP model that is similar to equation (5). The population is the group of candidates with which GP begins. 2. Fitness evaluation: following initialization, a preset assessment metric is used to investigate the performance of the produced individuals. The correlation between the predicted values acquired by the candidate and the required real values is measured using the root mean squared error. Through this procedure, the created candidates' predictability quality is assessed. 3. Reproduction: at this point, a series of GP operators are executed to candidates that were probabilistically chosen based on their fitness value. The selection strategy refers to the manner in which this procedure is carried out. The more weight a candidate possesses, the more likely it is that they will be chosen. Some genetic operations are: (1) Crossover: Crossover is the process by which two candidates interchange some of their structural components. Crossover operators come in a variety of forms, from simple to complicated. In the straightforward crossover scenario shown in Figure 8, two candidates swap randomly chosen subtrees from their models; (2) Mutation: This operation is performed on a single individual by replacing a random location in the tree model of the individual with a different randomly created subtree. The rate at which the mutation operator is used is typically lower than the crossover rate. That is done to prevent the mutation operator’s substantial alterations from significantly destroying quality among individuals. The mutation operator, on the other hand, aids in preserving variation over the period of iterations. The newly created populations substitute the old ones after executing genetic operations repeatedly; (3) Elitism: Elitism is a straightforward strategy frequently used to keep the best candidates in the population and pass them unchanged to the following generation. A simple GP candidate. GP crossover procedure.


Evaluation metrics
The root-mean-squared error (RMSE, Equation (6)), coefficient of determination (R2, Equation (7)), mean absolute percentage error (MAPE, Equation (8)), and median absolute error (MAE, Equation (9)) are employed to assess the efficiency of four ML models. A better fit is indicated by a higher R2 value. Lower RMSE, MAPE, and MAE values are preferred and signify better model performance.
Results and discussion
The ML models are created using the chosen dataset and the identified optimal hyper-parameters. Comparison of the predicted and observed PHL for training and testing subsets are used to assess the performance of the models. Figure 9 compares the predicted versus real values for four ML models in the training phase. The diagonal line (y = x) denotes that the estimation is equal to the real value. Performance indices of four models are listed in Table 6 for the training stage. Predicted values versus real values for training phase: (a) XGBoost; (b) RF; (c) CatBoost; (d) GP. Performance indices of four models – training phase.
It can be observed from Figure 9 that all the ML models obtain good outcomes as the dots (predicted values) are very close to the diagonal line except for the GP model. Overall, the CatBoost model achieves better performance than other models with RMSE, R2, MAPE, and MAE values of 96.89 mm, 0.987, 0.108, and 51.324 mm, respectively. The RF model obtains the second rank among the models in the learning phase. In addition, the GP model has the worst performance among all ML models in the training stage. In the training stage, the R2 values of the RF, CatBoost, and XGBoost models are all greater than 0.95, demonstrating the high level of estimation accuracy of these three models in the training stage. The RF model contains a single best quantitative metric in terms of the MAE with the lowest value of 42.57 mm.
The predictions of the models in the testing stage are compared with the experimental results in Figure 10. Also, Table 7 presents the performance indices of four models in the testing stage. By comparing the four quantitative metrics, the RF model has the four quantitative indicators with the best values in the testing stage than other models, namely the MAPE with the lowest value of 0.201, R2 with the highest value of 0.889, MAE with the lowest value of 100.974 mm, and the RMSE with the lowest value of 308.68 mm. The CatBoost model attains the third rank in the testing phase in terms of RMSE, R2, and MAPE. The RF model outperforms other models in the testing phase; therefore, it is selected as the best model. Predicted values versus real values for testing phase: (a) XGBoost; (b) RF; (c) CatBoost; (d) GP. Performance indices of four models – testing phase.
Existing PHL prediction models
Six existing empirical equations for the PHL of the RCSWs are also employed as a comparison to further support the suggested prediction model’s validity and show its efficiency. Bohl and Adebar (2011a) evaluated the pattern of inelastic curvatures that should be utilized to approximate the flexural displacement capacity of RCSWs using a nonlinear finite element model. A model that is able of estimating the equivalent PHL, as expressed in equation (13), was consequently established based on the range of RCSWs with different shear span ratios, wall lengths, and axial compression levels. Priestley et al. (2007) suggested a plastic hinge formula to satisfy serviceability and damage levels. The tension shift brought on by diagonal shear cracks and the strain penetration into the foundation are both taken into consideration by the model as factors of the bar diameter and the ultimate-to-yield strengths ratio of the rebars. Kazaz (2013) developed a model based on the regression analysis considering wall length, shear span ratio, axial load ratio, and the ratio of the reinforcement. The finite element shear wall model and Turkish seismic code regulations were also used to calibrate the model. In order to determine the length of the plastic hinge zone, two distinct approaches were used (11). The first approach used an analysis of the wall-cross sectional strain pattern to determine the ultimate curvature pattern when the outside tension steel achieved a strain of 0.06; while the second technique involved calculating the curvature pattern employing the shell element strains added to the finite element model at the wall’s opposite edges. Hoult (2022) used the same database used in this paper along with conventional regression methods and presented a new relationship for PHL.
The following is a concise summary of the empirical equations:
NZS 3101.1:2006, ASCE/SEI 41-06 (2006)
Where Scatter diagrams of the existing equations results. (a) Eq.10; (b) Eq.11; (c) Eq.12; (d) Eq.13; (e) Eq.14; (f) Eq.15. Performance measures of empirical equations.
Interpretation of best ML model
The majority of ML-based models typically do not produce mathematical expressions that clearly connect the predicted outcome to the input parameters. This study utilizes a state-of-the-art method (SHAP (Lundberg and Lee 2017) method) to describe the outputs of the RF model and identify the most notable variables in determining the PHL. In order to create an understandable model, SHAP employs an additive characteristic assignment method, where an outcome framework is described as the linear sum of input parameters. The explaining model
Figure 12(a) displays the importance of each input variable. It can be observed from Figure 12(a) that the most important input variable is the wall effective height ( SHAP method plots. (a) Relative importance; (b) Summary plot.
Regression models
In this section, the regression method is used to develop the model. Two cases are considered. The first case (C-1) is to consider samples that only have a rectangular section (because there are many of them in the database). The second case (C-2) considers all the data but with four input parameters (wall effective height, the wall length, ALR, and SSP) because based on the results of the SHAP method, the ratio of the secondary cracking has less impact than other parameters. Figure 13 compares the output of the regression models with the experiments. Table 9 presents performance measures of regression models. The performance metrics RMSE, R2, and MAE of 449.83, 0.766, and 247.13 are achieved in C-1 which are 17.12%, 44.8%, and 36.0% better than C-2. In comparison with Table 7, ML models outperform the regression models in terms of RMSE, R2, and MAE. The results show that the RF model could be employed in PHL approximation application with superior performance. Scatter diagrams of the models. (a) case 1 (C-1); (b) case 2 (C-2). Performance measures of regression models.
Web application
The excellent potential of the RF model for PHL calculation of RCSWs has been validated by prior comparisons. However, a significant obstacle to the practical application of ML techniques within routine design work has been the complexity of the processes involved in database development, model training, validation, and so on. To solve these problems, a user-friendly web application in Python has been developed that incorporates the model and its tuned hyperparameters. The web address provided in Appendix A, as seen in Figure 14, can be utilized to view the graphical user interface (GUI) of the RF model. Engineers only need to specify numerical inputs. When the “Run” button is pressed, the inputs are sent to the model that is running on a web server, and a forecast is displayed right away. Anyone can visit it using the provided URL without the need to install any additional software. GUI of the RF model.
Sensitivity analysis
In this section, a sensitivity analysis method (SAM) is conducted to quantify the sensitivity of a model’s state or outcome variations to fluctuations in each random parameter over the full range of the variable. The Sobol’s technique has received a lot of attention due to its simplicity and accuracy (Cacuci and Ionescu-Bujor, 2017; Ionescu-Bujor and Cacuci, 2017). The variance-based method, namely Sobol’s technique, is used as the SAM. This approach analyzes the model from a probabilistic point of view, treating the model’s inputs as an arbitrary vector using a joint probability density function. The model’s stochastic representation is shown as (Cacuci and Ionescu-Bujor, 2017; Ionescu-Bujor and Cacuci, 2017):
Figure 15 displays the results of the sensitivity analysis and shows that the wall length ( Sobol’ Total order results.
Conclusions
Due to their considerable stiffness, strength, and ductility, reinforced concrete structural walls (RCSW) are typically employed in structures to resist seismic loads. In addition, nonlinear numerical models are increasingly being used in earthquake engineering design. Therefore, it is crucial to create a practical model that can forecast the length of the plastic hinge of the RCSWs with acceptable accuracy. This paper assessed the application of some machine learning (ML) models in predicting the plastic hinge length (PHL) of RCSWs. Four ML models, namely Extreme Gradient Boosting (XGBoost), RF, Categorical features-support gradient Boosting (CatBoost), and Genetic Programming (GP), are used in this study. The following conclusions were drawn: 1. The results show the RF model outperformed other ML models with the lowest value of the MAPE, MAE, and RMSE as well as the highest value of R2. In the training phase, the CatBoost model had the best performance but in the testing phase, the RF model outperformed. 2. The most effective ML model was explained using the SHapley Additive exPlanation (SHAP) framework. The outcomes of the SHAP method showed that the effective parameters in the PHL were the ratio of shear span ( 3. The SHAP method revealed that the ratio of the shear span as well as the wall length both positively influence the PHL. However, the shear stress parameter (SSP) and the axial load ratio (ALR) showed a negative influence on the plastic hinge length. 4. Of the empirical equations available in literature for estimating the PHL of RCSWs, Hoult’s model was the most accurate. 5. Comparing the predictions of empirical equations and the RF based ML model, the latter provided much better predictions for the PHL of RCSWs. 6. A multilinear regression analysis was also performed to determine the PHL, using the existing wall database and the significant parameters identified by the SHAP method. The regression models yielded a large scatter in predictions of the PHL and were less precise than the RF ML-based model. 7. A user-friendly web application in Python has been developed that incorporates the model and its tuned hyperparameters. Users can obtain the PHL of any RCSW simply by inputting parameters and clicking a run button.
The presented ML models tend to perform significantly better than existing analytical models, for the database implemented in the study. However, a larger database with more parameters and datapoints per parameter might be needed to fully evaluate the model’s predictability. The model predictions could also be tested for hybrid walls such as concrete walls with stay-in-place fiber reinforced polymer (FRP) skins, and sandwich concrete walls. Future research can also concentrate on reducing the influence of unstructured and potentially abnormal data sources using abnormality detection methods. Novel, creative ML models must also be developed and evaluated. The bagging and bootstrap approach, as well as hybrid neural networks with metaheuristic algorithms, which are often used for creating ML models, can be used for this purpose.
Footnotes
Authors’ contributions
Mohammad Sadegh Barkhordari: formal analysis, investigation, resources, conceptualization, methodology, data curation, and writing—original draft. Akram Jawdhari: conceptualization, methodology, and writing—review. All authors have read and approved the final manuscript.
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors.
Data availability
The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.
Appendix
Web application can be found online at: https://msbearthquake-plastichingelengthwalls1-streamphl-vb82yd.streamlitapp.com/
