A review of optimization techniques for cost estimation in software development

Abstract

In software development, cost estimation remains a significant challenge. Despite numerous research efforts, identifying an optimal technique applicable to all situations has proven difficult. The increasing adoption of agile development methods has further complicated accurate cost estimation. Recently, the application of optimization techniques, particularly metaheuristic optimization algorithms, has increased to enhance estimation accuracy and performance. However, there is a lack of systematic literature reviews exploring these optimization techniques in software cost estimation (SCE). This study aims to fill this gap by employing a systematic literature review (SLR) method to select, filter, and analyze relevant literature from 2019 to 2024. The review included 41 journal articles and 11 conference proceedings to evaluate the current development status of optimization methodologies in SCE. The study identified 20 key optimization algorithms with the top 6 most commonly used: Particle Swarm Optimization (PSO), Differential Evolution (DE), Genetic Algorithm (GA), Gray Wolf Optimization (GWO), Whale Optimization Algorithm (WOA), and Bat algorithm (BA). Their key contributions are parameter tuning, feature analysis, and model optimization. While optimization techniques hold promise, their application in cost estimation must carefully consider these challenges, such as premature convergence, sensitivity to initial parameters, etc. Findings also reveal significant limitations for Agile Software Cost Estimation (ASCE), such as the less applicability of traditional cost estimation techniques, lack of cost drivers for agile, more reliance on expertise and experience, and few public datasets. This research provides valuable insights for further exploration of optimization techniques in SCE.

Keywords

Optimization techniques software cost estimation metaheuristic optimization algorithms biology-based algorithms agile software development systematic literature review agile project management hybrid models parameter tuning

1. Introduction

Software cost estimation is a pivotal component in software engineering. It is the cornerstone for software development projects to achieve project goals under resources, time, and quality constraints. Improving cost estimation accuracy continues to be a significant challenge for project managers, as it is impacted by various factors such as project planning, budgeting, resource allocation, risks, and constraints. Over the years, numerous methods have been developed for SCE, generally divided into algorithmic and non-algorithmic approaches. Algorithmic methods employ mathematical models, such as Putnam’s framework and the Constructive Cost Model (COCOMO), which require substantial data and vary in mathematical complexity. Non-algorithmic algorithms rely on expert knowledge, including case-based reasoning (CBR), analogy, regression, expert judgment, etc. Jadhav et al.¹ used an automated text mining framework to analyze software development effort and cost estimation techniques over the past 50 years. The results show that, the most popular estimation techniques include fuzzy logic, artificial neural network (ANN), regression, analogy, COCOMO, optimization, use case point, function point, machine learning, COCOMO II, clustering, and CBR-based approaches. Ch Anwar ul Hassan et al.² advocated the use of more effective nature-inspired methods for cost estimation, and experimentally showed that metaheuristic algorithms perform well in handling optimization problems and obtaining accurate estimates. Vikram Singh et al.³ highlighted risk assessment and mitigation strategies are also crucial for improving the reliability of cost estimates. Identifying potential risks and targeted mitigation measures can improve the accuracy and stability of estimates. Overall, research in SCE has traditionally focused on developing and refining various models to enhance estimation accuracy.

Since the birth of the Agile Manifesto, Agile methods have brought higher development speed, more flexibility and effectiveness to software companies. A case-based empirical study⁴ confirmed the importance of agile in software development and these above factors prompted more organizations to switch from traditional methods to agile practices. The application of agile has also brought great challenges to the estimation of development cost. A survey conducted by Ahmed Shams et al.⁵ showed that traditional estimation techniques were not suitable for agile practices and the appropriate and the research of scientific estimation techniques for agile is becoming increasingly urgent. A comparative analysis of effort estimation in agile and non-agile software projects found, traditional cost estimation techniques focus primarily on activity-based costing and user perspectives and often fall short in Agile Project Management⁶. Fernandez-Diego et al.⁷ studied 73 papers and the results emphasized accuracy remains a major challenge in agile effort and cost estimation. One reason is that the uncertainty of requirements increases the complexity of estimation. Saeed et al.⁸ proposed in an empirical study that agile-based estimation models were still in the early stages of development and needed further improvement in order to be applied in practice. Therefore, it is necessary to analyze and discuss the cost estimation techniques applied to agile development.

Over the past thirty years, software engineering has extensively explored search problems using various swarm intelligence algorithms. An increasing academic focus has recently been on employing optimization algorithms to address cost estimation challenges. Shoran et al.⁹ suggest that integrating advanced technologies such as metaheuristic algorithms, machine learning approaches, deep learning techniques, and hybrid models can significantly advance SCE. Rankovic et al.¹⁰ highlighted a shift in cost estimation practices towards diverse ML techniques and hybrid methodologies that blend parametric and non-parametric models. Mohamedyusf et al.¹¹ noted that metaheuristic algorithms are computationally simple yet powerful, capable of identifying optimal solutions under challenging conditions without restrictive assumptions about the search space. This suggests that metaheuristic algorithms are becoming central to SCE, making it crucial to study and analyze the development trend of optimization techniques in this field, identify application challenges, and explore solutions.

This research examines the current state of optimization methodologies in SCE through a systematic literature review (SLR) from 2019 to August 2024. It aims to identify the most widely used optimization algorithms in SCE and provide valuable insights into how optimization techniques can be effectively applied in software development. At the same time, this study also pays attention to the application status of optimization algorithms in agile cost estimation.

The content of this paper is organized as follows: Section 2 details the methodology and approach used for the systematic literature review. Section 3 overviews the data analysis results and provides a report. Finally, Section 4 discusses the conclusions and recommendations for future research.

2. Methodology

Kitchenham, Dyba, and Jørgensen¹² proposed the concept of evidence-based software engineering (EBSE), advocating for the use of systematic literature reviews (SLRs) as a methodological approach to achieve unbiased aggregation of empirical findings. Contract to expert reviews that rely on occasional literature selection, SLRs employ a methodologically rigorous process to review research outcomes. Beyond aggregating existing evidence on a specific research question, SLRs also facilitate the creation of evidence-based guidelines for practitioners.¹³ Therefore, this study follows the SLR guidelines set forth by Kitchenham¹³ to gather, evaluate, and analyze optimization techniques used in SCE over the last six years, from 2019 to August 2024. This approach ensures a comprehensive and structured review of relevant literature, providing a detailed understanding of the current trends and developments in the field.

2.1. Research issue

This study addresses the following research issues through a comprehensive literature review.

RQ1:
What are the prevalent optimization algorithms utilized within the SCE domain?
RQ2:
What specific challenges within SCE are typically addressed by applying optimization algorithms?
RQ3:
How have optimization algorithms been employed in ASCE over the past six years?
RQ4:
Which estimation algorithms are commonly combined with optimization algorithms in practice?
This investigation will provide a detailed understanding of the current trends and developments in applying optimization techniques in SCE.

Table 1.
Inclusion and exclusion criteria.

Inclusion Criteria

01 The literature has been published in journals or conferences.

02 The research objectives of the literature should cover cost estimation techniques for software development.

03 The study used optimization methods.

04 The language of the selected literature should be English.

05 The literature should be published in the latest 6 years (2019-2024).

Exclusion Criteria

01 The literature is not relevant to the research issues.

02 Only one copy of duplicate literature will be kept.

03 Not written in English

2.2. Inclusion and exclusion criteria

Inclusion Criteria
01	The literature has been published in journals or conferences.
02	The research objectives of the literature should cover cost estimation techniques for software development.
03	The study used optimization methods.
04	The language of the selected literature should be English.
05	The literature should be published in the latest 6 years (2019-2024).
Exclusion Criteria
01	The literature is not relevant to the research issues.
02	Only one copy of duplicate literature will be kept.
03	Not written in English

Specific inclusion and exclusion criteria were employed to identify the necessary and appropriate literature sources following Kitchenham¹³ guidelines and detailed in Table 1.

2.3. Data gathering

Four data sources were employed in the study: IEEE Xplore Digital Library (IEEE), Web of Science(WOS), Google Scholar, and Scopus, as shown in Table 2. These sources were selected for literature selection and gathering because they are widely used in SCE.

Table 2.
The data sources selected.

Data Source URL

IEEE https://ieeexplore.ieee.org/

WOS https://www.webofscience.com/

Google Scholar https://www.googlescholar.com/

Scopus https://www-scopus-com-s.web.bisu.edu.cn/

Data Source	URL
IEEE	https://ieeexplore.ieee.org/
WOS	https://www.webofscience.com/
Google Scholar	https://www.googlescholar.com/
Scopus	https://www-scopus-com-s.web.bisu.edu.cn/

The following formulated search strings were utilized when searching these sources: (“Software cost estimation” and “Optimization algorithm”) or (“Software cost estimation” and “Optimization technique”) or (“Software cost estimation” and “Optimization model”) or (“Software cost prediction” and “Optimization algorithm”) or (“Software cost prediction” and “Optimization technique”) or (“Software cost prediction” and “Optimization model”) or (“Software cost evaluation” and “Optimization algorithm”) or (“Software cost evaluation” and “Optimization technique”) or (“Software cost evaluation” and “Optimization model”) or (“Software cost estimation parameter” and “Optimization algorithm”) or (“Software cost estimation parameter” and “Optimization technique”) or (“Software cost estimation parameter” and “Optimization model”).

2.4. Data screening

The study utilized a tollgate approach to filter the selected literature through the data screening procedure, as illustrated in Figure 1. Tollgate approach is a structured project management and quality control method. The core of tollgate approach is to divide the entire project or process into multiple stages, and set a “checkpoint” (Tollgate) at the end of each stage.¹⁴ Through strict inspection, it is verified whether the goals of the current stage have been achieved and whether the conditions for entering the next stage have been met. This study applied this method to ensure a standardized process for screening data and the quality and consistency of results.

Figure 1.

Data screening procedure.

This procedure comprises six phases. In phase 1, the search string was used to search for literature from four data sources. 884 articles were collected from the IEEE database, 1,250 from the WOS, 150 from Google Scholar, and 278 from the Scopus. In phase 2, papers searched from the four data sources were individually processed using a three-step screening method. In the first step, only journal and conference papers were retained based on their category. In the second step, papers unrelated to SCE topics were excluded. Finally, papers not employing optimization algorithms were removed. This process ensured that the remaining literature was both relevant to the research objectives and of high quality. The remaining were 15(IEEE), 30(WOS), 72(Google Scholar), and 76(Scopus). In phase 3, the four data were merged to obtain 193 papers. Accounting for articles indexed in multiple data sources, the phase 4 checked for duplicates and retaining only one unique version of each paper. This process resulted in a final dataset of 122 articles. Among these, 7 articles related to agile software development, including effort and cost estimations. Since effort estimation and cost estimation are distinct sub-fields, the literature focused on effort estimation from the 122 was removed in stage 5 to ensure the accuracy of the screening. This left 47 relevant articles, but only 2 of them were related to agile, which was not enough to answer RQ 3. Therefore, the 47 articles were combined with the 7 articles related to Agile screened in the fourth stage, and after deleting duplicates, 52 articles were finally obtained.

3. Data analysis and report

4 of the 52 remaining articles are literature reviews, with a proportion of 7.69%, while the other 48 focus on cost estimation techniques, with a proportion of 92.31%, as shown in Figure 2. This indicates that the research focus of this field is on specific technical practices and there are relatively few summary discussions in the literature.

Figure 2.

Distribution of studies (article category).

11 articles are conference papers, representing 21.15% of the total, while the other 41 are journal papers, representing 78.85%, as shown in Figure 3. This suggests that research in this field tends to be published in high-quality journals.

Figure 3.

Distribution of studies (publishment category).

Figure 4 illustrates that the selected literature’s distribution over the past six years (2019 to 2024) varies. Each year from 2019, 2020, and 2023 has about 7 studies. The number of studies in 2021 and 2022 is significantly higher. But studies focusing on agile development consistently average about 2 articles per year, with 3 studies in 2022, above the average, and no studies in 2023 and 2024. From the overall trend, by 2022, the research on metaheuristic algorithms in the SCE field has shown an overall upward trend, and has maintained a high level of attention in the past two years. However, the attention paid to optimization technology in agile has declined in the past two years, and it is needed to further analyze the reasons.

Figure 4.

Distribution of selected studies (year-wise).

The basic information of the selected literature has been analyzed above. Based on the research issues, the following chapter will conduct data analysis and a discussion of the results.

3.1. RQ1: What are the prevalent optimization algorithms utilized within the SCE domain?

The optimization technique includes three primary categories: exact, heuristic, and metaheuristic optimization algorithms. Exact optimization algorithms can find the global optimal solution to the problem. Heuristic optimization algorithms are designed for specific problems and offer practical solutions for complex or large-scale issues, though they do not ensure a globally optimal solution. Metaheuristic optimization algorithms, operating at a more abstract level, oversee the exploration process of heuristic algorithms and can be customized to address a broad spectrum of problems. Metaheuristic optimization algorithms are divided into three main categories: biology-based, physics-based, and other natural phenomena-inspired approaches, as shown in Figure 5. Biology-based optimization algorithms are especially noteworthy, inspired by natural evolutionary mechanisms and biological processes. Within this category, there are two primary subtypes: swarm-based algorithms and evolution-based algorithms. These algorithms are particularly effective for feature weighting and selection tasks.¹⁵ Overall, metaheuristic algorithms developed best in optimization technology and have become one of the research hotspots.

Metaheuristic optimization algorithms have demonstrated strong performance in solving optimization problems across various domains due to their unique characteristics, such as a large search space and randomized selection techniques.¹⁶ Cost estimation problems inherently involve finding the optimal solution within predefined scope and time constraints. It is influenced by various factors, including scale, requirement complexity, team skill levels, etc., which often exhibit nonlinear relationships. These complexities make it challenging to develop a single, logical, and universally applicable estimation model, resulting in difficulties achieving high accuracy.¹⁷ Consequently, researchers in recent years have increasingly utilized metaheuristic algorithms to address the challenges of SCE.

This study identifies 20 primary optimization techniques employed for SCE, as shown in Table 3. The top six most commonly used optimization algorithms are Particle Swarm Optimization (PSO), Differential Evolution (DE), Genetic Algorithm (GA), Gray Wolf Optimization (GWO), Whale Optimization Algorithm (WOA), and Bat algorithm (BA). Among them, PSO is the most frequently used technique with 11 instances, followed by the DE algorithm with 7 uses and GA with 5 uses. This suggests these are currently the most popular optimization methods in SCE.

Figure 5.

The category of optimization algorithms.

Table 3.

Optimization techniques used for sce.

Optimization techniques		Usage type		Incorporate optimization techniques
Particle Swarm Optimization (PSO)	11	independent	9
		mix	2	1) Neighborhood Search (NS) 2) Support Vector Machine (SVM), Random Forest (RF), Linear Regression (LR)
Differential Evolution (DE) algorithm	7	independent	7
Genetic Algorithm (GA)	5	independent	4
		mix	1	1) Environment Adaptation (EA) algorithm
Gray Wolf Algorithm (GWO)	4	independent	3
		mix	1	1) Strawberry Algorithm (SBA), Harmony Search Algorithm (HSA)
Whale Optimization Algorithm (WOA)	3	independent	1
		mix	2	1) Crow Search Algorithm (CSA) 2) Dragonfly Algorithm (DA)
Bat algorithm (BA)	3	independent	1
		mix	2	1) Dolphin algorithm 2) Ant Colony Optimization (ACO)
Flower Pollination Algorithm	2	independent	2
Ant Colony Optimization (ACO)	1	independent	1
Battle Royale Optimization (BRO)	1	mix	1	1) Quantum Ensemble Meta-Regression Technique (QEMRT)
Biogeography-Based Optimization algorithm (BBO)	1	independent	1
Ensemble Duck Traveler Optimization (eDTO)	1	independent	1
Forest-Moth Flame Optimization	1	independent	1
Firefly algorithm	1	independent	1
Antlion Optimization Algorithm (ALO)	1	independent	1
Evolutionary Cost-Sensitive Deep Belief Network (ECS-DBN) model	1	independent	1
Optainet algorithm	1	independent	1
Artificial immune network (aiNet)	1	independent	1
Taguchi method	1	independent	1
Learnable evolution model (LEM)	1	independent	1
Search-based approach	1	independent	1

The research results show that 39 studies use only one independent optimization algorithm, while 9 studies (18.75%) use a combination of multiple optimization algorithms. This indicates that independent usage is more common. Several algorithms are combined with other techniques, demonstrating a trend toward hybrid models. For example, PSO combines Neighborhood Search (NS) and machine learning methods like Support Vector Machine (SVM) and Random Forest.

Many listed algorithms are inspired by natural phenomena or animal behaviours, such as PSO, GWO, BA, WOA, ACO, etc. Some techniques, like the artificial immune network (aiNet), suggest an integration of optimization with machine learning approaches for SCE.

3.2. RQ2: What specific challenges within SCE are typically addressed by applying optimization algorithms?

This section examines the six most commonly used optimization algorithms identified in the study – PSO, DE, GA, GWO, WOA, and BA – and analyzes their contributions when applied to SCE.

3.2.1. Particle swarm optimization

PSO is a global optimization algorithm grounded in swarm intelligence, initially introduced by Eberhart and Kennedy.¹⁸ PSO draws inspiration from artificial life studies, treating individuals in the group as particles in the search space. Each particle traverses the solution space at a defined velocity, progressively aligning itself with its personal historical optimum and the historical optimum of its neighbouring particles to refine potential solutions.¹⁹ PSO employs the principles of “population” and “evolution” to facilitate the search for optimal solutions by capitalizing on the interactions within a group of particles.

The simplicity of PSO is derived from its solid biosocial foundation and the minimal number of required parameters, which contribute to its straightforward implementation. It is highly effective for global searches in nonlinear and multimodal problems and is gaining widespread attention in scientific research and engineering practice.

In this study, PSO is frequently used with other cost estimation techniques. Its primary functions include parameter tuning,^20–23 enhancing particle diversity,²⁴ feature influence analysis,²⁵ and improving search capability and convergence rate,^26,27 among others. Additionally, Venkataiah et al.²⁴ introduced a chaotic linear increasing inertia weight and diversity-enhanced PSO algorithm to optimize model accuracy. PSO is well-suited for continuous optimization problems and it efficiently identifies the global optimal solution within a multi-dimensional solution space.²⁸ Especially, it excels in handling cost estimation for complex projects, enabling rapid optimization of cost estimation model parameters and improving estimation accuracy.²⁹ This is an important reason why PSO is widely used in the field of SCE.

Numerous studies have focused on optimizing and improving PSO to enhance accuracy. The results show that half of the 10 papers use enhanced and combined methods to improve PSO. As shown in Table 4, there are two main approaches to these improvements. The first approach involves enhancing the PSO algorithm itself, leading to variants such as Improved PSO (IPSO),²⁷ Hybrid PSO (HPSO), and Diversity Enhanced PSO (DPSO).¹⁹ The second approach integrates PSO with other models, such as Chaotic Linear Increasing Inertia Weight,²⁴ Neighborhood Search (NS), and SVM²¹

Table 4.
Applications of PSO in SCE.

Case study reference Estimation technique Optimization technique Usage type for PSO Contribution

Maher and Alneamy²⁰ K-nearest neighbours (KNN), Support Vector Regression (SVR), Random Forest, Bayesian Regression, Linear Regression, Multilayer Perceptron (MLP) PSO regular Tuning hyperparameters to improve estimation accuracy

Wani and Quadri²⁷ Functional link Artificial Neural Network Improved particle swarm optimization (IPSO) improve Improve the proposed model byiteratively improving a candidate solution in the training stage

Venkataiah et al.²⁴ Chaotic linear increasing inertia weight and diversity improved PSO algorithm Chaotic linear increasing inertia weight and diversity improved PSO algorithm improve Predict software cost estimation and improve the diversity between particles

Jiang et al.²⁵ Estimating Software Cost with a Weighted Feature Selection and Support Vector Regression with Mixture of Kernels Ensemble Learning Method (WFS-SVR-MK) Hybrid particle swarm optimization algorithm (HPSO) improve Achieve the influence analysis of features and the optimization of the proposed model

Venkataiah et al.²⁶ COCOMO DPSONS-K based on Diversity Enhanced Particle Swarm Optimization (DPSO) and Neighborhood Search (NS) improve Improve its searching ability and convergence rate

Zakaria et al.²¹ COCOMO PSO, PSOSVM, PSORF, Support Vector Machine (SVM), Random Forest (RF), Linear Regression (LR) improve Optimizing the parameters of COCOMO models

Chhabra and Singh³⁰ Intermediate COCOMO, fuzzy logic-based estimation model PSO regular Optimizing the design parameters of fuzzy model

Khandelwal and Sharma²² COCOMO PSO regular Perform tuning of parameters

Kumari³¹ Function point PSO regular Find the Parameters of value adjustment factor in order to optimized the function point

Draz et al.²³ Convolutional Neural Network (CNN) PSO regular Optimize CNN parameters to better fit the model to the data, improve prediction accuracy, and avoid local optimal solutions

Case study reference	Estimation technique	Optimization technique	Usage type for PSO	Contribution
Maher and Alneamy²⁰	K-nearest neighbours (KNN), Support Vector Regression (SVR), Random Forest, Bayesian Regression, Linear Regression, Multilayer Perceptron (MLP)	PSO	regular	Tuning hyperparameters to improve estimation accuracy
Wani and Quadri²⁷	Functional link Artificial Neural Network	Improved particle swarm optimization (IPSO)	improve	Improve the proposed model byiteratively improving a candidate solution in the training stage
Venkataiah et al.²⁴	Chaotic linear increasing inertia weight and diversity improved PSO algorithm	Chaotic linear increasing inertia weight and diversity improved PSO algorithm	improve	Predict software cost estimation and improve the diversity between particles
Jiang et al.²⁵	Estimating Software Cost with a Weighted Feature Selection and Support Vector Regression with Mixture of Kernels Ensemble Learning Method (WFS-SVR-MK)	Hybrid particle swarm optimization algorithm (HPSO)	improve	Achieve the influence analysis of features and the optimization of the proposed model
Venkataiah et al.²⁶	COCOMO	DPSONS-K based on Diversity Enhanced Particle Swarm Optimization (DPSO) and Neighborhood Search (NS)	improve	Improve its searching ability and convergence rate
Zakaria et al.²¹	COCOMO	PSO, PSOSVM, PSORF, Support Vector Machine (SVM), Random Forest (RF), Linear Regression (LR)	improve	Optimizing the parameters of COCOMO models
Chhabra and Singh³⁰	Intermediate COCOMO, fuzzy logic-based estimation model	PSO	regular	Optimizing the design parameters of fuzzy model
Khandelwal and Sharma²²	COCOMO	PSO	regular	Perform tuning of parameters
Kumari³¹	Function point	PSO	regular	Find the Parameters of value adjustment factor in order to optimized the function point
Draz et al.²³	Convolutional Neural Network (CNN)	PSO	regular	Optimize CNN parameters to better fit the model to the data, improve prediction accuracy, and avoid local optimal solutions

3.2.2. Differential evolution

Rainer Storn and Kenneth Price³² proposed the Differential Evolution (DE) algorithm, inspired by evolutionary principles, to address the Chebyshev polynomial problem. The DE algorithm begins by creating a population of potential solutions,³³ derived by combining current solutions in the population using a basic formula. The algorithm then identifies the optimal solution. DE is similar to genetic algorithms in that it uses mutation and crossover techniques on current solutions within the population. DE is an evolutionary algorithm that relies on iterative group processes and performs random searches within continuous space using actual number encoding.³⁴ Its notable features include a straightforward design and effective operational efficiency. As shown in Table 5, DE has been frequently used for feature weighting,³⁵ enhancing diversity and convergence speed,^36–38 and adjusting parameters.^39,40 The DE algorithm is commonly combined with cost estimation techniques, with COCOMO being the most prevalent. Shailendra Pratap Singha et al.³⁶ developed a novel multi-objective differential evolution (MODE) which effectively enhances diversification and accelerates the convergence velocity of the optimal Pareto front, demonstrating notable performance compared to recent multi-objective DE optimization methods. DE is ideal for solving multi-dimensional nonlinear problems, especially in estimating costs for large-scale software projects.³⁶ Through mutation and selection operations, DE optimizes cost estimation under multi-objective constraints, improving the robustness of the model. The results show that most studies have improved DE by enhancing and combining, resulting in the development of a variety of techniques, including multi-objective DE, enhance-based DE, and HABDE, among others.

Table 5.
Applications of DE in SCE.

Case study reference Estimation technique Optimization technique Usage type for DE Contribution

Bardsiri³⁵ Analogy Differential Evolution algorithm regular Weight the features appropriately to propose similar function

Singh et al.³⁸ Multi-objective Differential Evolution (DE) algorithm Multi-objective Differential Evolution (DE) algorithm improve Enhance the diversity and convergence speed

Gouda and Mehta³⁷ COCOMO A syndrome-based Self-adaptive multi-objective Differential Evolution (MOSADE) algorithm improve Ensure a better convergence speed and diversity among the solutions

Gouda and Mehta⁴¹ COCOMO Syndrome-based Self-adaptive DE Algorithm (SABDE) improve Use the mutation strategy for exploitation and exploring according to the search space

Singh³⁹ COCOMO Enhance-based Differential Evolution algorithm (EABMO) improve Tuning the parameters of the model

Singh et al.³⁶ COCOMO Homeostasis adaption based mutation operator Differential Evolution (HABDE) improve Provide diversity in local and global search spaces

Gouda and Mehta⁴⁰ COCOMO the fuzzy C-means clustering algorithm Improved self-adaptive Differential Evolution algorithm improve Optimize the constructive cost model parameters

Case study reference	Estimation technique	Optimization technique	Usage type for DE	Contribution
Bardsiri³⁵	Analogy	Differential Evolution algorithm	regular	Weight the features appropriately to propose similar function
Singh et al.³⁸	Multi-objective Differential Evolution (DE) algorithm	Multi-objective Differential Evolution (DE) algorithm	improve	Enhance the diversity and convergence speed
Gouda and Mehta³⁷	COCOMO	A syndrome-based Self-adaptive multi-objective Differential Evolution (MOSADE) algorithm	improve	Ensure a better convergence speed and diversity among the solutions
Gouda and Mehta⁴¹	COCOMO	Syndrome-based Self-adaptive DE Algorithm (SABDE)	improve	Use the mutation strategy for exploitation and exploring according to the search space
Singh³⁹	COCOMO	Enhance-based Differential Evolution algorithm (EABMO)	improve	Tuning the parameters of the model
Singh et al.³⁶	COCOMO	Homeostasis adaption based mutation operator Differential Evolution (HABDE)	improve	Provide diversity in local and global search spaces
Gouda and Mehta⁴⁰	COCOMO the fuzzy C-means clustering algorithm	Improved self-adaptive Differential Evolution algorithm	improve	Optimize the constructive cost model parameters

3.2.3. Genetic algorithm

Genetic Algorithm (GA) was developed by Holland in 1970, and it is a robust and versatile optimization method based on Darwinian evolution principles, particularly natural selection and survival of the fittest.³² GA represented problem parameters as chromosomes composed of genes and iteratively search for optimal or near-optimal solutions by simulating biological processes like selection, crossover, recombination, and mutation. Natural variation introduced randomness, enabling exploration of new solution spaces and avoiding local optima, while genetic inheritance propagates advantageous traits, guiding the search toward optimal regions.⁴² These mechanisms balance exploration and exploitation, allowing GA to traverse a broader solution space effectively. GA is particularly suitable for large-scale, multidimensional, and nonlinear problems that are difficult to solve with traditional methods. GA can even be applied to systems that lack human knowledge and experience. However, due to its reliance on random search and sensitivity to parameter settings, GA is prone to problems such as slow convergence, easy to fall into local optimality, and unable to guarantee global convergence. They also require high computational complexity.⁴³ As shown in Table 6, GA frequently employed to adjust parameters and enhance the features of estimation models. Various studies have integrated GA with COCOMO^43–45 and COCOMO II⁴⁶ models to adjust or optimize parameters, addressing non-convergence issues and improving estimation accuracy. Amrita Sharma and Neha Chaudhary⁴⁷ contrasted agile and traditional software development methodologies utilizing neural networks (NN) and GA. And only 1 of the 5 papers improved GA by combining it with EA algorithm. The findings suggest that GA’s performance is notably enhanced when integrated with additional algorithms. Based on its powerful global search capability and special advantages for nonlinear and multi-objective optimization problems,⁴³ GA plays an important role in adjusting key parameters in cost estimation models, thereby improving the adaptability and performance of estimation models.

Table 6.
Applications of GA in SCE.

Case study reference Estimation technique Optimization technique Usage type for GA Contribution

Verma and Preet⁴⁴ Intermediate COCOMO model Genetic Algorithm (GA) regular Update the coefficients of intermediate COCOMO

Gandomani et al.⁴⁵ COCOMO Genetic Algorithm (GA), Environment Adaptation (EA) algorithm improve Improve the parameters of the COCOMO model to solve the non-convergence problem

Chhabra and Singh⁴³ COCOMO, fuzzy model Genetic Algorithm regular Further optimize the selection of parameters haracterizing fuzzy sets in proposed model

Sharma and Chaudhary⁴⁷ Machine Learning, Artificial Neural Network (ANN) Genetic Algorithm regular Optimize the parameters of the estimation model

Amarif and Owaydat⁴⁶ COCOMO II Genetic Algorithm regular Tuning parameters to improve accuracy

Case study reference	Estimation technique	Optimization technique	Usage type for GA	Contribution
Verma and Preet⁴⁴	Intermediate COCOMO model	Genetic Algorithm (GA)	regular	Update the coefficients of intermediate COCOMO
Gandomani et al.⁴⁵	COCOMO	Genetic Algorithm (GA), Environment Adaptation (EA) algorithm	improve	Improve the parameters of the COCOMO model to solve the non-convergence problem
Chhabra and Singh⁴³	COCOMO, fuzzy model	Genetic Algorithm	regular	Further optimize the selection of parameters haracterizing fuzzy sets in proposed model
Sharma and Chaudhary⁴⁷	Machine Learning, Artificial Neural Network (ANN)	Genetic Algorithm	regular	Optimize the parameters of the estimation model
Amarif and Owaydat⁴⁶	COCOMO II	Genetic Algorithm	regular	Tuning parameters to improve accuracy

3.2.4 Gray wolf optimization

GWO, proposed by Seyedali Mirjalili et al.,⁴⁸ is a bio-inspired algorithm modelled after gray wolves’ social architecture and trapping patterns. This algorithm emulates the hierarchical social structure and cooperative hunting behaviours of gray wolves, categorizing solutions into Alpha, Beta, Delta, and Omega roles to balance exploration and exploitation. The higher-level Alpha, Beta, Delta wolves focus on finding the best solution, while the lower-level Omega wolves increase diversity to ensure that the algorithm does not converge prematurely.⁴⁹ It mimics the wolves’ strategies of encircling, tracking, and attacking prey to refine solutions, dynamically adjusting search behaviour for effective global optimization. This hierarchical structure helps strike a balance between global and local search, improving the efficiency of the algorithm.⁵⁰

The GWO algorithm’s advantages include ease of execution, fast convergence to solutions, and minimal parameter requirements,⁵¹ contributing to its widespread use across various fields. However, GWO is prone to premature convergence to local optima and can struggle with large-scale or high-dimensional data. In complex optimization problems, dynamic parameter adjustment, hybridization with other algorithms, and population diversification can improve the balance between exploration and exploitation, reducing the risk of premature convergence to local optima in GWO.⁵⁰

As shown in Table 7, GWO is frequently employed to optimize the parameters of estimation models to improve accuracy. Numerous studies have utilized GWO to enhance the performance and precision of COCOMO^2,52,53 and Analogy-Based Estimation.⁵⁴ 2 of the 4 studies improved the performance of GWO by combining it with other optimization techniques. GWO’s simplicity and convergence characteristics make it suitable for multi-objective cost estimation problems. By simulating the hunting behaviour of wolves, GWO optimizes parameter configurations, striking a balance between estimation accuracy and search efficiency in complex projects.

Table 7.
Applications of GWO in SCE.

Case study reference Estimation technique Optimization technique Usage type for GWO Contribution

ul Hassan and Khan² COCOMO Strawberry Algorithm (SBA), Gray Wolf Algorithm (GWO), and Harmony Search Algorithm (HSA) improve Optimize coefficients

Putri et al.⁵² COCOMO II Fuzzy Gaussia enhanced GWO improve Improve COCOMO II’s cost estimation accuracy and the capital drivers’ quantitative values

Putri and Siahaan⁵³ COCOMO II GWO regular Improve the accuracy of COCOMO II

Gandomani et al.⁵⁴ Analogy-Based Estimation GWO regular Define the weight of features in the stage of determining the similarity of projects

Case study reference	Estimation technique	Optimization technique	Usage type for GWO	Contribution
ul Hassan and Khan²	COCOMO	Strawberry Algorithm (SBA), Gray Wolf Algorithm (GWO), and Harmony Search Algorithm (HSA)	improve	Optimize coefficients
Putri et al.⁵²	COCOMO II	Fuzzy Gaussia enhanced GWO	improve	Improve COCOMO II’s cost estimation accuracy and the capital drivers’ quantitative values
Putri and Siahaan⁵³	COCOMO II	GWO	regular	Improve the accuracy of COCOMO II
Gandomani et al.⁵⁴	Analogy-Based Estimation	GWO	regular	Define the weight of features in the stage of determining the similarity of projects

3.2.5. Bat algorithm

Yang invented the BA, a biology-based optimization method inspired by bats’echolocation behaviour.⁵⁵ BA was built on three core principles of echolocation ranging, frequency and loudness modulation, and pulse firing rate control, which were inspired by the echolocation capabilities of small bats. This algorithm simulated the behaviour of approximately 996 species of bats, each of which exhibits unique echolocation abilities, emitting high-intensity ultrasonic pulses and using the reflected echoes to navigate and locate prey. These enable BA to efficiently explore the solution space and improve search accuracy, which has been adapted into an optimization method for engineering problems.

BA has shown effectiveness in various optimization scenarios. It is characterized by its simplicity, stability, and flexibility. This is because BA simulates bats’ ultrasonic echolocation behaviour to locate prey. It adjusts frequency, loudness, and pulse emission rates to control search range and accuracy.⁵⁶ By combining local and global search strategies, it enhances robustness and optimization capability. This algorithm is also simple in structure and easily extendable. However, BA’s performance can be sensitive to parameter settings and may face challenges balancing exploration and exploitation.⁵⁷ As illustrated in Table 8, BA has been used to determine optimal initial weights⁵⁸ and to optimize parameters in cost estimation models.^59,60 It is frequently integrated with COCOMO II and other estimation models to improve optimization performance. Ch Anwarul Hassan⁵⁸ proposed a method that combines Ant Colony Optimization (ACO), Bat Algorithm (BAT), and Hybrid Ant Colony Optimization with Bat Algorithm (HACO-BA) with COCOMO, demonstrating that HACO-BA outperformed other methods. A novel approach combining biology-based algorithms with Deep Neural Networks (DNN) was also introduced to verify the performance of HACO-BA. Experimental results showed that the estimation model combining HACO-BA with DNN performed better than NN in terms of execution time and accuracy,⁵⁸ while NN requires more time in trainning to achieve similar estimation accuracy. Moreover, all 3 papers improved the performance of BA by combining it with other techniques. BA optimizes multi-objective cost estimation problems by simulating the echolocation behaviour of bats, improving the efficiency of parameter optimization. Its combination of local and global search capabilities ensures high performance in scenarios requiring precise estimation.⁶¹

Table 8.
Applications of BA in SCE.

Case study reference Estimation technique Optimization technique Usage type for BA Contribution

ul Hassan et al.⁵⁸ COCOMO II DNN ACO, BAT, HACO-BA improve Give the best values for initial weights to train the network more preciously

Fadhil et al.⁶⁰ COCOMO II Dolphin swarm algorithm and hybrid bat algorithm (DolBat) improve Optimize coefficients of the cost estimation models

Arora et al.⁵⁹ The adaptive neuro fuzzy inference system (ANFIS) The novel Energy-Efficient BAT (EEBAT) technique improve Optimize the parameters of ANFIS

Case study reference	Estimation technique	Optimization technique	Usage type for BA	Contribution
ul Hassan et al.⁵⁸	COCOMO II DNN	ACO, BAT, HACO-BA	improve	Give the best values for initial weights to train the network more preciously
Fadhil et al.⁶⁰	COCOMO II	Dolphin swarm algorithm and hybrid bat algorithm (DolBat)	improve	Optimize coefficients of the cost estimation models
Arora et al.⁵⁹	The adaptive neuro fuzzy inference system (ANFIS)	The novel Energy-Efficient BAT (EEBAT) technique	improve	Optimize the parameters of ANFIS

3.2.6. Whale optimization algorithm

The Whale Optimization Algorithm (WOA) also belongs to the swam-based optimization algorithms. It was developed by Seyedali Mirjalili in 2016 and inspired by the hunting patterns of humpback whales, particularly the feeding mechanism of the bubble net.⁶² This strategy is a unique method whales use to hunt fish by creating spiral-shaped bubbles around their prey.^63,64 WOA is usually introduced to solve complex optimization problems. It is growing and gradually being applied to scenarios in various industries to solve global optimization problems.

As shown in Table 9, WOA is usually used to adjust parameters such as regression coefficients,⁶⁵ weights,⁶⁶ and network parameters⁶⁷ to optimize them and improve the models’ prediction ability. According to the analysis results, the optimization technologies used in combination with WOA include the Kernel Logistics Regression model, Linear Regression model, Radial basis function neural network (RBFN), functional link artificial neural network (FLANN), and Multilayer Perceptron (MLP). The WOA were usually improved by combining with WOA include the Crow Search algorithm (CSA) and Dragonfly Algorithm (DA). WOA enhances the predictive capabilities of cost estimation models by simulating the predatory behaviour of whales to optimize key variables. Its strong ability to avoid local optima makes it well-suited for handling project data with high uncertainty and complexity.⁶⁸

Table 9.
Applications of WOA in SCE.

Case study reference Estimation technique Optimization technique Usage type for WOA Contribution

Ahmad and Bamnote⁶⁵ Kernel Logistics Regression model, Linear Regression model Whale–crow optimization (WCO)algorithm, whale optimization algorithm (WOA), crow search algorithm (CSA) improve Optimal tuning Of the regression coefficients

Kaushik et al.⁶⁶ Radial basis function neural network (RBFN) and functional link artificial neural network (FLANN) Whale optimization algorithm (WOA) regular Optimize the weights of the neural network models

Vanathi et al.⁶⁷ Multilayer Perceptron (MLP) Dragonfly Algorithm (DA), Whale Optimization Algorithm (WOA) improve Optimize network parameters, improve MLP weight and bias settings, and enhance the model’s predictive power

Case study reference	Estimation technique	Optimization technique	Usage type for WOA	Contribution
Ahmad and Bamnote⁶⁵	Kernel Logistics Regression model, Linear Regression model	Whale–crow optimization (WCO)algorithm, whale optimization algorithm (WOA), crow search algorithm (CSA)	improve	Optimal tuning Of the regression coefficients
Kaushik et al.⁶⁶	Radial basis function neural network (RBFN) and functional link artificial neural network (FLANN)	Whale optimization algorithm (WOA)	regular	Optimize the weights of the neural network models
Vanathi et al.⁶⁷	Multilayer Perceptron (MLP)	Dragonfly Algorithm (DA), Whale Optimization Algorithm (WOA)	improve	Optimize network parameters, improve MLP weight and bias settings, and enhance the model’s predictive power

This analysis reveals that each of these algorithms–PSO, DE, GA, GWO, BA, and WOA–has unique strengths and applications in SCE. PSO excels in global searches and is often combined with other techniques for parameter tuning and feature analysis. DE is noted for its simplicity and efficiency, and it is frequently used for feature weighting and enhancing diversity. GA has robust optimization capabilities and is commonly employed for parameter adjustment and model feature enhancement. GWO, inspired by wolf behaviour, offers rapid convergence but can struggle with large-scale data. BA shows promise in determining optimal weights and parameters. WOA is increasingly applied to optimize various model parameters. Additionally, DE and GA perform well for multi-dimensional and nonlinear estimation problems. PSO and WOA are suitable for optimizing complex projects requiring continuous refinement. GWO and BA are more effective for cases with simpler parameter settings. Choosing the right optimization technique should considering characteristics of the algorithms, project complexity and data size. While each algorithm presents specific advantages, they face challenges such as premature convergence or sensitivity to parameter settings. Hence combining multiple optimization algorithms is a recommended approach to enhance their performance. Existing studies have shown that, integrating these algorithms with traditional cost estimation models like COCOMO has shown significant improvements in accuracy and performance, highlighting the potential of hybrid approaches in addressing the complexities of cost estimation problems.

3.3. RQ3: How have optimization algorithms been employed in ASCE over the past six years?

The evolution of optimization techniques in ASCE remains limited. The literature review reveals that only 7 articles have explored optimization methods in this field over the past six years. These articles primarily focus on biology-based approaches, including artificial immune networks (aiNet), Whale Optimization Algorithm (WOA), forest-moth flame optimization, Genetic Algorithm (GA), Bat Algorithm, Antlion Optimization (ALO), and the Evolutionary Cost-Sensitive Deep Belief Network (ECS-DBN), as shown in Table 10. Despite this variety, the scope of optimization research in ASCE is still relatively narrow.

Table 10.
Applications of optimization techniques in ASCE.

Case study reference Estimation technique Optimization technique Contribution

Najm et al.⁶⁹ Support vector machine (SVR) Artificial Immune Network (ai Net) Seek optimal parameters of the SVR-RBF model

Kaushik et al.⁶⁶ Radial basis function neural network (RBFN) and functional link artificial neural network (FLANN) Whale Optimization Algorithm (WOA) Optimize the weights of RBFN and FLANN

Gupta and Mahapatra⁷⁰ Artificial neural network (ANN), deep belief network (DBN), heuristically improved hybrid learning (HIHL) Solution index-based forest-moth flame optimization Weight optimization

Sharma and Chaudhary⁴⁷ machine learning Artificial neural network (ANN) Genetic Algorithm (GA) Optimize the parameters of the estimation model

Arora et al.⁵⁹ The adaptive neurofuzzy inference system (ANFIS) The novel Energy-Efficient BAT (EEBAT) technique Optimize the parameters of ANFIS

Kaushik et al.⁶ Deep Belief Network (DBN) Antlion Optimization Algorithm (ALO) Initialize the weights between the hidden layer of the third RBM stack and theoutput layer as it provides the optimal value

Premalatha and Srikrishna⁷¹ Evolutionary Cost-Sensitive Deep Belief Network (ECS-DBN) model Evolutionary Cost-Sensitive Deep Belief Network (ECS-DBN) model Reduce the overall cost of the training dataset

Case study reference	Estimation technique	Optimization technique	Contribution
Najm et al.⁶⁹	Support vector machine (SVR)	Artificial Immune Network (ai Net)	Seek optimal parameters of the SVR-RBF model
Kaushik et al.⁶⁶	Radial basis function neural network (RBFN) and functional link artificial neural network (FLANN)	Whale Optimization Algorithm (WOA)	Optimize the weights of RBFN and FLANN
Gupta and Mahapatra⁷⁰	Artificial neural network (ANN), deep belief network (DBN), heuristically improved hybrid learning (HIHL)	Solution index-based forest-moth flame optimization	Weight optimization
Sharma and Chaudhary⁴⁷	machine learning Artificial neural network (ANN)	Genetic Algorithm (GA)	Optimize the parameters of the estimation model
Arora et al.⁵⁹	The adaptive neurofuzzy inference system (ANFIS)	The novel Energy-Efficient BAT (EEBAT) technique	Optimize the parameters of ANFIS
Kaushik et al.⁶	Deep Belief Network (DBN)	Antlion Optimization Algorithm (ALO)	Initialize the weights between the hidden layer of the third RBM stack and theoutput layer as it provides the optimal value
Premalatha and Srikrishna⁷¹	Evolutionary Cost-Sensitive Deep Belief Network (ECS-DBN) model	Evolutionary Cost-Sensitive Deep Belief Network (ECS-DBN) model	Reduce the overall cost of the training dataset

Analysis of these 7 articles reveals that 3 focus on optimizing parameters,^47,59,69 3 aim to optimize model weights,^6,66,70 and 1 article addresses reducing the overall cost of the training dataset,⁷¹ as shown in Table 10. Overall, these optimization techniques consistently aim to enhance the precision and efficiency of cost estimation methods for Agile.

This demonstrates a trend towards leveraging advanced computational methods to address the complexities and uncertainties inherent in ASCE. Applying nature-inspired optimization algorithms and neural network-based approaches suggests that researchers are exploring sophisticated, adaptive methods to capture the nuances of Agile project dynamics. This integration of machine learning and optimization techniques represents a promising direction for improving the precision and reliability of cost estimation in Agile.

3.4. RQ4: Which estimation algorithms are commonly combined with optimization algorithms in practice?

Many studies have explored integrating cost estimation methods with optimization techniques to enhance accuracy. The results indicate that the most frequently used cost estimation models include COCOMO, COCOMO II, Artificial Neural Networks (ANN), Regression, Fuzzy Logic-based estimation model, Support Vector Regression (SVR), and Analogy, as depicted in Figure 6. COCOMO is the most commonly referenced, followed by COCOMO II. Combining various estimation models with optimization algorithms is standard practice, as it improves the precision and effectiveness of cost estimation models.

Figure 6.

The cost estimation techniques combined with optimization algorithms.

4. Discuss and conclusions

This comprehensive analysis of 52 papers on SCE reveals a significant trend toward integrating optimization algorithms with traditional cost estimation models. In the past few years, the research on the application of metaheuristic algorithms in the field of SCE has shown a steady upward trend, and the research focus has been on high-quality technical practices published in journals, with fewer literature review studies. The study identifies 20 primary optimization techniques, with PSO, DE, GA, GWO, BA, and WOA emerging as the most frequently used. These biology-based algorithms are often combined with established models like COCOMO and COCOMO II to enhance accuracy and performance. The research shows a preference for using single optimization algorithms (81.25% of studies), though there’s a growing trend towards hybrid models that combine multiple techniques. Each primary algorithm offers unique strengths in parameter tuning, feature analysis, and model optimization. However, they also face challenges such as premature convergence or sensitivity to initial parameters. The application of optimization techniques in ASCE is still in its early stages, with only 7 relevant articles identified in the past 6 years and a rare integration of different optimization techniques. Especially in the last two years, the number of studies on the application of metaheuristic algorithms to ASCE has decreased. Existing studies primarily focus on biology and neural network-based approaches and aim to optimize parameters, weights, or overall model performance. The main reasons for this phenomenon include: 1) Most existing cost estimation techniques are predominantly tailored to traditional development methods, which require well-defined requirements and early planning. However, Agile emphasizes iterative and progressive planning and estimation.⁷² Due to the characteristics of Agile methods, ASCE faces significant challenges, including variability in project requirements, the dynamic nature of agile methodologies, and the complexity of integrating metaheuristic algorithms with existing estimation models. 2) There is also a lack of research on cost drivers specific to agile software development,⁸ with additional cost drivers for agile projects often not considered. These features further complicate accurate cost estimation. 3) ASCE research is more conceptual than model-driven.⁶⁶ Standard agile estimation methods, such as Expert Opinion, Analogy and Disaggregation, Planning Poker, and Use Case Points, heavily rely on the development team’s expertise and experience. 4) Research into ASCE is progressing more slowly than that into non-agile development. Kaushik et al.⁶ attribute this to the limited availability of publicly accessible agile project data, contrasting with the more abundant non-agile data.

5. Recommendations for future research

Based on the above research, future research should address the gaps identified, explore new optimization techniques, and improve the cost estimation accuracy. 1) The trend toward combining multiple biology-based optimization algorithms with cost estimation models will likely continue. Future research may focus on creating more sophisticated hybrid models that leverage the strengths of various techniques while mitigating their individual weaknesses. 2) With the increasing complexity of hybrid models, future research may also focus on improving the precision and interpretability of optimized assessment models. 3) Machine learning will likely become increasingly significant in SCE with the growing use of neural network-based approaches. Future work may explore deep learning techniques and their integration with optimization algorithms for more accurate and adaptive estimation models. 4) Given the currently limited research in optimization techniques for ASCE, efforts must increase to develop and refine optimization algorithms tailored to Agile projects. This study thoroughly analyzes 20 optimization methods in SCE, details their applications, and highlights the critical optimization methods and their contributions. It also underscores the need for further research into optimization methods for ASCE, particularly biology-based algorithms and machine learning techniques, to refine and enhance the precision of ASCE models. This study provides an essential reference for the applications and advancement of optimization technology in the field of SCE. It also lays the foundation for future in-depth research.

Footnotes

Funding

The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This research was supported by the Fundamental Research Grant Scheme (FRGS): FRGS/1/2022/ICT01/UKM/02/1, Ministry of Higher Education (MoHE), Malaysia.

Conflicting interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

ORCID iDs

Xiaoyan Zhao

Zulkefli Mansor

Rozilawati Razali

Xuwei Guo

References

Jadhav

Kaur

Akter

. Evolution of software development effort and cost estimation techniques: five decades study using automated text mining approach. Mathemat Probl Eng 2022; 2022: 1–17. DOI: 10.1155/2022/5782587.

ul Hassan

Khan

. An effective nature inspired approach for the estimation of software development Cost. In: 2021 16th International conference on emerging technologies (ICET), 2021, pp.1–6. DOI: 10.1109/ICET54505.2021.9689832.

Mittal

Varun

. Risk analysis in software cost estimation: a simulation-based approach. TURCOMAT 2021; 12: 2176–2183. DOI: 10.17762/turcomat.v12i6.4822.

Parveen

. Rapid digital transformation using agile methodologies for software development projects. Res J Comput Sci Inf Technol 2021; 5: 54–64. DOI: 10.54692/lgurjcsit.2021.0503218.

Shams

Bohm

Winzer

, et al. App cost estimation: evaluating agile environments. In: 2019 IEEE 21st conference on business informatics (CBI), 2019, pp.383–390. Moscow, Russia: IEEE. DOI: 10.1109/CBI.2019.00050.

Kaushik

Tayal

Yadav

. A comparative analysis on effort estimation for agile and non-agile software projects using DBN-ALO. Arab J Sci Eng 2020; 45: 2605–2618. DOI: 10.1007/s13369-019-04250-6.

Fernandez-Diego

Mendez

Gonzalez-Ladron-De-Guevara

, et al. An update on effort estimation in agile software development: a systematic literature review. IEEE Acc 2020; 8: 166768. DOI: 10.1109/ACCESS.2020.3021664.

Saeed

Khan

Naeem

, et al. An Empirical Investigation on Cost Estimation Challenges in Agile Software Development (ASD) Context. In: 2021 International conference on frontiers of information technology (FIT), 2021, pp.188–193. Islamabad, Pakistan: IEEE, DOI: 10.1109/FIT53504.2021.00043.

Shoran

, et al. Enhancing software cost estimation using COCOMO cost driver features with battle royale optimization and quantum ensemble meta-regression technique. In 2023 14th International conference on computing communication and networking technologies (ICCCNT), 2023, pp.1–6. DOI: 10.1109/ICCCNT56998.2023.10307113.

10.

Rankovic

Ivanovic

, et al. Convergence rate of artificial neural networks for estimation in software development projects. Inf Softw Technol 2021; 138: 106627. DOI: 10.1016/j.infsof.2021.106627.

11.

Mohamedyusf

Sharif

Ghareb

. A new approach for software cost estimation with a hybrid tabu search and invasive weed optimization algorithms. UHD J Sci Technol 2024; 8: DOI: 10.21928/uhdjst.v8n1y2024.pp42-54.

12.

Kitchenham

Dybå

Jørgensen

. Evidence-based software engineering. In: Proceedings - 26th International conference on software engineering, 2004, pp.273–281.

13.

Kitchenham

Brereton

Budgen

, et al. Systematic literature reviews in software engineering – A systematic literature review. Inf Softw Technol 2009; 51: 7–15. DOI: 10.1016/j.infsof.2008.09.009.

14.

Muiño

Akselrad

. Gates to success - ensuring the quality of the planning. In: PMI Global congresses-EMEA, Amsterdam, North Holland, The Netherlands. Newtown Square, 2009. [Online]. Available at: https://www.pmi.org/learning/library/gates-success-tollgate-methodology-6842 (accessed 1 January 2025).

15.

Behera

Sahoo

Pati

. A review on optimization algorithms and application to wind energy integration to grid. Renewa Sustain Energy Rev 2015; 48: 214–227. DOI: 10.1016/j.rser.2015.03.066.

16.

Khan

Jabeen

Ghouzali

, et al. Metaheuristic algorithms in optimizing deep neural network model for software effort estimation. IEEE Access 2021; 9: 60309–60327. DOI: 10.1109/ACCESS.2021.3072380.

17.

Rashid

, et al. Software cost and effort estimation: current approaches and future trends. IEEE Access 2023; 11: 99268–99288. DOI: 10.1109/ACCESS.2023.3312716.

18.

Eberhart

Kennedy

. A new optimizer using particle swarm theory. In: MHS’95. Proceedings of the sixth international symposium on micro machine and Human science, 1995, pp.39–43. DOI: 10.1109/MHS.1995.494215.

19.

Afida

, et al. Particle swarm optimized back propagation neural network for state of health estimation of lithium-ion battery. jkukm 2024; 36: 365–373. DOI: https://doi.org/10.17576/jkukm-2024-36(1)-34.

20.

Maher

Alneamy

. An ensemble model for software development cost estimation. In 2022 5th international seminar on research of information technology and intelligent systems (ISRITI), 2022, pp. 346–350. DOI: 10.1109/ISRITI56927.2022.10052861.

21.

Zakaria

, et al. Optimization of COCOMO model using particle swarm optimization. Int J Adv Intell Inform 2021; 7: 177–187. DOI: https://doi.org/10.26555/ijain.v7i2.583.

22.

Khandelwal

Sharma

. Adaptive and intelligent swarms-based algorithm for software cost estimation. J Mult-valued Log S 2023; 40: 415–432.

23.

Draz

Emam

Azzam

. Software cost estimation predication using a convolutional neural network and particle swarm optimization algorithm. Sci Rep 2024; 14. DOI: https://doi.org/10.1038/s41598-024-63025-8

24.

Vehkataiah

Nagaratna

Mohanty

. Application of chaotic increasing linear inertia weight and diversity improved particle swarm optimization to predict accurate software cost estimation. Int J Electr Electron Res 2022; 10: 154–160. DOI: https://doi.org/10.37391/IJEER.100218

25.

Jiang

Zhou

Zhang

. Estimating software cost with a weighted feature selection and support vector regression With mixture of kernels ensemble learning method. IOP Conf Ser: Earth Environ Sci 2019; 252: 052124. DOI: https://doi.org/10.1088/1755-1315/252/5/052124

26.

Venkataiah

Nagaratna

Mohanty

. Integration of diversity enhancement of particle swarm optimization and neighbourhood search with k radius to predict software cost estimation. Int J Intell Syst Appl Eng 2022; 10: 348–362.

27.

Wani

Quadri

Smk

. An improved particle swarm optimisation-based functional link artificial neural network model for software cost estimation. Int J Swarm Intell 2019; 4: 38–54. DOI: https://doi.org/10.1504/IJSI.2019.097408

28.

Singh

Sharma

Kumar

. An efficient approach for software maintenance effort estimation using particle swarm optimization technique. Int J Recent Technol Eng 2019; 7: 1–6.

29.

Shahpar

Khatibi

Khatibi Bardsiri

. Hybrid PSO-SA approach for feature weighting in analogy-based software project effort estimation. J Artif Intell Data Min 2021; 9: 329–340. DOI: https://doi.org/10.22044/jadm.2021.10119.2152.

30.

Chhabra

Singh

. Optimizing design of fuzzy model for software cost estimation using particle swarm optimization algorithm. Int J Comput Intell Appl 2020; 19: 2050005. DOI: https://doi.org/10.1142/S1469026820500054.

31.

Kumari

. Software development cost estimation methods and particle swarm optimization model. Int J Inf Retr Res 2019; 5: 2349–6010.

32.

Storn

. Differential evolution – a simple and efficient heuristic for global optimization over continuous spaces. Differ Evol 1997; 11: 341–359. DOI: https://doi.org/10.1023/A:1008202821328.

33.

Oudah

Zaher

Hassanetal

. Literature review on differential evolution algorithm. J Univ Shanghai Sci Technol 2021; 23: 1577–1600. DOI: https://doi.org/10.51201/JUSST/21/06471.

34.

Mustafa

HMJ

Ayob

Nazri

AMZ

, et al. An improved adaptive memetic differential evolution optimization algorithms for data clustering problems. PLoS One 2019; 14: e0216906. DOI: https://doi.org/10.1371/journal.pone.0216906.

35.

Bardsiri

. An intelligent model to predict the development time and budget of software projects. Int J Nonlinear Anal Appl 2020; 11: 85–102. DOI: https://doi.org/10.22075/ijnaa.2020.4384

36.

Singh

Mehta

. Differential evolution using homeostasis adaption based mutation operator and its application for software cost estimation. J King Saud U Comp Inf Sci 2021; 33: 740–752. DOI: https://doi.org/10.1016/j.jksuci.2018.05.009.

37.

Gouda

Mehta

. A new evolutionism based self-adaptive multi-objective optimization method to predict software cost estimation. Softw Pract Exper 2022; 52: 1826–1848. DOI: https://doi.org/10.1002/spe.3092..

38.

Singh

Dhiman

Tiwari

, et al. A soft computing based multi-objective optimization approach for automatic prediction of software cost models. Appl Soft Comput 2021; 113: 107981. DOI: https://doi.org/10.1016/j.asoc.2021.107981.

39.

Singh

. Cost estimation model using enhance-based differential evolution algorithm. Iran J Comput Sci 2020; 3: 115–126. DOI: https://doi.org/10.1007/s42044-019-00049-8.

40.

Gouda

Mehta

. Software cost estimation model based on fuzzy c-means and improved self-adaptive differential evolution algorithm. Int J Inf Technol 2022; 14: 2171–2182. DOI: https://doi.org/10.1007/s41870-022-00882-4.

41.

Gouda

Mehta

. A self-adaptive differential evolution using a new adaption based operator for software cost estimation. J Inst Eng Ser B 2023; 104: 23–42. DOI: https://doi.org/10.1007/s40031-022-00801-y.

42.

Albadr

Tiun

Ayob

, et al. Genetic algorithm based on natural selection theory for optimization problems. Symmetry 2020; 12: 1–31. DOI: https://doi.org/10.3390/sym12111758.

43.

Chhabra

Singh

. Optimizing design parameters of fuzzy model based COCOMO using genetic algorithms. Int J Inf Tecnol 2020; 12: 1259–1269. DOI: https://doi.org/10.1007/s41870-019-00325-7.

44.

Verma

Preet

. Calibrating intermediate COCOMO model using genetic algorithm In: 2021 international conference on computing, communication, and intelligent systems (ICCCIS) , 2021, pp.174–179.

45.

Gandomani

Dashti

Nafchi

. Hybrid genetic-environmental adaptation algorithm to improve parameters of cocomo for software cost estimation In: 2022 second international conference on distributed computing and high performance computing (DCHPC), 2022, pp.82–85.

46.

Amarif

Owaydat

. An optimal optimization of software development cost estimation using genetic algorithm In: 2024 IEEE 4th international maghreb meeting of the conference on sciences and techniques of automatic control and computer engineering (MI-STA) , 2024, pp.654–659.

47.

Sharma

Chaudhary

. Analysis of software effort estimation based on story point and lines of code using machine learning. Int J Comput Digit Syst 2022; 12: 131–140. DOI: https://doi.org/10.12785/ijcds/1201012.

48.

Mirjalili

Lewis

Grey wolf optimizer. Adv Eng Softw 2013; 69: 46–61. DOI: https://doi.org/10.1016/j.advengsoft.2013.12.007.

49.

Albadr

MAA

Tirn

Ayob

, et al. Grey wolf optimization-extreme learning machine for automatic spoken language identification. Multimed Tools Appl 2023; 82: 27165–27191. DOI: https://doi.org/10.1007/s11042-023-14473-3.

50.

Ghafar

Khalid

Ali

, et al. Grey wolf optimization technique for hems using day ahead pricing scheme In: Advances on broad-band wireless computing, communication and applications , 2018, pp.25–36.

51.

Liu

As'arry

Hassan

, et al. Review of the grey wolf optimization algorithm: variants and applications. Neural Comput & Applic 2024; 63: 2713–2735.

52.

Putri

Siahaan

Fatichah

. A comparative study on COCOMO II model for cost estimation In: 2023 IEEE 13th international conference on control system, computing and engineering (ICCSCE) , 2023, pp.226–231. IEEE.

53.

Putri

Siahaan

. Improve the accuracy of software project effort and cost estimates in COCOMO II using GWO In: 2021 5th international conference on informatics and computational sciences (ICICoS) , 2021, pp.128–133. IEEE.

54.

Gandomani

Ansaripour

Dashti

. Enhancing analogy-based software cost estimation using Gray Wolf optimization algorithm. Research Square Platform LLC 2024. DOI: https://doi.org/10.21203/rs.3.rs-4406388/v1.

55.

Yang

X-S

. A New Metaheuristic Bat-Inspired Algorithm. Berlin Heidelberg: NINSO, 2010.

56.

Shehab

Abu-Hashem

Shambour

, et al. A comprehensive review of bat inspired algorithm: variants, applications, and hybridization. Arch Computat Methods Eng 2023; 30: 765–797.

57.

Umar

Rashid

. Critical analysis: bat algorithm-based investigation and application on several domains. World J Eng 2021; 18: 606–620. DOI: https://doi.org/10.1108/WJE-10-2020-0495.

58.

ul Hassan

Khan

Irfan

, et al. Optimizing deep learning model for software cost estimation using hybrid meta-heuristic algorithmic approach. Comput Intell Neurosci 2022; 1: 3145956. DOI: https://doi.org/10.1155/2022/3145956.

59.

Arora

Verma

Wozniak

, et al. An efficient ANFIS-EEBAT approach to estimate effort of Scrum projects. Sci Rep 2022; 12: 7974. DOI: https://doi.org/10.1038/s41598-022-11565-2.

60.

Fadhil

Alsarraj

RGH

Altaie

. Software cost estimation based on dolphin algorithm. IEEE Access 2020; 8: 75279–75287. DOI: https://doi.org/10.1109/ACCESS.2020.2988867.

61.

Madugula

Haritha

. A heuristic effort estimation method using BAT algorithm through clustering. Int J Innov Technol Explor Eng 2019; 8: 1536–1541. DOI: https://doi.org/10.35940/ijitee.I8242.078919.

62.

Mirjalili

Lewis

. The whale optimization algorithm. Adv Eng Softw 2016; 95: 51–67. DOI: https://doi.org/10.1016/j.advengsoft.2016.01.008.

63.

Fadhil

Alsarraj

. Exploring the whale optimization algorithm to enhance software project effort estimation. In: 2020 6th international engineering conference “sustainable technology and development" (IEC) , 2020, pp.146–151.

64.

Hassan

Abdullah

Zamli

, et al. Q-learning whale optimization algorithm for test suite generation with constraints support. Neural Comput & Applic 2023; 35: 24069–24090. DOI: https://doi.org/10.1007/s00521-023-09000-2.

65.

Ahmad

Bamnote

. Whale–crow optimization (WCO)-based optimal regression model for software cost estimation. Sadhana 2019; 44: 1–15.

66.

Kaushik

Tayal

Yadav

. The role of neural networks and metaheuristics in agile software development effort estimation. Intl J Inf Technol Proj Manag 2020; 11: 50–71.

67.

Vanathi

Anusha

Ahilan

, et al. Software cost and effort estimation using dragonfly whale optimized multilayer perceptron neural network. Alex Eng J 2024; 103: 30–37.

68.

Hassan

Abdullah

Zamli

, et al. Combinatorial test suites generation strategy utilizing the whale optimization algorithm. IEEE Access 2020; 8: 192288–192303. DOI: https://doi.org/10.1109/ACCESS.2020.3032851.

69.

Najm

Zakrani

Marzak

. An enhanced support vector regression model for agile projects cost estimation. IAES Int J Artif Intell 2022; 11: 265–275. DOI: https://doi.org/10.11591/ijai.v11.i1.pp265-275.

70.

Gupta

Mahapatra

. Automated software effort estimation for agile development system by heuristically improved hybrid learning. Concurr Comput Pract Exper 2022; 34: e7276. DOI: https://doi.org/10.1002/cpe.7267.

71.

Premalatha

Srikrishna

. Effort estimation in agile software development using evolutionary costsensitive deep belief network. IJIES 2019; 12: 261–269. DOI: https://doi.org/10.22266/ijies2019.0430.25.

72.

Vyas

Bohra

Lamba

DCS

, et al. A review on software cost and effort estimation techniques for agile development process. Int J Recent Res Aspects 2018; 5: 1–5.

A review of optimization techniques for cost estimation in software development

Abstract

Keywords

1. Introduction

2. Methodology

2.1. Research issue

2.3. Data gathering

Table 2. The data sources selected. Data Source URL IEEE https://ieeexplore.ieee.org/ WOS https://www.webofscience.com/ Google Scholar https://www.googlescholar.com/ Scopus https://www-scopus-com-s.web.bisu.edu.cn/

3.2.1. Particle swarm optimization

5. Recommendations for future research

Footnotes

Funding

Conflicting interests

ORCID iDs

References

Table 2.
The data sources selected.

Data Source URL

IEEE https://ieeexplore.ieee.org/

WOS https://www.webofscience.com/

Google Scholar https://www.googlescholar.com/

Scopus https://www-scopus-com-s.web.bisu.edu.cn/