Abstract
This study designs a new variant of the capacitated vehicle routing problem (CVRP) under a fuzzy environment. In CVRP, several vehicles start their journey from a central depot to provide services to different cities and finally return to the depot. This paper introduces an additional time beyond the service time at each city to fulfill the pre-ordered demands. The need for this excess service time is to provide the services to new customers who are not enlisted at the start of the process. It is a market enhancement step. The proposed model’s main objective is to find the maximum time-dependent profit by using the optimum number of vehicles in an appropriate route and spending optimum excess service time in each city. The model considers travel time and travel cost as fuzzy numbers. An expected value model (EVM) is formulated using the credibility approach on fuzzy variables. A hybrid meta-heuristic method combining a genetic algorithm (GA) and bacteria foraging optimization algorithm (BFOA) is designed to solve the proposed model. The proposed model is explained with the help of some numerical examples. Sensitivity analyses based on different independent parameters of the algorithms are also conducted.
Keywords
Introduction
In the competitive world of business, one of the most critical issues is to deliver the goods to the customers within a reasonable time and preferably as fast as possible once the order is placed. This issue is a significant operational process, and it has become a natural area of research in the field of supply chain and logistics. An efficient supply and distribution management can reduce a significant drop in the running cost for a company. In this regard, the vehicle routing problem (VRP) is a very well-known problem. In VRP, the location and demands are known for all the cities. In this problem, some vehicles start from a single fixed depot after serving all the customers in an appropriate route by maintaining the vehicle’s capacity constraints and returns to the starting point. It focuses on the minimization of the cost or distance traveled by those sets of vehicles.
This paper proposes a new variant in the domain of capacitated vehicle routing problems. It introduces an additional service time beyond its stay time to fulfill the previously ordered demands. In the real-world scenario, several times, while supplying the products ordered, customers may ask for more quantities of the same or different products to fulfill their recent excess demands. In this regard, we consider the above mentioned additional service time. The proposed model also considers that each city may have several customers. Some are enlisted in the system from the beginning of the process, and some are not. The basic need for this excess service time is to provide the services to new customers who are not enlisted at the start of the process. This is a step to increase the market of any product(s). However, there is no guarantee of fulfilling the excess demand as each vehicle has some fixed capacity, and for each vehicle, there is a maximum time limit. This excess service time in each city will return some extra earning and incur some expenditure. This paper considers a closed CVRP model having a single depot. Now the proposed model is illustrated below.
There is a depot or a central position from where all the vehicles start their journey in the capacitated vehicle routing problem. The proposed model is applicable in a salesman or medical representative problem where the depot denotes the medical house or warehouse locations. Every vehicle or salesperson spends some predetermined period in each city to deliver or to market their products. In this period, they are supposed to have some earnings or returns, as well as some expenses. So the revenues minus the costs to serve a particular city yields the profit in that city. This amount of profit in a city depends on the time a salesman spends there. All the cities must be served with base demands. This quantity is a deterministic term, and it is known earlier before the solution procedure is going to start. All the salesmen or vehicles begin at the same time, and each of them has different routes. All the routes are mutually exclusive, i.e., each city must be served by only one vehicle. All the vehicles return to the depot after the completion of their delivery. After serving the base demand of a city, the vehicle or salesman spends extra time to market or supply more products. This additional quantity of supply is called “excess demand.” The service time or unloading time for the base demand is known earlier, but the excess time is not concretely predictable. The total need, consisting of the base demands and extra demands for all the cities for a particular route, must be less than the vehicle’s capacity. There is an upper limit on the time before which all the conveyances have to return to the depot. The traveling time or cost from one city to another is equal for all the vehicles. Also, all the carriages have the same capacity. A specific part of each conveyance’s capacity is filled up for the base demand, and the remaining portion is used for the excess demand.
The solution methodology of any vehicle routing problem can be categorized in two different ways. One is the exact method, and the other is the meta-heuristic method. The exact methods are suitable for small-scale problems, but they used to take a long time to find the solution. Whereas the meta-heuristic processes can deal with large-scale problems that may not find the best solution, it may lead to a very close solution within very little time compared to the exact method. Here this paper uses a hybrid heuristic method combining the strength of genetic algorithm (GA) and bacteria foraging algorithm (BFO).
The significant contribution of this paper is listed below. A new variant of the capacitated vehicle routing problem is presented here. The system can work fine in an uncertain environment. A fuzzy model is also constructed for uncertain travel time and cost. A hybrid metaheuristic algorithm based on GA and BFO algorithm is designed to solve the proposed model. Sensitivity analyses are performed for all the independent parameters in the model.
The paper is organized using several sections. The introduction of the work is presented in the first section. The literature review is provided in Section 2. Section 3 deals with the motivation of this work. Some preliminaries required to understand the model are described in section 4. Section 5 presents the mathematical model formulation. The proposed GA-BFO based hybrid algorithm is described in Section 6. Section 7 consists of some numerical illustrations of different models. Section 8 deals with the result analysis. Finally, Section 9 concludes the work.
Literature review
The basic model of VRP is also called the capacitated vehicle routing problem (CVRP) [1]. There are several variations of CVRP problems. Based on the dynamicity of introducing new customers and their demand, CVRP problems can be classified into two groups. One of these is the static VRP [2], where the customers, as well as their needs, are fixed. Also, the demands are known before the vehicles start. The dynamic VRP [3] considers a part of the total customers’ demands to be known when the vehicles have already started. Based on a return to the depot after the completion of service, the CVRP problems can be classified into two groups. The closed VRP [4] allows all the vehicles to return to the depot after completing the service. Simultaneously, the open VRP (OVRP) [5] restricts the vehicles to return to the central depot after visiting the customers. This model is generally used when an organization uses the third party logistics for its supply and distribution of goods. In the VRP with time window (VRPTW) [6], the products have to be delivered to each customer within a predefined time window. In literature, some of the articles related to VRP consider the crisp time window; also, few studies in this domain consider fuzzy time window [7].
Based on the number of depots in the problem, VRP problems are classified as either single-depot VRP [8] or multi-depot VRP (MDVRP) [9]. There is another variation of VRP called green VRP [10], where the main area of concern is the reduction in carbon emission from vehicles. In 2014, a research work [11] was published on environmental VRP, focusing on the minimization of CO2 emission due to transportation. Here a hybrid artificial bee colony (ABC) algorithm was used and it performed slightly better than the existing method in the literature. Schyns [12] has presented a new variety of dynamic VRP that considered time windows, partial split delivery, and heterogeneous fleets. This problem was solved using an ant colony optimization (ACO). Hu et al. [13] proposed the advancement of simultaneous pickup and delivery VRP under closed-loop logistics by imposing the dynamic nature, uncertain pickup, and delivery of incompatible goods. In 2016, a heterogeneous VRP [14] with mixed backhaul was solved using label-based ACO by Wu et al. This problem also considers the time window. Another work on minimizing the time-dependent CO2 emission in VRP [15], especially in urban areas, was published in 2016. For solving the problem, a tabu search method was used. Du et al. [16] proposed a problem on multi-depot VRP in 2017. In this study, the authors considered hazardous materials. It was solved using fuzzy bi-level programming. Huang et al. [17] investigated a particular type of VRP that supports path flexibility, which allows selecting any of the direct paths from one city to its neighboring city depending upon several conditions like congestion in the network and the journey. In 2017, a new variety of VRP for electric vehicles was first introduced to overcome the limited driving range. It was solved by proper planning of battery charging stations having nonlinear charging functions [18]. Ng et al. [19] proposed a CVRP having strategies under time-dependent traffic congestion. It was solved using the multiple colonies ABC algorithm. In 2018, a new concept of time window assignment VRP having two-stage stochastic optimization [20] was published. Here, the product dependent time window and the delivery schedule were both considered. The work also includes sensitivity analysis on three different models. In the same year, another study [21] was investigated on VRP having a heterogeneous fixed fleet vehicle. This problem also considers fuel and carbon emissions.
Motivation
There are several works on different types of VRP that have already been published. However, none of the works has dealt with this type of extra service time that can fulfill some other customers’ requirements that may not be known a priory but belong to the same city where the vehicle is waiting for the service. In this paper, we consider extra service time to handle such a scenario. Sometimes, the enlisted customers may demand some more requirements beyond the pre-ordered amount because of several issues like weather effect, sessional, fashion trends, etc. This incremented demand produces more profit for the organization without investing in fuel costs to move vehicles to other cities. Most of the papers on VRP does not consider any time budget for the vehicles. Here we also consider maximum time budget constraints in our proposed model. It motivates us to develop a new model that can implement the above facts. Besides, most of the papers published on VRP are in a crisp environment. However, real-life problems are uncertain in nature. The fuzzy set theory has the flexibility to deal with this kind of uncertainty based on expert judgment, past data, and observations. Here, travel time, travel cost, and capacity constraint parameters are considered fuzzy numbers to implement the above model.
Mathematical preliminaries
This section presents some basic definitions and properties related to fuzzy numbers.
Fuzzy number
A fuzzy number [22] is a normal and convex fuzzy subset of
It is a fuzzy number represented as
If
The necessity of
The relationship between possibility and necessity is given by
The credibility of the fuzzy event
The credibility measure is defined as
The credibility of a triangular fuzzy variable
The expected value of a fuzzy variable
Now for any two bounded fuzzy variable
If
Then for a predefined credibility level β with 0 ⩽ β ⩽ 1
Now for a Trapezoidal fuzzy variable
Subject to
Then by the expected value model [23] the problem can be written as
Subject to
The concept of CVRP was first described by Dantzig et al. [25] in their work on the truck dispatching problem. In general, the CVRP can be stated as follows: Let G = (V, E) be a graph, V ={ j0, j1, j2, …… , j
n
} being the vertex set, where j0 refers to the depot, the customers are indexed j0, j1, j2, …… , j
n
, and E ={ (j
m
, j
n
) : j
m
, j
n
V } is the edge set. Let n be the number of customers, N ={ 1, 2, . . . , n } be the set of customers, and {0, 1, . . . , n} be the set of all customers and the depot, which is identified by 0. Let K = {1, 2, . . . , k} be the set of vehicles. Let d
i
denote the demand of customer iN, L
h
indicate the load of vehicle hK. Furthermore,
Let us define
Let c
ij
be the transport cost or distance between customer i and j. The mathematical model for the capacitated vehicle routing problem is as follows:
Subject to
In the above model, expression (19) represents the objective function, which minimizes the total distance traveled. The inequality (20) refers to the vehicle capacity constraints; that is, certain vehicles must deliver goods to customers within vehicle capacity; (21) represents that every demand node must be served. Equations (22) and (23) ensure that each demand node is served by the same vehicle.
The above model is used for minimization of cost based on distance. A little modification in the model mentioned above can be used for the minimization of time. Let t
ij
be the time needed to reach from customer i to customer j. The objective function, in this case, will be as follows:
But, it has been observed that, though in the cost minimization problem, the best solution incurs at the least amount of cost, it takes more time. Whereas, in the case of the time minimization problem, the best solution occurs at least time but assumes more cost. This multi-objective trade-off is optimized using the weighted average method. The objective function will be as follows:
Here c1 is the cost of traveling for one unit of time. This model yields optimized results in terms of cost awell as time. That is why this model is better to use in the case of profit maximization.
The list of parameters and variables referred to in the model with their description is presented below:
C ij Cost of traveling from i th city to j th city
R i (t) Return or earning for spending t time in the i th city
E i (t) Expenses for spending t time in the i th city
P i (t) Profit on the i th city after spending t time that is equal to the minus of E i (t) from R i (t)
u i Base demand in city i.
v i Excess demand in city i per unit of time
d i Total demand in city i, so d i = (u i + v i t i )
s i Service time to supply the base demand in i th city
t i Extra service time to supply the excess demand in i th city
T i Total service time in i th city, i.e., the addition of S i and t i
t ij Travel time from i th city to j th city
M Total tour time
N Count of all the cities
K Count of all the vehicles or salespeople
C Capacity of each vehicle
α Percentage of the capacity of the vehicle that is filled up for the base demands of the route of each vehicle
C1 Cost of traveling for one unit of time
Now the proposed model for a crisp variable can be formulated as follows,
Equation (28) represents the objective function, i.e., maximization of profit, which is the difference between the profit and expenditure.
Where
Equation (29) gives the profit of visiting the ith city by fulfilling some extra demand. t i is the extra service time needed to fulfill the excess demand. r i (t i ) and e i (t i ) are the return and expenditure respectively during the extra service time t i .
Subject to
Equations (30) and (31) indicate every city must be served by exactly one vehicle.
As
M is the maximum tour limit for each vehicle. The left-hand side of in Equation (33) represents each vehicle’s total tour time, which includes travel time from one city to another in its route, service time, and excess service time at each city. Therefore, in Equation (33) ensures the time budget constraints.
The inequality in (36) shows the excess demand constraint, which ensures that the total demand served by one vehicle must be less than the vehicle capacity. Condition (36) derived from the two restrictions (34) and (35). where (35) refers to excess demand constraint, and the in Equation (16) indicates base demand constraint.
Equations (37) and (38) shows that
Constraints (39) and (40) are used to maintain the maximum vehicle number constraints, i.e., the total number of vehicles that start from the depot and return to the same must not exceed K.
Sub-tour elimination constraint is represented by constraint (41).
Generally, in a real-life scenario, the profit-maximization mentioned above the CVRP problem is not deterministic because there are different uncertain parameters like the traveling cost and traveling time from one city to another city. In this paper, t
ij
, c
ij
and α are taken as TFN (Triangular Fuzzy Number) represented by (
Therefore, the fuzzy model can be written as
Subject to constraints (30) to (33) and (35) to (41) and
Now using the fuzzy expected value of the objective the above fuzzy model can be written as
Subject to constraints (30) to (33) and (35) to (41) and
As the fuzzy variable
For predefined credibility β, the fuzzy constraint can be written as
When we apply the expected value model, the objective (43) changes to (45), and the constraint (44) will be as follows
The purpose of the proposed GA-BFO hybrid algorithm is to maximize the profit by routing the vehicles in the optimized paths concerning the time, and the distance traveled and finding the excess service time to invest in each city. It combines the capability of both GA and BFO algorithms. The proposed model’s solution involves two decisions; first, each vehicle’s route serves all the cities, and second is the extra time to be spent on each city to gain more profit. Therefore, we use GA for finding the route of each vehicle and BFO to find the extra time to stay in each city. This paper uses natural number encoding for the genetic algorithm. Each chromosome is broken into some sub-routes based on the different destination cities’ demands and the vehicles’ capacity. Figure 1 represents the natural number encoding scheme, i.e., the depot is represented by 0, city 1 is represented by 1, and so on.

Encoding rule.
Here, GA finds the best-optimized paths concerning the time and the distance traveled. Each sub-route is passed through the BFO algorithm to get the excess service time spent in each city to get the highest return, excluding the cost. Finally, it concatenates all the extra service times to generate a complete list of excess time for the cities in different routes. This algorithm finds the best chromosome in each generation from the GA part, and then it passes all the sub-routes to the BFO part to get the best optimized excess service time to wait in each city. In this way, for each generation of GA, the BFO executes the same number of times as the number of vehicles used in the best path found through the GA. A flowchart of this proposed algorithm is depicted in Figs. 2 and 3.

Flowchart of the GA part.

Flowchart of the BFO part.
In the literature, several algorithms originate from the concept of biology. The Genetic algorithm is a state-of-the-art algorithm based on evolutionary strategy. The proper implementation of the algorithm helps to get better solutions in each successive iteration and these iterations continue until it converges or reaches the desired accuracy level. In this paper, we use a combination of natural numbers to encode the solution, where each natural number represents a city. Here we use tournament selection with selection pressure four, and for crossover operation, we choose partially mapped crossover (PMX) [27]. The mutation prevents the algorithm from being trapped in local minima. To implement GA, we use inversion mutation [28]. Two essential parameters of GA are crossover and mutation probability. Here we use adaptive crossover and mutation probability [29]. The first part of Fig. 2 shows the flowchart of GA.
The fundamental objective of VRP is to minimize the cost of the distance traveled and the total time of the traversal. Based on this concept, the fitness function used in GA is designed as follows.
The social behaviors of animals like food searching by ants, bird-flocking, and fish-schooling inspire scientists to develop so many meta-heuristic algorithms like ant colony optimization, particle swarm optimization, etc. The BFOA is also an example of such an algorithm. Passino proposed it [30], and it was a new nature-inspired algorithm in this series. This algorithm mimics the foraging behavior of E. Coli bacteria. At the time of foraging, the bacteria used to move with the help of a set of flagella. This movement of bacteria is generally of two types, tumble or swim. Depending on the food value or the health value, the new position the bacteria used to move straight (i.e., called swim) or in a different direction (tumble). These two steps are called chemotaxis movement and the purpose of these movements is to move towards a nutrient gradient and avoid a noxious environment. The problem with this algorithm is that it suffers from local minima, and its convergence rate is prolonged. We use an adaptive search mechanism and comprehensive learning strategy to increase the performance of the original BFO, called adaptive-comprehensive learning bacterial foraging optimization (ACLBFO) [31]. Here the extra service time spent on the cities of a vehicle’s route represents a bacteria, and the total profit in the corresponding cities of a route forms the health value. In the original BFO, a constant chemotaxis step length is used. This paper used an adaptive chemotaxis step length based on a non-linearly decreasing modulation model. The BFO algorithm is subdivided into four parts: chemotaxis, swarming, reproduction, and elimination and dispersal.
The steps of the BFO algorithm are described as follows: First, initialize all the parameters and positions: population (S), Number of chemotactic steps in life (N
c
), Maximum number of continuous swim (N
s
), Number of reproduction steps to be taken (N
re
), Number of elimination dispersal steps to be considered (N
ed
), Probability of elimination dispersal (P
ed
), Chemotaxis step length (C), etc. Evaluate the health value of the bacteria of the initial population. Find the pbest of each bacteria with the present values. For (elimination dispersal loop- N
ed
iterations) For (reproduction loop –N
re
iterations) For (chemotaxis loop- N
c
iterations) Calculate the chemotaxis step size using the Equation (33). Update the position of all the bacteria either by swim or tumble based on the health value and the value of N
s
. Evaluate the health value of all the new bacteria. Update the gbest and the pbest. End For (Chemotaxis loop). Sort bacteria in descending order based on the health values. Replace the last S/2 bacteria with the first S/2 bacteria. End For (Reproduction loop). Do Eliminate and dispersal step with probability P
ed
. End For (elimination dispersal loop). End
The steps of this proposed algorithm are described as follows: Initialize parameters: Population size P
S
Z; Maximum Generation MX
G
N ; Adaptive crossover probability and mutation probability parameters; Number of destinations NOD. Generate a random population using the natural number encoding method. Let Gn
count
= 1. If Gn
count
⩽ MX
G
N, then go to step5, else go to step 24. Decode chromosomes and calculate the corresponding fitness values. Let C = 2. Selection of parents using the Tournament selection method. Calculate p
c
and apply crossover. Calculate p
m
and apply the mutation. Store the offspring to form a new population. C = C + 2. If C ⩽ P
S
Z, then go to step 7, else go to step 13. Search the new population and store the best chromosome. Calculate the number of vehicles TNV required to fulfill the demands of the destination cities present in the best chromosome. Let VC = 1. If VC ⩽ TNV then go to step 17, else go to step 21. Apply the BFO algorithm on the VC
th
sub route. Store the extra service time for each city for VC
th
sub route. VC = VC + 1. Go to step 16. Store the concatenated list of extra service time and profit in BST
S
L. Gn
count
= Gn
count
+ 1 go to step 4. Find the best solution from BST
S
L in respect of profit.
Constraint handling
Constraint handling mechanism is a critical issue of solving constrained optimization problems while using the meta-heuristic algorithms. Most of the constraints, except the restrictions (30) to (36) and (39) to (41), are already explained in the mathematical model section. The others are defined in the literature. Therefore, only the above constraints are needed to be explained about its handling method.
In this paper, we use natural number encoding (a combination of 1 to n, where n is the number of the city in the problem) to represent chromosomes for finding the routes of each vehicle using GA. Constraints (30), (31), and (32) are automatically satisfied due to the encoding scheme used in this paper. We encode a bacterium in BFO with real numbers to optimize the excess service time. The total extra service time for a vehicle is equal to the difference between the predefined maximum allowable time and that vehicle’s tour time. We randomly split the whole excess service time among the cities in the route. Therefore, the constraint (33) is also preserved using the encoding scheme of the bacterium. Restrictions (34), (35), and (36) are the constraints related to vehicle capacity. When we evaluate each chromosome’s fitness, we take care of vehicle capacity constraints by varying the vehicle number. Conditions (39) and (40) are related to the number of maximum allowable vehicles. We make a significant penalty on fitness value if these two constraints are violated. Restraint (41) is used for sub-tour elimination; this is also maintained through the encoding scheme and binary decision variables.
Numerical Illustration
Test data for the crisp model
The proposed model is tested for a different set of cities (n = 10, 19, 28, 37, 46, 55, 64, 73, 82, 91). The traveling cost and traveling time between different cities are shown in Tables 1 and 2. The return and expenditure at the i
th
city for the extra service-time t is calculated using the following expressions.
The cost to travel between different cities (D for depot)
The time to travel between different cities (D for depot)
Values of different constants and BD and EDR
(Note: D for the depot, BD for Base Demand, EDR for Excess Demand Rate).
The proposed fuzzy model is tested for the 10 city problem, including depot. The traveling cost and traveling time between different cities are shown in Tables 4 and 5. The return and expenditure at the i th city for the extra service-time t is the same as the crisp problem. The fuzzy distance presented in Table 4 are generated based on crisp distance using the following formulae d l = d - random number between (1, 2, 3), d m = d and d u = d + random number between (1, 2, 3)
Fuzzy distance between different cities (D for depot)
Fuzzy distance between different cities (D for depot)
Fuzzy Time to travel between different cities (D for depot)
Where (d l , d m , d u ) as fuzzy distance and d as crisp distance.
The fuzzy demand in Table 5 are generated based on crisp demand using the following formulae
Where (Demand l , Demand m , Demand u ) is the fuzzy demand and demand is the crisp demand.
The proposed profit maximization VRP model is solved using four different metaheuristic algorithms, viz. GA, ALO, BFO, and proposed hybrid GA-BFO algorithm. All the algorithms are implemented using C language in the Linux environment running on an Intel Core i5 CPU T6500 of 2.44 GHz. Based on the several trial and error, we choose the algorithms’ optimal parameter settings and are presented in Table 6.
Parameter Settings of the algorithms
Parameter Settings of the algorithms
In the case of classical CVRP, it is observed that if we focus on the cost minimization, it takes a long time to complete the tour, whereas, if the focus is on the travel time minimization, then the cost due to distance increases. Table 7 and Fig. 4 show this trade-off of cost and time for the 10 city problem.
The cost-time trade-off for ten city problem
The cost-time trade-off for ten city problem

The cost - time trade-off for ten city problem.
The proposed model is solved for the above-mentioned input data set using the proposed GA-BFO hybrid algorithm. The results for the ten cities, including the depot, are presented in Table 8, and here the total tour time M is 30. The second column is used to present the derived path for the corresponding run. Here the 0′s in the path are used to indicate the depot, and the number of vehicles used is presented at the end of the path within parenthesis. The third column presents the excess time to wait in each city, corresponding to the path. For example, consider the first run, the path is 0962015730840 (3) and excess time in the cities are 7.80 0.10 6.10 2.20 6.40 0.20 3.20 8.30 5.70. It means the above path includes three vehicles and the paths are (Depot, 9, 6, 2, Depot),) and (Depot, 8, 4, Depot). The excess waiting time in city9 is 7.80, and the excess waiting time in city6 is 0.10. Similarly, the excess waiting time in city4 is 5.70. The fourth column indicates the total earnings for the corresponding path. The fifth column represents the cost involved in the corresponding path. Finally, the sixth column shows the net profit, i.e., the [return (fourthcolumn) - cost (fifthcolumn)]. The average value of the profits of all runs is 2306.41, the standard deviation is 14.4851, and the best profit obtained is 2323.49.
Results for the ten cities including depot when M (total tour time) = 30
After the value 30 for M (total tour time), it is increased step by step up to 80 with the gap of 10, resulting the profit is increasing up to a particular value of M (60), and then it is decreasing as the excess waiting after a specific time is not beneficial anymore. This fact is presented in Table 9, and it is clearly observed in Fig. 5.
Results for the ten cities including depot with varying M (total tour time)

Total tour time vs. Profit for ten city problem (crisp model).
The proposed model and the algorithm are also tested on a different virtual dataset with 19, 28, 37, 46, 55, 64, 73, 82, and 91 cities. The values of cost c ij and time t ij are chosen randomly within a range [t l , t u ] and [c l , c u ] using the time-cost relationship calculated from the data set used in the 10 city problem. In this paper, the value of t l = 0 and t u = 0 are taken into consideration. Now, for the values of a i , b i , c i , e i , BD and EDR are used repetitively, i.e., The a i values for city1, city11, city20, and city 29 are the same; similarly, all the other values are generated. The detailed results for 19 cities and the 37 cities problem with M = 80 are presented in Table 10 as an example.
Results for a different number of cities when M (total tour time) = 80
The result of the proposed hybrid method is compared with the GA, BFO, and ant lion optimization (ALO) algorithm. The comparisons are presented in Table 11.
Comparison of the results for different algorithms
It is clear from Table 11 that the proposed hybrid method yields better results compared to other methods we have considered. Out of 25 independent runs of each algorithm, the best are found for the comparison. The descriptive statistics of the results of 37 cities and 28 city problems are presented in Tables 12 and 13. A non-parametric statistical test, namely the Wilcoxon rank-sum test, is also conducted to compare the proposed algorithm with the other algorithms at a 5% level of significance. The null hypothesis is as follows:
Descriptive statistics of the results of 37 cities problems
Descriptive statistics of the results of 28 cities problems
The detailed results (p-values) are presented in Table 14.
The p- values of the Wilcoxon test for the different instances on a comparison of the proposed algorithm with the other algorithms
The average fitness of each generation using the four different algorithms for “Prob-10 91 city” is depicted in Fig. 6.

Generation vs. average fitness plot for Prob-10 91 city.
Finally, sensitivity analysis is performed on independent parameters, which shows the changes in the main output concerning the changes in different important parameters. It also helps to identify the most critical parameters. Here the sensitivity analysis is conducted with the parameters a i , b i , c i and e i . Here each of the parameters a i , b i , c i and d i are varied from -5% to 5% with a step-length of 2.5%. The sensitivity is presented in Table 15 and depicted in Fig. 7.
Sensitivity Table for different parameters

Sensitivity Graph.
In the case of fuzzy profit maximization CVRP, we observe that it shows a better profit compared to the results of a crisp model. The fuzzy model is solved using the expected value model (EVM), as mentioned earlier in this paper. The results of fuzzy profit maximization CVRP are given below for the different values of credibility level β.
The results show that the expected profit is directly proportional to the credibility level. This is presented in Table 16, and this fact is observed in Fig. 8.
Results for the ten cities including depot with varying credibility level (Value)
Results for the ten cities including depot with varying credibility level (Value)

Credibility level vs. Expected Profit for ten city problem (Fuzzy model).
This paper proposes a VRP with a tour time constraint to maximize profit in both crisp and fuzzy environments and solved using a GA-BFO hybrid metaheuristic algorithm. Here in Table 7 we presented a trade-off between cost and time while serving the customers, and we observe that time and cost are inversely proportional. From Table 8 we found that the algorithm is producing stable results with less variance in different executions. Table 9 presents the solution of 10 city problem for different tour time. We found the profit initially increasing with the increase of tour time but after a certain limit profit started decreasing. Tables 10 and 11 present the results of the proposed model for a different number of cities. From Tables 12 and 13, we observe that the coefficient of variance for the proposed hybrid GA-BFO algorithm is the smallest among the four algorithms, i.e., the proposed algorithm is more consistent than the other algorithms used here. Figure 6 presents a convergence graph of the algorithms used here. We visualize the steady and faster convergence of the proposed algorithm among the used algorithms from this figure. Table 14 gives the p-value matrix of the Wilcoxon’s Signed-rank test. From this table, we observe that in most cases, the proposed hybrid GA-BFO algorithm is statistically significant than the other algorithms. As we have used some independent parameters to construct the proposed model, we have performed the sensitivity analysis to find the impact of the independent parameters. From Table 15 and Fig. 7 we observe that the parameter b i has the most significant impact on the proposed model. The results of the proposed fuzzy model using chance constraints are presented in Table 16. The result of Table 16 is also visualized in Fig. 8; from this result, we observe that the profit is approximately proportional to the credibility level.
Conclusions
This paper has proposed a new variant of vehicle routing problem applied in several realistic scenarios. Here the VRP problem is considered as a profit maximization problem. This problem has also been illustrated with numerical data in a crisp and fuzzy environment. It shows that this problem is more suitable even in an imprecise paradigm, which fits better in the real-life implications since it produces better results in such cases. For the first time, GA-BFO based hybrid meta-heuristic algorithm has been successfully applied to the VRP problem with the extra service-time concept. The model formulation and algorithm is quite simple and general. The proposed model can be used to implement the sales and marketing of different product-based and service-based companies that want to spend more time in different cities to increase the profit. This profit-maximization VRP can also be extended to rough, fuzzy-rough, random, etc. uncertain environments.
This study can establish a perfect direction for future research to handle many profit maximization problems in a real-world scenario like the distribution of bakery products, dairy products, and perishable food products.
