Abstract
This paper investigates the electric vehicle routing problem with time windows and nonlinear charging constraints (EVRPTW-NL), which is more practical due to battery degradation. A hybrid algorithm combining an improved differential evolution and several heuristic (IDE) is proposed to solve this problem, where the weighted sum of the total trip time and customer satisfaction value is minimized. In the proposed algorithm, a special encoding method is presented that considers charging stations features. Then, a battery charging adjustment (BCA) strategy is integrated to decrease the charging time. Furthermore, a novel negative repair strategy is embedded to make the solution feasible. Finally, several instances are generated to examine the effectiveness of the IDE algorithm. The high performance of the IDE algorithm is shown in comparison with two efficient algorithms.
Introduction
During recent years, the vehicle routing problem (VRP) [1] has been researched in many types of applications. The VRP with time windows (VRPTW) is one of the most typical research problems, which aims to uncover the lowest cost routes from a single depot to a set of distributed customers, while satisfying the dispatching time windows constraint for customers and the load capacity for each dispatching vehicle. Many types of heuristics or meta-heuristics have been proposed to solve this type of problem. Solomon [2] considered the heuristic algorithm to solve the VRP, where the insertion heuristic shows excellent performance. Since then, many types of non-population based meta-heuristics have been developed to solve the complex optimization problem, such as the tabu search (TS) algorithm [3], the branch-and-price algorithm [4, 5], local search (LS) algorithm [6–11].
Furthermore, under the global background of energy conservation and environmental protection, electric vehicles (EVs) have gradually become connected with the VRP. Therefore, some researchers try to combine the environmental consciousness with the VRP. Bektas and Gilbert [12] presented the pollution-routing problem (PRP), which involves the cost of the greenhouse emission, the fuel, the total time of travelling and their costs. Erdoğan and Miller-Hooks [13] proposed the green vehicle routing problem (GVRP), which leveraged the alternative fuel stations (AFS) to overcome the limited range. Bruglieri et al. [14] applied the path-based method to efficiently solve the GVRP. However, the EVs cannot serve too many routes unless there are charging stations along the routes [15–19]. Generally, there are two critical issues of the electric vehicle routing problem (EVRP): how to fix the location of charging stations and how to charge the EVs.
To make the EVRP closer to reality, many researchers attempted to design how to choose the location of the charging stations. Adderly et al. [20] presented some measures, such as increasing the number of charging stations in the arterial routings to handle the emergencies of natural disasters. Breunig et al. [21] proposed the two-echelon EVRP that used the dynamic programming operation to acquire the best location of the charging station on a 2-level route. For the question of how to charge the EVs, the literature provided numerous in-depth studies. Desaulniers et al. [5] reported that reducing the charging capacity of the EVs but increasing the charging times may be better than fully charging. Keskin and Bülent [8] developed three different types of charging stations: normal, fast and superfast charging stations. Hõimoja et al. [22] conducted the typical curve that describes the current, terminal voltage and state of charging with time. Most authors considered the charging process as a linear function. However, the charging process is actually a nonlinear function. Pelletier et al. [23] described the nonlinear charging function of the EVs in detail. Some researchers used the piecewise linear function to describe the nonlinear charging function [11, 25]. Compared to the EVRP with linear charging constraints, the nonlinear charging constraint should receive more research focus. Therefore, the EVRP with time windows and nonlinear charging constraints (EVRPTW-NL) is more significance.
In addition, meta-heuristics have been recently presented to solve the realistic optimization problems, such as the artificial immune system algorithm [26], artificial bee colony (ABC) algorithm [27, 28], TS algorithm [29], genetic algorithm (GA) algorithm [30, 31], ACO algorithm [15, 32] and differential evolution (DE) algorithm [33]. Among these meta-heuristics, DE is considered to be a simple, reliable, robust, population-based algorithm that searches for the global optima by utilizing differences between contemporary population members [34]. Many scholars have utilized the DE algorithm to solve optimization problems, such as social learning [35], global numerical optimization problems over continuous space [36], scheduling raw milk transportation [37], pickups and deliveries problem with fuzzy time windows [38], multi-objective traveling salesman problem (TSP) [39, 40], fuzzy demands [41] and simultaneous pickups and deliveries of VRP [42], open VRP with uncertain demands [43], capacitated VRP [44], multi-objective VRP [45], and data driven safe vehicle routing analytics [46], green periodic competitive VRP under uncertainty [47]. According to the literature of the DE algorithm, it has been seen that DE algorithm is an efficient algorithm. There are fewer studies leveraging the DE algorithm for EVRP with time windows, and the EVRPTW-NL is also an NP-hard problem. For the reasons stated above, the aim is to develop a hybrid algorithm combining an improved DE and several heuristic (IDE) algorithm to solve the EVRPTW-NL in this study.
The main contributions of this paper are listed as follows: (1) the EVRPTW-NL is developed in this study; (2) a special encoding method is presented that considers charging stations features; (3) an initialization strategy of charging stations is proposed to guarantee the nonnegative battery level constraint of the EVs; (4) a hybrid algorithm combing an improved DE and several heuristic (IDE) is designed to solve the problem; (5) a battery charging adjust (BCA) strategy is presented to decrease the charging time; and (6) a novel negative repair strategy is embedded to make the solution feasible. The remainder of this study is organized as follows. Section 2 gives the problem description. The canonical algorithm is described in section 3. In section 4, the IDE algorithm is presented in detail. Section 5 presents the experimental results and compares with other algorithms. Finally, conclusions and future studies are presented in section 6.
Problem description
Problem definition
In this section, the characteristics of the EVRPTW-NL can be generalized as follows: There are some homogeneous EVs. The remaining battery capacity of the EV should be nonnegative all the time. The charging process is a nonlinear function.
In a realistic logistic system, a nonlinear function is related to the charging time. The EVs that start from the depot are assumed to be fully battery capacity, and all the charging stations can handle an unlimited number of vehicles simultaneously. It might be profitable to charge overnight at the depot from a practical point of view [48]. Meanwhile, under fierce market competition, attention must be paid to customer satisfaction. If vehicles arrive within the time windows of the customer, it will achieve a higher satisfaction level.
Moreover, the calculation of the customer satisfaction level is divided into two categories. One is based on the number of customers, which depends on the proportion of the number of customers (i.e., providing service in advance or delay) to the total customers. The other task is to use the time window to calculate the customer satisfaction level. If the time window is a hard window, the value of customer satisfaction is 0 or 1. A triangular fuzzy membership function describes the satisfaction level under the fuzzy time windows. Adopting a rating level method is different from other literature [29, 30]. The superiority of this method are as follows: First, the more important the customer, the stricter the time window; Second, a proportion is calculated by the time interval, which represented by the customer satisfaction level, and the level is integrated into the objective. These superiority make this problem more realistic.
It has following assumptions in this paper: Each customer must be served by only one vehicle. Each route considers the maximum trip time limit and does not exceed it. The remaining battery capacity of the EV is nonnegative all the time. The charging process is a nonlinear function. The problem is to minimize the weighted sum of the total trip time and customer satisfaction value.
Problem formulation
The EVRPTW-NL is defined on the graph G = (V, A), where V = {0} ∪ I ∪ F′. The 0 represents the depot, I and F′ are the set of customers and dummy vertices of charging stations, respectively. Let A = {(i, j) |i, j ∈ V, i ≠ j} be the set of arcs connecting vertices of V. Each arc (i, j) ∈ A is associated with a trip time t ij and a battery consumption e ij . In this paper, the battery consumption is equal to the distance. Each customer i ∈ I has a service time s i . In canonical VRPTW, every customer i ∈ I equips a time window [a i , b i ]. In this study, it sets a preferred time window [Eti, Lti] and an allowable time window [EEti, LLti]. Customers cannot visit after their allowable time window [49]. Each charging station corresponds to a piecewise linear approximation of the charging function which modified from Montoya et al. [11]. The piecewise linear approximation of the charging function according to the different charging level which corresponding to the different speeds of charging. There are four level stages in this function which corresponding different charging speed for each stage. The problem is to minimize the total trip time and maximize the customer satisfaction value.
The parameters and decision variables are defined as follows:
In addition, the corresponding formulas are shown as follows:
Objective:
Subject to:
Different weights have different optimization effects on each objective of the multi-objective problems, and the weight sum method can well balance of the optimization between two objectives. In this paper, the weighted sum method is used to solve the problem, which gives the corresponding weight value according to the importance of each objective, and then converts the problem into single objective by optimizing its linear combination [50–52]. The objective function of Equation (1) is to minimize the weighted sum of the total trip time and customer satisfaction value, i.e., minimize the total trip time and maximize the customer satisfaction value. Equation (2) ensures that each customer is served only once. Equation (3) indicates that the number of arrivals and leaves the same vertex is equal. Equation (4) ensures the EV is fully charged departs from the depot, and the remaining battery capacity of the EV maintains between 0 and the maximum battery capacity. Equations (5)–(8) track of battery consumption at each vertex. Equations (5)-(6) calculate the battery consumption between two adjacent customers. Equations (7)-(8) compute the battery consumption between the customer and the charging station. Equation (9) guarantees the EV with the remaining battery level can reach the depot. Equation (10) resets the battery capacity of the EV after charging and the remaining battery capacity of the EV is equal to the battery level after charging. Equation (11) tracks of battery capacity at each charging station and the remaining battery capacity of the EV maintains between 0 and the maximum battery capacity. Equations (12)–(19) define the remaining battery capacity when the EV arrives at (or leaves) the charging station and the charging time. Equations (12)–(15) calculate the charging time corresponding to the remaining battery capacity of the EV when it arrives at the charging station. Equations (12)-(13) denote the interval in which the remaining battery capacity (in percentage) when the EV arrives at the charging station belongs to the nonlinear charging function. Equations (14)-(15) calculate the current time corresponding to the current battery capacity. Equations (16)–(19) calculate the current time corresponding to the battery level after charging for the EV and the approach is the same as above. Equation (20) is the time difference between t′ and t″. Equation (21) is the time window constraints. Equation (22) represents the time relationship between two adjacent customers. Equations (23)-(24) guarantee that the sum of the total trip time and the demand of customers do not exceed the maximum trip time and load capacity of the EV, respectively. Equation (25) states the range of the variable. Equation (26) calculates the customer satisfaction level that discussed in sub-section 4.4 in detail.
As depicted in Fig. 1, there are eight customers and two charging stations. Customer parameters are shown in Table 1. The tolerate time is 0.5 units and the service duration for each customer is 0.5 units. The charging stations contain two types (normal and fast) of charging stations. The piecewise linear function maps the battery capacity q and o to the charging time e to d so as to estimate the time spent at the charging station, and a presents the time difference between e and d. The information of the EV is depicted in Fig. 1 (the maximum load capacity is 20, the maximum battery capacity is 9 units, and the maximum trip time is 40 units). In this example, Route 1 does not require visiting any charging stations. Route 2 recharges at the No.9 charging station, where the EV arrives at the charging station with the zero remaining battery capacity and recharges to o = 5. It used the piecewise linear function to estimate the recharging time. Therefore, Route 2 consumes 16.6 time units with the total of the trip time (15) and the recharging time (1.6). Finally, Route 3 consumes 5 recharging time units. Customer satisfaction value sv i is calculated by the Equation (26) and add it to the problem fitness. Table 1 shows that the sum of customer satisfaction value is 6.6. The weight w is 0.8, and the objective fitness of the small numerical example is 33.56.

Example of the EVRPTW-NL.
Customer parameters
In the canonical DE algorithm, several operators such as mutation, and crossover, should be performed iteratively, and therefore, the exploitation and exploration tasks can be implemented [33]. The framework of the canonical DE algorithm is given as follows.
Coding and initiation population
Each solution is represented by a real number in the canonical DE algorithm for solving the continuous optimization problem. Noted that the elements in each solution must be in the range between L
j
and U
j
, which solves the D-dimensional optimization problems.
Mutation operator
Mutation, as the core operator, plays a crucial role in the search of DE algorithm. The mutation operator utilizes the following formula:
To increase the potential diversity, the crossover operator is executed after the mutation operator. The crossover operator is defined by the following equation:
To obtain the next generation, the estimation and selection operator is given as follows. The better fitness value will be retained.
Repeat steps 3.2 to 3.4 until the iteration number reaches the maximum. It is noticed that the criterion is by means of the greedy selection.
The framework of the proposed IDE algorithm is given as follows. First, assign the customers without considering the battery capacity. The assignment of the vehicles to each customer and sequence for the vehicle route are determined. At this time the customer information, the vehicle information, the time windows, the load capacity constraints are considered (Refer to Equations (2)-(3); Equations (21)–(24)). And then, prerequisites of battery capacity constraint is considered to the solution (Refer to Equations (4)–(11)). The initialization strategy of charging stations is embedded into the routing planning and the description of it is in sub-section 4.3 in detail. When the remaining battery capacity of the EV is negative, it is necessary that insert the charging station to ensure the battery capacity constraint. Traverse all the charging stations at each position to find the suitable charging stations and locations for really inserting. Meanwhile, the battery charging adjust strategy is used to decrease the charging time and the description of it is in sub-section 4.7 in detail. When the EV reached the charging station for charging, according the starting and ending remaining battery capacity of the EV to calculate the charging time (Refer to Equations (12)–(20)). At last, the customer satisfaction level sv i is obtained (Refer to Equation (26)) that discussed in sub-section 4.4 in detail.
Encoding method
In this section, considering the problem constraints, two vectors are utilized to represent a solution, i.e., the vehicle assignment vector and the charging vector. The vehicle assignment vector is used to tell the assignment of the vehicles to each customer and provide sequence for the vehicle routes as well, and the charging vector tells whether a vehicle should be recharged at each charging station. Each vector contains a two-dimensional array. Figure 2(a) gives the illustrative example for the vehicle assignment vector, where the first dimension tells all the vehicles in the system and the second dimension represents the assigned customers for each vehicle. For example, there are three vehicles: the first vehicle serves two customers {1, 4}, and the other two vehicles serve {8, 3} and {5, 7, 2, 6}, respectively.

Encoding representation and decoding.
Figure 2(b) tells whether a vehicle should be recharged at each charging station, where the element is set to the charging station number if the vehicle should be recharged at the corresponding charging station, otherwise, it is set to “–1”. For instance, the first vehicle will not visit any charging stations. The second vehicle will be recharged at the charging station numbered 9, while the last vehicle will be recharged at the charging station numbered 10.
In decoding procedure, first, it can use the charging vector to judge whether a vehicle recharged at the charging station, and then combine the vehicle assignment vector to determine the recharging operation between customers or between customer and depot, and finally find the entire route. In Fig. 2(c), two customers {8, 3} are served by the EV_B, the second sequence of {–1, –1, 9, –1} represents that the EV_B will be recharged at the charging station numbered “9”. It can be observed that the recharging operation between the customers numbered 8 and 3. Therefore, the entire route of the EV_B is expressed as {0, 8, “9”, 3, 0}.
As mentioned above, the operations (like encoding, decoding, fitness evaluation) are illustrated as follows. Figure 3 presents a feasible Gantt chart. Each rectangle represents a customer. The customer parameters of the example (Fig. 1) are shown in Table 1. The vehicle assignment vectors and charging vectors are illustrated in Fig. 2. Without considering the battery capacity, the sequence of the EV_A is {1, 4}, the sequence of the EV_B is {8, 3}, and the sequence of the EV_C is {5, 7, 2, 6} (Fig. 2(a)). Then, prerequisites of battery capacity constraint is considered to the solution, to calculate the remaining battery capacity when the EV arrives at the customer. If the remaining battery capacity is negative, try inserting the charging station for recharging (that discussed in sub-section 4.3). Therefore, the charging vector is illustrated in Fig. 2(b). According to the Gantt chart, the trip time is obtained by the customer information and the charging time, and customer satisfaction level is calculated by the time windows and the time when the EV arrives at the customer. Consequently, the problem objective fitness is obtained by the weighted sum of the total trip time and customer satisfaction value.
The initialization strategy of charging stations
First, to assign the customers without considering the battery capacity, and then consider the prerequisites of battery capacity to the solution. It is necessary to record the battery consumption between customers and between customer and depot. Because some vehicles arrive at the customer, the remaining battery capacity may be negative. Therefore, it is necessary to consider inserting the charging stations for the charging operation. The example of inserting charging stations is illustrated in Fig. 4.

Gantt chart of the example.

The example of inserting charging station.
The number of negative remaining battery capacity represents s n . The detailed steps of initialization strategy of inserting charging stations are given as follows:
To better understand the process of inserting charging stations, it can be described as follows. Before inserting into the charging station, the first step is to get the sum of the negative remaining battery capacity when the vehicle reaches the customer and the current trip time. If the remaining battery capacity is negative, it is necessary to try inserting the charging station for charging. To obtain an optimal solution, try inserting each charging station at each location. After charging, it needs to recalculate the total trip time and the negative remaining battery capacity. If the negative value improves after inserting the charging station, there is still a negative battery capacity. Try considering inserting into other charging stations. After that, it could improve the negative remaining battery capacity. If the initial solution is feasible, to ensure whether the current trip time is improved after charging. Supposing that there is an improvement, the charging station numbered and location will also be updated and reserved.
Finally, after testing all charging stations for every location, select the best charging station to the best location for real insertion. The time complexity of the initialization strategy of charging stations is O (n3).
Recently, more and more people pay attention to the service performance. It can be reflected by customer satisfaction. The common approach to calculate it is considered as a linear function [53] and added it directly to the objective. However, customers usually cannot give an accurate satisfaction value in practice. The service satisfaction is assumed to be a piecewise function [54]. Figure 5 tells the piecewise function.

Service-level function of time windows.
The approaches to calculate the customer satisfaction are as follows (t
i
is time when the EV arrives at customer i): When E
ti
⩽ t
i
⩽ L
ti
, sv
i
= 1. When EE
ti
⩽ t
i
⩽ E
ti
, When L
ti
⩽ t
i
⩽ LL
ti
,
Each customer satisfaction level sv
i
is obtained according to t
i
(time when the EV arrives at customer i) and the time windows of each customer. The tolerate time is the same for every customer in this study, so the customer satisfaction value is converted into a uniform scale. The calculation of customer satisfaction value leverages the relationship of time (i.e., time when the EV arrives at customer i and the time windows of each customer) and the customer satisfaction value is between 0 and 1. For example, assuming that the time window is [Eti, L
ti
] = [20, 40], the tolerance time is 5, i.e., [EEti, LL
ti
] = [15, 45] and t
i
= 43. The customer satisfaction level:
In the canonical DE algorithm, the mutation operator is to realize the individual mutation and generate the neighboring solution. However, it is used for continuous optimization problem. To better deal with this problem, the improved mutation heuristics is described as follows. There are two strategies for the current solution: To generate neighboring solution, randomly chose one EV, and then one customer is randomly chosen from the chosen EV. Then, this customer is randomly removed from the newly chosen EV and inserted another EV to obtain a better solution. The time complexity of the first improved mutation heuristics is O (n2). To generate a neighboring solution, just like the first method, randomly chose one EV is to generate the neighboring solution. Secondly, a random number of customers is chosen from the newly chosen EV. Then, these customers are removed and inserted these into other EVs. Figure 6 illustrates the improved mutation method, where two customers {4, 2} are randomly chosen. Considering the time complexity of second improved heuristic, the time complexity of this approach is O (n3).

Improved mutation process.
The detailed steps are given in the algorithm 3. It can be seen that the second method can generate a solution with more differences from the current one, and therefore can improve the detection abilities of the proposed algorithm.
In the canonical DE algorithm, the crossover operator is leveraged to solve continuous optimization problem. To better deal with this problem, the improved crossover operation is described as below. It randomly generates the newly solution through the crossover probability P c . Figure 7 shows an example of the improved crossover operation. Two solutions are given in this figure: one is the current solution, and the other is the best solution. First, randomly select a vehicle from the current solution, find these customers who are served by this vehicle from the best solution, and then delete the corresponding customers from the best solution. At last, replace these deleted customers from the best solution and insert them into other vehicles to generate the crossover solution.

Crossover process.
The detailed steps of the improved following operation depicts in the algorithm 4. The time complexity of the improved crossover operation is O (n2). To ensure the non-negative battery capacity, it also continues to implement the charging station insertion strategy after the crossover operation.
Note that the current solution is to delete customers from the vehicle with the delete rate D r .
In the current strategy, the EV is fully charged each time, but it still has the remaining battery capacity when it comes back to the depot. In terms of the concept of environmental protection, it is necessary to decrease the charging battery capacity. Therefore, a battery charging adjust (BCA) strategy is proposed in this paper. According to the last customer residual battery capacity, adjusting the charging battery capacity could decrease the total trip time.
The BCA strategy displays as below. First, record whether the charging operator is used for each vehicle. Then, remember the charging stations numbered for the next steps. Finally, recalculate the battery capacity and the charging battery capacity is set to the appropriate value. The time complexity of the BCA strategy is O (n2). It satisfied with the following conditions. If there is only one charging station in the route, ensure that after charging in the current charging station, the vehicle returns to the depot with the zero remaining battery capacity. If there are two or more charging stations in the route, ensure that remaining battery capacity from the current charging station to the next charging station is 0.
Negative repair strategy
After the improved mutation operation or crossover operation, some customers could not satisfy the nonnegative battery capacity constraint. Therefore, the current solution becomes infeasible. Consequently, a negative repair strategy is proposed in this paper. There are two approaches to consider: To generate a feasible solution, delete these customers with negative remaining battery capacity and add an extra vehicle to serve these customers. To generate a feasible solution, except as mentioned in first condition, however, some VRPTW literature usually reduce to minimize the fleet size and the BCA strategy presents in this paper. Adding other vehicles to serve these customers is difficult. To satisfy the condition, increasing the charging battery capacity to make it possible to serve these customers.
The time complexity of the negative repair strategy is O (n2). It can use the figure to depict this process in detail. For example, delete the customer numbered 30 from the vehicle numbered 10. Now, assume insert into the ten vehicles (i.e., vehicles numbered 0 to 9), and the vehicle numbered 7 served these customers, including {1, 2, 5, 7}. There is a charging station numbered 8 between the customer numbered 5 and 7. The charging battery capacity is 100 in the charging process. It can increase 150 to ensure that this vehicle reaches the customer numbered 7 and then serves the customer numbered 30. The sequence can present {1, 2, 5, 7, 30} with the negative repair strategy (as shown in Fig. 8).

Negative repair process.
With the above research, time complexity of the IDE algorithm is O (n3). Framework of the IDE algorithm is shown as below:
Numerical experiments
All experiments are implemented in C++, compiled with Visual Studio 2013, and run on an Intel Core i5 2.4-GHz PC with 4 GB of memory using Windows 10 Operation System.
Experimental instances
The instances of this paper are generated from the Solomon benchmark instances. Clustering method is leveraged to divide the 100 customers of the Solomon instances into a number of categories by calculating the distance between each customer. Then, it designs the location of the charging station for each category by calculating the mean value of the customers’ coordinates which belong to each category. The instances can be found in the website"https://www.researchgate.net/publication/334735357_nonLinearVRPTWInstances". In reality, these midpoints may be more than or less than demand.
It contains 55 instances, named “c101” to “rc207”. Differences with the classical Solomon are listed below. All EVs have a tripe of data, i.e., an energy consumption rate that tells the energy consumption for each unit of travel distance, a fixed capacity that represents the vehicle load capacity. Sequences of charging stations are generated by the clustering method. The tolerance time between the preferred time windows and the allowable time windows is 5.
In this paper, the battery consumption is assumed to be equal to the distance on arc. The maximum battery capacity of the EV is 300.
Experimental parameter
To effectively set the system parameters, preliminary experiment is carried out. The parameters included in the experiment are as follows and shown in Table 2.
The population size Ps, which is the total number of individuals in an experiment. The delete rate Dr, which determines the number of customers deleted from the vehicle in the current solution. The crossover probability Pc, which determines the possibility of crossover-operation.
Parameter level
Parameter level
In the DOE Taguchi method, an orthogonal array L16 (43) is leveraged [55]. The IDE algorithm was independently run 30 times with each parameter combination, and the average fitness value is used as the response variable (RV). Table 3 shows the results of various combinations of parameters. Figure 9 depicted the factor level trend.
Response values

Factor level trend of three parameters.
As shown in Fig. 9, the IDE algorithm achieves better by selecting the following three key parameter levels: (1) P s is the level 2; (2) D r is the level 3; and (3) P c is the level 4. Consequently, the suitable values of P s , D r and P c are set to 100, 0.5 and 0.8, respectively.
In this section, to investigate the advantage of the improved crossover operator in this paper, it conducted a comparison with the classical crossover operator [44], i.e., the classical crossover operator (IDE-I for short), and the improved crossover operator (IDE-II for short). The fitness values obtained after 30 independent runs for 55 extending the VRPTW instances. Table 4 displays the experimental results.
Comparisons of crossover method in Min, Avg and Mag
Comparisons of crossover method in Min, Avg and Mag
It contains the instance name, the experimental results of the IDE-I and the IDE-II in minimum, average and maximum, respectively, and the percentage deviation. The calculation formula is given at (30):
To further test whether the differences from the above table are significant, Fig. 10(a)-(c) conducted detailed comparisons. It presented the means and 95% LSD intervals for fitness with the IDE-I and IDE-II. It is obvious that the IDE-II presents superiority of statistical significance with respect to the IDE-I. This main reason for this results is that by using the improved crossover operator (i.e., the sub-section 4.6 discussed), the newly generated solution can boost seek and exploration capabilities.

Means and 95% LSD intervals for IDE-I and IDE-II.
To further verify and test the validity of the IDE algorithm, the two-phase GA [30] and the improved TS algorithm (ITSA) [29] were selected as the comparison algorithms. The two-phase GA and the ITSA were efficient algorithms to solve a series of VRP. Likewise, the two algorithms can be applied to these instances for further comparison, re-encode the two-phase GA and ITSA aims to solve the EVRPTW-NL.
For experimental credibility, tested three algorithms on the same computer with the same 55 extended instances over 30 independent runs. The minimum and average values of these experiments are shown in the following two tables.
Table 5 shows the comparison experiment results: (1) 37 better solution are obtained by the IDE algorithm, while there is only 1 optimal solution calculated by the two-phase GA and 21 optimal solution calculated by the ITSA. (2) from the last line, the IDE algorithm obtains an average percentage deviation 1.11.
Comparisons of the different heuristics in Min
Comparisons of the different heuristics in Min
Table 6 presents that: (1) the IDE algorithm had 35 better values, while there are only 2 and 19 optimal solutions calculated by the two-phase GA and the ITSA respectively. (2) as shown in the last line, the IDE algorithm is 0.81.
Comparisons of the different heuristics in Avg
The two tables show that the IDE algorithm is significantly superior in minimum and average than the other classical algorithms.
Figure 11(a)-(d) illustrates the two pairs of compared algorithms in minimize and average. The p-value is close to 0, indicating significant differences between the compared algorithms.

Means and 95% intervals for different algorithms.
To further verify the performance of the developed algorithm in solving this problem, randomly selected six instances ‘Inst1, Inst10, Inst18, Inst30, Inst41, Inst49’. Figure 12(a)-(f) reports the convergence curve of these instances, showing the effectiveness of the IDE algorithm.

Convergence curve for instances.
Figure 13 presents the customer service time and charging time Gantt diagram of the instance. For example, the “V11” route is:

Gantt chart for the instance.
As seen from the Equation (1), the weighted sum method is leveraged to calculate the objective fitness and the weight w is to optimize the two objectives, i.e., the total trip time and the customer satisfaction value sv i . The weight w was set to 0.8 in this study. Reasonable weights are important for decision-making. In this section, we set the weight value to {0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9} to test the influence of each weight on the total trip time and customer satisfaction value. Each instance was run independently 10 times for ten instances. Figure 14 illustrates the results of the experiment, where each point represents the difference between the optimal objective value and the average objective value for each instance at the current weight value. It shows that: (1) the weight effects the balance between the total trip time and customer satisfaction value; (2) different weights have different optimization effects on the objective fitness.

Comparison with different weights.
In this study, a hybrid algorithm combing an improved DE and several heuristic (IDE) is proposed for solving the EVRPTW-NL. A special encoding approach is presented that considers the problem features. Then, an initialization strategy of charging stations is proposed to guarantee the nonnegative battery level constraint of the EV. Furthermore, to improve the performance of given solution, a battery charging adjust (BCA) strategy is developed. Moreover, a novel negative repair strategy is utilized to make a feasible solution. Finally, several efficient algorithms are compared with the proposed algorithm to examine the efficiency. The experimental results demonstrate that the proposed IDE algorithm has a good performance.
Future work will focus on the following tasks: (1) adapting other realistic constraints and objectives into the considered problem, such as the uncertain customer’s demand, and the multi-objective optimization algorithms [56–58]; (2) to add other heuristics into the IDE algorithm; (3) considering battery discharging of the EV and other realistic applications, such as the prefabricated systems [59].
Footnotes
Acknowledgments
This research is partially supported by National Science Foundation of China under Grant 61773192, 61803192, and 61773246, Shandong Province Higher Educational Science and Technology Pro-gram (J17KZ005), and Major Program of Shandong Province National Science Foundation (ZR2018ZB0419).
