Abstract
This paper presents one method and one hybrid genetic algorithm for multi-depot open vehicle routing problem with fuzzy time windows (MDOVRPFTW) without maximum time windows. For the method, the degree of customers’ willingness to accept goods (DCWAG) is firstly proposed, it’s one fuzzy vague and determines maximum time windows. Referring to methods to determine fuzzy membership function, the function between DCWAG and the starting service time is constructed. By setting an threshold for DCWAG, the starting service time that the threshold corresponds can be treated as the maximum time window, which meets the actual situation. The goal of the model is to minimize the total cost. For the algorithm, MDOVRPFTW without maximum time windows is an extension of the NP-hard problem, the hybrid genetic algorithm was designed, which is combination of genetic algorithm and Hungarian algorithm. When the hybrid genetic algorithm applied to one pharmaceutical logistics company in Beijing City, China, one optimal scheme is determined. Then the rationality and the stability of solutions by the hybrid genetic algorithm are proved. Finally, sensitivity analyses are performed to investigate the impact of someone factor on DCWAG and some suggestions are proposed.
Keywords
Introduction
The vehicle routing problem (VRP) plays a central role in optimization of distribution networks [1]. Today, there are more diversified requirements of customers for distribution [2]. Technology development leads to improvement about distribution tools’ efficiency [3]. Merger and reorganization of logistics enterprises lead to expansion of distribution scale. The government formulates many requirements in order to promote the logistic industry developing stably [4]. To adapt to development of economic society, many distribution enterprises try to apply some frontier theories to vehicle routing problem, which strengthens connection between theory and industry and broadens researches of VRP. MDOVRPFTW as one extended VRP, will play an important role in the trend of warehousing integration and artificial intelligence.
MDOVRPFTW is combination of multi-depot open vehicle routing problem (MDOVRP) and vehicle routing problem with fuzzy time windows (VRPFTW). MDOVRP means that there are many depots in the distribution system, vehicles can travel from depots to customers and don’t need to return to any depot after completing tasks. There are two traditional solution methods for MDOVRP. One is to divide the distribution area into several independent areas and dispatch vehicles separately, the method simplifies MDOVRP and may not lead to the optimal allocation of global resources. Another is to schedule globally, which can lead to the optimal allocation of global resources, but it’s more complex. About researches of MDOVRP, Wang et al. [5] considered the cost of carbon emissions and designed a hybrid heuristic algorithm. Lim et al. [6] designed a two-stage algorithm. Salhi et al. [7] considered multi-type vehicles. Yu et al. [8] improved ant colony algorithm. Rahimi et al. [9] designed a module heuristic algorithm. Nilay et al. [10] designed a genetic clustering algorithm based on geometric shape to solve the problem, etc.
VRPFTW is special vehicle routing problem with soft time windows (VRPSTW). For VRPSTW, the starting service time can be out of customers’ ideal time windows. There are mainly two ways to constrain the starting service time. One is to set penalty cost for deviating time (the starting service time deviates from the ideal time window) [11, 12]. Another is to set customers’ satisfaction function for deviating time, which is VRPFTW. VRPFTW is mentioned in previous researches [13, 14], its essence is that, customers have subjective feelings for deviating time and the subjective feelings are vague. By introducing fuzzy theory [15], customers’ feelings towards deviating time can be quantified and defined as customers’ satisfaction. Because VRPFTW meets actual distribution situations closely, it’s widely concerned and researched. Jiafu et al. [16] studied non-linear customers’ satisfaction function. Seyed et al. [17] designed multi-objective model to solve dynamic vehicle routing problem (DVRP). Canhong et al. [18] designed an embedded hybrid neighborhood search algorithm. Wang et al. [19] improved solution algorithm of Cheng et al. [14]. Similarly, Jun et al. [20] studied that the charging operation is affected by divers’ feeling for the rate of electricity consuming. Mehdi et al. [21] constructed multi-objective model to solve multi-depot vehicle routing problem with fuzzy time windows (MDVRPFTW).
For above researches about VRPFTW, limited by the existing method to describe customer’s satisfaction, maximum time windows and ideal time windows of customers must be known in advance. However, there are some distribution situations without maximum time windows. Like replenishment problem (RP), supplementary goods aren’t used immediately and customers’ feelings towards maximum time windows are fuzzy, customers don’t set maximum time windows. How to describe customer’s satisfaction for VRPFTW without maximum time windows hasn’t been studied by anybody. This paper is focused on two aspects, one is combination of MDOVRP and VRPFTW, another is VRPFTW without maximum time windows. Contributions are as follows: One method to describe customers’ satisfaction for MDOVRPFTW without maximum time windows was proposed. One hybrid genetic algorithm for MDOVRPFTW without maximum time windows was designed. It is applied to one case and proved valid.
This paper is organized as follows. Section 2 presents the method and the model for MDOVRPFTW without maximum time windows. Section 3 presents the hybrid genetic algorithm. Section 4 presents one case that illustrates the model and algorithm. Some concluding remarks and possible avenues for future work are given in section 5.
The method and the model for MDOVRPFTW without maximum time windows
The existing method to describe customers’ satisfaction in VRPFTW
Ideal time windows and maximum time windows are known. Generally, customers’ satisfaction is expressed as follows [16–18].
j represents customer, SST j represents starting service time of j. [e j , l j ] represents ideal time window of j. [E j , L j ] represents maximum time window of j. L j (SST j ) represents satisfaction of j at SST j . δ represents minimum satisfaction of customers, it’s to prevent to lost customers because of low customers’ satisfaction. For Equations (2a) and (2b), [ET j (δ) , LT j (δ)] represents actual maximum time window of j that δ corresponds. The actual maximum time window is different from the maximum time window.
Maximum time windows are the boundaries between customers’ acceptance and rejection to goods. For MDOVRPFTW without maximum time windows, when the starting service time is out of the ideal time window, customers’ acceptance or rejection to goods is determined by DCWAG. Customers accept goods if DCWAG is high. DCWAG is affected by the starting service time, DCWAG is high if the starting service time is close to the ideal time window. The main steps of the method proposed are as follows.
In this paper, three factors are refined and defined, the importance of goods towards customers, customers’ satisfaction towards starting service time, the coordination relationship between drivers and customers. Three factors are quantified by direct method [22] (one method to determine fuzzy membership function) and questionnaire survey. The rest content is shown in section 2.4 and 2.5.
Notation for MDOVRPFTW without maximum time windows
Parameters and variables in the mathematical model are shown in Table 1.
Parameters and variables in the mathematical model
Parameters and variables in the mathematical model
The importance of goods towards customers is one fuzzy concept, it’s described referring to direct method [22]. For MDOVRPFTW without maximum time windows, generally goods needn’t to be used urgently and the importance of goods towards customers is mainly reflected in price and quantity. More goods customers order, higher the importance of goods towards customers is. Higher the price of good is, higher the importance of goods towards customers is. The importance of goods towards customers is expressed in Equation (3).
A
j
represents the importance of goods towards customer j. Generally

The relationship between the importance of goods towards customers and relative quantity.
When relative quantity is fixed, there’s positive correlation between the importance of goods towards customers and relative quantity. When relative price is large, the importance of goods towards customers varies fast with relative quantity.
Customers’ satisfaction towards starting service time is one fuzzy concept, it’s described referring to direct method. (Customers’ satisfaction towards starting service time is similar to customers’ satisfaction in VRPFTW, but two maximum time windows are different) It’s expressed in Equation (4).
B j represents customer j’s satisfaction towards SST j .
The coordination relationship between drivers and customers is that, when uncertain situations affect terminal delivery of goods between drivers and customers, drivers and customers have different subjective intentions to complete delivery of goods. For example, outgoing drivers are better in completing delivery lately than incoming drivers, tolerant customers are likely to accept late delivery, etc. Questionnaire survey is introduced, the coordination relationship between drivers and customers is described by customers scoring drivers. Higher the scores are, better the coordination relationship between drivers and customers is. κ kj represents the coordination relationship between driver (vehicle) k and customer j, κ kj ∈ [0, 1].
DCWAG is positively correlated with above three factors. Three factors are independent to each other. Three factors are in [0, 1].
DCWAG is one fuzzy concept. The way to describe DCWAG is to set weight coefficients (P1, P2, P3, P1 + P2 + P3 = 1) for three factors (A j , B j , κ kj ). When starting service time of customer j is out of the ideal time window, as shown in Equation (5).
The function between DCWAG and starting service time is expressed in Equation (6).
P kj (SST j ) represents the degree of customer j’s wiliness to accept goods when vehicle k starts serving j at SST j . P kj (SST j ) ∈ [P1 · A j + P3 · κ kj , P1 · A j + P2 + P3 · κ kj ).
The mode for MDOVRPFTW without maximum time windows
For Equations (7a) and (7b), [E kj (ɛ) , L kj (ɛ)] indicates maximum time window of customer j that ɛ corresponds when vehiclek serves customer j. For Equations (8a) and (8b), [ET kj (δ) , LT kj (δ)] indicates actual maximum time window of customer j that δ corresponds when vehicle k serves customer j.
Note: Step 5 in algorithm 2 is specified in algorithm 3.
The objective function is given by Equation (9), which includes two terms: one term is the cost of vehicles dispatched, another term is the cost of routes. Constraint (10) guarantees no distribution of goods between depots. Constraint (11) guarantees that depots are only treated as starting nodes. Constraint (12) is used for eliminating sub-tours in the vehicle routing. Constraint (13) guarantees that number of vehicles used can’t be larger than total number available. Constraints (14a) and (14b) guarantee that each customer must be serviced and once. Constraint (15) ensures that vehicles are not overweight. Constraints (16a) and (16b) are used for calculating cumulative time that vehicles arrive first customers. Constraints (17a) and (17b) are used for calculating cumulative time of adjacent customers i and j covered by vehicle k. Constraint (18) guarantees that the scheme can meet actual maximum time windows of all customers that ɛ corresponds. Constraint (19) indicates the relationship between variables
About VRP, there are genetic algorithm, ant colony algorithm, particle swarm optimization algorithm and tabu search algorithm, etc. [23–25] In this paper, the hybrid genetic algorithm designed follows the framework of genetic algorithm (coding, decoding, selecting, crossing and mutating) and Hungarian algorithm is embedded in it. Meanwhile, some links of the hybrid genetic algorithm were optimized. The main steps are as follows.
Code
Customers, depots and vehicles are respectively represented by real number code. One chromosome is generated by arraying all customers randomly.
Decode and construct initial population
Firstly, add “0” at starting and ending positions of one chromosome. Then randomly select locations and insertk sum - 1“0”. Finally, customers are grouped without violating total number of vehicles. While inserting “0”, that “0” is adjacent to “0” will generate duplicate decoding schemes, as shown in Fig. 2. The corresponding decoding steps are shown in algorithm 1.

The situation with duplicate decoding schemes.
Algorithm 1 can meet constraints (10) — (14). To meet constraints (16), (17) and (18), the main decoding steps are shown in algorithm 2, 3 and 4.
Algorithm 4 can also meet constraint (15). Repeatedly run algorithm 3 to get the initial population.
Tournament selection strategy and elitist preservation strategy are applied to the algorithm. Tournament selection strategy is to choose one with better fitness value as progeny chromosome from any two paternal chromosomes. Because tournament selection strategy possibly retains chromosomes with poor fitness value, elitist preservation strategy is to choose the best chromosome from paternal population instead of the worst chromosome from progeny population.
Cross
Order-based crossover is adopted. As shown in Fig. 3.

Order-based crossover.
Reciprocal exchange operator, insertion operator and inversion operator are adopted. As shown in turn in Fig. 4. Reciprocal exchange operator is to randomly select two genes from one chromosome and exchange positions of two genes, as shown in Fig. 4(a). Insertion operator is to randomly select two genes from one chromosome and put one gene behind anther, as shown in Fig. 4(b). Inversion operator is to randomly select two genes from one chromosome and reverse the order of genes between the two genes, as shown in Fig. 4(c).

The mutation operator.
Background of the example
The pharmaceutical logistics company is composed by 4 depots and 20 rental vehicles. The medicines are delivered to some hospitals or drugstores in Beijing and Hebei province. The medicines that hospitals or drugstores request are generally used to replenish stocks and the medicines in stocks can be used up after 2-3 days without replenishment. Hospitals or drug-stores probably reject or accept the medicines out of ideal time windows (hospitals are open for 24 hours). Relevant parameters of the example are adjusted, some parameters are shown in Table 2, Table 3 and Table 4.
Parameters of depots and customers
Parameters of depots and customers
Parameters of vehicles
Other parameters
Notes: for
Notes: Q represents that medicines are packed in unified boxes and one vehicle can be loaded into 15 boxes at most. k sum ,C1 and C2 are provided by the company. In reality, v and w ij aren’t fixed. Aiming at adopting to the model and the algorithm (meanwhile fixing v and w ij ), we justified the real distances (between customers, between customers and depots) to ensure that the driving time (between customers, between customers and depots) meets to the actual situations. The justified distances aren’t equal to the real distances.
The population and number of iterations of the algorithm are set as 50 and 150, respectively. The crossover and mutation rates are set as 0.8 and 0.15, respectively. All the algorithms are coded in MATLAB and run on a computer with Intel Core i7-700 3.60 GHz and 16 GB RAM operating on Microsoft Windows 10 operation system. When ɛ is 0.6 and δ is 0.2, by running 10 times, the scheme with lowest cost is selected. The data are shown in Table 5, routes of the scheme with lowest cost are shown in Fig. 5.
The scheme with lowest cost
The scheme with lowest cost
Notes: ①, ②, ③ and ④ represent depots.

Routes of the scheme with lowest cost.
The relationship between routes and vehicles is shown in Table 6. “0” represents that the vehicle serving the route can meet actual maximum time windows of all customers in the route, “1” represents the vehicle can’t serve the route.
The relationship between routes and vehicles
Notes: k represents vehicle index,srepresents route index.
The solutions by the hybrid genetic algorithm are compared with the exact solutions. Firstly, 4 examples (A, B, C and D) are set. Example A is aimed at serving front 5 customers in Table 2. Example B is aimed at serving front 10 customers in Table 2. Example C is aimed at serving front 20 customers in Table 2. Example D is aimed at serving all 48 customers in Table 2. When ɛ is 0.9 and δ is 0.2, it’s to run 10 times by the hybrid genetic algorithm for each example, the scheme with lowest cost is selected as the solution by the hybrid genetic algorithm. As follows in Table 7.
Comparison of exact solutions and solutions by the hybrid genetic algorithm
Comparison of exact solutions and solutions by the hybrid genetic algorithm
Notes: Deviation degree is the difference between the solution by the hybrid algorithm and the exact solution divided by the exact solution.
Table 6 shows that when the scale of customers is small, the solutions by the hybrid genetic algorithm are equal to the exact solutions. When the scale of customers is large, the solutions by the hybrid genetic algorithm approach the exact solutions with slight deviation.
When δ is 0.2, it’s to run 10 times respectively for differentɛ. Maximum deviation degree and average deviation degree are obtained. Maximum deviation degree is the difference between maximum cost for 10 times and minimum cost for 10 times divided by minimum cost for 10 times. Average deviation degree is the average of deviation degrees for 10 times. As shown in Table 8.
Stability of solutions by the hybrid genetic algorithm
Stability of solutions by the hybrid genetic algorithm
Table 8 shows that average deviation degrees are within 6.9% and maximum deviation degrees are within 9.6%, which indicates solutions by the hybrid genetic algorithm are close to one small value and fluctuate in small range. To a certain extent, solutions by the hybrid genetic algorithm are stable.
Firstly, 5 examples (A, B, C, D and E) are set. The coordination relationship between drivers and customers in example C is one random value in [0:0.1:1]. That in example A is one value equal to the random value in example C minus 1. That in example B is one value equal to the random value in example C minus 0.5. That in example D is one value equal to the random value in example C plus 0.5, That in example B is one value equal to the random value in example C plus 1 (While values are less than 0, values are 0. While values are more than 1, values are 1.). When δ is 0.2, it’s to run 10 times for each example and the scheme with lowest cost is selected. As shown in Table 9 and Fig. 6.

Influence of the coordination relationship between drivers and customers on cost.
The cost of 5 examples
Notes: The unit is ¥.
Figure 6 reveals that improving the coordination relationship between drivers and customers can reduce distribution cost. When the coordination relationship between drivers and customers ranges from A to E, cost reduces 7.9–9.8%. The suggestion: when maximum time windows of customers are unknown and customers may refuse goods out of ideal time windows, logistics enterprises can adopt piecework wages or award drivers with high turnover, which can improve the desire that drivers complete delivery task and drivers’ attitude to serve customers. Finally logistics enterprises can improve DCWAG and reduce cost.
In this paper, one method to deal with MDOVRPFTW without maximum time windows is presented. By direct method and questionnaire method, the factors affecting DCWAG can be quantified. The function between DCWAG and the starting service time can be constructed. By setting an threshold for DCWAG, maximum time windows meeting actual situations can be determined. Meanwhile, one hybrid genetic algorithm is designed. Applying it to one example proves that the hybrid genetic algorithm is feasible. Some suggestions to improve DCWAG are proposed.
Note that factors affecting DCWAG are different at different situations, which means the factors aren’t fixed. The disadvantage of the proposed method is that the performance to optimize of the hybrid genetic algorithm is insufficient, a relatively better solution can be determined by running many times. The next research is to further improve the hybrid genetic algorithm.
Founding statement
This research was supported by the Emergency Management System Construction Research Special Project of the National Social Science Fund of China (20VYJ024).
