Abstract
Distributed scheduling has attracted much attention in recent years; however, distributed scheduling problem with uncertainty is seldom considered. In this study, fuzzy distributed two-stage hybrid flow shop scheduling problem (FDTHFSP) with sequence-dependent setup time is addressed and a diversified teaching-learning-based optimization (DTLBO) algorithm is applied to optimize fuzzy makespan and total agreement index. In DTLBO, multiple classes are constructed and categorized into two types according to class quality. Different combinations of global search and neighborhood search are used in two kind of classes. A temporary class with multiple teachers is built based on Pareto rank and difference index and evolved in a new way. Computational experiments are conducted and results demonstrate that the main strategies of DTLBO are effective and DTLBO has promising advantages on solving the considered problem.
Keywords
Introduction
Two-stage hybrid flow shop scheduling problem (THFSP) is a special case of hybrid flow shop scheduling and possesses parallel machines at one stage or two stages. It is NP-hard because its simplest version with two machines at one stage and single machine at another stage is proved to be NP-hard [1]. In the past decades, THFSP has been extensively considered in single factory and multiple factories and a number of results have been obtained.
Regarding THFSP in single factory, many methods including exact algorithm, heuristic and meta-heuristic have been applied. Allaoui and Artiba [2] investigated THFSP with availability constraint by using an exact method and several heuristics to minimize makespan. Yang [3] developed some heuristics for some special cases of the problem with dedicated machines. Wang and Liu [4] presented a heuristic based on branch-and-bound for the problem with dedicated machines. Wang et al. [5] proposed a problem-tailored constructive heuristic with local search strategy to solve a bi-criteria energy-efficient THFSP.
Meta-heuristic is a group of various algorithms [6–23] and the main path to solve THFSP. Wang and Liu [6] proposed a genetic algorithm for no-wait THFSP. Tan et al. [7] presented a hybrid decomposition approach with variable neighborhood search for the problem with batch processing machines. Fan et al. [8] provided a mutant firefly algorithm to optimize the simultaneous rate for the arrival and the on-time delivery rate. Except these works, THFSP with batching and scheduling [9], interval processing times [10], assembly [11], renewable resources [12], preventive maintenance [13] and deteriorating effect [14] are studied and other meta-heuristics including tabu search [13, 15], artificial bee colony [16], estimation of distribution algorithm [14] and imperialist competitive algorithm [17] are used.
Production has transferred from single factory to multiple factories with the development of globalisation. Distributed scheduling problems in multiple factories have attracted much attention in recent years. As the extended version of THFSP in multiple factories, distributed two-stage hybrid flow shop scheduling problem (DTHFSP) is also considered. Lei and Wang [24] developed a shuffled frog-leaping algorithm with memeplex grouping for DTHFSP with makespan minimization. Cai et al. [25] investigated FDTHFSP with sequence-dependent setup time (SDST) and presented a collaborative variable search (CVS) to optimize total agreement index and fuzzy makespan simultaneously.
As stated above, the previous works are mainly about THFSP in single factory and the results on DTHFSP are very limited. With the applications of multi-factory production in many manufacturing situations like semi-conductor manufacturing, it is important to pay attention to DTHFSP to quickly respond to rapid changes in market requirements and customer demands in the global market. On the other hand, uncertainty extensively exists in real-life production process, for example, processing time, setup time and due date are not fixed in advance and often described by fuzzy number etc. In the past decade, scheduling problems with uncertainty have been extensively considered in single factory.
Uncertainty cannot neglected intentionally because the obtained schedule is hard to meet the requirements of real-life production after uncertainty is neglected; moreover, the neglecting of uncertainty in multiple factories will lead to greater loss than the neglecting in single factory, so it is necessary to consider distributed scheduling problems with uncertainty such as FDTHFSP.
Fuzzy theory is often used to describe uncertainty and fuzzy scheduling problems have been extensively considered in single factory [26–37]. In recent years, there are some works on fuzzy distributed scheduling in multiple factories. Shao et al. [36] focused on distributed fuzzy blocking flow shop scheduling problem in multiple homogeneous factories and presented two constructive heuristics and two iterated greedy to minimize fuzzy makespan. Zheng et al. [37] handled multi-objective fuzzy distributed hybrid flow shop scheduling problem and proposed a cooperative coevolution algorithm with iterated greedy and estimation of distribution algorithm. Cai et al. [25] solved FDTHFSP by using CVS. Obviously, fuzzy distributed scheduling does not attract sufficient attention and FDTHFSP is seldom investigated, thus, it is necessary to focus on fuzzy distributed scheduling problems and especially on FDTHFSP.
As mentioned above, TLBO is seldom used to solve FDTHFSP. It is is a population-based algorithm inspired by passing on knowledge within a classroom environment, which has simple structure and fewer parameters and is easily understood and implemented [39]. In recent years, it has been successfully used to solve various production scheduling problems in single factory and multiple factories. Lin et al. [40] proposed a multi-objective TLBO with three carbon-footprint-reduction strategies for processing parameter optimization and flow shop scheduling. Li et al. [41] developed discrete TLBO with a modified iterated greedy for flow shop rescheduling problems. Lei et al. [42] presented a teachers’ TLBO for energy-efficient hybrid flow shop scheduling. Xu et al. [43] solved fuzzy flexible job shop scheduling by using an effective TLBO. Shao et al. [44] provided a hybrid discrete TLBO to solve no-idle flow shop scheduling. Mishra and Shrivastava [45] presented a TLBO and a jaya heuristic for permutation flow shop scheduling with cost minimization. Buddala and Mahapatra [46] applied two-stage TLBO for scheduling in flexible job shop with machine breakdown. Lei et al. [47] proposed a cooperated TLBO for distributed two-stage assembly flow shop scheduling. The competitive performances of TLBO are verified and tested in the above works, so TLBO may be an useful path to solve FDTHFSP.
In this study, FDTHFSP with SDST, fuzzy makespan and maximize total agreement index is considered, which is DTHFSP with fuzzy processing data and studied by Cai et al. [25]. A diversified teaching-learning-based optimization (DTLBO) algorithm is presented, in which multiple classes are constructed and categorized into two types according to class quality. Different combinations of global search and neighborhood search are used in two kind of classes. A temporary class with multiple teachers is constructed and evolved in a new way. A number of experiments are conducted on many instances. The computational results show the effectiveness of the main strategies of DTLBO and its promising advantages in solving FDTHFSP with SDST.
The remainder of the paper is organized as follows. Problem description is given in Section 2 followed by operators of fuzzy number. DTLBO for the considered problem is reported in Section 4. Numerical experiments on DTLBO are reported in Section 5 and the conclusions are summarized in the final section and some topics of the future research are provided.
Problem description
FDTHFSP with SDST is described as follows. There are n jobs J1, J2, ⋯ , J
n
processed by F factories located in different sites. Each factory f possesses a two-stage hybrid flow shop, where there are m
f
identical parallel machines at the first stage and a single machine at the second stage. All jobs are available at time zero. Machines M1,S
f
+1, ⋯ , M1,S
f
+m
f
exist at the first stage of factory f, where
Triangular fuzzy number (TFN) is used to depict processing time and setup time.
FDTHFSP has some constraints on jobs and machines: each machine can process at most one operation at a time; no jobs may be processed on more than one machine at a time; operations cannot be interrupted; all machines are available at all times, that is, maintenance, failure and breakdown are not considered.
FDTHFSP consists of three sub-problems, (1) factory assignment used to decide which jobs are allocated into each factory; (2) machine assignment; (3) scheduling. Cai et al. [25] analyzed the relationships among these sub-problems and pointed out that two assignment sub-problems can be combined into machine assignment of the first stage.
The goal of the problem is to optimize the following two objectives under all constraints are met.
Obviously, FDTHFSP is NP-hard because THFSP is the special case of FDTHFSP and NP-hard.
When TFN is used to represent processing data of scheduling problem, three operators are often used to build a schedule, which are addition operator, max operator and ranking operator.
Addition operator is used to compute completion time and defined for TFNs A = (a1, a2, a3) and B = (b1, b2, b3) by
Ranking operator is used to compare fuzzy completion time. To compare A and B, criteria c1, c2 and c3 are defined for TFN A, c1 (A) = (a1 + 2a2 + a3)/4, c2 (A) = a2, c3 (A) = a3 - a1. If c1 (A) > c1 (B), then A > B. If c1 (A) = c1 (B) and c2 (A) > c2 (B), then A > B. If c1 and c2 of A and B are both equal, c3 is used to compare A and B.
Max operator is applied to decide fuzzy beginning time and given by
In TLBO [39], its search process acts on a population considered as a class of learners and has a teacher phase and a learner phase. In teacher phase, a teacher x
teacher
which is the best solution obtained so far passes its knowledge to learners and a solution x is modified by
In learner phase, a learner x increases its knowledge by interaction with its classmate x
j
.
In the above phases, x new is accepted if x new gives a better objective value than x.
Equations (6,7) are hard to directly generate feasible solutions of scheduling problem because of its combinatorial feature. TLBO is often discrete when it is used to solve scheduling problems [41–47]. In this study, DTLBO is also discrete.
In DTLBO, multiple classes are constructed and divided into good classes and common classes and different combinations of global search and neighborhood search are adopted in two kind of classes; a temporary class with multiple teachers is constructed and evolved in a new way, as a result, diversification is implemented.
Cai et al. [25] pointed out that FDTHFSP is composed of scheduling and machine assignment of the first stage and provided a two-string representation, in which a machine assignment string and a random key string are used. In this study, this representation is adopted. For FDTHFSP with n jobs and W parallel machines at stage 1, a solution is denoted by a machine assignment string [M1,h1, M1,h2, ⋯ , M1,h n ] and a random key string [r1, r2, ⋯ , r n ], where machine M1,h i is allocated for job J i , r i is real number of job J i . The second string is for scheduling sub-problem.
The decoding procedure is shown as follows. In each factory f, for each parallel machine M1,q, decide its all allocated jobs according to machine assignment string and obtain a permutation of these jobs by sort these jobs in the ascending order of their r i , then start with the first job of the permutation, process all jobs of the permutation on M1,q sequentially, finally, deal with the processing of each job on M2,f after the processing at stage is finished.
Cai et al. [25] provided an example of FDTHFSP with 10 jobs and 2 factories. m1 = m2 = 2. A possible solution is composed of [M1,1, M1,3, M1,4, M1,1, M1,2, M1,2, M1,1, M1,1, M1,3, M1,4] and [0.12, 0.43, 0.89, 0.77, 0.40, 0.25, 0.89, 0.91, 0.17, 0.36.
An initial population P with N solutions is randomly produced. Then N solutions are compared according to Pareto dominance and a rank
x
is computed for solution x ∈ P.
When multiple classes are used, it is easy to apply different search strategies in different classes, exploration ability of TLBO can be extended and falling local optima can be avoided effectively. In this study, s classes are constructed as follows. Let g = 1, repeat the following steps until all solutions are added into classes: randomly select a solution x ∈ P and remove it from P to class Lg(mod s)+1, g = g + 1, where g (mod s) indicates the remainder of g/ s.
There are s classes L1, L2, ⋯ , L s and these classes have the same size θ, obviously, N = s × θ.
The division of s classes is done as follows. Q i = ∑x∈L i rank x is computed for each class L i , i = 1, 2, ⋯ , s and all classes are sorted in the ascending order of Q i , suppose that Q1 ≤ Q2 ≤ ⋯ ≤ Q s , where Q i denotes the quality of class L i . We define the first s/2 classes as good ones and other classes as common ones.
Algorithm 1 describes the diversified search procedure in multiple classes, where R1, R2 are integer. Solutions of good class are better than those of common class, we set R1 < R2 to assign more neighborhood search to solutions of good class and make them be used fully. Ω is used to store non-dominated solutions by DTLBO. When y is utilized to update Ω, y is included into Ω, all solutions in Ω are compared each other and the dominated ones are removed.
Seven neighborhood structures are used, four of which are for machine assignment and three of which are for scheduling. Neighborhood structure
1:
2:
3:
4: decide a teacher
5:
6: select two-point crossover of machine assignment string or
7: two-point crossover of scheduling string in the same probability
8: perform the chosen crossover between x and its teacher
9: execute NS3 on x
10:
11:
12:
13:
14: determine a teacher
15:
16: perform crossover between x and
17: execute NS1 and NS2 on x sequentially
18:
19:
20:
21:
Three neighborhood searches NS1, NS2, NS3 are constructed by using above neighborhood structures. NS1 is shown below. For solution x, let g = 1, z = 0, repeat the following steps until z = 1 or g > 4: produce a solution
NS3 is described as follows. For solution x, let g = 1, z = 0, repeat the following steps until z = 1 or g > 4: produce a solution
As stated above, the diversified searches in two kind of classes are implemented by the different combination of global search and NS1, NS2, NS3 and the usage of the parameters R1 > R2, as a result, the possibility of falling local optima reduces and the good balance between exploration and exploitation can be made.
1: decide rank x for each solution x ∈ P and sort all solutions in the ascending order of rank x
2: directly add the first α1 × N solutions of P into TCL and choose (α2 - α1) × N solutions with biggest DI x from the next α2 × N solutions of P and include them into TCL
3: determine δ teachers with the smallest rank x for TCL and let other solutions be learners
4:
5: randomly select a teacher, execute two-point crossover of machine assignment string between x and the chosen teacher, obtain a solution y
6:
7: update x and Ω with y
8:
9: perform two-point crossover of scheduling string between x and the chosen teacher and update x and Ω with the obtained y if y ≻ x
10:
11:
12:
13:
14: execute NS4 and NS5 sequentially on x
15: randomly select a member y ∈ Ω, perform a randomly chosen neighborhood structure on y, if y ≻ x, then replace x and update Ω with y when t can be exactly divided by β
16:
17:
Search in temporary class
Algorithm 2 describes the search procedure in temporary class TCL, where δ ≥ 2, β and R3 are integer, α i is real number, i = 1, 2, difference index DI x is computed by
When SDIx,y is computed, job permutation is obtained by sorting all jobs in the ascending order of their r i in scheduling string. We set δ = 3 and β = 10 by experiments.
The diversified searches in s classes L1, L2, ⋯ , L s are implemented independently. In TCL, α1 solutions with the smallest rank x is first chosen and then α2 - α1 solutions with the biggest DI x are decided. The search in TCL is applied to perform the intensified communications among classes to keep high diversity of class and low possibility falling local optima.
Obviously, it is unreasonable to set α2 to be greater than 0.5 because TCL consists of good solutions of population P. We obtain α2 = 0.25 by experiments independently.
NS4 is depicted as follows. For solution x, let g = 5, z = 0, repeat the following steps until z = 1 or g > 8: produce a solution
NS5 is similar with NS1 and described below. For solution x, let g = 1, z = 0, repeat the following steps until z = 1 or g > 5: produce a solution
Temporary class is composed of solution with good quality in convergence and diversity, the reinforced search in TCL can effectively implement the communication among classes and produce new solutions with good quality, as a result, high diversity of population P can be kept.
1: randomly produce initial population P and construct s classes
2:
3: apply classes division
4: perform search in s classes sequentially
5: execute search in temporary class
6:
The search procedure of DTLBO is shown in Algorithm 3. Fig.1 shows flow chart of DTLBO.

Algorithm flowchart.
DTLBO has the following features. s classes are constructed and divided into two types. The diversified searches based on the different combinations of neighborhood search and its parameter are implemented. A temporary class is built by using some solutions with best rank x and DI x and the reinforced search in TCL is applied to perform communications among classes, thus, the diversified searches and the intensified communications are the main strategies of DTLBO. DTLBO has more complicated structure than the general TLBO; however, the above features can result in the great improvements in search efficiency.
Extensive experiments are conducted to test the performance of DTLBO for FDTHFSP with SDST. All experiments are implemented by using Microsoft Visual C++ 2019 and run on 8.0G RAM 2.4GHz CPU PC.
Instance, metrics and comparative algorithms
95 instances provided by Cai et al. [25] are directly adopted for FDTHFSP with SDST and six instances with n = 200 are added. The processing data of these added instances are produced as done in [25]. Table 1 shows information on these instances.
Information on 100 instances
Information on 100 instances
Metrics
Metric
Let Ω* represent the set of non-dominated solutions along the Pareto front and |Ω*| denote the size of Ω*. d (x, y) is defined as the Euclidean distance computed by using vectors of the normalized objectives of x and y, ρ l and IGD are then defined.
Metric ρ
l
indicates the ratio of number of the elements in the set {x∈ Ω
l
|x ∈ Ω* } to |Ω*|. The larger the value of ρ
l
, the more members that Ω
l
provides for Ω*. ρ
l
= 1 indicates that all solutions of Ω* are provided by Ω
l
. ρ
l
can be calculated as follow:
IGD is a quality indicator for evaluating all quality aspects of multi-objective optimization results and obtained by
Three comparative algorithms are applied, which are competitive memetic algorithm (CMA [51]), multi-objective tabu search (MOTS [13]) and CVS [25]. CMA is used to solve multi-objective distributed permutation flow-shop scheduling problem with makespan and total tardiness. After the encoding and decoding of DTLBO are used directly, CMA can directly handle FDTHFSP because the objectives of optimization are similar, search operators for makepan and total tardiness can be used directly without any changes. MOTS is proposed to solve THFSP with preventive maintenance and it can be applied to deal with FDTHFSP after the repair related encoding string is deleted. CVS itself is presented for FDTHFSP with SDST, C max and AI tot .
We also compare DTLBO with TLBO to show the effect of the main strategies of DTLBO. In TLBO, there is only one class, teacher phase and learner phase are implemented as done in common class of Algorithm 1.
The stopping condition is set to be 0.15 × n × K CPU time, where K is the number of stages in hybrid flow shop, in this study, K = 2. It can be found that archive Ω cannot be updated by new solutions when running time of DTLBO is 0.15 × n × K for each instance, so we think DTLBO converge well. it is also found that the above time is also appropriate for all comparative algorithms, so the above condition is adopted. Other parameters of DTLBO are R1, R2, R3, s, N, α1. We conduct Taguchi method [52], which is often used to decide settings of algorithm parameters.
We select instance 71 for parameters tuning. The levels of each parameter are shown in Table 2. There are 27 parameter combinations according to the orthogonal array L27 (36).
Parameters and their levels
Parameters and their levels
DTLBO with each combination runs 10 times independently for the chosen instance. The results of IGD and S/N ratio are shown in Fig.2, in which S/N ratio is defined as -10 log 10 (IGD2). It can be found from Fig. 2 that DTLBO with following combination N = 60, s = 6, R1 = 8, R2 = 4, R3 = 100 and α1 = 0.15 can obtain better results than DTLBO with other combination, so the above settings are adopted.

Results on IGD and S/N ratio.
DTLBO is compared with TLBO, CMA, MOTS and CVS. All parameters except stopping condition of CVS, CMA and MOTS are directly adopted from papers [13, 51]. For TLBO, there are no R1, R3, α i , i = 1, 2, 3 and δ, other parameters are given the same settings as DTLBO. Each algorithm randomly runs 10 times for each instance. Tables 5-8 show the computational results of all algorithms, in which symbols D, CV, CM, M, T indicate DTLBO, CVS, CMA, MOTS and TLBO, respectively. Fig.3 gives box plot of all algorithms and Fig. 4 reveals the distribution of non-dominated solutions of four instances.
The orthogonal array L27 (36)
The orthogonal array L27 (36)
Response table for means
Computational results of all algorithms on
Computational results of all algorithms on
Computational results of all algorithms on metric ρ l
Computational results of all algorithms on IGD
As shown in Tables 5-8 and Fig.3, DTLBO performs better than TLBO and the performance differences between DTLBO and TLBO are notable in statistical sense. All solutions of TLBO are dominated by non-dominated ones of DTLBO on all instances; ρ l of DTLBO is 1 or very close to 1 on more than 70 instances and DTLBO obtains IGD of 0 on at least 80 instances.

Box plot of all algorithms.

Distribution of non-dominated solutions.
The notable performance differences between two TLBOs show that the main strategies such as the diversified search in s classes and the reinforced search in TCL really have positive impact on performances of DTLBO, so the new strategies are reasonable and effective.
It can be found that DTLBO performs better than its three comparative algorithms. DTLBO produces smaller
As stated in section 4.4, the diversified search in s classes and the intensified communications by the search in TCL are two main strategies of DTLBO. These strategies can effectively improve search efficiency by keeping high diversity of population and low possibility falling local optima, as a result, DTLBO can provide promising results, thus, DTLBO is a very competitive method for solving FDTHFSP with SDST.
Distributed scheduling problem with uncertainty is seldom investigated. In this study, FDTHFSP with SDST is studied and a new algorithm named DTLBO is proposed to minimize fuzzy makespan and total agreement index simultaneously. Multiple classes are constructed and categorized into two types according to class quality in DTLBO. Different combinations of global search and neighborhood search are used in two kind of classes. A temporary class with multiple teachers is built based on Pareto rank and difference index and evolved in a new way. Many experiments are conducted and the computational results reveal that DTLBO can solve FDTHFSP with SDST well and provide promising results.
THFSP has very important applications in many real world scenarios such as the electronics, paper and textile industries. FDTHFSP is an extended version of THFSP by considerations on two important managerial things: uncertainty and multiple factories production, and this work provides an effective path to deal with these two things; moreover, DTLBO can be extended to handle other distributed scheduling with uncertainty.
It is important to combine uncertainty and multiple factories production into a scheduling problem and distributed scheduling problem with uncertainty should be considered fully in the near future. We will investigate the application advantages of newly proposed meta-heuristics such as whale optimization algorithm [20] and crow search algorithm [21] etc. and the application possibility of reinforcement learning into meta-heuristic to solve the above problem. Distributed scheduling problem with uncertainty and energy-related objective is also our future topic. We will try to solve the problem by using heuristics and learning-based meta-heuristics.
