Abstract
In order to further improve the enthusiasm of spatial crowdsourcing workers, considering the service quality of workers, different incentive strategies are proposed and tasks are assigned. Firstly, the incentive model is constructed from the unit time revenue of task and online idle time, and the evaluation function of the evaluation model is constructed; Secondly, the task allocation is transformed into a combinatorial optimization problem by delay matching, and an improved glowworm swarm algorithm is proposed to solve the problem by discrete coding, introducing six kinds of mobile modes, adaptive probability matching and infeasible solution processing; Finally, the algorithm is used to solve the task allocation. The experimental results show that compared with the travel cost minimization strategy and random allocation strategy, the positive incentive index of the proposed strategy is improved by 11.79% and 14.60% respectively, and the fair incentive index is improved by 0.83% and 0.22% respectively, which can effectively improve the positive incentive range and incentive fairness of workers.
Introduction
With the continuous development and application of the Internet, smartphones and GPS terminal products, spatial crowdsourcing has emerged and now extensively employed. It has developed more and more mature in our daily life, such as taxi, food take-out, logistics and so on. Spatial crowdsourcing requires one or more workers to respond to the platform within a certain time, and to complete the corresponding tasks at the designated place at the specified time. The platform will allocate the corresponding benefits to the workers after the task is completed [1]. For example, Uber, Didi taxi and Meituan Takeaway, all of them have a high popularity and are widely used in spatial crowdsourcing [2].
At present, the main research fields of spatial crowdsourcing include task allocation, incentive mechanism, quality management and privacy protection [1]. Among them, task allocation is the core of spatial crowdsourcing research, because task allocation is the basis of other research, and ultimately affects the income of workers and users’ sense of experience. The incentive mechanism primarily addresses the incentive to employees and investigates ways to entice additional workers to join the platform while also improving the quality and enthusiasm of their work.
The goals of task allocation generally include maximizing the number of tasks allocated, and minimizing the travel cost of task allocation [3–6]. Location entropy is used to express the number of workers near the location, and prioritize fewer tasks with workers nearby, boosting the number of tasks allocated [3]. Region entropy is used to represent the number of workers in a certain space, which makes less region entropy have higher allocation priority, and the Greedy algorithm is used to solve the problem [4]. A new algorithm of adaptive model cost based on the Greedy algorithm is proposed under the condition of limited budget and a task requiring specific skills [5]. Tasks are randomly assigned to nearest neighbor workers to save travel expenses [6].
Incentive mechanism is generally studied from two aspects: pricing model and incentive model. The pricing model literature is as follows: Dynamic pricing which has been applied in Uber, taking higher price in areas with more passengers to attract workers to participate [7, 8]. A dynamic pricing method based on single threshold [9]. If the number of workers is greater than the threshold, the lower one will be chosen. Markov chain decision-making process is used, which takes into account factors such as driver direction and time, and it is solved with a polynomial-time algorithm [10]. Workers’ reputation is divided into high, medium and low. Those with a good reputation are rewarded well, whereas workers with a bad reputation are not rewarded at all and are gradually phased out [11]. The incentive model literature is as follows, when designers receive a remote task, they should pay the remote subsidies which below a certain threshold to the workers to motivate them [12]. Workers are encouraged to speed up task response by adaptive pricing, and they will get less reward [13]. In an auction pricing model, the workers who participate in the bidding platform shall decide the final allocation according to the quotation [14]. A dual pricing model means that workers and users offer auction prices respectively, the system matches it optimally as expected by both parties [15].
Most of the existing tasks are allocated by using Greedy algorithms, which are locally optimal solutions but do not obtain better solutions globally. Existing incentive-based task allocation typically accounts from pricing and subsidy methods, and most consider task gains rather than real gains per unit time, nor the waiting time of the next task at the end of the task. Although higher pricing can motivate workers, it will reduce the users’ sense of experience. In addition, the auction model takes into account the pricing model with the participation of different market or regional workers, which has low robustness in different regions, it doesn’t have wide applicability in various regions.
Intelligent optimization algorithm is the important algorithm to solve combinatorial optimization problems in recent years, which mainly include particle swarm optimization algorithm (PSO) [16], ant colony algorithm (ACA) [17], artificial bee colony algorithm (ABC) [18], glowworm swarm optimization algorithm (GSO) [19] and so on. The GSO algorithm has the characteristics of few parameters, simple process and easy implementation [19]. From the mechanism of GSO, the attraction between glowworms is inversely proportional to the generate multiple subgroups, and acquire the ability to deal with multiple extreme values. By setting parameters such as the individual perception radius of the algorithm, it can adjust the algorithm form and adapt to the specific problem solving ability [20]. As it has fast operation speed and better optimization ability in multi-dimensional space, so it can get better solutions in different scale datasets and multidimensional data. The glowworm swarm optimization algorithm (GSO) was first proposed in the paper [19], and the firefly algorithm (FA) was first proposed in the paper [21]. The two algorithms have similar principles and slightly different formulas and steps. This paper selects GSO algorithm to solve the problem. But the GSO algorithm converges prematurely and only applies to continuous functions. The discrete GSO algorithm usually improves the performance of the algorithm from coding [20, 23], mobile mode [20, 24], variable step [25, 26] and other aspects.
To sum up, the existing incentive strategies for spatial crowdsourcing task allocation mainly adopt dynamic pricing and auction, etc., and the income index usually used to measure is not considering the income per unit time or online idle time, so it is not the real income. In addition, existing studies lack strategies to transform workers’ service quality into competitive advantages in each task assignment, which is conducive to improving workers’ enthusiasm and service quality. This paper makes a supplement to the existing research, and its main contributions include the following aspects: Constructing a worker incentive strategy based on service quality, designing two incentive strategies from two perspectives of revenue per unit time and online idle time. In each task assignment, it can play a quantitative incentive role according to the level of service quality of workers Two new evaluation functions, positive motivation index and fair motivation index, are constructed to evaluate the scope and fairness of worker motivation. An IGSO (Improved glowworm swarm) algorithm is proposed and experimentally compared with other similar algorithms on different datasets, IGSO algorithm has better convergence and stability. The IGSO-ISSCTA (Improved glowworm swarm algorithm on space crowdsourcing task allocation from incentive strategy) is proposed to solve the task assignment under the worker incentive policy with the improved glowworm swarm algorithm, which transforms the spatial crowdsourcing task assignment into a combinatorial optimization problem through the delayed matching approach. Through doing experiments on different datasets, the evaluation function values and benefit values of task assignment under different incentive strategies are calculated to verify that the strategy in this paper can incentivize workers more extensively and fairly when compared to the travel cost minimization strategy and the random assignment strategy.
Task allocation model based on service quality incentive
Model definition
The research scenario hypothesis model meets the following preconditions: (1) Only online spatial crowdsourcing workers can receive task assignments. (2) The workers who are in the process of completing the task and cannot receive new tasks. (3) The server cannot be changed after task assignment. (4) The server only considers tasks that existed online at the time, and tasks are considered to exist online from the stage when they start to be posted until the deadline. (5) Suppose that the task income is proportional to the distance, see definition 2 for details. The definitions of model related concepts are as follows:
Task revenue per unit time is allocated by task corresponding to unit time, ets uw = r u /t uw . Task allocation revenue takes service quality into consideration, t uw means wasting time for workers to complete corresponding tasks.
Construction of incentive model
(1) Strategy 1 Considering the impact of traffic congestion and unit time benefit of the task
S. t.
The task reward usually has a fixed starting price within the basic mileage in daily life. After exceeding the basic mileage, it generally increases in positive proportion with the increase of distance. For example, the return of Didi express is closely related to mileage. However, each task is affected by the traffic congestion in a different time and space environment, and the actual time consumption is quite different.
Combined with the starting point and endpoint of the task and the corresponding task release time, different tasks correspond to different traffic congestion factors. By retrieving the real-time data of the traffic congestion factor, the time consumed to complete the task can be estimated, so that workers with higher service quality can obtain tasks with higher revenue per unit time as much as possible. In spatial crowdsourcing task allocation, the travel cost of workers arriving at the mission site is usually considered. The optimization objectives of strategy 1 are such as the Equations (3).
The Equation (1) means the maximum value of the total revenue per unit time. The Equation (2) means the minimum value of the total travel cost corresponding to the matching of all tasks and workers in a task allocation.The Equation (3) represents the set of users and corresponding workers, unit time income including service quality score weight, worker income, the calculation formulas of the distance of worker and task, and the value range of relevant variables (introduced in the model definition) in turn. Among them, c w = 0 means that the worker is not online or is completing the task and cannot receive the task assignment, and c w = 1 means that the worker is online and idle so they can accept the task assignment.
(2) Strategy 2 Considering the online idle time at the end of the task
In real life, workers tend to work in areas with a shorter waiting time. Even if the current task returns are higher, but the probability of receiving orders at the end point is low, then workers’ willingness to receive orders will be lower. Within a spatiotemporal unit, the relative number of tasks and workers will determine the average waiting time of tasks. The more tasks and the fewer workers lead to the greater the supply-demand ratio, and the expectation of average waiting time of task allocation will be shorter. According to the current task allocation time and the traffic congestion coefficient, the estimated arrival time can be calculated. At present, some scholars predict the occurrence of future workers and tasks based on the historical data through deep learning [27]. According to the predicted supply-demand ratio, the expected online idle time can be calculated. The online idle time of the end of the task is considered in the current task assignment. The worker who has higher service quality are assigned the task of shorter the online idle time, so they can accept more tasks at the same time. The optimization objectives of strategy 2 are as follows:
s. t.
The Equation (4) means the minimum value of the total online idle time. The Equation (5) means the minimum value of the total travel cost of all tasks and workers in a task assignment.The Equation (6) is the formula for calculating the online idle time with the weight of service quality score and the corresponding supply-demand ratio at the end of a task. The range of related variables are shown in Equation (3).
Two evaluation indexes, the positive incentive index and the fair incentive index, are constructed to evaluate the effect of incentive strategies from the perspectives of coverage and fairness.
(1) Positive incentive index
It is used to measure the coverage of positive incentive workers in the matching result of a task assignment. Specifically, it refers to the result of each assignment. After the descending order of the service quality score of workers, the income of two adjacent tasks is compared to observe whether the workers with higher service quality scores get more income. As shown in Equation (7).
As shown in Table 1, in one task assignment, a total of 6 workers and task combinations are formed. After the assignment, they are ranked in descending order according to the score of service quality. If the workers in order 1, 3 and 4 are greater than the adjacent 2, 4 and 5 respectively, they are counted as three positive incentives, with a total of 5 comparison times. According to Equation (7), the positive incentive index is 0.6.
Calculation example of incentive index
(2) Fair incentive index
Worker quality of service scores is not the same. In order to motivate workers and consider fairness, the improved Gini coefficient is introduced as the evaluation index to measure the fairness of incentive strategies.
Gini coefficient is the coefficient used to measure the inequality of social income distribution in economics developed by economist Gini on the basis of Lorentz curve [28]. The smaller the Gini coefficient g is, the higher the degree of fairness is; The greater the Gini coefficient, the lower the degree of fairness. The area between the Lorenz curve and the absolute equality line of income distribution is called unequal distribution area (area A); The area between the Lorenz curve and the absolute inequality line of income distribution is called the actual distribution area (area B), and the ratio of the two is Gini coefficient G. There are many calculation methods for Gini coefficient, and there is a certain error with the change of M. Equation (8) can obtain the approximate value with higher accuracy [28, 29]. The fair incentive index gins are as follows: Equations (9)
Under the fair incentive strategy, workers with higher service quality scores get a higher proportion of income. After dividing the unit time income and work quality score, the unit time income corresponding to each unit service quality score is obtained, as shown in Equation (9). In Table 1, the corresponding fair incentive index is 0.548 by calculation with z w and 0.642 by calculation with zets uw . Thus, the results calculated with z w are higher than those calculated by zets uw , which shows that the distribution is more unfair.
When spatial crowdsourcing tasks are assigned, workers and tasks appear in real-time at any location in space, with a great deal of randomness. If you do a real-time allocation, you can’t change the allocation after the allocation, making it difficult to achieve high allocation efficiency. The “delayed matching” strategy [30] can be used to improve the allocation efficiency and profitability. The specific idea is that after waiting for a certain length of time stamp, the set of tasks that have not reached the deadline in a spatial unit and the set of online workers are matched with each other to find a better allocation result. If there are m workers and n tasks in a spatiotemporal unit k (if m ⩾ n), each assignment forms a number of combinations as
GSO algorithm and improvement
GSO algorithm
GSO algorithm has fluorescein update, selection of movement direction, position update, decision domain update and other stages. The fluorescein corresponds to the objective function, thus combining the algorithm with the objective optimization, as follows:
Of which, l i (t) is fluorescein, ρ is the volatilization coefficient of fluorescein, γ represent respectively enhancement of fluorescein coefficients, J (x i (t)) represent objective function values; N i (t) is the set formed by glowworms at the t iteration in the decision domain, which are brighter than glowworms, ρ ij (t) represents the probability of the current glowworm x i moving to a brighter glowworm x j ; is the perception radius coefficient, is the threshold for the number of fireflies in neighborhoods, and is the sensing radius.
The discrete glowworm algorithm is improved from the aspects of initial coding, introducing six movement modes, adaptive probability matching and infeasible solution processing to make it be suitable for task allocation applications.
Step 1 Discrete coding and decoding
The initial solution is encoded as follows: Subscripts of each dimension represent the number of users, numbers on each dimension represent worker number, such as [2 5 6] indicates worker 2 service user 1, worker 5 service user 2, worker 6 service user 3, and so on.
Step 2 Six movement strategies
For the purpose of ensuring multivariate optimization of glowworm I, in addition to the algorithm’s moving strategy, a set of six modes of movement were developed, including exchange, two adjacent transformations, inversion, random reservation and reverse learning. According to the Table 2, assuming that the 2nd and 7th dimensions of the original glowworm i move, six different strategies are used to get the corresponding new location. Among them, exchange, left-adjacent transformation and right-adjacent transformation are local moves, inversion, random reservation and reverse learning are large-scale moves.
Examples of six movement strategies
Examples of six movement strategies
Step 3 Adaptive probability matching
In order to select the mobile mode better and improve the optimization ability of the algorithm, the adaptive probability matching strategy is introduced to select the mobile mode [31], the details are shown in Equation (15):
represents the recent performance of the g mobile mode, is adaptation rate, represents the contribution of the nth mobile mode in the t generation. In order to keep the convergence rate of the algorithm, the probability matching is updated once every 10 iterations, the probability of using the moving mode g is as Equation (16):
Step 4 Distance of discrete glowworms
Measure step length through using Hamming distance, suppose that glowworm and glowworm have the same value in the dimension, is recorded as 0, different is recorded as 1, the sum of the values on each dimension of the glowworms is the distance between the two glowworms, see Equations 17 18 for details, of which.
Step 5 Not feasible for solving
As shown in Table 3, glowworm flies toward glowworm, which has a better neighborhood, and converges in dimensions 1 and 4, when dimensions 1 and 5 are both 1 and dimensions 4 and 8 are both 7. The scenario application is that each worker can only serve a certain user at the same time, so this is not a feasible solution. The processing is to keep the 1st and 4th dimensions after the change of glowworm, and regenerate the non-repeating numbers in the duplicated 5th and 8th dimensions.
Moving treatment of infeasible solutions
IGSO-ISSCTA algorithm objective function
The objective functions of strategy 1, strategy 2 and strategy 3 are shown in Equations (20), respectively. Where are the coefficients that let the two parts of the data in the objective function out of a uniform measure are the parameters that reflect the importance of the dual objectives in the objective function. Equation (19) is the optimal maximum value, and Equation (20) is the optimal minimum value.
As shown in Fig. 1, the IGSO-ISSCTA algorithm consists of the ISSCTA model and the IGSO algorithm in the following steps.

IGSO-ISSCTA framework.
Step 1 Traffic data pre-processing includes patio-temporal cell segmentation and normalization of data, which is mapped to the specified spatiotemporal cell;
Step 2 Divide the given space-time into space-time units, and give a unique number to the users and workers in each space-time unit;
Step 3 According to the coding rules, initialize the glowworm population, update the population fluorescein, and calculate the value of the objective function corresponding to different incentive strategies;
Step 4 The glowworm swarm is guided by the learning factor parameters and chooses to move towards the brightest glowworm in the neighborhood or according to six improved adaptive probability matching strategies;
Step 5 Solve the objective function according to the incentive strategy, and continuously update the solution space according to the objective function;
Step 6 Check the new solution after the move, checking and processing the infeasible solution;
Step 7 Compare the objective function value with the bulletin board, and if better, replace the bulletin board value;
Step 8 Check whether the termination conditions are met, and if so, output the task assignment plan to motivate the worker.
Suppose the size of the population is m, number of users is m, and number of workers is m1 (assuming m > n). The number of matches is m, each glowworm (m dimensions) represents one task assignment, and the maximum number of iterations is t. The initialization IGSO population time complexity is O(n), and the utility time complexity for calculating the task assignment strategy for one incentive worker is O(m), so the total task assignment time complexity is O(n*m) in each iteration; The time complexity of calculating the distance between firefly individuals in each iteration is O(m); Although there are a variety of ways to move, choose the most ways to move when each dimension is moved, the maximum time complexity of calculating the individual movement of glowworms in each iteration is O(n*m), IGSO-ISSCTA algorithm time complexity is O(n*m*t). The initialized IGSO algorithm population space complexity is
Experiment and analysis
Experiment and data description
(1) Data description
The experiment was done in Matlab R2021a with PC parameters of system Windows 10, RAM (16 G), CPU Intel(R) Core (TM) i5-10600 (4.1 GHz). IGSO parameters, IGSO-ISSCTA parameters, and datasets as detailed in Table 4. The first two parts of the parameters are the IGSO algorithm parameters, referring to the corresponding literature and parameter experiments, respectively, and the constants are the coefficients in the objective function of the IGSO algorithm, and this paper focuses on worker excitation, so takes a larger value.
Experimental data explanation
Experimental data explanation
There are two data sets, D1 and D2. D1 is the simulated data set, each data in D1 is randomly generated in the range of values, the size of the simulated data set is small; D2 is a synthetic data set, using the cab trajectory of Shanghai on February 20, 2007 provided by literature [32], containing ID, time, latitude and longitude data, etc. 4316 in total; The traffic congestion index is obtained from the dataset [33] of traffic congestion coefficients for May 29, 2021 for each region of Shanghai from Baidu Maps Smart Traffic, with a total of 4208, The other datasets are randomly generated within the definition, and the first 4000 data are taken to synthesize the dataset. IGSO-ISSCTA algorithm contains multiple data. Data of location, traffic congestion coefficient, etc. They are mapped to the corresponding intervals with real data. Speed, service quality score, supply-demand ratio and other data are randomly generated, the starting price and default online idle time are taken as constants.
(2) Experimental description
The experiment is divided into 3 parts. Part 1 experiments validate the effectiveness of the two incentive strategies and compare them with two commonly used allocation strategies. The validation of the effectiveness of strategy introduces the cumulative earnings of two workers in addition to comparing the evaluation index of one allocation to illustrate the effectiveness and fairness of the incentive strategy at macro and micro levels. The second part of the experiment mainly compares the performance of the IGSO algorithm with the other three similar algorithms, including the results of the objective function search on different data sets, the effect of different movement methods and their combination on the performance and parameter experiments. The third part of the experiment involves solving the final result and comparing it to other algorithms in a big data scenario that is relevant to real-world applications.
Strategy 3 [6] targets the total cost of travel of per allocation and Strategy 4 is a randomly selected strategy. Strategy 3 is the most commonly used optimization goal in the current research on spatial crowd-sourcing task allocation, while Strategy 4 is the most commonly used strategy in daily life. Therefore, these two strategies are selected as comparative strategies. In the comparison experiment, the same improved GSO algorithm and data are used to obtain different allocation schemes by using these two objective functions and the objective function of the strategy in this paper. Then the evaluation function is calculated and compared to obtain the incentive results of different strategies for workers.
Validation of strategy 1
As shown in Table 5, from the positive incentive indices obtained on the five datasets on D1, it can be seen that the mean value of the positive incentive indices obtained from strategy 1 is always greater than that of strategies 3 and 4, indicating that strategy 1 can play a positive incentive role to a greater extent, so that more workers with high service quality can be assigned tasks with higher returns per unit of time. The standard deviations corresponding to strategy 1 are smaller than those of strategies 3 and 4, indicating that strategy 1 has better stability of the positive incentive index corresponding to task assignment. On the datasets dateset1, dateset2, dateset4 and dateset5, the optimal values of strategy 1 are better than those of strategy 3 and 4, indicating that strategy 1 can find a larger range of positive incentives for the task assignment scheme.
Calculation results of strategy 1 and comparison strategies on D1
Calculation results of strategy 1 and comparison strategies on D1
From the fair incentive indices obtained on the five datasets, it is clear that overall strategy 1 achieves better results than strategy 3 and 4; On datasets dateset2 dateset5, the mean value of the fairness indices obtained for strategy 1 is smaller than those obtained for strategy 3 and 4, indicating that the task assignment scheme of strategy 1 is fairer and enables workers with higher quality service scores to have a higher probability of achieving the corresponding proportional benefit. On the datasets dateset1, dateset2, dateset4 and dateset5, the optimal value of strategy 1 is better than that of strategy 3 and 4, indicating that strategy 1 can find a more equitable task assignment solution; On dataset1, the mean value of strategy 1 is lower than that of strategy 3 which indicates that strategy 1 does not yield the fairest results in every dataset assignment scheme, which may be due to factors such as closer worker service quality scores and insignificant disparity in unit time gain between tasks. On the dateset1 and dateset5, the mean value of strategy 1 is slightly lower than strategy3 and 4, indicating that the stability is slightly weaker than strategies 3 and 4, and the stability of strategy 1 is stronger than strategy 3 and 4 on dateset2 dateset4, indicating that the stability of the solution is better for strategy 1 in most cases.
As the size of the dataset increases, the quality of the results obtained from strategy 1 is closer than those obtained from strategies 3 and 4 for both positive and fair incentive indices, and the gap in incentive indices keeps decreasing. For example, strategy 1 in dateset1 seeks a significantly higher positive incentive index than strategies 3 and 4, with 30.17% and 43.8% improvement respectively; in dateset2 compared to strategies 3 and 4, with 17.43% and 21.26% improvement respectively, and in dateset5 compared to strategy 3 and 4, only 7.55% and 5.12% improvement respectively. The fair incentive indices derived from the three strategies, except for dateset1, also show a trend of decreasing disparity as the size of the dataset becomes larger.
In summary, strategy 1 is more effective than the positive incentives of the two comparison strategies. Compared with strategy 3 and strategy 4, the positive incentive index of strategy 1 is improved by 11.79% and 14.60% respectively, and the fair incentive index is improved by 0.83% and 0.22% respectively. Strategy 1 task assignment results in a wider range of incentives to participating workers, and the equity incentive index indicates that the task assignment scheme of Strategy 1 is also fair. This shows that it is effective to take worker service quality and unit time benefit as objective function elements of task allocation. But the incentive advantage of strategy 1 over the comparison strategy tends to decrease continuously as the data set increases.
As shown in Table 6, from the positive incentive indices obtained on the five datasets, it can be seen that the mean, optimal values obtained for strategy 2 are always greater than or equal to strategies 3 and 4, indicating that strategy 2 can play an incentive role to a greater extent, allowing more workers with high service quality to be assigned tasks with shorter expected online idle time; On the datasets dateset1∼dateset4, the standard deviation of strategy 2 is smaller than strategies 3 and 4, indicating that strategy 2 is more stable.
Calculation results of strategy 2 and other strategies on D1
Calculation results of strategy 2 and other strategies on D1
From the fair incentive indices obtained on datasets dateset1∼dateset4, it can be seen that overall strategy 2 has a smaller mean value of the fairness indices obtained than strategies 3 and 4, indicating that the task assignment scheme of strategy 2 is fairer. The optimal value of the fairness index found by strategy 2 is smaller than that found by strategies 3 and 4, indicating that strategy 2 can find a more equitable task assignment solution; On the datasets dateset1∼dateset3, the corresponding standard deviations of strategy 2 are smaller than those of strategies 3 and 4, indicating that strategy 2 has a better stability.
As the size of the data set expands, the advantages of strategy 2 continue to shrink compared with other comparison strategies, whether it is a positive incentive index or a fair incentive index. The standard deviation of the positive incentive index of strategy 2 solved on dateset5 is greater than that of strategy 3, and the fair incentive index solved on dateset5 is greater compared to strategy 3, and the optimal value is greater compared to strategy 4, which indicates that the stability and effectiveness of strategy 2 on dateset5 are not good.
Compared with strategy 3 and strategy 4, the positive incentive index of strategy 2 is improved by 9.29% and 12.08% respectively, and the fair incentive index is improved by 0.71% and 0.41% respectively. In summary, the task assignment result of strategy 2 can motivate the participating workers to a greater extent than the two comparison strategies, comparing with the two comparison strategies, so that workers with high service quality scores are more likely to receive the current task with less idle time in the end. And workers with high service quality scores can receive the waiting idle time corresponding to their own high scores, taking into account the fairness. This shows that it is also valid to consider worker service quality and online idle time as objective function elements of task allocation. But the incentive advantage of strategy 2 tends to decrease or even disappear as the data set increases.
Because each worker has different competitors at each assignment time, in order to study the cumulative benefits of strategy incentive from the perspective of the individual, this paper pays attention to the actual cumulative benefits of two special workers, namely, the workers with the highest and lowest service quality scores, w1 and w2. Assuming that the service quality of workers is not affected by the evaluation of recent services, under the premise of constant service quality score, five task assignments are conducted on dataset1∼dataset5 respectively. Each task allocation generates 20 groups of assignment results according to different strategies. The mode of the corresponding tasks matching by the workers is taken as the decision-making scheme of final task allocation, and the calculation results are shown in Table 7.
Cumulative income of workers under different strategies of w1 and w2
Cumulative income of workers under different strategies of w1 and w2
From the average mode of the five strategies of worker w1, the task assignment results of strategy 1 are concentrated, the stability is higher, strategy 3 is the lowest, which conforms to the strong randomness of the random strategy. The cumulative revenue is not consistent with ets uw , the order from high to low is (4,3,2,1), the order of unit time revenue ets uw and unit revenue considering online idle time iets uw is (1,3,4,2), which is mainly affected by the cumulative time. The comparison demonstrates that ets uw and iets uw can more accurately represent the income of the employees. Compared with strategy 3 and 4, strategy 1 have increased 58.64%, 30.98% respectively, with an average increase of 107.34%, 71.20% respectively, indicating that strategy 1 can expand the positive incentive range. The cumulative idle time corresponding to strategy 2 is the minimum, which indicates the effectiveness of strategy 2. However, the income index of strategy 2 corresponding to the current task is low, which indicates that the allocation is carried out according to strategy 2. Although the expected idle waiting time of the task end point is very short, the sum of the current task allocation is ignored.
From the average mode of w2 strategies, strategy 2 is more stable, task allocation results are more concentrated, and strategy 3 is the lowest. The order of cumulative income, ets uw and iets uw is not consistent, and the cumulative income is (2,3,4,1) from high to low, ets uw and iets uw from high to low (2,4,3,1). The comparison shows that ets uw and iets uw can more truly reflect the income of workers. Strategies 1 are compared with strategies 3 and 4, ets uw decreased by 10.17%, 21.31% respectively, iets uw decreased by 10.17%, 21.31% respectively. It reveals that strategy 1 can effectively adversely inspire the employees with poor ratings of service quality. The cumulative online idle time corresponding to strategy 2 is the longest, which indicates the effectiveness of strategy 2, but the corresponding income indicators of strategy 2 are the highest. It indicates that task allocation is conducted according to strategy 2. Although the idle waiting time of task destination is long, it can match the low service quality score, but the sum of tasks that are currently assigned is very high, which can not play the due punishment role.
To sum up, experiments verify that the two strategies are effective. The comparative experiment of the cumulative income of two workers shows that under the pricing background of the positive correlation between task income and distance, strategy 1 is significantly better than strategy 2 in terms of cumulative income and incentive effect. Strategy 2 can reduce online idle time significantly and save time to accomplish more operations, so it is suitable for application scenarios with less waiting time.
The main results of the above experimental calculation shows that the average positive excitation index is effective on five datasets. But relatively speaking, the effectiveness on dataset1 to dataset3 is better than that on dataset4 to dataset5. And the above experimental calculation show that the average fair incentive index is effective on dataset1 to dataset4, and has no significant effect on dataset5. Furthermore, in order to achieve better results, it is suggested that m value is within 40. The specific method is to control the size of m by segmenting the spatiotemporal units when the spatiotemporal data is preprocessed.
Algorithm performance experiment
Performance comparison of algorithm
In this paper, we use the “delay matching” strategy to solve the problem of dynamic task assignment online into static cumulative combination optimization. In recent years, some scholars have solved the problem of combination optimization by improving IDFA [22] and IFA [23]. In addition, some scholars use the improved Greed Random (GR) [34] to calculate the spatial crowd-sourcing task allocation utility. So IDFA, IFA and GR are selected as the comparison algorithms, and the IDFA algorithm and IFA algorithm have the similarity of mechanism and GR has the comparability in application.
As shown in Fig. 2, the performance of the IGSO algorithm and the other three comparison algorithms is tested by repeating 20 experiments on dataset1 to dataset6 and iterating 1000 times. In terms of the final objective function optimization results of the four algorithms, the IGSO algorithm is always better than the other three algorithms, and it is close to IDFA on dataset1, dataset2 and dataset4, which is obviously better than the IFA algorithm and GR algorithm. The results of the IGSO algorithm on dataset3, dataset5 and dataset6 are better than the other three algorithms. From the convergence speed of the four algorithms, within 0 ∼ 20 iterations.

Performance comparison between IGSO and comparison algorithms.
IGSO algorithm is slightly worse than IDFA algorithm on dataset1 and datesat2 and IGSO algorithm is equivalent to IDFA algorithm on dataset3 and dataset4, and they are better than IFA algorithm and GR algorithm; In 20 ∼ 400 iterations, the four algorithms can find better and stable solutions, and IGSO algorithm is slightly or significantly higher than the other three algorithms. To sum up, the IGSO algorithm outperforms the competition in terms of effectiveness and convergence.
Through 20 experiments on dataset1, dataset3 and dataset5, and iterating 1000 times, and then Fig. 3 is obtained by taking the average. In order to compare the performance of six kinds of mobile modes, the learning factor is set to 0. It means that the GSO algorithm is not used to optimize its own process. Among them, reverse learning is just a solution obtained by a reverse learning based on the original solution, and does not have the ability to update the solution repeatedly, therefore the learning factor is set to 0.1.

Performance comparison of six mobile modes.
As shown in Fig. 3, from the final results of optimization, the objective functions of the inversion method on three data sets are the highest, and of the reverse learning method are the lowest. From the convergence rate of optimization, the reverse learning method is obviously weaker than other methods. Within 0 ∼ 100 iterations, the left adjacent transformation and the right adjacent transformation are better in dataset1, the inversion and insertion are better in dataset3, and the inversion and exchange are better in dataset5. Within 100 ∼ 500 iterations, on dataset1, the optimization results of the right adjacent transformation are better in 100 ∼ 200 iterations, and the optimization results of left adjacent transformation are better in 200 ∼ 500 iterations; On dataset3, the optimization results of inversion and left adjacent transform are better in the range of 200 ∼ 500; On dataset5, the optimization result of interchange is better. Within 500 ∼ 1000 iterations, the overall effect of inversion on the three data sets is better. To sum up, in different iteration stages, the performance of the six kinds of mobile modes is different, and each has its own level. On the whole, the inversion mode is better, and the reverse learning mode is worse.
In order to further verify the effectiveness of adaptive probability matching combination, three combination allocation methods are selected for comparison, namely average allocation, local policy priority and global policy priority. As shown in Fig. 4, average distribution means that the utilization rates of the six kinds of mobile modes are the same or close; Local strategy priority means that the first three strategies are more likely to be used, while the last three strategies are less likely to be used, because the first three mobility modes are local dimension mobility, which can be considered as local optimization, while the last three mobility modes are global optimization because many dimensions are moving; Global strategy priority means that the first three strategies are less used and the last three mobile modes are more likely to be used. The use probability ranges of the six mobile modes are shown in Table 8.

Performance comparison of four combinations.
Utilization rate of different mobile mode combinations
The algorithm with different combinations of four mobile modes is repeated 20 times in dataset1, dataset3 and dataset5, with 1000 iterations. From the final optimization results, except for the smaller dataset1, the adaptive probability matching method is slightly larger than the local strategy priority method, and the adaptive probability matching method is significantly better than the other three methods on dataset3 and dataset5; The results show that the local strategy priority is worse than the adaptive probability matching, but better than the average allocation and global strategy priority; Average allocation and global policy priority are different in three datasets. In terms of convergence rate, the local strategy has the highest preferential convergence rate in 0 ∼ 10 iterations, and the other three strategies have the same preferential convergence rate, because the average allocation is used in the initial value of adaptive probability matching; Within 10 ∼ 50 iterations, the adaptive probability matching takes into account the contribution of each mobile mode, so it increases the utilization rate of better mobile mode, so it achieves better results.
The main parameters of the IGSO algorithm include neighborhood thresholds, sensing radius and learning factors. In the IGSO algorithm debugging experiment, when other parameters remain unchanged, dateset3 is the better and larger scale data set of the IGSO-ISSCTA algorithm, so it is more likely to use this number of data sets in practical application, so dataset3 is selected as the parameter experimental data set.
The independent experiment was repeated 20 times, and Fig. 5 was obtained. The results show that the solution of the IGSO algorithm in the objective function fluctuates with the increase of neighborhood threshold, and the maximum solution is obtained when the value is 8; The results show that the solution of the IGSO algorithm in the objective function first increases, then decreases and remains unchanged with the increase of sensing radius, and the performance is the best when the sensing radius is 30; The results show that the solution of the IGSO algorithm in the objective function first increases and then decreases with the learning factor, and the performance is the best when the learning factor is 0.8.

The influence of different parameters on the performance.
D2 is a large data set, including dateset6 ∼ dateset10. According to the discussion of m value, the IGSO-ISSCTA algorithm has a good effect on m ∈ (0, 40). By controlling the time interval and space unit size, the size of space-time unit is controlled to make m ∈ (0, 40). After space-time division, the objective function takes the cumulative value, and the positive incentive index and fair incentive index take the average value. And then Table 9 can be obtained. The objective function value of the IGSO algorithm is always higher than that of comparison algorithm on large-scale data sets, IDFA algorithm and IFA algorithm are lower, and GR algorithm is the lowest. The positive incentive index of strategy 1 is always larger than strategy 3 and strategy 4, and the fair incentive index is always smaller than strategy 3 and strategy 4, which indicates that the scope of positive incentive is larger and the fairness based on service quality allocation is better. Experimental results show that the IGSO-ISSCTA algorithm is still effective in large-scale data sets by dividing space-time units and accumulating calculations.
Calculation results of IGSO-ISSCTA algorithm on D2
Calculation results of IGSO-ISSCTA algorithm on D2
In this paper, the incentive problem in spatial crowd-sourcing task allocation is studied. For the first time, the real benefits of workers are considered from the aspects of task per unit time and online idle time, and a task allocation model based on worker service quality is constructed from these two aspects. In addition, compared with the traditional online Greedy algorithm, this paper designed an algorithm combining “delay matching” with the improved GSO algorithm to calculate the model results. Compared with the travel cost minimization strategy and the randomly selected strategy, the strategy proposed in this paper can motivate workers in a larger range of positive incentives and more equitably. Experimental results on data sets show that the proposed method is effective. However, the effect of the method in this paper is good on small-scale data sets, while the effect decreases on large-scale data sets. At the same time, it should be noted that other Intelligent optimization algorithms such as PSO, ACA and ABC, which can also solve the problem. In the following phase, we will continue to enhance the algorithm and model in order to broaden the spectrum of applications for the approach described in this work, and we will experiment with additional intelligent optimization methods in order to tackle the issue of spatial crowd-sourcing task allocation
Footnotes
Acknowledgments
This work was supported by the Anhui Provincial Natural Science Foundation under Grant No. 1908085QG298, the National Nature Science Foundation of China under Grant No. 91546108, No.71521001, the Fundamental Research Funds for the Central Universities No. JZ2019HGTA0053, No. JZ2019HGBZ0128, and the Open Research Fund Program of Key Laboratory of Process Optimization and Intelligent Decision-making (Hefei University of Technology), Ministry of Education.
