Abstract
To date, the research on agriculture vehicles in general and Agriculture Mobile Robots (AMRs) in particular has focused on a single vehicle (robot) and its agriculture-specific capabilities. Very little work has explored the coordination of fleets of such vehicles in the daily execution of farming tasks. This is especially the case when considering overall fleet performance, its efficiency and scalability in the context of highly automated agriculture vehicles that perform tasks throughout multiple fields potentially owned by different farmers and/or enterprises. The potential impact of automating AMR fleet coordination on commercial agriculture is immense. Major conglomerates with large and heterogeneous fleets of agriculture vehicles could operate on huge land areas without human operators to effect precision farming. In this paper, we propose the Agriculture Fleet Vehicle Routing Problem (AF-VRP) which, to the best of our knowledge, differs from any other version of the Vehicle Routing Problem studied so far. We focus on the dynamic and decentralised version of this problem applicable in environments involving multiple agriculture machinery and farm owners where concepts of fairness and equity must be considered. Such a problem combines three related problems: the dynamic assignment problem, the dynamic 3-index assignment problem and the capacitated arc routing problem. We review the state-of-the-art and categorise solution approaches as centralised, distributed and decentralised, based on the underlining decision-making context. Finally, we discuss open challenges in applying distributed and decentralised coordination approaches to this problem.
Keywords
Introduction
Research in the area of Agriculture Mobile Robots (AMR) [26,27] has primarily focused on single robot systems and challenges that the agriculture environment presents for standard robot tasks, such as: navigation, control, sensing, image processing, platform stability, terrain handling and system integration, including balance of computation between edge and cloud resources. Very little work has explored the domain of efficient and scalable AMR fleet coordination, taking into consideration three distinguishing features of the agriculture domain: (1) the need for the allocation of multiple scarce resources to given tasks simultaneously due to inter-related sets of functional and environmental constraints on vehicle components, farm implements (accessories that survive the task) and raw materials that can be depleted during the execution of the task;1
Note that “raw material” is separate from the energy material (e.g. fuel) required to run the vehicle. See Section 3 for detail.
In practice, agriculture fleets are conventionally coordinated by dividing an area of interest into sectors, each one assigned to a single human controller. Each controller coordinates the fleet’s vehicles, tracks their performance in real time and responds to contingencies in his/her assigned sector. The higher the fleet’s operational costs, the more importance is given to the fleet’s coordination. Planning and scheduling of the tasks assigned to the routes of vehicles (AMRs and tractors) and related vehicle-implement configurations,2
For example, a tractor pulling a tiller – see Section 3 for detail.
Even with multiple AMRs running and coordinating simultaneously with one another on the same farm (as well as shared across multiple farms), fully autonomous farming is still an open challenge. Advances in ICT and agriculture technologies allow for a higher level of autonomy of AMR fleets where integrated decision-making potential at present is unrealised. With the objective to increase the opportunity for autonomy in agriculture fleets, in this paper, we study dynamic and decentralised coordination of agriculture fleets and propose the Agriculture Fleet Vehicle Routing Problem (AF-VRP), motivated as follows.
The classic Capacitated Vehicle Routing Problem (CVRP) traces its origins to 1959, when Dantzig and Ramser introduced the “Truck Dispatching Problem” [14], a generalization of the now famous Traveling Salesman Problem (TSP) [20] – finding the shortest path that travels through n points, all known a priori (i.e. before the journey begins). Briefly stated, the CVRP adds the constraint that the vehicle travelling this shortest path must make a set of deliveries (
Our new variant that we propose in this paper, AF-VRP, possesses some particular characteristics that add to the complexity of the classic CVRP and, when taken together, inspire this new CVRP variant. First, in the AF-VRP, we model the routing of vehicles through a graph composed of nodes and capacitated edges. Contrary to the classic Vehicle Routing Problem, the demand is defined as a task to be performed on an edge and not on a node, similar to the Capacitated Arc Routing Problem (CARP). Thus, the proposed AF-VRP problem can be viewed as an extension of the CARP (e.g. [24]). While nodes represent specific locations, edges represent the path between nodes. Thus tasks like spraying, which take place while travelling, are modelled as occurring on the “arcs” (edges).
In the AF-VRP, each task may require a compatible accessory and payload(s) for execution. The solution involves assigning not only distinct locations (edges) to visit, but also assigning vehicles, vehicle “accessories” (i.e. farm implements) and payload for each vehicle and each task, where accessories can be re-used by other vehicles at a later time for other tasks while payloads are consumed during task execution. There may be multiple tasks on an edge, each one with different accessory and payload requirements. Second, the problem is dynamic, meaning that the tasks’ requirements at each location are probabilistic values that may not be known a priori with certainty. These tasks may be added to or removed from the vehicles’ routes (list of edges to be visited by a vehicle) online, i.e. during the mission (journey). Third, the problem is decentralised, since different entities with possibly conflicting objectives may own either the vehicles in the fleet (system) and/or the demand locations.

Example of mutually connected farming fields in a region of interest. The blue lines indicate permanent transportation network (i.e. roads). The red lines indicate examples of in-field transportation routes (i.e. tractor lines). The white circles represent the depots.
For instance, let us consider a simplified motivating example in which there are three farmers in a region of interest, each one being an exclusive owner of his/her fields that are neighbouring the fields of other farmers, as seen in Fig. 1. Given is a planning time horizon in which each farmer has assigned a set of tasks to be performed at his/her fields (e.g. spraying, irrigation, monitoring, etc.). The cost and duration of each task may vary depending on the weather conditions and other factors in each period of the given time horizon (e.g. early morning, late morning, afternoon, evening, night). In case a farmer does not own (a sufficient quantity of) resources for the execution of his/her tasks, delaying the task execution or not carrying it out at all may result in considerable crop losses. One of the reasons for the lack of resources may be a too short duration of a time window with favourable weather conditions for the execution of a task. If other farmers in a region of interest have the required resources, farmer-to-farmer collaboration will be a viable way to perform the tasks efficiently and effectively. In general, agriculture resources can be divided into implements, tractors and raw materials. The objective of the AF-VRP problem is, for each task, to allocate the adequate vehicle, implement, and raw material configuration and to route the vehicle-implement-raw material combinations through the fields while minimising the overall fleet costs and satisfying resource and task constraints. The AF-VRP solution must take into account the ownership of the tasks and resources and it must provide incentives for collaboration, so that a solution is at least as good as the solution in which an individual farmer chooses not to collaborate.
Considering the individual interests of each farmer in the planning of task allocation and vehicle routing is a necessary and desirable step in the scenarios with small and medium farming enterprises sharing their costly equipment. It is clear that the potential time and cost savings increase as the number of collaborating farmers and the common resources available increase.
The conventional (centralised) vehicle routing problem formulation that does not consider the individual interests of self-concerned and competitive farmers will result in an optimal solution for the system as a whole, which may not be acceptable for competing farmers who focus on their individually optimal solution. Thus, the (decentralised) AF-VRP problem formulation opens new directions in farming business and operative models by searching for the agriculture fleet coordination solution that will enable collaboration of self-concerned and competitive farmers.
Since the dynamic and decentralised AF-VRP is an NP-hard problem, it may be approximated by dispatching vehicles to tasks at each time period without considering future tasks. Then, the Dynamic Assignment Problem (DAP) and the Dynamic 3-index Assignment Problem (D3AP) are of concern. The main question we consider is: How can these technologies improve the efficiency and autonomy of agriculture fleets while decreasing the cost of the fleets and reducing their dependence on humans?
This paper is intended for researchers in combinatorial optimisation and multi-agent systems (MAS), particularly those interested in coordination, to highlight the possibilities of integrating these two fields with a third: the real-world domain of sustainable agriculture. We also address researchers in agriculture by demonstrating the added value of the application of combinatorial optimisation and multi-agent coordination technologies to everyday problems faced in agricultural settings. The content may be relevant for researchers or practitioners who wish to learn more about and/or engage with problems in this applied domain.
The paper is organised as follows. In Section 2, we describe the background and context of farming with agriculture machinery. In Section 3, we introduce the dynamic Agriculture Fleet Vehicle Routing Problem (AF-VRP). Section 4 presents the main features of decentralising multi-agent coordination, which may be applicable in this context. Section 5 reviews related problems: the Assignment Problem (AP) and the 3-index Assignment Problem (3AP), which consider task dispatching at each period without considering future events, and the Capacitated Arc Routing Problem (CARP). Finally, we discuss open issues in finding efficient solution approaches to the AF-VRP problem and conclude the paper with research opportunities in Section 6.
In this section, we describe the context of agriculture fleets and farming tasks while delineating differences in vehicle autonomy. Then, we review the state-of-the-art in relevant agriculture technologies.
Today’s Agriculture Fleets are comprised of traditional non-autonomous vehicles, such as tractors, as well as semi-autonomous vehicles, i.e. Agriculture Mobile Robots (AMRs). Our research is aimed at shifting the current centralised AMR fleet coordination (FC) paradigm towards a distributed and decentralised FC system for autonomous agriculture vehicle fleets, reducing the necessity for human controllers.
The ownership of the agriculture fleet may vary from (a) a completely centralised scenario with only one owner and manager of both the whole fleet and of the fields to be cultivated to (b) the completely decentralised scenario where both agriculture machinery and fields to be cultivated are owned and managed by multiple self-interested (i.e. individually rational) and potentially competing decision makers. For example, a farmer may own a tractor for tilling her field, but may rent a harvesting robot to help pick ripe produce. This distinction is an important factor in coordination because different owners may have different goals and priorities, and the lack of a centralised (common) owner contributes to the need to separate AF-VRPs from other VRPs. The AF-VRP in the latter context must consider fairness and equity concepts to be applicable in the real world.
Tractors are farm vehicles that provide traction powered by slow speed, high torque engines to mechanise agricultural tasks. These tasks include, among others, pulling or pushing of agricultural implements or trailers, tillage, plowing, disking, harrowing and planting. Agricultural implements include: irrigation machinery (e.g. central pivot irrigation systems, pump units and sprinkler systems), soil cultivation implements (e.g. trowels, spike, drag and disk harrows, power harrow parts, plows and tillers), planting machines (e.g. seed drills and planters, broadcast seeders, seed drills, air seeders and spreaders), harvesting machines (e.g. trailers, diggers and pickers). These implements may be towed behind or mounted on the tractor, and the tractor may also provide a source of power for the implement, if it is mechanised. In general, implement mounting, attaching and removal are still not suitable for automation but can be performed by trained human operators in a matter of minutes. This flexibility means that a single farmer or a cooperative involving several farmers can purchase a tractor and a number of attachments (implements) without needing to acquire and maintain multiple different types of specialised farm vehicles individually. This strategy can be especially useful as AMRs, which tend to be relatively expensive, become more capable and widely available.
Skeete [74] categorises tractors based on their autonomy levels as follows: tractors with driver assistance (level 1), semi-automated tractors (level 2 – partial automation), driverless remotely supervised tractors (level 3 – conditional automation), driverless fully autonomous tractors (level 4 – high automation), and complete automation of a tractor fleet (level 5).
At the driver assistance level (1), there is no automated decision-making. Operational decisions are taken by the driver (individual platform steering and route following, while a fleet controller makes tactical decisions about the vehicle(s) (e.g. route and task planning), supervises the performance of the whole fleet and performs tractor-field assignment when necessary. Here, technology provides only route guidance to the driver; while in the case of a semi-automated tractor (level 2), the only task of a driver is to supervise the vehicle and act in case of emergency. Driverless remotely supervised tractors (level 3), on the other hand, operate without the presence of a human inside the tractor itself, but still under supervision of a human controller positioned at a control station or in a manned tractor leading a tractor platoon that guides the driverless tractor onto and between fields. These tractors use vehicle-to-vehicle (V2V) and vehicle-to-infrastructure (V2I) communication for receiving driving instructions from a remote human controller.
A driverless fully autonomous tractor or a farming robot (level 4) is capable of independently performing its assigned task while tracking its GPS position, controlling its speed and sensing and avoiding obstacles in front of it. The environment of an assigned task has to be virtually deterministic (the task is defined before it starts and the next state of the environment is determined by the current state and the actions performed, e.g. follow a predetermined route on a field) (e.g. [17]). Any delay in decision-making for the action choice must be as small as possible, preferably instantaneous without hesitation or time-consuming calculations (e.g. changing the steering angle when necessary). Sensor technologies such as infrared, radar and LiDAR3
Light Detection and Ranging technology.
Driverless autonomous tractors usually deploy a radio receiver to receive tasks and commands from the remote command station; then, using control software installed on an on-board computer, the autonomous tractor translates it into vehicle commands such as steering, acceleration, braking, transmission and implement action while analysing real-time sensor data and views from the tractor’s on-board cameras.
In this way, a fully autonomous tractor is not only able to make its own way to the field along mapped on-farm paths, but also can work remotely in an autonomous fashion. This gives the human controller the ability to supervise multiple tractors at once and, since there is no need for a human driver, through multiple controller shifts, one could obtain non-stop (e.g. 24-hour) performance, eliminating driver fatigue and reducing work-related injuries.
Even though the agriculture fleet vehicle routing problem treated in this paper can be applied to every level of vehicle autonomy, in the study of solution approaches, we focus on level 4 in the context of intrinsically autonomous and decentralised highly automated and geographically distributed vehicles (AMRs) with the vision of reaching level 5. At level 4, these vehicles are capable of communicating with each other and possibly with fixed infrastructure sensors on the field and/or with human collaborators. Level 5 is expected to be reached in the future by applying dynamic and online AMR fleet coordination methods and mechanisms that are still today waiting to be developed. These approaches will result in agriculture vehicle fleets capable of completely autonomous coordination and operation without the need for any human supervision (e.g. [71]).
The AMRs today are mostly applied for weed control (e.g. [48]), seeding (e.g. [2,8]), harvesting (e.g. [82,86]), environmental monitoring (e.g. [46,66]) and soil analysis (e.g. [19,84]).
Examples of autonomous modular multi-purpose agricultural robots designed for horticultural tasks such as pruning, weeding, spraying and monitoring include the Thorvald [25] developed by Saga Robotics4
To the best of our knowledge, the problem of routing fleets of agriculture vehicles that we study in this paper – the Agriculture Fleet Vehicle Routing Problem (AF-VRP) – differs significantly from any other known variation of the vehicle routing problem. Therefore, we describe the motivation and background for the AF-VRP, after which we offer a formal description.
Motivation and background
Planning and scheduling of different stages of cultivation for each crop are based on agri-food production goals and agronomic needs. These are typically determined by tables for “technical itineraries”, which describe the entire cycle of crop cultivation processes throughout the year. For example, for the cultivation of maize in Spain, the scheduling of tasks is: deep ploughing in January; stone removal in February and March; harrowing, seeding and fertilisation, fertilising inserting, irrigation system, maintaining herbicide application in April and May; preseeding irrigation, and seeding and soil disinfection in June, etc. [75]. Technical itinerary tables also include scheduling of labourers for each month, required equipment and labour (driver/labourer), yield in hours per hectare of equipment and labour, and raw material (units per hectare).
In this paper, we focus on a scenario where multiple farmers and/or large agriculture conglomerates grow multiple crops with distinct cultivation needs simultaneously while sharing the same agriculture resources (e.g., implements and supply of raw materials such as pesticides, herbicides or chemicals) and a fleet of heterogeneous vehicles (tractors and/or AMRs). Each one of these crops requires tasks that need specific equipment for their implementation, usually a certain type of a tractor and a compatible implement with specific characteristics. For example, for deep ploughing, we need a tractor and a chisel; for stone removing, a tractor and a trailer. Further, some tasks may only be compatible within certain weather conditions while some tasks may be more important than others.
Each task requires a vehicle-implement-raw material combination:
Vehicle (tractor or AMR) parameters include: maintenance and cleaning frequency, operational state, task compatibility and compatibility with implements and related requirements (e.g. power, weight, front power take-off (PTO) used for taking power from a power source, guidance system or not, front loader, specific tires, etc.), driver requirements, fuel autonomy, and type and number of operators needed for operation per each task. Implement parameters usually include: maintenance frequency (maximum time and distance passed in operation between two maintenance activities), operation state (damaged, operating), efficiency level for each task, task compatibility (an implement can perform a subset of tasks), tractor compatibility (it can be installed on a subset of tractors) and potentially implement cleaning (to avoid cross-contamination of diseases across fields). Raw material, such as fertilisers, herbicides, fungicides and growth regulators, are typically applied at specific stages of plant development in quantities and frequencies that can depend dynamically on field conditions. These conditions may vary from one part of the field to another due to differences in crop development, soil characteristics (e.g. inclination, chemical structure, etc.), varying microclimate (e.g. local sun exposure, temperature, humidity), prevalence of pests (e.g. insects) and weeds and plant disease development. Thus, tasks have to be planned locally based on these differences and may vary from one field location to another with given short time weather windows in which they have to be performed. Potentially, they may extend across a 24-hour working day.
Nowadays, the allocation of vehicles (tractors and AMRs), implements, and raw materials to crop tasks is still done by human experts in an ad-hoc manner (e.g. [26,83]).
Description of the AF-VRP
For simplicity and without loss of generality, let us assume that mutually connected farming fields in a region of interest are positioned in the plane
Formally, we define the problem elements as follows. Figure 2 provides an overview of the sets, indices, parameters and decision variables used.

Elements of the formal AF-VRP. In sets, superscripts indicate partitions, and subscripts indices.
We consider a planning time horizon
For each temporary edge
For each task
Let
Both implements and the raw materials are initially stored in depot nodes
Each vehicle is characterised by: (i) its capacity
The vehicles can move from one node to another if and only if there is an edge
Parameters of each implement
For each vehicle, a daily task schedule should be given at the beginning of the planning time horizon in terms of a route (path) to follow (edges to visit) and the plan of tasks to do in the field on each of the edges, as well as the remounting of implements, fuel recharging and raw material (re-)loading (visits to depot nodes), when necessary.
Assuming that all the parameters of this problem are deterministic and known a priori and if we do not consider the ownership issues of the fleet and fields to be cultivated in the agriculture fleet vehicle routing, then the off-line and centralised Agriculture Fleet Vehicle Routing Problem (AF-VRP) consists of determining a feasible schedule of the execution of tasks
However, some tasks depend strongly on the weather conditions that may change during the day, e.g. the quantity of irrigation water, pesticides, fungicides and herbicides, which leads us to consider the AF-VRP as a stochastic problem.
The AF-VRP can be viewed as a tactical decision problem which leads to a solution able to face unpredicted contingencies a priori. Similar to other versions of the vehicle routing problem (see, e.g. [62]), there is a dynamic extension of the AF-VRP, which aims at optimal online reconfiguration of the VIR configurations and their online (re-)allocation to a changing set of tasks in real time to face uncertain weather and/or technical events. In case of an unpredicted event (e.g. vehicle breakdown or an unpredicted weather event), a task may be left only partially completed. Thus, we define a non-binary decision variable
Measuring the frequency of changes and the urgency of a task, the framework proposed by [33] classifies the Dynamic VRP problems into weakly, moderately and strongly dynamic problems based on the value of the effective degree of dynamism. However, this measure does not consider the geographical distribution and the travelling times between tasks [62].
The level of urgency of a task depends on the reaction time, i.e. the difference between the disclosure time of a task
Decentralised AF-VRP
In an intrinsically decentralised context with various individually rational and competitive farmers that use a set of vehicles, implements and raw materials that are owned by one or more individually rational and competitive resource owners, fairness and envy-freeness in the allocation of VIR combinations to tasks have to be considered. Here, we are not only interested in optimising the overall system cost (equation (1)) but we also have to consider the distribution of the costs over both individual farmers and resource owners in the ecosystem. Otherwise, a solution might not be accepted by one or more decision makers. In the latter case, we have to introduce further incentives for these decision makers to behave inline with the wanted system outcome.
Equity. Criteria of equity include fairness and envy-freeness. Envy-freeness is a criterion of fair division. Both vehicle owners and farmers can be modelled as agents. In an envy-free division, every agent feels that its share is at least as good as the share of any other agent, and thus no agent feels envy.
We study the case when the VIR combinations owned by multiple owners are to be allocated to the tasks that are owned by different farmers.
Let us consider the case when farmers are modelled through a set of agents
Let us assume that the preference of each agent
Since the VIR configurations are indivisible, an envy-free allocation may not exist. Deciding whether an envy-free and complete allocation exists is NP complete. Deciding whether an envy-free and Pareto efficient allocation exists is above NP [15]. Thus, we have to resort to approximate heuristic approaches to solve this difficult problem.
Fairness in the AF-VRP in this context depends also on the choice to maximise egalitarian social welfare (i.e. minimise the worst off cost,
Unfortunately, by optimising the system based on the worst-off performance, we deteriorate the system efficiency and thus, the utilitarian welfare. From the overall system efficiency point of view, we can use utilitarian social welfare which sums up the agents’ individual utilities in a given allocation and thus gives us a measure of the overall and average benefit for the system. However, optimising the utilitarian social welfare is not acceptable in the systems whose success is based on self-concerned individually rational agents’ acceptance (see, e.g. [11]). This is because in utilitarian systems, the optimum is paid by (usually a few) worst off agents. The latter, however, might not comply with paying the price of the system optimality (see, e.g. [36]). Nash Welfare optimisation maximises the product of the agents’ utilities and results in both egalitarian and utilitarian welfare maximization (see, e.g. [43]). When applying Nash Welfare optimisation, we obtain (
Decentralising the coordination of agriculture vehicle fleets
The AF-VRP problem considers providers of farming services (i.e. owner(s) of vehicles, implements, and raw materials) and tasks dispersed geographically that may be owned by multiple farm owners and thus all of them may be considered active participants in the agriculture fleet coordination process. Lujak et al. [45] categorise coordination models for vehicle fleets based on their ownership structure and the level of decentralisation. In the following, we adapt this categorisation to the coordination models for agriculture vehicle fleets that can be defined as follows.
A
A
A
We note here the main differences between distributed and decentralised coordination models (see, e.g. [45]). Distributed coordination relies on both local and shared (global) parameters and variables; decentralised coordination only has access to local information. Local parameters and variables are private (known only to the agent who holds them), whereas global parameters and variables are public and shared among two or more agents – potentially among all the agents in the system. If we assume selfish (i.e. individually rational) agents, resource owners can manipulate these parameters and variables or deceive agents in communicating their values to influence the individual decision-making of each one of them and thus obtain the behaviour of the system the resource owner wants. Furthermore, due to the lack of global non-obsolete and truthful information, in general, solution approaches for decentralised coordination concentrate on finding a feasible (admissible) solution without quality of solution guarantees. In contrast with the distributed case most often studied in the operations research field, where the emphasis is on the method’s optimality gap, decentralised coordination methods are mostly approximate heuristics-based methods without quality of solution guarantees but with proven completeness, soundness and termination – hence most appropriate for deployment in real-world, dynamic and messy environments such as agricultural robotics.
Related standard combinatorial optimisation problems and solution approaches
In this section, we review standard combinatorial optimisation problems that provide a baseline for the agriculture fleet vehicle routing problem, as described previously. We concentrate on the dynamic versions of these problems, that is, the case when both task demand and resource availability may change in time.
The most important decisions that must be taken by fleet managers have to do with the problems of assigning agriculture vehicles in general and AMRs in particular to implements and tasks (e.g. [39]) and managing their routes (e.g. [7,9,62,64,65,70,79,80]).
The dynamic AF-VRP considers routing of vehicles over a set of dynamically changing tasks through time. This is generally a computationally very complex problem. We can simplify it by ignoring the time dimension and myopically considering in each period only the tasks that are to be performed in the same period. Thus, we simplify the AF-VRP problem to the problem of allocation (dispatch). This problem focuses on deciding which vehicle should be assigned to each task. Conventionally, vehicles are assigned to tasks based on the First Come, First Served (FCFS) strategy. This strategy creates great discrimination among the tasks, increases transport costs and significantly lowers overall fleet performance. Fleet management significantly improves if the vehicles are dynamically assigned (in real time) depending on the characteristics of each vehicle and task requirements (e.g. [5,10,38,69].
Various mathematical and computational models have been developed for the optimisation of fleet operations to serve customer demands while minimising costs, e.g. [4,16,38,53,55,57]. Many of the problems of fleet management correspond to combinatorial optimisation problems, such as the problem of determining optimal routes, e.g. [10,16,38,43,55,57], that are still very difficult to solve, even in a static context with batch processing of requests and dynamic vehicle assignment problems, e.g. [29,39,60].
In the case of poor fleet performance, a penalty for non-compliance with Service Level Agreements (SLAs) translates into the loss of revenue. For agriculture vehicle fleets, optimal allocation of tasks and well-designed routes to vehicles not only ensure the service level, but also meet the needs of the fleet owner(s) and stakeholders in a cost-effective and efficient manner.
Multi-index assignment problem
The methods for dynamic AMR fleet task assignment and dynamic (re-)routing are relevant in various scenarios, such as, e.g. emergency services (e.g. [5,39,40,69]), taxi, hot meal home delivery and vehicle sharing. After a meticulous analysis of the available solutions, we have identified that the combination of these methods can provide a true differential value in the agriculture vehicle fleets.
The problem of allocation of the vehicles to implements and indivisible tasks may be modelled as a multi-index assignment problem [61] that we re-run in each time period when the constituents of the problem change. Each constituent part of this allocation is characterised by a set of attributes describing its availability and compatibility with the rest of the constituents that influence the cost or profit resulting from such a multi-index allocation. Assume there are n vehicle agents, m tasks and k implements. Here, the emphasis is on one-to-one assignment among the elements in each set. Furthermore, each vehicle agent has a valuation function that maps each implement-task combination to some non-negative value particular to that vehicle agent. These valuations are additive, which means that an agent’s value for a set of task-implement combinations is simply the sum of the values of each combination of this set. Our goal is to compute a one-to-one allocation, i.e. a partitioning of
Reynen et al. [67] present alternate integer programming formulations for the multi-dimensional assignment problem with decomposable costs with an increased number of variables and present solution methods based on Lagrangian Relaxation and massively parallel algorithms. Aiex et al. [1] designed a greedy randomised adaptive search procedure with path relinking (GRASP) for solving axial 3APs. GRASP is a multistart metaheuristic for combinatorial optimisation consisting of a construction procedure based on a greedy randomised algorithm and a local search. A parallel version appeared in [52]. Their computational experiments showed very good results compared with previously proposed heuristics. Huang and Lim [28] proposed a hybrid genetic algorithm for this problem and reported on extensive computational experiments. Li et al. [36] propose a novel convex dual approach to the three-dimensional assignment problem. It is shown that Li et al.’s dual approach is equivalent to the Lagrangian relaxation method in terms of the best value attainable by the two approaches. However, the pure dual representation is not only more elegant, but also makes the theoretical analysis of the algorithm more tractable. An asymptotically optimal approximation algorithm for axial k-index assignment problems was given by Kravtsov [31]. Frieze et al. [21] study random multi-dimensional assignment problems where the costs decompose into the sum of independent random variables. They minimise the total cost and show that with high probability a simple greedy algorithm is a
Assignment problem
The multi-index assignment problem is a higher dimensional version of the standard linear (two-dimensional) assignment problem, i.e. a weighted bipartite matching problem in which the objective is to minimise total cost of assigning n resources to n tasks. The latter is an important subproblem of many NP-hard optimisation problems, e.g. Traveling Salesperson Problem for which both sequential (Hungarian algorithm, the shortest path algorithms and auction algorithms) and parallel implementations of these algorithms are known.
In the case where sets of fixed (one-to-one) vehicle-to-implement combinations are static and given in advance, each such combination can be considered as an agent. Then, the multi-index assignment problem is simplified to the assignment problem focusing on the one agent-one task allocation at the time (e.g. [6,40]).
The dynamic task assignment problem is equivalent to the assignment problem for which several centralised approaches exist, e.g. [53]. One of the best known is the Hungarian method [32]. In [22], Lujak et al. propose a distributed version of the Hungarian Method for multi-robot task allocation where mobile robot agents are required to store all the information locally and there is no available shared memory.
One of the tools for mechanism design of agent systems are auctions, e.g. [3,41,68]. The implementation usually requires solving a combinatorial non-linear optimisation problem, which is in general NP-hard and intractable for complex networks. However, with certain relaxations, the latter can be modelled as a convex optimisation problem [3,56]. Computational optimisation auctions are methods that are similar to the Gauss-Seidel and Jacobii methods, e.g. [3]. This approach is well suited for massive parallelisation of local decision-making based on the information interchanged among multiple processors. It is modular, based on regular interactions, incremental, analysable, and permits incentive engineering. In [41,44], Lujak et al. proposed a modified version of Bertsekas’ auction algorithm for the case of incomplete information exchange and explored the deterioration of the solution quality according to the size of the communication network and proposed strategies to overcome this problem. Responding to the task assignment in the case of the medical emergency assistance of urgent out-of-hospital patients by ambulances, Lujak et al. proposed a distributed algorithm for the simultaneous assignment of ambulances [5,39] and ambulances and hospitals to multiple simultaneous patients in [40], where the authors also proposed an ambulance vehicle Voronoi-based relocation approach. Moreover, in [43], Lujak et al. proposed the route assignment approach that considers fair and envy-free routes and improves the overall efficiency in respect to the user optimum. Here, fair routes are related to the overall route cost that should be as balanced as possible between the vehicles, while envy-freeness is related to individual route costs that should not vary between each other more than some predefined factor.
Through a dynamic vehicle reassignment, we can significantly increase the overall performance of the fleet and lower farming costs. Furthermore, by dynamic routing, the fleet can divide the tasks to perform and each fleet vehicle can then respond in real time to any changes in terrain characteristics by rerouting and while doing so, maintain the region of interest well covered, so as to reach tasks quickly and efficiently. A distributed multi-agent computation model for route guidance under congestion in vehicle traffic considering envy-freeness and fairness was proposed by Lujak et al. in [42,43]. It was shown by simulation experiments that by proposing routes that are envy-free and fair, the user equilibrium traffic assignment solution can be improved towards the system optimum.
Vehicle routing problem
At the tactical level, the problem of routing a fleet of vehicles combined with implements through the execution of farming tasks in the fields may be modelled as a vehicle routing problem (VRP). VRPs are a class of combinatorial optimisation problems that consist of determining sequences of tasks for a fleet of vehicles with limited resources while minimising an objective function that is typically the total completion time or the total cost. VRPs are defined on graphs, and the tasks to be performed are associated with nodes or with arcs. When the tasks, e.g. deliveries, are associated with nodes, the corresponding problems are called node routing problems whereas when the tasks are associated with arcs, these are named arc routing problems.
The basic arc routing problem related to the AF-VRP is the Capacitated Arc Routing Problem (CARP). The CARP aims to determine a minimum cost set of routes that serve a subset of edges with positive demand under capacity constraints. For this problem introduced by Golden and Wong [24], many exact and heuristic algorithms have been proposed and are described in the book of Laporte and Corberan [12] and in the recent annotated bibliography by Mourão and Pinto [51]. The CARP is a simplified version of the AF-VRP where the edges to be traversed correspond to the aisles of the fields and the demand to the quantities of raw materials to be used. However, the AF-VRP includes several additional features that make it challenging to obtain good solutions for instances of the size encountered in practice.
When addressing an arc routing problem, it is essential to consider whether its transformation into a node routing problem presents a particular advantage or not. On the one hand, in arc routing problems, demand is associated with edges, and the graph is frequently sparse, which could represent an advantage that can be exploited in the design of solution algorithms. Lechford and Oukil [35] described an approach where they exploit the sparsity in the identification of promising routes. On the other hand, an arc routing problem can be transformed to a node routing problem (see, e.g. Pearn et al. [58] or Longo et al. [37]) for which many efficient algorithms have been proposed (e.g. [50,54,72,80,81]).
When there is only one vehicle, node routing problems reduce to variants of the classical Traveling Salesperson Problem (TSP). Among them, some are relevant since they include some key features of the AF-VRP. In this paper, we focus on narrow and long aisle farming fields, in which the distance travelled across from aisle to another is negligible compared to the distance travelled along the length of the aisle. This setting is similar to conventional multiple parallel-aisle warehouse systems. The Steiner TSP (STSP) is an extension of the TSP that is suitable for these instances. Given a list of locations, some of which are required, and the distances between them, the goal is to find the shortest possible walk that visits each required location and then returns to the origin. As we are looking for a walk, vertices can be visited more than once, and edges may be traversed more than once. Exact approaches to this problem only exist for warehouses that have at most three cross aisles. For other layout types, various heuristic approaches exist, e.g. [78].
The TSP considers minimising the overall travel time of a salesperson but if we concentrate on minimising the waiting times of the tasks, then we speak about the Travelling Repairman Problem [18]. Luo et al. [47] extend the multiple Travelling Repairman Problem (m-TRP) by considering a limitation on the total distance that a vehicle can travel. The resulting problem is called the Multiple Travelling Repairmen Problem with Distance constraints (MTRPD). The authors design a tailored branch-and-price-and-cut algorithm for this problem proposing a bounded bi-directional label-setting algorithm for the pricing subproblem. The m-TRP has characteristics in common with the problem we have to solve for the management of an agricultural fleet.
Another node routing problem related to the AF-VRP is the Field Service Routing Problem (FSRP). Given a limited number of technicians, the FSRP consists of determining a set of optimal technician routes to serve customer requests, while ensuring that each technician has the required skills for his/her tasks. There is an analogy between the technicians and the vehicles, implements and raw materials we present here. The most relevant variant was introduced by Kovacs et al. [30] where teams of technicians have to be built for some time period to complete most of the tasks. Several other extensions have been considered, including stochastic travel and service time and priorities between the tasks (see, e.g. [7]).
Conclusions and research opportunities
In this paper, we presented and described the agriculture fleet vehicle routing problem (AF-VRP). We presented the nomenclature for sets, indices, parameters and decision variables that are used in the AF-VRP mathematical program that can be developed in future work. Moreover, we discussed its dynamic and decentralised version and ways of simplification by removing the time dimension and focusing on dispatching vehicles in each time period by considering only the tasks that are to be performed at the present time. Finding an efficient solution approach to the AF-VRP problem remains an open challenge.
The AF-VRP is an intrinsically decentralised problem. Even though, as discussed, fleet coordination approaches may be centralised, distributed or decentralised, the focus in this problem is on distributed or decentralised online fleet coordination methods that are today still to be developed.
To simplify the AF-VRP in decentralised environments, we can combine aspects of the assignment problem, 3-index assignment problem and the capacitated arc routing problem. Multiple centralised algorithms have been proposed for each of these individual subproblems assuming perfect information. However, both a computationally efficient mathematical formulation for the dynamic and decentralised agriculture fleet vehicle routing problem and the related solution approach are still open challenges to the best of our knowledge.
The development of distributed MAS-based route guidance for AMR fleets that allows for a completely autonomous AMR fleet is still an open scientific challenge. In addition, the topic of distributed and dynamic multi-task assignment and vehicle routing considering multiple vehicle, operator and farming constraints is still an insufficiently explored field. To the best of our knowledge, distributed and decentralised MAS coordination models and optimisation approaches for vehicle fleet coordination are scarce and have undergone limited real-world testing.
First of all, a decentralised coordination approach is more robust than its centralised counterpart because it is resilient to individual vehicle errors and can rely on the fleet’s intrinsic built-in redundancy. It is scalable since it can operate at a larger scale with multiple large fields at once aggregating vehicle capacity and field throughput across all the fleet’s vehicles. It is open, seamlessly adapting to vehicles entering or leaving the system, and has fewer levels of authority. Finally, it does not suffer from the “single point of failure” problem found in centralised systems. However, distributed open vehicle fleets also have to deal with inter-agent communication and coordination overhead that can sometimes make them slower or more difficult to control than their centralised counterparts.
In the decision-making distribution process, the emphasis of the decomposition of the dynamic and decentralised AF-VRP problem should be on the scalability, local communication and computation constraints of each physical vehicle agent, the structure and topology of the dynamic communication network, and the available communication and processing capacities of the developed cyber-physical MAS. One common goal in this context is an efficient and cost-effective farming service using an agriculture vehicle fleet while considering vehicle autonomy and fairness constraints in work assignment, individual rationality, preferences and constraints – whether they are of operators, farmers or fleet owner(s), as well as farming tasks’ constraints. Quality of solution guarantees play a crucial role underlying sustainable competitive advantage.
The long-term goal of distributing decisions in agriculture vehicle fleets is the development of an open and non-proprietary software platform in the cloud for distributed route guidance and task coordination at large agriculture farms and peer-to-peer sharing of relevant agriculture resources, vehicles and AMRs among farmers. Such a route guidance approach contributes to a more efficient and competitive service in line with the Internet of Robotic Things (e.g. [73]) and Internet of Food Things [59]. Human drivers may also benefit from this technology as they may be motivated to perform better if they feel a sense of autonomy, thus improving the output, task engagement, time-on-task and accuracy. However, behavioural measures should be further studied to understand the triggers of individual effort and motivation.
The indirect benefits of such a distributed and decentralised AMR fleet coordination MAS, among others, should include higher efficiency and benefit in both large and small farms, smaller carbon footprint and reduction in pesticides, and above all, fair participation of fleet owners, AMR operators and farmers, with related rewards and benefits. Decentralised coordination mechanisms will not completely fix sustainable agriculture concerns, but they should facilitate improvements with respect to energy efficiency and resource usage, particularly by enabling precision farming functions, as they are directly related to giving higher autonomy to the fleet of agriculture vehicles while changing the hierarchical and unscalable farming structure to a more efficient and balanced enterprise.
Footnotes
Acknowledgements
This work was partially supported by the “AGRIFLEETS” project ANR-20-CE10-0001 funded by the French National Research Agency (ANR) and by the UK Research and Innovation (UKRI) Research England council’s “Lincoln Agri-Robotics” project as part of the Expanding Excellence in England (E3) Programme.
