Agriculture fleet vehicle routing: A decentralised and dynamic problem

Abstract

To date, the research on agriculture vehicles in general and Agriculture Mobile Robots (AMRs) in particular has focused on a single vehicle (robot) and its agriculture-specific capabilities. Very little work has explored the coordination of fleets of such vehicles in the daily execution of farming tasks. This is especially the case when considering overall fleet performance, its efficiency and scalability in the context of highly automated agriculture vehicles that perform tasks throughout multiple fields potentially owned by different farmers and/or enterprises. The potential impact of automating AMR fleet coordination on commercial agriculture is immense. Major conglomerates with large and heterogeneous fleets of agriculture vehicles could operate on huge land areas without human operators to effect precision farming. In this paper, we propose the Agriculture Fleet Vehicle Routing Problem (AF-VRP) which, to the best of our knowledge, differs from any other version of the Vehicle Routing Problem studied so far. We focus on the dynamic and decentralised version of this problem applicable in environments involving multiple agriculture machinery and farm owners where concepts of fairness and equity must be considered. Such a problem combines three related problems: the dynamic assignment problem, the dynamic 3-index assignment problem and the capacitated arc routing problem. We review the state-of-the-art and categorise solution approaches as centralised, distributed and decentralised, based on the underlining decision-making context. Finally, we discuss open challenges in applying distributed and decentralised coordination approaches to this problem.

Keywords

Agri-robots autonomous fleet coordination multi-agent system vehicle routing problem capacitated arc routing problem

1. Introduction

Research in the area of Agriculture Mobile Robots (AMR) [26,27] has primarily focused on single robot systems and challenges that the agriculture environment presents for standard robot tasks, such as: navigation, control, sensing, image processing, platform stability, terrain handling and system integration, including balance of computation between edge and cloud resources. Very little work has explored the domain of efficient and scalable AMR fleet coordination, taking into consideration three distinguishing features of the agriculture domain: (1) the need for the allocation of multiple scarce resources to given tasks simultaneously due to inter-related sets of functional and environmental constraints on vehicle components, farm implements (accessories that survive the task) and raw materials that can be depleted during the execution of the task;1

¹
Note that “raw material” is separate from the energy material (e.g. fuel) required to run the vehicle. See Section 3 for detail.

(2) the need for coordinated planning on two time scales: tactical planning for daily, weekly or seasonal factors (which can be computed a priori) and operational, dynamic management of activities in real time (which must be computed during fleet operations); and (3) the need to account for decentralised ownership of vehicles, where decisions about (1) and (2) may be made independently for different fleet members. Our primary contribution here is in recognising these challenges and recommending strategies to address them.

In practice, agriculture fleets are conventionally coordinated by dividing an area of interest into sectors, each one assigned to a single human controller. Each controller coordinates the fleet’s vehicles, tracks their performance in real time and responds to contingencies in his/her assigned sector. The higher the fleet’s operational costs, the more importance is given to the fleet’s coordination. Planning and scheduling of the tasks assigned to the routes of vehicles (AMRs and tractors) and related vehicle-implement configurations,2

For example, a tractor pulling a tiller – see Section 3 for detail.

drivers and controllers is still left to human planners. Such a segmented myopic view is one source of loss of efficiency for the overall system.

Even with multiple AMRs running and coordinating simultaneously with one another on the same farm (as well as shared across multiple farms), fully autonomous farming is still an open challenge. Advances in ICT and agriculture technologies allow for a higher level of autonomy of AMR fleets where integrated decision-making potential at present is unrealised. With the objective to increase the opportunity for autonomy in agriculture fleets, in this paper, we study dynamic and decentralised coordination of agriculture fleets and propose the Agriculture Fleet Vehicle Routing Problem (AF-VRP), motivated as follows.

The classic Capacitated Vehicle Routing Problem (CVRP) traces its origins to 1959, when Dantzig and Ramser introduced the “Truck Dispatching Problem” [14], a generalization of the now famous Traveling Salesman Problem (TSP) [20] – finding the shortest path that travels through n points, all known a priori (i.e. before the journey begins). Briefly stated, the CVRP adds the constraint that the vehicle travelling this shortest path must make a set of deliveries ( $q_{k}$ ), at each demand point k and the capacity (Q) of the delivery vehicle is smaller than the total deliveries to be made (i.e. $Q ≪ \sum_{k} q_{k}$ ). This constraint makes the problem more interesting because all the deliveries cannot be made on a single journey and so the solution aims to minimise the overall combined distance travelled in order to deliver everything ( $\sum_{k} q_{k}$ ).

Our new variant that we propose in this paper, AF-VRP, possesses some particular characteristics that add to the complexity of the classic CVRP and, when taken together, inspire this new CVRP variant. First, in the AF-VRP, we model the routing of vehicles through a graph composed of nodes and capacitated edges. Contrary to the classic Vehicle Routing Problem, the demand is defined as a task to be performed on an edge and not on a node, similar to the Capacitated Arc Routing Problem (CARP). Thus, the proposed AF-VRP problem can be viewed as an extension of the CARP (e.g. [24]). While nodes represent specific locations, edges represent the path between nodes. Thus tasks like spraying, which take place while travelling, are modelled as occurring on the “arcs” (edges).

In the AF-VRP, each task may require a compatible accessory and payload(s) for execution. The solution involves assigning not only distinct locations (edges) to visit, but also assigning vehicles, vehicle “accessories” (i.e. farm implements) and payload for each vehicle and each task, where accessories can be re-used by other vehicles at a later time for other tasks while payloads are consumed during task execution. There may be multiple tasks on an edge, each one with different accessory and payload requirements. Second, the problem is dynamic, meaning that the tasks’ requirements at each location are probabilistic values that may not be known a priori with certainty. These tasks may be added to or removed from the vehicles’ routes (list of edges to be visited by a vehicle) online, i.e. during the mission (journey). Third, the problem is decentralised, since different entities with possibly conflicting objectives may own either the vehicles in the fleet (system) and/or the demand locations.

Fig. 1.

Example of mutually connected farming fields in a region of interest. The blue lines indicate permanent transportation network (i.e. roads). The red lines indicate examples of in-field transportation routes (i.e. tractor lines). The white circles represent the depots.

For instance, let us consider a simplified motivating example in which there are three farmers in a region of interest, each one being an exclusive owner of his/her fields that are neighbouring the fields of other farmers, as seen in Fig. 1. Given is a planning time horizon in which each farmer has assigned a set of tasks to be performed at his/her fields (e.g. spraying, irrigation, monitoring, etc.). The cost and duration of each task may vary depending on the weather conditions and other factors in each period of the given time horizon (e.g. early morning, late morning, afternoon, evening, night). In case a farmer does not own (a sufficient quantity of) resources for the execution of his/her tasks, delaying the task execution or not carrying it out at all may result in considerable crop losses. One of the reasons for the lack of resources may be a too short duration of a time window with favourable weather conditions for the execution of a task. If other farmers in a region of interest have the required resources, farmer-to-farmer collaboration will be a viable way to perform the tasks efficiently and effectively. In general, agriculture resources can be divided into implements, tractors and raw materials. The objective of the AF-VRP problem is, for each task, to allocate the adequate vehicle, implement, and raw material configuration and to route the vehicle-implement-raw material combinations through the fields while minimising the overall fleet costs and satisfying resource and task constraints. The AF-VRP solution must take into account the ownership of the tasks and resources and it must provide incentives for collaboration, so that a solution is at least as good as the solution in which an individual farmer chooses not to collaborate.

Considering the individual interests of each farmer in the planning of task allocation and vehicle routing is a necessary and desirable step in the scenarios with small and medium farming enterprises sharing their costly equipment. It is clear that the potential time and cost savings increase as the number of collaborating farmers and the common resources available increase.

The conventional (centralised) vehicle routing problem formulation that does not consider the individual interests of self-concerned and competitive farmers will result in an optimal solution for the system as a whole, which may not be acceptable for competing farmers who focus on their individually optimal solution. Thus, the (decentralised) AF-VRP problem formulation opens new directions in farming business and operative models by searching for the agriculture fleet coordination solution that will enable collaboration of self-concerned and competitive farmers.

Since the dynamic and decentralised AF-VRP is an NP-hard problem, it may be approximated by dispatching vehicles to tasks at each time period without considering future tasks. Then, the Dynamic Assignment Problem (DAP) and the Dynamic 3-index Assignment Problem (D3AP) are of concern. The main question we consider is: How can these technologies improve the efficiency and autonomy of agriculture fleets while decreasing the cost of the fleets and reducing their dependence on humans?

This paper is intended for researchers in combinatorial optimisation and multi-agent systems (MAS), particularly those interested in coordination, to highlight the possibilities of integrating these two fields with a third: the real-world domain of sustainable agriculture. We also address researchers in agriculture by demonstrating the added value of the application of combinatorial optimisation and multi-agent coordination technologies to everyday problems faced in agricultural settings. The content may be relevant for researchers or practitioners who wish to learn more about and/or engage with problems in this applied domain.

The paper is organised as follows. In Section 2, we describe the background and context of farming with agriculture machinery. In Section 3, we introduce the dynamic Agriculture Fleet Vehicle Routing Problem (AF-VRP). Section 4 presents the main features of decentralising multi-agent coordination, which may be applicable in this context. Section 5 reviews related problems: the Assignment Problem (AP) and the 3-index Assignment Problem (3AP), which consider task dispatching at each period without considering future events, and the Capacitated Arc Routing Problem (CARP). Finally, we discuss open issues in finding efficient solution approaches to the AF-VRP problem and conclude the paper with research opportunities in Section 6.

2. Agriculture fleets

In this section, we describe the context of agriculture fleets and farming tasks while delineating differences in vehicle autonomy. Then, we review the state-of-the-art in relevant agriculture technologies.

Today’s Agriculture Fleets are comprised of traditional non-autonomous vehicles, such as tractors, as well as semi-autonomous vehicles, i.e. Agriculture Mobile Robots (AMRs). Our research is aimed at shifting the current centralised AMR fleet coordination (FC) paradigm towards a distributed and decentralised FC system for autonomous agriculture vehicle fleets, reducing the necessity for human controllers.

The ownership of the agriculture fleet may vary from (a) a completely centralised scenario with only one owner and manager of both the whole fleet and of the fields to be cultivated to (b) the completely decentralised scenario where both agriculture machinery and fields to be cultivated are owned and managed by multiple self-interested (i.e. individually rational) and potentially competing decision makers. For example, a farmer may own a tractor for tilling her field, but may rent a harvesting robot to help pick ripe produce. This distinction is an important factor in coordination because different owners may have different goals and priorities, and the lack of a centralised (common) owner contributes to the need to separate AF-VRPs from other VRPs. The AF-VRP in the latter context must consider fairness and equity concepts to be applicable in the real world.

Tractors are farm vehicles that provide traction powered by slow speed, high torque engines to mechanise agricultural tasks. These tasks include, among others, pulling or pushing of agricultural implements or trailers, tillage, plowing, disking, harrowing and planting. Agricultural implements include: irrigation machinery (e.g. central pivot irrigation systems, pump units and sprinkler systems), soil cultivation implements (e.g. trowels, spike, drag and disk harrows, power harrow parts, plows and tillers), planting machines (e.g. seed drills and planters, broadcast seeders, seed drills, air seeders and spreaders), harvesting machines (e.g. trailers, diggers and pickers). These implements may be towed behind or mounted on the tractor, and the tractor may also provide a source of power for the implement, if it is mechanised. In general, implement mounting, attaching and removal are still not suitable for automation but can be performed by trained human operators in a matter of minutes. This flexibility means that a single farmer or a cooperative involving several farmers can purchase a tractor and a number of attachments (implements) without needing to acquire and maintain multiple different types of specialised farm vehicles individually. This strategy can be especially useful as AMRs, which tend to be relatively expensive, become more capable and widely available.

Skeete [74] categorises tractors based on their autonomy levels as follows: tractors with driver assistance (level 1), semi-automated tractors (level 2 – partial automation), driverless remotely supervised tractors (level 3 – conditional automation), driverless fully autonomous tractors (level 4 – high automation), and complete automation of a tractor fleet (level 5).

At the driver assistance level (1), there is no automated decision-making. Operational decisions are taken by the driver (individual platform steering and route following, while a fleet controller makes tactical decisions about the vehicle(s) (e.g. route and task planning), supervises the performance of the whole fleet and performs tractor-field assignment when necessary. Here, technology provides only route guidance to the driver; while in the case of a semi-automated tractor (level 2), the only task of a driver is to supervise the vehicle and act in case of emergency. Driverless remotely supervised tractors (level 3), on the other hand, operate without the presence of a human inside the tractor itself, but still under supervision of a human controller positioned at a control station or in a manned tractor leading a tractor platoon that guides the driverless tractor onto and between fields. These tractors use vehicle-to-vehicle (V2V) and vehicle-to-infrastructure (V2I) communication for receiving driving instructions from a remote human controller.

A driverless fully autonomous tractor or a farming robot (level 4) is capable of independently performing its assigned task while tracking its GPS position, controlling its speed and sensing and avoiding obstacles in front of it. The environment of an assigned task has to be virtually deterministic (the task is defined before it starts and the next state of the environment is determined by the current state and the actions performed, e.g. follow a predetermined route on a field) (e.g. [17]). Any delay in decision-making for the action choice must be as small as possible, preferably instantaneous without hesitation or time-consuming calculations (e.g. changing the steering angle when necessary). Sensor technologies such as infrared, radar and LiDAR3

³
Light Detection and Ranging technology.

improve safety by detecting unforeseen obstacles (such as people, animals, vehicles or other large objects) while usually reactive behaviours are used for responding to them rapidly. Currently, the majority of fully autonomous tractors navigate using lasers that bounce signals off several mobile transponders located around the field.

Driverless autonomous tractors usually deploy a radio receiver to receive tasks and commands from the remote command station; then, using control software installed on an on-board computer, the autonomous tractor translates it into vehicle commands such as steering, acceleration, braking, transmission and implement action while analysing real-time sensor data and views from the tractor’s on-board cameras.

In this way, a fully autonomous tractor is not only able to make its own way to the field along mapped on-farm paths, but also can work remotely in an autonomous fashion. This gives the human controller the ability to supervise multiple tractors at once and, since there is no need for a human driver, through multiple controller shifts, one could obtain non-stop (e.g. 24-hour) performance, eliminating driver fatigue and reducing work-related injuries.

Even though the agriculture fleet vehicle routing problem treated in this paper can be applied to every level of vehicle autonomy, in the study of solution approaches, we focus on level 4 in the context of intrinsically autonomous and decentralised highly automated and geographically distributed vehicles (AMRs) with the vision of reaching level 5. At level 4, these vehicles are capable of communicating with each other and possibly with fixed infrastructure sensors on the field and/or with human collaborators. Level 5 is expected to be reached in the future by applying dynamic and online AMR fleet coordination methods and mechanisms that are still today waiting to be developed. These approaches will result in agriculture vehicle fleets capable of completely autonomous coordination and operation without the need for any human supervision (e.g. [71]).

The AMRs today are mostly applied for weed control (e.g. [48]), seeding (e.g. [2,8]), harvesting (e.g. [82,86]), environmental monitoring (e.g. [46,66]) and soil analysis (e.g. [19,84]).

Examples of autonomous modular multi-purpose agricultural robots designed for horticultural tasks such as pruning, weeding, spraying and monitoring include the Thorvald [25] developed by Saga Robotics4

⁴

https://sagarobotics.com/

and Tom [76] developed by Small Robot Company,5

⁵

https://www.smallrobotcompany.com/

as well as harvesting, developed by Dogtooth Technologies Ltd.6

⁶

https://dogtooth.tech/robotic-harvesting/

Examples of weeding robots include Oz [48] made by Naio Technologies7

⁷

https://www.naio-technologies.com/en/agricultural-equipment/weeding-robot-oz/

and MARS (mobile agricultural robot swarms)8

⁸

http://echord.eu/mars/index.php.html, http://echord.eu/public/wp-content/uploads/2018/01/Final-Report-MARS.pdf

made of small robots with low individual intelligence that are equipped only with a minimum set of sensors and coordinated by a centralised algorithm, OptiVisor, aiming to optimise plant-specific precision agriculture and, due to their light weight, resulting in minimum soil compaction and energy consumption [8]. The drivers of traditional tractors may also work in tandem with and supervise the activities of the autonomous tractors and robots.

3. Agriculture fleet vehicle routing problem

To the best of our knowledge, the problem of routing fleets of agriculture vehicles that we study in this paper – the Agriculture Fleet Vehicle Routing Problem (AF-VRP) – differs significantly from any other known variation of the vehicle routing problem. Therefore, we describe the motivation and background for the AF-VRP, after which we offer a formal description.

3.1. Motivation and background

Planning and scheduling of different stages of cultivation for each crop are based on agri-food production goals and agronomic needs. These are typically determined by tables for “technical itineraries”, which describe the entire cycle of crop cultivation processes throughout the year. For example, for the cultivation of maize in Spain, the scheduling of tasks is: deep ploughing in January; stone removal in February and March; harrowing, seeding and fertilisation, fertilising inserting, irrigation system, maintaining herbicide application in April and May; preseeding irrigation, and seeding and soil disinfection in June, etc. [75]. Technical itinerary tables also include scheduling of labourers for each month, required equipment and labour (driver/labourer), yield in hours per hectare of equipment and labour, and raw material (units per hectare).

In this paper, we focus on a scenario where multiple farmers and/or large agriculture conglomerates grow multiple crops with distinct cultivation needs simultaneously while sharing the same agriculture resources (e.g., implements and supply of raw materials such as pesticides, herbicides or chemicals) and a fleet of heterogeneous vehicles (tractors and/or AMRs). Each one of these crops requires tasks that need specific equipment for their implementation, usually a certain type of a tractor and a compatible implement with specific characteristics. For example, for deep ploughing, we need a tractor and a chisel; for stone removing, a tractor and a trailer. Further, some tasks may only be compatible within certain weather conditions while some tasks may be more important than others.

Each task requires a vehicle-implement-raw material combination:

Vehicle (tractor or AMR) parameters include: maintenance and cleaning frequency, operational state, task compatibility and compatibility with implements and related requirements (e.g. power, weight, front power take-off (PTO) used for taking power from a power source, guidance system or not, front loader, specific tires, etc.), driver requirements, fuel autonomy, and type and number of operators needed for operation per each task.

Implement parameters usually include: maintenance frequency (maximum time and distance passed in operation between two maintenance activities), operation state (damaged, operating), efficiency level for each task, task compatibility (an implement can perform a subset of tasks), tractor compatibility (it can be installed on a subset of tractors) and potentially implement cleaning (to avoid cross-contamination of diseases across fields).

Raw material, such as fertilisers, herbicides, fungicides and growth regulators, are typically applied at specific stages of plant development in quantities and frequencies that can depend dynamically on field conditions. These conditions may vary from one part of the field to another due to differences in crop development, soil characteristics (e.g. inclination, chemical structure, etc.), varying microclimate (e.g. local sun exposure, temperature, humidity), prevalence of pests (e.g. insects) and weeds and plant disease development. Thus, tasks have to be planned locally based on these differences and may vary from one field location to another with given short time weather windows in which they have to be performed. Potentially, they may extend across a 24-hour working day.

Nowadays, the allocation of vehicles (tractors and AMRs), implements, and raw materials to crop tasks is still done by human experts in an ad-hoc manner (e.g. [26,83]).

3.2. Description of the AF-VRP

For simplicity and without loss of generality, let us assume that mutually connected farming fields in a region of interest are positioned in the plane $E = {[0, ℓ]}^{2} \subset R^{2}$ of side length $ℓ > 0$ , Fig. 1. We also assume that there is a permanent transport network in the region of interest through which each field can be reached and a temporary transport network in each field composed of narrow and long “aisles” (tractor lines) whose structure and topology is a function of the crop that is grown in the field (Fig. 1). In the latter network, the distance travelled crossing from aisle to another is negligible compared to the distance travelled lengthwise, along an aisle. This setting is similar to conventional multiple parallel aisle warehouse systems.

Formally, we define the problem elements as follows. Figure 2 provides an overview of the sets, indices, parameters and decision variables used.

Fig. 2.

Elements of the formal AF-VRP. In sets, superscripts indicate partitions, and subscripts indices.

We consider a planning time horizon $T$ made of $| T |$ time periods. The transportation network is represented by an undirected weighted graph $G = (N, E)$ , where $N = {n_{1}, n_{2}, \dots, n_{| N |}}$ is a set of $| N |$ nodes and $E = (e_{i}, e_{j}) : i \neq j$ is a set of edges. Both nodes and edges can be of two kinds: uncapacitated permanent nodes $N^{ρ} \subseteq N$ and uncapacitated permanent edges $E^{ρ} \subseteq E$ and temporary nodes $N^{τ} \subseteq N$ and temporary edges $E^{τ} \subseteq E$ , the latter two both with unitary capacity, where $N^{ρ} \cap N^{τ} = \emptyset$ and $N^{ρ} \cup N^{τ} = N$ and $E^{ρ} \cap E^{τ} = \emptyset$ and $E^{ρ} \cup E^{τ} = E$ . The topology of each subgraph representing a field with temporary nodes $n \in N^{τ}$ and temporary edges $e \in E^{τ}$ (Fig. 1) depends on the cultivated crop and the strength of the ground it represents. To be able to support the weight of a vehicle (an AMR or a tractor) with its respective implement and raw material, the ground should be strong enough. This strength depends on soil humidity, which increases after rain. Thus, cost matrix $C = (c_{v e t})$ (e.g. in terms of travelled time) is defined for each vehicle $v \in V$ and each edge $e \in E$ at each period $t \in T$ . It is a function of (i) the cost of fuel consumption for vehicle v, (ii) the cultivated crop and (iii) weather conditions at edge e at time period $t \in T$ .

For each temporary edge $e \in E^{τ}$ , given is a set of tasks to perform $K_{e}$ , where $K_{e} \subseteq K$ , and $K$ is a set of all tasks to perform in graph $G$ . Each task $k \in K_{e}$ is a request for a specific vehicle-implement-raw material (VIR) configuration to walk through temporary edge $e \in E^{τ}$ .

For each task $k \in K_{e}$ associated with edge $e \in E^{τ}$ , we define a cost of the task $c_{k e t}$ and a related required quantity $q_{r t k}$ of raw material $r \in R$ , where $R$ is a set of raw materials, in units per edge, depending on time period $t \in T$ . We may assume that the cost of respective raw material r for task $k \in K$ at edge $e \in E$ in time $t \in T$ is included in the task’s cost $c_{k e t}$ . For simplicity, we assume that for each time period $t \in T$ , this quantity is estimated depending on a weather forecast. Moreover, tasks $k \in K_{e}$ for each edge $e \in E^{τ}$ may have interdependencies, i.e. a relative order of execution, but they do not have to be necessarily performed in consecutive time periods.

Let $V$ be a set (fleet) of vehicles $v \in V$ that are initially positioned in a set of depot nodes $N^{D} \subseteq N$ . For simplicity, and without loss of generality, we assume that agriculture vehicles travel with constant velocity and their itinerary cannot include driving in reverse. Let $I$ be a set of implements $i \in I$ to be matched with (installed on) vehicles $v \in V$ on a one-to-one basis to perform a task $k \in K$ by using raw material $r \in R$ . Let $ξ_{v i t}$ be a binary decision variable that is equal to 1 if implement $i \in I$ is mounted on vehicle v at time $t \in T$ ; and is equal to zero otherwise.

Both implements and the raw materials are initially stored in depot nodes $n \in N^{D}$ . For simplicity, in each time period $t \in T$ , we assume that one and only one task $k \in K$ can be assigned to a VIR configuration.

Each vehicle is characterised by: (i) its capacity $L_{v}$ for carrying raw material (it can carry multiple raw materials as long as their overall quantity does not surpass the vehicle’s capacity $L_{v}$ ); (ii) its implement compatibility $ζ_{v i}$ , where $ζ_{v i} = 1$ if vehicle $v \in V$ is compatible with implement $i \in I$ and $ζ_{v i} = 0$ otherwise (we assume that implements can only be changed at a depot node $n \in N^{D}$ ); (iii) its task compatibility $γ_{v b}$ , where $γ_{v b} = 1$ if vehicle $v \in V$ can perform task $k \in K$ and $γ_{v b} = 0$ otherwise; and (vi) its fuel autonomy $L_{v}$ in terms of the number of time periods it can run before it must return to any of the depot nodes $n \in N^{D}$ .

The vehicles can move from one node to another if and only if there is an edge $e \in E$ connecting the two nodes. Contrary to the classic Vehicle Routing Problem, there is no limit on the number of visits to each task by a VIR combination. When all the assigned tasks have been completed, the vehicle should turn to one of the depot nodes $n \in N^{D}$ .

Parameters of each implement $i \in I$ include: (i) maintenance and cleaning frequency $L^{i}$ , i.e. maximum number of periods passed in operation between two maintenance activities – the number of time periods it can run before it must return to any of the depot nodes $n \in N^{D}$ ; and (ii) task compatibility $ϵ_{i k}$ , where $ϵ_{i k} = 1$ if implement $i \in I$ is compatible with task $k \in K$ and $ϵ_{i k} = 0$ otherwise. To avoid disease transmission, an implement on a VIR configuration should be cleaned when changing from one field to another. This implies visiting a depot node $n \in N^{D}$ in a route between two fields.

For each vehicle, a daily task schedule should be given at the beginning of the planning time horizon in terms of a route (path) to follow (edges to visit) and the plan of tasks to do in the field on each of the edges, as well as the remounting of implements, fuel recharging and raw material (re-)loading (visits to depot nodes), when necessary.

Assuming that all the parameters of this problem are deterministic and known a priori and if we do not consider the ownership issues of the fleet and fields to be cultivated in the agriculture fleet vehicle routing, then the off-line and centralised Agriculture Fleet Vehicle Routing Problem (AF-VRP) consists of determining a feasible schedule of the execution of tasks $k \in K$ by compatible VIR combinations that minimises the following general cost objective function composed of the costs of the vehicles’ routes and performed tasks: $\begin{matrix} (1) & z = min \sum_{v \in V} \sum_{e \in E} \sum_{t \in T} (c_{v e t} x_{v e t} + \sum_{k \in K} c_{k e t} y_{v k e t}), \end{matrix}$ while respecting the tasks’, vehicles’, implements’, and raw materials’ constraints. This can be done for tactical planning a priori, i.e. before the beginning of the first period of the planning time horizon $T$ . In the centralised and off-line version of the AF-VRP problem, the plan of the best VIR combinations for task $k \in K_{e}$ is identified at the global level for all edges $e \in E$ . The plan includes for each vehicle $v \in V$ the route over the transport network edges $e \in E$ through each period of a given time horizon $t \in T$ and allocation of implement $i \in I$ and raw material $r \in R$ at time $t \in T$ at node $n \in N^{D}$ considering the edges’ capacities and respective tasks, and individual task, implement and vehicle constraints.

However, some tasks depend strongly on the weather conditions that may change during the day, e.g. the quantity of irrigation water, pesticides, fungicides and herbicides, which leads us to consider the AF-VRP as a stochastic problem.

3.3. Dynamic AF-VRP

The AF-VRP can be viewed as a tactical decision problem which leads to a solution able to face unpredicted contingencies a priori. Similar to other versions of the vehicle routing problem (see, e.g. [62]), there is a dynamic extension of the AF-VRP, which aims at optimal online reconfiguration of the VIR configurations and their online (re-)allocation to a changing set of tasks in real time to face uncertain weather and/or technical events. In case of an unpredicted event (e.g. vehicle breakdown or an unpredicted weather event), a task may be left only partially completed. Thus, we define a non-binary decision variable $u_{v k t e}$ representing a portion of task $k \in K$ performed by vehicle $v \in V$ at time $t \in T$ , where $0 ⩽ u_{v k t e} ⩽ 1$ .

Measuring the frequency of changes and the urgency of a task, the framework proposed by [33] classifies the Dynamic VRP problems into weakly, moderately and strongly dynamic problems based on the value of the effective degree of dynamism. However, this measure does not consider the geographical distribution and the travelling times between tasks [62].

The level of urgency of a task depends on the reaction time, i.e. the difference between the disclosure time of a task $t_{k}$ and the end of the corresponding time horizon $| T |$ , which is proportionally related to the quality of solution obtained when introducing the task into vehicles’ routes (see, e.g. [34,62]). The computation time of a proposed solution approach is crucial for real-time effective and efficient agriculture vehicle fleet performance.

3.4. Decentralised AF-VRP

In an intrinsically decentralised context with various individually rational and competitive farmers that use a set of vehicles, implements and raw materials that are owned by one or more individually rational and competitive resource owners, fairness and envy-freeness in the allocation of VIR combinations to tasks have to be considered. Here, we are not only interested in optimising the overall system cost (equation (1)) but we also have to consider the distribution of the costs over both individual farmers and resource owners in the ecosystem. Otherwise, a solution might not be accepted by one or more decision makers. In the latter case, we have to introduce further incentives for these decision makers to behave inline with the wanted system outcome.

Equity. Criteria of equity include fairness and envy-freeness. Envy-freeness is a criterion of fair division. Both vehicle owners and farmers can be modelled as agents. In an envy-free division, every agent feels that its share is at least as good as the share of any other agent, and thus no agent feels envy.

We study the case when the VIR combinations owned by multiple owners are to be allocated to the tasks that are owned by different farmers.

Let us consider the case when farmers are modelled through a set of agents $a \in A$ . Note that a task k is a request for a specific VIR configuration to walk through temporary edge $e \in E^{τ}$ (owned by some agent $a \in A$ ). A VIR configuration may be allocated to every agent a in each time period $t \in T$ . Each agent a has a subjective preference relation $⪰_{a t}$ over different possible VIR configurations over time based on its individual and private cost dynamics.

Let us assume that the preference of each agent $a \in A$ is represented by a cost function $z_{a}$ . Also let $X_{A T}$ be allocation of VIR configurations to agents $a \in A$ over time horizon $T$ . An allocation $X_{A T}$ is called envy-free if for all $a_{i}, a_{j} \in A$ , $z_{a_{i}} (X_{a_{i} T}) ⩽ z_{a_{i}} (X_{a_{j} T})$ . We say that an agent $a_{i}$ envies another agent $a_{j}$ if $a_{i}$ prefers the VIR configurations allocated to $a_{j}$ throughout time horizon $T$ over its own VIR configurations allocated throughout $T$ , i.e. if $z_{a_{i}} (X_{a_{i} T}) > z_{a_{i}} (X_{a_{j} T})$ . A division is called envy-free if no agent envies another agent.

Since the VIR configurations are indivisible, an envy-free allocation may not exist. Deciding whether an envy-free and complete allocation exists is NP complete. Deciding whether an envy-free and Pareto efficient allocation exists is above NP [15]. Thus, we have to resort to approximate heuristic approaches to solve this difficult problem.

Fairness in the AF-VRP in this context depends also on the choice to maximise egalitarian social welfare (i.e. minimise the worst off cost, $min {max}_{a \in A} z_{a}$ ), to maximise the utilitarian social welfare (i.e. minimise the overall cost, $min \sum_{a \in A} z_{a}$ ) or to minimise the elitist social welfare (i.e. the best off cost, $min {min}_{a \in A} z_{a}$ ) in the decentralised AF-VRP.

Unfortunately, by optimising the system based on the worst-off performance, we deteriorate the system efficiency and thus, the utilitarian welfare. From the overall system efficiency point of view, we can use utilitarian social welfare which sums up the agents’ individual utilities in a given allocation and thus gives us a measure of the overall and average benefit for the system. However, optimising the utilitarian social welfare is not acceptable in the systems whose success is based on self-concerned individually rational agents’ acceptance (see, e.g. [11]). This is because in utilitarian systems, the optimum is paid by (usually a few) worst off agents. The latter, however, might not comply with paying the price of the system optimality (see, e.g. [36]). Nash Welfare optimisation maximises the product of the agents’ utilities and results in both egalitarian and utilitarian welfare maximization (see, e.g. [43]). When applying Nash Welfare optimisation, we obtain ( $min \prod_{a \in A} z_{a}$ ). The resulting objective function is non-linear and we can linearise it by introducing a log operator similar to [43].

4. Decentralising the coordination of agriculture vehicle fleets

The AF-VRP problem considers providers of farming services (i.e. owner(s) of vehicles, implements, and raw materials) and tasks dispersed geographically that may be owned by multiple farm owners and thus all of them may be considered active participants in the agriculture fleet coordination process. Lujak et al. [45] categorise coordination models for vehicle fleets based on their ownership structure and the level of decentralisation. In the following, we adapt this categorisation to the coordination models for agriculture vehicle fleets that can be defined as follows.

A centralised coordination model is where the AF-VRP problem is solved in a single block by only one decision-maker (e.g. a single entity) having total control over and complete information about the vehicle fleet and tasks to be executed in the region of interest.

A distributed coordination model is where fields are owned by multiple farmers with a single vehicle fleet owner, where the global AF-VRP problem is decomposed such that each farmer is represented by an autonomous decision maker (agent) that may solve its own subproblem only using its own local decision variables and parameters. The allocation of a limited number of agriculture machinery (global constraints) is achieved through the interaction between competing farmer agents and a vehicle fleet owner (a single autonomous agent) having all the fleet information available. Farmer agents in competition with one another for farming resources are not willing to disclose their complete information to one another but will share a part of it if it facilitates achieving their local objectives. The vehicle fleet owner agent here is responsible for achieving globally efficient resource allocation by interacting with farmer agents usually through an auction. The problem decomposition here is done to gain computational efficiency since farmer agents can compute their bids in parallel. However, the resource allocation decisions are still made by a single decision maker (vehicle fleet owner) with the requirement on synchronous bidding of farmer agents (e.g. [22,23,85]).

A decentralised coordination model, which further distributes the model, is where there are multiple resource owner (vehicle) agents, multiple competing farmer agents (one agent per farmer) requesting the farming service and asynchrony in decision-making. Each farmer and resource (vehicles, implements, and raw materials) owner agent has access only to its local information, with no global information available. Farmer agents are responsible for the execution of a set of (possibly overlapping) field tasks with private cost values. The objective for the subset of tasks belonging to an individual farmer agent is to perform them at minimal individual cost, which is reflected in specific task constraints. The cost of an individual task here is less important than the overall cost for a competitive farmer agent. A set of tasks belonging to each farmer agent competes with the sets of tasks of other farmers for the allocation of fleet machinery held by multiple resource owners. Similarly, if the vehicles are owned by multiple fleet owners, then each vehicle agent should coordinate the allocation of its tasks with other vehicle agents of the fleet such that the overall operational costs of the fleet owner in performing the allocated tasks are minimised. The vehicle agent here must collaborate with other vehicles of its vehicle owner and compete with the others. The vehicles must negotiate resource allocation by running localised algorithms while exchanging relevant (possibly obsolete) information. Localised algorithms make the achievement of a desired global objective easier through simple local interactions of vehicle agents with their environment and other vehicle and farmer agents, with no need for a central decision maker. The decisions specifying these interactions emerge from local information. Fairness and envy-freeness in resource allocation here play a major role. The same as in the distributed model, a competitive farmer and vehicle owner agent are not willing to disclose their complete information but will share a part of it if it facilitates achieving their individual local objectives. Resource allocation through time here is achieved by the means of a decentralised protocol.

We note here the main differences between distributed and decentralised coordination models (see, e.g. [45]). Distributed coordination relies on both local and shared (global) parameters and variables; decentralised coordination only has access to local information. Local parameters and variables are private (known only to the agent who holds them), whereas global parameters and variables are public and shared among two or more agents – potentially among all the agents in the system. If we assume selfish (i.e. individually rational) agents, resource owners can manipulate these parameters and variables or deceive agents in communicating their values to influence the individual decision-making of each one of them and thus obtain the behaviour of the system the resource owner wants. Furthermore, due to the lack of global non-obsolete and truthful information, in general, solution approaches for decentralised coordination concentrate on finding a feasible (admissible) solution without quality of solution guarantees. In contrast with the distributed case most often studied in the operations research field, where the emphasis is on the method’s optimality gap, decentralised coordination methods are mostly approximate heuristics-based methods without quality of solution guarantees but with proven completeness, soundness and termination – hence most appropriate for deployment in real-world, dynamic and messy environments such as agricultural robotics.

5. Related standard combinatorial optimisation problems and solution approaches

In this section, we review standard combinatorial optimisation problems that provide a baseline for the agriculture fleet vehicle routing problem, as described previously. We concentrate on the dynamic versions of these problems, that is, the case when both task demand and resource availability may change in time.

The most important decisions that must be taken by fleet managers have to do with the problems of assigning agriculture vehicles in general and AMRs in particular to implements and tasks (e.g. [39]) and managing their routes (e.g. [7,9,62,64,65,70,79,80]).

The dynamic AF-VRP considers routing of vehicles over a set of dynamically changing tasks through time. This is generally a computationally very complex problem. We can simplify it by ignoring the time dimension and myopically considering in each period only the tasks that are to be performed in the same period. Thus, we simplify the AF-VRP problem to the problem of allocation (dispatch). This problem focuses on deciding which vehicle should be assigned to each task. Conventionally, vehicles are assigned to tasks based on the First Come, First Served (FCFS) strategy. This strategy creates great discrimination among the tasks, increases transport costs and significantly lowers overall fleet performance. Fleet management significantly improves if the vehicles are dynamically assigned (in real time) depending on the characteristics of each vehicle and task requirements (e.g. [5,10,38,69].

Various mathematical and computational models have been developed for the optimisation of fleet operations to serve customer demands while minimising costs, e.g. [4,16,38,53,55,57]. Many of the problems of fleet management correspond to combinatorial optimisation problems, such as the problem of determining optimal routes, e.g. [10,16,38,43,55,57], that are still very difficult to solve, even in a static context with batch processing of requests and dynamic vehicle assignment problems, e.g. [29,39,60].

In the case of poor fleet performance, a penalty for non-compliance with Service Level Agreements (SLAs) translates into the loss of revenue. For agriculture vehicle fleets, optimal allocation of tasks and well-designed routes to vehicles not only ensure the service level, but also meet the needs of the fleet owner(s) and stakeholders in a cost-effective and efficient manner.

5.1. Multi-index assignment problem

The methods for dynamic AMR fleet task assignment and dynamic (re-)routing are relevant in various scenarios, such as, e.g. emergency services (e.g. [5,39,40,69]), taxi, hot meal home delivery and vehicle sharing. After a meticulous analysis of the available solutions, we have identified that the combination of these methods can provide a true differential value in the agriculture vehicle fleets.

The problem of allocation of the vehicles to implements and indivisible tasks may be modelled as a multi-index assignment problem [61] that we re-run in each time period when the constituents of the problem change. Each constituent part of this allocation is characterised by a set of attributes describing its availability and compatibility with the rest of the constituents that influence the cost or profit resulting from such a multi-index allocation. Assume there are n vehicle agents, m tasks and k implements. Here, the emphasis is on one-to-one assignment among the elements in each set. Furthermore, each vehicle agent has a valuation function that maps each implement-task combination to some non-negative value particular to that vehicle agent. These valuations are additive, which means that an agent’s value for a set of task-implement combinations is simply the sum of the values of each combination of this set. Our goal is to compute a one-to-one allocation, i.e. a partitioning of $| K |$ tasks, $| I |$ implements and $V$ vehicle agents, of minimum overall cost. The mathematical formulation of such a problem leads to axial k-index assignment problems [63] and in the case of three indices (vehicles, implements and tasks), to the axial 3-index Assignment Problem (axial 3AP), which is an NP-hard binary programming problem for which the only scalable and efficient solution approach is based on (meta-)heuristics (e.g. [77]). Moreover, no polynomial-time algorithm can achieve a constant performance ratio for this problem unless P = NP [13]. Crama and Spieksma designed approximation algorithms that yield a feasible solution whose value is not worse than 3/2 of the optimal value when the overall assignment cost is a decomposable sum of the costs of all three set pairs [13].

Reynen et al. [67] present alternate integer programming formulations for the multi-dimensional assignment problem with decomposable costs with an increased number of variables and present solution methods based on Lagrangian Relaxation and massively parallel algorithms. Aiex et al. [1] designed a greedy randomised adaptive search procedure with path relinking (GRASP) for solving axial 3APs. GRASP is a multistart metaheuristic for combinatorial optimisation consisting of a construction procedure based on a greedy randomised algorithm and a local search. A parallel version appeared in [52]. Their computational experiments showed very good results compared with previously proposed heuristics. Huang and Lim [28] proposed a hybrid genetic algorithm for this problem and reported on extensive computational experiments. Li et al. [36] propose a novel convex dual approach to the three-dimensional assignment problem. It is shown that Li et al.’s dual approach is equivalent to the Lagrangian relaxation method in terms of the best value attainable by the two approaches. However, the pure dual representation is not only more elegant, but also makes the theoretical analysis of the algorithm more tractable. An asymptotically optimal approximation algorithm for axial k-index assignment problems was given by Kravtsov [31]. Frieze et al. [21] study random multi-dimensional assignment problems where the costs decompose into the sum of independent random variables. They minimise the total cost and show that with high probability a simple greedy algorithm is a $(3 + O (1))$ -approximation. An adaptive algorithm that extends the basic greedy-type algorithmic schemes using transition to a probabilistic setup based on variables randomisation for solving the axial 3-Index AP was also proposed [49]. Here, the minimisation of an objective function is replaced by the minimisation of its expectation.

5.2. Assignment problem

The multi-index assignment problem is a higher dimensional version of the standard linear (two-dimensional) assignment problem, i.e. a weighted bipartite matching problem in which the objective is to minimise total cost of assigning n resources to n tasks. The latter is an important subproblem of many NP-hard optimisation problems, e.g. Traveling Salesperson Problem for which both sequential (Hungarian algorithm, the shortest path algorithms and auction algorithms) and parallel implementations of these algorithms are known.

In the case where sets of fixed (one-to-one) vehicle-to-implement combinations are static and given in advance, each such combination can be considered as an agent. Then, the multi-index assignment problem is simplified to the assignment problem focusing on the one agent-one task allocation at the time (e.g. [6,40]).

The dynamic task assignment problem is equivalent to the assignment problem for which several centralised approaches exist, e.g. [53]. One of the best known is the Hungarian method [32]. In [22], Lujak et al. propose a distributed version of the Hungarian Method for multi-robot task allocation where mobile robot agents are required to store all the information locally and there is no available shared memory.

One of the tools for mechanism design of agent systems are auctions, e.g. [3,41,68]. The implementation usually requires solving a combinatorial non-linear optimisation problem, which is in general NP-hard and intractable for complex networks. However, with certain relaxations, the latter can be modelled as a convex optimisation problem [3,56]. Computational optimisation auctions are methods that are similar to the Gauss-Seidel and Jacobii methods, e.g. [3]. This approach is well suited for massive parallelisation of local decision-making based on the information interchanged among multiple processors. It is modular, based on regular interactions, incremental, analysable, and permits incentive engineering. In [41,44], Lujak et al. proposed a modified version of Bertsekas’ auction algorithm for the case of incomplete information exchange and explored the deterioration of the solution quality according to the size of the communication network and proposed strategies to overcome this problem. Responding to the task assignment in the case of the medical emergency assistance of urgent out-of-hospital patients by ambulances, Lujak et al. proposed a distributed algorithm for the simultaneous assignment of ambulances [5,39] and ambulances and hospitals to multiple simultaneous patients in [40], where the authors also proposed an ambulance vehicle Voronoi-based relocation approach. Moreover, in [43], Lujak et al. proposed the route assignment approach that considers fair and envy-free routes and improves the overall efficiency in respect to the user optimum. Here, fair routes are related to the overall route cost that should be as balanced as possible between the vehicles, while envy-freeness is related to individual route costs that should not vary between each other more than some predefined factor.

Through a dynamic vehicle reassignment, we can significantly increase the overall performance of the fleet and lower farming costs. Furthermore, by dynamic routing, the fleet can divide the tasks to perform and each fleet vehicle can then respond in real time to any changes in terrain characteristics by rerouting and while doing so, maintain the region of interest well covered, so as to reach tasks quickly and efficiently. A distributed multi-agent computation model for route guidance under congestion in vehicle traffic considering envy-freeness and fairness was proposed by Lujak et al. in [42,43]. It was shown by simulation experiments that by proposing routes that are envy-free and fair, the user equilibrium traffic assignment solution can be improved towards the system optimum.

5.3. Vehicle routing problem

At the tactical level, the problem of routing a fleet of vehicles combined with implements through the execution of farming tasks in the fields may be modelled as a vehicle routing problem (VRP). VRPs are a class of combinatorial optimisation problems that consist of determining sequences of tasks for a fleet of vehicles with limited resources while minimising an objective function that is typically the total completion time or the total cost. VRPs are defined on graphs, and the tasks to be performed are associated with nodes or with arcs. When the tasks, e.g. deliveries, are associated with nodes, the corresponding problems are called node routing problems whereas when the tasks are associated with arcs, these are named arc routing problems.

The basic arc routing problem related to the AF-VRP is the Capacitated Arc Routing Problem (CARP). The CARP aims to determine a minimum cost set of routes that serve a subset of edges with positive demand under capacity constraints. For this problem introduced by Golden and Wong [24], many exact and heuristic algorithms have been proposed and are described in the book of Laporte and Corberan [12] and in the recent annotated bibliography by Mourão and Pinto [51]. The CARP is a simplified version of the AF-VRP where the edges to be traversed correspond to the aisles of the fields and the demand to the quantities of raw materials to be used. However, the AF-VRP includes several additional features that make it challenging to obtain good solutions for instances of the size encountered in practice.

When addressing an arc routing problem, it is essential to consider whether its transformation into a node routing problem presents a particular advantage or not. On the one hand, in arc routing problems, demand is associated with edges, and the graph is frequently sparse, which could represent an advantage that can be exploited in the design of solution algorithms. Lechford and Oukil [35] described an approach where they exploit the sparsity in the identification of promising routes. On the other hand, an arc routing problem can be transformed to a node routing problem (see, e.g. Pearn et al. [58] or Longo et al. [37]) for which many efficient algorithms have been proposed (e.g. [50,54,72,80,81]).

When there is only one vehicle, node routing problems reduce to variants of the classical Traveling Salesperson Problem (TSP). Among them, some are relevant since they include some key features of the AF-VRP. In this paper, we focus on narrow and long aisle farming fields, in which the distance travelled across from aisle to another is negligible compared to the distance travelled along the length of the aisle. This setting is similar to conventional multiple parallel-aisle warehouse systems. The Steiner TSP (STSP) is an extension of the TSP that is suitable for these instances. Given a list of locations, some of which are required, and the distances between them, the goal is to find the shortest possible walk that visits each required location and then returns to the origin. As we are looking for a walk, vertices can be visited more than once, and edges may be traversed more than once. Exact approaches to this problem only exist for warehouses that have at most three cross aisles. For other layout types, various heuristic approaches exist, e.g. [78].

The TSP considers minimising the overall travel time of a salesperson but if we concentrate on minimising the waiting times of the tasks, then we speak about the Travelling Repairman Problem [18]. Luo et al. [47] extend the multiple Travelling Repairman Problem (m-TRP) by considering a limitation on the total distance that a vehicle can travel. The resulting problem is called the Multiple Travelling Repairmen Problem with Distance constraints (MTRPD). The authors design a tailored branch-and-price-and-cut algorithm for this problem proposing a bounded bi-directional label-setting algorithm for the pricing subproblem. The m-TRP has characteristics in common with the problem we have to solve for the management of an agricultural fleet.

Another node routing problem related to the AF-VRP is the Field Service Routing Problem (FSRP). Given a limited number of technicians, the FSRP consists of determining a set of optimal technician routes to serve customer requests, while ensuring that each technician has the required skills for his/her tasks. There is an analogy between the technicians and the vehicles, implements and raw materials we present here. The most relevant variant was introduced by Kovacs et al. [30] where teams of technicians have to be built for some time period to complete most of the tasks. Several other extensions have been considered, including stochastic travel and service time and priorities between the tasks (see, e.g. [7]).

6. Conclusions and research opportunities

In this paper, we presented and described the agriculture fleet vehicle routing problem (AF-VRP). We presented the nomenclature for sets, indices, parameters and decision variables that are used in the AF-VRP mathematical program that can be developed in future work. Moreover, we discussed its dynamic and decentralised version and ways of simplification by removing the time dimension and focusing on dispatching vehicles in each time period by considering only the tasks that are to be performed at the present time. Finding an efficient solution approach to the AF-VRP problem remains an open challenge.

The AF-VRP is an intrinsically decentralised problem. Even though, as discussed, fleet coordination approaches may be centralised, distributed or decentralised, the focus in this problem is on distributed or decentralised online fleet coordination methods that are today still to be developed.

To simplify the AF-VRP in decentralised environments, we can combine aspects of the assignment problem, 3-index assignment problem and the capacitated arc routing problem. Multiple centralised algorithms have been proposed for each of these individual subproblems assuming perfect information. However, both a computationally efficient mathematical formulation for the dynamic and decentralised agriculture fleet vehicle routing problem and the related solution approach are still open challenges to the best of our knowledge.

The development of distributed MAS-based route guidance for AMR fleets that allows for a completely autonomous AMR fleet is still an open scientific challenge. In addition, the topic of distributed and dynamic multi-task assignment and vehicle routing considering multiple vehicle, operator and farming constraints is still an insufficiently explored field. To the best of our knowledge, distributed and decentralised MAS coordination models and optimisation approaches for vehicle fleet coordination are scarce and have undergone limited real-world testing.

First of all, a decentralised coordination approach is more robust than its centralised counterpart because it is resilient to individual vehicle errors and can rely on the fleet’s intrinsic built-in redundancy. It is scalable since it can operate at a larger scale with multiple large fields at once aggregating vehicle capacity and field throughput across all the fleet’s vehicles. It is open, seamlessly adapting to vehicles entering or leaving the system, and has fewer levels of authority. Finally, it does not suffer from the “single point of failure” problem found in centralised systems. However, distributed open vehicle fleets also have to deal with inter-agent communication and coordination overhead that can sometimes make them slower or more difficult to control than their centralised counterparts.

In the decision-making distribution process, the emphasis of the decomposition of the dynamic and decentralised AF-VRP problem should be on the scalability, local communication and computation constraints of each physical vehicle agent, the structure and topology of the dynamic communication network, and the available communication and processing capacities of the developed cyber-physical MAS. One common goal in this context is an efficient and cost-effective farming service using an agriculture vehicle fleet while considering vehicle autonomy and fairness constraints in work assignment, individual rationality, preferences and constraints – whether they are of operators, farmers or fleet owner(s), as well as farming tasks’ constraints. Quality of solution guarantees play a crucial role underlying sustainable competitive advantage.

The long-term goal of distributing decisions in agriculture vehicle fleets is the development of an open and non-proprietary software platform in the cloud for distributed route guidance and task coordination at large agriculture farms and peer-to-peer sharing of relevant agriculture resources, vehicles and AMRs among farmers. Such a route guidance approach contributes to a more efficient and competitive service in line with the Internet of Robotic Things (e.g. [73]) and Internet of Food Things [59]. Human drivers may also benefit from this technology as they may be motivated to perform better if they feel a sense of autonomy, thus improving the output, task engagement, time-on-task and accuracy. However, behavioural measures should be further studied to understand the triggers of individual effort and motivation.

The indirect benefits of such a distributed and decentralised AMR fleet coordination MAS, among others, should include higher efficiency and benefit in both large and small farms, smaller carbon footprint and reduction in pesticides, and above all, fair participation of fleet owners, AMR operators and farmers, with related rewards and benefits. Decentralised coordination mechanisms will not completely fix sustainable agriculture concerns, but they should facilitate improvements with respect to energy efficiency and resource usage, particularly by enabling precision farming functions, as they are directly related to giving higher autonomy to the fleet of agriculture vehicles while changing the hierarchical and unscalable farming structure to a more efficient and balanced enterprise.

Footnotes

Acknowledgements

This work was partially supported by the “AGRIFLEETS” project ANR-20-CE10-0001 funded by the French National Research Agency (ANR) and by the UK Research and Innovation (UKRI) Research England council’s “Lincoln Agri-Robotics” project as part of the Expanding Excellence in England (E3) Programme.

References

R.M.

Aiex,

M.G.

Resende,

P.M.

Pardalos and

Toraldo, Grasp with path relinking for three-index assignment, INFORMS Journal on Computing17(2) (2005), 224–247. doi:10.1287/ijoc.1030.0059.

A.A.

Ali,

Zohaib and

S.A.

Mehdi, An autonomous seeder for maize crop, in: Proceedings of the 2019 5th International Conference on Robotics and Artificial Intelligence, 2019, pp. 42–47. doi:10.1145/3373724.3373737.

D.P.

Bertsekas, Auction algorithms for network flow problems: A tutorial introduction, Computational optimization and applications1(1) (1992), 7–66. doi:10.1007/BF00247653.

Bielli,

Bielli and

Rossi, Trends in models and algorithms for fleet management, Procedia-Social and Behavioral Sciences20 (2011), 4–18. doi:10.1016/j.sbspro.2011.08.004.

Billhardt,

Fernández,

Lemus,

Lujak,

Osman,

Ossowski and

Sierra, Dynamic coordination in fleet management systems: Toward smart cyber fleets, IEEE Intelligent Systems29(3) (2014), 70–76. doi:10.1109/MIS.2014.41.

Billhardt,

Lujak,

Sánchez-Brunete,

Fernández and

Ossowski, Dynamic coordination of ambulances for emergency medical assistance services, Knowledge-Based Systems70 (2014), 268–280. doi:10.1016/j.knosys.2014.07.006.

Binart,

Dejax,

Gendreau and

Semet, A 2-stage method for a field service routing problem with stochastic travel and service times, Computers & Operations Research65 (2016), 64–75. doi:10.1016/j.cor.2015.07.001.

Blender,

Buchner,

Fernandez,

Pichlmaier and

Schlegel, Managing a mobile agricultural robot swarm for a seeding task, in: IECON 2016-42nd Annual Conference of the IEEE Industrial Electronics Society, IEEE, 2016, pp. 6879–6886. doi:10.1109/IECON.2016.7793638.

Brotcorne,

Laporte and

Semet, Ambulance location and relocation models, European journal of operational research147(3) (2003), 451–463. doi:10.1016/S0377-2217(02)00364-8.

10.

B.K.-S.

Cheung,

Choy,

C.-L.

Li,

Shi and

Tang, Dynamic routing model and solution methods for fleet management with mobile technologies, International Journal of Production Economics113(2) (2008), 694–705. doi:10.1016/j.ijpe.2007.10.018.

11.

Chevaleyre,

P.E.

Dunne,

Endriss,

Lang,

Lemaitre,

Maudet,

Padget,

Phelps,

J.A.

Rodriguez-Aguilar and

Sousa, Issues in multiagent resource allocation, Informatica30(1) (2006), 03505596.

12.

Á.

Corberán and

Laporte, Arc Routing: Problems, Methods, and Applications, SIAM, 2015.

13.

Crama and

F.C.

Spieksma, Approximation algorithms for three-dimensional assignment problems with triangle inequalities, European Journal of Operational Research60(3) (1992), 273–279. doi:10.1016/0377-2217(92)90078-N.

14.

G.B.

Dantzig and

J.H.

Ramser, The truck dispatching problem, Management science6(1) (1959), 80–91. doi:10.1287/mnsc.6.1.80.

15.

de Keijzer,

Bouveret,

Klos and

Zhang, On the complexity of efficiency and envy-freeness in fair division of indivisible goods with additive preferences, in: International Conference on Algorithmic DecisionTheory, Springer, 2009, pp. 98–110.

16.

Desrosiers,

Dumas,

M.M.

Solomon and

Soumis, Time constrained routing and scheduling, Handbooks in operations research and management science8 (1995), 35–139. doi:10.1016/S0927-0507(05)80106-9.

17.

Emmi,

Gonzalez-de Soto,

Pajares and

Gonzalez-de Santos, New trends in robotics for agriculture: Integration and assessment of a real fleet of robots, The Scientific World Journal2014 (2014).

18.

I.O.

Ezzine,

Semet and

Chabchoub, New formulations for the traveling repairman problem, in: Proceedings of the 8th International Conference of Modeling and Simulation, Citeseer, 2010, pp. 1889–1894.

19.

Finegan,

Jaffe,

Leon,

Lytle,

Morgan,

Greene,

Meyer,

Brinkman,

De Wekker,

Yochumet al., Development of an autonomous agricultural vehicle to measure soil respiration, in: 2019 Systems and Information Engineering Design Symposium (SIEDS), IEEE, 2019, pp. 1–6.

20.

M.M.

Flood, The traveling-salesman problem, Operations Research4(1) (1956), 61–75. doi:10.1287/opre.4.1.61.

21.

Frieze,

Pegden and

Tkocz, On random multi-dimensional assignment problems, 2019, arXiv preprint, arXiv:1901.07167.

22.

Giordani,

Lujak and

Martinelli, A distributed algorithm for the multi-robot task allocation problem, in: International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems, Springer, 2010, pp. 721–730.

23.

Giordani,

Lujak and

Martinelli, A distributed multi-agent production planning and scheduling framework for mobile robots, Computers & Industrial Engineering64(1) (2013), 19–30. doi:10.1016/j.cie.2012.09.004.

24.

B.L.

Golden and

R.T.

Wong, Capacitated arc routing problems, Networks11(3) (1981), 305–315. doi:10.1002/net.3230110308.

25.

Grimstad and

P.J.

From, The thorvald ii agricultural robotic system, Robotics6(4) (2017), 24. doi:10.3390/robotics6040024.

26.

S.S.H.

Hajjaj and

K.S.M.

Sahari, Review of research in the area of agriculture mobile robots, in: The 8th International Conference on Robotic, Vision, Signal Processing & Power Applications, Springer, 2014, pp. 107–117. doi:10.1007/978-981-4585-42-2_13.

27.

S.S.H.

Hajjaj and

K.S.M.

Sahari, Review of agriculture robotics: Practicality and feasibility, in: 2016 IEEE International Symposium on Robotics and Intelligent Sensors (IRIS), 2016, pp. 194–198. doi:10.1109/IRIS.2016.8066090.

28.

Huang and

Lim, A hybrid genetic algorithm for three-index assignment problem, in: The 2003 Congress on Evolutionary Computation, 2003. CEC’03, Vol. 4, IEEE, 2003, pp. 2762–2768. doi:10.1109/CEC.2003.1299438.

29.

Ibri,

Nourelfath and

Drias, A multi-agent approach for integrated emergency vehicle dispatching and covering problem, Engineering Applications of Artificial Intelligence25(3) (2012), 554–565. doi:10.1016/j.engappai.2011.10.003.

30.

A.A.

Kovacs,

S.N.

Parragh,

K.F.

Doerner and

R.F.

Hartl, Adaptive large neighborhood search for service technician routing and scheduling problems, Journal of scheduling15(5) (2012), 579–600. doi:10.1007/s10951-011-0246-9.

31.

Kravtsov, Polynomial algorithms for finding the asymptotically optimum plan of the multiindex axial assignment problem, Cybernetics and Systems Analysis41(6) (2005), 940–944. doi:10.1007/s10559-006-0031-5.

32.

H.W.

Kuhn, The Hungarian method for the assignment problem, Naval research logistics quarterly2(1–2) (1955), 83–97. doi:10.1002/nav.3800020109.

33.

Larsen,

O.B.

Madsen and

M.M.

Solomon, Classification of dynamic vehicle routing systems, in: Dynamic Fleet Management, Springer, 2007, pp. 19–40. doi:10.1007/978-0-387-71722-7_2.

34.

Larsen,

O.B.

Madsen and

M.M.

Solomon, Recent developments in dynamic vehicle routing systems, in: The Vehicle Routing Problem: Latest Advances and New Challenges, Springer, 2008, pp. 199–218. doi:10.1007/978-0-387-77778-8_9.

35.

A.N.

Letchford and

Oukil, Exploiting sparsity in pricing routines for the capacitated arc routing problem, Computers & Operations Research36(7) (2009), 2320–2327. doi:10.1016/j.cor.2008.09.008.

36.

Li,

Tharmarasa,

Brown,

Kirubarajan and

K.R.

Pattipati, A novel convex dual approach to three-dimensional assignment problem: Theoretical analysis, Computational Optimization and Applications74(2) (2019), 481–516. doi:10.1007/s10589-019-00113-w.

37.

Longo,

M.P.

De Aragao and

Uchoa, Solving capacitated arc routing problems using a transformation to the cvrp, Computers & Operations Research33(6) (2006), 1823–1837. doi:10.1016/j.cor.2004.11.020.

38.

Lorini,

J.-Y.

Potvin and

Zufferey, Online vehicle routing and scheduling with dynamic travel times, Computers & Operations Research38(7) (2011), 1086–1090. doi:10.1016/j.cor.2010.10.019.

39.

Lujak and

Billhardt, Coordinating emergency medical assistance, in: Agreement Technologies, Springer, 2013, pp. 597–609. doi:10.1007/978-94-007-5583-3_35.

40.

Lujak,

Billhardt and

Ossowski, Distributed coordination of emergency medical service for angioplasty patients, Annals of Mathematics and Artificial Intelligence78(1) (2016), 73–100. doi:10.1007/s10472-016-9507-9.

41.

Lujak and

Giordani, On the communication range in auction-based multi-agent target assignment, in: Self-Organizing Systems,

Bettstetter and

Gershenson, eds, LNCS, Vol. 6557, Springer, 2011, pp. 32–43. doi:10.1007/978-3-642-19167-1_4.

42.

Lujak,

Giordani and

Ossowski, Fair route guidance: Bridging system and user optimization, in: 17th International IEEE Conference on Intelligent Transportation Systems (ITSC), IEEE, 2014, pp. 1415–1422. doi:10.1109/ITSC.2014.6957885.

43.

Lujak,

Giordani and

Ossowski, Route guidance: Bridging system and user optimization in traffic assignment, Neurocomputing151 (2015), 449–460. doi:10.1016/j.neucom.2014.08.071.

44.

Lujak,

Giordani and

Ossowski, Value of incomplete information in mobile target allocation, in: German Conference on Multiagent System Technologies, Springer, 2011, pp. 89–100. doi:10.1007/978-3-642-24603-6_10.

45.

Lujak,

Omicini,

Ossowski and

Giordani, Decentralizing coordination in open vehicle fleets for scalable and dynamic task allocation, Complexity2020 (2020), 1–21.

46.

Luo,

Nam,

Kantor and

Sycara, Distributed environmental modeling and adaptive sampling for multi-robot sensor coverage, in: Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, International Foundation for Autonomous Agents and Multiagent Systems, 2019, pp. 1488–1496.

47.

Luo,

Qin and

Lim, Branch-and-price-and-cut for the multiple traveling repairman problem with distance constraints, European Journal of Operational Research234(1) (2014), 49–60. doi:10.1016/j.ejor.2013.09.014.

48.

F.B.

Malavazi,

Guyonneau,

J.-B.

Fasquel,

Lagrange and

Mercier, Lidar-only based navigation algorithm for an autonomous agricultural robot, Computers and electronics in agriculture154 (2018), 71–79. doi:10.1016/j.compag.2018.08.034.

49.

S.N.

Medvedev and

O.A.

Medvedeva, An adaptive algorithm for solving the axial three-index assignment problem, Automation and Remote Control80(4) (2019), 718–732. doi:10.1134/S000511791904009X.

50.

M.A.

Mohammed,

M.K.

Abd Ghani,

R.I.

Hamed,

S.A.

Mostafa,

M.S.

Ahmad and

D.A.

Ibrahim, Solving vehicle routing problem by using improved genetic algorithm for optimal solution, Journal of computational science21 (2017), 255–262. doi:10.1016/j.jocs.2017.04.003.

51.

M.C.

Mourão and

L.S.

Pinto, An updated annotated bibliography on arc routing problems, Networks70(3) (2017), 144–194. doi:10.1002/net.21762.

52.

Murphey,

Pardalos and

Pitsoulis, A parallel grasp for the data association multidimensional assignment problem, in: Parallel Processing of Discrete Problems, Springer, 1999, pp. 159–179. doi:10.1007/978-1-4612-1492-2_7.

53.

Nair and

Miller-Hooks, Fleet management for vehicle sharing operations, Transportation Science45(4) (2011), 524–540. doi:10.1287/trsc.1100.0347.

54.

Nazari,

Oroojlooy,

Snyder and

Takác, Reinforcement learning for solving the vehicle routing problem, in: Advances in Neural Information Processing Systems, 2018, pp. 9839–9849.

55.

Ombuki,

B.J.

Ross and

Hanshar, Multi-objective genetic algorithms for vehicle routing problem with time windows, Applied Intelligence24(1) (2006), 17–30. doi:10.1007/s10489-006-6926-z.

56.

D.P.

Palomar and

Chiang, A tutorial on decomposition methods for network utility maximization, Selected Areas in Communications, IEEE Journal on24(8) (2006), 1439–1451. doi:10.1109/JSAC.2006.879350.

57.

S.N.

Parragh,

K.F.

Doerner and

R.F.

Hartl, A survey on pickup and delivery problems, Journal für Betriebswirtschaft58(1) (2008), 21–51. doi:10.1007/s11301-008-0033-7.

58.

W.-L.

Pearn,

Assad and

B.L.

Golden, Transforming arc routing into node routing problems, Computers & operations research14(4) (1987), 285–288. doi:10.1016/0305-0548(87)90065-7.

59.

Pearson,

Frey,

Maull,

Parr and Andrea , Zisman and. Internet of Food Things Network Plus: Event report series, Technical Report IoFT-Event-Series:Report001, University of Lincoln, UK, 2018.

60.

D.W.

Pentico, Assignment problems: A golden anniversary survey, European Journal of Operational Research176(2) (2007), 774–793. doi:10.1016/j.ejor.2005.09.014.

61.

W.P.

Pierskalla, Letters to the editor – the multidimensional assignment problem, Operations Research16(2) (1968), 422–431. doi:10.1287/opre.16.2.422.

62.

Pillac,

Gendreau,

Guéret and

A.L.

Medaglia, A review of dynamic vehicle routing problems, European Journal of Operational Research225(1) (2013), 1–11. doi:10.1016/j.ejor.2012.08.015.

63.

A.B.

Poore, Multidimensional assignment formulation of data association problems arising from multitarget and multisensor tracking, Computational Optimization and Applications3(1) (1994), 27–57. doi:10.1007/BF01299390.

64.

H.N.

Psaraftis, Dynamic vehicle routing problems, Vehicle routing: Methods and studies16 (1988), 223–248.

65.

H.N.

Psaraftis,

Wen and

C.A.

Kontovas, Dynamic vehicle routing problems: Three decades and counting, Networks67(1) (2016), 3–31. doi:10.1002/net.21628.

66.

A.A.

Ravankar,

Kobayashi and

Emaru, Autonomous navigation of ground robot for vineyard monitoring, in: The Proc. of JSME Annual Conf. on Robotics and Mechatronics (Robomec) 2019, The Japan Society of Mechanical Engineers, 2019, pp. 1A1–E05.

67.

Reynen,

Vadrevu,

Nagi and

LeGrand, Large-scale multi-dimensional assignment: Problem formulations and gpu accelerated solutions, in: 2019 22th International Conference on Information Fusion (FUSION), IEEE, 2019, pp. 1–8.

68.

Schneider,

Balas,

A.T.

Özgelen,

E.I.

Sklar and

Parsons, An empirical evaluation of auction-based task allocation in multi-robot teams, in: Proceedings of the 13th International Conference on Autonomous Agents and Multiagent Systems (AAMAS), Paris, France, 2014.

69.

Schneider,

Poulton,

Drake,

Smith,

Roussos,

Parsons and

E.I.

Sklar, The Application of Market-based Multi-Robot Task Allocation to Ambulance Dispatch, 2020, ArXiv, abs/2003.05550.

70.

F.M.F.

Semet, Les problèmes de gestion de flotte en temps réel, INFOR: Information Systems and Operational Research44(4) (2006), 299–330.

71.

R.R.

Shamshiri,

Weltzien,

I.A.

Hameed,

I.J.

Yule,

T.E.

Grift,

S.K.

Balasundram,

Pitonakova,

Ahmad and

Chowdhary, Research and development in agricultural robotics: A perspective of digital farming, Chinese Society of Agricultural Engineering (2018).

72.

M.A.L.

Silva,

S.R.

de Souza,

M.J.F.

Souza and

A.L.C.

Bazzan, A reinforcement learning-based multi-agent framework applied for solving routing and scheduling problems, Expert Systems with Applications131 (2019), 148–171. doi:10.1016/j.eswa.2019.04.056.

73.

Simoens,

Dragone and

Saffiotti, The Internet of robotic things: A review of concept, added value and applications, International Journal of Advanced Robotic Systems15(1) (2018), 1729881418759424. doi:10.1177/1729881418759424.

74.

J.-P.

Skeete, Level 5 autonomy: The new face of disruption in road transport, Technological Forecasting and Social Change134 (2018), 22–34. doi:10.1016/j.techfore.2018.05.003.

75.

M.L.

Sørensen, Agricultural Water Management Research Trends, Nova Publishers, 2008.

76.

Spencer, Meet Tom, Dick and Harry – machinery & implements, Farmer’s Weekly58 (2019), 19015.

77.

F.C.

Spieksma, Multi index assignment problems: Complexity, approximation, applications, in: Nonlinear Assignment Problems, Springer, 2000, pp. 1–12.

78.

Theys,

Bräysy,

Dullaert and

Raa, Using a

t s p

heuristic for routing order pickers in warehouses, European Journal of Operational Research200(3) (2010), 755–763. doi:10.1016/j.ejor.2009.01.036.

79.

Toth and

Vigo, The Vehicle Routing Problem, SIAM, 2002.

80.

Toth and

Vigo, Vehicle Routing: Problems, Methods, and Applications, SIAM, 2014.

81.

Vidal,

T.G.

Crainic,

Gendreau and

Prins, Heuristics for multi-attribute vehicle routing problems: A survey and synthesis, European Journal of Operational Research231(1) (2013), 1–21. doi:10.1016/j.ejor.2013.02.053.

82.

Xiong,

Ge,

Grimstad and

P.J.

From, An autonomous strawberry-harvesting robot: Design, development, integration, and field evaluation, Journal of Field Robotics (2019).

83.

Yaghoubi,

N.A.

Akbarzadeh,

S.S.

Bazargani,

S.S.

Bazargani,

Bamizan and

M.I.

Asl, Autonomous robots for agricultural tasks and farm assignment and future trends in agro robots, International Journal of Mechanical and Mechatronics Engineering13(3) (2013), 1–6.

84.

X.-T.

Yan,

Bianco,

Niu,

Palazzetti,

Henry,

Li,

Tubby,

Kisdi,

Irshad,

Sanderset al., The agrirover: A reinvented mechatronic platform from space robotics for precision farming, in: Reinventing Mechatronics, Springer, 2020, pp. 55–73. doi:10.1007/978-3-030-29131-0_5.

85.

M.M.

Zavlanos,

Spesivtsev and

G.J.

Pappas, A distributed auction algorithm for the assignment problem, in: 2008 47th IEEE Conference on Decision and Control, IEEE, 2008, pp. 1212–1217. doi:10.1109/CDC.2008.4739098.

86.

Zhao,

Gong,

Huang and

Liu, A review of key techniques of vision-based control for harvesting robot, Computers and Electronics in Agriculture127 (2016), 311–323. doi:10.1016/j.compag.2016.06.022.

Agriculture fleet vehicle routing: A decentralised and dynamic problem

Abstract

Keywords

1. Introduction

1 Note that “raw material” is separate from the energy material (e.g. fuel) required to run the vehicle. See Section 3 for detail.

3 Light Detection and Ranging technology.

3.1. Motivation and background

3.2. Description of the AF-VRP

3.4. Decentralised AF-VRP

4. Decentralising the coordination of agriculture vehicle fleets

5. Related standard combinatorial optimisation problems and solution approaches

5.1. Multi-index assignment problem

5.2. Assignment problem

5.3. Vehicle routing problem

6. Conclusions and research opportunities

Footnotes

Acknowledgements

References

¹
Note that “raw material” is separate from the energy material (e.g. fuel) required to run the vehicle. See Section 3 for detail.

³
Light Detection and Ranging technology.