Abstract
Significant population growth and ongoing socioeconomic development have increased reliance on irrigated agriculture and agricultural intensification. However, accurately predicting crop water demand is problematic since it is affected by several factors such as weather, soil, and water properties. Many studies have shown that a hybrid irrigation system based on two irrigation strategies (i.e., evapotranspiration and soil-based irrigation) can provide a credible and reliable irrigation system. The latter can also alert farmers and other experts to phenomena such as noise, erroneous sensor signals, numerous correlated input and target variables, and incomplete or missing data, especially when the two irrigation strategies produce inconsistent results. Hence, we propose Multi-Target soil moisture and evapotranspiration prediction (MTR-SMET) for estimating soil moisture and evapotranspiration. These predictions are then used to compute water needs based on Food and Agriculture Organization (FAO) and soil-based methods. Besides, we propose an explainable MTR-SMET (xMTR-SMET) that explains the ML-based irrigation to the farmers/users using several explainable AI to provide simple visual explanations for the given predictions. It is the first attempt that explains and offers meaningful insights into the output of a machine learning-based irrigation approach. The conducted experiments showed that the proposed MTR-SMET model achieves low error rates (i.e., MSE = 0.00015, RMSE = 0.0039, MAE = 0.002) and high
Introduction
In recent decades, significant population expansion, combined with ongoing socio-economic development, has increased reliance on irrigated agriculture and agricultural intensification to meet rising food demand [17]. As a result, a substantial strain on water supplies has emerged in recent years, particularly in water-stressed arid regions [42]. Climate change increases this strain and poses future water governance issues due to rising air temperatures and altering drought and rainfall patterns. According to FAO [15], owing to projected climate change, approximately 51% of people in arid countries would live in highly water-stressed areas by 2050. Irrigated agriculture and agricultural intensification can impact the quantity of water resources [17]. Several studies [2,6,29,35] have revealed that inefficient irrigation water usage, improper soil management, and unsuitable crop type choices are all causing a continuous decline in the quantity of water resources. However, forecasting water demand precisely is a highly complex task since it depends on many factors such as weather,soil, and water properties [17].
Nowadays, the latest sensors and weather stations allow the collection of a large amount of historical data that was unavailable in the past. According to [16], big data and artificial intelligence are the best tools for handling this amount of data, thanks to their capability to extract useful information and give additional value and usefulness to systems installed in the field. For this reason, many machine learning and IoT-based studies are proposed in the field of smart irrigation. According to [17], the existing irrigation methods are categorized into four types: evapotranspiration and water balance (ET-WB), soil moisture measurement, plant growth parameters, and process-based agricultural models. To give adequate irrigation, all of these categories need to be properly operated. An effective irrigation method should strike a balance between practicability, accuracy, and universalizability.
The first category is based on the evapotranspiration to forecast water demand. In fact, the two phenomena of evaporation and transpiration are combined under the umbrella term “evapotranspiration” (ET). The phenomenon of water transitioning from a liquid state to a vapor state is known as evaporation. Some examples are the surface of the soil, a pond or lake, or the surface of water. Whereas, the transpiration is the process of water being evaporated by trees through their stomata. The amount of water a tree needs to reach maturity is called the tree’s water requirement. For a tree to create one kilogram of dry matter, 400–1000 kg of water are needed [10]. The ET-WB based methods are often used in the field of irrigation, thanks to the availability of weather data since soil data is not necessary for this category. Generally, this type of method follows the guidelines provided by FAO-56 [4]. However, this method is not very accurate when used in site-specific circumstances, particularly when the error of daily soil moisture prediction accumulates [40]. As for soil moisture-based methods, they require sensors that measure the soil water status, which is a paramount task in modulating the water requirements of the crops. The main variables that should be captured in real-time are soil water status and elevation. Also, monitoring other variables, such as hydrodynamic soil factors or water drainage, might increase the chances of the irrigation predicted by the models being properly used by the plants [23]. The main challenge with the soil moisture-based methods is that their accuracy depends on the values returned by the sensors. If the sensors fail to transmit correct signals – due to sensing malfunctions or field noises like animals or rainfall – the scheduling may fail. Hence, one of the major problems of smart irrigation and wireless sensor networks is the sudden failure of physical nodes. Sometimes it is impossible to determine whether the physical nodes are transmitting accurate sensor values or if they are degrading with time. Physical nodes that are not functioning correctly can only be found manually, which is a time-consuming task. Any real-time system is severely impacted by the faulty sensor node [34].
Combining climate and soil variables has the potential to properly manage irrigation in a more efficient way than other traditional approaches. The study in [41] demonstrated that precision irrigation should consider both soil moisture and evapotranspiration since soil moisture is connected to atmospheric moisture through direct evaporation and vegetation transpiration. They also illustrated that soil moisture is affected by evapotranspiration and thus it always moves upward in the surface soil. When the evapotranspiration is large, the amount of water transport increases, and it suggests that the use of soil moisture fluctuation signals can be used to reflect evapotranspiration. Hence, these two parameters should be jointly and correlatively predicted to optimally schedule the daily water needs.
A hybrid irrigation system that relies on two irrigation strategies: ET-based irrigation and soil-based irrigation, could provide a credible and reliable irrigation system and also alert farmers and other experts from being subject to phenomena like noise, erroneous sensor signals, numerous correlated input and target variables, and incomplete or missing data [34] especially when the two irrigation strategies give inconsistent outputs.
According to this, we propose a Multi-Target Regression (MTR) model, a well-established technique in the fields of applied statistics and machine learning [8], to estimate both ET and soil moisture, denoted as VWC (Volumetric Water Content). To the best of our knowledge, this is the first attempt in the field of smart irrigation that uses MTR to estimate both parameters. Indeed, our objective in this context is to analyse and compare multi-target regression models as a new approach to estimating ET and VWC, and to select subsequently the best regression that will be used to calculate the quantity of water needed for each using the two irrigation strategies.
However, machine learning-based irrigation is always considered “black-box” and uninterpretable by humans. Indeed, the farmers/users could not understand the irrigation decision because of the inconsistency between the two irrigation strategies. In this regard, one of the most critical requirements in the majority of fields for understanding the motivations behind any particular choice is explainable AI [13]. It eliminates the outdated notion of machine learning as a “black box” and helps users comprehend the underlying causes of the discovered solution [33]. In the field of smart agriculture, a recent study [36] highlighted the necessity of using explainable artificial intelligence to monitor crop growth and disease control, policy standardization, and other technological and deployment advancements. To sum up, the main contributions of this study are:
A Multi-Target soil moisture and evapotranspiration prediction (MTR-SMET) for estimating both the soil moisture and evapotranspiration. MTR-SMET helps detecting inconsistent outputs due to erroneous sensor signals. The use of two well known irrigation methods in the agriculture field to calculate water needs based on the MTR-SMET to provide an accurate and robust daily amount of water for irrigation. An explainable MTR-SMET (xMTR-SMET) that explains the ML-based irrigation to the farmers/users using several explainable AI to provide simple visual explanations for the given predictions. It should be noted that this is the first attempt that explains and gives meaningful insights into the output of a machine learning-based irrigation approach.
Related work
We present in this section some relevant works that discuss machine learning-based approaches in the field of irrigation management. In particular, we present both evapotranspiration based irrigation scheduling and soil moisture prediction approaches.
Evapotranspiration based irrigation scheduling
The evapotranspiration referred as
In 2011, Jahanbani and El-Shafie [21] employed an artificial neural network (ANN) for predicting monthly
To predict
Han et al. [19] evaluated the capability of coupling a Bat algorithm with the XGBoost method (i.e., the BAXGB model) for estimating monthly
Aghelpour and Norooz-Valashedi [3] evaluated the performances of stochastic models, including autoregressive moving average (ARMA), and autoregressive integrated moving average (ARIMA) and machine learning models, including least square support vector machine (LSSVM), adaptive neuro-fuzzy inference system (ANFIS), and generalized regression neural network (GRNN), in predicting
Soil moisture prediction
The advantage of soil moisture-based methods is the ease of practice and automation with some commercially available systems. Major drawbacks of the soil moisture sensor–based scheduling method are the spatial soil moisture heterogeneity, errors in sensor installation, the difficulty in the representation of the entire root zone, the need for sensor calibration, and the inaccuracy of measurements when gravel exists [34]. However, ET-based irrigation methods strongly rely on the estimation of local climatic data. ET-based irrigation methods also depend on local-specific crop coefficients and may be inaccurate. Cumulative errors may occur with calculated ET-based scheduling approaches, so field-based measurements are generally needed to correct or reset ET-based irrigation.
Comparisons of related literature. ea: actual vapor pressure, sh: sunshine hours, T: air temperature, Tmean: mean air temperature, Tmax: maximum air temperature, Tmin: minimum air temperature, U: wind speed, Umean: mean daily wind speed, RH: relative humidity, Rs: solar radiation, RHmin: minimum relative humidity, RHmax: maximum relative humidity, RHmean: mean relative humidity, SD: sunshine duration, DPT: dew point temperature, RD: Root Depth (cm), R: rainfall, ST: soil temperature, VWC: soil moisture,
: evapotranspiration
Comparisons of related literature. ea: actual vapor pressure, sh: sunshine hours, T: air temperature, Tmean: mean air temperature, Tmax: maximum air temperature, Tmin: minimum air temperature, U: wind speed, Umean: mean daily wind speed, RH: relative humidity, Rs: solar radiation, RHmin: minimum relative humidity, RHmax: maximum relative humidity, RHmean: mean relative humidity, SD: sunshine duration, DPT: dew point temperature, RD: Root Depth (cm), R: rainfall, ST: soil temperature, VWC: soil moisture,
Gu et al. [18] proposed a method for irrigation scheduling based on neural network (NN) model. As shown in Table 1, the NN model was trained using weather data, root depth and soil moisture of seven soil layers. The latter was predicted using the Root Zone Water Quality Model (RZWQM2) which is a comprehensive simulation model designed to predict the hydrologic response, including potential for ground water contamination, of alternative crop-management systems [34]. The output of the NN model was the soil moisture of the root zone and the seven layers. Thereafter, the predicted root zone soil moisture was used in the irrigation scheduling method to compute the irrigation quantity based on the field capacity.
Chatterjee et al. [11] proposed a hybrid modified flower pollination algorithm supported ANN (NN-MFPA) model to predict the soil moisture quantity. The MFPA is used to find the optimal weight vector for the ANN by using soil temperature, air temperature, and relative humidity as input features recorded hourly. The NN-MFPA model was compared with PSO (Particle Swarm Optimization) supported ANN, Cuckoo Search (CS) supported ANN, and multilayer perceptron feed-forward network (MLP-FFN).
The authors in [39] developed a guideline for soil moisture prediction using machine learning. In order to accomplish this, they tested several algorithms: LightGBM, linear regression, decision tree, random forest, multilayer perceptron, LSTM, and StemGNN. As input, they considered air temperature, solar radiation, relative humidity, wind speed, precipitation and soil moisture. As a goal, they predicted the lowest and highest soil moisture levels. The different ML algorithms are trained and tested in a real case analysis comprehending eight crop types in twelve fields from four farms distributed over diverse climatic scenarios in Brazil. The findings revealed that soil moisture prediction reaches its maximum performance when using only past soil moisture, a context-aware index, and the precipitation. Moreover, experiments find out that LightGBM outperforms other ML algorithms.
In order to predict evapotranspiration or soil moisture of the root zone for irrigation management, the studies discussed above proposed ML-based models, particularly regression methods, in the field of irrigation management. These models can run complex mathematical algorithms in real-time at low cost [17], to predict evapotranspiration or soil moisture of the root zone for the purpose of irrigation management. Each study considered different features related to weather data, soil data, and crop. They used different supervised algorithms to predict either ET or VWC in different locations. Although the relevance of these studies in the field of smart irrigation, the main limitations and the ignored challenges are presented as follow:
The majority of studies are limited to predicting the parameters (ET or VWC) that interfere in the irrigation process without completing this later by proposing decision support systems for the purpose of real-time operation based on the predicted parameters. Only the work of Gu et al. [18] that proposed NN-based irrigation scheduling method based on soil moisture parameters. In this regard, we offer a comprehensive system that begins with the prediction of the required parameters, the quantity of water required, and ends with the explanation of the irrigation decision. The second limitation of the existing studies is that the performance is based on the accuracy value returned by the sensors. Our proposed approach is based on two irrigation strategies by predicting both ET and VWC and using two irrigation methods. This aims to provide an optimal water requirement quantity for the crop and also to verify the proper functioning of the sensors to alert farmers/users if an inconsistency is detected. Another challenge neglected in the existing studies is the complex decisions by ML-based systems since they are considered as black-box predictions that cannot readily be explained to farmers. This could decrease the transparency and reliability of the irrigation system [36]. As a result, while we recognize the need of involving farmers and other users in the irrigation management process, we provide an explainable AI to supplement the ET and VWC predictions with information on the irrigation decision explanation.
Material and dataset description
Multi-target regression models & time series analysis
Multi-Target Regression (MTR), also known as multivariate or multi-output regression, is related to the problems with multiple continuous outputs [8] by considering not only the relationships between features and targets but also the relationships between target variables [8].
The most common categorization of MTR methods is divided into Problem Transformation (PT), Algorithm Adaptation (AA) and Ensemble methods. PT methods transform the Multi-Target Regression problem into a series of more specific single-target problems for each target. Among PT methods, Binary Relevance (BR) is the most frequently used method [32], in which the target variables are estimated independently, and any potential inter-correlation between them is ignored. In contrast, the AA methods are based on single regression models to handle multi-target problems directly by exploiting the potential correlation between the multiple targets. To do so, existing single-target algorithms are adjusted to support the multi-target problems. The most popular models on which MTR are built include Predictive Clustering Trees (PCTs) [7], ADABOOST.MR and BP-MLL [26].
The ensemble methods for multi-target learning are developed on top of the common PT or AA methods. For instance, the regressor chain (RC) method is an extension of BR that aims to consider interdependencies among targets. RC was proposed by Spyromitros-Xioufis et al. [37] based on the idea of chaining single-target models. In this method, the single-target regressors are linked along a chain (given by a random permutation of the set of target variables). The feature space of each link in the chain is augmented with the values of previous targets in the chain. The main concern with the RC method is that it is sensitive to the selected chain order [8]. As a solution, Spyromitros-xioufis et al. [37] proposed the Ensemble of Regressor Chains (ERC) method that combines several chains. This method has had great success in the MTR field and is widely used in many domains [8]. The ERC algorithm uses L Regressor Chains
Moreover, the present study supports also time series analysis. A time series is a sequence of historical measurements of an observable variable at equal time intervals. Time series analysis is a cutting-edge scientific issue since it is able to quantitatively describe the relationship between a sequence of observations. Time series analysis has emerged as a paramount technique in the irrigation field for presenting sequential irrigation due to the recent tremendous increase in interest in the scientific understanding of climate change. According to this, the present study aims to combine the different regression algorithms with a sliding window time series analysis technique [20].
Explainable artificial intelligence tools
Explainable Artificial Intelligence (XAI) is a developing topic of study thanks to the growing use of artificial intelligence and machine learning and the current reliance on these technologies. XAI refers to systems that try to explain how a black-box AI model produces its outcomes. XAI methods can be divided into two broad categories: model-based and post-hoc methods [13]. Model-based methods reflect how the ML model produces its outputs. This type of XAI refers to applying interpretation methods during model training, which may impact their overall performance. The main challenge for these methods is developing a trade-off between model accuracy and explainability [27]. As for the post-hoc XAI methods, they approximate the behavior of a black-box by extracting relationships between feature values and the predictions [27]. In particular, they use information from an ML model to identify the features most responsible for an outcome for a given input. The post-hoc explanatory refers to applying interpretation methods after the model training. Post-hoc XAI methods can be further divided into global and local methods. Global explanations try to figure out how a black-box model works as a whole, while local explanations try to find correlations between the values of a record’s features and its results. Moreover, the output of post-hoc XAI methods varies among text, visual explanations, and feature relevance/feature importance. The current work focuses on post-hoc explanation, specifically visual explanations, by relying on two well-known tools used in the literature [13]: LIME [33] and SHAP [25].
Local Interpretable Model-agnostic Explanations (LIME)
Local Interpretable Model-agnostic Explanations are a tool for comprehending and interpreting the underlying machine learning model while remaining model-neutral. LIME approximates the machine learning model with an understandable model [33]. It is carried out locally since complex machine learning models may be simpler to comprehend and approximate broadly. Users will be able to understand and analyze the model thanks to the explanations offered by LIME.
Shapley additive explanations (SHAP)
The SHAP is a game theory-based XAI method that aims to compute Shapely values to explain the contribution of each feature to the prediction [25]. SHAP is based on coalitional game theory to identify how well each group (or coalition) of agents can do for itself. To compute the shapely values, SHAP makes only some feature values present, and some are not. The main purpose is to identify the contribution of each feature to the prediction.
Lundberg and Lee [25] developed a Python package that can calculate the SHAP for various technologies, including LightGBM, XGBoost, GBoost, CatBoost Scikit-learn, and tree models. This implementation is considered by enormous researchers that rely on SHAP to interpret different AI models [13].
The main difference between SHAP and LIME is that LIME uses the distance between vectors as the influence, whereas SHAP uses the Shapley value as the influence.
Moreover, Lundberg et al. [24] proposed an extension of SHAP, namely the SHAP tree explainer. The latter suggests that the exact evaluation of SHAP values can be done in polynomial time exclusively for tree-based models (including RF but not models like SVM and NNs) by exploiting the information stored in the tree structure. Note that the SHAP tree explainer does not assume feature independence since feature interactions are already captured in the underlying trees.
Data from COSMOS-UK
The COSMOS-UK project consists of a soil moisture monitoring station that collects long-term soil moisture measurements at about 50 sites [12]. Figure 1 plots the site locations. An innovative cosmic ray neutron sensor (CRNS) instrument is used to generate soil moisture observations for each site over an area of up to 120 000

Map of COSMOS-UK site locations [12].
List of variables considered in the study
The proposed approach encompasses four modules that are illustrated in Fig. 2 and explained in the following sub sections.

The proposed approach.
Data pre-processing aims to convert raw data into a meaningful and clear format for machine learning. Data pre-processing is a paramount task to make accurate predictions when raw data is collected from particular sources. Because the raw dataset is missing some values that could significantly affect how well machine learning models work, we first interpolated those values. Interpolation consists of calculating the missing values for an observation using its preceding values. This interpolation technique’s sequential character corresponds to the temporal nature of time-series data. Furthermore, to compute daily averages of the feature values, this module also resamples the dataset. Moreover, we also applied normalization, which scales all data in the range of [0,1]. Secondly, the sliding window technique is applied to support time series data. According to Hota’s definition [20], sliding window is a short-term estimate of the time series data’s actual value. The window and segment sizes go bigger until we get the closest value with the least amount of error. In this study, sliding windows contain overlapping data. For instance, considering an interval size equal to 2, as depicted in the example of Fig. 3, to predict the evapotranspiration of the day

Example of sliding window of ET prediction with window size equal to 2.
After sliding window pre-processing, the dataset can train the MTR-SMET model. The soil and climatic variables and the n previous values of ET and VWC are considered input variables (n is the window size). Notably, the multi-target model takes the form
Random forest of predictive clustering trees (RF-PCT) [22] is an ensemble method for multi-target learning that uses PCTs trees [22] as base regressors. To ensure the diversity among the base regressors, the bagging is used beside the feature set changing during learning. Notably, the most important feature is chosen from a random subset of the input features at each node in the decision trees. The best subset of features is selected among the overall features x by the Breiman function [9]
MTR-SMET irrigation module
The aim of this module is to determine daily irrigation quantity based on the evapotranspiration and soil moisture parameters. This study seeks to determine the daily amount of water, d, required for irrigation.
For this purpose, we use two irrigation methods: Evapotranspiration based method and soil moisture based approach. Below are brief description of the these two approaches.
Evapotranspiration based irrigation
According to [10], the water needs, and the actual evapotranspiration (
Soil moisture based irrigation
The aim of soil moistre-based irrigation is to obtain the irrigation quantity for the day d by considering the soil moisture. For doing so, we use in the first stage, the trained multi-target model to predict the root zone soil moisture
xMTR-SMET based explanation module
To assist farmer for making a precise decision, the two water quantities of the day d,
As for the global explanation explains how the model makes decisions based on the input features and their impact to the target. Thus, global explanations aims to help the users to understand the distribution of the target results in relation to the set of features. In our study, the global explanation aims to plot the impact of meteorological data such as Wind speed and temperature as well the impact of previous
Experiments
Three series of experiments are carried out in the subsections that follow. The first evaluates the performance of the MTR-SMET, while the second assesses, based on experts, the MTR-SMET-based irrigation scheduling. The third experiment presents various global and local explanations based on the xMTR-SMET model.
MTR-SMET assessment
To better illustrate the performance of the MTR-SMET model, we compare the RF-PCT ensemble MTR method with the RC ensemble MTR method and PCTs MTR method. Moreover, we compare the MTR methods with single target methods (RF and XGB) that separately predict VWC and ET by building a model for each target in terms of performance.
The dataset was divided into training and test data to train and test the different models. Because we are dealing with time-series data, the split was performed sequentially to preserve the data’s temporal dynamics. Consequently, we considered Balruderry data (from the COSMOS dataset) ranging from 2014 to 2017 as a training set and data from 2018 and 2019 as a test set. To find the best parameters in the training phase, the performance of all regression methods in predicting
To evaluate the different prediction models, we rely on performance metrics. The model performance evaluation is established by plotting observed versus predicted values [30]. The metrics used for this purpose are MSE, RMSE, MAE, and
The MSE average values with the standard deviation of the time window MTR prediction models
The MSE average values with the standard deviation of the time window MTR prediction models
The RMSE average values with the standard deviation of the time window MTR prediction models
The MAE average values with the standard deviation of the time window MTR prediction models
The
Prediction performance of the MTR models on the test dataset
Moreover, we assess the ability of the MTR-SMET model, notably the RF-PCT model, to generate VWC and ET predictions for unseen observations. In particular, the MTR-SMET model and the other MTR models, trained with data ranging from 2014 to 2017, were also applied in predicting VWC and ET for samples from 2018 and 2019. Table 7 plots the prediction performance of the MTR models when tested on the test dataset. Table 7 shows that RF-PCT model generates accurate predictions for the test data. This highlights the robustness of the RF-PCT model for generating predictions. Furthermore, we observe that all MTR models achieve acceptable prediction accuracy for test data compared to train data. Hence, this implies the considerable impact of sliding window, that is applied for the test data, on the prediction performance.
The objective of this series of experiments is to analyze the results of the irrigation module. For this purpose, we rely on an agronomist from the Precimed project [31], who is regarded as an expert. This agronomist assigned an irrigation quantity (

Box plot of the difference between the three irrigation types (
Comparison between the irrigation methods in terms of water need quantity (mm) across the growing season (days)
On the other hand, to examine the capability of the MTR-SMET-based irrigation method in detecting sensor failure, we disrupted the observation of the day 2019-07-03 by changing the climatic data values. The results of this disruption are highlighted (red color) in Table 8 where the

SHAP values for the ET prediction.
Global explanation
This series of experiments aims to illustrate how the explanation module gives a global view of how the features affect the estimation of the two targets. In this regard, we used the SHAP summary plot (Figs 5 and 6) to identify the magnitude and direction of each attribute’s impact on a global scale. As presented in Figs 5 and 6, the feature names are shown on the y-axis in decreasing order of significance, and the SHAP values for each input predictor are listed on the x-axis. Higher values of the feature are represented by red points, while blue points represent lower ones.
Figure 5 plots the impacts of the distribution of input features on the estimation of the ET value based on the MTR-SMET model. As for Table 9, it presents SHAP Feature Importance (SFI) generated from the absolute average of Shapley values per feature across the data for the ET prediction. The results indicate that increased feature solar radiation (Rs) leads to higher SHAP values for the ET estimation (SFI = 0.584). Indeed, solar radiation has the most significant influence on ET prediction. Figure 5 and Table 9 also show the importance of other inputs, such as relative humidity (SFI = 0.212), and soil temperature (SFI = 0.074). Moreover, the previous ET, ET-1 and ET-2 have an important impact on the daily evapotranspiration prediction with SFI equal to 0.102 and 0.081, respectively. However, the last moisture soil predictions VWC-1, VWC-2, and VWC-3 have less influence on the ET prediction.

SHAP values for the VWC prediction.
SHAP feature importance
Let us now concentrate on the explanation of the global VWC prediction. Figure 5 and Table 9 show the impacts of the distribution of input features on the estimation of the VWC value. Unlike the ET prediction, the VWC prediction is influenced by the previous evapotranspiration. In particular, ET-1 greatly impacts the VWC prediction (SFI = 0.318). The most significant features that impact the VWC prediction are VWC-1 (SFI = 5.311) and VWC-3 (SFI = 0.654).

xMTR-SMET based application interface: input.
This experiment aims to demonstrate the capability of xMTR-SMET to explain a given prediction with a local explanation of either ET or VWC. For this purpose, a realistic scenario is presented to show how xMTR-SMET could support the farmer in deciding on crop irrigation.
To get the irrigation values (
On the other hand, the VWC-based irrigation is estimated to be 2.239 mm based on the predicted VWC value, which is equal to 28.42. As presented in the interface, this predicted value is explained by the negative impact of VWC-1 since its value is less than or equal to 28.66

xMTR-SMET-based application interface: output.
This study investigated the viability of a multi-target regression model for estimating evapotranspiration (ET) and soil moisture (VWC). Therefore, these two characteristics must be jointly and correlatively forecasted in order to optimally arrange daily water requirements. Several studies like [11,21,28] are relevant to the field of smart irrigation but suffers from three main limitations: (1) They can only predict the parameters (ET or VWC) that interfere with the irrigation process. (2) Their performance is based on how accurate the sensors are. (3) The complex decisions made by ML-based systems are hard to explain to farmers because they are considered “black box” predictions. This could make the irrigation system less clear and less reliable. To overcome these limitations, we propose Multi-Target soil moisture and evapotranspiration prediction (MTR-SMET) for estimating soil moisture and evapotranspiration. This is intended to offer an appropriate water demand quantity for the crop, as well as test the sensors’ functionality to warn farmers/users if an inconsistency is observed. In this context, our goal is to analyze and contrast multi-target regression models as a novel method of estimating ET and, and then choose the best regression to be used to determine the amount of water required for each using the two irrigation strategies of soil-based irrigation and ET-based irrigation. However, machine learning-based irrigation is always regarded as a “black box” that people cannot understand. Because of the inconsistency between the two irrigation strategies, farmers/users were unable to grasp the irrigation decision. Explainable AI is one of the most important requirements for understanding the rationale behind any particular choice in this regard. It dispels the view of machine learning as a “black box” and assists users in understanding the root reasons of the identified solution. We present an explainable MTR-SMET (xMTR-SMET) that explains ML-based irrigation to farmers/users utilizing several explainable AI to provide simple visual explanations for the provided predictions. This is the first attempt to explain and provide relevant insights into the result of a machine learning-based irrigation technique. The conducted experiments showed that the proposed MTR-SMET model achieves low error rates (i.e., MSE = 0.00015, RMSE = 0.0039, MAE = 0.002) and high
Footnotes
Acknowledgements
The authors gratefully acknowledge the General Secretariat for Research and Technology of the Ministry of Development and Investments of Tunisia under the PRIMA Programme. PRIMA is an Art. 185 initiative supported and co-funded under Horizon 2020, the European Union’s Programme for Research and Innovation. (project application number: 155331/I4/19.09.18).
Conflict of interest
None to report.
