There exist numerous decision support tools of widely ranging complexity for crop management, introduced to
farmers with varying degrees of success (Gautron et al.,2022).
Machine learning (ML) and more generally artificial intelligence (AI) can help address sequential decision
making under uncertainty. In particular, the bandit algorithm paradigm (Lattimore and Szepesv´ari,2020)
considers a decision-maker, called agent, who repeatedly faces a choice between contending actions, and has to
iteratively improve its decisions with trials. The canonical bandit problem originates from clinical trials with
sequential drug allocation (Thompson,1933). At each time step, the agent chooses one action (i.e., one drug
for a patient) amongst a set of possible actions. Each action provides a reward (i.e.; tumor cell reduction after
taking the drug), drawn from a corresponding unknown reward distribution (i.e., the distribution of tumor cell
reduction for the drug). The optimal action has the reward distribution with the highest mean reward (i.e.,
the highest mean tumor cell reduction). The objective of the agent is to sequentially choose actions such that
the expected sum of rewards is maximized. Maximizing the total expected rewards is equivalent to minimizing
the regret, which is a measure of the total losses that occur with sub-optimal actions (Robbins,1952).
Iteratively, the agent refines his next decision based on all previous results. To know how a given action
performs, a sufficient number of (possibly poor) rewards is required: this is the exploration phase. To
maximize the expected sum of rewards, the previous actions that provided good results so far must be selected
more frequently; this is the exploitation phase. Bandit algorithms aim at finding the right balance between
exploration and exploitation. This exploration-exploitation dilemma is a reality for farmers when implementing
crop management. Farmers typically want to minimize overall crop yield losses and typically explore the
performance of promising new crop management practices on small test plots (Cerf and Meynard,2006;Evans
et al.,2017). They avoid potentially large crop yield losses from new management by managing a gradual
transition between the current management and the promising new one(s), based on the results they obtain on
the small test plots.
The objective of this paper is to develop a novel strategy to identify best crop management. We set as
baseline an “intuitive strategy” which consists in identifying the best crop management through multi-year
field trials in which a set of crop management practices is tested in an equiproportional way. We compare this
“intuitive strategy” to a novel crop management identification strategy, based on a bandit algorithm. This
novel identification strategy aims to minimize farmers’ yield losses occurring during the identification process,
compared to the intuitive strategy. Thus, we test the hypothesis that bandit algorithm can help farmers to
better identify the best crop management for their context, while further minimizing crop yield losses related
to sub-optimal choices in new crop management.
Our case study considers the rainfed maize production in southern Mali, and we compare the performance
of both crop management identification strategies based on maize growth simulations using a calibrated crop
model in order to mimic real-world performance of crop management. The novel identification strategy does,
however, not depend on model simulations, and ultimately aims at being applied in real field conditions. As
for crop management, we focus on nitrogen fertilization. Tailoring nitrogen fertilizer recommendations to
farmers’ contexts is known to be challenging. Indigenous soil nitrogen supply, depending to a large extent
on past-season events, is not accurately known to farmers, whilst in-season nitrogen mineralization depends
largely on weather events(Morris et al.,2018), themselves uncertain. Crop nitrogen requirements, such as with
maize, are related to specific crop growth stages (Hanway,1963) and excessive mineral nitrogen supply can
induce nitrate leaching, especially in wet conditions (Meisinger and Delgado,2002). Therefore, there are a
priori no upfront optimal nitrogen fertilizer practices.
2 Methods
2.1 Virtual crop management identification problem
In our virtual crop management identification problem, a population or ensemble of farmers joined a
participatory experiment to identify the best nitrogen fertilizer practices for maize production in their region,
Koutiala in southern Mali. A total population of 500 farmers was considered. The distribution of soil types of
the fields associated with the group of farmers was representative of the region (Table 1). A total population of
500 farmers was considered. Each farmer belonged to a cohort that corresponded to an ensemble of farmers
growing maize on the same soil type. For each cohort, we wanted to identify the best nitrogen fertilizer practice
from a set of recommended practices (see Table 3and Section 2.1.1 for the performance metrics we considered).
2