surrogate variables have been explored and proposed in the literature, with rainfall and river
discharge (hereafter flow) or water level commonly found as useful covariates (e.g. Tsai &
Yen (2017), Leigh et al. (2019a)). Heavy rainfall affects turbidity via erosion and subsequent
runoff, thereby increasing sediment loads in streamflow, which itself increases together with
water level and acts to re-suspend sediments. Hence, sudden increase in turbidity often follows
periods of high rainfall and sudden rise in water level (Leigh et al. 2019a). However, turbidity
may also increase when flow and water level decline leading to an increased concentration of
particles (Iglesias et al. 2014), although this phenomenon typically occurs more slowly than
the increase in turbidity associated with sudden, fresh inputs of water.
Seasonal patterns in temperature may also be reflective of seasonal patterns in rainfall,
and thus flow and water level, such that interannual variation in air and water temperature
may help to explain variation in turbidity. Iglesias et al. (2014), for example, found that
water temperature was the most important variable for estimating river turbidity in northern
Spain. Water temperature may also prove to be an informative covariate of turbidity given that
suspended particles absorb more heat than water molecules when exposed to solar radiation
(Paaijmans & Takken 2008). As such we may expect that temperature correlates somewhat
with turbidity, particularly during the day. This further suggests that sensors capable of
recording underwater illuminance (the amount of light shining onto a surface, measured
in lux) may help identify highly turbid waters given that poor water clarity will decrease
light penetration. However, underwater illuminance will also be affected by the amount of
solar irradiation and incident-light available, as dependent on, for example, time of day and
year, cloud cover and shading from vegetation or other structures above the stream. As
such, the relationship between light-associated variables (e.g. illuminance, irradiance, solar
exposure) and turbidity, despite its potential utility, may not be as straight forward as that
between rainfall, water level and turbidity. To our knowledge, the predictive ability of such
light-associated variables as potential covariates of turbidity is yet to be explored beyond
their potential use in determining light-attenuation (Droujko & Molnar 2022), but is worth
investigating given, for example, that low-cost 𝑖𝑛 −𝑠𝑖𝑡𝑢 sensors that can measure both water
temperature and illuminance are currently available (e.g. HOBO MX2202 data loggers;
https://www.onsetcomp.com/).
Our first objective in this study was to assess the potential of variables that can be mea-
sured using low-cost 𝑖𝑛-𝑠𝑖𝑡𝑢 sensors or via readily available open-access databases (including
rainfall, water level, temperature, and illuminance or radiant exposure, as available), either
alone or in combination, to act as surrogates for the prediction of turbidity in rivers. We
aim to achieve this by developing and comparing the ability of different models (dynamic
regression, long short-term memory, and generalised additive models) to forecast the one-step
ahead turbidity of Merri Creek, southeast Australia, using the aforementioned variables as
covariates. Our second objective was to develop and implement the novel use of a ensemble
machine learning method (meta-model) to select the most accurate model for turbidity predic-
tion at any one time-step. The establishment of an accurate model has the benefit of providing
water management and monitoring agencies a cost-effective tool for accurately predicting
turbidity using surrogates. The use of surrogates may also serve to detect technical-anomalies
within sensor-produced data via validation with model-produced data, enabling agencies to
3