Probabilistic Inverse Modeling An Application in Hydrology Somya SharmaRahul GhoshArvind RenganathanXiang Li Snigdhansu ChatterjeeJohn NieberChristopher DuyVipin Kumar

2025-04-26 0 0 843.18KB 9 页 10玖币
侵权投诉
Probabilistic Inverse Modeling: An Application in Hydrology
Somya SharmaRahul GhoshArvind RenganathanXiang Li
Snigdhansu ChatterjeeJohn NieberChristopher Duffy+Vipin Kumar
Abstract
Rapid advancement in inverse modeling methods have
brought into light their susceptibility to imperfect data. The
astounding success of these methods has made it impera-
tive to obtain more explainable and trustworthy estimates
from these models. In hydrology, basin characteristics can
be noisy or missing, impacting streamflow prediction. For
solving inverse problems in such applications, ensuring ex-
plainability is pivotal for tackling issues relating to data
bias and large search space. We propose a probabilistic in-
verse model framework that can reconstruct robust hydrol-
ogy basin characteristics from dynamic input weather driver
and streamflow response data. We address two aspects of
building more explainable inverse models, uncertainty esti-
mation and robustness. This can help improve the trust of
water managers, handling of noisy data and reduce costs.
We propose uncertainty based learning method that offers
6% improvement in R2for streamflow prediction (forward
modeling) from inverse model inferred basin characteristic
estimates, 17% reduction in uncertainty (40% in presence of
noise) and 4% higher coverage rate for basin characteristics.
1 Introduction
Researchers in scientific communities study engineered
or natural systems and their responses to external
drivers. In hydrology, streamflow prediction is one cru-
cial research problem for understanding hydrology cy-
cles, flood mapping, water supply management, and
other operational decisions. For a given entity (river-
basin/catchment), the response (streamflow) is gov-
erned by external drivers (meteorological data) and
complex physical processes specific to each entity (basin
characteristics). Process-based models are commonly
used to study streamflow in river basins (for example,
Soil & Water Assessment Tool). However, these hy-
drological models are constrained by assumptions, con-
tain many parameters that need calibration and incur
enormous computation cost. In addition, these mod-
els are often calibrated on every specific catchment and
thus can require specific fine-tuning for each basin. As
promising alternatives, machine learning (ML) models
are increasingly being used [30] (Figure 1 shows the di-
University of Minnesota - Twin Cities. {sharm636, ghosh128,
renga016, lixx5000, chatt019, nieber, kumar001}@umn.edu, +
Pennsylvania State University {cxd11}@psu.edu
Forward Model
Figure 1: Forward model using weather drivers xt
iand
river basin characteristics zt
ito estimate streamflow yt
i
[16]
agrammatic representation of this data-driven forward
model). In our study, an entity’s response to external
drivers depends on its inherent properties (called entity
characteristics). For example, for the same amount of
precipitation (external driver), two river basins (enti-
ties) will have very different streamflow (response) val-
ues depending on their land-cover type (entity char-
acteristic) [38]. Disregarding these inherent proper-
ties of entities can lead to sub-optimal model per-
formance. Knowledge-guided self-supervised learning
(KGSSL) [16] has been proposed to extract these en-
tity characteristics using the input drivers and output-
response data.
Developing such entity-aware inverse models re-
quires addressing several challenges. Often, the mea-
sured characteristics are only surrogate variables for the
actual entity characteristics, leading to inconsistencies
and high uncertainty. Uncertainty can arise due to sev-
eral reasons, such as measurement error, missing data,
and temporal changes in characteristics. Moreover, in
real-world applications these characteristics may be es-
sential in modeling the driver-response relation. How-
ever, they may be completely unknown, not well un-
derstood, or not present in the available set of entity
characteristics. A principled method of managing this
uncertainty due to imperfect data can contribute in im-
proving trust of data-driven decision making from these
methods.
In this paper, we introduce uncertainty quantifica-
tion in learning representations of static characteristics.
Such a framework can help quantify the effect of multi-
ple sources of uncertainty that introduce bias and er-
ror in decision-making. For instance, Equifinality of
hydrological modeling (different model representation
result in same model results) is a widely known phe-
nomenon affecting the adoption of hydrology models in
Copyright ©20XX by SIAM
Unauthorized reproduction of this article is prohibited
arXiv:2210.06213v1 [cs.LG] 12 Oct 2022
Figure 2: KGSSL for representation learning. The forward model learns streamflow (y) as functional
approximation of weather drivers (x) and river basin static attributes (z). KGSSL leverages an inverse modeling
framework for learning robust static attribute estimates (ˆz). The robust estimates aid in improving prediction
performance of the forward model.
practice [20]. Uncertainty in model structure and input
data are also widespread. In real world applications,
studying these can help improve trust of water man-
agers, improve hydrological process understanding, re-
duce costs and make predictions more explainable and
robust [36]. To achieve this, we propose a probabilis-
tic inverse model for simultaneously learning represen-
tations of static characteristics and quantifying uncer-
tainty in these predictions. As a consequence, we ana-
lyze the framework’s reconstruction capabilities and its
susceptibility to adversarial perturbations - our model
is able to maintain same level of robustness as KGSSL.
We also propose an uncertainty based learning (UBL)
method to reduce epistemic uncertainty (uncertainty in
predictions due to imperfect model and imperfect data)
in our reconstructions. We show that it results in reduc-
ing the temporal artifacts in static characteristic pre-
dictions by 17% and also reduces the epistemic uncer-
tainty by 36%. We also demonstrate the improvement
in streamflow prediction (in the forward model) using
these reconstructed static characteristics (6% increase
in test R2). We provide model performance for recon-
struction and forward modeling and compare it against
the baselines, KGSSL [16] and CT-LSTM [30], state-
of-the-art for streamflow prediction. Since, we use a
probabilistic model for estimating static characteristics,
we obtain a posterior distribution instead of point es-
timates. This enables us to compute coverage rate of
how often the observed values lie within the bounds of
the inferred static characteristics’ posterior prediction
distribution. In practice, this can help water managers
and public understand if we can reliably obtain a close
enough prediction, even if we are not always accurate
- analysis that can not be done with the deterministic
inverse model. UBL offers a 4% increase in coverage
rate.
2 Related Work
Robustness: In several large-scale applications in areas
like computer vision and natural language processing,
presence of even small, imperceptible adversarial per-
turbation can exacerbate model performance [10,48,51].
While, several of the robustness studies focus on the ef-
fect of different noises [32, 57], many other studies fo-
cus on methods to mitigate the adverse effects of these
perturbations [35, 42, 50]. Through objective modifica-
tion [11,24,26] or propagation of input-output relation-
ship constraints [9,21], deep learning architectures have
been modified to improve robustness. In real-world data
sets, natural variations (like blurring) have been stud-
ied [41, 47]. Issues like adversarial transferability [22]
and data brittleness issues (robustness issues due to
overfitting [40]) bring to light the limitations of mod-
ern machine learning methods. More recent studies also
look at Bayesian deep learning models for their robust-
ness properties [4, 5, 43].
Inverse Problems: In physical sciences [7,27,39,55],
several recent advances have focused on solving inverse
problems. Unlike standard inversion methods in math-
ematics, that rely on non-linear optimization for cal-
culating inverse of a forward model, recent machine
learning methods allow us to learn the inverse mapping
from datasets. This makes it imperative to mitigate
any representation error and data biases before solving
the inverse problem [2]. Further, within these vast ar-
ray of methodologies, selection of the right method is
crucial - since, searching for an inverse mapping may
be difficult due to the large search space. Bayesian op-
timization for searching and iterative gradient descent
Copyright ©20XX by SIAM
Unauthorized reproduction of this article is prohibited
摘要:

ProbabilisticInverseModeling:AnApplicationinHydrologySomyaSharma*RahulGhoshArvindRenganathanXiangLiSnigdhansuChatterjeeJohnNieberChristopherDu y+VipinKumarAbstractRapidadvancementininversemodelingmethodshavebroughtintolighttheirsusceptibilitytoimperfectdata.Theastoundingsuccessofthesemethodsha...

展开>> 收起<<
Probabilistic Inverse Modeling An Application in Hydrology Somya SharmaRahul GhoshArvind RenganathanXiang Li Snigdhansu ChatterjeeJohn NieberChristopher DuyVipin Kumar.pdf

共9页,预览2页

还剩页未读, 继续阅读

声明:本站为文档C2C交易模式,即用户上传的文档直接被用户下载,本站只是中间服务平台,本站所有文档下载所得的收益归上传人(含作者)所有。玖贝云文库仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对上载内容本身不做任何修改或编辑。若文档所含内容侵犯了您的版权或隐私,请立即通知玖贝云文库,我们立即给予删除!
分类:图书资源 价格:10玖币 属性:9 页 大小:843.18KB 格式:PDF 时间:2025-04-26

开通VIP享超值会员特权

  • 多端同步记录
  • 高速下载文档
  • 免费文档工具
  • 分享文档赚钱
  • 每日登录抽奖
  • 优质衍生服务
/ 9
客服
关注