A Contextual Bandit Approach for Value-oriented
Prediction Interval Forecasting
Yufan Zhang, Honglin Wen, Member, IEEE, and Qiuwei Wu, Senior Member, IEEE
Abstract—Prediction interval (PI) is an effective tool to quan-
tify uncertainty and usually serves as an input to downstream
robust optimization. Traditional approaches focus on improving
the quality of PI in the view of statistical scores and assume the
improvement in quality will lead to a higher value in the power
systems operation. However, such an assumption cannot always
hold in practice. In this paper, we propose a value-oriented PI
forecasting approach, which aims at reducing operational costs
in downstream operations. For that, it is required to issue PIs
with the guidance of operational costs in robust optimization,
which is addressed within the contextual bandit framework here.
Concretely, the agent is used to select the optimal quantile
proportion, while the environment reveals the costs in operations
as rewards to the agent. As such, the agent can learn the policy
of quantile proportion selection for minimizing the operational
cost. The numerical study regarding a two-timescale operation
of a virtual power plant verifies the superiority of the proposed
approach in terms of operational value. And it is especially
evident in the context of extensive penetration of wind power.
Keywords: Prediction interval; forecast value; decision-
making; uncertainty
I. INTRODUCTION
The ongoing decarbonization effort in the energy sector
places a particular emphasis on renewable energy sources
(RESs). Albeit enjoying the merits of clean and non-emission,
the stochastic nature of RESs poses a great challenge to
power systems operation and electricity markets, as the power
generation of RESs cannot be scheduled at will. This drives the
need of forecasting RES generation at future times to support
power system operation [1], such as power dispatch, trading
[2], and reserve procurement.
Forecasts can be communicated in various forms[3], includ-
ing single-valued points[4], densities[5], [6], and prediction
regions[7], [8]. Among them, prediction regions provide a
summary of the probability distribution of random variables.
For univariate forecasting, a prediction region is communi-
cated as a prediction interval (PI), which is specified by
two bounds and the nominal coverage probability (NCP)
(1−β)×100% that specifies the probability that the realization
falls in. PI has a wide range of applications in nowadays
power industry. For instance, PI serves as an input to robust
optimization for quantifying the wind uncertainty, determining
the reserve quantities [9] and wind power offering in the
day-ahead market [10], where the NCP is commonly chosen
Yufan Zhang is with the Department of Electrical and Computer Engineer-
ing, University of California San Diego, San Diego, California 92161, US.
Honglin Wen is with Department of Electrical Engineering, Shanghai Jiao
Tong University, Shanghai 200240, China.
Qiuwei Wu is with Tsinghua-Berkeley Shenzhen Institute,Tsinghua Shen-
zhen International Graduate School, Tsinghua University, Shenzhen 518055,
China.
Corresponding author: Qiuwei Wu (e-mail: qiuwu@sz.tsinghua.edu.cn).
between 90% and 95%. Also, based on the estimated PI,
the concept of uncertainty budget is leveraged to reduce the
conserveness of robust optimization in storage control [11],
unit commitment [12], and microgrid dispatch [13], which
is beneficial to reducing the operational cost in the robust
optimization.
PI is always desired to have good reliability and sharpness,
which means that the interval width needs to be minimized in
the constraint of some NCP. In recent decades, non-parametric
approaches have been preferred by the forecasting community,
which mainly develops quantile regression (QR) models to
issue a pair of quantiles as a PI. Machine learning models, such
as recurrent neural network [14], ridge regression [15], and
neural basis expansion model [16] have been combined with
QR, with the loss function of the pinball loss, which shows
superiority thanks to the strong learning ability of machine
learning models. Usually, the quantiles in a PI are statistically
symmetric with respect to the median, i.e., qβ/2,q1−β/2, which
is therefore referred to as the central PI (CPI) in literature.
However, the probability distribution of the RESs power
output is generally skewed [10], [17], thereby the width of
CPIs is often unnecessarily wide [18]. For this reason, optimal
PI (OPI) forecasting approaches arise, which optimize over
the bounds with the objective of improving statistical quality,
such as minimizing the Winkler score. A thread of studies
select the probability proportion according to the contextual
information, instead of setting it as a predetermined constant
like in the CPI approaches. Ref. [19] learned the policy of
proportion selection seeking to minimize the Winkler score. In
another thread of studies, the forecast model outputs the two
bounds directly without specifying a probability proportion to
it. As the quality metrics such as Winkler score are generally
non-differentiable, the main difficulty lies in how to design a
surrogate loss function [20] or a proper optimization technique
to estimate the forecast model parameters. Ref. [21] formu-
lated a multi-objective problem and optimized the parameters
of the extreme learning machine (ELM) by the particle swarm
optimization. In [18], the parameters estimation for the ELM
was formulated as a mixed-integer linear programming (MILP)
problem, which was solved by off-the-shelf solvers.
Although the aforementioned PI forecasting approaches
have contributed to improving forecasting quality in the view
of statistics, they have overlooked the value of forecasts in the
downstream power system operation. The idea of using value
for evaluating the goodness of forecasting can be dated back to
[22], where value is defined as the economic/operational gain
from leveraging forecasts at decision-making stages. Take a
robust optimization problem as an example (such as robust
power dispatch); the input PI will definitely impact the opera-
tional cost. Indeed, it has been shown that the improvement in
arXiv:2210.04152v2 [eess.SY] 13 Feb 2023