
SPECTRANET:MULTIVARIATE FORECASTING AND IM-
PUTATION UNDER DISTRIBUTION SHIFTS AND MISSING
DATA
Cristian Challu ∗
School of Computer Science
Carnegie Mellon University
cchallu@andrew.cmu.edu
Peihong Jiang
AWS AI Labs
jpeihong@amazon.com
Ying Nian Wu
AWS AI Labs
wunyin@amazon.com
Laurent Callot
AWS AI Labs
lcallot@amazon.com
ABSTRACT
In this work, we tackle two widespread challenges in real applications for time-
series forecasting that have been largely understudied: distribution shifts and miss-
ing data. We propose SpectraNet, a novel multivariate time-series forecast-
ing model that dynamically infers a latent space spectral decomposition to cap-
ture current temporal dynamics and correlations on the recent observed history.
A Convolution Neural Network maps the learned representation by sequentially
mixing its components and refining the output. Our proposed approach can simul-
taneously produce forecasts and interpolate past observations and can, therefore,
greatly simplify production systems by unifying imputation and forecasting tasks
into a single model. SpectraNetachieves SoTA performance simultaneously
on both tasks on five benchmark datasets, compared to forecasting and imputa-
tion models, with up to 92% fewer parameters and comparable training times. On
settings with up to 80% missing data, SpectraNethas average performance im-
provements of almost 50% over the second-best alternative. Our code is available
at https://github.com/cchallu/spectranet.
1 INTRODUCTION
Multivariate time-series forecasting is an essential task in a wide range of domains. Forecasts are
a key input to optimize the production and distribution of goods (B¨
ose et al., 2017), predict health-
care patient outcomes (Chen et al., 2015), plan electricity production (Olivares et al., 2022), build
financial portfolios (Emerson et al., 2019), among other examples. Due to its high potential benefits,
researchers have dedicated many efforts to improving the capabilities of forecasting models, with
breakthroughs in model architectures and performance (Benidis et al., 2022).
The main focus of research in multivariate forecasting has been on accuracy and scalability, to
which the present paper contributes. In addition, we identify two widespread challenges for real
applications which have been largely understudied: distribution shifts and missing data.
We refer to distribution shifts as changes in the time-series behavior. In particular, we focus on
discrepancies in distribution between the train and test data, which can considerably degrade the
accuracy (Kuznetsov & Mohri, 2014; Du et al., 2021). This has become an increasing problem in
recent years with the COVID-19 pandemic, which disrupted all aspects of human activities. Missing
values is a generalized problem in applications. Some common causes include faulty sensors, the
impossibility of gathering data, corruption, and misplacement of information. As we demonstrate in
our experiments, these challenges hinder the performance of current state-of-the-art (SoTA) models,
limiting their use and potential benefits in applications where these problems are predominant.
∗Work completed during internship at Amazon AWS AI Labs.
1
arXiv:2210.12515v2 [cs.LG] 25 Oct 2022