incomplete data. Fortunately, traffic speeds bounded in the road network are highly correlated in both space and
time, and therefore pose a rich degree of redundancy (or low-rankness in a matrix or tensor theory’s term (Tan
et al.,2013)) that undetected or erroneous speeds of some road links are recoverable by virtue of the correlation.
In other words, sparse sensors are able to achieve a "collaborative perception" of traffic speeds of the undetected
road links, when their intrinsic multi-dimensional correlations are effectively exploited (see Fig. 1).
Recent years have witnessed a vast array of studies investigating the problem of estimating traffic state on
undetected road links (Meng et al.,2017;Zhang et al.,2020;Wu et al.,2021a;Lei et al.,2022). This problem is often
cast into spatiotemporal traffic kriging (Wu et al.,2021a) in the literature. Unlike the traffic forecasting problem
that aims at predicting future state of a road link given its historical state, spatiotemporal traffic kriging attempts
to estimate the traffic state of an undetected road link without any historical state of that link; it instead needs
to utilize measurements of other detected road links to complete the "spatial prediction." Kriging is therefore
regarded as a special missing data imputation problem. How to effectively utilize the correlation between undetected
road links and other sparsely detected roads is the core research question of this paper. Upon close inspection, we
identify three-fold specific challenges:
First, correlation between speeds of the detected road links and speeds of the undetected ones can be arbitrary
unless reasonable prior assumptions are made. Low-rankness is a widely-adopted general prior in literature to
reflect linear correlation of traffic data over space and time (Tan et al.,2013;Asif et al.,2016;Chen et al.,2021a).
However, low-rankness is not directly applicable to our kriging problem in which data are fully missing in some
locations, as in this case, we can fill different sets of values to the missing data without changing the rank and the
missing values therefore remain undetermined under the low-rankness assumption. Conceptually, low-rankness
is not sufficient when imputing a space-time matrix with entire row or column missing. Additional regularization
terms should be carefully considered to complement the low-rankness assumption (Bahadori et al.,2014;Yokota
et al.,2018;Yamamoto et al.,2022).
Second, in addition to general correlation assumptions, exploiting the semantic spatiotemporal correlation of
speeds is an necessary add-on. Since speed manifests microscopic traffic that is time-varying, dynamic, periodic,
and spatially interdependent, in the meanwhile, spatial correlation is bounded on a graph instead of on a 2D
plane, spatial correlation should consider the graph features (e.g., network topology) as well as traffic flow loaded
on that graph (e.g., moving directions of traffic flow) (Li et al.,2017), exploiting spatio-temporal correlations is
by no means trivial. Moreover, as a graph signal, traffic data features inherent time series characteristics such
as continuity, periodicity, and stationary, providing another prior for ones to achieve time-varying recovery.
Tailoring an effective kriging model to capture the multi-modal correlations of spatiotemporal traffic data is still
an open question yet to be resolved.
Third, scalability of the kriging approach onto the network level is desired as it allows exploiting correlation
of a wider scope of speeds over space with added accuracy. Moreover, when new measurements are collected,
a kriging model with fast parametric update is preferable for an online estimation.To this end, computational
efficiency is the key. However, classical Gaussian process (GP) based kriging doesn’t show scalability due to
its computational complexity, while existing scalable tensor based imputation methods (Lu et al.,2019b;Chen
et al.,2021a) are mainly designed for general data imputation problem, and are not applicable to the kriging
task. Although Bayesian Gaussian kriging methods achieve flexible hyper-parameter tuning for spatiotemporal
kriging (Lei et al.,2022;Chen et al.,2022b), its high consumption of time and memory in parametric learning of
the Gaussian kernel inhibits itself from being scaled to the network level. Custom scalable solution algorithms
need to be designed to maximize the kriging performance.
To tackle the challenges and realize a network-wide kriging of missing speed data for the undetected road
links, we propose a Laplacian enhanced tensor completion (LETC) based kriging model to exploit multi-modal
spatiotemporal correlations of traffic speeds, where three kinds of carefully chosen speed correlation – temporal
continuity,temporal periodicity and spatial proximity – are simultaneously captured by assigning three forms of
graph Laplacian. By integrating graph Laplacian into tensor completion model either explicitly or implicitly,
LETC can leverage both global low-rankness property and local dependency informed by physical constraints at
the same time. The framework contributes to the current literature in four aspects as follows:
1.
We customize a new temporal graph Fourier transform (TGFT) based on temporal periodic graph Laplacian
to encode the temporal periodicity of speeds. The TGFT complements the low-rankness assumption and
enables parallelization of the tensor completion by a series of daily matrix in graph spectral domain.
2.
We propose a novel diffusion graph regularization (DGR) to exploit spatial dependency on directed graphs.
By modeling as random walkers on graphs, DGR considers the directed nature of traffic flows and closely
connects with the graph diffusion process. Moreover, DGR features explicit physical explanations and
encourages a low-pass propagation which is beneficial for reconstructing spatiotemporal traffic data.
3.
We develop a general formulation of temporal consistency regularization (GTCR) to impose smoothness
constraints on time slots within a kernel size. GTCR considers the causality of time series in a model-free
manner and can directly convert into some previously developed temporal regularization terms.
4.
To scale up the kriging on large-scale network, we speed up our model by randomized tensor singular
value decomposition and conjugate gradient method. Experiments on million-level traffic datasets not only
2