
1
Non-Parametric and Regularized Dynamical
Wasserstein Barycenters for Sequential Observations
Kevin C. Cheng∗,IEEE Student Member, Eric L. Miller∗IEEE Fellow,
Michael C. Hughes†, Shuchin Aeron∗IEEE Senior Member
Abstract
We consider probabilistic models for sequential observations which exhibit gradual transitions among a finite number of states.
We are particularly motivated by applications such as human activity analysis where observed accelerometer time series contains
segments representing distinct activities, which we call pure states, as well as periods characterized by continuous transition
among these pure states. To capture this transitory behavior, the dynamical Wasserstein barycenter (DWB) model of [1] associates
with each pure state a data-generating distribution and models the continuous transitions among these states as a Wasserstein
barycenter of these distributions with dynamically evolving weights. Focusing on the univariate case where Wasserstein distances
and barycenters can be computed in closed form, we extend [1] specifically relaxing the parameterization of the pure states as
Gaussian distributions. We highlight issues related to the uniqueness in identifying the model parameters as well as uncertainties
induced when estimating a dynamically evolving distribution from a limited number of samples. To ameliorate non-uniqueness,
we introduce regularization that imposes temporal smoothness on the dynamics of the barycentric weights. A quantile-based
approximation of the pure state distributions yields a finite dimensional estimation problem which we numerically solve using
cyclic descent alternating between updates to the pure-state quantile functions and the barycentric weights. We demonstrate the
utility of the proposed algorithm in segmenting both simulated and real world human activity time series.
Index Terms
Wasserstein barycenter, displacement interpolation, dynamical model, sequential data, time series analysis, sliding window,
non-parametric, quantile function, human activity analysis.
I. INTRODUCTION
We consider a probabilistic model for sequentially observed data where the observation at each point in time depends on a
dynamically evolving latent state. We are particularly motivated by systems that continuously move among a set of canonical
behaviors, which we call pure states. Over some periods, the system may reside entirely in one of the pure states while over
other periods, the system is transitioning among these pure states in a temporally smooth manner. There are many applications
where such a model is appropriate including climate modeling [2], sleep analysis [3], simulating physical systems [4], as well
as characterizing human activity from video [5] or wearable-derived accelerometry [6] data. Using the last case as an example,
there will be periods when the individual will be engaged in a well-defined activity such as standing or running. During these
intervals, the data can be modeled as drawn from a probability distribution specific to that canonical state. Given the high
sampling rates of modern sensors, there also may be intervals where multiple consecutive observations reflect the gradual
transition between or among pure states. Over these periods the distribution of the data is given by a suitable combination of
the pure state distributions. Therefore, one possible model for these types of systems consists of three components: a set of
distributions containing the data-generating distribution for each pure state, a continuously evolving latent state which captures
the transition dynamics of the system as it moves among these pure states, and a means of interpolating among these pure
state distributions to characterize the data distribution in the transition regions.
These types of systems pose some unique considerations that are not sufficiently addressed by prior work in time series
modeling. The two most common methods for modeling latent state systems are continuous and discrete state-space models.
Continuous state-space models [7], [8], [9] have no natural way to identify those pure states in which the system may persist
for periods of time. In discrete state-space models such as hidden Markov models, [10], [11], [12], the dynamics are captured
by a temporally varying state vector whose elements represent the probability that the system resides in each of a countable
number of discrete (or in our terminology, pure) states. For these models, the data-generating distribution associated with this
∗Tufts University, Dept. of Electrical and Computer Engineering
†Tufts University, Dept. of Computer Science
This research was sponsored by the U.S. Army DEVCOM Soldier Center under the Measuring and Advancing Soldier Tactical Readiness and Effectiveness
program and Cooperative Agreement Number W911QY-19-2-0003. We also acknowledge support from the U.S. National Science Foundation under award
HDR-1934553 for the Tufts T-TRIPODS Institute. Shuchin Aeron is supported in part by NSF CCF:1553075, NSF RAISE 1931978, NSF ERC planning
1937057, and AFOSR FA9550-18-1-0465. Michael C. Hughes is supported in part by NSF IIS-1908617. Eric L. Miller is supported in part by NSF grants
1934553, 1935555, 1931978, and 1937057.
Code repository: https://github.com/kevin-c-cheng/DWB Nonparametric
©2023 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including
reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or
reuse of any copyrighted component of this work in other works. DOI: 10.1109/TSP.2023.3303616
arXiv:2210.01918v3 [cs.LG] 21 Sep 2023