Empirical Evaluation of Data Augmentations for Biobehavioral Time Series Data
with Deep Learning
Huiyuan Yang, Han Yu and Akane Sano
Department of Electrical Computer Engineering
Rice University, Houston TX 77005, USA
{hy48, hy29, Akane.Sano}@rice.edu
Abstract
Deep learning has performed remarkably well on many
tasks recently. However, the superior performance of deep
models relies heavily on the availability of a large number
of training data, which limits the wide adaptation of deep
models on various clinical and affective computing tasks,
as the labeled data are usually very limited. As an effec-
tive technique to increase the data variability and thus train
deep models with better generalization, data augmentation
(DA) is a critical step for the success of deep learning mod-
els on biobehavioral time series data. However, the effec-
tiveness of various DAs for different datasets with differ-
ent tasks and deep models is understudied for biobehav-
ioral time series data. In this paper, we first systematically
review eight basic DA methods for biobehavioral time se-
ries data, and evaluate the effects on seven datasets with
three backbones. Next, we explore adapting more recent DA
techniques (i.e., automatic augmentation, random augmen-
tation) to biobehavioral time series data by designing a new
policy architecture applicable to time series data. Last, we
try to answer the question of why a DA is effective (or not)
by first summarizing two desired attributes for augmenta-
tions (challenging and faithful), and then utilizing two met-
rics to quantitatively measure the corresponding attributes,
which can guide us in the search for more effective DA for
biobehavioral time series data by designing more challeng-
ing but still faithful transformations. Our code and results
are available at Link.
1. Introduction
Deep learning performs remarkably well in many fields,
including computer vision (CV), natural language process-
ing (NLP), and recently time series-related tasks [6,10,
32]. Those successful applications increasingly inspire re-
searchers to embrace deep learning for solving issues in
human-centered applications that use physiological and be-
havioral time series data. However, the superior perfor-
mance of deep models relies heavily on the availability of a
large number of training data, but unfortunately, many hu-
man centered applications (i.e., healthcare tasks) usually do
not have enough labeled samples, which may limit the wide
adaptation of deep models to various computing tasks.
As an effective technique to increase the data variability
and thus train deep models with better generalization, data
augmentation (DA) is a critical step for the successful appli-
cations of deep learning models. While DA can yield con-
siderable performance improvements, they do require do-
main knowledge and are task- and domain-dependent. For
example, image rotation, a likely class-preserving behavior,
is designed to rotate the input by some number of degrees.
The image’s class can still be recognized by humans, thus
allowing the model to generalize in a way humans expect
it to generalize. However, such an effective random angle-
based rotation operation may not be applicable to other do-
mains, i.e., wearable data. In addition, searching for the
most effective DA methods for a new dataset is very time-
consuming, and this motivated the proposal of several auto-
matic DA search algorithms [4,5,14–16].
The existing DA literature mainly focuses on computer
vision, but its application to other domains, i.e, biobehav-
ioral time series data, is understudied. A few works inves-
tigated the effectiveness of basic DA methods for time se-
ries and wearable data [1,11,30,32]. However, those works
only investigated the very basic DAs, leaving the more re-
cent DA techniques (i.e., automatic DA) unexplored. More
importantly, it is still an open question of why a DA method
works, and how to quantify its effectiveness. Therefore,
in this paper, we first systematically review various basic
DA methods for biobehavioral time series data, evaluating
the effects on different datasets with varied backbones and
tasks. Next, we validate the effectiveness of adapting more
recent DA techniques (i.e., automatic DA) to biobehavioral
time series data. Following the DADA [14], we designed a
different policy architecture where the operations are differ-
entiable with respect to different time series DA methods.
arXiv:2210.06701v1 [cs.LG] 13 Oct 2022