Using Entropy Measures for Monitoring the
Evolution of Activity Patterns
Yushan Huang
Dyson School of Design Engineering
Imperial College London
&
Care Research and Technology Centre
The UK Dementia Research Institute
London, UK
yushan.huang21@imperial.ac.uk
Hamed Haddadi
Dyson School of Design Engineering
Imperial College London
&
Care Research and Technology Centre
The UK Dementia Research Institute
London, UK
h.haddadi@imperial.ac.uk
Yuchen Zhao
Dyson School of Design Engineering
Imperial College London
&
Care Research and Technology Centre
The UK Dementia Research Institute
London, UK
yuchen.zhao19@imperial.ac.uk
Payam Barnaghi
Department of Brain Sciences
Imperial College London
&
Care Research and Technology Centre
The UK Dementia Research Institute
London, UK
p.barnaghi@imperial.ac.uk
Abstract—In this work, we apply information theory inspired
methods to quantify changes in daily activity patterns. We use
in-home movement monitoring data and show how they can
help indicate the occurrence of healthcare-related events. Three
different types of entropy measures namely Shannon’s entropy,
entropy rates for Markov chains, and entropy production rate
have been utilised. The measures are evaluated on a large-
scale in-home monitoring dataset that has been collected within
our dementia care clinical study. The study uses Internet of
Things (IoT) enabled solutions for continuous monitoring of in-
home activity, sleep, and physiology to develop care and early
intervention solutions to support people living with dementia
(PLWD) in their own homes. Our main goal is to show the
applicability of the entropy measures to time-series activity data
analysis and to use the extracted measures as new engineered
features that can be fed into inference and analysis models.
The results of our experiments show that in most cases the
combination of these measures can indicate the occurrence of
healthcare-related events. We also find that different participants
with the same events may have different measures based on one
entropy measure. So using a combination of these measures in
an inference model will be more effective than any of the single
measures.
Keywords—entropy, healthcare, feature engineering, IoT
I. INTRODUCTION
Finding patterns in activity data collected by IoT has been
applied to various research fields, including object tracking [1],
intrusion detection [2], and healthcare [3]. Existing research
mainly focuses on using machine learning (ML) models and
algorithms to learn and analyse patterns from raw data [4].
Although such methods can find the direct combination of
raw data points that can indicate interesting events, they are
not able to utilise statistical and useful information about
the data, such as the distribution of data points and the
uncertainty in such distributions. Determining the distribution
and uncertainty will introduce more useful information into
ML models and thus can potentially contribute to building
accurate prediction and inference models.
In this paper, we propose three measures that are constructed
based on entropy. Our goal is to provide measures to enhance
the ability of processing models to quantify the changes in
data patterns and improve the outcome of predictive and
analytical machine learning models. We empirically evaluated
these measures on the data that we have collected in an in-
home healthcare monitoring IoT platform (illustrated in Fig. 1)
to support PLWD. Our preliminary results indicate that, in
most cases, combining these new measures into data analysis
pipelines can suggest occurrences of certain healthcare-related
events. These measures show different suitability on different
participants’ data. So these measures as engineered features in
modern ML methods can improve the outcomes of inference
and predictive models.
II. SYSTEM AND DATA
We have developed a digital platform, called Minder, to
integrate in-home IoT sensors to collect physiological data,
sleep data, environmental data, and activity data in a privacy-
aware and secure manner. A list of the IoT devices in our
platform is shown in Table I. The dataset used in this study
includes 9,370 person-day activity data collected between
arXiv:2210.01736v2 [cs.LG] 5 Oct 2022