2 Vétil at al.
of shape collections. In particular, atlas models [16] learn geometrical distribu-
tions in terms of an “average” representative shape and associated variability,
generalizing the Euclidean mean-variance analysis. In medical imaging, learning
atlases from healthy examples allows for the definition of normative models for
anatomical structures or organs, such as brain MRIs or subcortical regions seg-
mented from neuroimaging data [10,22], thus providing a natural framework for
the detection of abnormal anatomies.
In practice, leveraging an atlas model to compute the likelihood of a given
shape to belong to the underlying distribution either requires to identify land-
marks [6], or to solve a registration problem [3]. To circumvent the computational
cost of this shape embedding operation, the authors in [21] proposed to train an
encoder network to predict registration parameters from image pairs. In [7, 14],
the authors built on this idea and used the variational autoencoder (VAE) of [13]
to learn the embedding space jointly with the atlas model, instead of relying on
pre-determined parametrization strategies. However, the structure of the decod-
ing network remained constrained by hyperparameter-rich topological assump-
tions, enforced via costly smoothing and numerical integration operators from a
computational point of view.
Alternative approaches proposed to drop topological hypotheses by relying
on variations of the AE or its variational counterparts [13] to learn normative
models that are subsequently used to perform Anomaly Detection (AD). These
methods compress and reconstruct images of healthy subjects to capture a nor-
mative model of organs [1,23]. Yet, they are usually applied on the raw imaging
data, thus they entail the risk of extracting features related to the intensity dis-
tribution of a dataset which are not necessarily specific to the organ anatomy.
Therefore, regularization constraints [1, 4] are introduced to improve the detec-
tion performances compared to the vanilla AE. To further reduce the overfitting
risk, these methods artificially increase the dataset size by working on 2D slices.
Given this context, we propose a VAE-based method to learn a normative
model of organ shape that can subsequently be used to detect anomalies, thus
bridging the gap between SSM and AD models. Although SSM methods with
explicit modeling constraints proved effective to learn relevant shape spaces from
relatively small collections of high-dimensional data, we propose to further re-
duce the set of underlying hypotheses and leave the decoding network uncon-
strained in its architecture. With the objective to learn normative shape models
from collections of healthy organs, we argue that sufficiently large databases of
relevant medical images can be constructed by pooling together different data
sources, see [8] for instance. To reduce the risk of overfitting and focus on the
anatomy of organs, the VAE is learned from 3D binary segmentation masks
and is coupled with a shape-preserving data augmentation strategy consisting of
translations, rotations and scalings. An approach to study and visualize group
differences is also proposed.
Section 2 details the proposed method, which is then illustrated on a pancreas
shape problem in Section 3. Section 4 discusses the results and concludes.