
the transformation of EEG data automatically through model training [
17
,
18
,
19
]. In addition to
the fast growth of DL-based EEG decoders, geometric learning (GL) approaches, mostly based on
Riemannian geometry (RG), have been adopted in the field of BCI [
20
]. RG is a type of non-Euclidean
geometry that has a different interpretation of Euclid’s fifth postulate (i.e. parallel postulate) [
21
].
In GL, geodesic between points on the manifold is a critical feature for classification tasks in BCI.
The power and spatial distribution of a segment of multi-channel EEG signals can be coded into
a covariance matrix that is symmetric positive definite (SPD) in general. The use of Riemannian
geometry allows mapping of EEG data directly onto a Riemannian manifold where Riemannian
metrics are insensitive to outliers and noise [
22
,
20
]. RG can also avoid swelling effect [
23
], which is
a common issue when employing Euclidean metric. Furthermore, metrics on Riemannian manifold
have several types of invariance properties [
24
,
22
], which make the model have higher generalization
capability to complex EEG signals. In 2010, Barachant et al. [
25
] proposed Minimum Distance
to Mean (MDM) that maps target EEG data onto the SPD manifold to find the nearest class center.
Later on, they developed TSLDA [
26
] that projects data from the manifold to a specific tangent space
where Euclidean classifiers are applicable. RG-based classification for EEG decoding has shown
extra robustness as the relationship between data samples can be stably preserved, leading to success
in recent data competitions in the BCI field such as ’DecMEG2014’1and the ’BCI challenge’2.
The nascent field of geometric deep learning (GDL) [
27
] has expanded by emerging techniques
to generalize the use of deep neural networks to non-Euclidean structures, such as graphs and
manifolds. Efforts have been made to transitioning useful operations from Euclidean to Riemannian
spaces, including convolution [
28
,
29
,
30
], activation function [
28
,
29
], batch normalization [
31
,
32
],
that facilitate the ongoing development of GDL tools. SPDNet [
28
] is a Riemannian network
for non-linear SPD-based learning on Riemannian manifolds using bi-linear mapping that mimics
Euclidean convolution for visual classification tasks. ManifoldNet [
29
] offers high performance in
medical image classification with manifold autoencoder. [
33
] characterizes 3D movement via the
manifold polar coordinate with a geodesic CNN. [
27
] performs convolution on the manifold as a
generalization of local graph or manifold pseudo-coordinate for vertex classification on graph and
shape correspondence task. In contrast of the vast develop of GDL in many other scientific fields,
only few studies focus on decoding EEG data with a merge use of GL and DL. [
34
] proposed a
network architecture that integrates fusion of Euclidean-based module and manifold-based module
with multiple LSTM and attention structures to extract spatiotemporal information of EEG. [
35
]
proposes a Riemannian-embedding-banks method that separates the entire embeddings into multiple
sub-problems for learning spatial patterns of MI EEG signals based on the features extracted from
the SPDNet. [
36
] combines federated learning and transfer learning on Riemannian manifold using
the spatial information of EEG. [
37
] proposes deep optimal transport on the manifold to minimize
the cost of domain adaptation from the source domain to the target domain. [
38
] extracts multi-view
representations of EEG. These studies have established cornerstones toward the field of future GDL
for EEG decoding, but the increment of performance is yet marginal. Most of the above-mentioned
techniques can not map the temporal information of EEG onto the manifold, or still rely on Euclidean
tools to handle EEG features. We herein propose a manifold attention network, a novel GDL
framework, which maps EEG features on a Riemannian SPD manifold where the spatiotemporal
EEG patterns are fully characterized. The main contributions of the present study are the following:
• a manifold attention network proposed for decoding general types of EEG data.
•
a lightweight, interpretable, and efficient GDL framework that is capable of capturing
spatiotemporal EEG features across Euclidean and Riemannian spaces.
•
an empirical validation of our proposed model demonstrating its generalizable superiority
over leading DL approaches in EEG decoding.
•
neuroscientific insights interpreted by the model that not only echo prior knowledge but also
offer a new look into the dynamical brain.
This article is organized as follows: we first brief the essential background of RG and manifold
attention mechanism; next, we leverage the proposed MAtt architecture with details of model design;
we then validate our proposed model experimentally; lastly, we interpret our proposed model with
neuroscientific insights. Our source code is released in https://github.com/CECNL/MAtt.
1DecMEG2014: https://www.kaggle.com/competitions/decoding-the-human-brain/leaderboard
2BCI challenge: https://www.kaggle.com/c/inria-bci-challenge
2