
LaundroGraph: Self-Supervised Graph Representation Learning
for Anti-Money Laundering
Mário Cardoso
mario.cardoso@feedzai.com
Feedzai
Pedro Saleiro
pedro.saleiro@feedzai.com
Feedzai
Pedro Bizarro
pedro.bizarro@feedzai.com
Feedzai
ABSTRACT
Anti-money laundering (AML) regulations mandate nancial insti-
tutions to deploy AML systems based on a set of rules that, when
triggered, form the basis of a suspicious alert to be assessed by hu-
man analysts. Reviewing these cases is a cumbersome and complex
task that requires analysts to navigate a large network of nancial
interactions to validate suspicious movements. Furthermore, these
systems have very high false positive rates (estimated to be over
95%). The scarcity of labels hinders the use of alternative systems
based on supervised learning, reducing their applicability in real-
world applications. In this work we present LaundroGraph, a novel
self-supervised graph representation learning approach to encode
banking customers and nancial transactions into meaningful rep-
resentations. These representations are used to provide insights
to assist the AML reviewing process, such as identifying anoma-
lous movements for a given customer. LaundroGraph represents
the underlying network of nancial interactions as a customer-
transaction bipartite graph and trains a graph neural network on a
fully self-supervised link prediction task. We empirically demon-
strate that our approach outperforms other strong baselines on
self-supervised link prediction using a real-world dataset, improv-
ing the best non-graph baseline by
12
p.p. of AUC. The goal is to
increase the eciency of the reviewing process by supplying these
AI-powered insights to the analysts upon review. To the best of our
knowledge, this is the rst fully self-supervised system within the
context of AML detection.
CCS CONCEPTS
•Computing methodologies →Anomaly detection
;
Neural
networks;Learning latent representations.
KEYWORDS
anti-money laundering, self-supervision, graph neural networks
ACM Reference Format:
Mário Cardoso, Pedro Saleiro, and Pedro Bizarro. 2022. LaundroGraph: Self-
Supervised Graph Representation Learning for Anti-Money Laundering. In
3rd ACM International Conference on AI in Finance (ICAIF ’22), November
2–4, 2022, New York, NY, USA. ACM, New York, NY, USA, 9 pages. https:
//doi.org/10.1145/3533271.3561727
Permission to make digital or hard copies of all or part of this work for personal or
classroom use is granted without fee provided that copies are not made or distributed
for prot or commercial advantage and that copies bear this notice and the full citation
on the rst page. Copyrights for components of this work owned by others than the
author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or
republish, to post on servers or to redistribute to lists, requires prior specic permission
and/or a fee. Request permissions from permissions@acm.org.
ICAIF ’22, November 2–4, 2022, New York, NY, USA
©2022 Copyright held by the owner/author(s). Publication rights licensed to ACM.
ACM ISBN 978-1-4503-9376-8/22/10... $15.00
https://doi.org/10.1145/3533271.3561727
1 INTRODUCTION
Money laundering is a criminal activity concerned with concealing
the origin of funds obtained through illegal means such as terrorism
nancing, drug tracking or corruption, appearing legitimate until
a thorough analysis is performed. An estimated
€
1.7 to
€
4 trillions
(2% - 5% of global GDP) are estimated to be laundered annually [13].
To adhere to the AML regulations, nancial institutions employ
compliance experts that investigate suspicious activities alerted,
usually, through a rule-based system. These triggered rules are the
starting point of a process that can take several days to complete,
culminating in a decision of agging as suspicious activity or not.
When the former is identied, a suspicious activity report must be
led and delivered to a regulatory institution that proceeds with
due action. Non-compliance in reporting money laundering can
lead nancial institutions and their employees to face civil and
criminal penalties, such as heavy nes or prison time.
In Anti-Money Laundering (AML) reviewing, analysts investi-
gate alerts centered on an entity (e.g., bank accounts or customers),
comprised of a bulk of transactions that triggered one or more
rules in order to understand if any suspicious activity was involved.
Navigating the network of interactions sprawling from a complex
alert and keeping track of the ows of money, often times through
entities not directly connected to the one being investigated, is
a challenging and cumbersome task. To facilitate this procedure,
analysts resort to understanding the data through aggregations
of meaningful categories, such as grouping by entities interacted
with (known as counterparts) or amounts, as well as relying on
their past experience and prior knowledge of the customer under
review. Throughout the review process, there is a continuous eort
to lter the large bulk of transactions into a smaller set of abnormal
interactions that can be used to justify suspicious activity. There
are some challenges with the current reviewing process, namely:
1) New analysts lack the context more experienced analysts might
have, requiring an additional eort to familiarize themselves with
re-occurring customers. Similarly, additional eort is required to
contextualize new customers entering the system; 2) It is challeng-
ing to navigate the bulk of transactions and decide which move-
ments are particularly suspicious, and resorting to a macro-view of
the interactions can lead to missing the ne-grained details of each
transaction.
To mitigate the aforementioned challenges, in this work we
present
LaundroGraph
, a novel fully self-supervised approach
leveraging Graph Neural Networks (GNNs) to encode represen-
tations of customers and transactions within the context of AML
reviewing. We propose to represent the network of nancial in-
teractions as a directed bipartite customer-transaction graph
1
,
1
Other networks were considered but this was simultaneously the best performing
and most exible approach
arXiv:2210.14360v1 [cs.LG] 25 Oct 2022