
SIGIR ’23, July 23–27, 2023, Taipei, Taiwan Zhongxiang Sun, Jun Xu, Xiao Zhang, Zhenhua Dong, & Ji-Rong Wen
Article 163:
[The crime of Bribery]
Company and enterprise
work personnel, who, in the
course of economic contacts,
receive personal kick-backs
and commissions in various
forms in violation of state
rules …….
Article 246:
[The crime of Libel]
Those openly insulting
others using force or other
methods or those fabricating
stories to slander others, if
the case is serious, are to be
sentenced to three years in
prison ......
Case A:
Fact Description: The defendant, Zhou **,slandered Li ** for taking
bribes out of revenge which made a huge negative impact on Li ** ‘s life
and work ……
Cited law article: PRC Criminal Law, Article 246.
Case B:
Fact Description: The defendant, Hua ** took advantage of his position to
take bribes, making a profit of more than 1,000,000 YUAN during the
period ......
Cited law article: PRC Criminal Law, Article 163.
Case C:
Fact Description: The defendant, Le **, for the purpose of profit,
created chat groups to attract gambling participants to join, gambling
activities using the games on Xingyue Mahjong APP ……
Cited law article: PRC Criminal Law, Article 303.
Case D:
Fact Description: The defendant , Xu ** and Xu ** opened a gambling
shop in the form of "small village" in Xu ** convenience store in Wenling
City, and called on gamblers to gamble in the shop ……
Cited law article: PRC Criminal Law, Article 303.
Label: mismatch
Label: match
Article 303:
[The crime of Gambling]
Whoever, for the purpose of
reaping profits, assembles a
crow to engage in gambling,
opens a gambling house, or
makes an occupation of
gambling is to be sentenced
to not more than three
years …...
PRC Criminal Law
Legal case pair
Figure 1: Left: two pairs of legal cases; Right: three cited law
articles in the legal cases. (translated from Chinese)
Analysis shows that the law articles are also important in legal
case matching. Figure 1 shows the snapshots of two real legal case
pairs
2
. Contents of the cited law articles are listed in the right part of
the gure. In the rst legal case pair, Case A and Case B share a large
number of words in their fact descriptions. However, the judges’
decisions are: Case A is libel crime (PRC Criminal Law, Article
246) while Case B is bribery crime (Article 163). The associated law
articles are helpful in identifying the key information (highlighted)
in the two cases [
14
]. By comparing the key information in the two
cases, experts annotate the matching label as “mismatch” though
they have relatively high semantic text similarity (measured by
Lawformer [
41
]). In the second example of Figure 1, Case C and
Case D have relatively lower semantic text similarity than the
previous pair. However, both of them are judged as the gambling
crime (Article 303). The law article helps to identify similar key
information (highlighted) in these two legal cases. So the expert-
annotated matching label is “match”.
Usually, the key constitutive elements and the key circumstances
provide important signals for the matching of two legal cases [
19
].
The key constitutive elements are highly summative texts written
in light of some law articles. The key circumstances, on the other
hand, are detailed fact descriptions and are usually very dierent
from case to case. They are not directly related to any law articles.
Therefore, it is possible that law articles can help the matching
model to identify and decompose these key information.
From the causal sense, the matching of two legal cases is aected
by the mediation eect from the law articles and the direct eect
from the key circumstances part of legal cases. More specically,
the key constitutive elements in the legal cases mediate the law
articles’ eect on the matching decision (i.e., the mediation eect).
In contrast, the key circumstances directly aect the matching
decision (i.e., the direct eect). As a result, the embedding of a legal
case actually consists of two parts: the law-related part, which is
the mediator of the mediation eect, and the law-unrelated part,
which has direct eect. These two parts reect dierent association
2Crawled from http://faxin.cn and translated to English.
mechanisms between the legal cases and the matching decisions. It
is necessary to identify and treat them dierently.
To address the issue, this paper proposes a causal representa-
tion learning framework tailored for legal case matching, called
Law-Match. Specically, Law-Match considers the legal cases as
treatment and the corresponding law articles as instrument variables
(IVs) [
2
,
9
,
33
,
38
]. In the matching phase, after getting the embed-
dings of the legal cases (i.e., treatments) and the related law articles
(i.e., IVs), Law-Match rst uses the IVs to regress the treatments,
resulting in the tted vector (law-related part) and the residuals
(law-unrelated part). These two parts have dierent eects on the
nal matching. Law-Match then combines them into a newly recon-
structed treatment vector with the attention mechanism. Finally, the
reconstructed treatment is fed to the underlying matching model
for making the nal matching prediction. In the training phase,
an alternative optimization procedure is developed to learn the
parameters in the IV regression and matching models.
We summarize the major contributions of the paper as follows:
(1) We analyze the essential role of law articles in legal case
matching from a causal view: the matching decisions are aected
by the mediation eect of the law articles and the direct eect of
the key circumstances in the legal cases.
(2) We propose a novel model-agnostic causal learning frame-
work which introduces the law articles into the process of legal case
matching in a theoretically sound way. IV regression is adopted
to decompose the mediation eect and direct eect from the legal
case embeddings by considering law articles as IVs and legal cases
as the treatments.
(3) We conducted extensive experiments on three public datasets.
Experimental results demonstrated that Law-Match could signi-
cantly improve the underlying models’ performance and outper-
form the baselines, verifying the importance of the law articles in
legal case matching.
2 RELATED WORK
2.1 Legal case matching
Conventionally, legal case matching can be addressed with manual
knowledge engineering (KE) [
6
]. The methods include the Boolean
search technology and manual classication [
11
]. With the devel-
opment of NLP, deep learning has been adapted to realize seman-
tic level matching of legal cases. According to [
8
], these studies
can be categorized as network-based and text-based methods. The
network-based methods are tailored for common law and use the ci-
tations of dierent cases to construct a Precedent Citation Network
(PCNet). For example, [
17
] use PCNet-based Jaccard similarity to
infer the paired legal cases’ similarity. Bhattacharya et al
. [8]
use
Node2vec to map the nodes of the graph to a vector space and then
compute the legal cases’ cosine similarity. See also [7, 21].
The text-based methods compute the semantic similarity be-
tween legal cases. Shao et al
. [29]
utilize BERT to capture the
semantic relationships at the paragraph level and then infer the
relevance between two cases by aggregating the paragraph-level
interactions. Xiao et al
. [41]
release the longformer-based [
5
] pre-
trained language model to get a better representation of long legal
documents. Yu et al
. [46]
propose a three-stage explainable legal
case matching model. Law articles have shown their eects on a