Note that existing works generally assume the entities in the few-shot tasks (support + query set)
belong to the background KG. However, in real world cases, the goal of few-shot KG completion is
to simulate learning of novel relations that may involve new entities not exist yet on the KG. Thus,
in this paper we also consider a more challenging inductive setting where entities in the few-shot
tasks do not belong to the entity set
E
, but new triplets about these unseen entities can be added at
test time.
4 Connection Subgraph Reasoner
In this section, we first discuss our main motivation from the inductive reasoning perspective and
present the general framework based on it. Then we introduce both learning-free and learning-based
implementations of this framework.
4.1 Inductive Reasoning
Inductive Reasoning refers to the reasoning process of synthesizing a general principle from past
observations, and then using this general principle to make predictions about future events Hunter
[1998]. Few-shot link prediction task can be seen as an inductive reasoning task with background
knowledge.
The key motivation of our work is eliminative induction, one of the principled methods used to
reach inductive conclusions. Specifically, we consider the scientific hypothesis method: eliminating
hypotheses inconsistent with observations. In the context of few-shot link prediction task, we
explicitly try to find a hypothesis consistent with all examples in the support set, then test whether the
the query is consistent with this hypothesis.
To illustrate a simple case of this, we use the
example where the support triplets are (
chop
,
,
kitchen
), (
read
,
,
library
), and query triplet is (
sleep
,
,
?
). From a background KG
(e.g.ConceptNet), we can know a lot of knowledge in forms of triplets about these entities, such as
(
kitchen
,
is part of
,
a house
) and (
read
,
is done by
,
human
) etc. We essentially want to
find an induction hypothesis that explains how chop is related to kitchen in the same way that read is
related to library. In other words, we want to find the shared connection pattern over the background
KG that connects both two pairs of entities. In this case, we can observe that there is a simple shared
2 hop connection path that connects both pairs:
{(chop,can be done with,knife),(knife,is located at,kitchen)}(1)
{(read,can be done with,book),(book,is located at,library)}(2)
The abstracted inductive hypothesis consistent with both examples in the support set is then
∃Z, (hc,can be done with, Z)∧(Z, is located at, tc) =⇒(hc, , tc).(3)
This hypothesis can then be used to deduce that (
sleep
,
,
bedroom
) has a high score, since we know
{(sleep,can be done with,bed), (bed,is located at,bedroom)} from the background KG.
More generally, the shared connection pattern can be graph structured instead of a two-hop path,
which then form a connection subgraph between the two end entities instead of a connection path
(Figure 1). Here we define the connection subgraph: Let
G0= (E0,R0,T0)
be any subgraph of the
background KG
G
, (i.e.,
E0⊆ E
,
R0⊆ R
and
T0⊆ T
) that satisfies the following requirement
for a given pair of nodes
(hc, tc)
on the KG. (1)
hc∈ E0
and
tc∈ E0
; (2) there is no disconnected
component. We define the connection subgraph
GC
of
(hc, tc)
to be any such
G0
where we further
ignore the node identity. The key insight is that we should only consider the relation structure patterns
and abstract away the node identity in order to construct a hypothesis.
Then a hypothesis like Eq. 3 can be represented as a connection subgraph
GC
by interpreting
each clause as an edge. And such a hypothesis is consistent with a support/query triplet (
h
,
r
,
t
) if
GC
is a connection subgraph of
h, t
. In terms of the
example, the triplet (
sleep
,
,
bedroom
) is consistent with the hypothesis Eq. 3 because the connection subgraph form of the
hypothesis
(hc,can be done with, Z)∧(Z, is located at, tc)
is a connection subgraph be-
tween sleep and bedroom, with
hc, Z, tc
corresponding to
sleep
,
bed
and
bedroom
respectively.
Given a pair of node, we further call the KG subgraph with node identity an evidence that wit-
nesses why the hypothesis is consistent with a connection subgraph of
h, t;
. Note the key difference
4