
assume access to a presumed trusted knowledge
base such as Wikipedia (Thorne et al.,2018), scien-
tific publications (Wadden et al.,2020), or search
engine results (Augenstein et al.,2019). In this pa-
per, we focus on trusted evidence-based verification
approaches which can deal with the truth changing
over time (Schuster et al.,2019). More importantly,
they are the most representative of professional fact
verification. Effectively debunking misinformation
requires stating the corrected fact and explaining
the myth’s fallacy (Lewandowsky et al.,2020), both
of which require trusted evidence.
Global counter-evidence assumption in FCNLP.
In FCNLP, evidence retrieval-based approaches
assume that the semantic content of a claim is
sufficient to find relevant (counter-) evidence in a
trusted knowledge base (Thorne et al.,2018;Jiang
et al.,2020;Wadden et al.,2020;Aly et al.,2021).
This becomes problematic for misinformation that
requires the source guarantee to refute the claim.
By nature, in this case, the claim and evidence
content are distinct and not entailing. Content can-
not assert that two different narratives describe the
same protests (e.g., Claim 5 in Table 1), or that
a non-entailing fact (squalene is harvested from
sharks) serves as a basis for the false claim (e.g.,
Figure 1). The consequence is a circular reasoning
problem: Knowing that a claim is false is a precon-
dition to establishing the source guarantee, which
in turn is needed to refute the claim. To escape
this cycle, one must (a) provide the source guar-
antee by other means than content (e.g., context),
or (b) find evidence that refutes the claim without
the source guarantee (global counter-evidence). By
relying only on the content of the claim, FCNLP
cannot provide the source guarantee and is limited
to global counter-evidence, which only accounts
for 20% of misinformation claims analyzed in the
previous section.
Current FCNLP fails to provide source guaran-
tees.
We note that providing the source guarantee
goes beyond entity disambiguation, as required in
FEVER (Thorne et al.,2018). The self-contained
context within claims in FEVER is typically suffi-
cient to disambiguate named entities if required.
6
After disambiguation, the retrieved evidence serves
as global counter-evidence.
6
In the claim “Poseidon grossed $181,674,817 at the world-
wide box office on a budget of $160 million” it is clear that
“Poseidon” refers to the film, not an ancient god. (FEVER)
Recent approaches further add context snippets
from Wikipedia (Sathe et al.,2020) or dialogues
(Gupta et al.,2022) to resolve ambiguities and can-
not provide the source guarantee to break the circu-
lar reasoning problem. These snippets differ from
the context used by professional fact-checkers who
often need to trace claims and their sources across
different platforms. Recently, Thorne et al. (2021)
annotate more realistic claims w.r.t. multiple evi-
dence passages. They found supporting and refut-
ing passages for the same claim, which prevents
the prediction of an overall verdict. Some works
collect evidence for the respective claims by identi-
fying scenarios where the claimant’s source is nat-
urally provided: such as a strictly moderated forum
(Saakyan et al.,2021), scientific publications (Wad-
den et al.,2020), or Wikipedia references (Sathe
et al.,2020). However, such source evidence is
only collected for true claims. Adhering to the
global counter-evidence assumptions of previous
work, false claims in these works are generated
artificially and do not reflect real-world misinfor-
mation.
3.3 Human and NLP Comparison
Our analysis (Table 2) finds fact-checkers only
refuted 26% of false claims with global counter-
evidence. In all other cases, fact-checkers relied on
source guarantees (LCE, NCS) or asserted that no
supporting evidence exists (NEA). The verification
strategy is not evident given the claim alone but
dependent on existing evidence. The claim that
“President Barack Obama’s policies have forced
many parts of the country to experience rolling
blackouts” is refuted via global counter-evidence
(that rolling blackouts had natural causes). The
claim that “90% of rural women and 55% of all
women are illiterate in Morocco” seems verifiable
via official statistics. Yet, no comparable statistics
exist and the claim is refuted due to relying on a
decade-old USAID request report.
We further analyze claims refuted via global
counter-evidence, that FCNLP, in theory, can re-
fute. Some claims only require shallow reasoning
as directly contradicting evidence naturally exists:
A transcript of an interview in which Ron DeSantis
was asked about the coronavirus can easily refute
the claim “Ron DeSantis was never asked about
coronavirus”. Another case is when information
about the claim’s veracity already exists, e.g., be-
cause those affected by the myth already corrected