
System Summary Document
Gold Kevin Pietersen was sacked by England 14 months ago after Ashes defeat. Batsman scored
170 on his county cricket return for Surrey last week. Pietersen wants to make a sensational
return to the England side this year. But Andrew Flintoff thinks time is running out for him to
resurrect career. (ROUGE-1, ROUGE-2, ROUGE-L, METEOR, BERTScore)
Andrew Flintoff fears Kevin Pietersen is
’running out of time’ to resurrect his England
career. The dual Ashes-winning all-rounder
is less convinced, however, about Pietersen’s
prospects of forcing his way back into Test
contention. Kevin Pietersen scored 170 for
Surrey in The Parks as he bids to earn a recall
to the England squad... ... Flintoff senses he
no longer has age on his side. Pietersen has
not featured for England since he was
unceremoniously sacked 14 months ago. ... ...
Flintoff said ... ’If he’d started the season last
year with Surrey, and scored run after run and
put himself in the position... whereas now I
think he’s looking at the Ashes ... ... you get
the sense everyone within the England set-up
wants him as captain,’ he said.’ ... The former
England star is hoping to win back his Test
place with a return to red ball cricket. ... ...
’this stands up as a competition.’
Good (Liu
and Liu,
2021)
Kevin pietersen scored 170 for surrey against mccu oxford. Former england star andrew flintoff
fears pietersen is ’running out of time’ to resurrect his england career. Pietersen has been
surplus to requirements since being sacked 14 months ago. Flintoff sees a bright future for
’probably the premier tournament’ in this country. (55.45, 18.18, 41.58, 40.03, 85.56)
Broken Andrew Flintoff fears Kevin Pietersen is running out of time to resurrect his England ca-
reer Flintoff. Pietersen scored 170 for Surrey in The. Former England star Andrew. bats-
man has been . since being sacked 14 months ago after. three in the. the Ashes and he s.
>(56.84,21.51,44.21,47.26, 85.95)
A dot .(0, 0, 0, 0, 88.47)
Scrambled
code
\x03\x18$\x18...\x03$\x03|...\x0f\x01<<$$\x04...\x0e \x04#
$...\x0f\x0f\x0f...\x0e...\x0f...\x0f\x0f$\x0f \x04\x0f\x0f
(many tokens omitted) (0, 0, 0, 0, 87.00)
Scrambled
code +
broken
\x03\x18$\x18...\x03$\x03|...\x0f\x01<<$$\x04...\x0e \x04#
$...\x0f\x0f\x0f...\x0e...\x0f...\x0f\x0f$\x0f \x04\x0f\x0f...
Andrew Flintoff fears Kevin Pietersen is running out of time to resurrect his England career
Flintoff. Pietersen scored 170 for Surrey in The. Former England star Andrew. batsman has
been . since being sacked 14 months ago after. three in the. the Ashes and he s. (many tokens
omitted) (56.84,21.51,44.21,47.26, 87.00)
Table 1: We created non-summarizing systems, each of which produces bad text when processing any docu-
ment. Broken sentences get higher lexical scores; non-alphanumeric characters outperform good summaries on
BERTScore. Concatenating two strings produces equally bad text, but scores high on both. The example is from
CNN/DailyMail (for visualization, document is abridged to keep content most consistent with the corresponding
gold summary).
•
We are the first to treat automatic summariza-
tion scoring as an NLU regression task and
perform evasion attacks.
•
We are the first to perform a universal,tar-
geted attack on NLP regression models.
•
Our evasion attacks support that it is not dif-
ficult to deceive the three most popular auto-
matic scoring systems simultaneously.
•
The proposed attacks can be directly applied
to test emerging scoring systems.
2 Related Work
2.1 Evasion Attacks in NLP
In an evasion attack, the attacker modifies the input
data so that the NLP model incorrectly identifies
the input. The most widely studied evasion at-
tack is the adversarial attack, in which insignificant
changes are made to the input to make "adversar-
ial examples" that greatly affect the model’s out-
put (Szegedy et al.,2014). There are other types of
evasion attacks, and evasion attacks can be classi-
fied from at least three perspectives. (1) Targeted
evasion attacks and untargeted evasion attacks (Cao
and Gong,2017). The former is intended for the
model to predict a specific wrong output for that ex-
ample. The latter is designed to mislead the model
to predict any incorrect output. (2) Universal at-
tacks and input-dependent attacks (Wallace et al.,
2019;Song et al.,2021). The former, also known
as an "input-agnostic" attack, is a "unique model
analysis tool". They are more threatening and ex-
pose more general input-output patterns learned by
the model. The opposite is often referred to as an
input-dependent attack, and is sometimes referred
to as a local or typical attack. (3) Black-box attacks
and white-box attacks. The difference is whether
the attacker has access to the detailed computation
of the victim model. The former does not, and the
latter does. Often, targeted, universal, black-box
attacks are more challenging. Evasion attacks have
been used to expose vulnerabilities in sentiment
analysis, natural language inference (NLI), auto-
matic short answer grading (ASAG), and natural
language generation (NLG) (Alzantot et al.,2018;
Wallace et al.,2019;Song et al.,2021;Filighera
et al.,2020,2022;Zang et al.,2020;Behjati et al.,
2019).
2.2 Universal Triggers in Attacks on
Classification
A prefix can be a universal trigger. When a prefix
is added to any input, it can cause the classifier to
misclassify sentiment, textual entailment (Wallace
et al.,2019), or if a short answer is correct (Fil-
ighera et al.,2020). These are usually untargeted
attacks in a white-box setting
2
, where the gradients
of neural models are computed during the trigger
2
When the number of categories is small, the line between
targeted and non-targeted attacks is blurred, especially when
there are only two categories.