
SentBS: Sentence-level Beam Search for Controllable Summarization
Chenhui Shen ∗1,2 Liying Cheng ∗1,3 Lidong Bing††1Yang You2Luo Si1
1DAMO Academy, Alibaba Group 2National University of Singapore
3Singapore University of Technology and Design
{chenhui.shen, liying.cheng}@alibaba-inc.com
{l.bing, luo.si}@alibaba-inc.com youy@comp.nus.edu.sg
Abstract
A wide range of control perspectives have
been explored in controllable text genera-
tion. Structure-controlled summarization is re-
cently proposed as a useful and interesting re-
search direction. However, current structure-
controlling methods have limited effectiveness
in enforcing the desired structure. To address
this limitation, we propose a sentence-level
beam search generation method (SentBS),
where evaluation is conducted throughout the
generation process to select suitable sentences
for subsequent generations. We experiment
with different combinations of decoding meth-
ods to be used as sub-components by SentBS
and evaluate results on the structure-controlled
dataset MReD. Experiments show that all ex-
plored combinations for SentBS can improve
the agreement between the generated text and
the desired structure, with the best method sig-
nificantly reducing the structural discrepancies
suffered by the existing model, by approxi-
mately 68%. 1
1 Introduction
Controllable text generation is receiving increas-
ing attention due to its wide range of applications.
Depending on the use cases, the controllable gen-
eration tasks may focus on a wide range of control
perspectives, such as entities (Narayan et al.,2022;
Fan et al.,2018a), aspects (Hayashi et al.,2021),
and keywords (Wang et al.,2021;He et al.,2020).
Recently, Shen et al. (2022) propose a sentence-
level labeled meta-review dataset, MReD, for the
controllable summarization task from a new control
perspective that focuses on controlling the struc-
ture of the output summary. The input consists of
several reviews on the same research paper, and a
∗
Chenhui and Liying are under the Joint PhD Program
between Alibaba and their corresponding universities.
†† Corresponding author.
1
Our code and data are available at
https://github.com
/Shen-Chenhui/SentBS.
control sequence specifying the desired summary
structure. For instance, with a control sequence of
“abstract | strength | decision”, the generated output
should be composed of a sentence that summarizes
the contents of the paper, followed by a sentence
discussing the strengths, then the last sentence giv-
ing the final decision.
Previous controllable summarization models are
commonly fine-tuned on pre-trained transformer
architectures (Vaswani et al.,2017) such as BART
(Lewis et al.,2020) and Pegasus (Zhang et al.,
2020a), with the control signals merged into the
text input or prompts (Shen et al.,2022;Narayan
et al.,2022;He et al.,2020;Keskar et al.,2019;
Fan et al.,2018a). Previous works mainly focus
on improving the summary’s similarity with the
gold reference, leaving room for further improve-
ment on the controllability. In particular, the best-
performing model on the recently released MReD
dataset still generates around 29% of the sentences
that do not follow the control structure
2
, which is
far from satisfactory.
In this paper, we explore how to enhance the
structure-controllability in summarization. Specif-
ically, we notice the following possible pitfalls in
the existing summarization models. First, those
models usually treat generation as a standalone
process, which continuously generates the tokens
solely based on the logits predictions, without stop-
ping to reconsider whether the generated sequences
satisfy the control signals. Moreover, autoregres-
sive models can suffer from error propagation in
generation due to self-attention (Vaswani et al.,
2017). Therefore, if the previous sequences are not
well-controlled, subsequent generations may devi-
ate further from the desired output. Motivated by
this, we propose the
Sent
ence-level
B
eam
S
earch
(SentBS) method to address the identified issues.
2
For instance, the generated sentence may discuss the
weakness of the paper even though it corresponds to a
“strength” control label.
arXiv:2210.14502v3 [cs.CL] 24 Feb 2023