
CHAE: Fine-Grained Controllable Story Generation with Characters,
Actions and Emotions
Xinpeng Wang1, Han Jiang1, Zhihua Wei1∗
, Shanlin Zhou2
1Department of Computer Science and Technology, Tongji University, Shanghai, China
2School of Computer Science and Technology, Shanghai University of Electric Power, Shanghai, China
{wangxinpeng, 2230780, zhihua_wei}@tongji.edu.cn
zhoushanlin@mail.shiep.edu.cn
Abstract
Story generation has emerged as an interest-
ing yet challenging NLP task in recent years.
Some existing studies aim at generating flu-
ent and coherent stories from keywords and
outlines; while others attempt to control the
global features of the story, such as emotion,
style and topic. However, these works focus
on coarse-grained control on the story, neglect-
ing control on the details of the story, which
is also crucial for the task. To fill the gap,
this paper proposes a model for fine-grained
control on the story, which allows the gen-
eration of customized stories with characters,
corresponding actions and emotions arbitrar-
ily assigned. Extensive experimental results
on both automatic and human manual eval-
uations show the superiority of our method.
It has strong controllability to generate sto-
ries according to the fine-grained personalized
guidance, unveiling the effectiveness of our
methodology. Our code is available at https:
//github.com/victorup/CHAE.
1 Introduction
Story generation, one of emergent tasks in the field
of natural language generation, requires following
sentences given the beginning of the story. For hu-
man beings, it is believed that storytelling requires
strong logical thinking ability and organizational
competence, and for machines it may be even more
intractable. Nonetheless, works on story generation
can help machines communicate with humans and
drive improvements in natural language processing
(Alabdulkarim et al.,2021).
At present, most works on story generation fo-
cus on the coherence of the story generated ac-
cording to keywords, outlines and commonsense
knowledge (Yao et al.,2019;Guan et al.,2019;
Rashkin et al.,2020;Guan et al.,2020;Ji et al.,
2020). Some other works aim at generating sto-
ries controlled by overall emotion, style, and topic
∗Corresponding author
(Keskar et al.,2019;Xu et al.,2020;Brahman and
Chaturvedi,2020;Kong et al.,2021). However, in
reality, people often expect more detailed designs
catering to their needs rather than a simple theme or
topic in the generated story. For example, a novel
with more complete elements, i.e., plot, character,
theme, viewpoint, symbol, and setting is usually
preferred to those made up out of thin air.
Taking the control in story generation as the cut-
ting point, GPT-2 (Radford et al.,2019) can fulfill
the story according to the beginning, but the pro-
cess of generation cannot be controlled by people,
resulting in unlogical outputs that lack practical-
ity. CTRL (Keskar et al.,2019) can specify the
generation of articles with different styles through
some style words, but such control stays at the
coarse-grained level, and makes a relatively weak
influence. CoCon (Chan et al.,2020) introduces
natural language to guide text generation. Fang
et al. (2021) propose a new task that guides para-
graph generation through a given sequence of out-
line events. However, the above two studies just
explicitly add some contents to the generated sen-
tences, which is similar to forming sentences with
given phrases, not using the input as a condition
guide for the generative models. SoCP presented by
Xu et al. (2020) can generate stories under change-
able psychological state control, while it does not
govern the detailed contents of the story.
In this paper, we consider more fine-grained
control on story generation, and propose a model,
CHAE
for fine-grained controllable story gen-
eration, allowing the generation of stories with
customized
CH
aracters, and their
A
ctions and
E
motions. Characters are the core of the story.
Their actions drive the story along, and their emo-
tions make the story lively and interesting. Con-
sequently, we take the characters along with their
actions and emotions as the control conditions. It is
a challenge that our model needs to control multiple
characters with their actions and emotions respec-
arXiv:2210.05221v1 [cs.CL] 11 Oct 2022