with MBTI labels of four dimensions. In compar-
ison, the only existing related dataset (Flekova
and Gurevych,2015) contains only 298 charac-
ters and focuses on a single dimension. Our
dataset is proved challenging — on this binary
classification task, none of the baselines achieve
higher than 60% macro-F1.
•
We develop a movie script parser to automati-
cally process a script to a structured form with
the verbal character dialogues and the non-verbal
scene descriptions illustrating backgrounds. Hu-
man study shows that our parser is more accurate
compared to previous rule-based tools.
•
We propose an extension to BERT classifier (De-
vlin et al.,2018) to handle the long and multi-
view (verbal and non-verbal) inputs. Our model
improves 2-3% over the baselines. This shows
the potential of exploiting both verbal and non-
verbal narratives of characters, which is consis-
tent with psychological theory (McCroskey and
Richmond,1996;Richmond et al.,2008); and
suggests directions of future model design.
2 Related Work
Character-Centric Narrative Understanding
There have been existing studies on character-
centric narrative understanding. While many of
them (Massey et al.,2015;Srivastava et al.,2016;
Brahman et al.,2021) work on summaries of sto-
ries or summaries of characters. Their scopes thus
have a different assessment purpose from ours, and
have the challenge on understanding long narrative
inputs greatly reduced.
For works that use long narratives, most of them
study the inter-character relationship (Elson et al.,
2010;Elsner,2012;Elangovan and Eisenstein,
2015;Iyyer et al.,2016;Chaturvedi et al.,2016,
2017;Kim and Klinger,2019). Inter-character re-
lationship is also related to social network theories.
Various of relationships have been considered in
these studies, while most of them rely on unsuper-
vised learning and do not provide labeled data for
a direct automatic evaluation. TVSHOWGUESS
explored multiple perspectives of persona using
long narratives but the task format is different from
us (Sang et al.,2022b).
Finally, there is work on fundamental NLP anno-
tating techniques over books and screenplays, such
as named entity recognition (Bamman et al.,2019),
coreference resolution (Chen and Choi,2016),
event-centric extraction (Xu et al.,2022a), and
entity-centric natural language modeling (Clark
et al.,2018) which is different from narrative un-
derstanding. Their techniques can be helpful to our
task but the scope of their research is different from
character-centric comprehension.
Latent Persona Induction
Besides (Flekova
and Gurevych,2015) that is similar to our work
in terms of the focus on personality classification,
there is another line of related work on latent per-
sona induction (Bamman et al.,2013). The work
learns a topic model over character behaviors from
books, and each latent topic corresponds to an in-
duced persona. The induced persona vectors can
be then applied to potential applications as a type
of character representation.
From the perspective of practicality, our work
and (Bamman et al.,2013) have their own strengths.
From our motivation of story comprehension as-
sessment, the difference is whether we provide a
direct evaluation of the character understanding or
evaluate it in down-streaming tasks – similar to the
aforementioned relationship detection work, it is
also difficult to provide an automatic and objective
evaluation for the task of (Bamman et al.,2013).
The advantage of our task is that it supports direct
automatic evaluation by itself, without the need for
further downstream tasks; and it can be also used to
evaluate the methods for the task of (Bamman et al.,
2013). Moreover, compared to a direct evaluation,
the performance on a down-streaming task can be
affected by other factors other than persona so a
good performance on downstream tasks may not
come directly caused by a good persona represen-
tation. The cons of our task is that it is limited to
the personality types that have human annotations.
3 Background of MBTI
Personality is a “stable and measurable” individ-
ual characteristic (Vinciarelli and Mohammadi,
2014) which can “distinguish internal properties
of the person from overt behaviors” (Matthews
et al.,2003). Understanding the personalities of
the characters is essential for grasping the story’s
greater message. The Myers–Briggs Type Indicator
(MBTI) (Myers,1962) and the Big-5 Personality
are two of the most popular personality scales. We
used MBTI as the annotation criteria since despite
some validity controversy in self-report measure-
ment, research shows that a person’s friend can
accurately judge his/her MBTI personality (Cohen
et al.,1981). In our narrative comprehension sce-