on a conflictual–cooperative scale to the action
categories defined by the Conflict and Media-
tion Event Observations (CAMEO) event coding
scheme (Schrodt,2012). CAMEO specifies
204
low-level event types which are summarized into
20
high-level action categories. The Goldstein
scale ranks “use unconventional mass violence”
and “fight” as the most conflictual of the 20 high-
level action categories (
−10.0
) and “provide aid”
(+7.0) as the most cooperative; see Tab. 1.
Despite its usage, the Goldstein scale has many
well-known shortcomings (King and Lowe,2003;
Schrodt,2019). In particular, it applies only to
action categories, and does not account for any
contextual information of a given event, like which
actors are involved, or how many fatalities resulted,
among other bits of context that should contribute
to the perception of an event’s “intensity”.
This paper takes a latent-variable based
approach to measuring conflict intensity. We
introduce a probabilistic generative model that
assumes each observed event
n
is associated with
a latent intensity class
zn
. A novel aspect of this
model is that it imposes an ordering on the classes,
such that higher values of
zn
denote higher levels
of intensity. The ordinal nature of
zn
is induced
from naturally ordered aspects of the data (e.g.,
casualty counts) where higher values naturally
indicate higher intensity. The model effectively
learns to interpolate the ordered (i.e., cardinal
or ordinal) elements of the data while inferring
correlation structure with the non-ordered (e.g.,
categorical) elements of the data (e.g., actor types).
We start with a discussion of the Goldstein scale
and introduce a political event dataset annotated
with Goldstein values in §2and §3. Then, we
propose our model with an ordinal latent variable
in §4. We evaluate the performance of the model
intrinsically (§5) and extrinsically (§6) and find that
it improves over measures based on the original
Goldstein scale or heuristics based on the raw data.
2 Limitations of the Goldstein Scale
The Goldstein scale is a widely-used measure of the
conflictual versus cooperative nature of interactions
between countries (Goldstein,1992). The scale
was created by a panel of international relations
experts who ranked descriptions of interactions. It
was initially created to score action categories in
the WEIS event ontology (McClelland,1984) and
was later adapted to CAMEO (Schrodt,2012).
The Goldstein scale applies only to the action
category of an event (e.g., “fight” or “trade”). Thus,
two different “fight” events, which might involve
two different pairs of actors, occur at different
times, or differ dramatically with respect to the
number of associated fatalities, will still be as-
signed the same Goldstein value. The Goldstein
scale is thus a poor measure of a conflict’s per-
ceived “intensity”, as it ignores much of the infor-
mation that contributes to that perception. Recent
work in conflict studies, for instance, operational-
izes “intensity” primarily using casualty counts
(Chaudoin et al.,2017;Zhong et al.,2023), which
the Goldstein scale ignores entirely.
In Tab. 1, we show the empirical distribution
of assigned Goldstein values alongside the empir-
ical distribution of casualty counts in a dataset of
conflict events. The Goldstein scale is very coarse-
grained; while it ostensibly ranges between
−10.0
and
+10.0
, only a small number of discrete values
ever occur, with many action categories assigned
the same value. For the purpose of measuring con-
flict intensity, a finer-grained and more contextual
scale is desirable.
3 Conflict Event Data
This paper considers the publicly available Non-
violent and Violent Campaigns and Outcomes
(NAVCO) data collection (Chenoweth et al.,2018),
specifically, the latest release NAVCO 3.0 from
November 2019 which comprises
N= 112,089
events between December 1990 and December
2012. An exemplary event description is “On
19 May 2012, soldiers injured two civilians in
Afghanistan”. Each part of this description has
been parsed by human coders into standardized,
structural features. We color-code the features that
correspond to the semantic roles subject,predicate,
quantifier,object, which are the focus of our mod-
eling approach. Each data point
n
thus consists
of a four-element tuple
{sn,pn,qn,on}
. We note
that events are further coded for their location (in
this case, Afghanistan) and time (19 May 2012),
among other bits of contextual information. Let us
discuss each feature in more detail:
Subject
sn
.NAVCO contains columns termed
“actor3”, “actor6” and “actor9” which code for the
subject (or agent) of a given action. The actor types
are defined by the CAMEO actor codebook. We
first merge the higher-level categories “actor3” and
“actor6”, resulting in
33
different actor types, and
2