
form new practices and interventions that seek to improve
the political news media ecosystem. Quotatives, in this con-
text, are a powerful instrument to measure bias. Studying
how journalists deviate from the standard usage of quota-
tives – e.g., “say” and “tell” (The Associated Press 2020) –
allows researchers to quantitatively assess adherence to jour-
nalistic objectivity (Lee 2017) and reveal biases in journal-
istic coverage of politics (Gidengil and Everitt 2003).
Present work. This paper analyzes quotatives to study ob-
jectivity and media bias in political journalism. We ask:
•RQ1 How has the usage of nonobjective quotatives
evolved in U.S. political journalism?
•RQ2 How do news outlets use nonobjective quotatives
when covering politicians of different parties?
To answer these research questions, we developed a method-
ology to extract quotatives from a large-scale news corpus.
We then performed a comprehensive study on how (and
which) quotatives are used in direct quotes from U.S. politi-
cians between 2013 and 2020, leveraging a large dataset of
quotes from English-language media linked with relevant
speaker metadata (Vaucher et al. 2021) and enriched with
the political leanings of different U.S. outlets. By counting
the usage of nonobjective quotatives like “shout” or “assert,”
we analyze how U.S. news outlets of different political incli-
nations adhere to basic journalistic objectivity principles and
how this adherence has evolved. Further, analyzing how out-
lets of different political inclinations use quotatives to talk
about politicians of different parties, we study the evolution
of quotative bias in news outlets.
Summary of findings. We find that the usage of nonob-
jective quotatives varies across different outlet categories.
Overall, the more ideologically extreme an outlet is, the
more nonobjective quotatives it uses. However, we also find
that centrist outlets are experiencing a significant increase
in the usage of nonobjective quotatives over the last years
(about 0.6 percentage points, or 20% in relative percentage),
suggesting that they may be “catching up” to the more bi-
ased outlets, which are not experiencing such significant in-
creases (RQ1). We also find evidence of “quotative bias,”
i.e., outlets tend to use nonobjective quotatives, especially
when referring to politicians of opposing ideology. For in-
stance, left and right-leaning outlets use nonobjective quota-
tives up to 2% more often when referring to Republican and
Democrat politicians, respectively (RQ2). Last, we find that
this quotative bias is increasing at a swift pace, increasing as
much as 0.5 percentage points per year in absolute percent-
age, or 25% in relative percentage, for left-leaning outlets,
suggesting a rapid increase in polarization (RQ1 and RQ2).
Implications. Our findings indicate a decline in journalis-
tic objectivity in U.S. political news, particularly from cen-
trist outlets. This suggests that centrist outlets may play a
role in the increasingly less respectful and fact-based debate
around politics (Doherty et al. 2019). Further, we also find
evidence of an increasing quotative bias, which could further
erode trust in the media (Brenan 2022).
2 Background and Related Work
When a quote occurs in the news, three elements are typi-
cally involved: the source, i.e., the speaker who uttered the
quote (underlined in the examples); the quoted content itself
(in italic); and the quotative that introduces the quote (also
known as a cue, reporting verb, speech verb, or attribution
verb; in bold). Quotes can be classified as either direct, in-
direct, mixed, or pure (Cappelen and Lepore 1997), where
only the former three types are typically of concern to jour-
nalists. We give an example of a direct quote in the introduc-
tion and of mixed and indirect quotes below.
Indirect quote: Sen. Ron Wyden of Oregon, the chairman of
the Senate Finance Committee, indicated that in 2019, about
100 to 125 corporations reported financial statement income
greater than 1B USD.
Mixed quote: Catsimatidis said he’d serve for 99 cents “be-
cause I’m a grocer.”
In direct and mixed quotes, a pair of quotation marks
are used, and we can infer that the speaker uttered the
quoted words, whereas indirect quotes may paraphrase the
speaker’s words. Therefore, journalists have the most free-
dom in word choice in indirect quotes, as they can, to some
extent, rewrite what the speaker said. In contrast, in direct
quotes, journalistic norms require them to report the quoted
words verbatim (Harry 2014).
In our work, we focus on direct quotes for the fol-
lowing reasons: Quotebank does not contain indirect
quotes (Vaucher et al. 2021); automatically extracting the
quotative in mixed quotes is technically challenging and,
in some instances, impossible as there is no quotative, e.g.,
John will not help as he has “done more for this house than
all of us combined”.
Measuring media bias. Previous work has studied me-
dia biases: how journalists’ and editors’ personal opinions,
beliefs, and financial incentives shape what is considered
newsworthy (McCombs and Shaw 1972) and how issues are
covered (Iyengar 1994). Scholars argue that partisan media
bias can harm democracy by distorting citizens’ political
knowledge and increasing polarization (Bernhardt, Krasa,
and Polborn 2008; Boudana 2011; McNair 2017). Thus,
measuring media bias is the first step to improving our in-
formation ecosystem (Watts, Rothschild, and Mobius 2021).
Early studies in media bias required extensive manual an-
notation. For instance, Kobre (1953) studied how the press
in Florida covered the U.S. 1952 presidential campaign by
coding the number of inches of text given to each party,
the position of pictures, etc., across hundreds of newspa-
pers. However, in recent times and with the digitization of
news, various methods have been developed to automat-
ically measure media bias (Hamborg, Donnay, and Gipp
2019). Some of these methods are audience based, mea-
suring how segregated news consumers are across outlets,
e.g., Zhou, Resnick, and Mei (2011) use votes on Digg, a
social news aggregator, to classify political articles. Others
are content based, quantifying media bias by analyzing pub-
lished content directly. For instance, Gentzkow and Shapiro
(2010) measured bias using the frequency at which outlets
reproduce partisan phrases in congressional speeches.