2
ically address the role of local migration as a driving
factor in cholera transmission. Our goal is to build a
meta-population model with a minimal number of as-
sumptions which accounts for migration, and understand
if the two observers agree in their predictions. The bridg-
ing and common constraints on both sources of data will
be achieved using yet another source of independent data
such as high-fidelity estimation of migration flows.
Our modeling choices are aimed at simplicity and
driven by the overall goal of checking the consistency be-
tween these data sources. For instance, we deliberately
take an agnostic stance on the open questions related to
the role of the environment or details of bacterial dynam-
ics: while several previous studies explicitly included the
environmental compartment [17–21] leading to a larger
number of model parameters, we choose to model cholera
dynamics as an effective transmission process which in-
cludes a periodic functional dependence on the season-
ality, similarly to the approach of [22]. To account for
discreteness in observed cases, we propose a novel sam-
pling model which relates the continuous model with the
discrete observed case counts.
Most of key model parameters will be directly inferred
from case counts and migration data. Further, a sub-
set of most important parameters such as transmission
amplitudes and fraction of asymptomatic infections are
independently inferred from the genomic sequence data,
and compared to models inferred from other data sources
within their uncertainties. In particular, we don’t make
any a priori quantitative assumptions on the fraction
of asymptomatic infections, in previous studies ranging
from 1% to more than 90% of the population [23–25].
Instead, we keep this important model parameter free,
infer its values from data under different settings, and
discuss the sensitivity of this parameter to various mod-
eling assumptions. We also provide a series of careful
sensitivity studies that study the stability of the inferred
parameters related to all of our modeling assumptions.
In this paper, we present initial evidence that phylody-
namic methods can be used to study cholera outbreaks
at a regional level and that they produce parameter esti-
mates that are consistent with established methods. Our
approach provides a common methodology for an early
analysis of the model viability in the context of joint in-
ference from different data sources. Given the comple-
mentary view offered by independent data sources, we
anticipate that the analysis presented in this paper will
find a widespread use in building joint hybrid epidemi-
ological and genetic models which could help verify the
main modeling assumptions.
RESULTS
Integrated data from case counts, genomics,
and transportation data. Cholera was first reported
in Argentina in 1992, and subsequent cholera cases were
reported until 1998 [26–30]. Out of the total 4,281 cases
reported, over 3,500 Vibrio cholerae isolates were stored
at INEI-ANLIS “Dr. Carlos G. Malbr´an”, the national
reference laboratory for Argentina, and a representa-
tive sub-sample of 532 of these isolates were previously
whole-genome sequenced [16], see Supplementary Mate-
rials, section A for more details. We sought to deter-
mine if there was agreement between epidemiological and
genomic data. First, we pre-processed the data set by
removing cities with insufficient genomic samples (less
than 40 sequences). This left us with three target cities:
Tartagal, San Ram´on de la Nueva Or´an (both in Salta
province) and San Salvador de Jujuy (in Jujuy province)
located within in the Northwest of Argentina (see Fig.
1). Initial reports in 1992 indicated that cholera was
first introduced into Argentina via this region from Bo-
livia, leading to a large outbreak from 1992-1993 [31].
In addition to the epidemiological and sequence data,
we used publicly available data on domestic travel to es-
timate the movement of population between these three
cities during the study period. Focusing on the two pri-
mary means of transportation, flights and buses, we were
able to estimate the typical number of people travelling
daily between the selected cities (for details see Materials
and Methods and Supplementary Materials, section B).
Modeling assumptions. We modeled the cholera
transmission dynamics using a system of ordinary dif-
ferential equations (ODEs) where the population is split
into compartments representing individuals in different
states of infection. Typical cholera models are a sys-
tem of ODEs representing a modification of the clas-
sical Susceptible-Infected-Recovered (SIR) type model
[32] with varying degrees of complexity (see [17–22, 33]).
Here, we present a new, simple ODE cholera model that
significantly advances estimation of key epidemiological
parameters in two ways: (i) we focus on a minimalist
representation which allows us to reliably infer model pa-
rameters from a limited amount of data while introducing
the least amount of assumptions; and (ii) we use a meta-
population structure to leverage the spatial knowledge
on reported cases and travel patterns that can represent
the major spreading mechanism. We also do not con-
sider re-infection in our models of localized outbreaks, as
protective immunity against cholera has been estimated
to last at least 3 years [2]. Prior to formally introduc-
ing our dynamic model, we discuss the main modeling
assumptions behind our approach.
Many cholera models in the literature include an envi-
ronmental compartment [17–21]. Such an environmental
compartment is typically introduced to explicitly model
the transmission of infection through a water source, and
additionally describes the evolution of bacteria in a water
source with a temperature-dependent dynamics. From
the fitting perspective, an environmental component may
have a benefit to help the multi-year epidemic outbreak
(see the span of observed cases in the Supplementary Ma-
terials, Fig. S1) survive the period of cool temperatures
when number of cases drop significantly, and re-occur
when the temperature rises. During our initial model