2
are not known, but estimated, such that small domain models provide an opportunity to en-
hance the quality of both point and variance estimates through their joint estimation since
they are typically correlated.
Bayesian models for continuous data point and variance estimates are easily designed such
that the mean of the marginal likelihood for the estimated variances represents a denoised
“true” variance. The true variance, in turn, is set to be the “generating” variance for the noisy
point estimate in its likelihood centered around the estimated true mean value (Sugasawa
et al. 2017); for example, suppose vdrepresents the estimated sampling variance for domain
d∈(1,...,N)associated with continuous response, yd. In the case of unknown, latent true
domain variance, σ2
d, one may impose a likelihood, vd|σ2
d
ind
∼f(σ2
d)with mean σ2
d. One
typically chooses the conditional likelihood, yd|θd, σ2
d
ind
∼ N(θd, σ2
d), where N(·)denotes
the normal distribution. We see that the variance in the conditional likelihood for continuous
ydis readily and naturally set to equal the latent true variance, σ2
d. This connection between
the point and variance estimates where the true modeled variance is set as the generating
variance of the noisy point estimate ties together the likelihoods for the point and variance
estimates in a single model framework.
We are not aware of a (small area) model for count data in the small estimation literature
where the estimated sampling variance is modeled jointly with the direct point estimate such
that the generating variance of the direct point estimate is set equal to the mean of estimated
sampling variance (where we interpret the mean as the latent “true" domain variance). In their
recent comprehensive review article of small estimation methods, Sugasawa & Kubokawa
(2020) note the possibility to model the point estimate with a non-normal distribution and
more broadly discuss the use of generalized linear models. They do not, however, explicate
a count data model that incorporates estimated variances. Similarly, Rao & Molina (2015)
discuss over-dispersed Poisson models for count data, but none that incorporate estimated
domain variances. Tzavidis, Nikos and Ranalli, M Giovanna and Salvati, Nicola and Dreassi,
Emanuela and Chambers, Ray (2015) develop a Poisson small area model for count data, but
assume the domain variance is equal to the mean of the Poisson likelihood for the domain
point estimate. So, they do not input domain variances, at all.
The literature does, however, provide a recent example where Bradley et al. (2016) con-
struct a joint model for geographically-indexed point and variance estimates under a count
data response. They define a Poisson likelihood such that the model conditional variance
(of the point estimate) is defined as Var (yd|xd) = exp (λd)for count data response, yd,
associated to domain d∈(1,...,N); where xdis a set of covariates and λdis the log-
mean parameter. By contrast, under a normal likelihood with mean λdand variance ϕd
for logarithm, log(vd), of true variance vdof yd, the associated mean is E(vd|xd) = σ2
d=
exp λd+σ2
d/26=Var (yd|xd). Although Bradley et al. (2016) utilize a random effect by
specifying a likelihood for the log-variance, this construction does not produce a true variance
that is equal to the generating variance of the count data response. All to say, the literature
is more limited for small area models for count data that incorporate estimated variances and
there are no implementations to our knowledge that ensure the σ2
d=Var (yd|xd). Perhaps
the reason for the limited literature focused on count data models for small area estimation is
that for many datasets domain level counts are sufficiently large to approximate with a con-
tinuous data distribution, though we have mentioned some data examples above with count
data variables that express low counts for some domains.
This paper, by contrast, constructs a joint model for a count data point estimate ydand
its estimated variance vdwhere the modeled true variance σ2
d(the mean of the conditional
likelihood for vd) is set equal to the variance Var(yd|xd)of the point estimate likelihood.
We extend a multiplicative random effect in our model specification for the point estimate
as suggested by Zhou et al. (2012) for non-survey data where there is no associated variance