MARGINAL INDEPENDENCE STRUCTURES UNDERLYING BAYESIAN NETWORKS 3
contain all UEC-representatives with a specified set of “source nodes” and pairwise
intersections and unions of their neighbors. A quadratic and square-free reduced
Gr¨obner basis for this toric ideal is identified (Theorem 3.2.9). By the Fundamental
Theorem of Markov Bases [
6
,
28
], the Gr¨obner basis gives a set of moves for exploring
the UEC-representatives within a fiber. In Section 4, we extend these moves via
additional binomial operations to a set of moves that completely connects the space
of all UEC-representatives on
n
nodes (Theorem 4.3.1). The resulting connectivity
theorem yields a method for exploring the space of UEC-representatives in the
language of binomials, which is applied in Section 6.1 to estimate the marginal
independence structure of a DAG model. This section uses classic results on Gr¨obner
bases and toric ideals. It makes use of the results derived in Section 2.
Complexity reduction. Using the algebraic methods developed in Sections 3
and 4, we obtain a search algorithm over the space of UEC-representatives on
n
nodes in the language of polynomials. However, the polynomials used in this search
are computationally inefficient when implemented directly. To reduce the complexity,
in Section 5.1, we introduce the DAG-reduction of a UEC-representative and prove
that the algorithm can be rephrased in terms of DAG-reductions so as to reduce
complexity. This reduction in complexity makes feasible an implementation of the
identified search method. To do this, it is shown that there exists DAGs in a given
UEC that are maximal in the UEC with respect to edge inclusion. We observe
that these maximal DAGs form a MEC contained within the UEC. It follows that
every UEC can be identified with a unique MEC of DAGs. The completed partially
directed acyclic graph (CPDAG) of this MEC is characterized, and then used to
produce the DAG-reduction of the UEC, which is more computationally efficient
than the UEC-representative in terms of both time and space complexity. The
results in this section are accessible to readers with knowledge of graphical models.
MCMC estimation of the marginal independence structure of a DAG
model. In Section 6the DAG reduction search in Section 5is implemented in
the form of a Markov Chain Monte Carlo method, called GrUES (Gr¨obner-Based
Unconditional Equivalence Search).
GrUES
can completely explore the space of
UEC-representatives, thereby making possible the identification of an optimal UEC-
representative for the data.
GrUES
also yields an estimate of the posterior distribution
of the UEC-representatives, allowing the user to quantify the uncertainty in the
estimated marginal independence structure.
In subsection 6.2, we apply
GrUES
to synthetic data generated from random linear
Gaussian DAG models to evaluate its performance empirically. It is benchmarked
against pairwise marginal independence testing, with performance evaluated for
varying numbers of nodes, graph sparsity and choices of prior, including a noninfor-
mative prior as well as a prior that allows the user to incorporate beliefs about the
number of source nodes in the data-generating causal system.
We observe that for relatively sparse or relatively dense models,
GrUES
successfully
identifies the marginal independence structure of the data-generating model at a
rate higher than that achieved via simple independence tests. Highest Posterior
Density (HPD) credible sets are also estimated that give relatively fine estimates
of the true UDG. These results suggest that
GrUES
provides an effective method
for the estimation of the marginal independence structure of a DAG model, while
allowing for the flexibility of incorporating prior knowledge about the causal system.