Tester, 2011], wearable sensors [Haleem et al., 2021, Sharma et al., 2021, Neethirajan, 2017],
metabolomics for physiological measurements [Jin et al., 2020, Freimer and Sabatti, 2003],
and computer vision for morphometrics [L¨urig et al., 2021] are now within reach. How-
ever, incorporating these high-dimensional phenotype data into genetic analyses remains a
challenge.
Multivariate studies of genetic variation focusing on relatively small sets of traits p20
(small p) have transformed our understanding of how genetic variation is distributed across
phenotypes, and how this affects evolutionary outcomes. While almost all individual traits
studied have been shown to have genetic variation [Lynch et al., 1998], and responses to
artificial selection are often rapid and of large magnitude [Hill and Kirkpatrick, 2010], mul-
tivariate studies of the genetic variance-covariance matrix (G) show that this genetic varia-
tion is distributed unevenly across multivariate phenotypes [Kirkpatrick, 2009, Sztepanacz
and Houle, 2019]. The concentration of genetic variance onto fewer multivariate trait com-
binations than the number of phenotypes measured is biologically caused by the pleiotropic
effects of alleles on multiple traits which leads to their genetic covariance [Lande, 1980].
Multivariate trait combinations with high genetic variation form a genetic subspace where
traits are predicted to have high evolvability. Evolution is predicted to occur more quickly
along these multivariate trait combinations than in any individual trait [Agrawal and Stinch-
combe, 2009], contributing to divergence among populations (eg. [McGlothlin et al., 2022,
Schluter, 1996]), species (eg. [Innocenti and Chenoweth, 2013, B´egin and Roff, 2004]), and
sexes (eg. Gosden and Chenoweth [2014]). The set of orthogonal multivariate trait com-
binations with low genetic variation, form a complementary subspace which is termed the
nearly-null genetic subspace [Gomulkiewicz and Houle, 2009, Gaydos et al., 2013]. This
subspace putatively represents important evolutionary constraints in natural populations,
where phenotypic evolution is expected to be constrained [Gomulkiewicz and Houle, 2009],
occur slowly [Kirkpatrick, 2009] or stochastically [Hine et al., 2014].
Estimating the size of the nearly-null genetic subspace is an avenue to quantify both
genetic constraints that may lead to evolutionary limits, and the extent of pleiotropy under-
lying organisms, which determines their genetic dimensionality. Past studies have typically
found that genetic variance is restricted to less than half of the phenotype space with
most multivariate trait combinations having no detectable genetic variation [Blows and
McGuigan, 2015] . One notable exception to this pattern is Drosophila wing shape, which
as been shown to have genetic variation in all multivariate wing shape traits (ie. a full rank
G) [Mezey and Houle, 2005, Sztepanacz and Blows, 2015, Sztepanacz and Houle, 2019,
Houle and Meyer, 2015]. These multivariate studies have typically dealt with small sets
of functionally related traits such as wing [Mezey and Houle, 2005], skeletal [Garcia et al.,
2014] or phyto [Walsh and Lynch, 2018] morphology, or traits that have a strong physio-
logical [Caruso et al., 2005], or biochemical relationships [Sztepanacz and Rundle, 2012].
Therefore, genetic covariance among these traits may be relatively high compared to the
range of traits that are possible to study with phenomics. Whether inferences for the size
of the nearly null subspaces found in these small pstudies can be extrapolated to phenomes
is an open biological question.
Confounded with the biological phenomena that nearly-null genetic subspaces represent,
are the statistical phenomena that arise from their estimation. Even in low pstudies, esti-
mating Gand its eigenvalues is a challenge. The standard approach is to fit a multivariate
mixed model in REML to estimate Gas an unstructured covariance matrix. These mod-
els commonly fail to converge with only a small number of traits, and run-times are long
because of the quadratic growth ∼p2/2 in the number of parameters to estimate with the
2