Human Population Genetics and Genomics ISSN 2770-5005
Human Population Genetics and Genomics 2025;5(4):0009 | https://doi.org/10.47248/hpgg2505040009
Original Research Open Access
The Quantitative Genetics of Human Disease: 3B Interactions—Non-Additivity and Missing Heritability
Kiana Jodeiry
1,2
,
Andrew J. Bass
2,3,†
,
Michael P. Epstein
2,3
,
David J. Cutler
2,3
Correspondence: David J. Cutler
Academic Editor(s): Carina Schlebusch, Lounès Chikhi
Received: Oct 30, 2024 | Accepted: Dec 19, 2025 | Published: Dec 30, 2025
© 2025 by the author(s). This is an Open Access article distributed under the Creative Commons License Attribution 4.0 International (CC BY 4.0) license, which permits unrestricted use, distribution, and reproduction in any medium or format, provided the original work is correctly credited.
Cite this article: Jodeiry K, Bass AJ, Epstein MP, Cutler DJ. The Quantitative Genetics of Human Disease: 3B Interactions—Non-Additivity and Missing Heritability. Hum Popul Genet Genom. 2025;5(4):0009. https://doi.org/10.47248/hpgg2505040009
The third section of an anticipated four paper series distinguishes two different forms of genetic interactions. The first paper of our discussion on genetic interactions described interactions arising from correlation between genotypic and/or environmental states. In this, the second paper, we describe interactions arising from non-additivity between uncorrelated factors (epistasis). We also discuss in detail the concept of "missing heritability.” While this phrase is sometimes used to mean what might more precisely be called "still unidentified heritable factors," this phrase also describes the observation that heritability when studied in close relatives almost always produces estimates significantly larger than when studied in distant relatives. While still unidentified heritable factors can be discovered via whole genome sequencing in ever increasing sample sizes, differing estimates of heritability from close versus distant relatives implies the existence of some form of interaction. Several types of interaction could explain this phenomenon. We conclude by focusing on a particular form of interaction that has been widely ignored, interactions caused by non-additivity arising from the cis-regulation of gene expression. By exploring varying patterns of two-locus haplotypic effects, we show that the existence of two or more variants each influencing the expression of the same gene can give rise to substantial non-additive interactions, and that those interactions can be particularly large when the variants are rare. Additive-by-additive genetic interactions induced by gene regulation have the potential to fully explain the observation of missing heritability.
Keywordsquantitative genetics, human disease, genetic interactions, missing heritability, gene regulation
The phenomenon known as “missing heritability" is most simply described as estimates of heritability differ between differing techniques used to make the estimate, where estimates from close relatives are almost uniformly larger than any other technique used to make that estimate. We will argue that this pattern is an indication that some form of interaction must exist, and we pay particular attention to higher-order additive-by-additive interactions caused by non-additivity between variants involved in the regulation of the same gene’s expression.
In general, non-additivity means that the average phenotype given a combination of alleles at two or more different loci is not the sum of the average phenotypes given each allele individually. Genetic risk for complex traits will often act via proximate changes in gene expression, structure, or function (i.e., genetically regulated gene expression or protein sequence), and complex trait-associated SNPs are significantly more likely to be expression quantitative trait loci than minor allele frequency-matched SNPs [1–3]. Multiple genetic variants located in regulatory elements such as enhancers and transcription factor binding sites (e.g., eQTLs) can act in concert to modulate the transcriptional activities of target genes that are nearby in a process known as cis-regulation [4]. We demonstrate that it is the cis-regulation of gene expression by multiple loci that naturally induces higher-order additive-by-additive interactions, which could contribute a substantial fraction of the "missing heritability” of complex traits.
To a significant extent, higher-order interactions have been widely ignored in genetic studies as it has not been clear under what circumstances (what models) we should expect higher-order interactions to exist. We describe models of cis-gene regulation that predict the existence of higher-order interactions and develop analytical bounds for the possible size of those interactions as a function of single-site properties (allele frequencies and additive/main effects). We explore several relationships among two-locus haplotypic effects representing cis-regulatory network patterns that include loss of function alleles, general gene regulation models where loci can act as enhancers or repressors, and regulatory mechanisms at the level of topologically associating domains (TADs) in the context of linkage disequilibrium and without LD. For these models, we derive the maximum possible higher-order additive-by-additive interaction variance and demonstrate that additive-by-additive interaction variance has the potential to be largest when the minor alleles at the loci are rare. Our aim is to provide a coherent explanation of one of the most basic questions in human genetics today: "Why do modern SNP-based estimators of heritability suggest that additive variance is much lower than familial studies predict?”
We begin with a reminder [5] of our Kempthorne [6] inspired definition of "effect.” The effect of being in any particular state of some factor, or any collection of states, is defined to be the average phenotype of individuals who are in that state or collection of states. The effect of an A1 allele is the average phenotype of individuals who possess the A1 allele. The effect of the combination of the A1 allele and environmental state e1 is the average phenotype of individuals who have the combination of the A1 allele along with environmental state e1. An interaction is said to exist whenever the effect of a combination of states is not equal to the sum of the individual state effects. It is helpful to distinguish two forms of interaction. Interactions can arise from correlation in the state of two factors, i.e., the chance of having allele A1 might be correlated with the chance of being in environmental state e1 – these were discussed in the previous paper in this series [7]. Alternatively, the state of factors might be independent, but the effect of two or more factor states is different from the sum of the individual state effects: the independent factors are "non-additive" – these are the interactions discussed in this paper. Interaction due to non-additivity of independent states is often called epistasis, but this term is far from consistently applied or used. Although asserted several times previously, here we show for the first time that non-additivity of independent factors only increases total variance.
In attempt to be as general as possible, imagine two factors, call them X and Y, that contribute to phenotype. These factors might be genetic or environmental with a discrete or continuous number of different states that they can attain. Let x and y be possible states of X and Y. Let ξX=x be the effect of factor X being in state x, and ξY=y be the effect of factor Y being in state y. Let
Associate with these effects random variables zX, zY, kX,Y, and dX,Y = kX,Y − (zX + zY) whose values are determined by the effects of being in the states of X and Y. Thus, if factor X is in state x and factor Y is in state y, then zX = ξX=x, zY = ξY=y, kX,Y = κXx,Yy, and
The key insight is that when the states of X and Y are independent, dX,Y is orthogonal to both zX and zY, and therefore the total variance (Var[kX,Y]) will be the sum of the individual variance components (Var[kX,Y] = Var[zX] + Var[zY] + Var[dX,Y]). To see this explicitly,
because of the independence of X and Y. To better understand why this is, note that
Because X and Y are independent, we have swapped the orders of integration to remove Y from E[zXkX,Y] and X from E[zYkX,Y]. Altogether this establishes the "simple” intuition that all quantitative geneticists have and which is taught to students. When the states of factors are independent, any interaction (non-additivity) causes an increase in total variance, and that variance is quantified by Var[dX,Y], which is the average squared deviation (
Focus now on factor X which can take on multiple states, x. Assume that the states of X are uncorrelated with the state of any other factor. Any non-additive interaction involving X causes differing variance in phenotype between the states of X. In general, if there are f other factors Yk, 1 ≤ k ≤ f which interact non-additively with factor X, and yk is a possible state for factor Yk, then
where the continuing sums are for all the higher order deviations from additivity. Intuitively, this is telling us that when residual variance in phenotype differs between differing states of the same factor, then there exists interactions involving that factor. When states of factors are uncorrelated with anything else, if there is significant variation in the variance in the phenotype given the states, then this factor must interact with at least one other thing. The variance in variance in phenotype is the sum of the variances of the average squared deviations from additivity between this factor and its interacting partners. In this manner, it is possible to test for "latent” interactions by testing for differences in the residual phenotypic variance between states of a factor (see Figure 1). If the states of a factor differ in their residual phenotypic variance then that factor interacts with something. Its interacting partner may be unidentified (i.e., latent), but such a partner(s) must exist. Therefore SNPs involved in non-additive interactions can be directly discovered by examining the variance in residual phenotypic variance given the genotype. Variance-based testing procedures, which do not require the interacting variable(s) to be identified, prioritize genetic variants associated with the variance of the phenotype (vQTLs) by detecting unequal residual phenotypic variance among genotype categories at a specific SNP (i.e., heteroskedasticity) [8–10]. Genome-wide screening procedures that identify a reduced set of variants with potential interaction effects can circumvent some of the challenges inherent in attempting to find interactions among tens of millions of factors [11].
Figure 1 Differing residual phenotypic variance between genotypic states due to an interacting variable. On the x-axis are the states of a factor (e.g., factor X), in this case genotypes at a locus. A and B are the states of another factor (e.g., factor Y).
In the deviation regression model (DRM) [10], vQTLs are identified by a linear regression performed between individuals’ numeric minor allele count at each SNP (the independent variable) and the absolute difference between an individual’s phenotypic value and the population phenotype medians within each genotype (the dependent variable) after covariate adjustment. The effect size and p-value for the minor allele count in the regression are used as proxies for the variance effect size and significance of association with phenotypic variance. These vQTLs are then screened for potentially strong interactions associated with the same phenotype by applying distinct linear models for each vQTL-interacting partner (e.g., environmental factor) pair containing a single interaction term [10].
In addition to heteroskedasticity, latent genetic interactions also induce a differential covariance pattern between traits known as a covariance QTL (covQTL) or correlation QTL. Latent Interaction Testing (LIT) [8] tests for loci with latent interaction effects by assessing whether the individual-specific trait variances and covariances differ by genotype, harnessing differential covariance patterns between traits to increase power. Similar to variance-based testing procedures, the interacting partner does not need to be specified and can represent any type of factor (i.e., genotype at another locus, age, sex, environmental variable). LIT tests the null hypothesis of no latent interaction by comparing the genotypes to the individual-specific variance and covariance estimates for the trait residuals (after regressing out additive genetic effects, measured covariance, and principal components to adjust for population structure) [8]. The test statistic employed by LIT measures the overlap between the adjusted genotype matrix and a matrix of the squared trait residuals (individual-specific trait variances) and cross products of the residuals (individual-specific trait covariances) through a kernel-based distance covariance framework.
POIROT (Parent-of-Origin Inference using Robust Omnibus Test) [9] tests for parent-of-origin effects (POE, effect of an allele on a trait depends on whether it was transmitted from the mother or the father) in genetic studies of unrelated individuals that consider multiple quantitative phenotypes. The authors demonstrate that a pleiotropic POE variant induces differences in the variance of POE traits between heterozygotes and homozygotes as well as in their corresponding covariances. POIROT tests for equality of phenotypic covariance matrices between heterozygous and homozygous groups, with a post hoc test that distinguishes variants with POE effects from variants demonstrating more general gene–gene/gene–environment effects (which also induce patterns of trait variance/covariance that differ by genotype) [9]. The variance of each homozygous group is the same after phenotype centering for a POE, in contrast to a general interaction effect in which trait variance/covariances differ between the two homozygous groups. Thus, the post hoc test assesses the null hypothesis of trait variance/covariances are the same between the two homozygous groups.
One potential form of non-additivity is dominance within a locus, and dominance linkage disequilibrium score regression (d-LDSC) extends LDSC to incorporate within-locus non-additive effects on a site-by-site basis, using a unique recoding of allele counts that allows testing for a dominance deviation directly as the “dominance encoding” [12]. Under Hardy-Weinberg equilibrium (HWE), the standardized dominance encoding maps allele counts from [0, 1, 2] to [–q/(1 – q), 1, – (1 – q)/q] for each variant, where q is the minor allele frequency of the variant. This encoding imposes orthogonality on the additive and dominance components of genetic variance, thus the association test can be run in parallel rather than jointly. To estimate SNP-based dominance heritability, the d-LDSC model regresses the χ2 statistics of dominance GWASs on dominance LD scores [12].
The term missing heritability is used in multiple related ways. Sometimes the term is used to describe the fact that the sum of the estimated variances explained by all SNPs individually does not approach estimates of VA (e.g., total additive variance estimated by a linear mixed model or in family-based studies). Essentially, the variance explained by a polygenic risk score (PRS) is generally only a fraction of the estimated total additive variance. At other times the term is used to reflect the fact that heritability estimated from nearly "unrelated” individuals via LD-score regression or a linear mixed model is usually substantially less than heritability estimated from the resemblance between close relatives. Failure of PRSs to fully account for VA is easily explained by insufficient power, but differences between estimates of heritability in close versus distant relatives is almost surely strong evidence for the existence of some substantial form of interaction.
PRSs broadly attempt to provide a quantitative measure of an individual’s total genetic risk burden of disease over all susceptibility variants identified by genome-wide association studies [13]. PRSs are most commonly calculated as a weighted sum of the number of risk alleles carried by an individual, with risk alleles and their weights defined by the associated loci and their measured effect sizes from GWAS - see the second paper in this series for a more detailed discussion of the calculation and interpretation of PRSs [14]. The total variance explained by the PRS, VPRS is given by
where the sum is taken over all sites included in the estimate v, pv and qv are the allele frequencies, and β̃v is the LD-independent effect (on the liability scale for a dichotomous trait) of substituting the rare allele for the common one at locus v. VPRS is generally vastly smaller than the total estimated additive variance, VA. For instance, recent twin studies have estimated the heritability of Alzheimer’s disease to be 71% [15]. In contrast, a PRS based on loci reported by a large GWAS explained 19% of the variation in Alzheimer’s disease on the liability scale [16]. Twin studies have estimated the heritability of schizophrenia to be as high as 79% [17]. A PRS explained only 9.2% of the phenotypic variation in schizophrenia on the liability scale [18]. Twin studies have found that heritable influences account for 46% of the variance in post-traumatic stress disorder [19]. PRS based on a GWAS meta-analysis of PTSD with over 1.2 million participants explained 6.6% of the variation in PTSD [20]. The heritability of many disorders and diseases as estimated by family-based studies is high, but only a small part of it has so far been explained by identified variants.
Thus, if "missing heritability” is defined in this relatively narrow way, the likely answer is that it is only missing for technical reasons related to study design and insufficient statistical power. In other words, at current sample sizes, studies are not only underpowered to detect most of the SNPs contributing to the trait, they are also often underpowered to accurately estimate the effects at those SNPs that are correctly detected. However, for many traits, significant fractions of family-based estimates of heritability still remain unexplained even as sample sizes pass beyond the millions. Visscher et al. (2006) estimated the heritability of height to be 80% from empirical genome-wide identity-by-descent sharing (the observed proportion of the genome that two relatives share IBD estimated from DNA markers) in a sample of 3,375 full-sibling pairs [21]. The most recent GWAS meta-analysis of height from the the Genetic Investigation of ANthropometric Traits (GIANT) Consortium was conducted in a sample of 5.4 million individuals and identified 12,111 significantly associated SNPs [22]. A polygenic score based on these genome-wide significant SNPs explained 40% of the variance in height – only half of the heritability from family-based studies, even with such a vast sample size [22]. Thus, it remains plausible that the additive contributions of individual loci estimated from GWAS alone will never explain all of the familial resemblance for many traits, even if hundreds of thousands of loci are considered [23].
Despite this, if we view "missing heritability" in this narrow sense, it is still likely even a study such as this one conducted by the GIANT Consortium [22] was still underpowered. For instance, if a trait with 50% heritability (VA = 0.5) were contributed to by 200, 000 SNPs each explaining an equal fraction of the variance (equal
The solution of increasing sample sizes will only work, even in principle, when combined with whole genome sequencing (WGS). If only common sites are included in the analysis, but actual effect SNPs are not included because they have minor allele frequencies smaller than any typed SNP, VPRS will always be smaller than VA. In the second paper in this series [14], we demonstrated how genotyping the "wrong" variant (rather than the "real" causal variant it is in LD with) biases effect size estimates and the variance explained by polygenic risk scores. Call βv the estimated effect at site v and β̃w the "true” effect at site w. Assume these sites are in LD, i.e., Dv,w ≠ 0 and qv > qw, and therefore pv < pw. If site v is typed and it has no real effect, then
We thus arrive at the conclusion that if untyped rare sites are contributing to phenotype and a PRS includes only common markers, then the PRS will necessarily explain less than the total additive variance.
If there are multiple untyped rare sites all linked to the same common marker, this phenomena could be dramatically worse. In the above, we ignored the sign of Dv,w. More precisely –qvqw ≤ D ≤ pvqw, and we presented the "positive” side of the inequality. Since pv > qv, this is the "better” scenario, i.e., the scenario where the variance explained by site v is closer to that explained by w. If the signs are reversed, the problem is worse. If there are multiple sites in LD with v, the situation can be arbitrarily bad if the sign of D varies. If there are sites wk that are all rare and in LD with typed site v and the sign of
which can be arbitrarily close to zero. In theory it is possible for there to be no common site with a detectable effect in a region when all the contributing sites are rare and the sign of LD is changing back and forth. In practice, this doomsday scenario can be partially mitigated by a sufficiently deep set of common markers. Any rare site in LD with a common site is likely to be in LD with many common sites so that each common site may be affected by slightly differing combinations of rare sites. While switching signs may cause the effects to "cancel” at some common site, it seems unrealistic for this to happen at all sites in a region where each site is being influenced by slightly differing combinations of rare neighbors. That VPRS will be at least slightly less than VA seems completely inevitable, though. We thus conclude that the variance explained by a PRS will be less than the total additive variance unless all the sites were typed (WGS), and even with WGS, exceptionally large sample sizes will be required if the number of contributing sites is large.
Several approaches have been developed that utilize genome-wide variation instead of only statistically significant variation to examine heritability of complex traits [24–27]. LD-score regression (LDSC) can be thought of as an attempt to solve the problem of underestimating additive effects at common sites when the "real" causative sites are rare [24], and studies underpowered. SNPs in LD blocks with causal SNPs have high χ2 test statistics, and more LD yields higher χ2’s on average. In a GWAS, the deviation of observed χ2 test statistic for a SNP from its expected value under the null hypothesis of no association is a function of LD between the target SNP and underlying causal variants. Therefore, in LDSC, the slope from regression of the observed χ2 test statistic (from GWAS) on SNPs’ LD scores (from reference data) is used to estimate the additive genetic variance attributable to the causal variants in LD with common SNPs present in GWAS summary statistics (summary results of GWAS, estimated SNP effects and their standard errors for hundreds of thousands of SNPs analyzed in a study). This approach requires only summary-level data from GWAS because LD scores can be estimated in a reference population, for example, HapMap 3 SNPs in the 1000 Genomes Project [28].
Under a polygenic model, in which effect sizes for variants are drawn independently from a distribution with variance proportional to 1/pq (where q is the minor allele frequency), the expected χ2 statistic of SNP v is
where N is the sample size and M is the number of SNPs, such that (
At some level of understanding, we can view LDSC as a way of averaging effects over a region. Recalling that for a randomly picked site v with no effect on phenotype, but that is in LD with other sites w which do contribute,
In this context we see the average effect squared (β2) of a randomly picked SNP is approximately the average effect squared of SNPs contributing to the phenotype, multiplied by the r2 measure of LD between them. At this level we can view LDSC as an approximation of the average effect of neighboring SNPs in a region. On a deeper level, we can view LDSC with the understanding that the covariance in allelic state is simultaneously a measure of LD and a measure of the relatedness among those individuals [7]. Keeping that insight at the forefront of our imagination, it now seems natural that the "average” LD in a region is a direct measure of the average relatedness of the individuals used to estimate the LD. One could therefore calculate the average LD in a region, use that as an estimate of the average relatedness in an association study, and obtain a direct estimate of the additive variance contributed by a region. To understand this more precisely, we will now take a closer look at the LD score of variant v, lv [24]. LDSC models phenotypes as generated from the equation
where
an estimate of the relatedness of individuals i and j over the M SNPs considered, with
This is an estimate of the correlation in genotypic state between locus v and w, where the estimate is taken over the N individuals under study. Thus LDvw is an estimate of the "usual” LD measure r. Under the model considered in LDSC, the standard least squares estimator of
A standard test for association for SNP v is a χ2 test with one degree of freedom,
We thus arrive at the fundamental intuition of LD-score regression. The test statistic at each SNP can be written as function of the relatedness of the individuals involved in the study, R, averaged over SNPs in the region. It can also be written as function of LD among those SNPs (LD) averaged over individuals in the study. If one were to replace the LD observed in the study with LD observed among other individuals, it is very much a replacement of the relatedness of individuals under study with the relatedness among the individuals used to estimate the LD. We therefore imagine that LD-score regression will provide a robust estimate of the additive variance in phenotype in the regions tested whenever the LD among the study participants is sufficiently similar to the LD among the individuals used to create the LD score statistic. In the simplest expression of this intuition, we have replaced the observed relatedness of individuals under study with the average relatedness of random individuals drawn from the population.
While LDSC utilizes summary-level data and regression, genome-based restricted maximum likelihood estimation (GREML) estimates genetic variance using individual-level genome-wide data in a linear mixed model framework. One of the most popular software packages for estimating SNP-heritability using genome-wide data from unrelated individuals is Genome-wide Complex Trait Analysis (GCTA), which uses a GREML method [25,26]. The fundamental idea behind such approaches developed for individual-level genome-wide data is to estimate the realized genetic relationship among individuals by using genome-wide variants, and then use this relationship matrix to estimate the additive genetic variance.
In contrast to single-SNP association analysis, GCTA fits the effects of all genotyped SNPs as random effects via a linear mixed model. In this design, phenotype P can be represented in equation form
where
where
While R is usually constructed from genotypes at common markers, it is nonetheless an estimate of the average relatedness between every pair of individuals, where that average is taken over the SNPs used to construct it. This estimate is often taken at over ~ 500,000 loci around the genome (the number of common SNPs typically on commercial genotyping arrays) and thus should be viewed as an extremely precise estimate of identity-by-descent (IBD) sharing between individuals i and j. This estimate is not everywhere in the genome, but it is on average every ~ 6,000 bases across the 22 autosomes. On average, common variants tend to be in high LD with loci within 10,000 bases and rare variants may have high LD over hundreds of kilobases – in essence, 6,000 bases is much smaller than the distance over which LD extends. While there may be areas of poor coverage where IBD sharing might be greater or less than the estimate of average relatedness, and some imprecision at regions of increased or decreased allele sharing might matter, in general it seems likely that this will be a highly precise estimate of relatedness, r.
A commonly noted limitation of the GCTA approach is that it relies heavily on LD between assayed and causal variants [29]. This criticism states that genetic relatedness between a pair of individuals based on genotyped variants may not reflect genetic relatedness based on ungenotyped causal variants. If ungenotyped causal variants are in strong LD as compared to genotyped variants, as is the case when causal variants are rare, then heritability estimated using genotyped variants will be underestimated. There is some theoretical underpinning for the belief that at least a small fraction of the “missing heritability” in for complex traits could be due the effects of such rare alleles.
Yang et al. (2015) start with whole genome sequencing data in 3,642 unrelated individuals. They then simulate a complex trait with 80% heritability by sampling SNPs from the real data (17.6 million variants) and assigning effect sizes to those "causal" variants under four scenarios: random (roughly a 50/50 mixture of common and rare causal variants), more common (MAF > 0.01) causal variants, more rare causal variants, and more rare causal variants that also have a lower mean LD score than average [30]. Yang et al. (2015) show that when causal SNPs are picked at random from all sequence variants in the genome, the genetic relatedness matrix estimated from only common SNPs is an excellent estimate of r at all SNPs contributing to disease and there is no "missing heritability" at all – in other words, single-component GREML gives an average estimate of heritability of 80% with very little error. On the other hand, when disproportionately more rare variants are causal variants, r is overestimated and as a result heritability is very slightly underestimated by a few percent on average. If these rare causal variants are also in regions of the genome with unusually low LD, using common SNPs to estimate r gives up to a 10-15% underestimate of heritability (i.e., heritability estimated to be 65-70% instead of the true 80% value). Even if most of the causal variants happen to be rare and in regions of unusually low LD, this leads to at most a 10-15% underestimate – significantly less than the magnitude of the "missing heritability" for many phenotypes [30].
To overcome this limitation, later stratified variations of GREML have been developed to adjust for the influence of minor allele frequency (MAF) and local LD (individual LD scores) on the estimated SNP-heritability, known as the GREML-MAF Stratified (GREML-MS) and GREML-LD and MAF Stratified (GREML-LDMS) approaches, respectively [30]. These multi-component stratified GREML methods fit multiple GRMs created by a subset of SNPs stratified by either minor allele frequency (MAF) bins alone or both LD and MAF bins simultaneously in the linear mixed model. In simulations, these approaches appear to provide unbiased estimates of the total additive variance, regardless of the LD and MAF model used to simulate the trait [30].
If missing heritability is due only to issues of power or ungenotyped rare causal sites, absent interactions, estimates of heritability from LDSC and GREML should converge to family-based estimates. While there may be technical biases in LDSC caused by substituting the "average” relatedness of individuals for the relatedness among individuals under study, and somewhat different biases in GREML from using only a subset of all possible SNPs to estimate relatedness, these biases are likely to be small, and often solvable by careful LD matching for LDSC, and extensions of GREML to include SNP subsets with relatedness that might differ from average.
Yet, these approaches yield estimates of heritability that tend to be substantially lower than those from family studies. A pattern of heritability estimated by families-based studies being significantly larger than that estimated in linear mixed model approaches (GREML), which in turn are slightly larger than that estimated from LDSC, is often seen in large datasets.
Missing heritability is the norm for nearly all human traits studied, even traits with exceptionally large sample sizes. Consider the phenotype of height as an example, with a family-based estimate of heritability from genome-wide IBD sharing between sibling pairs of 80% [21]. Srivastava et al. (2023) estimated the SNP-based heritability (
Estimates of heritability from close relatives (i.e., family-based methods) are usually significantly larger than estimates from distant relatives/unrelated individuals (i.e., molecular genetic methods such as LDSC and GREML). For over 100 years, geneticists have been using the correlation between relatives to estimate heritability in a practice so familiar that we often forget that it usually involves simplifying assumptions. If we define the term heritability to mean the proportion of phenotypic variance due to additive effects, estimates of heritability from family studies can be biased upward by interactions between genes (epistasis), by genotype-by-environment interactions, and by unrecognized correlations in environments among relatives [32]. In particular, for a pair of relatives with phenotypes Pi and Pj [5]
In this formulation, VA is usually called the "additive variance” and we define heritability to be h2 = VA/VP where VP is the total phenotypic variance. VAA, VAD, VDD, and VAAA are usually called the additive-by-additive, additive-by-dominance, dominance-by-dominance, and three-way additive-by-additive variances, respectively [5,6]. These components each reflect a non-additive genetic contribution to phenotypic variance arising from interactions between alleles or genotypes at different loci, that are often collectively called "higher-order genetic interactions." Non-additive in this context means that the average phenotype given a multi-locus combination of genotypes/alleles is not just the sum of the average phenotypes given each genotype or allele individually. In the above equation, VGE reflects the total contribution of gene-environment interactions to phenotypic variance, with cij as the fraction of those interactions shared between individuals i and j.
For parent(P) / offspring(O) comparisons (a typical comparison in which to estimate heritability),
In practice, SNP based techniques used to estimate heritability from distantly related individuals typically found in a GWAS study will be far less inflated by higher-order genetic interactions. In principle, the inflation is present, but the values of r are such that the inflation may be negligible. For the “unrelated individuals” typically found in these kind of studies, r2 is nearly 0, and r ~ O(0.01), so that r2 ~ O(0.0001). Thus, the inflation from this approach is dramatically lower than from familial studies. In particular, if higher-order interactions contribute a significant fraction of the total genetic variance we expect heritability estimated by families to be significantly larger than that estimated in linear mixed model approaches or LDSC, as is often seen in large datasets [33].
It can be suggested that rather than higher-order interactions, other sources of non-additivity (e.g., dominance, assortative mating, gene-environment interactions) inflate estimates of heritability from family-based studies. If VGE is inflating estimates of heritability from family-based studies, then estimates of VA from GREML should be comparable to estimates of heritability from family-based paradigms that attempt to control for gene-environment interactions (with a much smaller shared environment contribution to similarity among relatives), such as adoption studies. Consider alcohol use disorder (AUD) as an example. Estimates of the heritability of AUD from twin studies are 51% with a 95% CI of [0.45-0.56], while estimates from adoption studies are 36% with a 95% confidence interval of [0.22-0.50] [34]. Linear mixed model estimates of the heritability of AUD from GREML are 13% with a 95% CI of [0.124-0.136] [35]. In essence, when family-based paradigms try their best to control for gene-environment interactions, heritability estimates from close relatives are still considerably larger than estimates from GREML in distant relatives. A heritability estimate of 36% with a lower bound of 22% from adoption studies of AUD is still about twice as large as the 13% estimate from distant relatives using GREML. Comparisons of twin and adoption studies for many other traits (e.g., see [36] for antisocial behavior) show a similar pattern: heritability estimates from adoption studies are modestly smaller than estimates from twin studies and while there is evidence for gene-environment interactions, they typically explain 5-10% of phenotypic variance - not enough to fully account for the difference in heritability estimates from close versus distant relatives. The data is far more consistent with gene-environment interactions being a smaller contributor to heritability inflation than higher-order genetic interactions.
While most seem to think there is little evidence for higher order interactions, and it is certainly true that data is very rarely interpreted as supporting the existence of higher order interactions, as it turns out, there is enormous evidence that is fully consistent with their existence. That data is simply seldom described that way.
In addition to stratified variants of GREML, another approach was developed to estimate SNP-heritability in individuals with close or extended relationships [27,37]. The GREML-KIN method models phenotype with a linear mixed model in samples that contain a mixture of close relatives, more distant relatives, and "unrelated individuals" (6th degree relatives and beyond). The model in GREML-KIN is
In this model,
Now we will demonstrate that
Now, let us consider the random effects model for three different types of pairs of individuals: distant or unrelated individuals, close relatives (5th degree and closer), and full siblings. To simplify our discussion, assume
Thus, in individuals more distantly related than second cousins,
Since r < 0.025, this can be thought of as being a very good approximation for VA. Thus, in individuals with
where both the R and K matrices are identical. The phenotypic covariance between such individuals is given by
Thus, in individuals that are 5th degree relatives and closer (not including full siblings),
For full siblings,
where both R and K are identical. The phenotypic covariance between full siblings is
Thus, in individuals who are full siblings,
Imagine the linear mixed model works perfectly (variance is partitioned perfectly by GREML), then
Thus, this "extra” variance component,
There is considerable evidence that a substantial fraction of all genetic variation is due to these additive-by-additive interactions. Xia et al. (2016) [37] partitioned phenotypic variance using GREML-KIN for complex anthropomorphic and cardiometabolic traits (see Table 1).
Table 1 Results of variance components analyses using GREML-KIN from final reduced models selected by Xia et al. (2016) and Hill et al. (2018). Standard errors of variance components in parenthesis.
The amount of variance that is explained by shared familial environment (
Comparing family-based estimates of heritability to the sum of the genetic variance components from GREML-KIN estimated by Xia et al. (2016) and Hill et al. (2018) demonstrates that very little heritability is "missing" at all once additive-by-additive interactions are accounted for [37,41]. As Yang et al. (2015), showed even if most of the causal variants happen to be rare and in regions of unusually low LD, this leads to at most a 10-15% underestimate of heritability [30] - significantly less than the magnitude of
A genetic interaction exists when the joint genetic effect of two loci on phenotype is different from the sum of the individual locus effects. If an allele
We imagine our gene, call it G, occurs on an autosomal, diploid locus. We imagine that the two alleles of G, G1 and G2, are individually regulated, such that there are four possible states for this locus: State 0: Both G1 and G2 are silenced. State 1: G1 is expressed, but G2 is silenced. State 2: G2 is expressed, but G1 is silenced. State 3: Both G1 and G2 are expressed. To further simplify this model, we imagine that the amount of transcript produced in state 0 is 0. The amount in State 1 and 2 is x, and the amount in State 3 is 2x. Thus, each allele, when transcribed, produces an equal amount of transcript. Whether or not the allele is transcribed is determined by binding of other proteins at cis-regulatory elements. In the simplest model, expression happens if binding occurs at any cis-regulatory element, and no transcription if all regulatory elements are unbound. To make this a model of genetic variation, we imagine that SNPs are within the cis-regulatory elements. One of the two SNP nucleotides allows binding to its element (and thereby “turns on” cis expression of that allele), while the other nucleotide inhibits binding, and thus eliminates cis-expression, unless some other element is bound. We fully recognize that biology is far more complex and subtle than this model, but even this simplistic view of gene regulation can induce substantial higher order interactions. We give a few key results to reveal the extent to which higher-order interactions can be the largest component of genetic variance in this system.
For this model of gene regulation, we present results in Table 2 where the gene has one cis-regulatory element or two cis-regulatory elements. Each element has a SNP, with one nucleotide allowing binding (and thereby turning on cis-expression), and the other nucleotide preventing binding. For simplicity, we assume the nucleotide preventing binding is at the same frequency in each element, and is at frequency 0.02 (relatively rare, but tested in most large GWAS studies). As Table 2 shows, under this model of gene regulation, if there is only one regulatory element, all variation is additive. If there are two regulatory elements, almost all the variation is additive-by-additive interaction. Thus, a simplistic model of cis-regulation of gene expression gives rise to substantial interaction variance whenever there is genetic variation in two different cis-elements. The converse of this model is one where there is no variation in the cis-regulatory elements, but variation in the protein binding partners, such that some of the time proteins are available to bind (and thereby induce expression at this locus), but at other times they are not. Combine this with an idea that differing proteins might bind to differing cis-regulatory elements, and the exact same quantitative results can ensue, i.e., a substantial fraction of the variance might be additive-by-additive (if a small number of different proteins regulate each locus), or even higher-order interaction variance if substantially more than two contribute to regulation.
In this numerical example, the additive-by-additive component will be approximately 25 times larger than sum of the individual SNP effects. We could imagine the two SNPs might show relatively uninteresting levels of association individually, with say p = 0.25 and χ2 = 1.32 at both loci. However, a test for the additive-by-additive contribution of the two loci together would have an expected E[χ2] = 64.9 with p ≈ 10–15, a value likely to reach an experiment-wide significance threshold even if all pairs of SNPs genome-wide are tested with Bonferroni correction. Of course, since the combination of both minor alleles will rarely be observed (1/(0.02)2 = 2, 500) such a study would require tens of thousands of samples to be well powered.
Far from being an abstraction that "could exist,” higher-order genetic interactions arise as a consequence of the cis-regulation of genes by nearby variants. In fact, we believe cis-regulation of genes is likely to account for some, and potentially a substantial fraction of “missing heritability” of complex traits. While Mendelian, or single-gene diseases, are largely caused by protein-coding changes, complex traits have been shown to be mainly driven by noncoding variants that presumably affect gene regulation. Indeed, many studies have shown that significant disease-associated SNPs are highly enriched in regions of active chromatin such as promoters and enhancers [44]. Cis-regulation of genetic transcription involves a proximal promoter as well as more distal regulatory elements (e.g., repressors, silencers, enhancers, insulators), and variation within any of these regulatory elements can lead to transcriptional changes to the target gene. If variants are in multiple regulatory elements, their effects are likely to be non-additive. Disrupting one enhancer might have limited effect, but disrupting all enhancers could have a major effect on expression of the target gene. It is difficult to imagine a model where the effects would be strictly additive, where the effect of having a variant in two cis-regulatory elements is exactly the sum of their individual effects.
On a broader scale, expanded views of complex traits such as Boyle et al. (2017)’s omnigenic model posits that information flows from regulatory variants, for example by affecting chromatin activity, to cis-regulation of nearby genes and ultimately to affect the activity of other “unlinked” genes [44]. In this way, cis-acting expression quantitative trait loci (eQTLs, or loci that explain a fraction of the variance in a gene expression phenotype) may in turn affect mRNA or protein levels of other “unlinked” genes via regulatory networks. In other words, a cis-eQTL can also be a trans-acting eQTL elsewhere in the genome, in addition to affecting other functions such as post-translational modification or subcellular localization of gene products [44]. Long-range cis-regulatory interactions could also be mediated by CCCTC-binding factor (CTCF), a highly conserved transcription factor. Variation in CTCF binding sites could change chromatin configuration at a large scale, dramatically altering transcription potential for many genes. Roughly 15% of CTCF binding sites are within the boundaries of topologically associating domains (TADs), and these CTCF binding sites play the role of an insulator to demarcate rostral-caudal transcriptional boundaries [45]. Chromatin domains with a high frequency of molecular interactions between different loci form TADs that are relatively isolated from neighboring regions of DNA, facilitating interactions between cis-regulatory elements and target genes. CTCF-cohesion complex forms a loop structure and an insulated enhancer can only act on genes within the formed loop. Mutation in CTCF-binding sites may thus prevent the proper loop structure from forming, and genes that are originally outside the loop are now affected by the enhancer (e.g., expression is highly upregulated compared to wild-type). Such mutations can have large scale impacts, causing increased inter-TAD interactions and reduced intra-TAD interactions [46]. When two or more variants affect the same TAD, additive-by-additive, or even higher-order interactions, are virtually unavoidable.
Not withstanding the toy example of cis-gene regulation described above (Figure 2), one is often tempted to assume that higher-order variance terms will smaller than lower order terms. Under many circumstances this can be true, but when the interaction involves one or more rare alleles, the higher-order interaction effect can be considerably larger than any of the main effects. To see this, we derive several general results for the additive-by-additive interaction variance,
To start, consider a single locus v in Hardy-Weinberg equilibrium with two alleles
Now consider a second locus w, with properties and quantities of the same form as those of locus v.
Imagine these two sites v and w reside on the same chromosome and both influence phenotype. These two sites can form four possible haplotypes (
Let H be a randomly picked haplotype, and
We begin by deriving expressions for all four haplotypic effects in terms of the β’s at each site, βv and βw, and δv,w.
The additive-by-additive variance is calculated as the variance of the additive-by-additive deviations across all four haplotypes. Note that additive-by-additive deviations for the other three haplotypes can each be written in terms of allele frequencies and the additive-by-additive deviation of the rare-rare haplotype (e.g.,
As we previously derived [5], in the context of no LD between the two loci under consideration, the single-locus β’s can be written as
From this, we derive
Quantities denoted by a hat are their values in a real population with linkage disequilibrium between the loci under consideration. In the presence of LD, where D is the standard measure of haplotypic covariance (
In Equation 108, the effect size at locus v, measured as a β, is equal to what the effect size would be at locus v absent LD, plus the effect at locus w absent LD, weighted by the haplotypic covariance between the two loci, divided by the allelic variance at locus v (see [5] for derivation). In the context of D ≥ 0, both single-locus additive variances become larger as the β’s at each site become larger, and the additive-by-additive variance becomes larger. The total additive variance becomes larger more rapidly than the additive-by-additive variance in the context of positive LD between two loci. Thus, the ratio of
We derive an analytic bound on the maximum possible additive-by-additive interaction variance contributed by any pair of sites in a framework independent of linkage disequilibrium and any other correlations in allelic state (e.g., departures from HWE) [5,7]. Assuming parental exchangability, cis-only, and no dominance in this system (
Without loss of generality, assume the size of haplotypic effects are oriented such that
§ Bound 1:
If
For δv,w ≥ 0 and βw ≤ 0,
If
For δv,w ≤ 0 and βw ≥ 0,
§ Bound 2:
If
For δv,w ≥ 0 and βv ≤ 0,
If
For δv,w ≤ 0 and βv ≥ 0,
§ Bound 3:
Recall that haplotypic effects are oriented such that
This is only an upper bound on δv,w when βv + βw is negative.
§ Bound 4:
If
For δv,w ≥ 0 and βv ≥ 0,
If
For δv,w ≤ 0 and βv ≤ 0,
§ Bound 5:
If
For δv,w ≥ 0 and βw ≥ 0,
If
For δv,w ≤ 0 and βw ≤ 0,
§ Bound 6:
If
If
This bound represents a complicated function of βv, βw, and the relationship between the allele frequencies at both sites.
Below is a summary of the situations in which each of the four easily interpretable bounds apply. Bound 1,
For the twelve possible patterns of haplotypic effect sizes (given
Table 3 Applicable bounds on
Note that Cases 3, 4, 11, and 12 represent pathological scenarios in that they are highly biologically implausible. To see this intuitively, imagine locus v and locus w reside in two different cis-regulatory elements for the same target gene. For Cases 3 and 4, consider the example where
Our aim is to develop a bound that applies to all possible non-pathological relationships among haplotypic effects. To do so, we now will consider the cases which only have an applicable bound in terms of either
§ Case 7:
Given that
§ Case 8:
Given that
As such, across all possible orderings of haplotypic effects, the maximum possible additive-by-additive interaction variance in our LD-independent framework is given by
When both sites have rare minor alleles,
Since allele frequencies and
In the two papers that comprise the third part of this series, we distinguish interactions arising from correlations in state between factors from those arising from non-additivity of independent factors. Genetic interactions in the form of correlation in allelic state – departures from Hardy-Weinberg equilibrium, linkage disequilibrium, population structure, and relatedness – are known to occur commonly and are often accounted for in genetic studies [7]. In contrast, non-additive interactions between uncorrelated markers at different genetic loci (epistasis) is seldom considered in any serious way in most genetic studies.
We have demonstrated that non-additivity among independent factors does not change individual effect sizes, but does increase the total variance. Interactions due to non-additivity of uncorrelated factors cause differences in the residual variance in phenotype between differing states of the same factor as well as differential covariance patterns between phenotypes. A variety of tests for "latent” interactions capitalize on this phenomenon by first detecting unequal residual phenotypic variance (vQTLs) and/or differing covariance patterns between traits (covQTLs) among genotype categories at a specific SNP and then prioritizing the identified vQTLs/covQTLs for interaction testing (e.g., [8–10]).
Correlation in genotypic state between two individuals is a measure of the relatedness of those individuals [7]. It is through this connection that we understand GREML [25–27] and LD-score regression [24] approaches to estimating heritability. GREML uses the correlation in genotypic state to estimate relatedness and thereby heritability. In LD-score regression, relatedness among the individuals used to estimate genetic effects is replaced by an estimate of relatedness among other samples used to estimate LD.
The problem of missing heritability (differences between estimates of heritability in close versus distant relatives) for complex traits is contextualized as evidence for the existence of some form of interaction(s). Fundamentally, since heritability almost always appears to be larger when estimated from close relatives than when estimated from distant ones, we must conclude that some form of interaction exists that is captured in the close relatives but absent from distant ones. Unaccounted for correlations in state are usually the form of interaction investigators ascribe to causing missing heritability. These correlations in state might be between genotype and an unobserved environmental factor, or between the phenotype of parents (assortative mating, more often a manifestation of population structure as mate choice is influenced by geography), or between the genotype of typed markers and unobserved (likely rare) alleles contributing to phenotype. For example, correlation in mate choice based on phenotype can induce correlation in genotypic state, which can alter estimate of the variance components obtained from family-based studies [47]. While all these forms of correlation in state could and likely do contribute to a fraction of the missing heritability, we argue that even when these correlations in state are well accounted for, there remains significant missing heritability.
One particular form of interaction which is rarely considered by investigators is variance induced by departures from additivity of uncorrelated genotypes: additive-by-additive variance. We considered evidence for higher-order additive-by-additive interactions from both an empirical standpoint (linear mixed models in samples of individuals with varying degrees of relatedness) [27,37] and a biological perspective (cis-gene regulation). While data from linear mixed models that estimate heritability in samples that contain a mixture of individuals with close or extended relatedness such as GREML-KIN [27,37] are rarely interpreted as supporting the existence of higher-order interactions, it is only because the authors assume these interactions do not exist. If we do not assume a priori that such interactions do not exist, these studies can be viewed as providing substantial evidence for the existence of rampant additive-by-additive interactions. By exploring several models of cis-gene regulation, we derive bounds on the maximum possible pairwise additive-by-additive interaction variance in terms of the individual site allele frequencies and additive effects. We demonstrate that the additive-by-additive interaction variance can be considerably larger than the main effects when minor alleles are rare, given that this maximum is inversely proportional to minor allele frequency for loci with rare alleles.
Classical population genetic theory fundamentally assumes primarily additive genetic variance. Given the pervasive evidence for multiple cis-regulatory elements acting in concert to regulate individual target genes as well as enrichment of disease-associated SNPs in non-coding regions of active chromatin [44], it is entirely possible that much of the genetic variance for complex traits could be higher-order non-additive interactions. Gibson (1996) used a mathematical model of cis-acting variants impacting gene expression during Drosophila embryogenesis to demonstrate that regulatory pathways naturally lead to epistatic and pleiotropic effects. By altering the number of binding sites for an activator protein in the hunchback promoter, Gibson demonstrated that the sharpest transcriptional responses were produced by cooperative DNA-binding likely to involve at least five or six binding sites for activator proteins [48]. In addition, genetic variance affecting the first switch in a pathway (e.g., activation of hunchback) will tend to affect multiple switches at the second level (e.g., targets of hunchback protein) unequally. This quantitative derivation demonstrates the potential for non-additive interactions between loci within binding sides of a promoter as well as between regulatory molecules, highlighting that epistasis is a natural consequence of transcriptional regulation [48]. Yengo and colleagues demonstrated that apparent linkage peaks (i.e., sharing within families of genomic regions) can be due to the combined effects of multiple loci that contribute additively or non-additively [49]. Recently, the LDSC approach has been extended to capture pairwise interactions between nearby variants, in addition to additive effects, using GWAS summary statistics [50]. The proportion of genetic variance that appears to be explained by such interactions ranged from 2 – 25% across 25 traits in the UK Biobank, with a mean of 17.6% (σ = 5.7%).
Although most hits in GWAS of complex traits implicate non-coding genetic variants with putative regulatory effects, a large fraction of noncoding GWAS hits do not colocalize with known expression quantitative trait loci (eQTLs). Mostafavi et al. (2023) demonstrate that GWAS and eQTL studies are systematically biased toward different types of variants, with selection shaping essential differences in what these assays detect [51]. GWAS hits are variants that have measurable effects on trait or disease and are thus biased toward functionally important genes that are selectively constrained. eQTLs with large effects on constrained genes are purged by selection, thus variants with relatively large effects on gene regulation tend to be clustered near the transcription start site of genes that are typically depleted of most functional annotations where selective constraint is low [51]. These observations provide an explanation for why most GWAS hits are indeed cis-eQTLs, but many have not yet been discovered in eQTL mapping. Approaching the question of "why is there any genetic variation at all for traits that appear to be strictly deleterious?", Hemani et al. (2013) build a series of two site interaction models under several selection regimes. They show that selection is both less effective at eliminating interaction variation from the population than it is in eliminating strictly additive effects, and that the resulting population may have patterns of correlation between interacting factors that could cause significant over-estimation of the total additive effects [52].
It is likely that the common assumption that additive-by-additive interactions are non-existent derives from a relatively small number of model system breeding experiments, where differences in covariances across pedigrees can be used to estimate higher-order interactions. Such analyses begin with the assumption that the probability of sharing the additive-by-additive interaction at loci v and w is the probability that an allele is shared at locus v times the probability that an allele is shared locus w, and those probabilities are independent and both equal r. Effectively this analysis assumes free recombination between loci v and w, and that the covariance in phenotype between pairs of individuals (who are not full siblings) is the familiar
However, if these sites are physically near each other, because they are both cis-regulators of the same gene, within megabases or less, there will be very little chance for recombination between those sites in the few generations captured in a pedigree. As such, when there is no recombination between nearby sites, the probability of sharing alleles at both loci is the same as the probability of sharing an allele at one, which is r. If, as proposed here, most or all of the additive-by-additive interaction variance is induced by cis-regulation of genes by neighboring sites, the covariance between any pair of moderately close relatives (close cousins or nearer) is better modeled as rVa + rVaa + rVaaa + ... +, because there will be little to no recombination between the interacting sites. If the additive-by-additive component is driven by the cis-regulation of genes, the difference between half-siblings (Va/4 + Vaa/4 + Vaaa/4 + … +) and twice cousin covariance (2 · [Va/8 + Vaa/8 + Vaaa/8 + … +]) should be zero. In our view, cis-regulation of genes induces additive-by-additive interactions between neighboring sites regulating the same gene, and this effect can be considerably larger than the additive variance, when the interacting sites are rare. Classical breeding studies that estimated additive-by-additive variance have disproven the existence of any significant component of unlinked interacting sites, but are entirely consistent with the existence of neighboring sites which interact.
Because any pair of neighboring sites will have the same IBD pattern among close relatives, classical breeding studies among close relatives are effectively unable to detect interactions caused by cis-regulation. In other words, without free recombination between sites, the additive effects and additive-by-additive interactions are fundamentally indistinguishable in small pedigrees. Conversely, methods to estimate heritability in samples of distant relatives (i.e., GREML) struggle to estimate interaction components because r2 is so low for "unrelated" individuals. There are, however, some experimental designed capable of generating a range of intermediate levels of sharing between pairs of sites to allow for estimation of higher-order interactions. Young and Durbin utilize an experimental yeast cross to estimate approximately 10% of the phenotypic variation on average across 46 traits may be due to two-way additive-by-additive interactions, and up to 14% on average from higher-order additive-by-additive interactions [54].
It is well-understood that unmodeled non-linearity, representing some combination of non-additive effects and correlation in state between genetic or environmental factors, contributes to the disparity in heritability estimates between family-based studies, GREML, and PRS [55,56]. Modern GREML-KIN studies have provided strong evidence for the existence of a substantial amount of interaction variance, but those interactions have been attributed, we think falsely, to unaccounted for correlation in state, despite the fact that the simplest interpretation is that they are direct estimates of the additive-by-additive component of variation induced by cis-acting gene regulation. Non-additive interactions deriving from the cis-regulation of gene expression have the potential to account for much of the missing heritability of complex traits. A first step towards actively searching for such interactions is to not assume them away. It is plausible that a significant fraction of the missing heritability is discoverable by systematic searching for additive-by-additive interactions.
Not applicable.
Not applicable.
Not applicable.
This study was supported by NIH Grants RF1 AG071170 and U01 DK134191.
David J. Cutler is a member of the Editorial Board of the journal Human Population Genetics and Genomics. The author was not involved in the journal’s review of or decisions related to this manuscript. The authors have declared that no other competing interests exist.
All authors participated in the derivation, writing, and editing of this work.
This work has benefited from many helpful suggestions by Greg Gibson, and long conversations with Loic Yengo.
| 1. | GTEx Consortium. Genetic effects on gene expression across human tissues. Nature. 2017 Oct;550:204-213. [Google Scholar] [CrossRef] |
| 2. | Nicolae DL, Gamazon E, Zhang W, Duan S, Dolan ME, Cox NJ. Trait-associated SNPs are more likely to be eQTLs: annotation to enhance discovery from GWAS. PLoS Genet. 2010;6:e1000888. [Google Scholar] [CrossRef] |
| 3. | Gusev A, Ko A, Shi H, Bhatia G, Chung W, Penninx BWJH, et al. Integrative approaches for large-scale transcriptome-wide association studies. Nat Genet. 2016;48:245-252. [Google Scholar] [CrossRef] |
| 4. | Li B, Ritchie MD. From GWAS to Gene: Transcriptome-Wide Association Studies and Other Methods to Functionally Understand GWAS Discoveries. Front Genet. 2021;12:713230. [Google Scholar] [CrossRef] |
| 5. | Cutler DJ, Jodeiry K, Bass AJ, Epstein MP. The Quantitative Genetics of Human Disease: 1 Foundations. Hum Popul Genet Genom. 2023;3:0007. [Google Scholar] [CrossRef] |
| 6. | Kempthorne O. The Theoretical Values of Correlations between Relatives in Random Mating Populations. Genetics. 1955;40:153-167. [Google Scholar] [CrossRef] |
| 7. | Jodeiry K, Bass AJ, Epstein MP, Cutler DJ. The Quantitative Genetics of Human Disease: 3a Interactions—Correlation in State. Hum Popul Genet Genom. 2025. Forthcoming. [Google Scholar] |
| 8. | Bass AJ, Bian S, Wingo AP, Wingo TS, Cutler DJ, Epstein MP. Identifying latent genetic interactions in genome-wide association studies using multiple traits. Genome Med. 2024 Apr;16. [Google Scholar] [CrossRef] |
| 9. | Head ST, Leslie EJ, Cutler DJ, Epstein MP. POIROT: a powerful test for parent-of-origin effects in unrelated samples leveraging multiple phenotypes. Bioinformatics. 2023 Apr;39(4):btad199. [Google Scholar] [CrossRef] |
| 10. | Marderstein AR, Davenport ER, Kulm S, Van Hout CV, Elemento O, Clark AG. Leveraging phenotypic variability to identify genetic interactions in human phenotypes. Am J Hum Genet. 2021 Jan 7;108(1):49-67. [Google Scholar] [CrossRef] |
| 11. | Aschard H. A perspective on interaction effects in genetic association studies. Genet Epidemiol. 2016 Dec;40(8):678-688. [Google Scholar] [CrossRef] |
| 12. | Palmer DS, Zhou W, Abbott L, Wigdor EM, Baya N, Churchhouse C, et al. Analysis of genetic dominance in the UK Biobank. Science. 2023 Mar 31;379(6639):1341-1348. [Google Scholar] [CrossRef] |
| 13. | Chatterjee N, Shi J, García-Closas M. Developing and evaluating polygenic risk prediction models for stratified disease prevention. Nat Rev Genet. 2016 Jul;17(7):392-406. [Google Scholar] [CrossRef] |
| 14. | Cutler DJ, Jodeiry K, Bass AJ, Epstein MP. The Quantitative Genetics of Human Disease: 2 Polygenic Risk Scores. Hum Popul Genet Genom. 2024;4(3):0008. [Google Scholar] [CrossRef] |
| 15. | Karlsson IK, Escott-Price V, Gatz M, Hardy J, Pedersen NL, Shoai M, et al. Measuring heritable contributions to Alzheimer’s disease: polygenic risk score analysis with twins. Brain Commun. 2022 Jan 4;4(1):fcab308. [Google Scholar] [CrossRef] |
| 16. | Leonenko G, Baker E, Stevenson-Hoare J, Sierksma A, Fiers M, Williams J, et al. Identifying individuals with high risk of Alzheimer’s disease using polygenic risk scores. Nature Commun. 2021 Jul;12:4506. [Google Scholar] [CrossRef] |
| 17. | Hilker R, Helenius D, Fagerlund B, Skytthe A, Christensen K, Werge TM, et al. Heritability of Schizophrenia and Schizophrenia Spectrum Based on the Nationwide Danish Twin Register. Biol Psychiatry. 2018;83:492-498. [Google Scholar] [CrossRef] |
| 18. | Calafato MS, Thygesen JH, Ranlund S, Zartaloudi E, Cahn W, Crespo-Facorro B, et al. Use of schizophrenia and bipolar disorder polygenic risk scores to identify psychotic disorders. Br J Psychiatry. 2018 Sep;213(3):535-541. [Google Scholar] [CrossRef] |
| 19. | Sartor CE. Common Heritable Contributions to Low-Risk Trauma, High-Risk Trauma, Posttraumatic Stress Disorder, and Major Depression. Arch Gen Psychiatry. 2012 Mar;69(3):293-299. [Google Scholar] [CrossRef] |
| 20. | Nievergelt CM, Maihofer AX, Atkinson EG, Chen CY, Choi KW, Coleman JRI, et al. Genome-wide association analyses identify 95 risk loci and provide insights into the neurobiology of post-traumatic stress disorder. Nat Genet. 2024;56:792-808. [Google Scholar] [CrossRef] |
| 21. | Visscher PM, Medland SE, Ferreira MAR, Morley KI, Zhu G, Cornes BK, et al. Assumption-Free Estimation of Heritability from Genome-Wide Identity-by-Descent Sharing between Full Siblings. PLoS Genet. 2006 Mar;2(3):e41. [Google Scholar] [CrossRef] |
| 22. | Yengo L, Vedantam S, Marouli E, Sidorenko J, Bartell E, Sakaue S, et al. A saturated map of common genetic variants associated with human height. Nature. 2022;610:704-712. [Google Scholar] [CrossRef] |
| 23. | Lopez-Cortegano E, Caballero A. Inferring the Nature of Missing Heritability in Human Traits Using Data from the GWAS Catalog. Genetics. 2019 May;212:891-904. [Google Scholar] [CrossRef] |
| 24. | Bulik-Sullivan BK, Loh PR, Finucane HK, Ripke S, Yang J, Schizophrenia Working Group of the Psychiatric Genomics Consortium, et al. LD Score regression distinguishes confounding from polygenicity in genomewide association studies. Nat Genet. 2015;47:291-295. [Google Scholar] [CrossRef] |
| 25. | Yang J, Benyamin B, McEvoy BP, Gordon S, Henders AK, Nyholt DR, et al. Common SNPs explain a large proportion of the heritability for human height. Nat Genet. 2010 Jul;42(7):565-569. [Google Scholar] [CrossRef] |
| 26. | Yang J, Lee SH, Goddard ME, Visscher PM. GCTA: A Tool for Genome-wide Complex Trait Analysis. Am J Hum Genet. 2011 Jan 7;88(1):76-82. [Google Scholar] [CrossRef] |
| 27. | Zaitlen N, Kraft P, Patterson N, Pasaniuc B, Bhatia G, Pollack S, et al. Using Extended Genealogy to Estimate Components of Heritability for 23 Quantitative and Dichotomous Traits. PLoS Genet. 2013 May;9(5):e1003520. [Google Scholar] [CrossRef] |
| 28. | The 1000 Genomes Project Consortium. An integrated map of genetic variation from 1,092 human genomes. Nature. 2012 Oct;491:56-65. [Google Scholar] [CrossRef] |
| 29. | Srivastava AK, Williams SM, Zhang G. Heritability Estimation Approaches Utilizing Genome-Wide Data. Curr Protoc. 2023 Apr;3(4):e734. [Google Scholar] [CrossRef] |
| 30. | Yang J, Bakshi A, Zhu Z, Hemani G, Vinkhuyzen AAE, Lee SH, et al. Genetic variance estimation with imputed variants finds negligible missing heritability for human height and body mass index. Nat Genet. 2015;47:1114-11120. [Google Scholar] [CrossRef] |
| 31. | Sabatti C, Service SK, Hartikainen AL, Pouta A, Ripatti S, Brodsky J, et al. Genome-wide association analysis of metabolic traits in a birth cohort from a founder population. Nat Genet. 2009;41:35-46. [Google Scholar] [CrossRef] |
| 32. | Zuk O, Hechter E, Sunyaev SR, Lander ES. The mystery of missing heritability: Genetic interactions create phantom heritability. Proc Natl Acad Sci U S A. 2012 Jan 24;109(4):1193-1198. [Google Scholar] [CrossRef] |
| 33. | Ni G, Moser G, Schizophrenia Working Group of the Psychiatric Genomics Consortium, Wray NR, Lee SH. Estimation of Genetic Correlation via Linkage Disequilibrium Score Regression and Genomic Restricted Maximum Likelihood. Am J Hum Genet. 2018;102:1185-1194. [Google Scholar] [CrossRef] |
| 34. | Verhulst B, Neale MC, Kendler KS. The heritability of alcohol use disorders: a meta-analysis of twin and adoption studies. Psychol Med. 2015 Apr;45(5):1061-1072. [Google Scholar] [CrossRef] |
| 35. | Clarke TK, Adams MJ, Davies G, Howard DM, Hall LS, Padmanabhan S, et al. Genome-wide association study of alcohol consumption and genetic overlap with other health-related traits in UK Biobank (N=112117). Mol Psychiatry. 2017 Oct;22(10):1376-1384. [Google Scholar] [CrossRef] |
| 36. | Rhee SH, Hewitt JK, Young SE, Corley RP, Crowley TJ, Stallings MC. Genetic and Environmental Influences on Substance Initiation, Use, and Problem Use in Adolescents. Arch Gen Psychiatry. 2003 Dec;60(12):1256-1264. [Google Scholar] [CrossRef] |
| 37. | Xia C, Amador C, Huffman J, Trochet H, Campbell A, Porteous D, et al. Pedigree- and SNP-Associated Genetics and Recent Environment are the Major Contributors to Anthropometric and Cardiometabolic Trait Variation. PLoS Genet. 2016 Feb 2;12(2):e1005804. [Google Scholar] [CrossRef] |
| 38. | Silventoinen K, Sammalisto S, Perola M, Boomsma DI, Cornes BK, Davis C, et al. Heritability of adult body height: a comparative study of twin cohorts in eight countries. Twin Res. 2003;6:399-408. [Google Scholar] [CrossRef] |
| 39. | Schousboe K, Visscher PM, Erbas B, Kyvik KO, Hopper JL, Henriksen JE, et al. Twin study of genetic and environmental influences on adult body size, shape, and composition. Int J Obes Relat Metab Disord. 2004 Jan;28(1):39-48. [Google Scholar] [CrossRef] |
| 40. | Goode EL, Cherny SS, Christian JC, Jarvik GP, de Andrade M. Heritability of Longitudinal Measures of Body Mass Index and Lipid and Lipoprotein Levels in Aging Twins. Twin Res Hum Genet. 2007 Oct;10(5):703-711. [Google Scholar] [CrossRef] |
| 41. | Hill WD, Arslan RC, Xia C, Luciano M, Amador C, Navarro P, et al. Genomic analysis of family data reveals additional genetic effects on intelligence and personality. Mol Psychiatry. 2018 Dec;23(12):2347-2362. [Google Scholar] [CrossRef] |
| 42. | Marioni RE, Davies G, Hayward C, Liewald D, Kerr SM, Campbell A, et al. Molecular genetic contributions to socioeconomic status and intelligence. Intelligence. 2014 May;44:26-32. [Google Scholar] [CrossRef] |
| 43. | Vukasović T, Bratko D. Heritability of personality: A meta-analysis of behavior genetic studies. Psychol Bull. 2015;141:769-85. [Google Scholar] [CrossRef] |
| 44. | Boyle EA, Li YI, Pritchard JK. An Expanded View of Complex Traits: From Polygenic to Omnigenic. Cell. 2017 Jun;169:1177-1186. [Google Scholar] [CrossRef] |
| 45. | Dixon JR, Selvaraj S, Yue F, Kim A, Li Y, Shen Y, et al. Topological domains in mammalian genomes identified by analysis of chromatin interactions. Nature. 2012 Apr;485:376-380. [Google Scholar] [CrossRef] |
| 46. | Zuin J, Dixon JR, van der Reijden MIJA, Ye Z, Kolovos P, Brouwer RWW, et al. Cohesin and CTCF differentially affect chromatin architecture and gene expression in human cells. Proc Natl Acad Sci U S A. 2014 Jan 21;111(3):996-1001. [Google Scholar] [CrossRef] |
| 47. | Tan T, Jayashankar H, Guan J, Nehzati S, Mir M, Bennett M, et al. Family-GWAS reveals effects of environment and mating on genetic associations. medRxiv [Preprint]. 2025 Oct 14;2024.10.01.24314703. [Google Scholar] [CrossRef] |
| 48. | Gibson G. Epistasis and Pleiotropy as Natural Properties of Transcriptional Regulation. Theor Popul Biol. 1996 Feb;49(1):58-89. [Google Scholar] [CrossRef] |
| 49. | Sidorenko J, Couvy-Duchesne B, Kemper KE, Moen GH, Bhatta L, Åsvold BO, et al. Genetic architecture reconciles linkage and association studies of complex traits. Nat Genet. 2024 Oct;56:2352-2360. [Google Scholar] [CrossRef] |
| 50. | Pattillo Smith S, Darnell G, Udwin D, Stamp J, Harpak A, Ramachandran S, et al. Discovering non-additive heritability using additive GWAS summary statistics. Elife. 2024 Jun 24;13:e90459. [Google Scholar] [CrossRef] |
| 51. | Mostafavi H, Spence JP, Naqvi S, Pritchard JK. Systematic differences in discovery of genetic effects on gene expression and complex traits. Nat Genet. 2023 Nov;55(11):1866-1875. [Google Scholar] [CrossRef] |
| 52. | Hemani G, Knott S, Haley C. An Evolutionary Perspective on Epistasis and the Missing Heritability. PLoS Genet. 2013 Feb;9(2):e1003295. [Google Scholar] [CrossRef] |
| 53. | Lee AJ, Henderson CR. Components in genetic variance in milk yield. J Dairy Sci. 1969 Jun;52(6):780-789. [Google Scholar] [CrossRef] |
| 54. | Young AI, Durbin R. Estimation of epistatic variance components and heritability in founder populations and crosses. Genetics. 2014 Dec;198(4):1405-1416. [Google Scholar] [CrossRef] |
| 55. | Eftedal NH, Eilertsen EM, Sunde HF, Kleppestø TH, Ystrom E, Czajkowski NO. Beyond additive genetic effects: Explaining family resemblance in school performance across millions of pairs of Norwegian relatives. Proc Natl Acad Sci U S A. 2025 Jun 24;122(25):e2419627122. [Google Scholar] [CrossRef] |
| 56. | Hunter MD, McKee KL, Turkheimer E. Simulated nonlinear genetic and environmental dynamics of complex traits. Dev Psychopathol. 2023 May;35(2):662-677. [Google Scholar] [CrossRef] |
Copyright © 2025 Pivot Science Publications Corp. - unless otherwise stated | Terms and Conditions | Privacy Policy