Dominance is common in mammals and is associated with trans-acting gene expression and alternative splicing

Cui, Leilei; Yang, Bin; Xiao, Shijun; Gao, Jun; Baud, Amelie; Graham, Delyth; McBride, Martin; Dominiczak, Anna; Schafer, Sebastian; Aumatell, Regina Lopez; Mont, Carme; Teruel, Albert Fernandez; Hübner, Norbert; Flint, Jonathan; Mott, Richard; Huang, Lusheng

doi:10.1186/s13059-023-03060-2

Dominance is common in mammals and is associated with trans-acting gene expression and alternative splicing

Research
Open access
Published: 29 September 2023

Volume 24, article number 215, (2023)
Cite this article

Download PDF

You have full access to this open access article

Genome Biology Aims and scope Submit manuscript

Dominance is common in mammals and is associated with trans-acting gene expression and alternative splicing

Download PDF

Leilei Cui^1,2,3,4^na1,
Bin Yang¹^na1,
Shijun Xiao¹,
Jun Gao¹,
Amelie Baud⁵,
Delyth Graham⁶,
Martin McBride⁶,
Anna Dominiczak⁶,
Sebastian Schafer⁷,
Regina Lopez Aumatell⁸,
Carme Mont⁹,
Albert Fernandez Teruel¹⁰,
Norbert Hübner^11,12,13,
Jonathan Flint¹⁴,
Richard Mott ORCID: orcid.org/0000-0002-1022-9330² &
…
Lusheng Huang¹

2807 Accesses
1 Citation
8 Altmetric
Explore all metrics

Abstract

Background

Dominance and other non-additive genetic effects arise from the interaction between alleles, and historically these phenomena play a major role in quantitative genetics. However, most genome-wide association studies (GWAS) assume alleles act additively.

Results

We systematically investigate both dominance—here representing any non-additive within-locus interaction—and additivity across 574 physiological and gene expression traits in three mammalian stocks: F2 intercross pigs, rat heterogeneous stock, and mice heterogeneous stock. Dominance accounts for about one quarter of heritable variance across all physiological traits in all species. Hematological and immunological traits exhibit the highest dominance variance, possibly reflecting balancing selection in response to pathogens. Although most quantitative trait loci (QTLs) are detectable as additive QTLs, we identify 154, 64, and 62 novel dominance QTLs in pigs, rats, and mice respectively that are undetectable as additive QTLs. Similarly, even though most cis-acting expression QTLs are additive, gene expression exhibits a large fraction of dominance variance, and trans-acting eQTLs are enriched for dominance. Genes causal for dominance physiological QTLs are less likely to be physically linked to their QTLs but instead act via trans-acting dominance eQTLs. In addition, thousands of eQTLs are associated with alternatively spliced isoforms with complex additive and dominant architectures in heterogeneous stock rats, suggesting a possible mechanism for dominance.

Conclusions

Although heritability is predominantly additive, many mammalian genetic effects are dominant and likely arise through distinct mechanisms. It is therefore advantageous to consider both additive and dominance effects in GWAS to improve power and uncover causality.

Analyses of allele-specific gene expression in highly divergent mouse crosses identifies pervasive allelic imbalance

Article 02 March 2015

Systematic differences in discovery of genetic effects on gene expression and complex traits

Article 19 October 2023

The statistical practice of the GTEx Project: from single to multiple tissues

Article 06 August 2020

Background

Dominance arises from non-additive interactions between different alleles within a locus. The pathways that cause dominance still remain to be clarified, despite intense scrutiny [1,2,3]; suggested explanations include haplo-sufficiency (when a single working copy of one gene is sufficient for normal function) [4, 5], antimorphs (when the mutant product of one gene interacts and interferes with the normal product) [6,7,8], hypomorphs (when one allele has a partial or complete loss-of-function) [9], and antagonistic pleiotropy (when one allele is beneficial to some traits while deleterious to others) [10,11,12].

In the quantitative genetics model proposed by Fisher in 1916 [13,14,15], the genetic variance, or heritability, of a quantitative trait is partitioned into additive, dominant, and epistatic components, each of which are aggregates of many smaller contributions within and between multiple causal loci. It follows that understanding dominance at the phenotypic level depends on understanding dominance within each causal locus, and on clarifying the causal molecular pathways.

A given biallelic locus is additive if the phenotypic effect of the heterozygote is the mean of that of the two homozygotes. In this study, we define dominance to mean any non-additive within-locus interaction, classified as partial (PD), complete (CD), or over-dominance (OD), according to whether the phenotypic effect of the heterozygote lies within the range spanned by the homozygotes but is unequal to their average, or is equal to one of the homozygote effects, or is outside their range [16,17,18] (Additional file 1: Figure S1.1). Any possible additive or dominance relationship between a trait and a biallelic locus can be modeled by a combination of additive and complete dominance effects, and the presence of dominance in this wider non-additive sense is therefore testable by comparing the fit of a purely additive model to a model with both additive and dominance effects. Computational methods to detect dominance in genome-wide association studies (GWAS) have been developed by our group [19] and others [20,21,22,23,24,25,26,27].

Although most quantitative genetics studies assume additivity, dominance effects—where investigated—have been observed in GWAS, heritability estimation, genomic selection, and prediction. Crosses between inbred strains often reveal the closely related phenomenon of heterosis. Dominance quantitative trait loci (QTLs) have been mapped in animals (cattle [28,29,30,31,32,33,34], pig [35,36,37,38], sheep [39], chicken [40, 41]), plants (maize [42,43,44,45], wheat [46], rice [47], sunflower [48], Arabidopsis [49], Primulina [50]) and in a few studies in humans [51,52,53,54,55,56]. In cattle, where dominance effects have been investigated most intensively, recessive QTLs are known for lactation, growth, and developmental traits [31, 33]. Similarly in pigs, dominance QTLs are associated with the number and weight of piglets born [38], number of teats [37], meat quality [36], and growth traits [35]. In plants, dominance QTLs are associated with disease resistance (shoot fly in maize [43] and stripe rust resistance of wheat [46]) and growth (leaf orientation in maize [45], flowering time in Arabidopsis [49] and sunflower hybrids [48], and hybrid male sterility of Primulina [50]). Consistent with these observations, a large fraction of dominance heritability frequently occurs in cattle (yearling weight [57], growth [58], milk production [59], and reproduction [60]) and pigs (sow longevity [61], daily gain [62, 63], number of teats [63], backfat [63, 64], and growth [64]).

In contrast with these findings, studies in humans have generally reported that both dominance variance components are small [65,66,67,68] and dominance-associated loci [54, 56, 69] are relatively rare. One potential explanation might be the prevalence of low-frequency alleles in human and other large random mating populations. In contrast, experimental and artificially bred populations exhibit limited haplotype diversity and higher allele frequencies. The power to detect dominance QTLs and to predict dominance phenotypes depends critically on the frequency of the rarer of the two homozygote genotypes and is consequently attenuated at lower allele frequencies, as shown in a recent study of recessive human disease [55].

There are also practical reasons why dominance is often ignored. First, modelling dominance requires extra degrees of freedom in fixed effects models, potentially reducing the power to detect purely additive effects. Second, if one is to model dominance effects using the mixed model framework, it follows that both additive and dominance variance components should be included in genetic background effects, which is computationally challenging. For these reasons, most GWAS in humans only consider additive variance components. However, where dominance heritability is large, this shortcut is potentially unsound and might reduce the power to detect genetic associations.

Another important reason to consider dominance is that understanding the relationship between dominance at the level of gene expression and at the level of physiological phenotype may be key to establishing causal mechanisms. In yeast [70, 71], plants [72], flies [73], fish [74], and mice [75] dominance gene expression is associated with trans-acting effects and with structural variations such as translocations that silence or otherwise modulate the expression [76]. In contrast, most cis-acting expression QTLs (eQTLs) are additive. These phenomena suggest how dominance might arise at the molecular level, but their prevalence in mammals is under-explored.

In this study, we systematically investigate dominance across physiological and gene expression traits in three mammalian species, namely pigs [77], rats [78], and mice [79]. These populations were chosen because of the wealth of genotypes and phenotypes available combined with gene expression measured on large subsets of the same animals. Within each population, we analyze their dominance and additive genetic architectures through variance decomposition, QTL and eQTL mapping. Additionally, in the rats, we use RNA-seq data to relate dominance to the expression of alternative isoforms, revealing a novel potential mechanism for dominance.

Results

Population characteristics

We integrated and analyzed published multi-phenotype and multi-omics data from three mammalian stocks: (1) F2 intercross pigs (hereafter F2 pigs) [77], containing 1005 progeny derived from 2 White Duroc boars mated with 17 Erhualian sows, with 253 complex traits measured related to growth, fatness, meat quality, and blood. In addition, we analyzed their digital gene expression data of liver and muscle [80]; (2) heterogeneous stock rats (HS rats) [78], encompassing 1407 individuals descended from eight inbred founder strains (ACI/N, BN/SsN, BUF/N, F344/N, M520/N, MR/N, WKY/N and WN/N), in which 220 physiological traits and previously unpublished RNA-seq data of amygdala and heart samples were measured; (3) heterogeneous stock mice [79] (HS mice), comprising 2002 individuals descended from eight inbred founder strains (A/J, AKR/J, BALBc/J, CBA/J, C3H/HeJ, C57BL/6 J, DBA/2 J, and LP/J), with measurements of 125 physiological traits and microarray gene expression data of hippocampus, liver, and lung [81]. The chromosomes of the HS rats and mice are fine-grained mosaics of their respective inbred founder strains. The F2 pigs are not descended from inbred founders because individuals from the same pig breed are not genetically identical. The populations and datasets used, including the traits mapped in each population are summarized in Additional file 2: Table S1 and Table S2.

Each phenotype was normalized, and the effects of covariates removed, as described previously [77,78,79]. All subsequent analyses used these normalized residuals. In each population, we removed SNPs with minor allele frequency (MAF) < 0.05, missing rate > 0.1 or if the rarest genotype occurred in fewer than 10 individuals. The numbers of SNPs passing these quality control steps was respectively 39,298 (pig), 244,786 (rat), and 9142 (mouse).

Dominance accounts for about a quarter of genetic variance of organismal traits

We used these SNP sets to construct additive and dominance genetic relationship matrices (GRMs) and performed quantitative genetic analysis. We dissected the contributions of dominance to the 584 organismal traits measured across the three populations, by simultaneously estimating both additive \(\left({V}_{a}\right)\) and dominance \(\left({V}_{d}\right)\) genetic variance components from the GRMs in each trait using GCTA [65]. Each phenotype was first adjusted to remove covariates and scaled to have unit variance, so these components also represent heritabilities (Additional file 3: Table S2). The relationships between \({V}_{a}\) and \({V}_{d}\) in each population are shown as scatter plots (Fig. 1a–c) and bar plots (Additional file 1: Fig. S1.2 a-c). Across 425 traits with nonzero dominance variance (\({V}_{d}>0.05\)), \({V}_{a}/{V}_{d}\approx 3\), i.e., dominance accounts for about one quarter of the genetic variance. In F2 pigs, \({\overline{V} }_{a}\):\({\overline{V} }_{d}=0.33:0.11\) across \(n=163\) traits. HS rats exhibit slightly higher average heritabilities (\({\overline{V} }_{a}:{\overline{V} }_{d}=0.48\):\(0.24\), \(n=187\)), while in the HS mice they are lower (\({\overline{V} }_{a}: {\overline{V} }_{d}=0.22\):\(0.08\), \(n=75\)).

In general, additive genetic effects explain more phenotypic variance than dominance effects across all three populations, in line with previous studies [77,78,79]. However, many traits have important dominance contributions. In pigs, rats and mice, respectively, there are 52, 143, and 15 traits (21.6%, 69.8%, 12.2%) where \({V}_{d}>0.15\), and 27, 8, and 13 (11.3%, 3.9%, 10.6%) where \({V}_{d}>{V}_{a}\) (Additional file 3: Table S2 and Additional file 1: Fig. S1.2). In F2 pigs, fatness, growth, and hematology-related traits exhibit the largest dominance effects (Fig. 1a, g), while in rats and mice, dominance is most noticeable in immunology, hematology, and serum biochemistry-related traits (Fig. 1b,c, h–j). We show below that dominance and over-dominance QTLs are more often seen in those traits with higher dominance variance components.

Mapping dominance QTLs improves GWAS sensitivity

We used ADDO [19] to perform a mixed model GWAS for each organismal trait, modelling both additive and dominance fixed effects at each focal SNP and including additive, dominance, and environmental variance components simultaneously to model background effects. This Add-Dom (or “AD”) Model exhibits better calibration of P-values than a model solely including additive effects (named the Add or “A-Model”) [19]. At each SNP, we compared the AD Model to either a null model with neither additive nor dominance QTL or to a model with additive SNP effect only, but fitting both AD variance components. We applied two Bonferroni P-value significance thresholds, (i) approximate 5% genome-wide significance (0.05/N_SNP) and (ii) suggestive significance (1/N_SNP, corresponding to \(-{\mathrm{log}}_{10}P=4.5, 4.8, 3.9\) in pigs, rats and mice respectively).

We adapted a long-standing definition—the degree of dominance [16, 18]—to classify QTLs. We first computed the absolute values of the ratios of the T-statistics of the additive (\({t}_{Add}\)) and dominance (\({t}_{Dom}\)) effect estimates at each QTL, and then applied the following classification thresholds; additive (A-QTL): 0–0.2; partial-dominance (PD-QTL): 0.2–0.8; complete-dominance (CD-QTL): 0.8–1.2; over-dominance (OD-QTL): > 1.2. We recognize that since the ratio \(\left|{t}_{Dom}/{t}_{Add}\right|\) is continuous, the classification boundaries are arbitrary. These thresholds are shown graphically in Fig. 2 and Additional file 1: Fig. S2.1. For a clearer visualization—but which does not change the classifications—we plot the log transformed ratios \({\mathrm{log}}_{2}\left|{t}_{Dom}/{t}_{Add}\right|\) in Fig. 2a–c. Un-transformed plots are shown in Additional file 1: Fig. S2.1 a-f.

Using suggestive genome-wide significance thresholds, the AD Model detected 352 QTLs for 182 F2 pig traits, 179 QTLs for 119 HS rat traits, and 116 QTLs for 73 HS mice traits (Fig. 2d–f and Additional file 4: Table S3), of which 137, 87, and 31 QTLs were genome-wide significant, with average logP-thresholds of 5.8, 6.1, and 5.2 for pig, rat, and mouse, respectively. We report results for the top SNP at each QTL in Additional file 5: Table S4. The \({\mathrm{log}}_{2}\left|{t}_{Dom}/{t}_{Add}\right|\) ratios for each QTL, categorized by population and class of phenotype, are plotted in Fig. 2a–c, and the genome-wide distributions of QTLs shown as porcupine plots in Additional file 1: Fig. S2.1 g-i. Similar proportions of QTL types occur in each population, and dominance QTLs are common throughout (Fig. 2d–f). On average, 16% of suggestive QTLs are complete-dominant and 28.3% over-dominant.

The AD Model has consistently greater power to detect QTLs (Additional file 1: Fig. S2.1 d-f) compared to the A Model (Additional file 1: Fig. S2.1 a-c), especially for CD and OD QTLs. Among all suggestive QTLs, 44.3% are detected by the AD Model but absent from the A Model (Fig. 2h–j, 43.8%, 35.8%, 53.4% for pig, rat, and mouse). These comprise 100 (98%) OD QTLs and 39 (70.9%) CD QTLs in F2 pigs, with similar counts of 15 (57.7%) and 35 (87.5%) in HS rats, and 16 (76.2%) and 34 (87.2%) in HS mice. In addition, the AD Model improved the − log₁₀(P) values of 119 (33.8%) pig QTLs by more than 4 units compared to the A model, and similarly in for 52 (29.1%) rat QTLs and 40 (34.5%) mouse QTLs (Additional file 5: Table S4). Most newly detected or improved QTLs relate to hematology and immunology traits, consistent with the variance decomposition results.

We also investigated the power of the “D Model” which mapped purely dominant QTLs (Methods). Among suggestive SNPs detected by the AD, A, or D Models (Additional file 1: Fig. S2.2 a-c), there was a large increase in − log₁₀(P) values of significant SNPs (either uniquely or concurrently) detected by AD compared to both A (Additional file 1: Fig. S2.2 d-f) and D Models (Additional file 1: Fig. S2.2 g-i). Thus, the AD model is uniformly more powerful than either simpler model and improves the detection and resolution of QTLs.

We show representative examples of six QTLs that are either significantly improved or only detectable by the AD model, for F2 pigs (Fig. 3a, b), HS rats (Fig. 3c, d), and HS mice (Fig. 3e, f). The dominance classifications (represented by colors in Fig. 3) of SNPs in linkage disequilibrium are generally similar. Further examples for F2 Pigs are shown in Additional file 1: Fig. S3.1 (meat quality traits), and for HS rats in Additional file 1: Fig. S3.2 (immune cell traits), and for HS mice in Additional file 1: Fig. S3.3 (immune cell traits). In the latter case, two dominance QTLs related to mouse T cell traits each localize to a potential causal gene Bat3, which is over 20 units of − log₁₀(P) values more significant than the QTL found by the A-model in the neighboring gene Myo1f. Conditional QTL mapping analysis shows that Bat3 and Myo1f are associated with nearby but unlinked SNPs (Additional file 1: Fig. S3.4) and therefore represent independent effects.

For each trait, we aggregated the variances explained by the peak SNPs at all independent QTLs, partitioned into additive (\({V}_{a\_QTL}\)) and dominance (\({V}_{d\_QTL}\)) contributions (Fig. 1d–f), and compared them with the genome-wide variance components computed using GCTA (Fig. 1a–c). Each population showed the expected missing heritability, where less variance was explained by QTLs than by all SNPs.

We identified pleiotropic dominance QTL hotspots (18 in pig, 9 in rat, and 9 in mouse; Additional file 6: Table S5-1 to S5-3). In F2 pigs, chr7: 34.8 Mb-35.1 Mb is associated with many growth-related traits (e.g., ear weight, bone length, skin thickness, and carcass length), and the over-dominant hotspot chr11: 16.3–66.8 Mb is associated with many pork color traits, serum glucose level, and hematocrit. Similarly, in HS rats, the hotspot at chr9: 3.78–4.67 Mb—distinct from the rat major histocompatibility complex (MHC) locus—is associated with immunology traits. In HS mice, a hotspot in chr9: 72–111 Mb is associated with hematological traits while the MHC hotspot chr17: 33.7–37.2 Mb is associated with immunological traits.

Among traits measured on more than one species, there are six groups of potentially homologous QTLs across all three species (Additional file 7: Table S6), namely (i) ratio of CD4 + cells to CD8 + cells of rat and mouse, (ii) heart weight of pig and rat, (iii) body growth or body weight of three species, (iv) glucose tolerance of rat and mouse, (v) various serum biochemistry-related traits of rat and mouse (e.g., HDL, LDL, cholesterol, urea), and (vi) hematology traits in all three species (e.g., HCT, HGB, MCH, MCHC, MCV, PCT, RDW). The most significant examples are for MCV (Additional file 1: Fig. S4.1) and MCH (Additional file 1: Fig. S4.2). Many other examples replicate between two species (Additional file 1: Fig. S4.3). For example, within the syntenic rat and mouse MHC regions we observe dominance for many immunological traits. Dominance loci in pigs do not appear to be syntenic with those in rodents, as no F2 pig immune system traits were available for analysis. We annotated the peak SNPs of each QTL using the Variant Effect Predictor (VEP) tool [82], to predict functional consequences (Additional file 5: Table S4).

Gene expression is strongly influenced by dominance effects

We next investigated the impact of dominance on gene expression. We evaluated seven tissues across the three populations: F2 pigs (liver and muscle), HS rats (amygdala and heart), and HS mice (hippocampus, liver, and lung), via variance decomposition with GCTA and eQTL mapping with ADDO. For rat RNA-seq data, we made separate analyses for gene and isoform expression, where a gene’s expression level is defined as the sum of all its constituent isoform levels.

The heritability of most expression traits was lower than for physiological traits, but surprisingly a larger fraction of the variance was accounted for by dominance, even though, as we describe below, most cis-eQTLs are additive. Figure 4a shows the averaged relative proportions of gene expression variances across species and tissues in comparison with the physiological phenotypes shown in Fig. 1d–f. In pigs, additive variance components \({V}_{a}\) were generally larger than dominance components \({V}_{d}\). Interestingly, the reverse is the case in rats and mice. Although the per-gene standard error of each estimated variance component is large, t-tests of the mean differences between \({V}_{a}\) and \({V}_{d}\) across genes are significant (P-value < 10⁻⁴).

To eliminate instabilities when both variance components are very small, we repeated the analysis restricted to genes where both \({V}_{a}>0.05\) and \({V}_{d}>0.05\) (each expression trait was first standardized to have unit variance). We observed \({V}_{d}>{V}_{a}\) in 33.6% pig, 67.8% rat, and 56.3% mouse genes (Additional file 8: Table S7). Numerous genes also exhibited very high dominance (\({V}_{d}>0.15\)), including 405 liver and 295 muscle genes in pig; 1651 amygdala and 9491 heart genes in rat; 887 hippocampus, 2540 liver and 2482 lung genes in mouse.

We mapped thousands of eQTLs (Additional file 9: Table S8-1, Additional file 10: Table S9-1 to S9-9 and Fig. 4b–e). We applied thresholds ranging from lax (− log₁₀(P) = 5.8, 6.3, 5.1 for pig, rat and mouse) to stringent (− log₁₀(P) = 8.5) to characterize the variable impact of dominance on cis and trans-eQTLs. The proportions of dominant eQTLs at different thresholds are presented in Additional file 9: Table S8-2. At lax thresholds, many trans-eQTLs are dominant; 42.3%, 81.1%, and 71.1% in pig, rat, and mouse. At more stringent thresholds—and therefore among fewer eQTLs—dominant trans-eQTLs are less common but still abundant (5.7, 61.5, and 44.7% respectively).

Trans-acting enrichment among dominance eQTLs

We cross-tabulated additivity vs dominance against trans vs cis-eQTLs using Fisher exact tests. To simplify results, we grouped additive and partial-dominant eQTLs as “generalized additive eQTLs” (G-Add eQTL) and complete- and over-dominant eQTLs as “generalized dominance eQTLs” (G-Dom eQTL). Overall, and consistent with other studies, most cis-eQTLs are additive while trans-eQTLs are enriched for dominance effects, although it is not the case that most trans-eQTLs are dominant (Additional file 11: Table S10-1 and S10-2). Across a range of thresholds (from (− log₁₀(P) = 5.5 to 8.5), there are statistically significant enrichments. For example, at threshold 5.5 all the P-values across all seven tissues are < 0.0001.

The genomic distributions of isoform-level HS rat eQTLs are shown in Fig. 5, and similar results for gene-level eQTLs in rats, pigs, and mice are in Additional file 1: Fig. S5.1-S5.3. We show the positions of eQTLs SNPs vs their associated isoforms (Fig. 5a–j), filtered by dominance type in HS rat amygdala and heart. There is a strong diagonal band of G-Add cis-eQTLs whereas G-Dom eQTLs are evenly distributed, not-withstanding the presence of several vertical hotspots. The phenomenon is most noticeable using lax thresholds (− log₁₀(P) = 4.7) where eQTLs are more numerous, but it persists at more stringent thresholds (Fig. 5k, Additional file 11: Table S10-1 and S10-2). Overall, at lax thresholds, 94.8% (amygdala) and 94.7% (heart) of isoform cis-eQTLs are G-Add. In contrast, 99.3 and 99.2% dominant isoform eQTLs are trans-acting.

eQTL hotspots are enriched for dominance effects

Trans-eQTL hotspots (i.e., where many trans-eQTL confidence intervals overlap) are ubiquitous and strongly enriched for dominance effects. For example, among suggestive significant pig eQTLs, there are 11,596 and 12,157 overlapping eQTLs that localize to just 594 and 606 separate regions in liver and muscle respectively (overlaps between eQTLs employed the 2-LOD drop method to define eQTL confidence intervals). Hotspots are summarized in Additional file 12: Table S11-1 to S11-9, where we report only high-significance eQTLs (− log₁₀(P) > 10) but which tag most of the hotspots (2333 liver and 2088 muscle pig eQTLs from 537 and 479 hotspots). We detected a representative dominance gene-level hotspot at chr10: 85–86 Mb in HS rat heart (Figs. 6 and 7). This hotspot is complex: it has two cis-eQTLs for the transcription factors (a) Tbx21 (over-dominant, Fig. 6) and (e) Nfe2l1 (additive, Fig. 7) that link to six trans-eQTLs. The correlation between Tbx21 and Nfe2l1 expression levels is low (only − 0.11 and − 0.16 at gene and isoform level, respectively) suggesting these genes act independently. Interestingly, scatter plots of the expression of the genes underlying these trans and cis-eQTLs (Figs. 6b–d and 7b-d) suggest each cis-eQTL is associated with distinct trans-eQTLs (Klrb1, Znf683, Cdh17 with Tbx21, Gzmb, Cd160, F1lnm2, with Nfe2l1). Additional file 1: Fig. S6.1 shows the corresponding transcript-level hotspot and Additional file 1: Fig. S6.3-S6.6 show further examples.

In addition, we detected hundreds of colocalized dominance eQTLs that regulate the same gene in different tissues within each population (tissue-consistent eQTLs, tc-eQTLs) in contrast to others that regulate different genes in different tissues (tissue-specific eQTLs, ts-eQTLs), tabulated in Additional file 13: Table S12-1 to S12-6 and summarized in Additional file 13: Table S12-7. Many hotspot-related eQTL are also present in these lists. Representative examples of cross and within-tissue eQTLs are shown in Additional file 1: Fig. S6.7.

Dominance enrichment among genes with multiple isoforms

HS rat gene expression was measured by RNA-seq, which made it possible to distinguish expression levels of alternative isoforms, and to investigate the dominance enrichment of isoform-based eQTLs. Using the Rn4 reference annotations of 34,721 transcripts in 24,688 genes, 74.5% of genes express only one known isoform, while 22.2% (5489 genes) express two or three isoforms (Additional file 14: Table S13-2). In amygdala, we detected a higher proportion of G-Dom eQTLs among genes with multiple isoforms (ratio of G-Dom: G-Add eQTLs = 3.23, Additional file 14: Table S13-1) compared to genes with only one isoform (G-Dom: G-Add = 2.16). Chi-squared tests of enrichment were significant in both amygdala (P = 7.9 × 10⁻¹¹) and heart (P = 5.1 × 10⁻³). Additional file 14: Tables S13-3 to S13-8 list the QTL positions for each gene with their corresponding isoforms, including 521 amygdala and 828 heart gene-based eQTLs with two or three isoforms. Additional file 14: Tables S13-9 and S13-10 show instances of antagonistic and synergistic isoform expression in amygdala and heart.

Different isoforms of the same gene are frequently associated with different SNPs; across the 4086 rat genes with exactly two isoforms, 1257 amygdala and 1785 heart genes contain isoform-based eQTLs, but only 34 amygdala and 94 heart genes share eQTLs for both isoforms. The relative expression of different isoforms for the same gene can be either antagonistic (Fig. 8 and Additional file 1: Fig. S7.1) or synergistic (Additional file 1: Fig. S7.2). We show examples from rat amygdala of antagonistic isoforms (Foxj2, Fig. 8b–c; Atp5g2, Additional file 1: Fig S7 h-i). In both cases, overall gene expression is not heritable while each constituent isoform is under strong genetic control. Scatter plots of corresponding isoform levels (color-coded by genotype) illustrate the antagonistic effects of a SNP on different isoform from the same gene (Fig. 8d–f, j–l). We show two further antagonistic examples for Rpl14 and LFI44 in Additional file 1: Fig. S7.1 replicated in both rat amygdala and heart. Four examples of synergistic isoform pairs are shown in Additional file 1: Fig. S7.2, namely Crot and Slc39a12 in amygdala and Sppl2a and Rt1-m6-2 in heart.

Remarkably 3942 amygdala and 5659 heart genes have no gene-level eQTL but exhibit transcript-level eQTLs, and this trend increases in genes with more isoforms; across gene sets with 1/2/3 isoforms we observed respectively 13.1, 25.9, and 33.6% cases in amygdala and 20.2, 34.1, and 39.8% in heart. G-Dom eQTLs explain 82.6 and 78.8% of these cases in amygdala and heart respectively. Overall, we observed statistically significant enrichments of dominance effects in genes with multiple isoforms (P = 7.9 × 10⁻¹¹ in amygdala and P = 0.0051 in heart; Additional file 14: Table S13-1), thereby suggesting the expression of multiple isoforms is one route by which dominance gene expression might arise.

Discussion

Understanding the prevalence, causes, and consequences of dominance not only improves power to detect associations but also clarifies the genetic architecture of complex traits [49, 53, 83]. In this study, across a range of mammals, and phenotype classes, we have shown that dominance is common; about a quarter of the heritability of diverse physiological traits is attributable to dominance, and there are non-additive effects of various types at over half of the QTLs we mapped.

There is also an enrichment of trans-acting effects among dominance eQTLs, which has been previously reported in yeast [70, 71], plant [72], fly [73], fish [74], and mouse [75]. Further, there is enrichment for dominance among genes that express multiple isoforms. To our knowledge, this is a novel finding, although the complex genetic architecture of isoform control has also been reported in humans [84].

Dominance analysis methodologies, such as that implemented in the ADDO toolkit [19] used here, are generally applicable to the selective breeding of animals [85, 86] and plants [87, 88] and to the study of certain human diseases. For example, many Mendelian blood disorders are dominant or recessive [89]. Although dominance studies of human traits have discovered few novel associations—partly due to the ubiquity of rare alleles—there some exceptions, e.g., for age-related [53] and eye diseases [52] and for blood corpuscle measurements [56].

We suggest there are three advantages to modelling both additive and dominance genetic effects as done here. First, the strategy detects more genetic associations (Additional file 1: Fig. S2.1 and Fig. S2.2). We mapped 44.3% more associations than additive modelling alone, despite the burden of fitting additional parameters. Most of these novel QTLs are complete (68%) or over (90.9%) dominant (Additional file 1: Fig. 2). This effect is greatest in immunological and hematological traits, where some QTLs have negligible additive signal.

Second, it reveals insights into the genetic architecture of complex traits. For example, the excess of partial-dominant and over-dominant QTLs compared to complete-dominant QTLs suggests that heterosis may be caused by polygenic heterozygote advantage (the over-dominance hypothesis) rather than being driven by the superiority of a few dominant alleles over deleterious recessive alleles (the dominance hypothesis). Heterosis—which is closely related to dominance [90, 91]—is of great importance in animal improvement and was the motivation behind breeding the F2 pig population used here.

Third, integrated modelling of dominance across organismal and gene and isoform expression traits may suggest causal mechanisms. We conjecture that physiological dominance in some cases arises from the complex genetic control of alternative isoforms, as evidenced by many instances where the relationship between phenotype and genotype appears to be mediated via the expression of a particular isoform rather than by overall gene expression (Fig. 8 and Fig. S7.1; Additional file 14: Table S13). We show examples of potential causal links between gene expression and physiological traits, based on co-localisation of dominance eQTLs and QTLs, in each species (Additional file 1: Fig. S8.1 to Fig. S8.3).

More generally, what mechanisms convert genotypes into additive or dominant phenotypes? Assuming a phenotype only depends on a given genotype via expression of a gene—i.e., it does not “see” the underlying genotype—must additive/dominance gene expression necessarily cause additive/dominance phenotypes?

Considering first additive gene expression, the resulting phenotype could be either additive—if that component of the phenotype variation attributable to the expression of the gene in question is proportional to that gene’s expression level—or potentially complete-dominant if the relationship is nonlinear, for example modelling a saturation or thresholding effect. On the other hand, if gene expression is dominant in any of the senses considered here, then there is a loss of information which makes it hard or impossible to invert the nonlinear dominance relationship between genotype and expression necessary to recover additive dependence of phenotype on genotype. For example, if the gene expression exhibits complete dominance, then there is no longer any distinction between the heterozygote and one of the homozygotes.

Thus, we argue that additive gene expression eQTLs could produce either additive or dominant physiological QTLs but that dominance eQTLs should only produce dominance physiological QTLs. This seems to be an important distinction, and moreover should to all types of dominance expression, including isoform. While we cannot prove causality in the examples presented in this study, prioritizing genes by dominance allows us to exclude, for example, dominance eQTLs underlying additive physiological QTLs, which are less likely to be causal. We have shown dominance eQTLs are abundant, and hence there are many opportunities for dominance physiological QTLs to arise.

Related to this question of mechanism, we observed a clear linkage between cis vs trans-eQTLs and additive vs dominant eQTLs. Specifically, most dominant eQTLs are trans-acting, and most cis-eQTLs are additive. A potential explanation is that cis-acting causal variants tend to lie in the regulatory elements of target genes, and the degree of binding is controlled by local sequence variation, thereby causing additive changes in transcriptional levels. In contrast, causal trans-acting variants are likely to be in or near distant transcription factors (TFs) that regulate the target gene. This could lead to non-additive relationships between TF concentrations and gene expression [92]. If the two chromosomes compete for a limited supply of the TF, then non-additive expression may emerge [93]. In contrast, where a trans-eQTL behaves additively, it may control the expression of a TF which binds to both chromosomes with equal efficiency, so that expression of the target gene is proportional to the amount of TF produced, e.g., nuclear factor-κB [94]. Antagonistic pleiotropy might also explain over-dominant trans-acting eQTLs, where heterozygotes express more transcriptional outputs compared to either homozygote [95]. Additionally, long-range physical interactions between promotors and enhancers [96], and the silencing effects of some trans-eQTLs [76] could also produce transcriptional nonlinearity.

Conclusions

In summary, incorporating dominance into quantitative genetic analysis uncovers important, complex, and interesting biology. We only have space here to highlight a few examples of dominance phenomena, but many more can be found in the supplementary tables and figures. Dominance analysis requires only a slightly different workflow compared to additive analysis. Hence, even though the greater part of trait heritability is typically additive, there are few if any disadvantages to searching for dominance and we recommend its routine use.

Methods

Data processing of genotypes, physiological and expression traits in F2 pigs

The F2 pigs were established by crossing Chinese Erhualian and White Duroc pigs in 1998, as described [97]. Genotypes of 1005 F2 pigs were measured using the porcine SNP60 Beadchip (Illumina) at 62,613 SNP sites, which were filtered by minor allele frequency (MAF) < 0.05, missing rate > 0.1, and minimum frequency of the rarest genotype at each locus > 10, to leave 39,298 SNPs for downstream analysis. A total of 253 complex traits were measured for growth, fatness, meat quality, basal hematology, and serum biochemistry. For each trait, we controlled for outliers by removing values more than 5 s.d. from the mean. Gene expression from 493 liver and 583 muscle samples of the same pigs was measured using digital gene expression (DGE), processed into transcript per million (TPM) values for 15,684 and 17,822 transcripts in liver and muscle respectively. The TPM values were quantile normalized and also adjusted for sex, batch, and the first ten principal components of expression data.

Data processing of genotypes, organismal, and expressional traits in HS rats

The HS rats were descended from eight inbred strains (ACI/N, BN/SsN, BUF/N, F344/N, M520/N, MR/N, WKY/N, and WN/N) [98], by rotational breeding over many generations, such that each HS rat chromosome is a mosaic of the founder genomes.

In total, the genotypes of 1407 individuals [78] were measured by a custom Affymetrix array for 257,868 SNPs, as well as a comprehensive measurement of 220 complex traits, including various complex traits related to psychology, basal hematology, basal immunology, and serum biochemistry. Transcriptome RNA sequencing (RNA-seq) for 205 amygdala and 192 heart previously unpublished samples from the same animals were analyzed in this study. We used different pipelines to quantify gene and transcript expression levels, using the same reference (Rattus norvegicus genome, Ensembl RGSC3.4) for consistency with the version used for the array genotypes. For the gene expression levels, we first aligned clean reads to the rat RGSC3.4 reference genome using STAR (v.2.5.3a), and removed duplicated reads by Picard (v.2.5.0). Next, we estimated the raw read counts of each gene using featureCounts (v.1.5.2), and normalized the counts using the Trimmed Mean of M-value (TMM) method, implemented in edgeR (v.3.20.9). For the transcript levels, we used Kallisto (v.0.43.1) to estimate the transcript per million (TPM) of all transcripts in rat RGSC3.4 genome, followed by the quantile normal transformation before GWAS analysis.

Data processing of genotypes, organismal, and expressional traits in HS mice

The HS mice originated from eight inbred progenitors (A/J, AKR/J, BALBc/J, CBA/J, C3H/HeJ, C57BL/6 J, DBA/2 J, and LP/J) [99] with a similar design as for HS rats. In total, the genotypes of 2002 individuals were measured by a custom Illumina assay for 10,168 SNPs [79]. In total, 125 traits were measured as described in [100], including basal immunological, basal hematology, and models of human disease related to anxiety, asthma, diabetes, and obesity. Gene expression data (371 hippocampus samples, 227 liver samples, and 197 lung samples) were also measured using Illumina microarray-based assays [81].

Estimation of additive and dominance variance components

We estimated additive and dominance variance components and heritabilities using GCTA [65] in a standard workflow: We first corrected the raw phenotypes regressing out covariates using lm() function in R and then standardized the residuals. We generated \({{\varvec{K}}}_{{\varvec{a}}},\boldsymbol{ }{{\varvec{K}}}_{{\varvec{d}}}\) the additive and dominant genetic relationship matrices (GRMs) of all individuals by GCTA. Pairs of individuals with absolute additive genetic correlation > 0.7 were randomly downsampled to single individuals, and the GRMs rebuilt using the remaining individuals. Finally, we calculated the variance components for additive and dominance effects using GCTA.

Detection and classification of QTLs and eQTLs

All QTL and eQTL mapping was done using the ADDO [19] toolkit. We fitted three mixed models, namely the Add-Dom (AD), Add (A), and Dom (D) models to detect, classify, and compare additive and dominance QTLs. Mixed models correct for unequal relatedness between individuals and to avoid false positive QTL calls. In brief, we model the phenotypic variance covariance matrix \({\varvec{V}}={{\varvec{K}}}_{{\varvec{a}}}{\sigma }_{a}^{2}+{{\varvec{K}}}_{{\varvec{d}}}{\sigma }_{d}^{2}+{\varvec{I}}{\sigma }_{e}^{2}\) where \({\sigma }_{a}^{2},{\sigma }_{d}^{2},{\sigma }_{e}^{2}\) are the additive, dominant, and environmental variance components estimated by GCTA. We multiply the phenotype vector and fixed effects design matrix by the matrix \({{\varvec{V}}}^{-0.5}\) to convert the mixed model to ordinary least squares with iid errors. To test association as a specific SNP with genotypes AA/AB/BB, we consider the following linear model fixed effect design matrices:

(1)
“Add (A) Model”: The three genotypes are recoded as 0/1/2 within each locus, i.e., as additive genotype dosages, in order to model additive genetic effects. The design matrix is thus a column of ones (for the intercept) and a column of genotype dosages. Except that the variance matrix incorporates dominance effects, the A model is equivalent to the usual additive model used in mixed model GWAS.
(2)
“Dom (D) Model”: Genotypes AA/AB/BB are coded as heterozygote dosages 0/1/0, to detect loci where the effect of heterozygote AB is from the mean effect of two homozygotes AA and BB. The design matrix is a column of ones and a column of heterozygote dosages.
(3)
“Add-Dom (AD) Model”: Genotypes are coded as two columns 0/1/2 (additive dosage) and 0/1/0 (heterozygote dosage). The design matrix is a column of ones and both of these columns, and which can model any type of dominance effect by suitable choice of the regression coefficients \({\beta }_{a},{\beta }_{d}\) corresponding to the additive and dominance columns respectively. We computed the T-statistics \({t}_{Add},{t}_{Dom}\) for these coefficients by dividing each by their estimated standard errors.
(4)
“AvsAD Model”: We used ANOVA to compare the A Model and AD Model, to detect loci with significant non-additive effect. Statistical significance was reported as the negative base 10 log p-value of the ANOVA comparison of the models.

We used two p-value thresholds to report significant QTLs: (1) a suggestive threshold (1/N_SNP), where N_SNP is the number of SNPs. Under the null hypothesis where no SNPs are associated, one false positive is expected per genome scan; (2) whole genome-wide significance (0.05/N_SNP), in order to control for false positives caused by multiple tests, and where a false positive should occur once in every 20 GWAS. The width of each QTL was determined using 2 point LOD drop from the peak SNP at the QTL.

We classified each QTL into a dominance type (A: additive/PD: partial-dominant/CD: complete-dominant/OD: over-dominant), based on the log ratio of T-statistics from the Add-Dom Model, i.e., \(r={\mathrm{log}}_{2}|{t}_{Dom}/{t}_{Add}|\), then all the significant QTLs could be classified into four groups using the rules A: r < 0.2, PD: 0.2 < r < 0.8, CD: 0.8 < r < 1.2 or OD: r > 1.2. For greater clarity we plotted ratios as log₂(r).

eQTLs were classified as cis-acting if the eQTL localized to the same chromosome with its target gene and the minimum of left and right boundary distances between “Peak SNP” and “gene physical range” < 2 Mb; otherwise, it was classified as trans-acting.

Availability of data and materials

The rat RNA-seq data have been deposited at ENA Biostudies with the accession number PRJEB60349 [101] and PRJEB60407 [102]. All other data used in this study are from previously published studies (pig [81, 103], rat [78], mouse [79, 81, 104]) and available as described.

All the codes used for variance decomposition, QTL mapping, RNA-seq, eQTL mapping, and variant annotation as well as the respective quality control of raw phenotype and genotype, results summary, comparison, and visualization scripts are available under the GPL-3.0 license on Github [105] and on Zenodo [106]. The authors declare all scripts and software used in this paper are mentioned in the Methods section.

References

Wright S. Physiological and Evolutionary Theories of Dominance. Am Nat. 1934;68:24–53.
Google Scholar
Kacser H, Burns JA. The molecular basis of dominance. Genetics. 1981;97:639–66.
CAS PubMed Central PubMed Google Scholar
Wilkie AO. The molecular basis of genetic dominance. J Med Genet. 1994;31:89–98.
CAS PubMed Central PubMed Google Scholar
Qian W, Zhang J. Gene dosage and gene duplicability. Genetics. 2008;179:2319–24.
PubMed Central PubMed Google Scholar
Huber CD, Durvasula A, Hancock AM, Lohmueller KE. Gene expression drives the evolution of dominance. Nat Commun. 2018;9:2750.
PubMed Central PubMed Google Scholar
Sijacic P, Wang W, Liu Z. Recessive antimorphic alleles overcome functionally redundant loci to reveal TSO1 function in Arabidopsis flowers and meristems. PLoS Genet. 2011;7: e1002352.
CAS PubMed Central PubMed Google Scholar
Boettcher S, Miller PG, Sharma R, McConkey M, Leventhal M, Krivtsov AV, Giacomelli AO, Wong W, Kim J, Chao S, et al. A dominant-negative effect drives selection of TP53 missense mutations in myeloid malignancies. Science. 2019;365:599–604.
CAS PubMed Central PubMed Google Scholar
Fertuzinhos S, Legue E, Li D, Liem KF Jr. A dominant tubulin mutation causes cerebellar neurodegeneration in a genetic model of tubulinopathy. Sci Adv. 2022;8:eabf7262.
CAS PubMed Central PubMed Google Scholar
Qi H, Zhang H, Zhao Y, Chen C, Long JJ, Chung WK, Guan Y, Shen Y. MVP predicts the pathogenicity of missense variants by deep learning. Nat Commun. 2021;12:510.
CAS PubMed Central PubMed Google Scholar
Carter AJR, Nguyen AQ. Antagonistic pleiotropy as a widespread mechanism for the maintenance of polymorphic disease alleles. BMC Med Genet. 2011;12:160.
CAS PubMed Central PubMed Google Scholar
LaFountain AM, Chen W, Sun W, Chen S, Frank HA, Ding B, Yuan YW: Molecular Basis of Overdominance at a Flower Color Locus. G3 (Bethesda) 2017;7:3947–3954.
Merot C, Llaurens V, Normandeau E, Bernatchez L, Wellenreuther M. Balancing selection via life-history trade-offs maintains an inversion polymorphism in a seaweed fly. Nat Commun. 2020;11:670.
CAS PubMed Central PubMed Google Scholar
Fisher RA. The correlation between relatives on the supposition of mendelian inheritance. Trans R Soc Edinb. 1918;53:399–433.
Google Scholar
Fisher RA. The causes of human variability. Eugen Rev. 1919;10:213–20.
CAS PubMed Central PubMed Google Scholar
Visscher PM, Goddard ME: From R.A. Fisher’s,. Paper to GWAS a Century Later. Genetics. 1918;2019(211):1125–30.
Google Scholar
Edwards MD, Stuber CW, Wendel JF. Molecular-marker-facilitated investigations of quantitative-trait loci in maize. I. Numbers, genomic distribution and types of gene action. Genetics. 1987;116:113–25.
CAS PubMed Central PubMed Google Scholar
Edwards MD, Helentjaris T, Wright S, Stuber CW: Molecular-marker-facilitated investigations of quantitative trait loci in maize : 4. Analysis based on genome saturation with isozyme and restriction fragment length polymorphism markers. Theor Appl Genet 1992, 83:765–774.
Li L, Lu K, Chen Z, Mu T, Hu Z, Li X. Dominance, overdominance and epistasis condition the heterosis in two heterotic rice hybrids. Genetics. 2008;180:1725–42.
PubMed Central PubMed Google Scholar
Cui L, Yang B, Pontikos N, Mott R, Huang L. ADDO: a comprehensive toolkit to detect, classify and visualize additive and non-additive quantitative trait loci. Bioinformatics. 2020;36:1517–21.
CAS PubMed Google Scholar
Bi W, Kang G, Pounds SB. Statistical selection of biological models for genome-wide association analyses. Methods. 2018;145:67–75.
CAS PubMed Google Scholar
Hall MA, Wallace J, Lucas AM, Bradford Y, Verma SS, Muller-Myhsok B, Passero K, Zhou J, McGuigan J, Jiang B, et al. Novel EDGE encoding method enhances ability to identify genetic interactions. PLoS Genet. 2021;17: e1009534.
CAS PubMed Central PubMed Google Scholar
Li M, Zhang YW, Zhang ZC, Xiang Y, Liu MH, Zhou YH, Zuo JF, Zhang HQ, Chen Y, Zhang YM. A compressed variance component mixed model for detecting QTNs and QTN-by-environment and QTN-by-QTN interactions in genome-wide association studies. Mol Plant. 2022;15:630–50.
CAS PubMed Google Scholar
Gilmour AR, Thompson R, Cullis BR. Average information REML: An efficient algorithm for variance parameter estimation in linear mixed models. Biometrics. 1995;51:1440–50.
Google Scholar
Zhang F-T, Zhu Z-H, Tong X-R, Zhu Z-X, Qi T, Zhu J. Mixed Linear Model Approaches of Association Mapping for Complex Traits Based on Omics Variants. Sci Rep. 2015;5:10298.
CAS PubMed Central PubMed Google Scholar
Wellmann R, Bennewitz J. Bayesian models with dominance effects for genomic evaluation of quantitative traits. Genet Res (Camb). 2012;94:21–37.
PubMed Google Scholar
Marchini J, Howie B, Myers S, McVean G, Donnelly P. A new multipoint method for genome-wide association studies by imputation of genotypes. Nat Genet. 2007;39:906–13.
CAS PubMed Google Scholar
Gonzalez JR, Armengol L, Sole X, Guino E, Mercader JM, Estivill X, Moreno V. SNPassoc: an R package to perform whole genome association studies. Bioinformatics. 2007;23:644–5.
PubMed Google Scholar
Li Y, Gao Y, Kim YS, Iqbal A, Kim JJ. A whole genome association study to detect additive and dominant single nucleotide polymorphisms for growth and carcass traits in Korean native cattle. Hanwoo Asian-Australas J Anim Sci. 2017;30:8–19.
CAS PubMed Google Scholar
Hiltpold M, Niu G, Kadri NK, Crysnanto D, Fang ZH, Spengeler M, Schmitz-Hsu F, Fuerst C, Schwarzenbacher H, Seefried FR, et al. Activation of cryptic splicing in bovine WDR19 is associated with reduced semen quality and male fertility. PLoS Genet. 2020;16: e1008804.
CAS PubMed Central PubMed Google Scholar
Doekes HP, Bijma P, Veerkamp RF, de Jong G, Wientjes YCJ, Windig JJ. Inbreeding depression across the genome of Dutch Holstein Friesian dairy cattle. Genet Sel Evol. 2020;52:64.
CAS PubMed Central PubMed Google Scholar
Reynolds EGM, Neeley C, Lopdell TJ, Keehan M, Dittmer K, Harland CS, Couldrey C, Johnson TJJ, Tiplady K, Worth G, et al. Non-additive association analysis using proxy phenotypes identifies novel cattle syndromes. Nat Genet. 2021;53:949–54.
CAS PubMed Google Scholar
Nagai R, Kinukawa M, Watanabe T, Ogino A, Kurogi K, Adachi K, Satoh M, Uemoto Y. Genome-wide detection of non-additive quantitative trait loci for semen production traits in beef and dairy bulls. Animal. 2022;16: 100472.
CAS PubMed Google Scholar
Reynolds EGM, Lopdell T, Wang Y, Tiplady KM, Harland CS, Johnson TJJ, Neeley C, Carnie K, Sherlock RG, Couldrey C, et al. Non-additive QTL mapping of lactation traits in 124,000 cattle reveals novel recessive loci. Genet Sel Evol. 2022;54:5.
CAS PubMed Central PubMed Google Scholar
Jiang J, Ma L, Prakapenka D, VanRaden PM, Cole JB, Da Y: A Large-Scale Genome-Wide Association Study in U.S. Holstein Cattle. Front Genet 2019, 10:412.
Yang W, Wu J, Yu J, Zheng X, Kang H, Wang Z, Zhang S, Zhou L, Liu J. A genome-wide association study reveals additive and dominance effects on growth and fatness traits in large white pigs. Anim Genet. 2021;52:749–53.
CAS PubMed Google Scholar
Stratz P, Schmid M, Wellmann R, Preuss S, Blaj I, Tetens J, Thaller G, Bennewitz J. Linkage disequilibrium pattern and genome-wide association mapping for meat traits in multiple porcine F2 crosses. Anim Genet. 2018;49:403–12.
CAS PubMed Google Scholar
Lopes MS, Bastiaansen JW, Harlizius B, Knol EF, Bovenhuis H. A genome-wide association study reveals dominance effects on number of teats in pigs. PLoS ONE. 2014;9: e105867.
PubMed Central PubMed Google Scholar
Coster A, Madsen O, Heuven HC, Dibbits B, Groenen MA, van Arendonk JA, Bovenhuis H. The imprinted gene DIO3 is a candidate gene for litter size in pigs. PLoS ONE. 2012;7: e31825.
CAS PubMed Central PubMed Google Scholar
Estrada-Reyes ZM, Rae DO, Mateescu RG. Genome-wide scan reveals important additive and non-additive genetic effects associated with resistance to Haemonchus contortus in Florida Native sheep. Int J Parasitol. 2021;51:535–43.
CAS PubMed Google Scholar
Tarsani E, Kranis A, Maniatis G, Avendano S, Hager-Theodorides AL, Kominakis A. Deciphering the mode of action and position of genetic variants impacting on egg number in broiler breeders. BMC Genom. 2020;21:512.
CAS Google Scholar
Marchesi JAP, Ono RK, Cantao ME, Ibelli AMG, Peixoto JO, Moreira GCM, Godoy TF, Coutinho LL, Munari DP, Ledur MC. Exploring the genetic architecture of feed efficiency traits in chickens. Sci Rep. 2021;11:4622.
CAS PubMed Central PubMed Google Scholar
Edwards MD, Stuber CW. Wendel JF: Molecular-Marker-Facilitated Investigations of Quantitative-Trait Loci in Maize.I. Numbers, Genomic Distribution and Types of Gene Action. Genetics. 1987;116:113–25.
CAS PubMed Central PubMed Google Scholar
Vikal Y, Kaur A, Jindal J, Kaur K, Pathak D, Garg T, Singh A, Singh P, Yadav I. Identification of genomic regions associated with shoot fly resistance in maize and their syntenic relationships in the sorghum genome. PLoS ONE. 2020;15: e0234335.
CAS PubMed Central PubMed Google Scholar
Galli G, Alves FC, Morosini JS, Fritsche-Neto R. On the usefulness of parental lines GWAS for predicting low heritability traits in tropical maize hybrids. PLoS ONE. 2020;15: e0228724.
CAS PubMed Central PubMed Google Scholar
Monir MM, Zhu J. Dominance and Epistasis Interactions Revealed as Important Variants for Leaf Traits of Maize NAM Population. Front Plant Sci. 2018;9:627.
PubMed Central PubMed Google Scholar
Beukert U, Liu G, Thorwarth P, Boeven PHG, Longin CFH, Zhao Y, Ganal M, Serfling A, Ordon F, Reif JC. The potential of hybrid breeding to enhance leaf rust and stripe rust resistance in wheat. Theor Appl Genet. 2020;133:2171–81.
CAS PubMed Central PubMed Google Scholar
Li L, Zheng X, Wang J, Zhang X, He X, Xiong L, Song S, Su J, Diao Y, Yuan Z, et al. Joint analysis of phenotype-effect-generation identifies loci associated with grain quality traits in rice hybrids. Nat Commun. 2023;14:3930.
CAS PubMed Central PubMed Google Scholar
Bonnafous F, Fievet G, Blanchet N, Boniface MC, Carrere S, Gouzy J, Legrand L, Marage G, Bret-Mestries E, Munos S, et al. Comparison of GWAS models to identify non-additive genetic control of flowering time in sunflower hybrids. Theor Appl Genet. 2018;131:319–32.
CAS PubMed Google Scholar
Seymour DK, Chae E, Grimm DG, Martin Pizarro C, Habring-Muller A, Vasseur F, Rakitsch B, Borgwardt KM, Koenig D, Weigel D. Genetic architecture of nonadditive inheritance in Arabidopsis thaliana hybrids. Proc Natl Acad Sci U S A. 2016;113:E7317–26.
CAS PubMed Central PubMed Google Scholar
Feng C, Yi H, Yang L, Kang M. The genetic basis of hybrid male sterility in sympatric Primulina species. BMC Evol Biol. 2020;20:49.
CAS PubMed Central PubMed Google Scholar
Powell JE, Henders AK, McRae AF, Kim J, Hemani G, Martin NG, Dermitzakis ET, Gibson G, Montgomery GW, Visscher PM. Congruence of additive and non-additive effects on gene expression estimated from pedigree and SNP data. PLoS Genet. 2013;9: e1003502.
CAS PubMed Central PubMed Google Scholar
Pozarickij A, Williams C, Guggenheim JA, and the UKBE, Vision C: Non-additive (dominance) effects of genetic variants associated with refractive error and myopia. Mol Genet Genomics 2020, 295:843–853.
Guindo-Martínez M, Amela R, Bonàs-Guarch S, Puiggròs M, Salvoro C, Miguel-Escalada I, Carey CE, Cole JB, Rüeger S, Atkinson E, et al. The impact of non-additive genetic associations on age-related complex diseases. Nat Commun. 2021;12:2436.
PubMed Central PubMed Google Scholar
Okbay A, Wu Y, Wang N, Jayashankar H, Bennett M, Nehzati SM, Sidorenko J, Kweon H, Goldman G, Gjorgjieva T, et al: Polygenic prediction of educational attainment within and between families from genome-wide association analyses in 3 million individuals. Nat Genet 2022.
Heyne HO, Karjalainen J, Karczewski KJ, Lemmela SM, Zhou W. FinnGen, Havulinna AS, Kurki M, Rehm HL, Palotie A, Daly MJ: Mono- and biallelic variant effects on disease at biobank scale. Nature. 2023;613:519–25.
CAS PubMed Central PubMed Google Scholar
Palmer DS, Zhou W, Abbott L, Wigdor EM, Baya N, Churchhouse C, Seed C, Poterba T, King D, Kanai M, et al. Analysis of genetic dominance in the UK Biobank. Science. 2023;379:1341–8.
CAS PubMed Central PubMed Google Scholar
Raidan FSS, Porto-Neto LR, Li Y, Lehnert SA, Vitezica ZG, Reverter A. Evaluation of nonadditive effects in yearling weight of tropical beef cattle. J Anim Sci. 2018;96:4028–34.
PubMed Central PubMed Google Scholar
Akanno EC, Abo-Ismail MK, Chen L, Crowley JJ, Wang Z, Li C, Basarab JA, MacNeil MD, Plastow GS. Modeling heterotic effects in beef cattle using genome-wide SNP-marker genotypes. J Anim Sci. 2018;96:830–45.
PubMed Central PubMed Google Scholar
Ertl J, Legarra A, Vitezica ZG, Varona L, Edel C, Emmerling R, Gotz KU. Genomic analysis of dominance effects on milk production and conformation traits in Fleckvieh cattle. Genet Sel Evol. 2014;46:40.
PubMed Central PubMed Google Scholar
Alves K, Brito LF, Baes CF, Sargolzaei M, Robinson JAB, Schenkel FS. Estimation of additive and non-additive genetic effects for fertility and reproduction traits in North American Holstein cattle using genomic information. J Anim Breed Genet. 2020;137:316–30.
CAS PubMed Google Scholar
Serenius T, Stalder KJ, Puonti M. Impact of dominance effects on sow longevity. J Anim Breed Genet. 2006;123:355–61.
CAS PubMed Google Scholar
Su G, Christensen OF, Ostersen T, Henryon M, Lund MS. Estimating additive and non-additive genetic variances and predicting genetic merits using genome-wide dense single nucleotide polymorphism markers. PLoS ONE. 2012;7: e45293.
CAS PubMed Central PubMed Google Scholar
Lopes MS, Bastiaansen JW, Janss L, Knol EF, Bovenhuis H: Estimation of Additive, Dominance, and Imprinting Genetic Variance Using Genomic Data. G3 (Bethesda) 2015, 5:2629–2637.
Costa EV, Diniz DB, Veroneze R, Resende MD, Azevedo CF, Guimaraes SE, Silva FF, Lopes PS. Estimating additive and dominance variances for complex traits in pigs combining genomic and pedigree information. Genet Mol Res. 2015;14:6303–11.
CAS PubMed Google Scholar
Zhu Z, Bakshi A, Vinkhuyzen AA, Hemani G, Lee SH, Nolte IM, van Vliet-Ostaptchouk JV, Snieder H, LifeLines Cohort S, Esko T, et al. Dominance genetic variation contributes little to the missing heritability for human complex traits. Am J Hum Genet. 2015;96:377–85.
CAS PubMed Central PubMed Google Scholar
Hivert V, Sidorenko J, Rohart F, Goddard ME, Yang J, Wray NR, Yengo L, Visscher PM. Estimation of non-additive genetic variance in human complex traits from a large sample of unrelated individuals. Am J Hum Genet. 2021;108:786–98.
CAS PubMed Central PubMed Google Scholar
Pazokitoroudi A, Chiu AM, Burch KS, Pasaniuc B, Sankararaman S. Quantifying the contribution of dominance deviation effects to complex trait variation in biobank-scale data. Am J Hum Genet. 2021;108:799–808.
CAS PubMed Central PubMed Google Scholar
Hill WG, Goddard ME, Visscher PM. Data and theory point to mainly additive genetic variance for complex traits. PLoS Genet. 2008;4: e1000008.
PubMed Central PubMed Google Scholar
Guindo-Martinez M, Amela R, Bonas-Guarch S, Puiggros M, Salvoro C, Miguel-Escalada I, Carey CE, Cole JB, Rueger S, Atkinson E, et al. The impact of non-additive genetic associations on age-related complex diseases. Nat Commun. 2021;12:2436.
CAS PubMed Central PubMed Google Scholar
Gruber JD, Vogel K, Kalay G, Wittkopp PJ. Contrasting properties of gene-specific regulatory, coding, and copy number mutations in Saccharomyces cerevisiae: frequency, effects, and dominance. PLoS Genet. 2012;8: e1002497.
CAS PubMed Central PubMed Google Scholar
Schaefke B, Emerson J, Wang T-Y. Lu M-YJ, Hsieh L-C, Li W-H: Inheritance of gene expression level and selective constraints on trans-and cis-regulatory changes in yeast. Mol Biol Evol. 2013;30:2121–33.
CAS PubMed Google Scholar
Zhang X, Cal AJ, Borevitz JO. Genetic architecture of regulatory variation in Arabidopsis thaliana. Genome Res. 2011;21:725–33.
CAS PubMed Central PubMed Google Scholar
Meiklejohn CD, Coolon JD, Hartl DL, Wittkopp PJ. The roles of cis-and trans-regulation in the evolution of regulatory incompatibilities and sexually dimorphic gene expression. Genome Res. 2014;24:84–95.
CAS PubMed Central PubMed Google Scholar
Pritchard VL, Viitaniemi HM, McCairns RS, Merilä J, Nikinmaa M, Primmer CR, Leder EH. Regulatory architecture of gene expression variation in the threespine stickleback Gasterosteus aculeatus. G3: Genes, Genomes, Genetics. 2017;G3(7):165–78.
Google Scholar
Wong ES, Schmitt BM, Kazachenka A, Thybert D, Redmond A, Connor F, Rayner TF, Feig C, Ferguson-Smith AC, Marioni JC. Interplay of cis and trans mechanisms driving transcription factor binding and gene expression evolution. Nat Commun. 2017;8:1–13.
Google Scholar
Kaisaki PJ, Otto GW, Argoud K, Collins SC, Wallis RH, Wilder SP, Yau ACY, Hue C, Calderari S, Bihoreau MT, et al: Transcriptome Profiling in Rat Inbred Strains and Experimental Cross Reveals Discrepant Genetic Architecture of Genome-Wide Gene Expression. G3 (Bethesda) 2016;6:3671–3683.
Cui L, Zhang J, Ma J, Guo Y, Li L, Xiao S, Ren J, Yang B, Huang L. Sexually dimorphic genetic architecture of complex traits in a large-scale F2 cross in pigs. Genet Sel Evol. 2014;46:76.
PubMed Central PubMed Google Scholar
Rat Genome S, Mapping C, Baud A, Hermsen R, Guryev V, Stridh P, Graham D, McBride MW, Foroud T, Calderari S, et al. Combined sequence-based and genetic mapping analysis of complex traits in outbred rats. Nat Genet. 2013;45:767–75.
Google Scholar
Valdar W, Solberg LC, Gauguier D, Burnett S, Klenerman P, Cookson WO, Taylor MS, Rawlins JN, Mott R, Flint J. Genome-wide genetic association of complex traits in heterogeneous stock mice. Nat Genet. 2006;38:879–87.
CAS PubMed Google Scholar
Zhang J, Cui L, Ma J, Chen C, Yang B, Huang L. Transcriptome analyses reveal genes and pathways associated with fatty acid composition traits in pigs. Anim Genet. 2017;48:645–52.
CAS PubMed Google Scholar
Huang GJ, Shifman S, Valdar W, Johannesson M, Yalcin B, Taylor MS, Taylor JM, Mott R, Flint J. High resolution mapping of expression QTLs in heterogeneous stock mice in multiple tissues. Genome Res. 2009;19:1133–40.
CAS PubMed Central PubMed Google Scholar
McLaren W, Pritchard B, Rios D, Chen Y, Flicek P, Cunningham F. Deriving the consequences of genomic variants with the Ensembl API and SNP Effect Predictor. Bioinformatics. 2010;26:2069–70.
CAS PubMed Central PubMed Google Scholar
Matsui T, Mullis MN, Roy KR, Hale JJ, Schell R, Levy SF, Ehrenreich IM. The interplay of additivity, dominance, and epistasis on fitness in a diploid yeast cross. Nat Commun. 2022;13:1463.
CAS PubMed Central PubMed Google Scholar
Garrido-Martin D, Borsari B, Calvo M, Reverter F, Guigo R. Identification and analysis of splicing quantitative trait loci across multiple tissues in the human genome. Nat Commun. 2021;12:727.
CAS PubMed Central PubMed Google Scholar
Liu T, Luo C, Ma J, Wang Y, Shu D, Qu H, Su G. Including dominance effects in the prediction model through locus-specific weights on heterozygous genotypes can greatly improve genomic predictive abilities. Heredity. 2022;128:154–8.
CAS PubMed Central PubMed Google Scholar
Esfandyari H, Bijma P, Henryon M, Christensen OF, Sorensen AC. Genomic prediction of crossbred performance based on purebred Landrace and Yorkshire data using a dominance model. Genet Sel Evol. 2016;48:40.
PubMed Central PubMed Google Scholar
Wilson S, Zheng C, Maliepaard C, Mulder HA, Visser RGF, van der Burgt A, van Eeuwijk F. Understanding the Effectiveness of Genomic Prediction in Tetraploid Potato. Front Plant Sci. 2021;12: 672417.
PubMed Central PubMed Google Scholar
Ramstein GP, Larsson SJ, Cook JP, Edwards JW, Ersoz ES, Flint-Garcia S, Gardner CA, Holland JB, Lorenz AJ, McMullen MD, et al. Dominance effects and functional enrichments improve prediction of agronomic traits in hybrid maize. Genetics. 2020;215:215–30.
PubMed Central PubMed Google Scholar
Vuckovic D, Bao EL, Akbari P, Lareau CA, Mousas A, Jiang T, Chen MH, Raffield LM, Tardaguila M, Huffman JE, et al. The polygenic and monogenic basis of blood traits and diseases. Cell. 2020;182(1214–1231): e1211.
Google Scholar
Xiao Y, Jiang S, Cheng Q, Wang X, Yan J, Zhang R, Qiao F, Ma C, Luo J, Li W, et al. The genetic mechanism of heterosis utilization in maize improvement. Genome Biol. 2021;22:148.
CAS PubMed Central PubMed Google Scholar
Krieger U, Lippman ZB, Zamir D. The flowering gene SINGLE FLOWER TRUSS drives heterosis for yield in tomato. Nat Genet. 2010;42:459–63.
CAS PubMed Google Scholar
Spitz F, Furlong EE. Transcription factors: from enhancer binding to developmental control. Nat Rev Genet. 2012;13:613–26.
CAS PubMed Google Scholar
Porter AH, Johnson NA, Tulchinsky AY. A new mechanism for mendelian dominance in regulatory genetic pathways: competitive binding by transcription factors. Genetics. 2017;205:101–12.
PubMed Google Scholar
Giorgetti L, Siggers T, Tiana G, Caprara G, Notarbartolo S, Corona T, Pasparakis M, Milani P, Bulyk ML, Natoli G. Noncooperative interactions between transcription factors and clustered DNA binding sites enable graded transcriptional responses to environmental inputs. Mol Cell. 2010;37:418–28.
CAS PubMed Google Scholar
Qian W, Ma D, Xiao C, Wang Z, Zhang J. The genomic landscape and evolutionary resolution of antagonistic pleiotropy in yeast. Cell Rep. 2012;2:1399–410.
CAS PubMed Central PubMed Google Scholar
Zuin J, Roth G, Zhan Y, Cramard J, Redolfi J, Piskadlo E, Mach P, Kryzhanovska M, Tihanyi G, Kohler H, et al. Nonlinear control of transcription through enhancer-promoter interactions. Nature. 2022;604:571–7.
CAS PubMed Central PubMed Google Scholar
Guo Y, Mao H, Ren J, Yan X, Duan Y, Yang G, Ren D, Zhang Z, Yang B, Ouyang J, et al. A linkage map of the porcine genome from a large-scale White Duroc x Erhualian resource population and evaluation of factors affecting recombination rates. Anim Genet. 2009;40:47–52.
PubMed Google Scholar
Hansen C, Spuhler K. Development of the National Institutes of Health genetically heterogeneous rat stock. Alcohol Clin Exp Res. 1984;8:477–9.
CAS PubMed Google Scholar
Clearn GE, Wilson J, Meredith WM: The use of isogenic and heterogenic mouse stocks in behavioral research. P. 3–22. In g. Lindzey & d. D. Thiessen (ed.), Contrib. To behavior-genetic. 1970.
Solberg LC, Valdar W, Gauguier D, Nunez G, Taylor A, Burnett S, Arboledas-Hita C, Hernandez-Pliego P, Davidson S, Burns P, et al. A protocol for high-throughput phenotyping, suitable for quantitative trait analysis in mice. Mamm Genome. 2006;17:129–46.
PubMed Google Scholar
Baud A, Graham D, McBride M, Dominiczak A, Schafer S, Lopez Aumatell R, Mont C, Fernandez Teruel A, Hübner N, Flint J, Mott R: RNA-seq from heart tissue of 192 heterogeneous stock rats, accession E-MTAB-12701, ENA Biostudies. 2023. https://www.ebi.ac.uk/ena/browser/view/PRJEB60349.
Baud A, Graham D, McBride M, Dominiczak A, Schafer S, Lopez Aumatell R, Mont C, Fernandez Teruel A, Hübner N, Flint J, Mott R: RNA-seq from amygdala tissue of 205 heterogeneous stock rats accession E-MTAB-12693, ENA Biostudies. 2023. https://www.ebi.ac.uk/ena/browser/view/PRJEB60407.
Chen C, Yang B, Zeng Z, Yang H, Liu C, Ren J, Huang L. Genetic dissection of blood lipid traits by integrating genome-wide association study and gene expression profiling in a porcine model. BMC Genomics. 2013;14:848.
PubMed Central PubMed Google Scholar
Mott R, Valdar W, Solberg Woods L, Gauguier D, Burnett-Stuart S, Klenerman P, Cookson WO, Taylor MS, Rawlins N, Flint J, Huang GJ, Shifman S, Johannesson M, Yalcin B, Taylor JM: Genotypes, Phenotypes and Gene Expression data from 2000 Heterogeneous Stock Mice. UCL Figshare; 2023. https://doi.org/10.5522/04/24114471.
Cui L: Deposited Code and Scripts, GitHub. 2023. https://github.com/LeileiCui/Dominance_3Stocks.
Cui L: Deposited Code and Scripts, Zenodo. 2023. https://zenodo.org/record/8329264.

Download references

Acknowledgements

We acknowledge Dr. Na Cai, Dr. Michael Scott, Prof. Dallas Swallow, Prof. Adam S. Wilkins, and Prof. David Curtis for their helpful advice and suggestions. The authors also want to thank the editor and reviewers to their practical comments, which contributed to improving the manuscript.

Review history

The review history is available as Additional file 15.

Peer review information

Wenjing She was the primary editor of this article and managed its editorial process and peer review in collaboration with the rest of the editorial team.

Funding

This work was supported by the UKRI Biotechnology and Biological Sciences Research Council (UKRI-BBSRC) (BB/S017372/1 and BB/R01356X/1). We also were supported by funding from the China Scholarship Council (No.201508360093) and the Scientific Research Training Program of Nanchang University (2022).

Author information

Leilei Cui and Bin Yang contributed equally to this work.

Authors and Affiliations

National Key Laboratory for Pig Genetic Improvement and Production Technology, Jiangxi Agricultural University, Nanchang, 330045, People’s Republic of China
Leilei Cui, Bin Yang, Shijun Xiao, Jun Gao & Lusheng Huang
UCL Genetics Institute, University College London, London, WC1E 6BT, UK
Leilei Cui & Richard Mott
Human Aging Research Institute and School of Life Science, Nanchang University, and Jiangxi Key Laboratory of Human Aging, Jiangxi, China
Leilei Cui
School of Life Sciences, Nanchang University, Nanchang, China
Leilei Cui
Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain
Amelie Baud
BHF Glasgow Cardiovascular Research Centre, University of Glasgow, Glasgow, G12 8TA, UK
Delyth Graham, Martin McBride & Anna Dominiczak
Cardiovascular and Metabolic Disorders Program, Duke-National University of Singapore Medical School, Singapore, Singapore
Sebastian Schafer
Department of Medicine and Life Sciences, Universitat Pompeu Fabra, Barcelona, Spain
Regina Lopez Aumatell
Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
Carme Mont
Departamento de Psiquiatría y Medicina Legal, Universitat Autonoma de Barcelona, Barcelona, Spain
Albert Fernandez Teruel
Genetics and Genomics of Cardiovascular Diseases Research Group, Max Delbrück Center (MDC) for Molecular Medicine in the Helmholtz Association, Berlin, Germany
Norbert Hübner
DZHK (German Center for Cardiovascular Research) Partner Site Berlin, Berlin, Germany
Norbert Hübner
Charité Universitätsmedizin Berlin, Berlin, Germany
Norbert Hübner
Department of Psychiatry and Behavioral Sciences, Brain Research Institute, University of California, Los Angeles, CA, USA
Jonathan Flint

Authors

Leilei Cui
View author publications
You can also search for this author in PubMed Google Scholar
Bin Yang
View author publications
You can also search for this author in PubMed Google Scholar
Shijun Xiao
View author publications
You can also search for this author in PubMed Google Scholar
Jun Gao
View author publications
You can also search for this author in PubMed Google Scholar
Amelie Baud
View author publications
You can also search for this author in PubMed Google Scholar
Delyth Graham
View author publications
You can also search for this author in PubMed Google Scholar
Martin McBride
View author publications
You can also search for this author in PubMed Google Scholar
Anna Dominiczak
View author publications
You can also search for this author in PubMed Google Scholar
Sebastian Schafer
View author publications
You can also search for this author in PubMed Google Scholar
Regina Lopez Aumatell
View author publications
You can also search for this author in PubMed Google Scholar
Carme Mont
View author publications
You can also search for this author in PubMed Google Scholar
Albert Fernandez Teruel
View author publications
You can also search for this author in PubMed Google Scholar
Norbert Hübner
View author publications
You can also search for this author in PubMed Google Scholar
Jonathan Flint
View author publications
You can also search for this author in PubMed Google Scholar
Richard Mott
View author publications
You can also search for this author in PubMed Google Scholar
Lusheng Huang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

LH, RM, and YB conceived and supervised the project. LC performed the analyses and produces the figures. LC and MR wrote the manuscript. LH, MR, and YB contributed to the results interpretation and manuscript revision. LH, SX, JG, and YB contributed to the pig data used in study. MR, JF, DG, MM, AD, SS, RLA, CM, and AFT contributed the rat data used in the study. MR, JF, and AB contributed the mouse data used in the study. All authors read and approved the final manuscript.

Corresponding authors

Correspondence to Richard Mott or Lusheng Huang.

Ethics declarations

Ethics approval and consent to participte

All the data used in this study was collected in three earlier studies, each of which was approved by the corresponding local animal ethical committee, as described for F2 pigs [77], HS rats [78], and HS mice [79].

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1: Fig. S1.1.

Examples of dominance classifications visualized with different coordinate systems for QTL category comparison. Fig. S1.2. Additive (blue bars, ) and Dominance (red bars, ) Variance Components of all traits across three populations. Fig. S2.1. QTLs detected by AD model. Fig. S2.2. Comparison of QTL detection among A, D and AD models. Fig. S3.1. Over-dominant QTLs in F2 pigs. Fig. S3.2. Over-dominant HS rat QTLs. Fig. S3.3. Over-dominant QTLs implicate a novel causal gene regulating the proportions of CD4+ and CD8+ T cells in HS mice, compared to additive QTL modelling. Fig. S3.4. Conditional GWAS of the proportion of CD4+ cells in CD3+ cells in HS mice. Fig. S4.1. Homologous QTLs for mean corpuscular volume (MCV). Fig. S4.2. Homologous QTLs for mean corpuscular hemoglobin (MCV). Fig. S4.3. Homologous QTLs. Fig. S5.1. Trans-acting enrichment among dominant eQTLs in liver and muscle in F2 pigs. Fig. S5.2. Trans-acting enrichment among dominant eQTLs of amygdala and heart tissues in HS rats. Fig. S5.3. Trans-acting enrichment among dominant eQTLs of hippocampus, liver and lung tissues in HS mice. Fig. S6.1. Dominant cis and trans transcript eQTLs at the hotspot chr10:85Mb-86Mb in HS rat heart. Fig. S6.2. Interplay of immunology-related QTLs in HS rats. Fig. S6.3. Dominant eQTLs detected by AD model from two trans-acting hotspots (a1-a7, chr3: 147Mb-149Mb; c1-c7, chr7: 107Mb-108Mb) in HS rat heart. Fig. S6.4. Dominant eQTLs from a trans-acting hotspot in HS mice hippocampus. Fig. S6-5. Dominant eQTLs from a trans-acting hotspot located at chromosome 7 in HS mice lung. Fig. S6.6. Dominant eQTLs from a trans-acting hotspot located at chromosome 9 in HS mice lung. Fig. S6.7. Tissue-conserved and tissue-specific dominant eQTLs. Fig. S7.1. Transcript-specific antagonistic dominant eQTLs of RPL14 and LFI44 in HS rat amygdala and heart. Fig. S7.2. Transcript-specific synergistic eQTLs in HS rats.Fig. S8.1. A pleiotropic over-dominant pig QTL for kidney weight and small intestinal length colocalized with MTCH1 eQTL in F2 pigs. Fig. S8.2. Dominant QTLs for three rat hematology traits colocalized with cis/trans eQTLs in rat amygdala and heart (chr19: 53 Mb-55 Mb). Fig. S8.3. An over-dominant QTL for mouse glucose slope colocalized with eQTLs in mouse amygdala, liver and lung.

Additional file 2: Table S1-1.

Summary of datasets in F2 pigs, HS rats and HS mice, together with phenotype and genotype quality control methods applied in QTL/eQTL mapping; Table S1-2. Trait classes and total numbers.

Additional file 3: Table S2.

Phenotypic variances with standard errors, due to additive and dominance effects (calculated by genome-wide SNP heritability or from accumulated significant QTLs) of 242 physical traits in F2 pigs (Table S2-1), 206 physical traits of HS rats (Table S2-2) and 124 physical traits of HS mice (Table S2-3), together with average values across each trait class.

Additional file 4: Table S3.

Counts and proportions of significant QTLs under dominance types within each population, and the corresponding missing rate compared to the additive GWAS, under suggestive thresholds and whole genome thresholds.

Additional file 5: Table S4.

Descriptive statistics, effect classification and consequence annotation of all significant QTL detected by AD Model in F2 pigs (Table S4-1), HS rats (Table S4-2) and HS mice (Table S4-3). (Rows with -log₁₀(P)values of ADvsA Model > 5 are in red, rows where -log₁₀(P)values of AD Model > 6 are in blue.).

Additional file 6: Table S5.

Summary of novel pleiotropic QTLs detected by AD model within each population. (Different shared QTL regions are separated by graey background and the rows with red numbers indicate novel QTLs where -log₁₀(P)values of ADvsA Model > 5).

Additional file 7: Table S6.

Summary of homologous QTLs of common physical traits across three stocks. (Different trait groups are separated by frame and QTL rows of different stocks are colored differently).

Additional file 8: Table S7.

Gene expression heritabilities and standard errors explained by additive and dominance effects estimated by genome-wide SNPs within each tissue of F2 pigs (Table S7-1), HS rats (Table S7-2) and HS mice (Table S7-3), summary of average additive and dominance variance components of each tissue across three populations (Table S7-4).

Additional file 9: Table S8.

Counts and proportions of significant eQTLs classified by dominance effect type within each tissue and each population under different significant thresholds (Table S8-1); Proportions of novel eQTLs detected by AD model compared to A model in cis/trans groups (Table S8-2).

Additional file 10: Table S9.

Classification of eQTLs detected by AD Model (Table S9-1, F2 pigs liver tissue; Table S9-2, F2 pigs muscle tissue; Table S9-3, HS rats amygdala genes expression; Table S9-4, HS rats heart genes expression; Table S9-5, amygdala transcripts expression; Table S9-6, heart transcripts expression; Table S9-7, HS mice hippocampus gene expression; Table S9-8, HS mice liver gene expression; Table S9-9, HS mice lung gene expression). Rows with -log₁₀(P)values of ADvsA Model > 6 are in red, remaining rows with -log₁₀(P)values of AD Model> 10 are in blue.

Additional file 11: Table S10.

Counts of significant cis-eQTLs and trans-eQTLs under four different effect types within each tissue of three populations under different significant thresholds, together with the correlation test between different genetic modes and different regulatory types (Table S10-1, A/PD/CD/OD vs cis/trans; Table S10-2, A & PD/CD & OD vs cis/trans, using Add-eQTL represents A- & PD-eQTL and Dom-eQTL represents CD- & OD-eQTL).

Additional file 12: Table S11.

Summary of pleiotropic eQTLs (Table S11-1, F2 pigs liver genes expression; Table S11-2, F2 pigs muscle genes expression; Table S11-3, HS rats amygdala genes expression; Table S11-4, HS rats heart genes expression; Table S11-5, amygdala transcripts expression; Table S11-6, heart transcripts expression; Table S11-7, HS mice hippocampus gene expression; Table S11-8, HS mice liver gene expression; Table S11-9, HS mice lung gene expression). Lines in bold indicate cis-eQTLs.

Additional file 13: Table S12.

Summary of eQTLs shared across tissues within each population (Table S12-1, liver-muscle in F2 pigs; Table S12-2, amygdala-heart gene expression levels in HS rats; Table S12-3, amygdala-heart transcript expression levels in HS rats; Table S12-4, hippocampus-liver in HS mice; Table S12-5, hippocampus-lung in HS mice; Table S12-6, liver-lung in HS mice). All rows are ordered by (1) whether from the same gene (2) the -log₁₀(P)values of ADvsA Model. Table S12-7, counts of shared eQTLs between different tissues within each specie.

Additional file 14: Table S13.

Counts of significant eQTLs classified by transcript compositions (one transcript or multi transcripts) and by dominance type, in HS rats heart and amygdala, together with the correlation test between dominance type (A/PD/CD/OD) and transcript compositions.

Additional file 15.

Review history.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Cui, L., Yang, B., Xiao, S. et al. Dominance is common in mammals and is associated with trans-acting gene expression and alternative splicing. Genome Biol 24, 215 (2023). https://doi.org/10.1186/s13059-023-03060-2

Download citation

Received: 31 March 2023
Accepted: 18 September 2023
Published: 29 September 2023
DOI: https://doi.org/10.1186/s13059-023-03060-2

Dominance is common in mammals and is associated with trans-acting gene expression and alternative splicing

Abstract

Background

Results

Conclusions

Similar content being viewed by others

Background

Results

Population characteristics

Dominance accounts for about a quarter of genetic variance of organismal traits

Mapping dominance QTLs improves GWAS sensitivity

Gene expression is strongly influenced by dominance effects

Trans-acting enrichment among dominance eQTLs

eQTL hotspots are enriched for dominance effects

Dominance enrichment among genes with multiple isoforms

Discussion

Conclusions

Methods

Data processing of genotypes, physiological and expression traits in F2 pigs

Data processing of genotypes, organismal, and expressional traits in HS rats

Data processing of genotypes, organismal, and expressional traits in HS mice

Estimation of additive and dominance variance components

Detection and classification of QTLs and eQTLs

Availability of data and materials

References

Acknowledgements

Review history

Peer review information

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Ethics approval and consent to participte

Competing interests

Additional information

Publisher’s Note

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation