Exploiting horizontal pleiotropy to search for causal pathways within a Mendelian randomization framework

Cho, Yoonsu; Haycock, Philip C.; Sanderson, Eleanor; Gaunt, Tom R.; Zheng, Jie; Morris, Andrew P.; Davey Smith, George; Hemani, Gibran

doi:10.1038/s41467-020-14452-4

Exploiting horizontal pleiotropy to search for causal pathways within a Mendelian randomization framework

Article
Open access
Published: 21 February 2020

Volume 11, article number 1010, (2020)
Cite this article

Download PDF

You have full access to this open access article

From

View current issue

Exploiting horizontal pleiotropy to search for causal pathways within a Mendelian randomization framework

Download PDF

10k Accesses
49 Citations
6 Altmetric
Explore all metrics

Abstract

In Mendelian randomization (MR) analysis, variants that exert horizontal pleiotropy are typically treated as a nuisance. However, they could be valuable in identifying alternative pathways to the traits under investigation. Here, we develop MR-TRYX, a framework that exploits horizontal pleiotropy to discover putative risk factors for disease. We begin by detecting outliers in a single exposure–outcome MR analysis, hypothesising they are due to horizontal pleiotropy. We search across hundreds of complete GWAS summary datasets to systematically identify other (candidate) traits that associate with the outliers. We develop a multi-trait pleiotropy model of the heterogeneity in the exposure–outcome analysis due to pathways through candidate traits. Through detailed investigation of several causal relationships, many pleiotropic pathways are uncovered with already established causal effects, validating the approach, but also alternative putative causal pathways. Adjustment for pleiotropic pathways reduces the heterogeneity across the analyses.

Non-linear Mendelian randomization: detection of biases using negative controls with a focus on BMI, Vitamin D and LDL cholesterol

Article Open access 25 May 2024

Unraveling the propensity of various genetic disorders and syndromes in the Koraga, an aboriginal tribe from southern India

Article 22 June 2024

The impact of consanguinity on human health and disease with an emphasis on rare diseases

Article Open access 07 December 2022

Introduction

Mendelian randomisation (MR) is now widely used to infer the causal influence of one trait (the exposure) on another (the outcome)^1,2. It generally uses genetic instruments for an exposure, obtained from genome-wide association studies (GWAS). If the instruments are valid, in that they are unconfounded and influence the outcome only through the exposure (vertical pleiotropy), then they will each provide an independent, unbiased estimate of the causal effect of the exposure on the outcome³. Meta-analysing these estimates can provide a more precise estimate of the effect of the exposure on the outcome^4,5. If, however, some of the instruments are invalid, particularly because they additionally influence the outcome through pathways that bypass the exposure (horizontal pleiotropy)³, then the effect estimate is liable to be biased. To date, MR method development has viewed horizontal pleiotropy as a nuisance that needs to be factored out of the analysis^6,7,8,9. Departing from this viewpoint, here we exploit horizontal pleiotropy as an opportunity to identify alternative traits that putatively influence the outcome. We also explore how this knowledge can improve the original exposure–outcome estimates.

A crucial feature of MR is that it can be performed using only GWAS summary data, where the effect estimate can be obtained solely from the association results of the instrumental single-nucleotide polymorphisms (SNPs) on the exposure and on the outcome⁵. This means that causal inference between two traits can be made even if they have never been measured together in the same sample of individuals. Complete GWAS summary results have now been collected from thousands of complex trait and common diseases¹⁰, meaning that one can search the database for candidate traits that might be influenced by SNPs exhibiting possible pleiotropic effects (outliers). In turn, the causal influence of each of those candidate traits on the outcome can be estimated using MR by identifying their instruments (which are independent of the original outlier). Should any of these candidate traits putatively influence the outcome then this goes some way towards explaining the horizontal pleiotropic effect that was exhibited by the outlier SNP in the initial exposure–outcome analyses.

Several methods exist for identifying outliers in MR, each likely to be sensitive to different patterns of horizontal pleiotropy. Cook’s distance can be used to measure the influence of a particular SNP on the combined estimate from all SNPs¹¹, identifying SNPs with large influences as outliers. Steiger filtering removes those SNPs that do not explain substantially more of the variance in the exposure trait than in the outcome, attempting to guard against using SNPs as instruments that are likely to be associated with the outcome through a pathway other than the exposure¹². Finally, meta-analysis tools can be used to evaluate if a particular SNP contributes disproportionately to the heterogeneity between the estimates obtained from the set of instruments, and this has been adapted recently to detect outliers in MR analysis^13,14,15. A potential limitation of heterogeneity-based outlier removal is that this practice could constitute a form of cherry picking^9,16. While outlier removal can certainly improve power by reducing noise in estimation, it could also potentially induce higher type 1 error rates, which we go on to explore through simulations.

Recent large-scale MR scans have indicated that horizontal pleiotropy is widespread based on systematic analysis of heterogeneity^14,17. This suggests that many SNPs used as instruments are likely to associate with other traits, which in turn might associate with the original outcome of interest—hence giving rise to heterogeneity. As such we have an opportunity to identify previously unreported pathways by making use of outlying instruments in an MR analysis. Equipped with automated MR analysis software¹⁰, outlier detection methods and a database of complete GWAS summary datasets, we develop MR-TRYX (from the phrase ‘TReasure Your eXceptions’¹⁸, popularised by William Bateson, an early pioneer in genetics). This framework is designed to identify putative causal factors when performing a simple exposure–outcome analysis.

In this paper, we describe how MR-TRYX can be implemented in MR analyses and how to interpret its results. A wide range of simulations is performed to show how knowledge of horizontal pleiotropic pathways can be used to improve the power and reliability of the original exposure–outcome association analysis. Our simulations also show that outlier removal methods can induce bias or increase type 1 error rates, but adjustment for detected pleiotropic pathway can improve estimates by reducing heterogeneity without sacrificing study power. We apply MR-TRYX to four exemplar analyses to demonstrate its potential utility, showing that horizontal pleiotropic pathways can be used to discover putative causal factors for an outcome of interest.

Results

Overview of MR-TRYX

Figure 1 shows an overview of the approach. MR-TRYX is applied to an exposure–outcome analysis in a two-sample MR framework and it has two objectives. The first is to use outliers in the original exposure–outcome analysis to identify putative factors that influence the outcome independently of the exposure. The second is to re-estimate the original exposure–outcome association by adjusting outlier SNPs for the horizontal pleiotropic pathways that might arise through the putative associations. This outlier-adjustment method should be treated as a new approach to be used in conjunction with other methods that already exist in the MR sensitivity analysis toolkit. We provide extensive discussion on the context, advantages and potential pitfalls that come with trying to use a data-driven approach to adjust for horizontal pleiotropy at the end of the paper.

**Fig. 1: Conceptual framework of the study.**

Adjustment of pleiotropic pathways improves MR performance

We performed a wide range of simulations (Fig. 2, Supplementary Data 2) to evaluate how a variety of methods designed to deal with pleiotropy fare under a set of different scenarios that violate the exclusion restriction principle. Perhaps the most striking result from these simulations is that no method is always reliable, and several methods have similar overall reliability while performing very differently from each other between specific scenarios. Across 47 simulation scenarios, adjusting for detected outliers using the MR-TRYX framework had the highest average rank, and simply performing inverse-variance weighted (IVW) random effects was most often the best performing method, whereas removing detected outliers had the lowest average rank. We note that generally we do not know which of the scenarios are of relevance for any particular empirical analysis and so the metric used to evaluate performance here reflects the methods that are most generally performant. We found that as the proportion of instruments exhibiting pleiotropic effects increased, all methods typically worsened in their performance though there were notable examples in which increasingly widespread pleiotropy does not have an adverse effect. For example, widespread balanced horizontal pleiotropy or mediated pleiotropy does not have a drastic adverse influence on IVW, and MVMR and outlier adjustment is relatively impervious to confounding pleiotropy.

**Fig. 2: Simulations comparing methods across different scenarios.**

It is an obvious conceptual disadvantage in these simulations for IVW and outlier removal, which use only the exposure and outcome data, when compared against MVMR and MR-TRYX which draw on information from other sources. However, we note that the MR-TRYX adjustment approach depends on detecting candidate traits that explain the pleiotropic effect and if the relevant candidate traits are not available, there is no adjustment and the method becomes identical to random effects IVW which generally performs better than outlier removal. We also note that if we use association with candidate traits to determine whether or not to remove an outlier, then improvements can be made over simple outlier removal. What we observe here is intuitive because the potential drawback of outlier removal is that the outliers could be the only valid instruments, or false discovery rates increase due to overly precise confidence intervals. Thus, adding an extra barrier to the removal of outliers can mitigate these problems.

Multivariable MR targets a different estimand than univariable MR—it is estimating the direct effect rather than the total effect of the exposure on the outcome. This strategy performs generally well across the range of simulations except in the case when the candidate trait is a mediator of the x–y association in which case there is a strongly attenuated direct effect. The problem here is that it is hard for MVMR to distinguish between a model where the exposure’s influence on the outcome is mediated by a candidate trait (the exposure is causal), vs. where the exposure’s apparent effect on the outcome is simply due to pleiotropy through the candidate trait (the exposure is not causal)¹⁹. Here, MVMR performs worse than other methods when the candidate trait is a mediator, as MVMR estimates the direct effect of x on y adjusting for the entirety of x's signal on y. Adjusting for outliers escapes this problem to some extent because it only adjusts some proportion of the instruments for x that are most likely to be pleiotropic, allowing some signal of x on y to persist due to the unadjusted variants.

Empirical MR-TRYX analyses

To examine the performance of MR-TRYX analysis, we tested four independent exposure–outcome hypotheses: (i) systolic blood pressure (SBP) and coronary heart disease (CHD); (ii) urate and CHD; (iii) sleep duration and schizophrenia; and (iv) education level (years of schooling) and body mass index (BMI). For each analysis we: (a) obtain MR estimates of the exposure–outcome causal relationship and detect outlier instruments; (b) identify putative causal influences (candidate traits) on the outcome trait based on their associations with outlier variants (Table 1, Supplementary Data 1); (c) adjust the original SNP–outcome estimates for the putative influences operating through the candidate traits (Table 2); and (d) compare the changes in heterogeneity in the MR estimates of the adjusted SNP–outcome effects to standard outlier removal methods.

Table 1 Candidate traits associated with both exposure and outcome.

Full size table

Table 2 Results of empirical analyses with different IV estimators derived from various MR methods.

Full size table

Example 1: Systolic blood pressure and coronary heart disease: Blood pressure is a well-established risk factor for CHD. Random effects IVW estimates indicated that higher SBP is causally associated with higher risk of CHD (odds ratio [OR] per 1 SD: 1.76; 95% CI: 1.47, 2.10). While there was substantial heterogeneity in this estimate (Q = 682.7 on 157 SNPs, p = 5.74 × 10⁻⁶⁷), the estimates from MR-Egger, weighted median and weighted mode methods were consistent (Table 2). Seven of the 157 SNPs were detected as strong outliers based on Q statistics. We identified 69 candidate traits that were associated with these outliers (p < 5 × 10⁻⁸). We manually removed redundant traits and traits that are similar to the exposure and the outcome (e.g. hypertension). Among the remaining candidate traits, 15 were putatively causal for CHD (Fig. 3a). After we applied LASSO regression, six traits remained (Table 1): anthropometric measures (e.g. height), lipid levels (e.g. cholesterol level) and self-reported ibuprofen use were among the candidate traits that associated with CHD, which were all uncovered due to two outliers (rs3184504 near SH2B3 and rs9349279 near PHACTR).

**Fig. 3: Causal associations between candidate exposures and hypothesised outcome.**

We next adjusted the exposure–outcome association for the detected pleiotropic pathways and obtained an adjusted IVW estimate. The total heterogeneity, based on adjusting only these two of 157 SNP effects, was reduced by 17% (Q = 567.6). The effect estimate remained consistent with the original estimate, as did the IVW estimates when removing all outliers, or just outliers known to associate with candidate traits that associated with the outcome (Fig. 4a). However, the width of the confidence interval was substantially larger (including the null) after removing outliers known to associate with candidate traits (1 OR per SD: 1.80; 95% CI: 0.56, 5.79).

**Fig. 4: Exposure–outcome association adjusting the SNP effects on the candidate traits.**

Example 2: Urate and coronary heart disease: Here we show an example with mixed findings from previous studies. The influence of circulating urate levels on risk of coronary heart disease has been under debate. Several MR studies have investigated the inflated effect of urate on CHD, which appeared to be influenced by pleiotropy^20,21 . We re-estimated the associations here using a range of MR methods. As has been previously reported the estimate from IVW suggested a weak association between urate and the risk of CHD using all variants (OR per 1 SD: 1.08; 95% CI: 1.00, 1.17), while there was a large intercept in the MR-Egger analysis (intercept = 1.02; 95% CI: 1.00, 1.03) with a much-attenuated causal effect estimate (Table 2). The median and mode-based estimates were also consistent with the MR-Egger estimate, indicating weak support for urate having a causal influence on CHD. Here, three variants were detected as outliers, which associated with 61 candidate traits (p < 5 × 10⁻⁸). Among those outliers, rs653178 and rs642803 were associated with 14 traits that had conditionally independent influences on the outcome (Fig. 3b), including anthropometric measures (e.g. hip circumference), cholesterol levels, diagnosis of thyroid disease and smoking status.

Removing the outliers in the IVW analysis led to a more precise (though slightly attenuated) estimate of the influence of higher urate levels on CHD risk (OR per 1 SD: 1.05; 95% CI: 1.01, 1.10 and OR per 1 SD: 1.06; 95% CIs: 1.06, 1.12, respectively, Table 2). The adjustment model indicated an attenuated IVW estimate in comparison to the ‘raw’ approach, with confidence intervals spanning the null (OR per 1 SD: 1.07; 95% CI: 0.99, 1.16) while the degree of heterogeneity was reduced by half by accounting for the pleiotropic pathways through two outlier SNPs. The adjusted scatter plot showed that outliers moved towards the fitted line after controlling for the SNP effect on the candidate traits (Fig. 4b). The results in this analysis suggest that it is unlikely that urate has a strong causal influence on CHD. Here, outlier removal appears to strengthen evidence that may lead to a wrong conclusion.

Example 3: Sleep duration and schizophrenia: previous studies have shown that sleep disorder is associated with schizophrenia²². However, none of them confirmed the causality between sleep disorder and schizophrenia. We observed weak evidence for any association between sleep duration and schizophrenia (OR per 1 SD: 1.18; 95% CIs: 0.57, 2.45), but there was substantial heterogeneity when all SNPs were used (Q = 204.8; p = 6.9 × 10⁻²⁶). Six outlier instruments were detected, which associated with 46 candidate traits (p < 5 × 10⁻⁸). Among those outliers, the SNPs rs7764984 (near HIST1H2BJ) and rs13107325 (near SLC39A8) were associated with three traits that putatively influenced the outcome: self-reported coeliac disease, body composition (impedance of leg) and memory function (Fig. 4c).

We re-estimated the original association accounting for the detected outliers. The degree of heterogeneity was reduced by 74% (Q = 54.1) when removing all six outliers and by 46% (Q = 147.7) when adjusting for the two SNP effects that had putative pleiotropic pathways. Both methods of outlier removal and adjustment provide similar estimates in terms of direction, while the magnitude of estimates differed. After removing outliers, MR-Egger causal estimates were substantially larger (OR per 1 SD = 2.43; 95% CI: 0.49, 12.16 and OR per 1 SD = 2.36; 95% CI: 0.25, 21.96, respectively) than those from the method using all variants. IVW causal estimates from the adjustment method were virtually identical with the original estimates, with narrower CIs (OR per 1 SD = 1.18; 95% CI: 0.63, 2.20). While all methods indicate that sleep duration is unlikely to be a major causal risk factor for schizophrenia, pursuing outliers in the analysis provided putative indications that coeliac disease and memory function may be risk factors for schizophrenia (Fig. 4d).

Example 4: Years of schooling and body mass index: The association of education and health outcome is well established in social science²³. Higher socioeconomic position is generally thought to lead to a lower risk of obesity in high-income countries^24,25. We used 59 independent genetic instruments²⁶ to estimate the influence of years of schooling on BMI²⁷ (Table 2). All MR methods indicated that years of schooling has a causal beneficial effect on BMI (e.g. IVW beta: −0.27; 95% CI: −0.39, −0.16), except the estimate from MR Egger which had a very imprecise estimate (beta: 0.01; 95% CI: −0.67, 0.70), but the degree of heterogeneity was large (Q = 211.9 on 59 SNPs; p = 2.20 × 10⁻⁸). Three outliers (rs6882046 near LINC00461, rs4800490 near NPC1, rs8049439 near ATXN2L) were identified as contributors to heterogeneity, and they showed associations (p < 5 × 10⁻⁸) with 48 candidate traits. Among those candidate traits, two were associated with BMI (Fig. 3b): alcohol intake frequency (which associated with all three outliers) and usual walking pace.

We next re-estimated the influence of years of schooling on BMI by accounting for outliers. Adjusting the outliers for candidate trait pathways such as alcohol intake and usual walking pace reduced heterogeneity by 15% and had a small reduction in the confidence intervals while the point estimate remained consistent (Table 1). By contrast, there was a 48% reduction in heterogeneity when removing outliers. Point estimates remained largely consistent across all outlier removal methods. However, we note that Fig. 4b shows that one of the outliers (rs4800490, near gene NPC1) on the scatter plot moved away from the fitted line after adjusting for the pleiotropic pathway, indicating that if this outlier is due to a pleiotropic pathway we have estimated its indirect effect inaccurately or partially (e.g. where GWAS summary statistics are not available to identify other influential pleiotropic pathways).

Discussion

The problem of instrumental variables being invalid due to horizontal pleiotropy has received much attention in MR analysis. Detecting and excluding such invalid instruments, based on whether they appear to be outliers in the analysis, is now a common strategy that exists in various forms^7,8,14,15,28. We have shown here that outlier removal could, in some circumstances, compound rather than reduce bias, and misses an opportunity to better understand the traits under study. We developed the MR-TRYX framework, which utilises the MR-Base database¹⁰ of GWAS summary data to identify potential explanations for outlying SNP instruments, and to improve estimates by accounting for the pleiotropic pathways that give rise to them. We have also demonstrated the use and interpretation of MR-TRYX in four sets of empirical analyses.

To be effective, MR-TRYX depends upon the performance of three methodological components: (i) detecting instruments that exhibit horizontal pleiotropy; (ii) identifying the candidate traits on the alternative pathways from the variant to the outcome; and (iii) adequately estimating the effects of the candidate traits on the outcome. Each of these components present difficult problems, but they are all modular and build upon existing methods and resources, and the MR-TRYX framework will naturally improve as those methods and resources themselves improve. We will now discuss the consequences of underperformance of each of these components on the TRYX analysis.

First it is important to notice that a major motivation for development of MR is that observational associations are often deemed unreliable because it is impossible to prove that there is no residual or unmeasured confounding biasing the effect estimate. But somewhat ironically, we find ourselves in a situation now where horizontal pleiotropy poses a similar challenge, in that proving that it is either absent or perfectly balanced is impossible. While several ‘pleiotropy-robust’ methods attempt to model out pleiotropic effects by assuming a particular model of genetic architecture, another strategy is to adjust for horizontal pleiotropy, by including in the same model the genetic effects on one or more traits that are hypothesised to mediate the horizontal pleiotropic pathways (e.g. MVMR²⁹). The adjustment approach depends upon those pathways being identified, which leaves it in a similar predicament to observational associations in that we cannot easily prove that all biasing pathways have been included in the model. The MR-TRYX approach falls within this category also, but we note that as fewer and fewer of the biasing pathways are identified and available to the adjustment model, the adjusted estimate will tend towards the IVW random effects estimate, which our simulations indicate can have good performance compared to, e.g., outlier removal methods. So, while clearly not a panacea for causal inference analysis, it is a valuable method within the MR toolkit, and its efficacy has been demonstrated. There is also an important contrast between outlier adjustment and multivariable MR in that the formulation of the latter is to estimate the direct effect of each exposure conditional on the others, whereas the former is to obtain an unbiased estimate of the total effect. MVMR may fail to distinguish between a pleiotropic model where the exposure (X) does not influence the outcome (Y) but has instruments that associate with another trait (A) which does influence Y, vs. a causal model in which trait A mediates the causal effect of X on Y. In both situations X will be deemed to be non-causal, despite it being indirectly causal in the latter case. This issue is discussed in detail elsewhere¹⁹. Here, outlier adjustment improves on the matter because MVMR will nullify all instruments for the exposure after adjusting for the mediator, leading to the exposure being dropped. When only the outlier variants are adjusted, the risk of erroneously removing the entire exposure signal is replaced by the lesser risk of incorrectly nullifying the effects of the outliers only. This will introduce heterogeneity and slight bias but is unlikely to remove the exposure’s entire signal.

The classification of an outlier in MR analysis can be based on the statistical estimates of how a SNP being included as an instrument is due to being reverse causal (Steiger filtering)^12,17, the extent to which a single SNP disproportionately influences the overall result (e.g. Cook’s distance), or most commonly the extent to which an SNP contributes to heterogeneity (e.g. Cochran’s Q statistic, MR-PRESSO, and implicitly in median- and mode-based estimators)^7,8,14,15. The philosophy of the latter two approaches is that proving horizontal pleiotropy is impossible, but that it should lead to outliers⁹. While a useful approximation, these approaches have two main limitations. First, determining whether a SNP is an outlier depends on the use of arbitrary thresholds, and this entails a trade-off between specificity and sensitivity. Second, if most variants are pleiotropic, then it is possible that the outlier SNPs are the valid instruments. Such a scenario can arise for complex traits such as gene expression or protein levels that have a few large effects and many small effects. For example, for C-reactive protein (CRP) levels, the SNP in the CRP gene region is likely the only valid instrument in some analyses³⁰. In this context, bias due to horizontal pleiotropy cannot be avoided by selection of instruments since this approach may generate more bias³¹. This is supported by our simulation which demonstrates that in the presence of extensive pleiotropy removing outliers increased FDR and bias.

MR-TRYX should, in principle, avoid the problem of outlier removal because instead of removing outliers in their entirety, it attempts to eliminate the component of the SNP–outcome effect that is due to horizontal pleiotropy. Hence, we avoid implicitly cherry picking from among the SNPs to be used in the analysis, and if we have low sensitivity (i.e. a more relaxed threshold for outlier detection) it does not mean that there will be an unnecessary loss of power in the overall analysis. Previous work has adjusted for the effect of pleiotropic phenotypes, but they treated pleiotropic phenotypes as exogenous variables that are not associated with the causal pathways of interest³². In MR-TRYX, candidate traits are treated as endogenous variables to account for the effect of the traits on the original association. Moreover, our method is applicable in the two-sample context, whereas the previous method requires individual level data. The problem of outlier detection which remains in MR-TRYX could be sidestepped by applying the adjustment approach to all SNPs irrespective of their contributions to heterogeneity.

Upon identification of potentially pleiotropic SNPs, MR-TRYX can only account for these if the pathways through which pleiotropy is acting can be identified. Detecting the pathways depends on the density and coverage of the human phenome available for the analysis. We use the MR-Base database of GWAS summary results, which comprises several hundred independent traits (we selected 605 traits from UK Biobank and 342 other complex traits and diseases obtained from previous GWA studies). While being the largest available resource, it is certainly not covering the whole human phenome. Therefore, even if a pleiotropic variant is detected correctly, it may not be possible to adjust it away if the phenotype associated with the variant cannot be identified. In the empirical analyses, often fewer than half of the candidate traits were inferred to be associated with the outcome. Yet, as we illustrated, MR-TRYX allows for an informative analysis that could routinely be applied in MR analyses. Broadening phenotype coverage is an on-going pursuit that will continually improve MR-TRYX analysis³³. It is also important to note that in estimating the adjusted effect, the SNP–outcome standard error is liable to increase, which is one avenue through which heterogeneity is reduced as its outlying contribution will be down-weighted in the subsequent IVW analysis. We used radial MR plots to illustrate this explicitly in Fig. 4.

MR-TRYX is an automated framework, and this comes with several limitations in addition to those discussed already. First, our LASSO extension to multivariable MR is used to automate the selection of exposures that will be used for adjustment. A shrinkage step of LASSO may increase the SNP–exposure effect heterogeneity, which is necessary to assess the power of multivariable MR³⁴. Multivariable MR is adept at establishing conditionally independent exposures but the reason that some exposures have attenuated effects in comparison to their total effects could be because (a) their total effects were biased by pleiotropy or (b) they are mediated by the exposures that are included in the model. Interpretations of (a) and (b) are very different, because in the case of mediation the exposure is a causal factor for the outcome. Second, we were primarily using the multivariable approach for practical purposes to avoid having multiple highly related exposures taken forward to the adjustment step (e.g. multiple different measures of body composition such as body weight and BMI). This approach worked effectively, although a problem remains unsolved in automating the removal of traits that are similar to the outcome. For example, if a trait similar to the outcome CHD associates with an outlier and is included in the multivariable analysis of multiple exposures against CHD, then all the other putative exposures will be dropped from the model. In the analyses presented we manually removed traits that came up as candidate pleiotropic pathways but were, in fact, synonymous with or closely related to the outcome. Third, we note that heterogeneity does not necessarily arise only because of pleiotropy, for example the non-collapsibility of odds ratios will introduce heterogeneity automatically which cannot be adjusted away through the TRYX approach. Many other mechanisms exist that can lead to bias in MR, as has been described in detail elsewhere. Fourth, SNPs can appear to be outliers not through being pleiotropic, but through other mechanisms, such as population stratification (association of alleles with phenotypes being confounded by ancestral population), canalisation (developmental compensation to a genetic change)^2,35, or the influence on phenotype being changeable across the life course³⁶. Fifth, since MR-TRYX uses the resource from MR-Base, it is recommended that the user acknowledge the limitation and restriction of MR-Base¹⁰. For example, the population should be the same for the exposure (or the candidate traits) and the outcome traits to avoid mis-estimation of the magnitude of the association. Also, sample overlap should be recognised between the GWAS studies for the SNP–exposure and SNP–outcome association to prevent effect estimates being biased³⁷. Users should consider modifying their analyses when the limitations indicated above are avoidable. Sixth, in the case of a binary outcome, there may be parametric restrictions on the conditional causal odds ratio in our multivariable MR model where the exposure effect is linear in the exposure on the log odds ratio scale³⁸. However, the two-stage estimator with a logistic second-stage model still yields a valid test of the causal null hypothesis³⁸. Finally, it is necessary for the effects through the identified pleiotropic pathways to be accurately estimated. This is a recursive problem—MR-TRYX adjusts the SNP–outcome effects based on the pleiotropic effect through the outlier SNP, but it does this by introducing more SNPs into the analysis that instrument the candidate traits. These new SNPs may themselves exhibit pleiotropic effects that could lead to bias in the estimates of the candidate traits on the outcome, requiring a second round of TRYX-style candidate trait searches, and so on. In the example of education level and BMI, adjustment for the pleiotropic pathway failed to substantially reduce the degree of heterogeneity. Further developments could involve recursively analysing alternative pathways. For example, Steiger filtering could be applied at all stages of MR estimation to attempt to automatically remove reverse causal instruments or those that arise due to confounding pleiotropy¹⁷.

In this study, we demonstrated the use of MR-TRYX through four examples of identifying putative pathways. In the first empirical example (SBP on CHD), we illustrated the validity of MR-TRYX to detect the traits that possibly influence the disease outcome. Apart from SBP, MR-TRYX also detected well-established risk factors for CHD including adiposity, cholesterol levels, and standing height. An interesting finding from this example is that headache-related traits (e.g. experience of pain due to headache and self-reported status of ibuprofen intake) were identified as candidate traits, which may influence the original association. In support of the putative finding for self-reported ibuprofen use associating with CHD, we also found that pain experienced in the last month (headache) and self-reported migraine were associated with lower risk of CHD (OR per 1 SD: 0.33; 95% CI: 0.12, 0.89 and beta = 0.02; 95% CI: 0.0004, 0.65, respectively). A previous study reported shared genetic risk between headache (migraine) and CHD, suggesting a potential role of migraine in vascular mechanisms³⁹. An alternative mechanism that could give rise to this association is that the effect of pain on lower CHD risk is mediated through the use of medications such as aspirin that have known protective effects on CHD.

The example of urate and CHD demonstrated the benefit of the adjustment method showing that the noise due to pleiotropy was substantially reduced after correcting for the effect of candidate traits. The presence of hypothyroidism and self-reported levothyroxine sodium intake status were identified as putative risk factors for risk of CHD, which is consistent with previous clinical trials: thyroid dysfunction is associated with overall coronary risk⁴⁰, which can be reversed by levothyroxine therapy⁴¹. In the education–BMI example, we showed that increased alcohol intake and slower usual walking pace may influence obesity. These identified traits have been reported as possible risk factors for higher BMI and obesity^42,43. Additionally, the example of sleep duration and risk of schizophrenia suggested coeliac disease and body composition as putative risk factors for schizophrenia. A number of observational studies suggested that schizophrenia is linked with body composition⁴⁴ and coeliac disease⁴⁵. MR of binary exposures is often difficult to interpret because the instrument effects are on liability to disease, not the presence or absence of the disease. Hence, the association between coeliac disease and schizophrenia may be better interpreted as an indication of shared disease aetiology. Nevertheless, this is a valuable finding since the causal effect of those putative risk factors on risk of schizophrenia has not been investigated using an MR approach. Therefore, our example illustrates how outliers can be used to identify alternative pathways, opening the door for hypothesis-free MR approaches and a network-based approach to disease.

In conclusion, we have introduced a framework to deal with the bias from horizontal pleiotropy, and to identify putative risk factors for outcomes in a more directed manner than typical hypothesis-free analyses, by exploiting outliers. Heterogeneity is widespread across MR analyses and so we are tapping into a potential new reservoir of information for understanding the aetiology of disease. The strategy is a departure from previous ones dealing with pleiotropy—enlarging the problem by searching across all traits for a better understanding of a specific exposure–outcome hypothesis can be fruitful.

Methods

Outlier detection

Several outlier detection methods now exist that are based on the contribution of each SNP to overall heterogeneity in an IVW meta-analysis⁴⁶. In order to estimate heterogeneity accurately, it is important to appropriately weight the contribution of each SNP to the overall estimate. We used the approach implemented in the RadialMR R package (https://github.com/WSpiller/RadialMR) to detect outliers. Full details are provided elsewhere¹⁵, but briefly, we used the so-called ‘modified 2^nd order weighting’ approach to estimate total Cochran’s Q statistic as a measure of heterogeneity, as well as the individual contributions of each SNP, q_i¹⁵. This has been shown to be comparable to the simulation-based approach in MR-PRESSO, providing a well-calibrated test statistic for outlier status whilst being computationally more efficient^14,47. The probability of a SNP being an outlier is calculated based on q_i being chi-square distributed with one degree of freedom. For demonstration purposes we adopted a p value threshold that was Bonferroni corrected for the number of SNPs tested in analysis (p < 0.05/number of SNPs). We are not, however, suggesting that this arbitrary threshold will necessarily be optimal for identifying outliers, and users can apply other approaches or thresholds through the MR-TRYX software.

Candidate trait detection

Traits associated with the detected outliers could causally influence the outcome. MR-TRYX searches the MR-Base database to identify the traits that have associations with the detected outliers. By default, we limit the search to traits for which the GWAS results registered at MR-Base have more than 500,000 SNPs and sample sizes exceeding 5000. Traits that have an association with outlier SNPs at genome-wide p value threshold (p < 5 × 10⁻⁸; in keeping with traditional GWAS thresholds used for instrument selection) are regarded as potential risk factors for the outcome and defined as candidate traits. Each candidate trait is tested for its influence on the original exposure (X) and outcome (Y) traits (Fig. 1) using the IVW random effects model. We take forward putative associations based on false discovery rate (FDR) < 0.05, where the null hypothesis is true, but we note that the use of arbitrary thresholds is problematic^48,49, and we use them here to make high dimensional investigations more manageable.

Assessing effect of the candidate traits on the outcome

Once candidate traits are detected, we can identify instruments specifically for the candidate traits and model how the exposure and candidate traits together associate with the outcome. This involves the following process, which we go on to describe in full detail below:

1.
Identify instruments for the candidate traits.
2.
Estimate the influence of the candidate traits on y conditioning on x using multivariable MR.

Suppose we have g₀, g_x1,…,g_xE instruments for the exposure x where g₀ is an outlier in the x–y MR analysis due to an association with candidate trait P, and where E indicates the number of genetic variants for the exposure. Also, P has g₀, g_P1,…,g_PM genetic instruments, where M is the number of genetic variants for P. To obtain the estimate of (py) uncontaminated by shared genetic effects between P and x (Fig. 1a), we perform multivariable MR analysis³⁴. We generate a combined list of instruments for both x and P and clump them to obtain a set of independent SNPs. The original outlier is removed from amongst these SNPs. We then obtain the genetic effects of each of these SNPs on the exposure (gx), candidate trait (gp), and outcome (gy). Finally, we estimate the causal influence of P on y conditioning on x by regressing (gy) ~ (gx) + (gp) weighted by the inverse of the variance of the (gy) estimates. The whole process is automated within the TwoSampleMR R package which connects to the MR-Base database.

In the case of an outlier SNP associating with many candidate traits we first apply a modified form of multivariable MR, involving LASSO regression of (gy) ~ (gx) + (gp_i)+…+(gp_p) and use cross-validation to obtain the shrinkage parameter that minimises the mean squared error. We retain only the candidate traits that are putatively associated with the outcome and have non-zero effects after shrinkage. Then we apply remaining traits in a multivariable model with x against the outcome, as described above³⁴. We perform the LASSO step because many traits in the MR-Base database have considerable overlap and redundancy, and the statistical power of multivariable analysis depends on the heterogeneity between the genetic effects on the exposure variables³⁴. Using LASSO therefore automates the removal of redundant traits (Supplementary Fig. 1, Supplementary Tables 2 and 3). We then obtain estimates of (py) that are conditionally independent of x and jointly estimated using all remaining P traits by combining them in a multivariable analysis on the outcome y. A detailed discussion of dealing with multiple candidate traits per outlier SNP is presented in Supplementary Note 1.

Adjusting causal estimates for candidate-trait associations

An illustration of how outliers arise in MR analyses is shown in Fig. 1. If a SNP g has some influence on exposure x, and x has some influence on outcome y, the SNP effect on y is expected to be (gy) = (gx)(xy), where (gx) is the SNP effect on x and (xy) is the causal effect of x on y. Any substantive difference between (gy) and (gx)(xy) could be due to an additional influence on y arising from the SNP’s effect through an alternative pathway.

If a SNP influences a ‘candidate trait’, P, which in turn influences the outcome (or the exposure and the outcome), then the SNP’s influence on the exposure and the outcome will be a combination of its direct effects through x and indirect effects through P³⁴. If we have estimates of how the candidate trait influences the outcome, then we can adjust the original SNP–outcome estimate to the effect that it would have exhibited had it not been influencing the candidate trait. In other words, we can obtain an adjusted SNP–outcome effect conditional on the ‘candidate-trait–exposure’ and ‘candidate-trait–outcome’ effects. If the SNP influences P independent candidate traits (as selected from the LASSO step), then the expected effect of the SNP on y is

$$\left( {gy} \right) = \left( {gx} \right)\widehat {\left( {xy} \right)} + \mathop {\sum}\limits_{i = 1}^P {\left( {gp_i} \right)\widehat {\left( {p_iy} \right)}}.$$

(1)

Hence, the effect of the SNP on the outcome adjusted for alternative pathways p₁,…, p_p is

$$\left( {gy} \right) ^\ast = \widehat {\left( {gy} \right)} - \mathop {\sum}\limits_{i = 1}^P {\left( {gp_i} \right)\left( {p_iy} \right)}.$$

(2)

We use parametric bootstraps to estimate the standard error of the ${\left( {gy} \right)}^\ast$ estimate, where 1000 resamples of (gy), (gp), and (py) are obtained based on their respective standard errors and the standard deviation of the resultant $\widehat {\left( {gy} \right)}^\ast$ estimate represents its standard error. Finally, an adjusted effect estimate of (xy) due to SNP g is obtained through the Wald ratio.

$$\widehat {\left( {xy} \right)} = \frac{{\left( {gy} \right) ^\ast }}{{(gx)}}.$$

(3)

Occasionally it might be possible that a candidate trait P is a redundant trait for y, for example if the outcome is coronary heart disease, the outliers might detect traits such as ‘medication for heart disease’ as a potential candidate trait. It would make no sense to attempt to adjust the SNP–outcome association for a trait that is essentially the same as the outcome, it would just nullify the association. We have not yet developed an automated method to remove such traits, but we recommend manually checking any traits that are selected for automated outlier adjustment.

Simulations

IVW effect estimates are liable to be biased when at least some of the instrumenting SNPs exhibit horizontal pleiotropy, and those SNPs tend to contribute disproportionately towards the heterogeneity in the effect estimate. We conducted simulations to evaluate how different methods perform at estimating the causal effect of x on y under different circumstances. The simulations are principally designed to evaluate the potential value of adjusting outliers for putative explanatory pathways. Other aspects of the MR-TRYX framework, for example, dealing with redundant traits in the GWAS database are dealt with separately (Supplementary Note 1). In all circumstances there are 30 independent genetic effects on x (Gx), and x either has no direct influence on y, or has a direct effect of 0.1 on y. For all simulations, we used 10,000 individuals, and repeated each circumstance 1000 times. We summarised each scenario in two ways: (a) We estimated the proportion of simulations that gave a biased estimate of the causal effect of x on y (b_xy). For each simulation we calculated the probability of the effect estimate being substantially different from the true simulated effect based on whether the true effect fell outside the 95% confidence interval of the estimate. Then for the set of 1000 simulations, we calculated the proportion of estimates that were ‘unbiased’. (b) We summarised the power and FDR by estimating the area under the receiver operator curve, characterising the sensitivity and specificity of each method at determining whether the true causal effect estimate was null or non-null. Each simulation is conducted by first simulating data to satisfy the parameters described below. We then search for instruments for x across all simulated genetic variants and retain those that are significant after Bonferroni correction, and applying the summary data-based methods based on the genetic associations for the instruments on x and y. All genetic variants are simulated to be Hardy Weinberg equilibrium with an allele frequency of 0.5.

We investigated three scenarios that could give rise to invalid instruments (Fig. 2).

In the confounding pleiotropy scenario, there are instruments detected for x that primarily influence a confounder variable (e.g. u₁ that influences both x and y). Therefore, the term ‘confounding pleiotropy’ indicates that the instrument’s horizontal pleiotropic effect arises because it primarily influences a confounder of x and y. See Fig. 2 (column 1) for a DAG describing the model. The confounder u₁ has a set of independent genetic influences, G_u1, which may be detected as instruments for x.

$$u_1 = \mathop {\sum }\limits_j^{m_{u1}} G_{u1,j}b_{gu1,j} + e_{u1},$$

(4)

$$x = u_1b_{u1x} + \mathop {\sum }\limits_j^{m_x} G_{x,j}b_{gx,j} + e_x,$$

(5)

$$y = u_1b_{u1y} + xb_{xy} + e_y.$$

(6)

Parameters: b_gu1,j values are sampled for each SNP m_u1 from a normal distribution such that they explain 60% of the variance in u₁. The value of b_u1x is chosen such that u₁ explains 60% of the variance in x and 40% of the variance in y. The values of b_gx,_j are sampled from a normal distribution for each of m_x SNPs such that they explain 20% of the variance in x. The causal effect b_xy is set to either 0, or some value such that x explains 10% of the variance in y. Values for e_u1,e_x, and e_y are sampled from normal distributions with mean 0 and variances that are scaled to satisfy the variances of all other parameters described for the model. Different sets of simulations are run with different proportions of invalid instruments by simulating different numbers of genetic variants directly influencing u₁ or x:

$$m_{u1} \in \{ 5,10,15,20,30\} ;$$

$$m_x \in \{ 5,10,15,20,25,30\}.$$

In the case of horizontal pleiotropy, at least some of the instruments for x have an independent effect on y that is mediated through some other pathway that does not include x. In these simulations, the pleiotropic influence of each instrument, G_x,i, is mediated by a different trait, u_2,i

$$x = \mathop {\sum }\limits_j^{m_x} G_{x,j}b_{gx,j} + e_x,$$

(7)

$$u_{2,i} = \mathop {\sum }\limits_j^{m_{u2,i}} G_{u2,i,j}b_{gu2,i,j} + G_{x,i}b_{plei,i} + e_{u2,i},$$

(8)

$$y = \mathop {\sum }\limits_i^{m_{plei}} u_{2,i}b_{u2,i,y} + xb_{xy} + e_y.$$

(9)

Parameters: Some number $m_{\mathrm{{plei}}} \in \{ 5,10,15,20,25,30\}$ of 30 G_x instruments for x are selected to have pleiotropic effects, such that they influence y each mediated by an independent trait u_2,i which itself has its own set of 30 direct genetic influences G_u2,i. The b_gu2,i,j values for the genetic effects on u_2,i are sampled from a normal distribution such that they explain 20% of the variance in u_2,i. Each pleiotropic G_x,i instrument has an influence on u_2,i that explains 20% of its variance (b_plei,i). The influence of each u_2,i on y is such that b_u2,i,y is normally distributed with mean 0 and variance 0.4. The outcome y is also influenced by x where the causal effect b_xy is set to either 0, or some value such that x explains 10% of the variance in y. Values for e_u2,i, e_x, and e_y are sampled from normal distributions with mean 0 and variances that are scaled to satisfy the variances of all other parameters described for the model.

Mediation pleiotropy is treated as in ‘confounding pleiotropy’, except the pleiotropic relationships arise due to a trait that is mediating the path from x to y, rather than confounding it (Fig. 2, column 3). Specifically, the influence of x on y is at least partially mediated by another trait u₃, and at least some of the instruments for x have an independent pleiotropic influence on u₃.

$$x = \mathop {\sum }\limits_j^{m_x} G_{x,j}b_{gx,j} + e_x,$$

(10)

$$u_3 = \mathop {\sum }\limits_j^{m_{u3}} G_{u3}b_{gu3,j} + \mathop {\sum }\limits_i^{m_{\mathrm{{plei}}}} G_{x,i}b_{\mathrm{{plei}},i} + xb_{x,u3} + e_{u3},$$

(11)

$$y = u_3b_{u3,y} + xb_{xy} + e_y.$$

(12)

Parameters: Some number $m_{\mathrm{{plei}}} \in \{ 5,10,15,20,25,30\}$ of 30 Gx instruments for x are selected to have pleiotropic effects, such that they influence u₃ which itself mediates an effect from x to y, and has its own set of 30 direct genetic influences G_u3. The b_gu3,j values for the genetic effects on u₃ are sampled from a normal distribution such that they explain 20% of the variance in u₃. Each pleiotropic Gx,i instrument has an influence on u₃ such that b_plei,i are sampled from a normal distribution explaining 20% of the variance in u3 in total. The indirect influence of x on y is generated such that x explains 30% of the variance in u₃, and u₃ explains 40% of the variance of y. The outcome y may also be influenced directly by x where the causal effect b_xy is set to either 0, or some value such that x explains 10% of the variance in y. Values for e_u3, e_x, and e_y are sampled from normal distributions with mean 0 and variances that are scaled to satisfy the variances of all other parameters described for the model.

In these simulations we ask: if we can identify the pathway through which an outlier SNP has a horizontal pleiotropic effect, can adjustment for that pathway improve the original exposure–outcome analysis? We assess the performance of the following methods for each simulation.

(1)
Raw, where all detected instruments are used in a standard IVW random effects analysis.
(2)
Adjusted SNP–outcome effects
1. (a)
  Where outlier SNPs are tested for association with all candidate traits and adjusted for the effect of the candidate trait on the outcome using MR-TRYX.
2. (b)
  Where attempts are made to adjust all detected instruments regardless of outlier status.
(3)
Removed instruments
1. (a)
  Where all detected outliers are removed.
2. (b)
  Where only outliers that are found to influence a candidate trait are removed.
(4)
Multivariable MR (MVMR)
1. (a)
  Where the traits selected to be included in the model are the candidate traits associated with outliers.
2. (b)
  Where the traits selected to be included in the model are the candidate traits associated with any of the detected instruments regardless of outlier status.

Empirical analyses

As applied examples, we chose two robust findings and two controversial findings that are potentially biased due to pleiotropy: (i) systolic blood pressure (SBP) and coronary heart disease (CHD); (ii) urate and CHD; (iii) sleep duration and schizophrenia; and (iv) education level (years of schooling) and body mass index (BMI). Those examples were chosen based on previous findings^20,22,50,51 to illustrate how pleiotropic variants can be used to identify other pathways and adjusted to estimate the causal effect of the original exposure on the outcome independent of pleiotropic bias.

Summary statistics (β-coefficients and SEs) for the associations of the SNPs with each exposure were obtained from the publicly available GWAS database (Supplementary Table 1). Selected SNPs were harmonised for the analysis, excluding palindromic SNPs and pruning for linkage disequilibrium (r² < 0.001). We primarily used the two-sample MR IVW method to obtain causal estimates between exposures and outcomes allowing each SNP to have a different mean effect (random effects model). A number of sensitivity analyses were applied to evaluate the consistency of causal effect estimates under different models of pleiotropy among the SNPs, including the MR-Egger⁶, weighted median, and weighted mode approaches^7,8.

Outliers were detected among the instruments for each exposure (p < 0.05/the number of SNPs). We searched the MR-Base database to identify the candidate traits that are associated with outliers (p < 5 × 10⁻⁸). We then performed multivariable MR analysis to test which candidate trait can explain the heterogeneity in the original exposure–outcome association. To perform multivariable MR, more SNPs that instrument the candidate traits were introduced into the analysis.

Subsequently we re-estimated the association of the original exposure and the original outcome using different sets of instruments: (a) all SNPs (corresponding to the raw method in our simulation), (b) outliers adjusted, (c) all outlier removed, and (c) candidate outliers removed.

All analyses were conducted with the TwoSampleMR package (https://github.com/MRCIEU/TwoSampleMR) and the MR-TRYX package (https://github.com/explodecomputer/tryx) in R statistical software (ver 3.4.1). Detailed information are provided in Supplementary Note 1 and the scripts used for the simulations and empirical analyses can be found here https://github.com/explodecomputer/tryx-analysis.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

The data that support the findings of this study are available from IEU GWAS database (https://gwas.mrcieu.ac.uk/).

Code availability

A copy of the code used in this analysis is available at https://github.com/explodecomputer/tryx and https://github.com/explodecomputer/tryx-analysis.

References

Holmes, M. V., Ala-Korpela, M. & Davey Smith, G. Mendelian randomization in cardiometabolic disease: challenges in evaluating causality. Nat. Rev. Cardiol. 14, 577–590 (2017).
Article CAS PubMed PubMed Central Google Scholar
Davey Smith, G. & Ebrahim, S. ‘Mendelian randomization’: can genetic epidemiology contribute to understanding environmental determinants of disease? Int. J. Epidemiol. 32, 1–22 (2003).
Article Google Scholar
Davey Smith, G. & Hemani, G. Mendelian randomization: genetic anchors for causal inference in epidemiological studies. Hum. Mol. Genet. 23, R89–R98 (2014).
Article CAS PubMed PubMed Central Google Scholar
Johnson, T. & Uk, S. Efficient Calculation for Multi-SNP Genetic Risk Scores. Technical Report (The Comprehensive R Archive Network, 2013).
Pierce, B. L. & Burgess, S. Efficient design for Mendelian randomization studies: subsample and 2-sample instrumental variable estimators. Am. J. Epidemiol. 178, 1177–1184 (2013).
Article PubMed PubMed Central Google Scholar
Bowden, J., Davey Smith, G. & Burgess, S. Mendelian randomization with invalid instruments: effect estimation and bias detection through Egger regression. Int. J. Epidemiol. 44, 512–525 (2015).
Article PubMed PubMed Central Google Scholar
Bowden, J., Davey Smith, G., Haycock, P. C. & Burgess, S. Consistent estimation in Mendelian randomization with some invalid instruments using a weighted median estimator. Genet. Epidemiol. 40, 304–314 (2016).
Article PubMed PubMed Central Google Scholar
Hartwig, F. P., Davey Smith, G. & Bowden, J. Robust inference in summary data Mendelian randomization via the zero modal pleiotropy assumption. Int. J. Epidemiol. 46, 1985–1998 (2017).
Article PubMed PubMed Central Google Scholar
Hemani, G., Bowden, J. & Davey Smith, G. Evaluating the potential role of pleiotropy in Mendelian randomization studies. Hum. Mol. Genet 27, R195–R208 (2018).
Article CAS PubMed PubMed Central Google Scholar
Hemani, G. et al. The MR-Base platform supports systematic causal inference across the human phenome. eLife 7, e34408 (2018).
Article PubMed PubMed Central Google Scholar
Corbin, L. J. et al. BMI as a modifiable risk factor for type 2 diabetes: refining and understanding causal estimates using Mendelian randomization. Diabetes 65, 3002–3007 (2016).
Article CAS PubMed PubMed Central Google Scholar
Hemani, G., Tilling, K. & Davey Smith, G. Orienting the causal relationship between imprecisely measured traits using GWAS summary data. PLoS Genet. 13, e1007081 (2017).
Article PubMed PubMed Central CAS Google Scholar
Bowden, J. et al. A framework for the investigation of pleiotropy in two-sample summary data Mendelian randomization. Stat. Med. 36, 1783–1802 (2017).
Article ADS MathSciNet PubMed PubMed Central Google Scholar
Verbanck, M., Chen, C. Y., Neale, B. & Do, R. Detection of widespread horizontal pleiotropy in causal relationships inferred from Mendelian randomization between complex traits and diseases. Nat. Genet. 50, 693–698 (2018).
Article CAS PubMed PubMed Central Google Scholar
Bowden, J. et al. Improving the visualization, interpretation and analysis of two-sample summary data Mendelian randomization via the Radial plot and Radial regression. Int. J. Epidemiol. 47, 1264–1278 (2018).
Article PubMed PubMed Central Google Scholar
Bakker, M. & Wicherts, J. M. Outlier removal, sum scores, and the inflation of the Type I error rate in independent samples t tests: the power of alternatives and recommendations. Psychol. Methods 19, 409–427 (2014).
Article PubMed Google Scholar
Hemani, G. et al. Automating Mendelian randomization through machine learning to construct a putative causal map of the human phenome. bioRxiv, 173682. Preprint at https://www.biorxiv.org/content/10.1101/173682v2 (2017).
Bateson, W. The Methods and Scope of Genetics (Cambridge University Press, 2014).
Anderson, E. L. et al. Education, intelligence and Alzheimer’s disease: evidence from a multivariable two-sample Mendelian randomization study. Int. J. Epidemiol. dyz280, https://doi.org/10.1093/ije/dyz280 (2018).
White, J. et al. Plasma urate concentration and risk of coronary heart disease: a Mendelian randomisation analysis. Lancet Diabetes Endocrinol. 4, 327–336 (2016).
Article CAS PubMed PubMed Central Google Scholar
Kleber, M. E. et al. Uric acid and cardiovascular events: a Mendelian randomization study. J. Am. Soc. Nephrol. 26, 2831–2838 (2015).
Article CAS PubMed PubMed Central Google Scholar
Kaskie, R. E., Graziano, B. & Ferrarelli, F. Schizophrenia and sleep disorders: links, risks, and management challenges. Nat. Sci. Sleep. 9, 227–239 (2017).
Article PubMed PubMed Central Google Scholar
Strom, D., Dudovitz, R., Guerrero, L. R. & Wong, M. D. The link between education and health: it is not what you know, but whom you know. J. Gen. Intern. Med. 30, S277–S278 (2015).
Article Google Scholar
Bockerman, P. et al. Does higher education protect against obesity? Evidence using Mendelian randomization. Prev. Med. 101, 195–198 (2017).
Article PubMed Google Scholar
Cohen, A. K., Rai, M., Rehkopf, D. H. & Abrams, B. Educational attainment and obesity: a systematic review. Obes. Rev. 14, 989–1005 (2013).
Article CAS PubMed PubMed Central Google Scholar
Okbay, A. et al. Genome-wide association study identifies 74 loci associated with educational attainment. Nature 533, 539–542 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Locke, A. E. et al. Genetic studies of body mass index yield new insights for obesity biology. Nature 518, 197–206 (2015).
Article CAS PubMed PubMed Central Google Scholar
Zhu, Z. H. et al. Causal associations between risk factors and common diseases inferred from GWAS summary data. Nat. Commun. 9, 224 (2018).
Article ADS PubMed PubMed Central CAS Google Scholar
Burgess, S. & Thompson, S. G. Multivariable Mendelian randomization: the use of pleiotropic genetic variants to estimate causal effects. Am. J. Epidemiol. 181, 251–260 (2015).
Article PubMed PubMed Central Google Scholar
Hartwig, F. P., Borges, M. C., Horta, B. L., Bowden, J. & Davey Smith, G. Inflammatory biomarkers and risk of schizophrenia: A 2-Sample Mendelian Randomization Study. JAMA Psychiatry 74, 1226–1233 (2017).
Article PubMed PubMed Central Google Scholar
Burgess, S. & Thompson, S. G. CRP CHD Genetics Collaboration Genetics Collaboration. Avoiding bias from weak instruments in Mendelian randomization studies. Int. J. Epidemiol. 40, 755–764 (2011).
Jiang, L. et al. Constrained instruments and their application to Mendelian randomization with pleiotropy. Genet. Epidemiol. 43, 373–401 (2019).
Article PubMed PubMed Central Google Scholar
Visscher, P. M. et al. 10 years of GWAS discovery: biology, function, and translation. Am. J. Hum. Genet. 101, 5–22 (2017).
Article CAS PubMed PubMed Central Google Scholar
Sanderson, E., Davey Smith G., Windmeijer, F. & Bowden, J. An examination of multivariable Mendelian randomization in the single-sample and two-sample summary data settings. Int. J. Epidemiol. 48, 713–727 (2018).
Davey Smith, G. & Ebrahim, S. Mendelian randomization: prospects, potentials, and limitations. Int. J. Epidemiol. 33, 30–42 (2004).
Article Google Scholar
Tan, Q. et al. Analyzing age-specific genetic effects on human extreme age survival in cohort-based longitudinal studies. Eur. J. Hum. Genet. 21, 451–454 (2013).
Article CAS PubMed Google Scholar
Burgess, S., Davies, N. M. & Thompson, S. G. Bias due to participant overlap in two-sample Mendelian randomization. Genet. Epidemiol. 40, 597–608 (2016).
Article PubMed PubMed Central Google Scholar
Vansteelandt, S., Bowden, J., Babanezhad, M. & Goetghebeur, E. On instrumental variables estimation of causal odds ratios. Stat. Sci. 26, 403–422 (2011).
Article MathSciNet MATH Google Scholar
Winsvold, B. S. et al. Shared genetic risk between migraine and coronary artery disease: a genome-wide analysis of common variants. PLoS ONE 12, e0185663 (2017).
Article PubMed PubMed Central CAS Google Scholar
Rodondi, N. et al. Subclinical hypothyroidism and the risk of coronary heart disease and mortality. JAMA 304, 1365–1374 (2010).
Article CAS PubMed PubMed Central Google Scholar
Fadeyev, V. V. et al. Levothyroxine replacement therapy in patients with subclinical hypothyroidism and coronary artery disease. Endocr. Pract. 12, 5–17 (2006).
Article PubMed Google Scholar
Cho, Y. et al. Alcohol intake and cardiovascular risk factors: a Mendelian randomisation study. Sci. Rep. 5, 18422 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Williams, P. T. & Thompson, P. D. The relationship of walking intensity to total and cause-specific mortality. Results from the National Walkers' Health Study. PLoS ONE 8, e81098 (2013).
Article ADS PubMed PubMed Central CAS Google Scholar
Sugawara, N. et al. Body composition in patients with schizophrenia: comparison with healthy controls. Ann. Gen. Psychiatry 11, 11 (2012).
Article PubMed PubMed Central Google Scholar
Ludvigsson, J. F., Osby, U., Ekbom, A. & Montgomery, S. M. Coeliac disease and risk of schizophrenia and other psychosis: a general population cohort study. Scand. J. Gastroenterol. 42, 179–185 (2007).
Article PubMed Google Scholar
Burgess, S., Butterworth, A. & Thompson, S. G. Mendelian randomization analysis with multiple genetic variants using summarized data. Genet. Epidemiol. 37, 658–665 (2013).
Article PubMed PubMed Central Google Scholar
Bowden, J., Hemani, G. & Davey Smith, G. Detecting individual and global horizontal pleiotropy in Mendelian randomization: a job for the humble heterogeneity statistic? Am. J. Epidemiol. 187, 2681–2685 (2018).
PubMed PubMed Central Google Scholar
Sterne, J. A. & Davey Smith, G. Sifting the evidence-what's wrong with significance tests? Phys. Ther. 81, 1464–1469 (2001).
Article PubMed Google Scholar
Wasserstein, R. L. & Lazar, N. A. The ASA's statement on p-values: context, process, and purpose. Am. Stat. 70, 129–131 (2016).
Article MathSciNet Google Scholar
Bennett, D. A. & Holmes, M. V. Mendelian randomisation in cardiovascular research: an introduction for clinicians. Heart 103, 1400–1407 (2017).
Article CAS PubMed PubMed Central Google Scholar
Tyrrell, J. et al. Height, body mass index, and socioeconomic status: mendelian randomisation study in UK Biobank. BMJ 352, i582 (2016).
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This study was supported by the UK Medical Research Council (MC_UU_00011/1; MC_UU_00011/4), which founds Integrative Epidemiology Unit at the University of Bristol where Y.C., P.C.H., E.S., T.R.G., J.Z., G.D.S. and G.H. work. G.H. was supported by the Wellcome Trust and Royal Society [208806/Z/17/Z].

Author information

Authors and Affiliations

MRC Integrative Epidemiology Unit, Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, BS8 2BN, UK
Yoonsu Cho, Philip C. Haycock, Eleanor Sanderson, Tom R. Gaunt, Jie Zheng, George Davey Smith & Gibran Hemani
Department of Biostatistics, University of Liverpool, Liverpool, L69 3GL, UK
Andrew P. Morris
Division of Musculoskeletal and Dermatological Sciences, University of Manchester, Manchester, M13 9NT, UK
Andrew P. Morris

Authors

Yoonsu Cho
View author publications
You can also search for this author in PubMed Google Scholar
Philip C. Haycock
View author publications
You can also search for this author in PubMed Google Scholar
Eleanor Sanderson
View author publications
You can also search for this author in PubMed Google Scholar
Tom R. Gaunt
View author publications
You can also search for this author in PubMed Google Scholar
Jie Zheng
View author publications
You can also search for this author in PubMed Google Scholar
Andrew P. Morris
View author publications
You can also search for this author in PubMed Google Scholar
George Davey Smith
View author publications
You can also search for this author in PubMed Google Scholar
Gibran Hemani
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Y.C., G.D.S. and G.H. conceived the study and developed the statistical analysis plan. Y.C. and G.H. developed the model and methods. Y.C., G.D.S. and G.H. prepared the first draft of manuscript. Y.C., P.C.H., E.S., T.R.G., J.Z., A.P.M., G.D.S., and G.H. contributed to the writing of the manuscript. All authors reviewed and agreed on the manuscript.

Corresponding author

Correspondence to Gibran Hemani.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Nature Communications thanks Hans van Kippersluis, Kostas Tsilidis and Marie Verbanck for their contribution to the peer review of this work. Peer reviewer reports are available.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Supplementary Data 1

Supplementary Data 2

Peer Review File

Reporting Summary

Description of Additional Supplementary Files

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Cho, Y., Haycock, P.C., Sanderson, E. et al. Exploiting horizontal pleiotropy to search for causal pathways within a Mendelian randomization framework. Nat Commun 11, 1010 (2020). https://doi.org/10.1038/s41467-020-14452-4

Download citation

Received: 26 November 2018
Accepted: 10 December 2019
Published: 21 February 2020
DOI: https://doi.org/10.1038/s41467-020-14452-4
Springer Nature Limited

This article is cited by

The causality between CD8+NKT cells and CD16−CD56 on NK cells with hepatocellular carcinoma: a Mendelian randomization study
- Zhengmei Lu
- Xiaowei Chai
- Shibo Li
Infectious Agents and Cancer (2024)
Association of glucose-lowering drug target and risk of gastrointestinal cancer: a mendelian randomization study
- Yi Yang
- Bo Chen
- Yi Wang
Cell & Bioscience (2024)
Obesity, Glycemic Traits, Lifestyle Factors, and Risk of Facial Aging: A Mendelian Randomization Study in 423,999 Participants
- Xuan-jun Liu
- Muhammad Tipu Sultan
- Guang-shuai Li
Aesthetic Plastic Surgery (2024)
Effects of genetically predicted posttraumatic stress disorder on autoimmune phenotypes
- Adam X. Maihofer
- Andrew Ratanatharathorn
- Caroline M. Nievergelt
Translational Psychiatry (2024)
Causal associations between liver traits and Colorectal cancer: a Mendelian randomization study
- Ying Ni
- Wenkai Wang
- Yun Jiang
BMC Medical Genomics (2023)

Exploiting horizontal pleiotropy to search for causal pathways within a Mendelian randomization framework

Abstract

Similar content being viewed by others

Introduction

Results

Overview of MR-TRYX

Adjustment of pleiotropic pathways improves MR performance

Empirical MR-TRYX analyses

Discussion

Methods

Outlier detection

Candidate trait detection

Assessing effect of the candidate traits on the outcome

Adjusting causal estimates for candidate-trait associations

Simulations

Empirical analyses

Reporting summary

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Navigation