Falsification of the instrumental variable conditions in Mendelian randomization studies in the UK Biobank

Guo, Kelly; Diemer, Elizabeth W.; Labrecque, Jeremy A.; Swanson, Sonja A.

doi:10.1007/s10654-023-01003-6

Falsification of the instrumental variable conditions in Mendelian randomization studies in the UK Biobank

METHODS
Open access
Published: 31 May 2023

Volume 38, pages 921–927, (2023)
Cite this article

Download PDF

You have full access to this open access article

European Journal of Epidemiology Aims and scope Submit manuscript

Falsification of the instrumental variable conditions in Mendelian randomization studies in the UK Biobank

Download PDF

Kelly Guo ORCID: orcid.org/0000-0002-7474-7663¹,
Elizabeth W. Diemer^2,3,4,
Jeremy A. Labrecque¹ &
…
Sonja A. Swanson^1,3,4,5

3554 Accesses
4 Citations
2 Altmetric
Explore all metrics

Abstract

Mendelian randomization (MR) is an increasingly popular approach to estimating causal effects. Although the assumptions underlying MR cannot be verified, they imply certain constraints, the instrumental inequalities, which can be used to falsify the MR conditions. However, the instrumental inequalities are rarely applied in MR. We aimed to explore whether the instrumental inequalities could detect violations of the MR conditions in case studies analyzing the effect of commonly studied exposures on coronary artery disease risk.

Using 1077 single nucleotide polymorphisms (SNPs), we applied the instrumental inequalities to MR models for the effects of vitamin D concentration, alcohol consumption, C-reactive protein (CRP), triglycerides, high-density lipoprotein (HDL) cholesterol, and low-density lipoprotein (LDL) cholesterol on coronary artery disease in the UK Biobank. For their relevant exposure, we applied the instrumental inequalities to MR models proposing each SNP as an instrument individually, and to MR models proposing unweighted allele scores as an instrument. We did not identify any violations of the MR assumptions when proposing each SNP as an instrument individually. When proposing allele scores as instruments, we detected violations of the MR assumptions for 5 of 6 exposures.

Within our setting, this suggests the instrumental inequalities can be useful for identifying violations of the MR conditions when proposing multiple SNPs as instruments, but may be less useful in determining which SNPs are not instruments. This work demonstrates how incorporating the instrumental inequalities into MR analyses can help researchers to identify and mitigate potential bias.

Using the global randomization test as a Mendelian randomization falsification test for the exclusion restriction assumption

Article Open access 29 February 2024

Mendelian randomization in cardiometabolic disease: challenges in evaluating causality

Article 01 June 2017

Non-linear Mendelian randomization: detection of biases using negative controls with a focus on BMI, Vitamin D and LDL cholesterol

Article Open access 25 May 2024

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Mendelian randomization (MR) has become a popular approach for estimating causal effects and is increasingly popular as new large genetic databases become available [1, 2]. Like any causal inference approach using observational data, MR requires causal assumptions. In brief, these methods require that genetic variants are (i) associated with the exposure of interest, (ii) only cause the outcome via the exposure, and (iii) share no causes with the outcome. We will refer to these conditions collectively as the instrumental conditions. Notably, these conditions are sufficient only for sharp null testing and bounding; additional assumptions are necessary for point estimation [3].

Although instrumental conditions (ii) and (iii) are not verifiable, there are methods that can be used to falsify them. In particular, the instrumental conditions imply a set of mathematical constraints known as the instrumental inequalities, which, if violated, imply that the observed data distribution in a particular dataset is inconsistent with the instrumental conditions [4,5,6,7]. Investigators can thus use these instrumental inequalities as one approach for falsifying the instrumental conditions. Diemer et al. 2020 [8] presented a description of how to use the instrumental inequalities as well as an application of the instrumental inequalities in the setting of MR studies with multiple genetic variants proposed as instruments. A practical question remains: to what extent do the instrumental inequalities detect violations in analyses of commonly studied exposures or commonly used databases in MR? To investigate this, we applied the instrumental inequalities to a series of MR analyses of the effect of vitamin D concentration, alcohol consumption, C-reactive protein (CRP), triglycerides, high-density lipoprotein (HDL) cholesterol and low-density lipoprotein (LDL) cholesterol on coronary artery disease risk in the UK Biobank. These specific exposures were chosen as examples that are commonly studied and have been studied using MR in the UK Biobank, as well as to explore a range of exposure types (e.g., biomarkers and behaviors; categorical and continuous measures).

Methods

Study design

The UK Biobank is a large prospective study of 502,648 adults, aged 40–69, recruited from across the United Kingdom between 2006 and 2010. Details of the cohort, including recruitment, assessment procedures, and quality control have been described in detail elsewhere [9, 10]. The UK Biobank received ethical approval from the National Health Service North West Multi-centre Research Ethics Committee (Research Ethics Committee reference: 11/NW/0382) and all participants provided written, informed consent. For each exposure under study, we restricted the eligible population to individuals with complete data on that exposure, the outcome of coronary artery disease, and that exposure’s proposed genetic instruments, resulting in sample sizes between 424,978 and 486,195 individuals (see Supplementary Information: S4). While this complete case analysis may result in selection bias [11], it is consistent with some common approaches used in MR studies within the UK Biobank.

Exposures

We selected 6 exposures whose relationships to cardiovascular disease have been previously studied using MR: vitamin D concentration, alcohol consumption, CRP, triglycerides, HDL-cholesterol, and LDL-cholesterol [12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27].

Vitamin D, triglyceride, CRP, HDL-, and LDL-cholesterol levels were measured in blood samples collected at either the initial assessment visit or a repeat assessment visit conducted between 2012 and 2013. Details of biomarker measurements and assay performance in UK Biobank have been described in detail elsewhere [28]. Briefly, vitamin D concentration was assessed based on total 25-hydroxyvitamin D (25(OH)D) levels measured using the Diasorin Liason, a chemiluminescent immunoassay. CRP levels were measured using an immunoturbidimetric assay in a Beckman Coulter analyzer (AU5800 Analyzer, Beckman Coulter, CA). Triglyceride concentrations were measured using an enzymatic analysis on said Beckman Coulter analyzer. HDL-cholesterol levels were measured by enzyme inhibition analysis, and LDL-cholesterol levels were measured using enzymatic protective selection analysis on said Beckman Coulter analyzer. Because the instrumental inequalities can only be used with categorical exposures [4, 5], all these exposures were categorized into deciles. Frequency of alcohol consumption was assessed based on self-report questionnaire. Participants were asked “About how often do you drink alcohol?” with response options “Never”, “Special occasions only”, “1 to 3 times a month”, “Once or twice a week”, “3 to 4 times a week”, or “Daily or almost daily”. If participants felt the answer varied, they were instructed to give an average over the past year. This exposure was categorized using these response categories.

Outcome

Participant electronic health records, including International Classification of Disease (ICD-10) diagnosis codes and Office of Population and Censuses Surveys (OPCS-4) procedure codes, have been integrated into UK Biobank [29]. Additionally, patients were asked to report diagnoses of cardiovascular disease using questionnaires, which were subsequently checked during a verbal interview with a trained nurse. Participants were considered to have coronary artery disease if they had experienced angina pectoris, acute or subsequent myocardial infarction or other acute or chronic ischemic heart disease (ICD-10 I20X, I21X, I123X, I24X, I25.5, I25.6, I25.8, I25.9) or if they previously underwent coronary procedure (OPCS-4 K40, K41, K43, K46, K49, K75, K45, K50.1-3) (see Supplementary Information: S2 and S3).

Genetic variants

In order to identify genetic variants that had previously been used in MR studies of these specific exposures within the UK Biobank, we conducted a systematic review of PubMed and the UK Biobank archive using the search term “Mendelian random*”, and each of the six exposures. Studies were eligible for inclusion in the review if they explicitly reported using an MR approach, studied either vitamin D, alcohol use, triglyceride levels, CRP, LDL-, or HDL-cholesterol as an exposure, and conducted the analysis using a UK Biobank sample. This resulted in 30 articles, of which 12 were rejected based on full text review. After review, 9 articles on vitamin D concentration, 3 articles on alcohol use, 2 articles on CRP, and 4 articles on lipoproteins (LDL-cholesterol, HDL-cholesterol, or triglycerides) met the criteria and were included in the review (see Supplementary Information: S1).

From these articles, we proposed single nucleotide polymorphisms (SNPs) as instruments for one of the six exposures if they had been proposed as instruments in at least one previous study. SNPs were not included if previous studies indicated that they were associated with another phenotype on a possibly pleiotropic pathway. In total, we proposed 1077 SNPs as instruments, including 15 SNPs as instruments for vitamin D concentration, 28 SNPs as instruments for alcohol consumption, 528 SNP as an instrument for CRP, 22 SNPs as instruments for triglyceride levels, 82 SNPs as instruments for HDL-cholesterol, and 402 SNPs as instruments for LDL-cholesterol (see Supplementary Information for an overview of the proposed instruments). We also constructed an unweighted categorical allele score for each exposure.

Analysis

The properties and use of the instrumental inequalities have been described in detail in publications by Pearl, 1995 [4], Glymour et al., 2012 [30] and Diemer et al., 2020 [8]. Intuitively, by using an instrumental variable model, investigators are assuming that the observed data and unobserved counterfactuals from a population follow certain constraints (i.e., a particular ‘shape’). One way for investigators to check whether the instrumental variable assumptions hold would be to check whether the observed data fits within the ‘shape’ assumed by the instrumental variable model. The instrumental inequalities are simply a means of conducting this check, by evaluating whether a particular dataset conforms to these constraints.

More specifically, the instrumental inequalities are a set of mathematical equations derived from the instrumental conditions by Pearl, 1995 [4]:

$$\mathop {{\rm{max}}}\limits_x {\sum _y}\mathop {{\rm{max}}}\limits_z P\left( {x,y|z} \right) \le 1$$

where Z is the proposed instrument (or proposed instrument set), X is the exposure, and Y is the outcome. These equations evaluate whether proportions of certain combinations of the proposed instruments, exposure, and outcome sum to equal or less than one. If one or more of these inequalities do not hold, meaning that the sum of the observed probabilities is greater than one, it implies that at least one of the instrumental conditions is violated within the dataset. However, if the instrumental inequalities do hold, it does not constitute evidence for or against the instrumental conditions. Such a result does not imply that the instrumental conditions are satisfied within the specific dataset, only that we are unable to prove the instrumental conditions are violated in that dataset. Thus, the instrumental inequalities can be used to falsify (but not verify) an MR model. Moreover, the instrumental inequalities cannot show why the instrumental conditions are violated, only that they are violated. For MR studies, such violations can result from a number of different bias structures, including pleiotropy, selection bias, population stratification, and assortative mating [8, 31]. One key limitation of the instrumental inequalities is that they cannot be applied to continuous exposures [5]. While it is computationally easy to resolve this by discretizing the continuous exposures of interest, the instrumental conditions can then be violated by the measurement error induced by this categorization (sometimes known as coarsening) [31]. A simplified example demonstrating the application of the instrumental inequalities and the corresponding R code can be found in the Supplemental Information (see Supplemental Information: S6 and S11).

When multiple SNPs are proposed as instruments in MR, we can also apply the instrumental inequalities to sets of SNPs jointly [8]. For each exposure-outcome combination, we applied the instrumental inequalities to models proposing each SNP as an instrument individually and to a model proposing unweighted allele score deciles as an instrument, using R code developed in Diemer et al., 2020 [8] (see Supplementary Information: S12). As a sensitivity analysis to understand how results are impacted by residual population stratification, we also calculated the instrumental inequalities using inverse probability weights to adjust for 10 principal components (see Supplementary Information: S7). All analyses were conducted in R version 3.2.6 [32].

Comparison to other falsification strategies: MR-Egger and MR-PRESSO

We also compared the results of the instrumental inequalities to the results of a MR-Egger intercept test and a MR Pleiotropy RESidual Sum and Outlier (PRESSO) global test. Both are commonly used falsification strategies in MR [33,34,35]. The MR-Egger method conducts an inverse variance weighted linear regression of the instrument-outcome association on the instrument-exposure association. If a test of the estimated intercept of this regression rejects the null hypothesis that the intercept is zero, then one often concludes that the instrumental conditions do not hold. The MR-PRESSO global test conducts multiple inverse variance weighed regressions to compute a residual sum of squares for each instrument in a set of proposed instruments, omitting one candidate instrument in each computation. It evaluates whether any of the proposed instruments are driving a difference between the total residual sum of squares and the simulated expected residual sum of squares. If the total residual sum of squares is inconsistent with what is expected by chance, the test rejects the null hypothesis of no horizontal pleiotropy; suggesting that the instrumental conditions are violated.

Unlike the instrumental inequalities, the MR-Egger and MR-PRESSO tests do not require us to coarsen continuous exposures. However, in addition to the instrumental conditions, they also evaluate whether linearity and homogeneity assumptions are satisfied [33,34,35,36]. A comparison of these falsification strategies’ assumptions, interpretation, and practical considerations are reviewed in Table 1. To compare the instrumental inequalities to the MR-Egger intercept test and MR-PRESSO, we applied both falsification methods to models proposing SNPs jointly as instruments for vitamin D concentration, alcohol consumption, CRP, triglyceride levels, HDL-cholesterol, and LDL-cholesterol.

Table 1 Overview of key assumptions, interpretations, and practical considerations of the instrumental inequalities, MR-Egger intercept test and MR-PRESSO global test

Full size table

Results

The study population in our analytic subpopulation consisted of participants with a median age of 58 years (ranging from 38 to 73 years) who were primarily of white, British ethnicity. The proportion of women varied between 53.5% and 54.2% across samples. A more detailed overview of the demographic and exposure characteristics of the study population can be found in the Supplementary Information (see Supplementary Information: S5).

The instrumental inequalities held for all 1077 SNPs proposed as instruments when considering each SNP individually (Table 2 and Supplementary Tables 2–7). The instrumental inequalities also held when allele scores were proposed as instruments for vitamin D. However, the instrumental inequalities were violated when proposing allele scores as instruments for alcohol consumption, CRP, triglycerides, HDL-cholesterol and LDL-cholesterol indicating that the instrumental conditions were violated for those models. Results were generally consistent when inverse probability weighted for 10 principal components (Supplementary Tables 8–13).

Table 2 Summary of results of instrumental inequalities applied to Mendelian randomization models studying the effects of vitamin D concentration, alcohol consumption, C-reactive protein, triglycerides, HDL-cholesterol, and LDL-cholesterol concentrations on coronary artery disease

Full size table

The instrumental inequalities were violated when proposing SNPs jointly as instruments for vitamin D, alcohol consumption, CRP, triglycerides and HDL-cholesterol and LDL-cholesterol levels (see Supplementary Information: S10). The MR-Egger intercept test showed a significant non-zero intercept when proposing SNPs jointly as instruments for CRP (intercept = 2.971 × 10^-4, p < 0.001), triglyceride (intercept = 8.931 × 10^-4, p < 0.001) and HDL-cholesterol levels (intercept = -3.474 × 10^-4, p = 0.018), but not for vitamin D (intercept = 3.588 × 10^-5, p = 0.914), alcohol consumption (intercept = 3.427 × 10^-4, p = 0.081) and LDL-cholesterol levels (intercept = 5.560 × 10^-5, p = 0.233). The MR-PRESSO global test was statistically significant for alcohol consumption, CRP, triglycerides, HDL-cholesterol, LDL-cholesterol, but not for vitamin D.

Discussion

In our investigation of 6 exposures and an accompanying 1077 SNPs used in prior MR studies in UK Biobank, we detected no violations of the instrumental conditions when considering each SNP individually as a proposed instrument. Violations of the instrumental conditions were detected for some allele scores proposed as instruments.

These findings suggest that the instrumental inequalities may be helpful in detecting violations of the instrumental conditions for sets of SNPs proposed as instruments, but, per prior conventional wisdom, may not detect which specific SNP or SNPs are not instruments. Moreover, these findings do not explain why the allele scores are not instruments: we do not know if violations of the instrumental conditions are due to selection bias, pleiotropy or due to another structural violation. It also remains unknown if the violation is a result of one or multiple SNPs marginally violating the instrumental conditions. Additionally, these findings do not tell us whether the violation is study-specific or could indicate a structural violation with that allele score being an instrument for that exposure-outcome in another study setting [8]. It is worth reiterating that detecting no violations when considering each SNP individually should only be interpreted as a failure to falsify, and not as support for the validity. It is possible that the instrumental conditions are violated for all SNP, exposure, and outcome combinations in this study, but the instrumental inequalities detected only some of these violations. Our results also need to be interpreted in light of the imposed categories of the continuous exposures [8].

Notably, the results of the instrumental inequalities, MR-Egger and MR-PRESSO diverged to some extent for several of the exposures of interest. In MR models for the effect of CRP, triglycerides, and HDL-cholesterol on coronary artery disease, violations of the instrumental conditions were detected by the instrumental inequalities, MR-Egger, and by MR-PRESSO when proposing all SNPs jointly as instruments. Violations were also detected by both MR-PRESSO and the instrumental inequalities when proposing all SNPs jointly as instruments for alcohol consumption and LDL-cholesterol, though MR-Egger did not detect any violations of the instrumental conditions for these exposures. MR-Egger and MR-PRESSO also did not detect any violations for vitamin D and LDL-cholesterol, though the instrumental inequalities were violated for both when proposing all SNPs jointly as instruments, and also when proposing an allele score as an instrument for LDL-cholesterol.

These differences suggest that these falsification strategies can be complementary to one another, as there are a number of possible reasons why the results of these three methods may diverge. First, some research has indicated that MR-Egger may produce biased results when applied to single-sample applications [37], and the MR-Egger intercept test may often be underpowered to detect violations of the MR assumptions [38]. Second, as discussed earlier, the inequalities require categorization of the exposure, meaning violations of the inequalities may indicate bias resulting from said categorization, while the MR-Egger intercept test and MR-PRESSO global test require additional linearity and homogeneity assumptions that are not guaranteed to hold in this setting. In addition, previous work has suggested that, as the number of proposed joint instruments grows and the number of individuals within strata of the proposed joint instrument decreases, the instrumental inequalities may become increasingly sensitive to random violations of the MR conditions [8]. Overall, our findings suggest that further work is needed to evaluate the relative sensitivity of the instrumental inequalities, MR-Egger intercept test and MR-PRESSO global test to different forms of bias, but that these three methods appear to provide information about the validity of the MR assumptions in applied studies.

MR studies rely on strong, unverifiable assumptions, but investigators have an arsenal of tools for falsifying these assumptions and attempting to mitigate violations, along with robust methods that leverage alternative assumptions as a means to relax some others [34, 39]. The instrumental inequalities are an easily implementable technique, which, if integrated into this MR toolbox, could help to identify violations of the instrumental conditions in common MR settings with multiple proposed instruments, including biases that may be difficult to identify through other means.

Data Availability

The data are available from the UK Biobank to individuals who apply for and receive permission from the appropriate UK Biobank study committees.

References

Davies NM, Holmes MV, Davey Smith G. Reading mendelian randomisation studies: a guide, glossary, and checklist for clinicians. BMJ. 2018;362:k601. https://doi.org/10.1136/bmj.k601.
Article PubMed PubMed Central Google Scholar
Johansson A, Marroni F, Hayward C, et al. Linkage and genome-wide association analysis of obesity-related phenotypes: association of weight with the MGAT1 gene. Obes (Silver Spring). 2010;18(4):803–8. https://doi.org/10.1038/oby.2009.359.
Article CAS Google Scholar
Hernan MA, Robins JM. Instruments for causal inference: an epidemiologist’s dream? Epidemiology. 2006;17(4):360–72. https://doi.org/10.1097/01.ede.0000222409.00878.37.
Article PubMed Google Scholar
Pearl J. On the testability of causal models with latent and instrumental variables. Proceedings of the Eleventh conference on Uncertainty in artificial intelligence. Montréal, Qué, Canada: Morgan Kaufmann Publishers Inc.; 1995. p. 435–43.
Bonet B. Instrumentality tests revisited. Proceedings of the Seventeenth conference on Uncertainty in artificial intelligence. Seattle, Washington: Morgan Kaufmann Publishers Inc.; 2001. p. 48–55.
Balke A, Pearl J. Bounds on Treatment Effects from studies with imperfect compliance. J Am Stat Assoc. 1997;92(439):1171–6. https://doi.org/10.1080/01621459.1997.10474074.
Article Google Scholar
Richardson T, Robins J. Analysis of the Binary Instrumental Variable Model. 2010.
Diemer EW, Labrecque J, Tiemeier H, Swanson SA. Application of the Instrumental Inequalities to a mendelian randomization study with multiple proposed Instruments. Epidemiology. 2020;31(1):65–74. https://doi.org/10.1097/EDE.0000000000001126.
Article PubMed Google Scholar
Sudlow C, Gallacher J, Allen N, et al. UK biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS Med. 2015;12(3):e1001779. https://doi.org/10.1371/journal.pmed.1001779.
Article PubMed PubMed Central Google Scholar
Bycroft C, Freeman C, Petkova D, et al. The UK Biobank resource with deep phenotyping and genomic data. Nature. 2018;562(7726):203–9. https://doi.org/10.1038/s41586-018-0579-z.
Article CAS PubMed PubMed Central Google Scholar
Swanson SA. A practical guide to Selection Bias in Instrumental variable analyses. Epidemiology. 2019;30(3):345–9. https://doi.org/10.1097/EDE.0000000000000973.
Article PubMed Google Scholar
He Y, Timofeeva M, Farrington SM, et al. Exploring causality in the association between circulating 25-hydroxyvitamin D and colorectal cancer risk: a large mendelian randomisation study. BMC Med. 2018;16(1):142. https://doi.org/10.1186/s12916-018-1119-2.
Article CAS PubMed PubMed Central Google Scholar
Maddock J, Zhou A, Cavadino A, et al. Vitamin D and cognitive function: a mendelian randomisation study. Sci Rep. 2017;7(1):13230. https://doi.org/10.1038/s41598-017-13189-3.
Article CAS PubMed PubMed Central Google Scholar
Lund-Nielsen J, Vedel-Krogh S, Kobylecki CJ, Brynskov J, Afzal S, Nordestgaard BG. Vitamin D and inflammatory bowel disease: mendelian randomization analyses in the Copenhagen Studies and UK Biobank. J Clin Endocrinol Metab. 2018;103(9):3267–77. https://doi.org/10.1210/jc.2018-00250.
Article PubMed Google Scholar
Manousaki D, Paternoster L, Standl M, et al. Vitamin D levels and susceptibility to asthma, elevated immunoglobulin E levels, and atopic dermatitis: a mendelian randomization study. PLoS Med. 2017;14(5):e1002294. https://doi.org/10.1371/journal.pmed.1002294.
Article CAS PubMed PubMed Central Google Scholar
Larsson SC, Melhus H, Michaelsson K. Circulating serum 25-Hydroxyvitamin D levels and bone Mineral density: mendelian randomization study. J Bone Miner Res. 2018;33(5):840–4. https://doi.org/10.1002/jbmr.3389.
Article CAS PubMed Google Scholar
Dudding T, Johansson M, Thomas SJ, Brennan P, Martin RM, Timpson NJ. Assessing the causal association between 25-hydroxyvitamin D and the risk of oral and oropharyngeal cancer using mendelian randomization. Int J Cancer. 2018;143(5):1029–36. https://doi.org/10.1002/ijc.31377.
Article CAS PubMed PubMed Central Google Scholar
Thompson WD, Tyrrell J, Borges MC, et al. Association of maternal circulating 25(OH)D and calcium with birth weight: a mendelian randomisation analysis. PLoS Med. 2019;16(6):e1002828. https://doi.org/10.1371/journal.pmed.1002828.
Article PubMed PubMed Central Google Scholar
Havdahl A, Mitchell R, Paternoster L, Davey Smith G. Investigating causality in the association between vitamin D status and self-reported tiredness. Sci Rep. 2019;9(1):2880. https://doi.org/10.1038/s41598-019-39359-z.
Article CAS PubMed PubMed Central Google Scholar
Bae SC, Lee YH. Alcohol intake and risk of rheumatoid arthritis: a mendelian randomization study. Z Rheumatol. 2019;78(8):791–6. https://doi.org/10.1007/s00393-018-0537-z.
Article CAS PubMed Google Scholar
Guo R, Wu L, Fu Q. Is there causal relationship of Smoking and Alcohol Consumption with Bone Mineral density? A mendelian randomization study. Calcif Tissue Int. 2018;103(5):546–53. https://doi.org/10.1007/s00223-018-0452-y.
Article CAS PubMed Google Scholar
Beasley M, Freidin MB, Basu N, Williams FMK, Macfarlane GJ. What is the effect of alcohol consumption on the risk of chronic widespread pain? A mendelian randomisation study using UK Biobank. Pain. 2019;160(2):501–7. https://doi.org/10.1097/j.pain.0000000000001426.
Article CAS PubMed Google Scholar
Trinder M, Walley KR, Boyd JH, Brunham LR. Causal inference for genetically determined levels of high-density lipoprotein cholesterol and risk of Infectious Disease. Arterioscler Thromb Vasc Biol. 2020;40(1):267–78. https://doi.org/10.1161/ATVBAHA.119.313381.
Article CAS PubMed Google Scholar
Hwang LD, Lawlor DA, Freathy RM, Evans DM, Warrington NM. Using a two-sample mendelian randomization design to investigate a possible causal effect of maternal lipid concentrations on offspring birth weight. Int J Epidemiol. 2019;48(5):1457–67. https://doi.org/10.1093/ije/dyz160.
Article PubMed PubMed Central Google Scholar
Wang Q, Wang Y, Lehto K, Pedersen NL, Williams DM, Hagg S. Genetically-predicted life-long lowering of low-density lipoprotein cholesterol is associated with decreased frailty: a mendelian randomization study in UK biobank. EBioMedicine. 2019;45:487–94. https://doi.org/10.1016/j.ebiom.2019.07.007.
Article PubMed PubMed Central Google Scholar
Funck-Brentano T, Nethander M, Moverare-Skrtic S, Richette P, Ohlsson C. Causal factors for knee, hip, and Hand Osteoarthritis: a mendelian randomization study in the UK Biobank. Arthritis Rheumatol. 2019;71(10):1634–41. https://doi.org/10.1002/art.40928.
Article CAS PubMed PubMed Central Google Scholar
Han X, Ong JS, An J, Hewitt AW, Gharahkhani P, MacGregor S. Using mendelian randomization to evaluate the causal relationship between serum C-reactive protein levels and age-related macular degeneration. Eur J Epidemiol. 2020;35(2):139–46. https://doi.org/10.1007/s10654-019-00598-z.
Article CAS PubMed Google Scholar
Fry D, Almond R, Moffat S, Gordon M, Singh P. UK Biobank Biomarker Project: Companion Document to accompany serum Biomarker Data. UK Biobank Document Showcase; 2019.
Organization WH. ICD-10: international statistical classification of diseases and related health problems : tenth revision. 2nd ed. ed. Geneva: World Health Organization; 2004.
Google Scholar
Glymour MM, Tchetgen Tchetgen EJ, Robins JM. Credible mendelian randomization studies: approaches for evaluating the instrumental variable assumptions. Am J Epidemiol. 2012;175(4):332–9. https://doi.org/10.1093/aje/kwr323.
Article PubMed PubMed Central Google Scholar
VanderWeele TJ, Tchetgen Tchetgen EJ, Cornelis M, Kraft P. Methodological challenges in mendelian randomization. Epidemiology. 2014;25(3):427–35. https://doi.org/10.1097/EDE.0000000000000081.
Article PubMed PubMed Central Google Scholar
Team RC. R: A language and environment for statistical computing. 2013.
Burgess S, Thompson SG. Interpreting findings from mendelian randomization using the MR-Egger method. Eur J Epidemiol. 2017;32(5):377–89. https://doi.org/10.1007/s10654-017-0255-x.
Article PubMed PubMed Central Google Scholar
Bowden J, Davey Smith G, Burgess S. Mendelian randomization with invalid instruments: effect estimation and bias detection through Egger regression. Int J Epidemiol. 2015;44(2):512–25. https://doi.org/10.1093/ije/dyv080.
Article PubMed PubMed Central Google Scholar
Verbanck M, Chen CY, Neale B, Do R. Detection of widespread horizontal pleiotropy in causal relationships inferred from mendelian randomization between complex traits and diseases. Nat Genet. 2018;50(5):693–8. https://doi.org/10.1038/s41588-018-0099-7.
Article CAS PubMed PubMed Central Google Scholar
Labrecque J, Swanson SA. Understanding the Assumptions underlying Instrumental variable analyses: a brief review of falsification strategies and related tools. Curr Epidemiol Rep. 2018;5(3):214–20. https://doi.org/10.1007/s40471-018-0152-1.
Article PubMed PubMed Central Google Scholar
Minelli C, Del Greco MF, van der Plaat DA, Bowden J, Sheehan NA, Thompson J. The use of two-sample methods for mendelian randomization analyses on single large datasets. Int J Epidemiol. 2021;50(5):1651–9. https://doi.org/10.1093/ije/dyab084.
Article PubMed PubMed Central Google Scholar
Slob EAW, Burgess S. A comparison of robust mendelian randomization methods using summary data. Genet Epidemiol. 2020;44(4):313–29. https://doi.org/10.1002/gepi.22295.
Article PubMed PubMed Central Google Scholar
Bowden J, Davey Smith G, Haycock PC, Burgess S. Consistent estimation in mendelian randomization with some Invalid Instruments using a weighted median estimator. Genet Epidemiol. 2016;40(4):304–14. https://doi.org/10.1002/gepi.21965.
Article PubMed PubMed Central Google Scholar
Swanson SA, Hernan MA, Miller M, Robins JM, Richardson TS. Partial identification of the average treatment effect using Instrumental variables: review of methods for Binary Instruments, Treatments, and outcomes. J Am Stat Assoc. 2018;113(522):933–47. https://doi.org/10.1080/01621459.2018.1434530.
Article CAS PubMed PubMed Central Google Scholar

Download references

Funding

Dr. S.A. Swanson was supported by a NWO/ZonMw Veni grant (91617066). E.W. Diemer is supported by an innovation program under the Marie Sklodowska-Curie grant agreement number 721567.

Author information

Authors and Affiliations

Department of Epidemiology, Erasmus MC, Rotterdam, the Netherlands
Kelly Guo, Jeremy A. Labrecque & Sonja A. Swanson
Department of Child and Adolescent Psychiatry, Erasmus MC, Rotterdam, the Netherlands
Elizabeth W. Diemer
Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, USA
Elizabeth W. Diemer & Sonja A. Swanson
CAUSALab, Harvard T.H. Chan School of Public Health, Boston, USA
Elizabeth W. Diemer & Sonja A. Swanson
Department of Epidemiology, University of Pittsburgh School of Public Health, Pittsburgh, USA
Sonja A. Swanson

Authors

Kelly Guo
View author publications
You can also search for this author in PubMed Google Scholar
Elizabeth W. Diemer
View author publications
You can also search for this author in PubMed Google Scholar
Jeremy A. Labrecque
View author publications
You can also search for this author in PubMed Google Scholar
Sonja A. Swanson
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Kelly Guo.

Ethics declarations

Conflict of interest

The authors have no relevant financial or non-financial interests to disclose.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Guo, K., Diemer, E.W., Labrecque, J.A. et al. Falsification of the instrumental variable conditions in Mendelian randomization studies in the UK Biobank. Eur J Epidemiol 38, 921–927 (2023). https://doi.org/10.1007/s10654-023-01003-6

Download citation

Received: 12 October 2021
Accepted: 03 April 2023
Published: 31 May 2023
Issue Date: September 2023
DOI: https://doi.org/10.1007/s10654-023-01003-6

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Falsification of the instrumental variable conditions in Mendelian randomization studies in the UK Biobank

Abstract

Similar content being viewed by others

Using the global randomization test as a Mendelian randomization falsification test for the exclusion restriction assumption

Mendelian randomization in cardiometabolic disease: challenges in evaluating causality

Non-linear Mendelian randomization: detection of biases using negative controls with a focus on BMI, Vitamin D and LDL cholesterol

Introduction

Methods

Study design

Exposures

Outcome

Genetic variants

Analysis

Comparison to other falsification strategies: MR-Egger and MR-PRESSO

Results

Discussion

Data Availability

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher’s Note

Electronic supplementary material

Supplementary Material 1

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Falsification of the instrumental variable conditions in Mendelian randomization studies in the UK Biobank

Abstract

Similar content being viewed by others

Using the global randomization test as a Mendelian randomization falsification test for the exclusion restriction assumption

Mendelian randomization in cardiometabolic disease: challenges in evaluating causality

Non-linear Mendelian randomization: detection of biases using negative controls with a focus on BMI, Vitamin D and LDL cholesterol

Introduction

Methods

Study design

Exposures

Outcome

Genetic variants

Analysis

Comparison to other falsification strategies: MR-Egger and MR-PRESSO

Results

Discussion

Data Availability

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher’s Note

Electronic supplementary material

Supplementary Material 1

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation