Skip to main content

Age at menarche and lung function: a Mendelian randomization study


A trend towards earlier menarche in women has been associated with childhood factors (e.g. obesity) and hypothesised environmental exposures (e.g. endocrine disruptors present in household products). Observational evidence has shown detrimental effects of early menarche on various health outcomes including adult lung function, but these might represent spurious associations due to confounding. To address this we used Mendelian randomization where genetic variants are used as proxies for age at menarche, since genetic associations are not affected by classical confounding. We estimated the effects of age at menarche on forced vital capacity (FVC), a proxy for restrictive lung impairment, and ratio of forced expiratory volume in one second to FVC (FEV1/FVC), a measure of airway obstruction, in both adulthood and adolescence. We derived SNP-age at menarche association estimates for 122 variants from a published genome-wide meta-analysis (N = 182,416), with SNP-lung function estimates obtained by meta-analysing three studies of adult women (N = 46,944) and two of adolescent girls (N = 3025). We investigated the impact of departures from the assumption of no pleiotropy through sensitivity analyses. In adult women, in line with previous evidence, we found an effect on restrictive lung impairment with a 24.8 mL increase in FVC per year increase in age at menarche (95% CI 1.8–47.9; p = 0.035); evidence was stronger after excluding potential pleiotropic variants (43.6 mL; 17.2–69.9; p = 0.001). In adolescent girls we found an opposite effect (−56.5 mL; −108.3 to −4.7; p = 0.033), suggesting that the detrimental effect in adulthood may be preceded by a short-term post-pubertal benefit. Our secondary analyses showing results in the same direction in men and boys, in whom age at menarche SNPs have also shown association with sexual development, suggest a role for pubertal timing in general rather than menarche specifically. We found no effect on airway obstruction (FEV1/FVC).


The timing of sexual development in women has shown a secular trend with a shift towards an earlier age over the years [1], and this has been related to childhood life-style and social factors, including diet and obesity, psychological stress and deprivation, as well as environmental exposures, including endocrine disruptors found in many household products [2]. Menarche, defined as the date of the first day of the first menstrual bleeding, is preceded by a complex hormonal cascade and signals the initiation of the menstrual cycle in adolescent girls. Earlier age at menarche has been described as a risk factor for a number of adverse health outcomes, including obesity [3], type 2 diabetes [4], cardio-metabolic traits [5], cardiovascular morbidity and mortality [6], as well as breast [7] and ovarian [8] cancers. Understanding the effects of the age at menarche offers insight into the pathophysiology of related diseases. In particular this can highlight the potential effects of early and late exposure to sex hormones on health outcomes in women, as well as help explain gender differences in the risks of common diseases [9,10,11].

Timing of menarche has been considered an important factor in relation to respiratory health [12]. Lung function is an important predictor of both respiratory disease and overall health. After cessation of lung growth by the early twenties, there is a plateau in lung function followed by gradual age-related decline. There are two common patterns of lung function impairment, obstruction and restriction, and these have a different impact on morbidity and mortality [13]. Obstruction, measured as a low ratio of the forced expiratory volume in one second (FEV1) to the forced vital capacity (FVC), represents an objective marker of chronic obstructive pulmonary disease, a major and growing cause of morbidity, disability and death worldwide. Restriction, defined as low total lung capacity (measured by plethysmography) but commonly approximated by low FVC in population-based studies, is a predictor of all-cause mortality even in the absence of chronic respiratory conditions. There has been increasing interest in understanding gender-related risk factors for lung function impairment, particularly in relation to hormonal influences and sexual development. Substantial evidence shows that lung function is influenced by sex hormones in women [14] for whom low lung function has been associated with irregular menstruation, menopausal transition, and both natural and surgical menopause, with report of improvement in lung function in postmenopausal women receiving hormone replacement therapy [12, 15].

An observational study of 2873 women aged 27–57 years investigated the association of early menarche with adult lung function and found a lower FVC and FEV1, but not FEV1/FVC, in women with early menarche [16]. This work took account of important potential confounders, including age, height, body mass index (BMI), smoking, education and birth order, as well as secular trends (age at menarche has changed over the years and lung function has also changed due to changes in height). However, the effect of residual confounding by unmeasured, or poorly recalled, early life and childhood factors cannot be ruled out. For example, age at menarche is influenced by childhood nutritional status [17], which in turn might influence lung function in adult life; another example is birth weight, which has been associated with both age at menarche [18] and lung function [19]. Such limitation is typical of observational studies, where confounding can make it hard to distinguish between causal effects and spurious associations.

Mendelian randomization (MR) can help assess the causality of an observed association by using genetic variants as proxies, or “instrumental variables”, for the exposure of interest [20, 21]. Genetic associations are not typically affected by confounding or reverse causation because genes are randomly allocated at the time of conception. Provided the underlying assumptions are satisfied [22], the demonstration that genetic variants known to modify age at menarche also modify lung function provides indirect evidence of a causal effect of age at menarche. The MR technique has been rapidly growing in popularity because of these advantages over the classical epidemiological approach, with MR studies having previously investigated age at menarche as a risk factor for depression [23], and lung function as an outcome in relation to C-reactive protein [24].

In this study we used MR to investigate the lifetime effect of age at menarche on lung function in adolescent girls and adult women, using 122 single nucleotide polymorphisms (SNPs) associated with age at menarche. We considered the effects in both adulthood and adolescence, since we hypothesised that they could differ due to a different role of menarche in lung growth, maximal lung function attained, and lung function decline.


SNP-age at menarche association estimates

We derived SNP-age at menarche association estimates from a published genome-wide association (GWA) meta-analysis of 57 studies on 182,416 women of European descent, which identified 122 independent SNPs at 106 genomic loci (p value < 5 × 10−8) [25]. Overall, the 122 SNPs explained 2.7% of the variability of age at menarche in the population. Age at menarche was based on self-reporting and analysed as a continuous variable, with study-specific analyses adjusted for birth year, to account for the secular trends in menarche timing, and genomic control, to account for population stratification [25]. We assessed instrument strength for the 122 SNPs, a function of magnitude and precision of their genetic effect, using the F statistic [26].

SNP-lung function association estimates

Three studies were used to estimate the association of the 122 SNPs with lung function in adult women: European Community Respiratory Health Survey (ECRHS) [27], Northern Finland Birth Cohort of 1966 (NFBC 1966) [28], and UK Biobank [29] (Table 1). ECRHS is a European prospective cohort study designed to identify risk factors for respiratory health [27]. The study started in 1992 (ECRHS I), with follow-up performed twice (ECRHS II and III) over the following 20 years. Here we include 1069 women aged 27–57 recruited at random from population-based sampling frames in 14 centres, and who had lung function measured in ECRHS II. Genetic associations analyses for the 122 SNPs with FVC and FEV1/FVC were adjusted for age, age2, height, centre and ancestry principal components. NFBC1966 is a birth-cohort study performed in the Finnish provinces of Oulu and Lapland. Pregnant women with expected date of delivery in 1966 were recruited and their offspring followed up [28]. Among the offspring, we include 2680 women with spirometry data at age 31. Analyses were adjusted for height and ancestry principal components. UK Biobank is a prospective study across 22 assessment centres, aimed at identifying causes of chronic disease in middle and old age [29]. We include 43,195 women of European ancestry aged 40–69 recruited in 2006–2010, who had GWA and lung function data. Analyses were adjusted for age, age2, height, and ancestry principal components, as well as smoking pack-years because part of the sample was ascertained by smoking status [30].

Table 1 Characteristics of the study populations included for the SNP-lung function associations

We used two studies to estimate the association of the 122 SNPs with lung function in adolescent girls: Avon Longitudinal Study of Parents and Children (ALSPAC) [31] and Northern Finland Birth Cohort of 1986 (NFBC 1986) [32] (Table 1). ALSPAC is a birth-cohort study that initially enrolled 14,541 pregnant women in Bristol, United Kingdom, in 1990–1992 [31]. We include 1234 of their daughters, aged around 16, with GWA and spirometry data. Analyses were adjusted for height; ancestry principal components were not included because there was no evidence of population stratification in the study. NFBC 1986 is a Finnish birth-cohort study that followed up 9432 live births to mothers in Oulu and Lapland. A total of 6642 adolescents aged 16 participated in the clinical examination in 2001–2002, and here we include 1791 with available GWA and lung function data [32]. Analyses were adjusted for height and ancestry principal components. For both ALSPAC and NFBC 1986, robust standard errors were used for the analyses of FEV1/FVC to account for deviation from normality of the regression residuals.

Spirometry methods for all studies are reported in Supplementary Table 1. For adults and adolescents, estimates of the association with FVC and FEV1/FVC for each SNP were pooled across studies using fixed-effect inverse-variance weighted meta-analysis.

Mendelian randomization estimates

We used a two-sample MR approach for summary data with multiple instruments, where the estimate of the causal effect is obtained as the inverse-variance weighted combination of individual MR estimates across instruments, using fixed-effect meta-analysis [33]. Individual MR estimates for the 122 SNPs were derived using the Wald estimator (ratio of SNP-lung function estimate over SNP-age at menarche estimate), with standard error derived using the delta method [34].

Investigation of pleiotropy

A fundamental assumption of MR is the absence of pleiotropy, i.e. the genetic instruments modify lung function only through age at menarche and no other independent pathways [22]. We tested for statistical evidence of pleiotropy by using between-instrument heterogeneity as a proxy; in the absence of pleiotropy, all variants are valid instruments and their MR estimates will vary only by chance (no heterogeneity) [35]. We defined evidence of pleiotropy as an I2 > 25%, where I2 describes the percentage of total variation in MR estimates due to heterogeneity rather than chance [35], or a statistically significant heterogeneity Cochran Q test (p < 0.05). If pleiotropy was detected, we performed a series of sensitivity analyses to address it.

Using the PhenoScanner, a curated database of publicly available GWA findings created to inform MR studies (available at [36], we first checked for previous associations of the 122 SNPs (and highly correlated SNPs; linkage disequilibrium r2 > 0.8) with any phenotype other than age at menarche, limiting our search to associations that had been identified at a significance level of 5 × 10−8 (Supplementary Table 2). For the exclusion of possible pleiotropic SNPs in our sensitivity analyses, we then only considered effects on phenotypes which can be related to lung function. We adjusted for height all SNP-lung function analyses and therefore height could not directly induce pleiotropy. However, height SNPs could be associated with other markers of somatic growth, potentially exerting pleiotropic effects on lung function other than through height. To test this, we excluded SNPs previously associated with height. We then additionally excluded SNPs previously associated with obesity and related traits (weight, BMI, waist circumference), fasting insulin and type 2 diabetes, and birth weight, since these might also influence lung function. These exclusions were used in sensitivity analyses if there was statistical evidence of pleiotropy.

If statistically significant heterogeneity remained after the exclusion of these SNPs, we performed two further sensitivity analyses: a meta-analysis of MR estimates using a random-effects model instead of the fixed-effect model used in the main analysis [37], and MR-Egger regression [38]. The random-effects model allows for random pleiotropic effects across SNPs, while MR-Egger regression provides unbiased results if pleiotropic effects are not random (i.e. do not cancel). Both analyses assume that the magnitude of the pleiotropic effects is independent of the magnitude of the corresponding SNP-age at menarche effects. As these approaches are less powerful than the fixed-effect meta-analysis, particularly the MR-Egger regression [37], they were only used as further sensitivity analyses when between-instrument heterogeneity was still present after excluding possible pleiotropic SNPs.

In order to understand whether the observed effects of age at menarche on FVC were due to factors specific to menarche rather than puberty in general, we tested the association of our 122 SNPs with lung function in men. Many of the age at menarche SNPs that we used as instruments have also been shown to regulate male pubertal timing as measured by Tanner stage [25]. Finding evidence of an association in men would indicate that the underlying mechanism is related to general timing of puberty as opposed to a female-specific effect. SNP–FVC associations in adolescent boys (N = 3421) and adult men (N = 40,687) were estimated from the same studies used for women (Supplementary Table 6). For each SNP, association estimates were pooled across studies using fixed-effect inverse-variance weighted meta-analysis. The individual SNP–FVC estimates were then meta-analysed (fixed-effect model) to provide an overall effect of the 122 SNPs on FVC, which is equivalent to performing an unweighted allele score analysis with all SNPs.

All analyses were performed using Stata 14 (StataCorp LP)


Imputed genotype data for the 122 SNPs were available for all studies, with the exception of one SNP (rs10423674) in NFBC 1986. The quality of imputation was very good for all SNPs across all studies (imputation INFO or R2 parameters ≥ 0.8), except for one SNP in two studies (rs17233066, R2 of 0.4 in ECRHS II and ALSPAC). The SNPs identified were strong instruments for age of menarche. F statistics ranged from 21 to 441 across variants (Supplementary Table 3), well over the threshold of F > 10 usually recommended as a test for weak instruments in MR analyses [39].

Individual estimates of the per-allele effects on age at menarche and lung function (FVC and FEV1/FVC) for each SNP are provided in Supplementary Tables 3 and 4, respectively. MR estimates for the causal effect of age at menarche on lung function obtained separately from each SNP are presented in Supplementary Table 5, while the combined MR estimates across the 122 SNPs are reported in Table 2.

Table 2 MR estimates for the causal effect of age at menarche on lung function in adults and adolescents, obtained by fixed-effect meta-analysis of SNP-specific MR estimates across the 122 SNPs

The MR estimate for age at menarche and FVC in adult women showed a statistically significant increase of 24.8 mL per year increase in age at menarche (95% confidence interval 1.8–47.9; p = 0.035), while we found no effect for FEV1/FVC (Table 2). In the MR analysis for FVC, a between-instrument I2 of 45% (95% CI 31–55%; p < 0.001) suggested the presence of pleiotropy, and we repeated the analysis after excluding SNPs with potentially pleiotropic effects. Out of the 122 SNPs, 34 had been previously associated with phenotypes other than age at menarche, the large majority of which were height and obesity-related traits (Supplementary Table 2). The first sensitivity analysis excluding 14 SNPs associated with height (Model 1 in Table 3) showed a larger and more highly statistically significant MR estimate (43.6 mL; 17.2–69.9; p = 0.001). The second sensitivity analysis, where we additionally excluded 13 SNPs associated with other traits potentially related to lung function, showed very similar results (Model 2 in Table 3). Since some residual between-instrument heterogeneity remained in both sensitivity analyses (Table 3), we performed a random-effects meta-analysis of the MR estimates as well as MR-Egger regression. For the random-effects meta-analysis, results were similar to those in Table 3, with an MR estimate of 40.7 mL (8.4–72.9; p = 0.013) for Model 1, and 40.3 mL (5.7–74.8; p = 0.022) for Model 2. MR-Egger regression, which suffers from low statistical power [37], showed results in the same direction but with much larger confidence intervals and loss of statistical significance (Model 1: 95.7 mL, −37.0 to 228.4, p = 0.156; Model 2: 84.8 mL, −52.2 to 221.8, p = 0.222).

Table 3 Sensitivity analyses for the MR of age at menarche and FVC in adult women

In adolescents, the MR analysis for FVC showed a statistically significant decrease of 56.5 mL per year increase in age at menarche (95% CI −108.3 to −4.7; p = 0.033), with no evidence of pleiotropy across the 122 SNPs (Table 2). As with adults, there appeared to be no causal effect of age at menarche on FEV1/FVC.

The results of the secondary analyses in men were very consistent with those in women, with a statistically significant positive association of the 122 SNPs with FVC in adult men (p = 0.013) and a statistically significant negative association in adolescent boys (p = 0.007).


Our study shows a causal effect of age at menarche on lung function using Mendelian randomization, a technique which draws on the biological principle that genes are randomly allocated at conception to provide evidence not affected by classical confounding. We found an effect of age at menarche on restrictive lung impairment (FVC), with no evidence of an effect on airway obstruction (FEV1/FVC). In particular, we find that early menarche increases FVC in adolescence but decreases it in adulthood. The findings for adult women confirm previous observational evidence suggesting a decrease in FVC of 123 mL (95% CI 27–220; p = 0.01) associated with early menarche (menarche ≤10 years vs. menarche at 13), but no association with FEV1/FVC (p = 0.77) [16].

The finding of a beneficial effect of earlier age at menarche on FVC in adolescence, as opposed to the detrimental effect in adulthood, is interesting and has a plausible explanation, illustrated graphically in Fig. 1. Lung development tends to plateau following menarche [40], and therefore earlier initiation of menstruation may lead to premature completion of lung development and lower maximally attained lung function. Given the relative stability of lung function over time (a phenomenon known as “tracking”) [41], this would translate to lower FVC in adulthood. Our secondary analysis suggests that the same happens in males, where lung development has also been shown to plateau at puberty [40]. The beneficial effect of earlier menarche on FVC in adolescent girls may be explained by the prominent truncal (as opposed to limb) growth and increased thoracic muscle strength which occur in puberty and which contribute to higher lung volumes [40]. It could also be related to the direct effect of early exposure to sex hormones, for example oestrogens, in adolescent girls, as these have been shown to affect lung function in humans [12] and animal models [42]. Our secondary analysis suggesting a similar effect in boys support the hypothesis of a mechanism related to factors associated with early pubertal timing in general rather than specifically through female sex hormones. However, it is also possible that the mechanisms differ in men and women, for example through sex hormones in girls and thoracic growth and muscle strength in boys (the latter being more pronounced in boys [40]). The complex hormonal and physiological shifts that occur in women during menarche [12] make it difficult to pin-point the precise mechanisms underlying our findings, and further research is needed to explore them.

Fig. 1
figure 1

Graphical representation of a possible explanation for the discrepancy in FVC findings for adult women and adolescent girls. Earlier menarche may have current benefits to the lung function in adolescents, but may also lead to premature completion of lung development with attainment of a lower maximal lung function in adult life

Our study suggests that if these same adolescents were assessed in early adult life (once they have reached maximal lung function), those with early menarche would have comparatively lower FVC. Large-scale studies of lung function with longitudinal data across the lifespan will allow to test this hypothesis. To date few child cohorts have lung function assessments in adolescence and at ages associated with lung function plateau, but ongoing consortium-based initiatives, such as STELAR [43] and MEDALL [44], will be able to provide the relevant information.

Our findings offer insight into plausible pathophysiological mechanisms underlying the effects of early menarche on lung function impairment. Previous work has shown an association of early menarche with greater risk and severity of asthma [16, 45, 46], while we did not find an effect of age at menarche on airway obstruction in either adults or adolescents. This might be explained by an association of early menarche with bronchial hyperactivity through immunological and inflammatory effects [12, 47], which would manifest as short-term reversible airway obstruction not captured by the FEV1/FVC ratio from single assessments in population-based studies. Our study also highlights the importance of evaluating different lung function parameters. Many epidemiological studies on lung function have focused on low FEV1, which may arise from either obstructive or restrictive lung impariment. We found that early menarche only affects FVC, a proxy for total lung capacity and characteristic of restrictive lung impariment, which is a predictor of morbidity (including cardiovascular morbidity) and mortality even in the absence of chronic respiratory conditions [13].

All 122 SNPs used in our MR study were “strong” instruments, with strength reflecting not only the magnitude of the genetic effects on age at menarche but also the precision of their estimates. This is important since the use of “weak” instruments can bias the MR estimate [39], with such bias resulting in an attenuation of the causal effect in the context of a two-sample MR analysis like ours [48]. To improve precision, we used the results from the GWA discovery rather than replication analysis from Perry et al. [25], as the former was over 20 times larger (182,416 vs. 8689). These estimates might have been affected by the upward bias typical of the discovery stage (“winner’s curse”) [49], but this is likely to be very limited in our study given the strong p values. Moreover, any resulting overestimation of the SNP-age at menarche association would have pulled the MR estimate towards the null, leading to underestimation of the true causal effect rather than to a false positive result.

Like any other instrumental variable approach, MR tends to suffer from limited statistical power when the effect of the instruments on the exposure is relatively small, as typically happens with common genetic variants [50]. Despite this, we were able to identify statistically significant effects of age at menarche on FVC, although the confidence intervals of our MR estimates are large, particularly for adolescents where the sample size for the genetic associations with lung function is much smaller.

MR is not affected by classical confounding encountered in observational studies, and yet there is a form of confounding specific to MR, pleiotropy, whereby the genetic instrument modifies lung function through secondary phenotypes other than age at menarche [22]. Heterogeneity in the MR estimates obtained from the individual instruments can be used as a proxy for pleiotropy [35]. In our study there was evidence of heterogeneity in the MR estimates for the analysis of FVC in adults; the exclusion of SNPs with possible pleiotropic effects, in particular SNPs previously associated with height, showed consistent and much stronger results than the main analysis, thus demonstrating robustness of our findings. Height is strongly influenced by genetic and environmental factors regulating growth and development, and is also a strong predictor of FVC. In order to clearly disentangle the effect of the age at menarche SNPs on FVC from any possible effect on height, we adjusted all our SNP-lung function analyses for height. Indeed, it is FVC standardised for height which is clinically of interest, and FVC is often expressed as a percentage of a normal reference value (percent predicted) based on the individual’s height as well as sex and age. The fact that removal of SNPs previously associated with height reduced the pleiotropy and made the result stronger in our secondary analysis in adult women supports our hypothesis that these SNPs could be associated with other markers of somatic growth, exerting pleiotropic effects on lung function other than through height. Robustness of our finding for FVC in women was also confirmed by a further analysis to account for residual pleiotropic effects using a random-effects meta-analysis, while MR-Egger regression resulted in a loss of statistical significance likely explained by its low statistical power [37]. Interestingly, there was no statistical evidence of pleiotropy for FVC in adolescents, as shown by an I2 of 2% for the between-instrument heterogeneity (95% CI 0–21%; p = 0.42). A possible explanation for this is that some of our genetic instruments may have effects on secondary phenotypes related to lung function that only become apparent during adult life.

A potential weakness of our study may arise from the presence of gene-environment interactions [51], since our MR analyses assume no interactions for the SNP-age at menarche and SNP-lung function relationships. For example, the data used in our study for the analysis of adults and adolescents cover different “cohorts” of women, potentially exposed to different environmental exposures. “Cohort effects” are known to affect both age at menarche and lung function. The MR approach is not susceptible to confounding from environmental exposures, including those inducing cohort effects, but they might bias the results if they interacted with the genetic variants used as instrumental variables [52]. Although the practical relevance of gene-environment interactions for our MR analyses of age at menarche and lung function is not clear, it remains a theoretical possibility.

Finally, our MR estimate of the effect of age at menarche on FVC needs to be interpreted as a population-averaged causal effect rather than the effect for an individual and are based on the assumption of a linear relationship. Parametric and non-parametric methods to address non-linearity in MR have been proposed, including stratification of the exposure to estimate localized average causal effects (LACE) [53, 54], although they typically require individual-level data.

In conclusion, our study provides evidence of a causal effect of early sexual development in women on lung function later in life, with our secondary findings in men suggesting a role for pubertal timing in general rather than menarche specifically. This, together with evidence of detrimental effects on other adverse health outcomes, including cardiometabolic outcomes and cancer [3,4,5,6,7,8], has public health implications given that factors predisposing to early sexual development in women could be targeted at a population level to contrast the secular trend towards earlier puberty. These include a number of established childhood life-style and social factors, such as diet and obesity, psychological stress and deprivation, as well as hypothesised environmental exposures, such as endocrine disrupting chemicals (EDCs) found in many household products [2]. Of these, childhood obesity is the most worrisome given the current childhood obesity epidemic. EDCs may prove to be a substantial concern due to their widespread presence and the potential persistence of their effects on menarche for generations without further exposure (transgenerational inherited effects) [55], although these effects remain controversial. Our study also illustrates the value of the MR approach, which exploits increasingly available genetic data from large datasets, as a tool to investigate causal effects of childhood events on adult health, an area of epidemiological research which is particularly problematic due to the presence of confounding factors very difficult to measure and partly unknown.


  1. Adams Hillard PJ. Menstruation in adolescents: what’s normal, what’s not. Ann N Y Acad Sci. 2008;1135:29–35. doi:10.1196/annals.1429.022.

    Article  PubMed  Google Scholar 

  2. Fisher MM, Eugster EA. What is in our environment that effects puberty? Reprod Toxicol. 2014;44:7–14. doi:10.1016/j.reprotox.2013.03.012.

    CAS  Article  PubMed  Google Scholar 

  3. Pierce MB, Leon DA. Age at menarche and adult BMI in the Aberdeen children of the 1950s cohort study. Am J Clin Nutr. 2005;82(4):733–9.

    CAS  PubMed  Google Scholar 

  4. He C, Zhang C, Hunter DJ, et al. Age at menarche and risk of type 2 diabetes: results from 2 large prospective cohort studies. Am J Epidemiol. 2010;171(3):334–44. doi:10.1093/aje/kwp372.

    Article  PubMed  Google Scholar 

  5. Remsberg KE, Demerath EW, Schubert CM, Chumlea WC, Sun SS, Siervogel RM. Early menarche and the development of cardiovascular disease risk factors in adolescent girls: the fels longitudinal study. J Clin Endocrinol metab. 2005;90(5):2718–24. doi:10.1210/jc.2004-1991.

    CAS  Article  PubMed  Google Scholar 

  6. Lakshman R, Forouhi NG, Sharp SJ, et al. Early age at menarche associated with cardiovascular disease and mortality. J Clin Endocrinol Metab. 2009;94(12):4953–60. doi:10.1210/jc.2009-1789.

    CAS  Article  PubMed  Google Scholar 

  7. Hsieh CC, Trichopoulos D, Katsouyanni K, Yuasa S. Age at menarche, age at menopause, height and obesity as risk factors for breast cancer: associations and interactions in an international case-control study. Int J Cancer. 1990;46(5):796–800.

    CAS  Article  PubMed  Google Scholar 

  8. Gong TT, Wu QJ, Vogtmann E, Lin B, Wang YL. Age at menarche and risk of ovarian cancer: a meta-analysis of epidemiological studies. Int J Cancer. 2013;132(12):2894–900. doi:10.1002/ijc.27952.

    CAS  Article  PubMed  Google Scholar 

  9. Global Burden of Disease Study C. Global, regional, and national incidence, prevalence, and years lived with disability for 301 acute and chronic diseases and injuries in 188 countries, 1990–2013: a systematic analysis for the Global Burden of Disease Study 2013. Lancet. 2015;386(9995):743–800. doi:10.1016/S0140-6736(15)60692-4.

    Article  Google Scholar 

  10. Mendelsohn ME, Karas RH. Molecular and cellular basis of cardiovascular gender differences. Science. 2005;308(5728):1583–7. doi:10.1126/science.1112062.

    CAS  Article  PubMed  Google Scholar 

  11. Carey MA, Card JW, Voltz JW, et al. It’s all about sex: gender, lung development and lung disease. Trends Endocrinol Metab. 2007;18(8):308–13. doi:10.1016/j.tem.2007.08.003.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  12. Macsali F, Svanes C, Bjorge L, Omenaas ER, Gomez RF. Respiratory health in women: from menarche to menopause. Exp Rev Respir Med. 2012;6(2):187–200. doi:10.1586/ers.12.15.

    CAS  Article  Google Scholar 

  13. Burney PG, Hooper R. Forced vital capacity, airway obstruction and survival in a general population sample from the USA. Thorax. 2011;66(1):49–54. doi:10.1136/thx.2010.147041.

    CAS  Article  PubMed  Google Scholar 

  14. Becklake MR, Kauffmann F. Gender differences in airway behaviour over the human life span. Thorax. 1999;54(12):1119–38.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  15. Amaral AF, Strachan DP, Gomez Real F, Burney PG, Jarvis DL. Lower lung function associates with cessation of menstruation: UK Biobank data. Eur Respir J. 2016;48(5):1288–97. doi:10.1183/13993003.00412-2016.

    Article  PubMed  Google Scholar 

  16. Macsali F, Real FG, Plana E, et al. Early age at menarche, lung function, and adult asthma. Am J Respir Crit Care Med. 2011;183(1):8–14. doi:10.1164/rccm.200912-1886OC.

    Article  PubMed  Google Scholar 

  17. Yermachenko A, Dvornyk V. Nongenetic determinants of age at menarche: a systematic review. Biomed Res Int. 2014;2014:371583. doi:10.1155/2014/371583.

    Article  PubMed  PubMed Central  Google Scholar 

  18. Adair LS. Size at birth predicts age at menarche. Pediatrics. 2001;107(4):E59.

    CAS  Article  PubMed  Google Scholar 

  19. Barker DJ, Godfrey KM, Fall C, Osmond C, Winter PD, Shaheen SO. Relation of birth weight and childhood respiratory infection to adult lung function and death from chronic obstructive airways disease. BMJ. 1991;303(6804):671–5.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  20. Davey Smith G, Ebrahim S. What can Mendelian randomisation tell us about modifiable behavioural and environmental exposures? BMJ. 2005;330(7499):1076–9. doi:10.1136/bmj.330.7499.1076.

    Article  PubMed  PubMed Central  Google Scholar 

  21. Smith GD, Ebrahim S. Mendelian randomization: can genetic epidemiology contribute to understanding environmental determinants of disease? Int J Epidemiol. 2003;32(1):1–22.

    Article  PubMed  Google Scholar 

  22. Sheehan NA, Didelez V, Burton PR, Tobin MD. Mendelian randomisation and causal inference in observational epidemiology. PLoS Med. 2008;5(8):e177. doi:10.1371/journal.pmed.0050177.

    Article  PubMed  PubMed Central  Google Scholar 

  23. Sequeira ME, Lewis SJ, Bonilla C, Davey Smith G, Joinson C. Association of timing of menarche with depressive symptoms and depression in adolescence: Mendelian randomisation study. Br J Psychiatry. 2016;. doi:10.1192/bjp.bp.115.168617.

    PubMed  Google Scholar 

  24. Bolton CE, Schumacher W, Cockcroft JR, et al. The CRP genotype, serum levels and lung function in men: the Caerphilly Prospective Study. Clin Sci (Lond). 2011;120(8):347–55. doi:10.1042/CS20100504.

    Article  Google Scholar 

  25. Perry JR, Day F, Elks CE, et al. Parent-of-origin-specific allelic associations among 106 genomic loci for age at menarche. Nature. 2014;514(7520):92–7. doi:10.1038/nature13545.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  26. Palmer TM, Lawlor DA, Harbord RM, et al. Using multiple genetic variants as instrumental variables for modifiable risk factors. Stat Methods Med Res. 2012;21(3):223–42. doi:10.1177/0962280210394459.

    Article  PubMed  PubMed Central  Google Scholar 

  27. Burney PG, Luczynska C, Chinn S, Jarvis D. The European Community Respiratory Health Survey. Eur Respir J. 1994;7(5):954–60.

    CAS  Article  PubMed  Google Scholar 

  28. Rantakallio P. The longitudinal study of the northern Finland birth cohort of 1966. Paediatr Perinat Epidemiol. 1988;2(1):59–88.

    CAS  Article  PubMed  Google Scholar 

  29. Sudlow C, Gallacher J, Allen N, et al. UK biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS Med. 2015;12(3):e1001779. doi:10.1371/journal.pmed.1001779.

    Article  PubMed  PubMed Central  Google Scholar 

  30. Wain LV, Shrine N, Miller S, et al. Novel insights into the genetics of smoking behaviour, lung function, and chronic obstructive pulmonary disease (UK BiLEVE): a genetic association study in UK Biobank. Lancet Respir Med. 2015;3(10):769–81. doi:10.1016/S2213-2600(15)00283-0.

    Article  PubMed  PubMed Central  Google Scholar 

  31. Boyd A, Golding J, Macleod J, et al. Cohort Profile: the ‘children of the 90s’—the index offspring of the Avon Longitudinal Study of Parents and Children. Int J Epidemiol. 2013;42(1):111–27. doi:10.1093/ije/dys064.

    Article  PubMed  Google Scholar 

  32. Jaaskelainen A, Schwab U, Kolehmainen M, et al. Meal frequencies modify the effect of common genetic variants on body mass index in adolescents of the northern Finland birth cohort 1986. PLoS ONE. 2013;8(9):e73802. doi:10.1371/journal.pone.0073802.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  33. Burgess S, Butterworth A, Thompson SG. Mendelian randomization analysis with multiple genetic variants using summarized data. Genet Epidemiol. 2013;37(7):658–65. doi:10.1002/gepi.21758.

    Article  PubMed  PubMed Central  Google Scholar 

  34. Thompson JR, Minelli C, Del Greco MF. Mendelian randomization using public data from genetic consortia. Int J Biostat. 2016;. doi:10.1515/ijb-2015-0074.

    PubMed  Google Scholar 

  35. Del Greco MF, Minelli C, Sheehan NA, Thompson JR. Detecting pleiotropy in Mendelian randomisation studies with summary data and a continuous outcome. Stat Med. 2015;34(21):2926–40. doi:10.1002/sim.6522.

    Article  Google Scholar 

  36. Staley JR, Blackshaw J, Kamat MA, et al. PhenoScanner: a database of human genotype-phenotype associations. Bioinformatics. 2016;32(20):3207–9. doi:10.1093/bioinformatics/btw373.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  37. Bowden J, Del Greco MF, Minelli C, Davey Smith G, Sheehan N, Thompson J. A framework for the investigation of pleiotropy in two-sample summary data Mendelian randomization. Stat Med. 2017;36:1783–1802. doi:10.1002/sim.7221.

    Article  PubMed  PubMed Central  Google Scholar 

  38. Bowden J, Davey Smith G, Burgess S. Mendelian randomization with invalid instruments: effect estimation and bias detection through Egger regression. Int J Epidemiol. 2015;44(2):512–25. doi:10.1093/ije/dyv080.

    Article  PubMed  PubMed Central  Google Scholar 

  39. Lawlor DA, Harbord RM, Sterne JA, Timpson N, Davey SG. Mendelian randomization: using genes as instruments for making causal inferences in epidemiology. Stat Med. 2008;27(8):1133–63. doi:10.1002/sim.3034.

    Article  PubMed  Google Scholar 

  40. Neve V, Girard F, Flahault A, Boule M. Lung and thorax development during adolescence: relationship with pubertal status. Eur Respir J. 2002;20(5):1292–8.

    CAS  Article  PubMed  Google Scholar 

  41. Twisk JW, Staal BJ, Brinkman MN, Kemper HC, van Mechelen W. Tracking of lung function parameters and the longitudinal relationship with lifestyle. Eur Respir J. 1998;12(3):627–34.

    CAS  Article  PubMed  Google Scholar 

  42. Massaro D, Massaro GD. Estrogen regulates pulmonary alveolar formation, loss, and regeneration in mice. Am J Physiol Lung Cell Mol Physiol. 2004;287(6):L1154–9. doi:10.1152/ajplung.00228.2004.

    CAS  Article  PubMed  Google Scholar 

  43. Custovic A, Ainsworth J, Arshad H, Bishop C, Buchan I, Cullinan P, et al. The Study Team for Early Life Asthma Research (STELAR) consortium ‘Asthma e-lab’: team science bringing data, methods and investigators together. Thorax 2015;70(8):799–801.

    Article  PubMed  PubMed Central  Google Scholar 

  44. Bousquet J, Anto J, Auffray C, Akdis M, Cambon-Thomsen A, Keil T, et al. MeDALL (Mechanisms of the Development of ALLergy): an integrated approach from phenotypes to systems medicine. Allergy 2011;66:596–604.

    CAS  Article  PubMed  Google Scholar 

  45. Salam MT, Wenten M, Gilliland FD. Endogenous and exogenous sex steroid hormones and asthma and wheeze in young women. J Allergy Clin Immunol. 2006;117(5):1001–7. doi:10.1016/j.jaci.2006.02.004.

    CAS  Article  PubMed  Google Scholar 

  46. Varraso R, Siroux V, Maccario J, Pin I, Kauffmann F. Asthma severity is associated with body mass index and early menarche in women. Am J Res Criti Care Med. 2005;171(4):334–9. doi:10.1164/rccm.200405-674OC.

    Article  Google Scholar 

  47. Haggerty CL, Ness RB, Kelsey S, Waterer GW. The impact of estrogen and progesterone on asthma. Ann Allergy Asthma Immunol. 2003;90(3):284–91. doi:10.1016/S1081-1206(10)61794-2.

    CAS  Article  PubMed  Google Scholar 

  48. Pierce BL, Burgess S. Efficient design for Mendelian randomization studies: subsample and 2-sample instrumental variable estimators. Am J Epidemiol. 2013;178(7):1177–84. doi:10.1093/aje/kwt084.

    Article  PubMed  PubMed Central  Google Scholar 

  49. Ioannidis JP, Ntzani EE, Trikalinos TA, Contopoulos-Ioannidis DG. Replication validity of genetic association studies. Nat Genet. 2001;29(3):306–9. doi:10.1038/ng749.

    CAS  Article  PubMed  Google Scholar 

  50. Pierce BL, Ahsan H, Vanderweele TJ. Power and instrument strength requirements for Mendelian randomization studies using multiple genetic variants. Int J Epidemiol. 2011;40(3):740–52. doi:10.1093/ije/dyq151.

    Article  PubMed  Google Scholar 

  51. Dudbridge F, Fletcher O. Gene-environment dependence creates spurious gene-environment interaction. Am J Hum Genet. 2014;95(3):301–7. doi:10.1016/j.ajhg.2014.07.014.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  52. Brennan P. Commentary: Mendelian randomization and gene-environment interaction. Int J Epidemiol. 2004;33(1):17–21. doi:10.1093/ije/dyh033.

    Article  PubMed  Google Scholar 

  53. Burgess S, Davies NM, Thompson SG. Instrumental variable analysis with a nonlinear exposure-outcome relationship. Epidemiology. 2014;25(6):877–85. doi:10.1097/ede.0000000000000161.

    Article  PubMed  PubMed Central  Google Scholar 

  54. Silverwood RJ, Holmes MV, Dale CE, et al. Testing for non-linear causal effects using a binary genotype in a Mendelian randomization study: application to alcohol and cardiovascular traits. Int J Epidemiol. 2014;43(6):1781–90. doi:10.1093/ije/dyu187.

    Article  PubMed  PubMed Central  Google Scholar 

  55. Zama AM, Uzumcu M. Epigenetic effects of endocrine-disrupting chemicals on female reproduction: an ovarian perspective. Front Neuroendocrinol. 2010;31(4):420–39. doi:10.1016/j.yfrne.2010.06.003.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

Download references


UK Biobank: This research has been conducted using the UK Biobank Resource under Application Number 648, and we thank the participants, field workers, and data managers for their time and cooperation. The project used the ALICE and SPECTRE High Performance Computing Facilities at the University of Leicester. NFBC studies: We thank the late Professor Paula Rantakallio for the launch of the studies, Ms Outi Tornwall and Ms Minttu Jussila for the DNA biobanking, and the late Academian of Science Leena Peltonen for her contribution. ECRHS study: We thank the participants, field workers and researchers who have participated in the ECRHS study for their time and cooperation. ALSPAC study: We are extremely grateful to all the families who took part, the midwives for their help in recruiting them, and the whole ALSPAC team, which includes interviewers, computer and laboratory technicians, clerical workers, research scientists, volunteers, managers, receptionists and nurses.


This project has received funding from the European Union’s Horizon 2020 research and innovation programme under Grant Agreement No. 633212. IPH was supported by an MRC programme Grant (G1000861). RG was supported by the UK Medical Research Council (Grant Ref: G0902125). The work of MDT, IPH and LVW was funded by a Medical Research Council (MRC) strategic award (MC_PC_12010). MDT was supported by MRC fellowships G0501942 and G0902313. This article presents independent research funded partially by the National Institute for Health Research (NIHR). The views expressed are those of the authors and not necessarily those of the NHS, the NIHR or the Department of Health. NFBC1966 study: NFBC1966 received financial support from the Academy of Finland (Project Grants 104781, 120315, 129269, 1114194, 24300796, Center of Excellence in Complex Disease Genetics and SALVE), University Hospital Oulu, Biocenter, University of Oulu, Finland (75617), NHLBI Grant 5R01HL087679-02 through the STAMPEED program (1RL1MH083268-01), NIH/NIMH (5R01MH63706:02), ENGAGE Project and Grant Agreement HEALTH-F4-2007-201413, EU FP7 EurHEALTHAgeing 277849, the Medical Research Council, UK (G0500539, G0600705, G1002319, PrevMetSyn/SALVE) and the MRC, Centenary Early Career Award. The program is currently being funded by the H2020-633595 DynaHEALTH action and academy of Finland EGEA-project. The DNA extractions, sample quality controls, biobank upkeeping and aliquotting were performed in the National Public Health Institute, Biomedicum Helsinki, Finland and were supported financially by the Academy of Finland and Biocentrum Helsinki. ECRHS study: This work was supported by a contract from the European Commission (018996), Fondo de Investigación Sanitaria (91/0016-060-05/E, 92/0319, 93/0393, 97/0035-01, 99/0034-01 and 99/0034-02), Hospital General de Albacete, Hospital General Ramón Jiménez, Consejería de Sanidad del Principado de Asturias, CIRIT (1997SGR 00079, 1999SGR 00241), and Servicio Andaluz de Salud, SEPAR, Public Health Service (R01 HL62633-01), RCESP (C03/09), Red RESPIRA (C03/011), Basque Health Department, Swiss National Science Foundation, Swiss Federal Office for Education and Science, Swiss National Accident Insurance Fund (SUVA), GSF-National Research Centre for Environment and Health, Deutsche Forschungsgemeinschaft (DFG) (FR 1526/1-1, MA 711/4-1), Programme Hospitalier de Recherche CliniqueDRC de Grenoble 2000 No. 2610, Ministry of Health, Direction de la Recherche Clinique, Ministere de l’Emploi et de la Solidarite, Direction Generale de la Sante, CHU de Grenoble, Comite des Maladies Respiratoires de l’Isere, UCB-Pharma (France), Aventis (France), Glaxo France, Estonian Science Foundation, and AsthmaUK (formerly known as National Asthma Campaign UK). ALSPAC study: GWAS data was generated by Sample Logistics and Genotyping Facilities at the Wellcome Trust Sanger Institute and LabCorp (Laboratory Corportation of America) using support from 23andMe. The UK Medical Research Council and the Wellcome Trust (Grant Ref: 102215/2/13/2) and the University of Bristol provide core support for ALSPAC study.

Author information

Authors and Affiliations


Corresponding author

Correspondence to Cosetta Minelli.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Table 1

Spirometry methods for all studies on lung function included (PDF 186 kb)

Supplementary Table 2

Evidence of association of the 122 SNPs (and SNPs highly correlated with them, LD r2 > 0.8) with secondary phenotypes. Information retrieved from the PhenoScanner (36) (available at: (PDF 325 kb)

Supplementary Table 3

Estimates of the SNP-age at menarche association (GX) for all 122 SNPs, from Perry et al. EA: effect allele; EAF: effect allele frequency; R2: proportion of the variance of age at menarche explained by the SNP, calculated as follows: R2 = [2 × EAF × (1–EAF) × β2]/varX, where β is the estimated genetic effect on age at menarche and varX is the variance of age at menarche; F: F statistic, a function of the magnitude and precision of the genetic effect calculated as: F = R2(N − 2)/(1 − R2), where N is the sample size of the SNP-age of menarche association (N = 182, 416); GX: per-allele genetic effect on age at menarche (years); GX SE: standard error of GX; p: p value of GX (PDF 401 kb)

Supplementary Table 4

Estimates of the SNP-lung function association (GY) for all 122 SNPs, for adult women (ECRHS, NFBC 1966 and UK Biobank studies) and adolescent girls (ALSPAC and NFBC 1986 studies). For one SNP, rs10423674, data were missing from NFBC 1986. EA: effect allele; GY: per-allele genetic effect on FVC (ml) or FEV1/FVC (%); GY SE: standard error of GY (PDF 407 kb)

Supplementary Table 5

Estimates of the causal effect of age at menarche on lung function for all 122 SNPs, for adult women (ECRHS, NFBC 1966 and UK Biobank studies) and adolescent girls (ALSPAC and NFBC 1986 studies). EA: effect allele; Beta: estimate of the effect of one year increase in age at menarche on FVC (ml) or FEV1/FVC (%); Beta SE: standard error of beta (PDF 499 kb)

Supplementary Table 6

Characteristics of the study populations included for the SNP-lung function associations in adult men and adolescent boys. Values reported are mean (standard deviation) (PDF 312 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Gill, D., Sheehan, N.A., Wielscher, M. et al. Age at menarche and lung function: a Mendelian randomization study. Eur J Epidemiol 32, 701–710 (2017).

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI:


  • Mendelian randomization
  • Menarche
  • Puberty
  • Lung function
  • FVC
  • FEV1/FVC