Evaluation and comparison of nine growth and development-based measures of pubertal timing

Elhakeem, Ahmed; Frysz, Monika; Goncalves Soares, Ana; Bell, Joshua A.; Cole, Tim J.; Heron, Jon; Howe, Laura D.; Sebert, Sylvain; Tilling, Kate; Timpson, Nicholas J.; Lawlor, Deborah A.

doi:10.1038/s43856-024-00580-1

Evaluation and comparison of nine growth and development-based measures of pubertal timing

Article
Open access
Published: 07 August 2024

Volume 4, article number 159, (2024)
Cite this article

Download PDF

You have full access to this open access article

Communications Medicine

Evaluation and comparison of nine growth and development-based measures of pubertal timing

Download PDF

200 Accesses
3 Altmetric
Explore all metrics

Abstract

Background

Pubertal timing is heritable, varies between individuals, and has implications for life-course health. There are many different indicators of pubertal timing, and how they relate to each other is unclear. Our aim was to quantitatively compare nine indicators of pubertal timing.

Methods

We used data from questionnaires and height, weight, and bone measurements from ages 7–17 y in a population-based cohort of 4267 females and 4251 males to compare nine growth and development-based indicators of pubertal timing. We summarise age of each indicator, their phenotypic and genetic correlations, and how they relate to established genetic risk score (GRS) for puberty timing, and phenotypic childhood body composition measures.

Results

We show that pubic hair in males (mean: 12.6 y) and breasts in females (11.5 y) are early indicators of puberty, and voice breaking (14.2 y) and menarche (12.7 y) are late indicators however, there is substantial variation between individuals in pubertal age. All indicators show evidence of positive phenotypic intercorrelations (e.g., r = 0.49: male genitalia and pubic hair ages), and positive genetic intercorrelations. An age at menarche GRS positively associates with all other pubertal age indicators (e.g., difference in female age at peak height velocity per SD higher GRS: 0.24 y, 95%CI: 0.21 to 0.26), as does an age at voice breaking GRS (e.g., difference in age at male axillary hair: 0.11 y, 0.07 to 0.15). Higher childhood fat mass and lean mass associated with earlier puberty timing.

Conclusions

Our findings provide insights into the measurements of the timing of pubertal growth and development and illustrate value of various pubertal timing indicators in life-course research.

Plain language summary

Age of puberty varies between individuals and can affect a person’s future health. We obtained information from 8500 British children as they progressed through puberty. We compared nine measures of pubertal timing. We found that the appearance of pubic hair in boys and breasts in girls are early indicators of puberty, and that voice change and onset of menstruation are late indicators. However, there was also substantial variability between individuals in age of puberty. All puberty measures were correlated with each other and related to an individual’s adult body mass index, as well as to their childhood muscle and fat mass. Our findings are useful information for health care workers and researchers who are interested in assessing and studying puberty.

Changes in Pubertal Timing: Past Views, Recast Issues

The Association Between Puberty Timing and Body Mass Index in a Longitudinal Setting: The Contribution of Genetic Factors

Article Open access 05 April 2022

Associations between infant growth and pubertal onset timing in a multiethnic prospective cohort of girls

Article Open access 31 March 2022

Introduction

Puberty is a milestone in human development that involves rapid transformations in anatomy, physiology, and behaviour. Its central feature is neuroendocrine transformation of processes regulating reproductive physiology via a reactivation of the hypothalamic-pituitary-gonadal (HPG) axis, leading to the onset of adult reproductive capacity^1,2. Reactivation of the HPG axis produces numerous observable downstream consequences including production of gonadal steroids, a pubertal growth spurt, development of secondary sexual characteristics, onset of menstruation in females, and appearance of facial hair and voice change in males^2,3. The sequence in which the observable changes appear is thought to mirror elevation of steroid levels, with all changes occurring earlier in females than males².

There is substantial variation in the age of puberty between children^4,5, which is attributable to genetic as well as non-genetic factors, such as nutrition^6,7,8,9. Understanding the determinants of variation in pubertal timing between individuals is important given its relation to reproductive capability and social and health implications, including the risk of some cancers^{8,9,10,11,12,13,14,15,16}. Within an individual, there is variation in the timing of different maturation processes (e.g., skeletal, and sexual maturation), and in the timing of related structures within a maturational process (e.g., within the sexual maturation process, pubic hair and genitalia can have different levels of maturity)¹⁷. Various approaches and indicators have been used by studies to measure puberty timing^{17,18,19,20,21}, which have included self/parent-reported age at menarche and voice change in females and males, respectively, and longitudinally modelled age at peak height velocity in both. As no single measure can capture all maturational processes during puberty, detailed, systematic analysis of anthropometric and developmental measures of puberty timing, including in both sexes, can help reveal their value for life course research, and identify which measure is best for exploring the causes and consequences of pubertal timing, and what might be done to mitigate those effects.

The aim of this study was to evaluate and compare multiple measures of pubertal timing. We used a UK birth cohort—the Avon Longitudinal Study of Parents and Children (ALSPAC)^22,23,24—where offspring have been prospectively assessed since birth with extensive biomedical data collections that included repeated assessments of height, weight, and bone in research clinics, and repeated assessments of pubertal development. Importantly, assessments began at age 7 years, i.e., before onset of puberty in most children. We derive nine anthropometric and development-based measures of pubertal age in >8500 females and males and describe the timing and chronological sequence of pubertal growth and development, the phenotypic and genetic correlations between measures of pubertal age, and how each pubertal age measure relates to genetic risk scores (GRSs) for pubertal timing and adiposity, and phenotypic measurements of childhood body composition. We identify early and late indicators of puberty and find that all pubertal age measures are interrelated. We show that pubertal age measures are related to GRSs for pubertal timing and adiposity and to phenotypic measurements of childhood fat mass and lean mass.

Methods

This study was conducted using data from the ALSPAC cohort. A pre-specified analysis plan for this study is available at https://osf.io/3qndg/²⁵.

Cohort description

ALSPAC is a multigenerational prospective birth cohort study that recruited pregnant women residing within the catchment area of three National Health Service authorities in southwest England with an expected date of delivery between April 1991 and December 1992^22,23,24. The initial number of pregnancies enrolled was 14,541. Of these initial pregnancies, there was a total of 14,676 fetuses, resulting in 14,062 live births and 13,988 children who were alive at 1 year of age. When children were ~7 years old, an attempt was made to bolster the initial sample with eligible new cases. Total sample size for analyses using data collected after age 7 years was 15,447 pregnancies, and 15,658 offspring. Of these 14,901 were alive at 1 year of age. Detailed data have been collected from offspring and parents by questionnaires, data extraction from medical records, data linkage to health records, and dedicated clinic assessments.

ALSPAC participants provided written informed consent for all measurements. Parents gave informed consent for children aged under 18 years and the children were also invited to give assent, with no measurements were taken from the children if they refused. Ethical approval for the ALSPAC study was obtained from the ALSPAC Law and Ethics Committee and the Local Research Ethics Committees (Bristol and Weston Health Authority, Southmead Health Authority, Frenchay Health Authority, United Bristol Healthcare Trust, North Bristol Trust, Weston Area Health Trust, Central & South Bristol Research Ethics Committee, North Somerset Research Ethics Committee, National Research Ethics Service Committee South West). Consent for biological samples has been collected in accordance with the Human Tissue Act (2004). Details of all available data can be found in the ALSPAC study website which includes a fully searchable data dictionary and variable search tool (http://www.bristol.ac.uk/alspac/researchers/our-data/).

Puberty data collection from research clinics and questionnaires

Data used to derive indicator-based pubertal ages were collected prospectively using nine repeated research clinic assessments and nine puberty-specific questionnaires. Figure 1 summarises the observed data from these clinic assessments and questionnaires, and Supplementary Table 1 and Supplementary Table 2 provide more information.

**Fig. 1: Longitudinal pubertal growth and development data that were used to derive nine indicator-based measures of pubertal age.**

All participants were invited to attend nine repeated research clinic examinations from ages 7–17 years where their height (in cm) and weight (in kg) were measured. In five of the clinics (ages 9–17 years), all participants underwent whole-body Dual-energy X-ray Absorptiometry (DXA) scans from which total-body (less head) bone mineral content (BMC; in grams) was extracted. Exact age in months at attending each research clinic assessment was recorded.

Questionnaires on pubertal development (the ‘Growing and Changing Questionnaire’) were mailed to all participants on nine occasions from ages 8 to 17 years. Questionnaires could be answered by either the parent or guardian, child, or a combination; over 70% of the first five questionnaires were completed with help from a parent or guardian whereas the last four were mostly completed by the child alone (Supplementary Table 3). Each questionnaire collected data on the five Tanner stages of pubic hair, breasts (girls), and genitalia (boys) development using line drawings representing each stage with accompanying description (Supplementary Note). Each questionnaire collected data on onset of menstruation in girls, and all except the first questionnaire collected data on change in voice (boys). The last seven questionnaires (ages 10–17 years) gathered data on the development of axillary hair. Exact age in months at completing each puberty questionnaire was recorded.

Genotyping and imputation

Children were genotyped using the Illumina HumanHap550 quad chip genotyping platform (Illumina) by 23andMe subcontracting the Wellcome Trust Sanger Institute (Cambridge, UK) and the Laboratory Corporation of America (Burlington, NC, USA). Raw genome-wide data were subjected to standard quality control methods. Individuals were excluded based on sex mismatches, minimal or excessive heterozygosity, disproportionate missingness (>3%), and insufficient sample replication (identity by descent (IBD) < 0.8). Individuals of non-European ancestry were removed because source GWAS (described below) for puberty measures were conducted primarily in European populations. Single nucleotide polymorphisms (SNPs) with minor allele frequency <1%, call rate <95%, or evidence for violations of Hardy-Weinberg equilibrium (P < 5 × 10⁻⁷) were removed. Cryptic relatedness was measured as proportion of IBD > 0.1. Related individuals that passed quality control thresholds were retained in subsequent phasing and imputation.

In total, 9115 children and 500,527 SNPs passed quality control filters. Of these, 477,482 SNP genotypes in common between the sample of ALSPAC children and mothers were combined for imputation to the Haplotype Reference Consortium (HRCr1.1, 2016) panel. SNPs with genotype missingness >1% (11,396 SNPs) were removed prior to imputation. A further 321 subjects were removed due to ID mismatches. HRC panel was phased using ShapeIt (v2.r644) which utilizes relatedness during phasing, and imputation was performed using the Michigan imputation server. This resulted in 8237 children with genotype data after exclusion of related subjects using cryptic relatedness measures described previously.

GRSs for female and male pubertal timing, and adulthood and childhood BMI

Four separate GRSs were created using genome-wide significant SNPs from four European ancestry GWAS meta-analyses on reported age at menarche⁸ and age at voice breaking⁹, and measured BMI in adulthood (mostly middle-aged adults)²⁶ and childhood (age range from 3 to 10 years)²⁷. Scores were calculated using 351 SNPs associated with age at menarche⁸, 73 SNPs associated with age at voice breaking⁹, 95 SNPs associated with adulthood BMI²⁶ (2/97 SNPs were not available in ALSPAC), and 15 SNPs associated with childhood BMI²⁷. The scores were constructed by multiplying the number of effect alleles (or probability of effect alleles if imputed) at each SNP (0, 1, or 2) by its weighting, summing them, and dividing by the total number of SNPs used, and reflect the average per-SNP effect on their respective trait (age at menarche, age at voice breaking, adulthood BMI, or childhood BMI). All scores were standardised (to mean=0 and SD = 1) prior to analysis (Supplemental Fig. 1).

Childhood body composition measurements and confounders

Pre-pubertal fat mass index (total body fat mass divided by height squared) and lean mass index (total body lean mass divided by height squared), both in units of kg/m², were derived from DXA scans performed at mean age 9.9 years and were used to examine associations of childhood body composition with pubertal timing. DXA scans were performed using a Lunar Prodigy scanner (Lunar Radiation Corp) and were analysed according to the manufacturer’s standard scanning software and positioning protocols. Scans were reanalysed as necessary to ensure optimal placement of borders between adjacent subregions, and scans with anomalies were excluded. Exact age in months when scan was performed was recorded. Fat mass and lean mass indices were standardised (to mean = 0 and SD = 1) prior to examining association with the derived indicator-based pubertal ages.

Maternal education, maternal early pregnancy BMI and early pregnancy smoking, maternal age at birth, parity, and child’s diet were identified as factors that could plausibly influence both child body composition and pubertal timing and were selected to be included as confounder adjustment when examining associations of childhood fat mass and lean mass indices with pubertal timing. Maternal confounders were reported using questionnaires during pregnancy (maternal BMI was calculated from reported height and weight). Child’s diet was based on daily energy intake (in kilojoules per day) and derived from food frequency questionnaires completed by the parent when the child was aged 7 years. Confounders were reported in questionnaires during pregnancy for maternal factors.

Statistics and reproducibility

Nine indicator-based pubertal timing (i.e., age) measures were derived: two in females only (age at menarche and age in Tanner breast stage 3), two in males only (age at voice breaking and age in Tanner genitalia stage 3), and five in both females and males (age at peak BMC, height, and weight velocity, age in Tanner pubic hair stage 3, and age at axillary hair).

Estimated pubertal ages were analysed in months for all measures and presented in years to aid interpretation. All analyses were restricted to White ethnicity individuals (>95% of all participants) to enable consistency across phenotypic and genetic analyses. Analyses were performed in R version 4.02 (R Project for Statistical Computing).

Age at menarche was calculated as the first reported age at onset of menstruation. Pubertal age for all other measures was derived using the SITAR (Super Imposition by Translation And Rotation) method of growth curve analysis^28,29. SITAR is a shape invariant nonlinear mixed effects model that fits a single (mean) natural spline growth curve in the study sample and tailors it (using random effects) to define how individual growth curves differ from the mean curve. SITAR usually has up to three random effects that describe the size, timing, and intensity of individual growth relative to the mean growth curve. Size adjusts for differences in growth and geometrically reflects up or down shifts in the mean curve, timing adjusts for differences in the timing of peak growth and geometrically reflects left to right shifts in the mean curve, and intensity adjusts for the duration of the growth spurt and geometrically corresponds to shrinking or stretching of the age scale (which rotates the mean curve)²⁸. A recent addition to the SITAR software allows a fourth ‘post-growth’ random effect to be fitted which extends SITAR to model variability in the adult slope of the growth curve to allow post-pubertal growth rate to vary between individuals³⁰.

Height was modelled using the standard SITAR approach with three random effects. Weight and BMC were modelled using SITAR with all four random effects to allow for variation in growth post-puberty. Tanner stages for pubic hair, breast, and genitalia development, and voice breaking, and axillary hair were modelled using SITAR with up to two random effects for timing and intensity³¹. This reduced SITAR model (i.e., without the size random effect) was used as all individuals are measured on the same 5-point scale (or 3 for voice breaking, and 2 for axillary hair), and so their position on the scale at any particular time depends purely on their developmental age at that time, taking into account their timing and intensity effects.

SITAR models were fitted separately in males and females with at least one outcome measurement. The best fitting models were identified by comparing models with 2 to 5 knots (placed at quantiles of the age distribution) in the mean spline curve and inspecting the fitted mean curves and the Bayesian information criterion (BIC) values for each model (Supplementary Table 4, Supplementary Figs. 2 and 3). Covariances for the random effects were modelled (Supplementary Table 5). Indicator-based pubertal age was estimated using the timing random effect from each SITAR model and represent age at peak growth velocity for height, weight and BMC, age in Tanner stage 3 of pubic hair, breast (females only) and genitalia (males only) development, and age at voice breaking (males only) and axillary hair appearance. Because regression modelling can allow for measurement error, inconsistent responses (i.e., reporting a developmental stage that was lower than that reported in a previous questionnaire) were included in the analysis, except for inconsistent responses in voice breaking which were removing prior to modelling due to convergence issues. Lastly, we did a sensitivity analysis to examine the effect of outliers on anthropometric pubertal age estimates by refitting SITAR models for height, weight, and BMC and re-estimating ages at peak velocity after removing conventional putative outliers (+/−5 SD).

The timing of pubertal indicators was summarised by calculating the mean age, and variation between individuals around the average age was summarised by calculating SD. Bivariate scatterplots and pairwise phenotypic Pearson correlations were used to examine interrelationships between pubertal age measures.

Linkage disequilibrium score regression (LDSR) was used to estimate genetic correlations between pubertal age measures, both within and between sex, using full GWAS summary statistics³². Summary data were obtained from a published GWAS for age at menarche⁸ (n = 252,000) and were generated in ALSPAC (coded in years) for all other measures (including voice breaking because full summary data were not available from the GWAS on age at voice breaking⁹). In ALSPAC, linear regression was used to run GWAS in BOLT-LMM³³ (without adjustment for principal components as all participants were from a small geographically defined region, with 96% of parents reporting they were White British). A reference map from BOLT-LMM was used to interpolate genetic map coordinates from each SNP physical (base pair) position. Reference LD scores from BOLT-LMM appropriate for the analysis of European-ancestry samples were used to calibrate BOLT-LMM. LD scores were matched to SNPs by base pair coordinate. GWAS was performed separately for male and female pubertal age measures, and results for shared measures (i.e., height, weight, BMC, pubic hair, and axillary hair) were meta-analysed using GWAMA³⁴. ALSPAC GWAS sample sizes ranged from 3109 (age at peak BMC velocity in males) to 6782 (age at peak height velocity in females and males combined).

To evaluate the usefulness of our nine derived pubertal age measures in respect to the strength of their associations with genetic predisposition to pubertal timing and BMI, we used separate univariable linear regression models to examine associations of four standardised GRSs that were constructed from published genome-wide significant SNPs for female and male pubertal timing^8,9 and adulthood and childhood BMI^26,27 with each pubertal age measure.

Effect of pre-pubertal body composition in terms of DXA-derived fat mass and lean mass indices (at age 10 years) on pubertal age measures was examined in separate multivariable linear regression models adjusted for exact age at measurement of fat mass and lean mass, and confounders (maternal age at birth, maternal education, parity, maternal early pregnancy BMI, maternal pregnancy smoking, and childhood dietary intake). DXA measures recorded after the age of puberty were removed. Fat mass and lean mass indices were coded in age- and sex-specific SD units (mean = 0 and SD = 1).

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Results

Indicator-based pubertal age was estimated for up to 4267 females and 4251 males who had completed at least one of up to nine repeated research clinic assessments where height, weight and BMC were recorded; or at least one of up to nine repeated puberty questionnaires where menarche, Tanner stages, and axillary hair and voice breaking status were reported. When compared with those included in estimation of pubertal age, those excluded due to missing data on all clinic and questionnaires assessments had younger maternal age at birth, lower maternal education, higher prevalence of maternal pregnancy smoking, mothers who were likely to have had previous pregnancies resulting in live birth, similar maternal pre-pregnancy BMI, and somewhat higher childhood energy intake (Supplementary Table 6).

Timing of pubertal growth and development

Mean age of pubertal indicators in females varied across measures from 11.5 years for age in Tanner breast stage 3 to 12.7 years for age at menarche (average of 1.2 years from mean age of earliest to latest measure), and in males from 12.6 years for age in Tanner pubic hair stage 3 to 14.2 years for age at voice break (average of 1.6 years from mean age of earliest to latest measure) (Fig. 2). The largest gap between the mean ages of consecutive measures was 0.3 years for females (from Tanner pubic hair stage 3 to axillary hair, and from peak BMC velocity to menarche) and 0.7 years for males (from Tanner genitalia stage 3 to axillary hair). Mean age of pubertal indicators was younger in females than males for all five measures common to both sexes, e.g., 11.8 years versus 13.5 years for age at peak height velocity (Fig. 2).

There was considerable variability between individuals in the timing of pubertal indicators, e.g., in females, the SD around mean ages ranged from 0.8 (peak height velocity) to 1.2 years (menarche), and in males, from 0.7 years (peak BMC velocity) to 1.2 years (Tanner genitalia stage 3 and peak weight velocity) (Fig. 2). Moreover, age in Tanner breast stage 3 occurred first in 38.5% of females and age at menarche was last in 41.8% and likewise, age in Tanner pubic hair stage 3 occurred first in 36.4% of males and age at voice break was last in 45.8% of males. Removing outliers had minimal impact on the estimated age at peak height, weight, and BMC velocity (the number (%) of observations removed in females and males were 99 (0.4%) and 633 (2.4%) for height, 119 (0.4%) and 238 (0.9%) for weight, and 6 (0.0004%) and 42 (0.4%) for BMC. Following removal of outliers, mean (and SD) ages at peak height, weight, and BMC velocity, respectively, were 11.5 (1.2), 11.8 (1.2), and 12.5 (1.0) years in females and 13.4 (1.1), 13.8 (1.3), and 13.2 (1.3) years in males.

Phenotypic correlations between indicator-based pubertal age measures

Pair-wise phenotypic (Pearson) correlation analyses identified positive, generally moderate strength, correlations between all pubertal age measures, with mainly stronger correlations in females (Fig. 3) than males (Fig. 4). In females, correlations ranged from 0.28 (between age at axillary hair and age at peak weight velocity) to 0.76 (age at menarche and age at peak height velocity). In males, correlations were from 0.19 (between age in Tanner genitalia stage 3 and age at peak weight velocity) to 0.77 (age at peak height velocity and peak BMC velocity).

**Fig. 3: Phenotypic correlations between indicator-based measures of pubertal age in females.**

**Fig. 4: Phenotypic correlations between indicator-based measures of pubertal age in males.**

Genetic correlations between indicator-based pubertal age measures

LDSR revealed mostly moderate to high genetic correlations between measures of pubertal age (Supplementary Data 1). This included genetic correlations between measures within each sex (for example, genetic correlation between age in Tanner pubic hair stage 3 and age at axillary hair in females was 0.87, P = 0.002), and between sex: both within measures (for example, genetic correlation between females and males for age at peak height velocity was 0.64, P = 0.05) and across different measures (for example, genetic correlation between age at menarche in females and age at peak BMC velocity in males was 0.78, P = 0.007).

Associations of GRS’s with indicator-based pubertal age measures

Higher GRSs, which were associated with older ages of female and male puberty, were both associated with older age of all derived pubertal age measures, and higher GRSs, which were associated with higher adulthood and childhood BMI, were both associated with younger age of all derived pubertal age measures, except for age in Tanner genitalia stage 3 (Fig. 5). The associations of pubertal timing GRSs with pubertal age measures were generally stronger for the female puberty timing GRS in females and were similar in magnitude for both scores in males. Associations of adulthood and childhood BMI GRSs were similar in magnitude for both scores in both females and males (Fig. 5).

**Fig. 5: Association between genetic risk scores and indicator-based measures of pubertal age.**

Association of childhood body composition with indicator-based pubertal age measures

Higher childhood fat mass index and lean mass index were both associated with younger age of puberty measures in females and males. The only exception was for age in Tanner genitalia stage 3 in males, where higher fat mass index was associated with older age (Fig. 6). Associations of fat mass and lean mass with measures of pubertal age were mostly similar in magnitude in females and were stronger for fat mass in males. Association with younger age at peak weight velocity were noticeably stronger for fat mass than lean mass in both sexes (Fig. 6).

**Fig. 6: Association of childhood fat mass and lean mass indices with indicator-based measures of pubertal age.**

Discussion

We used repeated assessments from a population-based cohort to examine and compare nine growth and development-based measures of pubertal timing. We found that, on average, breast development, appearance of pubic hair, and genitalia development were relatively early indicators of pubertal stage, while peak bone accrual, menarche, and voice breaking were later indicators. However, there was considerable variability between individuals in the timing of pubertal indicators. All pubertal age measures were interrelated, as demonstrated by positive phenotypic and genetic correlations. GRSs from large-scale GWAS’s on the ages at menarche and voice breaking were positively associated with all other pubertal age measures, and GRS’s for adulthood and childhood BMI were inversely associated with the pubertal age measures. Pre-pubertal fat mass and lean mass were inversely associated with all pubertal age measures, the only exception was a positive association between fat mass and genitalia stage in males.

To the best of our knowledge, ours is the first study to examine this collection of pubertal age measures. Our pubertal age estimates are consistent with studies that examined some of these measures. These include a study from the Danish National Birth Cohort (DNBC) on 14,000 participants with repeated data on the six developmental (but no anthropometric) measures which found that breast, genitalia, and pubic hair stages were early indicators of pubertal stage, with menarche and voice breaking being late indicators³⁵. Our results agree with findings from the Edinburgh Longitudinal Growth Study (ELGS) where height, menarche, and clinical examinations of development stages were taken every half-year until 20 years in 74 females and 103 males³¹, and with a cross-sectional study of 703 Norwegian females aged 6-16 years that showed mean age in Tanner Stage 3 of breast and pubic hair development was younger than menarche³⁶. Also consistent with our estimates are findings from study of 105 twin pairs showing that mean age of peak velocity for height was slightly younger than for weight³⁷, and evidence from the US Bone Mineral Density in Childhood Study (BMDCS) that peak velocity occurred earlier for height than BMC³⁸. Our observation of considerable variability in pubertal age between individuals, across all nine measures, is consistent with previous literature^4,5.

Our findings of positive phenotypic and genetic correlations between the pubertal age indicators are also consistent with studies that included some of these measures. For example, positive phenotypic correlations were found between measures in ELGS (r: 0.62 to 0.82 in males and r: 0.80 to 0.92 in females)³¹, between voice breaking, axillary hair, and pubertal stages (r: 0.40 to 0.62) in a study of 730 Danish males³⁹, and between age of peak height and BMC velocity in BMDCS³⁸. Like our LDSR results, Hollis et al.⁹ reported a moderate genome-wide genetic correlation between age at voice breaking and age at menarche. Also in line with our findings are reports of moderate to high genetic correlations (but with wide 95% CIs) between menarche and Tanner breast and pubic hair stage in 184 twin pairs⁴⁰, and between Tanner breast and pubic hair stage, and genitalia and pubic hair stage in 112 twin pairs⁴¹. Our study improves on these by including a larger sample size and examining genetic correlations across more measures.

We found that GRS’s for childhood and adulthood BMI were both inversely associated with puberty timing measures, which is consistent with Mendelian randomization studies on age at menarche^12,13,42 and voice breaking³⁹. Our finding of inverse associations between childhood fat mass and puberty timing is consistent with previous observations^39,43,44. Our study adds to previous studies by comparing associations across nine measures of pubertal timing, showing that this association is substantially stronger for peak weight velocity than for other pubertal measures, and that childhood lean mass index is also inversely associated with the timing of pubertal indicators.

The age sequence of the different puberty measures is broadly consistent with the underlying molecular and hormonal changes driving appearance of these changes^2,3,45. The substantial variability in pubertal timing between individuals may reflect between-individual differences in complex genetic and environmental factors (including exposures from early life onwards) contributing to puberty⁵. The positive phenotypic and genetic correlations between pubertal age measures suggest that they all might capture the same process and have a shared heritable contribution (from common genetic variation)⁶. Our finding of positive genetic correlations between males and females, which were generally lower than those within sex, point to both similar genetic factors driving pubertal timing in each sex as well sex-specific genetic effects on pubertal timing^6,46.

GRSs from published genome-wide significant SNPs for age at menarche and voice breaking associated positively with all other pubertal age measures. While replication in independent cohorts is needed, if confirmed, this (and the positive phenotypic and genetic correlations between measures) suggests our suite of nine measures could all be used as measures of pubertal age when assessing the determinants and effects of pubertal timing, and can facilitate research in cohorts with repeated assessments, e.g., to assess whether associations with risk factors or outcomes are comparable across all measures or if they are specific to certain growth/development measures (and sex).

We found that GRS’s for childhood and adulthood BMI were inversely associated with most pubertal age measures, which suggests that children with higher adiposity are more likely to experience earlier puberty⁴², possibly through adiposity-related hormonal perturbations⁴⁷ and those with earlier puberty may be more likely to have higher adiposity in adulthood, possibly due to shared genetic contributions to childhood adiposity^13,48. Our finding that both higher childhood fat mass and lean mass were associated with earlier puberty supports a role for higher childhood body size beyond solely adiposity in earlier pubertal timing. In contrast to the other pubertal indicators, higher childhood fat mass was associated with older (rather than younger) Tanner genitalia stage, and childhood BMI GRS was not associated with genitalia stage, which could both be due to Tanner staging being more challenging to implement in overweight or obese children²⁰, or because the more adipose children were more likely to exaggerate their development⁴⁹.

Data on developmental measures were collected by questionnaire using parent/self-reporting which might result in larger measurement errors compared with growth measures, and these differences in measurement error might have biased observed differences in pubertal ages¹⁷. Assessment of Tanner stages was supported by pictorial depictions and accompanying explanations of each Tanner stage which might have mitigated against this⁴⁹. Furthermore, studies that have used clinical assessments (i.e., observation by trained clinicians or research staff) rather than self-report have reported similar results to ours^31,39. Axillary hair was collected as a dichotomous response, which could result in an imprecise estimate of pubertal age. Only five repeated measures of BMC were available for deriving the age of peak BMC velocity which may have led to imprecise estimation⁵⁰. GWAS sample sizes were small in ALSPAC which can lead to unstable LDSR genetic correlation estimates. While analyses of pre-pubertal body composition were adjusted for measured confounders, we cannot rule out bias from residual or unmeasured confounding. ALSPAC participants were White Europeans and results might not generalise to other ethnic groups. Other pubertal age measures such as age of first ejaculation, and skeletal bone age were not available, and could have provided further information on pubertal timing.

In summary, findings from this prospective population-based cohort study of males and females supported all nine growth and development-based pubertal age measures as consistent measures of age at puberty, by providing evidence that they are measuring the same biological process. Choice of measure(s) to use in studies with plans for data collection is influenced by various factors, including research questions, available resources together with competing demands for other types of data to be collected, participant burden, and acceptability of data collection methods. For instance, studies comparing pubertal timing between males and females could focus on the measures available in both sexes, such as height, weight, and BMC, as well as pubic or axillary hair. Collecting longitudinal growth and development data can be challenging due to limited funding and research resources. Cohort studies that collect repeated data prospectively are research resources, often available to the global research community rather than funded to address a limited set of research questions. Thus, repeated height or weight data collections, which are likely to be relevant to many areas of study might become the basis for assessing pubertal age. However, there would be scientific value in other studies measuring as many of the measures we present so our finding might be replicated in independent studies. Further, availability of multiple measures would allow the comparison of risk factors and outcome across pubertal measures. Finally, the correlations presented may be useful for harmonising measures across studies (e.g., meta-analysis).

Data availability

Researchers interested in accessing ALSPAC data used in this study will need to submit a research proposal (https://proposals.epi.bristol.ac.uk/) for consideration by the ALSPAC Executive Committee (managed access). The ALSPAC Executive Committee encourage and facilitate data sharing with all ‘bona fide’ researchers. A bona fide researcher is defined as being a person with professional expertise to conduct bona fide research; and who has a formal affiliation with a bona fide research organisation that requires compliance with appropriate research governance and management systems. The ALSPAC data are not publicly available because the Executive Committee needs to check that the applicant is a bone fide researcher and that the proposed research is in the public interest. Source data underlying the graphs and charts presented in Figs. 1–6 can be found in Supplementary Data 2. All other data are available from the corresponding author on reasonable request.

Code availability

Statistical code (and analysis plan) used for this paper can be found in the Open Science Framework website at https://osf.io/3qndg/²⁵.

References

Patton, G. C. & Viner, R. Pubertal transitions in health. Lancet 369, 1130–1139 (2007).
Article PubMed Google Scholar
Ellison, P. T. & Reiches, M. W. In Human Growth and Development (second edition) (Cameron, N. & Bogin, B.) 81–107 (Academic Press, 2012).
Abreu, A. P. & Kaiser, U. B. Pubertal development and regulation. Lancet Diab. Endocrinol. 4, 254–264 (2016).
Article Google Scholar
Marshall, W. A. & Tanner, J. M. Variations in the pattern of pubertal changes in boys. Arch. Dis. Child. 45, 13–23 (1970).
Article CAS PubMed PubMed Central Google Scholar
Parent, A. S. et al. The timing of normal puberty and the age limits of sexual precocity: variations around the world, secular trends, and changes after migration. Endocr. Rev. 24, 668–693 (2003).
Article PubMed Google Scholar
Cousminer, D. L., Widen, E. & Palmert, M. R. The genetics of pubertal timing in the general population: recent advances and evidence for sex-specificity. Curr. Opin. Endocrinol. Diab. Obes. 23, 57–65 (2016).
Article Google Scholar
Lam, B. Y. H. et al. MC3R links nutritional state to childhood growth and the timing of puberty. Nature 599, 436–441 (2021).
Article CAS PubMed PubMed Central Google Scholar
Day, F. R. et al. Genomic analyses identify hundreds of variants associated with age at menarche and support a role for puberty timing in cancer risk. Nat. Genet. 49, 834–841 (2017).
Article CAS PubMed PubMed Central Google Scholar
Hollis, B. et al. Genomic analysis of male puberty timing highlights shared genetic basis with hair colour and lifespan. Nat. Commun. 11, 1536 (2020).
Article CAS PubMed PubMed Central Google Scholar
Golub, M. S. et al. Public health implications of altered puberty timing. Pediatrics 121, S218–S230 (2008).
Article PubMed Google Scholar
Graber, J. A. Pubertal timing and the development of psychopathology in adolescence and beyond. Horm. Behav. 64, 262–269 (2013).
Article PubMed Google Scholar
Gill, D. et al. Age at menarche and adult body mass index: a Mendelian randomization study. Int. J. Obes. 42, 1574–1581 (2018).
Bell, J. A. et al. Influence of puberty timing on adiposity and cardiometabolic traits: a Mendelian randomisation study. PLoS Med. 15, e1002641 (2018).
Article PubMed PubMed Central Google Scholar
Minelli, C. et al. Age at puberty and risk of asthma: a Mendelian randomisation study. PLoS Med. 15, e1002634 (2018).
Article PubMed PubMed Central Google Scholar
Elhakeem, A., Frysz, M., Tilling, K., Tobias, J. H. & Lawlor, D. A. Association between age at puberty and bone accrual from 10 to 25 years of age. JAMA Netw. Open 2, e198918 (2019).
Article PubMed PubMed Central Google Scholar
Zhang, Q., Greenbaum, J., Zhang, W. D., Sun, C. Q. & Deng, H. W. Age at menarche and osteoporosis: a Mendelian randomization study. Bone 117, 91–97 (2018).
Article PubMed PubMed Central Google Scholar
Cameron, N. In Human Growth and Development 2nd edn (eds. Cameron, N. & Bogin, B.) 515–535 (Academic Press, 2012).
Tanner, J. M. Growth at Adolescence, 2nd edn (Springfield, 1962).
Rockett, J. C., Lynch, C. D. & Buck, G. M. Biomarkers for assessing reproductive development and health: Part 1—Pubertal development. Environ. Health Perspect. 112, 105–112 (2004).
Article CAS PubMed PubMed Central Google Scholar
Walker, I. V., Smith, C. R., Davies, J. H., Inskip, H. M. & Baird, J. Methods for determining pubertal status in research studies: literature review and opinions of experts and adolescents. J. Dev. Orig. Health Dis. 11, 168–187 (2020).
Article CAS PubMed Google Scholar
Dorn, L. D. & Biro, F. M. Puberty and its measurement: a decade in review. J. Res. Adolesc. 21, 180–195 (2011).
Article Google Scholar
Fraser, A. et al. Cohort profile: the Avon Longitudinal Study of Parents and Children: ALSPAC mothers cohort. Int. J. Epidemiol. 42, 97–110 (2013).
Article PubMed Google Scholar
Boyd, A. et al. Cohort Profile: the ‘children of the 90s’—the index offspring of the Avon Longitudinal Study of Parents and Children. Int. J. Epidemiol. 42, 111–127 (2013).
Article PubMed Google Scholar
Northstone, K. et al. The Avon Longitudinal Study of Parents and Children (ALSPAC): an update on the enrolled sample of index children in 2019. Wellcome Open Res. 4, 51 (2019).
Article PubMed PubMed Central Google Scholar
Elhakeem, A. Comprehensive assessment of growth and development-based measures of puberty timing. Open Science Framework. https://doi.org/10.17605/OSF.IO/3QNDG (2023).
Locke, A. E. et al. Genetic studies of body mass index yield new insights for obesity biology. Nature 518, 197–206 (2015).
Article CAS PubMed PubMed Central Google Scholar
Felix, J. F. et al. Genome-wide association analysis identifies three new susceptibility loci for childhood body mass index. Hum. Mol. Genet. 25, 389–403 (2016).
Article CAS PubMed Google Scholar
Cole, T. J., Donaldson, M. D. & Ben-Shlomo, Y. SITAR-a useful instrument for growth curve analysis. Int. J. Epidemiol. 39, 1558–1566 (2010).
Article PubMed PubMed Central Google Scholar
Elhakeem, A. et al. Using linear and natural cubic splines, SITAR, and latent trajectory models to characterise nonlinear longitudinal growth trajectories in cohort studies. BMC Med. Res. Methodol. 22, 68 (2022).
Article PubMed PubMed Central Google Scholar
Cole, T. J. sitar: Super Imposition by Translation and Rotation growth curve analysis. R package version 1.4.0. https://CRAN.R-project.org/package=sitar (2023).
Cole, T. J., Pan, H. & Butler, G. E. A mixed effects model to estimate timing and intensity of pubertal growth from height and secondary sexual characteristics. Ann. Hum. Biol. 41, 76–83 (2014).
Article CAS PubMed Google Scholar
Bulik-Sullivan, B. et al. An atlas of genetic correlations across human diseases and traits. Nat. Genet. 47, 1236–1241 (2015).
Article CAS PubMed PubMed Central Google Scholar
Loh, P.-R. et al. Efficient Bayesian mixed-model analysis increases association power in large cohorts. Nat. Genet. 47, 284–290 (2015).
Article CAS PubMed PubMed Central Google Scholar
Mägi, R. & Morris, A. P. GWAMA: software for genome-wide association meta-analysis. BMC Bioinforma. 11, 288 (2010).
Article Google Scholar
Brix, N. et al. Timing of puberty in boys and girls: a population-based study. Paediatr. Perinat. Epidemiol. 33, 70–78 (2019).
Article PubMed Google Scholar
Bruserud, I. S. et al. References for ultrasound staging of breast maturation, tanner breast staging, pubic hair, and menarche in Norwegian girls. J. Clin. Endocrinol. Metab. 105, 1599–1607 (2020).
Article PubMed PubMed Central Google Scholar
Geithner, C. A. et al. Growth in peak aerobic power during adolescence. Med. Sci. Sports Exerc. 36, 1616–1624 (2004).
Article PubMed Google Scholar
McCormack, S. E. et al. Association between linear growth and bone accrual in a diverse cohort of children and adolescents. JAMA Pediatr. 171, e171769 (2017).
Article PubMed PubMed Central Google Scholar
Busch, A. S. et al. Voice break in boys-temporal relations with other pubertal milestones and likely causal effects of BMI. Hum. Reprod. 34, 1514–1522 (2019).
Article CAS PubMed PubMed Central Google Scholar
van den Berg, S. M. et al. Individual differences in puberty onset in girls: Bayesian estimation of heritabilities and genetic correlations. Behav. Genet. 36, 261–270 (2006).
Article PubMed Google Scholar
Koenis, M. M. et al. Longitudinal study of hormonal and physical development in young twins. J. Clin. Endocrinol. Metab. 98, E518–E527 (2013).
Article CAS PubMed Google Scholar
Mumby, H. S. et al. Mendelian randomisation study of childhood BMI and early menarche. J. Obes. 2011, 180729 (2011).
Article PubMed PubMed Central Google Scholar
Aris, I. M. et al. Analysis of early-life growth and age at pubertal onset in US children. JAMA Netw. Open 5, e2146873–e2146873 (2022).
Article PubMed PubMed Central Google Scholar
O’Keeffe, L. M., Frysz, M., Bell, J. A., Howe, L. D. & Fraser, A. Puberty timing and adiposity change across childhood and adolescence: disentangling cause and consequence. Hum. Reprod. 35, 2784–2792 (2020).
Article PubMed PubMed Central Google Scholar
Argente, J. et al. Molecular basis of normal and pathological puberty: from basic mechanisms to clinical implications. Lancet Diabetes Endocrinol. 11, 203–216 (2023).
Busch, A. S., Hagen, C. P. & Juul, A. Heritability of pubertal timing: detailed evaluation of specific milestones in healthy boys and girls. Eur. J. Endocrinol. 183, 13–20 (2020).
Article CAS PubMed Google Scholar
Burt Solorzano, C. M. & McCartney, C. R. Obesity and the pubertal transition in girls and boys. Reproduction 140, 399–410 (2010).
Article PubMed PubMed Central Google Scholar
Silventoinen, K., Jelenkovic, A., Palviainen, T., Dunkel, L. & Kaprio, J. The association between puberty timing and body mass index in a longitudinal setting: the contribution of genetic factors. Behav. Genet 52, 186–194 (2022).
Article PubMed PubMed Central Google Scholar
Campisi, S. C. et al. Can we rely on adolescents to self-assess puberty stage? A systematic review and meta-analysis. J. Clin. Endocrinol. Metab. 105, 2846–2856 (2020).
Cole, T. J. Optimal design for longitudinal studies to estimate pubertal height growth in individuals. Ann. Hum. Biol. 45, 314–320 (2018).
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We are extremely grateful to all the families who took part in this study, the midwives for their help in recruiting them and the whole ALSPAC team, which includes interviewers, computer and laboratory technicians, clerical workers, research scientists, volunteers, managers, receptionists, and nurses. ALSPAC data were collected and managed using REDCap (Research Electronic Data Capture) electronic data capture tools hosted at the University of Bristol. GWAS data was generated by Sample Logistics and Genotyping Facilities at Wellcome Sanger Institute and LabCorp (Laboratory Corporation of America) using support from 23andMe. This project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No. 874739 (LongITools). A.E. and D.A.L. receive part of their salary from the European Union’s Horizon 2020 research and innovation programme under grant agreement No. 101021566 (ART-HEALTH). A.E., M.F., A.G.S., J.A.B., J.H., L.D.H., K.T., N.J.T. and D.A.L. work in a Unit that receives funds from the University of Bristol and UK Medical Research Council (MC_UU_00032/05 and MC_UU_00032/02). D.A.L. is a National Institute of Health Research Senior Investigator (NF-0616-10102) and is also supported by a British Hear Foundation Chair (CH/F/20/90003). The UK Medical Research Council and Wellcome (Grant ref: 217065/Z/19/Z), and the University of Bristol provide core support for ALSPAC. A comprehensive list of grants funding is available on the ALSPAC website (http://www.bristol.ac.uk/alspac/external/documents/grant-acknowledgements.pdf). The funders had no role in the design and conduct of the study; collection, management, analysis, and interpretation of the data; preparation, review, or approval of the manuscript; and decision to submit the manuscript for publication. A.E. had full access to all the data in the study and takes responsibility for the integrity of the data and accuracy of the data analysis.

Author information

Authors and Affiliations

MRC Integrative Epidemiology Unit at the University of Bristol, Bristol, UK
Ahmed Elhakeem, Monika Frysz, Ana Goncalves Soares, Joshua A. Bell, Jon Heron, Laura D. Howe, Kate Tilling, Nicholas J. Timpson & Deborah A. Lawlor
Population Health Science, Bristol Medical School, University of Bristol, Bristol, UK
Ahmed Elhakeem, Ana Goncalves Soares, Joshua A. Bell, Jon Heron, Laura D. Howe, Kate Tilling, Nicholas J. Timpson & Deborah A. Lawlor
Musculoskeletal Research Unit, Translational Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK
Monika Frysz
UCL Great Ormond Street Institute of Child Health, London, UK
Tim J. Cole
Research Unit of Population Health, University of Oulu, Oulu, Finland
Sylvain Sebert
NIHR Bristol Biomedical Research Centre, Bristol, UK
Deborah A. Lawlor

Authors

Ahmed Elhakeem
View author publications
You can also search for this author in PubMed Google Scholar
Monika Frysz
View author publications
You can also search for this author in PubMed Google Scholar
Ana Goncalves Soares
View author publications
You can also search for this author in PubMed Google Scholar
Joshua A. Bell
View author publications
You can also search for this author in PubMed Google Scholar
Tim J. Cole
View author publications
You can also search for this author in PubMed Google Scholar
Jon Heron
View author publications
You can also search for this author in PubMed Google Scholar
Laura D. Howe
View author publications
You can also search for this author in PubMed Google Scholar
Sylvain Sebert
View author publications
You can also search for this author in PubMed Google Scholar
Kate Tilling
View author publications
You can also search for this author in PubMed Google Scholar
Nicholas J. Timpson
View author publications
You can also search for this author in PubMed Google Scholar
Deborah A. Lawlor
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.E. developed the idea for the paper with initial input from D.A.L. and further input from all authors. A.E. developed the analysis plan with input from all authors. A.E. did the majority of the statistical analysis. M.F. did the LDSR genetic correlation analysis. A.G.S. calculated the GRS for age at voice break. J.A.B. calculated the GRS for age at menarche, adulthood BMI, and childhood BMI. T.J.C., J.H. and K.T. provided advice on fitting mixed effects models to estimate age of puberty. A.E. wrote the first draft of the manuscript. M.F., A.G.S., J.A.B., T.J.C., J.H., L.D.H., S.S., K.T., N.J.T. and D.A.L. provided feedback on the draft and approved the final version for submission.

Corresponding author

Correspondence to Ahmed Elhakeem.

Ethics declarations

Competing interests

The authors declare the following competing interests: D.A.L. reported grants from national and international government and charity funders, Roche Diagnostics, and Medtronic Ltd for work unrelated to this publication. The other authors declare no competing interests.

Peer review

Peer review information

Communications Medicine thanks the anonymous reviewers for their contribution to the peer review of this work. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Peer Review File

Supplementary Information.

Description of Additional Supplementary Files

Supplementary Data 1.

Supplementary Data 2.

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Elhakeem, A., Frysz, M., Goncalves Soares, A. et al. Evaluation and comparison of nine growth and development-based measures of pubertal timing. Commun Med 4, 159 (2024). https://doi.org/10.1038/s43856-024-00580-1

Download citation

Received: 04 July 2023
Accepted: 25 July 2024
Published: 07 August 2024
DOI: https://doi.org/10.1038/s43856-024-00580-1
Springer Nature Limited

Evaluation and comparison of nine growth and development-based measures of pubertal timing

Abstract

Background

Methods

Results

Conclusions

Plain language summary

Similar content being viewed by others

Introduction

Methods

Cohort description

Puberty data collection from research clinics and questionnaires

Genotyping and imputation

GRSs for female and male pubertal timing, and adulthood and childhood BMI

Childhood body composition measurements and confounders

Statistics and reproducibility

Reporting summary

Results

Timing of pubertal growth and development

Phenotypic correlations between indicator-based pubertal age measures

Genetic correlations between indicator-based pubertal age measures

Associations of GRS’s with indicator-based pubertal age measures

Association of childhood body composition with indicator-based pubertal age measures

Discussion

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation