Introduction

Retinopathy of prematurity (ROP) is a major complication of preterm birth that can lead to varying degrees of visual impairment and even blindness [1, 2]. ROP is a multifactorial disease involving both intrinsic and environmental factors. Among the former, prematurity is the most important, with the rate of ROP being inversely proportional to infants' gestational age (GA) [1, 2]. Environmental factors include exposure to oxygen and/or oxidative stress, which play a key role in the pathogenesis of ROP. In addition, many other factors, such as genetic predisposition, perinatal infection/inflammation, or pre- and postnatal malnutrition, may also contribute to ROP [1, 2].

There is increasing evidence that sex differences exist in both the physiology and pathology of the eye [3,4,5,6]. In fact, sex hormones are locally produced in ocular tissues, and their receptors are present throughout the eye, including the retina [3, 4]. In addition, it is generally accepted that ROP is included in what is known as the "male disadvantage of prematurity" [7]. This concept is based on the notion that both mortality and morbidity associated with prematurity are higher in boys than in girls. However, sex differences in ROP have not been the subject of in-depth study until very recently.

In 2021, we performed a meta-analysis in which we included 41 studies that reported sex differences in various outcomes of prematurity, such as ROP, bronchopulmonary dysplasia, necrotizing enterocolitis, or intraventricular hemorrhage [7]. We found that severe ROP was more common in boys than in girls, but there were no sex differences for any ROP [7]. Very recently, Bahmani et al. conducted a meta-analysis of risk factors for any ROP and found no sex differences when they pooled the 121 studies that reported these data [8]. In addition, Hoyek et al. performed a meta-analysis of the male-to-female ratio in preterm infants requiring treatment for ROP [9]. They included 316 studies and found that a higher percentage of boys than girls were screened and treated for ROP [9]. These findings confirm the existence of a male disadvantage in the incidence of severe ROP. However, there are still questions to be answered.

Our aim in the present meta-analysis is to investigate sex differences not only in severe ROP but also in any ROP. In addition, we used subgroup analysis and meta-regression to determine whether the male disadvantage in ROP is homogeneous across geographic or sociodemographic areas and whether it has changed over time and with the subsequent evolution of neonatology. In addition to classical frequentist statistics, we used a Bayesian approach. By quantifying evidence on a continuous scale, Bayesian statistics allow more nuanced instead of all-or-none (significant vs. nonsignificant) conclusions [10, 11]. Furthermore, Bayesian meta-analysis can differentiate whether there is evidence for the alternative hypothesis (H1), for the null hypothesis (H0), or whether the data are inconclusive [10,11,12]. Therefore, Bayesian analysis can help distinguish between evidence of absence and absence of evidence.

Methods

The methodology of this study is based on that of earlier studies by our group on sex differences and risk factors for outcomes of prematurity [7, 13,14,15,16]. The study was performed and reported according to the preferred reporting items for systematic reviews and meta-analyses and meta-analysis of observational studies in epidemiology guidelines. The review protocol was registered in the PROSPERO international register of systematic reviews (CRD42018095509). The research question was “Do preterm boys have a higher risk of developing ROP than preterm girls?”.

Sources and search strategy

A comprehensive literature search was undertaken using the PubMed and Embase databases. The search strategy is detailed in Supplementary Table 1. No language limit was applied. The literature search was updated up to September 2022. Narrative reviews, systematic reviews, case reports, letters, editorials, and commentaries were excluded but were read to identify potential additional studies. Additional strategies to identify studies included manual review of reference lists from key articles that fulfilled our eligibility criteria, use of the “related articles” feature in PubMed, and use of the “cited by” tool in Web of Science and Google Scholar.

Study selection and definitions

Studies were included if they had a prospective or retrospective cohort design, examined preterm infants (GA < 37 weeks) and reported primary data that could be used to measure the association between infant sex and rate of ROP. Studies that exclusively included late preterm infants (GA ≥ 34 weeks) or combined preterm and term infants were excluded. To identify relevant studies, two reviewers (HTM and GS) independently screened the results of the searches and applied inclusion criteria using a structured form. Discrepancies were resolved by two other reviewers (CG and VE).

Severe ROP was defined as prethreshold disease type 1 according to the Early Treatment for Retinopathy of Prematurity criteria (zone I, any stage ROP with plus disease; zone I, stage 3 ROP with or without plus disease; or zone II, stage 2 or 3 ROP with plus disease) [17] or as any ROP requiring treatment with photocryotherapy, laser photocoagulation or intravitreal injection of anti-vascular endothelial growth factor (VEGF) agents. Data on any ROP were also collected. If a study reported on any ROP and severe ROP, the non-severe ROP data were estimated by subtracting the severe ROP group from the any ROP group.

Data extraction and assessment of study quality

Three investigators (HTM, GS and AMF) extracted data on the study design, demographics, and rate of ROP. A second group of investigators (GS and CG) checked the data extraction for completeness and accuracy. Methodological quality was assessed using the Newcastle‒Ottawa scale (NOS) for cohort studies [18]. This scale assigns a maximum of 9 points (4 for selection, 2 for comparability, and 3 for outcome). NOS scores ≥ 7 were considered high-quality studies (low risk of bias), and scores of 5 to 6 denoted moderate quality (moderate risk of bias) [18].

Statistical analysis

Frequentist meta-analysis

Studies were combined and analyzed using Comprehensive meta-analysis V3.0 software (Biostat Inc., Englewood, NJ, USA) [19]. The risk ratio (RR) with 95% confidence interval (CI) was calculated for each individual study. Due to anticipated heterogeneity, summary statistics were calculated with a random-effects model. This model accounts for variability between studies as well as within studies [19, 20].

Statistical heterogeneity was assessed by Cochran’s Q statistic and by the I2 statistic. I2 was interpreted on the basis of Higgins and Thompson criteria, where 25%, 50%, and 75% correspond to low, moderate, and high heterogeneity, respectively [21]. Potential sources of heterogeneity were assessed through subgroup analysis and/or random effects (method of moments) univariate meta-regression analysis [22] as previously described [7, 13,14,15]. The potential sources of heterogeneity analyzed were exclusive inclusion of extremely preterm infants (GA ≤ 28 weeks), median year of cohort inclusion, median GA of the cohort, geographic location (continent or subcontinent), and sociodemographic index (SDI) quintile. The SDI is a composite measure of developmental status as it is associated with health outcomes, calculated as the geometric mean of the following three indicators: total rate of fertility, log income per capita, and mean years of education among those 15 years or older. SDI values are scaled from 0 (highest fertility, lowest income, and lowest education) to 1 (highest income, highest education, and lowest fertility) [23]. For comparisons across SDI quintiles, each country was assigned to a single quintile according to its SDI in 2019.

We used Egger’s regression test and funnel plots to assess publication bias. Duval and Tweedie's trim and fill method was used to adjust effect sizes in cases in which there was evidence of publication bias [24]. A probability value of less than 0.05 (0.10 for heterogeneity) was considered statistically significant.

Bayesian model-averaged meta-analysis

The results were further supplemented by a Bayesian model-averaged meta-analysis (BMA) [10, 11]. BMA employs Bayes factors (BFs) and Bayesian model averaging to evaluate the likelihood of the data under the combination of models assuming the presence vs. the absence of the meta-analytic effect and heterogeneity [10, 11]. The BF10 is the ratio of the probability of the data under the alternative hypothesis (H1) over the probability of the data under the null hypothesis (H0). The BF10 was interpreted using the evidence categories suggested by Lee and Wagenmakers [25]. The evidence in favor of H1 (BF10 > 1) was categorized as weak/inconclusive (1 < BF10 < 3), moderate (3 < BF10 < 10), strong (10 < BF10 < 30), very strong (30 < BF10 < 100), and extreme (BF10 > 100). The evidence in favor of H0 (BF10 < 1) was categorized as weak/inconclusive (1/3 < BF10 < 1), moderate (1/10 < BF10 < 1/3), strong (1/30 < BF10 < 1/10), very strong (1/100 < BF10 < 1/30), and extreme (BF10 < 1/100). Consequently, BMA allows us to distinguish the absence of evidence from the evidence of absence [12]. The BFrf is the ratio of the probability of the data under the random effects model over the probability of the data under the fixed effect model. The BFrf was categorized in a similar way to that described for the BF10, considering values above 1 as evidence in favor of random effects (i.e., presence of heterogeneity) and values below 1 as evidence in favor of fixed effects (i.e., absence of heterogeneity). We used the empirical prior distributions based on the Cochrane Database of Systematics Reviews for logRR, logRR = Student’s t (µ = 0, σ = 0.32, ν = 3), tau = inverse-gamma (k = 1.51, θ = 0.23) [10, 11]. We performed BMA in JASP [26], which utilizes the metaBMA R package [27].

Results

Description of studies and quality assessment

The flow diagram of the search process is shown in Supplementary Fig. 1. Of 2320 potentially relevant studies, 205 (867,409 infants) were included. Their characteristics are summarized in Supplementary Table 2. The quality score of each study according to the NOS is depicted in Supplementary Table 2. All studies received a score of six or higher, indicating a low to moderate risk of bias.

Main meta-analyses

Frequentist meta-analysis did not show a significant association between sex and any ROP (144 studies, RR = 1.00, 95% CI = 0.96–1.03) (Fig. 1a) but did show a positive association between male sex and the risk of severe ROP (113 studies, RR = 1.14, 95% CI = 1.07–1.22) (Fig. 2a) and a negative association between male sex and the risk of non-severe ROP (52 studies, RR = 0.92, 95% CI = 0.87–0.97) (Fig. 3a).

Fig. 1
figure 1

Any retinopathy of prematurity. a Summary of frequentist meta-analyses on the association between infant sex and any ROP. RR > 1 indicates a higher risk of ROP in males; b summary of Bayes factor (BF)10 values calculated through Bayesian model averaged meta-analysis. The BF10 is the ratio of the probability of the data under the alternative hypothesis (H1) over the probability of the data under the null hypothesis (H0). The BF10 was not calculated for Oceania because there was only one study from this continent. ROP retinopathy of prematurity, RR risk ratio, CI confidence interval, SDI sociodemographic index, GA gestational age

Fig. 2
figure 2

Severe retinopathy of prematurity. a Summary of frequentist meta-analyses on the association between infant sex and severe ROP. RR > 1 indicates a higher risk of ROP in males; b summary of Bayes factor (BF)10 values calculated through Bayesian model averaged meta-analysis. The BF10 is the ratio of the probability of the data under the alternative hypothesis (H1) over the probability of the data under the null hypothesis (H0). ROP retinopathy of prematurity, RR risk ratio, CI confidence interval, SDI sociodemographic index, GA gestational age

Fig. 3
figure 3

Non-severe retinopathy of prematurity. a Summary of frequentist meta-analyses on the association between infant sex and non-severe ROP. RR > 1 indicates a higher risk of ROP in males; b summary of Bayes factor (BF)10 values calculated through Bayesian model averaged meta-analysis. The BF10 is the ratio of the probability of the data under the alternative hypothesis (H1) over the probability of the data under the null hypothesis (H0). The BF10 was not calculated for Oceania because there was only one study from this continent. ROP retinopathy of prematurity, RR risk ratio, CI confidence interval, SDI sociodemographic index, GA gestational age

BMA showed that the evidence in favor of H0 (i.e., lack of association between infant sex and ROP) was strong for any ROP (BF10 = 0.05) (Table 1, Fig. 1b), whereas the evidence in favor of H1 (i.e., infant sex is associated with ROP) was extreme for severe ROP (BF10 = 71,174) (Table 2, Fig. 2b) and weak for non-severe ROP (BF10 = 2.42) (Table 3, Fig. 3b).

Table 1 Bayesian model averaged meta-analysis of the association between infant sex and any retinopathy of prematurity
Table 2 Bayesian model averaged meta-analysis of the association between infant sex and severe retinopathy of prematurity
Table 3 Bayesian model averaged meta-analysis of the association between infant sex and non-severe retinopathy of prematurity

The heterogeneity of the three meta-analyses was very high: any ROP: I2 = 73.2%, BFrf > 106 (extreme evidence for random effects); severe ROP: I2 = 81.4%, BFrf > 106 (extreme evidence for random effects); non-severe ROP: I2 = 57.4%, BFrf > 105 (extreme evidence for random effects). Detailed data on the heterogeneity of BMA analyses are depicted in Supplementary Tables 3, 4, and 5. Visual inspection of the funnel plot, Duval and Tweedie's trim and fill procedure (number of imputed studies was zero), and Egger's test of the intercept did not indicate significant publication bias in the meta-analyses of severe and non-severe ROP (Supplementary Fig. 2). For the meta-analysis on any ROP, Duvall and Tweedie's trim and fill procedure suggested that 36 studies were missing and that the adjusted RR was 0.94 (95% CI = 0.91–0.97).

Subgroup analysis and meta-regression

Subgroup analysis based on the exclusive inclusion of extremely preterm infants (GA ≤ 28 weeks) did not reveal any substantial differences for any ROP (Fig. 1, Table 1), severe ROP (Fig. 2, Table 2), or non-severe ROP (Fig. 3, Table 3).

Subgroup analysis based on the geographic location of the cohorts showed a frequentist positive association between male sex and any ROP in the cohorts from Central-Southeast Asia (RR = 1.17, 95% CI = 1.04–1.31), and the BMA showed that the evidence in favor of H1 was moderate for this association (BF10 = 4.1) (Fig. 1, Table 1). For Oceania (one study), there was a significantly higher risk of any ROP for female infants than for male infants (Fig. 1a). In all other geographic locations, no evidence of an association between sex and any ROP was found. The frequentist positive association between male sex and severe ROP was significant for North America, Europe, Oceania, and Western Asia but not for the other (sub)continents (Fig. 2). BMA showed that the evidence in favor of H1 (association between male sex and severe ROP) was extreme for North America, strong for Europe and Oceania, and moderate for Western Asia (Fig. 2, Table 2). In contrast, the evidence in favor of H0 (lack of association between sex and severe ROP) was moderate for Central–Southeast Asia and weak for Africa, Eastern Asia and Latin America (Fig. 2, Table 2). The negative association between male sex and non-severe ROP was observed only in the frequentist meta-analysis pooling all the studies but was not significant for any (sub)continent (Fig. 3a). BMA showed that the evidence for H1 (association between infant sex and non-severe ROP) was weak for Central-Southeast Asia and Europe, and the evidence for H0 was moderate for Eastern Asia and weak for Africa, Western Asia, North America, and Latin America (Fig. 3, Table 3).

Subgroup analysis based on SDI showed a significant positive association between any ROP and male sex in the middle-low to low SDI subgroup (RR = 1.22, 95% CI = 1.04–1.43), and BMA showed that the evidence in favor of this association was moderate (BF10 = 4.8) (Fig. 1, Table 1). The frequentist positive association between male sex and severe ROP was significant for the cohorts from the high and high-middle SDI countries (Fig. 2). BMA showed that the evidence in favor of H1 (association between infant sex and severe ROP) was extreme for the high SDI cohorts (BF10 = 71,174) and strong (BF10 = 21.2) for the high-middle SDI cohorts (Fig. 2, Table 2). In contrast, the evidence in favor of H0 (lack of association between infant sex and severe ROP) was moderate for the middle SDI cohorts (BF10 = 0.28) (Fig. 2, Table 2). Regarding non-severe ROP, an association with sex was observed in the frequentist meta-analysis of the high, high-middle, and middle to low SDI subgroups (Fig. 3). BMA showed that the evidence in favor of H1 (association non-severe ROP and sex) was moderate for the high-middle SDI cohorts (BF10 = 4.5), whereas the evidence in favor of H0 was moderate for the middle SDI cohorts (BF10 = 0.28).

Meta-regression showed that the effect size of the association between sex and ROP remained stable over time. That is, it does not correlate with the median year of birth of the cohort (Supplementary Table 6, Supplementary Fig. 3). This lack of correlation was maintained after the exclusion of studies with a time span of more than 5 years (Supplementary Table 6, Supplementary Fig. 3). The meta-regression could also not demonstrate a correlation between the mean/median GA of the cohort and the effect size of the association between sex and ROP (Supplementary Table 6).

Discussion

To our knowledge, this is the most comprehensive meta-analysis to date on sex differences in the development of ROP. Our data confirm the previous observations that the more severe forms of ROP are more common in preterm boys than in preterm girls [9]. However, when the study focused on any ROP, we did not observe sex differences. Therefore, our data suggest that male sex might be associated with a greater ROP progression rate. In addition, subgroup analysis showed wide variation in the association between sex and ROP according to geographic location and sociodemographic level of the countries. Finally, meta-regression showed that sex differences in ROP have remained stable over the years.

In the presence of an immature antioxidant system, hyperoxia and oxidative stress play a key role in ROP pathogenesis [1, 28]. Therefore, sex differences in ROP incidence may be related to sex differences in the retinal response to oxygen. Retinal vascular development occurs while the fetus is in the uterus in a relatively hypoxic environment. When an infant is delivered very prematurely, retinal development must continue in a hyperoxic environment, even in room air [1]. In addition, preterm infants frequently need supplementary oxygen. In the first phase, hyperoxia induces the suppression of angiogenic factors, such as VEGF, leading to cessation of vascularization and loss of normal vessels. This results in hypoxia in the avascular retina, which induces an increase in vascular growth factors, leading to the formation of abnormal vessels in phase 2 of ROP (from 30 to 32 weeks postmenstrual age to term) [1].

The antioxidant defense system develops more rapidly in female fetuses than in males, and this difference, particularly in the glutathione pathway, is often suggested as a key factor in the male disadvantage of prematurity [29, 30]. Moreover, the increased female resistance to oxidative stress may even be present at the cellular level. For example, mitochondria from female cells generate fewer superoxide radicals than those from males [31], and cell cultures of female endothelial cells from umbilical veins [32] or fetal pulmonary arteries [33] are more resistant to hyperoxia-induced damage than male cells. This enhanced female antioxidant capacity may confer greater protection against ROP during the hyperoxic phase and is a potential explanation for the sex differences we found. Accordingly, it is noteworthy that in randomized controlled trials comparing a lower oxygen saturation target (85–89%) with a higher saturation target (91%–95%), lower saturation levels protected against severe ROP only in preterm boys but not in girls [34].

It should be noted that ROP is a multifactorial condition, and therefore, many of the potential pre- and postnatal risk factors may differ by sex. Fetal sex is an important influence on prenatal development, not only because of genetic differences between males and females but also because of differences in sex hormones [35,36,37]. The levels of sex hormones diverge very early between male and female embryos/fetuses [35, 36], and this difference is maintained in early postnatal life [38]. This differential concentration of sex hormones may result in variations in the growth and development of almost all organs and systems [29, 39, 40], including the retina [41]. Within these sex-specific developmental trajectories, that of the lung is particularly relevant [29, 39, 40]. Slower lung maturation among male fetuses may result in a more challenging clinical condition at birth, which is of great importance for the subsequent morbidity and mortality of very preterm infants [29, 39, 40]. In our previous meta-analysis, we observed that male sex in very preterm infants was associated with higher rates of intubation at birth, respiratory distress syndrome treated with surfactant and/or mechanical ventilation, pneumothorax, sepsis, and necrotizing enterocolitis [7]. These differences in clinical course and complications may have a significant impact on the progression of ROP, as male preterm infants are more often exposed to inflammation, higher oxygen levels, and oxidative stress.

An important limitation of our meta-analysis is the high degree of heterogeneity. When we examined this heterogeneity by subgroup analysis, we found large variations in sex differences in ROP according to geographical location and sociodemographic level. The global epidemiological trend of ROP over the last decades has been variable, depending on the socioeconomic and demographic conditions of the countries [42,43,44]. In high-income countries, ROP mainly affects extremely preterm infants, and ROP screening is not routinely recommended for infants born after 30 weeks’ GA and with birth weights greater than 1500 g [45, 46]. In low- and middle-income countries, the incidence of preterm birth is increasing, while the mortality rate of preterm babies is decreasing as neonatal care units are improving in number and capability. However, primary prevention of ROP through close oxygen monitoring and titration can be challenging due to limited human and material resources [42,43,44, 46]. Therefore, low- and middle-income nations are currently experiencing epidemic levels of ROP not only in extremely premature infants but also in larger, more mature infants [42,43,44, 46]. Our data suggest that sex differences in ROP vary between these two epidemiological patterns of the disease.

It should be noted that most of the cohorts were from countries with high economic and educational levels but low fertility rates (i.e., high and high-middle SDI quintiles). The results from these countries account for the extreme evidence we found in favor of male disadvantage in severe ROP. However, there is a marked difference with the results from countries with medium and low SDIs. For example, there was no evidence of sex differences in severe ROP in the lower SDI groups. However, a male disadvantage was observed for any ROP, which was not present in the meta-analysis of high and medium SDI countries.

Finally, it has been suggested that advances in perinatal medicine, which have led to reductions in mortality and improvements in short-term outcomes for the most vulnerable preterm infants, may have had a greater impact on boys than on girls [47]. It is, therefore, possible that the male disadvantage of prematurity diminishes over the years. This seems to be the case with mortality but not with other complications of prematurity [7]. Our meta-analysis included cohorts from the last 40 years, and meta-regression confirmed that sex differences in ROP have remained stable over time.

There is a growing awareness among clinicians and researchers that many of the diseases they treat and study are characterized by sex/gender differences in epidemiology, pathophysiology, clinical manifestations, disease progression, and response to treatment [3,4,5,6, 30]. The present meta-analysis confirms that the risk of developing severe ROP is higher in preterm boys than in girls. Visual impairment is an important sequela of prematurity that affects both the anterior and posterior visual pathways [48]. Some conditions leading to posterior visual pathway dysfunction, such as intraventricular hemorrhage, posthemorrhagic ventricular dilation, or periventricular leukomalacia, are also included in the male disadvantage of prematurity [7]. This raises the risk of visual dysfunction in extremely premature males. An increased risk of visual impairment may play a role in extending the male disadvantage of prematurity beyond the first years of life, as evidenced by the greater risk of neurodevelopmental impairment in ex-preterm boys compared with ex-preterm girls [49, 50].