Background

Assisted reproductive technology (ART) is a group of medical methods for treating the infertile human in which both male and female gametes are used outside the body to achieve pregnancy [1]. To date, approximately 5 million babies are born worldwide via ART [2]. Although ART may help infertile couples, its use has increased concerns associated with pregnancy-related complications and adverse consequences [3]. It has been suggested that obstetric outcomes in gestation after ART are poor when compared with those pregnancies spontaneously conceived [4]. Moreover, evidence from meta-analyses [4,5,6,7,8] has revealed that singleton pregnancies after ART are at higher risk of adverse consequences than those conceived naturally. One of the outcomes followed by ART is gestational diabetes mellitus (GDM) and is known as one of the most common complications in pregnancy [9, 10]. GDM is defined as “carbohydrate intolerance of variable severity with onset or first recognition during pregnancy” [11]. GDM is a worldwide public health problem and complicates about 7% of all pregnancies [12, 13]. The cause and pathogenesis of GDM is both multifunctional and complex [14]. GDM is prone to causing a woman and her baby a wide range of complications during pregnancy and in later life [15, 16]. women with GDM are more likely to develop metabolic syndrome in the future, including type 2 diabetes [17]. Therefore, it is important to realize the risk factors of GDM such as family history of diabetes, obesity, high parity, advanced maternal age, previous adverse pregnancy, non-white race, history of a baby with birth weight > 3800 g, and hypothyroidism [12, 18].

In addition, studies have indicated that ART pregnancies are related to an increased risk of GDM [19,20,21,22]. Another study in Australia reported those who underwent ART are more prone to experience GDM compared to those who conceived spontaneously [23]. However, it was shown in another study that the rate of GDM was lower in women who conceived under intracytoplasmic sperm injection (ICSI) compared to those of spontaneously, in vitro fertilization (IVF) or simple ART [24]. Finally, we conducted a meta-analysis to provide an up-to-date survey of pregnancies resulting from ART and the increased risk of GDM between 1997 and 2017. We aimed to investigate the higher risk of GDM in pregnancies following ART and compare them to those of spontaneous conceptions.

Material and methods

Search strategy

This systematic review adheres to the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) checklist [25]. We searched the electronic databases ISI Web of Knowledge, MEDLINE/PubMed, Scopus, and Embase through May 2017, for studies investigating the relationship between ART and GDM. The search terms used were presented in Table 1. Reference lists from all identified studies were also searched for any relevant articles. Two authors (MM and AA) evaluated the studies, and discrepancies were resolved by discussion.

Table 1 Search strategy for MEDLINE (MeSH, Medical Subject Headings)

Inclusion and exclusion criteria

We included published studies that examined the relationship between the use of ART and the risk of GDM. No restriction criteria were imposed with regard to the size or type of the studied population, nor to the type of ART treatment. The following study types were excluded from the analyses: (a) non-English articles; (b) animal studies; (c) repeated or overlapping studies; (d) reviews, meta-analyses, case reports, editorials, and letters-to-the-editor articles; and (e) unpublished studies.

Outcome and exposure

The exposure variable was all types of ART treatment. Our outcome was GDM, defined as “carbohydrate intolerance of variable severity with onset or first recognition during pregnancy” [11].

Data extraction and quality assessment

Two reviewers (MM and AA) independently abstracted the following data from all eligible articles: first author’s name; year of publication; location; study period; design; sample size; type of ART; and study findings. Discrepancies were resolved by discussion between two reviewers.

Quality assessment of included studies was performed independently by two reviewers using the Newcastle–Ottawa Scale (NOS) [26]. The NOS assesses the methodological quality of the observational studies according to three domains: (a) selection of study groups; (b) comparability of groups; and (c) ascertainment of exposure and outcomes. Total scores range from 0 (lowest quality) to 9 (highest quality).

Statistical analysis

Data were analyzed using STATA version 13.0 (Stata Corp, College Station, TX, USA). The pooled relative risk (RR) was calculated with its 95% confidence interval (CI) to assess the strength of the association between the use of ART and GDM risk. To assess between study heterogeneity, both the Cochran Q test and the I2 statistic (the percentage of total variation across studies attributable to heterogeneity beyond chance) were calculated [27]. I2 values of 25, 50, and 75% were used as evidence of low, moderate, and high heterogeneity, respectively [27]. Subgroup analysis was performed to detect factors that may explain heterogeneity in outcome between each study. Publication bias was assessed using visual inspection of a funnel plot, Egger’s test, and Begg’s test [28, 29]. In all statistical tests, results with P < 0.05 were deemed statistically significant, except for the Cochran Q test where P < 0.10 was used.

Results

Study selection

The steps of the study selection are displayed in Fig. 1. A total of 950 related published articles were retrieved by using a search strategy in four international databases (638 from Scopus, 91 from PubMed, 62 from ISI Web of Knowledge, and 159 from Embase) and also seven records were identified from Google Scholar and reference lists of final included papers in the meta-analysis. In this study, 829 papers remained after removing duplicate papers using EndNote software. After title and abstract screening, 278 relevant articles were recognized as eligible and they were considered for additional full-text screening. After excluding 230 non-eligible studies, finally, 48 studies (four case-control studies, three cross-sectional studies, and 41 cohort studies) were included in this meta-analysis.

Fig. 1
figure 1

Flow diagram of the literature search for studies included in meta-analysis

Study characteristics

The study characteristics of the included studies are summarized in Table 2. In total, we included 48 studies published from 1987 to 2017. Observational studies (i.e., cross-sectional, case control and cohort studies) were included in the meta-analysis, whereas non-English studies and studies without relevant data or partial data were excluded. Sample size in the ART group ranged from 31 to 21,615 cases and in the non-ART group it ranged from 20 to 595,168 cases. Of the 48 studies, 19 were conducted in Asia, 17 in Europe, and 12 in America. Fourteen studies were published before 2011 and 34 studies were published from 2011 to 2017.

Table 2 Characteristics of the primary studies included in the meta-analysis

Quantitative data synthesis

In the present study, 91,487 ART cases (with 6819 cases of GDM) and 2,525,234 non-ART cases (with 113,505 cases of GDM) were included in the analysis. RRs and their 95% CIs were calculated using the Mantel–Haenszel method and, because of significant heterogeneity between studies, random effect models were also used. The relationship of ART and the risk of GDM was estimated using 48 included primary studies. The summary estimate of RR in this meta-analysis suggested that ART significantly was associated with higher risk of GDM (pooled RR = 1.51, 95% CI = 1.18–1.93, P = 0.001); that is, the risk of GDM in the ART group is 1.51 times compared to that in the non-ART group (Fig. 2 and Table 3).

Fig. 2
figure 2

Forest plot showing the risk of GDM following ART

Table 3 Summary of meta-analysis results and subgroups analysis

Heterogeneity analysis

To check the heterogeneity between studies, chi-square test, I-squared, and Tau-squared were conducted. Chi-square analysis revealed that there was a significant heterogeneity between primary studies (P < 0.001, I2 = 98.6%); consequently, to pool the effect sizes in this study, a random effect model was used. To find the source of heterogeneity between studies, subgroup analyses were performed on the basis of study design, study region, and study period (Table 3). Even after the aforementioned subgroup analyses, heterogeneity across the studies did not diminish successfully in all subgroups; for that reason, some estimations of pooled RR were measured by the random effects model and only pooled RR for case control studies and the papers that were published between 1987 and 2010 were estimated by a mixed-effect model (Figs. 3, 4 and 5).

Fig. 3
figure 3

Forest plot showing the risk of GDM following ART on the basis of study design

Fig. 4
figure 4

Forest plot showing the risk of GDM following ART on the basis of time period

Fig. 5
figure 5

Forest plot showing the risk of GDM following ART on the basis of region

Risk of publication bias

Graphical (funnel plot) and statistical tools (Begg’s and Egger’s test) were done to test the existence of publication bias in the studies. The results of the symmetrical funnel plot (Fig. 6), Egger’s test (P = 0.331), and Begg’s test (P = 0.810) suggested that there was no significant publication bias in this study.

Fig. 6
figure 6

Funnel plot to assess the presences of publication bias

Discussion

The current study aimed to assess the impact of ART on GDM using a systematic review of related articles. This meta-analysis included 344,021 cases, in which 91,487 cases used ART to achieve pregnancy. Statistical approaches were determined based on the heterogeneity of the included studies and publication bias was checked. Several subgroups were defined based on the study design, time period, and region.

The results from this meta-analysis revealed that GDM is strongly affected by the use of ART. The relative risk of GDM was significant regarding the use of ART. Regarding the magnitude of the RR, the results from different study designs were in accordance. However, the included cross-sectional studies did not report a significant pooled RR in contrast to cohort and case-control studies and this might be due to the lower number of cross-sectional studies. Moreover, the impact of ART on GDM did not differ in two distinct periods of time (2010 as the cut-off point). In contrast to America, consistent results were found in two regions of Asia and Europe. The pooled RR resulting from American studies showed a higher risk of GDM among those in the non-ART group.

The ART has been defined as treatments including in vitro handling of oocytes and sperm, and embryos, in which establishing pregnancy is the goal [76]. There have been many debates on the efficacy and safety of using ART regarding its increasing trend of use across most countries [77, 78]. It has been shown that ART is responsible for a high number of adverse pregnancy-related complications and obstetric outcomes such as polyhydramnios, low and very low infant birth weight, pregnancy-induced hypertension, pre-eclampsia, perinatal mortality, preterm and very preterm birth, placenta previa, antepartum hemorrhage, multiple pregnancy congenital malformation, higher risk of ectopic pregnancy, lower odds of vaginal delivery, postpartum hemorrhage, oligohydramnios, small for gestational age, and placental abruption [36, 79,80,81,82,83]. As mentioned, using ART was associated with GDM, which is diabetes diagnosed during pregnancy. Pregnancy may cause insulin resistance and hyperinsulinemia and can be followed by diabetes. GDM is defined as glucose intolerance with the first recognition during pregnancy and usually progresses in the second trimester [84]. GDM is associated with a large number of risk factors, such as elevated pre-pregnancy body mass index, older maternal age, history of GDM, diabetes among family members, polycystic ovary syndrome (PCOS), pre-existing hypertension, weight gain during pregnancy, smoking, ART, and higher parity [85,86,87]. The adverse effect of ART on GDM is discussed by several studies; however, the mechanism has not been well clarified [48, 52]. Several hypotheses are introduced in which GDM is influenced by the use of ART, including the etiology of infertility, the drugs used in the treatment procedure, the hormonal levels, and metabolic and vascular factors [19, 52]. However, it has been revealed that maternal age is the most effective factor on GDM [88]. Wang et al. have discussed the association between GDM and ART through impaired glucose tolerance in comparison to those of spontaneous conceptions. Moreover, they have exposed that for singleton mothers, GDM was more common among cases that underwent ART. However, the risk increases for singleton mothers younger than 40 [48]. Double embryo transfer has been introduced as a significant factor for multiple gestational pregnancy, which is followed by an elevated risk of GDM [89, 90]. Vitthala et al. assessed the risk of monozygotic twins after ART using a systematic review and they revealed that in comparison to cleavage embryo transfer, GDM is more affected by blastocyst transfer [91]. Hammoud et al. addressed the scientific question of whether it is important to diagnose GDM by screening or symptoms. They showed that GDM is strongly related to large-for-gestational-age births [92] and Sazonova et al. showed that babies after embryo transfer have a higher large for gestational age compared to fresh embryo transfer [93]. Pre-existing hypertension is associated with GDM [87] and this might be due to higher rates of ART mothers being of high maternal age [94]. Sibai and Ross assessed the pathophysiology and long-term consequences of hypertension in GDM. They demonstrated that mothers of twins are at a higher risk of GDM in contrast to those of singletons [90]. Risk of GDM among women with PCOS was assessed by Toulis et al. in a systematic review. They showed an increased likelihood of developing GDM among women with PCOS compared with general cases [95].

The current meta-analysis revealed a significant heterogeneity among the pooled studies, the cohort and cross-sectional studies, the studies conducted during 2011–2017, and the three regions of Asia, Europe, and America. Several statistical tools are available to check the heterogeneity of included studies in a meta-analysis and its selection mechanism depends on several factors such as sample size, the frequency of included studies, etc. The two common tests for heterogeneity (chi-square and the I2 value) can result in controversial conclusions regarding the number of included studies and the magnitude of the relative risks [96]. There might be many reasons for the presence of heterogeneity in the results, such as different cultural and ethnic conditions and diversity in the amount of regions’ development.

The present systematic review has several limitations that should be noted. First, the most important limitation for this study as for other meta-analysis studies is the lack of data for subgroup analysis based on type of pregnancy (singleton versus twin pregnancy), type of ART, or for data analysis controlling for known confounders. Second, there were no data on the relationship between ART and GDM for large regions such as Africa and Latin America, thus the generalizability of the results may be limited. Third, this study included only English papers.

In sum, the findings of the present systematic review and meta-analysis indicate that the use of ART is associated with a 1.51-fold increase in GDM. Women need to be counselled carefully before undergoing ART treatment about the possibility and risk of GDM.