Introduction

Gestational diabetes mellitus is an important cause of maternal and perinatal complications including preeclampsia, gestational hypertension, caesarean section, macrosomia, and stillbirths [1]. Gestational diabetes mellitus is a glucose intolerance discovered for the first time in pregnancy and is by the National Institute for Health and Care Excellence (NICE) defined as a fasting plasma glucose of 5.6 mmol/L or higher or a 2-h plasma glucose level of 7.8 mmol/L or higher [2]. However, there is still no international agreement about the definition of gestational diabetes mellitus [3]. The prevalence of gestational diabetes mellitus is increasing worldwide [4] parallel to the increase in prevalence of overweight and obesity among pregnant women. Overweight and obesity is the strongest risk factor for gestational diabetes mellitus with 2–3 and 5–6 fold increases in the relative risk (RR) compared to normal weight women [5, 6]. Some evidence suggests an increased risk even within the high-normal range of body mass index compared to the low-normal range [7, 8], similar to what is observed for type 2 diabetes [9]. Gestational diabetes and type 2 diabetes have many pathophysiological features in common.

Although physical activity has been established as a protective factor for type 2 diabetes [10], the data regarding physical activity and gestational diabetes mellitus are less extensive and less convincing [1137]. Several [16, 19, 21, 2327, 29, 3440], but not all studies [1115, 17, 18, 20, 22, 28, 30], have reported inverse associations between higher physical activity and gestational diabetes mellitus risk, however, even among the studies that did report inverse associations the strength of the associations have varied considerably with reductions in the relative risk ranging from 10–30 [2426, 37] up to 50–90 % [16, 19, 21, 23, 27, 3436, 38]. It is not clear whether the variability in the results could be due to differences in the ranges and amounts of physical activity between studies, or if it varies by subtypes or intensity of physical activity, or whether it is the total amount of physical activity that is the most important factor. A previous meta-analysis of case–control and cohort studies reported an inverse association between high versus low physical activity and gestational diabetes mellitus [41], but no dose–response analyses were conducted. Two more recent meta-analyses [42, 43], which only included randomized trials came to opposite conclusions to whether physical activity reduces gestational diabetes risk with one showing no association [42] and another showing an inverse association [43], but none of the published meta-analyses reported whether the amount of physical activity was related to the outcome. Some studies have reported dose-dependent inverse associations between physical activity and gestational diabetes mellitus risk [23, 24, 31, 35, 37], however, other studies suggested that most of the benefit observed was when increasing physical activity level from a low level to a moderate level [25, 27, 34]. Clarifying whether there is a linear dose–response relationship or whether there are threshold levels of activity could be important to provide more detailed recommendations for the physical activity level needed for women to reduce the risk of gestational diabetes mellitus, and could also provide crucial information for the planning of future large-scale randomized trials of physical activity for prevention of the disease.

Several additional studies have been published since the previous meta-analyses [1214, 16, 1922, 2833] and for this reason we conducted an updated systematic review and meta-analysis of physical activity and gestational diabetes mellitus with a particular aim of clarifying whether there is a dose–response relationship between increasing physical activity level and lower risk of gestational diabetes mellitus. We also summarized studies that have been published on physical activity and abnormal glucose tolerance (elevated glucose levels in the non-diabetic range) [22, 25, 33, 39, 44, 45].

Methods

Search strategy

The PubMed, Embase and Ovid databases were initially searched up to December 10th 2014 for cohort studies and randomized trials of physical activity and gestational diabetes mellitus risk and the searches were later updated to August 5th 2015. We used the following search terms: (physical activity OR exercise OR sports OR walking OR biking OR running OR fitness OR “exercise test” OR inactivity OR sedentary OR “risk factor” OR “risk factors”) AND (“gestational diabetes” OR “gestational diabetes mellitus”) AND (“case–control” OR retrospective OR cohort OR cohorts OR prospective OR longitudinal OR “follow-up” OR “cross-sectional” OR trial). We also searched the reference lists of previous reviews on the subject [4143] and of the studies included in the analysis for any further studies.

Study selection

To be included, the study had to be a randomized controlled trial, or a cohort study, and to investigate the association between physical activity and risk of gestational diabetes mellitus or abnormal glucose tolerance. Estimates of the relative risk (hazard ratio, risk ratio, odds ratio) had to be available with the 95 % confidence intervals, and for the dose–response analysis, a quantitative measure of activity level for 3 or more categories of activity and the total number of cases and person-years or participants had to be available in the publication. When multiple publications were available from the same study we used the study with the most detailed analyses of physical activity and the largest number of participants. We identified 26 studies that were included in total [1137, 39, 44, 45], 23 studies that could be included in the analysis [1137] of gestational diabetes mellitus and six studies that could be included in the analysis of abnormal glucose tolerance [22, 25, 33, 39, 44, 45]. A list of the excluded studies and reasons for exclusion is found in Supplementary Table 1. The search was conducted by DA and study selection was conducted by DA and AS.

Data extraction

We extracted the following data from each study: The first author’s last name, publication year, country where the study was conducted, follow-up period, sample size, age, number of cases, exposure, physical activity level, RRs and 95 % CIs, and variables adjusted for in the analysis. Data extractions were done by DA and checked for accuracy by AS.

Study quality assessment

The quality of the studies included was assessed using the Newcastle–Ottawa scale [46] for cohort studies and the Cochrane Collaboration’s tool for assessing risk of bias in randomised trials [47]. The Newcastle–Ottawa scale assesses the study quality based on the selection (representativeness of the exposed cohort, selection of the non-exposed cohort, ascertainment of exposure, demonstration that the outcome of interest was not present at the start of the study), comparability (adjustment for confounding factors), and the outcome (outcome assessment, long enough follow-up, adequacy of follow-up of cohorts). The randomized trials were assessed for risk of bias based on random sequence generation, allocation concealment, blinding of participants and personal, blinding of outcome assessment, incomplete outcome data, selective reporting, and other biases. Subgroup analyses by study quality scores or risk of bias were conducted separately for the observational studies and the randomized trials because of the different scales for the two study designs. For the observational studies we grouped studies with 0–3, 4–6, and 7–9 points to indicate low, medium and high quality studies, while the randomized trials studies were grouped according to whether they were at high, low or unclear risk of bias in the subgroup analyses.

Statistical methods

We used random effects models to calculate summary RRs and 95 % CIs for the highest versus the lowest level of physical activity and for the dose–response analysis [48]. The average of the natural logarithm of the RRs was estimated and the RR from each study was weighted by the inverse of its variance and then un-weighted by a variance component which corresponds to the amount of heterogeneity in the analysis. A two-tailed p < 0.05 was considered statistically significant.

We used the method described by Greenland and Longnecker [49] for the dose–response analysis and computed study-specific slopes (linear trends) and 95 % CIs from the natural logs of the RRs and CIs across categories of physical activity. The method requires that the distribution of cases and person-years or non-cases and the RRs with the variance estimates for at least three quantitative exposure categories are known. We estimated the distribution of cases or person-years in studies that did not report these, but reported the total number of cases/person-years, as described previously [50, 51]. The median or mean physical activity level in each category was assigned to the corresponding relative risk for each study. For studies that reported physical activity by ranges of activity we estimated the midpoint for each category by calculating the average of the lower and upper bound. When the highest or lowest category was open-ended we assumed the open-ended interval length to be the same as the adjacent interval. For one study which only provided a continuous estimate of physical activity per 100 kcal of energy expenditure we re-calculated the odds ratio so it corresponded to an increment equal to the highest compared to the lowest quartile so it could be included in the high versus low analysis [26], and the same was done for another study [30]. For the dose–response analysis we conducted separate analyses for studies reporting results in metabolic equivalent task (MET)-hours and hours/week. The MET is an index of the intensity of physical activity and is defined as the caloric expenditure per kilogram of body weight per hour of activity, divided by the equivalent per hour at rest [52]. One MET is equal to the energy cost of a person during quiet sitting, walking slowly has a MET value of 2 and jogging and bicycling have MET values of 7–8. MET-hours are the number of hours spent in each activity multiplied with the MET value of that activity. For one study [37] we converted frequency of physical activity/week to hours/week by assigning a dose of 45 min per session [10, 53]. We examined a potential nonlinear dose–response relationship between physical activity and gestational diabetes mellitus by using restricted cubic splines with three knots at 10, 50 and 90 % percentiles of the distribution which was combined using multivariate meta-analysis [54, 55]. A likelihood ratio test was used to assess the difference between the nonlinear and linear models to test for nonlinearity [56]. Although formal dose–response analyses were not possible for the randomized controlled trials we fitted a linear regression of the RR estimates against the total number of hours/week the interventions lasted.

Heterogeneity between studies was assessed by the Q test and I2 [57]. I2 is the amount of total variation that is explained by between study variation. I2 values of approximately 25, 50 and 75 % are considered to indicate low, moderate and high heterogeneity, respectively. To investigate sources of heterogeneity subgroup analyses were conducted according to study design, geographic location, number of cases and adjustment for confounding factors. Meta-regression analyses were used to test for heterogeneity between subgroups.

Publication bias was assessed with Egger’s test [58] and the results were considered to indicate publication bias when p < 0.10. We conducted sensitivity analyses excluding one study at a time to ensure that the results were not simply due to one large study or a study with an extreme result. The statistical analyses were conducted using Stata, version 13.0 software (StataCorp, College Station, TX, USA). The following Stata commands were used: metan7 (high versus low analyses and dose–response analyses), glst (dose–response analyses), metareg (meta-regression analyses), metabias6 (Egger’s test), metafunnel (funnel plot), metaninf (influence or sensitivity analyses).

Results

Out of the 7616 records which were identified by the literature search, 7501 were excluded based on the title and abstract (one author screened all references), and 115 full text articles were assessed in detail (in duplicate) as they reported on physical activity or risk factors and gestational diabetes mellitus in the title/abstract (Fig. 1). Out of these articles, 26 publications (25 studies) were finally included in the meta-analysis, including twelve randomized trials [1122] and eleven cohort studies [2333] that could be included in the meta-analysis of physical activity and gestational diabetes mellitus (Tables 1, 2, 3; Fig. 1), and one randomized trial [22] and five cohort studies [25, 33, 39, 44, 45] that were included in the meta-analysis of physical activity and risk of abnormal glucose tolerance (Table 4; Fig. 1).

Fig. 1
figure 1

Flow-chart of study selection

Table 1 Randomized controlled trials of physical activity and gestational diabetes mellitus risk
Table 2 Prospective cohort studies of physical activity and gestational diabetes mellitus risk
Table 3 Cohort studies and randomized trials of physical activity and impaired or abnormal glucose tolerance
Table 4 Subgroup analyses of leisure-time physical activity and gestational diabetes mellitus risk, high versus low analysis

Total physical activity before pregnancy

Four cohort studies [26, 27, 32, 33] were included in the analysis of total physical activity (sum of leisure-time, household and occupational activity) before pregnancy and gestational diabetes mellitus and included 293 cases and 4607 participants. The summary RR for high versus low activity was 0.62 (95 % CI 0.41–0.94, I2 = 0 %, pheterogeneity = 0.88) (Fig. 2a). Because of differences in the way the results were reported it was not possible to conduct dose–response analyses of total physical activity.

Fig. 2
figure 2

Total physical activity before and during pregnancy and gestational diabetes, high versus low comparison. a Total physical activity before pregnancy and gestational diabetes mellitus, high versus low analysis. b Total physical activity during pregnancy and gestational diabetes mellitus, high versus low analysis

Total physical activity during pregnancy

Three cohort studies [27, 32, 33] were included in the analysis of total physical activity during pregnancy and gestational diabetes mellitus and included 244 cases and 3996 participants. The summary RR for high versus low activity was 0.66 (95 % CI 0.36–1.21, I2 = 0 %, pheterogeneity = 0.90) (Fig. 2b). Because of differences in the way the results were reported it was not possible to conduct dose–response analyses of total physical activity.

Leisure-time physical activity before pregnancy

Eight cohort studies [2325, 2730, 33] were included in the analysis of pre-pregnancy physical activity and the risk of gestational diabetes mellitus and included 2401 cases and 32,592 participants. The summary RR for high versus low pre-pregnancy physical activity was 0.78 (95 % CI 0.61–1.00, I2 = 47 %, pheterogeneity = 0.07) (Fig. 3). There was no evidence of publication bias with Egger’s test, p = 0.87. In the dose–response analysis of MET-hours/week the summary RR was 0.84 (95 % CI 0.59–1.21, I2 = 80.9 %, pheterogeneity = 0.001) per 20 MET-hours/week [23, 24, 28] (Fig. 4a) and there was no evidence of nonlinearity, p = 0.31 (Fig. 4b; Supplementary Table 2). In the dose–response analysis of hours/week the summary RR was 0.70 (95 % CI 0.49–1.01, I2 = 72.6 %, pheterogeneity = 0.03) per 5 h/week [23, 25, 31] (Fig. 4c). There was evidence of nonlinearity, pnonlinearity = 0.005, with a steeper inverse association at the lower levels of physical activity, but further reductions in risk were observed with higher levels of activity (Fig. 4d; Supplementary Table 2).

Fig. 3
figure 3

Leisure-time physical activity before pregnancy and gestational diabetes mellitus, highest versus lowest comparison

Fig. 4
figure 4

Leisure-time physical activity before pregnancy and gestational diabetes mellitus, linear (per 20 MET-hours/week and per 5 h/week) and nonlinear dose–response analysis. a Leisure-time physical activity before pregnancy and gestational diabetes mellitus, per 20 MET-hours/week. b Leisure-time physical activity before pregnancy and gestational diabetes mellitus, nonlinear dose–response, MET-hours/week. c Leisure-time physical activity before pregnancy and gestational diabetes mellitus, per 5 h/week. d Leisure-time physical activity before pregnancy and gestational diabetes mellitus, nonlinear dose–response, h/week

Leisure-time physical activity during pregnancy

Twelve randomized trials [1122] and five cohort studies [23, 25, 27, 30, 33] were included in the analysis of early pregnancy physical activity and the risk of gestational diabetes mellitus and included 900 cases and 9804 participants. The summary RR for high versus low physical activity in early pregnancy was 0.80 (95 % CI 0.64–1.00, passociation = 0.046), with low heterogeneity, I2 = 17 %, pheterogeneity = 0.26 (Fig. 5). The summary RR was 0.69 (95 % CI 0.50–0.96, I2 = 30.2 %, pheterogeneity = 0.15) for the randomized trials and 0.97 (95 % CI 0.73–1.28, I2 = 0 %, pheterogeneity = 0.80) for the cohort studies. There was evidence of publication bias with Egger’s test, p = 0.007. We also repeated the analysis using mid-pregnancy physical activity data instead of early pregnancy data for two studies [27, 33] which provided both, and the results were slightly strengthened, summary RR = 0.75 (95 % CI 0.59–0.95, I2 = 27.7 %, pheterogeneity = 0.14). The summary RR per 5 h/week of physical activity was 0.98 (95 % CI 0.87–1.09, I2 = 0 %, pheterogeneity = 0.59) (Fig. 6c) [23, 25, 34], but there was evidence of a nonlinear inverse association, pnonlinearity = 0.008, with no further reduction in risk from approximately 7–8 h/week (Fig. 6d; Supplementary Table 2). Among the randomized trials we fitted a linear regression of the relative risks against the approximate total number of hours/week of the interventions, and although not statistically significant, p = 0.24, there was some indication of greater reductions in risk with a larger number of hours of activity (Supplementary Fig. 1). When the randomized trials were stratified by duration of activity among the studies for which we could estimate the approximate number of hours of activity per week the interventions lasted, the summary RR was 0.80 (95 % CI 0.37–1.71, n = 3) for studies with 1–2 h of activity per week, and 0.64 (0.44–0.93, n = 9) for studies with >2 h/week of activity, and 0.66 (95 % CI 0.44–1.01, n = 7) for >2–3 h/week, and 0.48 (95 % CI 0.18–1.27, n = 2) for >3 h/week.

Fig. 5
figure 5

Leisure-time physical activity during early pregnancy and gestational diabetes mellitus, high versus low comparison

Fig. 6
figure 6

Leisure-time physical activity during early pregnancy and gestational diabetes mellitus, linear (per 5 h/week) and nonlinear dose–response analysis. a Leisure-time physical activity during pregnancy and gestational diabetes mellitus, per 5 h/week. b Leisure-time physical activity during pregnancy and gestational diabetes mellitus, nonlinear dose–response, h/week

Combined pre-pregnancy and early pregnancy physical activity

Two cohort studies [23, 25] investigated the association between combined physical activity before and during pregnancy and risk of gestational diabetes mellitus. The summary RR was 0.60 (95 % CI 0.30–1.23, I2 = 19.1 %, pheterogeneity = 0.27) for physical activity before pregnancy only, 1.01 (95 % CI 0.49–2.07, I2 = 0 %, pheterogeneity = 0.33) for physical activity during pregnancy only, and 0.41 (95 % CI 0.23–0.73, I2 = 0 %, pheterogeneity = 0.45) for physical activity both before and during pregnancy (Fig. 7).

Fig. 7
figure 7

Leisure-time physical activity before and during pregnancy and gestational diabetes mellitus, joint associations

Walking

Two cohort studies [24, 25] were included in the analysis of walking before pregnancy and gestational diabetes mellitus and two cohort studies [25, 30] were included in the analysis of walking during pregnancy and gestational diabetes mellitus. The summary RR was 0.66 (95 % CI 0.48–0.91, I2 = 0 %, pheterogeneity = 0.97) for walking before pregnancy (Fig. 8a). The summary RR was 0.80 (95 % CI 0.66–0.97, I2 = 0 %, pheterogeneity = 0.59) for walking during pregnancy (Fig. 8b).

Fig. 8
figure 8

Walking and vigorous physical activity before and during pregnancy and gestational diabetes, high versus low comparison. a Walking before pregnancy and gestational diabetes mellitus, high versus low analysis. b Walking during pregnancy and gestational diabetes mellitus, high versus low analysis. c Vigorous physical activity before pregnancy and gestational diabetes mellitus, high versus low analysis. d Vigorous physical activity during pregnancy and gestational diabetes mellitus, high versus low analysis

Intensity of physical activity

Three cohort studies [24, 25, 33] investigated the association between vigorous physical activity before pregnancy and gestational diabetes mellitus, while two cohort studies [25, 33] investigated vigorous physical activity in early pregnancy and gestational diabetes mellitus. The summary RR was 0.76 (95 % CI 0.66–0.88, I2 = 0 %, pheterogeneity = 0.45) (Fig. 8c) for vigorous physical activity before pregnancy and 0.95 (95 % CI 0.55–1.63, I2 = 0 %, pheterogeneity = 0.78) (Fig. 8d) for vigorous physical activity in early pregnancy.

Occupational physical activity and household physical activity

Two cohort studies [27, 33] were included in the analysis of occupational physical activity before and during pregnancy. The summary RR for high versus low occupational physical activity was 1.90 (95 % CI 0.97–3.74, I2 = 0 %, pheterogeneity = 0.79) for activity before pregnancy (Supplementary Fig. 2a) and 0.78 (95 % CI 0.21–2.93, I2 = 78 %, pheterogeneity = 0.03) for activity during pregnancy (Supplementary Fig. 2b). Two cohort studies [27, 33] were included in the analysis of household physical activity before and during pregnancy. The summary RR for high versus low household physical activity was 0.36 (95 % CI 0.12–1.08, I2 = 61.9 %, pheterogeneity = 0.11) for activity before pregnancy (Supplementary Fig. 3a) and 1.22 (95 % CI 0.53–2.81, I2 = 23.6 %, pheterogeneity = 0.25) for activity during pregnancy (Supplementary Fig. 3b).

Physical activity and abnormal glucose tolerance

We conducted supplementary analyses of one randomized trial [22] and five cohort studies [25, 33, 39, 44, 45] which reported on physical activity and abnormal glucose tolerance as an outcome and the summary RRs were 0.81 (95 % CI 0.55–1.17, I2 = 34 % pheteroeneity = 0.21) for pre-pregnancy physical activity [25, 33, 39, 44] (Supplementary Fig. 4a) and 0.77 (95 % CI 0.63–0.95, I2 = 0 % pheteroeneity = 0.76) for physical activity during pregnancy [22, 25, 33, 44, 45], respectively (Supplementary Fig. 4b).

Subgroup, sensitivity, and meta-regression analyses

In subgroup and meta-regression analyses we found no significant heterogeneity between subgroups when studies were stratified by study design, geographic location, number of cases, study quality (cohort studies) or risk of bias (randomized trials) (Table 4). Further subgroup analyses by whether studies had adjusted for confounding factors did not reveal significant heterogeneity between most strata, although associations were not always statistically significant. We also conducted sensitivity analyses excluding one study at a time in the analyses of leisure-time physical activity, and although the summary relative risks did not vary substantially exclusion of some studies made the borderline significant associations statistically significant (Supplementary Figures 5, 6).

Mean (median) study quality scores were 7.0 (6.5) for cohort studies of leisure-time physical activity before pregnancy and 7.0 (7.0) for cohort studies of leisure-time physical activity during pregnancy. Of the 12 randomized trials 6 were deemed to be of high risk of bias, 1 of low risk of bias and 5 of unclear risk of bias.

We conducted further analyses of three studies on physical activity before pregnancy [23, 24, 27] and three studies on physical activity during pregnancy [23, 24, 27] and gestational diabetes mellitus which provided risk estimates adjusted and not adjusted for BMI, to clarify whether part of the association might be explained by reduced body fatness. The summary RR for pre-pregnancy physical activity was 0.63 (95 % CI 0.23–1.71) without BMI adjustment and 0.72 (95 % CI 0.30–1.76) with BMI adjustment, while for physical activity during pregnancy it was 0.49 (95 % CI 0.29–0.83) without BMI adjustment and 0.56 (95 % CI 0.33–0.95) with BMI adjustment.

Discussion

In this meta-analysis higher leisure-time physical activity before and during pregnancy was associated with a marginally significant 22 % reduction and 20 % reduction in the relative risk of gestational diabetes mellitus, respectively. Higher total physical activity before pregnancy was associated with a 36 % reduction in the relative risk of gestational diabetes, while the association for total physical activity during pregnancy was in the direction of reduced risk, but was not statistically significant, possibly due to few studies. Walking before and during pregnancy and vigorous activity before pregnancy were also inversely associated with gestational diabetes, but occupational and household physical activity were not associated with risk, although these results were based on few studies.

When stratified by study design the association between leisure-time physical activity during pregnancy and gestational diabetes was significant in randomized trials, but not significant in cohort studies. As the studies differed with regard to the level of physical activity level between studies it is difficult to base physical activity recommendations on the results from the high versus low analyses, and therefore we also conducted linear and nonlinear dose–response analyses. In the nonlinear dose–response analysis there was a 12 % reduction in the relative risk of gestational diabetes mellitus for 30 MET-hours of pre-pregnancy physical activity per week compared to no activity, and a 30 % reduction in risk for 7 h of pre-pregnancy activity per week compared to no activity, and for physical activity during pregnancy there was a 37 % reduction in the relative risk for 7 h of activity per week compared to no activity. In analyses of combined pre-pregnancy and early pregnancy physical activity there was a suggestive 40 % reduction in risk among women who were physically active before pregnancy, but no association among women who were active only during early pregnancy, while there was a 59 % reduction in relative risk for women who were physically active before and during pregnancy compared to women who were inactive in both periods. An interesting finding of the present meta-analysis is that some of the associations were stronger for pre-pregnancy physical activity than for physical activity during pregnancy, which is similar to the findings in our meta-analysis on physical activity and preeclampsia [53]. This is not unreasonable as the time available to intervene, and the degree of physical activity that is achievable is more limited in pregnancy. In addition, the physiologic insulin-resistance in pregnancy could attenuate the effects of physical activity during pregnancy. However, as this was not entirely consistently observed across the various physical activity exposures, further studies with both prepregnancy and early pregnancy physical activity measures are needed for further clarification.

The results from this meta-analysis provide further support for the hypothesis that physical activity decreases the risk of gestational diabetes mellitus and are consistent with two previous meta-analyses which also found inverse association [41, 43], but not with a third [42]. However, in contrast to the previous meta-analyses we further quantified the association between physical activity and gestational diabetes mellitus risk by conducting linear and nonlinear dose–response analyses and conducted more detailed analyses of different domains of activity. Such analyses are important to guide recommendations to pregnant women with regard to the amount and types of physical activity that may reduce risk, as well as to inform future physical activity interventions that aim to reduce gestational diabetes risk.

Our meta-analysis may have some limitations that could have affected the results. It is possible that the observed inverse association between physical activity and risk of gestational diabetes mellitus risk could be due to unmeasured or residual confounding. Higher physical activity is associated with other healthy behaviors including lower prevalence of overweight and obesity and healthier diets with higher dietary fiber intake, and lower red and processed meat intake. However, many of the studies included in this meta-analysis adjusted for confounding factors such as age, BMI, and energy intake and the associations persisted in subgroup analyses by stratification by adjustment for these confounding variables. We found no evidence of heterogeneity between these subgroups with meta-regression analyses. Any further studies should adjust for more dietary confounding factors. There was moderate heterogeneity among studies of leisure-time activity before pregnancy (I2 = 47 %), but when stratified by number of cases there was no heterogeneity among studies with 200 or more cases (I2 = 0 %), although there was no between subgroup heterogeneity with meta-regression analyses. Among studies of leisure-time physical activity during pregnancy there was low heterogeneity (I2 = 17 %). For some other subtypes of activity there was high heterogeneity, but there were not enough studies to conduct subgroup and meta-regression analyses, or to test for publication bias and conduct other sensitivity analyses for the subtypes of physical activity. Not all the studies included in the high versus low analysis could be included in the dose–response analyses because results either were reported using a different underlying measure than others or because there was only a dichotomized categorization of physical activity. Any further studies should report the results for 3–4 or more categories of activity and use a measure that allows for combination with other studies, preferably in hours/week and/or MET-hours/week. In addition, some of the randomized trials may have had a dose of physical activity that was too low (1–2 h/week) to observe an association, particularly because of the possibility of contamination of the control group. In subgroup analyses of the randomized trials there was some suggestion of a stronger association among studies with an estimated duration of >2–3 and >3 h/week of activity than among those with an estimated duration of 1–2 h/week. In addition to the activity level being too low, the compliance with the exercise interventions may have been poor in some studies, which may have attenuated any associations. A challenge for future intervention studies will be to increase both the duration of the activity and the compliance with the interventions. In addition, much larger studies are needed as few of the randomized trials found statistically significant reductions in risk individually. As a meta-analysis of published studies publication bias may also have affected the results. There was evidence of publication bias in the analysis of leisure-time physical activity during pregnancy and risk of gestational diabetes, thus it is possible that this could have led to an exaggerated summary estimate.

Gestational diabetes and type 2 diabetes share many pathophysiological features and our previous findings of an inverse association between physical activity and type 2 diabetes [10] support the current results on gestational diabetes. Interestingly the strength and the shape of the dose–response relationships observed between leisure-time physical activity in relation to gestational diabetes and type 2 diabetes is similar with an approximate 20–30 % reduction in the relative risk observed with 5–7 h/week compared to no activity, and with a steeper reduction in the risk at lower levels of activity. Most likely several of the same mechanistic pathways are involved in explaining these results for gestational diabetes mellitus as for type 2 diabetes. Physical activity reduces adiposity [59] and has been associated with lower gestational weight gain [6062], which is strongly related to increased risk of gestational diabetes mellitus [63]. Overweight and obesity increases inflammation, flux of free fatty acids, and may thereby lead to insulin resistance [64], which in turn increases the endogenous glucose production in the liver, while physical activity may counteract some of these adverse effects [25, 65]. In this analysis, we found that associations were 14–25 % weaker when adjusted for BMI compared with when not adjusted for BMI, suggesting that approximately 14–25 % of the association may be explained by reduced adiposity, and this is comparable with our previous meta-analysis on physical activity and type 2 diabetes where associations were 20–30 % weaker when adjusted for BMI compared with analyses not adjusted for BMI [10]. However, a clinically significant reduction remained after adjustment for BMI, suggesting that other mechanisms may be implicated. We also found a significant inverse association between pre-pregnancy physical activity and abnormal glucose tolerance. Physical activity increases glucose uptake and glycogen synthesis through increased glucose transport by the GLUT4 glucose transporters and increased activity of glycogen synthase [65]. Exercise increases the secretion of interleukin-6 (IL-6) from muscle cells, which has anti-inflammatory effects through inhibition of TNF-α and IL-1β, and reduces TNF-induced insulin resistance [65]. Physical activity has been associated with lower levels of total cholesterol, triglycerides, leptin, and improved glycemic control and reduced insulin resistance in pregnant women [25, 6668]. The biological mechanism explaining the potential nonlinear association observed between physical activity and gestational diabetes mellitus is not clear and needs further study. In previous meta-analyses we have also observed similar nonlinear associations between physical activity and preeclampsia [53] and type 2 diabetes [10], with steeper reductions in risk at lower levels of activity, however, for all these three conditions reductions in risk have been observed up to between 5 and 7 h of activity per week. Given the similarities in the underlying pathophysiological features of preeclampsia, type 2 diabetes and gestational diabetes mellitus (e.g. insulin resistance, obesity), it is possible that some of the underlying mechanisms that may be common for all three conditions also partly could explain the nonlinearity. However, we can also not rule out the possibility that the nonlinearity partly could be due to few data points at higher levels of physical activity.

Our meta-analysis also has several strengths. Because we only included cohort studies and randomized trials, recall bias is not an issue and there is less potential for selection bias. We conducted dose–response analyses to investigate whether specific levels of physical activity were associated with gestational diabetes mellitus risk and found evidence of a dose–response relationship. Because of the increased sample size we had higher statistical power to detect an association than any individual study, however, most of the included studies had a moderate or small sample size. The quality of the cohort studies were in general high (mean scores of 7 out of 9 points), however, half of the randomized trials were at high risk of bias, and most of the remaining were at unclear risk of bias. Although, there was no heterogeneity by study quality scores or the risk of bias assessment when stratified, any future studies should improve the conduct and reporting of the results to provide better epidemiological evidence on physical activity and gestational diabetes mellitus.

Further large cohort studies and intervention trials are needed to conclusively establish the association between physical activity and specific types and intensities of physical activity and gestational diabetes, and for updated dose–response analyses it would be good if future studies could report results on a common scale, for example in hours/week and/or MET-hours/week. Any further intervention trials should aim to use a high enough dose or frequency of physical activity to be able to observe an effect (e.g. at least 2–3 h or more per week) and including multiple arms with different levels of physical activity might provide firm conclusions with regard to the dose–response relationship.

In conclusion, our results suggest that higher physical activity is associated with reduced risk of gestational diabetes mellitus. Any additional studies should assess the association between specific subtypes, amounts and intensities of physical activity and risk of gestational diabetes mellitus, adjust for more confounding factors and improve the reporting of the data.