Background

Hyperglycaemia during pregnancy is a growing problem globally and is associated with several long- and short-term adverse outcomes for the mother and offspring [1,2,3,4,5]. In particular, maternal hyperglycaemia induces foetal hyperinsulinemia with enhanced foetal growth and increased risk of foetuses being born large for gestational age (LGA). The rate of macrosomia, neonatal hypoglycaemia and caesarean delivery increase linearly with increasing levels of maternal hyperglycaemia [6]. Women with GDM are also at higher risk of gestational hypertension and preeclampsia. In the long term, metabolic diseases such as Type 2 diabetes mellitus (T2DM), cardiovascular disease and obesity are more frequent among women with prior GDM [2, 7,8,9] . In offspring GDM has been shown to be associated with adiposity and risk for prediabetes [3, 4, 10, 11]. Treatment of GDM involves dietary and physical activity advice, blood glucose monitoring, and where necessary metformin or/and insulin therapy. GDM management has been shown to reduce maternal and perinatal morbidity [12,13,14].

Internationally, the prevalence of GDM varies from 1 to 28% depending on the ethnic composition and prevalence of T2DM of the background population, local GDM screening strategies and diagnostic criteria [15,16,17]. This variation in screening and diagnostic approaches has made global comparisons of GDM prevalence and outcomes problematic. In order to progress towards a universal standard approach to GDM diagnosis, the World Health Organisation (WHO) recommended in 2013 that a 2-h 75 g oral glucose tolerance test (OGTT) should be used, with three time points blood testing (fasting, 1- and 2-h) [1]. These 2013 WHO criteria define GDM as ≥5.1, ≥10.0 and/or ≥ 8.5 mmol/L fasting, 1-h and/or 2-h thresholds. These cut-off values are based on a ≥ 75% adjusted excess risk of adverse neonatal outcomes (e.g. large for gestational age (LGA), foetal hyperinsulinemia), based on data from the Hyperglycaemia and Adverse Pregnancy Outcomes (HAPO) study involving 25,505 women from nine countries [18]. There has been debate over the merits of these new criteria as the number of women diagnosed with GDM would be expected to increase by 15–30%, raising concerns over their cost and clinical effectiveness [19,20,21,22]. While the thresholds are based upon calculated excess risk, the decision to base the diagnostic criteria on ≥75% excess risk was based upon consensus. Some countries (e.g. Canada, Norway) have reduced GDM numbers by using thresholds based upon a ≥ 100% excess risk of adverse neonatal outcomes (5.3- and/or 10.6- and/or 9.0 mmol/L, respectively) [23, 24]. Others have not based their criteria on HAPO data at all (e.g. England, New Zealand).

Older Swedish GDM criteria are based on varying cut-off values. If a fasting threshold was used (not used in one region), then ≥7.0 mmol/L was considered GDM. The 2 h criteria ranged from 9.0–11.1 mmol/L, using either capillary or venous samples. Using these criteria, 1–3% of the 115, 000 births were complicated by GDM annually. In June 2015, following a review of the available evidence, the Swedish National Board of Health and Welfare (SNBHW) recommended a move to the 2013 WHO diagnostic criteria, using venous sampling. The SNBHW made no recommendations in relation to the screening (e.g. universal vs risk factor).

With the current variation in GDM screening/diagnostic practice across Sweden [25], and the debate over the criteria, there was a recognition that the transition to the recommended new guidelines could be either by an ad hoc, or planned and structured way, to minimise clinical variation. National registers in Sweden offer a possibility to assess the impact of introducing the new GDM criteria on pregnancy outcomes and long-term health for both mother and child. A stepped wedge cluster randomised controlled trial (SW-CRCT) was the most realistic approach to evaluate this change on a national level [26,27,28,29,30]. A SW-CRCT involves randomly allocated times for clusters to introduce an intervention, allowing participants before and after any change to serve as control and intervention groups respectively. Reasons for choosing this study design were:

  1. 1.

    The intervention is a change in one clinical routine in a population, making it unrealistic to randomise by individual.

  2. 2.

    All sites were to end with adopting the national guidelines as promptly as possible, which would have been delayed using a simple case control approach.

  3. 3.

    Roll out in one go, across multiple centres was unrealistic, making a parallel cluster randomised controlled trial unrealistic. Steady adoption, site by site, was seen as the best way to make the change in the most organised manner.

This study will provide evidence to help address some of the clinical controversies over GDM diagnosis. Our aim is to test whether there is a reduction in adverse neonatal and maternal outcomes following the implementation of the new GDM criteria, and to evaluate the health economic impact on a population level.

Our specific objectives are:

  1. 1.

    To compare the rates of LGA and other adverse neonatal and maternal pregnancy outcomes before and after the change in GDM diagnostic criteria.

  2. 2.

    To compare the health care costs before and after the change and assess the net costs/savings.

  3. 3.

    To create prospective cohorts to compare the long-term outcomes in mothers and offspring exposed to the old and new diagnostic approaches.

Methods

Study design

The Changing Diagnostic Criteria for Gestational diabetes (CDC4G) in Sweden study is a national prospective, unblinded, SW-CRCT of the switch from the former Swedish diagnostic criteria to the WHO 2013 criteria for GDM i.e. to the 3 point OGTT with fasting plasma glucose, 1-h and/or 2-h diagnostic thresholds of ≥5.1, ≥10.0, ≥8.5 mmol/L, respectively (Fig. 1). Each participating centre constitutes one cluster, in which the patients continue to undergo screening for GDM following their usual approach, see Table 1. The time of transition to the new criteria is randomised and subsequently rolled out until all 11 clusters (i.e. centres) implement the new GDM regimes during 2018, see Fig. 2. The transition necessitated the introduction of a fasting glucose sample in one region, and a 1-h sample across all 11 regions. This means that a total population evaluation of the change is required, as many women with GDM diagnosed by the new criteria, will not be identified during the control period (e.g. those without the 1-h test). A sub-analysis among those with untreated and treated GDM by the WHO 2013 is also planned.

Fig. 1
figure 1

Time of preparation, set up, control and intervention in the CDC4G study during 2017, 2018 and 2019. Q = quartile, 3 months period

Table 1 List of included regions and methods for diagnosing GDM
Fig. 2
figure 2

CONSORT 2010 flow diagram of the stepped wedge cluster randomised CDC4G - trial

Sites have previously varied in how samples were taken, with some using capillary and some venous sampling. As venous sampling was used in the HAPO study, and is the approach recommended by the SNBHW and the WHO, all study centres using capillary sampling for diagnosing GDM changed to venous during the national preparation period (September–December 2017). GDM clinical management has also varied across the country and new guidelines for GDM treatment and obstetric surveillance were introduced during the national preparation period, prior to the commencement of the trial. It was agreed that no local policy should change over the trial period.

The trial was performed within the Swedish Network for Clinical Studies within Obstetrics and Gynaecology (SNAKS; www.sfog.se).

Participants

All regions in Sweden were invited to participate over the 12-month study period. Eleven (65–67,000 birth/year) out of 21 regions in Sweden agreed to participate. All women within the participating regions (including within both primary and secondary care) across Sweden are included in the study, unless they opt out from the national register. Women with pre-existing diabetes are excluded. This approach, to use de-identified data from the national registers, without individual consent, was approved by the Uppsala Regional Ethics committee on behalf of all Research Committees across Sweden. The national SPR uses an opt out approach, with less than 100 opts out yearly.

Outcomes

The primary outcome is LGA (birth weight above the 90th percentile for gestational age and sex using centile tables) based on Swedish reference population (twins, major malformations and intrauterine deaths excluded from reference population). This is the only common pregnancy outcome shown to be substantially and consistently reduced by treating GDM [32].

Secondary outcomes for neonate and mother are listed in Table 2.

Table 2 Outcomes, sampling and measurements

A healthcare cost-utility analysis will be undertaken.

Recruitment

Women will be recruited de facto by being under the care of a participating health service. Women always have an option to decline testing and, if GDM is diagnosed, to decline treatment. Informed consent will not be requested beyond the routine invitation to opt out of the SPR and the option at any time to refuse any aspect of management.

Randomisation

As the expected number of births varies across the participating centres, a stratified randomisation by centre size was conducted using two strata. The first strata was the two largest populated study centres (Stockholm and Gothenburg) which was randomised to change GDM criteria in June or August of 2018. The second strata with the nine remaining centres was changing, one centre per month, in a randomised order from February to July and September to November of 2018. The randomisation allocation was performed using computer-generated, random allocation sequences using SPSS version 22 (Armonk, NY:IBM Corp) by the study statistician at Clinical Epidemiology and Biostatistics, Region Örebro County. The randomisation was concealed from the participating centre and the list is stored in a safe at Örebro University Hospital.

The study statistician provided information on the randomisation to the study coordinator who in turn informed the relevant centre through the local PI, 2 months prior to when the centre being randomised to using the new GDM criteria. Staff at all sites were blinded until informed by their local PI of their start date to change from the control period (old GDM criteria) to the intervention period (new GDM criteria).

Monitoring of adhesion to the study protocol

Every month each centre reported their monitoring according to a checklist on the website (ref according to guidelines) to make sure that the guidelines for the CDC4G study (e.g. GDM treatment, blood sampling methods, obstetric surveillance, switch to the new criteria) were followed every step of the study. In addition, the number of women with GDM diagnosed every month during the study period is registered. Severe possible adverse outcomes such as (severe maternal hypoglycaemia (low blood glucose levels that requires assistance from another person to treat) and lactic acidosis in metformin treated women) is reported each month. Interim analysis is performed by the study coordinator every month to identify any safety or protocol breaches. The study data and safety monitoring board (DSMB) is available to determine whether any safety issues warranted termination of the trial.

Procedures

A 3-month national preparation period was organized between October to December 2017 (Fig. 2), for all sites to consolidate their local approach to screening, changing diagnostic testing to venous sampling, modifying and consolidating the agreed approach to GDM treatment and the agreed approach to obstetric management.

The trial started on the 1st of January 2018 with 1 month of baseline data collection when no randomisation occurred. Subsequently, at periodic time points called “steps”, sites changed to the new GDM criteria in a randomised order over a 10 months period. In December 2018 all sites used the new criteria. The last birth is expected in August 2019.

Booking screening

Booking screening varied between the centres. In some centres, high risk women underwent an OGTT, other centres used universal random blood glucose screening to identify those for referral to OGTT, see Table 1. The OGTT thresholds used for overt diabetes in pregnancy are listed in Table 1.

The oral glucose tolerance test (OGTT)

Throughout the study period, a 2-h 75 g OGTT was undertaken following booking screening, at 24–29 weeks gestation or if there was a clinical indication (e.g. polyhydramnios, accelerated growth or high random blood glucose).

Venous sampling were used with glucose measurements performed either by the local laboratory or with point of care testing (Table 1). All the laboratories are accredited according to SS-EN-ISO/IEC 15189. As part of the accreditation the laboratories participate in inter-laboratory comparison schemes either from Equalis (https://www.equalis.se/en/) or in schemes organised by the local hospital organisation.

GDM management guidelines

GDM management (treatment and obstetrical surveillance) was the same in all centres before and after the switch. All women received a GDM information leaflet and written dietary advice in Swedish and translated into five other languages (English, Pashto, Somali, Arabic and Dari). All women diagnosed with GDM were offered lifestyle advice from a registered dietitian. Further general GDM information was provided by the midwife if lifestyle treated, and the diabetes team if treated with insulin or metformin. A local diabetes nurse or trained midwife taught women on how to undertake self-measurement of plasma glucose (SMPG). Different meters were used, but all were capable of uploading data through DIASEND® (Glooko AB, Göteborg, Sweden) which is considered to be part of the local clinical records. SMPG was recommended to occur four times per day: Fasting and 1 h after breakfast, lunch and dinner among those who are lifestyle treated and seven times per day among those treated with insulin and/or metformin (before and 1 h after meals and at night before bed).

Target values for capillary plasma glucose were < 5.3 mmol/L fasting (before breakfast), < 6.0 mmol/L before other meals, < 8.0 mmol/L 1 h after meals, and < 7.0 mmol/L before bedtime. Those with three or more values above target during 1 week were commenced on pharmacological treatment or intensified treatment if already on metformin/insulin. GDM management (treatment and obstetrical surveillance) was the same in all centres before and after the switch.

Pharmacological treatment

Metformin is accepted for treatment of GDM by the Swedish National Board of Health (https://www.socialstyrelsen.se/globalassets/sharepoint-dokument/artikelkatalog/nationella-riktlinjer/2015-4-12.pdf) and it is used mainly for overweight patients (BMI ≥ 25 kg/m2), especially where fasting glucose and basal glucose levels were elevated during the day. Written and oral information was provided regarding adverse effects and that metformin crosses the placenta. Start dose was 500 mg once daily, with increasing one tablet every third day until target glucose level was achieved.

Insulin is recommended when metformin is not expected to bring hyperglycaemia rapidly under control, or when considered inappropriate for clinical reasons or declined by the patient. If fasting blood glucose is above the target, intermediate acting insulin (NPH) is the first line of choice. Long acting analogue insulins are considered if the blood glucose targets are not reached. Rapid acting insulin analogues are used to manage elevated postprandial glucose levels. Evaluation of glucose values and titration of insulin dose are recommended twice a week initially, after which titration is performed once a week. Changes in insulin doses occurred if three or four values are above the target value over a 1-week period. Starting doses are 4–8 units depending on the glycaemia and other clinical factors (e.g. BMI).

Obstetric surveillance

All women with GDM receive written information about health and exercise and continue with the usual obstetric/midwifery care including additional intervention when indicated. Women with diagnosed GDM and lifestyle management alone received expectant care as per normal pregnancy management with foetal weight estimation at week 38 for pre-delivery assessment. For women receiving metformin or insulin treatment, additional ultrasound assessments for foetal weight estimation was performed at least 2 times during pregnancy: at weeks 28–32 and latest at 38 weeks. Induction occurs if not delivered before 40 + 6 for medically treated GDM women, and for diet treated GDM induction at the latest 42 + 0.

Data collection

Data will be collected through the SPR, a national quality registry [33]. An additional electronic case record form is created with manually entered data not otherwise collected in the SPR (e.g. neonatal blood glucose and repeat OGTT values). Follow up studies will use the National Diabetes Register (NDR) and other Swedish health care registers (SNQ, Prescribed Drug register, Statistics Sweden). Glucose monitoring and target adherence were collected through the use of DIASEND®.

First data set extraction is planned to be completed during 2020. The project coordinator and steering group will have access to the complete dataset when available.

Statistics and data analysis

Sample size and power calculation

The sample size estimation was performed by the study statistician for a stepped wedge cluster randomised design using STATA release 14 [34]. With 11 centres (clusters) participating and an intra cluster correlation (ICC) of 0.0026 a minimum sample size of 47,916 pregnant women (23,958 before change and 23,958 after change of the new GDM criteria) have 90% statistical power with a 5% significance level to detect an absolute reduction in LGA by 1.5% on a population level (from the existing 10 to 8.5%). ICC was estimated from the variation in LGA incidence in year 2012 between participating centres which varied between 7.7 and 13.3% (0.077–0.133). Assuming that the variation in LGA incidence between centres follows a normal distribution, the variation in LGA incidence were estimated as mean 0.1067 and standard deviation (SD) 0.0164. The coefficient of variation (CV) is 0.154 (SD divided by the mean) and the CV can be expressed as an ICC.

The total estimated number of inclusions (=births) during the inclusion period for the participating centres is approximately 65,000. It is not possible to predict the exact number of births during the study due to the differences in sizes of the centres. Some incompleteness (estimated 10%) of data on maternal characteristics (mainly BMI) is expected. The calculation indicates that there will be sufficient power for the main outcome measure (LGA) even if 1–2 centres would drop out.

The proposed statistical analysis methods for stepped wedge designs will be used including Generalized Estimating Equation (GEE) models or mixed effect logistic models to evaluate the primary outcome, LGA. Mixed effect logistic models will include centres as random effects, which is the cluster and the randomisation unit in this design, and the intervention and time periods as fixed effects, or time as a random effect [35]. Adjustment for time is to compensate for calendar time changes over time in LGA incidences that are not due to the intervention changing from the old to new GDM criteria, but because of potential temporal trends in LGA incidence.

As the randomisation is on the cluster level and not on the participant level, we will adjust for important prognostic predictors for LGA, such as chronic hypertension, mother’s age, BMI, country of mother’s birth and the mother’s smoking behaviour during pregnancy. Stratification with interaction tests by prognostic factors will be used to further investigate if the associations with LGA between the two diagnostic criteria are heterogeneous and the new diagnostic criteria have an even larger impact in some subgroups according to risk factors.

Because of the stepped wedge study design, some recommended design-specific analyses will be conducted such as stratification by time period to evaluate if the intervention association with outcomes are homogenous during the study time period, or heterogeneity within centres between the two interventions. The secondary adverse neonatal and maternal outcomes will be analysed using the same type of methods and considerations as for the primary outcome. A sub-analysis will be conducted comparing those with fasting glucose 5.1–6.9 mmol/L and/or 2-h glucose 8.5–8.9 mmol/L before and after the switch in criteria.

Health economic evaluation

In a first health economic analysis, increased intervention costs for new diagnostic criteria will be set against reduced healthcare costs during pregnancy and in connection with childbirth. This cost analysis consists of 1) estimation of standard cost of treatment of GDM, 2) mother’s healthcare utilization during pregnancy and the month after delivery and, 3) the infant’s health care utilization the month after birth. Data are retrieved from the SPR. Cost of health utilization is estimated based on the Swedish national cost per Patient database.

In a second analysis, the mother’s quality of life during pregnancy and after birth will also be included. It will be expressed in quality adjusted life years (QALY) based on a single item of self-rated health transformed to preference-based values using a Swedish experience based preference valuation [36]. Self-rated health data during pregnancy and after birth is retrieved from the SPR where it is already collected.

In a third analysis, long-term consequences will be modelled. Differences in incidence of LGA will be modelled on future health for the child and cost implications based on best available evidence. In the same way, the mother’s future health impact from intervention can be modelled based on medical risk markers. Cost effectiveness will be expressed in cost per QALY.

Discussion

To the best of our knowledge, the CDC4G study is the first national SW-CRCT to evaluate the implementation of the 2013 WHO criteria for GDM. There are a number of non-randomised retrospective studies comparing outcomes before and after the change in criteria. Small studies in Spain and Taipei have shown improved pregnancy outcomes switching from the American Diabetes Association (ADA) criteria to the IADPSG criteria on a population basis [37,38,39]. A non-randomised retrospective study in the USA found no benefit in outcomes [40] but substantial changes in clinical and obstetric care occurred at the same time [41]. The few randomised trials of treating GDM [12,13,14] have all studied different settings and diagnostic criteria, and a randomised study of the effect on outcomes by treatment on a population level does not exist. Sweden has a public health care system with almost 100% clinical attendance and public health registries with high coverage. With the availability of Swedish registers and possibility to link registers by the personal identification number, this setting provides a unique possibility to evaluate the impacts of the 2013 WHO criteria on pregnancy outcomes and long-term health for both mother and child.

The IADPSG recommend universal screening [18], however, in Sweden there has been a diverse approach to screening as shown in Table 1. Screening criteria at each centre did not change through the trial and should not impact on the validity of the SW-RCT. Since universal screening with both fasting and 2-h values has not been the routine; the existing screening approaches does reduce the number of women who will have received treatment for GDM, in spite of some hyperglycaemia. Furthermore, in practice, many midwives find it difficult to adhere fully to the plethora of screening criteria. Therefore, any benefit (or harms) from GDM treatment will be less than would have occurred with universal screening.

SW-CRCT: strengths and limitations

The stepped wedge randomised controlled design is particularly beneficial when the intervention is likely to do more good than harm and is mainly used to evaluate routine implementation, as is the case for the CDC4G trial. The design is practical where it is not possible to deliver the intervention to all the participants at the same time for logistical, practical or financial reasons. The stepped wedge design is also appropriate for cost-effectiveness analysis of an intervention on a population basis (i.e. when changes across a whole population/service occur), also making it suitable for the CD4G study.

Disadvantages of the stepped wedge design are difficulties in blinding the participants and those delivering the intervention since the step from control to intervention is evident for both. It is also important to note that the stepped wedge design is likely to lead to a longer trial duration than a traditional parallel design, particularly where effectiveness is measured immediately after implementation. Since the implementation of the new criteria is expected to do more good than harm and is going to continue after the study period there are no real disadvantages from the participant’s point of view. Step-wedge trials can be impacted upon by wider secular trends that can differ between the beginning and end of the study period. In CDC4G, the design has included an alignment, and run in, of all significant clinical policies before the trial commenced, and a clear directive not to introduce any service changes during the study period. A major limitation in cluster randomised trials is the need for larger numbers to overcome the ICC.

Ethics

Random allocation is performed at the cluster, not the individual level, and prior informed consent to randomisation is not feasible and will not be requested. This is justifiable on the basis that this is a SNBHW policy directive and hence management of GDM is changing as a result of the new national recommendation. It would be unethical not to evaluate the impact of the move, and the best approach for this evaluation is by conducting an RCT. Women always have the right under Swedish Law to change clinic and refuse any aspect of care.

As there is an a priori acceptance that the intervention will do more good than harm [42], rather than belief of equipoise, it may be unethical to withhold the intervention from a proportion of the participants, or to withdraw the intervention as would occur in a cross-over design.

Trial status

The study preparation period started during 2017 and data collection 1st of January 2018. The first centre was randomised to the 2013 WHO criteria 1st of February 2018 and the last 1st of November 2018. The study is ongoing during 2018 and 2019, with first data collection during 2020.