Background

Metformin is an old, low cost, oral antidiabetic drug used to treat type 2 diabetes mellitus, gestational diabetes mellitus (GDM) and polycystic ovary syndrome (PCOS). Metformin passes the placental barrier, thereby potentially affecting the foetus [1]. A recent large exploratory case-control study on metformin-use in first trimester of pregnancy found no evidence of teratogenicity [2].

During the last four decades, prescription of metformin in pregnancy has increased, reflecting the increased prevalence of overweight, obesity and GDM [3]. Women with PCOS have increased risk of pregnancy complications, such as GDM, preterm delivery, preeclampsia, hypertension, and small for gestational age babies [4,5,6,7,8,9]. Our research group has recently reported on the reduced incidence of late miscarriage and preterm deliveries in an individual patient data meta-analysis of three metformin vs. placebo-controlled RCTs: the Pilot study, the PregMet study and the PregMet 2 study [10,11,12]. Interestingly, in all three studies, metformin exposed offspring had a larger head, both at gestational age 32 weeks and at birth [13]. Follow-up of the children from the PregMet study also showed that metformin-exposed children at 4 and 8 years og age, had higher BMI, waist circumference, waist-to-height ratio and prevalence of obesity than those exposed to placebo [14, 15].

Recent research suggests higher risk of developmental delay and neurodevelopmental disorders among children born to mothers with PCOS, due to potential exposure to a hyperandrogen environment and insulin resistance in utero [16,17,18]. Bell et al. found that maternal PCOS status was associated with developmental delay in offspring followed up through 3 years of age [16]. In a population-based study of register data in Sweden, Kosidou et al. found an increased risk of autism spectrum disorder and attention deficit/hyperactivity disorder (ADHD) in children of mothers diagnosed with PCOS, supporting the hypothesis of potential negative neurodevelopmental effects of a hyperandrogen environment in utero [17, 18].

In a follow-up of a Finnish RCT on pregnant women with GDM, Ijäs et al. reported no difference in motor, social and language development at 18 months of age between metformin- and insulin-exposed offspring [19]. Two studies have examined neurodevelopmental outcome in 2-year-old children born to mothers with GDM treated with metformin or insulin during pregnancy, reporting no difference between the two groups [20, 21]. In the present study, the primary aim was to explore the cognitive function of children born to mothers with PCOS, exposed to metformin vs. placebo in utero. The secondary aim of the study was to investigate associations between maternal androgen levels in pregnancy and subsequent cognitive function of children.

Methods

The present study (CogMet study) is a follow-up study of children whose mothers participated in “the Pilot study” and “the PregMet study”. Both were randomized, placebo-controlled, double-blinded studies, assessing the potential of metformin to reduce pregnancy complications in women with PCOS.

The pilot study

Between 2000 and 2003, 40 women aged 18–40 with PCOS and with singleton pregnancies were included between 5 and 12 weeks of gestation at the Trondheim University Hospital. Eighteen were randomized to metformin (1700 mg daily) and 22 to placebo throughout pregnancy [10].

The PregMet study

Between 2005 and 2009, 257 pregnant women aged 18–45 years with PCOS according to the Rotterdam criteria [22] were included with 274 singleton pregnancies at 5–12 weeks of gestation, at 11 study centers in Norway. Seventeen women participated twice. Participants were randomized to metformin (2000 mg daily) or placebo throughout pregnancy, and 80% had a self-reported intake of > 85% of the tablets [11].

The CogMet study

In all, 292 children were eligible, and parents were invited to allow their children to participate in a follow-up with cognitive testing; 37 children from the Pilot study and 255 children from the PregMet study (Fig. 1). A total of 93 children were tested between April 2014 to June 2016, 52 in the metformin group, and 41 in the placebo group. The staff collecting the data and performing the cognitive testing were blinded to original randomization allocation. Information on gender, age and socio-economic status was obtained by standardized interviewer-administered questionnaires. Weight was measured with a digital weighing scale. Height was measured with a stadiometer (Seca, Germany). Head circumference was measured with a measuring tape over the most prominent part of the occiput, and just above the supraorbital ridge.

Fig. 1
figure 1

Flow-chart of inclusion and randomization to either metformin or placebo in the baseline studies (PregMet study and Pilot study), and the present study (Cogmet study)

Cognitive assessment

Intelligence quotient (IQ) is a measure of cognitive function. Intellectual disability is considered to be a score on standardized intelligence tests that is more than two standard deviations below the population mean in combination with reported difficulties in daily life functioning, according the Diagnostic and Statistical Manual of Mental Disorders (DSM-V) [23]. This equals an IQ-score below 70. An IQ score within the normal range can be defined as a score within one standard deviation (SD) of the population mean (100), between 85 and 115 points. However, scores just below the normal range have been described as Borderline Intellectual Function in the DSM (IQ scores between approximately 70–85) and could also affect the individual’s ability to function in daily life [24].

The cognitive assessment was performed by three students specializing in clinical psychology, under the supervision of a clinical psychologist (HFØ) trained in cognitive testing. Testing took place during a single session with a fixed order of tasks. Age-appropriate Wechsler tests of cognitive ability were used for assessing full scale IQ (FIQ), IQ indices and subtest scores. The Wechsler tests are considered the gold standard in assessing intelligence, where different versions exist for preschool and school aged children. The Wechsler Preschool and Primary Scale of Intelligence version III (WPPSI-III) was applied for children under the age of six, and the Wechsler Intelligence Scale for Children version IV (WISC-IV) was applied for children above the age of six.

The WISC-IV consists of 15 subtests divided into scales that provide Verbal IQ and Performance IQ, respectively, in addition to FIQ. The verbal subtests are further divided into two indices: Verbal Comprehension and Working Memory. The performance scale tasks are divided into the Perceptual Organization Index and the Processing Speed Index. WPPSI-III consists of 14 subtests that are divided into the same scales and indices as the WISC-IV. The WPPSI-III and the WISC-IV group means are identical in the overlapping age-bands, and their respective FIQ scores are highly inter-correlated (i.e. r = 0.89) [25].

Androgen analyses

In the original RCTs maternal blood was drawn at the time of study inclusion, and in gestational weeks 19, 32 and 36. Dehydroepiandrosterone sulphate (DHEAS), sex hormone binding globulin (SHBG), and testosterone in maternal venous serum were analyzed. The DHEAS and SHBG levels were analyzed in single measurements by enzyme-linked immunosorbent assay techniques with the reagents and calibrators supplied by the manufacturer (DRG Instruments GmbH). Testosterone and androstenedione were analyzed by liquid chromatography-tandem mass spectrometry (LC-MS/MS). In the analysis, plasma samples were extracted by supported liquid extraction and the eluate evaporated and reconstituted before analysis on LC-MS/MS. The intra-assay and inter-assay coefficients of variation (CV) were 3.6 and 4.4% for DHEAS, and 2.9 and 11.4% for testosterone, respectively. The intra-assay CV was 6.6% for SHBG, and the inter-assay CV was 5.5% in fetal samples (lower values) and 24.1% in maternal samples. The free testosterone index (FTI) was calculated as (testosterone/SHBG) × 100 [26].

Statistical analysis

Data entry, management and analyses were performed at the Department of Clinical and Molecular Medicine at the Norwegian University of Science and Technology on IBM SPSS Statistics version 22.0 (IBM, SPSS inc USA, Chicago IL). Analyses were conducted using IBM SPSS Statistics version 25 (IBM, SPSS inc USA, Chicago IL). Differences between the groups were compared by independent t-test for continuous variables or the chi-square test and Fisher’s exact test for categorical variables. Linear regression analyses were used to compare continuous variables adjusted for parental education. Normality of residuals was confirmed by visual inspection of qq-plots. For calculation of anthropometric z-scores at birth, Niklasson’s standard values from a large Swedish population were applied [27]. The standard is gestational age adjusted and is based on singleton fetuses without chromosome abnormalities or major birth defects. At follow-up, z-scores for length/height, weight, and head circumference were computed from Norwegian growth references. These are built on anthropometric measurements of live-born children between 37 and 42 weeks of gestation, without congenital anomalies or diseases that could affect growth, and whose parents were from Norway or northern Europe [28]. Correlation coefficients ranging 0–0.29 were considered as small, 0.3–0.59 as moderate, and 0.6–1 as high. A significance level of 0.05 was chosen, and 95% confidence intervals (CI) are reported where relevant.

Results

The response rate for the follow-up was 32% (93 out of 292). There was no difference between participants and non-participants regarding maternal baseline data, pregnancy outcomes and neonatal data (Additional file 1: Table S1). Maternal baseline data at original study entrance in early pregnancy, and pregnancy outcomes were comparable between the metformin and placebo groups (Table 1). There were no significant between-group differences in neonatal data, except larger head circumference and head circumference z-score of the newborns in the metformin group (Table 1). There was no difference in the prevalence of asthma, eczema, heart conditions, bowel disease, constipation, diarrhea, abnormal growth, recurrent infections or hospital admissions between the metformin vs. placebo exposed children. Parental education and socioeconomic status at follow-up were comparable between groups (Table 2). Mean age at follow-up was 7.7 years in both study groups (age range 5–14 years). All anthropometric measures, including Tanner stage development of the children at follow-up, were also comparable between groups (Table 2).

Table 1 Maternal baseline characteristics, pregnancy outcome and neonatal data
Table 2 Socioeconomic status and offspring characteristics at follow-up

The mean FIQ in the metformin and placebo groups were 100.0 (SD 13.2) and 100.9 (SD 10.1) respectively, which correspond to the average FIQ score in the background population. There was no statistically significant difference in mean FIQ scores between metformin and placebo exposed children at the follow-up (Table 3). Further, there was no statistically significant between-group difference in scores on the subscales: verbal comprehension, working memory, perceptual organization or processing speed (Table 3). These results did not change after adjustment for maternal/parental educational level (Table 4). None of the children had FIQ < 70 corresponding to intellectual disability. Ten children had FIQ < 85 corresponding to borderline intellectual function, 9 in the metformin and 1 in the placebo group, (p = 0.039). There were no significant differences regarding gender, prematurity, birth weight, head circumference or maternal baseline data between participants with borderline intellectual function and participants with normal intellectual function (data not shown). Baseline characteristics regarding maternal and neonatal data of children with borderline intellectual function are shown in Table 5.

Table 3 WPPSI-III/WISC-IV sum scores for metformin versus placebo, unadjusted
Table 4 Linear model of prediction of WPPSI-II/WISC-IV scores. Adjusted for parents’ educational level
Table 5 Maternal and neonatal data, children with full-scale IQ < 85

Correlation between androgen levels at different time points in pregnancy and IQ-scores of children at follow-up, are shown in Table 6. There was a significant positive correlation between FIQ and maternal FTI measured at gestational week 19 (p = 0.041), and between androstenedione measured at gestational week 36 (p = 0.049). We found no correlation between maternal testosterone, androstenedione and FTI at any time of pregnancy in any subdomain scores of WISC-IV/WPPSI (Table 6). However, all correlation coefficients were small.

Table 6 Correlation between maternal androgen levels in pregnancy and child cognitive function measured by WPPSI-III/WISC-IV sum scores

Discussion

The main finding of the present study is that metformin exposure in utero did not seem to affect mean cognitive function of children born to women with PCOS. To the best of our knowledge, this is the first follow-up of cognitive function of children exposed to metformin in utero in a placebo-controlled RCT.

The study question arose from the surprising finding of significantly larger newborn head circumference in the two RCTs, the Pilot and the PregMet studies [10, 13], which were reproduced in a third larger RCT, the PregMet2 study [12]. This led to the question of whether the larger head circumference in the metformin-exposed offspring translated to altered cognitive function in the offspring. On a group level, larger head circumference at birth is associated with better cognitive function [29, 30]. However, also autistic spectrum disorders associate positively with large head size [31]. In the general population the measures of cognitive function follow a normal distribution. The prevalence of intellectual disability (FIQ < 70) is about 1%, and intellectual disability is found to be associated with several disorders, such as autism spectrum disorders, ADHD, conduct disorders, epilepsy and sensory and motor impairments [32]. It is well established that cognitive function is dependent on both genetic and environmental factors [33, 34]. Among these are: sociocultural and family environment, education, nutrition, early stress, exposure to toxins, drugs or alcohol prenatally or in childhood, and perinatal complications [33, 34]. The potential identification of metformin as a factor that could alter cognitive function in prenatally exposed offspring is of great importance. While impaired cognitive function could represent social, behavioral and educational challenges for the individual and costs for society, good cognitive function is associated with beneficial outcome for the individual [32, 35]. Intelligence is considered a relatively stable measure throughout life [36]. In a Danish cohort-study, Flensborg-Madsen and colleagues found that birth weight and head circumference at birth was positively associated with intelligence in adulthood, and persisted into midlife [37].

At birth, metformin-exposed offspring in the CogMet study had significantly larger head circumference z-scores (Table 1). At the present follow-up, the difference in HC z-score was no longer significant (Table 2), and there was no statistically significant difference in mean IQ-scores of children in the two study groups (Table 3). However, exploring children with FIQ-score < 85, we found that there were more metformin-exposed children with borderline intellectual function compared to unexposed children. This is a surprising finding that should be interpreted with great caution due to small study size. We have no mechanistic/cellular biologic explanations for our findings and can only speculate. Metformin passes the blood-brain barrier and has been shown in experimental studies to have a neuroprotective effect in cerebral damage such as oxidative stress, blood-brain barrier disruption and inflammation [38]. Metformin has a mild and specific inhibitory effect on mitochondrial respiratory chain complex activity, switching the cell from anabolic to catabolic state [39]. Theoretically, this might affect the developing fetal brain.

Based on previous results of increased head circumference of metformin exposed newborns [13], and the results of Smithers et al. and Gale et al. of positive associations between head size and cognitive function [29, 30], we would have expected an opposite result. However, the total prevalence of individuals with FIQ < 85 in the study population, does not exceed the expected number based on an assumption of normal distribution of IQ scores. The FIQ scores had a significant positive correlation to maternal education in the metformin group.

Nonetheless, these findings point to the importance of investigating cognitive function in children exposed to metformin in utero. The secondary aim of the present study was to investigate associations between maternal androgen levels and cognitive function in children. Previous research indicates that a hyperandrogenic environment in utero might be associated with impaired development and neurodevelopmental disorders such as ADHD and autism spectrum disorder [16,17,18]. We found statistically significant, but small, positive correlations of FIQ and maternal FTI at gestational week 19, and androstenedione at gestational week 36. However, these were the only statistically significant correlations among a large number of tests, and should therefore be interpreted with caution. We did not find convincing evidence for an association between hyperandrogen environment in utero and cognitive function in children.

Strengths and limitations

The well characterized cohort of women with PCOS and the randomized, placebo-controlled study design is a strength of the study. This is the first long-term follow-up on exposure of metformin versus placebo in utero of children including cognitive assessment, thereby an important contribution to the present base of knowledge regarding long-term consequences of metformin use in pregnancy. The main limitation of the present study is the relatively low number of participants, and a low response rate of only 32% of those invited. However, those who participated in the CogMet study did not differ at baseline from those who were invited, but chose not to participate (Additional file 1: Table S1). Particularly, there is a possibility that some children with low-to-borderline intellectual function were lost to follow-up due to the total burden of challenges these children and their families experience.

Conclusion

Previously reported larger head circumference at birth in metformin-exposed children did not translate into altered mean cognitive function at a follow-up at 7.7 years (mean age). However, more of the metformin exposed children had borderline cognitive function, which is an unexpected and unexplained finding. We found no convincing evidence of an association between a high maternal androgen level and altered offspring cognitive function. Our findings are important contributions to the current knowledge of long-term consequences of metformin use during pregnancy. More research on larger study populations is needed.