Introduction

Developmental delay is defined as delays in the areas of speech and language, motor, social, and cognitive development [1]. The incidence of developmental delay has increased dramatically in recent decades [2, 3]. Although the estimated prevalence of developmental delay is generally 5–15% in pediatric populations [2,3,4], reported rates vary depending on the socioeconomic characteristics of the study population, case definition, and age range [5].

Excess maternal weight gain increases the risk of obstetric complications, such as gestational diabetes, hypertensive disorder of pregnancy (HDP), eclampsia, caesarean delivery, and macrosomia [6]. On the other hand, insufficient maternal gestational weight gain (GWG) and low GWG rate have been associated with adverse birth outcomes, including pre-term birth and small for gestational age (SGA) [7, 8]. The Institute of Medicine (IOM; now known as the National Academy of Medicine) developed GWG guidelines in 1990 and later updated them in 2009 [9]. The IOM guidelines incorporate the World Health Organization (WHO) categories of maternal body mass index (BMI) and recommend lower GWG for obese women. Japan has not formally adopted the IOM guidelines, having instead developed an original set of rules for pregnancy weight management owing to limited ethnic diversity (Supplemental table S1) [10]. The Japanese guidelines are stricter for weight gain primarily to reduce obstetric complications. One large limitation of the guidelines, however, is that they lack validation from a large national study. An emerging problem in Japan is the increase in underweight pregnant women [11, 12]. Among Japanese pregnant women registered in the Japan Society of Obstetrics and Gynecology registry system, the prevalence of underweight pre-pregnancy BMI was 18.2%, versus 5.3% in the USA [12, 13]. Such a condition has been associated with an augmented risk of pre-term birth and SGA [7, 8] and possibly delayed offspring development. In Japanese women, underweight may be a larger issue than obesity.

Recent reports on the longer-term risks of maternal obesity have suggested a relationship with developmental delay in early childhood, and several epidemiologic studies have found associations between maternal obesity and various neurodevelopmental outcomes [14, 15]. In contrast, there is little evidence on the early childhood effects of maternal underweight, with none on whether excess or insufficient GWG increases the risk of offspring developmental delay. We therefore conducted a large birth cohort study with the specific objective of examining the impact of maternal GWG on early neurodevelopment.

Materials and methods

Study design, population, and settings

The data used in this study were obtained from the Japan Environment and Children’s Study (JECS), an ongoing cohort study that began in January 2011 to determine the effect of environmental factors on children’s health. The target number of enrolled pregnant women was 100,000. Partners were also recruited, although their participation was not mandatory. In the JECS, pregnant women were recruited between January 2011 and March 2014. The eligibility criteria for participants were as follows: (1) residing in the study area at the time of recruitment, (2) expected delivery after August 1, 2011, and (3) capable of comprehending the Japanese language and completing the self-administered structured questionnaire in Japanese. This study was registered in the UMIN Clinical Trials Registry (number UMIN000030786). Details of the JECS project have been described previously [16,17,18]. The JECS protocol was reviewed and approved by the Institutional Review Board on Epidemiological Studies of the Ministry of the Environment (ethical number 100910001) as well as by the ethics committees of all participating institutions. The JECS was conducted in accordance with the Helsinki declaration and other nationally valid regulations and guidelines. Written informed consent was obtained from each participant.

The present study was based on the “jecs-an-20180131” dataset released in March 2018 containing information on 98,255 mothers who had a singleton live birth, including 50,563 with the fathers’ registration. Specifically, we focused on questionnaire data regarding developmental screening as self-described by mothers when their child was 12 months old. The screening tool was the Ages and Stages Questionnaire, third edition (ASQ-3) [19]. Maternal medical information, additional pregnancy details, and medical history were collected from subject medical record transcriptions for adoption as other covariates.

Data collection

Information on socioeconomic status, smoking habit of the mother and partner, and maternal alcohol consumption during pregnancy was collected during the second/third trimester of pregnancy (T2) by means of self-reported questionnaires. Details on a parental history of neurodevelopmental disorders, epilepsy, and mental disease were also collected from T2 questionnaires as described by the mother and partner. Maternal anthropometric data before and during pregnancy, complications and medication during pregnancy related to HDP, diabetes mellitus/gestational diabetes mellitus (DM/GDM), and neonatal information was gathered from medical records. Pre-pregnancy BMI was calculated according to WHO standards as body weight (kg)/height (m)2 and categorized as underweight (BMI < 18.5), normal weight (BMI 18.5–24.9), overweight (BMI 25.0–29.9), and obese (BMI ≥ 30).

Outcomes

The main outcomes of interest were ASQ-3 domain scores at the age of 12 months. The ASQ-3 is a parent-reported comprehensive first-level developmental screening tool for children aged 1–66 months with 30 items in five domains: communication, gross motor, fine motor, problem-solving, and personal–social skills. Each item describes a skill, ability, or behavior to which the parent responds “yes” (10 points), “sometimes” (5 points), or “not yet” (0 points). Parents sometimes omit items when they are unsure of how to respond or because they have concerns about their child’s performance of the item. ASQ-3 scores were not calculated if there were three or more omitted items in a given domain. In the case of one or two omitted items, an adjusted total domain score was calculated by adding the averaged item score either once for one omission or twice for two omissions. The score calculated for each domain was categorized as normal development (above cutoff) or referral zone (below two standard deviations). The manual for the original ASQ recommends that a child be considered as screen positive if his/her score falls below the referral cutoff in any one of the five domains [19].

Participants with established risk factors of developmental delay, such as neonatal asphyxia, and physical abnormality at birth, including infection, respiratory distress, congenital abnormality, hearing disability, and chromosomal abnormalities, were excluded to investigate the effects of maternal GWG on neurodevelopment in infants without obvious underlying disease during the neonatal period (Fig. 1).

Fig. 1
figure 1

Case selection flowchart

Exposure

GWG in this study was subdivided as below, within, or above the reference values of the 2009 IOM guidelines widely used throughout the world. The IOM guideline ranges for total GWG based on pre-pregnancy BMI are as follows: 12.7–18.1 kg for underweight women, 11.3–15.9 kg for women of normal weight, 6.8–11.3 kg for overweight women, and 5.0–9.1 kg for obese women (Supplemental table S1).

Covariates

The covariates in our models were selected a priori based on previous literature and biologic plausibility [20,21,22,23,24]. We estimated the effects of GWG after adjusting for demographic data including maternal age, pre-pregnancy BMI, parental smoking habit, maternal drinking habit, maternal highest level of education, annual household income, parental history of neurodevelopmental disorders, epilepsy, and mental disease, as well as obstetric and medical variables such as parity, means of pregnancy (including spontaneous pregnancy and assisted reproductive techniques, such as ovulation induction and artificial insemination or in vitro fertilization), use of folic acid supplements, complications during pregnancy (including DM/GDM, HDP, and intrauterine growth restriction), means of delivery, birth weight, gender, method of feeding, and neonatal jaundice in the newborn period requiring treatment such as phototherapy and exchange transfusion. Parental medical history of neurodevelopmental disorders included attention deficit hyperactivity disorder, learning disability, autism, Asperger’s syndrome, pervasive developmental disorder, and others. Parental history of mental disease included depression, schizophrenia, and anxiety disorder. Intrauterine growth restriction was defined as estimated fetal weight less than −1.5 standard deviations of standard weight based on gestational age in Japan.

Statistical analysis

Distribution normality was confirmed by the Kolmogorov–Smirnov test. Data are expressed as the mean ± standard deviation or the median (interquartile range) depending on whether they are normally distributed or not. Possible differences in maternal age, pre-pregnancy BMI, GWG, gestational age, and birth weight between subjects with normal development and developmental delay were assessed by the unpaired t-test or the Mann–Whitney U test based on the presence or absence of normal distribution, respectively. We also categorized continuous and ordinal variables, such as maternal age (< 35 or ≥ 35 years), pre-pregnancy BMI, GWG (below, within, or above), annual household income (< 4,000,000, 4,000,000–7,999,999, or ≥ 8,000,000 JPY), gestational age (< 37 or ≥ 37 weeks), and birth weight (< 1500, 1500–2499, or ≥ 2500 g). Fisher’s exact tests or chi-square tests were performed to compare covariates between groups stratified by category as well as by the presence of developmental delay. Additionally, differences in the scores of each domain among the three GWG groups were assessed by one-way repeated measures of analysis of variance (ANOVA) followed by post hoc (Bonferroni) testing. We employed multiple logistic regression models to investigate developmental delay at 1 year as the dependent variable in association with maternal GWG. Infants below and above the cutoff for each domain were categorized as “delayed” and “normal,” respectively. GWG was subdivided as below, within (reference), or above IOM guidelines. The models were adopted to calculate adjusted odds ratios (ORs) and their 95% confidence intervals (CIs) controlling covariates, as described above. Spearman’s rank correlation coefficient was used to check for multicollinearity of covariates. The variable of gestational age was excluded from the covariates because it was multicollinear with birth weight. Hosmer–Lemeshow testing was used to assess the goodness of fit of the models. We also analyzed the subjects without registered fathers to evaluate for possible selection bias.

All statistical analyses were performed using SPSS statistical software version 27 (SPSS Inc., Chicago, IL). All tests were two-tailed, and P-values of less than 0.05 were considered to indicate statistical significance.

Results

A total of 30,694 mothers with singleton live births and partners who completed the JECS questionnaire were available for analysis (Fig. 1). According to the pre-pregnancy BMI categories, the prevalence of underweight, normal weight, overweight, and obese mothers was 15.4% (4730), 74.2% (22,761), 8.1% (2485), and 2.3% (718), respectively. The prevalence of mothers below, within, and above the IOM-based GWG guidelines was 60.4% (18,527), 32.1% (9850), and 7.5% (2317), respectively. There were 10,943 participants (35.7%) who were outliers in at least one ASQ-3 domain (Table 1 and Supplemental table S2).

Table 1 Demographic characteristics of participants with or without developmental abnormality

Table 1 and Supplemental table S2 summarize the participants’ characteristics and offspring outcomes for developmental delay. There were significant differences in the rates of the GWG groups. We observed significant differences between the normal development and developmental delay groups for demographic categories including maternal age, maternal educational level, annual household income, parental smoking status, and maternal history of epilepsy (Table 1). Significant differences were also seen in such perinatal categories as parity, means of pregnancy of current birth, maternal use of folic acid supplements, HDP, mode of delivery, gestational age, birth weight, gender, method of feeding, and neonatal jaundice (Supplemental table S2).

ASQ-3 domain classifications and proportions of a risk of developmental delay at 12 months according to maternal GWG are shown in Table 2 and Fig. 2. Chi-square analysis revealed significant differences in the prevalence of developmental delay in the communication, gross motor, fine motor, problem-solving, and personal–social domains among maternal GWG groups. ANOVA showed that the scores for every ASQ-3 domain were significantly lower in the GWG below guidelines group than in the GWG within and above guidelines groups.

Table 2 ASQ-3 domain scores and proportions at risk of delay according to maternal gestational weight gain
Fig. 2
figure 2

Comparison of ASQ-3 scale scores according to maternal gestational weight gain. *P < 0.001, P < 0.01, P < 0.05

The regression models for all domains demonstrated good fit in Hosmer–Lemeshow testing. In multivariate logistic regression analysis after adjustment for covariates, compared with ideal GWG, GWG below guidelines was significantly associated with a higher incidence of developmental delay in the communication (OR 1.21, 95% CI 1.09–1.34), gross motor (OR 1.14, 95% CI 1.05–1.24), fine motor (OR 1.13, 95% CI 1.04–1.24), problem-solving (OR 1.09, 95% CI 1.01–1.18), and personal–social (OR 1.15, 95% CI 1.07–1.24) domains (Table 3). For every 2.3 kg (5 lb) of GWG, the risk of abnormalities was reduced by 4–9% in each domain of ASQ-3 (communication, OR 0.91 [95% CI 0.88–0.94]; gross motor, OR 0.96 [95% CI 0.94–0.98]; fine motor, OR 0.94 [95% CI 0.91–0.96]; problem-solving, OR 0.95 [0.93–0.97]; personal–social, OR 0.94 [0.92–0.96]) (Table 3).

Table 3 Odds ratios and 95% confidence intervals for the association between gestational weight gain (GWG) categories and developmental delay in ASQ-3 domains

Across BMI categories, GWG below guidelines tended to associate with a higher risk of developmental delay (i.e., OR > 1.0) in ASQ-3 screening than did GWG within guidelines (Fig. 3A). In contrast, GWG above guidelines often tended to associate with a lower risk of developmental delay across domains as compared with GWG within guidelines (Fig. 3B).

Fig. 3
figure 3

Odds ratios (ORs) for the association between maternal gestational weight gain (GWG) below and above guidelines with developmental delay in ASQ-3 domains according to the pre-pregnancy body mass index (BMI) categories. ORs are shown for the association between GWG below (A) and above (B) guidelines with developmental delay in ASQ-3 domains. The reference group is mothers with the recommended weight gain in each category of pre-pregnancy BMI. These ORs were adjusted for maternal age, pre-pregnancy BMI, parental smoking habit and maternal drinking habit during pregnancy, maternal highest level of education, annual household income, parental history of developmental disorders, epilepsy, mental disease, means of pregnancy, use of folic acid supplements, complications during pregnancy (including DM/GDM and HDP), intrauterine growth restriction, gender, birth weight, method of feeding, and neonatal jaundice

Lastly, we analyzed the 24,823 subjects without registered fathers. Supplemental table S2 shows the characteristics of the normal development and developmental delay groups. We observed a significant difference in the proportion of GWG categories between the groups similar to that in the main analysis (Supplemental table S3). Multivariate regression analysis also revealed significant associations between GWG below guidelines and the incidence of developmental delay in all five domains. For every 2.3 kg (5 lb) of GWG, the risk of abnormalities was reduced by 5–11% in each domain of ASQ-3 (Supplemental table S4).

Discussion

We herein describe the first large-scale nationwide birth cohort study in Japan to clarify the impact of insufficient maternal weight gain during pregnancy on offspring neurodevelopment at 12 months. Across pre-pregnancy BMI categories, the association was particularly significant in mothers with a lower pre-pregnancy BMI.

In this Japanese nationwide birth cohort study, the prevalence rate of screen positive measured by ASQ-3 at 12 months of age for communication, gross motor, fine motor, problem-solving, and personal–social domains was 6.8%, 13.5%, 9.7%, 15.0%, and 16.5%, respectively. However, the prevalence of developmental delay can differ according to demographic status and underlying disease [4, 5, 20,21,22]. Several perinatal risk factors of developmental delay have been reported, including pre-term birth, perinatal maternal mental health, and maternal educational level [20,21,22]. Relationships between maternal obesity during pregnancy and poor pregnancy results have also been described [6, 7, 14, 15]. The number of underweight pregnant women in Japan is on the rise [11, 12, 25, 26], possibly since the ideal body shape of young women is becoming thinner [27] in addition to strict GWG management to facilitate delivery without obstetric complications [28]. The obesity classification and GWG recommendations used in Japan differ considerably from those prescribed by the IOM (Supplemental table S1) [10]. Several recent Japanese studies showed that underweight women carried a higher risk of adverse birth outcomes, such as pre-term birth and SGA [25, 26, 29]. However, they did not assess subsequent neurodevelopment in infants of underweight women. A Swedish cohort study investigating the association between maternal GWG and risk of offspring autism spectrum disorder (ASD) supported our findings, whereby an elevated risk of ASD was observed for both insufficient and excess GWG [30]. They also suggested that maternal undernutrition during pregnancy contributed to the risk developmental abnormality.

It is uncertain why insufficient GWG may cause neurodevelopmental disorders. One reason is that malnutrition may restrict fetal brain growth. In Japan, total calorie intake among pregnant women was far below nationally recommended levels [31, 32]. Maternal dietary quality is of critical importance since specific nutrients are required during sensitive or critical periods of fetal development [33]. Folic acid has been recognized as necessary for neural tube development [34]. Iron is the most common nutrient deficiency during pregnancy and is necessary for myelination and the development of the frontal cortex and basal ganglia [35]. The studies on Japanese pregnant women mentioned above reported that the proportions of carbohydrates and lipids in total calories were respectively lower and higher than those required by pregnant women [31, 32]. Sussman et al. suggested that prenatal exposure to a carbohydrate-restricted diet, such as recently popular ketogenic diet programs, influenced not only offspring neuroanatomy such as brain structure and volumetric change [36], but also behavioral alterations that included reduced susceptibility to anxiety and depression and elevated hyperactivity in adult mouse offspring [36, 37]. Indeed, optimal diet and weight gain guidance for underweight women of child-bearing age appear critical.

It is important to determine whether neurodevelopmental evaluations at 12 months are clinically valid for subsequent diagnosis. In one study longitudinally comparing child ASQ-3 domain screening results based on cutoff scores, the vast majority (88.9–96.7%) received the same categorization results at 9, 18, and 24 months of age [38]. Other studies have provided evidence on the concurrent validity of the ASQ-3 and the clinical diagnosis of developmental delay, as well as on the reliability of the ASQ-3 in a multiethnic population [39,40,41]. However, the number of children who were screen positive (i.e., failed at least one of the five domains) in this study was high at 35.7%. This rate varies among age, developmental area, and country at 13–48% [38,39,40,41,42]. One report that evaluated the validity of the Japanese translation of ASQ-3 suggested an alternative deficit criterion of failure in at least two domains [43]. In the present investigation, the majority of screen-positive children had a failure in one domain, which could have been an overestimate; to verify this, the study cohort will be followed until the age of 13 years.

A strength of this investigation was that not only maternal, but also paternal history of neurodevelopmental problems was adjusted for as covariates. Genetic influences could be larger than those of a shared environment on the incidence of neurodevelopmental disorders [23, 24]. Since selection bias might have been produced by excluding the subjects without father registration, we also analyzed the group without father registration to assess this possibility. GWG below guidelines was significantly associated with a higher incidence of developmental delay than in the main analysis, although paternal medical history was not adjusted as a covariate in this subpopulation (Supplemental table S3).

This study has several limitations. First, the data regarding developmental scores as measured by ASQ-3 were collected from parental self-reported questionnaires and therefore subjective. Second, as data on abnormalities were evaluated at 12 months of age, no neurodevelopmental disorders diagnosed afterwards were included. Third, the large attrition rate of either unpaired participants or those not completing the ASQ-3 questionnaire may have constituted selection bias; we cannot conclusively rule out the possibility of underreporting the incidence of developmental disorders. Fourth, the parental histories of neurodevelopmental disorders, epilepsy, and mental disease were also collected from self-reported questionnaires. Therefore, these answers might not have conformed to diagnostic criteria or ICD coding. Finally, the participants of this study contained a large group of underweight mothers, which was representative of the Japanese population [12]. Therefore, although the analysis of obesity and/or excessive GWG may have been inadequate, this study provides valuable and unique research that is impossible in other countries.

Despite the above limitations, this is the first investigation using a large dataset from a Japanese nationwide birth cohort study to examine the independent influence of insufficient maternal GWG on offspring’s neurodevelopment that controlled for confounders identified by previous reports including birth weight. This study indicates a need to reconsider the optimal BMI and GWG for women desiring pregnancy not only in Japan, but also in other developed countries.