Background

Birth weight is a primary measurement and significant indicator to ensure the optimal growth, survival, and future well-being of new-borns. Deviation from normal delivery weights (2500–3999 g), such as low birth weight (LBW) (< 2500 g) and macrosomia (> 4000 g) could lead to some negative consequences on neonatal health [1,2,3]. While macrosomia may cause neonatal and maternal morbidity [4], LBW is well-documented to be one of the most contributing factors to the neonatal mortality [1]. LBW is defined as weight less than 2500 g at birth regardless of gestational age (GA) and can be caused by preterm birth or intrauterine growth restriction [5]. In this paper, LBW includes both preterm and term new-borns of appropriate for GA.

Routine and reliable estimates of foetal weight at a given GA throughout pregnancy are vital. These estimates could create evidence-based track records/analysis to assist medical practitioners to detect the signs of potential LBW during pregnancy and provide the appropriate interventions. Although a wide range of simple and advanced multivariate weight prediction models based on clinical and ultrasonic measurements has been developed, most are only based on maternal or foetal factors [6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25]. Less is known about the combinations of these characteristics to estimate foetal weight during pregnancy despite the fact that birth weight is significantly associated with characteristics of both mother and foetus [1, 26].

Several models based on combined maternal and neonatal characteristics have been developed and reviewed, these existing models were mostly developed based on the information available at delivery time [27, 28]. In most developing countries, the availability of foetal biometric measurements during pregnancy is low, particularly in rural areas due to limited access to ultrasound machines and skilled personnel [29]. Westerway et al. (2000), Loughna et al. (2009), and Papageorghiou et al. (2014) have used a large number of ultrasonic measurements to develop formulas that estimate foetal biometric characteristics at a given GA [30,31,32]. These formulas then could be used to fill the foetal database gaps during pregnancy when ultrasound facility is absent.

The present research develops foetal weight prediction models based on combined maternal and estimated foetal biometric characteristics to estimate foetal weight at any given GA. The proposed models can be simply implemented in low-resource primary health care centres where ultrasound machines and trained ultra-sonographers are not always available. The predicted foetal weight will assist in the development of foetal growth charts for Indonesia. No such charts currently exist for the Indonesian population.

Methods

Study design and setting

A quantitative and analytic study based on a retrospective pregnancy cohort analysis was carried out. Unidentified secondary quantitative data were collected and analysed to (1) assess the adequacy of the existing ultrasonic models in estimating foetal biometric characteristics, (2) develop new foetal weight prediction models based on both maternal and estimated foetal biometric characteristics, (3) assess the accuracy of the proposed models in predicting foetal weight between 35 and 41 weeks of GA, and (4) carry out a comparison study between the proposed and commonly used models. The study was conducted in a primary health care centre in South Kalimantan province, Indonesia. The locality was selected because it is one of the five provinces with the highest neonatal mortality in the country [33,34,35].

Conceptual framework

Figure 1 shows the framework used in this study, along with the selected possible predictors of foetal weight estimation.

Fig. 1
figure 1

Conceptual framework for factors influencing foetal weight estimation between 32 and 41 weeks of pregnancy

Data source

Study data were sourced primarily from a paper-based pregnancy register of pregnant women who received antenatal care (ANC) services and gave birth in the selected primary health care centre from January 2013 to August 2015. Prior to delivery, GA, fundal height (FH), foetal head engagement/foetal station (FS), and recorded foetal weight estimation (EFWr) at a given GA were measured and recorded by the assigned midwives. At delivery time, actual birth weight (ABW), neonatal head circumference (HC), and neonatal abdominal circumference (AC) were also measured and recorded.

Data management

Data was recorded in Microsoft Excel and the statistical analyses were performed using Minitab version 17 and R. The ordinary least square (OLS) and robust regression (the weighted likelihood estimation) were carried out by using lm function and wle.lm function, respectively in R [36,37,38,39].

Statistical analysis

The adequacy assessment of existing ultrasonic models to estimate foetal biometric characteristics during pregnancy

The existing ultrasonic formulas to estimate foetal HC and foetal AC which were developed based on the Australian foetal biometry data (measured between 11 and 41 weeks), the UK foetal biometry data (measured between 13 and 42 weeks), and the international foetal biometry data (measured between 14 and 42 weeks or until birth) [30,31,32] (provided in Additional file 1: Table S1) were applied to estimate foetal HC and foetal AC at a given GA for Indonesian foetus (n = 127). A reliability analysis using intraclass correlation coefficient (ICC) [40, 41] was performed to assess the consistency of the ultrasonic formulas for Indonesian population. The obtained ICC values (provided in Additional file 2: Table S2) were computed by single-rating, consistency, and two-way random effects models for the foetal biometrics with three raters (different ultrasonic formulas) across 127 subjects (pregnant women). Interclass (Pearson) correlation coefficient was also analysed to assess whether there is a significant relationship between the predicted foetal biometrics and the neonatal measurements recorded at delivery time (provided in Additional file 3: Table S3).

The development of new foetal weight prediction models based on combinations of maternal and estimated foetal biometric characteristics

Bernoulli distribution with the event probability (p) of 70% was used to randomly divide our data into two sets: model development (training) data (n = 89) and model efficacy assessment (testing) data (n = 38).

Based on the training data set, simple and multivariate linear regressions were used to develop our proposed models. The best subset selection methodology together with correlation coefficient (r), coefficient of determination (R2), standard deviation (S), Mallows Cp, and variance inflation factor (VIF) were deployed to identify the most suitable subset of predictors. Analysis of variance (ANOVA) together with t-test statistics was used to simultaneously and partially confirm the significance of predictors’ contribution in the regression models. Diagnostic tests of residuals were used to confirm the validity of the regression models.

Since our aim is to investigate whether a combination of maternal and foetal factors could improve foetal weight prediction accuracy, we have utilised the most commonly recommended formulas of ultrasonic foetal measurement standards (based on GA) to predict the measurements of foetal biometrics in our local population. This prediction is one way to fill in the foetal database gaps during pregnancy in the absence of ultrasound. The estimates of these two most significant characteristics of foetal biometry, such as HC and AC were then combined with maternal FH to develop the prediction models. The idea of this combination was to evaluate whether it could improve the prediction accuracy of foetal weight.

Our delivery date in our data ranged from 32 to 41 weeks. The ultrasonic formulas were deployed to estimate foetal HC and foetal AC at the given GA for each individual patient and used to estimate the delivery weight. Therefore, the mean time between the last measurements of FH and GA as well as the last estimates of foetal HC and foetal AC and birth was assumed to be 0 days.

The efficacy assessment of the proposed models

The testing data set was used to validate and assess the efficacy of the proposed models. The potential bias due to growth between the last measurements and birth of the developed models for estimating foetal weight was assessed by calculating the mean prediction error [the average of the differences between the ith actual values of birth weight (ABWi) and the ith predicted values of foetal weight based on the proposed models (\( {EFW}_{p_i} \))]=\( \sum \limits_{i=1}^n\frac{\left({ABW}_i-{EFW}_{p_i}\right)}{n} \). The mean absolute percentage prediction error or \( MAPE=\sum \limits_{i=1}^n\frac{\left|\left(\frac{ABW_i-{EFW}_{p_i}}{ABW_i}\times 100\right)\right|}{n} \) was also calculated to represent the dispersion of the errors [42]. In addition, the median absolute percentage prediction error or \( MEDAPE= Median\ \left|\left(\frac{ABW_i-{EFW}_{p_i}}{ABW_i}\times 100\right)\right| \) was measured and used for assessing the efficacy of the models. The later measurement is more resistant to outlier distortion (due to the presence of extreme deviations) than the mean; therefore, deploying MEDAPE would eliminate the false interpretation of forecast accuracy [43].

The efficacy of the proposed models was also assessed by the number of estimates within 10% of ABW. A two independent sample t-test was used to decide if there is a significant difference between the observed or actual values of birth weight (ABW), EFWr, and estimated foetal weights based on the proposed models (EFWp). Multiple comparisons were carried out between our proposed models, eleven existing clinical models, and six existing ultrasonic models to select the most effective models for estimating foetal weights at a given GA.

Results

Out of 146 women who received ANC services and gave birth in the selected primary health care centre, 127 (87%) women met the study criteria (Fig. 2). These women delivered live singletons with normal delivery weights between 32 and 41 weeks of GA. We excluded 19 (13%) women due to incomplete information on the required characteristics listed in Fig. 2, such as no records of GA, FH, and FS (n = 3 ), GA > 41 weeks (n = 2), been referred to hospitals due to pregnancy complications (n = 6), and abnormal birth weight babies (n = 8).

Fig. 2
figure 2

Flowchart of recruitment of participants through the study

General information on the study population

Descriptive statistics on baseline characteristics of mother and new-born of the study population (n = 127) are presented in Table 1. Overall, the pregnant women were well-nourished (arm circumference = 25.5 cm) and had normal haemoglobin level (11.6 g/dl) and body mass index (24.4 kg/m2). The median age, height, weight, and FH for women were 28 years (range 16–44 years), 156 cm (range 148–176 cm), 60 kg (range 44–83 kg), and 32 cm (range 27–36 cm), respectively. The outcomes of pregnancy were in a normal average of GA (38 weeks), delivery weight (3252.8 g), birth length (50.2 cm), neonatal HC (33.5 cm), and neonatal AC (34.5 cm).

Table 1 Maternal and neonatal baseline characteristics of study population (n = 127)

The reliability assessment of existing ultrasonic formulas in estimating foetal biometrics

This section presents the results of reliability analysis among three existing ultrasonic formulas [30,31,32] listed in Additional file 1: Table S1 in predicting foetal biometrics when ultrasound facilities are not accessible.

The intraclass and interclass correlation coefficient analyses are presented in Additional files 2 and 3: Table S2 and S3, respectively. The results presented in Additional file 2: Table S2 indicate that all three formulas have excellent reliability/consistency in predicting foetal HC and foetal AC at a given GA (the obtained ICC values are 0.957 and 0.996, respectively). Therefore, either of the existing formulas can be deployed in our study population.

Additional file 3: Table S3 shows that the estimated ultrasonic HC has a significant relationship with the neonatal HC (p-value < 0.0005) based on the existing models although the relationship was weak (0.191 < r < 0.212). Meanwhile, there is no significant correlation between the estimated ultrasonic AC and the neonatal AC (0.076 < r < 0.078, p-value > 0.05). However, since the Australian standard formulas produced slightly higher interclass correlation coefficients (between the estimates of foetal biometrics and the neonatal measurements) and more estimates falling within 10% of the neonatal measurements; therefore, the ultrasonic formulas based on the Australian population will be deployed to fill the foetal database gaps and assist the development of our proposed models.

Correlation analysis

Prior to developing models, correlations between the potential predictors of foetal weight estimation based on 127 data were investigated. The correlation analysis is presented in Additional file 4: Table S4.

Additional file 4: Table S4 shows that maternal FH has a significant correlation with the EFWr and the ABW (r = 0.952, p-value < 0.0005 and r = 0.795, p-value < 0.0005, respectively). Unlike FH, GA has no significant correlation with the EFWr and ABW.

Optimal models based on the best subset selection algorithm

Deploying the best subset selection algorithm, we have summarised the models developed based on the EFWr (provided in Additional file 5: Table S5). These models were based on one, two, and three independent variables. The table also lists their corresponding R2, Mallows Cp, S, and VIF statistics.

Additional file 5: Table S5 shows that the first model incorporated only one predictor: FH. The second, third, and fourth models incorporated two predictors: FH and GA, FH and estimated foetal HC, and FH and estimated foetal AC, respectively. The last model was developed based on three predictors: FH, estimated foetal HC, and estimated foetal AC.

Overall, the developed models had equal capability in predicting foetal weight estimation (coefficient of determination between 88.3 and 88.8%). However, using Mallows Cp index and S, we concluded that Models (3) and (4) were the best fit models with the least predicting errors. Models based only on estimated foetal HC or estimated foetal AC was excluded from the analysis due to the insignificant R2. Model (5) was excluded due to the presence of severe multicollinearity (VIF > 193) (provided in Additional file 5: Table S5).

Table 2 presents the coefficients of the predictors for the chosen models together with the corresponding p-values of ANOVA, t-test statistics, and diagnostics of residuals.

Table 2 Predictor analysis of the proposed models

Table 2 shows that for each individual model, the p-value corresponding to independent predictors is significant. Since our sample size is large, statistically significant non-normality of residuals was accepted. However, the authors have deployed robust regression to find the best fit models. Unfortunately, the best fit models proposed by robust regression had slightly larger prediction errors than those selected through the best subset models. Therefore, our further analysis is carried out using the OLS regression models presented in Table 2.

The accuracy comparison of the proposed and existing models

The two most commonly used models in Indonesia for estimating delivery weight are the Johnson-Toshach and the Risanto models. Both models estimate foetal weight based on FH. However, the Johnson-Toshach formula, which is nationally well-recognised, requires additional information on the status of the FS [44].

As listed in Table 2, the first model recommended through the best selection algorithm was Model (1) which is also developed based on FH only. Therefore, the authors carried out further comparisons between the proposed Model (1) and the widely used the Johnson-Toshach [14, 15] and the Risanto models [22, 23] as well as other existing models for estimating foetal weight based only on FH (the Niswander, the modified Niswander, the Mhaskar, the Gayatri-Afiyanti, the Buchmann-Tlale, the Santjaka-Handayani, the Mongelli-Gardosi, and the Yiheyis [16,17,18, 22,23,24,25, 45]). We also compared Models (2), (3), and (4) with the existing models based on ultrasonic measurements of foetal biometrics, such as foetal HC and foetal AC (the Jordaan, the Weiner, the Hadlock 1984, and the Stirnemann [10, 42, 46, 47]). Details for the proposed and existing models are presented in Additional file 6: Table S6.

The prediction accuracy of the proposed (Models (1), (2), (3), and (4) in Table 2) and the existing models were assessed using the testing data set. The predicting errors were calculated as the mean prediction error (the average of the differences between ABWi and \( {EFW}_{p_i} \)), the MAPE, and the MEDAPE. The results are presented in Table 3.

Table 3 Accuracy comparisons between the proposed and existing models

Table 3 shows that the mean prediction errors recorded for the proposed models are significantly smaller (between − 0.2 and 2.4 g) than those recorded for other existing models. Similarly, the MAPEs and MEDAPEs recorded for the proposed models are significantly smaller (between 5.0 and 5.1% and between 4.1 and 4.2%, respectively) than those recorded for other existing models. Therefore, we concluded that our four proposed models were capable to predict estimated foetal weight with less errors compare with the existing models between 35 and 41 weeks of pregnancy. The visualisation of these multiple comparisons can be seen in Fig. 3.

Fig. 3
figure 3

MEDAPEs comparison between the proposed and commonly used models ordered by GA

Furthermore, a two independent sample t-test (provided in Additional file 7: Table S7) was also used to investigate if there is a significant difference between the observed and estimated foetal weights based on the proposed models. The results show that there is no significant difference between the observed and estimated foetal weights based on the proposed models (p-value > 0.05).

Assessing the prediction accuracy based on proportion falling between 10% of actual values

Table 3 presents the prediction ability of the proposed models, 11 existing clinical models (based on FH only), and 6 existing ultrasonic models (based on ultrasonic measurements of foetal HC and foetal AC as well as based on GA only). The table provides the total number of predictions falling within 10% of ABW.

Table 3 shows that 92% of the predicted values produced by our proposed Model (1) fall within the 10% of ABW compared with 89% for the Johnson-Toshach model and 84% of the Risanto models. However, Model (1) only uses FH to predict foetal weight, while the Johnson-Toshach model requires information on FH as well as FS. Therefore, we recommend that Model (1) be used instead of the Johnson-Toshach model.

Model (1) (based on FH only) is equally capable to estimate foetal weight as Models (2) and (4). These results imply that the inclusion of GA (which is not a biometric measurement of foetus) and estimated foetal AC do not have an impact on estimated foetal weight accuracy. Our results are in agreement with the previous study conducted by Huber (2014) [48].

Comparing the accuracy of Model (3) (based on FH and estimated foetal HC) and Model (4) (based on FH and estimated foetal AC) with the Hadlock 1984 model (based on ultrasonic measurements of foetal HC and foetal AC) [10], we concluded that both proposed Models (3) and (4) were significantly more capable in predicting foetal weight than the Hadlock model. Table 3 shows that the proportion of predicted birth weights falling within the 10% of ABW for Models (3) and (4) are more than double the proportion based on the Hadlock model.

Discussion

Our study highlights that the use of combined maternal and estimated foetal biometric characteristics can provide a reliable estimate of delivery weights between 35 and 41 weeks of GA. This result confirms the previous study that shows a significant association between birth weight and characteristics of mother and foetus [1, 26].

Both clinical and estimates of ultrasonic predictors are used in our proposed models. Maternal FH measurement was selected as one of the clinical predictors as it is one of the most recommended and accessible predictors to estimate foetal weight and monitor foetal growth during pregnancy [3, 23, 49, 50]. Although the clinical approach using FH screening had reportedly low sensitivity for detecting intergrowth and birth weight abnormalities (ranged 16–45%) [51, 52], it is a simple and inexpensive clinical activity [29, 53], especially true in rural areas where ultrasound machines and skilled personnel are not always available. The utility of FH remains an important first level screening tool, widely used during routine ANC in both high and low income settings [29] even though it had high false-negative rates for small for GA [53].

In ultrasonic settings, foetal biometric characteristics monitored during pregnancy include HC, biparietal diameter (BPD), occipitofrontal diameter (OFD), AC, and femur length (FL). These characteristics are routinely measured by ultrasound every 5 weeks after the first initial dating scan (between 8 and 14 weeks’ gestation). The standard ranges for ultrasonic measurements are (14–18), (19–23), (24–28), (29–33), (34–38), and (39–42) weeks [54] or at least once every trimester of pregnancy, i.e. between weeks 10–14 (first trimester), 20–24 (second trimester), and 30–32 (third trimester) [55].

Assessment of foetal biometric characteristics during ANC is vital to ensuring normal foetal size and safe delivery. In the absence of ultrasound facility, particularly in low-resource primary health care settings, the measurements of these characteristics are not always accessible. Therefore, a reliable prediction of these characteristics during pregnancy would be a proxy of foetal biometrics and vitally improve the quality of ANC services in monitoring foetal inter growth assessment which currently remain low due to the database gaps [56,57,58,59].

Several ultrasonic formulas to estimate the foetal characteristics at different GA have been developed [30,31,32]. The foetal HC and foetal AC are widely recognised as the most influential predictors for predicting foetal weight [10, 11, 46, 60, 61]. Our results show that the best fit formulas to estimate these foetal characteristics at a given GA in our population were based on the Australian population [30].

To the best of our knowledge, in the majority of Indonesian primary health care centres where ultrasound facility is not accessible, none of the existing ultrasonic formulas were adopted to estimate foetal HC and foetal AC. Therefore, the formulas potentially can be deployed to fill in the database gaps on the inter growth process of foetus during pregnancy. Consequently, early informed intervention could be initiated to prevent abnormal growth and delivery weights.

Several techniques have been available to reduce collinearity, such as centering, multiplying variables by various constants (scaling), the use of orthogonal polynomials, and other transformations [62]. Currently, the use of automated machine learning, such as Genetic Algorithm rather than a conventional fractional polynomial approach has also been applied to model multiple biometric variables of foetus that are highly correlated [54].

In this paper, we used the best subset selection algorithm to prevent the inclusion of highly correlated variables and select the best subset of predictors to be included in the models. It has been emphasized that a formula for estimating foetal weight should be simple and straightforward to be used by doctors and midwives and be easily understood by patients [63]. This would improve the quality of communication, information, and education as part of routine ANC service in low-resource primary health care centres.

Based on our comparison analysis, the proposed Models (1), (2), (3), and (4) produced the least mean prediction errors (between − 0.2 and − 2.4 g), the MAPEs (between 5.01 and 5.10%), and the MEDAPEs (between 4.10 and 4.22%). The mean percentage prediction error (MPE) steadily tended towards zero as the time interval between the last scan and birth decreased [42]. Our MPEs were ranged between − 0.1 and − 0.3% in those born within 0 day (n = 38) which are lower than the previous research [42] reported by − 0.8% in those born within 1 day (n = 198).

Our proposed models were unbiased for predicting weight between 35 and 41 weeks of GA. In the group born within 0 day of the last measurements, the MAPEs were ranged between 5.0 and 5.10% with 89–92% of predicted weights falling within 10% of the true birth weights which are smaller than those reported in previous study [42]. This was particularly for Model (1) which was simply developed based on FH only.

The comparison between the proposed Model (1) and the widely used Johnson-Toshach model shows that Model (1) (developed based on the Indonesian data) was more accurate in predicting the estimated foetal weight than the Johnson-Toshach model (developed based on the United States data). Furthermore, the Johnson-Toshach model requires the knowledge of FS. The results presented in Table 3 also shows that the inclusion of FS in the model has not reduced the prediction errors in foetal weight estimations yet raise a subjectivity issue unless there is a standard protocol to determine FS with less error [20]. Therefore, we recommend the proposed Model (1) be deployed in Indonesia and other countries with similar health systems and challenges for weight prediction.

Our comparison study confirms that the proposed Models (3) (based on FH and estimates of foetal HC) and (4) (based on FH and estimates of foetal AC) perform better than the ultrasonic models: the Jordaan, the Weiner II, the Hadlocks, the Stirnemann, and the Sotiriadis models. The incorporation of estimated foetal HC or estimated foetal AC has increased R2 slightly (provided in Additional file 5: Table S5), but it did not improve the predicting accuracy (Table 3). However, access to these values will enable the practitioners to monitor foetal growth during pregnancy where advanced equipment, such as ultrasound, is not always available. Consequently, detecting foetal growth abnormality, such as small for GA, prematurity, intrauterine growth retardation, and LBW during pregnancy will be possible.

Strengths and limitations

Our retrospective study has investigated the utilisation of some commonly used foetal weight prediction models in Indonesia. Particularly, the combination between maternal and estimated foetal biometric characteristics was proposed. The aim of this combination was whether it could improve the prediction accuracy of foetal weight at any given GA in the absence of ultrasound machines and trained ultra-sonographers.

The retrospective cohort study was undertaken to provide baseline data on the selected primary health care centre. It is possible that women have used different health services than that reviewed in this study. Although this may result in underestimation in data records, it is unlikely to impact on the validity of the analyses. This study also encountered limitations associated with the accuracy of the information recorded on the manual pregnancy register or inaccurate data transfer to the electronic database. However, monitoring and controlling the process of data transfer was conducted to reduce potential error. Further study should be conducted to assess the efficacy of the proposed models using prospective data [64].

The proposed prediction models are linear regressions. However, the authors have investigated non-linear models. The non-linear models did not improve the estimation accuracy. Therefore, complex models do not guarantee significant improvement in the prediction accuracy. Furthermore, due to the fact that the objective of the study is to provide simple yet reliable foetal weight estimating models for low-resource areas, we are recommending the proposed models. We believe that the findings can be applied in other low-resource settings to improve ANC services.

Conclusion

This research has developed models to predict the estimated foetal weight at varying gestational age where ultrasound facilities do not exist. Since birth weight is one of the most important indicators of neonatal survival, a reliable estimate of foetal weight at different stages of pregnancy would facilitate the intervention plan for medical practitioners to prevent the risk of abnormal delivery weights. Further, the models will lead to the development of foetal inter growth charts, which are currently unavailable in the Indonesian primary health care systems.