Background

Birthweight is one of the important indicators used to assess the health of an infant at birth. Low birthweight has often been defined as weights less than 2500 grams with birthweights less than 1500 grams classed as very low birthweight. These definitions however do not take into account gestational age. It is important to be able to differentiate between babies who are light because they are premature and those who are small-for-gestational age since the latter may have different health problems to the former. They may be growth restricted and have an increased risk of other complications such as perinatal asphyxia, symptomatic hypoglycaemia, congenital malformations, chronic intra-uterine infection and pulmonary haemorrhage [1]. Large-for-gestational age babies also have related health problems. Identification of small or large for gestational age babies is important for the management of the individual pregnancy and neonate. It is also a valuable aid to epidemiological studies where the aim is to identify risk factors or to assess the management of pregnancies [2].

Small- or large-for-gestational age babies may be identified using centile charts of birthweight by gestational age. Centile reference charts are used to monitor clinical measurements on individuals in the context of population values. Raw centiles can be calculated from appropriate data but the perturbations in these curves are unlikely to reflect the pattern of underlying growth at the population level. It is therefore reasonable to use statistical methods to derive a series of smoothed curves showing how the centiles of a measurement, in this case birthweight, change when plotted against time, in this case gestational age.

In Scotland there have been three sizeable studies resulting in the production of centile charts each based on data collected in Aberdeen [35]. The charts from the most recent of these studies, using data from 17,927 singleton births occurring between 1975–1980, were extensively used as a standard throughout Scotland until relatively recently.

The Information Services Division (ISD) of the Scottish Health Service use and publish birthweight centile charts[6]. ISD collects data on all maternity patients admitted to Scottish hospitals on an SMR02 form. The charts are based on 894,066 live births occurring between 1975 and 1989 and are the most recent published in Scotland.

Increases in birthweight since the formation of these standards have been observed for Scotland [7] England and Wales [8], the United States [9] and Canada [10] There have also been changes in the methods used to calculate gestational age [11]. These changes suggest that the centile charts in present use may now be inaccurate. Therefore, we aimed to produce updated charts using more recent data from 1998–2003.

Methods

Data on singleton births occurring between 1980 and 2003 were obtained from ISD's SMR02 (maternity) data collection system. This includes information on the birthweight, gestational age and sex of the infant. The parity of the mother is also recorded. Gestational age at birth was reported in completed weeks and is a clinician's estimate of gestation at birth based on an ultrasound dating scan and date of last menstrual period.

In order to adequately represent the population of all singleton births, the only exclusions made were lethal congenital anomalies and obvious outliers which included any birthweights less than 250 grams. Outliers were identified using Tukey's methodology [12]. This calculates the interquartile range and identifies as outliers any values more than twice the interquartile range below the first quartile or above the third quartile. This method assumes a symmetric distribution which is not the case for birthweights at most gestational ages. However the values of L obtained in each of the groups for each suggests only a slight degree of skewness at most gestational ages. The number of birthweights omitted as possible outliers was small and inspection of the omitted birthweights suggests that most of these could be explained by transcription errors.

The mean birthweight of all singletons born in each year between 1980–2003 was calculated. This confirmed the reported increase in birthweight over this period. This increase is marked over the period from 1980–1997 but appears to level off from 1998 onwards and therefore the most recent years for which complete data were available, namely 2002 and 2003, were used as a basis from which to construct new centile charts. For births occurring at gestations between 31 and 42 weeks a two year period gives sufficient data, however for the extremes of gestational age the data was supplemented by births from 1998 to 2001.

Centiles were calculated using the LMS method [13] which uses the Box-Cox power transformation to obtain normally distributed data within each group. This involves estimating three sets of values for each gestational age group, namely, L the power transformation used to achieve normality, M the median birthweight and S the coefficient of variation of the data. L, M and S are estimated for each gestational age and then smoothed curves are fitted using cubic splines to these to give L(t), M(t) and S(t) where t is the gestational age. The extent of the smoothing is expressed in terms of the degrees of freedom used for the fit. The 100αth centile for the appropriate sex and parity group is then given by

C100α (t) = M(t) [1+L(t)S(t)Z α ]1/L(t)

where Z α is the α % point of the normal distribution.

For a particular infant, with birthweight y, a z-score can be calculated using the formula

z = [ y M ( t ) ] L ( t ) 1 L ( t ) S ( t ) MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaeeOEaONaeyypa0tcfa4aaSaaaeaadaWadaqaamaalaaabaGaeeyEaKhabaGaeeyta00aaeWaaeaacqqG0baDaiaawIcacaGLPaaaaaaacaGLBbGaayzxaaWaaWbaaeqabaGaeeitaW0aaeWaaeaacqqG0baDaiaawIcacaGLPaaaaaGaeyOeI0IaeGymaedabaGaeeitaW0aaeWaaeaacqqG0baDaiaawIcacaGLPaaacqqGtbWudaqadaqaaiabbsha0bGaayjkaiaawMcaaaaaaaa@4526@

Four sets of charts were constructed defined by the sex of the baby, male or female, and the parity of the mother, nulliparous or multiparous. Centiles were calculated using the software LMS ChartMaker. Other analysis was carried out using SAS, version 9.1

Results

The mean birthweight for each of the years from 1980 to 2003 is shown in Figure 1 and confirms the previously reported increase in birthweight.

Figure 1
figure 1

Mean birthweight by year.

There were 98,904 records of singleton births occurring in 2002 and 2003. These were supplemented by information on 1,883 singleton births from 1998–2001 for gestational ages of 30 weeks or less or 43 weeks. Excluding lethal congenital anomalies and omitting outliers gave a total of 100,133 records. Applying Tukey's method resulted in 0.4% of the observations being omitted as outliers. Figures 2a and 2b show plots of birthweight against gestational age with and without the outliers for the subgroup girls, parity 1 or more. Table 1 gives the numbers of births used in constructing the centiles, and the percentage of outliers omitted together with the overall mean birthweight and standard deviation based on the data for 2002–2003 only.

Figure 2
figure 2

a: Birthweight by gestational age for girls, parity 1 or more. b: Birthweight by gestational age for girls, parity 1 or more with outliers removed.

Table 1 Summary of Data by Sex and Parity Groupings

Tables 2, 3, 4, 5 give the centiles for the groups: boys parity 0, boys parity 1 or more, girls parity 0 and girls parity 1 or more respectively. The tables also give the number of births used and the fitted values of L. M and S for each gestational age for each group, as well as the degrees of freedom used in fitting the cubic splines.

Table 2 Centiles for Boys, Nulliparous
Table 3 Centiles for Boys, Multiparous
Table 4 Centiles for Girls, Nulliparous
Table 5 Centiles for Girls, Multiparous

The z-scores resulting from the LMS models fitted should be normally distributed within each grouping. This was verified by obtaining normal probability plots of the z-scores overall and for each gestational age. The plot for girls, parity 1 or more is given in Figure 3. Table 6 gives the observed percentage of z-scores by centiles groupings for the same group.

Figure 3
figure 3

Q-Q plot of z-scores for girls, parity 1 or more.

Table 6 Percentage of observations (observed and expected) within centile bands

In order to assess the goodness of fit of the models, the new centiles were plotted against the observed centiles for each group. Figure 4 shows this plot for the 3rd, 50th and 97th centiles for girls, parity 1 or more. Figure 5 shows the 3rd, 10th, 25th, 50th, 75th, 90th and 97th centiles superimposed on the actual birthweights for the same groups. For comparison Figure 6 shows the new centiles compared to the currently used ISD centiles again for the 3rd, 50th and 97th centiles

Figure 4
figure 4

New centiles vs observed centiles for girls, parity 1 or more. 3rd, 50th and 97th centiles. Solid line: new centiles. Dashed line: observed centiles.

Figure 5
figure 5

New centiles with birthweights for girls, parity 1 or more. 3rd, 10th, 25th, 50th, 75th, 90th, 97th centiles.

Figure 6
figure 6

New centiles vs ISD centiles for girls, parity 1 or more. 3rd, 50th and 97th centiles. Solid line: new centiles. Dashed line: ISD centiles.

Discussion

Centile charts of birthweight for gestational age are a valuable tool in many epidemiological studies as well as providing important information to clinicians as to which babies may be at higher risk of neonatal or postnatal morbidity [14]. It is therefore essential that the charts used are representative of the population to which they are applied. A number of standards are available based on births occurring in various European countries; mostly using data from the 1980s and the 1990s [15].

There are clear differences between the centiles calculated here from recent data and those in current use in Scotland which are based on data from 1975–1989. For term babies the median birthweight in all sex and parity groupings is shown to be higher than it was previously. This increase in birthweight is also reflected in the other centiles. For babies born at very low gestational ages the median birthweight is now less than it was, possibly reflecting the increased survival rate in pre-term births [16]. The centiles for lower gestational ages are also much closer together than in the existing charts. One possible explanation for such a marked difference at lower gestations in particular may be poor estimation of gestational age, particularly in the 1970s, as was found in data for England and Wales analysed by Milner and Richards in 1974 [17].

In recent years a number of centiles charts [18] have been constructed using the method developed by Gardosi [19]. This method aims to give a fetal weight standard and requires only data for term births from the population of interest. Whilst it is desirable in principle to look at fetal weights the assumptions which are being made with this method cannot be substantiated with reference to our data which consists only of actual birthweights. It is therefore not possible to assess the goodness of fit of the centiles calculated in this way.

As well as modelling the median birthweight the LMS method also models the coefficient of variation S and the power L which is used to transform the birthweights to achieve normality. Within each of the sex and parity groupings it is seen that the coefficient of variation decreases with increased gestational age showing that the birthweights are more variable at lower gestational ages. This contrasts with the assumption used in Gardosi's methodology for fetal weights that the coefficient of variation is constant.

It is important in constructing charts of this type to test the adequacy of the model fitted both with reference to the raw data used to construct the charts and to the assumptions on which the model relies [20, 21].

Comparing the new centiles to the empirical centiles suggests that the LMS method is a reasonable fit to the data. It can be shown that, in general, the standard errors of empirical centiles are larger than those for the centiles calculated using the normal distribution. The latter method is therefore more efficient. This is only true if the assumption of normality is reasonable which is not the case for birthweight and therefore some transformation of the data is required. A value of 1 for L indicates no transformation required with a value less than 1 adjusting for positive skewness and a value greater than 1 for negative skewness. For each sex and parity grouping the values of L suggest that the birthweights are negatively skewed for low gestational ages and positively skewed for higher gestational ages. The values of L suggest the extent of skewness at each gestational age is not high.

Normal probability plots of the z-scores for each grouping and for each gestational age within each grouping show that the LMS method has largely succeeded in achieving normality. There is some suggestion in the plot of heavier tails however the percentages in the tails are close to what is expected.

An important question in constructing centile charts of any data is which cases to include in the calculations and which to omit. Many previous studies into centile charts have used live births only because of the difficulty of accurately assessing the gestational age of stillbirths. The argument in the past has been that a baby which is stillborn may have died some time before delivery and therefore the weight may not be a true reflection of the gestational age at which delivery occurs. This is not often the case now. Fetal death is almost always recognized very quickly, and most women prefer to be delivered as soon as possible once it is realized that this has happened. This was argued by Tin[16] looking at the problems of estimating centiles for babies born before 32 weeks gestational age, In this paper it was suggested that not all stillbirths should be excluded, arguing that by doing so centiles at gestational ages less than 28 weeks have been largely overestimated.

For babies born within ten weeks of term the difference in centiles including and excluding stillbirths are negligible because the numbers of stillbirths are relatively small. Omitting stillbirths at low gestations of 24–27 weeks gestation causes bias in the centiles possibly because very small babies at any specified gestation are much more likely to be treated as "effectively" stillborn than larger babies of the same gestation when pregnancy ends as soon as this.

Information on ethnicity is poorly recorded on the SMR02 forms therefore no attempt was made to produce separate centiles for different ethnic groups. From the 2001 census it is known that the minority ethnic population was just over 100,000 in that year which is 2% of the total population of Scotland. The percentage is similar for women of child bearing age. Ethnicity is not therefore a major consideration for the Scottish data.

Other studies have followed the convention of excluding babies with major congenital malformations [22] and this has been used in this study. However with such a large data set the exclusion has made little difference to the centiles. Other studies [1, 23] have also identified outliers at each gestational age using the criterion outlined by Tukey. From visual inspection of the charts with and without the outliers, it is clear that the points identified in this study are most likely to be due to transcription errors. The excluded points do not therefore raise any concerns about the accuracy with which gestational age is measured.

Other factors are known to have a significant effect on birthweight and a number of customised charts have been developed in recent years. It can be desirable to take into account physiological factors such as the height of the mother which contribute to the natural variation in birthweights but not potential risk factors such as whether or not the mother smokes. The distinction between the two types of factor may not always be clear cut however. For example height and weight of the mother may in part be determined by risk factors such as social deprivation or nutrition. There is therefore an important role in epidemiological studies into adverse perinatal outcomes for charts such as the ones described here which will allow both the effect of infant's size and the size of the mother to be separated.

Conclusion

The differences shown between the new centiles and the current published centiles confirm the need to have centiles appropriate for the population for which the charts are to be used. Use of inappropriate centiles may result either in small-for-dates babies not being identified or too many babies being flagged as small-for-dates. After consistent increases in mean birthweight from 1980 until the mid 1990s, mean birthweight has stabilised over recent years making the new charts appropriate for current use. It is however important that the distribution of birthweight continues to be monitored on a regular basis.