Background

Preterm birth (PTB) is one of the main causes of death among children under 5 years of age [1]. PTB is also associated with various complications throughout the lives of survivors as the frequency and severity of adverse outcomes increase as gestational age decreases [2, 3]. In addition, PTB has important economic and social repercussions [4]. The overall rate of PTB has increased from 2000 to 2014 worldwide, and it was estimated at 10.6% in 2014 (14.84 million live births) [5]. In this same year, the proportion of preterm births was 11.2% in Brazil. This figure places the country among the ten countries in the world with the highest rates of PTB [5].

PTB is a syndrome resulting from a complex list of underlying causes and factors including sociodemographic, psychosocial, nutritional, behavioral, and biological factors [6]. Higher rates of PTB are observed in low- and middle-income countries (LMICs) [5], in which socioeconomic disparities have been commonly associated with PTB [7, 8]. Among the risk factors associated with preterm birth is inadequate prenatal care [9]. Adequate prenatal care allows for the diagnosis and treatment of pregnancy complications and a reduction in behavioral risk factors associated with prematurity [10].

Despite this knowledge, whether these risk factors differ in the context of prior preterm birth history is less understood. Also, only one study has distinguished between risk factors for premature birth between women with a previous PTB compared with women without a previous PTB [11]. However, this study was hospital-based from a high-income country with a limited sample size [11]. Using data from the Center for Data and Knowledge Integration for Health (CIDACS) Birth Cohort, we aimed to (1) explore whether the risk factors for PTB in a second pregnancy are different in women whose first pregnancy was delivered at term (≥37 weeks of gestation) or preterm (<37 weeks of gestation) and (2) assess how changes in the number of prenatal visits between pregnancies were associated with PTB in the context of prior PTB.

Methods

Study design and population

This population-based cohort study used data from the CIDACS Birth Cohort. This cohort was created by linking data from Brazil’s National Live Birth System (SINASC) and the 100 Million Brazilian Cohort baseline from Jan 1, 2001, to Dec 31, 2015. This study adhered to the RECORD (Reporting of studies Conducted using Observational Routinely-collected Data) statement.

The SINASC gathers information on birth notifications across the country, including information about the mother, pregnancy, newborn, and gestational age at birth, which allows the estimation of the prematurity rate for the country. The live birth declaration forms that provide information to SINASC adopt the last menstrual period (LMP) as the standard method for estimating the gestational age in weeks. Results of physical examinations and other methods are alternatively accepted [12]. The 100 Million Brazilian Cohort is primarily built from Cadastro Único (CadUnico) which covers the poorest half of the Brazilian population (families with monthly income equal to or below three minimum wages ~750 USD) [13]. The CIDACS Birth Cohort is composed of 24,695,617 live births. Children included in the cohort were generally born from younger, unmarried, and less educated mothers and are more likely to be born via vaginal delivery than children in the general Brazilian population [14]. In this study, we identified successive pregnancies using the unique maternal identifier and the newborn’s date of birth.

The linkage process used CIDACS-RL (Record Linkage), a novel record linkage tool developed to link large-scale administrative datasets at CIDACS [15]. The linkage was based on the similarity index using common attributes (mother’s name, maternal age at birth, maternal date of birth, and the municipality of residence of the mother at the time of delivery) between the databases. The linkage process is described in detail elsewhere [16].

In this study, we included live births of multiparous women, aged 14–49 years, who entered the CIDACS Birth Cohort as nulliparous women. We excluded (a) all multiple births and live births with congenital anomalies as these conditions are known to be strongly associated with premature birth [2, 6], (b) live births weighing < 500 g and gestational age < 22 weeks [17,18,19], (c) those with a birth date prior to the mother’s entry date into the cohort, and (d) live births with missing information on gestational age and for at least one live birth at the first or second pregnancy (Fig. 1).

Fig. 1
figure 1

Study population flow diagram 

Study variables

The study outcome was the occurrence of PTB in the second pregnancy. Incident PTB was defined as a live birth with a gestational age less than 37 weeks preceded by a term delivery. Recurrent PTB was defined as a live birth with a gestational age less than 37 weeks preceded by a previous PTB.

The following covariates were considered in the analyses: (i) sociodemographic characteristics of the mother: residence area (urban; rural), household overcrowding (no: ≤2 inhabitants per room; yes: >2 inhabitants per room), self-declared race/ethnicity (white, black/mixed-race, or indigenous), level of education (<8 years: 8–11 years; ≥12 years of schooling), marital status (married: married or in a stable relationship; unmarried: single, divorced or widowed); (ii) prenatal assistance: the number of visits (none; 1 to 3 visits; ≥ 4 visits); (iii) maternal, delivery, and newborn-related variables: maternal age (14–19; 20–34; 35–49 years), type of delivery (vaginal or cesarean), and sex of the newborn (female or male), and interbirth interval (<12 months, 12 to 24; ≥24 months). Household overcrowding was calculated by dividing the number of individuals living in the house and the number of rooms. The interbirth interval was estimated (in months) by the difference between the second child's birth date and the previous child’s birth date.

Statistical analysis

Socioeconomic, maternal, and birth characteristics were summarized using frequency distributions. We used longitudinal transition models with multivariate logistic regression to investigate the risk factors for PTB in a second pregnancy, which is different in women whose first pregnancy was delivered at term (incidence) or preterm (recurrence). Thus, a multiplicative interaction between each incidence and recurrence of PTB was included in the model to assess the risk factors’ chances for incidence versus recurrence in the second pregnancy. Transition models, also called dynamic models or Markov models, are relevant when any response in a sequence of repeated measures is modeled conditionally under one or more sets of past or previous measurements [20]. The specific statistical model for each objective is presented in a footnote for Tables 2 and 3. The transition model used here is a concise form proposed by Diggle et al. [21] to express recurrence and incidence in a single expression as predictors of the previous PTB in the first pregnancy as well as the interactions. The interaction test only identifies risk factors that are different between incidence and recurrence (α=0.05). However, the interaction test does not explicitly test whether the risk factor is significant for the incident and recurrent PTB, and thus the odds ratio (OR) and 95% confidence intervals (95% CI) for each risk factor were reported separately according to the first pregnancy PTB status. All models were adjusted for the complete set of risk factors considered to be relevant and plausible in the literature on preterm birth [2, 6, 11, 22,23,24,25]. Individuals with missing observations in any of the variables were excluded from the multiple models.

As preventive health behaviors can change between the first and second pregnancies, we assessed the chances of the incident and recurrent PTB according to changes in the number of prenatal visits. We created a four-category variable considering the performance of ≥ 4 prenatal visits in both pregnancies: yes-yes (4 or more appointments in the first and second pregnancy, reference category), yes-no (4 or more appointments in the first and fewer than 4 appointments in the second pregnancy), no-yes (fewer than 4 appointments in the first and 4 or more appointments in the second pregnancy), or no-no (fewer than 4 appointments in the first and second pregnancy). The analyses were adjusted by the same variables mentioned above, except for the number of prenatal visits analyzed as the primary exposure variable.

All analyses were performed using Stata version 15.1 (Stata Corporation, College Station, USA).

Supplementary analysis

Due to changes in gestational age information in the SINASC registry [26], we conducted complementary analyses with live births born between 2012 and 2015 to verify the consistency of the results of our initial analyses (Additional file 1).

We applied a cross-validation technique using multivariate logistic regression to assess the reliability of our model using. We used a 70% training sample and validated the model with a 30% sample. Receiver operating characteristic (ROC) curves were generated and sensitivity and specificity were calculated for the optimal cut-off based on the ROC curve. The results were compared with those obtained in the general model (Additional file 1).

Results

During the study period, there were 6,576,079 live births from multiparous women in the CIDACS Birth Cohort. After applying the exclusion criteria, we obtained 3,528,050 (53.7%) live births from 1,764,025 multiparous women. Of this total, 129,772 births (7.4%) were from women who had a PTB in the first pregnancy. At the second pregnancy, 139,139 (7.9%) were PTB, of which 115,777 (7,1%) occurred among 1,634,253 women with a prior term birth (i.e., incident preterm birth), and 23,362 (18%) among the 129,972 women with a previous preterm birth (i.e., recurrent preterm birth) (Fig. 1). Compared to term births, premature births were less frequent among mothers living in rural areas. PTB was more frequent among mothers living in crowded households, single (single, divorced, or widowed), who did fewer than 4 prenatal visits, those with a short interbirth interval, young mothers, and had had a vaginal delivery (Table 1).

Table 1 Mother’s sociodemographic characteristics, prenatal care assistance, type of delivery and live birth at second pregnancy by preterm birth status at the first birth, 2001–2015 (n=1,764,025)

We first investigated whether the factors associated with PTB in a second pregnancy differed between women whose first pregnancy was delivered at term or preterm (Table 2). We identified different risk factors (Pdifference <0.05) between incident and recurrent PTB: household overcrowding (ORincident 1.09, 95% CI 1.07–1.10; ORrecurrent 1.01, 95% CI 0.98–1.04), maternal race/ethnicity [(Black/mixed—ORincident 1.04, 95% CI 1.03–1.06; ORrecurrent 0.95, 95% CI 0.92–0.98) and (indigenous—ORincident 1.34, 95% CI 1.24–1.44; ORrecurrent 1.14; 95% CI 0.96–1.35)], single marital status (ORincident 0.85, 95% CI 0.84–0.86; ORrecurrent 0.90, 95% CI 0.87–0.93), reduced number of prenatal visits [(1 to 3 visits—ORincident 2.44, 95% CI 2.40–2.49; ORrecurrent 2.24; 95% CI 2.14–2.33) and (no visit—ORincident 2.56, 95% CI 2.47–2.66; ORrecurrent 2.16; 95% CI 1.98–2.36)], maternal age at birth between 14 and 19 years (ORincident 1.16, 95% CI 1.14–1.18; ORrecurrent 1.02, 95% CI 0.98–1.06), and cesarean delivery (ORincident 1.09, 95% CI 1.08–1.11; ORrecurrent 0.96, 95% CI 0.92–0.99). For these risk factors, point estimates were higher in magnitude for incident PTB compared to recurrent PTB. Inverse results were observed for short interbirth interval [(12 to 23 months—ORincident 1.04, 95% CI 1.02–1.06; ORrecurrent 1.22, 95% CI 1.17–1.26) and (<12 months—ORincident 1.89, 95% CI 1.80–1.98; ORrecurrent 2.58, 95% CI 2.38–2.79)] and maternal age between 35 and 49 years (ORincident 1.42, 95% CI 1.38–1.47; ORrecurrent 1.45, 95% CI 1.33–1.58). The univariate analyses are presented in Additional file 1: Table S1.

Table 2 Frequency and factors associated with incident and recurrent preterm birth in the second pregnancy, 2001–2015 (n = 1,764,025)

We also explored how changes in the number of prenatal visits were associated with the incidence and recurrence of PTB according to the prior delivery history. We observed that live births from mothers with more than four prenatal visits in the first pregnancy who did not have four prenatal visits in the second pregnancy were at increased risk of PTB, mainly if they did not have a previous preterm birth (ORincident 2.57, 95% CI 2.52–2.62; ORrecurrent 2.25, 95% CI 2.14–2.37). The chance of incident PTB was 14% higher in the live births from mothers without at least four prenatal visits in the first pregnancy who had this number of visits in the second pregnancy (OR 1.14, 95% CI 1.11–1.18); however, there was no significant increased chance among those recurrent PTB and the reference group. When mothers had an inadequate number of prenatal visits (fewer than four prenatal visits) in both pregnancies, there was a similar increased risk of incident PTB (OR 2.22, 95% CI 2.14–2.30) and recurrent PTB (OR 2.20, 95% CI 2.06–2.38 PTB (Table 3).

Table 3 Incident and recurrent preterm birth in the second pregnancy according to change in the number of prenatal visits between the first and the second pregnancies, 2001–2015 (n = 1,764,025)

The analyses restricted to births after 2012 mostly confirmed the findings described above (Additional file 1: Tables S2 and S3). The factors associated with incident and recurrence of preterm birth using the cross-validation technique were similar to those observed in the general model (Additional file 1: Table S4). For incident PTB, we observed accuracy of 50.78% and 50.86% in the training and validation analyses, respectively. For recurrent PTB, we observed accuracy of 52.05% and 51.92% in the training and validation analyses, respectively. The sensitivity and specificity of the models can be seen in Additional file 1: Fig. S1 and S2.

Discussion

In this study involving 3.5 million live births, we observed different risk factors between incident and recurrent PTB for household overcrowding, maternal race/ethnicity, marital status, number of prenatal visits, maternal age, and delivery mode; association estimates were stronger in magnitude for incident PTB than recurrent PTB. On the other hand, the magnitudes of association for interbirth interval and maternal age were higher for recurrent PTB. The chances of incident and recurrent PTB increased in live births from mothers with fewer than four prenatal visits in the second or in both pregnancies. Four prenatal visits or more in the second pregnancy minimizes the chance of incident PTB.

To our knowledge, risk factors for incident versus recurrent PTB have been reported in a single study [11]. Our findings that a wide range of risk factors was observed for the incident PTB compared to recurrent PTB are consistent with the results reported by Grantz et al. (2014) [11]. Nonetheless, in our study, many risk factors persisted for recurrent PTB after accounting for the excess risk associated with prior PTB history, unlike those reported by the authors above. The mechanisms that explain these results are not fully understood but can be attributed to higher baseline risk among women who have had a previous preterm birth [11]. Another explanation may be related to “index event bias,” which is possible when studying recurrent events. An analysis conditioned to a previous event can induce inverse associations between risk factors (known and unknown). This type of bias can lead to misleading findings and potentially harmful conclusions [27]. We adjusted our analyses for multiple risk factors in an attempt to minimize the index event bias. However, the ability to adjust analyses for all risk factors is a limitation in epidemiological studies in general, and not just in this particular case.

Our results found an increased chance of incident PTB with household overcrowding, Black/mixed and indigenous race/ethnicity, younger ages at birth, and cesarean delivery but not for recurrent PTB. Overcrowding is a marker of poverty and social deprivation [28] and may be associated with PTB since the number of people in a family can influence per capita income, family expenditure, access to food, and other essential services. Crowding can also trigger stress factors on health and well-being, increasing exposure to risk factors for preterm birth [29]. For Black/mixed women, exposure to psychosocial stressors (poverty, homelessness, living in dangerous neighborhoods, domestic violence, experience of discrimination or racism) and risk behaviors associated with stress can favor an increased risk of preterm birth [30]. The association between indigenous race/ethnicity and PTB can be related to worse social and health outcomes of this population when compared to the general population [31]. Concerning the increased risk of PTB among adolescent mothers, some of the explanations proposed are biological immaturity and competition for nutrients between the fetus and the pregnant adolescent [32]. Also, the association between cesarean delivery and preterm birth can be related to the expansion of obstetric interventions aimed at reducing maternal and fetal complications [33].

We observed that a lower number of prenatal visits more than doubled the risk of incident and recurrent PTB. This was notably the strongest factor observed to be associated with PTB incidence. These results corroborate those of several other studies that have reported an increased risk of recurrent PTB among women who make fewer prenatal visits [24, 34]. These findings reinforce the importance of prenatal care for the identification of women at high risk for preterm birth [10]. In our study, we also found lower chances of incident and recurrent PTB among live births from single women (single, widowed, or divorced), with higher protective effect for incident PTB. It is possible that bias in filling out this variable occurred, resulting in the underreporting of women in a stable relationship [35].

We also identified increased chances of incident and recurrent PTB among live births among mothers with short inter-birth intervals and in births from mothers of advanced maternal age, however, unlike the other factors, the estimates were higher for recurrent PTB. Other studies have also identified an increased risk of PTB incident [11] and recurrent [22,23,24,25] among women with a short pregnancy interval. This association may be related to maternal nutritional depletion, folate depletion, cervical insufficiency, and infections [36]. Short birth intervals are more common in women from LMICs, where lower socioeconomic status, less education, high fertility rates, and the mother’s age are often associated with short birth intervals [37]. Richer and better-educated women are better off and have access to health services, as well as to information on the use of contraceptive methods and supplies of them, expanding their intervals between deliveries [37]. Preterm birth among women of advanced age may be associated with the increase in clinical complications as age increases, such as arterial hypertension and diabetes mellitus [38].

We also explored changes in the number of prenatal visits between pregnancies and we found that live births from mothers who had a low number of prenatal visits in the first pregnancy and who made four or more prenatal visits in the second pregnancy had less chance of incident PTB, but not recurrent PTB. This finding can be partly explained by the index event bias already discussed in our work and also by Smits et al. [27]. Furthermore, the adoption of ≥4 antenatal visits in the second pregnancy may be affected by the status of the first delivery. The fact is that the first birth status can influence both the follow-up of ≥ 4 prenatal visits during the second pregnancy and the situation of the second birth.

We also observed that the chances of incident and recurrent PTB increased when mothers had fewer than four prenatal visits in the second and both pregnancies. It is known that adequate prenatal care can lead to the adoption of and timely access to preventive measures and effective interventions to reduce biological, social, and behavioral risk factors associated with prematurity [10] in current and subsequent pregnancies. The prevention of premature births occurs through reducing risk behaviors, the identification and treatment of sexually transmitted diseases and other infections, and malnutrition identification and nutritional advice, including supplementation with multiple nutrients [10]. Besides, other health services, such as family planning, favor adequate spacing between pregnancies and reduce the risk of prematurity in subsequent pregnancies [39]. We highlight that, despite the expansion of prenatal care coverage in Brazil in recent decades, there are still regional and social inequalities in access to adequate prenatal care which impact the high level of PTB in our country [40]. Our results reinforce the importance of expanding the access and quality of primary care, especially primary care and access to prenatal care to women in the reproductive phase in order to achieve a reduction in premature births, especially in a current pregnancy.

Strengths and limitations

This is the first study to assess the factors associated with the incidence and recurrence of PTB in a poor population in a middle-income country with great social and health inequalities. The linkage with the national live birth information system allowed us to track women individually through successive pregnancies. Also, the large number of cases allowed us to simultaneously investigate the factors associated with the incidence and recurrence of PTB, and to carry out our analysis considering the changes in the number of prenatal consultations between pregnancies.

However, this study has some limitations. The use of secondary data, which was not designed primarily for research purposes, may be subject to some limitations related to missing, underestimation, and potential misclassification. The proportion of preterm births recorded in SINASC may be subject to underreporting related to errors in the gestational age [41]. Until 2010, the gestational age at birth was collected over wide intervals of gestational weeks. As of 2011, SINASC started to collect gestational age as a continuous variable; however, the estimation method changed to be mainly based on the LMP [26]. The LMP is the method recommended by WHO, due to its wide accessibility and low cost [42]. Nonetheless, this method can be sometimes not much accurate due to circumstances such as individual variations in the length of the menstrual cycle, especially memory biases [43]. However, these are probably non-differential errors and are unlikely to introduce bias in the measure of association, although the absolute measures of risk may be underestimated. In addition, the proportion of missing data in the cohort could be a limitation for the generalization of our findings. Residual confounding is also possible because important variables for determining PTB, such as maternal comorbidities (e.g., obesity, diabetes, and hypertension), risk behaviors (e.g., smoking, alcohol, or drug use during pregnancy), and access and quality of health services were not available. Also, we were not able to classify the preterm birth subtypes (spontaneous or with medical indication) because of the lack of information in our dataset. The lack of genetic information is another limiting factor; it is known that genetic variants are associated with the duration of pregnancy and the risk of PTB [44]. These limitations may be affecting the differences observed between incident vs. recurrent PTB in this study. In addition, the association between changes in the number of prenatal care visits and PTB according to previous PTB history may present confounding bias. Our findings must be interpreted with caution because the absence of an association between a risk factor and PTB recurrence should not be a reason to dismiss the factor as a potential focus for preventive action. Finally, this study was conducted among the poorest population of a middle-income country with a history of great social and health inequalities which might limit the generalizability of these findings.

Conclusions

Our study suggests that the risk profiles for PTB in the second pregnancy differ according to women’s first pregnancy PTB status. Household overcrowding, Black/mixed and indigenous race/ethnicity, younger ages at birth, and cesarean delivery were associated with incident PTB but not for recurrent PTB. Incident and recurrent PTB risk factors included single marital status, fewer prenatal visits, short interbirth interval, and advanced maternal age. We found that four or more prenatal visits in the second pregnancy reduce the chance of incident and recurrent PTB. These findings can help to identify women with a high vulnerability to preterm births in a subsequent pregnancy. Furthermore, it can contribute to the development of intervention strategies aimed at reducing this problem and contribute to reducing children’s risk of death.