Background

Resulting in the death of one million children per year, preterm birthis the leading cause of both neonatal and under-five mortality worldwide [1]. Preterm birth, defined as birth before 37 weeks of gestation, is a complex syndrome characterized by numerous etiologic pathways inclusive of both maternal and fetal contributing factors. The cause of a preterm birth may be the result of one independent factor or the interplay of multiple factors [2]. Biologic indicators causally linked to preterm birth include extrauterine infection, chorioamnionitis, maternal chronic illness (such as diabetes, hypertension previous to pregnancy, epilepsy, etc.), multiparous pregnancy, intrauterine growth restriction (IUGR), fetal sepsis, fetal anaemia, fetal distress, fetal anomaly, placental abnormalities, antepartum stillbirth, and pre-eclampsia/eclampsia [3].

In 2012, the Global Alliance to Prevent Prematurity and Stillbirth (GAPPS) released a classification system for preterm birth based on “clinical phenotypes” to consider multiple causal factors that may lead to a preterm delivery. Their preterm birth phenotypes were defined as “one or more significant conditions related to the mother, the foetus, and/or the placenta that are present before initiation of parturition” [4]. The GAPPS framework distinguishes between a risk factor and a clinical phenotype by stating that the relationship between preterm delivery and a clinical phenotype must be causal, not simply correlative, meaning factors such as age, socioeconomic status, smoking, and high stress levels are not considered phenotypes. This system allows for the consideration of all biologic factors that may contribute to a preterm birth and recognizes the interplay of multiple conditions present in one pregnancy. It is a wider lens through which to analyse preterm birth trends in a given area, and ultimately inform future intervention priorities [5].

In 2015, Barros et al. applied the GAPPS system to preterm births from the INTERGROWTH-21stdataset and through a 2-step cluster analysis identified twelve preterm birth phenotypic clusters and their prevalence across eight countries [3]. The study populations, however, were women in urban areas, with adequate antenatal care and early ultrasound data, reflecting a population with generally lower preterm birth rates. Our analysis sought to adapt and apply the Barros et. al phenotyping framework to a cohort of women from a rural, low-resource facility in Migori, Kenya using only routine data sources. After adapting the framework, we evaluated the most common preterm birth phenotypes and determined which were most strongly associated with newborn morbidity and mortality.

Methods

Study design and setting

This study was nested in the University of California, San Francisco (UCSF) East Africa Preterm Birth Initiative’s (PTBi-EA) intrapartum quality improvement cluster-randomized control trial [6]. It was a single centre, observational, retrospective chart review of all eligible preterm births from November 2015 – March 2017 at Migori County Referral Hospital (MCRH). The institutional review boards of UCSF (Study ID# 16–19,162) and the Kenya Medical Research Institute (KEMRI) (Study ID# 0034/321) reviewed and approved this study and all methods were carried out in accordance with relevant guidelines and regulations.

Migori County is in the southwestern region of Kenya, on the border with Tanzania. MCRH is a level IV government referral hospital with approximately 4,000 births per year and receives most of the region’s complicated maternal and neonatal cases.

Adapting the Barros framework

The Barros et al. definitions were adapted to allow for missed diagnoses, limited data availability, and variations in the completeness and accuracy of data in a low-resource setting. From the available data, we were able to adapt nine of the twelve phenotyping conditions used in Barros et al. HELLP syndrome, IUGR suspicion, and early pregnancy bleeding were unable to be identified in this setting due to limited laboratory data, low and late antenatal care (ANC) engagement, inconsistent recording of ANC data, and variable gestational age (GA) accuracy. We added fetal anaemia as a phenotype, due to the high prevalence of this condition in our study population which was not included in Barros et al. as a phenotype but was listed as a fetal condition with a causal relationship to preterm birth. Finally, sickle cell anemia and other documented severe anemia not related to the presence of malaria infection were added to the phenotype “severe maternal condition” due to the high prevalence of these conditions in our sample population. Table 1 explains the adapted definitions in detail.

Table 1 Phenotypic conditions with diagnostic criteria adapted to a low-income setting

Data collection

Due to a lack of early pregnancy ultrasounds, inconsistent recording of GA data (i.e., 32 weeks in one area of the chart and 34 weeks in another), and unlikely GA and birth weight combinations (i.e. 4000 g and 28 weeks), GA accuracy was determined to be limited in this setting. As such, eligibility criteria were based on a combination of listed GA and birthweight, including: 1) all babies with a birth weight less than 2500 g, and 2) and babies with a birth weight between 2500 to 3000 g only if the GA recorded in the MR was < 37 weeks. As babies greater than 3000 g were most likely term and those less than 2500 g were most likely preterm, recorded GA was used only for babies who fell within the more uncertain 2500-3000 g range. These definitions are consistent with the eligibility criteria defined by the PTBi-EA parent study and are explained in greater detail elsewhere [7, 8].

Eligible births were identified in the MCRH maternity register (MR), a Ministry of Health logbook where maternity demographic and outcomes data are hand-recorded for each birth by the attending healthcare provider. The inpatient number associated with each eligible MR record was recorded, and the corresponding Maternity Unit inpatient record (IPR) was retrieved from the on-site health records office. If an IPR could not be found, the birth was excluded from the analysis.

Data were extracted from the MR and IPR by PTBi-EA research staff and entered into an Open Data Kit (ODK) tool. The variables extracted were predetermined in consultation with obstetricians and paediatricians from UCSF and KEMRI and modelled on the Barros et al. phenotypic cluster diagnostic criteria. If data between the MR and IPR contradicted one another, preference was given to the IPR.

Data analysis

The clinical data for each birth were reviewed and classified into one or more of the phenotypes by two independent researchers (LM and CS). For each phenotype, births were categorized as having either a single condition or multiple conditions. Extrauterine infection and severe maternal condition were further disaggregated to report the breakdown of infections and conditions. The frequency of each phenotype within the study population was calculated, as well as the frequency of corresponding conditions within each phenotypic subgroup.

To evaluate outcomes associated with each phenotype, the mean birthweights, GAs, parity, maternal age, and antenatal care (ANC) visits were calculated, as well as the proportion and odds ratios of babies who were transferred to the special care newborn unit (NBU), or died before hospital discharge (pre-discharge mortality, PDM). All data were analysed in Microsoft Excel (version 16.16.3) and RStudio (Version 1.0.136).

Ethical considerations

All data were extracted from routine data sources collected from patients by healthcare providers. No personally identifiable data were collected, and all data were stored on encrypted computers and servers. Permission to access the data was sought and granted by the leadership of MCRH.

Results

From November 2015 to May 2017 there were 5,641 births at MCRH of which 621 (11.0%) met the eligibility criteria. Of eligible births, 234 (37.7%) IPRs could not be traced, resulting in a dataset of 387 (Fig. 1).

Fig. 1
figure 1

Preterm birth phenotyping eligibility flow chart at Migori County Referral Hospital

Of eligible women with documented socio-demographic information, the majority were 20 – 26 years old, married, without formal employment, and with a primary school level of education. Women received an average of 3 ANC visits and had an average of 1.4 deliveries prior to the index pregnancy (Table 2).

Table 2 Demographic information of maternity unit patients at Migori County Referral Hospital

Table 3 summarizes the preterm birth phenotypes of the cohort. The largest group was none (no phenotype was identified) at 41.1%, followed by extrauterine infection (25.1%), and antepartum stillbirth (16.7%). Extrauterine infection represented cases of HIV (75.3%), urinary tract infections (UTI) (24.7%), malaria (4.1%), syphilis (3.1%), and general infection (3.1%). Severe maternal condition was ranked fourth (15.6%) and included diagnoses of anaemia (69.5%), chronic respiratory distress (22.0%), chronic hypertension prior to pregnancy (5.1%), diabetes (3.4%), epilepsy (3.4%), and sickle cell disease (1.7%). Corresponding conditions showed trends between various phenotyping groups. Of cases with fetal anaemia, 80.0% also had a severe maternal condition. Of cases with chorioamnionitis, 60.0% resulted in an antepartum stillbirth. Of severe maternal conditions cases, 40.0% also had an extrauterine infection.

Table 3 Phenotypic distribution of eligible babies at Migori County Referral Hospital

Figure 2 explores diagnostic data listed in the MR and IPR. Recorded vital signs ranged from 35.4% of women evaluated for respiratory rate to 61.5% of women evaluated for blood pressure. For blood work, 69.8% of women had a documented HIV test, 80.1% had a documented venereal disease research laboratory (VDRL) test, and 1.8% had a documented malaria test. For urinary symptoms, 6.5% of women had a documented urine test, 16.0% of women a documented urine frequency and/or pain while urinating, and 15.2% were documented as having been evaluated for flank pain.

Fig. 2
figure 2

Patient chart diagnostics flow chart. Abbreviations: HIV (human immunodeficient virus), VDRL (venereal disease research laboratory test), IPTp (intermittent preventative treatemnts in pregnancy)

Table 4 summarizes neonatal outcomes. Babies born with foetal distress had both the largest and the oldest babies with a mean birthweight of 2535.7 g and a mean GA of 34.7 weeks. Cases with antepartum stillbirth had both the smallest and the youngest babies with a mean birthweight of 1783.6 g and a mean GA of 31.5 weeks. For outcomes, foetal anaemia cases were the most likely to experience a transfer from the maternity ward to the NBU (OR 5.1, 95% CI 0.8, 30.9), whereas foetal distress cases were the least likely to transfer (OR 0.5, 95% CI 0.1, 2.5). Foetal anomaly had the highest odds of resulting in a PDM (OR 3.9, 95% CI 0.8, 19.2) whereas foetal distress and foetal anaemia had zero recorded PDMs.

Table 4 Neonatal outcomes of babies in preterm birth phenotyping categories

Discussion

Adapting the Barros et al. preterm birth phenotyping framework allowed us to characterize preterm birth patterns at a county referral hospital. The use of routine data sources allowed for a novel retrospective analysis of an existing dataset, requiring less time and fewer resources than a prospective study. In our analysis of MCRH, we determined that extrauterine infection (most commonly HIV), antepartum stillbirth, and severe maternal conditions (most commonly anaemia) as the leading identifiable phenotypes of preterm birth. These findings are consistent with a recent systematic review of determinants of preterm birth in East Africa, can inform future prevention interventions in the region, and demonstrate the feasibility and utility of an adapted phenotyping framework applicable to lower-resource clinical settings in which the availability of clinical data is often limited [9].

The MCRH preterm rate of 11% is comparable to the burden across high-income and low-income settings with the recent Born Too Soon 2023 report showing a global preterm birth estimate of 9.9% [10]. With rates above 10% considered a high preterm birth burden, the 11% MCRH rate reflects the need for targeted prevention efforts.

The high percentage of unclassified preterm births (41%) was in line with the global trends and other phenotyping studies [3, 11]. In some settings this can reflect caregiver-initiated preterm labor due to less severe conditions or for iatrogenic reasons, however at MCRH this gap likely represents data entry gaps and errors or missed diagnoses. Plausible preterm birth diagnoses such as cervical or placental insufficiency were challenging to diagnose in this setting due to a lack of comprehensive obstetrical history and no availability of transvaginal ultrasound, dopplers, and other diagnostic tools.

Women in Migori County are at high risk for infection given HIV and malaria endemicity, lack of adequate sanitation in most households, and limited general healthcare accessibility [12]. Despite the WHO recommendation that all pregnant women in malaria endemic areas receive intermittent preventive treatment during pregnancy (IPTp), few charts included IPTp on patients’ medication lists, and few malaria tests were reported at the time of delivery [13]. This may be partially attributable to data documentation concerns, but still likely represents an area to target improvements. Low uptake of IPTp is consistent with other studies in East Africa and should be a high priority in the region [14, 15].

For women living with HIV in our dataset, viral load and antiretroviral therapy (ART) regimen information were unavailable, so it is impossible to know if active infection or the ART regimens themselves, as recent studies suggest, contributed to early onset of labor [16,17,18,19,20,21]. Women in Migori living with HIV and pregnant should receive careful prenatal monitoring, and oral preexposure prophylaxis (PrEP) should be considered for HIV negative women considered high risk for infection [13]. The relationship between ARTs and preterm birth also warrants further study.

Sexually transmitted and reproductive tract infections, including UTIs, are known to be highly correlated with preterm birth [22, 23]. Of women in our dataset, 6% had a UTI at the time of birth, and UTIs made up 25% of extrauterine infections. While UTI was not included in the Barros et. al phenotyping classification unless it had advanced to pyelonephritis, only 15% of patients in our cohort were evaluated for flank pain and only 6% received a urinalysis. We therefore included all UTIs to both account for potential missed diagnoses and to highlight the potential risk of UTIs even if they have not progressed to pyelonephritis [22, 24,25,26,27,28]. A recent study in Uganda concluded that the IPTp sulfadioxide was less effective in preventing malaria infection than an artemisinin-based regimen, but more effective at preventing preterm birth due to the broad-spectrum antimicrobial properties of sulfadioxine that likely treated persistent STIs and UTIs [29]. High rates of persistent UTIs have been document across various low-resource settings, implying the need for increased testing and treatment [25, 27].

The high incidence of maternal anaemia in our dataset may have multiple explanations. For one, anaemia is often associated with malaria or other parasitic infections and may indicate a higher incidence of malaria than reported [30,31,32]. For another, late pregnancy anaemia has been shown to be related to HIV seropositivity, including in a study conducted with pregnant women in western Kenya [33]. Finally, while maternal nutrition was not indicated in the charts, high rates of anaemia likely suggest a prevalence of iron deficiency and the need for nutritional counselling and assistance in the antenatal period [34, 35].

Implications for clinical practice

The preterm birth phenotypes identified suggest the need for greater infection management in the antepartum and intrapartum periods for women in Migori County particularly malaria, UTIs, STIs, and HIV. Extrauterine infection was not only the largest identifiable phenotype but was the largest corresponding phenotype with both antepartum stillbirth and fetal distress. High rates of anaemia also suggest a need for increased routine monitoring and nutritional counselling and assistance.

These recommendations imply increased frequency and quality of ANC visits. In our MCRH dataset the mean number of ANC visits was only 3 compared to the WHO recommendation that all pregnant women receive at least 8 ANC visits [13]. It can be challenging for women to come to the clinic for ANC for myriad reasons, therefore options such as mobile clinics, in-home or community-based ANC, and incentive programs for women should be explored [13, 36,37,38,39]. Research has shown that distance from the health facility and an unwillingness to visit the facility alone are major factors limiting ANC visits [40].

Implications for administrative practice

From a data perspective, enhanced linkage opportunities between maternity files and files from other departments (such as antenatal, newborn, and HIV wards) would give maternity healthcare workers critical information needed in the intrapartum period and could allow for a more comprehensive clinical picture to be documented in the chart. This would be most efficiently accomplished through the introduction of electronic medical record systems, but also through more consistent patient identification numerical systems in the interim.

Limitations

Concessions were made in diagnostic precision to allow this model to be applied to births from facilities with limited infrastructure and data accuracy and completion concerns. As we were not diagnosing individuals for treatment but rather assigning phenotypes to look for facility trends our threshold for assigning a phenotype to a preterm birth was lowered. In a region in which necessary confirmatory tests are often missing and documentation in the medical record may be more limited, this approach allowed for greater inclusion of likely cases. With GA quality concerns, an adapted definition of preterm birth was used which may have under or over counted preterm babies (but was consistent with the definition used in the larger PTBi-EA study). Additionally, retrieving patient files for all eligible mothers was not possible due to poor filing systems. Only including births in which the IPR could be traced may have led to a level of selection bias, in that patients with complications may have been more likely to have proper documentation than those without. It is important to note, however, that the filing system at MCRH, is stronger than many others in the region; the high rates of IPRs that could be traced is a strength of this study. Finally, being unable to diagnose HELLP syndrome, IUGR, and early pregnancy bleeding is certainly a limitation. However, as the Barros et. al dataset reported no instances of HELLP syndrome and only a few cases of early pregnancy bleeding, we feel that the exclusion of IUGR was the biggest limitation in our analysis.

Conclusions

Adapting the Barros et. al phenotyping framework allowed for an evaluation of preterm birth trends at MCRH that can inform future clinical and intervention strategies. This analysis highlighted the need for maternal extrauterine infection prevention and management in the antenatal and intrapartum periods for women in Migori. Even in regions where data is limited, our investigation demonstrates that adaptations can be made to the preterm phenotyping framework to accommodate a range in data accuracy and completeness. Indeed, areas with the highest preterm birth rates tend to also have the lowest availability of quality data, reinforcing the importance of adapting valuable methods to meet the needs of highly impacted communities.