Background

Preterm birth (before 37 weeks gestation) is estimated to impact 10% of U.S. births annually with resultant potential for developmental and long-term adverse health consequences [1,2,3,4]. The estimated overall cost of preterm birth in the U.S. is approximately $26.2 billion per year [5]. Preterm birth is a complex phenotype with no single known mechanism or therapeutic strategy. Causes of preterm birth have remained largely unknown [5] and therefore, in most instances, not amenable to effective interventions or prevention.

Several studies have identified important environmental risk factors for preterm birth including prenatal exposure to air pollution [6, 7], contaminated water [8,9,10,11,12], pesticides [13,14,15], traffic density (i.e. counts of motor vehicles within a given radius) [16], air toxins [17], and persistent organic pollutants [18]. In general, these studies have been relatively small and usually contaminants have been examined in isolation. Disparities in preterm birth have also been shown to exist by socioeconomic status (SES) wherein those with lower SES experience higher rates of preterm birth and other adverse pregnancy outcomes [19, 20].

Studies have also shown that exposures to pollutants differ by race/ethnicity and SES [21]. In previous work, we demonstrated that there are racial/ethnic disparities in exposure to air pollution during pregnancy [22]. Woodruff et al. found that Hispanic, African American and Asian/Pacific Islander mothers in the U.S. experienced higher mean levels of air pollution and were more than twice as likely to live in the most polluted counties compared with non-Hispanic white mothers after controlling for maternal risk factors, region and educational status [22]. Pregnant women who are exposed to multiple environmental chemicals and multiple psychosocial stressors such as neighborhood SES are at greater risk of adverse birth outcomes [23, 24]. The cumulative impacts and potential interactions between elevated exposures to chemical and psychosocial stressors have been referred to as a form of “double jeopardy” [25]. In other words, not only are such women at increased risk due to more cumulative risk factors, but the combination of risk factors is compounding the risk in a multiplicative rather than additive way. In a previous study, we found interactive effects of air pollution and SES that contribute to risk of preterm birth in the San Joaquin Valley of California [26].

Fresno County, in the San Joaquin Valley of California (CA), is an area of known environmental pollution burden [27] and a high prevalence of preterm birth (12.1% compared to 9.6% in CA in 2012). Additionally, Fresno County is characterized by diverse race/ethnicity and SES with a majority of the population being of non-white race and of lower SES, which may impact adverse health effects in conjunction with environmental exposure. Our study examines the association between multiple environmental, medical and social factors and preterm birth in Fresno County, CA from 2009 to 2012. Few studies have addressed how these factors may compound one another to contribute to preterm birth. The interaction of environmental, medical and social stressors may be critical in elucidating disparities in preterm birth. Furthermore, uncovering such compounding effects may focus policy and intervention efforts at reducing pollution burden in the most vulnerable communities.

Methods

Study population

Birth outcome and maternal demographic information were collected from a linked hospital discharge birth cohort database maintained by the CA Office of Statewide Health Planning and Development (OSHPD) that includes linked information from the State of CA vital records and hospital discharge records (comorbidities were identified from codes in the form of ICD-9-CM diagnoses). From this linked dataset, the study includes race/ethnicity, infant sex, maternal age at delivery, years of education, participation in the Women, Infants, and Children (WIC) food and nutrition service (a Federally-funded supplemental program), payer for delivery costs (i.e., heath insurance status), place of mother’s birth, body mass index (BMI) calculated from maternal height and pre-pregnancy weight, preexisting diabetes (ICD-9 code 250 and 648.0), gestational diabetes (648.8), preexisting hypertension (642.0, 642.1, 642.2, 642.7), gestational hypertension (642.3), preeclampsia/eclampsia (642.4, 642.4, 642.6), infection (646.5, 646.6, 647), anemia (648.2), mental illness (648.4), reported smoking, reported drug abuse, reported alcohol dependence, trimester when prenatal care began, parity, previous preterm birth, previous cesarean section, inter-pregnancy interval, premature rupture of membranes (658.1), mode of delivery (cesarean or vaginal), birth weight, birth date and gestational age at delivery (best obstetric estimate).

The sample was restricted to live-born singleton births with known birth date, birth weight between three standard deviations of mean by week of gestation [28] and gestational age between 20 and 44 weeks with complete information including census tract or zip code and births between 2009 and 2012 in Fresno County, CA.

Methods and protocols for the study were approved by the Committee for the Protection of Human Subjects within the Health and Human Services Agency of the State of California.

CalEnviroScreen

We used the California Communities Environmental Health Screening Tool (CalEnviroScreen 2.0, released in 2014) to estimate environmental exposures for each census tract in Fresno County [29]. The CalEnviroScreen was developed by CA’s Environmental Protection Agency’s (CalEPA) Office of Environmental Health Hazard Assessment to evaluate the cumulative existence of multiple pollutants and stressors in communities [30]. CalEnviroScreen is used to identify communities disproportionately burdened by cumulative impacts and identify disadvantaged communities for allocation of cap and trade funds generated under the Global Warming Solutions Act of 2006 [31]. CalEnviroScreen combines multiple sets of data on pollutants and stressors within a census tract into an overall index, which can be used to screen for places with the highest cumulative burdens (https://oehha.ca.gov/calenviroscreen).

CalEnviroScreen 2.0 consists of 19 environmental and population indicators in total, which are aggregated into a final, relative CalEnviroScreen Score (Table 1, Fig. 1). The CalEnviroScreen Score is made up of two key categories and four components of census tract-level indicators: Pollution Burden – Exposures score and Environmental Effects; and Population Characteristics – Sensitive Populations and Socioeconomic Factors (Fig. 1). Exposures score indicators include measures of pollutant sources, releases and environmental concentrations. Environmental Effects indicators are measures of threats to the environment and degraded ecosystems caused by pollution. In calculating the average Pollution Burden, the Environmental Effects indicators are weighted by half because CalEPA considers the Exposures score indicators to be more direct measures of exposures to pollution (e.g., air pollution monitoring). These indicators likely contribute more to a person’s total pollution burden than the impact of living near contaminated land or water, where the exposure is less immediate. Indicators of Sensitive Populations and Socioeconomic Factors include both biological traits (e.g., age and health conditions of tract residents) and factors related to tract-level SES (e.g., poverty and education) that can increase susceptibility to the adverse health impacts of pollutants. These together form the Population Characteristics score. The Pollution Burden and Population Characteristics scores are then multiplied together to arrive at a final relative CalEnviroScreen score ranging from 0 to 100. The indicators are ranked into percentiles, which allows them to be compared across the state. The indicator percentiles and component scores are also useful to evaluate and understand the key drivers of vulnerability in a community. The methodology and rationale for each specific indicator is described in detail in the CalEnviroScreen 2.0 report [31]. In addition, the individual drinking water contaminants are shown in Table 1. We used the Socioeconomic Factors score from the CalEnviroScreen, which includes the following variables derived from the US Census American Community Survey: educational attainment, linguistic isolation (households where no one over 14 years of age speaks English very well), poverty and unemployment.

Table 1 Description of pollution indicators in CalEnviroScreen 2.0
Fig. 1
figure 1

Components of the CalEnviroScreen 2.0

We merged the OSHPD birth records with CalEnviroScreen 2.0 data by 2010 census tract. When birth records contained 2000 census tracts, we used the relationship files for 2000 to 2010 census tracts to create area-weighted values for the CalEnviroScreen variables [32]. If a census tract identifier for a birth record was missing or invalid, zip codes were used as surrogate and similar area-weighted adjustments were made using zip code to census tract relationship files (N = 1879; 3.5%).

Statistical analyses

Our primary outcome was preterm birth was defined as birth at less than 37 weeks gestation. We examined 24 exposure variables, which included the following scores and indicators from the CalEnviroScreen: Pollution Burden Score; Exposures score (component of Pollution Burden); Environmental Effects (component of Pollution Burden); 11 indicators (6 Exposures and 5 Environmental Effects); and 10 subcategories of the drinking water indicator (Fig. 1, Table 1). Each exposure variable was examined separately and classified dichotomously (split at the median) and by quintiles. We calculated Pearson correlation coefficients between the each of the indicators and scores from the CalEnviroScreen.

We examined several sets of covariates and their relationships to preterm birth and exposure indicators, which included socioeconomic variables (maternal education, payer for delivery), demographic characteristics (race/ethnicity, maternal age, maternal country of birth), obstetrical-related variables (diabetes, hypertension, smoking/alcohol/drug use during pregnancy, BMI, parity), and, among multiparous women, previous caesarean section, previous preterm birth, and inter-pregnancy interval from the previous live birth to the estimated conception of the index pregnancy. Inter-pregnancy interval was calculated from previous live birth (month and year) as reported in linked records and estimated as months to conception of the index pregnancy. Given that the day of previous live birth was not available, the middle of the month was used for calculation purposes [33]. We explored the association between the covariates and both the outcome (preterm birth) and exposure (above median levels of Pollution Burden).

We used logistic regression to evaluate the association between each indicator and preterm birth (< 37 weeks) and early preterm birth (< 34 weeks), comparing each of the higher 4 quintiles to the lowest to allow for non-monotonic relationships across the pollution distribution. We ran three sets of models: crude, adjusted with a priori variables and a stepwise selection. The covariates determined a priori included maternal education, age, race/ethnicity, and payer of delivery costs. The stepwise procedure included a forward and backward algorithm to estimate the association between environmental factors with preterm birth that allowed inclusion of covariates listed above that had p < 0.05 in crude risk calculations.

To explore the hypothesis that there is a double jeopardy when populations are vulnerable to both social and environmental stressors, we examined SES and race/ethnicity as potential modifiers in the relationship between environmental contaminants and preterm birth. We stratified analyses to examine the relationships between pollution and preterm birth by high and low SES of the census tract the woman lived in. The low SES group consisted of census tracts with below median levels of poverty, education, unemployment and linguistic isolation (Fig. 1, Table 1). We also stratified the analyses by broad race/ethnicity groups: White/non-Hispanic, non-White/non-Hispanic and Hispanic. These stratified analyses compared above versus below median levels of exposure in Fresno County and risk of preterm birth including early preterm birth.

In sensitivity analyses, we explored several alternative analytic decisions. We evaluated the pollutants continuously, both in individual models and a combined model with social factors. We chose more specific phenotypes of preterm birth including early preterm birth (< 34 weeks) and spontaneous preterm (i.e. premature labor or premature rupture of membranes) to restrict to preterm births that were not the consequence of a known cause or indication. We evaluated the raw scores of the exposure indicators (as opposed to the percentiles). Additionally, we mapped preterm birth prevalence across the county to visually observe the geographic variability.

Results

Population characteristics

After applying our exclusion criteria, our final study population included 53,843 births (Fig. 2). Our study population was highly diverse in both race/ethnicity and SES and pollution burden was higher in non-White and low SES areas (Table 2). We did not present cells with less than 16 women (for privacy purposes) nor calculate odds ratios with any cell less than 5. Our population in Fresno County was majority Hispanic (60%), followed by non-Hispanic white (19.7%), Asian (10.5%), and African American (5.8%). One quarter of mothers were born in Mexico (24.5%). More than 30% of the mothers had less than high school education and more than two-thirds of the mothers’ delivery costs were paid by Medi-Cal (California’s Medicaid). The prevalence of preterm birth (< 37 weeks), early preterm birth (< 34 weeks) and spontaneous preterm birth (< 37 weeks and premature rupture of membranes or preterm labor) were 8.5%, 2.1% and 7%, respectively.

Fig. 2
figure 2

Flow Chart of Our Study Population of Births in Fresno County, California

Table 2 Population characteristics in Fresno County, 2009–2012 (N = 53,843)

Correlations were moderate between diesel PM, ozone and traffic, ranging from 0.53 to 0.79 (Additional file 1: Appendix 1a). Nitrate and TCE were also moderately correlated (0.62; Additional file 1: Appendix 1b). Summary statistics of each of the indicators by preterm birth status is presented in Table 3. Although many are similar between the two groups, the Exposures score, PM2.5, Diesel PM, Toxic Release, Traffic, Drinking Water Score, Cadmium, Nitrate, Uranium, Solid Waste and Pollution Burden Score were all higher among preterm births.

Table 3 Descriptive statistics of environmental indicators by preterm birth status in Fresno County, 2009–2012 (N = 53,843)

Associations between the covariates and preterm birth included hypertension with pre-eclampsia, drug or alcohol abuse and previous preterm birth as maternal factors strongly associated with preterm birth (data not shown). Additionally, Hispanic, African-American and Asian mothers were more likely to have preterm birth compared to white mothers. Mothers with Medi-Cal payer status had higher risk of preterm birth. Additional risk factors for preterm birth included underweight BMI, diabetes, hypertension without pre-eclampsia, infection, anemia, mental illness, previous cesarean delivery, and short (< 6 months) or long (> 59 months) inter-pregnancy interval. Conversely, mothers that participated in WIC were less likely to deliver preterm.

Association between environmental pollutants and preterm birth

We found that the mothers in the highest quintile of Exposures score were two times as likely to have preterm birth (< 37 weeks), compared to the lowest quintile in the a priori variable adjustment regardless of different statistical adjustment settings (crude and stepwise adjustment, not shown). We also found the highest three quintiles of Pollution Burden score had statistically higher odds of preterm birth (Table 4).

Table 4 Crude and adjusted odds ratio of preterm birth across quintiles of CalEnviroScreen indicators and scores in Fresno County, 2009–2012 (N = 53,843)

We found the highest quintile of drinking water contaminants was associated with higher odds of preterm birth (Table 4), especially spontaneous preterm birth (data not shown). Specifically, uranium concentrations in drinking water was associated with preterm birth and trichloroethylene (TCE) was associated with early preterm birth. Trihalomethanes (THM) concentrations were inversely associated with preterm birth.

The Exposures score, diesel PM and drinking water contaminants were more strongly associated with increased risk of early preterm birth in the low socioeconomic areas compared to the high socioeconomic areas (Table 5). Similar increases were also observed for early preterm birth among the low SES areas compared to high SES areas (Additional file 1: Appendix 3).

Table 5 Crude and adjusted* odds ratio of preterm birth comparing above versus below the median of environmental exposure stratified by census tract-level socioeconomic status (SES) in Fresno County, 2009–2012 (N = 53,843)

We also found the association between diesel PM and preterm birth was slightly higher among non-white and non-Hispanic women, particularly for early preterm birth after adjusting for age, education and payment for delivery costs (Table 6).

Table 6 Crude and adjusted odds ratio of preterm birth comparing above versus below the median of environmental exposure stratified by race/ethnicity in Fresno County, 2009–2012 (N = 53,843)

Sensitivity analyses

In logistic regression models of preterm birth (< 37 weeks gestation) examining one indicator at a time continuously, two pollutant measures were statistically associated with preterm birth: interquartile range increases in PM2.5 and Pollution Burden Score were associated with 6% increases in odds of preterm birth after adjustment for education, payer of delivery, maternal age and race/ethnicity. Diesel PM, traffic density and Trichloroethylene concentration (in drinking water) were associated with 26.3% increased odds of early preterm birth (26%, 10%, 16%, respectively, Additional file 1: Appendix 2). The associations were consistent between toxic releases and preterm across all race/ethnicity groups, but highest for white, non-Hispanic early preterm births (Additional file 1: Appendix 4). Pesticides were found to be inversely associated with early preterm birth (data not shown).

When all individual environmental indicators and social factors were included in the same model, PM2.5 and unemployment, maternal age > 34, Medi-Cal payer of delivery and African-American race were associated with preterm birth (data not shown). Results examining raw scores were comparable to those of the percentiles.

Discussion

Overall, the current study found small but consistent associations between pollution exposure and preterm birth in Fresno County. Although many of the individual pollutants were not associated with preterm birth, the cumulative scores were consistently associated with preterm birth, including the Exposures score, drinking water contaminants and Pollution Burden score. Novel exposures, such as the toxic releases from facilities, were identified as a potential contributor to preterm birth in Fresno County. There was an exposure-response of increased risk of preterm birth across quintiles of Pollution Burden scores. Furthermore, the relationship between pollution and preterm birth was stronger among areas with lower SES.

Some risk factors of preterm birth, such as hypertension, have large associations though only affect a small portion of the population. The associations found with pollution were smaller, but may affect a larger portion of births across the population. Pollution may be exacerbating diseases and health issues that lead to preterm birth (e.g., hypertension) [34], or operating directly through toxic exposures (through a variety of possible mechanisms) [35].

The results did not differ considerably when restricted to spontaneous preterm birth. In some cases, results were stronger among the more severe early preterm birth (less than 34 weeks). The drinking water contaminant, THM, was associated with a decrease in preterm birth; however, it can be inversely correlated with other contaminants because it is a disinfection by-product commonly found in metropolitan areas.

Our findings add to the literature on environmental risk factors and preterm birth. For example, in previous studies in CA, we found small but consistent effects of air pollution on risk of preterm birth using air pollution measurements at the geocoded residence [26, 36, 37]. Along with the current study, two additional studies found stronger associations between air pollutants and preterm birth for early preterm birth [26, 37]. Additionally, an interaction was also observed between air pollution and neighborhood SES using three U.S. Census indicators at the block group level (unemployment, poverty, income from public assistance) [26] in our previous study in the Central Valley of California. Compared to previous studies of air pollution with more precise exposure assessment, our current results are likely underestimated owing to non-differential exposure misclassification. The trade-off of the potential measurement errors is the ability to combine multiple exposures and examine cumulative pollution effects.

Consistent with previous work on environmental justice, we observed higher pollution burden among those who were non-White and of lower education and income. Additionally, we found stronger, though not statistically different, associations between some environmental indicators and preterm birth in low SES areas. This is consistent with the concept of ‘double jeopardy’ of environmental and socioeconomic stressors [24]. Further work in this area comparing the entire state of California may be more suitable to demonstrate this occurrence. Overall, there were not considerable differences in the association between pollution and preterm birth between racial/ethnic groups.

Notably, WIC participation, which was associated with high pollution burden and requires low SES, was protective against preterm birth. This is an example of a program that may be having a positive effect on reducing preterm birth in Fresno county. The addition of similar programs, which provide access to supplemental foods, healthcare referrals and nutritional education for pregnant women, may further reduce preterm birth in low-income areas.

Despite the large inclusion of the population, our study did have several limitations. One limitation is the imprecise exposure assessment both geographically and temporally. In some cases, the linkage between the birth records and the census tract were not available and this may have resulted in bias, given changes in census tracts are often a result of population growth. The exposure assessment was at the census tract level and the years were pooled for most data sources. Additionally, the CalEnviroScreen was designed as a screening level tool and does not include specific pollutants or chemical exposures that may be affecting this study population. We examined many indicators of pollution that included nested summary measures, which led to many comparisons. Although we did not adjust for multiple comparisons, we present these results as exploratory. Some women may have had two or possibly more births during this time period (2009–2012); however, we were unable to link them and control for these correlated events. Lastly, we assumed that mothers lived constantly throughout their pregnancy in the maternal residence recorded in the birth certificate without relocating from other regions and did not account for time activity patterns or time spent in other geographical areas.

The CalEnviroScreen is a unique tool devised to identify areas of high pollution burden and vulnerable populations and has the benefit of informing epidemiologic studies. Strengths of this study include our ability to include a large set of pollution indicators both individually and cumulatively across a broad geographic area. Additionally, we were able to include all singleton births in Fresno County with detailed demographic and medical information from medical discharge records. Further, our results find a stronger association with the Exposures score, which makes sense as this score consists of monitoring data that is likely to be more representative of actual exposures in the population.

Conclusion

Our study provides an initial investigation of the CalEnviroScreen as an epidemiologic tool to help elucidate a host of environmental and social factors that contribute to preterm birth. As a screening tool designed to discern communities that assume disproportionate environmental burdens in California, the CalEnviroScreen provides data for environmental justice research. Future studies could expand to the entire state of California and aim to include additional sources of data such as biomonitoring and genomics that could confirm exposure levels and identify pathways by which environmental pollutants contribute to preterm birth.