Background

Recent epidemiologic studies have documented increased health risks from exposure to atmospheric particulate matter (PM) (Anderson et al. 2012; Beelen et al. 2014; Pelucchi et al. 2009; Shah et al. 2013; Hart et al. 2015; Heinrich et al. 2013; Weuve et al. 2012; Stieb et al. 2012; Rich et al. 2018; Golan et al. 2018) for a variety of health outcomes, including non-accidental mortality, cardiovascular disease mortality, lung cancer, neurocognitive functioning, and effects on reproduction. Studies using proxy measures for exposure to traffic-related air pollution have generally reported health effects larger than those for the mass concentration of PM < 2.5 μm (PM2.5) or PM < 10 μm (PM10) in aerodynamic diameter (Brugge et al. 2013; Gehring et al. 2006; Hoffmann et al. 2007; Puett et al. 2011; Puett et al. 2014; Schikowski et al. 2005). Several of these studies used simple exposure metrics for traffic-related air pollution based on distance to nearest road, or summaries of road geography within buffers of fixed radii (Eckel et al. 2011; Medina-Ramon et al. 2008). More complex approaches have been applied within small geographic areas, typically to one urban area or municipality (Gryparis et al. 2007; Maynard et al. 2007). One such approach accounted for wind direction using a geographic information system (GIS) to calculate the traffic density in locations upwind of receptors (Arain et al. 2007). Traffic-related air pollutant levels can vary on small (i.e., micro (0–100 m), middle (100–500 m), and neighborhood (500–4 km)) spatial scales immediately near roadways (Jerrett et al. 2005; Zhou and Levy 2007). Though the above complex approaches likely described local gradients in traffic-related air pollutants better than simpler methods such as distance to road terms, few direct, robust, and long-term multi-city comparisons are available. Uncertainty remains about whether and to what extent noise may be a confounder in analyses of health effects of traffic-related PM exposure, which underscores the need for accurate and highly spatially resolved estimation of traffic-related PM levels so these effects can be better disentangled in future epidemiologic analyses.

Gaussian plume dispersion models have been widely used to model point sources (Jerrett et al. 2005) and have been adapted to area and line sources (Benson 1992). Wilton et al. (2010) used such a model, CALRoads-View, to describe small scale gradients in nitrogen oxide (NOx) and nitrogen dioxide (NO2) levels for a 2-week period in 2006 in greater Los Angeles and for a 2-week period in 2005 in Seattle. They found that including the output from CALRoads-View improved the prediction capability of city-specific spatio-temporal models of NOx and NO2 levels compared to traditional land-use regression models.

The objective of the present study was to describe the development of and evaluate the performance of a line-source Gaussian plume dispersion modeling approach that estimates daily 24-h average traffic-related PM concentrations near roadways which can be applied at any location in the conterminous US from 1999 to 2011. Also, we developed and validated a spatio-temporal model of PM2.5 elemental carbon (EC) using 24-h average measurements of this component of PM2.5, PM2.5EC, together with several meteorological and GIS-based spatial covariates (including traffic-related PM levels from our line-source dispersion modeling approach). These models will be used to provide estimates of exposure in epidemiologic analyses. We chose to present results for PM2.5 EC because it has both traffic-related local sources and larger-scale regional gradients (Brochu et al. 2011) and also had high predictive accuracy (spatial cross-validation (CV) R2 = 0.784 (described later)); PM components Zn and Cu were also evaluated but had lower predictive accuracy (spatial CV R2 = 0.691 and 0.501, respectively). We chose not to use NO2/NOx as our exposure metric due to our interest in PM health effects and also to the complexity of NO-NO2-O3 chemistry in areas very near roadways, though we may address this in future work. Our spatio-temporal model predicts 24-h average PM2.5 EC levels across the conterminous US with high spatial resolution from 1999 to 2011, and thus can provide acute (24-h), sub-acute, or chronic exposure estimates that account for traffic intensity, prevailing wind direction, and dispersion characteristics based on local meteorology.

Methods

To approximate micro-, middle-, and neighborhood-scale primary traffic-related PM levels, we applied a line-source Gaussian plume dispersion model, Atmospheric Dispersion Modeling System (ADMS)-Roads v2.3 software (CERC, Cambridge, UK), to a hypothetical grid (hereafter referred to as a kernel) with a 2-m cell size under varying meteorological regimes. We calculated traffic-related PM levels at each grid point in the kernel for each combination of ADMS-Roads inputs, and then mapped receptor locations onto the appropriate kernel, with rotation for wind direction if appropriate, to estimate traffic-related PM levels. We chose ADMS-Roads as opposed to other line-source models such as CALRoads-View because of ADMS-Roads’ more advanced treatment of atmospheric stability as characterized by continuous Monin-Obhukov length (LMO) and planetary boundary layer height (inputs which are directly available from the Modern Era Retrospective-analysis for Research and Applications (MERRA; http://disc.sci.gsfc.nasa.gov/daac-bin/FTPSubset.pl; see below), resulting in a continuous and non-overlapping metric of atmospheric stability, as opposed to conventional stability classes. Two hypothetical 5-m road segments, facing each other to eliminate vehicle direction effects, were placed at the origin of each kernel. The domain of each kernel was 1 km wide by 2.05 km long (2 km downwind of source and 0.05 km upwind). The ADMS-Roads modeling approach requires several spatially and/or temporally varying inputs in order to produce estimates of near-road traffic-related PM concentrations (referred to as ADMS-Roads output): (1) traffic intensity for each of four road classes (A1, A2, A3, and A4/A6), (2) daily prevailing wind direction, and (3) several parameters which affect pollutant dispersion: wind speed, surface roughness, sensible heat flux, planetary boundary layer height, road width, vehicle speed, and absolute value of minimum LMO. An example kernel is shown in Supplemental Material Fig. S1. The derivation and estimation of these spatially varying or spatio-temporally varying input parameters is described in the following sections. Later we discuss the evaluation and validation of ADMS-Roads output using spatio-temporal models of 24-h average PM2.5 elemental carbon (EC) levels.

Availability of data and material

Restrictions apply to the availability of the data used and analyzed in this study, some of which were used under license for the current study and so are not publicly available.

Meteorological inputs to ADMS-Roads

Meteorological data on wind speed and direction (as u- and v- vector wind components at ~10 m above ground), sensible heat flux, and planetary boundary layer height were obtained from the MERRA project. These data are available hourly on a grid across the continental US with an approximately 55-km (depending on latitude) cell size; the native grid is 0.5° latitude/longitude. Hourly gridded data were assigned local time and date based on the time zone of each grid point’s location and then averaged by day (for wind direction, we averaged u- and v- vector components separately, then calculated the direction of the resulting vector) from 1999 to 2011. To allow spatial prediction between the 55-km grid points, the daily averages of each parameter were used to create spatial smoothing generalized additive models (GAMs) of the form:

$$ {y}_{\mathrm{i}\mathrm{t}}={\alpha}_{\mathrm{t}}+g\left({s}_{\mathrm{i}}\right)+{\varepsilon}_{\mathrm{i}\mathrm{t}};{\varepsilon}_{\mathrm{i}\mathrm{t}}\sim \mathcal{N}\left(0,{\sigma_t}^2\right) $$
(1)

where yit is the value of the meteorological parameter at location i = 1…I on day t = 1…T, αt is an intercept representing the adjusted mean on a given day, g(si) is a two-dimensional penalized thin-plate smoothing spline, with basis dimension k = I ∗ 0.9 chosen to allow much flexibility in the function. To reduce the potential for over-fitting, a multiplier of 1.4 (using the gamma argument to gam()), as recommended by Wood (2006), was used. These daily models were fit separately for each of seven geographic regions (with an overlap of 400 km with adjacent regions) of the conterminous US (Supplemental Material Fig. S3), yielding seven spatial smoothing GAMs per day to be used for space-time prediction of each daily meteorological parameter at any location in the domain of the conterminous US. The distribution of each of these meteorological parameters was divided into several categories, listed in Table 1, with details of this classification provided in the Supplemental Material.

Table 1 Categories of the meteorological, traffic intensity, and other input parameters to ADMS-Roads to produce the kernels

Location-specific surface roughness and absolute value of minimum LMO values were derived, in accordance with guidance in the ADMS-Roads software, based on categorization of urban land use data from the 2001 US Geological Survey (USGS) National Land Cover Dataset (USGS 2012; Table 1). The original 30-m cell size data were summarized in ArcGIS v.10 (Environmental Systems Research Institute, Redlands, CA) using a moving window with a 1-km radius to calculate the percentage of low, medium, and high-intensity developed land use. Categories of these values are shown in Table 1.

Because we found it had no substantive effect on ADMS-Roads output in the range applicable over the conterminous US, air temperature was assumed to equal 15° C for all ADMS-Roads output.

Traffic intensity inputs to ADMS-Roads

We obtained data on annual-average daily traffic counts (hereafter referred to as traffic counts) from Geographic Data Technology, Inc. (Lebanon, NH) Dynamap Traffic Counts v4.2. We first spatially joined the traffic count points (those that reported in or after 1990) to the ESRI StreetMap Pro 2007 network of road segments to obtain the US Census Feature Class Code (US Census Bureau 2013) road class (hereafter referred to as road class) of each road segment. Details regarding the processing of these data can be found in the Supplemental Material. From the processed traffic intensity data, we created four spatial smoothing GAMs: one for each road class (A1, A2, A3, and A4/A6; results shown in Supplemental Material Fig. S4 panels A1b, A2b, A3b, and A4b, respectively). These spatial smoothing GAMs were of the same form as in Eq. 1 and were validated by calculating the squared Pearson correlation between predicted road-class-specific traffic intensities and those measured in an external validation data set from the US Bureau of Transportation Statistics (BTS) 2011 National Transportation Atlas Database Automated Traffic Recorders (ATR; US BTS 2013) and Weigh in Motion (WIM) (US BTS 2013) data sets. Details regarding the development of the spatial smoothing GAMs of traffic intensity and the processing of the external traffic count validation data can be found in the Supplemental Material. For comparison purposes, we also calculated the kernel density of the traffic count data using a neighborhood of 100 km and the ArcGIS “Kernel Density” tool (Silverman 1986). We compared these values to the combined ATR and WIM data by road class using linear regression models.

Because detailed geographic data on road widths and vehicle speeds across the conterminous US were not available, representative road width and vehicle speed values were assumed to be constant based on road class (Table 1).

To provide road source location data, StreetMap Pro 2007 street line segments were converted to a series of points spaced 10 m apart using the Xtools Pro tool “Convert Features to Points” (Data East, Russia; examples in Fig. 1 and Supplemental Material Fig. S2). The emissions factors we used in ADMS-Roads can be found in the Supplemental Material. We note that our use of a smooth regression spline in the spatio-temporal model of 24-h average PM2.5 EC (hereafter referred to as “spatio-temporal PM2.5 EC model”) described below provides an effective scaling of this emission factor to measured PM2.5 EC levels. Thus, our modeling process eliminates discrepancies between European and US vehicle fleets and between assumed and actual emission rates.

Fig. 1
figure 1

Roadways displayed by US Census feature class code and 10-m street points nearby to a PM2.5 speciation monitor near Linden, New Jersey

Obtaining ADMS-Roads output

A separate kernel containing ADMS-Roads output was generated for each combination of meteorological and traffic intensity categories (Table 1; details in Supporting Material). Briefly, to obtain ADMS-Roads output, we first selected the kernel that best matched the dispersion and traffic intensity conditions, then identified road source points within 2 km of each prediction location. Together with meteorological and traffic intensity inputs predicted from spatial smoothing GAMs, the ADMS-Roads output concentrations were interpolated from the appropriate kernel, for both “best-estimate” levels which depended on prevailing wind direction and for “worst-case downwind” levels ignoring wind direction. A detailed description of the steps is provided in the Supplemental Material.

Other meteorological and GIS-based spatial covariates

Data on meteorological parameters air temperature, total precipitation, and total snowfall were obtained from MERRA and processed as described above. Data on daily total rainfall were derived by subtraction of total snowfall flux from total precipitation flux for each day. Though these parameters were not used as inputs to ADMS-Roads, daily spatially smoothed meteorological values were evaluated as spatio-temporal covariates in spatio-temporal models of 24-h average PM2.5 EC. Details on other spatial covariates including county-level population density, elevation (USGS 2013), and distance to road covariates can be found in the Supplemental Material.

Validation of ADMS-Roads output using spatio-temporal models of 24-h average PM2.5 EC

Statistical models

Spatio-temporal generalized additive mixed models (GAMMs) of 24-h average PM2.5 EC levels were evaluated and a final prediction model developed with the joint aims of (1) validating and calibrating the ADMS-Roads output to a traffic-related PM2.5 component, and (2) predicting 24-h average PM2.5 EC levels at unmeasured locations anywhere in the conterminous US. There were 219,300 24-h site-average PM2.5 EC measurements from 1999 to 2011 at 361 unique monitoring locations. These measurements were available from the Interagency Monitoring of Protected Visual Environments (IMPROVE) network (parameter code 88321; geographic coordinates for two sites were modified, see Supplemental Material for details) and the US Environmental Protection Agency’s Chemical Speciation Network (parameter code 88380) on every-third day after August 2001, and alternating every-third and fourth day from January 1, 1999, to that date. 89.7% of available measurements were from the IMPROVE network. PM2.5 EC measurements were approximately log-normally distributed, and had geometric mean and geometric standard deviation of 154 ng m−3 and 10, respectively (PM2.5 EC measurements ranged from 9.4 to 10,398.8 ng m−3). A map showing the locations of the monitors in the conterminous US is shown in Supplemental Fig. S3.

The spatio-temporal GAMM of 24-h average PM2.5 EC levels was similar to that presented in Yanosky et al. (2014), except there monthly averages were used; the model had the following generic form:

$$ {\displaystyle \begin{array}{l}{y}_{\mathrm{i},\mathrm{t}}=\alpha +{\alpha}_{\mathrm{t}}+{\sum}_{\mathrm{q}}{d}_{\mathrm{q}}\left({X}_{\mathrm{i},\mathrm{q}}\right)+{\sum}_{\mathrm{p}}{f}_{\mathrm{p}}\left({Z}_{\mathrm{i},\mathrm{t},\mathrm{p}}\right)+{g}_{\mathrm{t}}\left({s}_{\mathrm{i}}\right)+g\left({s}_{\mathrm{i}}\right)+{b}_{\mathrm{i}}+{e}_{\mathrm{i},\mathrm{t}};\\ {}{b}_{\mathrm{i}}\sim N\left(0,{\sigma^2}_{\mathrm{b}}\right);{e}_{\mathrm{i},\mathrm{t}}\sim N\left(0,{\sigma^2}_{\mathrm{e}\ \mathrm{t}}\right)\end{array}} $$
(3)

where yi, t is the natural-log transformed 24-h average PM2.5 EC (in ln(ng m−3)) for i = 1…I sites (I = 361) and t = 1…T 24-h time periods (T = 1555), and si is the projected spatial coordinate pair for the ith location. Xi, q are GIS-based time-invariant spatial covariates for q = 1…Q, Zi, t, p are spatio-temporal covariates for p = 1…P, and αt is a day-specific intercept that represents the adjusted mean across all sites on a given day. dqare one-dimensional penalized spline smooth functions of Q GIS-based time-invariant spatial covariates (basis dimension of seven for each); fp are one-dimensional penalized spline smooth functions of P spatio-temporal meteorological or other covariates (basis dimension of five for each). gt(si) accounts for residual spatial variability in the 24-h average values, and g(si) accounts for time-invariant spatial variability across the conterminous US, with both terms specified as spatial bivariate thin-plate penalized splines with basis dimension values: kt = It ∗ 0.9 and k = (I − Q) ∗ 0.9, respectively. The site-specific random effect bi represents unexplained site-specific variability, thus our characterization of the model as a GAMM.

As in previous work, we used a two-stage modeling approach to fit the above model (Eq. 3) (Yanosky et al. 2014). In the first stage (Eq. 4), we estimated site-specific intercepts (ui) adjusting for spatio-temporal covariates and residual spatial variability in the 24-h average values (N = 219,300). The spatial terms assumed residual spatial variation was stationarity and isotropic across the domain of the conterminous US.

The first-stage model equation was:

$$ {\displaystyle \begin{array}{l}{y}_{\mathrm{i},\mathrm{t}}={u}_{\mathrm{i}}+{\alpha}_{\mathrm{t}}+{\sum}_{\mathrm{p}}{f}_{\mathrm{p}}\left({Z}_{\mathrm{i},\mathrm{t},\mathrm{p}}\right)+{g}_{\mathrm{t}}\left({s}_{\mathrm{i}}\right)+{e}_{\mathrm{i},\mathrm{t}};\\ {}{e}_{\mathrm{i},\mathrm{t}}\sim N\left(0,{\sigma^2}_{\mathrm{e}\ \mathrm{t}}\right)\end{array}} $$
(4)

and was fit iteratively in a back-fitting arrangement with ui + αt + ∑pfp(Zi, t, p) estimated jointly and gt(si) estimated separately by day (on days where sufficient (> 12 values day−1) data were available). Variability in the measured concentrations is parsed between the covariates and the residual spatial terms. For the spatial models in the first stage, a multiplier of 1.4 (using the gamma argument to gam()) was used to avoid over-fitting (Wood 2006). All individual fits within the back-fitting were preformed using the gam()function in the mgcv package (Wood 2006) of R (R Development Core Team 2009).

In the second stage, we fit a spatial model to the \( {\widehat{u}}_{\mathrm{i}} \) terms obtained from the first-stage model. This model included terms for GIS-based time-invariant spatial covariates and residual time-invariant spatial variability. The second stage (Eq. 5) was:

$$ {\displaystyle \begin{array}{l}{\widehat{u}}_{\mathrm{i}}=\alpha +{\sum}_{\mathrm{q}}{d}_{\mathrm{q}}\left({X}_{\mathrm{i},\mathrm{q}}\right)+g\left({s}_{\mathrm{i}}\right)+{b}_{\mathrm{i}};\\ {}{b}_{\mathrm{i}}\sim N\left(0,{\sigma^2}_{\mathrm{b}}\right)\end{array}} $$
(5)

where \( {\widehat{u}}_{\mathrm{i}} \) is an estimated site-specific intercept that represents the adjusted long-term mean at each location (n = 361) and the other terms are as above. The second stage was also fit using the gam()function in the mgcv package of R.

Model predictions

We obtained model predictions by generating the covariates at locations of interest for each, and then transforming to the native scale by exponentiation. To avoid extrapolation, covariates at prediction locations beyond their range among the monitoring locations were set to the appropriate minimum or maximum among the monitoring locations across 1999–2011.

We generated estimates of uncertainty in model predictions (i.e., standard errors) on the natural-log scale using methods described previously (Paciorek et al. 2009; Yanosky et al. 2008). 95% prediction intervals on the natural-log scale were calculated and exponentiated to assess prediction interval coverage.

Model validation

10-fold out-of-sample CV was used to evaluate model predictive accuracy and thereby inform covariate selection. Monitoring sites were selected at random and assigned exclusively to one of 10 sets. Since the covariate selection process involved fitting multiple candidate models to the same data, set 10 was reserved (i.e., not used for model fitting) to assess whether the covariate selection process contributed to over-fitting (Draper and Krnjacic 2006). Data from sets one to nine were removed from the data set sequentially, with the model fit to the remaining data. Model predictions were then generated and compared to the left-out observations.

The predictive accuracy of the spatio-temporal PM2.5 EC model levels was determined from the squared Pearson correlation between the left-out observations and model predictions (CV R2), with both back-transformed to the native rather than the natural-log scale. Weekly, monthly, and annual averages were calculated after averaging left-out observations and model predictions to the corresponding averaging time. Spatial CV R2 values were calculated similarly using the means of the 24-h average values across the entire time period from 1999 to 2011. Prediction errors were calculated by subtracting left-out observations from the model predictions. Bias in model predictions was determined using the normalized mean bias factor (NMBF) (Shaocai Yu, personal communication) and the slope from major-axis linear regression (Legendre 2011) of the left-out observations against the model predictions (both on the natural-log scale). The precision of model predictions was obtained by taking the mean of the absolute value of the prediction errors (CVMAE) and using the normalized mean error factor (NMEF) (Shaocai Yu, personal communication). Formulas for the NMBF and NMEF are provided in the Supplemental Material. Bias and precision values from CV were evaluated overall, and by US Census Division, tertiles of urban land use, season, monitoring network, and monitoring objective. Two thousand seventy-three extreme values of PM2.5 EC above the 99th percentile of 1850 ng m−3 were excluded as outliers prior to calculating CV statistics.

Model development and covariate selection

To determine the best parsimonious prediction model, we fit spatio-temporal models of 24-h average PM2.5 EC levels with alternate versions of ADMS-Roads output (A1, A2, A3 and A4/A6 vs. A1–A3 only; “best-estimate” vs. “worst-case downwind”), with and without various meteorological parameters, and without ADMS-Roads output but instead with GIS-based spatial covariates for distance to nearest road by road class. The initial set of meteorological covariates was based on our earlier work (Yanosky et al. 2014). We compared the spatial CV R2 among these models, and also among those with and without GIS-based spatial covariates for county-level population density, urban land use, and elevation in the second-stage model. Covariates which were not statistically significant (p > 0.05) using the Wald tests (Wood 2006) were removed. We also evaluated whether the direction of the smooth functions met a priori expectations; however, no covariates were removed for this reason.

Results

Sensitivity of ADMS-roads output to changes in input parameters

Wind direction, which determines whether the receptor point is downwind of the source, was found to be the most influential ADMS-Roads input parameter. The next most influential input parameters, evaluated at an arbitrary point 100-m downwind of the source, were, in order of decreasing influence: wind speed, surface roughness, sensible heat flux, planetary boundary layer height, absolute value of minimum LMO, and air temperature. These findings were consistent with guidance in the ADMS-Roads User Guide.

Results of smoothing traffic intensity

The spatial smoothing GAMs of traffic intensity fit the Dynamap traffic count data reasonably well, as evidenced by model R2 values of 0.834, 0.638, 0.533, and 0.590 for A1–A4 roads, respectively (Table 2).

Table 2 Results of the external validation of the spatial prediction models of Dynamap traffic counts using ATR and WIM station traffic count data from the US DOT Bureau of Transportation Statistics’ National Transportation Atlas Database 2011A

Spatial smoothing GAMs showed generally similar spatial trends across road classes, each showing increased traffic intensity in major urban areas. However, slightly different spatial patterns were evident (Supplemental Material Fig. S4), particularly with regard to the extent of influence of major cities in surrounding suburban areas. Additional details regarding spatial trends in traffic intensity and performance of the two spatial averaging methods can be found in the Supplemental Material.

Validation of ADMS-Roads traffic intensity input data

Validation R2 values from linear regression of the combined ATR and WIM traffic counts using the spatial smoothing GAM predictions of the Dynamap traffic count data were 0.524, 0.492, and 0.305 for road classes A1–A3, respectively (Table 2). We also regressed results from the kernel density within 100-km (displayed in Supplemental Material Fig. S4 panels A1c, A2c, A3c, and A4c) approach against the combined ATR and WIM traffic count data; validation R2 values were lower as compared to the spatial smoothing GAM approach at 0.316, 0.313, and 0.189, for road classes A1–A3, respectively (Table 2). In all six regression models discussed above, each of the predicted traffic intensity terms was highly statistically significant (p < 0.001) when regressed against the combined ATR and WIM traffic count data for the corresponding road class.

Validation of ADMS-Roads output using spatio-temporal models of 24-h average PM2.5 EC

Predictive performance of spatio-temporal models of 24-h average PM2.5 EC

Results from our model comparisons showed that the spatio-temporal model that included ADMS-Roads output summed across only A1–A3 roads exhibited slightly higher predictive accuracy than one that included ADMS-Roads output summed across A1, A2, A3, and A4/A6 roads (Table 3; spatial CV R2 = 0.784 vs. 0.753, respectively). This model also performed better than the simpler model that used distance to nearest road covariates for A1–A3 roads (Table 3; spatial CV R2 = 0.753). Among only “near-major-road” monitors (those with an A1, A2, or A3 road within 300 m, which represented about 47% of all monitors), these differences were more evident (CV R2 of annual averages of 0.762 for ADMS-Roads output summed across A1–A3 roads vs. 0.689 for ADMS-Roads output summed across A1, A2, A3, and A4/A6 vs. 0.663 for distance to road covariates).

Table 3 Cross-validation statistics for the spatio-temporal prediction model (top row) and two alternate models (lower two rows) of 24-h average PM2.5 EC levels in the conterminous US from 1999 to 2011

Results also showed that the spatio-temporal model that included “best-estimate” ADMS-Roads output performed only slightly better than one with “worst-case downwind” ADMS-Roads output when comparing long-term averages (spatial CV R2 = 0. 784 vs. 0.775, respectively). In contrast, on the 24-h average level, the model with “worst-case downwind” ADMS-Roads output performed well and even slightly better than the “best-estimate” ADMS-Roads output (24-h average CV R2 = 0.539 for “worst-case downwind” vs. 0. 532 for “best-estimate”). Despite the influence of daily prevailing wind direction on ADMS-Roads output, these models were quite comparable with respect to predictive accuracy of PM2.5 EC levels, suggesting that our 24-h prevailing wind direction data from MERRA is only crudely capturing the influence of wind direction; modeling hourly wind direction may provide improvement but is currently impractical at the national scale.

Based on the results above, the final prediction model used “best-estimate” ADMS-Roads output from only road classes A1–A3. However, the model with “worst-case downwind” ADMS-Roads output would provide an acceptable alternative to the “best-estimate” model, especially for analyses focused on shorter averaging time periods. Also, we note that excluding A4 roads results in a large gain in computational efficiency.

Though predictive accuracy was moderate for PM2.5 EC at the 24-h average level (Table 3; 24-h average CV R2 = 0.532), it improved substantially for longer averaging times (CV R2 = 0.613, 0.679, 0.707, 0.795, and 0.784 for weekly, two-week, and monthly, annual-average, and spatial averages, respectively. Predictive accuracy was generally consistent across US Census Divisions (Supplemental Material Table S1). It was also reasonably consistent across tertiles of urban land use, across the four seasons, and across monitoring networks. Across monitoring objectives, predictive accuracy was slightly lower at “source-related” sites (categories defined by the US EPA; “source-related” refers to a monitor located nearby to a large point source; CV R2 of annual averages = 0.398). 95% prediction interval coverage was 0.969, as compared to a nominal value of 0.95, indicating that model standard errors though reasonably well scaled, may have been slightly underestimated.

Smooth function of ADMS-Roads output in the spatio-temporal PM2.5 EC model

In Fig. 2, the smooth function for ADMS-Roads output summed across A1–A3 roads (shown on the x-axis) increases monotonically and nearly linearly from zero to moderate levels around 500 ng m−3, and then begins to flatten. Once the ADMS-Roads output exceeds 1000 ng m−3, PM2.5 EC levels again rise, but with a shallower slope.

Fig. 2
figure 2

Plot of the smooth function of ADMS-Roads output (summed across road classes A1–A3) from the spatio-temporal PM2.5 EC model levels. Data on both axes have been transformed from the natural log to the native scale. The x-axis shows the distribution of ADMS-Roads output (summed across road classes A1–A3), at the set of monitors used for model fitting, in red

Smooth functions of other covariates in the spatio-temporal PM2.5 EC model

PM2.5 EC levels increased linearly with county-level population density (1.0° of freedom (df)). For urban land use, PM2.5 EC levels increased monotonically but not linearly (4.0 df), increasing steadily from 0 to about 40% urban land use, less steeply from 40 to about 80%, and more steeply from about 80 to 100%. For elevation, PM2.5 EC levels decreased nearly linearly from 0 to about 2000 m, and less steeply beyond. PM2.5 EC levels decreased nearly linearly with increasing wind speed, increased in an S-shaped pattern with increasing air temperature, and decreased non-linearly with increasing total rainfall. Plots of each of these smooth functions are shown in the Supplemental Material Fig. S5. Also, details regarding the df used by the spatial terms in the spatio-temporal PM2.5 EC model can be found in the Supplemental Material.

Spatial patterns in spatio-temporal PM2.5 EC model-predicted levels

Figure 3 shows predicted 24-h average PM2.5 EC levels for 2 days in 2008 and averaged across that year. Predicted PM2.5 EC levels are clearly increased near roadways in each plot.

Fig. 3
figure 3

a Predicted 24-h average PM2.5 EC levels (ng m−3), in the same domain as Fig. 1, for August 4, 2008. b Same as A but showing 24-h average levels for August 10, 2008. c Same as A but showing annual-average levels for 2008. 5th to 95th percentiles are shown for each. The prevailing wind direction from smoothed MERRA data is shown with black arrows in a and b

Figure 4 shows predicted 24-h average PM2.5 EC levels on a 6-km grid across the conterminous US for August 4 and 10, 2008, and averaged across 2008. Additional details regarding these figures and the statistical assumptions of our spatio-temporal PM2.5 EC model can be found in the Supplemental Material.

Fig. 4
figure 4

a Predicted 24-h average PM2.5 EC levels (in ng m−3) on a 6-km grid in the conterminous US (219,948 cell values each day) for August 4, 2008. b Same as a but showing 24-h average levels for August 10, 2008. c Same as a but showing annual-average levels for 2008. White points are PM2.5 EC monitors. 5th to 95th percentiles are shown for each

Discussion

Comparing the two methods of spatial averaging traffic counts, we found that spatial smoothing performed better than the kernel density approach in an external validation using automated traffic recorder data. This is likely due to the different assumptions regarding sparsity of data in space and the manner in which each are affected by extreme values.

For the spatio-temporal PM2.5 EC model, the model R2 of 0.679 indicates the model was able to explain much of the variability in measured levels. Local influences that may not be fully described by the spatio-temporal model (i.e., those not described by model covariates for traffic-related PM levels, urbanization, elevation, and local meteorology) include local wildland fires, residential wood burning, and industrial sources, all sources of EC. The spatio-temporal PM2.5 EC model does well in representing spatial trends, and especially so for averaging times longer than 24-h (CV R2 for monthly and annual averages = 0.707 and 0.795, respectively; spatial CV R2 = 0.784).

The smooth function for ADMS-Roads output (summed across road classes A1–A3) showed a non-linear but monotonically increasing relationship between that covariate and 24-h average PM2.5 EC levels, highlighting the need to use penalized spline smoothing. By including PM2.5 EC monitoring data from all sites in the conterminous US in the same spatio-temporal model, we capture the largest possible range in the ADMS-Roads output covariate.

The similarity in CV results from the spatio-temporal models with alternative representations of traffic, including those with distance to road terms instead of ADMS-Roads model output and even among only “near-major-road” monitors, suggests that micro-scale spatial gradients are a relatively small proportion of PM2.5 EC mass (albeit an arguably important one for health effects analyses). Also, spatial error induced by smoothing sparse traffic count data or imprecision in other ADMS-Roads model inputs may contribute to similarity among CV R2 values. Specifically, including the effect of wind direction from the 10-m road source points to the receptor locations improved prediction accuracy as assessed by the spatial CV R2 only slightly as compared to the “worst-case downwind” ADMS-Roads output, suggesting that MERRA 24-h prevailing wind direction data (one value per day) are of only limited use in describing primary traffic-related PM levels. Though it would represent a substantial increase in computational demand, hourly wind data, which would reflect changes in wind direction throughout the day, would likely be more realistic and may further increase predictive accuracy.

In comparison to previous studies, our methodology is similar to that used in the London Air Pollution Toolkit (Kelly et al. 2011) and in Wilton et al. (2010). However, our methodology is novel in (1) its scope of application, in that it is the first, to our knowledge, to apply a line-source Gaussian plume model to estimate traffic-related PM levels in a consistent manner across the conterminous US, and (2) its use of spatially smoothed road-class-specific traffic counts. In comparison to previous studies, the spatial distribution of long-term mean predicted PM2.5 EC in Fig. 4 is broadly similar to that presented in Bergen et al. (2013). Also, consistent with results in Wilton et al. (2010), we found that including ADMS-Roads output on smaller, lesser trafficked A4/A6 roads did not improve predictive accuracy of the spatio-temporal PM2.5 EC model.

One advantage of our spatio-temporal model is it avoids spatial discontinuities in predicted levels, such as those caused by moving-windows in Akita et al. (2012) and the use of regional terms in Yanosky et al. (2014), by including data from the conterminous US in one spatio-temporal model and by using spatially smooth covariates, with the exception of ADMS-Roads output. Though the ADMS-Roads output does contain spatial discontinuities where the traffic intensity changes categories, these were quite small (one is visible in the bottom right of each panel of Fig. 3; this is an extreme example due to the very high traffic intensity in the area shown). However, we do assume stationarity across the domain. In contrast, the moving-window approach in Akita et al. (2012) does not require this assumption.

Maintaining computational feasibility was a key challenge in the development of our traffic-related PM modeling approach. Even the spatial join to identify the 10-m road source points within 2 km of each PM2.5 monitoring location was computationally demanding, requiring specialized multiprocessing software code to be implemented. Categorization of the several ADMS-Roads inputs was necessary to make computation possible, and to provide a balance of complexity and computational feasibility.

Our models will be used to provide estimates of exposure to traffic-related PM and to PM2.5 EC in epidemiologic analyses. Despite advantages over simpler exposure metrics, our traffic-related PM modeling approach has several limitations. One is that annual-average traffic counts do not explain all of the daily variability in traffic intensity; another is that it relies on spatially averaged traffic counts rather than data on each specific road segment. Finely time-resolved (e.g., daily) traffic counts for each road segment in the road network are not currently available. To some extent, this is addressed by performing spatial smoothing separately for each road class. However, traffic count data are spatially incomplete in that not every segment has a traffic count, so error from spatial interpolation is unavoidable. Other spatial averaging methods are also vulnerable to this problem, such as when imputing the mean traffic count based on county boundaries. We note that though our traffic intensity inputs vary smoothly in space, the resulting ADMS-Roads output is not smooth in space due to our application of kernels to the 10-m road source points; therefore, micro-scale detail, down to the resolution of a few meters, is retained. Another limitation is that the approach does not account for bridges, street canyons, tunnels, or other elevation-related effects or for differences in emission factors between light-duty and heavy-duty vehicles, the latter because traffic counts segregated by light- and heavy-duty vehicles were unavailable. Finally, meteorological data were averaged to the daily level and, as discussed above for wind direction, do not reflect hourly variation in meteorological parameters. Other minor limitations include small discontinuities induced by categorization of the input parameters and use of assumed road widths and vehicle speeds. However, the inputs were finely divided into a large number of categories to minimize spatial discontinuities, and assumed road widths and vehicle speeds are likely most influential within 10–20 m of the road centerline, an area where relatively few residences exist. Finally, despite the simplifying assumptions discussed above, the approach presents significant computational challenges with respect to processing time, memory, and disk space and requires parallel computing in order to be implemented on large numbers of locations.

Our spatio-temporal PM2.5 EC model also has several limitations. One is that the data set does not contain multiple monitors within small areas with monitors near and far from major road, data that could help distinguish micro-scale effects, including that of wind direction, from larger spatial scale regional effects. Also, the model makes several statistical assumptions (discussed in the Supplemental Material), such as stationary and isotropic spatial variation, additive covariate effects, and independent and normally distributed model residuals with mean zero and constant variance. Finally, a limitation of all models of exposure to combustion-produced PM2.5 is the combination of continually evolving engine modifications, after-treatment technologies, and fuel blends. The adoption of diesel particulate filters for post-2007 heavy-duty compression ignition engines used in on-road (highway) trucks and busses and reduced sulfur content in diesel fuel have and will continue to reduce particulate emissions from new vehicles.

This work emphasizes the need for spatially accurate, consistently measured, nationwide, and time-resolved information on traffic intensity for each road segment (preferably distinguishing between light-and heavy-duty vehicles). Also, the siting of additional monitoring sites near existing PM2.5 speciation monitors but in locations nearer to large roadways, including those with “population exposure” and “background” monitoring objectives, would be valuable for future development of predictive models of traffic-related air pollutant levels.

Conclusions

Our approach provides estimates of primary traffic-related PM levels based on line-source Gaussian plume dispersion model output. The methodology can provide high spatial resolution (interpolated from a 2-m grid) estimates of traffic-related PM (both “best-estimate” values that incorporate wind direction and “worst-case downwind” values) across the conterminous US from 1999 to 2011 which account for (1) spatially varying traffic intensity (specific to each road class) and (2) spatio-temporal covariates describing daily local meteorology. In contrast, distance to road terms, often used to represent near-road traffic-related air pollution levels, do not account for differences in traffic intensity (or at least do not do so within a given road class) or daily local meteorology. Our results demonstrate that ADMS-Roads output describes near-road primary traffic-related PM levels with greater detail and predictive accuracy as compared to distance to road terms in a spatio-temporal PM2.5 EC model; this model had moderate prediction accuracy at the 24-h average level (CV R2 = 0.532), but better performance when longer-term averages were evaluated (CV R2 of annual averages = 0.795 and spatial CV R2 = 0.784). Caution is warranted when using ADMS-Roads output as a separate exposure metric because of the non-linear shape of the smooth function in the spatio-temporal PM2.5 EC model levels (Fig. 2). The spatio-temporal model effectively calibrates the ADMS-Roads output to measured PM2.5 EC levels; predictions from it represent an improved exposure metric for epidemiologic analyses of PM as compared to distance to road terms.