The WEB study included women who were aged 35–79 and residents of Erie and Niagara Counties in western New York State including the cities of Buffalo and Niagara Falls. Included were women with no previous history of cancer other than non-melanoma skin cancer. Details of the study have been described elsewhere [18, 19]. Briefly, cases had incident, primary, histologically-confirmed breast cancer. Controls were randomly selected from the residents of Erie and Niagara Counties and frequency-matched to cases on age and race. Those between the ages of 35 and 64 were selected from the New York State Department of Motor Vehicle list. Those 65 and over were identified through Health Care Finance Administration records. Among those determined eligible, response rates for the study were 72% (cases) and 63% (controls). All participants provided written informed consent, and the study protocol was approved by the Institutional Review Boards of the University at Buffalo and of all the participating hospitals.
Due to differences in risk factors for breast cancer by menopausal status, all analyses were stratified by menopausal status. In our study, a woman was defined as postmenopausal if her menses had ceased permanently and naturally. Also considered postmenopausal were women on hormone replacement therapy who were over age 55, women who had had a bilateral oophorectomy, women who had had a hysterectomy and were older than 50, women whose menses had ceased permanently due to radiation or other medical treatment and were older than 55, and women over 55 whose menses had not ceased permanently.
Data collection
In-person interviews were used to collect data on potential breast cancer risk factors. Included were race, age, education, body mass index (BMI = weight (kg)/height (m)2), smoking history, usual diet during the period 12–24 months prior to interview, medical history, reproductive history, family history of breast cancer, previous benign breast disease, and occupational history.
Self-reported lifetime residential histories were also collected. Participants listed each of their residences for their entire lives, providing the addresses and the time periods when they lived at those addresses. For the addresses with incomplete information, an extensive search of available records, including the Polk Directory, was conducted to find as much missing data as possible.
Addresses were geocoded using ArcView 3.2 (ESRI, Inc., Redlands, CA), with GDT/Dynamap 2000 (GDT, Inc., Lebanon, NH) as the reference theme. ZP4 (Semaphore Co., Aptos, CA) software was used to correct and update the zip code for each address before the geocoding process. Geocoding was limited to participants’ addresses in Erie and Niagara Counties. Our previous validation study showed good positional accuracy of the geocoded addresses in comparison to a global positioning system unit used as the gold standard to measure the latitude and longitude of the locations [20].
Exposure assessment
A geographic traffic exposure model, the Buffalo version of a model developed for the Long-Island Breast Cancer Study Project, was used to estimate historical residential exposure to traffic emission. The model estimated PAH exposure using benzo[a]pyrene (BaP) as a surrogate for total PAH exposure. This emissions and meteorological dispersion model, along with its validation and calibration, has been described in detail previously [21, 22]. The data used for validation and calibration included PAH measurements carried out on a subset of study subjects, e.g., soil and carpet BaP concentrations, and PAH-DNA adducts in study subjects’ blood, as well as measurements of carbon monoxide (CO) at an U.S. Environmental Protection Agency monitoring station. The authors found that, in three out of four of these comparisons, the model successfully predicted the relevant measurement [21, 22].
For use of the model in the Erie and Niagara Counties region, Long-Island data were replaced with region-specific meteorological and traffic data. Meteorological data were obtained from the National Climatic Data Center. The numbers of vehicles on roads in the two counties were obtained from the Greater Buffalo-Niagara Regional Transportation Council (GBNRTC) for the years from 1971 to 2002, and from the New York State Department of Transportation (NYSDOT) for the years from 1960 to 1975.
The traffic exposure model entails the choice of a scale factor corresponding to higher emissions at intersections, where vehicles are accelerating and decelerating [21]. This factor was obtained by calibrating the model to Erie and Niagara Counties regional air pollution data (carbon monoxide) collected by the U.S. Environmental Protection Agency. Carbon monoxide air concentration is highly correlated with PAH air concentration in cities (R
2 = 0.5–0.8) [21].
With the region specific adjustments in place, the meteorological dispersion model was applied to estimate traffic emissions, particularly PAHs, emitted along the 54,494 road segments in the two study counties, producing exposure estimates for each residence for the participants specific to the time period of interest. Emission data per road segment were derived from historical data obtained for tailpipe emissions (Beyea J, Hatch M, Stellman SD, Gammon MD, unpublished data) and for number of vehicles on roads. The model produces relative rather than absolute estimates of exposure because the former are less sensitive to uncertainties in model parameters and because the model cannot really distinguish between traffic PAH pollutants and co-pollutants.
Statistical methods
Of the total of 1,170 cases and 2,116 controls in the WEB study, 1,068 cases and 1,944 controls provided information on lifetime residential history. There were a total of 20,862 individual addresses, among which 15,969 (77%) were within Erie and Niagara Counties. This study was limited to those residences within Erie and Niagara Counties, with adequate information for geocoding and with consistent information on the year the participant moved in and out of the residential location. For these studies of traffic emissions, we were limited to historical data beginning in 1960; therefore, we limited our analysis here to residences in which study participants lived during or after 1960. Analyses were done separately for each of the time windows, i.e., at menarche, at the time of a woman’s first birth, and 20 and 10 years prior to interview. We were not able to examine risk in relation to exposure at the time of the woman’s own birth, because of these limitations in the availability of historical data.
To describe the distribution of the studied variables, means and standard deviations (SDs) were calculated for the continuous variables for cases and controls, and t-tests were used to compare means. χ2 tests were used for comparisons of categorical variables. Since the distribution of traffic emissions was skewed, all these values were natural log transformed. These exposure estimates were categorized into quartiles based on the distribution of traffic emissions among controls. The cutoffs of the traffic emissions varied based on menopausal status and time windows examined, i.e., premenopausal women at menarche analysis (7.65, 8.36, and 8.84), premenopausal women at first birth analysis (6.42, 7.41, 8.16), postmenopausal women at first birth analysis (7.57, 8.35, 8.76), premenopausal women at 20 years prior analysis (7.03, 7.73, 8.11), postmenopausal women at 20 years prior analysis (6.94, 7.72, 8.14), premenopausal women at 10 years prior analysis (5.40, 6.15, 6.75), and postmenopausal women at 10 years prior analysis (5.49, 6.29, 6.82). Unconditional logistic regression was used to calculate odds ratios (OR) and 95% confidence intervals (CI). To test for linear trend, we also examined a model with traffic emissions entered as a continuous variable. Breast cancer risk factors were adjusted for in the model, including the matching factors of age and race. Also included were education, BMI, age at menarche, age at menopause (for postmenopausal women only), age at first birth, number of births, family history of breast cancer, and previous benign breast disease. In addition to matching variables, i.e., age and race, a reduced model including education, age at first birth and year at interview was determined by removing covariates that did not alter the OR by more than 10%. To test potential effect modification, analyses stratified by smoking status, and estrogen receptor (ER) and progesterone receptor (PR) status were also performed.