Air Quality, Atmosphere & Health

, Volume 10, Issue 4, pp 411–420 | Cite as

Ecological bias in environmental health studies: the problem of aggregation of multiple data sources

  • Rakefet Shafran-Nathan
  • Ilan Levy
  • Noam Levin
  • David M. Broday


Ecological bias may result from interactions between variables that are characterized by different spatial and temporal scales. Such an ecological bias, also known as aggregation bias or cross-level-bias, may occur as a result of using coarse environmental information about stressors together with fine (i.e., individual) information on health outcomes. This study examines the assumption that distinct within-area variability of spatial patterns of the risk metrics and confounders may result from artifacts of the aggregation of the underlying data layers, and that this may affect the statistical relationships between them. In particular, we demonstrate the importance of carefully linking information layers with distinct spatial resolutions and show that environmental epidemiology studies are prone to exposure misclassification as a result of statistically linking distinctly averaged spatial data (e.g., exposure metrics, confounders, health indices). Since area-level confounders and exposure metrics, as any other spatial phenomena, have characteristic spatiotemporal scales, it is naively expected that the highest spatial variability of both the SES ranking (confounder) and the NOx concentrations (risk metric) will be obtained when using the finest spatial resolution. However, the highest statistical relationship among the data layers was not obtained at the finest scale. In general, our results suggest that assessments of air quality impacts on health require data at comparable spatial resolutions, since use of data layers of distinct spatial resolutions may alter (mostly weaken) the estimated relationships between environmental stressors and health outcomes.


Data aggregation Socioeconomic status (SES) NOx Modifiable areal unit problem (MAUP) Ecological fallacy 



This research was supported by the Technion Center of Excellence in Environmental Sciences and Environmental Health (TCEEH). We thank Yuval, Y. Etzion, and B. Fishbain for fruitful discussions.


  1. Amster ED, Haim M, Dubnov J, Broday DM (2014) Contribution of nitrogen oxide and sulfur dioxide exposure from power plant emissions on respiratory symptom and disease prevalence. Environ Pollut 186:20–28CrossRefGoogle Scholar
  2. Apparicio P, Abdelmajid M, Riva M, Shearmur R (2008) Comparing alternative approaches to measuring the geographical accessibility of urban health services: distance types and aggregation-error issues. Int J Health Geogr 7:7CrossRefGoogle Scholar
  3. Beelen R, Hoek G, Vienneau D, Eeftens M, Dimakopoulou K, Pedeli X, Eriksen KT (2013) Development of NO2 and NOx land use regression models for estimating air pollution exposure in 36 study areas in Europe—the ESCAPE project. Atmos Environ 72:10–23CrossRefGoogle Scholar
  4. Bell ML, Ebisu K (2012) Environmental inequality in exposures to airborne particulate matter components in the United States. Environ Health Perspect 120:1699–1704Google Scholar
  5. Cyril S, Oldroyd JC, Renzaho A (2013) Urbanisation, urbanicity, and health: a systematic review of the reliability and validity of urbanicity scales. BMC Public Health 13:513CrossRefGoogle Scholar
  6. Diez Roux AV, Mair C (2010) Neighborhoods and health. Ann N Y Acad Sci 1186:125–145CrossRefGoogle Scholar
  7. Dormann FC, McPherson MJ, Araújo BM, Bivand R, Bolliger J, Carl G, Davies GR, Hirzel A, Jetz W, Daniel Kissling W, Kühn I, Ohlemüller R, Peres-Neto RP, Reineking B, Schröder B, Schurr MF, Wilson R (2007) Methods to account for spatial autocorrelation in the analysis of species distributional data: a review. Ecography 30:609–628CrossRefGoogle Scholar
  8. Duncan D (1995) Multiple range and multiple F tests. Biometrics 11:1–42CrossRefGoogle Scholar
  9. Eitan O, Yuval, Barchana M, Dubnov J, Linn S, Carmel Y, Broday DM (2010) Spatial analysis of air pollution and cancer incidence rates in Haifa Bay, Israel. Sci Total Environ 408:4429–4439CrossRefGoogle Scholar
  10. Fekete A, Damm M, Birkmann J (2010) Scales as a challenge for vulnerability assessment. Nat Hazards 55:729–747CrossRefGoogle Scholar
  11. Flowerdew R, Manley DJ, Sabel CE (2008) Neighborhood effects on health: does it matter where you draw the boundaries? Soc Sci Med 66:1241–1255CrossRefGoogle Scholar
  12. Goodman A, Wilkinson P, Stafford M, Tonne C (2011) Characterising socio-economic inequalities in exposure to air pollution: a comparison of socio-economic markers and scales of measurement. Health and Place 17:767–774CrossRefGoogle Scholar
  13. Greenland S, Morgenstern H (1989) Ecological bias, confounding, and effect modification. Int J Epidemiol 18:269–274, Erratum in Int J Epidemiol 1991 20(3):824CrossRefGoogle Scholar
  14. Gryparis A, Paciorek CJ, Zeka A, Schwartz J, Coull BA (2009) Measurement error caused by spatial misalignment in environmental epidemiology. Biostatistics 10:258–274CrossRefGoogle Scholar
  15. Hoskins AJ, Bush A, Gilmore J, Harwood T, Hudson LN, Ware C, Williams KJ, Ferrier S (2016) Downscaling land‐use data to provide global estimates of five land‐use classes. Ecology Evolution 6:3040–3055CrossRefGoogle Scholar
  16. Idrovo AJ (2011) Three criteria for ecological fallacy. Environ Health Perspect 119:A332CrossRefGoogle Scholar
  17. Jerrett M, Gale S, Kontgis C (2010) Spatial modeling in environmental and public health research. Int J Environ Res Public Health 7:1302–1329CrossRefGoogle Scholar
  18. Levy I, Levin N, Yuval, Schwartz JD, Kark JD (2015) Back-extrapolating a land use regression model for estimating past exposures to traffic-related air pollution. Environ Sci Technol 49:3603–3610CrossRefGoogle Scholar
  19. Lovasi GS, Moudon AV, Smith NL, Lumley T, Larson EB, Sohn DW, Siscovick DS, Psaty BM (2008) Evaluating options for measurement of neighborhood socioeconomic context: evidence from a myocardial infarction case–control study. Health Place 14:453–467CrossRefGoogle Scholar
  20. Maantay J (2007) Asthma and air pollution in the Bronx: methodological and data considerations in using GIS for environmental justice and health research. Health Place 13:32–56CrossRefGoogle Scholar
  21. Myers V, Broday DM, Steinberg DM, Yuval, Drory Y, Gerber Y (2013) Exposure to particulate air pollution and long-term incidence of frailty after myocardial infarction. Ann Epidemiol 23:395–400CrossRefGoogle Scholar
  22. Openshaw S (1994) The modifiable area unit problem. Concepts Tech Mod Geogr 38:1–41Google Scholar
  23. Padilla CM, Deguen S, Lalloue B, Blanchard O, Beaugard C, Troude F, Navier DZ, Vieira VM (2013) Cluster analysis of social and environment inequalities of infant mortality. A spatial study in small areas revealed by local disease mapping in France. Sci Total Environ 454–455:433–441CrossRefGoogle Scholar
  24. Parenteau M-P, Sawada MC (2011) The modifiable areal unit problem (MAUP) in the relationship between exposure to NO2 and respiratory health. Int J Health Geogr 10:58CrossRefGoogle Scholar
  25. Poortinga W, Dunstan FD, Fone DL (2008) Neighbourhood deprivation and self-rated health: the role of perceptions of the neighbourhood and of housing problems. Health Place 14:562–575CrossRefGoogle Scholar
  26. Ryan PH, LeMasters GK, Levin L, Burkle J, Biswas P, Hu S, Grinshpun S, Reponen T (2008) A land-use regression model for estimating microenvironmental diesel exposure given multiple addresses from birth through childhood. Sci Total Environ 404:139–147CrossRefGoogle Scholar
  27. Shaddick G, Lee D, Wakefield J (2013) Ecological bias in studies of the short-term effects of air pollution on health. Int J Appl Earth Obs Geoinf 22:65–74CrossRefGoogle Scholar
  28. Sheppard L, Burnett RT, Szpiro AA, Kim S-Y, Jerrett M, Pope CA, Brunekreef B (2012) Confounding and exposure measurement error in air pollution epidemiology. Air Qual Atmosphere Health 5:203–216CrossRefGoogle Scholar
  29. Stafford M, Duke-Williams O, Shelton N (2008) Small area inequalities in health: are we underestimating them? Soc Sci Med 67:891–899CrossRefGoogle Scholar
  30. Wakefield J (2008) Ecologic Studies Revisited. Annu Rev Public Health 29:75–90CrossRefGoogle Scholar
  31. Wakefield J, Shaddick G (2006) Health-exposure modeling and the ecological fallacy. Biostatistics 7:438–455CrossRefGoogle Scholar
  32. Yuval, Broday DM (2006) High-resolution spatial patterns of long-term mean concentrations of air pollutants in Haifa Bay area. Atmos Environ 40(20):3653–3664.CrossRefGoogle Scholar
  33. Zou B, Wilson JG, Zhan FB, Zeng Y (2009) Air pollution exposure assessment methods utilized in epidemiological studies. J Environ Monit 11:475CrossRefGoogle Scholar

Copyright information

© Springer Science+Business Media Dordrecht 2016

Authors and Affiliations

  • Rakefet Shafran-Nathan
    • 1
  • Ilan Levy
    • 1
  • Noam Levin
    • 2
  • David M. Broday
    • 1
  1. 1.Environmental, Water and Agricultural Engineering, Faculty of Civil and Environmental EngineeringTechnion, Israel Institute of TechnologyHaifaIsrael
  2. 2.Department of Geography, Faculty of Social SciencesThe Hebrew University of JerusalemJerusalemIsrael

Personalised recommendations