Air Pollution and Population Exposure

Venter, Zander S; Chowdhury, Sourangsu

doi:10.1007/978-3-031-26588-4_35

21k Accesses

Abstract

After high blood pressure and smoking, air pollution is the third-largest risk factor for death globally (Murray et al. in Lancet 396:1223–1249, 2020). Air pollution can therefore be described as a global “pandemic” that should arguably be monitored and addressed with the same intensity with which the COVID-19 pandemic has been. Remote sensing and cloud computing technologies allow us to do so.

You have full access to this open access chapter, Download chapter PDF

FormalPara Overview

After high blood pressure and smoking, air pollution is the third-largest risk factor for death globally (Murray et al. 2020). Air pollution can therefore be described as a global “pandemic” that should arguably be monitored and addressed with the same intensity with which the COVID-19 pandemic has been. Remote sensing and cloud computing technologies allow us to do so.

The purpose of this chapter is to explore and analyze gridded air pollution data from Sentinel-5P in the context of changes brought about by COVID-19 lockdowns. Practical components will include analyzing changes in nitrogen dioxide (NO₂) concentrations over time and quantifying population-weighted NO₂ concentrations for selected administrative units.

FormalPara Learning Outcomes

Understanding Sentinel-5P data.
Quantifying changes in air pollutant concentrations over time.
Generating a split-panel map to compare two time epochs.
Calculating population-weighted air pollutant concentrations.

Helps if you know how to

Import images and image collections, filter, and visualize (Part I).
Create a graph using ui.Chart (Chap. 4).
Perform basic image analysis: select bands, compute indices, create masks (Part II).
Use ee.Reducer functions to summarize pixels over an area (Chaps. 8 and 9).
Write a function and map it over an ImageCollection (Chap. 12).
Mask cloud, cloud shadow, snow/ice, and other undesired pixels (Chap. 15).
Design user interfaces for an Earth Engine App (Chap. 30).

1 Introduction to Theory

Air pollution can be generally defined as any chemical, physical, or biological agent that alters the natural composition of the atmosphere. Pollutants that are of primary concern for public health include particulate matter with diameter less than 2.5 μm (PM_2.5), carbon monoxide (CO), ozone (O₃), NO₂, and sulfur dioxide (SO₂). Globally, chronic exposure to air pollution results in greater loss of life than HIV/AIDS, malaria, and tuberculosis combined, and more than an order of magnitude more deaths than all forms of violence (Lelieveld et al. 2020). Exposure to PM_2.5 and O₃ is estimated to result in ~ 4.7 million excess deaths annually across the globe (Murray et al. 2020), although these estimates range between 3 and 10 million excess deaths per year, based on the disease categories considered and the exposure–response function used (Burnett et al. 2018; Chowdhury et al. 2022). Exposure to NO₂ may result in 4 million new pediatric asthma cases annually (Achakulwisut et al. 2019).

Knowledge about the global distribution of these air pollutants and their sources has improved over the last decade, with the expansion of networks of ground-based monitors in many countries, the evolution of satellite products, and the advancement of complex atmospheric chemistry models. Studies have found that more than 70% of the global health burden from air pollution is attributable to anthropogenic emissions (Chowdhury et al. 2022; Lelieveld et al. 2019). The main anthropogenic sources of air pollution are industries, motor vehicles, power generation, agricultural activities, and household combustion, while non-anthropogenic sources include desert dust, biogenic emissions, forest fires, and even volcanoes. The reduction in transport and industrial activity during the COVID-19 lockdowns significantly reduced global air pollution levels, thereby highlighting the significance of anthropogenic emissions (Venter et al. 2020). In fact, it is estimated that the decline in air pollution during the first five months of 2020 resulted in 49,900 avoided deaths and 89,000 fewer pediatric asthma emergency room visits (Venter et al. 2021).

Despite the recent growth in monitoring networks, the air in most regions of Earth is insufficiently monitored, limiting air quality management. Given the paucity of ground-based monitoring, alternative monitoring approaches such as satellite remote sensing are gaining popularity and becoming more accurate (e.g., Griffin et al. 2019). Over the past few decades, we have had increasing access to a range of satellite sensors that monitor the contents of Earth’s atmosphere. However, it is important to note that satellites measure pollutant concentrations in the troposphere and stratosphere, which extend for many kilometers above the Earth’s surface. As a result, satellite measurements are not necessarily representative of the concentrations humans are exposed to on the ground, and consequently, relying on satellite data alone for human health applications is not advised. However, more sophisticated methods combine information from satellite remote sensing data, complex atmospheric chemistry models, and ground-based monitors to provide ground-level concentrations of pollutants with high confidence (Dey et al. 2020, Donkelar et al. 2021).

2 Practicum

2.1 Section 1: Data Importing and Cleaning

There is a range of satellite-based datasets on air pollution to choose from in the Earth Engine Data Catalog. The main datasets relevant to air pollution include the Moderate Resolution Imaging Spectroradiometer and Advanced Very-High-Resolution Radiometer for monitoring aerosol optical depth (a proxy for PM_2.5); the Total Ozone Mapping Spectrometer Ozone Monitoring Instrument for monitoring O₃; and more recently the TROPOspheric Monitoring Instrument (TROPOMI) on board the Sentinel-5 Precursor (Sentinel-5P), which monitors a range of air pollutants. We will use Sentinel-5P in this practicum, but the methods covered here are easily transferable to the datasets mentioned above.

Now, let’s load the satellite data for this practicum. If you search “tropomi” in the Earth Engine Data Catalog, you will see a range of datasets from Sentinel-5P, which can all be of value in quantifying air quality (Fig. 35.1).

A catalog for the search results matching tropomi exhibits 5 results for sentinel 5 P O F F L. — **Fig. 35.1**

Although Sentinel-5 was launched in October 2017, the data available for analysis in Earth Engine are from July 2018 onward. TROPOMI, the sensor on board the satellite, is a spectrometer sensing ultraviolet, visible, near-infrared, and shortwave infrared wavelengths to monitor NO₂, O₃, aerosol, methane (CH₄), formaldehyde, CO, and SO₂ in the atmosphere. The swath width of TROPOMI is approximately 2600 km on the ground, resulting in a global daily coverage with a spatial resolution of 7 × 7 km. All of the Sentinel-5P datasets, except CH₄, have two versions: Near Real-Time (NRTI) and Offline (OFFL); CH₄ is available as OFFL only. The NRTI assets cover a smaller area than the OFFL assets but appear more quickly after acquisition. The OFFL assets have a delayed availability, but each asset contains data from an entire orbit and is arguably easier to work with for retrospective analyses. We will use the OFFL NO₂ product in this practicum.

First we need to define an area of interest. Wuhan is infamous for being the epicenter of the COVID-19 pandemic and witnessed severe lockdowns. In the next section of this practicum, we will test to see if we can detect a reduction in NO₂ during the early 2020 lockdowns in the surrounding province, Hubei. To start, in the code below, we import a global dataset of administrative boundaries and filter them for intersection with an ee.Geometry.Point object, which appears under the Imports section at the top of your script. This geometry has to be drawn with the drawing tool and can be moved to a new location to rerun the analysis for that administrative boundary.

After centering the Map on Hubei Province, we will import a population dataset, which is necessary for calculating population-weighted exposures in Sect. 3 of this practicum. We will use the Gridded Population of the World dataset for 2020, which includes a total population count per ~ 1 × 1 km grid (Fig. 35.2).

A google satellite map for the Hubei province indicates the population density. The map has a darker region surrounded by a bright region with higher population counts. — **Fig. 35.2**

A 29-line of script exhibits steps from importing a global dataset of administrative units level 1 to adding it to the map to see the population distribution.

Question 1. There are two other datasets of gridded population in the Earth Engine Data Catalog, namely WorldPop and Global Human Settlement Layers. Use the search bar to find them and add them to the map to compare them with the Gridded Population of the World dataset. Which one looks more realistic in your opinion, and why?

Now it is time to import the NO₂ data. As with most optical satellite data, there can be things in the atmosphere that contaminate the signal from the object or chemical you want to measure. Clouds are a common issue for land surface reflectance products (Chap. 15), and they are also an issue when trying to measure air pollutant concentrations. In the code below, we create a function to mask out pixels with a cloud fraction above 0.3 (i.e., 30% cloud cover). You can test different masking thresholds to see what suits your use case best. After masking out cloudy pixels, we create a median composite from images during March 2021. It is important to note that we are working with the band that gives measurements for the tropospheric vertical column of NO₂ and not the stratospheric or total vertical column. The troposphere is the closest we can get to ground-level measurements with Sentinel-5P. The median image for March 2021 should look like the map shown in Fig. 35.3.

A satellite map with a heat map exhibits tropospheric N O 2 concentrations over Hubei province. — **Fig. 35.3**

A 32-line script exhibits steps from importing the sentinel 5 P N O 2 offline product to visualizing the median N O 2.

Code Checkpoint A14a. The book’s repository contains a script that shows what your code should look like at this point.

2.2 Section 2: Quantifying and Visualizing Changes

Next we will test to see if we can visualize a change in NO₂ concentrations during the 2020 COVID-19 lockdowns. We will compare the median NO₂ concentration during March 2020 (during which Hubei Province was in lockdown) with the median value during March 2019.

Weather can significantly affect air pollutant concentrations (e.g., wind causing long-range transport of smoke), and therefore differences between 2020 and 2019 could be an artifact of differences in weather. By comparing the same month in different years, we partly control for the effects of seasonal weather patterns, but not completely. If you would like to control for weather effects more thoroughly, see Venter et al. (2020) for details. In the code below, we calculate and visualize median composite images for March 2019 and March 2020. The visualization makes use of Earth Engine’s comprehensive library of user-interface widgets (see Chap. 30 for more details). Specifically, we use the ui.SplitPanel widget to compare the two median composites side by side (Fig. 35.4). This widget can be set to have a wiping effect where maps are overlaid on top of one another, or a side-by-side comparison.

A satellite map with a heat map has 2 panels. The left panel is for baseline 2019 and the right panel is for lockdown 2020. The map exhibit data for tropospheric N O 2 concentrations over Hubei province. — **Fig. 35.4**

A 34-line of script exhibits steps from defining a lockdown N O 2 median composite to making a function to add a label with fancy styling.

Question 2. Comparing the two maps in the split-panel map, do you find a reduction in NO₂ concentrations during the lockdown? Where is the change in NO₂ concentrations most significant?

Question 3. How are changes in NO₂ concentrations related to population density? To help answer this question, you can (1) create a difference image by subtracting the no2Lockdown image from the no2Baseline image, (2) create a new ui.Map.Layer for the difference image and the population image created in Sect. 35.1, and (3) add these to the left or right map. Hint: You can change the opacity of the NO₂ layers to aid interpretability.

Exploring the differences in NO₂ concentrations as ee.Image objects can be visually informative, but quantifying the changes for specific regions requires further work. In the code below, we calculate the mean NO₂ concentrations for Hubei Province by applying a reduceRegion function to each image in the March 2019 and March 2020 collections. The resulting time series are visualized in the chart shown in Fig. 35.5.

A graph for the baseline versus lockdown N O 2 for the study region by D O Y calculates the D O Y time series for mean N O 2 during March 2019 baseline and 2020 lockdown. The peak for baseline is around 70.3 for D O Y and in lockdown, the peak is around 80.5. — **Fig. 35.5**

A 24-line of script exhibits steps from creating the baseline map layer, adding it to the left map, and adding the label to reset the map interface with the split panel widget.

Code Checkpoint A14b. The book’s repository contains a script that shows what your code should look like at this point.

2.3 Section 3: Calculating Population-Weighted Concentrations

In Sect. 35.2, we used the ee.Reducer.mean reducer in the reduceRegion function to get the average NO₂ concentration over Hubei Province. However, when aggregating pollutant concentrations to define population exposure, we need a different approach. Imagine there was a large concentration of NO₂ in a rural area in the east of Hubei Province where very few people live. If we simply calculated the average of all pixels, this rural NO₂ anomaly would skew our representation of population exposure. Using the population number dataset imported in Sect. 35.1, we can calculate the population-weighted exposure ($Exp$) aggregated across $n$ pixels in the area of interest (in this case, Hubei Province) using Eq. A1.4.1 below, where $C_{i}$ is the NO₂ concentration and $P_{i}$ is the subpopulation in pixel $i$.

$$ {\text{Exp }} = \mathop \sum \limits_{i}^{n} \frac{{P_{i} }}{{\mathop \sum \nolimits_{i}^{n} \left( P \right)}} \cdot C_{i} $$

(35.1)

In the code below, we map a function to calculate population-weighted exposure over all the images in the NO₂ ImageCollection. Remember that in Sect. 35.1 we masked out pixels from images that had a cloud cover value greater than 30%. Therefore, an important step in this function is to calculate the percentage of available Sentinel-5P pixels within Hubei Province per image. We need to decide what percentage pixel coverage is enough to calculate a representative average for the province. Here we choose 25% for illustrative purposes, but depending on your research question, you may want to calculate averages only when you have 100% coverage by/from Sentinel-5P that is free of clouds. The contrast between the simple average and population-weighted average is shown in Fig. 35.6. The difference may appear small in this case, but when aggregating over larger areas with greater variation in population density, population-weighted averages can be very different from simple averages.

A graph for the raw versus population-weighted N O 2 for the study region with time series for man N O 2 and the pop-weighted N O 2. The data exhibits peaks around March 20, 2020, for both. — **Fig. 35.6**

A 20-line of script exhibits steps from creating a function to get the mean N O 2 for the study region to return a feature with N O 2 concentration and day-of-year properties.

A 19-line of script exhibits steps from getting the concentrations for a baseline and lockdown collection to printing it to the console.

Finally, although we can plot this data in Earth Engine, it is often easier to process with other statistical software, such as R or Python. So, to conclude, let us code for exporting time series of population-weighted averages for more than one area of interest (in this case, administrative units). In the code below, we map the function over two regions and then export the resulting table as a CSV file to Google Drive.

A 33-line of script exhibits steps from defining the spatial resolution of the population data to summing the e x p over the region.

Code Checkpoint A14c. The book’s repository contains a script that shows what your code should look like at this point.

3 Synthesis

In this practicum, we focused on a particular pollutant (NO₂), region (Hubei), and time period (March 2019 and March 2020). To reinforce your comprehension and understanding, consider the following assignments.

Assignment 1. How would you run this analysis for a different pollutant? Try substituting the NO₂ collection with the Sentinel-5P NRTI SO₂ collection. Hint: The main emission source for SO₂ is electricity generation, for which coal is the most significant fuel. Use this information to inform your selection of a location and time period so that you can detect interesting changes.

Assignment 2. How would you run this analysis for a different geographic area? Try deleting the ee.Geometry.Point at the top of your script and using the Geometry Tools to digitize your own point on which to focus the analysis. If you are running the latter part of the script, you can also change the list of named administrative units. Hint: Add the adminUnits object from Sect. 35.1 of the code to the map. You can use the Inspector tab to click on polygons and get the name of the administrative unit under the ‘ADM1_NAME’ property.

Assignment 3. Finally, try changing the dates in the script so that you are comparing two different time periods. Remember that the Sentinel-5P data are available from July 2018 onward; defining dates before this will cause the script to throw an error.

4 Conclusion

In this chapter, we covered the basics of importing Sentinel-5P air pollution data, comparing changes over time, and calculating population-weighted averages for spatial units. Satellite detection of air pollutants is an important tool for monitoring air quality from local to global scales, but ground-station measurements and atmospheric modeling are often necessary to draw conclusions about human health risk. The fusion of ground-level and satellite data with advanced machine learning models to map and forecast air pollution is a growing research field with important societal applications (e.g., https://www.iqair.com/). Earth Engine is a well-suited and currently underutilized resource to advance this field.

References

Achakulwisut P, Brauer M, Hystad P, Anenberg SC (2019) Global, national, and urban burdens of paediatric asthma incidence attributable to ambient NO₂ pollution: estimates from global datasets. Lancet Planet Heal 3:e166–e178. https://doi.org/10.1016/S2542-5196(19)30046-4
Article Google Scholar
Benedetti A, Morcrette J-J, Boucher O, et al (2009) Aerosol analysis and forecast in the European centre for medium-range weather forecasts integrated forecast system: 2. Data assimilation. J Geophys Res Atmos 114https://doi.org/10.1029/2008JD011235
Burnett R, Chen H, Szyszkowicz M et al (2018) Global estimates of mortality associated with long-term exposure to outdoor fine particulate matter. Proc Natl Acad Sci USA 115:9592–9597. https://doi.org/10.1073/pnas.1803222115
Article CAS Google Scholar
Chowdhury S, Pozzer A, Haines A et al (2022) Global health burden of ambient PM_2.5 and the contribution of anthropogenic black carbon and organic aerosols. Environ Int 159:107020. https://doi.org/10.1016/j.envint.2021.107020
Dey S, Purohit B, Balyan P et al (2020) A satellite-based high-resolution (1-km) ambient PM_2.5 database for India over two decades (2000–2019): applications for air quality management. Remote Sens 12:1–22. https://doi.org/10.3390/rs12233872
Article Google Scholar
Griffin D, Zhao X, McLinden CA et al (2019) High-resolution mapping of nitrogen dioxide with TROPOMI: first results and validation over the Canadian oil sands. Geophys Res Lett 46:1049–1060. https://doi.org/10.1029/2018GL081095
Article CAS Google Scholar
Lelieveld J, Klingmüller K, Pozzer A et al (2019) Effects of fossil fuel and total anthropogenic emission removal on public health and climate. Proc Natl Acad Sci USA 116:7192–7197. https://doi.org/10.1073/pnas.1819989116
Article CAS Google Scholar
Lelieveld J, Pozzer A, Pöschl U et al (2020) Loss of life expectancy from air pollution compared to other risk factors: a worldwide perspective. Cardiovasc Res 116:1910–1917. https://doi.org/10.1093/cvr/cvaa025
Article CAS Google Scholar
Murray CJL, Aravkin AY, Zheng P et al (2020) Global burden of 87 risk factors in 204 countries and territories, 1990–2019: a systematic analysis for the global burden of disease study 2019. Lancet 396:1223–1249.https://doi.org/10.1016/S0140-6736(20)30752-2
Van Donkelaar A, Hammer MS, Bindle L et al (2021) Monthly global estimates of fine particulate matter and their uncertainty. Environ Sci Technol 55:15287–15300. https://doi.org/10.1021/acs.est.1c05309
Article CAS Google Scholar
Venter ZS, Aunan K, Chowdhury S, Lelieveld J (2020) COVID-19 lockdowns cause global air pollution declines. Proc Natl Acad Sci USA 117:18984–18990. https://doi.org/10.1073/pnas.2006853117
Article CAS Google Scholar
Venter ZS, Aunan K, Chowdhury S, Lelieveld J (2021) Air pollution declines during COVID-19 lockdowns mitigate the global health burden. Environ Res 192:110403. https://doi.org/10.1016/j.envres.2020.110403
Article CAS Google Scholar

Download references

Author information

Authors and Affiliations

Norwegian Institute for Nature Research, Trondheim, Norway
Zander S Venter
Center for International Climate and Environmental Research (CICERO), University of Oslo, 0349, Oslo, Norway
Sourangsu Chowdhury

Authors

Zander S Venter
View author publications
You can also search for this author in PubMed Google Scholar
Sourangsu Chowdhury
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Zander S Venter .

Editor information

Editors and Affiliations

Department of Natural Resource Sciences, McGill University, Sainte-Anne-De-Bellevue, QC, Canada
Jeffrey A. Cardille
Department of Natural Resource Sciences, McGill University, Ste. Anne de Bellevue, QC, Canada
Morgan A. Crowley
University of San Francisco, San Francisco, CA, USA
David Saah
Google LLC, Mountain View, CA, USA
Nicholas E. Clinton

Rights and permissions

Open Access This chapter is licensed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

The images or other third party material in this chapter are included in the chapter's Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the chapter's Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Venter, Z.S., Chowdhury, S. (2024). Air Pollution and Population Exposure. In: Cardille, J.A., Crowley, M.A., Saah, D., Clinton, N.E. (eds) Cloud-Based Remote Sensing with Google Earth Engine. Springer, Cham. https://doi.org/10.1007/978-3-031-26588-4_35

Download citation

DOI: https://doi.org/10.1007/978-3-031-26588-4_35
Published: 02 October 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-26587-7
Online ISBN: 978-3-031-26588-4
eBook Packages: Chemistry and Materials ScienceChemistry and Material Science (R0)

Publish with us

Policies and ethics