Neighborhood Emission Mapping Operation (NEMO): A 1-km anthropogenic emission dataset in the United States

Ma, Siqi; Tong, Daniel Q.

doi:10.1038/s41597-022-01790-9

Neighborhood Emission Mapping Operation (NEMO): A 1-km anthropogenic emission dataset in the United States

Data Descriptor
Open access
Published: 09 November 2022

Volume 9, article number 680, (2022)
Cite this article

Download PDF

You have full access to this open access article

Scientific Data

Neighborhood Emission Mapping Operation (NEMO): A 1-km anthropogenic emission dataset in the United States

Download PDF

3133 Accesses
6 Citations
Explore all metrics

Abstract

We present an unprecedented effort to map anthropogenic emissions of air pollutants at 1 km spatial resolution in the contiguous United States (CONUS). This new dataset, Neighborhood Emission Mapping Operation (NEMO), is produced at hourly intervals based on the United States Environmental Protection Agency (US EPA) National Emission Inventories 2017. Fine-scale spatial allocation was achieved through distributing the emission sources using 108 spatial surrogates, factors representing the portion of a source in each 1 km grid. Gaseous and particulate pollutants are speciated into model species for the Carbon Bond 6 chemical mechanism. All sources are grouped in 9 sectors and stored in NetCDF format for air quality models, and in shapefile format for GIS users and air quality managers. This dataset shows good consistency with the USEPA benchmark dataset, with a monthly difference in emissions less than 0.03% for any sector. NEMO provides the first 1 km mapping of air pollution over the CONUS, enabling new applications such as fine-scale air quality modeling, air pollution exposure assessment, and environmental justice studies.

Measurement(s)	anthropogenic emission in United States
Technology Type(s)	Census survey and computer models

A High-Resolution National Emission Inventory and Dispersion Modelling—Is Population Density a Sufficient Proxy Variable?

Informing urban climate planning with high resolution data: the Hestia fossil fuel CO2 emissions for Baltimore, Maryland

Article Open access 14 October 2020

Multi-level policies for air quality: implications of national and sub-national emission reductions on population exposure

Article Open access 06 September 2018

Background & Summary

Emission, or the release of gases and particles from the Earth’s surface into the atmosphere, is the starting point of many Earth system processes responsible for some of the greatest environmental challenges today, such as air pollution, acid deposition, and climate change^1,2,3. The World Health Organization (WHO) estimates that exposure to ambient air pollution has been associated with 7 million premature deaths per annum, making it the single largest environmental risk today⁴. In the United States, over one third of the population lives in areas not attaining the health-based National Ambient Air Quality Standards (NAAQS) for ozone (O₃) and/or fine particulate matter (PM_2.5)⁵. Air quality and public health managers have an important task to protect public health by alerting the population when forecasts predict the exceedance of the NAAQS, which critically depends on the accurate prediction of the timing, location, and severity of unhealthy air quality episodes^6,7.

Air quality models used for forecasting and policy studies rely on detailed mapping of emission sources to predict spatio-temporal variations of air pollution. Air pollutants can be directly emitted (primary) or formed in the atmosphere through chemical/physical processes (secondary)⁸. While PM_2.5 can have both primary and secondary origins, O₃ is mostly formed through photochemical reactions in the troposphere^9,10. Consequently, the collocation of emitted pollutants and their precursors, that affects the chemical transformations, localized dispersion and deposition, is a major factor controlling the variability of concerned atmospheric constituents^11,12,13. Better spatially resolved emission data allows further improvement of atmospheric composition prediction for air quality early warning and management^14,15,16. Chemical transport models can provide the knowledge of vertical atmospheric constituents as priori for the retrievals of satellite products^17,18,19,20. As technology advances, the instruments are able to observe at higher spatial resolution, requiring a priori information at a finer resolution as well. Similarly, fine resolution emission and concentration data can provide new insight into population exposure to air pollution^21,22,23.

Numerous approaches have been utilized to map greenhouse gases and air pollutants at high spatial resolution. A global CO₂ emission dataset with 1 km resolution was developed by using the satellite observed nighttime lights²⁴, and a global ammonia emission inventory with 0.1° resolutions was created with updated emission factors and data products²⁵. Within the US, the DAtabase of Road Transportation Emissions (DARTE) provides annual emissions of on-road CO₂ over the contiguous United State (CONUS) at 1 km resolution based on the roadway-level and emission factors²⁶. The Vulcan v3.0 CO₂ emissions data was generated at 1 km and hourly resolution which includes various anthropogenic source sectors²⁷. Additionally, a sub-neighborhood (~100 m) surface NO₂ dataset over the CONUS was presented using land-use regression (LUR) models along with observational and modeling data^28,29. These datasets usually focus on a single emission species, and are often inadequate for some applications, such as air quality modeling. A high-resolution emission dataset that contains all co-emitted major air pollutants is desirable to support various applications in air quality modeling, public health, and environmental management.

In this study, we present a new high‐resolution anthropogenic emission dataset, called the Neighborhood Emission Mapping Operation (NEMO), that maps all major sources in the CONUS. It includes the emissions from nine sectors, accounting for 854 individual source types, based on the 2017 National Emissions Inventory (NEI). The emission data are mapped at 1 km spatial gridding and at hourly intervals. These air pollutants are further split into chemical species consistent with the Carbon Bond 6 (CB6) chemical mechanism, so that the data can be used to drive air quality models such as the Community Multiscale Air Quality (CMAQ) model³⁰ and the Weather Research and Forecast with chemistry (WRF-chem) model. The data are available in the NetCDF format, and annual data are also provided in the shapefile format for VOCs, NO_x, CO, SO₂, NH_3, and PM_2.5. In addition, a web-based data portal has been set up to provide online emission data services for interested users. This, to our knowledge, is the first effort to map all major air pollutants at 1 km resolution for the entire CONUS. The dataset, along with the data access, is expected to enable new applications such as fine-scale air quality modeling, air pollution exposure assessment, and environmental justice studies.

Methods

The anthropogenic emissions in this dataset are generated based on the 2017 National Emissions Inventory (NEI2017) from the US Environmental Protection Agency (US EPA). Since the NEI only provides aggregate emissions for each county, four steps were taken to generate the high-resolution emission dataset, including 1) spatial allocation; 2) chemical speciation; 3) temporal allocation, and 4) merging. These four steps are implemented by the Sparse Matrix Operator Kernel Emissions (SMOKE) model³¹, and the configuration files (usually called profiles) used for the allocation and speciation in SMOKE can be generated with various tools or provided by the ancillary datasets from US EPA. The information of the emission inventory and other data and tools is described as follows.

Base emission inventories

The NEI2017 (version 2017gb) compiled by US EPA is used to develop the high-resolution emission dataset. There are hundreds of individual emission sources in NEIs, which are grouped into nine emission sectors, including six nonpoint sectors, two mobile sectors, and the point sector (Table 1). Each emission source is identified by a unique source classification code (SCC). Except the point source, all sources are provided at county level. For each county, the NEI lists the annual amounts of emitted air pollutants, including fine and coarse particulate matter (PM_2.5 and PM₁₀), nitrogen oxides (NO_x), carbon monoxide (CO), sulfur dioxide (SO₂), ammonia (NH₃), and volatile organic compounds (VOCs). Point sources are represented as the individual facilities (energy, industrial, and manufacturing facilities), usually at specific latitude/longitude coordinates, rather than as county or tribal aggregates. In NEI2017, all point sectors are treated as elevated sources, so in this dataset we only consider the airports sector, which has surface level of emissions and can be processed into two-dimension gridded files. The Motor Vehicle Emissions Simulator (MOVES) version 2014b generates county-level emission factors from on-road mobile sources, which include monthly county-level emissions from motorized vehicles that are normally operated on public roadways. In addition, emissions from nonroad sources, such as nonroad engines and equipment, construction equipment, and agricultural engines, are also calculated by the nonroad component of the EPA’s MOVES model (MOVES-Nonroad). For the estimated emission records, quality assurance (QA) has been implemented and reviewed by EPA and state, local, and tribal agencies. Detailed information about the emission inventory is provided in the NEI2017 Technical Support Document (TSD)³² and all the inventory files, as well as the emission processing platform, can be downloaded from EPA FTP³³.

Table 1 Overview of NEI2017 inventory used for the emission processing.

Full size table

Chemical speciation

Some of the pollutants (namely NO_x, VOCs, PM_2.5 and PM₁₀) in the emission inventory cannot be directly used by chemical transport models, unless distributed into model species of a specific chemical mechanism. The model species can be individual chemical compounds (explicit species) or groups of species (lumped species). In the NEI2017, we use the Carbon Bond 6 (CB6) chemical mechanism³⁴ to split gaseous pollutants (NOx and VOCs), and the Aerosol 7 (AERO7) aerosol mechanism³⁵ to split particulate pollutants (PM_2.5 and PM₁₀) into required model species.

Chemical speciation of the pollutants is achieved using detailed chemical profiles that allocate an aggregate pollutant to required model species. For VOCs, the speciation profiles generally have two types, “CRITERIA” and “INTEGRATE”. “CRITERIA” means all model species are speciated from the total VOC emissions in NEI. This VOC speciation approach is applied to point sources and several area sources that are not included in the Hazardous Air Pollution (HAP) inventory. The other VOC speciation approach, called Integration or “INTEGRATE”, is used for onroad, offroad and some area source sectors. This approach aims to integrate two NEIs, NEI2017 and HAP NEI, for select VOC HAPs. For these HAPs, the HAP NEI is generally considered a better data source than speciated VOC in NEI2017. Five VOC HAPs, including naphthalene (NAPH), benzene (BENZ), acetaldehyde (ALD2), formaldehyde (FORM) and methanol (MEOH) collectively called NBAFM, are explicitly represented in the CB6 chemical mechanism. The “INTEGRATE” profiles are used to subtract NBAFM from the total VOC during the speciation processes to avoid double counting emissions. For instance, in the airports sector, the NEI2017 provides the total VOC emission named as “VOC” and no integration is needed for the chemical speciation. All the model species are speciated from the “VOC” in the NEI2017. In contrast, the onroad and offroad emission inventory provides specific emissions for HAP species (i.e., NBAFM) and the VOC emissions that exclude those species. Therefore, these HAP will be removed from the criteria VOC mass, and the profiles are generated by removing the specified HAP species from the “CRITERIA” profiles, and then renormalizing. Detailed information of the use of HAP along with NEI VOC, called “HAP-CAP integration”, and the integration status for each emission sector can be found in the Table 3-4 of the TSD for the 2016 NEI Collaborative³⁶.

The speciation profiles for most emission sectors can be created by the Speciation Tools³⁷ on the basis of SPECIATE database³⁸ which is developed and maintained by the Office of Research and Development (ORD) of US EPA. The only exception is that the speciation profiles of the mobile sources (on-road and non-road sectors, other than for California) are generated by the Motor Vehicle Emissions Simulator (MOVES)³⁹. Similar to the VOC, the speciation information of PM is also supported by the SPECIATE and can be generated using the Speciation Tools and MOVES. For NO_x, the speciation is based on a NO₂ weight factors, speciating total NO_x into NO, NO₂, and/or HONO. The speciate profiles for different emission sources and locations are differentiated by the SCC and county/state, managed through a cross-reference file that links SCC for each county/state to a specific speciation profile. In NEI2017, the speciation profiles for the CB6 mechanism are already prepared by EPA, which are created based on the SPECIATE5.0 database³³.

Temporal allocation

NEI provides annual totals but models require the information of finer temporal variations (monthly, weekly, daily and hourly). Distributing aggregated emissions to a finer (hourly) temporal resolution to meet the model requirement is realized by the temporal allocation process. For the source sectors with annual emission records (Table 1), three temporal allocation profiles (annual-to-month, month-to-day, and diurnal) are applied. For the sectors with monthly emission records, the annual-to-month allocation will not be used. The temporal allocations are also based on the profile files which are obtained in several ways. The temporal profiles of most sectors are created based on the operational data from different agencies/industries, such as the Federal Aviation Administration (FAA) operations and performance data for airports sector and Association American Railroads (AAR) Rail Traffic data for rail sector. For some sectors, the temporal variations of the emissions are also controlled by meteorological conditions. Therefore, the meteorology-based temporal profiles are developed using a tool called “gentpro” using the weather data. These weather-adjusted profiles are applied to three sectors: anthropogenic fugitive dust, residential wood combustion, and agriculture. The temporal allocation of on-road sources is based on a combination of traditional temporal profiles and the influence of meteorology. The on-road inventory used in this study is in the Flat File 2010 (FF10) format processed from the MOVES outputs; therefore, the temporal profiles for this format are derived from MOVES and supported in the platform³³. The temporal profiles for each source and county/state are assigned using a cross-reference file that links Federal Information Processing System (FIPS) code/SCC/pollutant to different monthly/weekly/diurnal temporal profiles.

Spatial distribution

A major challenge to develop a neighborhood level emission dataset is how to spatially distribute the county-level emission aggregate from NEI into locations at finer scale. In this study, county-level emissions from nonpoint and mobile sources are spread among the grid cells intersecting the county by using spatial distribution profiles (namely spatial surrogates). A spatial surrogate ratio is a value greater than zero and less than or equal to unity that specifies the fraction of the emissions in an area (usually a county) that should be allocated to a particular model grid cell (a 1 km² square in this case). As the area of a given county may fall into several grid cells, spatial surrogates need to be used to indicate the fraction of the county’s emissions assigned to each grid cell. These surrogates are created based on geographic information systems (GIS) shapefiles which include the geographic information, such as population/housing, roadways, and land cover (Supplementary Table 1) which act as weight factors when calculating different types of surrogate ratios. A spatial surrogate ratio file includes the grid description, surrogate code, FIPS, column/row number of the model grid, and spatial surrogate ratio (spatial factor).

In this study, the spatial surrogates for the 1 km × 1 km grids were generated using a surrogate generating tool Spatial Allocator (SA) coupled with the PostgreSQL database management system. The SA, developed by the University of North Carolina Community Modeling and Analysis System (CMAS), is a suite of tools to create input files for weather and air quality models. More specifically, the surrogate tools of SA were used to create a large set of spatial surrogates, and to merge and gap-fill these surrogates when necessary. The source code and scripts, as well as detail documentation of the SA tools can be downloaded from the CMAS center⁴⁰. The procedures can be summarized in five steps: (1) Install the Spatial Allocator⁴¹ along with PostgreSQL software, and collect shapefile data from the EPA⁴² or commercial vendors; (2) Activate PostgreSQL server, create a database and load the shapefile data into database; (3) Generate a table representing the modeling grid in the database; (4) Generate surrogate files using SA tools; (5) Gap-filling, normalization, and quality assurance. For the contiguous United States (CONUS), a total of 108 spatial surrogates were prepared, including 12 U.S. census-based surrogates, 24 transportation surrogates (roadways, railways, bus terminals and idling), 17 landcover surrogates, 20 surrogates for building footprints, 23 surrogates that describe oil and gas well production, 6 surrogates for shipping and ports, and 6 for other industrial and commercial activities like refineries and tank farms, airports, golf courses, mines, and timber. The surrogate information and relevant shapefile data used for our dataset are provided in Supplementary Table 1.

Generating 1 km emission dataset

With the base emission inventories, chemical speciation, temporal profiles and spatial surrogate ratios, we generate the 1 km emission dataset using the SMOKE model version 4.7 for all nine anthropogenic emission sectors. This process takes four steps. First, the chemical profiles are used to speciate NO_x, VOCs, PM_2.5 and PM₁₀ into required chemical species for each source/location. Next, all emission records are distributed to 1-hour intervals from the 2017 annual or monthly total emissions using SCC‐specific temporal profiles. Third, the spatial surrogate ratios are used to distribute county-level emissions into 1 km × 1 km grids. Finally, all gridding, speciation, and temporal matrices are combined to create model-ready emission data at 1 km horizontal resolution and hourly intervals in the netCDF format.

For each of the emission sectors, the above processes are repeated, so that the combined datasets are generated for each sector. The gridded emission will be stored by sectors and can be merged using a SMOKE tool (mrggrid) as needed, depending on the needs of the model simulation. The flow chart in Fig. 1 depicts the steps for generating the emission data and Table 1 shows the emission sectors that this dataset includes. For the all-sector merged emission data, we also convert the data into the Shapefile format, so that users may be able to visualize the data along with other maps (such as highways and street maps).

Data Records

Table 2 summarizes the information of the generated 1 km emission dataset. This emission dataset is stored in two formats: NetCDF for modeling and analysis, and Shapefile for use with GIS software. Both formats have the same emission sectors with 1 km² resolution. The NetCDF format contains hourly, monthly, and annual data while the Shapefiles only include annual emissions. Additionally, the NetCDF provides the model species for CB6 mechanism in the hourly and monthly data files while the shapefiles include integrated species like VOCs, NO_x, PM_2.5, and three inorganic gases, SO₂, CO, and NH₃. Figures 2 and 3 shows the example of the annual emission distributions of VOC and PM_2.5, along with their frequency diagrams, as well as the diurnal variations and the proportions of each speciated model species from VOC and PM_2.5. The datafiles of monthly and annual emissions that are available on figshare⁴³, while the hourly emission data are stored on our data server at George Mason University⁴⁴ because of the large file sizes.

Table 2 Information of NEMO emission dataset.

Full size table

Technical Validation

Comparison with the EPA benchmark dataset

Here we compared the NEMO dataset against the 12 km × 12 km emission, generated using the spatial surrogates provided by US EPA in NEI2017, a benchmark dataset widely used in research and regulatory modeling. Figure 4 depicts the monthly emissions over CONUS of the NEMO dataset and the differences with those of 12 km. We found that the 1 km × 1 km emissions of each variable are almost identical to those of the benchmark dataset, although slightly lower (<−0.02%) than the latter. The differences between 1 km and 12 km datasets are more significant during summertime when the monthly emissions are higher than in other seasons (Fig. 4a). Figure 4 also shows the percentage differences of particulate matter are usually higher than those of the gases and the largest difference appears in black carbon (PEC) of July with a value of −0.02%. The sector-specific emissions in Fig. 4c,d show that most variables in the nonroad sector and particulate matters in the anthropogenic fugitive dust sector have larger underestimations. The difference in the emissions of other sectors are between 0.001% and 0.01%. In general, our dataset is consistent with the benchmark emissions.

NO_x emissions over five large cities

Next, we compare the NO_x emissions over five metropolitan areas to that of the benchmark dataset. NO_x is a key precursor to tropospheric ozone and particulate nitrate. Figure 5 shows the annual emissions of NO_x from 12 km and 1 km dataset. We overlay the emission map with geographical information including roads, airports, ferries, and main cities as a measure to validate the accuracy of the spatial allocation. The results show that the NEMO dataset can capture high emissions in urban areas that follow the benchmark pattern. The 1 km distribution can also reflect the fine features of emissions over highways and other major roads. At airports, ultra-high NO_x emissions are shown at corresponding locations. In addition, the 1 km distributions create much clearer coast-pattern emissions over cities like New York City and Los Angeles compared to the benchmark. These results show that the spatial distribution of the 1 km emission dataset is more consistent with the geographical features in the real world. The increase of resolution (144 times finer than the benchmark) in comparison to the 12 km product provides the desirable information to map air pollutant emissions at neighborhood level.

Usage Notes

The NEMO data are available in the NetCDF format at hourly, monthly and annual intervals. The shapefile format of NEMO is only available for the annual aggregated emissions, although finer temporal resolution can be generated from the NetCDF files. Each hourly emission file includes 5397 columns, 3177 rows, 35 gas species, 20 aerosol species, and 25 time steps which needs a longer time for processing. We recommend using double precision for data analysis and processing. For convenience, we also provide a web-based data portal⁴⁵ to prepare anthropogenic emissions within the CONUS domain according to the user’s requirements.

Code availability

Code used for calculating monthly and annual emission is written in Fortran and available from Zenodo⁴⁶. The Spatial Allocator version 4.4 and SMOKE version 4.7 are used for data processing which can be obtained from CMAS webpage⁴⁰.

References

Grennfelt, P. et al. Acid rain and air pollution: 50 years of progress in environmental science and policy. Ambio 49, 849–864 (2020).
Article CAS PubMed Google Scholar
Reis, S. et al. From acid rain to climate change. Science (80-.). 338, 1153–1154 (2012).
Article ADS CAS Google Scholar
Carmichael, G. R. et al. Changing trends in sulfur emissions in Asia: implications for acid deposition, air pollution, and climate. (2002).
World Health Organization. World health statistics 2022: monitoring health for the SDGs, sustainable development goals. (2022).
U.S. EPA. Summary Nonattainment Area Population Exposure Report. https://www3.epa.gov/airquality/greenbook/popexp.html (2022).
Oliveri Conti, G., Heibati, B., Kloog, I., Fiore, M. & Ferrante, M. A review of AirQ Models and their applications for forecasting the air pollution health outcomes. Environ. Sci. Pollut. Res. 24, 6426–6445 (2017).
Article Google Scholar
Tong, D. & Tang, Y. Advancing Air Quality Forecasting to Protect Human Health. https://pubs.awma.org/flip/EM-Oct-2018/tong.pdf (2018).
Sitaras, I. E. & Siskos, P. A. The role of primary and secondary air pollutants in atmospheric pollution: Athens urban area as a case study. Environ. Chem. Lett. 6, 59–69 (2008).
Article CAS Google Scholar
National Research Council. Rethinking the ozone problem in urban and regional air pollution. National Academies Press (1992).
Gelencsér, A. et al. Source apportionment of PM_2.5 organic aerosol over Europe: Primary/secondary, natural/anthropogenic, and fossil/biogenic origin. J. Geophys. Res. Atmos. 112 (2007).
Singh, H. B. et al. Pollution influences on atmospheric composition and chemistry at high northern latitudes: Boreal and California forest fire emissions. Atmos. Environ. 44, 4553–4564 (2010).
Article ADS CAS Google Scholar
Naik, V. et al. Impact of preindustrial to present‐day changes in short‐lived pollutant emissions on atmospheric composition and climate forcing. J. Geophys. Res. Atmos. 118, 8086–8110 (2013).
Article ADS CAS Google Scholar
Blanchard, C. L., Shaw, S. L., Edgerton, E. S. & Schwab, J. J. Emission influences on air pollutant concentrations in New York state: II. PM2. 5 organic and elemental carbon constituents. Atmos. Environ. X 3, 100039 (2019).
CAS Google Scholar
Dommen, J. et al. High-resolution emission inventory of the Lombardy region: Development and comparison with measurements. Atmos. Environ. 37, 4149–4161 (2003).
Article ADS CAS Google Scholar
Zhou, Y. et al. Development of a high-resolution emission inventory and its evaluation and application through air quality modeling for Jiangsu Province, China. Atmos. Chem. Phys. 17, 211–233 (2017).
Article ADS CAS Google Scholar
Liu, X., Yan, F., Hua, H. & Yuan, Z. Identifying hotspots based on high-resolution emission inventory of volatile organic compounds: A case study in China. J. Environ. Manage. 288, 112419 (2021).
Article CAS PubMed Google Scholar
Holloway, T. et al. Satellite monitoring for air quality and health. Annu. Rev. Biomed. Data Sci. 417–447 (2021).
Lamsal, L. N. et al. Ground‐level nitrogen dioxide concentrations inferred from the satellite‐borne Ozone Monitoring Instrument. J. Geophys. Res. Atmos. 113 (2008).
Wang, J., Xu, X., Spurr, R., Wang, Y. & Drury, E. Improved algorithm for MODIS satellite retrievals of aerosol optical thickness over land in dusty atmosphere: Implications for air quality monitoring in China. Remote Sens. Environ. 114, 2575–2583 (2010).
Article ADS Google Scholar
Choi, S. et al. Assessment of NO 2 observations during DISCOVER-AQ and KORUS-AQ field campaigns. Atmos. Meas. Tech. 13, 2523–2546 (2020).
Article CAS Google Scholar
Wang, R. et al. Exposure to ambient black carbon derived from a unique inventory and high-resolution model. Proc. Natl. Acad. Sci. 111, 2459–2463 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Cai, B. et al. China high resolution emission database (CHRED) with point emission sources, gridded emission data, and supplementary socioeconomic data. Resour. Conserv. Recycl. 129, 232–239 (2018).
Article Google Scholar
Huang, K. et al. Estimating daily PM_2.5 concentrations in New York City at the neighborhood-scale: Implications for integrating non-regulatory measurements. Sci. Total Environ. 697, 134094 (2019).
Article ADS CAS PubMed Google Scholar
Oda, T. & Maksyutov, S. A very high-resolution (1 km× 1 km) global fossil fuel CO₂ emission inventory derived using a point source database and satellite observations of nighttime lights. Atmos. Chem. Phys. 11, 543–556 (2011).
Article ADS CAS Google Scholar
Meng, W. et al. Improvement of a global high-resolution ammonia emission inventory for combustion and industrial sources with new data from the residential and transportation sectors. Environ. Sci. Technol. 51, 2821–2829 (2017).
Article ADS CAS PubMed Google Scholar
Gately, C., Hutyra, L. R. & Wing, I. S. DARTE annual on-road CO₂ emissions on a 1-km grid, conterminous USA, V2, 1980–2017. ORNL DAAC (2019).
Gurney, K. R. et al. The Vulcan version 3.0 high‐resolution fossil fuel CO₂ emissions for the United States. J. Geophys. Res. Atmos. 125, e2020JD032974 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Novotny, E. V., Bechle, M. J., Millet, D. B. & Marshall, J. D. National satellite-based land-use regression: NO2 in the United States. Environ. Sci. Technol. 45, 4407–4414 (2011).
Article ADS CAS PubMed Google Scholar
Bechle, M. J., Millet, D. B. & Marshall, J. D. National spatiotemporal exposure surface for NO₂: monthly scaling of a satellite-derived land-use regression, 2000–2010. Environ. Sci. Technol. 49, 12297–12305 (2015).
Article ADS CAS PubMed Google Scholar
Byun, D. & Schere, K. L. Review of the governing equations, computational algorithms, and other components of the Models-3 Community Multiscale Air Quality (CMAQ) modeling system. (2006).
Houyoux, M., Vukovich, J., Brandmeyer, J. E., Seppanen, C. & Holland, A. Sparse matrix operator kernel emissions modeling system-SMOKE User manual. Prep. by MCNC-North Carolina Supercomput. Center, Environ. Programs, Res. Triangle Park. NC (2000).
U.S. EPA. 2017 National Emissions Inventory: January 2021 Updated Release, Technical Support Document. https://www.epa.gov/sites/default/files/2021-02/documents/nei2017_tsd_full_jan2021.pdf (2021).
U.S. EPA. 2017 National Emissions Inventory (NEI) Data. https://gaftp.epa.gov/air/emismod/2017 (2021).
Yarwood, G., Whitten, G., Jung, J., Heo J. & Allen, D. Final Report Development, Evaluation and Testing of Version 6 of the Carbon Bond Chemical Mechanism (CB6). Work Order No. 582-7-84005-FY10-26 (Texas Commission on Environmental Quality, 2010).
Pye, H. O. T. Overview of CMAQ–AERO7. https://github.com/USEPA/CMAQ/blob/5.3/DOCS/Release_Notes/aero7_overview.md (2021).
Eyth, A., Vukovich, J. & Farkas, C. Technical Support Document (TSD) Preparation of Emissions Inventories for the 2016v1 North American Emissions Modeling Platform. https://www.epa.gov/sites/default/files/2021-03/documents/preparation_of_emissions_inventories_for_2016v1_north_american_emissions_modeling_platform_tsd.pdf (2021).
Shah, T., Shi, Y., Beardsley, R. & Yarwood, G. Speciation Tool User’s Guide Version 5.0. https://www.cmascenter.org/speciation_tool/documentation/5.0/Ramboll_sptool_users_guide_V5.pdf (2020).
Simon, H. et al. The development and uses of EPA’s SPECIATE database. Atmos. Pollut. Res. 1, 196–206 (2010).
Article CAS Google Scholar
U.S. EPA. Exhaust emission rates for heavy-duty on-road vehicles in MOVES2014. https://cfpub.epa.gov/si/si_public_file_download.cfm?p_download_id=525695 (2015).
CMAS. CMAS Software. CMAS Download Center https://www.cmascenter.org/download.cfm (2022).
CMAS. Spatial Allocator User’s Manual. https://github.com/CMASCenter/Spatial-Allocator/blob/master/docs/User_Manual/README.md (2018).
U. S. EPA. Spatial Surrogates Shapefiles Data. https://gaftp.epa.gov/air/emismod/2016/alpha/spatial_surrogates/shapefiles (2021).
Ma, S. & Tong, D. Neighborhood Emission Mapping Operation (NEMO): A 1-km anthropogenic emission dataset in the United States, figshare, https://doi.org/10.6084/m9.figshare.c.6141735 (2022).
Ma, S. & Tong, D. Neighborhood Emission Mapping Operation (NEMO): A 1-km anthropogenic emission dataset in the United States, Data server at George Mason University, http://air.csiss.gmu.edu/aq/NEMO (2022).
Ma, S. & Tong, D. Emission data portal for NEMO. Data server at George Mason University, www.emissionnow.org (2022).
Ma, S. & Tong, D. Neighborhood Emission Mapping Operation (NEMO): A 1-km anthropogenic emission dataset in the United States. Zenodo https://doi.org/10.5281/zenodo.7076321 (2022).

Download references

Acknowledgements

This work is partially supported by NOAA Climate Program Office and NASA Health and Air Quality Program. We thank the US EPA for providing the base emissions inventories and ancillary files used in this study. The authors are grateful to Bok Haeng Baek, Dongmei Yang, and Charles Chang for insightful discussion and assistance with preparing the data/tools and the website.

Author information

Authors and Affiliations

Department of Atmospheric, Oceanic and Earth Sciences, George Mason University, Fairfax, VA, 22030, USA
Siqi Ma & Daniel Q. Tong

Authors

Siqi Ma
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Q. Tong
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.M. collected the data, generated the dataset, and conducted the analysis. D.T. conceptualized the study, guided the research, and helped with emission modeling and analysis. Both S.M. and D.T. wrote and revised the manuscript.

Corresponding authors

Correspondence to Siqi Ma or Daniel Q. Tong.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

SUPPLEMENTARY INFORMATION

Supplementary Information Table 1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Ma, S., Tong, D.Q. Neighborhood Emission Mapping Operation (NEMO): A 1-km anthropogenic emission dataset in the United States. Sci Data 9, 680 (2022). https://doi.org/10.1038/s41597-022-01790-9

Download citation

Received: 12 August 2022
Accepted: 18 October 2022
Published: 09 November 2022
DOI: https://doi.org/10.1038/s41597-022-01790-9
Springer Nature Limited

This article is cited by

Healthy Cities, A comprehensive dataset for environmental determinants of health in England cities
- Zhenyu Han
- Tong Xia
- Yong Li
Scientific Data (2023)

Neighborhood Emission Mapping Operation (NEMO): A 1-km anthropogenic emission dataset in the United States

Abstract

Similar content being viewed by others

A High-Resolution National Emission Inventory and Dispersion Modelling—Is Population Density a Sufficient Proxy Variable?

Informing urban climate planning with high resolution data: the Hestia fossil fuel CO2 emissions for Baltimore, Maryland

Multi-level policies for air quality: implications of national and sub-national emission reductions on population exposure

Background & Summary