1 Background

In the research of identifying the impact of climate change on the industrial economic system, the core step is to overlay climate data and industrial economic data with the same spatiotemporal resolution and perform spatial analysis (Zhao et al. 2017). However, the risk assessment of the industrial economic system is hampered by the lack of spatialized datasets of global and Chinese industrial economic system output value, especially under future climate scenarios, because it is difficult to accurately identify the output value of secondary and tertiary industries by conventional remote sensing methods. The existing spatial data of industrial economic system output value are mostly at the provincial, city, and county levels, with administrative areas as the smallest spatial units, which cannot represent the difference and spatial distribution of industrial output value within a province or a city. Therefore, it is difficult to carry out overlay analysis with gridded climate data in risk assessment. Although there are some spatialized data on the output value of a particular industry from research (Dong et al. 2016), in general, there is a lack of large-scale, high-resolution, and comprehensive spatial data of industrial output value.

Currently, the research methods of mapping economic data can be divided into three categories: spatial interpolation model (Tobler 1979), multi-source data fusion model (Li et al. 2018), and remote sensing inverse model (Wang et al. 2018). Compared with other spatial models, the nighttime light remote sensing data inversion model is characterized by simple implementation and high precision and is expected to solve the problem of large-scale data localization. However, the model is mainly used to analyze gross domestic product (GDP) and population data. Based on the existing applications, this study developed a method to spatialize the industrial output on a global scale, and the random forest algorithm in machine learning was used to map the industrial value added under different climate change scenarios in the future.

The definition of industrial value added is based on the World Bank’s statistical standards. The industrial value added covers mining, manufacturing, construction, electricity, water, and gas sectors. Industrial value added statistics data are from the World Bank, and the data are in current U.S. dollars.

2 Method

The method for mapping industrial value added for future climate scenarios includes the following steps: (1) Mapping the current industrial value added; (2) Simulating the spatial boundary of future industrial value added; and (3) Estimating the future industrial value added. Figure 1 shows the technical flowchart for mapping industrial value added.

Fig. 1
figure 1

Technical flowchart of mapping global industrial value added

2.1 Mapping the Current Industrial Value Added

Based on the 2010 global vegetation index data (Enhanced Vegetation Index, EVI, from MODIS) and the 2010 nighttime light remote sensing data (from DMSP/OLS), the adjustment nighttime light index (ANLTI) was constructed to preprocess the light saturation and overflow phenomena of nighttime light data, and the best light data in the world was obtained. The calculation formula is as follows (Zhuo et al. 2015):

$$ {\text{ANLTI}} = \frac{2}{{1 - {\text{NT}}_{\text{n}} + {\text{EVI}}}} \times {\text{NT}} $$
(1)

ANLTI is the EVI adjusted nighttime light index, NTn is the normalized nighttime light value, and NT is the original nighttime light value.

A regression model of the industrial value added was constructed using the adjusted nighttime light index (ANLTI) and industrial value added statistical data of countries from the World Bank. The data of global industrial value added in 2010 is generated using the following formula:

$$ I = \frac{{I_{i} \times {\text{ANLTI}}}}{{{\text{ANLTI}}_{i} }} $$
(2)

I is the industrial value added of each pixel. Ii is the total industrial added value of the country in which the pixel is located, and ANLTIi is the total EVI adjusted nighttime light index of the country in which the pixel is located.

In the global scope, statistical industrial value added in 178 provincial (state) regions were randomly selected for the correlation test, and the correlation coefficient is 0.93. Taking statistical data as the true values, the average accuracy of industrial value added in the 178 regions is 80.14% (Xue et al. 2018).

2.2 Simulating the Spatial Boundary of Future Industrial Value Added

Future industrial value added changes can be approximated by starting with industrial land use change. According to the principle of logistic–cellular automata (CA)–Markov simulation, the global land use data from ESA (European Space Agency, https://maps.elie.ucl.ac.be/CCI/viewer/index.php) in 2010 and 2015 were used to simulate the land use change in 2030 and 2050. The urban land was extracted as the spatial boundary of the future industrial added value. In the selection of driving factors, it is necessary to comprehensively describe the impact of different driving factors on land use change, and consider the research scale. Elevation, slope, population, GDP, and distance from river and road are selected as driving factors of land use. The data are from NOAA (National Oceanic and Atmospheric Administration, https://www.noaa.gov/), SEDA (Socioeconomic Data and Applications Center, https://sedac.ciesin.columbia.edu/), Global Risk Data Platform (https://preview.grid.unep.ch/), and Natural Earth (https://www.naturalearthdata.com/). The accuracy of the simulated land use in 2015 is verified by using the global land use data in 2015. The global average accuracy is 91.89%.

The logistic–CA–Markov model is mainly used in the simulation of land use change by combining the characteristics of the logistic regression model, the CA model, and the Markov model (Jamal et al. 2013). The logistic regression model can analyze the relationship between land use types and driving forces. The CA model can effectively simulate the spatial changes of the land use system and the Markov model can predict the quantitative changes of land use types so as to simulate land use changes more comprehensively and accurately.

2.3 Estimating Future Industrial Value Added

For the spatialization of industrial added value under different climate change scenarios in the future, appropriate factors of industrial added value change need to be selected. In order to determine the influencing factors of spatial change of industrial value added, land use, population density, and accessibility of the study area should be comprehensively considered. In addition, the distribution of rivers and lakes, and topographic features such as elevation and slope should be considered. Due to the many influencing factors, the ordinary regression model is difficult to comprehensively and accurately reflect the spatial distribution characteristics of the industrial added value under different climate change scenarios. Therefore, this study is based on machine learning, combined with the random forest model to build a spatial model of industrial added value under different climate change scenarios and the model is as follows (Xue and Song 2020):

$$ F = \left( {T,P,{\text{GDP}},{\text{Land}},{\text{Slope}},{\text{Pop}} \ldots } \right) $$
(3)

where F represents simulation results under different climate change scenarios, T represents average annual temperature, P represents average annual precipitation, GDP represents the gross domestic product, Land represents the spatial boundary of future industrial value added, and Pop represents the density of population.

The driving factors used to build the random forest model include industrial value added in 2010, the spatial distribution of urban land, elevation, slope, distance from rivers, distance from roads, distance from railroads, distance from residential settlements, and air temperature and precipitation (representing different climate change scenarios). When mapping the future industrial added value under different climate change scenarios, these driving factors are constant except for temperature and precipitation.

Based on the comprehensive consideration of the fitting speed and accuracy of the model, the parameters of the random forest model were tested. Finally, we built a total of 100 decision trees. In each decision tree, 90% of the samples were randomly selected to build the sample model, and the remaining 10% were used as test data. The simulation results show that the sample accuracy was 0.94 and the test sample accuracy was 0.81. The overall sample accuracy was relatively high, which can well explain the influence of various factors on the industrial value added, so it can be used for the simulation and prediction of industrial added value.

Through the statistics of the proportion of industrial value added in GDP (from the World Bank) and the GDP data under the Shared Socioeconomic Pathways (SSPs), the industrial added value of each country under different SSP scenarios in the future was obtained. According to the estimated proportion of industrial value added of each country and the proportion of industrial value added of each country under different SSP scenarios in the future, the regression model was constructed. Finally, the distribution of industrial added value under SSP1, SSP2, and SSP3 scenarios in 2030 and 2050 was obtained.

3 Results

The industrial value added under different future climate change scenarios was calculated by continent. Then we derived the statistical value of industrial value added of each continent under SSP1, SSP2, and SSP3 scenarios in 2010, 2030, and 2050 (Fig. 2). Overall, in the future, the industrial value added will increase obviously. Compared with other regions, Asia has the largest industrial value added, followed by North America, Europe, Africa, and South America, and Oceania has the smallest industrial value added. The industrial value added of SSP1 was the biggest, followed by SSP2, and the smallest was SSP3.

Fig. 2
figure 2

Industrial value added under the Shared Socioeconomic Pathway (SSP) 1, SSP2, and SSP3 scenarios in 2030 and 2050

4 Maps

figure a
figure b
figure c
figure d
figure e
figure f
figure g