1 Introduction

Environmental degradation has become an ongoing concern across China, especially haze events (Ministry of Ecology & Environment of the People’s Republic of China, 2019). According to the 2019 World Air Quality Report released by IQ Air, 48 Chinese cities featured among the top 100 most polluted cities in the world, which was frequently due to fine particulate matter with an aerodynamic diameter ≤ 2.5 µm (PM2.5).Footnote 1Long-term exposure to high PM2.5 concentrations is a potential health hazard (Guan et al., 2016). Additionally, air quality deterioration results in huge medical costs and negatively impacts tourism, which seriously restricts regional economic development (Becken et al., 2017; Hao et al., 2018; Yang et al., 2019). It has been estimated that the economic cost of environmental degradation has reached approximately 1–8% of the GDP in China (World Bank, 2007; Heck & Hirschberg, 2011). Thus, alleviating haze pollution has become one of the top priorities for sustainable economic development. To formulate targeted policies for haze control, it is necessary to investigate the differences in PM2.5 pollution and its long-term trends among regions.

Current studies mainly focus on the variations of PM2.5 concentrations over time, the heterogeneity among regions, and their influencing factors. Cities, provinces, and urban agglomerations with severe PM2.5 pollutions have been comprehensively explored, such as Beijing, Hefei, and Bohai Rim (Mi et al., 2019; Wang & Fang, 2016; Yan et al., 2018; Zhang et al., 2020; Zou et al., 2016). Specifically, in terms of spatial variations, PM2.5 concentrations have illustrated both heterogeneity and autocorrelation at both the national and regional levels (He et al., 2019; Yang et al., 2018). The highest PM2.5 pollution areas are mainly located in the northern plains and central regions, especially the Beijing–Tianjin–Hebei urban agglomeration (Fan et al., 2020; Ma et al., 2016). Meanwhile, PM2.5 concentrations in urban areas are significantly higher than those in rural areas (Lin et al., 2020). Moreover, PM2.5 concentrations are frequently positively correlated among neighboring areas, indicating spatial clustering characteristics in the form of “high-high” and “low-low” agglomeration centers (Shen et al., 2019).

In terms of temporal trends, PM2.5 concentration demonstrates seasonal, weekly, and diurnal variations. Pollution is much worse in winter than in summer (Fan et al., 2020; Ma et al., 2016); January and December are the worst months of the year for PM2.5 pollution, while July and August are the best (Song et al., 2017). Simultaneously, due to vehicular traffic, the concentration of PM2.5 is the highest on Monday, followed by the weekend, and is the lowest on Wednesday (Huang et al., 2015). During the day, the highest PM2.5 concentrations occur from 8 to 12 am, and the lowest occur from 3 to 5 pm (Shen et al., 2019).

In addition to spatial and temporal variations, scholars have investigated the factors that influence the regional PM2.5 pollution. Studies have shown that there are mainly two types of factors. The first are meteorological factors, such as temperature, air pressure, and wind speed (He et al., 2019; Tai et al., 2010); the second are socio-economic factors, such as land use, economic development, population growth, and urbanization (Han et al., 2018; Lin et al., 2020; Xu, Chen, et al., 2019; Xu, Miao, et al., 2019).

Although the above studies have performed comprehensive investigations, there are still some inadequacies. First, most relevant studies compared haze pollution among regions by directly comparing the level differences in PM2.5 concentrations during a set period. However, comparisons in this way are not only susceptible to occasional extreme meteorological conditions (Luo et al., 2019), but may also ignore change trends, making it difficult to determine convergence. For example, although the PM2.5 concentration of Longnan was always higher than that of Zhangzhou from 1998 to 2016, the continuous improvement of air quality in Longnan and the continuous increase in PM2.5 pollution in Zhangzhou led to their gradual convergence. However, if only their mean PM2.5 concentrations during this period are compared, it is difficult to find such narrowing gap. In contrast, a dynamic, long-term convergence analysis would depict it. Second, the current studies were mostly based on monitoring data from fixed observation points in cities (Song et al., 2017; Shen et al., 2019; Fan et al., 2020; Jiang et al., 2020). However, because air quality monitoring stations were not fully covered in China until 2013, the available data are temporally limited, which prevented any long-term or dynamic calculation, including convergence analysis. Notably, long-term convergence analysis of PM2.5 concentrations, especially the spatial distribution of convergence clubs, can provide a new perspective for understanding the current haze pollution. Third, previous research on club convergence usually artificially divided samples into several subgroups according to specific information, such as geographical locations or institutions; however, these artificial divisions may cause biased results. This is because this approach may divide samples that should belong to the same convergence club into different subgroups or fail to detect potential convergence clubs (Bai et al., 2021). Fortunately, this bias can be avoided by using the data-driven algorithm developed by Phillips and Sul (2007). Recent studies have applied this algorithm to various issues, such as carbon emissions (Apergis & Payne, 2017) and energy efficiency (Parker & Liddle, 2017). However, there are no studies that use this method to rigorously investigate the convergence of PM2.5 concentrations so far. Fourthly, although analyses of PM2.5 pollution in typical Chinese cities, provinces, or urban agglomerations are common, few studies have involved all cities over a long period, which may lead to bias due to sample size and data fineness, especially when exploring the factors that influence PM2.5 pollution.

For these reasons, this study conducted convergence analysis to explore the dynamic evolution of PM2.5 concentrations for 342 prefecture-level cities in China from 1998 to 2016. To this end, we first conducted the log t regression test proposed by Phillips and Sul (2007), followed by a club clustering algorithm, to empirically investigate the overall convergence and club convergence of the PM2.5 concentrations. Then, we analyzed the spatial distribution patterns and driving forces of the convergence clubs using the spatial autocorrelation test and Ordered Probit regression.

This study contributes to the literature in the following ways. First, we first explored the convergence patterns and its distribution characteristics of PM2.5 concentrations for Chinese cities, describing the trends in the differences in PM2.5 pollution among cities from a dynamic perspective. Second, by applying the nonlinear regression-based test and club clustering algorithm, the obtained convergence patterns were more reliable and robust. This is because this method allows for inter-city heterogeneity; it does not set specific assumptions about the stationarity properties of the data and avoids possible biased results that may be caused by artificial division (Du, 2017). Third, we found that PM2.5 concentrations in Chinese cities showed club convergence characteristics and were the first to investigate the factors characterizing the observed clustering patterns. Fourthly, compared with the ground-observed data in previous studies, satellite-derived data from the Socio-economic Data and Applications Center (SEDAC) are generally more accurate and have a longer time span (Van Donkelaar, 2015). Thus, it allowed us to explore the evolution of PM2.5 concentrations in Chinese cities over a longer period, which produced a more complete picture of how haze pollution evolved during China’s urbanization process.

The remainder of this paper proceeds as follows. In Sect. 2, the methods and data sources are introduced in detail. Then, in Sect. 3, the empirical results of the PM2.5 convergence tests for the full panel and within groups are described. In Sects. 4 and 5, the reasons for the observed PM2.5 convergence club classification are explored, including their geographical distribution patterns and the influencing factors contributing to the patterns. The last section concludes the paper and proposes targeted policies.

2 Method and data

2.1 Method

The methods of log t regression test and a club clustering algorithm proposed by Phillips and Sul (2007) were used to explore the convergence of PM2.5 concentrations in Chinese cities. In general, the log t regression method tests for convergence by checking whether the cross-sectional variance of the idiosyncratic part of the interest variable eventually converges to zero. Compared with classical convergence tests, this method has numerous advantages. First, because it also works under the possibility of transitional heterogeneity or even transitional divergence, long-term common fluctuation will always be captured even in the absence of cointegration (Panopoulou & Pantelidis, 2009). Second, this methodology does not set specific assumptions about the smoothness trends or stochastic non-stationarity of the data (Du, 2017). Third, as mentioned above, by repeatedly conducting the log t regression test for nearly all the combinations of cities in a specific order, its identification of club convergence not only avoids potentially biased results caused by artificial division, but also can detect the subgroup with the most obvious convergence characteristics.

2.1.1 log t regression test

According to the convergence test method proposed by Phillips and Sul (2007), the PM2.5 concentration of city \(i\) during year \(t\), as denoted by \({x}_{it}\), can be decomposed as follows:

$${x}_{it}={\delta }_{it}{\mu }_{t}$$
(1)

where \({\mu }_{t}\) represents the common trend section of the data that describes some stochastic or deterministic trending behaviors, and \({\delta }_{it}\) measures the evolution of idiosyncratic elements over time. In Eq. (1), a long-run equilibrium in \({x}_{it}\) is equivalent to the convergent form of \({\delta }_{it}\), namely \(\underset{t\to \infty }{\mathrm{lim}}{\delta }_{it}=\delta\) for any \(i\). To eliminate the common factor \({\mu }_{t}\), \({x}_{it}\) is scaled down to the \({h}_{it}\).

$${h}_{it}=\frac{{x}_{it}}{\frac{1}{N}{\sum }_{i=1}^{N}{x}_{it}}=\frac{{\delta }_{it}}{\frac{1}{N}{\sum }_{i=1}^{N}{\delta }_{it}}$$
(2)

Thus, if \({\delta }_{it}\) converges to \(\delta\), the cross-sectional variance of \({h}_{it}\), that is \({\eta }_{t}=\frac{1}{N}{\left({h}_{it}-1\right)}^{2}\), will eventually converge to zero. Using this property, a log t regression can be constructed as follows:

$$\mathrm{log}\left(\frac{{\eta }_{1}}{{\eta }_{t}}\right)-2\mathrm{log}\left[\mathrm{log}\left(t+1\right)\right]=\widehat{a}+\widehat{b}\mathrm{log}\left(t\right)+\widehat{{\varepsilon }_{t}}$$
(3)

Under some regularity assumptions, the robust t statistic for the coefficient \(\widehat{b}\) is constructed as follows:

$${t}_{\widehat{b}}=\frac{\widehat{b}-b}{{s}_{b}}$$
(4)

where \({s}_{b}\) is the long-run heteroskedasticity and autocorrelation consistent standard error. For the one-sided test at the 5% level, the null hypothesis of convergence for the whole sample should be accepted if \({t}_{\widehat{b}}>-1.65\); otherwise, it is rejected.

2.1.2 Club clustering algorithm

Phillips and Sul (2007) proposed a data-driven algorithm for identifying clusters of samples with homogeneous convergence behaviors. First, a core group that consists of the cities with the highest t statistic is determined. Adding one city at a time to the core group, the log t regression test is repeatedly performed to decide which cities could be included in the core group. This updated core group is the first convergent subgroup. Then, the log t regression test is performed among the remaining cities screened above. If the convergence cannot be rejected, this subgroup forms the second convergence club; otherwise, the steps above are repeated to detect smaller convergent subgroups.Footnote 2

Later, Schnurbus et al. (2017) improved the original algorithm in terms of club merging. To produce the minimum number of convergence clubs, the joint log t test is performed for the adjacent pairs generated above, and they are merged if the convergence hypothesis holds. This step is iterated until the resulting clubs can no longer be merged.

2.1.3 The ordered probit model

The obtained convergence clubs are successively assigned positive integer values of 1 to J, where J is the total number of convergence clubs, according to rule that the higher the PM2.5 concentration mean is among the club members, the higher the assignment will be. In this case, it is assumed that:

$$\mathrm{P}\left({y}_{i}=j|{{\varvec{X}}}_{i}\right)=\mathrm{P}({r}_{j-1}<{{\varvec{X}}}_{i}^{^{\prime}}\beta +\varepsilon_{i} <{r}_{j}\left|{{\varvec{X}}}_{i}\right)=\Phi \left({r}_{j}-{{\varvec{X}}}_{i}^{^{\prime}}\beta \right)-\Phi \left({r}_{j-1}-{{\varvec{X}}}_{i}^{^{\prime}}\beta \right)$$
(5)

where \({y}_{i}\) is the convergence club type to which the city i belongs, the \({{\varvec{X}}}_{i}\) vectors are the factors that may affect the PM2.5 concentration, \({r}_{j}\) is the cut point, and \({\varepsilon }_{i}\) is the disturbance term. Using the assumption that \({\varepsilon }_{i}\) is \(N(\mathrm{0,1})\), the sample likelihood function and maximum likelihood estimation value can be obtained.

2.2 Data sources and processing

The PM2.5 concentration data in this study were derived from the global annual average PM2.5 grids database obtained by the SEDAC at Columbia University.Footnote 3 These data were obtained from the aerosol optical depth instruments of several satellites, were gridded at a 0.01° × 0.01° resolution, and had dust and sea-salt removed. By matching this data to maps of China, this study determined the annual average PM2.5 concentrations of 342 Chinese prefecture-level cities from 1998 to 2016, which is a longer data set than those of other studies (Ye et al., 2018; Zhang et al., 2016).

Gridded satellite-derived data are generally more accurate and reliable tha2017n ground-observed data from urban monitoring stations, which are used in most studies (Song et al., ; Shen et al., 2019; Fan et al., 2020; Jiang et al., 2020). As non-point source data, gridded data can better capture spatial variability from a regional perspective (Lin et al., 2018). Moreover, because global monitoring can be performed from a fixed satellite, possible biases attributed to differences in the designs or operations among regional monitoring stations are avoided (Van Donkelaar, 2015). In contrast, owing to local weather, factory emissions, or population agglomeration, ground-based PM2.5 observations exhibit fluctuations that may lead to deviation during the retrieval and simulation by relatively sparse monitoring (Chen et al., 2006).

In terms of the influences on convergence clubs, this study considered that PM2.5 pollution is affected by both meteorological and socio-economic factors (He et al., 2019; Xu et al., 2020; Yun et al., 2019) to select ten possible variables, referring to the methods of Tai et al. (2010) and Xu et al. (2020). In addition, because the explained variable is the convergence club type, which is cross-sectional data, the ten explanatory variables were selected as the mean values from 2006 to 2016.Footnote 4Detailed descriptions of the data are provided in Table 1Footnote 5.

Table 1 Descriptions of the influencing factors

To remove the influence of price, the actual completed amount of real estate development investment was adjusted to the real value according to its corresponding city’s fixed asset investment price index for each year. Data for the above variables were obtained from the National Meteorological Science Data Center (http://data.cma.cn/) and the China City Statistics Yearbook (2006–2016).

3 Convergence test

3.1 Results of the log t test for the full panel

We first conducted a log t regression test to detect the overall convergence of PM2.5 concentrations for 342 Chinese cities.Footnote 6The results are shown in Table 2.

Table 2 Results of the log t test of full panel convergence

Table 2 shows that the coefficient for the entire sample set test was negative, and the corresponding t statistic was -81.9438, which is far less than the critical value of -1.65. Thus, the null hypothesis of convergence for the full panel should be rejected at the 5% significance level, which indicates that the cross-sectional dispersion of PM2.5 did not tend to shrink over time.

Based on the results, not only were there significant differences in PM2.5 concentrations among Chinese cities, but overall, the differences did not narrow during 1998 to 2016. Therefore, the variation in PM2.5 concentrations across cities was not a contingency, but a long-term feature. According to Eq. (1), such differences could not be explained by common factors, but came from the heterogeneous factors of different cities.

3.2 Results on the club clustering

Although the data did not support full panel convergence, cities with similar economic development patterns or meteorological conditions might converge within the groups. To examine this, the empirical club clustering algorithm was used to further identify the club convergence and divide all the samples into respective clubs.

The results initially divided the 342 cities into 20 convergence clubs and one divergent group. Further, by conducting the iterative club merging algorithm proposed by Schnurbus et al. (2017), the initial 20 clubs were merged into 11 final convergence clubs. The list of cities they include are shown in Table 3.

Table 3 List of cities in each convergence club

As can be seen from Table 3, the cities converged to 11 different steady states, indicating that the PM2.5 concentrations across Chinese cities generally diverged, but followed convergence patterns within 11 subgroups.The existence of club convergence means that the differences in PM2.5 concentrations among the cities within the same club gradually shrank or even disappeared over time. In other words, PM2.5 pollution in these cities converged and showed the same trend. For example, the PM2.5 concentration in Handan in 1998 was 21.22 \(\mathrm{\mu g}/{\mathrm{m}}^{3}\), while that of Cangzhou was 46.25 \(\mathrm{\mu g}/{\mathrm{m}}^{3}\). Although the initial PM2.5 pollution in Handan was significantly better, it increased faster, narrowing the difference. In 2012, the difference has narrowed to less than 12 \(\mathrm{\mu g}/{\mathrm{m}}^{3}\) and has remained roughly the same since then. Because of this narrowing gap, Handan and Cangzhou were both divided into the same club.

3.3 Statistical characteristics of the convergence clubs

Although the PM2.5 pollutions of the cities in the same club showed similar trends, the PM2.5 pollutions of the cities belonging to different clubs showed various long-term patterns. In other words, different convergence clubs exhibited significant differences. We first classified and named each convergence club according to their mean PM2.5 concentrations and growth rates and then conducted a comparative analysis. Table 4 shows the descriptive statistics of the 11 convergence clubs, where the mean and annual growth rates are calculated as the average of its member cities.

Table 4 Statistical characteristics of the convergence clubs

According to the Chinese Ambient Air Quality Standards (GB 3095–2012) as well as the relative values of the 11 clubs, we took 30 \(\mathrm{\mu g}/{\mathrm{m}}^{3}\), 15 \(\mathrm{\mu g}/{\mathrm{m}}^{3}\), 10 \(\mathrm{\mu g}/{\mathrm{m}}^{3}\), 7 \(\mathrm{\mu g}/{\mathrm{m}}^{3}\), and 5 \(\mathrm{\mu g}/{\mathrm{m}}^{3}\) as the critical points to divide them into six different concentration levels, namely high, relatively high, medium, relatively low, low, and very low. Similarly, we took 4%, 3%, 2%, and 1.5% as the critical points to divide the clubs into five different growth rate levels, namely high, relatively high, medium, relatively low, and low. Consequently, Club 1 to Club 11 were successively named as shown in the column (6) of Table 4, and the column (7) is the abbreviation. Actually, the critical points for naming clubs were ultimately unimportant because the following analyses were based on the relative values of the clubs, rather than the absolute values.

There were some findings from comparing. First, in terms of the differences among these clubs, the difference between the two with the highest and lowest mean concentrations was approximately 60 \(\mathrm{\mu g}/{\mathrm{m}}^{3}\), and the highest growth rate was more than three times that of the lowest. This indicates that the common trends display very distinctive patterns among convergence clubs. The significant differences between each convergent steady state also suggest that there are some key roles in characterizing the PM2.5 convergence. In addition, all convergence clubs displayed an upward trend.

Second, the number of cities in the four clubs with high concentration levels accounted for more than 50%, which means that more than half of the cities in China were converging to clubs with high PM2.5 concentration from 1998 to 2016. Therefore, PM2.5 pollution remains a concern.

Finally, considering the relationship between the mean and annual growth rate, although the mean PM2.5 concentrations decreased successively from Club H-H to Club VL-H, the growth rates showed a trend of increase and then decrease. Specifically, the annual growth rates of the top two clubs with the highest PM2.5 concentrations, Club H-H and Club H-RH, were both above 3%; thus, the PM2.5 pollutions of the cities in these two clubs have rapidly worsened from 1998 to 2016. As the climate did not change much during that short period, this rapid increase was likely attributed to social and economic factors. Moreover, the annual growth rates of the two clubs with the lowest PM2.5 concentrations, Club L-RH and Club VL-H, were also above 3%, indicating that low PM2.5 concentrations are not always accompanied by low growth rates. Although their current PM2.5 concentrations were favorable, their high growth rates may result in a worsening air quality over time, if effective measures are not taken.

4 Analysis of the spatial distribution of convergence clubs

The above results confirm that the PM2.5 concentrations in Chinese cities showed club convergence, and the 342 cities were divided into 11 convergence clubs and one divergent group. This means that cities belonging to the same club had become increasingly similar over time in terms of haze pollution, and changes in their PM2.5 concentrations could be explained by common factors. It is important to determine what these common factors are and explore the reasons behind the current club divisions. We first examined the connections among cities from the perspective of geographical distribution.

4.1 Geographic distribution of the convergence clubs

Figure 1 shows the spatial distribution of the 11 convergence clubs and one divergent group; it shows that the cities belonging to the same club tend to be geographically clustered. Specifically, members of the four clubs with high PM2.5 concentrations (Club H-H, Club H-RH, Club H-M, Club H-L) were mainly located in the central and northern parts of East China, the southern part of North China, the central part of Northeast China, and throughout Central China, especially the areas between the middle and lower Yellow River and Yangtze River. Members of the two clubs with low or very low PM2.5 concentrations (Club L-RH and Club VL-H) were mainly in Qinghai–Tibet Region and the central part of Northwest China. Therefore, from the perspective of long-term convergence, the PM2.5 pollution in Chinese cities showed a marked spatial distribution pattern in which the east was more serious than the west and the north was more serious than the south, which is consistent with the conclusions of previous studies (Fan et al., 2020; Ma et al., 2016; Wang et al., 2017). At a province level, Shandong was the province with the most members of Club H-H. By contrast, all cities in Tibet belonged to Club L-RH or Club VL-H, and their average annual PM2.5 concentrations were all below 4 \(\mathrm{\mu g}/{\mathrm{m}}^{3}\). Qamdo prefecture was the cleanest city in China.

Fig. 1
figure 1

The distribution of convergence clubs and the divergent group

4.2 Spatial correlation analysis

To further investigate the distribution characteristics of the convergence clubs, an autocorrelation test was conducted on their spatial dependence using the GeoDa software (Center for Spatial Data Science, University of Chicago, Chicago). Specifically, the global Moran’s I index of the convergence clubs was first calculated. Then, to intuitively show the main types of local spatial agglomeration patterns, the Moran scatter plot for all cities was drawn (Fig. 2). Finally, to clearly display their specific geographic locations, a map of local indicators of spatial association was drawn (Fig. 3).

Fig. 2
figure 2

Moran scatter plot

Fig. 3
figure 3

Local indicators of spatial association map

By constructing a spatial adjacency matrix, the obtained global Moran’s I value of the convergence club types in all cities was 0.879, and the P value was 0.001, which indicates a positive correlation at the 1% significance level. In other words, the geographic distribution of the PM2.5 convergence clubs was not random, but showed significant agglomeration and spatial dependence.

As shown in Fig. 2, almost all cities were distributed in the first or third quadrants, indicating that the club convergence mainly presented high-high and low-low clusters with a few high-low and low-high outliers. Figure 3 shows that, except for Central China, Southern China, and the western part of Eastern China, the remaining regions evidenced spatial dependence and spatial spillover effects. Specifically, the low-low cluster types were mainly distributed in Southwest China and Northwest China, while the high-high cluster types were mainly distributed in Northeast China, Northern China, and the Yangtze River Delta. The increasing interconnection of economic activities between cities might be the reason of spatial autocorrelation, especially in developed areas. In fact, due to the increasingly frequent movement of labor and the radiating effects of economic activities among regions, classifying cities by their functional urban areas rather than by their administrative boundaries has become widely used (Ma & Long, 2020).

4.3 Regional distribution difference

Despite the global agglomeration and spatial dependence of the convergence clubs, when comparing the pollution differences among cities in each province, regional heterogeneity emerged. Table 5 displays the clubs of the cities in each province.

Table 5 The clubs of the cities in each province

Table 5 shows that more than 75% of the cities in Jiangsu, Fujian, Tibet, and Hunan provinces fell into the same club, while cities in Sichuan, Yunnan, Guangdong, Heilongjiang, and Guizhou provinces were almost evenly distributed among different clubs. This indicates that in some provinces, PM2.5 pollutions among cities were relatively similar or gradually converging and thus, can be explained by same factors. However, in other provinces, PM2.5 concentrations varied greatly among cities, and different cities converged to different steady states. The possible reasons for this phenomenon are as follows: First, due to different geomorphic features, the degree of differences of meteorological conditions in constituent cities vary greatly across provinces. Since meteorological conditions are an important factor affecting PM2.5 (Chen et al., 2020), it may have caused the different dispersion of clubs among province. For example, the natural environment of Guizhou Province is complex and diverse, including different landforms such as plateaus, mountain basins, and river terraces, which results in significantly different meteorological conditions and then different PM2.5 among its constituent cities despite a close location. By contrast, the meteorological conditions of 13 cities in Jiangsu Province are relatively similar because its topography is mainly plain. Thus, pollutions among these cities did not differ due to local climate. Second, since the dispersions of economic development vary among provinces and economic development is an important factor affecting PM2.5 pollution (Xu & Lin, 2016; Xu, Miao, et al., 2019), the different dispersion of club may be explained by the different dispersion of economic development.

In addition, there were some singularities where the PM2.5 concentrations or growth rates were significantly different from the surrounding cities. As shown in Fig. 1, this included cities with significantly more severe pollution than the surrounding areas, such as Xiamen, Shihezi, Changchun, Guiyang, Weinan, Yuncheng, Chengdu, and Ziyang. It also included cities with better air quality than the adjoining areas, such as Enshi Tujia and Miao autonomous prefecture, Yangjiang and Zhanjiang. Similar to the above analysis, the possible reason for this may also be due to the differences in meteorological and economic factors.

4.4 Further analysis

Many studies have similarly examined the geographical distribution characteristics of PM2.5 pollution, including by spatial autocorrelation test (Zhang et al., 2016; Ye et al., 2018; Xu, Chen, et al., 2019). However, although the research methods were similar, the subjects and perspectives were completely different. The subject of previous studies was regional PM2.5 concentrations, and thus, the spatial distribution pattern exhibited the regions with similar PM2.5 from a static perspective. In contrast, the subject of this section was the convergence clubs to which the cities converged, which depicted the geographical distribution characteristics of cities with similar PM2.5 change trends from a dynamic perspective.

Due to different research subjects, there are some new findings regarding to the geographical distribution characteristics of PM2.5 pollution. First, although cities in the same urban agglomeration are located adjacently and have close economic ties, they might still be divided into different clubs. For example, as shown in Fig. 1, not all cities in the Beijing-Tianjin-Hebei urban agglomeration converged to the same club, but were scattered over different clubs with differences in PM2.5 concentrations or growth rates. The same was also true for the Pearl River Delta and Yangtze River Delta urban agglomerations. This indicates that the PM2.5 differences among these closely connected cities were increasing, and they showed different patterns of long-term variation. Therefore, in addition to the studies on the differences among typical urban agglomerations (Lin et al., 2018; Shen et al., 2019), the exploration of differences in PM2.5 pollution within each agglomeration should not be ignored.

Second, cross-regional convergence was a common occurrence. For example, Changchun and Jinan, which are more than 1,000 km apart, both belonged to the Club H-H. This not only indicates that there are other factors affecting the convergence of PM2.5 besides geographical location, but also that there will indeed be a bias if samples are divided in advance according to geographical location when identifying convergence clubs.

5 Analysis of the influencing factors of club convergence

The spatial distribution of convergence clubs showed a significant clustering characteristic of same type, indicating the role of geographical location of a city in determining its convergence club. However, geographical location reflects the cause, but it is not the direct determining factor. Further, to reveal the reasons and propose targeted solutions for pollution, we explored its diverse determinants.

To obtain an intuitive understanding before the formal regression analysis, we first carried out a comparative analysis of the characteristic variables of 11 clubs. Table 6 shows the mean values of each club for the 10 variables listed in Table 1. Due to variance heterogeneity, the Welch test is applied for one-way Analysis of Variance and the corresponding F statistic is showed in the last row.

Table 6 The mean values of each club for the factors affecting the club convergence

Table 6 shows that the F statistics of more than half variables are significant at the 1% level, such as tempe, humid, and popula, which indicates that there are indeed significant differences in these variables among 11 clubs. Thus, these may be the possible factors characterizing the observed clustering patterns. Additionally, Table 6 also reveals some trends. For example, the population density (popula) of clubs with high PM2.5 concentrations is significantly higher than that of clubs with low concentrations, and there is an apparent trend that the higher the PM2.5 concentration of the club, that is from Club VL-H to Club H-H, the higher the population density.

Although the comparative analysis provides the intuitions, the correlation among different variables may bias the result. In addition, the question of interest is not only the difference among the clubs in these factors, but also their specific relations with the club type, that is, how changes in these variables affect the club to which the city converges. Therefore, we further examined the driving factors that characterize the clubs based on normative regression analysis. As mentioned above, the explained variable was set as the convergence club type. Specifically, Club H-H to Club VL-H in Table 4 were successively assigned as 11 to 1, that is, the higher the PM2.5 concentration of the club, the higher the assigned value. Based on the Ordered Probit model with a robust standard error, Table 7 lists the estimated results, in which column (2) is based on 276 prefecture-level cities.Footnote 7 As a robustness test, column (3) is based on the samples remaining after removing the municipalities and provincial capitals, and columns (4)–(7) show the average marginal effect of the four high PM2.5 concentrations club type.

Table 7 Regression results for factors affecting the club convergence

Column (2) of Table 7 shows that the coefficient of the relative humidity is significantly positive at the 10% level. This indicates that the higher the humidity is in a city, the more likely the city is to converge to a club with a higher PM2.5 concentration, which is consistent with the findings of Zhang et al. (2020). The response of PM2.5 to humidity is mainly due to the complex characteristics of aerosol components under different meteorological conditions. One possible reason is that higher humidity makes the soluble particles in PM2.5 absorb water and expand more easily, thus increasing the size and mass concentration of PM2.5 (Chen et al., 2017).

The coefficient of the industrial structure is significantly positive at the 5% level, indicating that cities with more developed industries are more likely to converge to a club with a higher PM2.5 concentration. If the development and application of clean technology are limited, extensive industrial development with a high energy input and high pollution emissions usually causes a conflict between the environment and development (Zhang et al., 2013).

The coefficient of population density is significantly positive at the 5% level, demonstrating that cities with a denser population also tend to converge to a club with a higher PM2.5 concentration. This is because continuous population growth greatly raises energy consumption. Without significantly improved consumption structures, fossil fuels are still the main energy source in some areas. Their burning, especially inferior coal, produces many gaseous pollutants that easily convert to secondary aerosol species, which are main components of PM2.5 (Wang et al., 2014).

The coefficient of the proportion of construction land is significantly positive at the 5% level, indicating that the more rapidly construction expands, the more likely the city is to converge to a club with a higher PM2.5 concentration. This relation can be attributed to the impact of urban construction design and its thermodynamic properties. Studies have found that inappropriate building layouts may block airflow and result in urban heat islands, making it difficult for ground-level aerosols to diffuse outward, thus exacerbating PM2.5 pollution (Dobrovolny & Krahula, 2015; Hang et al., 2012).

However, the coefficient of the public transportation utilization rate is significantly negative at the 10% level, indicating that cities with more frequent public transportation use tend to converge to a club with a lower PM2.5 concentration. Emissions released by vehicles are an important source of PM2.5 pollution in urban areas (Yang et al., 2011); thus, cities with a high per capita car ownership usually have more severe PM2.5 pollution (Xu & Lin, 2016; Zhang et al., 2018). In contrast, public transportation, as an alternative to private transportation, can significantly reduce PM2.5 pollution by decreasing vehicle exhaust.

The coefficient of environmental regulation is also significantly negative at the 1% level, which indicates that the greater the intensity of environmental regulation, the more likely a city is to converge to a club with a lower PM2.5 concentration. Thus, from the perspective of long-term convergence, government actions can effectively alleviate urban PM2.5 pollution, which is consistent with the findings of Zhou et al. (2019). In fact, environmental regulation not only improves regional air quality directly through environmental governance, but also affecting micro-enterprise decisions. It can influence the emission and pollution control decisions of enterprises through incentives such as taxes and subsidies or direct central supervision (Laplante & Rilstone, 1996; Liu et al., 2021; Shapiro & Walker, 2018), forcing polluting enterprises to relocate or close (Zeng & Zhao, 2009). Moreover, environmental regulation can also encourage enterprises to innovate energy-efficient and environmentally friendly technology, encourage the use of clean products, and improve environmental quality; this is known as the Porter hypothesis (Porter, 1991).

In addition, the coefficients of temperature, electricity consumption, R&D intensity, real estate investment and construction intensity are not significant, indicating that these variables have no significant impact on the club convergence.

Furthermore, the coefficients showed in column (3) are similar to those in column (2), which indicates that the above conclusions are robust and not affected by the economic and policy conditions of municipalities or provincial capitals. In addition, columns (4)–(7) shows that the average marginal effects of 10 variables on the probability of converging to the Club H-H, Club H-RH or Club H-M are consistent with the above discussion. By contrast, there is no variable that has a significant marginal effect on the probability of converging to the club H-L. This indicates that higher humidity, higher secondary industry proportions, higher population density, and more rapid construction land expansion not only increase the likelihood that a city would converge to a club with a higher PM2.5 concentration, which is the result from columns (2) and (3), but more precisely, increase the likelihood that a city would converge to the Club H-H, Club H-RH or Club H-M, the top three clubs with the highest PM2.5 concentrations.

When considering the correlation among the explanatory variables studied, the continuous advancement of urbanization has made the increasing trends in the urban population density and construction land ratio inevitable. The significant positive coefficient of the industrial structure also indicates that, for most cities in China, industrial production still extensively consumes resources. This indicates that there is a conflict between environmental protection and economic development. The key to resolving this issue and realizing coordinated improvement of the two should rely on upgrading industry through knowledgeable input (Zhao et al., 2021). In addition, reasonable urban planning, combined with effective transportation facilities, can also serve as important supplementary means to alleviate this issue. However, the key to realizing these changes is the implementation and enforcement of governmental environmental regulations.

6 Conclusions and policy implications

By performing the log t regression test, club clustering algorithm, spatial autocorrelation test, and Ordered Probit regression, this study performed an empirical convergence analysis of PM2.5 concentrations for 342 Chinese cities, which not only identified its overall divergence and club convergence but also explored the spatial distribution pattern as well as its influencing factors.

The main conclusions are as follows: (1) There was no evidence of full panel convergence, but the results showed club convergence. Eleven convergence clubs and one divergent group were identified. (2) The geographical distribution of the convergence clubs showed significant spatial interdependencies. (3) Higher humidity, higher secondary industry proportions, higher population density, and more rapid construction land expansion increased the likelihood that a city would converge to a club with a higher PM2.5 concentration. However, a city with more frequent use of public transportation and a greater intensity of environmental regulation tended to converge to a club with a lower concentration.

Based on these conclusions, we propose following two policy implications. First, “one-size-fits-all” environmental policies should be avoided and replaced by a set of city-specific plans. This is because the PM2.5 pollution in Chinese cities converged to different clubs over time. This was true even among geographically adjacent cities in the same urban agglomeration with a close economic linkage. Second, because higher utilization of public transport increased the likelihood that a city would converge to a lower pollution club, efforts should be made to build a convenient and effective urban transit systems and encourage residents to travel by it. Conversely, because excessive urban expansion influenced cities to converge to a club with a higher pollution, it is critical to plan urban layouts reasonably. The distribution, height, and density of buildings should be designed in consideration of the local meteorological conditions, especially the wind speed and direction, to improve the city’s capacity of dispersing ground-level pollutants. Active research and development of new technologies, such as the Internet of Things, may provide a way (Bai et al., 2020; Gonzalez-Zamar et al., 2020). As the conclusion shows, since government actions can effectively alleviate urban PM2.5 pollution, it is necessary to give full play to the positive role of the regulation in environmental protection. To achieve energy efficiency and emissions reduction, regulations should force industrial upgrading, change the energy consumption structure, and commit to building environmentally friendly cities.

Although this study reveals the convergence patterns of PM2.5 in Chinese cities and its main influencing factors, which provides recommendations for the haze control, there are two deficiencies. First, because this study mainly focuses on the convergence characteristics of PM2.5 and is limited by space, we do not further investigate the specific mechanism of each factor affecting the current convergence patterns. In the future research, we will conduct more detailed analyses in this regard. Second, this study mainly focuses on the convergence characteristics of PM2.5 in Chinese cities, and thus, the political and institutional influencing factors cannot be examined in depth. Our future research will extend the dataset to include other countries and cities to further explore more potential influencing factors.