Background

Many epidemiological studies have reported that exposure to air pollution is associated with an increased risk for cardiovascular and respiratory diseases [1,2,3,4,5], even at concentrations less than the current health-based guidelines [6,7,8]. The Global Burden of Disease study identified air pollution as a leading cause of global disease burden, especially in developing countries [9, 10]. Lelieveld reported that ambient air pollution leads to more than 3 million premature deaths globally each year, and that China had the most premature deaths (1.36 million) [11].

As a result of rapid industrialization and urbanization in the past two decades, China is experiencing one of its worst air pollution situations. In the first quarter of 2013, China experienced extremely severe and persistent haze pollution, affecting an area > 1.3 million km2 and approximately 800 million individuals [12]. The annual average particulate matter < 2.5 μm in aerodynamic diameter (PM2.5) and particulate matter < 10 μm in aerodynamic diameter (PM10) concentrations were 141 μg/m3 and 303 μg/m3, respectively [13]. Shijiazhuang has been listed as the second-worst polluted city, with record-breaking daily average concentrations on January 12, 2013, of 771 μg/m3of fine particles (PM2.5) and 800 μg/m3 of inhalable particles (PM10). However, only a limited number of studies have investigated the health effects of such levels of air pollution.

It has been established that children are vulnerable to the effects of air pollution [14, 15]. Evidence suggests that ambient air pollution has the potential to increase the severity of respiratory diseases, particularly in children. Nhung [16] observed that all ambient air pollutants (PM2.5, PM10, PM1, SO2, NO2, NOx, O3, and CO) were positively associated with pneumonia hospitalizations in children. Statistically significant associations were observed for most pollutants, except for O3 and SO2. Moreover, stronger associations were observed in infants than in older children [16]. Another study found that four pollutants (PM2.5, PM10, NO2, and SO2) were significantly associated with hospital visits for acute upper and lower respiratory infections. A time-series analyses from Shanghai (China) found that an increase of 2.49 μg/m3 in black carbon was associated with a 7% (95% CI: 5–8%) increase in asthma admission [17]. Moreover, contrary to the study by Nhung, stronger associations were observed among older children [17]. Another study from China also found stronger associations in older children [18]. Despite mounting literature suggesting that air pollution may be associated with respiratory disease in children, information regarding the association remains limited. It is important, therefore, to determine the reasons for these inconsistent data, and to study the exact respiratory effects of air pollution on children, particularly in severely polluted cities.

In the present study, we conducted a time-series study to investigate the association between five ambient air pollutants (PM2.5, PM10, sulphur dioxide [SO2], nitrogen dioxide [NO2] and Ozone [O3]) and child respiratory outpatients in children in Shijiazhuang, China.

Methods

Shijiazhuang, the capital of Hebei province, comprises eight urban and suburban districts, with a total area of 2206 km2 and a population of 4.55 million at the end of 2013. The study area was limited to the traditional four urban districts (469 km2). Approximately 2.19 million permanent residents include 0.31 million children (< 15 years of age) residing in these four districts in 2015.

Hospital outpatient data

The Children’s Hospital of Hebei Province is the sole paediatric hospital in Shijiazhuang. Daily hospital outpatient visit data from January 1, 2013 to December 31, 2015 were collected from a database located at this hospital. All disease diagnoses were completed by computer coders. To validate health data, duplicate records were deleted and International Classification of Diseases, 10th Revision (ICD-10) codes were re-matched as reported in the authors’ previous research using MySQL server (version 5.6.26) [19]. The data cleaning strategy is described in Fig. 1. When there was a difference between the newly matched and original codes, this record would be picked up and discussed by a doctor’s team. Some errors, such as the wrong word, acronym or non-standard name, in the disease diagnosis would be changed to standard names and subsequently re-matched to an accurate ICD-10 code.

Fig. 1
figure 1

Data cleaning flow diagram

The respiratory outpatients’ data (ICD-10 codes J00-J99) were selected and targeted in the database. Patients residing outside of the four urban districts and those >14 years of age were excluded from the analysis. Outpatient visits caused by infection, suppuration, or ulceration were also excluded from this study. Finally, hospital respiratory outpatients’ visits (ICD-10 codes J00-J99, excluding pathogenic infections, abscess, suppuration, gangrenous and ulcerative diseases) and six specific or classified diseases (acute upper respiratory infections, ICD-10 codes J02-J06; pneumonia, J18; other acute lower respiratory infection, J20-J22; other diseases of upper respiratory tract, J30-J39; chronic lower respiratory diseases, J40-J47; and other respiratory diseases, J60-J99) were identified as health outcomes.

Air pollution and meteorological data

Daily air pollution data, including PM2.5, PM10, SO2, NO2 and O3, were obtained from the website of China’s National Urban Air Quality Real Time Publishing Platform (http://106.37.208.233:20035/). The platform is administered by China’s Ministry of Environmental Protection. Hourly concentrations of each pollutant were measured from seven fixed site stations distributed in the four urban districts. These stations are mandated to be located away from major roads, industrial sources, buildings, and residential sources of emission from the burning of coal, oil or waste. This ensures that monitoring results reflect the urban air pollution level in the city rather than local sources of traffic or industrial combustion. The methods were based on the tapered element oscillating microbalance, ultraviolet fluorescence, chemiluminescence, ultraviolet fluorescence were used to measure PM (PM2.5 and PM10), SO2, NO2, and O3, respectively. For PM2.5, PM10, SO2, and NO2, daily concentrations were represented 24 h averages, and the O3 concentration was the maximal 8 h average from all valid monitoring sites in this study.

Daily mean temperature and humidity data were retrieved to adjust the effects of weather on hospital outpatients. Meteorological data were measured at a fixed site station and obtained from the Meteorological Bureau of Shijiazhuang.

Statistical analysis

Time-series analysis is a regular analytic method to explore the acute effects of air pollution based on the daily aggregate date, and can control for both time-invariant and time-varying confounders by design [20].

The statistical analysis used a generalized additive model (GAM) to analyse the data. Because daily hospital visits typically followed an over-dispersed Poisson distribution, quasi-Poisson regression was used in the GAM [21]. Several covariates, including natural splines, were introduced to control for their potential confounding effects. First, a natural cubic regression smoothing function of calendar time with 7 degrees of freedom (df) per year excluded unmeasured long-term and seasonal trends longer than two months [20]. Second, a natural smooth functions of the mean temperature (6 df) and relative humidity (3 df) controlled for the nonlinear confounding effects of weather conditions [20]. Third, indicator variables were implemented for “day of the week” and public holidays. Briefly, the following log-linear GAM was fit to obtain the estimated pollution log-relative rate β in the selected city:

$$ logE\left({Y}_t\right)={\beta Z}_t+ DOW+ ns\left( time, df\right)+ ns\left( temperature, 6\right)+ ns\left( humidity, 3\right)+ intercept, $$

in which E(Yt) represents the expected number of respiratory disease outpatients at day t; β represents the log-related rate of respiratory diseases associated with a unit increase of air pollutants; Zt represents the pollutant concentrations at day t; DOW is a dummy variable for day of the week; And ns indicates the natural cubic regression smooth function [22].

After establishing the basic model, single-pollutant models were initially used and introduced, a priori, in turn each air pollutant concentration on the concurrent day (lag0). To verify the stability of the model, three sensitivity analyses were conducted. First, alternative df were selected with 4–10 per year for the smoothness of time trends. Second, given that the health effects of ambient air pollutants could last for multiple days, more single lag days were used (lag1, lag2, lag3, lag4, lag5, lag6, and lag7) and moving average exposure of multiple days (lag01, lag02, lag03, lag04, lag05, lag06, and lag07). Third, two-pollutant models were built to examine the stability of the effect estimates after adjustment for co-pollutants. Co-pollutants with a correlation coefficient < 0.7 would be added to the two-pollutant model.

Both the total respiratory outpatients with non-pathogenic disease and cause-specific respiratory outpatients were assessed. Because behaviour patterns and common diseases may be different in children of different ages, all of these outpatients were stratified by age (0–3, 4–6, and 7–14 years). Because both air pollution levels and the incidence of respiratory disease events are known to vary by season, the analysis was stratified by cool season (November to March), hot season (June to August) and transition season (April, May, September and October), and reduced the df per year to 3, 2, and 3 respectively. The statistical significance of the differences between the effect estimates of the strata of a potential effect modifier (e.g., the difference between age or season) was tested by calculating the 95% confidence interval (CI) as\( \left({\widehat{Q}}_1-{\widehat{Q}}_2\right)\pm 1.96\sqrt{{\left(\mathrm{S}{\widehat{\mathrm{E}}}_1\right)}^2+{\left(\mathrm{S}{\widehat{\mathrm{E}}}_2\right)}^2} \), in which \( {\widehat{Q}}_1 \) and \( {\widehat{Q}}_2 \) are the estimates for two categories, and \( \mathrm{S}{\widehat{\mathrm{E}}}_1 \) and \( \mathrm{S}{\widehat{\mathrm{E}}}_2 \) are their respective SEs [23]. Regardless of significance, modification of effect by a factor ≥ 2 was considered to be important and worthy of attention [23].

The statistical tests were two-sided, and effects with p < 0.05 were considered to be statistically significant. All statistical models were constructed using R software version 3.2.1 (R Foundation for Statistical Computing, Vienna, Austria) using the MGCV package. The effects are expressed as the percentage of change and 95% CI in daily hospital child respiratory outpatient visits per 10 μg/m3 increase in pollutant concentrations.

Results

Data description

A total of 3,541,692 total hospital outpatient records were retrieved for the period 2013 to 2015, from the Children’s Hospital of Hebei Province. A total of 1,400,199 records remained after deleting duplicate data, the records of cases residing outside of Shijiazhuang, and cases>14 years of age. After ICD-10 code re-matching, approximately 355,833 (25.4%) recodes were mismatched. Finally, 551,678 records of hospital outpatients caused by non-pathogen respiratory diseases were extracted. The percentages of total non-pathogen respiratory hospital outpatients according to age group were 72.2% for 0–3, 18.5% for 4–6 and 9.3% for 7–14 years of age, respectively. Acute upper respiratory infections (ICD-10 codes J00-J06) accounted for 37.4% of the total of non-pathogen respiratory diseases. Other acute lower respiratory infections (ICD-10 codes J20-J22) accounted for 36.7%, while pneumonia (ICD-10 codes J18) accounted for 12.1%. Other diseases of upper respiratory tract (ICD-10 codes J30-J39) accounted for 10.2%, chronic lower respiratory diseases (ICD-10 codes J40-J47) accounted for 3.0%, and other respiratory diseases (ICD-10 codes J60-J99) accounted for 0.7%.

During the study period, there were no missing value days for air pollutant measurements, meteorological variables, or health data. According to the results of the Shapiro-Wilk test, all of these data were skewed (i.e., non-normally distributed); therefore, median and quartile values were used to describe their distribution. Descriptive statistics from this study are summarized in Table 1. There was serious air pollution in Shijiazhuang, especially from PM2.5 and PM10, and on most days, these two pollutant concentrations exceeded the National Ambient Air Quality Standards (24 h average standards for PM2.5 is 75 μg/m3, PM10 is 150 μg/m3, SO2 is 150 μg/m3, NO2 is 80 μg/m3, and O3-8h is 160 μg/m3). The highest daily average concentrations were 10.3 and 5.6 times the limit values, respectively, confirming that the main air pollutants in the selected city are, in fact, PM2.5 and PM10. The minimal, mean, and maximal daily average temperature and relative humidity were − 7.7 °C, 14.5 °C, 34.7 °C and 11.5%, 57%, 98%, respectively, reflecting the warm temperate continental monsoon climate in Shijiazhuang.

Table 1 Summary statistics of daily air pollutants, weather conditions, and children hospital outpatients caused by respiratory diseases (N = 551,678) in Shijiazhuang from 2013 to 2015

As shown in Fig. 2, daily air pollution concentrations (except for O3) and total respiratory outpatients were highest in the cool season and lowest in the hot season. The interquartile range of PM2.5, PM10, SO2 and NO2 concentrations in the cool season (158, 272.7, 117.1 and 43.8, respectively) were significantly higher than in the hot season (65.2, 110.8, 22.6 and 19.6, respectively).

Fig. 2
figure 2

Box plots of five air pollutans in the cool, transition and warm season. Boxes indicate the interquartile range (25th percentile – 75th percentile); lines within boxes indicate medians; whiskers below boxes represent minimum values; whiskers and dots above boxes indicate maximum values

Generally, there were strong correlations among PM2.5, PM10, SO2 and NO2 pollutants with the Spearman correlation coefficients, ranging from 0.50 to 0.75. PM2.5, PM10, SO2 and NO2 concentrations were negatively or weakly correlated with temperature and relative humidity. Maximal 8 h mean O3 concentrarion was negatively correlated with PM2.5, PM10, SO2 and NO2 (Spearman correlation coefficients ranged from − 0.31 to − 0.50), weakly correlated with relative humidity, and strongly correlated with temperature (r = 0.82, p < 0.05).

In the whole-season analysis, SO2, NO2 and PM2.5 were significantly associated with increased total respiratory outpatient visits. An increase of 10 μg/m3 in two-day average concentrations of SO2, NO2 and PM2.5 corresponded to a 0.33% (95% CI: 0.10–0.56), 0.66% (95% CI: 0.30–1.03), and 0.13% (95% CI: 0.02–0.24) increase in total respiratory outpatient visits (Table 2). The associations between O3, PM10 and total respiratory outpatients were positive but non-significant. For cause-specific diseases, positive associations were observed except for correlations between O3 and chronic lower respiratory diseases (ICD-10 codes J40-J47) and other respiratory diseases (ICD-10 codes J60-J99).

Table 2 Percent change (95% CI) in children hospital outpatients caused by total and cause-specific respiratory diseases per 10 μg/m3 increase in concentrations of five air pollutants in Shijiazhuang, China, 2013–2015

Effects by season

The effect estimates of ambient air pollution on total respiratory outpatients showed significant differences among three seasons. Effect estimates of all five pollutants were significant in the transition season, and non-significant in both the cool and hot seasons, except for SO2 in the hot season. NO2, PM2.5, PM10 and O3 exhibited highest effects in the transition season. The magnitudes of SO2-associated increase were approximately 2 times higher in the hot season than in the transition season. Significant differences were observed for SO2 and O3 between the cool season and the hot, transition season, for PM10 in the transition season and the cool and warm season, for NO2 in the cool season and the transition season.

Effects by age

The percent increase in associations between air pollutants and total respiratory hospital outpatients varied by age group. For NO2, PM2.5 and PM10 pollutants, the older the child was, the higher the effect estimates. O3 had significant influence on children 4–6 years of age, then on those 7–14 years, and the smallest on those 0–3 years of age. There were no significant effects of air pollutants on children 0–3 years of age, except for SO2. Meanwhile, significant effects were observed in children 4–6 and 7–14 years of age, except for SO2 and O3, respectively. Three pollutants (NO2, PM10 and O3) present significant differences between the 0–3 years of age group and the 4–6/7–14 years group, while differences in the other two pollutants were non-significant among the three groups (Table 3).

Table 3 Percent change (95% CI) in children hospital outpatients caused by respiratory diseases per 10 μg/m3 increase in concentrations of five air pollutants stratified by season and age in Shijiazhuang, China, 2013–2015

Sensitivity results

The results of sensitivity analyses, adjusted for different df are shown in Fig. 3. The effect estimates remained stable. The results demonstrated that the acute effects of air pollution did not change substantially with the adjustment of smoothness of time using alternative df from 4 to 10 per year.

Fig. 3
figure 3

Percent increase of hospital outpatient visits with 10 μg/m3 increase of NO2, O3, PM10, PM2.5 and SO2 due to respiratory disease classified by degrees of freedom per year

The results from the single-lag day (lag0-lag7) and cumulative exposure models (lag01-lag07) for the percent increase in children respiratory outpatients per 10 μg/m3 increase in pollutants are shown in Fig. 4. Statistically significant results were observed at lag 0, 1 and 01–07 day for NO2. Lag 0 and 01 day for PM2.5. Lag 0 and 01–07 day for SO2. respectively. For all five pollutants, the effects on cumulative days were higher than single-lag days. According to previous studies, lag0 day or lag01 air pollution was most closely correlated with child hospital outpatient visits. Therefore, a two-day average (lag01) exposure model was used for modifying effects analyses.

Fig. 4
figure 4

Percent increase of hospital outpatient visits with 10 μg/m3 increase of NO2, O3, PM10, PM2.5 and SO2 due to respiratory disease in different lag days

The results of the two-pollutant models using exposure at lag 01 are provided in Table 4. The magnitudes of all five pollutants were stable. Effect estimates of NO2, SO2 and PM2.5 pollutants remained statistically significant when adjusting for co-pollutants.

Table 4 Percent change (mean and 95% confidence intervals) of daily total respiratory outpatients associated with10ug/m3 increase of pollutant concentrations in single and two-pollutant models

Discussion

Although the associations between ambient air pollution and daily hospital child outpatient visits have been well described in developed countries, studies in developing countries, especially in severely pollution haze Chinese cities, remain limited. The present study demonstrated that season and age may modify the health effects of air pollution in Shijiazhuang. Unlike other study results, the association between air pollution and daily children outpatient visits was generally more evident in the transition season than the hot or cool seasons.

The effect estimates of our results are lower than reported in previous studies [16, 24,25,26]. There are several potential reasons for this heterogeneity. First, the lower estimates may reflect the fact that Shijiazhuang’s air pollution was significantly more severe than in developed countries and other developing cities in China, which may reflect the shape of the concentration-response curves where there may be a flattening (saturation) at the higher end [27]. Second, the chemical components of PM pollution are very important to their effects on health, which may partially explain the reason for different effects among cities [21]. Third, the varying magnitude of misclassification of clinical diagnosis, as well as other factors such as statistical models and population characteristics, may explain the differences between our results and previous studies [28].

For the first time, the present study observed that the association between air pollution and daily child respiratory hospital outpatient visits in the transition season is significantly more sensitive than in hot or cool seasons. The concentrations of SO2, NO2, PM2.5 and PM10 were higher in the cool season, medium in the transition season, and lower in the hot season (Fig. 2). Associations between ambient air pollution and daily total non-pathogen respiratory outpatient visits were strongest during the transition season: the effect estimates were 2–6 times higher than in all seasons. The pattern of exposure to ambient air pollution in children may change from season to season [29]. Because of low temperatures, high air pollutant concentrations, and the use of central heating systems in the winter, residents generally stay indoors and close their windows. Similarly, high temperatures and the widespread use of air conditioning forces individuals to enter rooms and close windows. Thus, the exposure dose may be reduced in the cool or hot seasons. One study reported that the indoor /outdoor ratio of air pollutant concentration in Beijing (China) is 0.5 and 0.7 in the cool and hot seasons, respectively (data not shown). In contrast, the climate is more pleasant in the transition season; children’s outdoor activities and time with open windows in homes would be increased; therefore, exposure to ambient air pollution would likely be higher.

Previous studies have reported that the health effects of air pollution on infants and young children may be greater than in adults [30,31,32]. However, we found a very interesting phenomenon in our study: the effect estimate increases with age in children. Considering the differences in activity range and air pollution patterns among children of different age groups, our results may be easier to understand. Children 0–3 years of age need adult supervision and their activities are mainly indoor; consequently, their exposure to ambient air pollution is the least. Kindergarten (4–6 years) can offer a wide range of free activities, meanwhile the children are compliant and follow teachers’ recommendations to stay indoors when the air quality is inadequate. Children 7–14 years of age engage in the highest activities but have a weak awareness of self-protection, therefore, exposure dose may increase with age. Although the exact air pollution exposure dose to children of different ages remains unclear, this phenomenon provides new insights into research investigating the adverse health effects of air pollution on children, and warrants careful future study.

Another important finding from our study was that pollutant states may contribute to seasonal differences. Effect estimates of gaseous pollutants (SO2, NO2 and O3) with total respiratory outpatient visits in the hot season were higher than those in the cool season, while the PM pollutants (PM2.5 and PM10) had lower estimates in the hot season than that in the cool season. That may be attributed to the constituents of the complex mix of PM2.5 and PM10, which may vary by season. The exact PM2.5 compositional difference in different seasons is currently under investigation.

The varying magnitude of misclassifications of clinical diagnosis and ICD-10 codes may have introduced bias. To mitigate this bias, one ICD-10 code rematch and validate mechanism was introduced in our research. While these diagnoses made by physicians may not be accurate and completely consistent. Further studies are needed to validate these diagnoses.

Our research had limitations. First, we collected only three years’ of data for the association analysis between air pollution and children respiratory outpatient visits; the GAM model, therefore, may have some instability [33]. Second, as in many previous time-series studies, we used available ambient monitoring data to assess the children’s exposure to air pollutants. As a result, several issues may have arisen, given that ambient monitoring results differ from a child’s exposure level to air pollutants [33, 34]. Data for the assessment of weather conditions was retrieved entirely from one monitoring station. Measurement error may have substantial implications for interpreting epidemiological studies on air pollution. Third, we evaluated the association of five air pollutants with seven different hospital outpatient outcomes. In addition, moderate-to-high correlation between PM pollution and gaseous pollutants in a selected city limited our ability to separate the independent effect for each pollutant.

Conclusions

Our findings suggest that ambient air pollutants were associated with child respiratory outpatient visits, especially for pneumonia (ICD-10 code J18), other acute lower respiratory infections (ICD-10 code J20-J21), and other diseases of upper respiratory tract (ICD-10 code J30-J39). Furthermore, our results suggest that the effect estimates in the transition season were stronger than in cold or hot seasons, and that the estimates increase with age in children. To protect the health of children, local authorities should take more measures to control air pollutant emissions.