Introduction

Industrialization, technical growth, and overpopulation in urban areas of Kuwait have resulted in increased air pollution (Abdul-Wahab et al. 2000), and toxic air pollutants in close proximity to populated areas can have adverse health effects. Ambient ozone is the primary constituent of smog and is a pollutant of concern in industrialized countries, as it can lead to chronic respiratory infection and lung inflammation, aggravate asthma, impair lung defense mechanisms, and reduce the immunity of the human body (Tilton 1989). In the environment, ozone may result in acute foliar injuries, reduce agricultural yield and biomass production, and shift the competitive advantages of plant species in mixed populations (Lefohn et al. 1994). In this respect, surface ozone has become a serious problem in the urban areas of Kuwait, and if it occurs in sufficient concentration, it could threaten both human health and the environment (Jallad and Jallad, 2010).

Ground-level ozone is formed by chemical reactions between nitrogen oxides (NOx) and volatile organic compounds (VOC) in the presence of heat and sunlight. It is difficult to exactly define the formation and destruction mechanism of ozone. This is because ozone is an extremely reactive pollutant and can be scavenged by its precursors (Dimitriades 1989). As a result, the area of air pollution forecasting through empirical methods has gained importance with the availability of sufficient data. Earlier forecasting models were based on simple empirical data correlations, but the availability of a large amount of information has resulted in development of complex air pollution simulations for forecasting (Telenta et al. 1995).

Management of public warning strategies for ozone levels in densely populated areas requires accurate forecasts of ambient levels. Although ozone prediction models exist or have been proposed for several cities (Robeson and Steyn, 1990; Elsom 1996; Noordijk 1994; Yi and Prybutok 1996), they have not been assessed in realistic conditions. The purpose of this work is to explore the possibility of forecasting daily maximum ozone impacts in urban areas of Kuwait, where most measurements are taken at surface stations. The final goal is to produce a quantitative tool to help authorities monitor ozone pollution, which has become a public health issue in many cities worldwide.

Background

Kuwait is a principality state in the northeastern corner of the Arabian Peninsula with an estimated population of 3.5 million. It is a low lying country, with the highest point being 306 meters above sea-level. The annual rainfall across Kuwait varies from 75 to 150 millimeters, and the country has a desert climate—hot and dry (Federal Research Division 1993).

Kuwait is a major exporter of crude oil (ranked 4th in OPEC’s list), with plans to increase production to 3.2 million barrels per day in 2013 (Arab times 2012). Such characteristics make Kuwait a country with air pollution associated with petroleum, petrochemical, and other industrial pollutants (Arab times 2012; Organization of the Petroleum Exporting Countries 2009).

Recently, there has been an increase in air pollution in urban areas of Kuwait. This is the result of rapid industrialization, technical growth, and overpopulation. In the past, it has been observed that toxic air pollutants in close proximity to populated areas can have adverse health effects. Ozone is one such pollutant, and if it occurs in sufficient concentration, it can become a serious problem in urban areas of Kuwait. Jallad and Jallad (2010) observed that ozone pollution levels in the Salmiya residential district exceeded ambient air quality standards during specific times of the year. Therefore, accurate forecasting of surface ozone is required, as it can help with successful implementation of public warning strategies during episodic days in Kuwait.

Most of the air pollution studies carried out in Kuwait during the past two decades were aimed at air pollutant patterns, dispersion, and photochemical mechanisms. Similarly, studies that analyzed ozone levels were focused on comparing ozone levels with international standard limits, assessing health effects of ozone pollution, understanding diurnal behavior of ozone, and studying the seasonal trends in ozone levels (Ettouney et al. 2009a; Ettouney et al.2009b).

There have been very few studies focused on developing a robust forecasting system which can be used to develop a public warning system. Most of the forecasting systems that have been developed to predict the ambient ozone concentration in Kuwait use meteorological data and precursor concentrations (Ettouney et al. 2009a; Ettouney et al. 2009b; Ettouney et al. 2009c).

Abdul-Wahab et al. (1996) used stepwise multiple regression modeling to predict ozone levels from precursor concentrations and meteorological conditions during daylight hours in the Shuaiba Industrial Area (SIA) of Kuwait. Al-Alawi et al. (2008) applied principal component regression and artificial neural networks to predict ozone concentration in Kuwait’s lower atmosphere using the data on seven environmental pollutant concentrations (CH4, NMHC, CO, CO2, NO, NO2, and SO2) and five meteorological variables (wind speed, wind direction, air temperature, relative humidity, and solar radiation). In their study, Elkamel et al. (2001) used an artificial neural network model to predict ozone concentrations as a function of meteorological conditions and precursor concentrations in the SIA.

Similarly, Abdul-Wahab and Al-Alawi (2002) used neural networks for ozone modeling in the lower atmosphere as a function of meteorological conditions and various air quality parameters in the Khaldiya residential district of Kuwait.

These statistical models are based on semi-empirical statistical relations among available data and measurements. They do not necessarily reveal any relation between cause and effect. They simply attempt to determine the underlying relationship between sets of input data (predictors) and targets (predictands).

However, the complex and sometimes nonlinear relationships of multiple variables can make these statistical models awkward and complicated. Therefore, it is expected that they will underperform when used to model the relationship between ozone and the other variables that are extremely nonlinear. In addition, over the last decade, artificial intelligence (AI) based techniques have been proposed as alternatives to traditional statistical ones for forecasting urban air pollution. Two of the most reliable and feasible AI techniques that have been used for urban air pollution forecasting are neural networks and fuzzy logic. Neural networks are simple mathematical models representing a brain-like system. Although considered as one of the most popular AI methods, neural networks also have inherent drawbacks that impede their practical applications. Scalability, testing, verification, and integration of neural network models into forecasting system are some of the major concerns today. Sometimes, neural network systems become unstable when they are applied to bigger problems. Compared to neural networks, fuzzy logic offers better insights into the forecasting model, and they can form a multivalued logic to deal with reasoning that is approximate rather than precise (Karlaftis and Vlahogianni 2011).

Prediction of ozone levels using a theoretical method (i.e. detailed atmospheric diffusion model) is difficult and empirical analysis is required to develop a forecasting system. A well-evaluated ozone forecast model can raise the possibility of successful ozone control strategy. In addition, forecasting the daily maximum ozone concentrations can help avoid and reduce ozone-related injuries and damages. This research is significant, as it is the first study comparing the ozone forecasting systems across different locations in Kuwait.

Research methodology

Dataset

The study used hourly air pollutants data from January 2007 through September 2011 gathered by the Environmental Monitoring Information System of Kuwait (eMISK, working under the Environment Public Authority of Kuwait). The data was collected from three different locations in Kuwait (see Figure 1): Al-Jahra (Tower A), Al-Mansouriya (Tower B), and Al-Rigga (Tower C). The initial three years of data was used to train the forecasting systems, and data from the remaining years was divided into subgroups to validate the forecasting models. The validation data was divided into subsets based on the availability of the data and seasonal variation.

Figure 1
figure 1

The geographical location of air quality monitoring stations in Kuwait.

Geographical location and a history of high ozone episodes were considered in selecting target sites. Tower A is located in Al-Jahra, the capital of the Al-Jahra Governorate of Kuwait and the surrounding agriculturally based Al-Jahra District. Tower B is located in one of the suburbs of the Al-Asimah Governorate, which houses most of Kuwait’s financial and business center. Tower C is located in the Al-Ahmadi Governorate, which forms an important part of the Kuwaiti economy, as several of Kuwait’s oil refineries are located there.

Model structure

Seinfeld (1986) proposed the atmospheric diffusion equation, from which the theoretical model (Pryor and Steyn 1995; Jorquera, and Acuna 1998) used for this study was developed:

dC dt = V . C + . K . C + Q x , y , z , t R C k , T , t + L x , y , z , t
(1)

where C(x, y, z, t) is the concentration, V = (u, v, w) is the wind vector, K = diag(K x , K y , K z ) is the eddy diffusivity, Q is the emission rate, R is the net-generation term (balance of chemical production and destruction) and L stands for physical removal processes such as wet and dry deposition. The mathematical expression in equation (1) represents the mass conservation principle and is the most general equation obeyed by any atmospheric pollutant.

In the case of ozone, the mass conservation expression can be simplified, because the emission term is not there and the deposition terms can be ignored. The ozone concentration can be assumed to be uniform in the mixed layer, as long as we take a moving air mass under the conditions of a strong convection.

Under such conditions, the column of air satisfies the balance:

dC dt = R C k , T , t
(2)

Where {C k } stands for a detailed photochemical mechanism, which includes all the relevant species participating in ozone generation and destruction. These concentrations depend upon the spatial coordinates and time. Assessing the importance of VOC and NOx in ozone photochemistry is not easy, as there are no previously measured VOC mixing ratios for Kuwait. Thus, to take a simpler approach, a stochastic model is constructed from equation (2). Integrating equation (2) from early morning rush hour up to the time when the ozone concentration is highest leads to the following equation:

O 3 max O 3 min = t o t f R C k , T , t dt
(3)

Temperature is steadily increasing during this time interval (Figure 2), so using q = dT/dt, the above expression can be written in temperature terms as:

O 3 max O 3 min = T min T max R C k , T dT q
(4)
Figure 2
figure 2

Evolution of typical photochemical pollution at different locations in Kuwait.

Taking the difference between two consecutive days results in:

O 3 , t + 1 max O 3 , t max = O 3 , y + 1 min O 3 , t min = T max , t T max , t + 1 R C k , T dT q
(5)

Using the mean value theorem for integrals on the right hand side of equation (5) and dropping the “max” superscript for daily maximum ozone and temperature levels, the theoretical model reduces to:

O 3 , t + 1 = α O 3 , t + β T t + 1 + γ T t + δ
(6)

Here, parameter α is positive and of order one, β and γ are of about the same magnitude (but differ in sign), and δ has a small value. The planetary boundary layer is well mixed, and the surface measurements are representative of the whole convective mixed layer, when the daily maximum ozone concentration is reached. Furthermore, the temperature recorded at the time when ozone is highest is strongly correlated with the highest temperature recorded that day, so in passing from equations (5) to (6) the interpretation of Tmax is accordingly modified. Robeson and Steyn (1990) developed an equation similar to equation (6) and used variance analysis to find the best model fit among several slight variations of the equation.

Time series forecasting modeling

Presently three types of stochastic models to forecast air pollutants have been developed using the time-series approach (McCollister and Wilson 1975):

  1. (a)

    Models based on time series of a single pollutant: AR, ARMA, ARIMA models.

  2. (b)

    Multivariable time series, where meteorological explanatory variables have been added on: ARX, ARMAX models.

  3. (c)

    Forecasting models based on a combination of classical and non-parametric methods.

The above citations make it clear that when the aim is on air pollution forecasting, any forecast model ought to be better than the pure persistence forecast (i.e. forecasting for tomorrow what occurred today).

Thus, in order to have an accurate forecast, some persistence and some exogenous, meteorological variables need to be included. Of all the meteorological variables, air temperature has the strongest correlation with ozone concentrations for two reasons: (1) high air temperatures are an excellent indication of environmental conditions conducive to O3 production and accumulation (i.e. anti-cyclonic conditions with associated clear skies and light winds), and (2) the rate ‘constants’ of photochemical reactions are highly temperature dependent (Pryor and Steyn 1995). As a result, air temperature is a reasonable surrogate for the combined effects of wind speed, wind direction, inversion height, and photochemical reaction rate. As discussed earlier, the simple linear equation (6), which considers only surface air temperature as the exogenous variable, will be used. In all cases the actual maximum temperature (Tt+1) was used in place of the forecast value (that is, an ex post parameter fit was performed). The errors associated with forecasting Tt+1 will be assessed ex ante by simulation. Equation (6) has an ARX (1 2 0) structure (Ljung 1987), and its parameters were estimated using the ARX procedure available in MATLAB’s System Identification Toolbox (Ljung 1991).

Fuzzy modeling

A fuzzy model is a collection of rules, derived from the original data, which are combined to produce a single output for a given input. A multiple input, single output (MISO) model structure is given by the following set of rules:

R i : if X is A i then y is B i i = 1 , , M
(7)

Where X = (X1, X2, …, Xp) is a given observation, A i = A 1 i , A 2 i , , A p i is a premise of the fuzzy model; the points Ai are a basis for the space of the input variables. The consequences of the model are denoted by Bi; they represent an intensity level associated with the dependent variable y and are the basis of the output space. Equation (7) states that, if a given observation X (for instance, meteorological and air quality measurements) can be associated with a known pattern Ai (for instance, some class of meteorological condition), then a rule specific to Ai will give an estimate of the associated output y (for example, ground-level ozone concentration). By using a fuzzy clustering classification process, the set of points Ai can be discriminated and then a fuzzy model constructed. A simple choice was proposed by Takagi and Sugeno (1985); they suggested linear rules of the form:

R i : if X is A i then Y i = C o i + j = 1 j = p c j i x j . i = 1 , , M
(8)

Then the parameters {ci} can be estimated using ordinary least squares, by minimizing the difference between the observed output and the output of the fuzzy model, given by:

Y = i = 1 M w i Y i i = 1 M w i
(9)

In order to develop fuzzy models, using fuzzy clustering techniques for parameter identification, the model structure given by equation (6) was used. In fuzzy modeling language, the sets (O3, t; Tt+1; Tt) and (O3, t+1) correspond to the premises and consequences, respectively. The fuzzy C-means algorithm (Rousseeuw et al. 1996; Johanyak, and Kovacs 2011; Sugeno and Yasukawa 1993) was applied to affect the partition of the original data set into M fuzzy clusters; the parameters were estimated using MATLAB’s Fuzzy Logic Toolbox.

Model performance

The performance of the two forecasting models was evaluated using two statistical indices: the root-mean-square error (RMSE) and the index of agreement (IA), defined as:

RMSE = 1 N i = 1 N o i p i 2 IA = 1 i = 1 N o i p i 2 i = 1 N o i + p i 2
(10)

Here oi and pi are observed and forecasted ozone maximum values on day i, N is the number of days in the test set, p i = p i o m and o i = o i o m , with om, the average observed ozone maximum. The index of agreement is a dimensionless index bound between 0 (showing no agreement at all) and 1 (showing perfect agreement of the time series).

Results and discussions

Evolution of photochemical pollution

The evolution of typical photochemical pollution at different locations in Kuwait is summarized in Figure 2. It was observed that the ozone levels at different locations started rising in the morning (around 7:00 AM), reached maximum levels during the afternoon (11:00 AM – 4:00 PM), and then started falling again in the evening. Similarly, the monthly ozone levels at different locations in Kuwait are summarized in Figure 3 through Figure 4. It was observed that during summers (June to August), the average ozone levels are higher compared to the average ozone levels during the rest of the year. Thus, it can be concluded that there is a strong association between the ambient temperature and the surface ozone levels.

Figure 3
figure 3

Monthly ozone levels in the Al-Jahra area.

Figure 4
figure 4

Monthly ozone levels in the Al-Rigga area.

Empirical findings (time series modeling)

The results in Table 1 shows the parameter estimates for the time series model—presented in equation (6)—developed for the period 2007 through 2009. There is a great amount of fluctuation in the daily maximum temperature during the year; therefore, the data subsets (based on seasonal duration and missing data points) were used individually in the model development process. There are four data subsets used for each of the locations covered under this study. An additional data subset for model validation was also used. It can be seen from the parameter estimates in Table 2 that the fitted values support the model structure derived in equation (6). Additionally, it can be seen that the fitted parameters associated with each location show great amounts of variation during different time periods.

Table 1 Parameter estimates in equation ( 6 ) using time series analysis
Table 2 Parameter estimates in model (4) using fuzzy model

Empirical findings (fuzzy modeling)

The parameters estimated for Towers A, B, and C using fuzzy model are summarized in Table 2. For the fuzzy model, a different number of rules were found for each of the data subsets. Each of these rules corresponds to a linear model valid for a specific cluster obtained from the original data by means of a fuzzy classification. The parameters of these rules are similar to the ones estimated for the time-series models. The predicted output results from a nonlinear combination of these rules.

Comparison of models

The results in Table 3 compare the forecasting for the three towers: Al-Jahra, Al-Mansouriya, and Al-Rigga, using the pure persistence model, the linear time series model, and the fuzzy model. Clearly, the root mean square error shows that the proposed fuzzy forecasting model and time series forecasting model are a significant improvement over the pure persistence forecast (O3,t+1 = O3,t), for all data subsets. Regarding the observed episodes, the three models achieve a high percentage of correct forecasts, between 66 and 95%. As shown in Figures 3 and 5, the overall quality of the forecast results supports the model structure proposed in equation (6) and the most relevant input variables have been identified. The most significant performance difference among these three forecasts occurs on the number of false positives; fuzzy models are consistently lower, and therefore afford a more reliable forecast. It is worth noticing that the fuzzy forecast performs better in this sense, so it is less sensitive to changes in the parameter values. Nonetheless, all models ought to be calibrated once a year to take into account trends in ozone ground levels and to keep the forecast accurate.

Table 3 Statistical comparison of forecasting models
Figure 5
figure 5

Monthly ozone levels in the Al-Mansouriya area.

Conclusion

The structure proposed in equation (6) was tested and validated by comparing the outcomes of two different modeling schemes, namely linear time series modeling and fuzzy modeling. These two forecasting systems showed a significant improvement over the pure persistence forecast. The impact of temperature on daily maximum ozone levels was confirmed by deducing the model from basic principles.

Furthermore, the accuracy of the results and the performance of the two models suggest that temperature explains a large proportion of variance in the daily maximum ozone levels. However, the accuracy of the models in forecasting very high ozone concentrations is very low. Ad hoc corrections may improve local forecasts, but such models will likely be site specific. Among the different forecasting systems, the fuzzy system showed better performance in terms of lower numbers of false positives for all validation datasets. Thus, fuzzy modeling can be used to develop an effective environmental warning system (EWS) in Kuwait.