1 Introduction

The Metropolitan Area of São Paulo (MASP) is one of the largest urban agglomerations in South America and is considered to be the most important socioeconomic region in the southeast of Brazil. The area of the MASP extends over 7945 km2, and its estimated population of 21.5 million citizens is distributed over 39 conurbated municipalities, including São Paulo City, which is the most densely built and populated city (over 8000 inhabitants km2) in the country (EMPLASA 2017).

The MASP lies close to the Tropic of Capricorn (23° 33′ S, 46° 39′ W), 80 km from the Atlantic Ocean, on a plateau at an altitude between 700 and 1000 m above mean sea level (MSL). Its climate is classified as humid subtropical (Cwa according to the Köppen scale) and characterized as having humid summer and dry winter seasons (Peel et al. 2007). The local weather is strongly influenced by sea and land breezes, by topographic features, and by interactions with urban heat island (UHI) effects (Vemado et al. 2016).

A meteorological phenomenon that is especially relevant for cities is fog, as incidences of fog directly affect ground, air, and sea transport by impairing the visibility and thus raising the likelihood of accidents (Vasconcellos et al. 2018). Fog is a meteorological phenomenon defined by conditions of horizontal visibility below 1 km on the ground as caused by hydrometeors (WMO 2017). Fog, which is a cloud with ground contact, typically has a liquid water content of 0.1–0.2 g m−3 and droplet sizes around 10-μm diameters. In comparison to elevated clouds, fog is usually much more concentrated with respect to air pollutants, because particulate and gaseous emissions from the surface have a more direct impact on fog chemistry than on the chemistry of higher clouds (Seinfeld and Pandis 2006).

Moreover, fog droplets interact directly with the incoming solar radiation, on the one hand through reflection, which increases the albedo, and on the other hand through absorption, which results in heating and increased latent heat fluxes within the fog layer (Boucher et al. 2013). Therefore, fog impacts the local climate. For example, Hartmann et al. (2013) state that the decline in the frequency of fog events in European cities has led to an increase in direct solar radiation at the ground and may, thus, be partially responsible for increased local warming effects.

In the time period between 1933 and 2008, the frequency of fog in the MASP was observed to decrease (Gonçalves et al. 2008). Thus, the goal of the current study is to prove whether the observed trend continued from 2008 through 2018, and, more importantly, determine what caused the observed overall trend. Gonçalves et al. (2008) suggested that regional warming led to the observed decrease of fog. Similarly, Da Rocha et al. (2015) also argue that regional processes account for the occurrence of fog in the MASP, but they suspect radiative rather than advective processes. However, other studies have suggested that both global warming and an increase in air quality have been driving observed decreases in fog at various sites worldwide, including the MASP (Vautard et al. 2009; Klemm and Lin 2016). Therefore, for the MASP, we will analyze the trends of fog in conjunction with the trends of meteorological and air quality parameters.

The main source of air pollutants in the MASP is street-traffic tailpipe emissions, caused by the city’s over 7 million vehicles. Other sources are aviation, wood burning emissions from restaurants, construction and demolition, recovery and maintenance of outdoor road surfaces, domestic waste burning, as well as advection of pollutants from distant industrial activities, from biomass burning in peripheral urban areas, and from distant fires in the São Paulo State, Central Brazil, and the Amazon rainforest (Kumar et al. 2016). The Environmental Agency of São Paulo (Companhia Ambiental do Estado de São Paulo - CETESB) has monitored the MASP’s air quality and traffic emissions since 1998, and in the last decade, several studies have concluded that although the vehicle fleet has increased by more than 100%, most vehicular emission levels have decreased due to a series of implemented laws and regulations for tailpipe emissions and fuel blends (CETESB 2016; Silveira et al. 2015).

We will apply statistical modeling to develop an overview of whether fog formation is influenced by the air pollutants SO2, NOx, and PM10 (acting as potential cloud condensation nuclei) as well as by the daily minimum temperature (Tmin), relative humidity (rH), and atmospheric pressure (aP). We make use of air quality data retrieved by CETESB and meteorological data provided from the Institute for Astronomy and Geophysics from the University of São Paulo (IAG/USP).

2 Material and Methods

We employed two different datasets covering two different research periods. Dataset_1 covers the 86-year period from 1933 until 2018. It contains the annual mean air temperature (AMT) and the annual fog occurrence (AFO). Both the air temperature and the fog occurrence were continuously measured, and the respective data were extracted from the database of the IAG/USP.

The second dataset (Dataset_2) covers the 21-year period from 1998 through 2018. The period for Dataset_2 is much shorter than that of Dataset_1, but it contains, in addition to AMT and AFO, air quality data (SO2, NOx, and PM10) as routinely monitored by CETESB and further meteorological data (rH, aP) as provided by the IAG/USP.

2.1 Meteorological Data

The meteorological data (air temperature, relative humidity, atmospheric pressure, fog occurrence) were measured by the IAG/USP at their site on the Água Funda Park, 800 m above MSL and at 23°39′ 05.2″ S, 46° 37′ 23.4″ W. The area itself (almost 750 ha) and its surroundings, especially the vegetation, did not change much since the 1930’s. Dataset_1 was used to compare trends of AMT with trends of AFO in order to provide insight into the development of the two parameters over a very long time period. A similar study by Gonçalves et al. (2008) on the same dataset, though for a shorter period, documented an increase of AMT and a synchronous decrease of AFO.

For the analysis of Dataset_2, we also employed the dew point temperature to calculate the relative humidity. Overall, the daily minimum air temperature (Tmin), the daily mean relative humidity (rH), and the daily mean atmospheric pressure (aP) were used to analyze potential correlations with fog occurrence and air quality parameters. Daily data on fog occurrence (FO) was available as a binary-coded variable. The variable describes whether, on a given day, a fog occurrence was sighted or not (yes = 1; no = 0).

2.2 Air Quality Data

Data on air quality from 1998 until 2018 was downloaded directly from the São Paulo State Government’s (CETESB) website and added to Dataset_2. The dataset is quality checked according to the new state air quality standards established by State Decree No. 59.113 from 2013-04-23 (CETESB 2016). Based on hourly data, we calculated the daily medians of the concentrations of SO2, NOx, and PM10, as measured by 26 automatically monitoring stations in the MASP (Table 1). These stations are scattered through the most densely populated areas of the MASP, strategically installed in areas of heavy traffic and close to public places such as schools, hospitals, and parks, respectively (Fig. 1). We trust that the median of these stations is the best estimate of the overall air pollution in the MASP at a given time. To use either one single station or a group of stations would introduce issues such as missing data, gap filling, and potential over-representation of specific station types over others.

Fig. 1
figure 1

Section of Southern Brazil with the metropolitan area of São Paulo (MASP, bold white boundary line). CETESB stations are indicated with their numbers (Table 1). The blue dot with “M” shows the IAG/USP monitoring site. Sources: Esri, DigitalGlobe, GeoEye, Earthstar geographics, CNES/Airbus DS, USDA, USGS, AeroGRID, IGN, IBGE, CETESB, DataGeo and the GID User Community

Table 1 CETESB stations in the metropolitan area of São Paulo, monitored pollutants, and launch dates

The monitoring of the parameters started mostly in 1998, and there is no inconsistency in the dataset. Specifically, there is no gap in the air pollutants’ time series, because there was no malfunction of all stations at the same time and the median could be computed for each time step. The PM2.5 concentration was not used in this analysis because there are very large gaps (up to over 5 years) in the data set.

2.3 Data Analysis

Analysis of Dataset_1 is descriptive in nature, as we investigated temporal trends of AMT and AFO from 1933 through 2018 over the MASP. One aim of our study was to determine whether the corresponding trends of the two parameters continued or changed since 2008.

The analysis of Dataset_2 was focused on studying the importance of meteorological and air quality parameters as drivers of fog formation in the MASP. We employed binary logistic regressions, in which the emergence of a dependent, binary variable is linked with several predictor variables. The dichotomous parameter FO was defined as the dependent variable. The meteorological parameters (Tmin, rH, aP) along with the air quality parameters (SO2, NOx, PM10) were metrically scaled and processed as centered, standardized, independent variables acting as predictors in our model. We employed the “Enter” mode as our analysis method, meaning that all independent variables are analyzed equally, without discarding variables that exceed the significance level (p = 0.05).

2.4 Logistic Regression

The equation of the multivariate binary logistic regression is generally defined as follows:

$$ P(Y)=\frac{e^{B_0+{B}_1\ast {X}_1+{B}_2\ast {X}_2+{B}_n\ast {X}_n}}{1+{e}^{B_0+{B}_1\ast {X}_1+{B}_2\ast {X}_2+{B}_n\ast {X}_n}} $$
(1)

with P being the probability with values between 0 and 1. A statistical model can be formulated using Eq. 1, the predictors Xn (first column in Table 2), and the variables Bn (second column in Table 2) to predict the probability of fog occurrence. For the statistical significance (p value) of the predictors (column “Sig.” in Table 2), the critical value was defined as p ≥ 0.05.

Table 2 Result of the logistic regression

3 Results and Discussion

3.1 Results Dataset_1 (1933–2018)

Figure 2 shows the time series of the annual mean air temperature (AMT) and the annual fog occurrence (AFO) in the period from 1933 through 2018. It clearly depicts an increase of the AMT and a decrease of the AFO over this period. The lowest AMT (16.8 °C) was registered in 1933. Since then, a rather homogeneous increase of AMT was observed. Exceptionally high temperatures were recorded in 2002 (20.4 °C) and in 2015 (20.3 °C). The linear trend of AMT over the entire time period is dAMT/dt = 0.0356 °C year−1; the coefficient of determination is R2 = 0.794.

Fig. 2
figure 2

Annual mean temperature (black) and annual fog occurrence (blue) in the MASP from 1933 through 2018. Dotted lines show linear trends

The AFO time series are much less homogeneous than that of AMT. For example, there is a period of rather high AFO (> 158) from 1973 through 1979 with its local maximum of 207 in 1978. Nevertheless, an overall decrease of AFO is evident since its first registration in 1933. The presumption of a linear trend for the entire timeline yields a yearly decrease of dAFO/dt = −1.159 year−1 (R2 = 0.4356). The 95% confidence interval of the slope dAFO/dt is between − 1.446 year−1 and − 0.873 year−1, the overall decrease of fog occurrence thus convincing.

The results of Dataset_1 exhibit a continuation of the trends previously described by Gonçalves et al. (2008). While the AMT continues its upward trend, the AFO continues to drop. The reasons for both trends in Dataset_1 are most likely related to global climate change and regional land use change, especially deforestation and urbanization. The almost tenfold growth of the population since the 1930s (Brazilian Institute of Geography and Statistics (IBGE) 2011) was accompanied by an increase in impervious surface area in the MASP (Freitas et al. 2007). This led to a change of the runoff regime and to a change of the meso- and microclimates in the MASP, specifically to the formation of urban heat islands (UHI) (Vemado et al. 2016). In addition, anomalies in the sea surface temperature (SST) of the Atlantic Ocean have been mentioned as potential drivers of the increase of AMT and the decrease of AFO (Gonçalves et al. 2008). Given the positive trend of the AMT in the MASP, it will likely continue to increase, and AFO trends will continue to decrease.

Yet, long-term countermeasures have been proposed. For example, planting suitable trees in the streets may lead to ground shading, cultivating green areas will enhance evapotranspiration, and installing solar panels on facades will help absorb radiation to convert it into electric energy (Johansson et al. 2013). In general, it appears that that the local governments have set their focus on climate change mitigation measures. The municipality of São Paulo was the first city of Brazil to establish a “Climate Law” to comply with the goals of the UNFCCC (Prefeitura da Cidade de São Paulo 2012). This law was sharpened in 2018, setting measures to reduce greenhouse gas emissions from the public transport sector as well as from public waste management (Prefeitura da Cidade de São Paulo 2018). In the third quarter of 2019, the council of São Paulo ratified the “Plan of Climatic Action”, the main strategy of which is to operate the city in line with the commitments of in the Paris Agreement (Prefeitura da Cidade de São Paulo 2020). Considering the socioeconomic influence of the municipality of São Paulo in the MASP, surrounding municipalities will likely follow its example.

3.2 Results Dataset_2 (1998–2018)

Figure 3 shows the annual median concentrations of the air pollutants PM10, NOx, and SO2 between 1998 and 2018 in the MASP. The median concentrations exhibit a pronounced decreasing trend. The highest median of SO2 occurred in 2000 at about 42 μg m−3, while the lowest (5.2 μg m−3) was recorded from 2015 through 2018; the negative linear trend of SO2 is dSO2/dt = −1.980 μg m−3 year−1 (R2 = 0.8927).

Fig. 3
figure 3

Annual median concentrations of PM10, NOx, and SO2 in the MASP for the period 1998–2018

Regarding NOx, its highest median concentration (59 μg m−3) was registered in 2001 and its lowest concentration (19 μg m−3) in 2018; the negative linear trend is dNOx/dt = −1.812 μg m−3 year−1 (R2 = 0.856). The highest median concentration of PM10 (43 μg m−3) was recorded in 1998, whereas the lowest concentration (22 μg m−3) occurred in 2017; its negative linear trend is dPM10/dt = −0.957 μg m−3 year−1 (R2 = 0.808). Sulfur dioxide shows the strongest negative trend, followed by the trends of NOx and PM10.

According to CETESB, since industrial growth was limited the main source of atmospheric pollutants in the MASP has been vehicle emissions (78% of NOx, 43% of SOX, and 40% of PM) (Andrade et al. 2015; Kumar et al. 2016; Nogueira et al. 2015; Vara-Vela et al. 2016). Although the vehicular fleet and the fuels in use have increased by more than 100% within the last decade, and although it is expected to further grow by about 6% year−1, the air quality has improved following the implementation of a series of emission control measures enforced by the local and federal governments since the 1980s (Carvalho et al. 2015; Pérez-Martínez et al. 2015; Silveira et al. 2015). The Program for the Control of Vehicular Emission (PROCONVE-L) was introduced in 1986 for light-duty vehicles (LDV). Further on, PROCONVE-P was enacted in 1990 for heavy-duty vehicles (HDV). After the first phase of PROCONVE-L (from 1988 through 1991), a total emission reduction of about 70% was realized, even though the emission standards for HDV only became legally effective in 1994. In 2002, a new law (Resolution CONAMA 315/2002 from the Brazilian Environmental Council) established more strict tailpipe emission standards (Lucon and Goldemberg 2010), followed by PROMOT in 2003, which was another important piece of legislation with the objectives of controlling motorcycles’ emissions and complementing the PROCONVE program (MMA 2013). Those laws and other conditions in Brazil stimulated vehicle manufacturers and the Brazilian market to introduce advanced, cleaner vehicles and fuels, which led to a considerable improvement of air quality (Noronha et al. 2016).

3.2.1 SO2

The atmospheric concentration of SO2 decreased mainly due to a reduction of the sulfur content of vehicular and industrial fuels (Silveira et al. 2015). Regulations to enforce the reduction of sulfur content in fuels were first implemented in the late 1970s, first affecting industrial sites and later on also vehicle emissions (Kumar et al. 2016).

The sulfur content in diesel fuel is a special case in Brazil. Before the introduction of the CONAMA 315-Resolution in 2002, diesel fuel blends contained between 500 and 2000 mg kg−1 of sulfur. When the resolution came into effect in 2003, the national oil company Petrobras, which is the main producer of fossil fuels in the country, did not immediately supply the market with the mandatory S50-diesel fuel (= diesel with 50 mg kg−1 of sulfur). They argued that they only had to follow a prior regulation by the Agência Nacional do Petróleo (ANP), the Brazilian National Oil Agency. Subsequently, the Federal State of São Paulo filed a lawsuit against Petrobras and ANP and eventually enforced them to provide S50-diesel to the Brazilian market starting in 2003 (Lucon and Goldemberg 2010). This process could explain the abrupt decrease of the SO2 concentration in air after 2004 (Fig. 3).

However, as HDVs remained a major source of SO2 and other pollutants, the local government took further action in 2008 and prohibited HDV traffic in the central sections of the MASP (Kumar et al. 2016). As the outcome was, again, considered to be unsatisfactory, a new phase of PROCONVE was implemented, so that starting in 2012, S10-diesel has to be provided to the market (Lucon and Goldemberg 2010).

3.2.2 NOx

The NOx-emissions also decreased due to the control policies described above and due to improvements of vehicle technology. In the first phase of PROCONVE-L (L-1), between the years 1988 and 1991, technological innovations (i.e., the introduction of three-way catalytic converters) led to the reduction of exhaust gases, which diminished NOx emissions. Later, the fourth phase of PROCONVE-L (L-4, from 2005 till 2008) focused on reducing ozone precursor compounds, hydrocarbons and NOx through new technologies in vehicles (i.e., optimized combustion chamber, electronic and increased pressure fuel injection). In the fifth phase (L-5 from 2009 through 2013), the emission policies were further sharpened. As a result, the NOx emissions were reduced to 48% and 42% for LDVs and HDVs, respectively. However, regarding vehicular emissions from HDVs, PROCONVE-P aided by the reduction of NOx emissions especially in its fifth and sixth phases (P-5, from 2003 to 2008 and P-6, from 2009 to 2011) (Nogueira et al. 2015).

3.2.3 PM10

Important contributors to the PM10 mass concentration are primary, natural aerosol particles such as dust and sea salt. Further, over 40% of the respective particle mass is secondary material which is formed in atmospheric processes from precursor gases. These gases are primarily hydrocarbons that originate largely from vehicle emissions (Vara-Vela et al. 2016). Another traffic-related source is not related to engine exhaust: Resuspension of material from dry roads and abrasion of brakes, tires, and clutches introduces nonexhaust traffic-related particles into the ambient air that contribute to the PM10 concentration (Grigoratos and Martini 2014). The respective emissions are estimated to be of the same magnitude as that of tailpipe emissions. In addition, human food preparation also contributes to the generation of PM10. The MASP hosts a large number of restaurants (mostly pizzerias operating on wood stoves or steakhouses using charcoal ovens), the emissions of which are not under any emission controls (Kumar et al. 2016). A similar situation applies to home cooking and machines used for construction. Although a few respective case studies do exist, the role of anthropogenic emissions, i.e., cooking, construction and airport emissions, to the PM10 concentration and other pollutant levels in the MASP should not be underestimated.

3.2.4 Minimum Temperature, Relative Humidity, and Atmospheric Pressure

Figure 4 shows the annual medians of the daily minimum temperature (Tmin) and the annual mean of the relative humidity (rH) in the respective period. The lowest annual median of Tmin (15.2 °C) was registered in the year 1999, while the maximum (17.8 °C) occurred in 2015. Assuming a linear trend of Tmin since 1998, it’s slope is positive (dTmin/dt = 0.0084 °C year−1, R2 = 0,0069). The lowest annual median (71%) of rH was in the year 2007, the highest (85%) was recorded in 1999. Since 1998, rH decreases on average. Accepting a linear negative trend for rH, its slope is drH/dt = −0.1395% year−1, R2 = 0.054. A negative trend of aP (not shown in detail) has also been detected (daP/dt = − 0.2331 hPa year−1, R2 = 0.2775). The annual medians do exhibit large fluctuations over the years, and they do not correlate with the decrease of AFO.

Fig. 4
figure 4

Annual medians of daily minimum temperature (red line) and daily mean relative humidity (purple line) in the MASP for the period 1998–2018

With respect to our findings from descriptive statistics of Dataset_2, we summarize that the concentrations of PM10, NOx, and SO2 decreased between 1998 and 2018, while Tmin increased. Note that the relative humidityalso decreased. To study these results and their potential impact on AFO in more detail, we performed the statistical analysis of Dataset_2 via logistic regression.

3.3 Results of the Logistic Regression

We performed the logistic regression on SPSS and chose the “Enter” method for further analysis. First, the significance of our regression model is calculated by the Omnibus Test of Model Coefficients. It is equivalent to the chi-square test and provides a statement about the explanatory power of the model. The model is, according to the chi-square test (chi-square = 747.31; degrees of freedom = 6; significance is p = 0.000) significant and, therefore, useful for further analysis. The goodness of fit of the model was calculated on the basis of Nagelkerke’s pseudo R2. If a full model predicts the outcome perfectly, then the Nagelkerke’s pseudo R2 = 1 (UCLA - Statistical Consulting Group n.d., last access: 2019-11-23). In our model R2Nagelkerke = 0.164, which indicates that the model explains only 16.4% of the observed outcome. Further, the model’s relevance is estimated using the effect size f according to Cohen (1992):

$$ f=\sqrt{\frac{R_{\kern0.5em \mathrm{Nagelkerke}}^2}{1-{R}_{\kern0.5em \mathrm{Nagelkerke}}^2}} $$
(2)

Corresponding to Cohen’s classification system, our model has a strong effect (f = 0.44) (Cohen 2009).

Table 2 shows the results of the binary logistic regression. Here, the predictors (Xn) are in the first column, B’s are the generated coefficients for the predictors and for the constants in the model (Eq. 1), S.E. values in the third column are the standard errors around the coefficients (B), and Sig. in the fourth column is the statistical significance of each coefficient.

The Sig. values of the independent variables, Tmin, rH, and PM10 area equal zero (p = 0), meaning that these parameters are significant predictors of fog occurrence (FO = 1). For SO2, the significance of 0.003 does not exceed the critical value (defined as p = 0.05), meaning that its influence is considered as significant as well, but at a lower level compared to that of the former variables.

The significance levels of aP (p = 0.483 > 0.05) and NOx (p = 0.678 > 0.05) exceed the critical value. Their influence on fog formation is very likely of random character. Thus, they are not implemented into the regression Eq. 3 (Bittmann 2015; Zürich 2018). Substituting the Bn values and the Xn values in Eq. 1 with their respective coefficients and significant predictors (Table 2), we arrive at the following equation:

$$ P\ \left(\mathrm{Fog}=1\right)=\frac{{\mathrm{e}}^{-1.628-0.382\ast {\mathrm{T}}_{\mathrm{min}}+0.587\ast \mathrm{rH}+0.828\ast {\mathrm{PM}}_{10}-0.117\ast {\mathrm{SO}}_2}}{1+{\mathrm{e}}^{-1.628-0.382\ast {\mathrm{T}}_{\mathrm{min}}+0.587\ast \mathrm{rH}+0.828\ast {\mathrm{PM}}_{10}-0.117\ast {\mathrm{SO}}_2}} $$
(3)

The logistic regression model was operated with daily data. Most of the model results (Eq. 3) are conceivable with general meteorological and climatological reasoning. For example, it is obvious that a day with high air humidity rH is more likely to be (or become) a foggy day than a drier day. Correspondingly, the driver Tmin exhibits a negative coefficient in Eq. 3, meaning that a relatively low daily minimum air temperature fosters the formation of fog. The roles of these meteorological variables as drivers of fog formation is unequivocal, fog cannot form in an air mass that is warm and dry.

The roles of the air pollution parameters are less straightforward when analyzing their influence on fog on a daily basis than it was for the annual averages (section 3.1). For example, a high concentration of SO2 leads to the formation of large amounts of cloud condensation nuclei (CCN) after gas-phase oxidation of SO2 to H2SO4. If rH raises, these CCN facilitate the formation of a large number of small fog droplets that effectively scatter light and lead to a strong reduction of the horizontal visibility. In that sense, high SO2 should be correlated with high FO. However, Eq. 3 shows a contradictory result, i.e., a negative regression between SO2 and FO (B = − 0.117; p = 0.003). There are two possible reasonings to explain these findings. First, once fog is formed in conjunction with SO2 as described above, the liquid phase is a further effective sink for SO2. It will be completely scavenged from the gas phase and further oxidized to H2SO4. The remaining very small concentrations of gaseous SO2 in the gas phase cannot be detected any more with conventional analyzers, so that the correlation between SO2 and FO is disguised. Second, the data base of our analysis is, both spatially and temporally, not perfectly synchronous. While the FO data are collected at one single location, the SO2 data are averages over the entire MASP. Further, fog may have been detected on a day for only a short time period, while the SO2 data are averaged over 24-h periods. Such disalignment may lead to statistical misinterpretation.

The PM10 concentration shows a positive regression with FO (Eq. 3). This result is reasonable because PM10 can be regarded as a proxy for CCN. Although this last statement should be made cautiously in the context of fog formation, because it is not the mass concentration of aerosol particles that defines their efficacy to act as CCN rather than their number concentration, hygroscopicity, and sizes, the effect of PM10 on FO is, statistically speaking, rather strong, even on a daily basis (coefficient B = 0.828).

The role of NOx is somewhat between that of SO2 and PM10. Nitrogen oxides are, under certain conditions, precursors for CCN and thus for the formation of fog, although their efficacy is much lower than that of SO2. They cannot be regarded as a proxy for FO as PM10 can. Overall, their contribution is not significant (B = − 0.018; p = 0.678).

Our regression model states that aP has no significant contribution to fog formation. We did not further study if the general weather situation, i.e., the presence of high-pressure or low-pressure systems, plays any role in the formation of radiation fog or advected fog. The general picture suggests that the atmospheric pressure is not a significant driver for the overall occurrence of fog also on a yearly basis. Overall, the regression model suggests that high concentrations of particulate matter (PM10) along with the low presence of high relative humidity (rH) and minimal temperatures (Tmin) have a strong influence on fog formation mechanisms over the MASP. Statistical uncertainties remain. For instance, both goodness of fit (R2Nagelkerke = 0.164) and effect size (f = 0.44) demonstrate the limitation of the explanatory power of the model. Therefore, the interpretation of the results should be exercised with caution.

4 Conclusion

Most of the results of our overview trend analysis for assessing the occurrence of fog in the MASP under the influence of meteorological drivers and air pollution are highly consistent with scientific understanding of fog formation: The decrease of AFO over decades is associated with increasing temperatures and decreasing relative humidity. Ongoing urbanization and the regional imprint of global change are, thus, rather obvious contributors to the observed trend of AFO. This is evident from visual inspection of the longer historical dataset (since 1933) and the shorter dataset (since 1998) as well as from statistical analysis of the latter set.

Further drivers are increasing air quality in the MASP. The annual trends of air quality parameters (Fig. 3) and their synchronicity with AFO (Fig. 2) are striking. The statistical analysis of the shorter data set with a logistic regression partly supports these observations. PM10 is the strongest driver of AFO, followed by rH and Tmin. This indicates that both warming and increasing air quality lead to the decrease of AFO in the MASP.

For PM10, its dominant role is conceivable as it is a proxy for CCN, which are effective drivers for reduced visibility and fog formation whenever rH is very high (Maurer et al. 2019). Statistically, NOx is not a significant contributor to the AFO trend. This result is not surprising, because nitrogen oxides are only indirectly associated with the formation of PM and CCN. Note, however, that the NOx decreases together with AFO.

Due to its chemical properties, we expected that increasing levels of SO2 would contribute positively to fog formation. Figure 3 shows a strong decrease of the yearly SO2 concentration, which occurs together with the decrease of AFO (Fig. 2), suggesting a positive correlation between these two parameters. The logistic model results, however, do not support this result, even suggests an opposite relation. At this point, it is not unequivocally clear why the logistic regression does not confirm this picture. We speculate that the SO2 concentration is, in the MASP, more closely related to the formation of haze situations (visibility 1–5 km) with lower humidity associated with radiative processes, while situations of even lower visibility (< 1 km) are mostly associated with humidity advected from the Atlantic Ocean (Da Rocha et al. 2015). Also, there may be a temporal and spatial mismatch between the observation of fog (one location, often one point in time during a day) and the SO2 data (daily medians over the entire MASP). The role of SO2 is, thus, not entirely comprehensible. We speculate that an analysis based on data with better spatial and temporal allocation would change the picture considerably.

The atmospheric pressure (aP) does not contribute significantly to fog formation. Likely, a more detailed analysis of fog occurrence during the various seasons of the year would lead to a different result, because the weather systems in winter lead to a higher fog frequency than those in summer. It has been shown for other regions that changing circulation patterns may lead to changing patterns of fog occurrence (e.g., LaDochy and Witiw 2012).

It is not expected that the drivers we examined can fully explain the formation (or nonformation) of fog. The explanatory power of our model is only 16%. Further note that, strictly speaking, most parameters analyzed in this and other studies are nothing more than proxies of potential drivers of fog formation. Only for the meteorological parameter rH, a direct and causal effect on fog is expected, because fog occurs only at high-enough rH. However, such high rH is rather difficult to measure accurately so that these data are often not reliable. Therefore, the air temperature is an attractive proxy: With increasing temperature and all other conditions being constant, rH decreases. In a similar fashion, SO2 or NOx are not at all direct drivers of fog. Only their oxidation and subsequent formation of secondary aerosol particles drives the formation of cloud condensation nuclei (CCN). A larger number of CCN leads, with all other conditions being constant, to a larger number of yet smaller fog droplets. Klemm and Lin (2016) showed that this effect leads to a reduction of the horizontal visibility and thus enforces fog. The PM10 concentration, which is measured in São Paulo and many other places, is the only available proxy for CCN in long-term time series. However, the correlation between PM10 and CCN is in many cases poor and therefore the PM10 concentration is a rather nonspecific proxy for CCN. Given these limitations we are impressed by the strong role PM10 plays in driving the formation of fog in the MASP. We conclude that the model performance (goodness of fit R2Nagelkerke = 0.164, effect size f = 0.44) is acceptable to provide an overview of the parameters driving fog trends in the MASP.

More detailed analyses in the future should include the seasonality of fog and meteorological conditions. This may help to elucidate the role of regional weather systems and their dynamics in the context of how climate change drives fog occurrence in the MASP. Further, nonlinear models should be employed. Hůnová et al. (2018, 2020) found for their studies in the Czech Republic that rH, SO2, NOx, air temperature and seasonality were explanatory variables for fog trends. However, they also found that the number of stations analyzed and the lengths of the observation periods played important roles, just like the seasonality. Maurer et al. (2019) found for datasets from Taiwan that the influence of PM10 on fog was stronger when the PM10 concentration was lower, while rH was more important in more polluted air masses.

Future datasets should also include the quantification of the horizontal visibility rather than just the dichotomous parameter of presence/absence of fog during a given day. Also, it should be considered to use exclusively hourly air quality data measured at the same times when fog was observed. We also suggest that intensive campaigns should be performed to produce more detailed data on air chemistry, the chemical composition of CCN and fog, and their particle size distributions in order to quantify how local and regional air pollution influence the formation and density of fog.