A Feasibility Study for Determining the Sensible Heat Flux to and from Small Green Roofs

Eddy covariance is an established technique for registering sensible heat fluxes. However, this method is less adequate for smaller surfaces that cannot match the associated footprint. As an alternative technique, a recently constructed acoustic anemometer (Ly-ATOM) is tested, which operates horizontally at an extension of circa 1 m and a data-acquisition frequency of 1 Hz. The Ly-ATOM device reproduces both the acoustic virtual temperature and the horizontal wind components registered by a three-dimensional sonic anemometer. As this Ly-ATOM device can be applied much closer to the ground compared with a sonic anemometer, the size of the related source area is significantly reduced (by a factor of 25). Two methods are used to retrieve the sensible heat flux from variance characteristics of temperature and the horizontal wind components recorded by the Ly-ATOM device: combining the flux-variance-similarity and alternative-flux-variance methods for use in unstable and stable stratification, respectively, yields good results for the sonic measurements. Therefore, these methods can be applied to the Ly-ATOM device as well. In investigating the sensitivity to detect modified surface characteristics, specifically increased evapotranspiration and decreased surface albedo, the Ly-ATOM device proves to be superior to the sonic anemometer which is more vertically removed from the surface of interest.


Introduction
In recent years, sustainable development, especially of cities, has received increasing attention. Besides their potential for aesthetic improvement, green roofs are also considered to hold multiple environmental benefits (Berardi et al. 2014).
One of the benefits attributed to green roofs is the attenuation of urban heat stress. However, few studies evaluate the potential of green roofs to reduce the temperature at street level, with B Niels Wollschläger niels.wollschlaeger@ufz.de 1 Helmholtz Centre for Environmental Research, Leipzig, Germany 2 University of Leipzig, Leipzig, Germany most using mesoscale models on an urban scale (Santamouris 2014). Overall, the results of such simulation studies regarding the cooling potential and temporal occurrence vary greatly (Rosenzweig et al. 2006;Chen et al. 2009;Ng et al. 2012). In this context, Morakinyo et al. (2017) found a distinct influence of the local climate and building height on this cooling potential. These inconsistencies in simulations on the urban scale are also accompanied by the still uncertain effect of green roofs on local micrometeorology. Green roofs are capable of reducing surface temperatures during summer months (Teemusk and Mander 2009;Morau et al. 2012;Bevilacqua et al. 2017), however, results concerning the cooling potential of green roofs on the ambient air temperature vary in terms of magnitude and temporal occurrence. MacIvor et al. (2016) performed an experimental analysis above multiple small green roof modules with variations in substrate type, irrigation schedule, and plant community in Toronto. There, Sedum was particulary promoted for its cooling potential compared to other species (meadow mix of 15 grasses and scattered wildflowers) of up to 1.5 K. Blanusa et al. (2013) compared different green roof species regarding leaf surface cooling, cooling of the substrate beneath the canopy layer, and air temperature in Reading, U.K. However, temperature differences between these species were only detected for the hottest summer days at a height of 0.3 m. Stachys was found to outperform Sedum. Heusinger and Weber (2015) compared air temperature measurements at a height of 0.5 m of an extensive green roof to a bitumen roof and determined an air cooling effect of 0.7 K in Braunschweig, Germany. Speak et al. (2013) detected a decrease in monthly median air temperature of 1.06 K above an intensive green roof compared with a concrete roof at a height of 0.3 m in Manchester, U.K. Air temperature over the green roof was lower than over the concrete roof throughout the diurnal cycle, while higher daytime temperatures were found for a roof area with less maintenance resulting in damaged vegetation.
Green roofs with relatively abundant soil moisture have a comparable effect in reducing surface and near-surface temperatures to cool roofs with an albedo value of 0.7 (Li et al. 2014). A study in Utrecht, Netherlands by Solcerova et al. (2016) compared temperatures at a height of 0.15 m across several green roof plots to a white gravel roof. There, a night-time cooling and daytime warming effect by the green roofs was observed, which resulted in a slight net warming of the air by the green roofs.
Overall, generalizations regarding green roof thermal performance are difficult since differences in air temperature are quite small and there are a variety of green roofs as well as reference roof designs. Jim and Peng (2012) showed a strong influence of the measurement height and background weather patterns on the green roof thermal performance. Furthermore, the thermal performance is influenced by the irrigation scenario and moisture availability (Heusinger et al. 2018;Zhang et al. 2020). This makes comparisons between studies even more difficult.
For a physically comprehensive assessment of the thermal effects of different types of green roofs, the surface energy balance should be considered (see, e.g., Foken 2017) Solar radiation reaches the surface (SW ↓ flux density) whereby, depending on the surface albedo α, shortwave radiation is reflected back into the atmosphere (SW ↑ flux density). Furthermore, the surface emits longwave radiation (LW ↑ flux density), according to the Stefan-Boltzmann law, but longwave radiation reaches the surface (LW ↓ flux density), too. These four components can be summarized as the net radiation Q * . Usually, at noon, the net radiation is positive, whereby the surface gains more energy than it loses. This energy excess is transported away from the surface by several heat fluxes. The sensible heat flux Q H causes heating of the lowermost hundreds of metres of the atmosphere and originates from the temperature difference between the surface and overlying air layers. The second turbulent heat flux is the latent heat flux Q E , which describes the water exchange processes (including evaporation, transpiration from plants, and condensation) between the surface and atmosphere. The ground heat flux Q G quantifies the temperature exchange between surface and soil by heat conduction. Finally, the residual term in the surface energy balance Q R covers undetected flux parts like advection terms, experimental limitations, or heat storage in both soil and roughness elements (Foken 2017). Experimental studies dealing with the evaluation of heat fluxes of green roofs are rare, which can be attributed to the difficulties of appropriate measurement techniques. Considering surfaces with rather restricted spatial dimensions, Q G and Q E can be determined via the use of ground-heat-flux sensors, and lysimeters, respectively. However, the eddy-covariance (EC) method, which is the standard method, requires a three-dimensional acoustic anemometer (sonic) to determine the sensible heat flux Q H . The sonic requires a minimum height for applying the EC method, wherefore measurements may be impossible to perform inside the respective internal boundary layer (IBL) of the desired surface (Aubinet et al. 2012). The IBL describes a significantly developed disturbance layer near the surface, and causes several challenges for proper micrometeorological measurements. It originates from horizontal advection over discontinuities of the surface properties (Foken 2008). Due to the IBL, the exchanges of both mass and energy are also affected by the neighbouring area since the wind profile remains in balance with the previous surface (Pires et al. 2015). Therefore, measurements above the height of the IBL should be avoided; correct measurements are taken inside the IBL (Fig. 1). According to Foken (2008), in most cases the parametrization for the height of the IBL δ in m can be described by δ = 0.3 √ s d (Raabe 1983), where s d denotes the run length (m) to the next windward surface type. In order to perform valid EC measurements above a green roof, Heusinger and Weber (2016) used a huge green roof with a size of 8600 m 2 in Berlin, Germany as the study area. However, comparisons to similar green roof systems, including differences in plant choice, structure, etc. are difficult to perform since the second green roof may experience different meteorological conditions. Therefore, only generalized statements compared to conventional urban surfaces using the Bowen ratio β = Q H /Q E could be made. Klein and Coffman (2015) compared EC measurements, which were conducted in Norman, Oklahoma, USA, above a conventional and a green roof with a size of 116.1 m 2 . Although a reduction in the magnitude of Q H over the green roof was observed for days with generally high Q H values, measurements within the corresponding IBL could not be guaranteed, leading to inaccuracies.
In the studies of Takebayashi and Moriyama (2012) and Heusinger and Weber (2015), multiple surfaces were investigated with respect to their impact on the urban climate by considering the sensible heat fluxes with the help of the Juerges equation, which is based on the difference between surface and air temperatures (Juerges 1924). While this method can be performed for surfaces with small dimensions, however, the surface temperature distribution of green roofs is not homogeneous, leading to inaccuracies in sensible heat flux estimations (Ayata et al. 2011).
Here, it is investigated whether valid acoustic measurements can be performed much closer to the ground compared to a standard sonic anemometer on a horizontal plane. In particular, the feasibility to detect sensible heat fluxes, based on temperature and wind fluctuations, with a small source area and inside the respective IBLs is examined.

Derivation of Temperature and Velocity Based on Acoustic Measurements
In a still atmosphere, acoustic wavefronts propagate at the Laplace speed of sound with γ d = 1.4 as the ratio of specific heat capacities at constant pressure c P and constant volume c V , and the specific gas constant for dry air R d = 287.05 m 2 s −2 K −1 . In order to allow the use of the parameters for dry air, the acoustic virtual temperature T av = T (1 + 0.513 q) is used in (2), which takes the specific humidity q and temperature T into account. As a good approximation, the influence of the specific humidity may be neglected, since its contribution to the speed of sound is much lower compared with the air temperature (Barth and Raabe 2011). The effective sound speed c eff depends on the travel times of acoustic signals τ , which propagate along a ray path of length d. Besides the Laplace speed of sound, the effective sound speed c eff consists of a flow-dependent portion v r , which denotes the velocity component in the direction of the sound propagation The temperature and the velocity component along a certain path can be separated by acoustic measurements in opposite directions and by applying the reciprocal method. Therefore, the Laplace speed of sound, which is temperature dependent, can be obtained by eliminating the dependence on the velocity component via adding the effective sound speeds in the forward direction (c eff,1 ) and backward direction (c eff,2 ) (Barth et al. 2013) by Afterwards, the acoustic virtual temperature can be derived from the definition of the Laplace speed of sound (2). In order to obtain the velocity component v r , the influence of the temperature needs to be eliminated by subtracting the effective sound speeds measured in the forward and backward directions by Hence, acoustic travel-time measurements enable the detection of the line-averaged temperature and the velocity components along a sound ray path.

Eddy-Covariance Method
The EC method is the only micrometeorological method that allows direct measurements of sensible and latent heat fluxes (Järvi et al. 2018). It is based on the assumption of horizontal homogeneity and that all transport processes of heat and moisture between the atmosphere and the ground-proximate layer take place only through turbulent mixing (Foken 2008;Zhao et al. 2010).
With the help of the covariance of temperature and the vertical velocity component w T , the sensible heat flux can be calculated as where ρ represents the air density, and c P = 1005 J kg −1 K −1 .
For the determination of the heat fluxes with the EC method, small fluctuations need to be measured. Therefore, the quantities of interest must be measured with high temporal resolution, which is usually realized via the use of a sonic anemometer. According to Aubinet et al. (2012), an upwind fetch of the order of 100 times the measurement height is recommended.

Flux-Variance-Similarity Method
Another method for estimating the sensible heat fluxes is the flux-variance-similarity (FVS) method, which is based on Monin-Obukhov similarity theory (MOST). For an unstable atmospheric surface layer over a homogeneous and flat surface, the dimensionless (normalized) standard deviation σ of a scalar quantity X can be expressed as a function of the Monin-Obukhov stability parameter ζ (Tillman 1972;Cava et al. 2008 where X * = w X /u * is a scalar scale defined by the turbulent flux of the scalar w X and the friction velocity u * . The Monin-Obukhov stability parameter ζ = z/L is given as the ratio of the measurement height z and the Obukhov length L, which is defined as L = −T u 3 * /κ g w T with the von Kármán constant κ = 0.4 and the acceleration due to gravity g. The following characteristics of the function f X (−ζ ) (7) can be assumed (Tillman 1972;Wesson et al. 2001;Cava et al. 2008) 1. Neutral case: −ζ converges to zero and f X (−ζ ) approaches a constant. 2. Free convection limit: −ζ → ∞ and σ x /X * should become independent of u * In the case of the sensible heat flux, the variable X * is replaced by the characteristic temperature scale (T * = −w T /u * ). Taking both criteria into account, (7) yields with the similarity constants C 1 and C 2 . Inserting the definitions of both T * and Q H (6) leads to Substituting for u * from the definition of L and rearranging gives the following expression for the sensible heat flux, which is valid for the entire unstable range (Tillman 1972) In the case of free-convection conditions, (8) can be simplified according to Wyngaard et al. (1971) and Cava et al. (2008) as which yields the following expression for the sensible heat flux Thus, for free convection conditions, the FVS method requires only high-frequency (2-10 Hz) measurements of air temperature at a single height (Ullah et al. 2019). However, the FVS method applies only to unstable atmospheric stratification (−ζ >0) and is not able to predict negative sensible heat fluxes since σ T ≥ 0. Thus, most studies when applying the FVS method, like Zhao et al. (2010), Wang et al. (2013), and Ullah et al. (2019), did not consider Q H during night-time stable conditions.

Alternative-Flux-Variance Method
Based on the MOST relationships for temperature and the streamwise velocity component, an alternative-flux-variance (AFV) method for the estimation of the sensible heat flux Q H-AFV during stable conditions (ζ > 0) is (De Bruin and Hartogensis 2005) in which the exponent m describes the influence of the magnitude of ζ for stable stratification.
Using the definition of T * , rearranging (14), and inserting (13) leads to According to Wesson et al. (2001) and Babić et al. (2016), an exponent of m ≈ 0 for stable stratification can be assumed. Thus, the sensible heat flux can be calculated for stable conditions as

Measurement Site
A measurement campaign in the garden of the Leipzig Institute for Meteorology (LIM), which is close to the city centre, was carried out from 23 June 2019 to 1 July 2019. The data analysis was performed in UTC, which is 2 h earlier than local time (Central European Summer Time). An acoustic ultrasonic anemometer (USA-1, METEK, Elmshorn, Germany), which is shortened to sonic here, was used to determine the three velocity components and the temperature at a height of 2.3 m and at a sampling frequency of 10 Hz.
For the detection of micrometeorological quantities and the representation of the energy fluxes between the surface and the atmosphere, four acoustic sensors at a height of 0.25 m were equidistantly attached on a metallic ring. The distance between the two opposing sensors, each consisting of a combination of transmitter and receiver-type ultrasonic sensors (A-18P20, EKULIT, Ostfildern, Germany), was about 1.3 m . This device, which is depicted in Fig. 2, is called Ly-ATOM in the following, since it was originally designed to enable acoustic tomography measurements above lysimeters (Raabe et al. 2012). Initially, the Ly-ATOM set-up was placed on a concrete surface. On 25 June 2019 at 0700 UTC, a rolled sod was laid, and the surface was modified by irrigation and a dark foil (see Table 1). It was ensured that the acoustic sensors of the Ly-ATOM device were located inside a possible IBL in order to detect the properties of the targeted surface (see Fig. 1). The measurements were supplemented by a net radiometer (CNR4, KIPP& Zonen, Delft, the Netherlands), which was placed next to the Ly-ATOM.

Raw-Data Processing
The 10-Hz raw data of the sonic acoustic virtual temperatures and velocity components were processed, including double coordinate rotation and a spike detection algorithm (outside ±6 standard deviations) via an in-house program written in Fortran 90/95. The Ly-ATOM device provided sound transit times with a temporal resolution of 1 Hz, which were processed with the statistical programming language R. Acoustic virtual temperatures and the horizontal velocity components were calculated as described in Sect. 2.1 for the sensors, which are facing each other and passing the centre of the ring. The Ly-ATOM set-up includes an autonomous calibration, which yields the distances between all sensor heads (see Fig. 2). These preassigned distances between the sensors of the Ly-ATOM device were improved by comparison with the sonic, which serves as the reference device for a rather high-wind-speed period in the case of temperature and a calm period in the case of the velocity components. Again, a coordinate rotation was performed in which the rotation around the y-axis could be omitted since no vertical velocity component was detected, and outliers were removed using a spike detection algorithm. A block-averaging time period of 10 min was selected for both devices in order to enable direct comparisons.

Estimation of Flux-Variance-Similarity Constants
Knowledge regarding the FVS relationship f(−ζ ) (8) is required to apply the FVS method. The constants C 1 and C 2 were determined using a Bayesian parameter estimation approach with 10,000 iteration steps. For both C 1 and C 2 , log-normal distributions were assumed as prior distributions with a geometric mean of 1 and very small precision of 0.00001. Data points in the near-neutral range (−ζ > 0.02) and outliers were removed. The FVS constants were estimated with 95% confidence as C 1 = 0.822(±0.04) (mean ± standard error) and C 2 = 0.015(±0.008). The obtained relationships are displayed for unstable conditions on a log-log plot (Fig. 3). As suggested by Tillman (1972), the freeconvection approximation can be assumed to be valid for values of −ζ < 0.05 and simplifies (8) to (11). The FVS constant C 1 differs greatly from previously reported values, which are Dimensionless standard deviation of air temperature as a function of ζ determined with the sonic. Data points in the near-neutral range (ζ < 0.02) and outliers (outside two standard deviations of the regression line), which are represented by the grey circles, were excluded from the analysis. The solid black line represents the experimental similarity function for the whole unstable range σ T /T * = C 1 (C 2 − ζ ) −1/3 with C 1 = 0.822 and C 2 = 0.015. The dashed black line denotes the free-convection approximation usually in the range between 0.9 and 1.1 (Wyngaard et al. 1971;Tillman 1972;Albertson et al. 1995;Cava et al. 2008;Zhao et al. 2010), which illustrates the importance of determining the FVS relationships for each study area. The large scatter in the data is typical for measurements in the surface layer (Zahn et al. 2016). Deviations from the relationships of Wyngaard et al. (1971) and Tillman (1972) may be attributed to the heterogeneity of sources and sinks inside the surface layer, violating the assumptions of MOST (Filho et al. 2008;Zahn et al. 2016).

Determination of the Correction Coefficient for the Alternative-Flux-Variance Method
To determine the magnitude of Q H during stable conditions, the AFV method is applied, and the constants C u and C T need to be estimated (16b). Therefore, the dimensionless standard deviations of temperature and the streamwise velocity component according to (14) and (13), respectively, are considered in terms of their dependence on the stability parameter ζ (see Fig. 4 including regression analyses). The influence of the magnitude of ζ is found to be not significant, with the constants determined to be C T = e 1.029 = 2.797(±0.2) and C u = e 0.865 = 2.376(±0.2), which agree well with the values of 3.0 and 2.3, respectively, found by Pahlow et al. (2001). Thus, the correction coefficient for (16b) results in r corr = 1/(C u C T ) = 0.15(±0.06).
In contrast to the EC method, the applications of the FVS and AFV methods require knowledge concerning the present stratification in order to apply the correct formulae. Besides ζ , this distinction can be performed with the help of the vertical temperature gradient or by consideration of the net radiation (Wesson et al. 2001;Foken 2017). Here, the distinction between stable and unstable stratification is made based on the sign of ζ and the net radiation measurements using the sonic and Ly-ATOM device, respectively. A positive net radiation as a function of ζ determined with the sonic: data points in the near-neutral range (ζ < 0.02) and outliers (outside two standard deviations environment of the regression line), which are indicated by the grey circles, are excluded from the analysis. The solid red line depicts the linear-regression model corresponds to unstable stratification, while a negative net radiation indicates stable stratification.

Criteria for Comparison
In order to enable comparisons between the considered time series, two criteria were employed: the index of agreement (i A), which can detect additive and proportional differences in the observed and simulated means and variances, is a standardized measure of the degree of model prediction error. This parameter indicates the ratio of the mean-square error to the potential error by and varies between 0 and 1, where 0 indicates no match and 1 indicates a perfect agreement.
Here, O i is the observation value, S i is the reference value, and O and S are the respective average values (Willmott 1981). The root-mean-square error indicates the standard deviation of the model prediction error. Since the value of R M S E is ideally zero, a large value of R M S E suggests a poor agreement.

Footprints
Most green roofs offer only limited space and may possess obstacles, which perturb EC measurements. Since footprint models are based on the assumption of horizontal homogeneity, they can only be considered as a first approximation in complex terrain (Aubinet et al. 2012). The footprint climatologies of both measurement systems were determined using the flux-footprint model of Kljun et al. (2015) (Fig. 5). Comparing the respective 80% footprint source area of both devices shows a remarkably larger source area of the sonic, which can be attributed to the differences in measurement height. The 80 %-source area of the Ly-ATOM device is reduced by about 96 % compared with the sonic. The environment cannot be considered as homogeneous and the measurements were disturbed by the surrounding allotments and walls, which were 8 m (to the north) and 13 m (to the east) away from the sonic.

Measurements of Temperature and Velocity Components
In Fig. 6, time series of temperature, wind direction, and wind speed recorded by the Ly-ATOM, sonic, and LIM weather station are shown. All measurement devices detect similar values of air temperature, while wind speeds detected by the LIM weather station (10 m above the ground) are significantly higher compared with the sonic and Ly-ATOM due to the logarithmic wind speed profile.
Differences between the sonic and Ly-ATOM measurements of temperature and wind speed are comparatively low (R M S E = 0.67 K and 0.31 m s −1 and i A = 0.99 and 0.84).
Thus, the Ly-ATOM device can reliably detect area-averaged values of temperature and wind speed in the direct vicinity of the surface.

Sensible-Heat-Flux Estimations
Since the Ly-ATOM device yields feasible results concerning the temperature and the velocity fluctuations, their corresponding fluctuations can be investigated in order to determine Q H via the FVS and AFV methods. In a first step, these methods were tested with measurements of the sonic. Afterwards, the FVS and AFV methods are applied to measurements of the Ly-ATOM device. Since the EC method is the standard micrometeorological method for detecting turbulent surface heat fluxes, it serves as the reference method.  Kljun et al. (2015) In the case of sonic measurements, the FVS method in the near-neutral range (0 ≤ −ζ < 0.05) was calculated according to (10), while (12) was used for −ζ > 0.05.
In the case of the Ly-ATOM measurements, this modification of the FVS method in the near-neutral range was not applied, since there is no information concerning the magnitude of ζ without supplementing measurements of the sonic.

Sonic
In Fig. 7, time series of Q H-EC are compared to Q H-FV and Q H-AFV for the measurement period of 24-30 June 2019. The Q H-EC follows a diurnal pattern with positive peaks around noon, which were above 200 W m −2 at each day of the measurement period, and slightly negative values around −30 W m −2 during the night-time.
Using the FVS method, daytime Q H-EC values can be well reproduced during the measurement period. However, daily maximum values are slightly attenuated on 25 and 30 June.
As night-time absolute values of Q H are quite small, the significance in an appropriate detecting was minor in past research (De Bruin and Hartogensis 2005). Nevertheless, the surface energy balance during the night-time is meaningful for urban climate research. For stable stratification, the magnitude of Q H-AFV corresponds well to that of Q H-EC .  2.3 m), PT100-resistance thermometer (z = 2 m), and cup anemometer (z = 10 m), the latter two being recorded by the LIM weather station. The grey-shaded domains indicate distorted measurements of the Ly-ATOM device due to loose sensor mountings and were, therefore, excluded from the analysis Since a combination of the FVS and AFV methods is able to recover Q H-EC all day, these are applied to the Ly-ATOM measurements in a next step.

Ly-ATOM
The FVS and AFV methods were adopted to recordings of the Ly-ATOM device in order to measure inside the corresponding IBL of the surface. In Fig. 8, Q H-AFV and Q H-FV , determined with the Ly-ATOM device, are compared to Q H-EC . Again, a very good agreement with Q H-EC can be achieved using the Q H-AFV method during night-time. In contrast, daily maximums of Q H-FV are considerably reduced compared with Q H-EC .
Discrepancies between the applied methods can be observed around the zero line (when the sensible heat flux changes its sign), as well, since transitions from Q H-FV to Q H-AFV (and vice versa) are less smooth compared with the sonic measurements and discontinuities are present. These may be reduced with an improved method for the differentiation between stable and unstable stratification. Moreover, differences in terms of atmospheric stratification may exist due to the differences in height and underlying surface properties. Larger deviations from the EC method are observable on 26 and 30 June, which can be attributed to the surface manipulations. These are examined in more detail in Sect. 4.4.

Comparison
In order to evaluate the performance of both the AFV and FVS methods for the sonic and Ly-ATOM devices, respectively, they are compared with Q H-EC by using a Taylor diagram (Fig. 9). This comparison is not intended as a benchmarking procedure of the Ly-ATOM device against the sonic and we do not expect perfect agreement of the measured values. Instead the aim is to demonstrate that (i) the heat fluxes measured by the Ly-ATOM device are in a reasonable range, and (ii) the specific measurement conditions (sensor height, surface material, footprint area) of the Ly-ATOM device are properly reflected in the flux values. Overall, the applied methods show a reasonable agreement to the equivalent Q H-EC in terms of the standard deviation, correlation, and the magnitude of R M S E. For the sonic measurements, both AFV and FVS methods agree better with the EC data in terms of the magnitude of R M S E and the correlation coefficient than heat fluxes calculated from Ly-ATOM recordings. This is plausible since the Q H values were determined with another device, above a The EC method was applied to measurements of the sonic. The differentiation between Q H-AFV , and Q H-FV was performed on the basis of net radiation measurements of the CNR4 net radiometer located next to the Ly-ATOM device. The grey-shaded domains indicate distorted measurements of the Ly-ATOM device due to loose sensor mountings and were, therefore, excluded from the analysis  Table 2.
The measurements with the Ly-ATOM device were conducted in the roughness sublayer, while the EC measurements were performed in the inertial sublayer. In the upper part of the surface layer, where the turbulent fluxes are almost constant with height, MOST can be generally applied (Sfyri et al. 2018). Ullah et al. (2019) found no essential differences when comparing FV measurements at different heights. Indeed, measurements of sensible heat fluxes in the roughness sublayer have not been investigated in detail. In the roughness sublayer, turbulence cannot be considered as homogeneous, wherefore MOST is not valid to its full extent and problems in flux estimations may occur (Järvi et al. 2018). This may generally account for differences of sensible heat flux estimations between sonic and Ly-ATOM device. Fig. 9 Performance of different methods for the determination of Q H in a Taylor diagram. The EC method serves as a reference for both unstable (FVS method) and stable stratification (AFV method) and is represented by the green triangle. Standard deviation (radial distance from the origin) and RM SE values (represented by the concentric distances between the reference measurement and symbol) are normalized based on Q H-EC . Measurements of the sonic are represented by a circle, while the triangles indicate measurements carried out with the Ly-ATOM device. The days 26, 27, and 30 June are excluded in this plot since the measurements were distorted on these days due to loose sensor mountings

Detection of Surface Modifications
During the measurement campaign, the surface was modified multiple times by irrigation and a dark foil. These modifications of the surface are associated with changes of temperature and Q H . It might be anticipated that watering of the surface leads to an increased latent heat flux and the sensible heat flux should be lowered since net radiation remains constant. Due to evaporation, the temperature should decrease, too. The albedo lowered by the dark foil leads to an increase in net radiation (1c), thus, the temperature and sensible heat flux increase (Li et al. 2014).
Aiming to investigate the influences of the activities, the time series Y t , was split into its respective components, trend Y T ,t , seasonality Y S,t , and error t by For this, a dynamic linear model, containing a diurnal seasonal component and a local level trend (second-order polynomial model), was deployed. The related Kalman filter serves as a low-pass filter for the trend (cut-off period = 1 day) and as a band-pass filter for the seasonal The grey background indicates recordings above the concrete surface, while the green background marks measurements above turf. The blue stripes refer to watering of the soil and black stripes refer to a placed dark foil. Measurements of the sonic were neither performed directly above the concrete nor the turf, for which the background was kept blank component (period = 1 day) (Schlink et al. 1997). Finally, the random component is calculated by subtracting the trend and seasonal components from the original signal. Subsequently, a statistical intervention analysis examines whether there is an event over time that affects the level of the observed time series in the short or long term (Rainer Mean values correspond to the averaged level of the respective time series before the surface modification (Y t ) and give perspective on the magnitude of the effect, while the effects correspond to the regression coefficient b (with 95% confidence) in (21) Schlittgen 2012). Therefore, a dummy variable I t is generated, which encodes the intervention periods of the respective surface modifications t 0 by The influence of the modifications can be evaluated using a linear-regression analysis, which was applied to the intervention model, where b describes how the activity at time t 0 affects the time series.
The influence of the activities on the random components of temperatures as well as Q H measured with both devices is depicted in Fig. 10. In Table 3, the parameters of the intervention analysis are summarized. Watering of the surface led to a decrease of the temperatures of the Ly-ATOM device and the sonic by 0.15 K and 0.17 K, respectively. The dark foil increased the temperature by 0.65 K (Ly-ATOM) and 0.53 K (sonic). Considering the sensible heat fluxes, irrigation of the surface led to a reduction of sensible heat fluxes (see minima in Q H during blue stripes in lowermost plot of Fig. 10). For Q H-EC , the effect is least pronounced and is most uncertain. The dark foil leads to an increase in the magnitude of Q H (see peaks in Q H during grey stripes in the lowermost plot of Fig. 10), which is much more pronounced for Q H-AFV and Q H-FV compared with Q H-EC .
All in all, the conducted modifications fulfilled prior expectations. The determined effects of irrigation and the dark foil are more pronounced for the sensible heat fluxes compared with the temperature due to the extremely sensitive character of the turbulent heat fluxes (Raabe et al. 2002). Due to the underestimation of Q H-FV with the Ly-ATOM device, intervention effects of Q H-FV are minor compared with Q H-AFV . For irrigation periods, however, the air humidity rises and the approximation T ≈ T av for determination of temperature, and Q H becomes more imprecise (Schotanus et al. 1983). Since the sonic is situated outside the IBL, while the Ly-ATOM device detects surface characteristics inside the IBL, intervention analysis result in more pronounced effects for sensible heat fluxes of the Ly-ATOM device compared with the sonic. Therefore, it could be demonstrated that the Ly-ATOM device is able to detect changes in surface characteristics, and is most suitable for investigating surfaces with rather restricted spatial dimensions. At this point, it should be noted that the precise estimation of the magnitude by the respective effects is not the aim of this analysis.

Conclusions
This work compared different approaches (EC, FVS, AFV techniques) to determine the sensible heat flux above green roofs. Besides a three-dimensional sonic anemometer, a measurement system consisting of four horizontally arranged acoustic sensors (Ly-ATOM) was utilized.
Considering the sonic measurements, AFV and FVS methods were tested and a combination of both methods result in good agreement with the EC method. Sonic measurements are not appropriate for detecting the surface energy balance of surfaces with rather restricted spatial dimensions, as it is usually the case for green roofs, since the source area of the EC method (sonic) is too large. Therefore, with the help of the Ly-ATOM device, acoustic measurements in the proximity of the surface were employed. Thus, the corresponding footprint could be significantly decreased and the impact of obstacles on the measurements were reduced in comparison to an EC measurement set-up. It was demonstrated that the Ly-ATOM device is able to detect reliably the temperature and horizontal velocity components. The vertical velocity component cannot be measured with the Ly-ATOM device, wherefore the EC method cannot be employed. Hence, a combination of the FVS and AFV methods was applied to the Ly-ATOM measurements and compared to the eddy-covariance measurements of the sonic. Despite distinctly lower daytime Q H-FV values, the estimated heat fluxes of the Ly-ATOM device were generally still consistent with the EC measurements.
Adjustments regarding the applicability of the FVS and AFV methods, in particular the FVS constants C 1 and C 2 as well as the correction coefficient r corr , were derived on the basis of the sonic measurements. The validity of adopting these findings to the Ly-ATOM device is questionable. Nevertheless, it could be proven that a horizontal acoustic anemometer is able to determine the sensible heat flux of the underlying and very near surface, as could be demonstrated by the greater effects of the surface modifications on the Ly-ATOM measurements compared with those of the sonic.
By combining the Ly-ATOM device with ground-heat-flux sensors, lysimeters and radiation balance measurements, all components of the energy balance can be measured. Since each measurement device detects only a small source area, statements concerning the energy balance of a particular location are enabled. Thus, comparisons between various green-roof types regarding their cooling potential on the overlying air layers are possible. In this regard, based on the evaluation of the surface modifications, this study shows a more rapid and more pronounced response on differences in surface properties of the sensible heat flux than temperature. Thus, differences between various surfaces with different surface properties regarding the impact on the overlying air can be reliably observed by the Ly-ATOM technique. By reason of a reduced sampling frequency of the Ly-ATOM technique, spectral losses in the high-frequency range occur. Furthermore, the block averaging time span was assumed to be 10 min, for which losses in the low-frequency range may appear as well. In order to minimize the residuals of the energy balance closure, these spectral corrections should be included in subsequent studies. appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.