1 Introduction

Since its inception, numerical weather prediction (NWP) has always been governed by a tight balance; on the one hand using the most up-to-date science in representing atmospheric processes, and on the other, the best use of available computational resources. Ever since the earliest days of operational NWP in the 1950s, parametrization has been key in maintaining this balance, such that subgrid-scale physical processes could be represented and their influence be passed back to the grid. Increasing model resolution through the years has allowed for more advanced dynamics (e.g. non-hydrostatic modelling) and finer detail (e.g. mesoscale systems and orography), with a marked increase in forecast accuracy (Carpenter 1979; Simmons et al. 1989; Mass et al. 2002; Lean et al. 2008). However, with many NWP centres now running limited-area models (LAMs) at grid spacings on the kilometric or even sub-kilometric scales, the time has come to once again ask the question – is higher resolution always better?

The grey zone of turbulence (or terra incognita, Wyngaard 2004) refers to resolutions at which turbulent eddies in the atmospheric boundary layer are partially resolved and partially parametrized, a regime that is now emerging in the highest resolution mesoscale models. Similarity functions derived by Honnert et al. (2011) and a length scale analysis from Beare (2014) imply that the limits of this regime can vary for different model fields and configurations, with a strong sensitivity to the scale of the features being modelled. Eddies that are represented within the grey zone often appear to evolve in a grid-dependent way (Ching et al. 2014; Zhou et al. 2014), and since neither one-dimensional (1D) planetary boundary-layer (PBL) schemes (e.g. Lock et al. 2000) nor three-dimensional (3D) large-eddy simulation (LES, Lilly 1967) formulations are designed to work within this regime, a necessity is growing to find new ways to represent these processes.

It is not self-evident that resolved turbulent motions should be permitted within the grey zone (Zhou et al. 2014). Ching et al. (2014) investigate possible methods for suppressing this resolved motion, so that the model physics relies solely on the 1D PBL scheme throughout. This would negate some of the issues arising from representing each turbulent eddy with as little as one or two grid points. However, as the authors note, a drawback of this approach is that the assumptions used in Reynolds-averaged Navier–Stokes (RANS) modelling may no longer be valid at such small grid spacings. Additionally, there is evidence to suggest that resolving turbulent motions in the PBL might have succeeding effects on how larger-scale features develop, such as deep convection (Petch et al. 2002; Efstathiou et al. 2016).

A key consideration in grey-zone modelling is the scale of the atmospheric processes with respect to the grid spacing (along with the effective resolution needed to represent them). Many atmospheric processes may change in scale as a consequence of diurnal variation, large-scale forcing, thermal advection, etc., whereas the grid size tends to remain static. This implies that processes beginning their life cycle within a PBL scheme can enter the grey zone as they develop.

One such feature is the convective boundary layer (CBL) over land. The CBL has been the focus of many recent grey-zone studies (Beare 2014; Zhou et al. 2014; Ito et al. 2015; Shin and Hong 2015; Efstathiou et al. 2016; Honnert et al. 2016), and is of particular interest because the scale of the boundary-layer depth (\(z_{i}\)) changes during the transition from the shallow, stable, and fully-parametrized boundary layer (\(\textit{O}{[}10^{1}\,{\mathrm{m}}{]}\)) of the night-time (Beare et al. 2006a, b) to the deeper (\(\textit{O}{[}10^{3}\,{\mathrm{m}}{]}\)), thermally-driven CBL in the afternoon. Efstathiou et al. (2016) investigated the morning transition of the CBL for the Wangara experiment in Australia (Clarke et al. 1971) and also for a case from the Met Office site in Cardington, UK. Using horizontal resolutions in the range 50–800 m, they showed that mean profiles of potential temperature (\(\theta \)) had a tendency to become overly superadiabatic, with vertical mixing being increasingly delayed as resolution decreased. The time series of horizontally-averaged turbulence kinetic energy (TKE, Sect. 2.2) for these runs also showed an associated delay in the transition into resolved motion, with the coarser resolution runs being the last to exhibit resolved convection. (The onset of resolved convection in this context is often referred to as ‘spin-up’.)

Spin-up has also been linked to Rayleigh–Bénard thermal instability theory (Zhou et al. 2014). Simulations within the grey zone appear to need larger values of the Rayleigh number (Ra) as grid spacing coarsens before spin-up occurs, but there does exist a critical Rayleigh number (\(Ra_{c}\)) that is theoretically achievable in the grey zone, marking the onset of resolved turbulence. It follows from this that the \(\theta \) profiles might become more superadiabatic before \(Ra_{c}\) is reached, as with the cases in Efstathiou et al. (2016). However, the question still remains: what exactly governs the timing of spin-up, and can the transition be accelerated?

In this paper, we investigate the factors that influence the spin-up of CBL turbulence within the grey zone. The primary aim is to categorize and test model sensitivities, but we also offer some practical techniques that, as will be shown, have a marked influence on how early in the simulation the resolved turbulence can be encouraged to appear. Introducing pseudo-random perturbations into the \(\theta \) field is the key to this approach, and the impacts of arranging such perturbations in both time and space are investigated. In LES modelling, the choices of perturbation structure can affect certain fields, such as the initiation of deep convection (Stirling and Petch 2004). We investigate here how such perturbations affect the grey-zone boundary layer, and what implications this has. An important aspect of this approach is that the modifications are applied only to the resolved fields, allowing use of the method without forcing constraints on the choice of subgrid scheme.

We also make our own adjustments to the parametrization itself, in this case the 3D Smagorinsky scheme, highlighting the need to modify standard LES or PBL parametrizations when attempting to create an optimal grey-zone model configuration. The methods employed in designing the \(\theta \) perturbations and the subgrid-scheme modifications are described in Sect. 2. In Sect. 3, we investigate several approaches for applying the perturbations, and also the sensitivity of our modified parametrization. We make the argument that the most promising avenue towards finding the optimal method of inducing grey-zone spin-up, while preserving or even improving the overall evolution of the simulated CBL, is to customize both the resolved fields and the subgrid scheme in combination. We discuss our findings in Sect. 4.

2 Methods

2.1 Model Set-Up

We make use of the UK Met Office’s large-eddy simulation model, known as the Met-Office NERC Cloud-resolving model (MONC, Brown et al. 2015). The MONC model employs an identical scientific formulation to its predecessor, the Large-eddy model (Gray et al. 2001), with improved parallel capability. We have here configured the MONC model to simulate day 33 of the Wangara experiment, and since this is a widely studied CBL case (Yamada and Mellor 1975; André et al. 1978; Nakanishi et al. 2014), we do not repeat the evaluation of model performance at LES resolutions against observations taken during the campaign.

As a benchmark simulation, we use a LES run with a horizontal grid spacing of \(\varDelta x=\varDelta y=50\,\mathrm{m}\) and a vertical grid spacing of \(\varDelta z=20\,\mathrm{m}\) across a \(9.6\times 9.6\, {\mathrm{km}}^{2}\) domain. The simulation employs a Boussinesq approximation with the advection of momentum following the scheme of Piacsek and Williams (1970), and advection of heat by the total variation diminishing scheme of Leonard et al. (1993). The lower boundary is forced using time-varying sensible-heat-flux data from the campaign, with periodic flow in the horizontal, and an upper boundary set at a height of \(z=2500\,\mathrm{m}\). A relaxation term is applied above \(2000\,\mathrm{m}\) to inhibit gravity-wave formation. All simulations begin at 0900 local time (LT) and run until \(1800\,{\mathrm{LT}}\), with output taken at intervals of approximately \(100\,{\mathrm{s}}\). The initial profile of \(\theta \) is prescribed, and indicates the presence of an inversion at \(z_{i}=100\,\mathrm{m}\), which rises throughout the simulation. Horizontal wind-speed profiles have also been prescribed as per the observations, but wind speeds are generally low (\(< 5\,\mathrm{m}\,\mathrm{s}^{-1}\)), and so we concentrate here on thermally-driven circulations.

2.2 Grey-Zone Simulations

For the grey-zone runs, we use horizontal resolutions ranging from \(\varDelta x=200\,\mathrm{m}\) through to \(\varDelta x=800\,\mathrm{m}\), with at least \(24\times 24\) grid points in the horizontal and a grid spacing of approximately \(40\,\mathrm{m}\) in the vertical (63 levels). The vertical levels remain fixed, with slightly higher vertical resolution near the surface boundary, in common with many NWP model configurations.

Our grey-zone simulations make use of the 3D static Smagorinsky scheme (see Sect. 2.4). Countergradient correction terms are sometimes added in the grey zone to provide extra non-local transport (e.g. Boutle et al. 2014), but we do not make use of such terms here because we wish to measure the effect our methods have on the non-local transport provided to the system by the resolved fields. Another key limitation when using a 3D turbulence scheme in the grey zone is the absence of a cascade of turbulent energy through the inertial subrange. These limitations should always be kept in mind when studying grey-zone modelling.

Honnert et al. (2011) describe a method of ‘coarse-graining’ the LES model solution to approximate the desired grey-zone variances, leading to a useful benchmark in each TKE time series to compare with the grey-zone runs. The horizontally-averaged TKE, denoted here as \(\left\langle e \right\rangle \), is defined by:

$$\begin{aligned} {\left\langle e \right\rangle }=\frac{1}{2}\left( \overline{u^{\prime ^{2}}}+\overline{v^{\prime ^{2}}}+\overline{w^{\prime ^{2}}}\right) , \end{aligned}$$
(1)

where \(\left( u^{\prime },v^{\prime },w^{\prime }\right) \) are deviations from the mean state of the velocity components. The coarse-graining approach is based on taking a horizontal mean across bi-dimensional cells of several grid points in width, such that the variances in the LES model at 50-m resolution are reduced to match the desired grey-zone resolution. Although this approach is not without flaw, it is shown by Honnert et al. (2011) to be very useful in estimating the expected magnitude of resolved TKE in grey-zone runs. We therefore allow the coarse-grained fields to serve as an idealized result for such comparisons. Here we have created four coarse-grained time series of \(\left\langle e \right\rangle \) from the 50-m LES model: \(\varDelta x=200\,\mathrm{m}\), \(\varDelta x=400\,\mathrm{m}\), \(\varDelta x=600\,\mathrm{m}\) and \(\varDelta x=800\,\mathrm{m}\).

2.3 Structuring Perturbations in the Initial Fields

The traditional method of encouraging resolved turbulence to spin-up in a LES model is to impose artificial pseudo-random perturbations to the initial potential temperature and/or momentum fields. This is generally done at the lowest model levels, though some disparity exists in the literature as to the choices of perturbation. The consequences of selecting certain perturbation amplitudes and structures begin to have more relevance in the grey zone, and we will show that the effects of choosing such perturbations carefully have significant impacts on the resolved fields, at minimal computational expense. While many studies focus on the search for a parametrization scheme appropriate for the grey zone, here we investigate whether or not sensitivities exist in the resolved fields that may be useful in combination with an appropriate parametrization.

Some authors (Shin and Hong 2015; Shin and Dudhia 2016) choose the initially-imposed \(\theta \) perturbations (\(\theta ^{\prime }\)) from the range \({{-\,0.05}\,{\mathrm{K}}<\theta ^{\prime }<{0.05}\,{\mathrm{K}}}\) in the lowest four model levels. Others (Petch et al. 2002; Beare 2008) choose the range \({{-\,0.1}\,{\mathrm{K}}<\theta ^{\prime }<{0.1}\,{\mathrm{K}}}\) in the lowest 100–200 m of the domain. The largest range we find in the literature, \({{-\,0.5}\,{\mathrm{K}}<\theta ^{\prime }<{0.5}\,{\mathrm{K}}},\) has been applied up to various heights above ground level, including \(50\,\mathrm{m}\) (Nakanishi et al. 2014), \(200\,\mathrm{m}\) (Stirling and Petch 2004), \(500\,\mathrm{m}\) (Mirocha et al. 2014), the lowest model level only (Mason and Brown 1999; Brown et al. 2000), and \(2z_{i}/3\) (Muñoz-Esparza et al. 2014). So far, these authors have not explicitly explained their choice of perturbation structure, primarily because it is generally accepted that this choice does not seem to have much significance in the LES regime. These numbers apply to the CBL for the most part, though the perturbations are typically included for simulations of the stable boundary layer (SBL) as well (Beare et al. 2006b).

Muñoz-Esparza et al. (2014) have developed methods to impose these pseudo-random \(\theta ^{\prime }\) values in a structured way. Their best performing method, which they refer to as ‘cell perturbation’, was shown to decrease the necessary fetches from the edges of the domain for spin-up to occur when nesting high-resolution domains that pass through the grey zone to reach the inner nests. We have applied a similar principle to this approach by adding organization in the horizontal by means of cells, in which each cell contains a unique perturbation common to all grid points that lie within that cell. In testing the method, we focus on the time series of \(\left\langle e \right\rangle \) rather than the spatial approach taken by Muñoz-Esparza et al. (2014), and stress that our method differs from that of Muñoz-Esparza et al. (2014) in many respects.

We also test three other hypotheses regarding the structuring of the initial perturbations, to show their effect (if any) on spin-up in the grey zone. The four methods are:

  1. (1)

    Increasing the amplitude of the initial perturbation of potential temperature (Fig. 1a).

  2. (2)

    Our variation of the ‘cell perturbation’ method (hereafter denoted as CELL) using cells of \(4\times 4\) (CELL-4) and \(8\times 8\) (CELL-8) grid points (Fig. 1b).

  3. (3)

    Applying coherence to the vertical dimension, such that each grid-point perturbation is unique in the horizontal, but identical in the vertical (Fig. 1c).

  4. (4)

    Using mixed-layer scaling theory to structure the perturbation range (Fig. 1d).

Fig. 1
figure 1

3D visualization of the various methods of perturbing \(\theta \), representing adjacent model levels (at some arbitrary level n) in the lower boundary layer at \(t+0\). The methods are: (a) the default set-up (the SMAG2K and CNTL simulations differ only in the amplitude of \(\theta ^{\prime }\)); (b) the CELL method with cell sizes of \(4\times 4\) grid boxes; (c) the vertical coherence method; (d) the mixed-layer scaling method

Method 4 above is the most complex and requires further calculation, but we show below that this method may well be worth the extra computation. According to mixed-layer scaling theory, the values of \(\theta ^{\prime }\) in the CBL are thought to be largest at the surface and at the inversion, and Garratt (1994) explains how this can be quantified using the empirical relationship

$$\begin{aligned} \sigma _{\theta }^{2}=T_{*}^{2}\left[ 2\left( \dfrac{z}{z_{i}}\right) ^{\nicefrac {-2}{3}}\left( 1-\dfrac{z}{z_{i}}\right) ^{\nicefrac {4}{3}}+0.94\left( \dfrac{z}{z_{i}}\right) ^{\nicefrac {4}{3}}\left( 1-\dfrac{z}{z_{i}}\right) ^{\nicefrac {-2}{3}}\right] \end{aligned}$$
(2)

\(\text {for } z<z_i \text {,}\) where \(\sigma _{\theta }^{2}\) is the naturally occurring variance of \(\theta ^{\prime }\) as a function of \(z/z_{i}\). Here, \(T_{*}\) is defined by

$$\begin{aligned} T_{*}=\left[ \dfrac{\overline{\theta }\left( \overline{w^{\prime }\theta ^{\prime }}\right) _{0}^{2}}{gz_{i}}\right] ^{\nicefrac {1}{3}}, \end{aligned}$$
(3)

where \(\overline{\theta }\) is a reference temperature, \(\left( \overline{w^{\prime }\theta ^{\prime }}\right) _{0}\) is the kinematic heat flux at the surface, and g is the acceleration due to gravity. Knowledge of \(\sigma ^{2}\) in a uniform distribution allows for the calculation of its upper and lower limits \(\left[ a,b\right] \), since \(\sigma ^{2}=\left( b-a\right) ^{2}/12\), and therefore if we assume that \(\overline{\theta ^{\prime }}=0\), we can calculate the limits \(\left[ a,b\right] \) from which to source the random numbers at each vertical model level,

$$\begin{aligned} a= & {} -\sqrt{3\sigma _{\theta }^{2}} \end{aligned}$$
(4a)
$$\begin{aligned} b= & {} \sqrt{3\sigma _{\theta }^{2}}. \end{aligned}$$
(4b)

We have elected to assume a uniform distribution for \(\theta ^{\prime }\), rather than a Gaussian distribution, based on tests performed by Muñoz-Esparza et al. (2014). Preliminary runs have been performed to test how the system responds to perturbations in the vertical-velocity field (w), rather than \(\theta \), and these have shown comparable results. We have therefore chosen to perturb \(\theta \) only, given the dependence of w on \(\theta \) in the basic equation set.

We have also investigated the use of applying the perturbations in time. At pre-defined intervals, new random numbers are chosen and the perturbations are reapplied. The mixed-layer time scale (\(t_{*}\)) is our favoured choice of interval in which to inject the perturbations, because it is both physically tenable and responds well to testing. Here, \(t_{*}\) represents the time scale for the life cycle of a single CBL eddy and follows the relation

$$\begin{aligned} t_{*}=\dfrac{z_{i}}{w_{*}}, \end{aligned}$$
(5)

where \(w_{*}\) is the convective velocity scale, defined by

$$\begin{aligned} w_{*}=\left[ \dfrac{gz_{i}}{\overline{\theta }}\left( \overline{w^{\prime }\theta ^{\prime }}\right) _{0}\right] ^{\nicefrac {1}{3}}. \end{aligned}$$
(6)

Hence \(t_{*}\) increases as \(z_{i}\) increases, and the perturbations are implied less frequently as the CBL develops; here, \(t_{*}\) is employed as a domain-averaged value.

2.4 Diffusion in the Subgrid Scheme

The subgrid scheme in use with the MONC model is the 3D Smagorinsky scheme (Smagorinsky 1963; Brown et al. 1994), a parametrization that fundamentally relies on a constant to control the level of diffusion of the eddy energy. This Smagorinsky constant, usually denoted \(C_{S}\), is used to calculate the basic mixing length (\(\lambda \)), such that

$$\begin{aligned} \lambda =C_{S}\max (\varDelta x, \varDelta y). \end{aligned}$$
(7)

The basic mixing length undergoes a modification to accommodate near surface effects, which produces the neutral mixing length (l),

$$\begin{aligned} \dfrac{1}{l^{2}}=\dfrac{1}{\left( kz\right) ^{2}}+\dfrac{1}{\lambda ^{2}}, \end{aligned}$$
(8)

where k is the von Kármán constant. Eddy viscosity (\(\nu \)) and diffusivity (\(\nu _{h}\)) are then calculated using

$$\begin{aligned} \nu= & {} l^{2}\left| S_{ij}\right| f_{M}(Ri) \end{aligned}$$
(9a)
$$\begin{aligned} \nu _{h}= & {} l^{2}\left| S_{ij}\right| f_{H}(Ri), \end{aligned}$$
(9b)

where \(f_{M}\) and \(f_{H}\) are stability functions for momentum and heat, which depend on the Richardson number (Ri), and \({{\left| S_{ij}\right| }}\) is the modulus of the strain tensor defined by

$$\begin{aligned} S_{ij}=\dfrac{\partial u_{i}}{\partial x_{j}}+\dfrac{\partial u_{j}}{\partial x_{i}}. \end{aligned}$$
(10)

Finally, the subgrid-stress and buoyancy-flux terms in the full transport equations are calculated using

$$\begin{aligned} \overline{u_{i}^{\prime }u_{j}^{\prime }}= & {} -\nu S_{ij} \end{aligned}$$
(11a)
$$\begin{aligned} \overline{u_{j}^{\prime }\theta ^{\prime }}= & {} -{\nu _{h}\dfrac{\partial \overline{\theta }}{\partial x_{j}}}, \end{aligned}$$
(11b)

where the subscripts \(\left[ i, j\right] \) imply tensor notation.

Lilly (1967) originally defined the value of the Smagorinsky constant to be \(C_{S}\approx 0.17\) (assuming a value of \(\alpha =1.5\) for the Kolmogorov constant). However, this value is often adjusted for use in atmospheric models, typically requiring some tuning to best balance the signal-to-noise ratio after considering dissipation effects from the numerical discretization scheme and other model components. Both the MONC model and the Met Office Unified Model (MetUM) use a value of \(C_{S}=0.23\) by default, while the LES configuration of the Weather Research and Forecast (WRF) model has a default of \(C_{S}=0.18\). Beare (2014) and Efstathiou and Beare (2015) investigate some of the effects of changing \(C_{S}\) in grey-zone simulations, one of which was the dampening of resolved turbulence when \(C_{S}\) is increased. Lower values of \(C_{S}\) tended to allow for faster spin-up, but introduced noise in the resolved fields that inhibited the development of coherent structures in the fully-developed CBL, as well as allowing for excessive resolved eddy energy.

The value of \(C_S\) does not necessarily need to be constant throughout a simulation. Investigations into scale-dependence in LES modelling have lead to the development of dynamic formulations (Germano et al. 1991; Porté-Agel et al. 2000; Meyers and Sagaut 2006; Efstathiou et al. 2018). These dynamic models are based on the concept that, as a coefficient, \(C_{S}\) should exhibit a dependency on the quantity \(L/\varDelta \), where L is the dominant length scale and \(\varDelta \) is the filter width. For the case of the atmospheric CBL, this quantity is analogous to \(z_{i}/\varDelta x\) , which also happens to be the reciprocal of the grey-zone similarity variable proposed by Honnert et al. (2011). Wyngaard (2004) also explains the importance of the variable \(L/\varDelta \), stressing how fundamental this ratio is to grey-zone modelling. Also related is the similarity variable \(z_{i}/l_{d}\) from Beare (2014), where \(l_{d}\) is the dissipation length scale. Recent work by Efstathiou et al. (2018) has shown the utility of dynamic models in simulating the CBL, although this approach can be somewhat demanding computationally.

With these considerations in mind, we propose a coefficient \(C_{S}\) that adapts to the evolving CBL from the fully-parametrized state into that of partially-resolved turbulence during the morning transition. Our \(C_{S}\) value is constant across the domain but evolves in time alongside \(z_{i}\) such that

$$\begin{aligned} C_{S}=C_{S0} \tanh \left( {{\beta }}\dfrac{z_{i}}{\varDelta x}\right) , \end{aligned}$$
(12)

where \(C_{S0}\) is an equilibrium value in the interior of the fully-developed CBL, and \(\beta \) is some constant in the unit interval. The ideal value of \(\beta \) is likely to be model dependent (i.e. requires some tuning), again due to varying dissipation sources from other model components. We justify the hyperbolic tangent form of Eq. 12 by comparing it with \(C_{S}\) data taken from the Lagrangian-averaged scale-dependent dynamic (LASD) Smagorinsky model of Efstathiou et al. (2018) during the morning CBL development. Figure 2 shows how the vertical maximum of horizontally-averaged \(C_{S}\) data from the LASD model approximately traces a tanh shape, and we find that a best fit of these data using the functional form of Eq. 12 employs the parameters \(\beta =0.29\) and \(C_{S0}=0.19\). Since we are seeking a single, optimal value of \(C_{S}\) for the entire domain, the influence of the upper and lower boundaries must be minimized. This is the reason why the form of Eq. 12 has been based on the maximum \(C_{S}\) values from the LASD model. The figure also shows the form of Eq. 12 with \(\beta \) set to 1.0 and \(C_{S0}=0.23\), which we have found in our simulations to be a better choice when combining the method with those of Sect. 2.3. Also shown is the curve following \(\beta =0.6\), which we discuss in Sect. 3.3. In other models (e.g., the WRF model), we suggest that also using the default value of \(C_{S}\) for setting \(C_{S0}\) is probably the best starting point. The scope of our study includes the introduction of Eq. 12 as a proof of concept, but we acknowledge that more work needs to be done before claiming that the method is fully robust in its present form.

Fig. 2
figure 2

Prognostic values for \(C_{S}\) as described by Eq. 12. Also shown are \(C_{S}\) data taken from the LASD model (Efstathiou et al. 2018) as a basis for the tanh shape of the equation

Fig. 3
figure 3

Time series of \(\left\langle e \right\rangle \) at \(\varDelta x=400\,\mathrm{m}\) for (a) the CNTL400 run (solid red) with the spread of possibilities across a further 12 runs using altered seeds (grey shading). Also shown are the the coarse-grained fields (dashed green) and the outcome of not using any initial perturbations (solid blue); (b) SMAG2K run (solid black) showing the effect of imposing perturbations in the larger range of \({{-\,2}\,{\mathrm{K}}<\theta ^{\prime }<{2}\,{\mathrm{K}}}\) at \(t+0\). Note how the initially high energy value of the SMAG2K run is dissipated away almost immediately

3 Results

3.1 Initial-Condition Perturbations

During the simulated development of a nocturnal boundary layer, the variances in the resolved momentum fields inevitably tend to zero. This is because negative thermodynamic fluxes at the lower boundary no longer promote convective activity, and the model tends to a more laminar flow overnight (in the absence of mechanically-driven turbulence). The following morning, when positive surface heat fluxes return, the resolved fields have little small-scale variation. This lack of variation is not a problem in mesoscale models since all subsequent CBL turbulent transfer is represented by the parametrization scheme, as in the night-time SBL. In the grey zone, however, a lack of this variation produces difficulties in the transition from subgrid to partially-resolved flow. Figure 3 shows how this affects the evolution of the Wangara CBL. We will focus on a horizontal grid spacing of \(400\,\mathrm{m}\) throughout this section, since this resolution lies firmly within the grey zone, and exhibits a significant resistance to spin-up. In the figure, \(\left\langle e \right\rangle \) is calculated at the closest model level to \(z/z_{i}=0.5\).

The difference between omitting and including random perturbations at time \(t+0\) is immediately apparent, and it is clear from Fig. 3a that no resolved motion develops during the 9-h period in the unperturbed case. The perturbed case in Fig. 3a is based on a commonly-used LES set-up at the UK Met Office, in which the pseudo-random perturbations are drawn from the range \({{-\,0.1}\,{\mathrm{K}}<\theta ^{\prime }<{0.1}\,{\mathrm{K}}}\) up to a height of \(250\,\mathrm{m}\) above ground level. We use this as the point of reference (a control, hereafter CNTL) for subsequent configurations.

Pseudo-random numbers are typically generated using a seeding function, which for the MONC model is loosely based on the grid-point location and the processor allocation. We have run 12 additional unique simulations using the CNTL400 configuration, altering the random number seed in each, to examine when and by how much each run diverges when the random perturbations are different (grey shading in Fig. 3a). This technique has a similar principle to ensemble modelling. The spin-up times for each of these members are within 5 min of one another, implying that the random numbers themselves have little effect on the actual timing of convective onset. The runs then diverge after resolved convection is initiated, showing a spread of approximately \({0.2}\,\mathrm{m}^{2}\,\mathrm{s}^{-2}\) in the TKE amplitude. Each run does, however, follow the general pattern of the coarse-grained fields once a steady state is reached.

A strong peak in \(\left\langle e \right\rangle \) is apparent in the CNTL400 simulation, following the abrupt onset of resolved convection (at \(1330\,{\mathrm{LT}}\) in Fig. 3a). As the surface heat flux increases at the lower boundary, the lack of resolved motion causes a build-up of energy preceding convective onset, which corresponds with the achievement of the critical Rayleigh number (Zhou et al. 2014). At the point of spin-up, the TKE values tend to overshoot so that this energy can be released, sometimes followed by a slight oscillation before settling into the same pattern as the coarse-grained fields. This effect has probably been present in previous studies, but is only apparent here because we use a very high temporal resolution in our output (\({\approx }100\,\mathrm{s}\)). For CNTL400, the model takes more than half of the simulation time before the resolved turbulence can reach a steady state. It is therefore desirable to make modifications to induce spin-up at an earlier time, allowing the energy entering the system to be properly transported and diffused.

3.2 Inducing Spin-Up

An important practical consideration is to determine whether or not spin-up can be accelerated, thereby encouraging non-local transport and relaxing the excessively superadiabatic profiles of the CNTL run during the mid- to late-morning. In the previous section, we discussed the effects of varying the seed of the initial pseudo-random numbers, noting a negligible change in spin-up time. We now apply the methods outlined in Sect. 2.3, and analyze their effects on the timing of spin-up. For the purpose of comparison, we shall here define ‘spin-up’ as the point in time at which the magnitude of \(\left\langle e \right\rangle \) reaches \({0.1}\,\mathrm{m}^{2}\,\mathrm{s}^{-2}\). This is an arbitrary choice, but nonetheless useful for the comparison. A similar definition is used by Zhou et al. (2014), who define spin-up using \(\langle u_{i}^{\prime }u_{i}^{\prime }\rangle /w_{*}^{2}=0.1\).

Many of our early tests with the methods of vertical coherence, CELL, mixed-layer scaling, and customization of perturbation altitude/amplitude yielded quite similar results. Although initially organized, the variations in the resolved fields would tend to dissipate away, returning the \(\left\langle e \right\rangle \) field to near-zero despite the positive gradient in surface heat flux. This leads to our first key finding: perturbations imposed at the initial time have a tendency to be damped away by the subgrid scheme in the grey zone, in contrast with the LES model behaviour. We can visualize this by showing how even an unrealistically large perturbation range of \({{-\,2}\,{\mathrm{K}}<\theta ^{\prime }<{2}\,{\mathrm{K}}}\) behaves in the grey zone (SMAG2K, Fig. 3b). The resolved TKE in the SMAG2K run is initially very large, but this energy is quickly removed by the subgrid scheme within the first hour of the simulation. It follows from this that specific structures such as those of the CELL method lose their organization early on if the perturbations are only applied at the \(t+0\) timestep.

Although much of the organization is lost, the modification of the initial state of \(\theta \) does affect the resolved fields to a certain extent. We have found differences in spin-up time of up to 40 min in our preliminary tests, implying that despite the \(\left\langle e \right\rangle \) field tending to near-zero (\(\textit{O}{[}10^{-4}\,\mathrm{m}^{2}\,\mathrm{s}^{-2}{]}\) in Fig. 3b), there still exists a ‘memory’ between \(t+0\) and the time of spin-up. This is evident in Fig. 3b, in which the modified SMAG2K run exhibits a spin-up that is \({\approx }\,30\) min earlier than the CNTL400 run, despite a negligible amount of resolved energy being present just before spin-up.

Because of this, we find that accelerating the transition to resolved turbulence in the grey zone requires the implementation of the pseudo-random perturbation structures at intervals of time. The perturbations are applied every \(t_{*}\) and are then ceased entirely at \(t=15{,}000\,\mathrm{s}\), when the CBL is fully developed. A preliminary run using the same settings as the CNTL400 run (but applied every \(t_*\)) produced an acceleration in spin-up of 44 min, and following on from this, we sought to improve the result by implementing the methods from Sect. 2.3. Table 1 shows how each of these methods affects the timing of spin-up. Spin-up times shown in the table reflect how much earlier the resolved motions appear with respect to the CNTL400 simulation.

Table 1 Key simulations performed in this study

The CELL method (Fig. 1b) was applied for bi-dimensional cells of \(4\times 4\) (CELL-4) and \(8\times 8\) (CELL-8) grid points, with an acceleration in spin-up of 37 min and 24 min respectively. Overall, we find that perturbing \(\theta \) every \(t_{*}\) shows no improvement over imposing a unique \(\theta ^{\prime }\) value to each grid point every \(t_{*}\) (CELL-1). It is possible that numerical considerations are working against the physical basis of the method here, since the technique was originally designed for grid spacings of no more than \(\varDelta x=100\,\mathrm{m}\). The horizontal scale of the perturbations in the grey zone may simply be too large when multiple grid points are assigned to each cell, but using multiple grid points is generally necessary for satisfying the effective-resolution requirements of the model. On balance, it seems that in the grey zone, perturbing by cells is not effective; however, we have not been able to test every combination of cell size, domain size, and resolution, and therefore we cannot make this claim explicitly. We stress that this result does not call into question the underlying viability of Muñoz-Esparza’s approach, but for now, it does appear that perturbing \(\theta \) by cells is not a defensible approach at grey-zone resolutions.

The vertical coherence method (hereafter denoted VERTCOH, Fig. 1c) is based on the idea of creating disturbances in \(\theta ^{\prime }\) that are complementary along each vertical column, preventing perturbations at adjacent levels from cancelling each other, and establishing organization in the vertical. This method has also been employed by Stirling and Petch (2004). When the technique is applied up to a height of 250 m (for comparison with the CNTL400 simulation), spin-up occurs 1.4 h before the CNTL400 run, which is very favourable. We see no major drawback to this method, particularly since the amplitude of perturbations is so small (\(0.1\,\mathrm{K}\)), and the method is computationally inexpensive.

The earliest to spin-up of all the methods tested was the mixed-layer scaling method (hereafter MLS400, Fig. 1d), which allowed spin-up to occur 1.8 h ahead of the CNTL400 simulation. Although this method is slightly more complex than for the VERTCOH simulation, it is still relatively inexpensive since \(\sigma _{\theta }^{2}\) is calculated as a horizontal-average value at each level. MLS400 is also arguably more physical, since it follows mixed-layer theory closely, and uses \(z_{i}\) for the depth through which the perturbations are applied (rather than up to a fixed height, as with the other methods). We seek to generalize this approach in future work, but when doing so, we must stress the importance of the limits \(z\rightarrow 0\) and \(z\rightarrow z_{i}\) in Eq. 2, since \(\sigma _{\theta }^{2}\rightarrow \infty \) at these limits. The elevations of the grid points closest to \(z=0\) and \(z=z_{i}\) therefore become important with respect to the perturbation amplitude.

The question of the depth through which the perturbations should be applied is still an open one. We use a height of \(250\,\mathrm{m}\) in most of our simulations because this is a commonly used value in the MONC model (e.g. Efstathiou et al. 2016), but this height is somewhat arbitrary. As we have discussed in Sect. 2.3, it is clear that a consensus does not exist as to the choice of optimal depth, and it is unclear whether this depth should remain fixed or should evolve alongside the CBL. Nakanishi et al. (2014) show that there does exist TKE in the residual layer above \(z_{i}\) during Wangara (\(\textit{O}{[}10^{-2}\,\mathrm{m}^{2}\,\mathrm{s}^{-2}{]}\)). Some of our testing has revealed that adding perturbations up to \(2000\,\mathrm{m}\) above the surface (well above the inversion) has a significant impact on the time series of \(\left\langle e \right\rangle \). It is the authors’ opinion, however, that such an approach does not have enough physical grounds. This is another reason why we advocate the use of the mixed-layer scaling method; because the depth below which the perturbations should be applied is clear and physically based. We believe that the grey zone is inherently a problem that will always rely to some extent on numerical considerations and tuning, and that is exactly why we wish to take a physical approach wherever possible.

3.3 Modification of the Smagorinsky Scheme

We now show the results of applying Eq. 12 to the Wangara grey-zone simulations. These runs maintain the perturbation structure of the CNTL400 run, while introducing the new domain-wide \(C_{S}\) coefficient at each timestep. Overall, the runs appear to be strongly sensitive to, (i) the initial perturbation amplitude at \(t+0\), and (ii) the constant \(\beta \) in Eq. 12. Figure 4a shows the model’s reaction to using the values \(\beta =0.6\) and \(\beta =0.7\). Using \(\beta =0.7\) encourages spin-up to appear \({\approx }\,40\) min ahead of the CNTL400 simulation, with a further increase of \({\approx }\,40\) min for \(\beta =0.6\) (the \(\beta =0.6\) run is denoted by CsCo400 in Fig. 4a and Table 1). Tests at \(\beta <0.6\) (not shown) reveal an evolution that is more energetic than the coarse-grained fields, while \(\beta >0.7\) bears a strong similarity to the CNTL400 run. These values of \(\beta \) are probably specific to Wangara, and are unlikely to apply directly to the general case. However, the results do show the strong sensitivity of the model to this parameter.

Sensitivity tests of our simulations show that our \(C_{S}\) coefficient requires higher values of both \(\beta \) and \(C_{S0}\) than do the LASD model data shown in Fig. 2. This identifies a key difference between using a single value for \(C_{S}\) throughout the domain, rather than a dynamically calculated \(C_{S}\) at each grid point. We have performed a simulation using the parameters \(\beta =0.29\) and \(C_{S0}=0.19\) as in the LASD model data in Fig. 2 (not shown), but find the evolution to be noisy and overly energetic. Although it may be possible to generalize the calculation of \(\beta \) and \(C_{S0}\) in a future study, it is also very possible that these constants must simply be tuned to certain model configurations. A comprehensive investigation into the effects of such tuning is beyond the scope of this paper; and at this stage we offer these results as a proof of concept, rather than as a fully generalized technique in itself. However, dynamically adjusting \(C_{S}\) using our simple tanh relationship may be preferable from a pragmatic viewpoint because of the simplicity and faster run time the method affords.

Figure 4b presents a similar result to using \(\beta =0.6\) if the initial perturbation amplitude of the \(t+0\) initial state is increased to \({{-\,0.5}\,{\mathrm{K}}<\theta ^{\prime }<{0.5}\,{\mathrm{K}}}\) and \(\beta \) is set to 0.7. This highlights the interplay between the state of the resolved fields and the new \(C_{S}\) coefficient, even when \(\theta \) is modified only at the first timestep. We explore such combinations further in Sect. 3.4.

Fig. 4
figure 4

Time series of \(\left\langle e \right\rangle \) evolution for the new \(C_{S}\) coefficient, showing the effects of altering both the limits of \(\theta ^{\prime }\) at \(t+0\) and \(\beta \). The solid magenta in (a) is the CsCo400 run

Fig. 5
figure 5

Horizontal cross-sections of w at \(z/z_{i}=0.5\) for time \(t=3.5\,\mathrm{h}\): (a) CNTL400; (b) CsCo400; (c) CsCoMLS400; and (d) LES50 coarse-grained to \(400\,\mathrm{m}\)

Figure 5b shows how w responds to the changes in \(C_{S}\), compared to the CNTL400 run (Fig. 5a), visualized as a horizontal cross-section in the mid-CBL at \(t=3.5\,\mathrm{h}\). It is clear from Fig. 5b that although spin-up has occurred, the resolved fields are somewhat noisy and lacking in organized structures (Efstathiou and Beare 2015). Although these structures are somewhat unphysical, they do provide non-local transport to the system, improving the overall evolution of the mean \(\theta \) profiles (Fig. 6a). The \(\theta \) profiles in CsCo400 quickly become less superadiabatic after spin-up at around \(1200\,{\mathrm{LT}}\), which implies that the resolved fields are providing improved mixing than in the CNTL400 run. By 1315 LT, the CBL is well mixed and matches well with the LES model profile. Finally, in the fully-developed CBL at \(1500\,{\mathrm{LT}}\) and \(1730\,{\mathrm{LT}}\), both the CNTL400 and CsCo400 simulations are well mixed, but it appears that the earlier spin-up time of the CsCo400 run has allowed a better match with the LES model, as the CBL in CsCo400 is slightly warmer in the mid-CBL than in the CNTL400 run, and has a slightly larger value of \(z_{i}\). Overall, this highlights two important outcomes: firstly, resolved motions appear to be a very important component of the CBL system in the grey zone – indeed it is entirely necessary when used alongside a 3D scheme such as the static Smagorinsky – and we consider its presence to be preferable over damping away the resolved fields (Ching et al. 2014) and relying on a non-local countergradient term for non-local transport. Secondly, the benefits of encouraging spin-up are not limited to the hours preceding the resolved motion, but in fact, it appears that the entire system benefits, even in the later hours just ahead of the evening transition.

Fig. 6
figure 6

Mean vertical profiles of \(\theta \): (a) CsCo400 before spin-up (\(1100\,{\mathrm{LT}}\), \(1200\,{\mathrm{LT}}\)), just after spin-up (\(1315\,{\mathrm{LT}}\)), and for the fully-developed CBL (\(1500\,{\mathrm{LT}}\)). Using the modified \(C_{S}\) method (solid black) appears to accelerate mixing, allowing the inversion height to match the LES model profiles (dashed green) more closely than the CNTL400 run (dot-dashed red) for the fully-developed CBL. (b) Same as (a) but for CsCoMLS400, showing relaxation of the superadiabats/spin-up by \(1100\,{\mathrm{LT}}\), while remaining almost identical to CsCo400 by \(1500\,{\mathrm{LT}}\)

3.4 Implications Across the Grey Zone

Since the grey zone is a regime in which both subgrid schemes and resolved motions play a significant role, one might argue that finding an optimal grey-zone configuration should logically require a customization to both. This is indeed our finding. Combining the best performing of our perturbation structures, the mixed-layer scaling method, with the use of the new \(C_{S}\) coefficient gives the \(\left\langle e \right\rangle \) field that most closely matches (r.m.s. \(\hbox {error} = {0.052}\,\mathrm{m}^{2}\,\mathrm{s}^{-2}\), see Table 1) the coarse-grained fields at \(\varDelta x=400\,\mathrm{m}\) (designated CsCoMLS). Figure 7 shows the time series of \(\left\langle e\right\rangle \) using this combination across the grey zone from \(\varDelta x=200\,\mathrm{m}\) to \(\varDelta x=800\,\mathrm{m}\). At \(200\,\mathrm{m}\) (CNTL200), spin-up is delayed until \(t=2.6\,\mathrm{h}\), while the new method (CsCoMLS200) gives a similar spin-up timing and structure as the coarse-grained fields.

The \(400\,\mathrm{m}\) run also follows the coarse-grained fields well, particularly at first, with an acceleration in spin-up of \({\approx }\,2.6\,\mathrm{h}\). The large peak in \(\left\langle e \right\rangle \) present in CNTL400 at \(t\approx 4.5\,\mathrm{h}\) is absent, although the eddies in CsCoMLS400 do become slightly over-energetic around this time. Two noteworthy features are present at this point in the time series: firstly, every simulation, including the LES model, shows a sharp increase in TKE at this time. This is coincident with the time at which the developing CBL penetrates the residual layer formed the night before. This may have implications for factors like convective inhibition, and we speculate that there may be an influence (either directly or indirectly) of this on the TKE at \(z/z_i=0.5\). Secondly, the larger vertical grid spacing of the grey-zone run (compared to the LES model) could also have implications for the behaviour of the TKE profile, since grey-zone runs have been shown to be somewhat unpredictable in these profiles, particularly near the inversion (Beare 2014). Some combination of these effects could reasonably explain the excessive TKE at this time in the CsCoMLS400 run. In addition to a well-matched \(\left\langle e \right\rangle \) time series at \(\varDelta x=400\,\mathrm{m}\), we also find that CsCoMLS400 exhibits organized structures, and their development and evolution appears physical from the beginning (Fig. 5c). The cross-section shown in Fig. 5c also compares well to the structures and amplitude of the coarse-grained fields shown in Fig. 5d. Mean \(\theta \) profiles of CsCoMLS400 (Fig. 6b) are very similar to CsCo400, implying that the advantages of the modified \(C_{S}\) coefficient are still present, but with the added advantage of earlier spin-up.

Fig. 7
figure 7

Time series of \(\left\langle e \right\rangle \) for the resolutions: (a) \(\varDelta x=200\,\mathrm{m}\), (b) \(\varDelta x=400\,\mathrm{m}\), (c) \(\varDelta x=600\,\mathrm{m}\), and (d) \(\varDelta x=800\,\mathrm{m}\), using a combination of the mixed-layer scaling method and our modified \(C_{S}\) coefficient. In each plot, \(\beta =1.0\). Also shown are the 50-m LES run (coarse-grained to each of the resolutions) and the time series for each CNTL simulation. Note the differences in scale for each y-axis

At \(\varDelta x=600\,\mathrm{m}\) and \(\varDelta x=800\,\mathrm{m}\) (the CsCoMLS600 and CsCoMLS800 runs), a very significant change occurs in the resolved TKE time series compared to the CNTL600 and CNTL800 simulations. From approximately \(1330\,{\mathrm{LT}}\) onward, the \(\left\langle e \right\rangle \) values match well with the coarse-grained fields, exhibiting spin-up well in advance of the CNTL runs. In contrast, spin-up in the CNTL runs is delayed for the majority of the simulations, producing an insufficient amount of non-local transport.

We do note that the CsCoMLS600 and CsCoMLS800 runs are overly energetic in the first 2–3 hours. This would appear to be a by-product of the forcing we have applied, and indeed some of this TKE may well be a direct result of the imposed perturbations themselves, rather than naturally occurring turbulence. This begs the question of whether the amplitude of the \(\theta ^{\prime }\) perturbations should be scaled with the resolution. Although there is certainly a theoretical argument for such scaling, our testing of this has shown that the resolved fields do not evolve as favourably as in the CsCoMLS runs if the perturbations are scaled down. We conclude that allowing the full range of \(\theta ^{\prime }\) from Eq. 2 is a pragmatically preferable solution for achieving an optimal fit to the coarse-grained fields later in the simulation.

As a final note, we have recorded some run times to test the computational burden of the new methods, and find an increase in run time of \({\approx }\,\)10% for the CsCoMLS400 run compared to the CNTL400 run. However, this increase also includes the calculation of \(z_{i}\) at each timestep, which is not calculated by default in the MONC model (unlike other models, e.g. the WRF model).

3.5 Further Analysis of CsCoMLS400

The resolution \(\varDelta x=400\,\mathrm{m}\) is a useful test-bed for the techniques presented here since it lies firmly within the grey zone, which is why it has been a focus in previous sections. This sub-section presents a deeper analysis of the best performing of our methods in terms of improved spin-up time and agreement with the coarse-grained fields. CsCoMLS400 combines the use of our modified Smagorinsky coefficient with the mixed-layer scaling method of perturbing \(\theta \).

Two-dimensional spectra comparing the CNTL400 simulation with CsCoMLS400 are shown in Fig. 8. Since the CNTL400 run is unable to spin-up any turbulence in the resolved fields until near the halfway point of the run, the result is near-negligible values of \(S_{\overline{w^{\prime 2}}}\) at 1230 LT (Fig. 8a), while the CsCoMLS400 simulation is capable of developing spectra that are much closer to the LES model in magnitude. Naturally, much of the inertial subrange present in the LES model is absent in the grey-zone runs, and the point of departure from the ideal Kolmogorov \(k^{-5/3}\) law is quite close to the peak wavelength. Since the CsCoMLS400 simulation exhibits the correct shape and magnitude in the spectra, this strengthens our argument that the manner in which these fields are induced does indeed give rise to physically legitimate structures. Later, at \(t=6\,\mathrm{h}\) (Fig. 8b), the two simulations show little difference in shape and magnitude.

Figure 9 shows the partitioning of kinematic heat flux (\(\overline{w^{\prime }\theta ^{\prime }}\)) into resolved and subgrid components for the CNTL400 and CsCoMLS400 simulations, where the subgrid component is calculated using the buoyancy-flux term shown in Eq. 11b. Early in the simulation at \(t=3.5\,\mathrm{h}\), the Smagorinsky scheme in the CNTL400 run is not resolving any motion and there is no entrainment present, leading to a CBL depth that is \({\approx }\,{350}\,\mathrm{m}\) lower than that of LES50. The CsCoMLS400 run improves upon this by inducing resolved motion earlier, thus generating entrainment and non-local transport. The inversion height of CsCoMLS400 matches that of the LES50 run to within \({\approx }{30}\,{\mathrm{m}}\). Later, the CNTL400 run adjusts towards the LES model solution, after the resolved fields have spun-up, again highlighting the importance of inducing resolved turbulence, and doing so as early as possible. By 1500 LT, the LES model run (Fig. 9b), CNTL400 run (Fig. 9d) and CsCoMLS400 run (Fig. 9f) all agree well with each other, with the subgrid and resolved components of the grey-zone runs giving rise to a total-flux profile that matches well with the LES model.

Fig. 8
figure 8

2D normalized power spectra of vertical velocity as a function of the normalized horizontal wave number in the mid-CBL for (a) the CsCoMLS400 and CNTL400 simulations at \(t=3.5\,\mathrm{h}\) (1230 LT), and (b) the same at \(t=6\,\mathrm{h}\) (1500 LT). The LES50 run is also plotted in each frame. The \(k^{-5/3}\) Kolmogorov power law is plotted in grey

Fig. 9
figure 9

Vertical profiles of total kinematic heat flux (solid black), partitioned into subgrid (dot-dashed blue) and resolved (dashed red) components for, (a) LES50 at \(t=3.5\,\mathrm{h}\); (b) LES50 at \(t=6\,\mathrm{h}\); (c) CNTL400 at \(t=3.5\,\mathrm{h}\); (d) CNTL400 at \(t=6\,\mathrm{h}\); (e) CsCoMLS400 at \(t=3.5\,\mathrm{h}\); and (f) CsCoMLS400 at \(t=6\,\mathrm{h}\)

4 Discussion and Conclusions

Several grey-zone simulations using the UK Met Office MONC model have been performed to better understand the spin-up of partially-resolved turbulent motions. Firstly, to address the question of whether or not this resolved turbulence should be allowed in the grey zone at all, our results show that allowing such resolved turbulence provides much of the necessary non-local transport to complement the 3D static Smagorinsky scheme, even in the absence of the countergradient correction term that is characteristic of 1D PBL schemes. This is true for a range of \(z_{i}/\varDelta x\), although as the model resolution approaches the mesoscale limit, a need for such a non-local transport term does begin to arise. It is here that one might benefit most from blending schemes such as that of Boutle et al. (2014). Nonetheless, we have shown that the sooner the resolved motion can be established, the more non-local transport is provided to the system, and the closer the results become to those of a coarse-grained LES model. Based on this, we have investigated why there exists a delay before resolved turbulence appears and whether this delay can be reduced.

Our results suggest that the grey-zone CBL is highly sensitive to structured, pseudo-random perturbations applied to the \(\theta \) field. Without these perturbations, we observe no resolved turbulence in the model output at all. When applied to the initial state, the perturbations allow turbulence to spin-up with a delay that is proportional to the grid spacing. Since grey-zone grids are prone to producing grid-scale convection, the application of these perturbations (which is also done at the grid scale) becomes paramount to the development of a well-mixed and well-behaved CBL.

We suggest that the optimal grey-zone configuration consists not only of a customized parametrization, but in fact a customization to both the resolved and subgrid fields simultaneously. The logic in doing so lies in the fact that the grey zone itself can be defined by the coexistence of both resolved and subgrid components. We modify the Smagorinsky turbulence closure by reassigning the Smagorinsky constant as a coefficient with dependency on the variable \(z_{i}/\varDelta x\), so that the eddy diffusivity in the newly formed CBL becomes more sensitive to the sensible heat fluxes at the lower boundary. The importance of the relationship between \(z_{i}\) and \(\varDelta x\) is becoming increasingly apparent in grey-zone modelling, and here we base our modifications of the Smagorinsky scheme on this relationship.

At the same time, we apply pseudo-random perturbations to the \(\theta \) field at intervals of \(t_{*}\), thereby encouraging the natural variation of \(\theta \) within the CBL. We have also carried out trial simulations with the perturbations applied to w instead of \(\theta \), but preliminary results of these tests have shown that the impact of perturbing w is similar to perturbing \(\theta \). Various methods of organizing the perturbations have been tested, including increasing the perturbation amplitude, applying coherence in the vertical dimension, applying uniform perturbations to bi-dimensional cells in the horizontal, and employing mixed-layer scaling theory to selecting the perturbation amplitude at each vertical model level. The mixed-layer scaling method is shown to provide the best result, with spin-up greatly enhanced, even at resolutions approaching the mesoscale limit. Of the two modifications (i.e. to the parametrization and to the resolved fields), it is our opinion that modifying the resolved fields tends to play a more significant role in allowing faster spin-up and establishing a well-mixed CBL.

Although our method is shown to be useful at resolutions approaching the mesoscale limit, the question arises as to when one should assume that RANS modelling becomes valid, and switch from a 3D scheme to a 1D scheme. There do exist methods that blend 1D and 3D schemes (Boutle et al. 2014), but here we have not yet supplied a means of determining at what resolution the imposed \(\theta \) perturbations should cease. For Wangara, this limit would be in the vicinity of \(\varDelta x\approx 1000\,\mathrm{m}\), but generalizing the approach presented here to apply to any situation is beyond the scope of our work. In fact, it is as yet unclear whether a working generalized method of inducing spin-up (which applies to any CBL) is possible. We have shown that the grey zone exhibits strong sensitivities in how the resolved fields behave, and documenting every one of these sensitivities is simply not possible. However, in future we hope to apply our methods to real cases, such that inciting spin-up at an earlier time might have a positive impact on the entire atmospheric system in high-resolution NWP simulations. One such impact might be to encourage deep convection (Stirling and Petch 2004), since this process is fundamentally driven by the CBL.

Although operational NWP is, in general, still at the point of partially-resolving deep convection only (and not boundary-layer turbulence), there are regions of the world where these approaches may be practically useful. One such example is the Saharan atmospheric boundary layer, where the CBL grey zone can exist even at grid spacings as large as \(4\,{\mathrm{km}}\). In the future, as the realm of sub-kilometric resolutions becomes more commonplace in operational NWP, it will be more important than ever to understand the grey zone, and to identify which modifications can be applied to help with adapting to this partially-resolved regime.