Broadly speaking, there are two distincts, high-level classes of time-integration schemes for IBVP (4) such as those arising in NWP and climate models:
-
A.
Eulerian-based time-integration (EBTI), where spatial and temporal discretizations are viewed as independent from each other; and
-
B.
Lagrangian or path-based time-integration (PBTI), where space and time are solved together, or where temporal derivatives may be expressed as spatial derivatives.
EBTI and PBTI have different characteristics in terms of stability properties, accuracy, suitability to emerging hardware, etc. and their application in the context of NWP has been rather different in the past few decades, with PBTI methods, especially semi-implicit semi-Lagrangian schemes, as the method of choice in operational global NWP. Both EBTI and PBTI methods can be explicit (current time-level is calculated using information coming from the previous time-steps only) or implicit (current time-level is obtained by solving a nonlinear problem that uses information from the current time-step). In the following, we describe in more detail each of the two classes and, for each class, we outline the prevailing operational time-integration practices adopted by the main weather and climate centers (cf. Table 1). In particular, we focus on two EBTI time-integration methods, namely (i) split-explicit (SE) schemes and (ii) horizontally-explicit vertically-implicit (HEVI) schemes, and on the PBTI-based family of methods referred to as (iii) semi-implicit semi-Lagrangian schemes.
Eulerian-Based Time-Integration (EBTI)
EBTI schemes [84], also referred to as Method Of Lines-based schemes (MOL), recast the IBVP constituted by the set of prognostic PDEs describing the physical model (4), into two sequential problems, a semi-discrete boundary value problem (BVP), where the equations are discretized in space, and an initial value problem (IVP), where the spatially discretized equations are discretized in time via a suitable time-integration scheme, as depicted in Fig. 2.
In fact, the continuous system of PDEs (4), after being spatially-discretized, is reduced to a semi-discrete BVP that is formally a system of ordinary differential equations (ODEs)
$$ \frac{\text {d}\textsf {y}_{h}}{\text {d}t} = {\mathcal {R}}_{h}(\textsf {y}_{h}, t)$$
(5)
where \(\textsf {y}_{h}\) is the vector of spatially discretized prognostic variables and \({\mathcal {R}}_{h}\) is the spatially-discretized right-hand side. Equation (5) can be solved to a desired time accuracy at all spatial locations of the model domain
$$ \frac{\textsf {y}_{h}^{n+1} - \textsf {y}_{h}^{*}}{\alpha \varDelta t} = {\mathcal {R}}_{h}(\widetilde{\textsf {y}}_{h}),$$
(6)
where n indicates time-level \(t^{n}\), \(\varDelta t = (t^{n+1} - t^{n})\) is the time-step and the factor \(\alpha \) signifies the time-interval over which the temporal approximation is made. In addition, \(\textsf {y}_{h}^{*}\) and \(\widetilde{\textsf {y}}_{h}\) are combinations of model solutions. In particular, the first contains only known quantities (i.e., quantities from previous time-steps), while the second contains either quantities from previous time-steps only—explicit time-integration—or includes also future quantities—implicit time-integration. The right-hand side \({\mathcal {R}}_{h}(\textsf {y}_{h}, t)\) of Eq. (5), includes the spatially-discretized (nonlinear) advection term \((\textsf {u}_{h}\cdot \nabla _{h})\textsf {y}_{h}\) and terms describing wave propagation. The fastest of these terms cast the most severe restrictions regarding the time-step that can be adopted for explicit time-integration schemes. In particular, fast gravity and acoustic waves need to be handled appropriately to avoid time-steps that will be otherwise too small to be used in the context of operational global NWP and climate simulations.
EBTI schemes are conceptually simpler than PBTI and encapsulate two subcategories [67],
-
i.
multistage Runge–Kutta methods, that use multiple stages between two consecutive time-levels (or time-steps), discarding information from earlier time-steps; and
-
ii.
linear multistep methods, that use information from mulitple earlier time-steps.
These two subcategories can be effectively represented within the General Linear (GL) method proposed by Butcher in 1987 [15], a unifying framework for which there exist many reviews in the literature—see for instance [16, 51]—as well as several implementation strategies—see for example [102]. Both subcategories, multistage Runge–Kutta and linear multistep methods, can be fully-implicit (the current time-level is obtained by solving a nonlinear problem that uses information from the current time-step) and fully-explicit (the current time-level is calculated using information coming from the previous time-steps only). Representative classes of time-integration schemes embedded in the GL method consist of implicit multistep methods such as Adams–Moulton (AM) [22] and backward differentiation (BDF) methods [13, 20, 21], implicit multistage Runge–Kutta schemes such as diagonally (DIRK) and singly-diagonally (SDIRK) implicit Runge–Kutta schemes [3, 19, 59], explicit multistep methods, such as leapfrog and Adams–Bashforth methods [28, 43], explicit Runge–Kutta schemes, such as the fourth-order Runge–Kutta scheme [55] and partitioned methods, such as Implicit–Explicit (IMEX) schemes, whereby the operators are linearized in some fashion with—e.g., two Butcher tableaux, one explicit and one implicit [5, 40, 106].
While EBTI schemes are widely used in computational fluid dynamics, especially in the engineering sector [18, 52], their adoption in the weather and climate communities has been less widespread, with SE schemes [54, 88, 107] and horizontally-explicit vertically-implicit schemes [8, 40, 63]—i.e., schemes where the horizontal direction is treated explicitly and the vertical is treated implicitly–becoming more prominent but still confined mainly to research and limited-area models (with very few exceptions—see Table 1). Within this context, Eq. (5) is further expressed as
$$ \frac{\partial {\mathbf {y}}}{\partial t} = {\mathcal {R}}_{f}({\mathbf {y}}, t) + {\mathcal {R}}_{g}({\mathbf {y}}, t),$$
(7)
where \(\left| \left| {\mathcal {R}}_{f}\right| \right| \gg \left| \left| {\mathcal {R}}_{g}\right| \right| \) (by some norm). The difference in magnitude of \({\mathcal {R}}_{f}\) and \({\mathcal {R}}_{g}\) comes from two aspects:
-
1.
solutions to the continuous model (7) comprise fast and slow modes, i.e. \({\mathcal {R}}_{f}\) (for the fast modes) and \({\mathcal {R}}_{g}\) (for the slow modes) can describe processes that differ by orders of magnitude with respect to the time-scale of their propagation; and in addition,
-
2.
in the discretized model, as already highlighted, the grid-spacings used to resolve the horizontal and vertical directions are highly anisotropic (\(z_h \ll s_h\)), reflecting the different scales that characterize the important processes in each direction. Since the terms on the right-hand side of (7) include spatial gradients, then, if \({\mathcal {R}}_{f}\) represents contributions from the vertical direction and \({\mathcal {R}}_{g}\) from the horizontal, the mesh-anisotropy leads to a separation of scales between the two terms.
The separation of scales that arises between the vertical and horizontal directions is a key attribute for EBTI approaches in global NWP and climate simulations, as it motivates the use of different solution methods in the two directions. The “special-ness” of the vertical direction is further enhanced by the typical method of domain decomposition for parallelization of modern atmospheric models. The domain decomposition is limited to the horizontal dimension, with entire vertical columns preserved on each processor (based on the principle that important physical processes, such as radiative balance, act through the entire atmospheric depth, more or less in a pure vertical direction). EBTI approaches exploit the locality of the model data in the vertical direction in their choice of solution methods.
In the rest of this subsection, two EBTI approaches will be discussed—both are formally horizontally-explicit, vertically-implicit approaches, but the “split-explicit” approach, discussed in Sect. 3.1.1, adds an additional level of complexity in its use of sub-steps to handle the integration of fast processes. The “HEVI” approaches discussed in Sect. 3.1.2 highlight more recent developments, motivated by high-resolution global atmospheric models, which do not use sub-stepping.
Split-Explicit Schemes
Split-explicit schemes have been mainly used for high-resolution atmospheric simulations, where the horizontal mesh spacing is \(\le \mathcal {O}(10)\) km, thus being confined to local area models in the past few decades. One of the first SE (or “time-splitting”) approaches presented in the literature is in fact [57] (hereafter “KW78”). This was a limited-area cloud-resolving (grid-spacing of 1 km) atmospheric dynamical model, where advection, mixing and buoyancy were identified as the physically important processes in the system. The choice of compressible equations adopted however, meant that relatively fast acoustic waves were also present. Their system could be characterized by further extending (7) to
$$ \frac{\partial {\mathbf {y}}}{\partial t} = {\mathcal {R}}_{f_s}({\mathbf {y}}, t) + {\mathcal {R}}_{f_z}({\mathbf {y}}, t) + {\mathcal {R}}_{g}({\mathbf {y}}, t),$$
(8)
where \({\mathcal {R}}_{f_s}\) and \({\mathcal {R}}_{f_z}\) represent horizontally- and vertically-propagating fast waves respectively, and \({\mathcal {R}}_{g}\) represents the slower modes.
Under the SE approach, a scheme with the required accuracy over an appropriate time-step, \(\varDelta t\), is chosen to solve the physically important terms in \({\mathcal {R}}_{g}\)—KW78 used the 3-time-level leapfrog scheme. The fast terms are advanced on short sub-steps, \(\varDelta \tau \), such that \(\varDelta t=M\varDelta \tau \) (\(M>1\)), using simpler and cheaper schemes that guarantee stability, but sacrifice accuracy. For terms in \({\mathcal {R}}_{f_s}\), the single-step 2nd-order forward-backward scheme [68] was used, where the horizontally-propagating acoustic wave terms of the continuity equation are integrated forward in time and those of the momentum equation are integrated backward in time. For \({\mathcal {R}}_{f_z}\), the implicit trapezoidal (or Crank–Nicholson) scheme was used. This yields a tridiagonal system of equations for each vertical model column, which is computationally simple to compute. A pictorial representation of a time-integration step with the leapfrog-based SE approach used in KW78 is presented in Fig. 3a.
The SE approach gains efficiency by computing the contributions from \({\mathcal {R}}_{g}\) only on the longer time-step, \(\varDelta t\). The terms in \({\mathcal {R}}_{g}\) include advection and mixing terms that require a relatively large stencil of data and are therefore more computationally expensive. Meanwhile, the contributions from \({\mathcal {R}}_{f}\) (due to the fast waves) are computed every sub-step, \(\varDelta \tau \), but involve only immediate neighbour data-points to calculate local gradients. The latter aspect is particularly attractive for emerging computing technologies, given the reduced communication-to-flop ratio required, thus favoring co-processors, accelerators and many-core architectures.
Based on a stability analysis of the KW78 SE approach, [87] argued that greater efficiency could be gained by handling the buoyancy terms with the implicit scheme on the sub-step alongside the vertically-propagating acoustic waves. Under this approach, the longer time-step \(\varDelta t\) is limited by the maximum speed of advection. Meanwhile, the sub-step \(\varDelta \tau \) continues to be limited by the horizontally-propagating acoustic waves. In addition, it was found that the SE approach needs some damping in its formulation to ensure an acceptable stability region. Skamarock and Klemp [87] proposed a “divergence damping” term to filter the acoustic modes in their analysis of the leapfrog-based KW78 approach. More recently, [35] has demonstrated the importance of an isotropic application of the divergence damping to the acoustic waves. Alternatively, they proposed using simple off-centering for the sub-step implicit solver. Baldauf [6] includes a comprehensive stability analysis of RK-based SE approaches, where the free parameters associated with the various components of the method are optimised, in terms of accuracy and stability. In particular, optimal values are proposed for the off-centering of the implicit solution of the acoustic and buoyancy terms, and the magnitude of the divergence damping applied to the acoustic waves.
The precise integration schemes used in a SE approach are open to choice: simple, efficient methods for the fast components; and a method with good accuracy and an acceptable window of stability (in terms of time-step length) for the slow components. A SE approach has been adopted in a number of active research and operational limited-area atmospheric models: the JMA’s non-hydrostatic Mesoscale Model (JMA-NHM) continues to use a leapfrog-based approach for the long time-step integrations, but includes some low-order advection components in the short time-step computations to improve the computational stability [79, 80]. Other groups have moved towards single-time-level, multistage explicit Runge–Kutta schemes to integrate the slow components—both the COSMO [6, 7] and WRF [56, 88] models use a 3-stage 3rd-order Runge–Kutta (RK) scheme for integrating the slow components, retaining the forward–backward and trapezoidal schemes (previously described) for the fast components (following the analyses of RK methods for time-splitting in [108, 109]). Figure 3b illustrates the 3-stage RK-based approach.
With the recent developments of global models with “very” high resolution (grid-spacings \(\le \mathcal {O}\left( 10\right)\, \mathrm {km}\)), SE methods are now being used also for global NWP: the MPAS model [89] has adopted the SE approach (as described in [56]) based directly on the successful experiences with the 3-stage 3rd-order RK-based SE approach in WRF. The high resolution global non-hydrostatic model, NICAM, also uses a RK-based SE approach (with options for 2nd- or 3rd-order RK schemes) [82, 83].
Horizontally-Explicit Vertically-Implicit (HEVI) Schemes
Similarly to SE approaches, HEVI schemes are becoming more and more attractive due to the latest advancements in computing that are driving the development of “very” high-resolution global NWP models. For global NWP, the stratosphere plays a significant role in the global circulation [48, 50, 71]. Inclusion of a well-represented stratosphere has implications for the chosen time-integration methods, since the stratospheric polar jet (which contributes via the advection term) reaches speeds exceeding \(100\;\mathrm {m\;s}^{-1}\), i.e., the advective Courant number approaches the acoustic one. As highlighted in [34], in this context the efficiency gains from the SE approach become less clear: the horizontal splitting (which defines the sub-stepping) in SE schemes is only relevant when there is a scale-separation between fast insignificant and slow significant processes. With the acoustic and advective Courant numbers being similar, the sub-step \(\varDelta \tau \) and the model time-step \(\varDelta t\) are constrained by similar stability limits and little efficiency can be gained from sub-stepping. In addition, as already noted, SE models require artificial damping to ensure stabilization, with atmospheric models typically employing divergence damping (see, e.g., [6]). HEVI-based alternatives can be efficiently used for global non-hydrostatic equation sets and do not have the drawbacks affecting SE schemes.
Consider again (7), describing the atmosphere as containing contributions from two scale-separated processes:
$$\begin{aligned} \frac{\partial {\mathbf {y}}}{\partial t}={\mathcal {R}}_{g}(t,{\mathbf {y}})+ {\mathcal {R}}_{f}(t,{\mathbf {y}}), \;\; \left| \left| {\mathcal {R}}_{f}\right| \right| \gg \left| \left| {\mathcal {R}}_{g}\right| \right| , \end{aligned}$$
where, for global atmospheric models, the scale-separation occurs due to the order-of-magnitude difference in grid-spacings in the horizontal and vertical directions, that is \(z_{h} \ll s_{h}\). In this context, \({\mathcal {R}}_{f}\) contains vertically-propagating processes and \({\mathcal {R}}_{g}\) contains horizontally-propagating terms. The problem naturally lends itself to an IMEX (Implicit–Explicit) approach, examples of which have been widely analyzed in the literature for use in very stiff diffusion-dominated (parabolic) systems. For the atmospheric (hyperbolic) system, IMEX schemes have only recently been analyzed in the context of HEVI solutions. The analyses have tended to focus on (single time-step) Runge–Kutta (RK) based approaches [8, 23, 40, 63, 100, 106], which inherently avoid any problems associated with the computational modes that derive from multi-step methods.
A \(\nu \)-stage IMEX-RK method can be expressed by a so-called “double Butcher tableau”:
where \(\tilde{c}_i=\sum _{j=1}^{\nu }\tilde{\alpha }_{ij}\) and \(c_i=\sum _{j=1}^{\nu }{\alpha }_{ij}\), and \(\sum \tilde{b}_j =1\) and \(\sum b_j = 1\). Applied to the atmospheric system of interest, subject to a HEVI-based discretization, this notation leads to:
$$\begin{aligned} {\mathbf {Y}}^{(j)}=&\;{\mathbf {y}}^n+\varDelta t \sum _{\ell =1}^{j-1}\tilde{\alpha }_{j\ell }{\mathcal {R}}_{g}\left( t^n+\tilde{c}_{j} \varDelta t,{\mathbf {Y}}^{(\ell )}\right) + \sum _{\ell =1}^{j}\alpha _{j\ell }{\mathcal {R}}_{f}\left( t^n+ c_{j}\varDelta t,{\mathbf {Y}}^{(\ell )}\right) , \end{aligned}$$
(9)
$$\begin{aligned} {\mathbf {y}}^{n+1}=&\;{\mathbf {y}}^{n} +\varDelta t \sum _{j=1}^{\nu }\tilde{b}_{j}{\mathcal {R}}_{g}\left( t^{n}+\tilde{c}_j\varDelta t, {\mathbf {Y}}^{(j)}\right) + \sum _{j=1}^{\nu }b_{j}{\mathcal {R}}_{f}\left( t^n + c_j\varDelta t,{\mathbf {Y}}^{(j)}\right) , \end{aligned}$$
(10)
where the explicit scheme is used to integrate the horizontally-propagating terms in \({\mathcal {R}}_{g}\), and the implicit scheme is used for the vertically-propagating terms in \({\mathcal {R}}_{f}\).
Similar to the SE approach, the aim is to optimize the computational cost by selecting a relatively cheap (due to its local nature) but appropriately accurate (say, 3rd order) conditionally stable explicit scheme, which places a limit on the time-step \(\varDelta t\); and a less accurate (say, 2nd order) unconditionally stable implicit scheme to handle the large Courant number vertical processes. The loss of accuracy in computing the vertical processes is offset by their lesser physical importance. As for the SE case, the implicit problem is also cheap (e.g., a tridiagonal system), since vertical grid columns remain complete on each compute node.
Expressing the RK-HEVI approach as a double Butcher tableau makes it efficient to explore many alternative combinations of schemes—both through linear analyses [23, 40, 63] and numerical simulations [23, 40, 106]. The analyses focus on the accuracy and stability implied for components of simple linear systems (acoustic and gravity waves, and advection); and the performance of numerical implementations for idealised (dry) atmospheric tests, often compared to solutions from a very high-resolution high-order explicit RK method. Substantially different approaches have been recommended: some completely splitting the vertical and horizontal integrations using Strang-type splitting [8, 97, 100]; others proposing schemes that keep the vertical and horizontal solutions balanced in time, by integrating over the same time-interval, at each predictor-stage [40, 63, 106]. In keeping with the semi-implicit approach (Sect. 3.2), [106] stresses the importance of ending the integration with a stage that includes an implicit integration, thereby ensuring a balanced final solution.
Colavolpe et al. [23] proposes a further extension to the double Butcher tableau approach—a quadruple Butcher tableau, whereby the horizontal pressure-gradient and divergence terms are treated separately to the horizontal advection. Under their scheme, all four solutions are balanced in time at each predictor-stage, but in addition, a forward–backward type operation is introduced for the pressure-gradient and divergence terms (based on [68]) that alternates the forward/backward operations for the windspeed/pressure solutions at two predictor stages. They demonstrate that the additional splitting brings greater stability and accuracy at no extra computational cost.
Only one operational global NWP model but a number of global non-hydrostatic research models (cf. Table 1) have adopted HEVI time-integration methods. This trend seems to indicate that HEVI schemes are being considered as a valuable alternative to more commonly adopted time-integration strategies (such as the semi-implicit semi-Lagrangian method), especially for high-resolution models.
Path-based time-integration (PBTI)
PBTI schemes known also as Lagrangian methods, solve the original IBVP simultaneously in space and time without separating the BVP from the IVP. The PDEs in this case are seen as physical constraints on the path that can be followed to connect two states in the four-dimensional time-space continuum [93], as depicted in Fig. 4. The connection between two arbitrary states is obtained through a trajectory integral applied to the equation
$$\begin{aligned} \overbrace{\frac{\partial {\mathbf {y}}}{\partial t} + ({\mathbf {u}}\cdot \nabla ) {\mathbf {y}}}^{\frac{\text {D}{\mathbf {y}}}{\text {D}t}} = {\mathcal {R}}({\mathbf {y}}, t). \end{aligned}$$
(11)
In Eq. (11), the advective term (where \({\mathbf {u}}\) is the advection velocity that can eventually depend on \({\mathbf {y}}\)) on the left-hand side is absorbed into the material (or path) derivative \(\frac{\text {D}{\mathbf {y}}}{\text {D}t}\) and the right-hand side \({\mathcal {R}}\) consists of forcing terms, e.g., the pressure-gradient term, the Coriolis terms and the source terms arising from the parametrization of the sub-grid physical processes in an NWP model.
The trajectory integral of \({\mathcal {R}}\) can be approximated using a weighted average of the integrand values from the physical space at a past time \(t_{\text {D}}\), the “departure time” where the state of the system is known, and the values from the physical space at a future time \(t_{\text {A}}\), the “arrival time”, where the solution is sought. In the PBTI approach when the approximation solely relies on the past values, the integration scheme is explicit, while when the approximation depends on the future (unknown) values of the integrand \({\mathcal {R}}\), the integration scheme is implicit. In the case of explicit integration schemes, the solution of the system is fairly straightforward but the approximation can become numerically unstable, for time-steps exceeding the Eulerian CFL condition of the fast processes, leading to failure of the simulation. On the other hand, in the case of implicit integration schemes, such as the commonly used trapezoidal rule (semi-implicit Crank–Nicholson), the approximation is guaranteed to be unconditionally stable but the resulting system of equations, usually in the form of a BVP, becomes more complicated and its numerical solution more difficult.
PBTI techniques have been very successful in NWP (as indicated in Table 1 with adoption of the technique as late as 2014). The most common PBTI strategy employed in NWP is the Semi-Lagrangian (SL) scheme that revolutionized the field two decades ago [94, 98]. Pure Lagrangian approaches, where the exact solution at the next time step is sought by translating the flow information on the mesh at the current time level along the trajectory integrals, with remapping only for postprocessing purposes, have never been adopted into operational NWP models. This has been mainly due to the initial mesh being significantly deformed in a few time-steps which results in large spatial truncation errors. Having a mesh with vertically aligned grid-points is essential for resolving complex diabatic processes and vertically-propagating gravity waves in a NWP model. In contrast to the pure Lagrangian approach, there is no mesh deformation in a semi-Lagrangian scheme. The reason is that backward trajectories are calculated at each time-step; these end at the model grid-points while they start from locations between mesh grid-points that must be determined.
More recently, forward-in-time finite volume (FTFV) integrators, that can be written in a congruent manner as the SL scheme, have also emerged [91] with applications in NWP and climate. Furthermore, vertical Lagrangian coordinates have been successfully applied in hydrostatic models [46].
The semi-implicit semi-Lagrangian scheme (SISL)
The semi-Lagrangian (SL) method [74, 94] is an unconditionally stable scheme for solving the generic transport equation
$$\begin{aligned} \frac{\text {D}y}{\text {D}t} = \text {S}, \qquad \frac{\text {D}}{\text {D}t}=\frac{\partial }{\partial t} + {\mathbf {u}} \cdot \nabla ,\quad {\mathbf {u}}=(u,v,w) \end{aligned}$$
(12)
where \({\mathbf {u}}\) denotes a wind vector, y a transported variable and S a source term. Beyond stability, an additional strength of the SL numerical technique is that it exhibits very good phase speeds and little numerical dispersion (see, e.g., [38, 94]). Because of these properties, SL solvers can integrate the prognostic equation sets of atmospheric models stably with long time-steps at Courant numbers much larger than unity, without distorting the important atmospheric Rossby waves. When a SL scheme is coupled with a semi-implicit (SI) time discretization, long time-steps can be used in realistic atmospheric flow conditions where a multitude of fast and slow processes coexist. In semi-implicit semi-Lagrangian (SISL) schemes the high-speed gravity waves associated with high-frequency fluctuations in the wind divergence are mitigated, where the terms responsible for the gravity waves are identified and treated in an implicit manner, thereby slowing down the fastest gravity waves.
The SISL approach is currently the most popular option for operational global NWP models while it is also often used in limited area modeling. As shown in Table 1 the vast majority of the listed global NWP centers are using a model with a SISL dynamical core. A typical example is the ECMWF forecast model IFS, which has used a SISL approach since 1991. As discussed in [86], the change from Eulerian to semi-Lagrangian numerics improved the efficiency of IFS by a factor of six thus enabling a significant resolution upgrade at that time. Since 1991, further successful upgrades followed and currently the (high resolution) global forecast model is run at 9 km resolution in grid-point space, to this date the highest in the world.
To explain how a SISL method works we shall write the prognostic equations of the atmosphere in the compact form:
$$ \frac{\text {D}{\mathbf {y}}}{\text {D}t}={\mathcal {R}}({\mathbf {y}}),$$
(13)
where \({\mathbf {y}}=(y_{i})\), \(i=1,2,\ldots ,N\) is a vector of N three-dimensional prognostic scalar fields \(y_i\) (such as the wind components, temperature, density, water vapour and other tracers) and \({\mathcal {R}}=(R_{i})\) is the corresponding forcing term. Integrating (13) along a trajectory, which starts at a point in space D, the departure point, and terminates at a point in space A, the arrival point
$$\begin{aligned} \frac{{\mathbf {y}}^{t+\varDelta t}_{\text {A}}-{\mathbf {y}}^{t}_{\text {D}}}{\varDelta t}=\int _{t}^{t+\varDelta t}{\mathcal {R}}\left( {\mathbf {y}}(t) \right) \text {d}t \end{aligned}$$
(14)
and approximating the right-hand side integral using the second order trapezoidal scheme yields the following SISL discretization
$$\begin{aligned} \frac{{\mathbf {y}}^{t+\varDelta t}_{\text {A}} - {\mathbf {y}}^t_{\text {D}}}{\varDelta t} = \frac{1}{2} \left( {\mathcal {R}}^{t}_{\text {D}} + {\mathcal {R}}^{t+\varDelta t}_{\text {A}} \right) . \end{aligned}$$
(15)
In any SISL scheme there are three crucial steps that influence the numerical properties of the discretization, namely (i) the calculation of the departure point locations and the related interpolation of the prognostic variables at these points, (ii) the semi-implicit time discretization of the nonlinear forcing terms and (iii) the solution of the final semi-implicit system reduced in the form of a Helmholtz elliptic equation. We detail each of these steps in the following.
- (i):
-
SL advection and calculation of the departure points All operational SL codes work “backwards” in the sense that at a given discrete point in time t and with a model time-step of \(\varDelta t\) an air-pracel will start from a point in space between grid-points and will terminate at a given mesh grid-point. The latter are called “arrival points” and coincide with the model mesh grid points while the former are called “departure points” and they must be found as they are not known a priori. There is a unique departure point associated with each grid-point to be computed (this assumes that characteristics do not intersect, i.e., no discontinuities are permitted). Therefore, for a simple passive scalar advection of a generic field y without forcing, the solution at a new time step is:
$$ y^{t+\varDelta t}_{\text {A}} = y_{\text {D}}^{t}.$$
(16)
This means that to compute the field y values at the new time-step \(t+\varDelta t\), it suffices to compute a departure point “D” for each model grid point and then interpolate the transported field y at these departure points. The interpolation method uses the known y-values at time t, at a set of grid points nearest to “D”; the number and location of these grid points depend on the order of interpolation method used. For the more general problem (13), the forcing terms should also be interpolated at the departure point. To compute the location of the departure points the following trajectory equation must be solved:
$$ \frac{\text {D}\mathbf {r}}{\text {D}t}={\mathbf {u}}(\mathbf {r},t), $$
(17)
where \(\mathbf {r}\) denotes the coordinates of a moving fluid parcel, for example \(\mathbf {r}=(x,y,z)\) if a Cartesian system is used. By integrating Eq. (17), we obtain:
$$ \mathbf {r}_{\text {A}} - \mathbf {r}_{\text {D}} = \int _{t}^{t+\varDelta t}{\mathbf {u}}(\mathbf {r},t)\text {d}t.$$
(18)
The right-hand side integral of (18) is usually approximated using a 2nd order scheme such as the midpoint rule, resulting in an implicit equation of the form
$$ \mathbf {r}-\mathbf {r}_{\text {D}} = \varDelta t \,{\mathbf {u}}\left( \frac{\mathbf {r}+\mathbf {r}_{\text {D}}}{2},t+\frac{\varDelta t}{2}\right),$$
(19)
which is solved iteratively (for details see [27]). The accuracy with which the departure points are computed influences greatly the overall accuracy of the model as shown in [27].
In addition, the method employed to interpolate the terms of Eq. (13) to the departure points has also important implications in the model accuracy. From this perspective, it is common practice in operational SISL models to use a cubic interpolation formula most often based on tri-cubic Lagrange interpolation followed by formulae based on cubic Hermite or cubic spline polynomials. The interpolation is directional, i.e., it is performed separately in each of the three spatial coordinates. There is an intriguing interplay between the spatial and time truncation error in the SL advection method. Following the convergence analysis in [31], verified experimentally in [112] using the Navier–Stokes system, the leading order truncation error term for a SL method solving a 1D constant wind advection equation with an interpolation formula of order p on a grid with constant spacing \(\varDelta x\) and a time-integration method for the departure point of order k with time-step \(\varDelta t\) is \(\mathcal {O}(\varDelta t^k + \varDelta x^{p+1}/\varDelta t)\). This suggests that reducing the time-step only without refining the mesh resolution may not improve the overall solution accuracy as it increases the contribution from the error term which has \(\varDelta t\) in the denominator. However, with a shorter time-step the accuracy of the departure point calculation improves and a higher order interpolation scheme improves the accuracy of spatial structures such as waves [29].
- (ii):
-
Semi-implicit time-discretization of forcing terms Eq. (15) is expensive and complex to solve due to its large dimension, its implicitness and in general its nonlinear form (right hand-side \({\mathcal {R}}\) includes nonlinear terms). For this reason, an approach commonly used in NWP is to extract fast terms from the right-hand side and linearise them around a constant reference profile. For example, in the IFS model the right-hand forcing term is split as follows:
$$ {\mathcal {R}}=\mathcal {N}+\mathcal {L} $$
where \(\mathcal {L}\) contains the linear and linearised fast terms which are integrated implicitly and \(\mathcal {N}\) the remaining nonlinear terms \(\mathcal {N}={\mathcal {R}} - \mathcal {L}\) which are integrated explicitly. A two-time-level second order SISL discretization of (15) can be written as follows:
$$\begin{aligned} \frac{{\mathbf {y}}^{t+\varDelta t}_{\text {A}}-{\mathbf {y}}^t_{\text {D}}}{\varDelta t}=\frac{1}{2}\left( \mathcal {L}^t_{\text {D}} + \mathcal {L}^{t+\varDelta t}_{\text {A}}\right) + \frac{1}{2} \left( \mathcal {N}^{t+\varDelta t/2}_{\text {D}} + \mathcal {N}^{t+\varDelta t/2}_{\text {A}} \right) . \end{aligned}$$
(20)
The slowly varying nonlinear terms at \(t+\varDelta t/2\) can be “safely” approximated by a second order extrapolation formula such as
$$ \mathcal {N}^{t+\varDelta t/2} =\frac{3}{2} \mathcal {N}^t - \frac{1}{2}\mathcal {N}^{t-\varDelta t}$$
or alternatives such as SETTLS [49], which are less prone to generate numerical noise. The latter aspect—i.e., the numerical noise issue—is particularly relevant in the stratosphere where large vertically stable areas occur and any small scale oscillations appearing due to the three time-level form of the extrapolation formula used may be amplified. Using “iterative semi-implicit” schemes [11, 26, 111], in which a future model state is predicted with 2-iterations (or more) with the first serving as a predictor and the second as a corrector, is the most effective method for solving the noise issue. However, it is more costly due to its iterative nature.
There is considerable variation in the implementation of the SI time-stepping by different models. An alternative, iterative, approach to the SI method (20) is followed by the UK Met Office (UKMO) Unified Model, where there is no separate treatment between linear and nonlinear terms. Here, the standard off-centerd semi-implicit discretization is used:
$$\begin{aligned} \frac{{\mathbf {y}}^{t+\varDelta t}_{\text {A}}-{\mathbf {y}}^t_{\text {D}}}{\varDelta t}=(1-\alpha ){\mathcal {R}}^t_{\text {D}} +\alpha {\mathcal {R}}^{t+\varDelta t}_{\text {A}}. \end{aligned}$$
(21)
The weight \(\alpha \) is either 0.5 or slightly larger to avoid non-physical numerical oscillations (noise) which may arise due to spurious orographic resonance [76]. To tackle the implicitness of Eq. (21) an iterative method with an outer and inner loop is used. This functions as a predictor-corrector two-time-level scheme. As stated in [66], in the outer iteration loop, the departure-point locations are updated using the latest available estimates of the winds at the next time step. In the inner loop the nonlinear terms, together with the Coriolis terms, are evaluated using estimates of the prognostic variables obtained at the previous iteration. Further details on the iterative approach followed by the UM can be found in [66, 111]. Iterative SI schemes are expensive algorithms, however, they are used by most non-hydrostatic SISL global models as in practice the cheaper non-iterative schemes based on time-extrapolation become unstable when long time steps are used.
- (iii):
-
Helmholtz solver Once the right-hand side of (20) has been evaluated the semi-implicit system can be solved. To avoid solving simultaneously all implicit equations in (20), it is common practice to derive a Helmholtz equation from these. The form of the Helmholtz equation depends on the type of space discretization. In spectral transform methods, such as the one used in IFS [104], the specific form of the semi-implicit system is derived from subtracting a system of equations linearised around a horizontally homogeneous reference state. The solution of this system is greatly accelerated by the separation of the horizontal and the vertical part, which matches the large anisotropy of horizontal to vertical grid dimensions prevalent in atmospheric models. In spectral transform methods, one uses the special property of the horizontal Laplacian operator in spectral space on the sphere
$$ \nabla ^2\psi _n^m = {n(n+1) \over a^2} \psi _n^m,$$
(22)
where \(\psi \) symbolises a prognostic variable, a is the Earth radius, and (n, m) are the total and zonal wavenumbers of the spectral discretization [104]. This conveniently transforms the 3D Helmholtz problem into a 2D matrix operator inversion with dimension of the vertical levels only, resulting in a very cheap direct solve [75]. Even in the non-hydrostatic context, formulated in mass-based vertical coordinates [60], only the solution of essentially two coupled Helmholtz problems allow the reduction of the system in a similar way [12, 14, 115]. This technique requires a transformation from grid-point space to spectral space and vice versa at each time-step, an aspect that increases the associated computational cost although the spectral space computations are based on FFTs and matrix–matrix multiplications that are well suited for modern computing architectures. One disadvantage of this technique is the need for a somewhat simple reference state that does not allow, by definition, the inclusion of horizontal variability (as would be desirable for terms involving orography). The relaxation of this constraint and some alternatives are discussed, for example, in [17].
For grid point models using finite differences, such as the UKMO Unified Model, a variable coefficient 3D Helmholtz problem is solved using an iterative Krylov subspace linear solver (e.g., BiGCstab, GCR(k), GMRES etc.) [111]. This type of solver is generally more expensive despite grid point models not requiring transformations from spectral to grid point space and vice versa, which offsets some of the extra cost. However, typically up to 80 percent of computations are spent in the solver in grid-point based semi-implicit methods, compared to 10–40% in spectral transforms (depending on the resolution and the number of MPI communications involved) on today’s high performance computing (HPC) architectures. For emerging and future architectures, that may heavily penalize global communication patterns moving to high throughput capabilities, this is a serious concern that needs to be properly investigated and addressed.
Summary
In this section, we categorized time-integration schemes into two classes Eulerian-based, EBTI, and path-based, PBTI. The former discretizes the original PDE problem in space first, thus obtaining an ODE, and subsequently in time through a suitable time-integration strategy. The PBTI class, instead, solves the PDE problem in a single step, where the advection term is adsorbed into the path (or material) derivative and the right-hand side is formed by forcing terms only. In this case, the system of PDEs can be seen as a physical constraint on the path that can be followed to link two states in the four-dimensional continuum constituted by space and time.
For each of the two categories, EBTI and PBTI, we outlined the most prominent time-integration schemes adopted in the NWP and climate communities. In particular, SE and HEVI for the EBTI class, and the SISL approach for the PBTI class. The latter was the most widely adopted in the past few decades thanks to the hydrostatic approximation ubiquitously used in global weather and climate models. Indeed, PBTI strategies and SISL in particular, were extremely competitive given the large time-steps they allow and their extremely favorable dispersion properties, which yield the correct representation of wave-like solutions—e.g., Rossby and gravity waves. EBTI strategies are instead now emerging as a potential alternative to PBTI in non-hydrostatic models. This is mainly because they can be constructed to ‘filter’ the fast and atmospherically irrelevant acoustic modes that propagate vertically. In fact, SE and HEVI schemes that had been mainly used in the context of limited-area models are now being taken into consideration in global weather models, as they seem to represent an attractive compromise between solution accuracy, time- and energy-to-solution and reliability. In addition, they can address the non-conservation issues typical of PBTI-based approaches, a feature that is particularly relevant for climate simulations. Some additional strategies, beyond HEVI, SE and SISL methods are also under investigation within global NWP and climate simulations, namely IMEX schemes, fully implicit methods and conservative semi-Lagrangian schemes—the latter addressing the conservation issues of traditional SISL schemes. These additional strategies will be briefly discussed in Sect. 5.
Note that while the time-integration approaches described in this section might differ in their implementation details from one model to another, the general concepts and properties, which motivate their use, hold true across all the models adopting each strategy. In the next Sect. 4, we will highlight the time-stepping strategies employed by the main operational global NWP and climate centres and emphasize in more detail why these were selected. In addition, we will outline the implications these choices may have in the context of the changing hardware and weather modeling landscape. Finally, we will introduce some of the (several) projects undertaken within the weather and climate industry to address the computational challenges of the incoming decades.