Account

Continental scale spatial temporal interpolation of near-surface air temperature: do 1 km hourly grids for Australia outperform regional and global reanalysis outputs?

Original Article
Open access
Published: 13 August 2024

(2024)
Cite this article

You have full access to this open access article

Climate Dynamics Aims and scope Submit manuscript

Continental scale spatial temporal interpolation of near-surface air temperature: do 1 km hourly grids for Australia outperform regional and global reanalysis outputs?

379 Accesses
1 Citation
1 Altmetric
Explore all metrics

A Correction to this article was published on 03 September 2024

This article has been updated

Abstract

Near-surface air temperature is an essential climate variable for the study of many biophysical phenomena, yet is often only available as a daily mean or extrema (minimum, maximum). While many applications require sub-diurnal dynamics, temporal interpolation methods have substantial limitations and atmospheric reanalyses are complex models that typically have coarse spatial resolution and may only be periodically updated. To overcome these issues, we developed an hourly air temperature product for Australia with spatial interpolation of hourly observations from 621 stations between 1990 and 2019. The model was validated with hourly observations from 28 independent stations, compared against empirical temporal interpolation methods, and both regional (BARRA-R) and global (ERA5-Land) reanalysis outputs. We developed a time-varying (i.e., time-of-day and day-of-year) coastal distance index that corresponds to the known dynamics of sea breeze systems, improving interpolation performance by up to 22.4% during spring and summer in the afternoon and evening hours. Cross-validation and independent validation (n = 24/4 OzFlux/CosmOz field stations) statistics of our hourly output showed performance that was comparable with contemporary Australian interpolations of daily air temperature extrema (climatology/hourly/validation: R² = 0.99/0.96/0.92, RMSE = 0.75/1.56/1.78 °C, Bias = -0.00/0.00/-0.03 °C). Our analyses demonstrate the limitations of temporal interpolation of daily air temperature extrema, which can be biased due to the inability to represent frontal systems and assumptions regarding rates of temperature change and the timing of minimum and maximum air temperature. Spatially interpolated hourly air temperature compared well against both BARRA-R and ERA5-Land, and performed better than both reanalyses when evaluated against the 28 independent validation stations. Our research demonstrates that spatial interpolation of sub-diurnal meteorological fields, such as air temperature, can mitigate the limitations of alternative data sources for studies of near-surface phenomena and plays an important ongoing role in supporting numerous scientific applications.

Similar content being viewed by others

Evaluation of ERA-interim monthly temperature data over the Tibetan Plateau

Article 18 September 2014

Downscaling RCP8.5 daily temperatures and precipitation in Ontario using localized ensemble optimal interpolation (EnOI) and bias correction

Article 03 October 2017

Monthly Variation in Near Surface Air Temperature Lapse Rate Across Ganga Basin, India

Chapter © 2021

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Quantifying spatial and temporal variability in near-surface air temperature is essential to the study of many biophysical phenomena, including energy (McVicar and Jupp 1999; Parton and Logan 1981) and water balances (Guerschman et al. 2022; McVicar and Jupp 2002; Vaze et al. 2013), wildfire (Brown et al. 2016), agricultural productivity (Holzworth et al. 2018; Walter 1967) and biogeographic distributions (Kearney and Porter 2009). Often there are trade-offs between spatial resolutions and temporal frequencies in climatological datasets, and therefore the availability of suitable products to study sub-diurnal (i.e., time-steps shorter than a 24-hour period) environmental dynamics can be limited. Quantifying air temperature at sub-diurnal time-steps enables better representation of diurnal processes (Dai 2023), whereas finer spatial resolutions can better represent the effect of lapse rates on temperature in topographically variable regions (Hutchinson 1991; McVicar et al. 2007). This increased spatial and temporal granularity can also enable better linkages with secondary data sources, such as remote sensing analyses where data are acquired at specific times.

Several post-processing techniques can be applied to achieve the desired spatial and sub-diurnal precision. For example, spatial downscaling (e.g., Wang et al. 2016) or temporal interpolation (e.g., Parton and Logan 1981) can be used to modify the spatial and temporal characteristics of existing products. Despite the availability of such methods, finding an adequate balance between the spatial resolution, temporal frequency, currency, and accuracy of air temperature datasets can be challenging (e.g., Kettle and Thompson 2004; Li et al. 2020; Pan et al. 2012).

Empirical temporal interpolation using daily air temperature extrema (minimum, maximum) as input has been used for over 130 years to model sub-diurnal air temperature (Strachey 1886). These models typically apply harmonic functions to daily (or longer) air temperature observations (i.e., measured once or summarised to one value per time-step) and measures of solar geometry (e.g., timing of sunrise and sunset), and assume that the timing of minimum and maximum air temperatures are constant with respect to local solar time (Table 1). The relatively simple input data requirements can be easily computed or accessed from meteorological databases (e.g., daily air temperature extrema from gridded datasets, station observations) and therefore are practical methods to implement. Despite this accessibility, these models require strong assumptions about the timing of minimum and maximum air temperatures, need calibration for best results, and cannot represent frontal systems (Cesaraccio et al. 2001; Parton and Logan 1981). These models are also usually calibrated for specific and/or a small set of location(s) (e.g., Gholamnia et al. 2019) despite being applied more broadly in practice, and do not explicitly consider spatial relationships between observations.

Table 1 Summary of studies using spatial (S) and/or temporal (T) methods for interpolating sub-diurnal near-surface air temperature (Ta). The studies are ordered by temporal interpolation then spatial interpolation, then within each interpolation type they are ordered chronologically by publication year. Temporal interpolation methods require minimum (Ta_min) and maximum (Ta_max) daily air temperature. Spatial interpolation methods include numerical weather prediction (NWP) and inverse distance weighting (IDW). The interpolated area is reported as “not applicable” (n/a) for site-based analyses

Full size table

Spatial interpolation is widely used for the development of climate datasets from regional to global scales (Cornes et al. 2018; Harris et al. 2020; Jeffrey et al. 2001; Jones et al. 2009; McVicar et al. 2008; Thornton et al. 2021). Variables are commonly available at daily or longer time-steps; however, there have been applications at sub-diurnal time-steps (see Table 1). Air temperature is often provided as extrema at daily or longer time-steps (e.g., monthly), and as such are modelled as a mosaic of observations that likely occurred at different times. Models are calibrated using high-quality observations, ideally with sufficient density and representation to characterise important climatic gradients (e.g., lapse rates, sea breezes, synoptic patterns). Among the most common spatial interpolation methods include thin plate splines and kriging (Hutchinson 1991; Matheron 1962), though many have been developed (Hengl et al. 2018; Li and Heap 2014; Sekulić et al. 2020). Spatial interpolation draws on observations from surrounding locations and typically shows strong statistical performance in climate applications (Hutchinson et al. 2009; Jones et al. 2009), though model skill is dependent upon the quality and density of input data (e.g. Stewart et al. 2017), in addition to appropriate representation of process gradients (e.g., elevation for estimating lapse rates, distance from coast for sea breeze modelling). Historically, the limited availability of high-frequency climate observations (relative to daily or monthly) has been one barrier to applying these methods at sub-diurnal time-steps over many years at continental extents.

Sub-diurnal meteorological fields can be accessed via climate reanalysis products, which are developed using physical models in conjunction with assimilated observations (e.g., Gelaro et al. 2017; Hersbach et al. 2020; Kobayashi et al. 2015; Muñoz-Sabater et al. 2021; Su et al. 2019). These datasets quantify many atmospheric variables, are regional to global in extent, range from decades to over a century in length, and often provide sub-diurnal time-steps. Reanalyses preserve the physical relationships between variables and include multiple vertical layers representing increasing altitudes, but are sensitive to spatial-temporal patterns in assimilated observations (as with spatial interpolation) and often have very coarse spatial resolution (0.1 ° to 2.5 ° grid cells, equivalent to ~ 11 km to 277 km at the equator). The coarse spatial resolution means that many fine-scale features are not well represented in the reanalysis output without post-processing (e.g. Guo et al. 2022; Karger et al. 2017; Politi et al. 2021), which can be limiting where analysing with remotely sensed data and/or studying near-surface processes in regions of highly variable topography. Furthermore, the delay between the current date and reanalysis data availability can range from several months to years (e.g., Muñoz-Sabater et al. 2021; Saha et al. 2014; Su et al. 2019), limiting suitability for studying recent events.

The increased availability of hourly observations across Australia in recent decades (Australian Bureau of Meteorology 2023; Trewin 2012) provides an opportunity to demonstrate spatial interpolation of near-surface air temperature at sub-diurnal time-steps. This enables the representation of frontal systems, avoiding many of the assumptions associated with temporal interpolation of air temperature, such as the (consistent) timing of daily minimum and maximum air temperatures. Spatial interpolation also allows for the development of high-resolution surfaces better suited for use with remote sensing data, and can be rapidly updated, enabling the study of recent events (e.g., extreme weather). Our aim was to develop and evaluate a high-resolution spatial dataset of hourly air temperature calibrated using weather station observations distributed across Australia. Our specific objectives, which form the basis for sub-headings in the following Methods, Results and Discussion sections, were to:

i)
describe the development and statistical performance of spatially interpolated hourly air temperature surfaces for Australia;
ii)
evaluate the performance of our method against temporal interpolation using daily extrema air temperature observations; and.
iii)
compare our spatially interpolated hourly product against high-quality, contemporary regional and global reanalysis products.

Many studies have investigated spatial and temporal air temperature interpolation in isolation (Table 1), but few, if any, have analysed the relative benefits of using one approach over another. These analyses, in conjunction with validation against independent observations and comparisons with reanalysis products, demonstrate the feasibility and suitability of spatial interpolation for generating high quality hourly air temperature surfaces.

2 Study region and materials

Hourly air temperature (at ~ 1.5 m height) observations recorded across Australia between January 1990 and November 2019 were obtained from the Australian Bureau of Meteorology (BoM). The dataset included records for 621 stations. The number of these stations where meteorological parameters are available at hourly time-steps has steadily increased over recent decades (Figure S1; Australian Bureau of Meteorology 2023; Trewin 2012). Hourly air temperature observations were available for just 35 of these stations in 1990, increasing to 328 stations in 2000 and 568 stations in 2019. All records were converted from their respective Australian time zones, accounting for daylight saving time as needed, to Coordinated Universal Time (UTC) to ensure they were temporally aligned.

For independent validation, hourly air temperature observations from 24 OzFlux weather stations (see Beringer et al. 2016; https://www.ozflux.org.au/) and 4 CosmOz weather stations (see Hawdon et al. 2014; https://cosmoz.csiro.au/) were compiled. The OzFlux stations monitor energy, carbon, and water fluxes across Australian ecosystems and contribute to the global FLUXNET network. The CosmOz stations are a network of cosmic ray probes used to measure average soil moisture over large areas (~ 30 ha footprint). Observations at the CosmOz stations were in some cases recorded at irregular time-steps and were rounded to the closest hour. The OzFlux observations are often made at multiple heights up an instrumentation mast. In all cases, we took those measurements closest to standard station height (i.e., 1.5 m). The difference between observation height and standard station height was recorded and is presented alongside the validation results. The spatial distribution of weather stations used herein are illustrated in Fig. 1.

Fig. 1

Surveyed site elevation data, sourced from station metadata, were used for all model cross-validation, validation site predictions and model fitting. The GEODATA 9-second Digital Elevation Model (DEM; Hutchinson et al. 2008), reprojected to Australian Albers (EPSG: 3577) equal area projection (with 1 km resolution) was used for gridded spatial interpolation. Distance to the generalised coast data was obtained from ANUClimate v2.0 (Hutchinson et al. 2021).

Two contemporary reanalysis products were used to evaluate our spatially interpolated surfaces: (i) the 12 km Bureau of Meteorology Atmospheric high-resolution Regional Reanalysis for Australia (BARRA-R; Su et al. 2021); and (ii) ERA5-Land (Muñoz-Sabater et al. 2021). Screen temperature and 2 m air temperature were obtained for both BARRA-R and ERA5-Land, respectively, at 4 times of the day from 01/Jan/2015 to 31/Dec/2018. The analyses were restricted to this period to ensure consistent seasonal summaries.

Minimum and maximum daily air temperature observations were obtained from the SILO patched point dataset (https://www.longpaddock.qld.gov.au/silo/point-data/; Jeffrey et al. 2001). The SILO database provides quality controlled and gap-filled daily records of meteorological variables across Australia. Daily data were acquired for 387 stations located across Australia (see Figure S2) that could be matched by station identifier to the co-occurring hourly records.

3 Methods

3.1 Spatial interpolation of hourly air temperature

Two distinct approaches to spatial interpolation of hourly near-surface air temperature were evaluated: (i) climatologically aided spatial interpolation (CASI); and (ii) direct spatial interpolation (DSI). CASI (Willmott and Robeson 1995) involves the separate interpolation of a stable long-term base climatology and corresponding anomalies. For DSI, models are fitted using all available station observations and covariates at each time-step (i.e., allowing model responses to vary with specific weather conditions).

We implemented both the CASI and DSI methods for analyses at sub-diurnal time-steps using hourly records. We produced and statistically evaluated interpolated hourly air temperature grids following eight key steps for CASI, and two key steps for DSI. For CASI, these steps were to: (C1) gap fill observations to mitigate potential bias introduced from temporally incomplete records; (C2) calculate hourly air temperature climatologies for every 5 day-of-year (DOY) period; (C3) fit and evaluate spline models for climatologies; (C4) interpolate every 5th DOY hourly climatologies with quart-variate thin plate splines; (C5) calculate hourly anomalies (i.e., hourly deviations from climatology); (C6) fit and evaluate spline models for anomalies; (C7) interpolate anomalies with bi-variate thin plate splines; and (C8) add the interpolated climatologies to the interpolated anomalies to produce hourly surfaces. For DSI, these steps were to: (D1) fit and evaluate quart-variate spline models; and (D2) interpolate hourly air temperature. Each step is discussed in detail below.

Gaps in the hourly observations were gap-filled (Step C1) using a regression patching procedure (based on Hopkinson et al. 2012; Stewart and Nitschke 2017) to mitigate potential bias in climatologies calculated with temporally incomplete records. Stations with at least 220 observations for any specific hour ± 5 DOYs (20 years by 11 days), were considered as “long-term stations” and used as reference points for gap-filling. Linear regression was used to model missing observations from the closest 10 long-term stations where at least 44 observations (4 years by 11 days) co-occurred. Only models with statistically significant (F-test; α ≤ 0.01) linear relationships were retained. Gaps were filled using the model achieving the highest F-score for each time-step. This procedure was iteratively applied across each station, until all possible gaps were filled, to estimate as many missing observations as possible.

Climatologies were calculated for each station to quantify long-term average air temperatures (Step C2). Climatologies were developed for each hour, centred on every 5th DOY (n = 73, i.e., every 5th day for 365 days) between 01/Jan/1990 and 31/Dec/2019 (i.e., 30 years) to represent shifting solar geometry and seasonal changes in temperature (n climatologies = 1,752, calculated as 73 by 24 h). For each hour we included all available observations within +/- 5 DOYs (including the central day there are 11 days summarised) across all available years. This maximised the number of data points available to build reliable climatologies (i.e., n = 330, 11 days by 30 years). This 11-day aggregation window was chosen to minimise the effect of changes in solar geometry on air temperature for specific times of the day. The time of sunrise and sunset (when air temperatures may vary rapidly) varies by less than 10 min anywhere in Australia for all 11-day periods, well within the hourly interpolation time-step.

Climatologies were calculated for each hour and 5th DOY period at every weather station, where at least 110 (10 years by 11 days) observed or gap-filled values were available (mean = 313.2, standard deviation = 20.4 values) and where no more than 90% of the values were estimated (mean = 30.5, standard deviation = 20.1%). Gap-filled values were only used to calculate climatologies. Note: climatologies were only retained for stations that met the above criteria for all time periods (i.e., n = 1,752 values per station, calculated as 73 by 24 h) to ensure consistency across space and time for the subsequent spatial interpolations. A total of 505 stations met these criteria for building the climatologies (see Fig. 1). While records at an additional 116 locations that did not meet these criteria were retained to calibrate and cross-validate the hourly interpolations (Fig. 1).

Hourly air temperature climatologies for every 5th DOY were fitted (Step C3) and spatially interpolated (Step C4) with quart-variate thin plate smoothing splines (easting, northing, elevation, and a coastal distance index as independent spline variables) using ANUSPLIN v4.4 (Hutchinson and Xu 2013). The smoothing parameters for each surface were selected via Generalised Cross Validation. We included 95% of data points as knots for all climatologies (see Figure S3 for results identifying the optimal % of data points included as knots). Elevation was exaggerated by a factor of 100 relative to the coordinate system to represent the differences in horizontal and vertical synoptic scales as is typical for spline-based climate interpolation (see Hutchinson et al. 2009). The coastal distance index (CDI) was calculated as:

$$\:\begin{array}{c}CDI={e}^{-D2C/d}\:\:\:\:\end{array}$$

(1)

where D2C is distance to the generalised coast (in units of km; Hutchinson et al. 2021) and d is the parameter controlling the rate at which the index decays (see Figure S4). Higher values of d cause the coastal distance index to decay more slowly with distance from the coast. The CDI was included to represent the effect of coastal weather (e.g., sea breeze) on air temperature (Abbs and Physick 1992; Daly et al. 2002; Hutchinson et al. 2021; Miller et al. 2003).

Climatologies for each time-of-day and 5th DOY were iteratively re-fitted with 20 potential values of d between 3 and 50 (see Figure S4 for specific values) to determine the optimal decay rate for the coastal distance index (n climatologies = 35,040, calculated as 73 5th DOYs by 24 h by 20 d values). The optimal d was evaluated using two methods. The first approach was to simply pool all cross-validation results by each unique value of d and empirically determine the best performing value for the full set of climatologies. The second approach was to pool cross-validation results by each time-of-day, 5th DOY, and unique value of d. The optimal d value that minimised root mean squared error (RMSE) for each combination of time-of-day and 5th DOY was then selected to enable a more specific time-varying response to coastal weather conditions to be fitted. Filtering processes were then implemented to ensure the resulting interpolations did not contain step-changes that could negatively affect model predictions. Time-varying d climatologies were first filtered so that only those values that reduced RMSE by at least 3% relative to the optimal fixed value (determined by the first, simpler approach) were considered for a specific time-of-day and 5th DOY. These filtered d values were then smoothed using a focal mean across all times-of-day ± 1 h and all 5th DOYs ± 5 DOYs. The filtered and smoothed values of d were used to generate a CDI for each specific time-of-day and DOY combination.

Hourly air temperature anomalies were then calculated (Step C5) as:

$$\:\begin{array}{c}{A}_{h}={{Ta}_{h}-Ta}_{c}\:\:\:\:\:\:\end{array}$$

(2)

where h is the hour of the observation, c is the climatology for the same hour-of-day and closest 5th DOY and A is the anomaly (i.e., difference between the observation and corresponding climatology). Point interpolations for each of the 1,752 climatologies were performed at each of the 116 stations (n = 203,232) not meeting the inclusion criteria for climatologies to enable calculation of anomalies. Anomaly models were fitted (Step C6) and spatially interpolated (Step C7) using bi-variate thin plate splines as a function of easting and northing and 80% of data points as knots (see Figure S3) to mitigate the potential for model instability and exact interpolation. The final hourly air temperature surfaces were calculated by adding the spatially interpolated climatology (for the same hour, and closest 5th DOY period) and anomaly (Step C8).

Hourly air temperature was ‘directly’ (DSI) fitted (Step D1) and spatially interpolated (Step D2) using quart-variate thin plate splines (easting, northing, elevation, and a coastal distance index as independent spline variables) with hourly observations as input (i.e., without any anomaly calculation). As with CASI, 80% of the hourly observations were used as knots (see Figure S3). The optimal d values for DSI were assessed using a similar process as with the climatologies (see Step C3/C4 above) but using hourly cross-validation statistics between 01/Jan/2015 and 31/Dec/2018 in place of the climatologies. Hourly cross-validated predictions were aggregated to the closest 5th DOY when assessing d, both for consistency with the climatologies and to increase the number of available samples for calculating reliable cross-validation statistics. There was considerable variability but fewer large outliers in d when evaluated for DSI and therefore values were not filtered for % change in RMSE, but the focal mean step was applied (i.e., across all times-of-day ± 1 h and all 5th DOYs ± 5 DOYs). DSI was additionally modelled with d values obtained from the analysis of climatologies to determine the effectiveness of using more temporally stable coastal proximity indices.

Statistical performance was evaluated for each of the climatologies (representative of 1990–2019) and all hours available between 01/Jan/2000 and 14/Nov/2019 (174,064 h over 7,258 days). The number of hourly observations across Australia has steadily increased over time (Figure S1), therefore this period was chosen to provide conservative error estimates that reflect our ability to interpolate historical data. Hourly CASI and DSI were compared against one another; however, only the best performing model was retained for the remainder of the analyses. Hourly air temperature surfaces (1 km resolution) and point predictions at validation sites were interpolated for each hour between 01/Jan/2015 and 14/Nov/2019 (1,779 days = 42,696 h). This shorter period was chosen for validation against independent OzFlux and CosmOz station observations, and comparison with alternative methods (i.e., empirical models, reanalysis data) to better represent the higher station density that is currently available (and likely available on an ongoing basis).

Model performance was evaluated on leave-one-out cross-validated predictions generated by ANUSPLIN during the model fitting procedure (Fig. 2, orange symbols). Hourly CASI was evaluated using both the cross-validated climatology and cross-validated anomaly together to ensure a conservative estimate of error. Independent observations at the OzFlux and CosmOz stations were evaluated against point interpolations for each hour (i.e., using the fitted model from the best performing of the CASI and DSI workflows). The coefficient of determination (R²), root mean squared error/deviation (RMSE/RMSD) and mean error (bias) were used to quantify agreement between observed and cross-validated values (Willmott 1982). The RMSE was used to indicate where statistical comparisons are made between observations (i.e., ground truth) and modelled estimates, and the RMSD was used to indicate statistical comparisons between two modelled estimates. Changes in the absolute value of bias are reported when directly comparing different analyses (e.g., CASI versus DSI). The same metrics were used for all subsequent comparisons. Confidence intervals were given to 1 standard deviation and all results were presented in UTC + 9 unless otherwise specified. This offset was selected to optimally align with the time zones used across Australia, which vary from UTC + 8 in the west to UTC + 11 in the east during daylight savings (in the austral summer).

Fig. 2

3.2 Comparing hourly spatial interpolation with temporal interpolation of daily air temperature

Empirical estimates of hourly air temperature were modelled using cross-validation predictions of daily minimum and maximum air temperature following Parton and Logan (1981), which has been previously used in Australia (Holzworth et al. 2014; McVicar and Jupp 1999). The model (herein denoted PL81) uses a truncated sine function to model daytime temperature and exponential decay function to model night-time air temperature. We parameterised the model by empirical analysis of the time lag between solar noon and maximum air temperature (Figure S5), and sunrise and minimum air temperature (Figure S6) for each calibration and validation station. These parameters were determined seasonally, and typically varied by an hour or less at any specific site (Figure S7). Local solar time (LST) was calculated based on hourly time-steps in UTC to ensure air temperatures produced by PL81 were temporally aligned with the available observations.

The PL81 analyses used cross-validated and point predictions of daily minimum and maximum air temperature, as the daily temperature extrema are not necessarily well captured by data at (relatively) infrequent time-steps and therefore additional biases may be introduced by using hourly records. Daily minimum and maximum air temperatures were also interpolated using quart-variate thin plate splines (full spline dependence on easting, northing, elevation, and coastal distance index) using 80% of the observations as knots. The d value used for transforming the coastal distance index was independently assessed for daily air temperature (as described for the hourly climatologies and DSI), though with the time-varying analysis aggregated by the closest 5th DOY only. PL81 modelling used cross-validated predictions of daily air temperature from calibration stations (n = 391 stations) and point predictions for all remaining stations (n = 231 stations), including the validation stations (Figure S2).

Hourly air temperature was modelled with PL81 iteratively for each day in the comparison period (01/Jan/2015 to 14/Nov/2019), using cross-validated minimum air temperature of both the current and next day in separate runs to ensure a smooth transition in the diurnal air temperature profile between days. Hourly predictions modelled using PL81 were compared against the air temperature observations (at both calibration and validation sites) and performance statistics were compared against cross-validation predictions from hourly spatial interpolations for the same comparison period. Hourly observations were first converted to LST to align timestamps. Statistical comparisons of PL81 and observed air temperature were calculated for each hour, the daily mean, at the time of sunrise − 1 h, and at the time of solar noon + 1.5 h. The latter two times were selected to explore the potential biases present at the typical time of daily minimum and maximum air temperature, respectively. These periods were evaluated using the closest time available per day, as UTC time will vary with respect to a fixed solar time (e.g., changes in sunrise and sunset times as a function of DOY and latitude). Cross-validation predictions from the hourly spatial interpolations were compared by converting them to LST and evaluating the difference in statistical performance, relative to PL81, by hour-of-day and day-of-year.

3.3 Comparing hourly spatial interpolation with reanalysis products

The spatially interpolated air temperature surfaces were compared against two contemporary reanalysis products: (i) BARRA-R (regional; Su et al. 2021); and (ii) ERA5-Land (global; Muñoz-Sabater et al. 2021). Each of the corresponding spatially interpolated surfaces were reprojected to the native resolution of BARRA-R (0.11°) and ERA5-Land (0.10°) for subsequent analyses. Statistical comparisons of both pairs of products (i.e., [i] CASI/DSI and BARRA-R; and [ii] CASI/DSI and ERA5-Land) were calculated for each pixel through the available paired time points, and for the seasonal means across all pixels at four times-of-day (03:00, 09:00, 15:00, 21:00 UTC + 9). Hourly records from the calibration and validation (OzFlux and CosmOz) stations were then compared against both reanalysis outputs and the spatial interpolations to assess the accuracy of each dataset relative to the observations.

4 Results

4.1 Spatial interpolation of hourly air temperature

Hourly air temperature and climatologies were best interpolated using time-varying estimates of d for calculating the coastal distance index (Fig. 3e, f). Time-varying d values improved interpolation performance for hourly climatologies by up to 22.4% (Fig. 3c, i.e., DOY 306, 17:00) in comparison to fixing d at 5 (Fig. 3a), and were most effective in the afternoon and evening of the warmer DOYs (i.e., DOYs 250–50, 15:00–21:00, Δ RMSE = -10.4% ± 5.9%). Hourly DSI showed similar patterns in d when evaluated over 4 years; however, the values were more variable than those for the climatologies (Fig. 3d). Separate evaluation of DSI with the climatological d and hourly d revealed little difference in statistical performance (Fig. 3, Figure S8, Δ RMSE = -0.06% ± 0.17%), and therefore the climatological d was applied to the DSI for all subsequent analyses. As with the climatologies, the DSI improved most in the afternoon and evening of the warmer DOYs (i.e., DOYs 250–50, 15:00–21:00, Δ RMSE = -2.1% ± 1.5%). Further performance improvements were found for DSI throughout the year in the early morning (Fig. 3f, i.e., 07:00–08:00, Δ RMSE = -0.6% ± 0.3%), reflecting the lower optimal d values at these times (Fig. 3d).

Fig. 3

Hourly air temperature climatologies (1990–2019; see Fig. 4) were best interpolated at daytime during the cooler DOYs (i.e., DOYs 150–230, 08:00–16:00, mean = 15.43 °C, R² = 0.99, RMSE = 0.60 °C); however, they also performed worst during the early hours of the morning at the same time of year (01:00–07:00, mean = 9.57 °C, R² = 0.95, RMSE = 1.08 °C, Fig. 5). Cross-validation statistics show a bimodal pattern in diurnal performance during the warmer DOYs (i.e., DOY 340–055; Fig. 5). During these warmer DOYs, performance was best when temperatures increase in the hours after sunrise (i.e., 06:00–09:00, mean = 21.9 °C, R² = 0.99, RMSE = 0.51 °C) and in the evening (i.e., 17:00–23:00, mean = 22.5 °C, R² = 0.99, RMSE = 0.66 °C), with performance being weaker in the early hours of the morning (i.e., 00:00–05:00, mean = 19.0 °C, R² = 0.98, RMSE = 0.65 °C) and during the afternoon (i.e., 12:00–16:00, mean = 26.9 °C, R² = 0.98, RMSE = 0.76 °C). There was a small positive bias during the hours prior to sunrise on the cooler DOYs (i.e., DOY 100–300) and a marginal negative bias during the daytime hours throughout the year, but the overall magnitude of mean error was very small (< 0.03 °C; Fig. 5).

Fig. 4

Fig. 5

Pooled statistics across all stations and hours show strong performance for air temperature climatologies interpolated for seasonal (R² = 0.98 to 0.99, RMSE = 0.66 °C to 0.86 °C, Bias = -0.01 °C to -0.00 °C) and annual (R² = 0.99, RMSE = 0.75 °C, Bias = -0.00 °C) aggregation periods (Table 2). Cross-validation performance remained strong when pooled annually for each station (Fig. 6). Statistical performance was best at a daily time-step (Fig. 6), where observations and predictions were aggregated to a daily mean value prior to calculating each metric (R² = 0.99 ± 0.02, RMSE = 0.43 °C ± 0.27 °C, Bias = -0.00 °C ± 0.43 °C). Performance when hourly observations were diurnally maximum (R² = 0.99 ± 0.05, RMSE = 0.50 °C ± 0.39 °C, Bias = 0.00 °C ± 0.52 °C) was typically more reliable than at the diurnal minimum (R² = 0.99 ± 0.03, RMSE = 0.75 °C ± 0.52 °C, Bias = -0.02 °C ± 0.82 °C) across all stations (Fig. 6). Hourly statistics were within the range of performance at diurnal minima/maxima (R² = 0.99 ± 0.05, RMSE = 0.66 °C ± 0.35 °C, Bias = -0.00 °C ± 0.43 °C; Fig. 6).

Table 2 Pooled seasonal and annual cross-validation statistics for hourly air temperature interpolated across Australia

Full size table

Fig. 6

Seasonally, DSI of hourly air temperature achieved marginally higher R² (up to 0.01) and lower RMSE (0.03 °C to 0.04 °C) than CASI (Table 2). Time-series of cross-validation statistics pooled by month show an increase in the performance of DSI over time (Fig. 7a, c, e; mean R² = 0.94 / 0.96, RMSE = 1.67 °C / 1.52 °C for 2000–2003 / 2015–2018). DSI consistently performed better than CASI after January 2004 (Fig. 7b, d, f) when the number of hourly observations reached ~ 400 per hour (Figure S1), though overall the differences were small (mean Δ R² < 0.01, Δ RMSE < 0.08 °C). Statistical performance of DSI tended to decrease with lower observation density (Fig. 8a, c, e; mean R² = 0.94 / 0.88, RMSE = 1.42 °C / 1.91 °C, n = 497 / 103, where mean distance to the closest 10 stations is ≤ 200 km / > 200 km), and CASI trended towards slightly better performance on average where observation density was low (mean Δ R² = 0.01, Δ RMSE = -0.09 °C, where mean distance to the closest 10 stations is > 200 km). DSI outperformed CASI at weather stations located at elevations exceeding 800 m (Fig. 8b, d, f; n = 26, mean Δ R² = 0.04, Δ RMSE = -0.40 °C). The spatial distribution of error for each station using DSI, and comparison with CASI, are illustrated in Figure S9.

Fig. 7

Fig. 8

DSI of hourly air temperature achieved lower R² and higher RMSE than the climatologies (Δ R² = -0.03 and Δ RMSE = 0.81 °C on an annual basis, respectively; Table 2); however, there were similar trends in temporal patterns (Fig. 9) and distribution of per-station performance statistics (Fig. 10). The overall magnitude of hourly bias was negligible when pooled by time-of-day and day-of-year (< |0.03| °C; Fig. 9d). As with the climatologies, statistical performance per station was best when aggregating air temperature to daily values (R² = 0.96 ± 0.06, RMSE = 0.91 °C ± 0.35 °C, Bias = -0.01 °C ± 0.47 °C). Performance when hourly observations were diurnally maximum (R² = 0.92 ± 0.10, RMSE = 1.25 °C ± 0.52 °C, Bias = -0.18 °C ± 0.56 °C) was also better than the diurnal minimum (R² = 0.90 ± 0.10, RMSE = 1.72 °C ± 0.70 °C, Bias = 0.21 °C ± 0.96 °C). Statistical performance was strong for individual stations at an hourly time-step (R² = 0.93 ± 0.08, RMSE = 1.51 °C ± 0.42 °C, Bias = -0.01 °C ± 0.47 °C).

Fig. 9

Fig. 10

Independent validation showed that hourly DSI of air temperatures were reliable at most locations when compared against independent observations from the 24 OzFlux and 4 CosmOz stations (R² = 0.92 ± 0.07, RMSE = 1.78 °C ± 0.62 °C, Bias = -0.03 °C ± 0.93 °C; Table 3; see Fig. 1 for station locations). Validation measurements taken above 13.8 m in height (n = 8) achieved lower R² (0.88 ± 0.09) and higher RMSE (1.88 °C ± 0.33 °C) on average than those at lower heights (R² = 0.94 ± 0.06, RMSE = 1.74 °C ± 0.71 °C; n = 20). Cape Tribulation in far north Queensland (145.38 °E, -16.11 °S) gave the worst validation performance (R² = 0.68, RMSE = 2.41 °C, Bias = 0.83 °C); however, there were large differences in measurement height (43.5 m). Seasonal summaries of validation performance for each station (Table S1) show the weakest performance from June to August (in the austral winter), consistent with the cross-validation statistics (Table 2).

Table 3 Validation error statistics for direct spatial interpolation (DSI) of hourly air temperature compared with field-based observations at 28 stations between 01/Jan/2015 and 14/Nov/2019. Note the “Δ height ^a” column refers to the difference in instrumentation height relative to the calibration stations used in this study (1.5 m). The terms “height” and “elevation” are used as per McVicar and Körner (2013)

Full size table

Validation statistics across each station (Fig. 11) were best when first aggregating air temperature to daily values (R² = 0.95 ± 0.08, RMSE = 0.99 °C ± 0.67 °C, Bias = -0.03 °C ± 0.93 °C). Spatial interpolations frequently demonstrated negative bias with poorer performance at the time of minimum temperature (R² = 0.90 ± 0.09, RMSE = 1.61 °C ± 0.51 °C, Bias = -0.31 °C ± 0.97 °C), and positive bias with better performance at the hour of maximum temperature (R² = 0.94 ± 0.08, RMSE = 1.50 °C ± 1.13 °C, Bias = 0.54 °C ± 1.52 °C). As with the cross-validation results, hourly interpolations were associated with low bias and intermediate performance (R² = 0.92 ± 0.07, RMSE = 1.78 °C ± 0.62 °C, Bias = -0.03 °C ± 0.93 °C) in comparison to hourly observations at the diurnal maximum and minimum.

Fig. 11

4.2 Comparing hourly spatial interpolation with temporal interpolation of daily air temperature

Spatial interpolation of daily minimum and maximum air temperature, a key input to the PL81 model, showed reliable cross-validation performance on an annual basis (Table S2; R² = 0.94 / 0.98, RMSE = 1.81 °C / 1.29 °C, Bias = -0.00 °C / 0.00 °C). Optimised selection of the d parameter (used to transform the coastal distance index) showed little improvement relative to a fixed value (Figure S10c, d; Δ RMSE < 0.5%), and high variability across DOY (Figure S10, f). The d parameter was therefore fixed at 11 and 23 for minimum and maximum temperature, respectively (Figure S10a, b).

Empirical time-of-day interpolation using PL81 typically performed best near solar noon (i.e., 12:00–14:00 LST, mean = 23.21 °C, R² = 0.94, RMSE = 2.19 °C, Bias = 1.37 °C) and in the early hours of the morning (0:00–05:00 LST, mean = 15.45 °C, R² = 0.89, RMSE = 2.54 °C, Bias = -1.42 °C) across all days-of-year (Fig. 12). PL81 showed strong positive biases during the day (i.e., 07:00–14:00 LST, mean = 20.73 °C, R² = 0.92, RMSE = 2.62 °C, Bias = 1.82 °C) and strong negative biases in the evening and early morning (i.e., 18:00–04:00 LST, mean = 17.06 °C, R² = 0.89, RMSE = 2.77 °C, Bias = -1.74 °C). Cross-validated DSI consistently performed better than PL81 (i.e., when subtracting statistics for PL81 from those for DSI), particularly during post-sunrise warming (07:00–11:00 LST, mean Δ R² = 0.03, Δ RMSE = -1.42 °C) and afternoon cooling (16:00–19:00 LST, mean Δ R² = 0.07, Δ RMSE = -1.36 °C) cycles (Fig. 12).

Fig. 12

PL81 performance across each of the calibration stations (n = 566; Fig. 13) was limited in the hours following solar noon (i.e., solar noon + 1.5 h, R² = 0.89 ± 0.13, RMSE = 2.25 °C ± 0.54 °C, Bias = 1.61 °C ± 0.76 °C) and prior to sunrise (sunrise − 1 h, R² = 0.85 ± 0.11, RMSE = 2.14 °C ± 0.58 °C, Bias = -0.64 °C ± 1.01 °C), despite the temporal proximity to the timing of minimum and maximum air temperature (Figure S5, Figure S6). Hourly PL81 predictions (R² = 0.83 ± 0.13, RMSE = 2.68 °C ± 0.69 °C, Bias = -0.21 °C ± 0.95 °C) performed less effectively than the corresponding DSI statistics (Δ R² = -0.10 ± 0.07, Δ RMSE = -1.16 °C ± 0.54 °C, Δ Abs. Bias = -0.16 °C ± 0.47 °C) for calibration stations. Statistical performance of PL81 at each of the validation stations (Table 4; mean R² = 0.85 ± 0.10, RMSE = 2.80 °C ± 0.65 °C, Bias = -0.00 °C ± 0.98 °C), with the exception of bias, was consistently better with DSI (mean Δ R² = 0.07 ± 0.06, Δ RMSE = -1.02 °C ± 0.43 °C, Δ Abs. Bias = -0.04 °C ± 0.27 °C).

Fig. 13

Table 4 Validation error statistics for temporal interpolation of hourly air temperature using PL81 when compared with independent field-based observations at 28 stations between 01/Jan/2015 and 14/Nov/2019, and the difference in performance when compared against direct spatial interpolation (DSI). Differences between DSI and PL81 (denoted by Δ) are calculated by subtracting PL81 statistics from DSI statistics (i.e., DSI minus PL81, where positive values indicate DSI performs better for Δ R², and negative values indicate DSI performs better for Δ RMSE and Δ |Bias|)

Full size table

4.3 Comparing hourly spatial interpolation with reanalysis products

Comparisons of mean air temperature, aggregated to seasonal and annual summaries at each of four times-of-day (i.e., 03:00, 09:00, 15:00 and 21:00 UTC + 9), showed strong agreement with the coarsened DSI surfaces and both BARRA-R (Table 5; R² ≥ 0.93, RMSD ≤ 1.41 °C, Bias ≤ |0.92| °C) and ERA5-Land (Table 6; R² ≥ 0.94, RMSD ≤ 1.62 °C, Bias ≤ |1.29| °C). Deviations were typically lowest during the daylight hours (09:00–15:00 UTC + 9) for both reanalyses. DSI showed positive biases relative to BARRA-R that were most pronounced during the austral warmer months (September – February; Table 5) after sunset (21:00 UTC + 9, Bias = 0.55 °C to 0.92 °C) and in the early hours of the morning (03:00 UTC + 9, Bias = 0.55 °C to 0.78 °C). When compared against ERA5-Land, DSI showed negative biases most pronounced during the austral cooler months (March to August; Table 6), also after sunset (21:00 UTC + 9, Bias = -1.29 °C to -1.26 °C) and in the early hours of the morning (03:00 UTC + 9, Bias = -1.24 °C to -1.15 °C).

Table 5 Statistical comparison of mean air temperature across all available grids (01/Jan/2015 to 31/Dec/2018) for coarsened DSI surfaces (0.11°) assessed against BARRA-R during each season at four times of the day (n = 57,818 pixels). Bias is calculated by subtracting BARRA-R from DSI (i.e., DSI minus BARRA-R, values > 0 °C indicate higher air temperatures for DSI)

Full size table

Table 6 Statistical comparison of mean air temperature across all available grids (01/Jan/2015 to 31/Dec/2018) for coarsened DSI surfaces (0.10°) assessed against ERA5-Land for each season at four times of the day (n = 69,286 pixels). Bias is calculated by subtracting ERA5-Land from DSI (i.e., DSI minus ERA5-Land, values > 0 °C indicate higher air temperatures for DSI)

Full size table

The spatial distribution of comparative statistics between coarsened DSI assessed against both BARRA-R and ERA5-Land, calculated on a pixel-by-pixel basis through time and by season, are illustrated in Figs. 14 and 15, respectively. Spatially autocorrelated patterns in bias and statistical deviations in many coastal regions are present in each comparison. DSI resulted in higher air temperature relative to BARRA-R (0.14 °C to 0.38 °C) and lower air temperature relative to ERA5-Land (-0.44 °C to -0.18 °C) per season across all pixels.

Fig. 14

Fig. 15

Statistics comparing the agreement between calibration and validation observations, and the spatial interpolations, BARRA-R and ERA5-Land are illustrated in Fig. 16. Spatially interpolated data showed a stronger fit to the observations used in model calibration (for those that could be compared; n = 424 stations; see Figure S11) on average than BARRA-R (i.e., when subtracting statistics for BARRA-R from those for DSI, Δ R² = 0.05 ± 0.05, Δ RMSE = -1.06 ± 0.70 °C, Δ Abs. Bias = -0.38 ± 0.70 °C) and ERA5-Land (Δ R² = 0.07 ± 0.04, Δ RMSE = -1.22 °C ± 0.57 °C, Δ Abs. Bias = -0.41 °C ± 0.59 °C). Marginal improvements were found for the validation observations (n = 28 stations) when subtracting performance statistics for BARRA-R (Δ R² = 0.01 ± 0.04, Δ RMSE = -0.27 °C ± 0.45 °C, Δ Abs. Bias = -0.13 °C ± 0.47 °C) and ERA5-Land (Δ R² = 0.00 ± 0.05, Δ RMSE = -0.12 °C ± 0.54 °C, Δ Abs. Bias = -0.23 °C ± 0.47 °C) from DSI. The spatial distribution of pooled error statistics for BARRA-R and ERA5-Land at each of the calibration and validation sites are mapped in Figures S12 and S13. The differences in RMSE between DSI and both BARRA-R and ERA5-Land are reported seasonally at four times-of-day (i.e., 03:00, 09:00, 15:00 and 21:00 UTC + 9) in Figures S14 and S15.

Fig. 16

Statistical performance of cross-validated DSI was higher overall than both BARRA-R (Δ R² = 0.02 ± 0.05, Δ RMSE = -0.37 °C ± 0.70 °C, Δ Abs. Bias = -0.23 °C ± 0.78 °C) and ERA5-Land (Δ R² = 0.03 ± 0.03, Δ RMSE = -0.53 °C ± 0.60 °C, Δ Abs. Bias = -0.25 °C ± 0.70 °C) at the Bureau of Meteorology calibration stations (Fig. 17; n = 424). High elevation stations (> 800 m, n = 24) validated poorly overall (Figure S16) and in comparison to cross-validated predictions from DSI (Fig. 17) for both BARRA-R (Δ R² = -0.05 ± 0.06, Δ RMSE = 1.69 °C ± 1.84 °C, Δ Abs. Bias = 1.78 °C ± 2.06 °C) and ERA5-Land (Δ R² = -0.05 ± 0.04, Δ RMSE = 1.69 °C ± 1.49 °C, Δ Abs. Bias = 1.82 °C ± 1.74 °C). Statistical performance of reanalysis products did not show clear improvements in regions of lower observation density (i.e., mean distance to closest 10 stations > 200 km) when compared to cross-validated DSI (Fig. 17; BARRA-R, Δ R² = -0.01 ± 0.06, Δ RMSE = 0.17 °C ± 0.57 °C, Δ Abs. Bias = 0.09 °C ± 0.54 °C; ERA5-Land, Δ R² = -0.02 ± 0.03, Δ RMSD = 0.16 °C ± 0.45 °C, Δ Abs. Bias = -0.01 °C ± 0.43 °C). The relationship between statistical performance and observation density at the validation sites was variable (Fig. 18) and sample sizes were limited when the mean distance to the closest 10 stations was > 200 km (n = 4).

Fig. 17

Fig. 18

5 Discussion

5.1 Spatial interpolation of hourly air temperature

Direct spatial interpolation (DSI), using an optimal number of observations points as knots (Hutchinson 1995; Hutchinson et al. 2009; Johnson et al. 2016; Price et al. 2000), was most effective in generating high quality predictions of hourly air temperature across Australia. Our pooled hourly cross-validation results (R² = 0.96, RMSE = 1.56 °C) compared well against those reported by Webb and Minasny (2020; R² = 0.89 to 0.91, RMSE = 1.6 °C to 1.7 °C), who spatially interpolated air temperature across Australia at 30 min time-steps between 01/Jan/2019 and 31/Dec/2020. Our study built upon this previous research, as we: (i) evaluated a longer analysis period (i.e., 01/Jan/2000 to 31/Dec/2019); (ii) compared both CASI and DSI to evaluate the relative strengths of each approach (e.g. Table 2; Figs. 7 and 8; Figure S9); (iii) designed the study for the development of stable long-term climatologies and multi-year historical datasets; (iv) validated DSI with independent station observations; and (v) used a time-varying coastal distance index to represent the effects of continentality. Coastal weather can have large impacts on the quality of spatial climate data products (Daly 2006; Daly et al. 2002, 2003; Hutchinson et al. 2021; Jones et al. 2009), and coastal proximity metrics have been reported to increase statistical performance of monthly mean minimum and maximum air temperature interpolation by up to 25% (Hutchinson et al. 2021).

Sea breeze systems, that develop when temperature (and associated pressure) gradients at the land-water interface cause cool air over the ocean (or water bodies) to move inland, can exert a strong influence on coastal weather (Abbs and Physick 1992; Miller et al. 2003; Simpson 1994). They typically begin early in the day, when air temperatures over land exceed those over water, and can persist well into the night under suitable conditions (Miller et al. 2003). Sea breezes can bring cool, moist air up to several hundred km inland (Abbs and Physick 1992; Clarke 1955; Simpson et al. 1977), and tend to be stronger in sub-tropical and tropical climates than mid-latitudes and in the afternoon and evening during warm months (Abbs and Physick 1992; Azorin-Molina et al. 2011; Miller et al. 2003). These systems can be difficult to model due to interacting factors such as synoptic scale wind and cold fronts, coastline morphology, and topographic features (Abbs and Physick 1992; Azorin-Molina and Chen 2009; Miller et al. 2003).

Temporal patterns in the coastal distance indices developed herein reflect the timing and expected behaviour of sea breeze systems, with the strongest inland propagation of cool air inferred (i.e., with high values of d) during the afternoon and evening in spring and summer. Interpolation performance improved most (up to 22.4% for climatologies and 7.5% for DSI) in late spring, when the coastal distance index decays very slowly with distance from the coast (Fig. 3; Figure S4) and air temperature decreases over time more rapidly than further inland (see Figure S17). These same periods correspond to times when Webb and Minasny (2020) reported large errors in coastal regions, indicating that coastal distance metrics often play a key role in improving interpolation performance for meteorological variables. The variability in d found when calibrating the coastal distance index using DSI (e.g., Fig. 3d) reflects the difficulty in predicting sea breeze systems (Miller et al. 2003). Calibrating the coastal distance index with long-term stable climatologies was essential in identifying a generalizable temporal structure that was performant even when applied to DSI (e.g. see Fig. 3 and Figure S10). The low optimal values of d (restricting coastal influences) in the morning hours are potentially associated with the convergence of land and sea breezes and provided a marginal but consistent improvement in DSI (Fig. 3f). Our time-varying coastal distance index provides a parsimonious method for capturing sea breeze dynamics in coastal regions, and further improvements (e.g., varying d in both space and time) may be possible with further research.

Hourly air temperature performed best overall with DSI; however, CASI can improve stability for interpolating some climate time-series (Hutchinson et al. 2021; Jeffrey et al. 2001), and enables blending of different datasets (Funk et al. 2015; Harris et al. 2020; Karger and Zimmermann 2018) or interpolation methods (Jones et al. 2009; Raupach et al. 2012). We found that CASI tended to perform better in data sparse times (Fig. 7) and locations (Fig. 8); however, this finding depends on the specific methods applied (e.g., independent spline variables used for modelling anomalies), and the density and spatial autocorrelation structure of the observations (Hofstra et al. 2008, 2010; Jeffrey et al. 2001). A considerable limitation of CASI however is that model responses to environmental gradients (e.g., environmental lapse rates with elevation, influence of coastal proximity) are fixed according to the climatology (e.g., identical lapse rates at the same time of year in a specific location) when interpolating anomalies using positional coordinates only. While we interpolated anomalies as a function of positional coordinates only, the large difference in error between DSI and CASI for high elevation stations suggests that our anomaly interpolation did not adequately represent variability of environmental lapse rates for air temperature, and may benefit further from incorporating additional independent spline variables (i.e., elevation, coastal distance indices when sea breezes are more likely; Hutchinson et al. 2021). This limitation can be addressed with DSI, but can come at the cost of reduced model stability and increased sensitivity to poor quality data (e.g., Jeffrey et al. 2001).

We found error was more pronounced during the coolest parts of the day during winter. This is consistent with our daily air temperature extrema analyses (Table S2) and many interpolation studies, where minimum temperature performs poorly in comparison to maximum temperature (Hutchinson et al. 2009; Jeffrey et al. 2001; Jones et al. 2009; Mark et al. 2002; Webb and Minasny 2020). There are several potential reasons why hourly air temperature was least performant during night-time in winter. These include: (i) lower spatial autocorrelation ranges for minimum temperature (Jones and Trewin 2000); (ii) the occurrence of temperature inversions (e.g., driven by katabatic winds and cold air pools) that commonly develop under clear, calm conditions and can confound air temperature lapse rate estimates (Stewart et al. 2017; Trewin 2005; Whiteman et al. 1999); (iii) the latent heat of condensation (Hutchinson et al. 2009); and (iv) associated humidity dynamics where saturated air lowers the (wet) adiabatic lapse rate. While it is difficult to attribute the change in performance to any one factor, the patterns identified in our cross-validation statistics provide insights into specific times when further improvements in spatial interpolation performance may be achieved.

Independent validation, using the OzFlux and CosmOz stations further supported the use of spatial interpolation as a viable option for generating air temperature surfaces at sub-diurnal time-steps. Overall, the statistical performance at these validation stations was strong, despite differing site conditions (i.e., different types of forested and agricultural ecosystems; Beringer et al. 2016; Hawdon et al. 2014) and height at which observations were made (Table 3). This indicates that DSI can also play a role in gap-filling sub-diurnal air temperature at field sites. The resultant hourly 1 km near-surface air temperature grids can be used in numerous applications, such as being coupled with Himawari geostationary remotely sensed imagery (Bessho et al. 2016) to monitor sub-diurnal processes such as cloud presence and cloud type (Qin et al. 2019), incoming shortwave radiation (Qin et al. 2021), land surface temperature (Yu et al. 2024) and vegetation dynamics. With Himawari being launched in July 2015 there are adequate numbers of hourly stations to support the generation of hourly air temperature grids for Australia; see Table S3.

5.2 Comparing hourly spatial interpolation with temporal interpolation of daily air temperature

Temporal interpolation of daily air temperature consistently performed poorly in comparison to DSI. The differences in statistical performance between PL81 and DSI were lowest around the time of sunrise and after solar noon (Fig. 12b, d), when minimum and maximum air temperatures typically occur. This is unsurprising given that cross-validated predictions of daily minimum and maximum air temperatures performed well statistically (see Table S2) when compared against previous studies for Australia (RMSE = 1.7 °C to 2.0 °C and RMSE = 1.2 °C to 1.7 °C, respectively; Jeffrey et al. 2001; Jones et al. 2009) and were used as a key input to the PL81 model. It also demonstrates that the empirical parameterisation of PL81 was effective for accurately estimating the timing of minimum and maximum temperature (see Figure S1, S6). PL81 performed poorly by comparison at all other times, showing strong positive biases during the day and strong negative biases overnight (Fig. 12f). This pattern of bias indicates that PL81 does not accurately represent the rate of sub-diurnal temperature change. While further improvements may be possible by tuning the exponential decay parameter, this would only reduce error overnight. The errors accumulated due to the rate of air temperature change given by the truncated sine curve and exponential decay curve were a key source of poor model performance. Our findings are supported by Reicosky et al. (1989), who noted that temporal interpolation of daily air temperature extrema is useful for many (not all) applications, but it is unlikely to be appropriate when accurate air temperatures are required for specific times.

These findings were expected given at least two limitations of PL81: (i) the inability of daily air temperature extrema to represent natural variability in sub-diurnal air temperature; and (ii) functional form (i.e., truncated sine curve and exponential decay curve) and model parameterisation. The former limitation was addressed with spatial interpolation, where frontal systems and sub-diurnal variability can be represented by geostatistical modelling using hourly observations. The latter limitation was mitigated in part by station-specific calibration of the PL81 parameters. Here we have empirically determined the time-lag between solar noon and maximum air temperature (i.e., PL81 parameter ‘a’, Figure S5), and sunrise and minimum air temperature (i.e., PL81 parameter ‘c’, Figure S6), but recognize that some performance improvement may be possible with station-specific calibration of the exponential decay rate (i.e., PL81 parameter ‘b’). Given the inability of PL81 to characterise frontal systems, and the truncated sine curve to accurately represent post-sunrise warming, this would not otherwise alter our conclusions. The climatologies produced as part of this study can, however, provide opportunities to develop and parameterise temporal interpolation models.

5.3 Comparing hourly spatial interpolation with reanalysis products

Overall, there was good agreement between spatially interpolated air temperature and both reanalysis products. Except for bias, pooled statistical metrics typically showed lower errors relative to the point-based cross-validation; however, these analyses were conducted at coarser spatial scales and for only a subset of our analysis period (i.e., 01/Jan/2015 to 31/Dec/2018). Interpolated surfaces were slightly warmer on average (i.e., ≤ 0.5 °C) than BARRA-R in the evening and early hours of the morning (Table 5), in contrast to previous analyses (Fig. 5 of Su et al. 2019) showing BARRA-R produced marginally warmer minimum air temperatures on average (i.e., < 0.2 °C) than interpolated daily minimum air temperature. While direct comparisons are difficult to make, these differences may be in part explained by: (i) differences in the analysis period; (ii) inability of observations at regular time-steps to capture daily air temperature extrema; and (iii) the (6-hourly) data assimilation mechanisms used in BARRA-R. The interpolated surfaces were on average cooler than ERA5-Land (-0.44 °C to -0.18 °C; see Table 6; Fig. 15), consistent with previous analyses of ERA5 products (Dee et al. 2011; Hersbach et al. 2020) that showed positive biases relative to Australian climate products (Su et al. 2021; Su et al. 2019). Spatial analyses showed autocorrelated biases that are expected given the differences between modelling techniques. For example, interpolated surfaces are driven by digital elevation models, whereas atmospheric models are sensitive to land characteristics and assimilate many data sources. This was clearly demonstrated in ERA5-Land, where the higher RMSD across Australia’s largest salt lakes (Fig. 15), was otherwise absent from BARRA-R (Fig. 14) where they were treated as a bare soil surface (Su et al. 2019).

Point-based analyses using both the calibration and independent validation datasets (Fig. 16) showed that the DSI surfaces better represented the observations used for model calibration than either BARRA-R or ERA5-Land. Overall, this finding was the same for validation stations, although there was greater variability across locations. Each gridded product was evaluated at their native resolution, and therefore this analysis represented how well each product reproduced ground-based measurements in the absence of any downscaling. Several high elevation stations were among the worst performing when compared against both reanalyses (Fig. 17); however, this is likely a result of the comparatively coarse spatial resolution of BARRA-R (0.11°) and ERA5-Land (0.10°). BARRA-C (~ 1.5 km resolution), a regionally downscaled version of BARRA-R (~ 1.5 km), has been shown to better represent these observations for stations at elevations above 500 m and/or proximal to the coast (i.e., within ~ 150 km). We did not perform a direct comparison for two key reasons: (i) BARRA-C doesn’t cover our whole study extent; and (ii) the reported magnitude of improvement relative to BARRA-R (Fig. 2 of Su et al. 2021) is unlikely to substantially impact upon our findings.

Despite the tendency for spatial interpolation performance to decrease with observation density (e.g., Fig. 8), we did not find a clear relationship with observation density when comparing the statistical performance of BARRA-R and ERA5-Land with (cross-validated) DSI (see Fig. 17). This suggests that DSI still performs well for interpolating hourly air temperature in sparser regions of the network, and part of the trend towards decreased performance may be an artefact of very high-quality predictions when dense observations are available. Similar patterns are found in the validation analyses, where relative performance of DSI increases with station density (mean distance to closest 10 stations < 100 km) and then levels off; however, the number of samples in low density regions is limited. Our results demonstrate that spatial interpolation can provide substantial accuracy advantages in situations where hourly, high spatial resolution air temperature data are required to support analyses (Fig. 16). Overall, spatial interpolation remains a parsimonious and computationally efficient method for accurately quantifying sub-diurnal near-surface air temperature dynamics.

6 Conclusion

Direct spatial interpolation (with an appropriate knot parameter selection) was effective for modelling near-surface hourly air temperature spatio-temporal dynamics over a continental scale (i.e., Australia). This was demonstrated by strong statistical performance achieved by cross-validation, at independent validation stations, against temporal interpolation techniques, in comparison with two atmospheric reanalyses, and when evaluated against similar interpolation studies. The methods developed herein: (i) improved model performance with time-varying coastal distance indices; (ii) avoided the limitations of temporal interpolation; (iii) were efficient in comparison to computationally expensive and data intensive reanalyses; and (iv) maximised preservation of information contained in the observational record as evidenced by the point-based analyses. Future work could use more complex models (e.g., machine learning) to incorporate land surface processes into spatially interpolated datasets, and/or downscale existing reanalysis products for further study. The density and observation frequency of observations that are currently available enable the development of historical and future hourly air temperature surfaces, which will support numerous scientific applications.

Data availability

The air temperature grids produced as part of this study (i.e., both hourly direct spatial interpolation, and hourly climatologies) are openly available on the CSIRO Data Access Portal (https://data.csiro.au/collection/csiro:60405). This collection also includes an animation of the hourly air temperature climatologies, and 1,752 hourly station (n = 505) climatologies (1990–2019) to enable future comparative analysis of statistical performance.

Change history

03 September 2024
A Correction to this paper has been published: https://doi.org/10.1007/s00382-024-07420-x

References

Abbs DJ, Physick WL (1992) Sea-breeze observations and modelling: a review. Aust Meteorol Mag 41:7–19
Google Scholar
Australian Bureau of Meteorology (2023) Annual Report 2022-23, 276 pp
Azorin-Molina C, Chen D (2009) A climatological study of the influence of synoptic-scale flows on sea breeze evolution in the Bay of Alicante (Spain). Theoret Appl Climatol 96(3):249–260. https://doi.org/10.1007/s00704-008-0028-2
Article Google Scholar
Azorin-Molina C, Chen D, Tijm S, Baldi M (2011) A multi-year study of sea breezes in a Mediterranean coastal site: Alicante (Spain). Int J Climatol 31(3):468–486. https://doi.org/10.1002/joc.2064
Article Google Scholar
Beringer J, Coauthors (2016) An introduction to the Australian and New Zealand flux tower network – OzFlux. Biogeosciences 13(21):5895–5916. https://doi.org/10.5194/bg-13-5895-2016
Article CAS Google Scholar
Bessho K, Coauthors (2016) An introduction to Himawari-8/9— Japan’s New-Generation Geostationary Meteorological satellites. J Meteorological Soc Japan Ser II 94(2):151–183. https://doi.org/10.2151/jmsj.2016-009
Article Google Scholar
Brown T, Mills G, Harris S, Podnar D, Reinbold H, Fearon M (2016) A bias corrected WRF mesoscale fire weather dataset for Victoria, Australia 1972–2012. J South Hemisphere Earth Syst Sci 66(3):281–313
Article Google Scholar
Casellas E, Bech J, Veciana R, Miró JR, Sairouni A, Pineda N (2020) A meteorological analysis interpolation scheme for high spatial-temporal resolution in complex terrain. Atmos Res 246:1–11. https://doi.org/10.1016/j.atmosres.2020.105103
Article Google Scholar
Cesaraccio C, Spano D, Duce P, Snyder RL (2001) An improved model for determining degree-day values from daily temperature data. Int J Biometeorol 45(4):161–169. https://doi.org/10.1007/s004840100104
Article CAS Google Scholar
Chen F, Yang X, Ji C, Li Y, Deng F, Dong M (2019) Establishment and assessment of hourly high-resolution gridded air temperature data sets in Zhejiang, China. Meteorol Appl 26(3):396–408. https://doi.org/10.1002/met.1770
Article Google Scholar
Chung U, Yun JI (2004) Solar irradiance-corrected spatial interpolation of hourly temperature in complex terrain. Agric for Meteorol 126(1):129–139. https://doi.org/10.1016/j.agrformet.2004.06.006
Article Google Scholar
Clarke RH (1955) Some observations and comments on the sea breeze. Aust Meteorol Mag 11:47–68
Google Scholar
Cornes RC, van der Schrier G, van den Besselaar EJM, Jones PD (2018) An Ensemble Version of the E-OBS Temperature and Precipitation Data Sets. J Geophys Research: Atmos 123(17):9391–9409. https://doi.org/10.1029/2017JD028200
Article Google Scholar
Dai A (2023) The diurnal cycle from observations and ERA5 in surface pressure, temperature, humidity, and winds. Clim Dyn 61(5):2965–2990. https://doi.org/10.1007/s00382-023-06721-x
Article Google Scholar
Daly C (2006) Guidelines for assessing the suitability of spatial climate data sets. Int J Climatol 26(6):707–721. https://doi.org/10.1002/joc.1322
Article Google Scholar
Daly C, Gibson W, Taylor PG, Johnson HG, L., and, Pasteris P (2002) A knowledge-based approach to the statistical mapping of climate. Climate Res 22(2):99–113
Article Google Scholar
Daly C, Helmer EH, Quiñones M (2003) Mapping the climate of Puerto Rico, Vieques and Culebra. Int J Climatol 23(11):1359–1381. https://doi.org/10.1002/joc.937
Article Google Scholar
Dee DP, Coauthors (2011) The ERA-Interim reanalysis: configuration and performance of the data assimilation system. Q J R Meteorol Soc 137(656):553–597. https://doi.org/10.1002/qj.828
Funk C, Coauthors (2015) The climate hazards infrared precipitation with stations—a new environmental record for monitoring extremes. Sci Data 2(1):1–21. https://doi.org/10.1038/sdata.2015.66
Gelaro R, Coauthors (2017) The Modern-Era Retrospective Analysis for Research and Applications, Version 2 (MERRA-2). J Clim 30(14):5419–5454. https://doi.org/10.1175/JCLI-D-16-0758.1
Gholamnia M, Alavipanah SK, Boloorani AD, Hamzeh S, Kiavarz M (2019) A new method to model diurnal air temperature cycle. Theoret Appl Climatol 137(1):229–238. https://doi.org/10.1007/s00704-018-2587-1
Article Google Scholar
Guerschman JP, McVicar TR, Vleeshower J, Van Niel TG, Peña-Arancibia JL, Chen Y (2022) Estimating actual evapotranspiration at field-to-continent scales by calibrating the CMRSET algorithm with MODIS, VIIRS, Landsat and Sentinel-2 data. J Hydrol 605:1–18. https://doi.org/10.1016/j.jhydrol.2021.127318
Article Google Scholar
Guo J, Wang X, Xiao C, Liu L, Wang T, Shen C (2022) Evaluation of the temperature downscaling performance of PRECIS to the BCC-CSM2-MR model over China. Clim Dyn 59(3):1143–1159. https://doi.org/10.1007/s00382-022-06177-5
Article Google Scholar
Harris I, Osborn TJ, Jones P, Lister D (2020) Version 4 of the CRU TS monthly high-resolution gridded multivariate climate dataset. Sci Data 7(1):1–18. https://doi.org/10.1038/s41597-020-0453-3
Article Google Scholar
Hawdon A, McJannet D, Wallace J (2014) Calibration and correction procedures for cosmic-ray neutron soil moisture probes located across Australia. Water Resour Res 50(6):5029–5043. https://doi.org/10.1002/2013WR015138
Article Google Scholar
Hengl T, Nussbaum M, Wright MN, Heuvelink GBM, Gräler B (2018) Random forest as a generic framework for predictive modeling of spatial and spatio-temporal variables. PeerJ 6:1–49. https://doi.org/10.7717/peerj.5518
Article Google Scholar
Hersbach H, Coauthors (2020) The ERA5 global reanalysis. Q J R Meteorol Soc 146(730):1999–2049. https://doi.org/10.1002/qj.3803
Hofstra N, Haylock M, New M, Jones P, Frei C (2008) Comparison of six methods for the interpolation of daily, European climate data. J Geophys Research: Atmos 113(D21). https://doi.org/10.1029/2008JD010100
Hofstra N, New M, McSweeney C (2010) The influence of interpolation and station network density on the distributions and trends of climate variables in gridded daily data. Clim Dyn 35(5):841–858. https://doi.org/10.1007/s00382-009-0698-1
Article Google Scholar
Holzworth DP, Coauthors (2014) APSIM – Evolution towards a new generation of agricultural systems simulation. Environ Model Softw 62:327–350. https://doi.org/10.1016/j.envsoft.2014.07.009
Holzworth D, Coauthors (2018) APSIM Next Generation: overcoming challenges in modernising a farming systems model. Environ Model Softw 103:43–51. https://doi.org/10.1016/j.envsoft.2018.02.002
Hopkinson RF, Hutchinson MF, McKenney DW, Milewska EJ, Papadopol P (2012) Optimizing Input Data for Gridding Climate normals for Canada. J Appl Meteorol Climatology 51(8):1508–1518. https://doi.org/10.1175/JAMC-D-12-018.1
Article Google Scholar
Hutchinson MF (1991) The application of thin plate smoothing splines to continent-wide data assimilation. Bureau Meteorol Res Rep 27:104–113
Google Scholar
Hutchinson MF (1995) Interpolating mean rainfall using thin plate smoothing splines. Int J Geographical Inform Syst 9(4):385–403. https://doi.org/10.1080/02693799508902045
Article Google Scholar
Hutchinson MF, Xu T (2013): ANUSPLIN Version 4.4. Fenner School of Environment and Society, Australian National University. https://fennerschool.anu.edu.au/research/products/anusplin
Hutchinson MF, Stein JL, Stein JA, Anderson H, Tickle PK (2008): GEODATA 9 second DEM and D8: Digital Elevation Model Version 3 and Flow Direction Grid 2008. 3 ed., Australia G. http://pid.geoscience.gov.au/dataset/ga/66006
Hutchinson MF, McKenney DW, Lawrence K, Pedlar JH, Hopkinson RF, Milewska E, Papadopol P (2009) Development and testing of Canada-wide interpolated spatial models of Daily Minimum–Maximum temperature and precipitation for 1961–2003. J Appl Meteorol Climatology 48(4):725–741. https://doi.org/10.1175/2008JAMC1979.1
Article Google Scholar
Hutchinson MF, Xu T, Kesteven JL, Marang IJ, Evans BJ (2021) ANUClimate v2.0. NCI Australia
Jabot E, Zin I, Lebel T, Gautheron A, Obled C (2012) Spatial interpolation of sub-daily air temperatures for snow and hydrologic applications in mesoscale Alpine catchments. Hydrol Process 26(17):2618–2630. https://doi.org/10.1002/hyp.9423
Article Google Scholar
Jeffrey SJ, Carter JO, Moodie KB, Beswick AR (2001) Using spatial interpolation to construct a comprehensive archive of Australian climate data. Environ Model Softw 16(4):309–330. https://doi.org/10.1016/S1364-8152(01)00008-1
Article Google Scholar
Johnson F, Hutchinson MF, The C, Beesley C, Green J (2016) Topographic relationships for design rainfalls over Australia. J Hydrol 533:439–451. https://doi.org/10.1016/j.jhydrol.2015.12.035
Article Google Scholar
Jones DA, Trewin B (2000) The spatial structure of monthly temperature anomalies over Australia. Aust Meteorol Mag 49:261–276
Google Scholar
Jones DA, Wang W, Fawcett R (2009) High-quality spatial climate data-sets for Australia. Aust Meteorol Oceanogr J 58:233–248. https://doi.org/10.22499/2.5804.003
Article Google Scholar
Karger DN, Coauthors (2017) Climatologies at high resolution for the earth’s land surface areas. Sci Data 4(1):1–20. https://doi.org/10.1038/sdata.2017.122
Karger DN, Zimmermann NE,CHELSAcruts - High resolution temperature and precipitation timeseries for the 20th century and beyond., EnviDat (2018) https://www.envidat.ch/dataset/chelsacruts
Kearney M, Porter W (2009) Mechanistic niche modelling: combining physiological and spatial data to predict species’ ranges. Ecol Lett 12(4):334–350. https://doi.org/10.1111/j.1461-0248.2008.01277.x
Article Google Scholar
Kettle H, Thompson R (2004) Statistical downscaling in European mountains: verification of reconstructed air temperature. Climate Res 26(2):97–112
Article Google Scholar
Kobayashi S, Coauthors (2015) The JRA-55 reanalysis: general specifications and basic characteristics. J Meteorol Soc Jpn 93(1):5–48. https://doi.org/10.2151/jmsj.2015-001
Krähenmann S, Walter A, Brienen S, Imbery F, Matzarakis A (2018) High-resolution grids of hourly meteorological variables for Germany. Theoret Appl Climatol 131(3):899–926. https://doi.org/10.1007/s00704-016-2003-7
Article Google Scholar
Li J, Heap AD (2014) Spatial interpolation methods applied in the environmental sciences: a review. Environ Model Softw 53:173–189. https://doi.org/10.1016/j.envsoft.2013.12.008
Article Google Scholar
Li X, Li Z, Huang W, Zhou P (2020) Performance of statistical and machine learning ensembles for daily temperature downscaling. Theoret Appl Climatol 140(1):571–588. https://doi.org/10.1007/s00704-020-03098-3
Article Google Scholar
Lussana C, Tveito OE, Uboldi F (2018) Three-dimensional spatial interpolation of 2 m temperature over Norway. Q J R Meteorol Soc 144(711):344–364. https://doi.org/10.1002/qj.3208
Article Google Scholar
Lussana C, Seierstad IA, Nipen TN, Cantarello L (2019) Spatial interpolation of two-metre temperature over Norway based on the combination of numerical weather prediction ensembles and in situ observations. Q J R Meteorol Soc 145(725):3626–3643. https://doi.org/10.1002/qj.3646
Article Google Scholar
Mark N, David L, Mike H, Ian M (2002) A high-resolution data set of surface climate over global land areas. Climate Res 21(1):1–25
Google Scholar
Matheron G (1962): Traité De géostatistique appliquée. Technip, 333 pp
McVicar TR, Jupp DLB (1999) Estimating one-time-of-day meteorological data from standard daily data as inputs to thermal remote sensing based energy balance models. Agric for Meteorol 96(4):219–238. https://doi.org/10.1016/S0168-1923(99)00052-0
Article Google Scholar
McVicar TR, Jupp DLB (2002) Using covariates to spatially interpolate moisture availability in the Murray–Darling Basin: a novel use of remotely sensed data. Remote Sens Environ 79(2):199–212. https://doi.org/10.1016/S0034-4257(01)00273-5
Article Google Scholar
McVicar TR, Körner C (2013) On the use of elevation, altitude, and height in the ecological and climatological literature. Oecologia 171(2):335–337. https://doi.org/10.1007/s00442-012-2416-7
Article Google Scholar
McVicar TR, Van Niel TG, Li L, Hutchinson MF, Mu X, Liu Z (2007) Spatially distributing monthly reference evapotranspiration and pan evaporation considering topographic influences. J Hydrol 338(3):196–220. https://doi.org/10.1016/j.jhydrol.2007.02.018
Article Google Scholar
McVicar TR, Van Niel TG, Li LT, Roderick ML, Rayner DP, Ricciardulli L, Donohue RJ (2008) Wind speed climatology and trends for Australia, 1975–2006: capturing the stilling phenomenon and comparison with near-surface reanalysis output. Geophys Res Lett 35(20):L20403. https://doi.org/10.1029/2008gl035627
Article Google Scholar
Miller STK, Keim BD, Talbot RW, Mao H (2003) Sea breeze: structure, forecasting, and impacts. Rev Geophys 41(3). https://doi.org/10.1029/2003RG000124
Muñoz-Sabater J, Coauthors (2021) ERA5-Land: a state-of-the-art global reanalysis dataset for land applications. Earth Syst Sci Data 13(9):4349–4383. https://doi.org/10.5194/essd-13-4349-2021
Pan X, Li X, Shi X, Han X, Luo L, Wang L (2012) Dynamic downscaling of near-surface air temperature at the basin scale using WRF-a case study in the Heihe River Basin, China. Front Earth Sci 6(3):314–323. https://doi.org/10.1007/s11707-012-0306-2
Article Google Scholar
Parton WJ, Logan JA (1981) A model for diurnal variation in soil and air temperature. Agric Meteorol 23:205–216. https://doi.org/10.1016/0002-1571(81)90105-9
Article Google Scholar
Politi N, Vlachogiannis D, Sfetsos A, Nastos PT (2021) High-resolution dynamical downscaling of ERA-Interim temperature and precipitation using WRF model for Greece. Clim Dyn 57(3):799–825. https://doi.org/10.1007/s00382-021-05741-9
Article Google Scholar
Price DT, McKenney DW, Nalder IA, Hutchinson MF, Kesteven JL (2000) A comparison of two statistical methods for spatial interpolation of Canadian monthly mean climate data. Agric for Meteorol 101(2):81–94. https://doi.org/10.1016/S0168-1923(99)00169-0
Article Google Scholar
Qin Y, Steven ADL, Schroeder T, McVicar TR, Huang J, Cope M, Zhou S (2019) Cloud cover in the Australian region: development and validation of a cloud masking, classification and optical depth Retrieval Algorithm for the Advanced Himawari Imager. 7(20). https://doi.org/10.3389/fenvs.2019.00020
Qin Y, Huang J, McVicar TR, West S, Khan M, Steven ADL (2021) Estimating surface solar irradiance from geostationary Himawari-8 over Australia: a physics-based method with calibration. Sol Energy 220:119–129. https://doi.org/10.1016/j.solener.2021.03.029
Article Google Scholar
Raupach MR, Briggs PR, Haverd V, King EA, Paget M, Trudinger CM (2012): Australian Water Availability Project (AWAP): CSIRO Marine and Atmospheric Research Component: Final Report for Phase 3. CAWCR Technical Report No. 013
Reicosky DC, Winkelman LJ, Baker JM, Baker DG (1989) Accuracy of hourly air temperatures calculated from daily minima and maxima. Agric for Meteorol 46(3):193–209. https://doi.org/10.1016/0168-1923(89)90064-6
Article Google Scholar
Safeeq M, Fares A (2011) Accuracy evaluation of ClimGen weather generator and daily to hourly disaggregation methods in tropical conditions. Theoret Appl Climatol 106(3):321–341. https://doi.org/10.1007/s00704-011-0438-4
Article Google Scholar
Saha S, Coauthors (2014) The NCEP Climate Forecast System Version 2. J Clim 27(6):2185–2208. https://doi.org/10.1175/JCLI-D-12-00823.1
Article Google Scholar
Sekulić A, Kilibarda M, Heuvelink GBM, Nikolić M, Bajat B (2020) Random Forest Spatial Interpolation. Remote Sens-Basel 12(10):1–29. https://doi.org/10.3390/rs12101687
Article Google Scholar
Simpson JE (1994) Sea breeze and local winds. Cambridge University Press
Simpson JE, Mansfield DA, Milford JR (1977) Inland penetration of sea-breeze fronts. Q J R Meteorol Soc 103(435):47–76. https://doi.org/10.1002/qj.49710343504
Article Google Scholar
Stewart SB, Nitschke CR (2017) Improving temperature interpolation using MODIS LST and local topography: a comparison of methods in South East Australia. Int J Climatol 37(7):3098–3110. https://doi.org/10.1002/joc.4902
Article Google Scholar
Stewart SB, Choden K, Fedrigo M, Roxburgh SH, Keenan RJ, Nitschke CR (2017) The role of topography and the north Indian monsoon on mean monthly climate interpolation within the Himalayan Kingdom of Bhutan. Int J Climatol 37(S1):897–909. https://doi.org/10.1002/joc.5045
Article Google Scholar
Strachey R (1886) II. On the computation of the harmonic components. Proc Royal Soc Lond 40(242–245):367–368. https://doi.org/10.1098/rspl.1886.0052
Article Google Scholar
Su CH, Coauthors (2019) BARRA v1.0: the Bureau of Meteorology Atmospheric high-resolution Regional Reanalysis for Australia. Geosci Model Dev 12(5):2049–2068. https://doi.org/10.5194/gmd-12-2049-2019
Su CH, Eizenberg N, Jakob D, Fox-Hughes P, Steinle P, White CJ, Franklin C (2021) BARRA v1.0: kilometre-scale downscaling of an Australian regional atmospheric reanalysis over four midlatitude domains. Geosci Model Dev 14(7):4357–4378. https://doi.org/10.5194/gmd-14-4357-2021
Article Google Scholar
Thornton PE, Shrestha R, Thornton M, Kao S-C, Wei Y, Wilson BE (2021) Gridded daily weather data for North America with comprehensive uncertainty quantification. Sci Data 8(1):1–17. https://doi.org/10.1038/s41597-021-00973-0
Article Google Scholar
Trewin B (2005) A notable frost hollow at Coonabarabran, New South Wales. Aust Meteorol Mag 54:15–21
Google Scholar
Trewin B (2012) Techniques involved in developing the Australian climate observations Reference Network – Surface Air Temperature (ACORN-SAT) dataset. Bureau of Meteorology and CSIRO, Ed
Google Scholar
Uboldi F, Lussana C, Salvati M (2008) Three-dimensional spatial interpolation of surface meteorological observations from high-resolution local networks. Meteorol Appl 15(3):331–345. https://doi.org/10.1002/met.76
Article Google Scholar
Vaze J, and Coauthors (2013) The Australian Water Resource Assessment Modelling System (AWRA). 20th International Congress on ModellingSimulation
Walter A (1967) Notes on the utilization of records from third order climatological stations for agricultural purposes. Agric Meteorol 4(2):137–143. https://doi.org/10.1016/0002-1571(67)90017-9
Article Google Scholar
Wang T, Hamann A, Spittlehouse D, Carroll C (2016) Locally downscaled and spatially customizable Climate Data for historical and future periods for North America. PLoS ONE 11(6):e0156720. https://doi.org/10.1371/journal.pone.0156720
Article CAS Google Scholar
Webb M, Minasny B (2020) A digital mapping application for quantifying and displaying air temperatures at high spatiotemporal resolutions in near real-time across Australia. PeerJ 8:1–23. https://doi.org/10.7717/peerj.10106
Article Google Scholar
Webb MA, Kidd D, Minasny B (2020) Near real-time mapping of air temperature at high spatiotemporal resolutions in Tasmania, Australia. Theoret Appl Climatol 141(3):1181–1201. https://doi.org/10.1007/s00704-020-03259-4
Article Google Scholar
Whiteman CD, Bian X, Zhong S (1999) Wintertime Evolution of the Temperature Inversion in the Colorado Plateau Basin. J Appl Meteorol 38(8):1103–1117.
Article Google Scholar
Willmott CJ (1982) Some comments on the evaluation of Model Performance. Bull Am Meteorol Soc 63(11):1309–1313.
Article Google Scholar
Willmott CJ, Robeson SM (1995) Climatologically aided interpolation (CAI) of terrestrial air temperature. Int J Climatol 15(2):221–229. https://doi.org/10.1002/joc.3370150207
Article Google Scholar
Yu Y, Renzullo LJ, McVicar TR, Van Niel TG, Cai D, Tian S, Ma Y (2024) Solar zenith angle-based calibration of Himawari-8 land surface temperature for correcting diurnal retrieval error characteristics. Remote Sens Environ 308:114176. https://doi.org/10.1016/j.rse.2024.114176
Article Google Scholar

Download references

Acknowledgements

We thank Dr Andrew Frost and Dr Blair Trewin from the Australian Bureau of Meteorology, for providing air temperature observations that made this study possible, and for providing comprehensive answers to our questions. We thank the continued support of the TERN Landscapes Observatory (https://www.tern.org.au/tern-observatory/tern-landscapes/), which also support the CosmOz program, a sensing platform of the Terrestrial Ecosystem Research Network (TERN; https://www.tern.org.au/), and the OzFlux network. TERN is supported and enabled by the Australian Government through the National Collaborative Research Infrastructure Strategy (NCRIS). Thanks to Dr David McJannet (CSIRO Environment, Brisbane) for providing a total of 763,347 air temperature observations that were vital for our independent validation. Thanks to the Climate Dynamics editorial team and the two anonymous reviewers for constructively critical comments that prompted further analyses and numerous improvements.

Funding

The authors declare that no funds, grants, or other support were received during the preparation of this manuscript.

Open access funding provided by CSIRO Library Services.

Author information

Authors and Affiliations

CSIRO Environment, Private Bag No. 5, GPO Box 1700, Hobart, TAS, 7005, Australia
Stephen B. Stewart
CSIRO Environment, GPO Box 1700, Canberra, ACT, 2601, Australia
Tim R. McVicar & Dejun Cai
Australian Research Council Centre of Excellence for Climate Extremes, Canberra, ACT, 2609, Australia
Tim R. McVicar
CSIRO Environment, Private Bag No. 5, Waterford, WA, 6913, Australia
Thomas G. Van Niel

Authors

Stephen B. Stewart
View author publications
You can also search for this author in PubMed Google Scholar
Tim R. McVicar
View author publications
You can also search for this author in PubMed Google Scholar
Thomas G. Van Niel
View author publications
You can also search for this author in PubMed Google Scholar
Dejun Cai
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Stephen B Stewart, Tim R McVicar and Thomas G Van Niel contributed to the study conception and design. Material preparation, data collection and analysis were performed by Stephen B Stewart and Dejun Cai. The first draft of the manuscript was written by Stephen B Stewart. All authors contributed critically to the draft, then read and approved the final manuscript.

Corresponding author

Correspondence to Stephen B. Stewart.

Ethics declarations

Competing interests

The authors have no relevant financial or non-financial interests to disclose.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Stewart, S.B., McVicar, T.R., Van Niel, T.G. et al. Continental scale spatial temporal interpolation of near-surface air temperature: do 1 km hourly grids for Australia outperform regional and global reanalysis outputs?. Clim Dyn (2024). https://doi.org/10.1007/s00382-024-07340-w

Download citation

Received: 18 September 2023
Accepted: 08 July 2024
Published: 13 August 2024
DOI: https://doi.org/10.1007/s00382-024-07340-w

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.