1 Introduction

Murphy et al. (2018) have recently created a continuous 305-year (1711–2016) monthly rainfall series for the island of Ireland, known as IoI_1711. They have also provided detailed descriptive statistical analysis of the series but have not attempted any stochastic time series modelling of the type undertaken by Mills (2005, 2015, 2017) for the U.K. and its regions. The purpose of this paper is to undertake, such an analysis on IoI_1711 to enable a wider perspective on the evolution of the series to be obtained.

To this end, Sect. 2 outlines the model used for the analysis of the monthly IoI_1711 series and discusses how deterministic and stochastic trends and seasonal patterns may be identified, along with model estimation and testing. Section 3 provides a complementary analysis of the seasonal and annual series obtained from the monthly data. Given the long span of the data, Sect. 4 investigates the possibility of breaks and shifts in the model and in its predictability. Section 5 completes the paper by providing a summary, conclusions, and a comparison with the findings of Murphy et al. (2018).

2 A model for monthly rainfall

2.1 The basic model

Following Mills (2005, 2015, 2017), a basic model for a monthly rainfall series observed from time t = 1 to time t = T has been found to be

$$ {x}_{\mathrm{t}}^{\left(\uplambda \right)}={\sum}_{\mathrm{i}=1}^{12}\left({\alpha}_{\mathrm{i}}{s}_{\mathrm{i},t}+{\beta}_{\mathrm{i}}{s}_{\mathrm{i},t}t\right)+{u}_{\mathrm{t}}\kern0.5em t=1,2,\cdots, T $$
(1)

In Eq. (1), rainfall xt is transformed by the Box and Cox (1964) power transformation defined as

$$ {x}_{\mathrm{t}}^{\left(\uplambda \right)}=\frac{x_{\mathrm{t}}^{\uplambda}-1}{\lambda}\kern0.5em {x}_{\mathrm{t}}^{(0)}=\log\ {x}_{\mathrm{t}} $$

This has been applied to ameliorate the skewness found in the raw data, a consequence of xt being bounded below at zero and possibly having a long right (positive) tail. Through its scaling property, the transformation helps to induce normality, linearity, and constancy of variance into the model. It is used here in its simplest form, as it may be generalised in a variety of directions to deal, for example, with negative values and heteroskedasticity, neither of which are needed here. The si, tt = 1, 2, ⋯, T, are “dummy” variables defined to take the value 1 in month i and 0 elsewhere (where i = 1 signifies January, etc.). Their inclusion allows a deterministic monthly pattern to be modelled. The presence of the si, tt “interaction” variables allows for the possibility of different monthly linear time trends. The αi and βi parameters measure the intercept and slope of these trends, so that if βi ≠ 0 then the seasonal pattern for month i evolves linearly over time.

The error ut can, in general, follow a seasonal autoregressive-moving average (ARMA) process: see for example, Mills (2019), chapter 8) for technical details and Mills (2014) for a discussion of such models in a meteorological context:

$$ \phi (B)\Phi \left({B}^{12}\right){u}_{\mathrm{t}}=\theta (B)\Theta \left({B}^{12}\right){a}_{\mathrm{t}} $$
(2)

where

$$ \phi (B)=1-{\phi}_1B-\cdots -{\phi}_{\mathrm{p}}{B}^{\mathrm{p}} $$
$$ \theta (B)=1+{\theta}_1B+\cdots +{\theta}_{\mathrm{q}}{B}^{\mathrm{q}} $$

are “non-seasonal” polynomials of orders p and q in the lag operator B, defined such that Bjat ≡ at − j, at being zero mean white noise (E(at) = 0, E(atat − j) = 0 for all j ≠ 0) with variance \( E\left({a}_{\mathrm{t}}^2\right)={\sigma}_{\mathrm{a}}^2 \). The “seasonal” polynomials

$$ \Phi \left({B}^{12}\right)=1-{\Phi}_1{B}^{12}-\cdots -{\Phi}_{\mathrm{P}}{B}^{12\times \mathrm{P}} $$
$$ \Theta \left({B}^{12}\right)=1+{\Theta}_1{B}^{12}+\cdots +{\Theta}_{\mathrm{Q}}{B}^{12\times \mathrm{Q}} $$

are of orders P and Q, their presence allowing the error to be autocorrelated at seasonal lags, such as 12, 24, ⋯, as well as being autocorrelated at non-seasonal lags.

2.2 Deterministic and stochastic trends and seasonality

More general models result if unit roots are allowed in the ϕ(B) and Φ(B12) polynomials. If the non-seasonal autoregressive polynomial contains a unit root, i.e., the characteristic equation associated with ϕ(B) contains a root of unity, then ϕ(B) can be factorised as

$$ \phi (B)=\left(1-B\right)\left(1-{\phi}_1^{\ast }B--{\phi}_{\mathrm{p}-1}^{\ast }{B}^{\mathrm{p}-1}\right)=\left(1-B\right){\phi}^{\ast }(B) $$

where ϕ(B) is a polynomial of order p − 1. Equation (1) then becomes, with ∇ = 1 − B signifying the first-difference operator and \( {u}_{\mathrm{t}}^{\ast }=\nabla {u}_{\mathrm{t}} \),

$$ \nabla {x}_{\mathrm{t}}^{\left(\uplambda \right)}={\sum}_{\mathrm{i}=1}^{12}\left({\alpha}_{\mathrm{i}}\nabla {s}_{\mathrm{i},t}+{\beta}_{\mathrm{i}}\nabla {s}_{\mathrm{i},t}t\right)+{u}_{\mathrm{t}}^{\ast}\kern0.5em {\phi}^{\ast }(B)\Phi \left({B}^{12}\right){u}_{\mathrm{t}}^{\ast }=\theta (B)\Phi \left({B}^{12}\right){a}_{\mathrm{t}} $$
(3)

Noting that ∇si, t = si, t − si + 1, t and ∇si, tt = (12 + i)(si, t − si + 1, t), where it is taken that ∇s12, t = s12, t − s1, t + 1, Eq. (3) in turn becomes

$$ \nabla {x}_{\mathrm{t}}^{\left(\uplambda \right)}=\sum \limits_{\mathrm{i}=1}^{12}\left({\alpha}_{\mathrm{i}}+\left(12+i\right){\beta}_{\mathrm{i}}\right)\left({s}_{\mathrm{i},\mathrm{t}}-{s}_{\mathrm{i}+1,\mathrm{t}}\right)+{u}_{\mathrm{t}}^{\ast } $$
(4)

Since \( \nabla {x}_{\mathrm{t}}^{\left(\uplambda \right)}=\alpha +{u}_{\mathrm{t}}^{\ast } \) would depict a random walk with drift α, Eq. (4) may be interpreted as implying that \( {x}_{\mathrm{t}}^{\left(\uplambda \right)} \) contains a stochastic, random walk, trend with differing seasonal drifts, i.e. each month evolves as a random walk with its own drift.

Alternatively, suppose that the seasonal autoregressive polynomial contains a (seasonal) unit root:

$$ \Phi \left({B}^{12}\right)=\left(1-{B}^{12}\right)\left(1-{\Phi}_1^{\ast }{B}^{12}--{\Phi}_{\mathrm{P}-1}^{\ast }{B}^{12\left(\mathrm{P}-1\right)}\right)=\left(1-{B}^{12}\right){\Phi}^{\ast}\left({B}^{12}\right). $$

Equation (1) then becomes, with ∇12 = (1 − B12) and \( {u}_{\mathrm{t}}^{\dagger }={\nabla}_{12}{u}_{\mathrm{t}} \)

$$ {\nabla}_{12}{x}_{\mathrm{t}}^{\left(\uplambda \right)}={\sum}_{\mathrm{i}=1}^{12}\left({\alpha}_{\mathrm{i}}{\nabla}_{12}{s}_{\mathrm{i},t}+{\beta}_{\mathrm{i}}{\nabla}_{12}{s}_{\mathrm{i},t}t\right)+{u}_{\mathrm{t}}^{\dagger}\kern0.5em \phi (B){\Phi}^{\ast}\left({B}^{12}\right){u}_{\mathrm{t}}^{\dagger }=\theta (B)\Phi \left({B}^{12}\right){a}_{\mathrm{t}} $$
(5)

Since ∇12si, t = 0 and ∇12si, tt = 12si, t, Eq. (5) becomes

$$ {\nabla}_{12}{x}_{\mathrm{t}}^{\left(\uplambda \right)}=12{\sum}_{\mathrm{i}=1}^{12}{\beta}_{\mathrm{i},\mathrm{t}}+{u}_{\mathrm{t}}^{\dagger } $$
(6)

and \( {x}_{\mathrm{t}}^{\left(\uplambda \right)} \) now contains a stochastic seasonal random walk with differing seasonal drifts.

If Φ(B12) = Θ(B12), then there is only deterministic seasonality (see Pierce 1978, and Mills and Mills 1992, for similar set-ups and additional analysis).

2.3 Estimating and testing the model

The data provided by Murphy et al. (2018) are T = 3672 monthly observations on rainfall for the island of Ireland for the 305 years from 1711 to 2016, known as the IoI_1711 series. Figure 1 displays the histogram and empirical kernel density of the series, superimposed on which is a normal distribution with the same mean and standard deviation as IoI_1711. Although the distribution is not excessively kurtotic (the kurtosis measure is only 3.12), it is highly skewed to the right (the skewness measure is 0.52), as might be expected. Figure 2 shows the plot of the log-likelihood function for the Box-Cox transformation parameter λ in Eq. (1). The maximum likelihood (ML) estimate is \( \widehat{\lambda}=0.58 \) with a 95% confidence interval running from 0.53 to 0.63. (ML estimation of and the construction of a confidence interval for the Box-Cox transformation parameter in models, such as (1) is conveniently discussed in Mills 2019, chapter 2). For convenience, λ was thus set at the value of 0.6, and the histogram and empirical kernel density of the transformed series are shown in Fig. 3. Skewness has been eradicated (it is now just 0.03), and the distribution is close to the superimposed normal distribution.

Fig. 1
figure 1

Histogram and empirical kernel density of IoI_1711 with a normal distribution with same mean and standard deviation superimposed

Fig. 2
figure 2

Log-likelihood function for the Box-Cox transformation parameter λ and 95% confidence interval

Fig. 3
figure 3

Histogram and empirical kernel density of Box-Cox transformed IoI_1711 with a normal distribution with same mean and standard deviation superimposed

To determine the most appropriate form of the combined model given by Eqs. (1) and (2), initial analysis using the information from the sample autocorrelation and partial autocorrelation functions, along with residual diagnostic checks from fitted models, established that the polynomial orders could be set at p = 3, q = 0, and P = Q = 1 (at most), leading to the model

$$ {x}_{\mathrm{t}}^{(0.6)}=\sum \limits_{\mathrm{i}=1}^{12}\left({\alpha}_{\mathrm{i}}{s}_{\mathrm{i},\mathrm{t}}+{\beta}_{\mathrm{i}}{s}_{\mathrm{i},\mathrm{t}}t\right)+{u}_{\mathrm{t}} $$
(7)
$$ \left(1-{\phi}_1B-{\phi}_2{B}^2-{\phi}_3{B}^3\right)\left(1-\Phi {B}^{12}\right){u}_{\mathrm{t}}=\left(1+\Theta {B}^{12}\right){a}_{\mathrm{t}} $$

The estimates of this model are reported in the first column of Table 1. For there to be a random walk trend in rainfall, ϕ1 + ϕ2 + ϕ3 would have to be unity (the unit root condition). The estimates show clearly the absence of such a (stochastic) trend since \( {\widehat{\phi}}_1+{\widehat{\phi}}_2+{\widehat{\phi}}_3=-0.195 \), with a standard error of 0.0285. There is, though, evidence of non-seasonal autocorrelation as both \( {\widehat{\phi}}_1 \) and \( {\widehat{\phi}}_3 \) are significantly different from zero. The estimates of both Φ and Θ are insignificantly different from zero, so that there is, in fact, no evidence of stochastic seasonality, so that the seasonal pattern does not evolve over time. However, the hypothesis of no deterministic monthly trends (β1 = β2 = ⋯ = β12 = 0) may be conclusively rejected, although only four of the months have trends that are individually significant.

Table 1 Estimates of Eq. (7)

The residuals from this model exhibit no autocorrelation. The question of whether the deterministic seasonal model might contain a non-linear component was addressed by including additional quadratic and cubic trends, taking the form si, tt2 and si, tt3, but these were found to be insignificant (an F test for their inclusion has a marginal significance level of just 0.65 when just quadratic trends are included and 0.60 when both quadratic and cubic trends are included).

The second column of Table 1 provides estimates of a restricted version of Eq. (7) in which all individually insignificant coefficients have been set to 0. These restrictions are clearly acceptable as the fit of the equation remains the same. The monthly trends from this restricted model, calculated by “inverting” the Box-Cox transformation, are shown in Fig. 4: i.e., if the predicted transformed rainfall for month i in year y, where y = 1 corresponds to 1711, etc., is given by

Fig. 4
figure 4

Monthly trends estimated from the restricted version of Eq. (7)

$$ {\widehat{x}}_{\mathrm{i},\mathrm{y}}^{(0.6)}={\widehat{\alpha}}_{\mathrm{i}}+{\widehat{\beta}}_{\mathrm{i}}\left(y-1+i\right) $$

then the predicted rainfall itself is given by the inverted value

$$ {\widehat{x}}_{\mathrm{i},\mathrm{y}}={\left({\widehat{x}}_{\mathrm{i},\mathrm{y}}^{(0.6)}\right)}^{-1}={\left(1+0.6{\widehat{x}}_{\mathrm{i},\mathrm{y}}^{(0.6)}\right)}^{5/3} $$

January, March, and December exhibit positive trends, so that rainfall in these months has increased over the three centuries, while the trend for July is negative, indicating that this month has become progressively drier. The slopes of these (non-linear) trends are rather small; however, January rainfall is predicted to have increased from 81 to 118 mm between 1711 and 2016, March rainfall from 64 to 81 mm, and December rainfall from 91 to 121 mm. July rainfall is predicted to have declined from 99 to 74 mm over the three centuries. The remaining 8 months show constant seasonal factors.

3 Modelling the seasonal and annual rainfall data

Following Murphy et al. (2018), the IoI_1711 monthly series may be aggregated to the four seasons: winter (the sum of December of year y − 1, January of year y, and February of year y, i.e. winy = x12, y − 1 + x1, y + x2, y), spring (spry = x3, y + x4, y + x5, y), summer (sumy = x6, y + x7, y + x8, y) and autumn (auty = x9, y + x10, y + x11, y). An annual series may then be defined as anny = x1, y + x2, y + ⋯ + x12, y. These series are displayed in Fig. 5, and obviously, such annual series display no seasonality. Interestingly, these series do not require transformation, since at this level of aggregation, no significant departures from normality are found in any of them, presumably because aggregation “averages out” many of the more extreme rainfall fluctuations observed at monthly frequencies.Footnote 1 Fitted trend lines are also shown in Fig. 5, obtained from the following models

Fig. 5
figure 5

Seasonal and annual rainfall, 1712–2016, with trends superimposed

$$ wi{n}_{\mathrm{y}}={\displaystyle \begin{array}{c}254.88\\ {}(8.33)\end{array}}+{\displaystyle \begin{array}{c}0.2514\\ {}(0.0422)\end{array}}y+{a}_{1,y}\kern0.5em {\widehat{\sigma}}_1=72.38 $$
$$ sp{r}_{\mathrm{y}}={\displaystyle \begin{array}{c}215.30\\ {}(3.04)\end{array}}+{a}_{2,y}\kern0.5em {\widehat{a}}_2=53.24 $$
$$ su{m}_{\mathrm{y}}={\displaystyle \begin{array}{c}281.55\\ {}(8.16)\end{array}}-{\displaystyle \begin{array}{c}0.1322\\ {}(0.0476)\end{array}}y+{a}_{3,y}+{\displaystyle \begin{array}{c}0.133\\ {}(0.054)\end{array}}{a}_{3,\mathrm{y}-1}\kern0.5em {\widehat{\sigma}}_3=65.77 $$
$$ au{t}_{\mathrm{y}}={\displaystyle \begin{array}{c}310.96\\ {}(3.86)\end{array}}+{a}_{4,\mathrm{y}}\kern0.5em {\widehat{\sigma}}_4=67.57 $$
$$ an{n}_{\mathrm{y}}={\displaystyle \begin{array}{c}1052.61\\ {}(13.69)\end{array}}+{\displaystyle \begin{array}{c}0.1840\\ {}(0.0777)\end{array}}y+{a}_{5,\mathrm{y}}\kern0.5em {\widehat{a}}_5=120.06 $$

These models are consistent with the findings from the monthly IoI_1711 series. Only the summer series exhibits any autocorrelation, and this is of just 1-year duration. Winter exhibits a positive trend in rainfall, which is approximately twice the size of the negative trend for summer. Both spring and autumn have no trends in rainfall, the positive March trend being dissipated in significance by the lack of trends in April and May rainfall. Consequently, annual rainfall has a positive trend, being approximately the average of the (absolute) winter and summer trends.

4 Breaks and changing predictability

Over such a long sample period, somewhat in excess of three centuries, it is quite conceivable that the model may have undergone one or more shifts over time. To investigate this possibility, a model closely related to Eq. (7),

$$ {x}_{\mathrm{t}}^{(0.6)}=\sum \limits_{\mathrm{i}=1}^{12}\left({\alpha}_{\mathrm{i}}{s}_{\mathrm{i},\mathrm{t}}+{\beta}_{\mathrm{i}}{s}_{\mathrm{i},\mathrm{t}}t\right)+{\delta}_1{x}_{\mathrm{t}-1}^{(0.6)}+{\delta}_3{x}_{\mathrm{t}-3}^{(0.6)}+{a}_{\mathrm{t}} $$
(8)

was investigated for structural shifts using the Bai (1997) and Bai and Perron (1998, 2003a, b) “sequentially determined” break test.Footnote 2 The statistics from this test, shown in Table 2, identify five breaks, at July 1739, December 1765, December 1786, February 1843, and August 1976. Interestingly, three of these breaks occur during the eighteenth century, a period for which Murphy et al. (2018) have “low confidence” in the reliability of the data. The seasonal trends estimated from this five-break model are shown in Fig. 6. The trends are quite volatile across the three breaks during the eighteenth century, but from the mid-1800s, the seasonal trends are rather stable within subsamples, with only one significantly negative linear trend (for September) during the fifth subsample from 1843 to 1976 and one significantly positive linear trend (for July) during the last subsample from 1976.

Table 2 Break test statistics
Fig. 6
figure 6

Seasonal trends across the identified five breaks

Break tests were also performed on the seasonal and annual series. The only break that could be identified was for the winter series with a break at 1790:

$$ wi{n}_{\mathrm{y}}={\displaystyle \begin{array}{c}304.81\\ {}(15.83)\end{array}}-{\displaystyle \begin{array}{c}1.1043\\ {}(0.0348)\end{array}}y+{a}_{1,y}\kern0.5em y\le 79\ (1789) $$
$$ wi{n}_{\mathrm{y}}={\displaystyle \begin{array}{c}254.05\\ {}(14.74)\end{array}}+{\displaystyle \begin{array}{c}0.2622\\ {}(0.0727)\end{array}}y+{a}_{1,\mathrm{y}}\kern0.5em y\ge 80\ (1790) $$

There is thus a trend towards drier winters in the years up to 1790, with the trend then being reversed towards wetter winters after this break point.

The potential for changes in predictability was also investigated.Footnote 3 To assess whether the pattern of rainfall has altered in predictability over time relative to the fitted models, moving residual standard deviations were computed for both the model fitted assuming no breaks and for the model with five breaks. The n period moving residual standard deviation at time t, \( {\widehat{\sigma}}_{\mathrm{a},\mathrm{t},\mathrm{n}} \) is defined from

$$ {\widehat{\sigma}}_{\mathrm{a},t,n}^2={\left(n-1\right)}^{-1}{\sum}_{\mathrm{j}=0}^{\mathrm{n}-1}{\widehat{a}}_{\mathrm{t}-\mathrm{i}}^2\kern0.5em t=n,n+1,\cdots, T $$

so that the conventional residual standard deviation \( {\widehat{\sigma}}_{\mathrm{a}} \) reported in Table 1 implicitly sets the moving window size n equal to the sample size T.

Figure 7 plots these moving residual standard deviations for n = 120, i.e. for a 10-year (decadal) moving window. Variation is much more pronounced during the eighteenth century, the first half of which exhibits less unpredictability than the second half, during which unpredictability was at its greatest. The period since 1976 has also exhibited a tendency towards greater unpredictability.

Fig. 7
figure 7

Moving residual standard deviations using a decadal window

5 Summary and concluding comments

Using a stochastic model that has already been successfully employed to model monthly rainfall series for the U.K. and its regions, this paper has demonstrated that this model can also be successfully fitted to the IoI_1711 series for the island of Ireland. The excess skewness in the monthly IoI_1711 data may be eradicated by using a Box-Cox transformation with parameter equal to 0.6, a value very similar to that found for the U.K. and its regions. There is no evidence of either an overall stochastic trend or of evolving monthly seasonal patterns, but positive linear trends are found for January, March, and December and a negative linear trend found for July. Analysis of the seasonal and annual series (which require no transformation) confirms the implication from the monthly data that winters have become progressively wetter and summers progressively drier, with the positive linear trend for winter being approximately twice the size of the negative summer trend. Since there is no trend in either spring or autumn rainfall, annual rainfall shows an overall positive linear trend.

Given that the IoI_1711 series exists for over three centuries, breaks in trend and structural shifts in the model were investigated. Five breaks were identified, three of which occurred in the early portion of the series during the eighteenth century. However, trends were found to be much more stable from the middle of the nineteenth century. For the seasonal series, only a single break, at 1790 for winter, was found, so that it was only after this break that winters became wetter; as before then winter rainfall had a negative trend. In terms of predictability, predictions from the model were found to be more volatile during the second half of the eighteenth century and again from 1976 onwards.

The formal modelling results presented in this paper may also be compared with the essentially descriptive findings of Murphy et al. (2018). The overall finding of increasingly wetter winters and dryer summers complements their conclusions, as does the finding that most of the eighteenth century was characterised by dryer winters, which Murphy et al. suggest may be a consequence of the under-catch of snowfall. They also point out that before 1790, confidence in the data is low and this may have led to the finding of multiple breaks and more volatile predictions in the models during the eighteenth century. The break analysis also complements their conclusions that trends were less significant from 1850 onwards and that trends computed from recent data are not necessarily representative of long-term trends. Of course, a perennial problem with all trend fitting techniques is their projection into the future. Given the results of the break analysis one should be wary of projecting current trends too far!

Comparisons with the findings for the U.K. regions given in Mills (2017) are necessarily limited by the much shorter sample periods available for the U.K. and the rather different focus of that paper. Perhaps, the most noticeable difference is that the annual IoI_1711 series, which contains a positive trend, stands in contrast to all the U.K. regions, for which trends in rainfall are conspicuously absent.

The above findings, which complement and enhance the descriptive analysis of Murphy et al. (2018), thus conclusively demonstrate the importance and usefulness of the formal modelling of rainfall series undertaken in this paper.