1 Introduction

Polar motion is Earth’s rotation axis movement with respect to its crust. It is a significant component of Earth orientation parameters (EOPs). In general, EOPs are available with a delay of hours to days because of complex data processing procedures. The growing demand from spacecraft tracking and navigation has intensified the researches on EOP predictions (Kalarus et al. 2010; Xu et al. 2012).

Various techniques have also been developed and applied to the polar motion prediction, including those presented by Petrov et al. (1995), Kosek et al. (1998), Kosek (2002), Schuh et al. (2002), Akulenko et al. (2002), Akyilmaz and Kutterer (2004), Akyilmaz et al. (2011), and Liao et al. (2012). These methods either estimate the parameters of harmonic functions and extrapolate them into the future or use stochastic models such as autoregressive integrated moving average processes and Kalman filter with atmospheric angular momentum forecast. A combination of these methods is used for the polar motion prediction (Xu et al. 2012) and the comparison between these methods has also been conducted (Kalarus et al. 2010; Kosek et al. 2007).

These methods work well for short-term prediction. However, their performance is not satisfactory in mid-term and long-term predictions because of increasing prediction error with lead time. On one hand, the limited prediction accuracy is caused by the variable amplitude of Chandler and annual wobbles and short-period oscillations (which have amplitudes up to several mass and even tens of them) (Zhu 1982). On the other hand, in these methods, the instantaneous harmonic information (including frequency, amplitude and phase) of the polar motion is not taken into account although such information is necessary for the polar motion prediction.

The polar motion is the sum of two statically independent parts: trend and undulation. Thus, polar motion prediction consists of trend prediction and undulation prediction. A trend item can be obtained via linear least squares adjustment. The acquired fitting parameter can be applied to trend prediction as well.

However, no tool as simple as the linear least squares adjustment exists for undulation prediction which is the focus of this study. The instantaneous frequency, phase and amplitude of the undulation can be obtained via time–frequency analysis. Normal time–frequency transform (NTFT) proposed by Liu and Hsu (2009); Liu and Hsu (2012) is designed for unbiased measurement of the instantaneous frequency, phase and amplitude of a time series. Its rescaling index \(\mu (\varpi )\) can be a constant as in Gabor transform or linear as in wavelet transform, but even quadratic, cubic or any other function can be used to construct other unnamed transforms. The NTFT is used to construct the prediction model that considers the information coming from both time and frequency domains rather than the information coming from pure time or frequency domain. For the first time, this study uses the time–frequency analysis method (i.e., the NTFT) to predict the polar motion.

In practice, the closer to the current time the time series data are, the more useful they are for prediction. However, the edge effect existing in the time–frequency analysis method makes the time–frequency analysis results at the current time (i.e., the end of the observation) implausible. In order to obtain the instantaneous frequency, amplitude and phase at the current time, half-window can be used to construct the kernel function of the NTFT.

This paper is organized as follows. Section 2 introduces the NTFT concept and explains how it works for time–frequency analysis and prediction. In Sect. 3, based on the normal wavelet transform (a special NTFT), we present the time–frequency analysis of the polar motion and illuminate the predictability of long-term polar motion. In Sect. 4, the NTFT with its kernel function using the half-window is applied to the long-term polar motion predictions. Section 5 provides the conclusion.

2 Normal time–frequency transform for prediction

The NTFT has been proposed by Liu and Hsu (2009); Liu and Hsu (2012); refer to these studies for detailed proof and properties of the NTFT) and is defined as follows.

Definition 2.1

A local time function \(w( t)\in L^1( R)\) is called a normal window if

$$\begin{aligned} \begin{array}{lll} {\hat{w}( \omega )=\left| {\hat{w}( \omega )} \right| =\mathbf Maximum =1} &{} \Leftrightarrow &{} {\omega =0} \\ \end{array} \end{aligned}$$
(1)

where \(\hat{w}( \omega )=\mathrm{F}\left[ {w( t)} \right] _\omega , \mathrm{F}\left[ \cdot \right] _\omega \) is Fourier transform (FT) operator, “\(\left| \cdot \right| \)” denotes modulus operator, and “\(\Leftrightarrow \)” means “if and only if”. \(\hat{w}( \omega )\)is a function of \(\omega \). Equation (1) means that if and only if \(\omega =0\), function \(\hat{w}( \omega )=\left| {\hat{w}( \omega )} \right| \) and meanwhile \(\vert \hat{w}( \omega )\vert \) gets its maximum with the value of 1. We suppose that \({\varvec{\Omega }}( R)\) is being used to denote the set of all normal windows.

Definition 2.2

For a time series \(f( t)\), its NTFT \({\varvec{\Psi }}\) is defined as

$$\begin{aligned} {\varvec{\Psi }}f( {\tau ,\varpi })=\int \limits _R {f( t)\psi ^*( {t-\tau ,\varpi })\mathrm{d}t},\quad \tau ,\varpi \in R \end{aligned}$$
(2)

where \(\tau \) is the time index, \(\varpi \) is the frequency index, ‘*’ denotes the complex conjugation operator, and transform kernel \(\psi ( {t,\varpi })\) satisfies

$$\begin{aligned} \begin{array}{lll} {\hat{\psi }( {\omega ,\varpi })=\left| {\hat{\psi }( {\omega ,\varpi })} \right| =\mathbf{Maximum }=1} &{} \Leftrightarrow &{} \omega \\ \end{array} =\varpi \end{aligned}$$
(3)

in which \(\hat{\psi }( {\omega ,\varpi })=F\left[ {\psi ( {t,\varpi })} \right] _\omega \) and \(\hat{\psi }( {\omega ,\varpi })\) is function of \(\omega \) and \(\varpi \). The meaning of relation (3) is similar to relation (1).

A typical NTFT kernel can be constructed as

$$\begin{aligned} \psi ( {t,\varpi })&= \left| {\mu ( \varpi )} \right| w( {\mu ( \varpi )t})\exp ( {i\varpi t}),w( t)\in {\varvec{\Omega }}( R),\nonumber \\&( {\mu ( \varpi )\in R})\dot{\ne }0 \end{aligned}$$
(4)

where dot means “almost anywhere”. The rescaling index \(\mu (\varpi )\) can be almost anything other than zero, for instance, \(\varpi ,\,\varpi ^{2/3},\,\varpi ^5\) and so on. Letting \(\mu ( \varpi )=1\) in Eq. (4) yields a phase-updated GT and letting \(\mu ( \varpi )=\varpi \) in Eq. (4) yields a normal wavelet transform. In this study, the latter is used for the time–frequency analysis and for the long-term polar motion prediction.

For a multi-component signal \(f( t)\) that consists of \(n\) sub-signals \(h_1 ( t),\,h_2 ( t),\ldots ,\,h_n ( t)\) with frequencies of \(\varpi _1 ,\varpi _2 ,\dots ,\varpi _n \), respectively,

$$\begin{aligned} f( t)&= h_1 ( t)\!+\!h_2 ( t)\!+\!\cdots \!+\!h_n ( t)=A_1 \exp ( {i\varpi _1 t})\nonumber \\&+ A_2 \exp ( {i\varpi _2 t})\!+\!\cdots \!+\!A_n \exp ( {i\varpi _n t}) \end{aligned}$$
(5)

where \(A_1 ,A_2 ,\dots ,A_n \) are complex, its NTFT is

$$\begin{aligned} {\varvec{\Psi }}f( {\tau ,\varpi })&= h_1 ( \tau )\hat{w}\left( {\frac{\varpi \!-\!\varpi _1 }{\mu ( \varpi )}}\right) \!+\!h_2 ( \tau )\hat{w}\left( {\frac{\varpi \!-\!\varpi _2 }{\mu ( \varpi )}}\right) \nonumber \\&+\cdots +h_n ( \tau )\hat{w}\left( {\frac{\varpi -\varpi _n }{\mu ( \varpi )}}\right) \end{aligned}$$
(6)

We fix the parameter \(\tau _k \) as the current observation time. Letting \(\tau =\tau _k \) and \(\varpi =\varpi _1 ,\varpi _2 ,\dots ,\varpi _n\), respectively, one has

$$\begin{aligned} \left\{ {{ \begin{array}{ll} {{\varvec{\Psi }}f( {\tau _k ,\varpi _1 })\!=\!h_1 ( {\tau _k })\!+\!h_2 ( {\tau _k })\hat{w}\left( {\frac{\varpi _1 \!-\!\varpi _2 }{\mu ( {\varpi _1 })}}\right) \!+\!\cdots \!+\!h_n ( {\tau _k })\hat{w}\left( {\frac{\varpi _1 \!-\!\varpi _n }{\mu ( {\varpi _1 })}}\right) }\\ {{\varvec{\Psi }}f( {\tau _k ,\varpi _2 })\!=\!h_1 ( {\tau _k })\hat{w}\left( {\frac{\varpi _2\! -\!\varpi _1 }{\mu ( {\varpi _2 })}}\right) \!+\!h_2 ( {\tau _k })\!+\!\cdots \!+\!h_n ( {\tau _k })\hat{w}\left( {\frac{\varpi _2 \!-\!\varpi _n }{\mu ( {\varpi _2 })}}\right) } \\ \cdots \\ {{\varvec{\Psi }}f( {\tau _k ,\varpi _n })\!=\!h_1 ( {\tau _k })\hat{w}\left( {\frac{\varpi _n \!-\!\varpi _1 }{\mu ( {\varpi _n })}}\right) \!+\!h_2 ( {\tau _k })\hat{w}\left( {\frac{\varpi _n \!-\!\varpi _2 }{\mu ( {\varpi _n })}}\right) \!+\!\cdots \!+\!h_n ( {\tau _k })} \\ \end{array} }} \right. \nonumber \\ \end{aligned}$$
(7)

Equation groups (7) can be rewritten as

$$\begin{aligned} \mathbf{T }=\mathbf{WE } \end{aligned}$$
(8)

where

$$\begin{aligned}&\mathbf{T }=\left[ {{ \begin{array}{l} {{\varvec{\Psi }}f( {\tau _k ,\varpi _1 })} \\ {{\varvec{\Psi }}f( {\tau _k ,\varpi _2 })} \\ \vdots \\ {{\varvec{\Psi }}f( {\tau _k ,\varpi _n })} \\ \end{array} }} \right] ,\nonumber \\&\mathbf{W }=\left[ {{ \begin{array}{l@{\quad }l@{\quad }l@{\quad }l} 1 &{} {\hat{w}\left( {\frac{\varpi _1\! -\!\varpi _2 }{\mu ( {\varpi _1 })}}\right) } &{} \cdots &{} {\hat{w}\left( {\frac{\varpi _1 \!-\!\varpi _n }{\mu ({\varpi _1 })}}\right) } \\ {\hat{w}\left( {\frac{\varpi _2 \!-\!\varpi _1 }{\mu ( {\varpi _2 })}}\right) } &{} 1 &{} \cdots &{} {\hat{w}\left( {\frac{\varpi _2 \!-\!\varpi _n }{\mu ( {\varpi _2 })}}\right) } \\ \vdots &{} \vdots &{} \vdots &{} \vdots \\ {\hat{w}\left( {\frac{\varpi _n \!-\!\varpi _1 }{\mu ( {\varpi _n })}}\right) } &{} {\hat{w}\left( {\frac{\varpi _n \!-\!\varpi _2 }{\mu ( {\varpi _n })}}\right) } &{} \cdots &{} 1 \\ \end{array} }} \right] ,\quad \hbox {and}\nonumber \\&\mathbf{E }= \left[ {{ \begin{array}{l} {h_1 ( {\tau _k })} \\ {h_2 ( {\tau _k })} \\ \vdots \\ {h_n ( {\tau _k })} \\ \end{array} }} \right] \end{aligned}$$
(9)

Vector \(T\) represents NTFT coefficients, matrix \(W\) represents the frequency resolution (Liu and Hsu 2009) or harmonic amplitude weight (Liu and Hsu 2012), and vector \(E\) denotes the sub-signals at \(\tau _k\). The matrix \(W\) comes from the normal window and is independent of sub-signals, providing the basis for sub-signal component separation.

Note that the width of the normal window must not be extremely narrow to ensure the matrix \(W\) invertible. The detailed width depends on the distance between \(\varpi _1 ,\varpi _2 ,\dots ,\varpi _n\). Then, sub-signals at \(\tau _k\) can be obtained as

$$\begin{aligned} \mathbf{E }=\mathbf{W }^{-1}\mathbf{T } \end{aligned}$$
(10)

where the \(\mathbf{W }^{-1}\) means the inverse matrix of the matrix \(W\).

Thus, the signal \(f(t)\) can be extrapolated by

$$\begin{aligned} f( {\tau _k +M})=\mathbf E \left[ {{ \begin{array}{l} {\exp ( {i\varpi _1 M})}\\ {\exp ( {i\varpi _2 M})}\\ \vdots \\ {\exp ( {i\varpi _n M})}\\ \end{array} }} \right] \end{aligned}$$
(11)

where \(M\) is the extrapolation step, i.e., lead time. We use the instantaneous frequency, amplitude and phase at the current observation time \(\tau _k \) to extrapolate the signal. So the current instantaneous information is important for prediction.

In practice, the signals are usually quasi-harmonic. If the frequency, amplitude and phase of the quasi-harmonic signal vary slowly, the prediction method mentioned above is also suitable.

3 Time–frequency analysis of the polar motion

3.1 Data source and preparation

In this study, the polar motion (PMx and PMy) time series covering the period from January 1, 1962 to December 31, 2012 is downloaded from the website of IERS Earth Orientation Center (ftp://hpiers.obspm.fr/iers/eop/eopc04-05). The sampling interval is 1 day. We constructed the complex polar motion series, which has a real part of PMx and an imaginary part of PMy.

We use the linear least squares adjustment to get the trend model of the polar motion. Thus, the detrended polar motion can be obtained by subtracting the trend model from the polar motion. The detrended polar motion is used to analyze the time–frequency characteristics of the polar motion.

3.2 Spectrum analysis of the polar motion

We obtained the normalized spectrum of the detrended polar motion using fast Fourier transform (FFT), as shown in Fig. 1. Chandler wobble, prograde and retrograde annual components can be seen with periods of approximately 432, 365, and \(-\)366 days, respectively. Excitations to the latter two are almost identical. The response of the prograde annual wobble to the excitations is larger than the retrograde annual wobble, because the prograde annual wobble is rather closer in frequency to the Chandler wobble than the retrograde annual wobble.

Fig. 1
figure 1

Normalized spectrum of the detrended polar motion by FFT

The spectrum reflects only the averaged frequency. It cannot reflect the varying of frequency and amplitude with time. The instantaneous time–frequency parameters can be captured only through the time–frequency analysis.

3.3 Time–frequency analysis of the polar motion

The normal wavelet transform is used to analyze the detrended polar motion. Its kernel function is constructed as \(\psi ( {t,\varpi })=\left| \varpi \right| w( {\varpi t})\exp ( {i\varpi t})\) and \(w( t)\) is the normal Hamming window, i.e.,

$$\begin{aligned} w\left( t \right) = \left( {1.08l} \right) ^{{ - 1}} \left\{ {\begin{array}{l@{\quad }l} {\begin{array}{l} {0.54 + 0.46\cos \left( \pi t /l \right) } \\ 0 \\ \end{array} } &{} {\begin{array}{l} {\left| t \right| \le l}\\ {{\text {otherwise}}}\\ \end{array}}\\ \end{array}} \right. \nonumber \\ \end{aligned}$$
(12)

In order to separate the three wobbles, \(l=100\) according to the time–frequency resolution (Liu and Hsu 2009). The unit of \(t\) is day.

There are three maximum ridges along the time–frequency direction in the normal wavelet transform spectrum. The normal wavelet transform coefficients along the three ridges are nearly the three wobbles, respectively. Then we can extract the three wobbles at different times. The instantaneous frequency and amplitude of the three wobbles are computed and are shown in Fig. 2. We can also get fitting model of the detrended polar motion by adding the three main wobbles together, as shown in Fig. 3. Only the wobbles without edge effect are shown in Figs. 2 and 3.

Fig. 2
figure 2

The instantaneous frequency (a) and amplitude (b) of Chandler wobble (I), prograde (II) and retrograde (III) annual wobbles obtained via the normal wavelet transform

Fig. 3
figure 3

The detrended polar motion (the gray line) and its corresponding fitting model (the black line)

It can be seen from Fig. 2 that the frequency and amplitude of the Chandler wobble, prograde and retrograde annual wobbles of the polar motion are slowly varying with time. So they can be considered as quasi-harmonic processes. Thus, instantaneous frequency and amplitude are necessary to depict the time–frequency characteristics of these wobbles. These factors are also the basis for the normal wavelet transform to predict the long-term polar motion.

The correlation between actual and fitting values is 0.9896 and 0.9907 for the PMx and for the PMy, respectively. The Root-mean-square error (RMSE) between them is 19.73 and 21.4 mas for the PMx and for the PMy, respectively. Both correlation coefficient and RMSE indicate the feasibility of the normal wavelet transform for long-term polar motion prediction.

4 Prediction of the polar motion

The detrended polar motion is quasi-harmonic. Therefore, the instantaneous time–frequency information of the polar motion at the current time should be the most useful for predicting its future. However, a natural problem in time–frequency analysis methods is edge effect. The edge effect makes the time–frequency analysis results at the current time implausible.

In order to obtain the accurate time–frequency information at the current time, we use the normal half-window to construct the kernel function. In the rest of this paper, numerical examples use the normal half-Hamming window, i.e., if \(\varpi \ge 0\),

$$\begin{aligned} w\left( t \right) \!=\! 2\left( {1.08l} \right) ^{{ - 1}} \left\{ {\begin{array}{l@{\quad }l} {\begin{array}{l} {\left( {0.54 \!+\! 0.46\cos \left( {{{\pi t}/l}} \right) } \right) } \\ 0\\ \end{array} } &{} {\begin{array}{l} { - l \le t \le 0}\\ {{\text {otherwise}}} \\ \end{array} } \\ \end{array} } \right. \nonumber \\ \end{aligned}$$
(13)

and if \(\varpi \le 0\)

$$\begin{aligned} w\left( t \right) \!=\! 2\left( {1.08l} \right) ^{{ - 1}} \left\{ {\begin{array}{l@{\quad }l} {\begin{array}{l} {\left( {0.54 \!+\! 0.46\cos \left( {{{\pi t}/l}} \right) } \right) } \\ 0\\ \end{array} } &{} {\begin{array}{l} { 0 \le t \le l}\\ {{\text {otherwise}}} \\ \end{array} } \\ \end{array} } \right. \nonumber \\ \end{aligned}$$
(14)

In Eqs. (13) and (14), \( l({>}0)\) denotes the original width of the window. The unit of \(t\) is day. We can get the instantaneous frequency, amplitude and phase at the current time using only the history data. It is the accurate instantaneous information at the current time, especially the phase information, that makes the NTFT particularly suitable for the long-term prediction.

The detailed steps are as follows: (1) The detrended polar motion is calculated by subtracting the linear trend from the polar motion series; (2) The instantaneous frequencies of sub-signals at the current time are determined through its normal wavelet transform spectrum; (3) Take the instantaneous frequencies into Eq. (7), then we can get the instantaneous time–frequency information at current time by Eq. (10); and (4) Extrapolate the polar motion series according to Eq. (11).

4.1 One-year-lead prediction

One-year-lead predictions from 2003 to 2012 are obtained with the steps described above. The results are compared with the IERS “Bulletin A” predictions that have the comparable lead time. The prediction accuracies are measured by the correlation coefficient and RMSE as well. The results are shown in Fig. 4 and the statistical results are listed in Tables 1 and 2.

Fig. 4
figure 4figure 4

The polar motion series (the gray line), the IERS “Bulletin A” predictions (the black dash line) and the NTFT predictions (the black line) from 2003 to 2012. The kernel function of NTFT is constructed as \(\psi ( {t,\varpi })=\left| \varpi \right| w( {\varpi t})\exp ( {i\varpi t})\), in which \(w(t)\) is chosen as Eqs. (13) and (14) with \(l=12\pi \) days or higher.

Table 1 Correlation coefficient between the polar motion series and predictions
Table 2 RMSE (mas) of the predictions

Table 1 shows that the correlations in the predictions of 2006 are particularly smaller compared with others in the NTFT method. This result is caused by the abnormality during the second half of 2005 and first half of 2006, as indicated in the Sect. 4.2. The statistical results of 10 predictions show that the observations and predictions are highly correlated with the average degree of more than 0.98 for both PMx and PMy.

The mean RMSE of PMx and PMy is 18.49 and 19.26 mas in the NTFT method, respectively, which is less than that of IERS “Bulletin A”. This mean value is significantly less than that of other methods listed in Table 3. Furthermore, the RMSE of other methods (Table 3) increases with lead time, while the RMSE of the NTFT method changes a little (not more than 2 mas) with the lead time.

Table 3 RMSE (mas) of the predictions by other methods

4.2 Five-year-lead prediction

Our method can also be used for considerably longer-term prediction. From 1997, 5-year-lead predictions are made. The results are shown in Fig. 5. Correlation coefficient and RMSE are also used to assess prediction accuracy. Both are listed in Table 4.

Fig. 5
figure 5figure 5

The polar motion series (the gray line) and the NTFT predictions (the black line) from 1997. The kernel function of NTFT is constructed as \(\psi ( {t,\varpi })=\left| \varpi \right| w( {\varpi t})\exp ( {i\varpi t})\), in which \(w( t)\) is chosen as Eqs. (13) and (14) with \(l=20\pi \) days or higher

Table 4 Correlation coefficient and RMSE (mas) of 5-year-lead predictions

An abnormality during the second half of 2005 and first half of 2006 can be seen in Fig. 5. The abnormality causes a phase delay of approximately 40 days. This delay is significant in predictions for 2007, 2008, and 2009 as shown in Fig. 5g–j. In Fig. 5k–l, the predictions starting from 2007 and 2008 return to the normal state. It indicates that the non-stationary behavior of the polar motion process during 2005–2006 causes the poor 5-year-lead predictions from 2003 to 2008, 2004 to 2009, 2005 to 2010 and 2006 to 2011.

The observations and 5-year-lead predictions are highly correlated with the correlation coefficient larger than 0.93 for both PMx and PMy. The average RMSEs for the PMx and the PMy are 48.00 and 45.17 mas, respectively.

4.3 Predictions after 2013

The 1-year-lead predictions for 2013 and 5-year-lead predictions from 2013 to 2017 are reported in Fig. 6. Parameters associated with the algorithm are selected as shown in previous figures. The predictions can be used to verify the effectiveness of the algorithm.

Fig. 6
figure 6

The 1-year-lead predictions obtained through IERS “Bulletin A” (the dash line) and NTFT method (the black line) for 2013 (a) and 5-year-lead predictions from 2013 to 2017 (b) by NTFT.

5 Conclusion

In this paper, we explain how the NTFT can be used for time–frequency analysis and long-term prediction. The Chandler wobble, prograde and retrograde annual wobbles of the polar motion can be considered as quasi-harmonic processes, as indicated by the NTFT. Based on these factors, a prediction method based on the NTFT is proposed. The NTFT with a half-window captures the instantaneous frequency, amplitude and phase accurately at the current time. It enables our method particularly suitable for long-term prediction.

In the numerical examples, the mean correlation between the observations and 1-year-lead predictions are higher than 0.98, and the correlation is higher than 0.93 in the case of the 5-year-lead prediction. The mean RMSE of the 1-year-lead prediction is less than that of other methods listed in Table 3. The comparison between the IERS “Bulletin A” and the NTFT predictions shows that our approach performs better in 1-year-lead prediction. Furthermore, the 5-year-lead predictions also can be made.

The difficult period for the 5-year-lead predictions is the second half of 2005 and first half of 2006. There is an abnormal phase delay of approximately 40 days. Finally, 1-year-lead predictions for 2013 and 5-year-lead predictions from 2013 to 2017 are provided.