1 Introduction

Long memory refers to statistical dependence between observations far apart in time. Indications of long memory have been found in various fields such as hydrology, meteorology, geophysics, psychology, economics, and finance (for surveys see, e.g., Beran 1994; Graves et al. 2017). A popular measure of the degree of long-range dependence is the memory parameter d, which can be defined for a given stationary time series in terms of the rate of decay of its autocorrelation function or the steepness of its spectral density near frequency zero. The most widely used long memory models are the autoregressive fractionally integrated moving average (ARFIMA) models. Pötscher (2002) showed that drawing inferences about d is only possible when severely restricted subsets of ARFIMA models are used. Unfortunately, restrictions of this type are not plausible in most applications, with series of stock returns being notable exceptions because of the absence of significant short-term autocorrelation.

A popular semiparametric method of estimating d, which allows to estimate d without simultaneously estimating the other model parameters, is based on the log periodogram. Since the periodogram is a very erratic estimator for the spectral density, it seems natural to smooth it prior to its use in the estimation of the memory parameter d (Hassler 1993; Peiris and Court 1993; Reisen 1994). While smoothing has the expected positive effect on the variance of the estimator, it may also increase the bias (Chen et al. 1994; Reschenhofer et al. 2020), which is especially problematic when individual estimates are first obtained with a rolling estimation window and then combined by averaging. In this paper, we take an opposite approach. Instead of reducing the resolution through smoothing, we increase it by doubling the number of frequencies. However, there are two problems. First, the Fast Fourier Transform allows only the fast computation of the periodogram at the Fourier frequencies. Second, the periodogram ordinates can in the case of non-Fourier frequencies no longer be interpreted as the squared amplitudes of simple sinusoids. We address the latter problem by modifying the periodogram and the former by introducing a two-stage algorithm for the fast computation of the modified periodogram. In the first stage, a modified version of the Fast Fourier Transform is used for the computation of the standard periodogram at non-standard frequencies, which is subsequently subjected to a simple and efficient transformation in order to rectify the distortions caused by the violation of the standard orthogonality conditions in the case of non-Fourier frequencies. Both a simulation study and an empirical study with financial high-frequency data are carried out to verify the relevance of our approach.

The paper is structured as follows. Section 2 briefly reviews various frequency-domain estimators for the memory parameter d and explains how they can be modified by the inclusion of non-Fourier frequencies. In Sect. 3, an efficient algorithm for the computation of the modified estimators is described. Sections 4 and 5 present the results of the simulation study and the empirical study, respectively. Section 6 concludes.

2 Frequency-domain estimation of the memory parameter

A parametric approach (see Fox and Taqqu 1986) to estimate the memory parameter (or fractional differencing parameter) d of the autoregressive fractionally integrated moving average (ARFIMA) model

$$ y_{t} = \left( {1 - \phi_{1} L - \cdots - \phi_{p} L^{p} } \right)^{ - 1} \left( {1 - L} \right)^{ - d} \left( {1 - \theta_{1} L - \cdots - \theta_{q} L^{q} } \right)u_{t} $$
(1)

(Granger and Joyeux 1980; Hosking 1981) with spectral density

$$ f\left( \omega \right) = \frac{{\sigma^{2} }}{2\pi }\left| {1 - e^{ - i\omega } } \right|^{ - 2d} \left| {1 - \mathop \sum \limits_{j = 1}^{q} \theta_{j} e^{ - i\omega j} } \right|^{2} \left| {1 - \mathop \sum \limits_{j = 1}^{p} \phi_{j} e^{ - i\omega j} } \right|^{ - 2} $$
(2)

only makes sense when we are sure that the parametric model is correctly specified (see, e.g., Robinson 1995; Reschenhofer 2013). The uncertainty regarding the order \( \left( {p,q} \right) \) of the ARMA model can be avoided by adopting Geweke and Porter-Hudak’s (1983) approach. Their semiparametric estimator \( \hat{d}_{GPH} \) is obtained as the slope of a simple linear regression which explains the log periodogram

$$ \log I\left( {\omega_{k} } \right) = \log \frac{1}{2\pi n}\left| {\mathop \sum \limits_{t = 1}^{n} y_{t} e^{{ - i\omega_{k} t}} } \right|^{2} $$
(3)

of the observations \( y_{1} , \ldots ,y_{n} \) by the deterministic regressor

$$ - \log \left| {1 - e^{{ - i\omega_{j} }} } \right|^{2} = - 2 \log \left( {2\sin \left( {\omega_{j} /2} \right)} \right), $$
(4)

where

$$ \omega_{j} = 2\pi j/n, \;j = 1, \ldots ,K \ll \left[ {n/2} \right], $$
(5)

are the first \( K \) Fourier frequencies. Hurvich et al. (1998) showed that this estimator is consistent when only Fourier frequencies in the neighborhood of frequency zero are used, more precisely when \( K = o\left( {n^{4/5} } \right) \) and \( \log^{2} \left( n \right) = o\left( K \right) \).

Because of the irregular behavior of the spectral density (2) in the neighborhood of frequency zero, the standard asymptotic results for the normalized periodogram ordinates \( J\left( {\omega_{j} } \right) = I\left( {\omega_{j} } \right)/f\left( {\omega_{j} } \right) \) do not hold when the frequencies \( \omega_{j} \) are too low. However, Künsch (1986) was able to show that \( J\left( {\omega_{j} } \right),\;j = H + 1, \ldots ,H + K \), are still asymptotically i.i.d. standard exponential provided that \( \left( {H + 1} \right)/\sqrt n \to \infty \) and \( \left( {H + K} \right)/n \to 0 \). Robinson (1995) therefore proposed to remove the very lowest Fourier frequencies from the log periodogram regression. The most conservative variant \( \hat{d}_{tr} \) of Robinson’s estimator is obtained by trimming out only the contribution from the lowest frequency. Clearly, removing further frequencies will only be an option if the removal of the first does not already lead to a deterioration in the estimation properties.

Another modification of the log periodogram regression is based on smoothing. Hassler (1993), Peiris and Court (1993), Reisen (1994) proposed to smooth the periodogram before it is used for the estimation of the memory parameter d. This can be achieved by cutting off the sum in the representation

$$ I\left( {\omega_{j} } \right) = \frac{1}{2\pi }\mathop \sum \limits_{{s = - \left( {n - 1} \right)}}^{n - 1} \hat{\gamma }\left( s \right)e^{{ - i\omega_{j} s}} $$
(6)

at a much smaller value than \( n - 1 \) and downweighting the higher-order autocovariances \( \hat{\gamma }\left( s \right) \) with a suitable lag window such as the Bartlett window or the Parzen window. In general, the resulting estimators have typically a smaller variance and a larger bias than those based on the raw periodogram (Chen et al. 1994; Reschenhofer et al. 2020). However, the squared bias is not always small relative to the variance. For example, in the case of high-frequency financial data, preliminary estimates of the memory parameter d must first be obtained separately from the different trading sessions because of the overnight gaps and then be combined by averaging. Thus, the variance of the global estimator decreases with the number of trading sessions but the bias remains fixed. Estimators with a large bias are therefore of no use. We may keep the bias small by using an estimator based on minimal smoothing, e.g.,

$$ {{\tilde{I}\left( {\omega_{j} } \right) = \left( {I\left( {\omega_{j - 1} } \right) + I\left( {\omega_{j} } \right) + I\left( {\omega_{j + 1} } \right)} \right)} \mathord{\left/ {\vphantom {{\tilde{I}\left( {\omega_{j} } \right) = \left( {I\left( {\omega_{j - 1} } \right) + I\left( {\omega_{j} } \right) + I\left( {\omega_{j + 1} } \right)} \right)} 3}} \right. \kern-0pt} 3}, $$
(7)

or by using a more sophisticated smoothing method that does not systematically increase the bias. Reschenhofer and Mangat (2020) managed to achieve a reduction in the variance which does not come at the expense of an increase in the bias. Their estimator is based on running a log periodogram regression repeatedly for different partitions of the data. In the simplest case, only the whole sample and the first and second halves are used but the frequency range \( (0,\omega_{K} ] \), is kept constant, which implies that reducing the sample size by half results in half the number of included Fourier frequencies. The simple estimator

$$ \hat{d}_{2} = \frac{1}{2}\hat{d}_{1} + \frac{1}{4}\hat{d}_{21} + \frac{1}{4}\hat{d}_{22} , $$
(8)

where \( \hat{d}_{1} ,\hat{d}_{21} ,\hat{d}_{22} \) are the OLS estimators for d based on the log periodograms \( L^{1} ,L^{21} , L^{22} \) of the whole sample and the first and second halves, respectively, can easily be extended to the more general estimator

$$ \hat{d}_{k} = \frac{1}{k}\left( {\hat{d}_{1} + \mathop \sum \limits_{j = 2}^{k} \frac{1}{j}(\hat{d}_{j1} + \cdots + \hat{d}_{jj} )} \right), $$
(9)

which is based on k partitions.

Reschenhofer (2013) observed that the root mean square error (RMSE), which is defined as the square root of the average of squared errors, of Geweke and Porter-Hudak’s (1983) semiparametric estimator \( \hat{d}_{GPH} \) can be reduced significantly by including additional frequencies. However, he did not examine the effect on other estimators and he also did not address the problem of efficiently computing the periodogram at non-Fourier frequencies.

For an arbitrary frequency \( \omega_{k} \), where k is not necessarily an integer, it is more appropriate to define the periodogram by

$$ I_{k} = \frac{n}{8\pi }\left( {\hat{A}_{k}^{2} + \hat{B}_{k}^{2} } \right), $$
(10)

where

$$ \hat{R}_{k}^{2} = \hat{A}_{k}^{2} + \hat{B}_{k}^{2} $$
(11)

is the least squares (LS) estimate of the squared amplitude of a (normalized) sinusoid

$$ \begin{aligned} y_{t} & = \mu + R_{k} \sin \left( {\omega_{k} t + \varphi } \right) + u_{t} \\ & = \mu + \underbrace {{R_{k} \sin \left( \varphi \right)}}_{{A_{k} }}\cos \left( {\omega_{k} t} \right) + \underbrace {{R_{k} \cos \left( \varphi \right)}}_{{B_{k} }}\sin \left( {\omega_{k} t} \right) + u_{t} \\ \end{aligned} $$
(12)

fitted to the data. If

$$ \mathop \sum \limits_{t = 1}^{n} \cos \left( {\omega_{k} t} \right) = \mathop \sum \limits_{t = 1}^{n} \sin \left( {\omega_{k} t} \right) = 0, $$
(13)
$$ \mathop \sum \limits_{t = 1}^{n} \cos \left( {\omega_{k} t} \right)\sin \left( {\omega_{k} t} \right) = 0, $$
(14)

and

$$ \mathop \sum \limits_{t = 1}^{n} \cos^{2} \left( {\omega_{k} t} \right) = \mathop \sum \limits_{t = 1}^{n} \sin^{2} \left( {\omega_{k} t} \right) = n/2, $$
(15)

then the LS estimates \( \hat{A}_{k} \), \( \hat{B}_{k} \) simplify to

$$ \tilde{A}_{k} = \frac{2}{n}\mathop \sum \limits_{t = 1}^{n} y_{t} \cos \left( {\omega_{k} t} \right),\quad \tilde{B}_{k} = \frac{2}{n}\mathop \sum \limits_{t = 1}^{n} y_{t} \sin \left( {\omega_{k} t} \right) $$
(16)

or, because of (13), to

$$ \underline{{\tilde{A}}}_{k} = \frac{2}{n}\mathop \sum \limits_{t = 1}^{n} \underbrace {{\left( {y_{t} - \bar{y}} \right)}}_{{\underline{y}_{t} }} {\text{cos}}\left( {\omega_{k} t} \right),\quad \underline{{\tilde{B}}}_{k} = \frac{2}{n}\mathop \sum \limits_{t = 1}^{n} \underline{y}_{t} {\text{sin}}\left( {\omega_{k} t} \right). $$
(17)

The definition (10) therefore matches the conventional definition of the periodogram. The conditions (13), (14), (15) are satisfied for the regular Fourier frequencies \( \omega_{k} = 2\pi k/n, \;1 \le k \le m = [n/2] \) (except the last one if \( n \) is an even number), hence \( \hat{A}_{k} \) and \( \hat{B}_{k} \) can easily be obtained from

$$ \underline{{\tilde{A}}}_{k} = \frac{2}{n}\text{Re} \left( { \underline{{\tilde{C}}}_{k} } \right),\quad \underline{{\tilde{B}}}_{k} = - \frac{2}{n}\text{Im} \left( { \underline{{\tilde{C}}}_{k} } \right), $$
(18)

where

$$ \underline{{\tilde{C}}}_{k} = \mathop \sum \limits_{t = 1}^{n} \underline{y}_{t} \exp \left( { - i\omega_{k} t} \right),\quad k = 1, \ldots ,m = \left[ {n/2} \right], $$
(19)

is the Discrete Fourier Transform (DFT) of \( y_{1} , \ldots ,y_{n} \), which can be efficiently computed with the Fast Fourier Transform (FFT; see Cooley and Tukey 1965).

Since condition (13) is not satisfied for the fractional Fourier frequencies \( \omega_{k} = 2\pi k/n, \;k = 1/2, 3/2, \ldots ,m - 1/2 \), we introduce a two-stage algorithm for the efficient computation of the modified periodogram (10) at these frequencies in the next section. This algorithm allows to carry out an extensive simulation study in order to examine the effect of including additional frequencies in the log periodogram regression on the performance of various estimators. In Sect. 5, we compare the standard estimator \( \hat{d}_{GPH} \) based on \( I_{1} ,I_{2} , \ldots ,I_{K} \) with the analogous estimator \( \hat{d}_{GPH}^{ + } \) based on \( I_{1} ,I_{1.5} , \ldots ,I_{K} \), the estimator \( \hat{d}_{tr} \) based on \( I_{2} ,I_{3} , \ldots ,I_{K} \) with the estimator \( \hat{d}_{tr}^{ + } \) based on \( I_{2} ,I_{2.5} , \ldots ,I_{K} \), the estimator \( \hat{d}_{sm} \) based on \( (I_{1} + I_{2} + I_{3} )/3,(I_{2} + I_{3} + I_{4} )/3, \ldots ,\left( {I_{K - 2} + I_{K - 1} + I_{K} } \right)/3 \) with the estimator \( \hat{d}_{sm}^{ + } \) based on \( (I_{1} + I_{1.5} + I_{2} )/3,(I_{1.5} + I_{2} + I_{2.5} )/3, \ldots ,\left( {I_{K - 1} + I_{K - 0.5} + I_{K} } \right)/3 \), and the estimator \( \hat{d}_{2} \) with the estimator

$$ \hat{d}_{2}^{ + } = \frac{1}{2}\hat{d}_{1}^{ + } + \frac{1}{4}\hat{d}_{21}^{ + } + \frac{1}{4}\hat{d}_{22}^{ + } . $$
(20)

3 Algorithm

In case of the regular Fourier frequencies \( \omega_{k} = 2\pi k/n, \;1 \le k \le m = [n/2] \), \( \hat{A}_{k} \) and \( \hat{B}_{k} \) are identical to \( \underline{{\tilde{A}}}_{k} \) and \( \underline{{\tilde{B}}}_{k} \) and can therefore be computed with the Fast Fourier Transform. For the fractional Fourier frequencies \( \omega_{k} = 2\pi k/n, \) \( k = 1/2, 3/2, \ldots ,m - 1/2 \), an analogous approach can be used for the computation of \( \underline{{\tilde{A}}}_{k} \) and \( \underline{{\tilde{B}}}_{k} \). Suppose that \( n \) is even. Then

$$ \begin{aligned} \underline{{\tilde{C}}}_{k} & = \mathop \sum \limits_{t = 1}^{n} \underline{y}_{t} \exp \left( { - i\omega_{k} t} \right) \\ & = \mathop \sum \limits_{t = 1}^{m} \underline{y}_{2t - 1} \exp \left( { - i\omega_{k} \left( {2t - 1} \right)} \right) + \mathop \sum \limits_{t = 1}^{m} \underline{y}_{2t} \exp \left( { - i\omega_{k} 2t} \right) \\ & = \exp \left( {i\omega_{k} } \right)\underbrace {{\sum\nolimits_{t = 1}^{m} {\underline{y}_{2t - 1} } \exp \left( { - i\omega_{2k} t} \right)}}_{{ \underline{{\tilde{C}}}_{k} \left( 1 \right)}} + \underbrace {{\sum\nolimits_{t = 1}^{m} {\underline{y}_{2t} } \exp \left( { - i\omega_{2k} t} \right)}}_{{ \underline{{\tilde{C}}}_{k} \left( 2 \right)}}, \\ \end{aligned} $$
(21)

where \( \underline{{\tilde{C}}}_{k} \left( 1 \right) \) and \( \underline{{\tilde{C}}}_{k} \left( 2 \right) \) need to be computed only for \( 2k \le m \) because

$$ \exp \left( { - i\omega_{n - 2k} t} \right) = \exp \left( {2\pi - i\omega_{2k} t} \right) = \exp \left( { - i\omega_{2k} t} \right). $$
(22)

Thus, the number of required calculations is approximately reduced by half. If \( n \) is divisible by 4 or, in the best case, is even a power of 2, this technique can be applied repeatedly. Fortunately, by appending zeroes to the end of the time series (zero padding), it can always be achieved that \( n \) is a power of 2.

However, in case of the fractional Fourier frequencies, we obtain with the help of standard formulas for sums of trigonometric functions (see Gradshteyn and Ryzhik 2007, p. 37)

$$ \mathop \sum \limits_{t = 1}^{n} \cos \left( {\omega_{k} t} \right) = \cos \left( {\omega_{k} \left( {n + 1} \right)/2} \right)\sin \left( {\omega_{k} n/2} \right)/\sin \left( {\omega_{k} /2} \right) = - 1 $$
(23)

and

$$ \mathop \sum \limits_{t = 1}^{n} \sin \left( {\omega_{k} t} \right) = \sin \left( {\omega_{k} \left( {n + 1} \right)/2} \right)\sin \left( {\omega_{k} n/2} \right)/\sin \left( {\omega_{k} /2} \right) = \cot \left( {\omega_{k} /2} \right), $$
(24)

hence condition (13) is violated and we must therefore still try to get the values of \( \hat{A}_{k} \) and \( \hat{B}_{k} \) in an efficient manner from \( \underline{{\tilde{C}}}_{k} \) or, equivalently, from \( \underline{{\tilde{A}}}_{k} \) and \( \underline{{\tilde{B}}}_{k} \). Eliminating the constant from (12) gives

$$ \underline{y}_{t} = A_{k} \underbrace {{\left( {\cos \left( {\omega_{k} t} \right) + \frac{1}{n}} \right)}}_{{\underline{\cos } \left( {\omega_{k} t} \right)}} + B_{k} \underbrace {{\left( {\sin \left( {\omega_{k} t} \right) - \overbrace {{\cot \left( {\omega_{k} /2} \right)/n}}^{{\tau_{k} }}} \right)}}_{{\underline{\sin } \left( {\omega_{k} t} \right)}} + u_{t} . $$
(25)

Thus,

$$ \begin{aligned} \hat{A}_{k} & = {{\left( {\mathop \sum \limits_{t = 1}^{n} \underline{\sin }^{2} \left( {\omega_{k} t} \right)\mathop \sum \limits_{t = 1}^{n} \underline{\cos } \left( {\omega_{k} t} \right)\underline{y}_{t} - \mathop \sum \limits_{t = 1}^{n} \underline{\cos } \left( {\omega_{k} t} \right)\underline{\sin } \left( {\omega_{k} t} \right)\mathop \sum \limits_{t = 1}^{n} \underline{\sin } \left( {\omega_{k} t} \right)\underline{y}_{t} } \right)} \mathord{\left/ {\vphantom {{\left( {\mathop \sum \limits_{t = 1}^{n} \underline{\sin }^{2} \left( {\omega_{k} t} \right)\mathop \sum \limits_{t = 1}^{n} \underline{\cos } \left( {\omega_{k} t} \right)\underline{y}_{t} - \mathop \sum \limits_{t = 1}^{n} \underline{\cos } \left( {\omega_{k} t} \right)\underline{\sin } \left( {\omega_{k} t} \right)\mathop \sum \limits_{t = 1}^{n} \underline{\sin } \left( {\omega_{k} t} \right)\underline{y}_{t} } \right)} {D_{k} }}} \right. \kern-0pt} {D_{k} }} \\ & = {{\left( { \frac{{n^{2} }}{2}\left( { \frac{1}{2} - \tau_{k}^{2} } \right) \underline{{\tilde{A}}}_{k} - \frac{{n^{2} }}{2}\frac{{\tau_{k} }}{n}\underline{{\tilde{B}}}_{k} } \right)} \mathord{\left/ {\vphantom {{\left( { \frac{{n^{2} }}{2}\left( { \frac{1}{2} - \tau_{k}^{2} } \right) \underline{{\tilde{A}}}_{k} - \frac{{n^{2} }}{2}\frac{{\tau_{k} }}{n}\underline{{\tilde{B}}}_{k} } \right)} {D_{k} }}} \right. \kern-0pt} {D_{k} }} \\ & = \underline{{\tilde{A}}}_{k} + \frac{2}{{n^{2} \left( {1 - 2\tau_{k}^{2} } \right) - 2}}\left( {\underline{{\tilde{A}}}_{k} - n\tau_{k} \underline{{\tilde{B}}}_{k} } \right) \\ \end{aligned} $$
(26)

and

$$ \begin{aligned} \hat{B}_{k} & = {{\left( {\mathop \sum \limits_{t = 1}^{n} \underline{\cos }^{2} \left( {\omega_{k} t} \right)\mathop \sum \limits_{t = 1}^{n} \underline{\sin } \left( {\omega_{k} t} \right)\underline{y}_{t} - \mathop \sum \limits_{t = 1}^{n} \underline{\cos } \left( {\omega_{k} t} \right)\underline{\sin } \left( {\omega_{k} t} \right)\mathop \sum \limits_{t = 1}^{n} \underline{\cos } \left( {\omega_{k} t} \right)\underline{y}_{t} } \right)} \mathord{\left/ {\vphantom {{\left( {\mathop \sum \limits_{t = 1}^{n} \underline{\cos }^{2} \left( {\omega_{k} t} \right)\mathop \sum \limits_{t = 1}^{n} \underline{\sin } \left( {\omega_{k} t} \right)\underline{y}_{t} - \mathop \sum \limits_{t = 1}^{n} \underline{\cos } \left( {\omega_{k} t} \right)\underline{\sin } \left( {\omega_{k} t} \right)\mathop \sum \limits_{t = 1}^{n} \underline{\cos } \left( {\omega_{k} t} \right)\underline{y}_{t} } \right)} {D_{k} }}} \right. \kern-0pt} {D_{k} }} \\ & = {{\left( { \frac{{n^{2} }}{2}\left( { \frac{1}{2} - \frac{1}{{n^{2} }} } \right) \underline{{\tilde{B}}}_{k} - \frac{{n^{2} }}{2}\frac{{\tau_{k} }}{n} \underline{{\tilde{A}}}_{k} } \right)} \mathord{\left/ {\vphantom {{\left( { \frac{{n^{2} }}{2}\left( { \frac{1}{2} - \frac{1}{{n^{2} }} } \right) \underline{{\tilde{B}}}_{k} - \frac{{n^{2} }}{2}\frac{{\tau_{k} }}{n} \underline{{\tilde{A}}}_{k} } \right)} {D_{k} }}} \right. \kern-0pt} {D_{k} }} \\ & = \underline{{\tilde{B}}}_{k} + \frac{2}{{n^{2} \left( {1 - 2\tau_{k}^{2} } \right) - 2}}\left( {n^{2} \tau_{k}^{2} \underline{{\tilde{B}}}_{k} - n\tau_{k} \underline{{\tilde{A}}}_{k} } \right), \\ \end{aligned} $$
(27)

where the auxiliary quantities

$$ \begin{aligned} D_{k} & = \mathop \sum \limits_{t = 1}^{n} \underline{\cos }^{2} \left( {\omega_{k} t} \right)\mathop \sum \limits_{t = 1}^{n} \underline{\sin }^{2} \left( {\omega_{k} t} \right)\underline{y}_{t} - \left( { \mathop \sum \limits_{t = 1}^{n} \underline{\cos } \left( {\omega_{k} t} \right)\underline{\sin } \left( {\omega_{k} t} \right)} \right)^{2} \\ & = \frac{{n^{2} }}{2}\left( { \frac{1}{2} - \tau_{k}^{2} - \frac{1}{{n^{2} }}} \right) \\ \end{aligned} $$
(28)

and

$$ \tau_{k}^{2} = \left( {\cot \left( {\pi k/n} \right)/n} \right)^{2} \;\sim\; \left( {\pi k} \right)^{ - 2} $$
(29)

do not depend on the data and therefore only need to be computed once. After the computation of \( \underline{{\tilde{C}}}_{k} \), \( \underline{{\tilde{A}}}_{k} \), and \( \underline{{\tilde{B}}}_{k} \) with the modified Fast Fourier Transform in the first stage, the quantities of interest, \( \hat{A}_{k} \) and \( \hat{B}_{k} \), can easily be obtained with the formulas (26) and (27) in the second stage.

4 Simulations

To investigate the bias-variance tradeoff for different estimators of the memory parameter d, we carry out an extensive simulation study. In general, one would expect that the bias decreases and the variance increases as the complexity of the estimation procedure increases. However, things get more complicated when more sophisticated procedures are involved. E.g., Geman et al. (1992) found evidence for the bias-variance tradeoff in nonparametric procedures such as the k-nearest neighbors algorithm and kernel regression whereas Neal et al. (2020) found in the context of neural networks that both bias and variance can decrease as the number of parameters grows. In the case of the log periodogram regression, the number of parameters is of dubious relevance. The performance of this procedure critically depends on the highest frequency used. Smoothing the periodogram before carrying out the regression requires the specification of an additional parameter, namely the cut-off point of the lag window. Thus, smoothing increases the number of parameters but at the same time also reduces the spectral resolution and thereby increases the bias and decreases the variance rather than the other way round. When dealing with the bias-variance tradeoff, the complexity of a procedure should therefore not be judged solely on the basis of the number of parameters. More important is the flexibility to fit the data. Overfitting typically tends to increase the variance while underfitting tends to increase the bias. Since the periodogram may be regarded as a representation of the raw data in the frequency domain, it provides a perfect fit. Clearly, smoothing compromises this fit. On the other hand, it is also not possible to improve the perfect fit by the introduction of additional periodogram ordinates at non-Fourier frequencies. These new ordinates are, in a certain sense, just interpolations of the original periodogram ordinates. Their inclusion therefore brings about some kind of minimal local smoothing which has the big advantage that it does not systematically have a negative impact on the bias. This advantage is particularly important when estimates obtained from separate samples of fixed size are combined by averaging because in this case only the variance decreases as the number of samples increases but the bias remains fixed.

With the help of the R-package ‘fracdiff’, 40,000 realizations of length \( n = 390, 3000 \) of ARFIMA(1,d,0) processes with standard normal innovations and parameter values \( d = - 0.25, - 0.1, 0, 0.1, 0.25 \) and \( \phi_{1} = - 0.25, - 0.1, 0, 0.1, 0.25 \), respectively, are generated using a burn-in period of 40,000. For each realization, the memory parameter d is estimated with the estimators \( \hat{d}_{GPH} \), \( \hat{d}_{tr} \), \( \hat{d}_{sm} \), and \( \hat{d}_{2} \), which use only the conventional periodogram, and their counterparts \( \hat{d}_{GPH}^{ + } \), \( \hat{d}_{tr}^{ + } \), \( \hat{d}_{sm}^{ + } \), and \( \hat{d}_{2}^{ + } \), which use also the modified periodogram at fractional Fourier frequencies. Tables 1 and 2, 3 and 4, 5 and 6 give the sample bias, the sample variance, and the root mean square error for the sample sizes \( n = 390 \) and \( n = 3000 \), respectively. The smaller sample size corresponds to the number of minutes in a regular trading session for U.S. stocks, which starts at 9:30 a.m. and ends at 4:00 p.m. The parameter K, which determines the highest frequency to be used for estimation, is set to \( K = n^{\alpha } \), where \( \alpha = 0.5 \).

Table 1 Sample bias of the estimators \( \hat{d}_{GPH} \) (log periodogram regression), \( \hat{d}_{GPH}^{ + } \) (log periodogram regression additionally with non-Fourier frequencies), \( \hat{d}_{tr} \) (trimming), \( \hat{d}_{tr}^{ + } \) (trimming additionally with non-Fourier frequencies), \( \hat{d}_{sm} \) (simple smoothing), \( \hat{d}_{sm}^{ + } \) (smoothing additionally with non-Fourier frequencies), \( \hat{d}_{2} \) (2 partitions) and \( \hat{d}_{2}^{ + } \) (2 partitions additionally with non-Fourier frequencies) obtained from 40,000 realizations (length: \( n = 390 \), number of used Fourier frequencies: \( K = 20 \)) of Gaussian ARFIMA(1,d,0) with \( d = - 0.25, - 0.1, 0, 0.1, 0.25 \) and \( \phi_{1} = - 0.25, - 0.1, 0, 0.1, 0.25 \)
Table 2 Sample bias of the estimators \( \hat{d}_{GPH} \) (log periodogram regression), \( \hat{d}_{GPH}^{ + } \) (log periodogram regression additionally with non-Fourier frequencies), \( \hat{d}_{tr} \) (trimming), \( \hat{d}_{tr}^{ + } \) (trimming additionally with non-Fourier frequencies), \( \hat{d}_{sm} \) (simple smoothing), \( \hat{d}_{sm}^{ + } \) (smoothing additionally with non-Fourier frequencies), \( \hat{d}_{2} \) (2 partitions) and \( \hat{d}_{2}^{ + } \) (2 partitions additionally with non-Fourier frequencies) obtained from 40,000 realizations (length: \( n = 3000 \), number of used Fourier frequencies: \( K = 55 \)) of Gaussian ARFIMA(1,d,0) with \( d = - 0.25, - 0.1, 0, 0.1, 0.25 \) and \( \phi_{1} = - 0.25, - 0.1, 0, 0.1, 0.25 \)
Table 3 Sample variance of the estimators \( \hat{d}_{GPH} \) (log periodogram regression), \( \hat{d}_{GPH}^{ + } \) (log periodogram regression additionally with non-Fourier frequencies), \( \hat{d}_{tr} \) (trimming), \( \hat{d}_{tr}^{ + } \) (trimming additionally with non-Fourier frequencies), \( \hat{d}_{sm} \) (simple smoothing), \( \hat{d}_{sm}^{ + } \) (smoothing additionally with non-Fourier frequencies), \( \hat{d}_{2} \) (2 partitions) and \( \hat{d}_{2}^{ + } \) (2 partitions additionally with non-Fourier frequencies) obtained from 40,000 realizations (length: \( n = 390 \), number of used Fourier frequencies: \( K = 20 \)) of Gaussian ARFIMA(1,d,0) with \( d = - 0.25, - 0.1, 0, 0.1, 0.25 \) and \( \phi_{1} = - 0.25, - 0.1, 0, 0.1, 0.25 \)
Table 4 Sample variance of the estimators \( \hat{d}_{GPH} \) (log periodogram regression), \( \hat{d}_{GPH}^{ + } \) (log periodogram regression additionally with non-Fourier frequencies), \( \hat{d}_{tr} \) (trimming), \( \hat{d}_{tr}^{ + } \) (trimming additionally with non-Fourier frequencies), \( \hat{d}_{sm} \) (simple smoothing), \( \hat{d}_{sm}^{ + } \) (smoothing additionally with non-Fourier frequencies), \( \hat{d}_{2} \) (2 partitions) and \( \hat{d}_{2}^{ + } \) (2 partitions additionally with non-Fourier frequencies) obtained from 40,000 realizations (length: \( n = 3000 \), number of used Fourier frequencies: \( K = 55 \)) of Gaussian ARFIMA(1,d,0) with \( d = - 0.25, - 0.1, 0, 0.1, 0.25 \) and \( \phi_{1} = - 0.25, - 0.1, 0, 0.1, 0.25 \)
Table 5 RMSE of the estimators \( \hat{d}_{GPH} \) (log periodogram regression), \( \hat{d}_{GPH}^{ + } \) (log periodogram regression additionally with non-Fourier frequencies), \( \hat{d}_{tr} \) (trimming), \( \hat{d}_{tr}^{ + } \) (trimming additionally with non-Fourier frequencies), \( \hat{d}_{sm} \) (simple smoothing), \( \hat{d}_{sm}^{ + } \) (smoothing additionally with non-Fourier frequencies), \( \hat{d}_{2} \) (2 partitions) and \( \hat{d}_{2}^{ + } \) (2 partitions additionally with non-Fourier frequencies) obtained from 40,000 realizations (length: \( n = 390 \), number of used Fourier frequencies: \( K = 20 \)) of Gaussian ARFIMA(1,d,0) with \( d = - 0.25, - 0.1, 0, 0.1, 0.25 \) and \( \phi_{1} = - 0.25, - 0.1, 0, 0.1, 0.25 \)
Table 6 RMSE of the estimators \( \hat{d}_{GPH} \) (log periodogram regression), \( \hat{d}_{GPH}^{ + } \) (log periodogram regression additionally with non-Fourier frequencies), \( \hat{d}_{tr} \) (trimming), \( \hat{d}_{tr}^{ + } \) (trimming additionally with non-Fourier frequencies), \( \hat{d}_{sm} \) (simple smoothing), \( \hat{d}_{sm}^{ + } \) (smoothing additionally with non-Fourier frequencies), \( \hat{d}_{2} \) (2 partitions) and \( \hat{d}_{2}^{ + } \) (2 partitions additionally with non-Fourier frequencies) obtained from 40,000 realizations (length: \( n = 3000 \), number of used Fourier frequencies: \( K = 55 \)) of Gaussian ARFIMA(1,d,0) with \( d = - 0.25, - 0.1, 0, 0.1, 0.25 \) and \( \phi_{1} = - 0.25, - 0.1, 0, 0.1, 0.25 \)

The positive effect of including the fractional Fourier frequencies is obvious. In each of the 200 cases (25 models, 4 types of estimators, 2 sample sizes), the sample variance of the modified estimator is less than the sample variance of the original estimator (see Tables 3, 4). This improvement comes at the cost of a slightly increased sample bias (see Tables 1, 2). When both the sample bias and the sample variance are taken into account, the effect is still positive. In all 200 cases, the RMSE of the original estimator is larger than that of its modified counterpart. Clearly, the latter estimator will become less competitive when averages of several estimates are considered because the variance term in the RMSE will become less important as the number of individual estimates increases.

5 Empirical results

Greene and Fielitz (1977) used the R/S statistic (see Hurst 1951; Mandelbrot and Wallis 1969; Mandelbrot 1972, 1975), which is defined as the ratio of the range (R) of all partial sums of a time series to its standard deviation (S), in order to verify Mandelbrot’s (1971) suspicion that the strength of the statistical dependence of stock prices may decrease very slowly. Indeed, they found indications of long-range dependence when they investigated daily stock return series. However, Lo (1991) argued that the R/S statistic cannot distinguish between short-range dependence and long-range dependence and proposed therefore a simple generalization of the R/S statistic that is robust to general forms of short-range dependence. Using this modified R/S statistic, he found little evidence in favor of long-range dependence in daily and monthly index returns. Similarly, Barkoulas and Baum (1996) found no convincing evidence of long-range dependence when they applied the log periodogram regression method of Geweke and Porter-Hudak (1983) to daily U.S. stock returns and index returns. Negative results were also obtained for international data by Cheung and Lai (1995) with both modified R/S analysis and log periodogram regression and by Crato (1994) with parametric methods (ARFIMA models). In contrast to signed returns, absolute returns and squared returns actually do appear to exhibit significant long-range dependence (see Crato and de Lima 1994; Lobato and Savin 1998; Grau-Carles 2000).

When high-frequency data are available, realized variances (sums of squared intraday returns) can be used instead of squared daily returns. Alternatively, the unaggregated squared intraday returns can be investigated right away. The latter approach has the advantage that the sample size is many times greater than the number of trading days. A disadvantage is that there are usually gaps between the individual trading sessions, which make it necessary to estimate the memory parameter d separately for each trading session and compute the final estimate by averaging the individual estimates (see Reschenhofer and Mangat 2020). In this case, it is important to use an estimator with a small bias because the variance decreases with the number of trading sessions but the bias remains fixed. Of course, it might also be interesting to adopt the same approach for the signed intraday returns. In Fig. 1, the estimates obtained from the individual trading sessions are plotted cumulatively. By dividing each estimate by the total number of trading sessions, we make sure that the final value corresponds to the mean of all estimates. Each subsample consists of 390 1-min intraday returns of the General Electric Company (GE) stock, which are obtained from the first mid-quotes (midpoints of the best bid and ask quotes) in each minute. The limit order book data from 2007-06-27 to 2019-04-30 have been downloaded from Lobster (https://lobsterdata.com).

Fig. 1
figure 1

Cumulative plots of the estimates obtained by applying \( \hat{d}_{GPH} \) (pink), \( \hat{d}_{GPH}^{ + } \) (green), \( \hat{d}_{tr} \) (gold), \( \hat{d}_{tr}^{ + } \) (red), \( \hat{d}_{sm} \) (magenta), \( \hat{d}_{sm}^{ + } \) (blue), \( \hat{d}_{2} \) (yellowgreen) and \( \hat{d}_{2}^{ + } \) (orange) to the a 1-min intraday log returns \( r_{t} \left( s \right), s = 1, \ldots ,390 \), b transformed absolute 1-min intraday log returns \( \log \left( {\varepsilon + \left| {r_{t} \left( s \right)} \right|} \right) \) with \( \varepsilon = 10^{ - 6} \) of GE stock

Figure 1a, b suggests that the memory parameter is close to zero for the signed returns and somewhere between 0.2 and 0.3 for the transformed absolute returns. The discrepancies between the results obtained with the original estimators and the modified estimators, respectively, are relatively small. In case of the GPH-estimator, the difference between the means is − 0.0034 (see Table 7) and the difference between the biases is − 0.0027 for \( d = 0 \), \( \phi_{1} = 0 \) (see Table 1). The agreement is less good in case of the other estimators. For the transformed absolute returns, the estimates produced by the modified estimators are generally greater than those produced by the original estimators, which is inconsistent with the results of the simulation study. This discrepancy may indicate that the transformation of the absolute returns does not achieve approximate normality. However, Table 7 shows that the inclusion of the fractional Fourier frequencies always leads to a reduction in the variance, which is of crucial importance in any conventional study with only a single sample of observations.

Table 7 Sample mean and sample variance of the daily estimates of the memory parameter d obtained for the signed 1-min returns and the transformed absolute 1-min returns of the GE stock with the estimators \( \hat{d}_{GPH} \) (log periodogram regression), \( \hat{d}_{GPH}^{ + } \) (log periodogram regression additionally with non-Fourier frequencies), \( \hat{d}_{tr} \) (trimming), \( \hat{d}_{tr}^{ + } \) (trimming additionally with non-Fourier frequencies), \( \hat{d}_{sm} \) (simple smoothing), \( \hat{d}_{sm}^{ + } \) (smoothing additionally with non-Fourier frequencies), \( \hat{d}_{2} \) (2 partitions) and \( \hat{d}_{2}^{ + } \) (2 partitions additionally with non-Fourier frequencies)

6 Discussion

A standard approach to improve the performance of frequency-domain methods for the estimation of the memory parameter d is to smooth the periodogram before it is put to use. While this approach usually leads to a reduction of the variance of the estimator, it has the opposite effect on the size of the bias. Alternatively, a reduction in the variance can also be achieved by additionally including fractional Fourier frequencies. Unfortunately, there are two problems. Firstly, the Fast Fourier Transform cannot be used for the calculation of the periodogram at these frequencies. Secondly, the usual orthogonality conditions are only satisfied for the regular Fourier frequencies, which raises the question whether the standard definition of the periodogram is still meaningful in the general case. In this article, we address both problems. We propose to first compute the naïve sine and cosine transforms for the fractional Fourier frequencies with a modified version of the Fast Fourier Transform and then to amend these transforms by taking the violation of the standard orthogonality conditions into account. The computational efficiency of the second stage is due to the fact that a large number of auxiliary quantities only need to be computed once because they do not depend on the data.

We have carried out an extensive simulation study in order to investigate the effect of the inclusion of the fractional Fourier frequencies on the performance of various estimators. The results show that the overall effect in terms of RMSE is always positive which is due to a significant reduction in the variance and only a slight increase in the bias. These findings are corroborated by the results of an empirical study of financial high-frequency data. In this study, the variances of the modified estimators, which use also fractional Fourier frequencies, are always smaller than those of their conventional counterparts. Clearly, little can be said about the size of the bias because the true value of the memory parameter is unknown in practice. However, all competing estimators, regardless which frequencies are used, unanimously agree that significant long-range dependence is present only in the intraday volatility but not in the signed intraday returns.