Introduction

Analyses of response time series have given insight into cognitive processes and mental organization in a wide variety of tasks such as simple reaction, word naming, choice decision, visual search, memory search, and lexical decision. Contemporary response time analyses have been developed within two paradigms: analyses of the response time distribution and analyses of the trial dependency of response time dynamics. Response time distributions are typically unimodal and positively skewed toward long response latencies. Consequently, distributions like the ex-Gaussian (Burbeck & Luce, 1982), shifted Wald (Schwarz, 2001), lognormal (Ulrich & Miller, 1993), Gamma (Van Zandt & Ratcliff, 1995), Weibull (Logan, 1992), and power law (Holden, Van Orden, & Turvey, 2009; Moscoso del Prado Martin 2009) distributions have been suggested as models to parameterize the response time distribution. However, all models assume that the response times are trial-independent random variables. In contrast, several studies conducted through the last decades have found that response times possess long-range trial dependency (Gilden, 2001; Gilden, Thornton, & Mallon, 1995; Kello, Beltz, Holden, & Van Orden, 2007; Kello et al., 2010; Van Orden, Holden, & Turvey, 2003). The long-range trial dependency of response times is numerically defined by a scaling exponent obtained by monofractal analyses such as detrended fluctuation analysis, scaled window variance analysis, or rescaled range analysis, to mention but a few (cf. Delignières et al., 2006). All these analyses numerically define long-range trial dependency of response time series by a single exponent that is estimated by the scaling of the root-mean square, variance, or range. This scaling exponent is found to decrease with increased difficulty and an increased level of external perturbation of the response time task (cf. Diniz et al., 2011) and has been suggested to reflect the self-organization of cognitive processes (e.g., Van Orden et al., 2003).

The conventional numerical definition of a single scaling exponent assumes that response times are Gaussian distributed such that the response time variation is described by the second-order statistical moment (i.e., variance) alone. However, response time distributions are found to be unimodal and right-skewed toward long response latencies across a range of cognitive tasks (Luce, 1986). The presence of a non-Gaussian response time distribution has two fundamental implications for the analyses of long-range trial dependency of the response time series. First, the presence of a non-Gaussian response time distribution indicates that the response time variation cannot be exclusively described by the scaling of variance alone but that the scaling of higher order statistical moments such as skewness and kurtosis has to be considered. Thus, the single scaling exponent defined by conventional monofractal analyses will define only the fraction of the trial dependency related to the variance of the response time series. Second, the presence of a non-Gaussian response time distribution indicates intermittent changes in the magnitude of response time variation. These intermittent changes might be due to modulation of the task difficulty, sensory feedback, or changes in participants’ intention to act according to the task instruction. Furthermore, intermittent changes in variability have been shown to be a central feature for phase transition in a bimanual coordination task (Kelso, 1995) and other complex systems (Schaffer et al., 2009). Intermittent changes in the response time variation will provide temporal modulation of both the width and shape of the response time distribution and, consequently, temporal modulation of the scaling exponent.

Ihlen and Vereijken (2010) introduced multifractal analyses that are able to numerically define the scaling exponents of higher order statistical moments and the temporal modulation of a local scaling exponent. They suggested that the multifractal structure of response time series reflected phase transitions in the mental organization driven by multiplicative interactions across the temporal scales of the response time dynamics. Later, multifractal analyses of response times revealed temporal changes in the trial-dependent structure of the response series by modulation in task difficulty and performance feedback (Kuznetsov & Wallot, 2011) and the presence of strong anticipation (Stephen & Dixon, 2011). The presence of multifractal variation is also a central feature of healthy regulation of physiological processes like heart rate dynamics (Ivanov et al., 1999), and the absence of multifractal variation is related to several pathological conditions (Wang, Huang, Xie, Wang, & Hu, 2007). However, none of the studies within behavioral research have compared the performance of the multifractal analyses a priori with their employment for behavioral variables.

The aim of the present study is to compare the performance of seven multifractal analyses and, consequently, to provide a guideline for the choice of multifractal analyses of response time series and other behavioral variables. The article describes and compares multifractal analyses and assumes that the reader is familiar with basic statistical concepts of fractal analyses. Readers unfamiliar with the basic concepts of fractal analyses are referred to Ihlen and Vereijken (2010) and Ihlen (2012) for an introduction to the topic.

Method

Multifractal analyses of a response time series are all conducted by a step-wise procedure illustrated in Fig. 1. First, the response time series is decomposed into a scale-dependent measure μ s,t (see the top panel in Fig. 1). Second, the multifractal spectrum is computed either directly from μ s,t (right panels along B arrows in Fig. 1) or indirectly by the q-order statistics of μ s,t (left panels along A arrows in Fig. 1). The scale-dependent measure μ s,t and q-order statistics differentiate between the different types of multifractal analyses and, consequently, between their potential pros and cons when applied to response time series. The seven multifractal analyses compared in the present study are introduced in the next three subsections according to their scale-dependent measure μ s,t and their q-order statistics.

Fig. 1
figure 1

A flow chart of the estimation of the multifractal spectrum D h directly from the scale-dependent measure \( {\mu_{{s,{t_0}}}} \) (B arrows) and indirectly by the q-order statistics of μ s,v (A arrows). The basis for all multifractal analyses within both formalisms is the scale-dependent measure (upper contour plot) that decomposes the response time series or behavioral variable into both the time and scale domains. The red contours indicate large scale-dependent measures of the response time series that coincide with the time periods of intermittent large variations. In contrast, the blue contours indicate small scale-dependent measures that coincide with the time periods of intermittent small variations. The panel below the top arrow A indicates that the scale-dependent measure is summarized by its q-order statistical moment (see Eq. 5). The statistical moments with positive qs amplifies the large μ s,v (i.e., red contours), whereas the statistical moments negative qs amplifies the small μ s,v (i.e., blue contours). The scaling exponent ζ q numerical defines the power law relation of the intermittent periods with large (i.e., positive qs) and small variations (i.e., negative qs). The panel below the bottom arrow A illustrates a multifractal spectrum D h estimated by the Legendre transformation of ζ q (i.e., Eq. 6). The multifractal spectrum D h is the plot of D q versus h q where the mode \( \bar{h}={h_{q=0 }} \) is defined at the maxima D q=0 = 1. The panel below the top arrow B illustrates the direct estimation of the local singularity exponent \( {h_{{{t_0}}}} \) as the local slope of \( \log \left( {{\mu_{{s,{t_0}}}}} \right) \) versus log(s) for each time instant t 0. The panel below the bottom arrow B illustrates the multifractal spectrum D h defined by Eq. 9 in the main text

Scale-dependent measure

Response time series are decomposed into both time and scale domains by (1) computing a scale-dependent measure \( {\mu_{{s,{t_0}}}} \) in a floating trial interval [t 0s/2, t 0 + s/2] centered at trial t 0 with sample size s (i.e., scale) or by (2) computing a scale-dependent measure μ s,v in the nonoverlapping trial interval [(v − 1)s + 1, vs] with interval number v and sample size s. The analyses that estimate the multifractal spectrum directly from μ s,t use method 1, whereas the analyses based on q-order statistics of μ s,t use method 2. The scale-dependent measure of the seven multifractal analyses compared in the present study is defined below.

Multifractal detrended fluctuation analysis

Multifractal detrended fluctuation analysis (MFDFA; Kantelhardt et al., 2002) defines the scale-dependent measure μ s,v as a root-mean square fluctuation of the integrated profile \( {y_t}=\sum\limits_{i=1}^t {\left( {{x_i}-\bar{x}} \right)} \) of the response time series x i around a scale-dependent trend \( {{\mathop{y}\limits^{\frown } }_{{s,t}}} \) within a nonoverlapping time interval [(v − 1)s + 1, vs]:

(1)

.

The present study compares the performance of MFDFA with linear (MFDFAlin), moving average (MFDFAma), and empirical mode decomposition (MFDFAemd) detrending procedures (see technical details in Appendix 1).

Entropy analysis

Entropy analysis (Chhabra & Jensen, 1989) defines the scale-dependent measure as a probability measure of the total magnitude of the response time fluctuation (i.e., sum of x t ) within the nonoverlapping time interval [(v − 1)s + 1, vs]:

$$ {\mu_{s,v }}=\frac{{\sum\limits_{{t=\left( {v-1} \right)s+1}}^{vs } {{x_t}} }}{{\sum\limits_{t=1}^N {{x_t}} }} $$
(2)

The definition of μ s,v in entropy analysis contains no detrending procedure and, therefore, assumes that the response time series is a stationary process.

Leader wavelet transformation

Leader wavelet transformation (LWT; Wendt, 2008) defines the scale-dependent measure μ s,v by a recursive high-pass and low-pass filtering procedure of the response time series x t illustrated in Fig. 2a. The scale-dependent measure μ s=2,v on the smallest scale s = 2 is the convolution product ⨂ of response time series x t and the high-pass filter \( \psi_t^{hi } \) subsampled by 2. The subsampled convolution product ⨂ of the response time series x t and low-pass filter \( \psi_t^{lo } \) are used in another convolution and subsample procedure to define the statistical measure μ s,v on the coarser scales s = 4. The detrending of the response time series in LWT is dependent on the choice of \( \psi_t^{hi } \) and \( \psi_t^{lo } \) where the present study uses a third-order Daubechies filter (Daubechies, 1992). The subsampling by 2 leads to dyadic intervals λ s,v that double the sample sizes s = 2 j for the nonoverlapping intervals (width of the boxes in Fig. 2b) for each new level j, but halves the frequency band f = 1/2 j (height of the boxes in Fig. 2b). The local maximum of μ s,v (i.e., wavelet leader illustrated in Fig. 2b) is used as a scale-dependent measure to prevent a degeneration of the multifractal spectrum by μ s,v → 0.

Fig. 2
figure 2

Schematic representation of the leader wavelet transformation. a The discrete wavelet transformation computes the statistical measure μ s,v by recursive high-pass and low-pass filtering procedure of the response time series x t . b Twenty-eight samples of a time series (upper panel) and the dyadic intervals μ s,v of the statistical measure μ s,v (lower panel) defined by the recursive filtering procedure. The wavelet leaders is defined as the local maxima of three subsequent intervals λ s, v on scale s and for all s < s′ (gray colored boxes)

Gradient modulus wavelet projection

Gradient modulus wavelet projection (GMWP; Turiel, Perez-Vicente, & Grazzini, 2006) defines a scale-dependent measure by a continuous wavelet transformation:

$$ {\mu_{{s,{t_0}}}}=\sum\limits_{{t={t_0}-{s \left/ {2} \right.}}}^{{{t_0}+{s \left/ {2} \right.}}} {\left| {{x_t}} \right|{\psi_{{s,{t_0}-t}}}} $$
(3)

The continuous wavelet transformation is the convolution of the response time series and a waveform \( {\psi_{{s,{t_0}-t}}} \) scaled to the floating time interval [t 0s/2, t 0 + s/2] (Mallat, 1999). The detrending of the response series is dependent on the shape of the waveform. The present study used a Laplacian wavelet \( {\psi_{{s,{t_0}-t}}}={{\left( {1+\left( {{{{{t^2}}} \left/ {s} \right.}} \right)} \right)}^{-2 }} \) that performs well on the smallest scales (Turiel & Perez-Vicente, 2003).

Local detrended fluctuation analyses

Local detrended fluctuation analyses (DFAloc; Ihlen, 2012) defines the scale-dependent measure \( {\mu_{{s,{t_0}}}} \) as a root-mean square fluctuation of the integrated response time series \( {y_t}=\sum\limits_{i=1}^t {\left( {{x_i}-\bar{x}} \right)} \) around a polynomial trend of order m within a floating trial interval [t 0s/2, t 0 + s/2]:

(4)

The present study used a second-order polynomial detrending (i.e., m = 2) of the response time series.

Indirect estimation from q-order statistics of μ s,v

The q-order statistical moments \( E\left( {\mu_{s,v}^q} \right) \) are defined for the scale-dependent measure of MFDFA and LWT by the following equation (Parisi & Frisch, 1985):

$$ E\left( {\mu_{s,v}^q} \right)=\frac{1}{{{N_s}}}\sum\limits_{v=1}^{{{N_s}}} {\mu_{s,v}^q} \propto {s^{{{\zeta_q}}}} $$
(5)

where N s is the number of nonoverlapping intervals at each scale s. \( E\left( {\mu_{s,v}^q} \right) \) satisfies the power law on the right-hand side when the response time series has a multifractal structure. The q-order scaling exponent ζ q is estimated as the linear regression slope of \( E\left( {\mu_{s,v}^q} \right) \) versus s in log-log coordinates (see middle left panel in Fig. 1). Next, the q-order singularity exponent h q and singularity dimension D q are defined by the following Legendre transformation of ζ q (Parisi & Frisch, 1985):

$$ \begin{array}{*{20}c} {{h_q}=\frac{{d{\zeta_q}}}{dq }} \\ {{D_q}=1+q{h_q}-{\zeta_q}} \\ \end{array} $$
(6)

Finally, the multifractal spectrum D h is defined by plotting h q versus D q (see lower left panel in Fig. 1).

In entropy analysis, the scale-dependent measure of Eq. 2 is used to estimate h q and D q directly from the following q-order entropies \( {F_h}\left( {\mu_{s,v}^q} \right)\mathrm{and}\;{F_D}\left( {\mu_{s,v}^q} \right) \), respectively, without the Legendre transformation of Eq. 6 (Chhabra & Jensen, 1989):

$$ \begin{array}{*{20}c} {{F_h}\left( {\mu_{s,v}^q} \right)=\sum\limits_{v=1}^{{{N_s}}} {P_{s,v}^q\log \left( {{\mu_{s,v }}} \right)\propto {s^{{{h_q}}}}} } \\ {{F_D}\left( {\mu_{s,v}^q} \right)=\sum\limits_{v=1}^{{{N_s}}} {P_{s,v}^q\log \left( {P_{s,v}^q} \right)\propto {s^{{{D_q}}}}} } \\ \end{array} $$
(7a)

where

$$ P_{s,v}^q=\frac{{\mu_{s,v}^q}}{{\sum\limits_{v=1}^{{{N_s}}} {\mu_{s,v}^q} }} $$
(7b)

Similar to the estimation of ζ q in MFDFA and LWT, h q and D q are estimated as the linear regression slopes of the q-order entropies \( {F_h}\left( {\mu_{s,v}^q} \right)\mathrm{and}\;{F_D}\left( {\mu_{s,v}^q} \right) \), respectively, and scale s in log-log coordinates.

Direct estimation from \( {\mu_{{s,{t_0}}}} \)

GMWP and DFAloc estimate the multifractal spectrum directly from the scale-dependent measure of Eqs. 3 and 4, respectively. The temporal variation in the trial dependency of a response time series can be defined by the local singularity exponents \( {h_{{{t_0}}}} \) when scale-dependent measure \( {\mu_{{s,{t_0}}}} \) satisfies the following power law (Canus, Lévy Véhel, & Tricot, 1998, Turiel, Yahia, & Pérez-Vicente, 2008):

$$ {\mu_{{s,{t_0}}}}\propto \mathop{\lim}\limits_{{s\to 0}}\ {s^{{{h_{{{t_0}}}}}}} $$
(8)

The local scaling exponent h t is estimated as the linear regression slope of log-log plot of the scale-dependent measure \( {\mu_{{s,{t_0}}}} \) versus scale s (see the middle right panel in Fig. 1), but the present study uses a method suggested by Struzik (2000) that provides more numerically stable results (see technical details in Appendix 2). The multifractal spectrum D h is defined by the normalized distribution P h of \( {h_{{{t_0}}}} \) in log-coordinates by the following equation (Turiel et al., 2008):

$$ {D_h}=\mathop{\lim}\limits_{{\varepsilon \to 0}}\left( {1-\frac{{\log \left( {{{{{P_h}}} \left/ {{P_h^{\max }}} \right.}} \right)}}{{\log \left( \varepsilon \right)}}} \right) $$
(9)

where ε is the bin size of the histogram used to define P h and \( P_h^{\max } \) is the maximum probability at the mode \( \bar{h} \) of \( {h_{{{t_0}}}} \) (see the lower right panel in Fig. 1). The most common trial dependency in the response series is the mode \( \bar{h} \) and is assigned D h = 1 by Eq. 9.

In summary, the present article compares the performance of the seven multifractal analyses represented in Table 1 and defined by the choice of scale-dependent measure and q-order statistics. Table 1 also summarizes the parameter settings used to compare the performance of the multifractal analyses. All multifractal analyses in Table 1 were conducted in MATLAB and are available at www.ntnu.edu/inm/geri/software.

Table 1 Summary of definitions and parameter settings for the seven multifractal analyses compared in the present study

Multiplicative cascading noise

The seven multifractal analyses represented in Table 1 were tested on the stationary increments ∆B H (A(t)) = B H (A(t +1)) – B H (A(t)) of a multifractal random walk B H (A(t)) (Bacry, Muzy, & Delour, 2001; Ihlen & Vereijken, 2010). The main feature of ∆B H (A(t)) is intermittent periods with variation of large and small magnitudes (see red and blue intervals, respectively, in Fig. 3). The increasing width of the multifractal spectrum D h of ∆B H (A(t)) reflects increasing differences in the magnitudes of intermittent small and large variations (compare the variations within the red and blue intervals for light to dark blue traces in Fig. 3). The central tendency \( \bar{h} \) reflects the long-range dependency H of the fractional Brownian motion B H (t). The present performance comparison utilized an algorithm developed by Chainais, Riedi, and Abry (2005) to generate 100 realizations of ∆B H (A(t)) of 1,024 and 4,096 samples with H = 0.5, 0.6, 0.7, 0.8, and 0.9 and with a multifractal spectrum width h maxh min = 0, 0.1, 0.2, 0.3, 0.4, 0.5, and 0.6. A symmetric shape of the multifractal spectrum was provided by a lognormal cascade A(t) constructed by random multipliers with a lognormal distribution.

Fig. 3
figure 3

Intermittent variations in the multifractal process ∆B H (A(t)) with H = 0.9 for the seven multifractal spectrum widths h maxh min used in the present comparative study of multifractal analyses (traces from light to dark blue in the left panel with corresponding colored spectrum in the right panel). h maxh min are directly related to the width of the distribution of interaction multipliers used to generate the cascade A(t) within ∆B H (A(t)). The blue interval and arrow indicate the intermittent period of small variation with local scaling exponents in the right tail of D h close to h max. The red interval and arrow indicate intermittent period of large variation with local exponent in the left tail of D h close to h max. The difference between the magnitude of small and large variations within ∆B H (A(t)) increases with its multifractal spectrum width h maxh min indicated by an increase in the large variation within the red intervals and a decrease in variation within the blue intervals

Response time series are influenced by noise and nonstationary trends caused by differences in task difficulty, external perturbations, or internal mental strategies. Thus, Gaussian-distributed random noise with a signal-to-noise ratio of either 0.33 or 1 was added to ∆B H (A(t)) to test for the robustness of the seven multifractal analyses shown in Table 1 to additive noise (Delignières et al., 2006). Furthermore, a complex multiharmonic trend T(t) given by the following equation was added to ∆B H (A(t)) to test the robustness of the multifractal analyses to complex additive trends:

$$ T(t)=C\sin \left( {0.1250\pi t} \right)+C\sin \left( {0.0313\pi t} \right)+C\sin \left( {0.0078\pi t} \right)+C\sin \left( {0.0020\pi t} \right) $$
(10)

where t is the sample number of ∆B H (A(t)). The amplitudes C of oscillations were 0.33 and 1 times the root-mean square variation of ∆B H (A(t)) (i.e., signal-to-trend ratios of 0.33 and 1). Figure 4 illustrates ∆B H (A(t)) with H = 0.9 (see top panel) with additive white noise (see red trace in left panels) and multiharmonic trend (see red trace in right panels). The white noise and multiharmonic trend generate minor distortions of the intermittent variation in ∆B H (A(t)) for the signal-to-noise/trend ratio of 0.33 (see middle panels in Fig. 4), whereas major distortion of the intermittent variation in ∆B H (A(t)) is generated for the signal-to-noise/trend ratio of 1 (see bottom panels in Fig. 4).

Fig. 4
figure 4

Multiplicative cascading noise ∆B H (A(t)) with H = 0.9 (top panel) with additive white noise and multiharmonic trend (red traces). Right panels show ∆B H (A(t)) with white noise with a moderate signal-to-noise ratio of 0.33 (middle panel) and a high signal-to-noise ratio of 1 (bottom panel). Left panels show ∆B H (A(t)) with multiharmonic trends with a moderate signal-to-trend ratio of 0.33 (middle panel) and a high signal-to-trend ratio of 1 (bottom panel)

Performance comparison of the multifractal analyses

The performance of the multifractal analyses in Table 1 was evaluated by two error measures. The following error measure defines the deviations in the width and shape of the estimated multifractal spectrum \( D_h^e \) and the predefined D h of ∆B H (A(t)) (Turiel et al., 2006):

$$ {E_D}=\frac{{{{{\left\langle {\left| {{D_h}-D_h^e} \right|} \right\rangle}}_h}}}{{{h_{\max }}-{h_{\min }}}} $$
(11)

where \( {{\left\langle {\left| {\ldots } \right|} \right\rangle}_h} \) defines the mean absolute difference between D h and \( D_h^e \) and is taken over all values h of D h and normalized to the width h maxh min of D h . \( D_h^e \) was set to zero for the h of the model where the estimated \( D_h^e \) has no defined value (Turiel et al., 2006). Furthermore, the estimated central tendency \( {{\bar{h}}^e} \) and the central tendency of the model \( \bar{h} \) were aligned (i.e., \( {{\bar{h}}^e}=\bar{h} \)) to exclude the influence of errors in estimated \( {{\bar{h}}^e} \) on the error measure in Eq. 11. The root-mean square error of the estimated central tendency \( {{\bar{h}}^e} \) was defined by the following equation:

$$ {E_{\bar{h}}}=\sqrt{{\left\langle {{{{\left( {\overline{h}-{{\overline{h}}^e}} \right)}}^2}} \right\rangle }} $$
(12)

where \( \bar{h} \) is the central tendency of model D h and \( {{\bar{h}}^e} \) is the estimated central tendency by multifractal analyses in Table 1. The mean \( \left\langle {\ldots } \right\rangle \) was computed for the entire ensemble of 100 model series of ∆B H (A(t)) for each parameter setting. The following color convention was used to improve the readability of the tabulated results in the Results section below (Turiel et al., 2006):

figure a

A paired sample t-test was employed to identify a significantly different E D . If one or two analyses had a significantly smaller or larger E D (p < .05), as compared with the rest of the analyses, then an asterisk * and a crucifix †, respectively, was marked behind value. Significant differences for two analyses were also considered because the performance of MFDFAlin and MFDFAma together with DFAloc and GMWP were expected to be more related than the other analyses. The error measure E D was also visually evaluated with the mean and standard deviation of D h from the ensemble of 100 series of ∆B H (A(t)) where D h was aligned at their central tendency.

Results

Comparison of the estimated D h

Figure 5 shows the mean D h and ±1 standard deviation (see bars) estimated by the seven multifractal analyses in Table 1 for the ensemble of ∆B H (A(t)) with H = 0.5 and h maxh min = 0 (see red bars) and h maxh min = 0.6 (see blue bars). The best performance of analyses based on q-order statistics (i.e., MFDFA, LWT, and entropy analysis) was provided by MFDFAlin and MFDFAma when the multifractal analyses were employed to ∆B H (A(t)) with H = 0.5 (see Table 2). However, the LWT and entropy analyses performed more consistently across H and were superior to the MFDFAs for H = 0.9, as indicated by error measures in Table 3. The LWT and entropy analyses were also superior to the GMWP and DFAloc that showed similar inconsistency across H as the MFDFAs. Furthermore, LWT had more consistent errors (i.e., systematic underestimation of D h ), as compared with the other analyses, based on the q-order statistics indicated in Fig. 5 by the smaller standard deviation for the left tail of the multifractal spectrum D h.

Fig. 5
figure 5

Mean D h and ± 1 SD (bars) estimated by the seven multifractal analyses in Table 1 for the ensemble of ∆B H (A(t)) with 4,096 samples and h maxh min = 0 (red bars) and h maxh min = 0.6 (blue bars)

Table 2 Mean error E D for six different multifractal spectrum widths h maxh min for ∆B H (A(t)) with H = 0.5 and with no noise or trends present
Table 3 Mean error E D for five different H of ∆B H (A(t)) with multifractal spectrum width h maxh min = 0.6 and simple sizes 1,024 (upper panel) and 4,096 (lower panel)

Comparison of the estimated central tendency \( \bar{h} \) of D h

All multifractal analyses estimated the central tendency \( \bar{h} \) of the multifractal spectrum with high precision. LWT had the largest estimation error \( {E_{\bar{h}}} \), whereas the GMWP had the smallest \( {E_{\bar{h}}} \) for both sample sizes, as indicated in Table 4. However, GMWP had larger variation in \( \bar{h} \), as compared with DFAloc. The estimation of central tendency was consistent across H of ∆B H (A(t)) but was influenced by the presence of white noise and multiharmonic trends.

Table 4 Error \( {E_{\bar{h}}} \) of the position of the central tendency for ∆B H (A(t)) with five different H and multifractal spectrum width h maxh min = 0.6 and sample sizes 1,024 (upper part) and 4,096 (lower part)

Influence of sample size

The error measure E D decreased in magnitude and was more consistent with an increased sample size (i.e., N = 4,096) for all multifractal analyses. In addition, all analyses performed more consistently across H of ∆B H (A(t)), and the differences between the performance of multifractal analyses decreased. The largest decrease in magnitude of E D was observed for MFDFAemd, where the performance became comparable to those for MFDFAlin and MFDFAma (compare upper and lower parts of Table 2). Nevertheless, MFDFAemd had less consistent error E D, as compared with the other analyses, indicated by large standard deviations in the tails of D h in Figs. 5 and 6. The largest increase in consistency of E D with sample size was observed in DFAloc that had a twofold lower standard deviation of E D , as compared with the analyses based on the q-order statistics (i.e., MFDFA, LWT, and entropy analysis).

Fig. 6
figure 6

Mean D h and ±1 SD (bars) estimated by the seven multifractal analyses in Table 1 for the ensemble of ∆B H (A(t)) with white noise. The estimated D h represented for no additive noise (blue bars), a signal-to-noise ratio of 0.33 (green bars), and a signal-to-noise ratio of 1 (red bars). The parameter settings for ∆B H (A(t)) are h maxh min = 0.6 and H = 0.9, and the sample size is 4,096

Influence of additive noise

Figure 6 shows the mean D h and ±1 SD (see bars) estimated by the seven multifractal analyses in Table 1 for the ensemble of ∆B H (A(t)) with additive white noise. The blue, green, and red markers in Fig. 6 are the estimated D h for ∆B H (A(t)) with signal-to-noise ratios of 0, 0.33, and 1, respectively. Comparison of blue, green, and red markers in Fig. 6 shows that the increasing magnitude of additive white noise truncates the estimated multifractal spectrum D h and translates its central tendency toward \( \bar{h}=0.5 \). This result was especially apparent for the MFDFA, LWT, and entropy analyses.

All multifractal analyses were quite robust to additive white noise in the case of the moderate signal-to-noise ratio of 0.33, as is shown in Table 5. The robustness to noise appeared visually as the preservation of the left side (i.e., positive qs) of the multifractal spectra for the MFDFA, LWT, and entropy analyses (compare blue and green D h in Fig. 6). However, the same analyses yield the above-mentioned changes for both the left and right sides of D h for large noise levels (i.e., signal-to-noise ratio of 1), as is shown by the increased error measure E D in the upper part of Table 6 and the truncated red multifractal spectra in Fig. 6. GMWP and DFAloc had a more noise-resistant performance, as compared with the analyses based on the q-order statistics, for both the small N = 1,024 and large N = 4,096 sample sizes, especially for large multifractal spectrum widths h maxh min = 0.5 and 0.6. However, the red traces in Fig. 6 for GMWP and DFAloc are comparable to the red monofractal traces in Fig. 5 and indicate that the superior performance of GMWP and DFAloc is related to their overestimation of the monofractal spectrum width.

Table 5 Mean error E D for six different multifractal spectrum widths h maxh min for ∆B H (A(t)) with H = 0.5 and with signal-to-noise ratio = 0.33, but with no trends present
Table 6 Mean error E D for seven different multifractal spectrum widths h maxh min for ∆B H (A(t)) with H = 0.5 and with signal-to-noise ratio = 1 (upper part) and signal-to-trend ratio = 1 (lower part)

The estimation of the central tendency \( \bar{h} \) of D h was quite robust to the moderate signal-to-noise ratio of 0.33, but the MFDFA, LWT, and entropy analyses showed less consistency across the different H of ∆B H (A(t)), as compared with GMWP and DFAloc, for the large signal-to-noise ratio of 1 (see red traces in Fig. 6 and comparison of MFDFAemd and GMWP in the upper part of Table 7). GMWP was the most robust analysis to the presence of additive white noise when estimating the central tendency \( \bar{h} \) of D h .

Table 7 Comparison of the error \( {E_{\bar{h}}} \) of the position of the central tendency of MFDFAemd and GMWP for ∆B H (A(t)) with sample size 4,096 for five different Hs and two different signal-to-noise ratios (upper part) and two different signal-to-trend ratios (lower part)

Influence of additive multiharmonic trends

Figure 7 shows the mean D h and ±1 SD (see bars) estimated by the seven multifractal analyses in Table 1 for the ensemble of ∆B H (A(t)) with additive multiharmonic trends. The blue, green, and red markers in Fig. 6 are the estimated D h for ∆B H (A(t)) with signal-to-trend ratios of 0, 0.33, and 1, respectively. Comparison of blue, green, and red markers in Fig. 7 indicates that the increasing magnitude of multiharmonic trends truncates the estimated multifractal spectrum D h and translates its central tendency towards \( \bar{h}=1 \).

Fig. 7
figure 7

Mean D h and ±1 SD (bars) estimated by the seven multifractal analyses in Table 1 for the ensemble of ∆B H (A(t)) with multiharmonic trends. The estimated D h represented for no multiharmonic trend (blue bars), a signal-to-trend ratio of 0.33 (green bars), and a signal-to-trend ratio of 1 (red bars). The parameter settings for ∆B H (A(t)) are h maxh min = 0.6 and H = 0.9, and the sample size is 4,096

Most analyses were quite robust to the moderate signal-to-trend ratio of 0.33, as is indicated by less truncation in the green spectra, as compared with the red spectra, in Fig. 7. MFDFAlin and MFDFAma had the best performance when estimating the multifractal spectra with narrow spectrum widths (see stars in Table 8), but the performance declined for the large spectrum width by a truncation of the right side of D h (compare blue and green spectra in Fig. 7). GMWP and DFAloc had the best performance when estimating multifractal spectra with wider spectrum widths (see stars in Table 8 and the lack of truncation in the green spectra in Fig. 7). However, entropy analysis had the most consistent performance across the range of multifractal spectrum widths, especially for the larger sample size of 4,096. The performance declined for the MFDFA, LWT, and entropy analyses for multiharmonic trends with the larger signal-to-trend ratio of 1. In this case, the MFDFA, LWT, and entropy analyses underestimated the multifractal spectrum width, especially for the larger spectra indicated by a larger estimation error for h maxh min = 0.5 and 0.6 in the lower part of Table 6. This increased estimation error was caused by a truncation of the right side (i.e., negative qs) of the multifractal spectra (see red spectra in Fig. 7). MFDFAemd had an improved performance, as compared with the other analyses based on q-order statistics, for H = 0.5 (see red trace in Fig. 7), but this result was not consistent across other Hs. In contrast, GMWP and DFAloc were quite robust to the presence of large multiharmonic trends across Hs because of a narrow scaling range close to the limit s → 0 (see the scaling range in Table 1).

Table 8 Mean error E D for six different multifractal spectrum widths h maxh min for ∆B H (A(t)) with H = 0.5 and with signal-to-trend ratio = 0.33, but with no noise present

The central tendencies of the multifractal spectra were highly sensitive to the presence of multiharmonic trends with the low and high signal-to-trend ratios. The central tendency \( \bar{h} \) for all H of ∆B H (A(t)) was estimated close to \( \bar{h}=1 \), as indicated by large estimation errors \( {E_{\bar{h}}} \) in the lower part of Table 7 for H = 0.5 and 0.6 (see also the top of the red spectra in Fig. 7). Thus, multifractal analyses were not able to detect differences in the central tendency \( \bar{h} \) when strong multiharmonic trends were present. Furthermore, the detrending procedures of both MFDFAs and wavelet-based analyses were not able remove the influence of the multiharmonic trends. As an example, MFDFAemd, which contains the most adaptive detrending procedure, performed less precisely and less consistently than MFDFA with other detrending procedures when multiharmonic trends were present for the small sample size of N = 1,024 (see crucifix in the upper part of Table 8 and lower part of Table 7).

Discussion

The purpose of the present study was to compare the performance of seven multifractal analyses to provide a guide for the choice of multifractal analyses of response time series and other behavioral variables. In the discussion, the present comparative study of multifractal analyses is compared with the results in previous comparative studies. In addition, the implications of the sensitivity of the multifractal analyses to additive noise and multiharmonic trends are discussed in the last subsection.

MFDFAlin versus MFDFAma and MFDFAemd

The performance of MFDFA was found in previous studies to be dependent on the detrending procedures. MFDFAma and MFDFAemd have been shown to outperform the original MFDFAlin when employed to a simple binominal cascade (Gu & Zhou, 2010; Manimaran, Panigrahi, & Parikh, 2009; Qian, Zhou, & Gu, 2009). In the present study, MFDFAma and MFDFAemd had similar or larger estimation errors, as compared with the original MFDFAlin. The deviation of the performance of the detrending procedures in the present study, as compared with previous studies, is probably caused by smaller sample sizes and a more advanced multiplicative cascading model used in the present study.

The poor performance of the detrending procedures of the MFDFA and wavelet-based analyses indicates that the detrending procedure has to be exactly matched to the shape of a complex multiharmonic trend in order for these procedures to remove the influence of trends on the estimated D h . The exact tuning of detrending procedures will be extremely difficult in multifractal analyses of response time series and other behavioral variables where a priori theoretical argumentations for a particular shape of a complex trend do not exist. Thus, the comparison of the performances of the original MFDFAlin versus MFDFAma and MFDFAemd suggests that the original MFDFAlin should be used unless theoretical argumentation exists for the exact shape of the trend.

LWT versus MFDFA

Serrano and Figliola (2009) concluded that LWT gave an improved estimation of D h , as compared with MFDFA, when employed to binominal and random wavelet cascades. In contrast, X. Y. Huang et al. (2011) found that MFDFA performed better than LWT considering only positive qs when employed on canonical Mandelbrot cascades. The present study confirms the findings by X. Y. Huang et al. that MFDFA had better performance than LWT for ∆B H (A(t)) with H = 0.5. However, LWT had a more consistent error E D than the MFDFA and entropy analyses across the range of Hs of ∆B H (A(t)) and had significantly better performance for H = 0.8 and 0.9. Furthermore, LWT seems to have a more consistent performance, as compared with both the MFDFA and entropy analyses, when estimating the left side (i.e., positive qs) of the multifractal spectrum D h . Thus, LWT is a good choice of multifractal analyses of response time series, especially for long sample sizes of 4,096 and when positive qs are considered.

Entropy analysis versus MFDFA and LWT

Entropy analysis has recently been suggested as a superior estimation procedure of D h of response time series, as compared with the multifractal analyses based on the Legendre transformation in Eq. 6 (Stephen & Dixon, 2011). This suggestion is based on the original argumentation by Chhabra and Jensen (1989) in their development of entropy analysis. However, Chhabra and Jensen compared entropy analysis with a basic histogram analysis based on Eqs. 8 and 9. The present study shows only minor differences between the entropy analysis of Chhabra and Jensen and MFDFA and LWT. Furthermore, the probability measure used in entropy analysis (i.e., Eq. 2) can be constructed by dividing other choices of scale-dependent measure μ s,v by the total sum of \( {\mu_{{s,{t_0}}}} \). Substituting the original scale-dependent measure in entropy analysis (i.e., Eq. 7a) with these probability measures will yield the exact same D h as the Legendre transformation of ζ q in Eq. 6. Thus, the performance of multifractal analyses with similar μ s,v is insensitive to the choice of estimating D h directly by entropy functions or estimating D h indirectly by the Legendre transformation of ζ q .

GMWP and DFAloc versus MFDFA, LWT, and entropy analysis

The benchmark test performed by Turiel et al. (2008) concluded superior performance by GMWP, especially for multifractal signals with large spectrum width h maxh min and small sample sizes (i.e., N = 1,024). The present study also found that both GMWP and DFAloc performed better than the multifractal analyses based on q-order statistics (i.e., MFDFA, LWT, and entropy analysis) for the larger spectrum width. However, GMWP and DFAloc both had a larger overestimation of the spectrum width of monofractal signals, as compared with MFDFA, LWT, and entropy analysis. A shortcoming of the benchmark test conducted by Turiel et al. (2008) was the inclusion of other multifractal analyses based on q-order statistics, which have inferior performance, as compared with analyses included in the present study (Jaffard, Lashermes, & Abry, 2006; Oświęcimka, Kwapien, & Drozdz, 2006; Serrano & Figliola, 2009). Another shortcoming of the benchmark test conducted by Turiel et al. (2008) was the use of random cascades that are not appropriate candidates to mimic the intermittent variation in behavioral variables by their scale-discrete nature and strictly positive values. Furthermore, Pont, Turiel, and Perez-Vicente (2006) have shown that the GMWP overestimates the spectrum widths for fractional Gaussian noise (i.e., ∆B H (A(t)) with h maxh min = 0) confirmed in the present result. However, the same authors showed that the performance of both GMWP and DFAloc can be improved by modifications of the scale-dependent measure \( {\mu_{{s,{t_0}}}} \) when employed to monofractal ∆B H (A(t)). In summary, the present study is less conclusive regarding the performance of both GMWP and DFAloc, as compared with the benchmark test conducted by Turiel et al. (2008). The obtained D h from GMWP and DFAloc of response time series should be compared with D h of monofractal surrogate series to prevent false positive presence of multifractality.

Sensitivity to additive noise and multiharmonic trends

The presence of both trends and noise yields a false negative result for the presence of multifractal response time variation, a false positive result for the presence of long-range trial dependency when trends dominate, and a false negative result for long-range trial dependency when noise dominates. The sensitivity of multifractal analyses to noise and trends has two important implications for the employment of multifractal analyses to response time series and the interpretation of the estimated D h .

First, a narrow D h reflects the lack of a multiplicative interaction or phase coupling between temporal scales of response time series when no noise or trends are present. Ihlen and Vereijken (2010) reported multifractal spectra with smaller widths and central tendencies closer to 1 for the response series of the interval estimation task, as compared with other cognitive tasks. Furthermore, Kuznetsov and Wallot (2011) reported that the local singularity exponent h t of a response series in an interval estimation task had less variation (i.e., smaller spectrum width) and an average value (i.e., central tendency) closer to 0.5 in the presence of performance feedback. The sensitivity of multifractal analyses to noise and trends suggests that both these results can be created by task- and feedback-dependent modulation of additive noise and trends. Thus, noise and trends will disguise the task and feedback modulation of multifractal spectra caused by modulation of the intrinsic multiplicative interactions between temporal scales. However, both the Ihlen and Vereijken (2010) and Kuznetsov and Wallot studies found significant differences between the multifractal spectrum of the response time series and multifractal spectra of synthesis surrogate series that eliminate multiplicative interactions between the temporal scales within the response time series. Furthermore, Kuznetsov and Wallot found no correlation between the multifractal spectrum width and the central tendency of the spectrum that would have been expected in the presence of a high level of noise or strong trends. In summary, surrogate tests and correlation analyses of multifractal spectrum widths and the central tendencies should be conducted together with the multifractal analyses to control for the presence of additive noise and trends.

Second, the presence of noise and trends leads to a degeneration of D h and deviation from linearity in the log-scaling plot of the q-order statistics for the multifractal analyses based on the q-order statistics. The degeneration of D h appeared in the right end of D h and was caused by a nonmonotonic decreasing h q for negative qs (see red trace estimated by LWT in Fig. 7). The deviation from linearity in the log-scaling plot was also most apparent for negative qs and suggests that noise and trends with large magnitudes will influence the scale invariance of response time series, especially for the intermittent periods with response time variation of small magnitudes. Consequently, the employment of multifractal analyses based on q-order statistics should interpret the right end of the estimated D h with caution when noise and trends are expected.

Conclusion

Table 9 summarizes the pros and cons of the seven multifractal analyses compared in the present study. All multifractal analyses were able to differentiate between monofractal and multifractal time series, but GMWP and DFAloc showed a larger overestimation of the multifractal spectrum size for the monofractal time series. However, GMWP and DFAloc were more robust to multiharmonic trends of large magnitudes, and GMWP had the most precise estimation of the central tendency of the multifractal spectrum. MFDFAlin and MFDFAma had a superior performance for H = 0.5, as compared with both LWT and entropy analysis. However, the LWT and entropy analyses had more consistent performances across H and had superior performances, as compared with MFDFA, for H = 0.9. The detrending procedures of MFDFA and the wavelet-based analyses were unable to remove the influence of a multiharmonic trend on the estimated multifractal spectra. The present study concludes that the estimated multifractal spectrum from two or more multifractal analyses should be compared with response time series and other behavioral data when these analyses are employed. In addition, the estimated multifractal spectra should be compared with the estimated spectra from monofractal surrogate series to reveal a false positive presence of multifractality.

Table 9 Summary of all pros and cons of the seven multifractal analyses in Table 1