Introduction

Temporal expectation refers to the process of focusing attention on a specific point in time to optimize behaviour, which is a fundamental survival ability in our daily lives. Humans rely on two sources of information to generate temporal expectations: rhythmic cues, if available, and symbolic cues. Rhythmically defined temporal expectation has been suggested to be a stimulus-driven orientation of attention in time (Breska & Deouell, 2014; Rohenkohl, Coull, & Nobre, 2011). When participants experience a non-random temporal structure, they can spontaneously adapt to the temporal regularity and build a temporal pattern of the fixed interval, which can be used to predict the likely point in time when the next event will occur in order to optimize perceptual or motor performance (Barnes & Jones, 2000; Jones & Boltz, 1989; Large & Jones, 1999). Common findings across studies of temporal expectation driven by rhythmic cues are that, by matching the rhythm’s pace, better detection and discrimination occur when an event appears at the expected interval rather than early or late (Doherty, Rao, Mesulam, & Nobre, 2005; Jones, Moynihan, MacKenzie, & Puente, 2002; McAuley & Jones, 2003; Sanabria, Capizzi, & Correa, 2011).

Conversely, temporal expectation driven by symbolic cues (such as the presentation of a digit or a coloured circle) directs attention through a goal-directed voluntary mechanism (Capizzi, Sanabria, & Correa, 2012). Depending on the mnemonic representations of the learned cue-target interval, the symbolic, informative cue informs participants whether an upcoming target would likely appear after a short or long delay. Research on temporal expectations triggered by symbolic cues has revealed that compared to misleading (invalid) and non-informative (neutral) cues, valid symbolic cue presentation results in faster and more accurate performance (Correa, Lupianez, Milliken, & Tudela, 2004; Coull, Frith, Buchel, & Nobre, 2000; Coull & Nobre, 1998).

Temporal expectations driven by both rhythmic and symbolic cues have been proven effective in directing attention to a particular point in time for improving performance. However, whether the attentional precision of the two forms of temporal expectation is the same remains unexplored. However, Trivino et al. (2011) have directly compared the two forms of temporal expectation in which a rhythmic cue task and a symbolic cue task were administered to a frontal brain lesion group and a control group. In the rhythmic cue task, the predictive temporal information regarding the target onset time was provided by a fast rhythm or a slow rhythm. In the symbolic cue task, the predictive information was provided by a symbolic cue (short line or long line). In their studies, Trivino et al. did not find a significant difference in the speed-up reaction time (RT) between the two tasks in the control group, even though it seemed a difference could be seen in reducing false alarms between the two tasks from their results (as they used a go/no-go task, in which behavioural performance could be improved through both speeding up RTs and reducing false alarms); however, because they did not perform a statistical analysis of the false alarms, the behavioural differences between the two types of temporal expectation require further investigation (Trivino, Arnedo, Lupianez, Chirivella, & Correa, 2011). Moreover, Correa et al. (2014) utilized similar tasks and showed that transcranial magnetic stimulation (TMS) on either left or right frontal sites (vs. sham) increased the temporal orienting effect in the symbolic cue task, whereas frontal TMS did not influence performance in the rhythmic cue task. There appeared to be a greater cueing effect (i.e., the difference between the RT for invalid and valid cues) for the rhythmic cue task than for the symbolic cue task under the sham TMS condition. Nonetheless, it is not possible to draw conclusions because of the lack of a statistical comparison (Correa, Cona, Arbula, Vallesi, & Bisiacchi, 2014). Thus, in the current study, we adapted the two temporal expectation tasks and used a detection task to compare the two forms of temporal expectation directly.

Methods

Participants

A total of 17 right-handed volunteer participants (15 males and two females) with a mean age of 23.3 years (range 21–28) from Okayama University participated in the study. The participants had normal or corrected-to-normal vision and no record of psychiatric disorders. The study was approved by the institutional ethics committee, and all participants gave written informed consent before being enrolled.

Experimental task

We used E-prime software (Schneider et al., 2002) to present stimuli and to record behavioural data, and all stimuli were presented in the centre of a 27-in. monitor with a resolution of 1,280 × 720 pixels over a grey background (RGB = 180, 180, 180). The participants were comfortably seated in a quiet room with their head fixed on a chin rest approximately 60 cm from the centre of the monitor. Two temporal expectation tasks were performed by participants: a rhythmic cue task and a symbolic cue task, which were each performed separately, and the initial task was counter-balanced across participants. Both written and verbal instructions for the rhythmic and symbolic cue tasks (which are included in Fig. 1) were provided to the participants.

Fig. 1
figure 1

Experimental task. Each trial started with the presentation of the fixation point for 500 ms followed by the presentation of a rhythmic or a symbolic cue. (a) The rhythmic cue consisted of four or five regular circle stimuli that appeared for 100 ms and disappeared every 500 or 1,500 ms. (b) The symbolic cue was a digital visual ‘500’ or ‘1,500’ stimulus presented for 100 ms. Following either temporal cue, a black circle target appeared for 100 ms after a variable interval. The participants were instructed to respond as quickly as possible once the target appeared by pressing the left button with their index fingers while avoiding anticipation

In both tasks, each trial started with the presentation of the fixation point for 500 ms, which consisted of a black ‘+’ symbol (0.6°× 0.6° of visual angle), followed by the presentation of a rhythmic or a symbolic cue (Fig. 1). The rhythmic cue consisted of an isochronous sequence of stimuli consisting of four or five successive circles (at a uniform probability that was randomly generated during the trial), which appeared for 100 ms and disappeared every 500 or 1,500 ms; the time interval changed randomly from trial to trial (depending on the interval condition; see Table 1). All stimuli were solid grey circles (diameter = 1.2°; RGB = 100, 100, 100), except the final stimulus, which was a solid white circle (diameter = 1.2°) and served as the signal of an impending target that delivered the onset point of the following, preparatory, foreperiod (i.e., the interval between the cue and the target). This variation in the number of stimulus presentations prevented the white circle from being fully predictable. However, the symbolic cue was a digital presentation of ‘500’ or ‘1,500’ (Courier New 18 point) with a duration of 100 ms that randomly appeared following the preceding fixation point. The digital presentation of ‘500’ indicated that the target would appear after 500 ms, whereas the digital presentation of ‘1,500’ indicated that the target would appear after 1,500 ms. Following both temporal cues, a black circle target (diameter = 1.2°) appeared for 100 ms after a variable interval (described below; see Table 1); then, the screen remained blank until the participant responded or for 1,200 ms. The participants were instructed to respond as quickly as possible after the presentation of the target by pressing the left button with their index fingers while avoiding anticipation. The inter-trial interval was 1,200 ms.

Table 1 Summary of all the cue-target intervals (ms) for the short- and long-interval conditions

Both the rhythmic and symbolic cue tasks included 440 trials, which were equally distributed in 220 short-interval trials and 220 long-interval trials. The probability that the target appeared at the expected interval (i.e., 500 ms or 1,500 ms) was set to 54% (240 trials). In the other 36% of trials (160 trials), the target appeared at an unexpected interval, and these times were distributed across four earlier intervals (at a proportion of 0.4, 0.55, 0.7, or 0.85 of the expected interval) and four later intervals (at a proportion of 1.15, 1.3, 1.45, or 1.6 of the expected interval). See Table 1, for the short-interval conditions, for which the expected interval was 500 ms and the unexpected interval was 200, 275, 350, 425, 575, 650, 725 or 800 ms. For the long-interval conditions, the expected interval was 1,500 ms and the unexpected interval was 600, 825, 1,050, 1,275, 1,725, 1,950, 2,175 or 2,400 ms. In 10% of the total trials (40 trials), the target was not presented (catch trials); this manipulation was included to prevent the target from being confidently expected at longer intervals, thus helping attenuate the effects of the ‘hazard function’, which is defined as the increasing conditional probability over time that the target is going to appear if it has not already occurred (Correa, Lupianez, & Tudela, 2006; Nobrel, Correa, & Coull, 2007). At the beginning of the experiment, the participants were instructed about the task, the catch trials and the final white circle in the rhythmic sequence that marked the beginning of the foreperiod preparatory interval. The participants were also encouraged to use the pace of the rhythmic or the symbolic cue to prepare for upcoming targets but to respond only when the target actually appeared while avoiding anticipation. Prior to each symbolic cue task, all the participants performed a training session with 24 trials. In the training phase, the foreperiod was always expected (500 ms or 1,500 ms in two different blocks) to ensure a stable reference memory, thus allowing the participants to learn the association between the digital cues and the foreperiod to establish a temporal expectation about the point in time of target onset.

Data analysis

The RT in both tasks refers to the time between the onset of the target and the motor response. The practice trials and catch trials were eliminated from the analyses. Anticipated responses (participants responding before the target appeared) and omission errors (participants did not respond after the target appeared) were excluded from the analysis (1.92% of trials rejected). For each participant, trials were also discarded if the RT was larger than three standard deviations from the mean RT, which was separately calculated for each dependent variable (task, interval and temporal expectation) or if the RT was smaller than 50 ms (1.6% rejected). The remaining correct mean RTs were modelled as a function of a number of variables, and mixed-effects linear regression analyses (Baayen, Davidson, & Bates, 2008; Jc & Bates, 2000) were conducted using the lmer function from the lme4 package (Bates et al. 2015) in R (vision 4.0.2). The set of fixed-effect predictors is summarized in Table 2, and their interactions are included in all models.

Table 2 List of fixed-effect predictors

Previous studies (Griffin, Miniussi, & Nobre, 2001; Piras & Coull, 2011) have reported some nonlinear effects of temporal expectation (TE) with a U-shaped temporal expectation curve; thus, a quadratic term (TE2) was also included to model potential nonlinear effects in our data. TE were centred to minimize collinearity. For Task and Interval, simple coding (Rhythmic Cue: -.5; Symbolic Cue: .5; Short-Interval: -.5; Long-Interval: .5) was used.

To examine whether the temporal expectation curve differed by task and interval, the model also included a two-way interaction of temporal expectation (TE + TE2) with task and interval. The interaction between task and interval was also included to capture how the contrast between the two tasks differed by interval. The models also included a three-way interaction of temporal expectation (TE + TE2), task and interval to examine whether and how the interaction of temporal expectation and task varied by interval. The model included by-participant random intercepts as well as all possible random slopes to allow for variability among participants. The correlations between random effects were omitted to facilitate model convergence.

Results

The fixed effects for the statistical models of RT are summarized in Table 3; each fixed effect coefficient is shown with its standard error, test statistic and significance level, which were calculated using the Satterthwaite approximation as implemented in the ImerTest package (Kuznetsova, Brockhoff, & Christensen, 2015).

Table 3 Summary of all the fixed effect coefficients for the models of reaction time: coefficient estimates, standard errors, t-values and significance levels

First, compared with the symbolic cue task, the main effect of task is significant(β = 21.99, p < 0.001)with a faster RT for the rhythmic cue task. A significant main effect of interval (β = 30.32, p < 0.001) is also observed, which shows that the RT for the short-interval condition was faster than that for the long-interval condition. Temporal expectation (TE) (β = -20.82, p < 0.001) and the quadratic term (TE2) (β = 186.39, p < 0.001) are confirmed to be significant and well suited to capturing the effect of temporal expectation, with slower RTs for the earlier and later unexpected intervals and faster RTs for intervals approaching the expected point in time, which are apparent as U-shaped RT curves in Fig. 2.

Fig. 2
figure 2

Actual plots (a) and model prediction plots (b) for the reaction times (RTs) of the rhythmic and symbolic cue tasks for the short- and long-interval conditions as a function of temporal expectation. The 95% confidence intervals are shown in grey shading

Additionally, the interaction between Task × TE2 is significant (β = -160.17, p < 0.001), indicating that the U-shaped RT curve for the rhythmic cue task is steeper than that for the symbolic cue task. The interaction between Task × Interval is also significant (β = -23.31, p < 0.001), showing that the RT for the rhythmic cue task is faster than that for the symbolic cue task during the short-interval conditions; however, during the long-interval condition, the RT for the symbolic cue task is faster. The Interval × TE interaction was also revealed to be significant (β = 32.3, p < 0.001), where the difference of the RT between the short- and long-interval increases over the temporal expectation. The three-way interactions of Task × Interval × TE (β = 11.01, p = 0.285) and Task × Interval × TE2 (β = -40.54, p = 0.08) were not significant.

Discussion

The aim of the present experiments was to investigate the attentional precision of temporal expectations driven by rhythmic versus symbolic cues. By directly comparing rhythmic and symbolic cue tasks, the results showed a temporal expectation effect for both tasks: slower RTs for the earlier and later unexpected intervals and faster RTs for intervals approaching the expected point in time. Moreover, we found a significant interaction between task and the quadratic term of temporal expectation, indicating that the U-shaped RT curve for the rhythmic cue task is steeper than that for the symbolic cue task. This result revealed that compared with symbolic cues, temporal expectation driven by rhythmic cues provides a more precise attentional focus in time.

The responses to targets that appeared at the expected interval were observed to be faster in the rhythmic cue task than in the symbolic cue task under the short-interval conditions. However, no differences were observed under the long-interval conditions (see Fig. 2). A possible conjecture for this phenomenon was that a relatively short, constant foreperiod allows accurate and confident prediction of the target onset time, whereas a longer, constant interval results in predictions that are more variable and less accurate with higher uncertainty, hence resulting in a slower RT (Gibbon, Church, & Meck, 1984). Moreover, a previous study found that when participants were asked to continue isochronous sequences by tapping, the variability in these sequences was minimal around an interval of 600 ms; the variability increased for shorter or longer intervals (Fraisse, 1956). Furthermore, if the next stimulus lies too far beyond 2 s or 3 s, then it becomes impossible to synchronize a regular sequence precisely, and the two stimuli are suggested to no longer to be perceptually linked: the rhythm disappears and the two stimuli appear as independent events (Fraisse, 1982; Mates, Müller, Radil, & Pöppel, 1994; Poppel, 1997). Thus, as the prediction becomes less accurate during long intervals, the difference between the performance at the expected interval in the rhythmic and symbolic cue tasks was lessened.

U-shaped RT curves were found in both the rhythmic and symbolic cue tasks with a faster response to the target appearing around the expected relative to the unexpected intervals. However, under the short-interval conditions, we observed asymmetrical curves in both the rhythmic and symbolic cue tasks. These data indicated that although catch trials were included to prevent the effects of the ‘hazard function’, it was clear that the participants still used the temporal probability information inherent in the elapse of time itself to reorient their attention to the later interval if the target did not appear at the expected interval. Thus, we suggest that catch trials could only modulate the temporal probabilities and could not completely inhibit the hazard function. Consistent with our suggestion, previous studies that included 10% and 20% catch trials have found that the early and late unexpected targets were processed differently (Correa & Nobre, 2008; Jones, Hsu, Granjon, & Waszak, 2017).

Our important result is that the U-shaped RT curves for the rhythmic cue task are steeper than those for the symbolic cue task. However, previous studies have observed U-shaped RT curves in both rhythmic (Jones et al., 2017; Sanabria et al., 2011) and symbolic cue tasks (Griffin et al., 2001). Nonetheless, the curves for the two forms of temporal expectation have rarely been compared directly. Our current study provides novel evidence that the U-shaped RT curves for the rhythmic cue task are steeper than those for the symbolic cue task, revealing that rhythmic cues allowed the participants to allocate more focal attention around the cued expected point in time. One conceivable interpretation of this result may be that temporal expectations driven by symbolic cues are based on a remembered standard code in the reference memory (Church, 1984; John Gibbon, 1977; Treisman, 1963). However, temporal expectation, as driven by the relative timing of onsets of successive symbolic cues, has been related to the self-sustaining entrainment model (Jones & Boltz, 1989; Large & Jones, 1999). Thus, the isochronous sequence afforded a more robust, stable and accurate temporal expectation against a single memorized interval.

Conclusion

The results of this study showed a significant interaction between task and the quadratic term of temporal expectation, indicating that the U-shaped RT curve for the rhythmic cue task is steeper than that for the symbolic cue task. These outcomes revealed that compared with symbolic cues, temporal expectation driven by rhythmic cues provides a more precise attentional focus in time.