Introduction

Attention deficit/(hyperactivity) disorder (AD(H)D) is a multi-faceted developmental disorder [1]. A hallmark of this disorder is an increased intra-individual variability (IIV) in behavioral performance, e.g., during reaction time (RT) tasks examining various cognitive functions [2,3,4,5,6,7]. In day-to-day life and in clinical practice, this becomes apparent as frequent and rapid attentional fluctuations which are characteristic of many children and adults with AD(H)D. From a scientific point of view, the phenomenon of IIV describes fluctuations in performance occurring after mastery (i.e., an expected level of performance) has been achieved. Increases within IIV, such as those frequently occurring within AD(H)D, indicate deficits in processing robustness, i.e., a reduced stability of performance across a period of time [8]. Since these aspects seem to be associated with symptom severity [9, 10], correlate with standardized measures of general cognitive skills [11] and may resemble an endophenotype of AD(H)D [2,3,4, 7, 12, 13], they are of considerable clinical importance.

Notably, even though it is the performance profile across time that constitutes IIV (i.e., how performance evolves on a trial-by-trial basis), this profile has not been considered in detail and cannot be captured using commonly applied measures of IIV (e.g., standard deviation of RTs) or by approaches examining different aspects of the overall distribution of RT data [3, 14]. In other words, established measures of IIV cannot depict whether IIV is constantly increased in AD(H)D or whether there are phases of increased IIV, followed by phases of non-increased IIV. In this case, only the phases of increased IIV would then constitute the generally increased IIV across time, which is captured by measures of overall RT distribution. Such a variable RT pattern would have important clinical implications for the treatment of patients with AD(H)D (see discussion section for examples). As exemplified in Fig. 1, fluctuations in performance could show various temporal profiles which implicate various patterns in the underlying information processing.

Fig. 1
figure 1

Simulation of different hypothetical performance profiles. All four simulated groups have the same standard deviation (SD) values. However, SD is not a sufficient indicator of performance fluctuations. Instead, the patterns underlying these differences in variability need to be considered and possible patterns within it need to be analyzed

For example, it is possible that patients show high IIV because performance variations (i.e., fluctuations in RTs across time) are uniformly present throughout (refer Fig. 1, green line). Alternatively, it is possible that patients only show intermittent, yet very strong performance variations [15] (refer Fig. 1, black, red and blue lines). Crucially, these different patterns are not noticeable when using common measures to examine IIV (e.g., the SD of performance) as actually, all simulated examples shown in Fig. 1 have identical SDs (also see: [15]). However, the trial-by-trial performance profile carries important implications for clinical practice, as different patterns may require different behavioral adaptation strategies in daily life. Equally, the identification of any AD(H)D-specific differences in the performance pattern may provide highly relevant insights into the neurobiological foundations of this disorder. Here, it has been shown that deficiencies in the dopaminergic system underlie the emergence of a reduced stability of performance across a period of time/across trials [16]. Regarding neurobiology, especially striatal dopaminergic mechanisms are important for such processes [7, 17,18,19]. The likely reason is that reduced dopaminergic activity increases neuronal noise [20, 21], which leads to less distinct and unstable cortical representations resulting in decreases and variability in cognitive performance [16]. Theoretical [20, 22,23,24] and empirical research [24,25,26,27,28,29] underline the fact that dopaminergic modulation regulates the signal-to-noise ratio (SNR) of neural information processing. Based on the close relation between performance fluctuations and dopaminergic functions, the consideration of the trial-by-trial performance profile in patients with AD(H)D allows a closer insight into the nature of dopaminergic dysfunction in this patient group. If performance differences compared to healthy controls only occur intermittently, this would suggest that the dopaminergic system in AD(H)D fluctuates in its efficacy in reducing neuronal noise. However, this would also suggest that there is no general inability in AD(H)D to suppress neuronal noise and to perform on a stable level in cognitive tasks. Rather, this would suggest that there are transitory (phasic) dysfunctions in performance and therefore also likely in (phasic) dopaminergic neural transmission [15, 23].

Since time estimation processes are based upon mechanisms that are also central for the emergence of fluctuations in behavioral performance, they seem well suited for the examination of the trial-by-trial performance pattern in AD(H)D in comparison to healthy controls [30]. Overall, various aspects of timing including time estimation [31, 32], time discrimination [33], interval (re)production [3, 33], duration judgment and temporal set shifting [11] have previously been linked to ADHD (for overview, please see: [30]). It is especially interesting that time estimation processes have been shown to dissociate between patients with ADHD and patients with attention problems occurring due to the presence of other psychiatric disorders [11]. Conceptually, the time processing difficulties occurring in individuals with ADHD have been explained by pacemaker-counter models [34, 35], which suggest that time perception is driven by an opening/closing “switch” which allows for pulses generated by an “oscillating clock” to be counted [36]. A faster running “clock” would make time intervals seem longer and it has been suggested that this is the case in ADHD [37]. Timing problems in general have been closely linked to impulsivity, with neuroanatomical substrates substantially overlapping [17, 38,39,40]. Importantly, specifically task performance concerning shorter time spans (< 5 s) has been shown to be independent of attentional capacity [3]. Therefore, it is particularly useful to examine time estimation abilities in the second to millisecond time range in ADHD if aiming to obtain information not directly influenced by the level of attentional difficulties.

Since there have been some previous suggestions that time estimation processes [41] differ between the combined (ADHD) and inattentive (ADD) subtypes of AD(H)D [30], we subdivided the patient sample into two groups based on AD(H)D subtype. Here, we hypothesize finding significant group differences in the distribution of IIV across time when comparing either of the two patient groups to healthy controls. It is, however, unclear whether patients with ADD and ADHD will differ from each other.

Interestingly, recommended first-line pharmacological interventions using methylphenidate (MPH) [42,43,44] predominantly enhance tonic dopaminergic activity [45, 46]. Further, MPH may also somewhat suppress phasic aspects of dopaminergic neurotransmission [23, 29, 46,47,48,49]. MPH acts as a dopamine transporter (DAT) blocker [50, 51] which is highly expressed in the nigro-striatal and meso-corticolimbic pathways [52]. Because time estimation processes depend on striatal dopaminergic mechanisms [17,18,19, 30], MPH may be expected to modulate these processes [53]. Yet, as outlined above, it is possible that differences in performance between AD(H)D and healthy controls only occur intermittently. As MPH predominantly affects tonic dopaminergic activity [45,46,47, 54], it is possible that the trial-by-trial performance profile is only modulated to a limited extent by MPH treatment.

Importantly so far, the examination of the trial-by-trial performance has been hampered by insufficient methods to reliably examine how performance evolves on a trial-by-trial basis. To address this, we use a new methodological approach and apply a “trial-frequency of trials (TFT) decomposition” on the behavioral time series data; i.e., the series of all successive RTs. This approach is very similar to the approach taken by the widely used time–frequency analysis of neurophysiological data [55] and makes it possible to identify oscillatory patterns in behavior as it is usually done for EEG data (see “Methods” for details). Using this approach, it is possible to distinguish phases where IIV is increased as compared to healthy controls, and phases where IIV does not differ [15]. In the current study, we examined the performance of children with AD(H)D and healthy controls in a time estimation task, which has previously been shown to distinguish well between children with AD(H)D and healthy controls [41, 56]. Specifically, we compare healthy control children (i) to patients diagnosed with the inattentive subtype (ADD group) and (ii) to those diagnosed with the combined subtype (ADHD group). Here, we hypothesize that both patient groups will present with significantly lower accuracy rates than healthy controls. Further, groups are expected to present with different patterns of performance stability/reaction time variability in the TFT decomposition. Further, (iii) we conduct a comparison between the ADD group and the ADHD group. Here, some previous findings suggest that subgroup differences in time estimation and IIV may indeed occur, whereas others suggest no such differences (see above). Lastly, we examine (iv) the effects of MPH treatment on the performance profile across time and expect any differences between the control and patients groups to be reduced by treatment.

Materials and methods

Study design

In a cross-sectional case–control study, we examined the trial-by-trial performance profiles of reaction times in a large sample of patients with AD(H)D compared to healthy controls. To achieve this, we used a novel analysis of RT data collected in a time estimation task. We compared healthy control children to patients diagnosed with the inattentive subtype (ADD group; Fig. 2a) and to those diagnosed with the combined subtype (ADHD group; Fig. 2b). Further, we conducted a comparison between the ADHD group and the ADD group (Fig. 2c). Lastly, we examined the effects of MPH treatment on the performance profile across time (Fig. 2d). Please not that an a priori sample size estimation was not possible since we did not focus on the use of traditional inferential statistics (incl. effect sizes).

Fig. 2
figure 2

Results from the trial-by-trial analysis of reaction times plotted in 2D color-coded format where the power of RT variation is color coded, and vertical and horizontal axes represent frequency of trial and trials, respectively. RTs (black) are plotted on top or bottom of these 2D color plots. Results of comparison using cluster-based permutation tests are plotted in the center. White boundaries in these plots denote the clusters of trials in which significant group differences were evident. a The trial frequency of trials decomposed data is shown for the ADD (left) and the control group (top). The comparison of these groups is shown in the middle. b The ADHD (right) and the control group (top). The comparison of ADHD and control group is shown in the middle. c The ADD (left) and ADHD group (bottom). The comparison of ADD and ADHD is shown in the middle. d Data from the AD(H)D group prior (right) and post-MPH treatment (bottom). The comparison of pre-MPH and post-MPH treatment is shown in the middle

Participants

Only patients in whom a clinical AD(H)D diagnosis had been determined according to standard clinical guidelines by a team of experienced child and adolescent psychiatrists and psychologists in an outpatient clinic setting in the years 2015–2017 were considered in the study. The diagnostic procedure included family and school interviews, questionnaires, IQ (WISC-IV) and attention testing and the exclusion of possible somatic differential diagnoses via blood analyses, EEG, audiometry and vision testing. Following this extensive diagnostic procedure in the clinical setting, patient families were asked about their interest in taking part in this research study. Due to the recruitment context (outpatient clinic setting), it was not possible to record the number of participants who were not interested or able to take part in the study for various reasons. In this way, 143 patients in total could be recruited to take part in this study. N = 76 of them fulfilled clinical criteria for ADD according to ICD-10 (F98.8, subsequently referred to as the ADD sample), while the remaining n = 67 had been clinically diagnosed with the combined subtype (ADHD; ICD-10 F90.0 or F90.1, subsequently referred to as the ADHD sample). The patients were not affected by any further psychiatric comorbidity (e.g., ICD-10 codes F0-F7, F84 and F92-F95). Patients with additional conduct disorder or oppositional defiant disorder were only included if these symptoms were diagnostically seen as secondary to the ADHD core symptoms. 23 of the patients with ADD and 29 of those with ADHD were medicated with methylphenidate (range: 10–50 mg), but underwent a minimum of 24-h medication washout before study procedures. The minimum 24-h washout period was determined based on parent report and is the standard washout period used for stimulants given their short half-life [57, 58]. Medication status did not differ between the ADD and the ADHD group (χ2(1) = 2.6, p = 0.1). The healthy control sample consisted of n = 45 participants. Healthy controls were recruited from an internal participant database and by advertisements. The presence of AD(H)D was excluded through the use of child and parent questionnaires and an interview concerning ICD-10 diagnostic criteria. For an initial screening, telephone interviews were conducted 1–2 weeks before the appointment. During this interview, parents were interviewed about the diagnostic criteria of ADHD and were asked about the presence of any psychiatric symptoms of confirmed diagnoses of their child. In case of a good suitability for the study (i.e., agreement to participate by child and parent, fulfillment of all inclusion and of none of the exclusion criteria (see below)), questionnaires (ADHD symptom checklist) were sent to the families beforehand to be completed at home. Any items marked by parents as applicable to their child were discussed with the parents at the beginning of the appointment. A number of a priori inclusion and exclusion criteria were defined and applied to all participants. Participants were not included in the study if symptoms (or the presence of confirmed diagnoses) of severe or acute psychiatric disorders were reported during the telephone interview or within the symptom questionnaires (except for AD(H)D in the AD(H)D group). Further, they were excluded from the study if they reached an IQ score below 85 points (assessed by a short form of the WIC-IV, [59]), did not fall within the required age range of 8–15 years or had performed the time estimation task before. Please see Table 1 for demographic information.

Table 1 Demographics

This sample was compared to the ADD and the ADHD groups. Using the AD(H)D Symptom Checklist [60], parents rated their children on a scale of 0 (no problems) to 3 (severe problems) in regards to AD(H)D core symptoms (Table 1). Mean values above 1.5 indicate clinically severe symptoms [60]. Healthy controls had significantly lower scores than the two patient groups on all three subscales (all F > 139.8, all p < 0.001). Patients with ADHD and ADD did not differ significantly with regard to the degree of inattention (p = 0.12). As expected based on the different disorder characteristics, hyperactivity (p < 0.001) and impulsivity (p < 0.001) were significantly more pronounced in patients with ADHD than in those with ADD. Groups did not differ in age, IQ or gender distribution (all F < 1.6, all p > 0.4) (see Table 1).

Further, for n = 29 of the patients diagnosed with AD(H)D (see Table 1 for descriptives), performance on the time estimation task was examined a second time after treatment with MPH had been initiated (all drug-naïve beforehand). Initially, all patients received a low dose of immediate-release MPH and switched to extended-release MPH during the course of treatment. According to clinical guidelines, this dose was increased until (i) a significant and satisfactory symptom reduction was reported by parents or (ii) the target dose of 1 mg/kg body weight had been reached. Final doses ranged from 10 mg to 40 mg extended-release MPH per day (20.95 ± 8.6 mg/day). The initiation of MPH treatment in this group led to significant improvements in the domain of inattention (t(20) = 2.2; p = 0.04) and hyperactivity (t(20) = 2.1; p = 0.05), but not in regard to impulsivity (t(20) = 1.3; p = 0.2). Questionnaire data for either the pre- or the post-MPH time point was unfortunately missing. The study was approved by the local ethics committee. Informed written assent/consent was obtained from all participants/their legal guardians in accordance with the Declaration of Helsinki. The ethics committee of the Technical University of Dresden approved the study. Data can be made available upon reasonable request to the corresponding author.

Task

Healthy controls and patients were asked to estimate a time of 1200 ms following a visual stimulus (white square on black background) [61]. They were asked to press a button whenever they thought that this time had elapsed. Responses given within 200 ms around this target time (i.e., between 1000 and 1400 ms) were accepted as correct. Responses given between 400 and 1000 ms after cue onset were classified as early responses. Responses occurring between 1400 and 2000 ms were classified as late answers. Any key press before 400 ms or after 2000 ms was classified as a missed response and was not included in further analyses. To have trade-off between ecological validity (e.g., such as demands in school environment) on the one hand, and isolating the processes of behavioral IIV on the other hand, visual feedback was given after every key press which requires using feedback learning to optimize performance. In the case of correct trials, a green happy smiley and the word “correct” were presented. In the case of early/late trials (including those later characterized as misses), a red sad smiley and the word “too early”/”too late” was presented. The words “did not react” were displayed if no response had occurred within 3000 ms after stimulus onset. Altogether, participants performed three blocks of 100 trials each. The intertrial interval was randomized between 800 and 2200 ms. Experimental performance was closely monitored online via an observation of the trigger codes occurring in the EEG recording software during the task.

Statistical analysis of task performance

For the statistical analysis of task performance, response accuracy (i.e., the number of trials in which responses occurred within 200 ms around the target time of 1200 ms) and reaction times were compared between the groups. To analyze differences between healthy controls and the two AD(H)D subtype groups, we used a univariate analysis of variance (ANOVA) with the between-subject factor Group (healthy controls, ADD, ADHD). This was followed by Bonferroni-corrected post hoc tests where necessary. A paired samples t test was used to compare accuracy between testings with and without MPH.

Statistical analysis of trial-by-trial performance profile

Gabor (Morlet) wavelets were used to analyze the performance (RT) profile across trials/time:

$$G\left( {t,f_{G} ,\sigma_{G} } \right) = \left( {1/\sqrt {2\pi } \sigma G} \right)\exp \left( {{{ - \Delta t^{2} } \mathord{\left/ {\vphantom {{ - \Delta t^{2} } {\left( {2\sigma_{G}^{2} } \right)}}} \right. \kern-0pt} {\left( {2\sigma_{G}^{2} } \right)}} + i2\pi f_{G} \Delta t} \right),$$

where fG is its frequency, σG its amplitude standard deviation, and Δt the “trial” deviation from the centre of the wavelet. The “size” of the wavelet is defined as its length in trial number and is usually represented in multiples of σG; unless otherwise specified in the text, all wavelet values were σG= 0.5/fG; the wavelet window was 4σG; fG is discretized in 1/T = 1/300 = 0.0033 steps where T is the length of all trials (here 300). The result of this procedure is plotted in 2D color-coded format where the power of RT variation is color-coded and vertical and horizontal axes represent the frequency of trial and trials, respectively (cf. Fig. 2). Because there is no a priori assumption concerning the trials at which the groups differ from each other (or MPH may affect performance), a data-driven strategy needed to be employed and corrected for multiple comparisons. This involves non-parametric cluster-based permutation testing [62] that is accomplished in two steps [63, 64] and requires two reference distributions: In the first step, t tests are calculated independently for each bin in the obtained map to test whether the measures in each group (patients, controls) could have been obtained from the same reference distribution (i.e., are not different from each other). For this, a distribution for each bin in the map is obtained by shuffling the data between the two conditions/groups Nshuffle = 5000 times. All significant time bins are then marked. The significance was set to p < 0.001. In the second step, clusters of contiguous significant bins are created and these data clusters are then compared to a reference distribution of clusters using a Monte Carlo randomization procedure of the original data (i.e., permutation testing). For this, a distribution for the cluster variable is computed that uses the result from the clusters of significant bins in each shuffled map and thus permits to create the reference distribution for clusters’ statistics. The significance level was set to p < 0.05. This procedure is used to compare the trial-by-trial performance profile across time between (i) the ADHD sample and the control sample, (ii) the ADD sample and the control sample, and (iii) the AD(H)D sample prior to MPH medication against this sample after 8 weeks of MPH medication had elapsed. These procedures were implemented in MATLAB using custom code [63] which can be downloaded at http://vision.ustc.edu.cn/packages/TutorialDataSetFunctions_TFanalysis.zip.

Results

Accuracy

Regarding the number of correct responses, the univariate ANOVA revealed a significant main effect of Group (F(2,185) = 21.78, p < 0.001, η 2p = 0.19). Bonferroni-corrected post hoc tests showed that healthy executed significantly more responses in the target time interval than patients with ADD (p < 0.001) and those with ADHD (p < 0.001). Performance did not differ between the two AD(H)D subtypes (p > 0.99). Concerning the effect of MPH on correct responses, the applied paired t test revealed a significant improvement in the number of responses executed in the target time interval (t(28) = −5.2, p < 0.001).

Regarding the number of too early responses, a significant main effect of Group was revealed F(2,185) = 14.88, p < 0.001, η 2p  = 0.14). Bonferroni-corrected post hoc tests showed that healthy controls executed significantly less responses prior to the target time interval than patients with ADD (p < 0.001) and those with ADHD (p < 0.001). Performance did not differ between the two AD(H)D subtypes (p > 0.99). Concerning the effect of MPH on the number of too early responses, paired t tests revealed a significant decrease in the number of too early responses (t(28) = 2.52, p = 0.018). As to the number of too late responses, the univariate ANOVA revealed a significant main effect of Group (F(2,185) = 3.80, p = 0.024, η 2p = 0.39). Bonferroni-corrected post hoc tests showed that healthy controls executed significantly less responses after the target time interval than patients with ADD (p = 0.028). Yet, the number of too late responses did not differ between healthy controls and patients with ADHD (p = 0.075). Performance did not differ between the two AD(H)D subtypes (p > 0.99). Concerning the effect of MPH on the number of too late responses, paired t tests revealed a significant decrease in the number of too late responses (t(28) = 3.14, p = 0.004). The descriptive values can be found in Table 2.

Table 2 Behavioural data

Reaction times

Regarding RTs, the univariate ANOVA revealed no significant main effect of Group (F(2,185) = 2.176, p = 0.116). However, the analysis of the standard deviation of the RTs (RTSD) showed a significant main effect of Group (F(2,185) = 18.55, p < 0.001, η 2p  = 0.17). Further Bonferroni-corrected post hoc tests for RTSD showed that patients with ADD (p < 0.001) and those with ADHD (p < 0.001) had significantly higher RTSD as compared to healthy controls. Yet, no differences in RTSD were found between ADD and ADHD patients (t(141) = −0.043, p = 0.966). Concerning the effect of MPH on accuracy, the applied paired t test revealed no difference in RTs (t(28) = 0.375, p = 0.713), but a significant difference in RTSD (t(28) = −5.2, p < 0.001). Regarding the AD(H)D Symptom Checklist, RTSD correlated with inattention (r = 0.309, p < 0.0001), moderately with hyperactivity (r = 0.192, p = 0.014), but not with impulsivity (r = 0.128, p = 0.105). The descriptive values can be found in Table 2.

However, the trial-to-trial performance across the entire task is of main interest for the current study and shown in Fig. 2.

In the center of Fig. 2, the results of the statistical comparison using cluster-based permutation tests are shown. Figure 2a shows the comparison between ADD patients and healthy controls. The comparison of the trial-by-trial performance (RT) profile across time between the ADD group and the control group revealed eleven clusters (significance level set to p < 0.05, see step 2 in “Methods”) in which trials showed significant differences in RTs between the ADD group and the control group (significance level set to p < 0.001, see step 2 in “Methods”). The comparison of ADHD patients and healthy controls is shown in Fig. 2b. The cluster-based permutation analysis revealed seven clusters (significance level set to p < 0.05, see step 2 in “Methods” section) in which RTs differed between ADHD group and healthy controls (significance level set to p < 0.001, see step 2 in “Methods” section). As can be seen in Fig. 2a, b, differences between healthy controls and AD(H)D subgroups do not occur continuously throughout the entire task. Rather, periods during which no differences in performance between healthy controls and patients were evident alternate with periods in which significant differences between groups were evident. For the ADD subgroup, 11 clusters containing a total of 68.7% of all trials showed differences as compared to the control group. For the ADHD subgroup, seven clusters comprising 40.7% of all trials showed differences as compared to the control group. The results of the cluster-based permutation analysis comparing ADD and ADHD subjects are shown in Fig. 2c. As can be seen, the performance profile comparison between ADD and ADHD group revealed that no clusters and no trials significantly differed between these groups. This means that there are no substantial differences in the pattern of trial-by-trial performance between ADD and ADHD patient groups. Figure 2d shows the comparison of ADD/ADHD patients before (pre-MPH) and after MPH medication (post-MPH). There were only two very small clusters in which trials differed (significance level set to p < 0.05, see step 2 in “Methods” section). Since this cluster only comprised six trials (2% of all trials), this suggests that no substantial differences are induced by MPH administration.

In sum, patients made less correct responses, which was due to a larger number of too early (ADD and ADHD) and too late (ADD) responses when compared with healthy controls. This response pattern is also reflected by a larger response time variability in both ADHD groups, although their mean response times were equal to the control group.

Discussion

In the current study, we examined trial-by-trial performance profiles of reaction times in a large sample of patients with AD(H)D compared to healthy controls. To achieve this, we used a novel analysis of RT data collected in a time estimation task. We compared healthy control children to patients diagnosed with the inattentive subtype (ADD group; Fig. 2a) and to those diagnosed with the combined subtype (ADHD group; Fig. 2b). Further, we conducted a comparison between the ADD group and the ADHD group (Fig. 2c). Lastly, we examined the effects of MPH treatment on the performance profile across time in a subgroup of AD(H)D patients (Fig. 2d).

An analysis of the accuracy to respond within the target time interval revealed significant differences between healthy controls and patients with both AD(H)D subtypes. No differences were found between the inattentive (ADD) and combined subtype (ADHD) of AD(H)D. This pattern has previously been shown in examinations of reaction time variability [65]. Yet, the trial-to-trial performance profile was of main interest here. To aid the discussion, we kindly refer the reader to Fig. 3 which shows a simplified representation of the study results. Note that this figure does not contain the actually measured/decomposed data. The finding that patients with AD(H)D showed lower accuracy rates compared to healthy controls is shown in a general upward shift along the y-axis in Fig. 3a.

Fig. 3
figure 3

Simplified representation of the study results. a Healthy controls (black line) and AD(H)D patients (red line). b AD(H)D patients before (red line) and after treatment with methylphenidate (MPH) had been initiated (green line). Compared to performance without MPH treatment, only the overall level along the y-axis (downward shift) is changed compared to performance with MPH treatment. The amplitude of the fluctuations remains the same after MPH treatment

The analysis of the trial-by-trial performance profile shows that both patient groups differ significantly from the healthy controls (Fig. 2a, b). However, patients with ADD and ADHD did not differ from each other concerning the performance profile across trials in the time estimation task, which is in line with the overall accuracy data (Fig. 2c). This may be due to the fact that IIV and behavioral performance fluctuations are closely linked to inattention [65, 66], which patients of both subtypes have in common. Patients with AD(H)D generally showed lower accuracy rates compared to healthy controls. This is in line with previous findings [17, 31,32,33, 37, 53, 67].

Overall, the results show that differences between patients with AD(H)D and healthy controls did not occur continuously throughout the entire task. Instead, the data show that performance was only different in some of the trials. Importantly, the results show a more fluctuating pattern of performance than it is the case in healthy controls (for an example illustration refer to Fig. 3a). That is, there are some periods during which no differences in performance between healthy controls and patients were evident. Importantly, these periods alternate with periods in which significant reaction time differences occur compared to healthy controls. These findings carry important clinical and scientific implications (see Fig. 4 for overview). From a clinical point of view, the results show that attention in children with AD(H)D may fluctuate rapidly across short spaces of time. Consequently, it may be useful to allow for “microbreaks” (i.e., off-task periods in the range of a few seconds) rather than aiming to achieve full on-task attention for prolonged periods of time. Such an approach has previously been studied in healthy adults, in whom microbreaks (e.g., lasting for 5 s [68] or 40 s [69]) have been shown to significantly improve sustained attention [68, 69], with this even being reflected in its neurophysiological correlates [68]. Further, breaking down longer tasks into microtasks has been shown to lead to increased task accuracy and more resilience to interruptions, albeit at the price of somewhat longer overall completion time [70]. Based on the current findings, such an approach may also prove useful in the case of AD(H)D, as it may match the fluctuating nature of attention very well. In fact, it may be possible that patients with AD(H)D use microbreaks (i.e., brief attentional lapses) as a self-regulatory strategy in order to reach required task demands [11]. To enhance its efficacy, such an approach may be combined with a token system to support the development of appropriate behavioral strategies. Importantly, this may be useful independent of AD(H)D subtype and also in patients already receiving treatment with MPH. Since recent evidence has shown divergent findings on periodicity of attentional fluctuations in patients with AD(H)D which do no not support the claim that that IIV in AD(H)D is the product of spontaneous periodic lapses of attention [71], more well-designed studies are needed to thoroughly examine the usefulness of such an approach in children with AD(H)D.

Fig. 4
figure 4

Clinical and scientific implications derived from the application of the presented novel methodological approach to analyze the behavioral performance profile in patients with AD(H)D

Furthermore, the current findings are crucial when considering the neurobiological basis of AD(H)D. Increased intra-subject variability of reaction times in AD(H)D has been hypothesized to reflect interference from the default mode network (DMN) [72,73,74,75,76,77] and dopaminergic modulation [78,79,80,81,82,83]. Deficiencies in the dopaminergic system are important for the emergence of a reduced stability of performance across a period of time [16]. The possible reason is that reduced dopamine activity increases neuronal noise [16, 20], which leads to less distinct and less stable cortical representations resulting in decreases in cognitive performance and increases in IIV [16]. Several lines of evidence suggest that dopaminergic modulation is a mechanism regulating the signal-to-noise ratio (SNR) of neural information processing [20, 22, 24, 29]. Considering this evidence, the current results suggest that the dopaminergic system in AD(H)D is just partly able to suppress neuronal noise. This seems to occur independent of AD(H)D subtype. It seems that periods during which neuronal noise is successfully suppressed alternate with periods during which this does not occur. Consequently, performance becomes different to that of healthy controls. It therefore seems that there are fluctuations in the efficacy of the dopaminergic system to suppress neuronal noise in patients with AD(H)D.

Regarding the above interpretation that the observed differences between AD(H)D and healthy controls may reflect dysfunctions in dopamine neurotransmission, the observed findings on MPH effects are important. Overall, patients performed more accurately after 8 weeks of MPH administration. Yet, there were no systematic and significant differences in the trial-by-trial performance profile across the duration of the task. Indeed, a previous study also failed to show significant effects of MPH on a variability measure in a sustained attention task [84]. Crucially, MPH mainly affects tonic dopaminergic activity [45, 46]. The obtained data show that only overall accuracy, but not trial-by-trial fluctuations, is affected by that. This result is reflected in the general downward shift along the y-axis together with the constantly high amplitudes after initiating MPH treatment in the hypothetical model shown in Fig. 3b. The finding of increased overall performance accuracy, yet unchanged trial-by-trial fluctuations patterns may seem paradoxical but can be explained as follows (refer Fig. 3b).

MPH acts as a dopamine transporter (DAT) blocker and steadily modulates the reuptake of dopamine [50, 51]. This is why MPH predominantly affects tonic dopaminergic activity [45]. The task used in the current study required participants to estimate a time interval of 1200 ms. Responses were considered as in time when they were executed within 200 ms of this target time (Fig. 3). If the RT curve across trials is only slightly and monotonically shifted downward the y-axis (green line in Fig. 3b), significantly more responses occur within the required time range, leading to a larger number of correct (in-time) responses. Importantly, and as can be seen in Fig. 3b, this is possible without affecting the degree of fluctuation or the RT profile across trials (i.e., the amplitude modulation across trials). It is therefore possible that MPH may affect overall task performance (i.e., the number of in-time reactions) without affecting the degree of trial-by-trial fluctuations. The interpretation of the monotonic shift of the RT profile is in line with neurobiological processes associated with MPH effects [85]. The pattern of results suggests that trial-by-trial fluctuations in performance are largely independent of tonic modulations of dopaminergic activity as induced using the first-line pharmacological treatment in ADHD (i.e., MPH). This interpretation is in line with several findings suggesting that MPH predominantly affects tonic dopaminergic activity [45, 46]. Findings concerning the effects of MPH on phasic dopamine release are mixed [46, 47, 54] and the effects of MPH on phasic release seem to depend on the administered dose [23]. Especially therapeutic doses of MPH as used in the treatment of AD(H)D have been suggested to decrease the ratio between phasic and tonic dopamine [23]. Since MPH medication emphasizes tonic dopaminergic turnover, general performance levels are increased [23]. However, as the current study shows, this is not sufficient, as fluctuations in performance, which also represent a particular challenge in everyday functioning of AD(H)D patients [9, 10], are not improved by MPH. Such improvements in phasic dopaminergic neurotransmission may be achieved through other approaches to AD(H)D treatment. Phasic effects of dopaminergic activity are known to be mediated via (prefrontal) dopamine D1 receptors [86, 87]. Interestingly, atomoxetine, which is currently used as a second-line pharmacological approach to AD(H)D, has been shown to decrease “noise” in neuronal signaling by indirectly increasing dopamine stimulation of D1 and norepinephrine receptors [88, 89]. Since it is the modulation of neuronal noise that is central for the stability/variability of performance [16], it is possible that atomoxetine, which is equally potent as MPH to treat ADHD [90, 91], also affects trial-by-trial variability in ADHD. However, this clinical implication remains to be tested in future studies, since MPH administration was not counterbalanced and all subjects were on medication at the second time of measurement, MPH effects cannot be disentangled from effects of practice. Similarly, it will be important to examine trial-by-trial fluctuations in other cognitive domains and in other disorders associated with deficits in dopaminergic neurotransmission. Also, the analysis of possible corresponding patterns on the (trial-by-trial) neurophysiological level would be an important next step. Moreover, the current findings may be limited, since the task was adapted from Beste et al. [61], who conducted a study in patients with a neurological disorder (Huntington’s Disease) in adults rather than a child and adolescent sample with cognitive alterations related to AD(H)D. Future studies should also address the differential effects of response feedback on the attentional deficit subtypes (ADD vs ADHD).

Further, the generalizability of the results from the experimental to the clinical context remains to be elucidated. In this regard, it will be important for future studies to further tease apart the contributions of time estimation and feedback learning to timing skills in patients with ADHD.

In summary, the study examined IIV with a novel focus on the precise trial-by-trial performance profile. This has not been considered in detail until now. A new method/tool to examine the trial-by-trial behavioral performance profile in patients with AD(H)D in comparison to healthy controls and its relation to methylphenidate (MPH) treatment is presented. Findings show that there are periods during which no differences in performance between healthy controls and patients are present. These periods alternate with periods in which significant reaction time differences are found. Overall, however, the results point to phasic dopaminergic dysfunctions in AD(H)D that need to be more considered in treatment. They also carry possible important clinical implications which remain to be addressed in further studies.