Study design
This was a double-blind, sham-controlled study with a single session of bilateral anodal tDCS chosen as the method of intervention. Patients visited the laboratory on three separate occasions, with tDCS applied only on the first visit. The second visit took place 1 week later with the third visit taking place one month after visit two and will be termed week and month throughout the manuscript (Fig. 1). The primary outcome measure was a change in trait fatigue. Secondary outcome measures included state fatigue, explicit and implicit measures of PE and motor cortex physiology measures of resting motor thresholds (RMT) and slope of recruitment curves (IOSlope) of the affected and un-affected hemisphere assessed using TMS. The primary outcome measure was recorded at three distinct time points (pre stimulation, week and month). All other outcome measures were recorded at four distinct time points (pre stimulation, immediately post stimulation, week and month).
Subjects
The study was approved by London Bromley Research Ethics Committee (16/LO/0714). Stroke survivors were recruited via Clinical Research Network at University College NHS Trust Hospital, a departmental Stroke Database and from the community.
Inclusion criteria: Date of stroke > 3 months, first-time stroke, age ≥ 18 years; Fatigue Severity Scale (FSS-7) ≥ 4. A score of ≥ 4 on FSS-7 indicates the presence of clinically significant fatigue [39]. Exclusion criteria: use of centrally-acting medication which may affect the level of fatigue, depression, and anxiety; depression scores ≥ 11 (Hospital Anxiety and Depression Scale—HADS); grip strength and manual dexterity (nine-hole peg test) ≤ 60% of unaffected hand.
The minimal clinically important difference on the FSS-7 is 0.45, with differences greater than 0.45 predicting a significant effect on the quality of life [40, 41]. To detect the minimal clinically important difference in fatigue with 80% power (0.80) and a significance level alpha of 0.05, a sample size of 11 subjects per group is needed. Twice the number of patients were allocated to the real stimulation group than necessary, as previous studies using tDCS in multiple sclerosis fatigue and healthy individuals showed that approximately 50% of patients respond to tDCS [31, 42]. Thirty-three patients were recruited into the study and were randomly allocated to the real (n = 22) or sham (n = 11) stimulation groups (Fig. 2). All patients gave written informed consent in accordance with the Declaration of Helsinki. Patient demographics for both groups are found in Table 1.
Table 1 Patient demographics and clinical data for the real and sham stimulation groups Questionnaires
Trait and state measures of fatigue were captured during the study. Trait fatigue represents the experience and impact of fatigue on day to day living for a pre-determined time period leading up to the day of testing, whereas state fatigue characterizes fatigue at a given moment in time. Trait fatigue was quantified using the FSS-7, a seven-item questionnaire asking for ratings of fatigue ranging from one to seven (strongly disagree to strongly agree) over the preceding week from the day of administration [43]. An average score of one indicates no fatigue while an average score of seven indicates very severe fatigue. State fatigue was quantified using a visual analogue scale (VAS) ranging from zero to ten (Not at all tired to extremely tired). Patients also completed the HADS, a 14-item questionnaire with a depression and anxiety subscale, prior to the stimulation. A score of 0 to 7 for either subscale could be regarded as being in the normal range, with a score of 11 or higher indicating the probable presence of the mood disorder [44].
Stimulation
tDCS was applied using two battery-driven stimulators (DC-Stimulator Plus, NeuroConn, Germany) while patients were awake and at rest. Four 35 cm2 rubber electrodes coated with conductive paste were secured with self-adhesive bandages. The anode of each stimulator was placed over the left and right M1 (C3 and C4 according to the 10–20 EEG system), while the cathode of each stimulator was placed over the ipsilateral left and right shoulders. This tDCS montage has previously been shown to reduce the perception of effort and increase corticospinal excitability in healthy individuals[38]. Real tDCS involved two 20-min sessions of stimulation at 2 mA separated by a 10-min break in between. The current was ramped up for 30 s until reaching 2 mA and ramped down for 30 s at the end of the stimulation period. Stimulation intensity and duration complied with current safety recommendations [45]. For sham stimulation, the current was ramped down immediately after ramping up, providing effective blinding [46]. The patient and researchers were blind to the applied stimulation (real or sham). At the end of stimulation, patients were explicitly asked whether they thought they received real or sham stimulation.
Perceived effort
PE was measured in an isometric handgrip task with a hand-held dynamometer (Biometrics Ltd, Newport, UK) performed using the dominant hand [21]. Force data from the dynamometer were acquired at 500 Hz via a data acquisition interface (Power1401, CED) and recorded in MATLAB (2016b, MathWorks). Each trial was 5 s long, in which patients were required to sustain a grip force for 3 s at 20%, 40%, or 60% of their maximum voluntary force. Immediate force feedback was shown on the monitor as a filling of a red bar, which turned green once the minimal required target force, indicated by a cross on the screen, was reached. The grip force–visual feedback relationship was individually adjusted for every patient to eliminate potential influence on PE. Before the experiment, patients practiced each force level with their dominant hand to familiarize themselves with the effort required and performed a line familiarization. In the line familiarization, patients were shown 3 “short’ lines (1, 2, and 3 cm), and 3 “long” lines (10, 11, and 12 cm). After the presentation of the 6 lines, patients were shown each of the learned lines without information about the category it belonged to and were asked to judge the line length. Patients responded using the keyboard: left arrow key for “short” and right arrow key for “long”. They were then asked to rate their confidence in their response using a VAS. If patients’ response was < 100% correct, the procedure was repeated until they were able to distinguish between short and long lines.
During the PE task, each grip was followed by a line length estimation. The line presented could have a length of 3.5–8.5 cm with a total of 24 different line lengths, 12 short and 12 long. Twenty-four lines presented under the 3 force conditions resulted in a total of 72 trials divided into 3 blocks. The order of forces and line lengths was randomized with equal numbers of the 3 different force levels in each block. Participants reported if the presented line was short or long based on the length of lines presented during the familiarization phase. If they determined the presented line to be shorter than half the length of the longest line presented during the familiarization (12 cm), they reported short; otherwise, they reported long. These blocks were used as an implicit measure of PE.
After 3 blocks, participants performed a final block of 9 trials. This block was used as an explicit measure of PE. Each trial consisted of a 5-s grip with visual feedback at the 3 different force levels, 20%, 40%, or 60% of maximum voluntary force, with 3 trials for each force level. This was followed by the question, “How effortful was the squeeze?” Patients had to respond using a VAS ranging from “not at all” to “very hard.”
Surface electromyogram and TMS
Electromyogram (EMG) recordings were obtained from the first dorsal interosseous (FDI) muscle using surface electrodes (1041PTS Neonatal Electrode, Kendell) in a belly-tendon montage with the ground positioned over the flexor retinaculum of the hand. The signal was amplified with a gain of 1000 (D360, Digitmer, Welwyn Garden City, UK), bandpass filtered (100–1000 Hz), digitized at 10 kHz (Power1401, CED, Cambridge, UK) and recorded with Signal version 6.04 software (CED, Cambridge, UK).
TMS (figure-of-eight coil with wing diameter, 70 mm; Magstim 2002, Magstim, Whitland, UK) was used to stimulate the hand area of the M1. The coil was held tangentially on the scalp at 45° to the mid-sagittal plane to induce a posterior-anterior current across the central sulcus. The subjects were instructed to stay relaxed with their eyes open and their legs uncrossed. The motor ‘hotspot’ of the FDI muscle was determined as described previously [7]. RMT was defined as the lowest intensity required to evoke a motor evoked potential (MEP) at the hotspot of at least 50 μV in a minimum of 5 of 10 consecutive trials while subjects were at rest. IO curves were acquired at rest at the hotspot using TMS intensities set at 100, 110, 120, 130, 140 and 150% of RMT. Six pulses at each of the 6 intensities were delivered in a randomized order with an inter-trial interval of 4 s, giving thirty-six trials in total. This procedure was repeated for both the affected (RMT-A, IOSlope-A) and unaffected (RMT-U, IOSlope-U) hemispheres.
Analysis of questionnaires
The FSS-7 score was calculated by averaging all items for each of the three-time points.
The total score was taken for the anxiety and depression subscales of HADS, HADS-Anxiety and HADS-Depression respectively, and were considered as independent measures.
TMS analysis
The data were analysed using custom-written routines in Matlab (2018a, Mathworks). Peak-to-peak MEP amplitudes for each condition were estimated from the EMG recordings. All trials were visually inspected and approximately 7% of trials with pre-contraction and size ≤ 0.025 mV were excluded across all participants. A linear fit was applied to all the MEP data across the six conditions (100–150% RMT) for each participant at each session. The quality of the linear fit was evaluated by calculating the r-squared value for each participant across each session. The grand mean r-squared value across all individuals and all session and the resulting standard deviation (R2 = 0.86 ± 0.12) demonstrate an overall good fit of the MEP data with little variability within conditions. The gradient of the linear fit was subsequently calculated for each participant in each of the four sessions and for each hemisphere (affected and un-affected hemisphere) giving us the slope of the recruitment curve.
PE analysis
To obtain a measure of explicit PE, VAS scores were averaged across all trials in each force level in each individual. As there was no difference across force level, the average VAS score across all force levels was used as an explicit measure of PE for each of the four-time points. To obtain a measure of implicit PE, the sum of the number of lines reported as long for each individual in each force level was calculated. As there was no difference across force level, the average number of lines across all force levels was used as an implicit measure of PE for each of the four-time points.
Statistical analysis
All statistical analysis was performed using R (RStudio Version 1.2.5033). Assumptions of a normal distribution of the primary and all secondary outcome variables were assessed using the Shapiro–Wilk test. All data were non-normally distributed (p < 0.05). To test for changes over time, a non-parametric Friedman test was performed for the primary outcome variable (trait fatigue) and all secondary outcome variables (state fatigue, RMT-A, RMT-U, IOSlope-A, IOSlope-U, PE-implicit and PE-explicit), separately for the sham and real intervention groups as in Saiote et al. [31]. When significant results were found, pairwise comparison between baseline and each post-measurement day were performed using Wilcoxon signed-rank test. To analyse the effect of real stimulation versus sham stimulation across both primary and secondary outcome measures, the changes in scores were calculated by normalising each day to baseline (pre stimulation) and then compared within each day using a Wilcoxon signed-rank test. Adjustment for multiple comparisons was performed using Bonferroni correction.
A spearman correlation was used to examine the association between baseline trait fatigue scores and the change in trait fatigue a week after stimulation in both the sham and real stimulation groups. To identify the potential mechanisms that drive the change in trait fatigue in the real stimulation group a multiple linear regression was used with demographic data and secondary outcome variables that were significantly different between the real and sham stimulation groups used as predictors. Collinearity amongst the predictors used in the multiple regression model was assessed by computing the variance inflation factor (VIF). No VIF value exceeded a score of 5, demonstrating that there was no collinearity amongst the predictors used. Goodness of fit was assessed using the BIC (lower BIC indicates a better fitting model) to identify the combination of variables that best predicted the outcome variable, the change in trait fatigue. Assumptions of normality and homoscedasticity of the residuals for each model were assessed visually using quantile–quantile normal plots and fitted- versus residual-value plots.