Introduction

Vagus nerve stimulation (VNS) with implantable devices is currently used to treat drug-resistant epilepsy and treatment-resistant depression. However the cost and complications associated with implantation have limited the application of invasive VNS to a broader range of health conditions1,2,3,4 and spurred growing interest in non-invasive approaches, such as transcutaneous auricular vagus nerve stimulation (taVNS)4,5,6,7. In taVNS, an electrode on the surface of the ear delivers a small electrical current across the skin to the auricular branch of the vagus nerve. Stimulation of the auricular branch in turn activates other areas of the brain through connected pathways8,9, including regions involved in the regulation of the autonomic nervous system (ANS). The theory behind the therapeutic efficacy of taVNS is that stimulating the vagus nerve can correct maladaptive processing within the nervous system as well as neuro-immune communications that are associated with negative health outcomes. Building upon prior work that suggests a link between ANS dysfunction and inflammation10, a recent open-label study of subjects with rheumatoid arthritis demonstrated clinically relevant changes in both disease biomarkers and symptoms after 12 weeks of sub-threshold taVNS treatment11. The therapeutic benefits of taVNS have also been reported in the context of depression5,10,12 and migraine13, with mounting evidence linking these clinical effects to neural responses4,6,14.

While the mechanisms through which taVNS causes changes are not completely understood, a number of recent studies have used blood oxygen-level dependent (BOLD) functional MRI (fMRI) to characterize the acute brain response to taVNS1,4,6,14,15,16. While BOLD fMRI is an effective approach for measuring evoked brain responses, its susceptibility to low-frequency noise sources makes it less effective for measuring changes that occur with time periods greater than several minutes17. In contrast, measures of cerebral blood flow (CBF), a well-defined physiological quantity, can be used to characterize changes in brain physiology occurring over periods ranging from seconds to years18, and therefore constitute a promising metric for the assessment of the sustained effects of VNS.

In prior work, both SPECT and PET have been used to characterize the CBF response to VNS applied using implantable devices in patients with either major depression (MD) or epilepsy. In MD patients, SPECT has been used to demonstrate regionally dependent increases and decreases in CBF due to 4 to 10 weeks of chronic VNS19,20. In a PET study with MD patients, Conway et al.21,22 reported regionally dependent increases and decreases in the evoked CBF response to short duration (90 s) VNS. For epilepsy patients, Henry et al.23,24 reported PET-based CBF increases and decreases in the evoked CBF reponse to 30 s of VNS, with the effects diminishing after 12 weeks of chronic VNS. In addition, two earlier pilot studies25,26 reported VNS evoked changes in PET CBF measures.

To the best of our knowledge, no studies to date have examined the CBF response to taVNS. In this study, we used arterial spin labeling (ASL) MRI to measure changes in CBF induced by 15 min of taVNS in healthy subjects, employing a high-frequency 20 kHz stimulus similar to that employed by Marsal et al.11. In contrast to SPECT and PET, ASL MRI does not require the injection of radioactive tracers, making it especially suitable for repeated measures in a healthy population27,28. Furthermore, in this study the taVNS is applied at a sub-threshold level, such that subjects cannot readily determine if they are receiving the treatment or a placebo. In contrast to prior studies where the perceptible sensations associated with VNS may have complicated the interpretation of the results, the sub-threshold stimulus used in this study removes somatosensory confounds and is therefore compatible with a single-blind placebo-controlled design. The goal of this study was to determine whether a single course of taVNS with duration similar to that of the daily dose employed in11 could lead to detectable changes in CBF.

Material and methods

Overview

We used a single-blind placebo-controlled experimental design. Twenty seven healthy subjects were enrolled in this study after providing informed consent (14 females and 13 males, aged 19–40 years). The study was approved by the UCSD Institutional Review Board (IRB), and informed consent was obtained from all participants. All experiments were performed in accordance with relevant guidelines and regulations. Figure 1 shows the overall experiment structure. Each subject participated in a control session and a treatment session, with the order randomized across subjects. The sessions were separated by at least one day to avoid taVNS carryover effects.

Figure 1
figure 1

Diagram of experimental design. Each subject undergoes a control session and a treatment session. Each session consists of pre scans, a stimulation (sham for control session) section, and post scans.

All scanning visits consisted of a pre scan section, a stimulation (or sham) section, and a post scan section. For the first 6 subjects, the subjects wore the stimulation earpieces throughout the entire session, and the stimulation section took place while they remained lying inside the scanner. To improve the subject experience, the protocol was slightly modified for the subsequent 21 subjects. These 21 subjects wore regular earplugs during the MRI scans, came out of the scanner for the stimulation section, and then returned to the scanner with regular earplugs to finish the post scan section.

MRI scans

Scans were acquired on a GE Discovery MR750 3.0 T system. Pre and post scan sections consisted of (1) a high-resolution anatomical scan (MPRAGE, resolution = 1mm3, 208 slices, FOV = 25.6 cm, TR = 2500 ms, TE = 2.92 ms, FA = 8 deg), (2) resting-state BOLD fMRI scans, and (3) arterial spin labeling (ASL) scans (2D pseudo-continuous ASL (PCASL), single-shot spiral, 100 repetitions, resolution = 3.75 mm × 3.75 mm, slice thickness = 6 mm, matrix size = 64 × 64 × 24, FOV = 24 cm, TE = 3.2 ms, TR = 4300 ms, labeling duration = 1800 ms and post-labeling delay = 1800 ms)28. Only the anatomical and ASL scans are analyzed here. The resting-state BOLD scans will be analyzed in a separate paper. During all ASL scans, subjects rested with their eyes open and gently focused on a fixation point.

taVNS

taVNS was delivered using custom-fit ear pieces designed by Nēsos® (Redwood City, CA). A bipolar configuration was used and two circular (radius 0.3 cm) hydrogel electrodes were placed on the ear piece surface interfacing with the cymba concha, with a contact area of 0.28 cm2 for each electrode.

Alcohol wipes were used to clean the cymba concha prior to inserting the ear pieces. Stimulation was applied to the left ear, targeting vagal afferents in the cymba concha based on known vagus nerve anatomy8,29. For all visits, the stimulation (or sham) section took place between the pre and post scan sections.

On both scan days, a sensation test was performed to determine a subject’s threshold for perceiving the stimulation, which entailed stimulating at 1.0 mA and increasing by increments of 0.5 mA (up to a maximum of 5.0 mA). Criteria for perceiving the stimulation include sensations of stinging, pricking, or warmth. This procedure was repeated until the subject reported feeling the stimulation at the same level three times. The sensation test was followed by a 15-min stimulation period. The stimulation was delivered continuously at 20 kHz, using biphasic square waves with pulse width of 20us. The stimulation amplitude was set at 75% of a subject’s perceptual threshold (1.5–3.8 mA; impedance 3–9.5 kΩ at 20 kHz) on the "active" stimulation day and was set to zero on the control day (“sham” condition). The order of “active” and “sham” stimulation days was randomized across subjects. To verify that the subthreshold stimulation did not cause any local heating, we used a DTT-1372 Thermometer (Ames InstrumentsTM, Harbor Freight Tools USA. Inc., Calabasas, CA) to measure temperature changes over a 15-min period in both the active and sham conditions for one of the subjects. The probe was placed on the medial surface of the external ear, directly across from the earpiece that was positioned at the cymba concha on the lateral surface of the external ear. Minimal temperature increases were observed for active (0.20 °C) and sham (0.25 °C) conditions with no significant difference (p = 0.46) in the slope of the changes over the 15-min period.

Since the stimulation was delivered at 75% of perceptual threshold, subjects were blinded to whether they were undergoing the sham or stimulus treatment. At the end of their second session, all subjects were asked whether they could identify their control and treatment sessions across their two visits.

Data processing and analysis

Data from the ASL scan were volume registered and quantified into physiological CBF units (mL/100 g/min) via local tissue correction with a proton-density scan acquired during the same scan section30. The SCORE algorithm31 was used on a per-slice basis to censor outlier timepoints in the CBF time series before computing the average CBF map. Using AFNI software32, the high-resolution anatomical scans were transformed into MNI space, and the CBF volumes were then warped to MNI space using the transformation matrices from the anatomical transformation, with a final voxel resolution of 4mm3.

For group analysis in MNI space, Post minus Pre CBF difference maps (termed “Post–Pre” for the remainder of the manuscript) were computed for control and treatment sessions. A two-sided paired t-test was performed for the main contrast (Post–Pre)treatment − (Post–Pre)control on a per-voxel basis using AFNI 3dttest++. This contrast is equivalent to the interaction effect for a two-way (Post, Pre by Treatment, Control) repeated measures ANOVA. After setting a primary voxelwise threshold, spatial clustering thresholds were used to control the family-wise error rate (FWER) associated with detecting clusters of supra-threshold t-values across the 27,168 voxels in the MNI brain mask33. For a specified voxelwise threshold, a permutation-based method (as implemented in AFNI 3dttest++ with the -Clustsim option) was used to estimate the false positive rates associated with detecting clusters with varying cluster-size thresholds. Additional details and background on this approach are provided in Cox et al. 201733. A voxelwise threshold of p < 0.01 and a cluster size threshold of 22 voxels (clusters defined with voxel faces touching) were used to assess results at a FWE-corrected p < 0.05 level.

Post-hoc two-sided paired t-tests were also performed for each session (control and treatment), and were assessed at a voxelwise threshold p < 0.01 and a FWE-corrected p < 0.05 level (using cluster size thresholds of 35 and 51 voxels for the control and treatment contrasts, respectively). In addition, post-hoc analyses were performed using the mean CBF values of any significant clusters found from the main contrast of (Post–Pre)treatment − (Post–Pre)control.

Finally, subjects that were randomized to a session order of control followed by treatment were compared to subjects randomized to treatment then control. A post-hoc two-sided two-sample t-test was performed between these two groups using the mean CBF values of any significant clusters found from the main contrast.

Results

Participants and stimulation

Out of twenty-seven total subjects, two dropped out after their first session, due in part to the COVID-19 related suspension of research activities during the course of the study. The remaining twenty-five subjects completed both the control and treatment sessions. Five subjects were excluded from the study due to technical errors or severe image artifacts in at least one of their CBF scans, leaving twenty total subjects included in the analysis. These twenty subjects received a mean stimulation amplitude of 3.24 ± 0.56 mA (min = 1.5 mA, max = 3.8 mA). The control and treatment sessions were matched to take place at the same time each day whenever possible, with an average time-of-day difference of 0.9 ± 1.32 h (min = 0 h, max = 4 h).

When asked whether they could identify the control and treatment sessions across their two visits, sixteen of the twenty subjects responded that they could not guess. Of the remaining four subjects who indicated they could identify their sessions, three guessed correctly and one guessed incorrectly.

Overall, an average of 24.81 ± 6.93 min elapsed from the end of the stimulation to the start of the post section ASL scan for these subjects.

Group-level whole-brain analysis

For the main contrast shown in Fig. 2a, application of clustering analysis at voxelwise p < 0.01 and FWE-corrected p < 0.05 revealed one significant cluster spanning both hemispheres of the posterior cerebellum (71 voxels, peak MNI coordinate [−6, 78, −36]). Figure 2b,c show session-level contrasts that support the main cluster result. Figure 2b shows no significant CBF changes in the control session, while Fig. 2c shows a left hemispheric cerebellar cluster (69 voxels) and a right hemispheric cerebellar cluster (74 voxels) with significant CBF decreases in the treatment session.

Figure 2
figure 2

Maps showing clusters of voxels with significant effects, FWE-corrected at p < 0.05 with voxelwise threshold p < 0.01. Contrast-dependent cluster size thresholds based on permutation testing are indicated below. Surviving voxels are color-coded by their t-statistic value and overlaid on anatomical images, with the same slices shown for all three contrasts. No significant clusters were observed in any of the other slices. (a) For the main (Post–Pre)treatment − (Post–Pre)control contrast , one cluster with 71 voxels survived with a cluster size threshold of 22 voxels. (b) For the Postcontrol – Precontrol contrast, no clusters survived with a cluster size threshold of 35 voxels. (c) For the Posttreatment – Pretreatment contrast, two clusters survived with a cluster size threshold of 51 voxels.

We used the probabilistic atlas of the cerebellar lobules as defined in34,35 and the AFNI whereami function to determine the overlap of the significant clusters with the cerebellar lobules. For the main contrast, the single cluster showed 96.3% overlap with the cerebellar lobules, with 47.2% of the cluster overlapping with left and right Crus I, 44.8% with left and right Crus II, 3.5% with Vermis Crus II, and 0.8% with left and right lobules VIIb. For the treatment contrast, the left hemispheric cluster had 94% overlap with the cerebellar lobules, with 75.5% overlap with left Crus I, 15.8% with left Crus II, and 2.6% with left lobule VI. The right hemispheric cluster had 89.7% overall overlap consisting of 45.4% overlap with right Crus I, 28.5% with right Crus II, 8% with right lobule VI, 3.3% with right dentate, 2.7% with Vermis Crus II, 1.8% with left Crus II, and 0.1% with right VIIb.

Post-hoc analyses of the mean CBF values from the main contrast cluster are shown in Fig. 3. The cluster shows a significant decrease in CBF related to treatment (p < 0.001, −15.90%) alongside a statistically insignificant increase in the control session (p = 0.19, + 4.02%). Consistent with the maps shown in Fig. 2, the post-hoc analyses indicate that the contrast of interest mainly reflects significant treatment-related decreases in cerebellar CBF.

Figure 3
figure 3

Post-hoc analysis with mean per-subject CBF values from the cluster shown in Fig. 2a. The Pre and Post values from the control and treatment sessions are shown in the leftmost and center pairs, respectively. The (Post–Pre) CBF differences from the control and treatments sessions are shown in the rightmost pair. There was a significant treatment-related decrease in CBF (p < 0.001, −15.90%) alongside a weak, statistically insignificant increase in the control session (p = 0.19, + 4.02%), resulting in a significant main effect (p < 0.001).

Post-hoc analysis of session-ordering effects in the main cluster is shown in Fig. 4. There was not a significant difference in the main contrast values between the two session orders (p = 0.54), indicating that the main contrast effect is not driven by either randomization order.

Figure 4
figure 4

The main effect for any significant differences in effect between the subjects randomized to A: Control then Treatment vs B: Treatment then Control. The plot below shows the main contrast values of (Post–Pre)treatment − (Post–Pre)control within the main contrast cluster reported in Fig. 3. On the left are subjects randomized to group A; on the right are subjects randomized to group B. A two-sided two-sample t-test was performed between the two groups, indicating no significant difference in main effect (p = 0.54) between the two session orders.

Discussion

Our findings reveal a significant bilateral decrease in cerebellar CBF as a result of 15 min of sub-threshold taVNS, reflecting physiological changes that persisted for many minutes after the treatment. These effects were observed using a stimulus paradigm and duration similar to those of the daily dose employed in the recent open label clinical trial of11.

In contrast to the current findings, prior studies have reported both unilateral26 and bilateral21,23,24 increases in cerebellar CBF. However, there are several key methodological differences that are worth noting. First and foremost, the prior studies examined evoked CBF changes in response to the concurrent application of 30 to 90 s of VNS, whereas the current study assessed resting-state CBF measures obtained before and after the application of taVNS. Thus, the prior findings are limited to characterizing the acute cerebellar CBF response to VNS, while the current findings offer evidence of a sustained decrease in cerebellar CBF in reponse to taVNS. Second, VNS consists of an implantable device targeting the cervical vagus nerve, while taVNS is non-invasive and applied to a branch originating from the superior ganglion of the vagus nerve. This difference in neural targets may be expected to produce different physiological responses. Third, the use of a sub-threshold high frequency (20 kHz) stimulus in the current study enabled the use of a single-blind placebo controlled design, in contrast to the open-label design with the super-threshold low frequency (20 to 30 Hz) stimuli employed in prior studies. The placebo control in the current study enabled a clean isolation of the direct vagal stimulation effects from those associated with stimulus-associated sensations and responses, a separation that was not possible with prior designs. In addition, there are likely to be frequency dependent neurophysiological mechanisms that lead to differences in the effects between the high (20 kHz) versus low (20 to 30 Hz) frequency stimulus paradigms. Finally, the current findings were obtained in healthy subjects who were naïve to taVNS, whereas the prior results characterized the responses in MD or epilepsy patients at a time point that was hours to months after the commencement of continuous VNS therapy. Thus, the current study provides critical information about the effect of taVNS on the healthy brain that can serve as a baseline for the future assessment of effects in clinical populations.

Stimulus-induced changes in cerebellar activity have also been observed with BOLD fMRI, but the experimental methods and results greatly vary across studies with reports of both evoked activations in healthy subjects1,15,16 and epileptic patients36 and deactivations in healthy subjects2 and MD subjects37,38. As the BOLD signal exhibits a complex dependence on neural, vascular, and metabolic factors, a definitive link between observed BOLD changes and underlying CBF changes cannot be established without additional experimental measures39. However for most conditions, task-evoked increases and decreases in the BOLD signal are generally thought to reflect concomitant CBF increases and decreases. Thus, while the fMRI findings cannot resolve whether VNS induces increases or decreases in cerebellar CBF, they offer general support for the potential of VNS to modulate CBF levels in the cerebellum.

In this study, significant CBF changes were limited to the cerebellum. The prior PET and SPECT studies reported CBF changes in additional brain regions, although the results were highly variable across studies. As noted above, a number of these studies considered the evoked CBF response as compared to the sustained response reported here. Two SPECT studies examined the effects of 4 to 10 weeks of chronic VNS19,20 and reported increases and decreases in CBF in multiple brain regions, but there were differences between the two studies in the regions that were identified and neither study reported cerebellar CBF changes. In light of the variability observed in prior studies and the significant methodological differences discussed above, discrepancies between the prior and current findings are to be expected. In particular, the interpretation of the changes observed in the prior studies typically discussed the potential modulation of activity in regions that were thought to be associated with the disease state. These brain regions would not necessarily show an effect in healthy subjects. Nevertheless, a future study examining the effects of sub-threshold taVNS over a longer period (e.g. weeks) would be of interest to determine if CBF changes can be detected in areas beyond the cerebellum.

While the cerebellum has traditionally been viewed as a motor control region, a wealth of evidence suggests that it also plays important roles in cognitive, affective, and social processing40,41,42. In particular, the cerebellar regions Crus I and Crus II, which exhibited the greatest overlap with significant CBF changes in this study, are considered to be primarily associated with non-motor functions, such as working memory, executive function, language processing, social mentalizing, and emotional cognition41,43,44. Interestingly, the cerebellum has a purported role in conditions for which taVNS has beneficial effects. For example, cerebellar dysfunction has been implicated in major depressive disorder (MDD)45, and a preliminary clinical study found that a multi-week regime of taVNS (relative to sham) can alleviate depression severity as assessed via the Hamilton Depression Scale in MDD patients5. TaVNS-induced cerebellar activation has also been reported in an fMRI study of MDD patients, though individual differences in activation were not related to clinical outcomes46. Furthermore, taVNS has shown promise as a treatment for migraine13, another disorder that has been linked to functional and structural alterations within the posterior cerebellum, namely Crus I and Crus II47,48. Finally, a recent study found that a 12-week regime of taVNS treatment (with parameters similar to those used in this study) improved symptoms for patients with rheumatoid arthritis (RA)11. Reduced gray matter of the cerebellum has been associated with higher levels of peripheral inflammation in RA patients49. Together, these findings suggest that the link between taVNS therapeutic efficacy and cerebellar function should be further explored.

There are limitations to the study that would be important to address in future follow-up work. First, the analysis required that all four ASL scans (Precontrol, Postcontrol, Pretreatment, and Posttreatment) were free from artifacts, as issues in any run could affect the main contrast. However, this strict approach makes the study sensitive to any issues like subject motion that can cause artifacts in the CBF maps, leading us to exclude five out of twenty-five subjects, even though a much lower fraction (10 out of 100) of the scans exhibited artifacts. For future work using ASL to probe taVNS effects, one improvement would be to use ASL sequences with higher SNR, such as 3D PCASL with background suppression to reduce motion sensitivity50,51. The increased SNR can be used to shorten the scan lengths while maintaining similar image quality, mitigating the risk of motion associated with longer scan times. The higher SNR also increases the ability to detect CBF changes in a group analysis, so that fewer subjects would be needed to detect the same effects, increasing the study robustness to subject exclusions. Second, there is a small possibility of a thermo-regulatory vascular confound in response to local energy deposition from taVNS. We believe this to be unlikely, as no subjects reported sensations of heating at the stimulation site. Furthermore, an in-situ thermal test of the device found minimal (< 0.3 °C) changes in temperature over a 15-min period in both active and sham conditions. However, we cannot completely rule out a vascular response that quickly dissipates heat and could prevent any detection of warmth despite an underlying change in neural response.

In this study, the taVNS was applied at the cymba concha, which is 100% innervated by the auricular branch of the vagus nerve8. However, given the overlapping innervation by the greater auricular nerve and the auriculotemporal nerve of auricular regions proximal to the cymba concha8, it is not possible to completely rule out the effects on CBF through these pathways. For example, the auriculotemporal nerve lies within the mandibular branch of the trigeminal nerve, and prior work indicates that stimulation of the trigeminal nerve can cause changes in CBF52.

In conclusion, the present results indicate that 15 min of high-frequency sub-threshold taVNS can induce sustained CBF decreases in the bilateral posterior cerebellum in a sample of healthy subjects. Further work is needed to determine if similar changes can be observed in clinical populations and to assess if the measures can provide greater insight into the mechanisms underlying treatment outcomes. In addition, future research should also address how stimulation parameters such as frequency, duration, and electrode location impact the response.