Introduction

The vagus nerve is a mixed nerve composed of 20% efferent fibers and 80% afferent fibers1. It plays a major role in maintaining autonomic tone throughout in the brain, the thorax, and the abdomen. The vagus nerve stimulation (VNS), first approved by the US Food and Drug Association (FDA) in 1997 in the form of a cervical implantable device, is currently approved for drug-resistant epilepsy, depression, and morbid obesity2,3. Further studies have shown promising results that VNS can treat disorders such as anxiety disorder, Alzheimer’s disease, cluster headache, heart failure, sepsis, lung injury, rheumatoid arthritis, inflammatory bowel disease, obesity, and diabetes4,5,6.

However, this classic approach requires an invasive procedure that involves cervical dissection and foreign body implantation. Due to the potential risks of surgical complications, hospitalization cost, and accessibility, the application of the intervention is limited despite its demonstrated benefits. Thin-myelinated afferent fibers of the cervical vagus nerve are targets for clinically effective VNS6. However, it is impossible to selectively stimulate certain fibers through conventional methods, and efferent branches are unintentionally stimulated together. Efferent branches send parasympathetic cholinergic signals to target organs, including the lungs, digestive tract, and heart3. The most critical reports include cardiac events, such as bradycardia and asystole, occurring in approximately 1 per 1000 cases as a result of direct stimulation of the cardiac branches of the vagus nerve7. Injury to the efferent branches may also cause hoarseness, dyspnea, and dysphagia related to left vocal cord paralysis8. Therefore, there has been an increasing demand for a non-invasive alternative method to avoid side effects but achieve similar effects9.

Transcutaneous VNS can be approached using two methods, stimulation of either the cervical bundle or the auricular branch10,11. Because there is no need for surgical procedures, clinicians in many fields suggest that transcutaneous VNS can be applied to a wider range of patients and is even tolerable for patients with disorders that are not life-threatening. Transcutaneous auricular vagus nerve stimulation (taVNS) is a newer delivery system that uses an electrical transcutaneous stimulation device placed at the concha or tragus of the ear. fMRI studies have shown that when these sites are stimulated, the ipsilateral nucleus of tractus solitarius (NTS) is activated through the vagal projections in the brainstem and forebrain12. Opposing to cervical VNS, taVNS uses a physiological pathway to activate the NTS and dorsal motor nucleus, which subsequently sends impulses to the heart surface bilaterally via efferent cervical vagus nerves13. Therefore, this technique avoids the possibility of directly and asymmetrically stimulating cardiac motor efferent fibers, which could result in unfavorable cardiac events. The taVNS combines advantageous qualities including non-invasiveness, cost, easiness, and efficacy to make it an appealing intervention for neuropsychiatric illnesses13.

Many authors consider taVNS a safer substitute for the previous surgical device4,14,15,16,17. Thus far, taVNS use has been studied in many conditions such as epilepsy, tinnitus, depression, stroke, Alzheimer’s disease18.

Therefore, it is important to assess the safety of taVNS. Although the general impression is that taVNS is a safe technique with only mild and transient adverse effects (AEs), human data on safety and tolerability are largely based on clinical trials with small study populations. A review study in 2018 addressed this issue and was the first to systematically report transcutaneous vagus nerve stimulation treatment harms11. The author concluded that transcutaneous vagus nerve stimulation is well tolerated and safe, with only mild side effects such as local skin irritation due to electrode placement, headache, and nasopharyngitis11. Overall, 30 serious adverse events were reported (22 participants from 7 studies), but none of them were confirmed to be caused by transcutaneous VNS11. However, this study suggests a comprehensive perspective, including transcutaneous vagus nerve stimulation techniques such as that applied to the neck. Adverse effects specifically caused by taVNS have not yet been investigated. In addition, because of the relatively high interest in taVNS, many human studies have been further reported since this publication and require systematic and meta-analytic analysis for possible adverse events. To assess the safety of taVNS under different conditions and study designs, we performed this systematic review and meta-analysis of clinical reports on taVNS.

Another goal of our study was to explore the relationship between the study characteristics and the frequency of adverse events. Because there are no guidelines for taVNS clinical trials, there is a large interstudy heterogeneity in study methodology, device type, electrode design, and stimulation parameters17. Stimulation parameters vary among studies, with different stimulation frequencies, pulse widths, polarities, stimulation times, and utilized pulse types13. However, no standardized stimulation parameters elicit optimum efficacy and safety18. Therefore, regarding the safety aspect of taVNS, the method of monitoring and assessment methods, the frequency of adverse events, and their relationship with clinical variables have not been established. This review aimed to (1) systematically analyze study characteristics and AE assessment, (2) characterize and analyze possible AEs and their incidence, (3) search for predictable risk factors, (4) analyze AEs by severity, and (5) suggest an evidence-based taVNS adverse events questionnaire for safety monitoring during taVNS.

Methods and analysis

Literature search

This systematic review was registered at PROSPERO (registration number 340559), and it was conducted according to the guidelines of the Preferred Reporting Items for Systematic Reviews and Meta-Analyses–PRISMA19 and the Cochrane Handbook for Systematic Reviews20. We searched for articles published through April 7, 2022, in Medline, Embase, Web of Science, Cochrane, and Lilacs databases, using the following search terms: (("Transcutaneous vagal nerve stimulation") OR ("Transcutaneous vagus nerve stimulation") OR ("Auricular vagal nerve stimulation") OR ("Auricular vagus nerve stimulation") OR (tVNS) OR (taVNS) OR (aVNS)).

Inclusion and exclusion criteria

The following criteria were adopted for eligibility: (1) articles published in English; and (2) articles that reported the effects of taVNS in humans. Therefore, we excluded the following articles: (1) animal studies; (2) review articles; (3) articles reporting duplicate data or data extracted from original articles; (4) abstracts lacking full text; and (5) articles addressing other brain stimulation techniques.

Data extraction

Search findings were imported into Endnote Version X9, and uploaded to the Covidence system, where duplicate articles were removed. To independent reviewers (AM and AYK) performed the title and abstract screening process. Two reviewers then read the full text to assess their eligibility for inclusion. During the review process, studies were excluded under consensus if they did not meet our criteria, and a third reviewer was consulted to resolve any conflicts (FF). Out of 824 articles, 175 articles were included for review, and data from each article were sorted out for analysis. Four reviewers independently (ACLG, AM, AYK, and PdSM) extracted data using a predefined data extraction template in Excel. Any discrepancies were resolved by consensus, with the corresponding author (FF) consulted when necessary. The reviewers developed a structured checklist that included the following items: (1) Study characteristics: first author, title, journal, year of publication, country, etc.; (2) Trial characteristics: design, randomization method, blinding, etc.; (3) Participant characteristics: sample size, age, sex, comorbidity (healthy vs. disease), etc.; (4) Intervention characteristics: device type, interventional protocol (number of sessions, stimulus frequency, intensity, pulse, target area, duration, etc.), and control group (no vs. yes); (5) Outcomes: primary outcome, secondary outcome, and main conclusions; (6) Assessment method of adverse events; (7) Adverse and severe adverse events. The original authors were contacted for any missing data. Discrepancies were resolved through them through team discussions.

Quantitative analysis

Comparison of study characteristics

For this analysis, we divided the studies into three groups: (1) studies not reporting AEs; (2) studies reporting no AEs, and (3) studies reporting at least 1 AE. Continuous variables were reported as mean and standard deviations and compared using one-way analysis of variance. Categorical variables were reported as absolute numbers and percentages and were compared by Fisher's exact test. We performed Bonferroni corrections for multiple comparisons to interpret the results.

Analysis of the effects of active taVNS on adverse events

Furthermore, a quantitative analysis was performed on the clinical trials reporting the number of subjects with at least one adverse event or the intensity of the adverse events in the active taVNS and the control groups.

For the studies with the number of subjects who reported at least one adverse event, we performed a random-effects meta-analysis to calculate the pooled risk differences and the 95% confidence interval (CI) based on the Mantel–Haenszel method. We calculated specifically the risk differences developing in ear pain, dizziness, skin redness, and headache using the same approach. In this analysis, the command "metabin" from the "meta" package in RStudio was used21.

In the studies reporting the intensity of adverse events in both groups, a random-effects meta-analysis was performed to calculate the pooled standard mean difference and 95% CI, based on the inverse variance method, of intensity in headaches, nausea, unpleasant feelings, neck pain, changes in concentration, stinging sensation, muscle contraction, burning sensation, dizziness, skin irritation, and fluctuation feeling. For this analysis, the command "metacont" from the "meta" package in RStudio was used21.

Overall incidence in the population exposed to active taVNS

Aiming to detect the most common adverse events related to taVNS, we analyzed all the studies that documented the number of subjects exposed to active taVNS, the number of subjects that reported at least one event, the duration of the taVNS sessions, and the number of days that a session was delivered. The product of the duration of the session and the number of days that the session was delivered was called cumulative exposure time (if there were more than one session per day, the time was multiplied by the number of sessions). The overall incidence rate was calculated for the total number of adverse events and for each type of adverse event reported on a 100,000 person-minutes-days scale. To perform the meta-analytic incidence, random-effects and the inverse variance method were used to the pooled incidence and the 95% confidence intervals. We used the command “metarate” from the “meta” package in RStudio in this analysis21.

Analysis of stimulation parameters effects on the adverse events per person

We modeled a linear regression to identify associated the factors with the development of adverse events in subjects who underwent active taVNS. To control for the number of subjects in each study, we created a variable, the number of adverse events per person, which consisted of the number of patients experiencing adverse events receiving active taVNS in each study over the total number of subjects receiving active taVNS per study. To select the covariates for the model, we performed univariate regressions, accepting a significance of p < 0.25 for their inclusion in the multivariate model. A "purposeful selection" method was used to create the multivariate model; covariates that were not statistically or clinically significant as well as confounders were removed from the model22. Covariates were considered confounders if they changed the beta coefficient of the dependent variable by ≥ 10%. Moreover, biologically plausible variables, such as age and the prevalence of female subjects, were included in the multivariate model as well.

Studies were included in the linear regression analysis if they had an active taVNS group (i.e., one arm or two-arm studies) and reported the number of patients experiencing adverse events in said group. All analyses were performed using RStudio version 2022.2.2.48523.

Reporting of the severe adverse events and the relatedness with taVNS

All the adverse events reported as severe by the authors were analyzed individually for their severity using the Common Terminology Criteria for Adverse Events (CTCAE). The grades were 1–5, being in general (1) mild symptoms; (2) moderate; (3) severe but not life-threatening; (4) life-threatening consequences; and (5) death related to adverse events. For this review, we considered grades 3–5 as severe24.

To determine if the adverse events classified as severe were related to the taVNS, we used the nine criteria of causality proposed by Bradford Hill. According to these criteria, a relationship between two variables, to be defined as causation, must be analyzed regarding the: (1) association strength; (2) consistency; (3) specificity; (4) temporality; (5) biological gradient; (6) plausibility; (7) coherence; (8) experiment; and (9) analogy25.

Quality assessment

The risk of bias was assessed in controlled studies, either parallel arm or cross-over randomized controlled trials (RCTs), using Cochrane Collaboration’s tool (Risk of Bias Tool 2.0)26 concerning its specific domains.

The non-randomized studies (case reports and one arm studies) were inherit classified as a high risk of bias by our research team, since these studies do not have randomization, blinding, and a control group.

Results

We retrieved 824 articles using the search term previously mentioned, and after excluding studies according to our selection criteria, 174 articles remained with a total of 177 experiments (seven articles presented two experiments and two articles presented duplicated data) remained. Throughout this paper, the term “study” refers to the experiment and not the article. Seventy-nine (44.63%) studies reported the presence or absence of adverse events, while 98 (55.37%) studies did not. Only 44 (24.86%) studies reported at least one adverse event occurred. The flowchart detailing the study selection is provided in supplementary material 1.

Studies that did versus did not report the presence/absence of AEs

We evaluated 177 studies that assessed 6322 subjects with a mean age of 34.99 (S.D. = 15.15) years and 53.51% of the subjects were female. Most studies were RCT with either a cross-over or parallel design. Of them, 55.37% (98/177) of studies did not mention the presence or absence of any AEs. Most studies also included a control group. Regarding taVNS parameters, most studies in all three groups reported their characteristics (Table 1).

Table 1 Study characteristics and taVNS parameters of studies not reporting AEs, studies reporting no AEs, and studies reporting at least one AE.

When comparing studies reporting no AEs, or at least one AE vs. non-reporting AEs, we observed significant differences in population type, number of sessions, and stimulation side. A significantly higher number of studies evaluated taVNS in healthy populations and unilaterally without mentioning AEs, versus reporting no AEs or at least one AE. Moreover, the study group not reporting AE had significantly more studies with 1–2 sessions when compared with the studies reporting no AE or at least one AE.

We contacted the corresponding authors of the studies that did not report AEs to inquire about any relevant information from their trials; a total of 14 authors replied. Of them, 11 answered that their participants did not report any AEs and all the information was included in the published articles, while four studies reported mild adverse events, including discomfort with the ear clip (n = 1), skin irritation (n = 1), headache (n = 1), and annoying sensation of the stimulation (n = 1). These studies were included in their respective groups in the supplementary material 2; the results were similar, and these data were not used in the subsequent analyses.

Studies that reported AE, device analysis, and assessment method

A total of 76 articles (79 studies) that reported adverse events were used for the data extraction. The characteristics of these studies are summarized in Table 2. The most common study populations were healthy (40.51%), epilepsy (15.19%), tinnitus (7.60%), depression (6.33%), stroke (6.33%), and migraine (2.53%). Three studies were of pediatric populations, one each of epilepsy, nephrotic syndrome, and infant oral feeding dysfunction. There were 33 parallel, 24 cross-over, 19 one arm, one pilot study, and two case studies.

Table 2 Characteristics of included studies that report adverse events according to study population.

The types of devices and electrodes used for the stimulation varied significantly. Of the 79 studies that were reviewed, 41 studies (51.3%) used specific commercial devices designed for taVNS. The most common was NEMOS by Cerbomed (Erlangen, Germany) used in 22 studies (27.5%), which consisted of two titanium electrodes positioned a top of a silicon earplug designed to stimulate the left cymba concha. Seven studies (8.8%) use a specific clip electrode inserted into the left tragus27. The NET-1000 and NET-2000 (Auri-Stim Medical, CO, USA) used four 3 mm electrodes placed crosswise to stimulate the bilateral outer ear canals28, while 35 studies (43.8%) used off-label custom devices, and four studies (5.0%) did not specify the equipment used. The majority of the studies (64 studies, 81.01%) stimulated the left side, while five studies (6.3%) stimulated the right side and 10 studies (12.6%) stimulated both sides. Most studies used stimulation sites within the cymba concha, cavum concha, tragus, and external acoustic meatus. The earlobe was the most common site for sham stimulation. Two studies used 1 Hz stimulation as an active control for comparison with 25 Hz taVNS stimulation, both at the left cymba concha29.

The methods of AEs assessment varied among the studies. Thirty-four studies (43.03%) did not specify the method of assessment but reported only AEs results. Other studies used qualitative assessment methods such as patient interviews, diaries, self-reports, and clinician objective monitoring. Notably, 18 studies used a quantitative scale in which participants were asked to rate potential side effects such as headache, neck pain, dizziness, tiredness, nausea, tingling sensations, skin irritations at the ear, and concentration, and mood changes on a point scale30. All of these studies had a control group, and 6 studies (33.3%) reported that there was no significant difference versus the sham group participants (Table 2).

Effects of active taVNS on the development of adverse events

To assess the effects of active taVNS on the development of adverse events, we performed a quantitative analysis on the clinical trials that reported this information in their active and control groups.

Analysis of subjects reporting at least one adverse event

A total of 35 studies included in this review reported the number of subjects who developed at least one adverse event. This analysis gathered all available data of this review regarding the number of patients reporting AEs in the active and in the control groups: pooled data were obtained from 1708 patients.

Risk differences in development of any adverse events versus control

There were no differences in the risk of developing an adverse event between the active taVNS and controls (RD 0.013, 95% CI − 0.003 to 0.028, I2 = 45%). A forest plot is shown in Fig. 1.

Figure 1
figure 1

Forest plot of the studies that reported the number of subjects reporting at least one adverse event in the active and sham groups.

Differences in develop ear pain, dizziness, headache, and skin redness in comparison to control

Our analysis showed no risk differences in developing specific AE as ear pain, dizziness, skin redness, and headache. All of them presented a pooled risk difference of 0.00 (95% CI − 0.01 to 0.01, I2 = 0%), detailed information of these analyzes is provided in the supplementary material 3.

Analysis of differences in intensity of the adverse events

Nine studies included in our review reported the adverse events on different scales with different ranges representing the intensity of each symptom. Depending on the adverse event reported, there was a different number of studies and subjects included in the analysis because of the availability of the data.

From the adverse events available to analyze, our review did not detect intensity differences in any of them. Headaches (n = 567, SMD − 0.21, 95% CI -0.46 to 0.05, I2 = 56%), nausea (n = 567, SMD 0.14, 95% CI -0.08 to 0.36, I2 = 40%), unpleasant feeling (n = 567, SMD 0.05, 95% CI -0.20 to 0.29, I2 = 50%), neck pain (n = 481, SMD 0.06, 95% CI − 0.52 to 0.64, I2 = 89%), changes in concentration (n = 391, SMD 0.17, 95% CI − 0.33 to 0.68, I2 = 81%), stinging sensation (n = 368, SMD 0.66, 95% CI − 0.08 to 1.39, I2 = 91%), muscle contraction (n = 368, SMD 0.14, 95% CI − 0.09 to 0.37, I2 = 25%), burning sensation (n = 365, SMD 0.93, 95% CI − 0.16 to 2.02, I2 = 95%), dizziness (n = 192, SMD − 0.28, 95% CI − 0.61 to 0.05, I2 = 25%), skin irritation (n = 192, SMD 0.82, 95% CI -0.38 to 2.03, I2 = 91%), and fluctuation feeling (n = 144, SMD 0.43, 95% CI − 0.35 to 1.20, I2 = 80%). The forest plots and further information is provided in the supplementary material 4.

Overall incidence of adverse events in taVNS

From the 80 studies reporting the presence or absence of adverse events, 56 studies were included to calculate the cumulative incidence of the events described in the literature and define the most common adverse events related to taVNS. The 24 studies not included in this analysis did not clearly report the cumulative time of the stimulation, the total number of subjects exposed to taVNS, or did not report the number of subjects that developed the events. From 1330 patients analyzed, there were 167 that reported at least one adverse event. The total cumulative exposure time was 484,812 min-days of stimulation. The total cumulative incidence of adverse events was 12.84 events/ 100,000 person-minutes-days (95% CI 6.65 to 24.80). Based on our analysis, the three most common adverse events in consideration to the cumulative exposure time to taVNS were ear pain, headache, and tingling. The full order of the most common adverse events and the detailed information of the cumulative incidences can be found on Table 3.

Table 3 Meta-analytic pooled incidence rates (events/100,000 person-minutes-days of stimulation) for the total and each adverse event in patients exposed to taVNS.

Effects of parameters of stimulation on the AE

When analyzing for the relationships between taVNS parameters and the number of AEs per person receiving taVNS in the included studies, univariate analyses conveyed a significant association between the number of adverse events per person and duration of taVNS, number of sessions, and type of devices performing taVNS. Specific taVNS devices were shown to be significantly more likely to lead to AEs than off-label devices (ß: 0.149, 95% CI: − 0.010 to 0.289, p-value: 0.036). Repeated sessions and sessions lasting 60 min or more are also shown to be more likely to lead to AEs (ß: 0.126, 95% CI: 0.019 to 0.232, p-value: 0.022; and ß:0.151, 95% CI: 0.049 to 0.252, p-value: 0.004, respectively).

Furthermore, in the multivariate analysis, age, and higher prevalence of female subjects, proved not to be significant influences on the likelihood of a subject having an AE after taVNS across the assessed studies (ß: − 4.73, 95% CI: − 0.00003 to 0.00002, p-value: 0.727; and ß: 0.053, 95% CI: − 0.113 to 0.220, p-value: 0.520, respectively). When evaluating the inclusion of taVNS parameter variables, two of them proved to be significant on their own; duration of stimulation (ß: 0.285, 95% CI: 0.122 to 0.448, p-value: 0.001) and type of devices (ß: 0.061, 95% CI: 0.045 to 0.381, p-value: 0.014). However, when both were included in the model together, none of them proved to be significant. Nonetheless, these two variables proved to be colinear, thus the final multivariate analysis conveys the best fit model between the two, the one in which duration of stimulation was significant (Table 4).

Table 4 Univariate and multivariate regression analysis of taVNS parameters associated with the number of AEs reported by patients undergoing active taVNS in the included studies.

Severe adverse events

The authors of 10 studies reported 20 “severe” adverse events. One study used the International Conference on Harmonization of Technical Requirements for Registration of Pharmaceuticals for Human Use: Guidance for Good Clinical Practice to define adverse event severity, while two studies from the same author defined an adverse event as severe if the patient was hospitalized31,32. Seven studies did not mention the criteria used to define severity. In our review, adverse event severity was determined using the Common Terminology Criteria for Adverse Events (CTCAE). We applied the criteria for those adverse events that the authors identified as "severe" in their studies. Accordingly, of the 20 severe adverse events reported, only 15 were classified as severe based on this classification.

No severe cardiac AEs probably or possibly caused by taVNS were reported. In a randomized, double-blind controlled trial assessing the efficacy and safety of taVNS versus sham stimulation in 39 epilepsy patients, there was one report of palpitation, which was classified as “severe” because the patient was monitored and treated in an intensive care unit after the administration of nitroglycerine, but they appeared unlikely to be caused by the device31,33. Further exploration revealed that the patient had experienced comparable events earlier in the past, and she was dismissed from the hospital without complaint 1 day after normal results were noted in echocardiography, duplex sonography of the carotids, 24 h electrocardiography (ECG) monitoring, and thoracal radiography31. Cardiac arrhythmias may occur during taVNS due to parasympathetic innervation of the heart caused by the vagus nerve; however, no relevant ECG alterations were observed in any of the subjects enrolled in this study31. In a sham-controlled, double-blind, randomized clinical trial of 52 heart failure patients, two (one from the active taVNS group, one from the sham stimulation group) were hospitalized for persistent atrial fibrillation and underwent cardioversion, and three patients (one from active taVNS group, two from sham stimulation group) were hospitalized for heart failure34. The authors of this study reported no device-related side effects34 (Table 5).

Table 5 Adverse events reported as "severe" by the authors versus our classification based on the Common Terminology Criteria for Adverse Events (CTCAE).

Among the 15 severe adverse events identified by us (CTCAE grade ≥ 3), of which none were from healthy subject studies, only two were classified by the authors as "possibly” or “probably" related to taVNS. Therefore, we used the classic Bradford Hill criteria to analyze the relationship between these two severe adverse events and active taVNS. Our analysis shows that the event of anxiety and depression worsening leading to hospitalization could only address the temporality and plausibility criteria, while the suspected basal cell carcinoma addressed only the temporality criteria based on the information provided by the authors. Therefore, our team concluded that there was no causal relationship between the severe adverse events detected in this review and the taVNS (Table 6).

Table 6 Severe adverse events classified by the Common Terminology Criteria for Adverse Events reported to be "probably” or “possibly" related to taVNS by the authors’ versus our classification of relatedness based on the Bradford Hill criteria.

Quality assessment

In our review, most of the studies that we assessed the risk of bias were randomized clinical trials (n = 55). The overall evaluation consisted in 13 studies with a low risk of bias, 15 studies with some concerns, and 27 studies with a high risk of bias. The selective reporting (D5) was the domain with less problems detected, while the deviation from the intended interventions (D2), missing outcome data (D3), and outcomes measurement (D4) were the domains with major problems. Most of the studies did not provide enough or clear information on these domains (Supplementary material 5).

Discussion

We conducted the first systematic review and meta-analysis to evaluate specifically taVNS safety. Our study reviewed 167 articles describing taVNS use in human subjects and aimed to assess its safety and identify associated AEs. To analyze our data at different stages due to the significant discrepancy in reporting techniques, we categorized studies that (1) failed to report the absence or presence of AEs, (2) reported no AEs, and (3) reported at least one AE. Our main finding was that taVNS is a safe technique and does not increase the risk of developing and AE when compared to controls. Considering all the included studies, we detected that no severe adverse events were related to the stimulation according to the CTCAE classification and Bradford Hill’s criteria. We also found that more than half of the studies did not mention the presence or absence of AEs, and these studies did not differ from those that reported no AE or the presence of at least one AE.

Studies not reporting adverse events

More than half of the studies included in this review did not report any type of assessment of AEs, and it is evident that the inconsistency in AE reporting seen throughout many clinical trials translates into the taVNS field35. Interestingly, it has been shown that when studies do not publish AE data, a larger number of adverse events are seen in unpublished versus reported data36. In our review, however, evidence points to the opposite trend; trials on taVNS that did not report AE data were, for the most part, performed in healthy populations and undergoing 1–2 sessions. Thus, it is more likely that the taVNS trials not reporting AE data, did not do so because no AEs were experienced. Moreover, when we contacted the authors of trials that did not publish any AE data, the majority of those who replied stated that no AEs were experienced by the subjects in their trials, thus bolstering the assumption that the lack of reporting of AEs is more likely related to a lack of AEs rather than the omission of experienced AEs. Nonetheless, a significant reporting bias and inconsistencies were seen within taVNS studies, suggesting the need for a more standardized format for reporting AEs in this field.

Based on our incidence analysis per cumulative exposure time. The most frequently reported adverse event in the included studies were ear pain, headache, and tingling. Therefore, local AEs were the most frequent type, and most of them were thought to be related to the intervention delivery, specifically to the type of electrodes used for stimulation.

Safety assessment methods

Many taVNS trials did not specify their safety assessment methods. Among the reported, there were qualitative and quantitative methods. Quantitative scales were based on a patient's numerical rating of common side effects.

Fewer studies mentioned the criteria used to define the AEs severity. A review article published by Redgrave in 2018 defined “serious adverse events'' according to the original authors’ reports. In this review, among the 10 studies that reported “serious adverse events,” only two used the International Conference on Harmonization of Technical Requirements for Registration of Pharmaceuticals for Human Use: Guidance for Good Clinical Practice)32,37. One of these studies by Hasan et al. was also included in our analysis, whereas another study using cervical tVNS was excluded. Two studies from the same author defined an adverse event to be “severe” if the patient was hospitalized31,32. Otherwise, most studies (7/10) reported at least one “severe AE,” but did not mention their criteria of defining severity. In our review, the severity of the adverse events was determined according to the CTCAE. For future advancement in the field of taVNS to guarantee safety, a consistent definition and practice guideline for AE monitoring must be developed under consensus. Our analysis indicates a low incidence of any type of adverse events in patients exposed to taVNS. The 10 most frequent types were ear pain, headache, tingling, dizziness, skin redness, fatigue, prickling, pressure, itching, and unpleasant feeling. According to our incidence findings, this study is the first to suggest an evidence-based taVNS adverse events questionnaire to actively monitoring patient safety during taVNS. Our questionnaire is based on asking actively the most common types of adverse events accordingly to our review, the patient’s opinion about their relatedness with the treatment, and the classification of severity based on subjects’ opinion guided by the Common Terminology Criteria for Adverse Events (CTCAE). Furthermore, an open question for any additional adverse event experienced by the subject is applied (see supplementary material 6).

Effects of active taVNS on AE development

Our meta-analysis of 35 studies identified no pooled risk difference between active taVNS and control groups. Only two studies individually showed that taVNS had a small statistically significant effect on AE development. Busch et al. (2012) reported only cases of slight pain, pressure, prickling, itching, trickling, strange feelings, and irritation on swallowing, mostly in both groups. Song et al. detected events such as headache and nausea with only one case each of vomiting, ear pain, dizziness, vertigo, and fatigue, most of which were present in both groups. The two studies did not report severe AEs. When analyzed individually, no specific adverse event subgroup was more likely to appear due to active taVNS. Moreover, regarding the intensity of AE, our pooled analysis showed no differences in AE intensities between active taVNS and controls. Therefore, our analysis conveyed that active taVNS and controls have no differences in the AE development risk and in the AE intensities.

Factors associated to the development of adverse events

Our multivariate regression analysis revealed a significant association between duration of taVNS stimulation and the number of adverse events experienced per person. This association was not determined by the previous systematic review due to underreporting of tVNS doses11. However, considering the additional studies included in our review, more data was able to be gathered regarding duration. Moreover, it is reasonable to assume that the longer the exposure to taVNS, the higher likelihood of a subject developing an adverse event, something that is shown through our analysis. Interestingly, clinically important covariates such as age of participants in the included studies and the prevalence of more female subjects, did not also show a strong or significant relationship with adverse events per person undergoing active taVNS. This result suggests that taVNS can potentially be equally safe for all age ranges, with no difference in gender. It is worth mentioning, however, most studies in our sample included subjects between the ages of 20 and 60 years old, so this suggestion cannot be extrapolated to all age groups.

Regarding transcutaneous cervical VNS, concern may persist about stimulation of the neck. Because transcutaneous vagus nerve stimulation must pass through the skin barrier, relatively strong electrical currents are required, thus nearby non-vagal nerves in the neck may be co-stimulated, possibly including efferent cervical fibers10. Similar to the invasive method, transcutaneous cervical VNS stimulates the cervical trunk, thereby activating both efferent and afferent vagus nerve fibers in the carotid sheath38. Because it is possible that motor efferents innervating the sinoatrial node are stimulated, there are questions as to whether the cervical approach would be truly safe, and further investigation are needed in this aspect13. Because taVNS does not stimulate the neck directly, this approach may have a different safety profile from tcVNS, however its safety had not been reviewed independently until the current study.

Transcutaneous cervical VNS is most delivered via a handheld portable device positioned over the vagus nerve located adjacent to the carotid artery10. The most common device (GammaCore®, electroCore, Inc.) is FDA-approved for the treatment of migraine and episodic cluster headache10. The device must be held by the patient over the target position, during the entire stimulation. Because the device is self-administered, patients may not correctly stimulate the vagus nerve, by placing the device in the wrong position. Patients also cannot hold a device for long periods of time, repetitively, hence, there is a limitation to the amount of stimulation that can be delivered. In our analysis, 30 studies described only one session of taVNS, and 28 studies applied them to healthy subjects. Noting that most studies aiming to treat certain diseases used multiple stimulation sessions, future devices are more likely to be designed as a convenient take-home device, once proved that the device is safe to use. Therefore, easy and accurate application would become more important when devices are widely applied to patients, and taVNS has advantages in this regard.

According to our analysis, the most common stimulation site was the cymba concha, known as the only site, that is exclusively innervated by the auricular branch of the vagus nerve12. Other studies used either commercial or off-label custom devices to stimulate the cavum concha, external auditory meatus, or tragus. These regions are not only supplied by the auricular branch of the vagus nerve, but also by branches of the auriculotemporal nerve, glossopharyngeal nerve, facial nerve, and cervical nerves39. We established that regardless of the stimulation site, only non-severe side effects such as local skin problems, headache, and dizziness were observed.

As shown in our analysis, many different electrode forms (ear inserts, ear clips, headphones, etc.) have been introduced for taVNS. According to a previous study on earbud electrodes, given their safety, preliminary efficacy, and comfort results, they warrant further development and investigation. This is because mechanical pinching from electrode clips might cause discomfort, and small stainless steel ball electrodes can cause discomfort due to high current densities at the electrode–skin interface40. In our review, there was one case of hand pain related to device application, and minor reports of earclip discomfort. Univariate regression analysis showed that specific taVNS devices were significantly more likely to lead to AEs than off-label devices, but multivariate regression adjusted for age, sex, and duration showed no significant influence. Our review found that taVNS methods up to date are safe and tolerable, leaving room for future developers to improve the reported patient discomforts.

It is generally believed that taVNS must not be administered to the right ear, since efferent vagal fibers on the right side regulate heart rate41,42. Previous animal studies reported that right side stimulation more often resulted in bradycardia38,43. Therefore, most studies simply avoid the right side for safety concerns; concordantly, most of the studies (71.3%) examined here used the left side. Although few studies used the right or bilateral sides, our analysis revealed that they carry no additional risk. Previous studies reported that stimulation of the right side does not cause severe adverse events31. It is worth mentioning that studies applying bilateral cervical tVNS show cardiovascular side effects, which may explain why the preference for unilateral tVNS has been translated to taVNS44. However, our analysis and review revealed that bilateral taVNS did not significantly increase the likelihood of cardiovascular events. In fact, no cardiac issues were reported, which has traditionally been a concern for many researchers. This indicates that stimulation of the auricular branch may not directly affect the heart but rather have more afferent fibers that may break this projection unlike the cervical branch of the vagus nerve.

Severe adverse events

Of the 167 studies reviewed, we detected only two adverse events that we classified as severe in two different studies. Evensen et al. evaluated the safety and feasibility of taVNS in 15 patients with major depression from an inpatient psychiatric center and an outpatient psychiatric clinic. They observed an anxiety attack leading to hospitalization in the active taVNS group that only followed the temporality and partially followed the plausibility of the nine criteria proposed by Bradford Hill25. Another severe adverse event was reported by Bauer et al. The authors assessed the efficacy and safety of taVNS in patients with drug-resistant epilepsy. During the stimulation period, they detected in one patient a skin lesion that was suspected to be a basal cell carcinoma. The event only followed the temporality of the 9 criteria and the authors stated that the histology was not confirmed. Our team contacted the authors for further information and histology results with no answers until the submission of this review. Therefore, we conclude that both adverse events cited, besides severe, were not caused by the intervention based on those criteria.

Unlike the traditional concerns that researchers have with taVNS, there have been no reports of severe cardiac AEs that were associated with taVNS. Except for very few patients with underlying heart diseases and recurrent symptoms, we were unable to identify any severe cardiac issues in on this review. Our results confirm previous findings that instantaneous heart rate (HR), and systolic, diastolic, and mean blood pressures are not significantly modified by taVNS, therefore taVNS is safe and tolerable45.

Limitations

As most studies included in our analysis did not distinguish between adverse effect and adverse event, it was difficult to determine the causality of each side effect. Although our attempt to validate casualty of severe adverse events used a scale of relatedness, it is still difficult to determine the true relationship, since only limited information was provided by the original authors. In future studies not employing a sham group, another useful approach would be to correlate AEs with the intervention dosage.

Another major limitation of the literature was the heterogeneity among the included studies. Most studies were excluded from the analysis because they did not report any data regarding safety. The device, electrode type, and stimulation parameters were not specified by some reports. In addition, those studies included had varied assessment and reporting methods. Some studies reported the total frequency, events, or patients with AEs, while others reported each AE type. However, some studies reported no results. Also, some studies may have had a risk of publication or reporting bias. Therefore, although many studies were screened in our initial search, many did not report analyzable data and were excluded from the analysis. Despite these limitations, our findings indicate that AEs related to taVNS were mild and their frequency was low.

One implication of our review is that future taVNS studies should collect more data on AEs. The following three aspects are important:

  1. (1)

    Safety/toxicity AEs should be monitored actively, using structured questionnaires in which the rater should ask about each specific AE. In this article, we include a proposed questionnaire for AEs based on our findings (see supplementary material 6) that might be useful as it actively assesses AEs and asks subjects to relate the AE with the effects of taVNS.

  2. (2)

    Blinding Reporting AEs is necessary to develop better blinding techniques. In fact, most studies allowed patients to adjust the stimulation intensity under their subjective pain threshold. For example, in studies that used low-intensity stimulation as a sham control, it was possible that patients could discern about their group allocation. Researchers and clinicians should therefore further discuss methods of blinded sham stimulation based on future studies.

  3. (3)

    Precisely identify the duration and the number of sessions that the patients were exposed to develop the AE. Therefore, it will be easier to assess the incidence of the AE due to the cumulative exposure to the intervention.

Conclusion

Our review of AEs associated with taVNS indicated a selective reporting bias as almost half of the studies did not report the presence or absence of any AEs. We attribute this to the absence of AE on those studies. Furthermore, our meta-analysis conveyed that active taVNS and controls were not different in the risk of developing an AE and in the intensity of the events developed. In general, the incidence of AE was low, being ear pain, headache, and tingling the most frequent ones. No severe adverse event was shown to be caused by taVNS. Thus, we postulate that taVNS is a safe and feasible treatment option. However, while taVNS research moves from bench to bedside, it is essential that future studies explore AEs in an active, systematic, and standardized format.