Background

Navigated transcranial magnetic stimulation (nTMS) nowadays plays a crucial role in presurgical planning, as it can be used to map cortical areas associated with motor or language function [15]. When applied with high frequency during an object-naming task (repetitive nTMS = rTMS), this technique is able to elicit a transient impairment of language or speech performance within the scope of cortical mapping [69].

Although language mapping based on rTMS is frequently used for preoperative neurosurgical diagnostics as well as neuroscientific trials, the spatial resolution of this method is largely unknown [1, 46, 10]. In this context, the spatial resolution of rTMS is regarded as the average of distances between language-positive and language-negative stimulation points. Since direct comparison to intraoperative stimulation by direct cortical stimulation (DCS), which is able to differentiate between essential, language-positive and language-negative areas, is not possible in healthy subjects, the present study was designed to assess the minimum spatial resolution of rTMS for language mapping. This was achieved by differentiating between rTMS hotspots (stimulation points with high naming error rates due to rTMS) and adjacent rTMS mapping points (stimulation points with low naming error rates), followed by discussion of results with regard to previous data derived from studies that used pre- or intraoperative mapping techniques. Regarding the stimulation approach, rTMS was carried out over predefined spots distributed across the left hemisphere, which follows the most common approach of rTMS-based language mapping to date.

Methods

Volunteers

All mapping sessions were originally performed in 19 healthy, right-handed (as indicated by the Edinburgh Handedness Inventory = EHI) subjects in 2013 to investigate different research questions of rTMS-based language mapping [11, 12]. The enrolled volunteers were German native speakers without general transcranial magnetic stimulation (TMS) exclusion criteria (e.g., metal implants).

Magnetic resonance imaging

All imaging was performed on a magnetic resonance scanner (Achieva 3T, Philips Medical Systems, The Netherlands B.V.) by the use of an eight-channel phased-array head coil. Our scanning protocol consisted of a three-dimensional (3-D) gradient echo sequence (TR/TE 9/4 ms, 1 mm3 isovoxel covering the whole head, 6 min and 58 s acquisition time) without intravenous contrast administration. Subsequent to imaging, the 3-D magnetic resonance imaging (MRI) dataset of each subject was transferred to the rTMS system via DICOM standard.

Transcranial magnetic stimulation

In all volunteers, language mapping by rTMS was performed using the Nexstim eXimia NBS, version 4.3, combined with a NexSpeech® module and a biphasic figure-of-eight stimulation coil (Nexstim Oy, Helsinki, Finland).

Task

During baseline testing and rTMS-based language mapping, volunteers participated in an object-naming task, which has been frequently used in previous rTMS investigations [2, 68, 1012]. The task consisted of a total amount of 100 colored photographs, similar to the objects of the Snodgrass and Vanderwart picture set (1980) [1113]. The photographs portrayed familiar living as well as non-living objects (e.g., banana, chair, snake), and had to be named in German as quickly and precisely as possible.

For baseline testing, the volunteers had to name all objects, and any delayed or misnamed objects were discarded. Baseline testing was carried out twice, meaning that the second run was performed with the remaining object stack immediately after the first testing, and the total number of naming errors was documented in the end. Consequently, only objects that were correctly named twice were used during language mapping. Thus, baseline testing was performed to be able to remove objects the individual participants were not familiar with, resulting in an individualized stack of images. During language mapping, incorporation of objects that the participants are not acquainted with could cause incorrect attribution of naming errors to stimulation effects rather than to object unfamiliarity, leading to imprecise results.

Procedure

The examination started with the determination of the individual resting motor threshold (RMT) by motor mapping of the cortical representation of the hand area [2, 6, 7, 10]. The volunteers sat in a comfortable chair with armrests, and electrodes (Neuroline 720, Ambu, Ballerup, Denmark) were placed over the abductor pollicis brevis (APB) and abductor digiti minimi (ADM) muscle of the right hand to be able to detect motor evoked potentials (MEPs) during continuous electromyography (EMG) recording. Furthermore, the reference electrode was placed at the elbow. Then, a coarse round of single-pulse stimulations was conducted to localize the spot with the highest MEP amplitude, the motor hotspot, which is usually found within the area of the hand knob [14]. During pulse application, the induced electrical field was oriented perpendicular to the precentral gyrus, and the RMT was then determined at the motor hotspot. In this context, the RMT was defined as the lowest stimulation intensity that elicits MEPs over 50 µV in amplitude in at least 50% of stimulation trials in a relaxed muscle [15]. According to the stimulation protocol, the exact stimulation intensity for later language mapping was adjusted with respect to the RMT.

During stimulation, the subjects had to name their individualized sets of objects according to previous baseline testing. Thus, all objects that were correctly named twice according to baseline testing were displayed in randomized order while rTMS was applied in a time-locked fashion. Overall, 46 left-hemispheric cortical spots, which were manually tagged on the 3-D MRI scan of the volunteer prior to mapping, were stimulated three times each in a row without any particular order (Fig. 1), as reported previously [11, 12]. Thus, mapping was carried out over predefined spots.

Fig. 1
figure 1

Cortical parcellation system (CPS). This figure visualizes the cortical stimulation targets (white spots, n = 46) within a template of the left hemisphere. In addition, the cortical surface is divided into subregions, and the numbers refer to the anatomical names of the stimulated subregions (see Table 1 for anatomical names and abbreviations of the CPS)

Due to unacceptable stimulation-related discomfort, none of the 46 points were located within the following regions: orbital part of the inferior frontal gyrus (orIFG), polar superior temporal gyrus (polSTG), polar middle temporal gyrus (polMTG), anterior middle temporal gyrus (aMTG), polar superior frontal gyrus (polSFG), polar middle frontal gyrus (polMFG), and polar inferior frontal gyrus (polIFG). In addition, the inferior temporal gyrus (ITG) was also not mapped because of the increased coil-cortex distance. Parcellation of the anatomical structures was performed according to the cortical parcellation system (CPS, Table 1; Fig. 1) [2, 6, 16].

Table 1 Cortical parcellation system (CPS)

For transcranial stimulation, the magnetic coil was placed tangential to the subject’s skull, and the induced electrical field was oriented in strict anterior–posterior direction during language mapping [2, 6, 8, 17, 18]. Both the coil angulation (tangential to the subject’s skull) and electrical field orientation (anterior–posterior) were displayed by the help of the neuronavigation unit. Stimulation trials with incorrect coil angulation or orientation were repeated and not taken into account during analysis. All points of stimulation and the electrical field strength at the stimulation spots were automatically saved for later analysis. Moreover, a video of the naming performance during baseline testing as well as during stimulation was recorded for post hoc analysis.

Protocol

For each mapping session, the following stimulation parameters were chosen, as they (a) have shown to be efficient in terms of eliciting reproducible naming errors during object naming, and (b) have been proven to be well tolerable and safe for the individual subject [1, 2, 68, 1012]:

  • Stimulation intensity: 100% of the RMT

  • Stimulation frequency: 7 Hz

  • Number of pulses: 10

  • Duration of each stimulation burst (7 Hz/10 pulses): 1430 ms

  • Picture-to-trigger interval (=PTI, time between the presentation of an object on the screen and the beginning of the rTMS pulse): 0 ms

  • Display time (=DT, duration of the screening of an object): 700 ms

  • Inter-picture interval (=IPI, time between the screening of two objects): 3000 ms

Data analysis

Error maps

All videos were analyzed for no-response errors strictly blinded to the stimulated cortical spots by the same person who had already conducted rTMS [68, 19]. A no-response error was defined as a complete lack of naming response within the duration of the stimulation. Other error types, like hesitations or performance errors, were not taken into account in the present investigation. Although other error types represent stimulation-induced disruption of important language subfunctions as well, we decided to not take these categories into account since no responses have proven to be among the most frequent error types whilst being easy to detect during video analysis [6, 20].

The total numbers of no-response errors as well as the numbers of stimulation bursts were pooled across all volunteers. Then, mapping results of the 46 stimulated cortical spots were projected into the CPS by putting pooled error rates (=number of induced no responses at one of the 46 cortical spots divided by the total number of stimulation bursts applied to this spot) on a brain template of the left hemisphere (Fig. 2).

Fig. 2
figure 2

Distribution of no-response errors. This figure shows the no-response error rates (=number of induced no responses at a certain stimulation spot divided by the total number of stimulation bursts applied to this spot) as a percentage projected into the cortical parcellation system (CPS) including all stimulated spots (n = 46). Additionally, the four identified hotspots are marked

Resolution measurement

Cortical spots that were prone to comparatively high no-response error rates (=hotspots) and were surrounded by four spots with lower error rates were identified from the raw data of mapping results and the error map (Fig. 2). Hence, hotspot definition was based on visual inspection of error distributions (Fig. 2) without additional statistical testing between single stimulation points. The four adjacent spots were separated into one ventral, one dorsal, one apical, and one caudal point with respect to the localization of the hotspot. If there were more than four points matching the inclusion criteria for being adequate adjacent spots, the closest one to the hotspot was chosen. Applying the described rule to Fig. 2, a number of four stimulated points fulfilled the hotspot criteria (hotspots 1, 2, 3, and 4). Hotspot 1, for example, shows a no-response error rate of 11%, and is surrounded by values of only 7% (ventral spot within the mMFG), 5% (apical spot within the mMFG), 0% (dorsal spot within the pMFG), and 5% (caudal spot within the opIFG). Each of these adjacent spots shows an error rate of less than 11% and can clearly be characterized as ventral, dorsal, apical, or caudal with respect to the hotspot.

After identification of the four suitable hotspots and their adjacent points on the template (Fig. 2), these spots were exported from each volunteer’s individual rTMS mapping session via DICOM standard to an external working station. Then, measurement of a minimum spatial resolution for rTMS-based language mapping was performed using OsiriX imaging software (OsiriX version 5.8.5, Pixmeo SARL, Bernex, Switzerland). Therefore, the two-dimensional (2-D) distances parallel to the cortical surface between each hotspot and its corresponding adjacent points were measured separately on the individual coronal MRI slices for the apical and caudal points to the hotspot (Fig. 3a), and on the sagittal MRI slices for the distance between the hotspot and the ventral as well as the dorsal spots (Fig. 3b). Since measurement was done between hotspots and adjacent mapping points belonging to the CPS and not on a finer-grained basis including cortical spots lying in-between, we refer to a minimum spatial resolution.

Fig. 3
figure 3

Spatial resolution measurement. Illustration of the distance measurement procedure on a subject’s individual coronal magnetic resonance imaging (MRI) slice for the apical and caudal points to the hotspot (a), and on a sagittal MRI slice for the distance between the hotspot and the ventral and dorsal spots (b). The black lines represent the distance between the hotspot (H) and the corresponding apical (a) or dorsal point (b) parallel to the cortical surface

Statistics

Mean values ± standard deviation (SD), medians, minimum and maximum values of subject-related characteristics, mapping parameters, and distances between mapping spots were calculated by using GraphPad Prism (GraphPad Prism 6, La Jolla, CA, USA). Furthermore, a one-way analysis of variance (ANOVA) between the individual electrical field strengths of the enrolled subjects at the hotspots was performed, and a one-way ANOVA followed by Tukey’s multiple comparisons test, including calculation of 95% confidence intervals (CIs), was carried out to compare distance measurements between hotspots and adjacent mapping points. For all statistical calculations, a p value of <0.05 was considered statistically significant.

Results

Subject and mapping characteristics

Mappings were successfully performed in 19 healthy, right-handed volunteers, which were already analyzed for different purposes in previous investigations [11, 12]. Table 2 gives an overview of subject-related characteristics and mapping parameters. Regarding the comparison of electrical field strengths at the hotspots (Table 2), there was no statistically significant difference between the four hotspots (p = 0.3445).

Table 2 Subject and mapping characteristics

Spatial resolution

Overall, mean distance values of 13.8 ± 6.4 mm (average of all hotspots to ventral points, range 0.7–30.7 mm), 10.8 ± 4.8 mm (average of all hotspots to dorsal points, range 2.0–26.5 mm), 16.6 ± 4.8 mm (average of all hotspots to apical points, range 0.9–27.5 mm), and 13.8 ± 4.3 mm (average of all hotspots to caudal points, range 2.0–24.2 mm) were measured. More detailed, Table 3 provides information about the mean ± SD, minimum, and maximum distance values of the measurements between each particular hotspot and the corresponding adjacent points. Furthermore, the results of measurements between all hotspots taken together and adjacent points are illustrated in Fig. 4.

Table 3 Measurement results
Fig. 4
figure 4

Measurement results. Boxplots with median, min-, and max-whiskers, and quartile-boxes for the hotspot to ventral, dorsal, apical, and caudal distance measurements (in mm)

According to ANOVA, statistically significant differences regarding the measurements between the hotspots and adjacent points were revealed with respect to hotspot 1, 2, and 3 (F = 16.5, F = 11.7, F = 8.8, p < 0.0001, Table 3), whereas no statistically significant difference was revealed concerning hotspot 4 (F = 2.1, p = 0.1111, Table 3). Regarding hotspot 1, there was a statistically significant difference in measurements for the hotspot to ventral versus apical points (CI −9.6 to −3.7), to dorsal versus apical points (CI −10.0 to −4.1), and to apical versus caudal points (CI 1.2 to 7.2), whereas statistically significant differences were revealed for the hotspot to ventral versus dorsal (CI 1.0 to 6.9), to dorsal versus apical (CI −9.5 to −3.6), and to dorsal versus caudal points (CI −6.6 to −0.7) for hotspot 2. Concerning hotspot 3, there was a statistically significant difference in measurements for the hotspot to dorsal versus apical points (CI −11.8 to −3.6) and to dorsal versus caudal points (CI −9.5 to −1.3).

Discussion

Current knowledge

Only a few publications investigated the resolution of TMS up to now [2124]. The publications of Opitz et al. [21] and Thielscher et al. [22] calculated the electrical field induced by TMS in the human brain and came to the conclusion that the field strength is significantly enhanced when the currents run perpendicular to the stimulated gyrus. Moreover, this effect was demonstrated to be primarily restricted to the gyral lips and crowns, but it did not extend into the walls of the sulci, which means that the focality of the electrical field is increased [22]. The authors Bijsterbosch et al. [23] systematically investigated the impact of various TMS coil positions on electrical field shaping, finding that, at least for most coil positions, the induced field includes the target region of stimulation but is not distinctly restricted to it. According to their publication, the distribution of subarachnoid cerebral fluid and gyral geometry predominantly influence the modelling of the electrical field in the human brain [23].

Although the described publications represent extensive and valuable contributions to the current knowledge about the spatial resolution of TMS, they neither explicitly focused on cortical language mapping, nor did they examine the spatial resolution from a functional point of view, meaning that they did not particularly aim for a differentiation of functionally relevant from irrelevant cortical spots [2123]. Under these premises, examination of the spatial resolution of rTMS-based language mapping seemed mandatory.

Gyrus identification

The average width of a cortical gyrus in the adult human brain varies between 10 and 20 mm, although there can be distinct inter-individual variations. Therefore, when the central point of a figure-of-eight TMS coil is placed perpendicular to the subject’s skull and in the middle of a certain gyrus, the center of stimulation should primarily hit the stimulated gyrus, as our mean spatial resolution measurements for each of the four directions accounted for values clearly under 20 mm (Table 3; Fig. 4). As an important result, we can state that specific gyri can be targeted and identified by rTMS, at least when stimulation is applied to the gyrus center with the protocol used in the present study.

Furthermore, because the spatial resolution of rTMS does not exceed the average gyrus width and a particular gyrus can be targeted by this modality, the common practice of parcellating the cortical surface into subregions according to its gyral structure (e.g., as is done for the CPS) seems to be a reasonable approach for rTMS research.

Electrical field strength

In general, a higher electrical field strength induced by rTMS should principally cause a lower spatial resolution and vice versa. The mean electrical field strengths at the four hotspots were comparable, and there was no statistically significant difference revealed. Thus, our approach including a comparison of hotspots being located within different remote cortical gyri seems reasonable because the applied electrical field does not significantly vary between these spots, and therefore, minimum measured distances seem comparable. However, statistical analysis of the four hotspots revealed a statistically significant difference of the mean distances between the hotspots and adjacent stimulation spots (Table 3), highlighting that the mean differences in distance measurements to the hotspot were different for the respective adjacent mapping points. This might show that spatial resolution of rTMS for language mapping—at least within a certain range—depends on the localization of cortical stimulation, although the applied electrical field strength does not significantly change during mapping. Yet, this effect might also be due to the chosen distribution of our cortical stimulation spots, and we can state that the minimally measured spatial resolution of rTMS was 11.1 ± 5.5 mm (Table 3).

Comparison to other modalities

Overall, cortical language distribution has been under extensive investigation during the last decades. In this context, one frequently used technique for the identification of cortical sites related to language function is functional magnetic resonance imaging (fMRI). In its most common form, this approach compares the blood oxygenation level dependent (BOLD) signal during a language-related task performance to the signal obtained during resting-state measurement within the same subject [25, 26]. Then, the systematic comparison of both signal datasets allows drawing conclusions about the localization of individual language-related brain areas. Overall, fMRI is characterized by comparatively good spatial resolution, and a recent study on nTMS-based motor mapping has demonstrated that the spatial differences between nTMS- and DCS-positive cortical spots and fMRI- and DCS-positive points are both in the range of millimeters, although better results were observed for nTMS- versus DCS-positive spots (10.5 ± 5.7 vs. 15.0 ± 7.6 mm) [27].

Another neuroimaging modality for the investigation of cortical language representation is magnetoencephalography (MEG), which has been particularly used for determination of language lateralization over the last years [28, 29]. In this context, cortical activation in response to a language-related task performance goes along with a local rise in neuronal signaling, which is characterized by an increased flow of intracellular ions mediating associated magnetic fields that can be measured at the scalp surface in the form of event-related potentials [30]. However MEG has failed to show sufficient correlation with language maps generated by rTMS or DCS, at least partly due to its lower spatial resolution, which is typically limited to several centimeters [1, 31].

When it comes to DCS, which is regarded as the current gold standard for mapping human cortex function, language performance has been repeatedly tested during awake surgery [3234]. Interestingly, Haglund et al. [33] showed that radical brain tumor resection without postoperative permanent language deficits can only be achieved reliably when the resection border is at least 10 mm away from the nearest language site determined by DCS. Thus, 10 mm seems to be the spatial resolution of DCS-based language mapping during awake surgery. As our findings show, the spatial resolution of rTMS-based language mapping is slightly above 10 mm (Table 3; Fig. 4), which is comparable to the results of DCS [33].

Further considerations

Regarding the distribution of no responses across the hemisphere, the opIFG and trIFG, which should roughly overlap with the classical Broca’s area, were not characterized by a high rate of errors during rTMS. However, these regions showed error rates to a higher extent when it comes to other error categories, which were incorporated into analysis of the same dataset in other publications focusing on different purposes [11, 12]. Hence, considerable error rates were achieved during mapping of the opIFG and trIFG, but belonging to other types than no responses. As a possible explanation for the low rate of no responses in these regions when compared to previous investigations [68, 20], the stimulation frequency of 7 Hz (instead of 5 Hz) has to be considered, but the set of objects, which was different when compared to these previous publications, might have also played a role. However, the distinct cause for the specific pattern of no responses observed in the present study cannot be fully assessed within the scope of our approach.

Although the resolution measurements of the present study can be discussed in the light of previous literature on DCS-based language mapping, there is no opportunity to directly compare rTMS to DCS results on an intra-individual basis in healthy subjects due to the highly invasive character of DCS. As a consequence, the language-positive areas mapped in the present study cannot be verified by the gold standard method, and a final decision about the possible essential character of a particular language-positive site cannot be made due to the comparatively high sensitivity and low specificity of rTMS [10]. However, this limitation can at least partly be overcome by the results of recent studies that showed a much higher specificity and positive predictive value between language maps generated by both mapping techniques using the CPS, which again provides spatial resolution data on a gyri level [1, 2]. To draw more definite conclusions concerning resolution, a prospective study including preoperative rTMS and intraoperative DCS mapping among patients should be the next step.

In the recent years, the differentiation of naming errors into various predefined error categories has been demonstrated repeatedly within the scope of rTMS and DCS trials [4, 6, 8, 16, 20, 34]. Although other error categories except no responses are based on stimulation-induced disruption of important language subfunctions as well, we decided to not take these error types into account since no responses have proven to be among the most frequent error types [6, 20]. Incorporation of other error types could have led to different hotspots, but potentially on the basis of more unclear language disruption. Therefore, our current approach seems to be reasonable within the scope of one of the first studies regarding the exploration of rTMS-based language mapping resolution. However, we have to be aware of the fact that upcoming studies systematically analyzing other categories might further refine our minimum resolution measurements. Additionally, the spatial resolution of rTMS should also be evaluated for mapping of other cognitive functions since such approaches are currently emerging. In this context, non-invasive assessment of calculation functions by rTMS has been successfully performed recently, but spatial resolution for this purpose is vastly unknown [35].

Furthermore, it is already known that rTMS mapping results and electrical field shaping depend on various mapping parameters, like the stimulation frequency or the angulation and shape of the magnetic coil, for example [1, 11, 17, 36]. As a consequence, it cannot be the claim of the present study to define the definite spatial resolution of rTMS-based language mapping independently from setup factors. Instead, this study at least provides some practical orientation on the minimum spatial resolution for a commonly used mapping protocol, which has revealed to be reliable in terms of inducing transient errors during object naming while being well tolerable for the individual subject.

Another potential limitation could be the fact that distance measurement was only performed for predefined cortical points. In this context, the rTMS resolution measurement was limited by the distance between single stimulation points of the CPS, meaning that areas in-between the different stimulation spots were not systematically examined. Thus, the results presented in the current approach reflect a minimum resolution measurement. Consequently, it might be possible that the definite resolution of rTMS is finer-grained. Yet, all cortical points of stimulation were placed manually at the center of the gyri, meaning that finer-grained mapping might also have targeted sulci instead of cortical tissue, which is generally not regarded as adequate for the rTMS-based language mapping approach per se [6, 8]. We decided to follow the presented kind of approach because one major objective was to investigate the required cortical distance to distinguish between hotspots (stimulation points with a high error rate) and adjacent mapping points (stimulation points with low error rates) in a practical setup. Basically, this method is similar to the measurement presented in Haglund et al., which represents a key reference for the accuracy of DCS-based language mapping during awake surgery [33].

Conclusions

The present study examined the spatial resolution of rTMS-based language mapping via distance measurement. In this context, rTMS is able to differentiate between hotspots (stimulation points with a high error rate) and adjacent mapping points (stimulation points with low error rates) at a minimum distance of 11.1 ± 5.5 mm as measured for a parietal hotspot. According to these distance measurement results, the spatial resolution of rTMS should principally allow for the identification of a particular gyrus as it is the case for DCS-based language mapping during awake surgery as the current gold standard.