Keywords

1 Introduction

Embodied cognition theories propose that modal brain regions involved in perception and action are likewise involved in the processing and storage of semantic memory traces (Barsalou, 2008) as opposed to the classical view of amodal brain regions storing semantic memory traces in the form of symbols (for a review see Meteyard, Rodriguez Cuadrado, Bahrami, & Vigliocco, 2012). Specifically, the motor component inherent to language containing action concepts triggers a simulation of the implied movement in sensorimotor areas which is likely reflected in increased activation (Kiefer & Pulvermüller, 2011). The association between perceptual-motor and more cognitive brain areas can be explained through learning experiences, but the precise role of the re-activation of these areas during semantic processing is still under debate (Pulvermüller, 2018). For instance, it is important to elucidate in which detail semantic processing is supported by sensory-motor areas. This issue can be operationalized experimentally in different ways. The current study used adverb-verb combinations to modify the underlying action concept during verbal processing. Behavioral and neurophysiological data were examined in order to find possible interactions of adverbial context with verb processing. These interactions would argue for the contribution of motor cortical areas to language processing being specific and detailed, going beyond superficial epiphenomena.

In turn, theories of language and semantic processing are continuously revised to accommodate empirical findings concerning the functional and neuroanatomical grounding of semantic memory in modality-specific areas (Barsalou, 2008; Binder & Desai, 2011; Pulvermüller, 2018). The current study was aimed at contributing to this discussion by investigating one aspect of embodied cognition, namely the potential interaction between action verb processing and a modifying adverb.

Previous studies reported motor activation either during language processing of action sentences (Aziz-Zadeh, Wilson, Rizzolatti, & Iacobini, 2006; Boulenger, Hauk, & Pulvermüller, 2009; de Vega, Léon, Hernández, Valdés, Padrón, & Ferstl, 2014; Tettamanti et al., 2005) or action verbs (Hauk, Johnsrude, & Pulvermüller, 2004; Kemmerer, Castillo, Talavage, Patterson, & Wiley, 2008; Yang & Shu, 2011). Motor activation seems to be somatotopical in the sense that action verbs implying the movement of a specific extremity elicit activation in corresponding cortical motor areas (Hauk et al., 2004; Pulvermüller, 2005). On the other hand, another study reported no somatotopical activation in primary and premotor areas but rather found action-related activation in the pre-SMA potentially holding an abstract representation of the action verbs in the form of instructional cues (Postle, McMahon, Ashton, Meredith, & de Zubicaray, 2008).

On behavioral level, an interaction between action-related language processing and motor execution emerged in altered kinematic measures (Boulenger, Roy, Paulignan, Deprez, Jeannerod, & Nazir, 2006; Dalla Volta, Gianelli, Campione, & Gentilucci, 2009) and in reaction time (Buccino, Riggio, Melli, Binkofski, Gallese, & Rizzolatti, 2005; Sato, Mengarelli, Riggio, Gallese, & Buccino, 2008). Conversely, motor output was shown to affect action language processing (Rüschemeyer, Lindemann, van Rooij, van Dam, & Bekkering, 2010). Depending on the task and stimulus timing both facilitation (Andres, Finocchiaro, Buiatti, & Piazza, 2015; Glenberg & Kaschak, 2002; Klepp et al., 2017; Scorolli & Borghi, 2007) and interference or prolongation (Boulenger et al., 2006; Klepp, Niccolai, Buccino, Schnitzler, & Biermann-Ruben, 2015; Sato et al., 2008) of motor behavior (i.e. response times) were found.

The demonstrable engagement of motor areas is also evident in studies showing an impairment of action-related language processing in patients suffering from Parkinson’s Disease (Fernandino et al., 2013; Herrera, Rodríguez-Ferreiro, & Cuetos, 2011) and Amyotrophic Lateral Sclerosis (Bak, O’Donovan, Xuereb, Boniface, & Hodges, 2001; Grossmann et al., 2008). These impairments indicate an initially important role of motor areas in the ontogenesis of language acquisition (Perniss & Vigliocco, 2014), while neurological disorders affecting motor areas seem to impede an efficient and complete access to semantic memory traces in later life. This can, however, be still partly compensated for by other brain areas (Pulvermüller, 2018). Yet, these results suggest a substantial contribution of motor areas to action-related language processing.

Aside from somatotopy, there is further evidence that sensorimotor involvement in language processing is specific and detailed. For instance, it may reflect semantic features of verbal material: The amount of effector-specific movement affected verb-motor priming (Klepp et al., 2017). Additionally, functional magnetic resonance imaging (fMRI) activity in parietal areas within the motor network was modulated by the specificity of action plans described by verbs (van Dam, Rüschemeyer, & Bekkering, 2010). Activity in pre-motor areas can reflect motor features described in action sentences, e.g. the degree of physical effort the described action requires as determined by a verb-object combination (Moody & Gennari, 2010).

In natural language, however, important cues about the precise implied action may not only come from the verb itself, but from other sources such as objects and adverbial constructions. The linguistic focus hypothesis (LFH) postulated by Taylor and Zwaan (2008) suggests that motor simulation, which is assumed in the theoretical framework of embodied cognition theories regarding action-related language processing, is dependent on the linguistic focus. If the described action is maintained within the linguistic focus, e.g. through action-modifying adverbs, motor simulation of the action occurs beyond the action verb itself and continuation of motor activity should be observed; if the linguistic focus is shifted away from the action, e.g. through agent-modifying adverbs, no motor simulation occurs and termination of motor activity should be observed. The study conducted by Taylor and Zwaan (2008) demonstrated that adverbs can influence reading times. Participants were instructed to read a sentence frame by frame by turning a knob either in clockwise or counter-clockwise direction. The sentences contained hand action verbs depicting either clockwise or anticlockwise movements followed by an adverb either modifying the action (e.g. quickly) or the agent (e.g. happily). Facilitation of reading times occurred in direction-matching versus non-matching verb-response conditions and when adverbs modified the action instead of the agent.

Yet it remains unclear, if the motor simulation of the action verb is differentially affected by different kinds of adverbs. For example, despite relating to a bodily action, action verbs might not contain lexically specific information about the amount of force with which the action is executed (Goldschmidt, Gamerschlag, Petersen, Gabrovska, & Geuder, 2017). This questionnaire study examined whether the force component in the German action verb “schlagen” (to hit) could be modulated in sentences containing adverbs. Crucially, force-denoting manner adverbs (lightly/hard) directly modified the action’s force component in the direction suggested by the adverb. Moreover, force modification may also be achieved by inferences through agent-oriented adverbs (Goldschmidt et al., 2017). The current study uses manner adverbs denoting a clear attenuation or intensification of either the force component or the speed component, thus expected to directly modify the action described by verbs. Note that the term “force” is used as a synonymous expression for “intensity” and is not used in terms of causality, as it is in the linguistic field of “force semantics” and “force dynamics”.

Based on these findings we investigated in two separate experiments if adverbs further influence reaction times in a well-established priming paradigm containing hand/foot action verbs and hand/foot responses (Klepp et al., 2017). In both experiments, the interaction of verb type and response effector was anticipated as a priming effect resulting in faster response times for congruent verb-response conditions. We furthermore introduced intensifying and attenuating manner adverbs as an additional factor. The previously observed interaction of verb type and response effector was hypothesized to be more pronounced when the action verb was combined with an intensifying compared to an attenuating adverb resulting in even faster response times in congruent verb-response conditions. Adverb-verb (Experiment 1) and verb-adverb (Experiment 2) order of presentation realized in two separate experiments additionally allowed investigating the influence of the time point of adverb presentation with respect to the action verb processing stream.

A useful technique to investigate the time course of activation with respect to action verb processing is EEG. Increased activity in motor areas is typically accompanied by increased desynchronization in the mu (10–15 Hz) and the beta band (15–25 Hz). This oscillatory pattern is generally associated with motor preparation and execution (Pfurtscheller & Lopes da Silva, 1999; Pfurtscheller, Neuper, Andrew, & Edlinger, 1997). Typically, desynchronization increases, reaching a peak during response execution, while a rebound consisting in increased synchronization is found about a second after movement offset (Pfurtscheller, & Lopes da Silva, 1999). A similar pattern has also been observed during action verb processing (van Elk, van Schie, Zwaan, & Bekkering, 2010; Moreno, de Vega, Léon, Bastiaansen, Lewis, & Magyari, 2015; Niccolai, Klepp, Weissler, Hoogenboom, Schnitzler, & Biermann-Ruben, 2014). Neural oscillatory and event-related potential (ERP) effects related to the presentation of action verbs have been reported as early as 170–250 ms after word onset (Pulvermüller, Härle, & Hummel, 2000, 2001; van Elk et al., 2010), displaying somatotopy (Hauk et al., 2004; Pulvermüller et al., 2001). This pattern of results suggests that motor areas contribute to the processing of verbal linguistic action stimuli.

We therefore conducted EEG measurements in Experiment 2 to gain further insights in the processing of the action verb and its possible interaction with the adverb. According to the somatotopic organization of the motor cortex, differential effects for hand and foot responses were hypothesized in electrode sites C3 and Cz, respectively. We expected stronger hand-related activity at electrode site C3 and stronger foot-related activity at electrode site Cz. This approach has been successfully used regarding evoked EEG activity during action verb reading before (Hauk & Pulvermüller, 2004; Pulvermüller et al., 2000, 2001). We specifically focused on the mu and beta band in our study due to their role in motor processes (Pfurtscheller & Lopes da Silva, 1999) and action-related language processing (Klepp et al., 2015, Moreno et al., 2015; van Elk et al., 2010). Thus, we expected the mu and beta desynchronization around the onset of the response to be reduced in congruent conditions due to the priming effect of the action verbs (Grisoni, Dreyer, & Pulvermüller, 2016; Schacter, Wig, & Stevens, 2007).

As the manner adverb directly modified the action verb which elicits motor activity if semantically processed, we expected the adverb to further modulate the language-motor interaction. The interaction of verb type and response effector was hypothesized to be more pronounced when the action verb was combined with an intensifying compared to an attenuating adverb, resulting in reduced mu and beta desynchronization in congruent verb-response conditions.

Methods and Results of Experiments 1 and 2 will be reported separately followed by a joint discussion.

2 Experiment 1

2.1 Methods

2.1.1 Participants

Thirty-two participants (eleven male; mean age = 24.97 years, SD = 6.71) were included into the study. Exclusion criteria were academic linguistic expertise, history of prior neurological or psychiatric disorders and medication affecting the central nervous system. Participants were monolingual German native speakers with normal or corrected-to-normal vision. Right-handedness and footedness was assessed and confirmed. Hand dominance was assessed with the Hand Dominance Test (HDT, Steingrüber, 2011), as well as with the German version of the Edinburgh Handedness Questionnaire (EHI, Oldfield, 1971). Right footedness was tested with a self-report questionnaire extracted from the Lateral Preference Inventory (LPI, Ehrenstein & Arnold-Schulz-Gahmen, 1997).

This experiment is in accordance with the Declaration of Helsinki and was approved by the ethics committee of the Medical Faculty at Heinrich-Heine-University Düsseldorf (study number: 3400). All subjects gave written informed consent before the beginning of the experiment and received course credit or financial reimbursement.

Two subjects were excluded due to data loss. Three subjects exceeded our criterion of at most 10% incorrect responses during the experiment. After the experimental session, one participant reported taking medication affecting the central nervous system and was also excluded. The final set of participants in Experiment 1 therefore consisted of 26 subjects (nine male, mean age = 25.28 years, SD = 7.39).

2.1.2 Stimuli

We used a total of 36 disyllabic German verbs and eight German adverbs (for an overview see Table 1). The verb set consisted of three categories with 12 verbs each: manual actions, e.g. “klatschen” (to clap), foot actions, e.g. “rennen” (to run) and abstract actions, e.g. “denken” (to think). We used a subset of verbs out of a previous selection which had been based on successive rating and matching procedures (details see Klepp et al., 2014) including familiarity, imageability and movement energy, as well as word length and verb frequency (Leipzig Corpora Collection, LCC, Biemann, Heyer, Quasthoff, & Richter, 2007, available at http://wortschatz.uni-leipzig.de). As indicated by a multivariate ANOVA with verb category as an independent variable the final set of verbs differed with regard to some of these variables, but only in comparison to the abstract verb category. This category served as the No Go-condition, however and was not further analyzed. Importantly, hand and foot verbs did not differ significantly (all p > 0.087).

Table 1 The stimulus set consisted of 36 verbs and 8 adverbs. All adverbs and verbs are reported in German and English, respectively, although only the German version was used in the study

Twenty-five adverbs entered a rating process (n = 4) serving to redefine stimuli selection. Participants were asked to evaluate the probability of verb and adverb going together (from “not at all” to “absolutely”). This was termed as the semantic fit of adverb-verb combinations. Adverb selection was based on their semantic fit with the previously selected set of action verbs, as well as the possibility to define opposed pairs of intensifying and attenuating adverbs, e.g. “kräftig”—“kraftlos” (stronglyfeebly). This resulted in the subsequent inclusion of eight out of the initially selected 25 adverbs, four of which strengthening or accelerating the movement implied by the verb (intensifying adverbs) and four weakening or slowing the implied action (attenuating adverbs) . Exact Mann-Whitney-U-Test revealed no significant differences in the frequency of intensifying and attenuating adverbs (U = 14.50, p = 0.343) nor differences in their semantic fit to hand verbs (U = 6.50, p = 0.645) or foot verbs (U = 5.00, p = 0.381), respectively. Please note that linguistically all adverbs are adjectives applied in an adverbial manner of use. For the sake of simplification we will refer to them as “manner adverbs” in this article.

2.1.3 Procedure

Subjects were seated at a distance of 95 cm from a computer screen (ASUS VG248, ASUS Computer International, Fremont, California, USA) with a keyboard in front of them and a foot pedal (USB Triple Foot Switch II; Scythe, Tokyo, Japan) positioned under the table. All trials started with a black background screen containing a white fixation cross at the center presented for 1200 ms. This was followed by the presentation of a mask consisting of two horizontal lines of seven white ‘X’ for a jittered interval between 400 and 700 ms. Thereafter, an adverb was presented in white letters pseudorandomly above or below the fixation cross for 400 ms together with the remaining upper or lower line of seven ‘X’. The verb followed in white letters replacing the latter seven ‘X’ and the adverb-verb combination was displayed together for another 400 ms. Then, the stimuli turned either blue or yellow. Subjects were instructed to respond as fast and accurately as possible with the hand or the foot according to the color of the adverb-verb combination, but only if the verb expressed a concrete bodily action. Participants were pseudorandomly assigned to one of two groups of color change instructions: color change to blue required a hand response by pressing the ‘space’-key on a keyboard while a color change to yellow required a foot response by pressing down a foot pedal for 50% of the subjects. For the other 50%, the assignment was reversed. The experimental procedure of each trial is shown in Fig. 1A.

Fig. 1
figure 1

Experimental procedure. A Experiment 1. B Experiment 2

Pseudorandomization of the spatial position of verb and adverb was introduced to prevent participants from adopting a strategy to solely attend to the stimulus relevant for solving the experimental task, i.e. the verb in Experiment 1 and the adverb in Experiment 2, respectively. The spatial predictability of these stimuli could have resulted in impaired semantic processing of adverb-verb-combinations, which we tried to preclude by variation of the spatial positions.

Each adverb was combined with each verb and presented once with each type of color change resulting in a total of 576 trials per subject. The experiment lasted about 50 min. Stimuli were presented using Presentation 14.9 software (Neurobehavioral Systems, Albany, California, USA).

2.1.4 Statistical Analysis

We computed a linear mixed effect model using the package lme4 (version 1.1–13, Bates, Maechler, Bolker, & Walker, 2015) for R (version 3.3.3) including crossed random effects for subjects and items. This method is especially advantageous for studies incorporating psycholinguistic stimuli since it is assumed that not only participants but also the items are randomly drawn from a population (Baayen, Davidson, & Bates, 2008). Linear mixed effect models allowed the inclusion of the two-level factor verb (hand, foot), the two-level factor adverb type (intensifying, attenuating) and the two-level factor response effector (hand response, foot response). Thus, the fixed effects included the factors verb, adverb type and response effector, as well as their two-way and three-way interactions. Random effects for participants included random intercepts with random slopes for the factors verb, adverb type and response effector. Random effects for items only included random intercepts. All analyses used logarithmically transformed reaction times of correct responses within 150 to 1500 ms. T-values below −2 or above 2 are considered to represent significant effects. Post hoc tests were calculated using the package lsmeans (version 2.25–5, Lenth, 2016).

2.2 Results

2.2.1 Behavioral Data

Errors and responses faster than 150 ms or slower than 1500 ms after the Go-signal onset were excluded resulting in the exclusion of 358 trials (3.59%). The Go-signal is defined as the cue stimulus prompting a response, i.e. here the color change of the adverb-verb-combination. Raw data are shown in Fig. 2A. We observed a significant main effect of response effector (t = 4.86) with faster hand responses than foot responses. The main effect of adverb type was significant (t = 4.01) as well, with faster responses following intensifying adverbs compared to attenuating adverbs. Furthermore, the interaction between verb and response effector was significant (t = −11.20). Post hoc tests indicated significantly faster (z = 3.25, p = 0.001) hand responses following hand verbs compared to foot verbs and significantly faster (z = −3.76, p < 0.001) foot responses following foot verbs compared to hand verbs. The hypothesized three-way interaction of verb, response effector and adverb type was not significant (t = −0.32). All model estimates are given in Table 2. Fitted model parameters (±SD) for verb x adverb type x response effector are depicted in Fig. 3A.

Fig. 2
figure 2

Data distribution in Experiment 1 (A) and Experiment 2 (B). Raw data is split for verb conditions (hand verb, foot verb) on the x-axis, y-axis indicates the response times in milliseconds. Data is furthermore split according to the response effector (hand responses, foot responses) following intensifying (blue) and attenuating (red) adverbs. Red and blue lines indicate mean values in the respective conditions

Table 2 Results of statistical analyses of behavioral data. Model estimates (β), standard error (SE) and t-values are reported for Experiment 1 (left) and Experiment 2 (right). Significant effects are bold
Fig. 3
figure 3

Effects of Experiment 1 (A) and Experiment 2 (B). The 2-level factor verb (hand verb, foot verb) is denoted on the x-axis, y-axis indicates log-transformed response times. Data are split according to adverb type (intensifying adverb, attenuating adverb) and response effector. Circles, squares and error bars indicate fitted model parameters with standard deviation

3 Experiment 2

3.1 Methods

3.1.1 Participants

Seventeen participants (six male; mean age = 24.82 years, SD = 4.90) were included into the study. Exclusion criteria were the same as in Experiment 1. Handedness and footedness was assessed as in Experiment 1. The experiment is in accordance with the Declaration of Helsinki and was approved by the ethics committee of the Medical Faculty at Heinrich-Heine-University Düsseldorf (study number: 3400). All subjects gave written informed consent before the beginning of the experiment and received course credit or financial reimbursement.

Two subjects were excluded due to unidentifiable artifacts in the EEG recordings and excessive eye blinks during stimulus presentation. The final set of participants in Experiment 2 consisted of 15 subjects (six male, mean age = 24.93 years, SD = 5.23).

3.1.2 Stimuli

The stimulus set included the same adverbs and concrete verbs as in Experiment 1, i.e. 24 concrete verbs (twelve hand verbs and twelve foot verbs) and eight adverbs (four intensifying adverbs, four attenuating adverbs).

3.1.3 Procedure

All subjects participated in two separate experimental sessions at least seven days apart. Sessions differed regarding the instructions: in one session subjects had to react with their right hand to intensifying adverbs and with their right foot to attenuating adverbs. Reaction times were recorded. The response effector-adverb type relationship was reversed in the other session. The order of sessions was counterbalanced across subjects. Experimental sessions were conducted in an electrically shielded room. The experimental setup was the same as described in Experiment 1 with only few changes. First, the verb preceded the adverb. Second, the onset of the adverb instantaneously cued the response effector according to the respective instructions of the current session, i.e. there was no color change. Third, the verb-adverb combination was presented until response onset or maximally 1600 ms. The trial design is depicted in Fig. 1B. Each adverb was paired with each verb and each combination was shown twice, thus resulting in a total of 384 trials per subject. Each experimental session lasted about 30 min. Stimuli were presented using Presentation 14.9 software (Neurobehavioral Systems, Albany, California, USA) in white font on a black background.

3.1.4 EEG Data Acquisition

We recorded the EEG signal with 29 Ag–AgCl electrodes mounted in an elastic cap (EASYCAP GmbH, Herrsching, Germany), according to the 10/20 system. The average of right and left mastoid was used as reference and electrode position AFz as ground. Vertical EOG was recorded using bipolar electrodes. EEG-signals were amplified with a BrainAmp MR Plus amplifier (Brain Products, Munich, Germany). A sampling rate of 1000 Hz and an online high-pass filter of 0.3 Hz were applied. Impedance of all electrodes was kept below 10 kΩ. EEG and EOG signals were registered with BrainVision Recorder (Brain Products GmBH, Munich, Germany).

3.1.5 EEG Data Processing

Neurophysiological data were analyzed with Fieldtrip (version 20160629, Oostenveld, Fries, Maris, & Schoffelen, 2011), an open source toolbox for Matlab (version R2016a, Mathworks, Natick, Massachusetts, USA). Episodes include time-windows from 2 s before verb onset to 0.5 s after response onset. A semi-automatic artifact detection routine was applied to identify electrode jumps and muscle artifacts. A lowpass filter at 120 Hz was applied and line noise at 50 and 100 Hz filtered out. Trials were visually inspected for blink artifacts in the critical time window between verb and response onset as well as for non-EOG artifacts in the whole trial. Trials containing artifacts were rejected. Remaining blink artifacts in the baseline or post-response period were removed using independent component analysis (ICA).

Data were subsequently split into eight conditions defined by adverb type (intensifying vs. attenuating), verb (hand vs. foot) and response effector (hand vs. foot) and entered a time-frequency analysis. To discern and investigate semantic processing around adverb onset and motor processes during response execution more closely, we conducted two analyses locked to adverb and response onset, respectively. In both analyses data were aligned to the respective event, with 0 either denoting the onset of adverb or response. Time-frequency representations (TFRs) were computed in steps of 2 Hz from 2 to 30 Hz using a Fourier transformation. We applied a single Hanning taper with a width of 5 cycles, sliding in steps of 40 ms. Data were baseline-corrected using a time window of ta = −1.3 to −0.8 s for the adverb-locked analysis and tr = −1.5 to −1.0 s for the response-locked analysis.

3.1.6 Statistical Analysis

3.1.6.1 Behavioral Data

The linear mixed effect model contained the two-level factors verb (hand, foot), adverb type (intensifying, attenuating) and response effector (hand response, foot response). Fixed effects included the factors verb, adverb and response and their interactions. Random effects for participants included random intercepts for subjects and random slopes for the factors verb, adverb and response. Random effects for items only included random intercepts. Logarithmically transformed reaction times of correct responses within 150 to 1500 ms entered the analysis. T-values below −2 or above 2 are considered to represent significant effects. Post hoc tests were carried out using the package lsmeans (version 2.25-5, Lenth, 2016).

3.1.6.2 EEG Data

To statistically analyze the EEG data we computed pseudo-t-values for each participant to normalize individual differences (compare Lange, Oostenveld, & Fries, 2011). These t-values were then transformed into z-values to account for different number of trials in each condition (Klepp et al., 2015; see van Dijk, Nieuwenhuis, & Jensen, 2010). Then we applied a non-parametric statistical procedure to assess significant differences on the group level. This non-parametric randomization approach identifies clusters containing neighboring timepoints and frequencies while simultaneously correcting for multiple comparisons (Maris & Oostenveld, 2007).

Conditions were considered significantly different if the test statistic obtained from 5000 permutations resulted in an alpha-level below 0.05. We defined the relevant contrasts of conditions based on the resulting significant behavioral effects. In addition, we investigated if semantic priming is mirrored in reduced desynchronization in congruent verb-response conditions, as stated in our hypothesis. Electrodes C3 as proxy of the right hand and Cz as proxy of the right foot were analyzed separately.

3.2 Results

3.2.1 Behavioral Data

Errors and responses faster than 150 ms or slower than 1500 ms after adverb onset were excluded from further analysis resulting in the exclusion of 411 trials (3.57%). Raw data are shown in Fig. 2B. The mixed model analysis showed a significant main effect for response effector (t = 5.65) with faster hand responses than foot responses. A significant interaction between the factors adverb type and response effector emerged (t = −4.68). Post hoc tests revealed significant differences for hand responses (z = 2.448, p = 0.014) with faster hand responses following intensifying compared with attenuating adverbs. No difference emerged in the case of foot responses (z = 0.820, p = 0.412). The interaction between verb and response effector was not significant (t = −1.70). The three-way interaction of verb, response effector and adverb type was not significant (t = 0.82). All values are given in Table 2. Fitted model parameters (±SD) for verb x adverb type x response effector are depicted in Fig. 3B.

3.2.2 EEG Data

3.2.2.1 Adverb-Locked Analysis

A cluster between ta = 0.48 and 1 s after adverb onset (p = 0.003) indicated a significant difference between response effectors in the electrode Cz ranging from 17 to 30 Hz, i.e. stronger beta desynchronization for foot responses (Fig. 4). No effect was found in C3 (all p > 0.201). Neither adverbs nor any interaction with response effector or verb showed significant effects in electrodes C3 nor Cz (all p > 0.110).

Fig. 4
figure 4

Grandaverage EEG data and statistical contrasts relating to significant effects in the adverb-locked analysis of Experiment 2. The time in seconds is depicted on the x-axis with 0 denoting the onset of adverb, the frequency in Hertz is shown on the y-axis. Data is furthermore color-coded according to the power relative to baseline (left and middle column) or according to the z-value (right column) of the respective statistical comparison. The contrast shows hand- and foot response-related activity in electrode Cz with the significant cluster outlined in black.

3.2.2.2 Response-Locked Analysis

A cluster at tr = −0.16 to 0.32 s (p < 0.001) indicated a significant difference between hand and foot responses in electrode C3 ranging from 9 to 30 Hz showing stronger mu and beta desynchronization for hand responses (Fig. 5A). Complementarily, in electrode Cz, a cluster indicated a significant difference between the hand and foot condition at tr = 0–0.4 s after response onset (p = 0.002) ranging from 15 to 30 Hz, i.e. stronger beta desynchronization for foot responses (Fig. 5B). A significant cluster (p = 0.019) in electrode C3 showed that in the case of foot responses, the hand verb condition showed significantly more beta desynchronization than the foot verb condition at tr = −0.64 to 0.16 s ranging from 12 to 18 Hz (Fig. 5C). No effect was observed in electrode Cz (p > 0.082). No corresponding effect emerged for hand responses (all p > 0.116).

Fig. 5
figure 5

Grandaverage EEG data and statistical contrasts relating to significant effects in the response-locked analysis of Experiment 2. The time in s is depicted on the x-axis with 0 denoting the onset of response, the frequency in Hz is shown on the y-axis. Data is furthermore color-coded according to the power relative to baseline (left and middle column) or according to the z-value (right column) of the respective statistical comparison. A: Hand response- and foot response-related activity in electrode C3 with the significant cluster outlined in black. B: Hand response- and foot response-related activity in electrode Cz with the significant cluster outlined in black. C: Foot response-related activity following hand verbs and foot verbs with the significant cluster outlined in black.

3.3 Discussion

Our results show an influence of manner adverbs on motor behavior. In Experiment 1, we found a significant main effect of adverb indicating faster responses following intensifying compared with attenuating adverbs. This effect might depend on the direct relation between the force component of the action verb and the manner adverb specifying the amount of force implied in the movement (Goldschmidt et al., 2017). Action verbs are reported to elicit motor activation (Hauk et al., 2004; Pulvermüller, 2005), especially when processed semantically (Klepp et al., 2017; Sato et al., 2008). Motor output interacts with action-related language processing because of shared neuronal circuits (Boulenger et al., 2006; Dalla Volta et al., 2009). Manner adverbs modifying an action verb might therefore modulate its elicited motor activation by modulating the amount of force implied in the action. As was shown in the case of imageability (Klepp et al., 2015) and effector-specific movement (Klepp et al., 2017), semantic features of the action verb might influence motor behavior. Cortical motor areas might therefore also be involved in the processing of semantic features of stimuli relating to the action verb. This is furthermore corroborated by the complementary results of Experiment 2. Here, though no significant main effect of adverb type emerged, manner adverbs interacted with the response effector. Hand responses following intensifying adverbs were significantly faster than hand responses following attenuating adverbs. Participants had to respond depending on the adverb type.

Comparing Experiments 1 and 2 it seems that the main effect of adverb type in Experiment 1 switched to an interaction between adverb type and response effector in Experiment 2. The main difference between these two experiments is the order of adverb-verb (Experiment 1) and verb-adverb (Experiment 2) presentation combined with the instruction cues color change (Experiment 1) and adverb type (Experiment 2). The relevance and subsequent psycho-linguistic processing of the adverb for successfully operating on the tasks hence was different in Experiments 1 and 2: In Experiment 1, priming of force components could have taken place resulting in a main effect of adverb type even though the semantics of the adverbs were of minor relevance. In Experiment 2 on the other hand, the semantics of the adverb were indicative for the required response potentially resulting in simulation processes directly interacting with response preparation. In addition, while both tasks prompted semantic processing of the verbal material, Experiment 2 might have increased the participants’ awareness of the semantic features of the manner by making them task-relevant. Studies concerning the mental timeline argued that mental simulations only occur during language processing, if the semantic features of the verbal material is task-relevant and the processor is aware of these features (Maienborn, Alex-Ruf, Eikmeier, & Ulrich, 2015; Ulrich & Maienborn, 2010). The increased awareness of the semantic features may have resulted in the more specific interaction between adverb and response. That the effect was only found for hand responses and not for foot responses might be attributable to the closer connection of the hand with language (Rizzolatti & Arbib, 1998). Another explanation could be that due to longer response times for foot responses the interaction with adverbs might fade with processing time. Still, foot responses to intensifying adverbs were numerically faster than to attenuating adverbs. Arguably, intensifying adverbs might increase motor activation per se thereby increasing the motor contribution to the processing of the action verb. This might reflect a semantic priming effect even for manner adverbs. This remains elusive, however, since no corresponding effect of manner adverbs was found in the neurophysiological data of Experiment 2. Differential motor activation relating to intensifying and attenuating adverbs could arise in studies focusing solely on the semantic processing of manner adverbs. Yet both experiments reported in this study incorporated manner adverb-action verb combinations which might have limited our ability to discern adverb- and verb-related processes regarding brain oscillations.

In addition to the effects of manner adverbs, action verbs interacted with motor behavior. In Experiment 1, facilitated hand and foot responses in congruent compared with incongruent verb-response combinations revealed a semantic priming effect, which is in line with previous findings (Scorolli & Borghi, 2007; Klepp et al., 2017). This is corroborated by the neurophysiological data recorded in Experiment 2. Results showed reduced beta desynchronization in electrode C3 for foot responses following foot verbs compared to hand verbs. As expected, the congruent condition presented with reduced motor activation (Grisoni et al., 2016; Schacter et al., 2007). The onset of the effect was about 600 ms before response onset, which would have allowed the action verb to be processed semantically and subsequently interacting with response execution (Kutas & Hillyard, 1984). However, the effect emerged in electrode C3 only, which was located approximately above the cortical motor hand area, whereas no complementary effect for hand responses emerged in electrode Cz. Furthermore, no verb x response effector interaction was visible in the behavioral data of Experiment 2. Hence, the observed differences might alternatively be accounted for by significant differences between hand and foot verbs, respectively. This might be due to the limited set of action verbs employed in this study. To reduce confounding effects of imageability, familiarity and frequency, we matched our verb set very carefully. However, this might have prevented us from mapping a wider range of possible differences in the action verbs, for instance with regard to their movement pattern as well as other linguistic features. Differences in brain oscillations for foot responses following hand and foot verbs might therefore alternatively be unrelated to a semantic priming effect but merely reflect an overall stronger desynchronization following hand verbs compared with foot verbs independent of the response effector.

Two further aspects should be discussed, namely the somatotopy of response effectors and timing. Hand and foot responses were required in a double-dissociation paradigm. As visible in the behavioral data of both experiments, hand responses were overall faster than foot responses, which is in line with previous findings (Buccino et al., 2005; Gianelli & Dalla Volta, 2015; Klepp et al., 2017). Experiment 2 indicated differential motor activation for hand and foot responses in electrodes C3 and Cz in analyses around adverb and response onset. Oscillatory differences arose predominantly in the beta frequency range in a time window relating to response execution. Crucially, stronger desynchronization for hand responses was observed in electrode C3, while stronger desynchronization for foot responses emerged in electrode Cz. Our results thus demonstrate somatotopical activity differences related to the respective response effector, as hypothesized. This should have allowed for the detection of differential EEG effects of verbs and adverbs for the two response effectors. Indeed, stronger desynchronization for hand than foot verbs preceding foot responses in electrode C3 was found, but not the full pattern of effects expected from the double dissociation paradigm. One straightforward explanation may be that when no behavioral effects were found, there simply might have been no differences in neurophysiological processing to be measured by EEG. Note, however, that neurophysiological effects are sometimes reported in the absence of behavioral differences (Mollo, Pulvermüller, & Hauk, 2016). Nevertheless, the paradigm of Experiment 2 may be not optimally suited for the detection of language-motor priming effects. More specifically, the temporal proximity between manner adverb and hand/foot responses onset in Experiment 2 might have been too close to discern oscillatory differences in the semantic processing of manner adverbs. Instead, potentially subtle activity differences relating to the processing of the manner adverb might have been overshadowed by motor activation induced during response execution processes. In addition, potential activation differences might have been observable in other electrodes, especially located above other language-related brain areas, e.g. temporal regions; these regions might also reflect differences based on the type of manner adverb or its interaction with action verbs.

An important concern in the comparison between the effects of Experiment 1 and Experiment 2 is the temporal structure of stimulus presentation and the average response time. In Experiment 1, the average time interval between adverb and response onset was 1300 ms while the average time interval between action verb and response, on the other hand, was only 900 ms. There was a priming effect of verb effector, but only an unspecific effect of adverb type. Thus, Experiment 2 was designed to induce more semantic interaction with the hypothesis to find an interaction of priming and adverb type, reflected in neurophysiological data. The reversal of verb and adverb presentation order also implied that the average time interval between verb presentation and response was 1150 ms, with 750 ms between adverb and response onset. Action verbs did not influence response times possibly due to the prolonged interval between verb and response onset. Accordingly, stimulus-response intervals likely modulated the effects observed in the two experiments.

Further, a relatively small sample size and the inclusion of only two EEG electrodes in the statistical analyses might have limited the power of our results. Inclusion of a greater sample to increase effect sizes and a greater number of EEG electrodes could lead to a more detailed picture regarding the interplay of action-related language processing and motor activity and its modulation by manner adverbs.

Future studies should furthermore investigate which semantic aspects of manner adverbs potentially elicit motor activation, e.g. differentiating between force and velocity, providing closer insights into the extent of motor involvement in language processing. The small number of force- and velocity-modulating adverbs (two each for intensifying and attenuating adverbs) prevented us from validly deducing differential effects on the verb-response interaction. On the other hand, some action verbs, e.g. “boxen” (to box), might be predominately modulated by adverbs defining the amount of force, while others, e.g. “tippen” (to type), might be more susceptible to an adverbial modulation relating to the velocity of the action. This raises the important question whether such differences are mirrored in the overt motor behavior or neurophysiological activity. Additionally, previous studies suggested an influence of various movement-dependent factors on beta-desynchronization in motor areas (Tan et al., 2013; Nakayashiki, Saeki, Takata, Hayashi, & Kondo, 2014). A consecutive study might therefore also be concerned with the influence of manner adverb on the motor response by taking various movement-related parameters into consideration. Furthermore, adverb-verb combinations should be included in natural sentences to shed more light on the influence of grammatical constructions on action-related language processing in sensorimotor areas.

Taken together, our study provides an indication that manner adverbs influence motor behavior while corroborating the already existing data concerning the interaction between action verb processing and motor output. These findings are in line with assumptions made by embodied cognition theories proposing an essential role of sensorimotor areas in the processing and storage of action concepts inherent in action-related language. The adverbial modulation of motor behavior might reflect a certain variation of motor involvement in language processing. This involvement could be susceptible to grammatical constructions modifying the action component of action verbs. Yet, effects of the verb material in a closely matched verb set and influences of timing have to be taken into account.