Learning Foreign Language Vocabulary with Gestures and Pictures Enhances Vocabulary Memory for Several Months Post-Learning in Eight-Year-Old School Children

Andrä, Christian; Mathias, Brian; Schwager, Anika; Macedonia, Manuela; von Kriegstein, Katharina

doi:10.1007/s10648-020-09527-z

Learning Foreign Language Vocabulary with Gestures and Pictures Enhances Vocabulary Memory for Several Months Post-Learning in Eight-Year-Old School Children

Intervention Study
Open access
Published: 18 April 2020

Volume 32, pages 815–850 (2020)
Cite this article

You have full access to this open access article

Download PDF

Educational Psychology Review Aims and scope Submit manuscript

Learning Foreign Language Vocabulary with Gestures and Pictures Enhances Vocabulary Memory for Several Months Post-Learning in Eight-Year-Old School Children

Download PDF

Christian Andrä^1,2,
Brian Mathias^3,4,
Anika Schwager⁵,
Manuela Macedonia^4,6 &
…
Katharina von Kriegstein^3,4

38k Accesses
84 Citations
68 Altmetric
7 Mentions
Explore all metrics

Abstract

The integration of gestures and pictures into pedagogy has demonstrated potential for improving adults’ learning of foreign language (L2) vocabulary. However, the relative benefits of gestures and pictures on children’s L2 vocabulary learning have not been formally evaluated. In three experiments, we investigated the effects of gesture-based and picture-based learning on 8-year-old primary school children’s acquisition of novel L2 vocabulary. In each experiment, German children were trained over 5 consecutive days on auditorily presented, concrete and abstract, English vocabulary. In Experiments 1 and 2, gesture enrichment (auditorily presented L2 words accompanied with self-performed gestures) was compared with a non-enriched baseline condition. In Experiment 3, gesture enrichment was compared with picture enrichment (auditorily presented words accompanied with pictures). Children performed vocabulary recall and translation tests at 3 days, 2 months, and 6 months post-learning. Both gesture and picture enrichment enhanced children’s test performance compared with non-enriched learning. Benefits of gesture and picture enrichment persisted up to 6 months after training and occurred for both concrete and abstract words. Gesture-enriched learning was hypothesized to boost learning outcomes more than picture-enriched learning on the basis of previous findings in adults. Unexpectedly, however, we observed similar benefits of gesture and picture enrichment on children’s L2 learning. These findings suggest that both gestures and pictures enhance children’s L2 learning and that performance benefits are robust over long timescales.

Twelve- and Fourteen-Year-Old School Children Differentially Benefit from Sensorimotor- and Multisensory-Enriched Vocabulary Training

Article Open access 11 March 2022

The Effect of Children Learning English Vocabulary through a Gesture-Based System

Incidental vocabulary learning for primary school students: the effects of L2 caption type and word exposure frequency

Article 26 September 2018

Introduction

Multisensory and Sensorimotor Enrichment

Learning in natural environments is multisensory: Information arising from sensory modalities is integrated when we acquire knowledge or skills. Learning to recognize the voice of a new acquaintance, for example, takes into account both the sight of the individual and their speech characteristics (von Kriegstein et al. 2008; Sheffert and Olson 2004). Interactions between sensory and motor modalities may also be essential for the learning of complex skills such as reading, writing, and arithmetic and are therefore highly relevant for many issues associated with education (Kiefer and Trumpp 2012).

The presence of complementary information across multiple sensory modalities during learning has been referred to as multisensory enrichment (Mayer et al. 2015; Repetto et al. 2017). Current pedagogical and neurocognitive theories propose that multisensory input, compared with unisensory input, is beneficial for learning outcomes (Mahmoudi et al. 2012; Sadoski and Paivio 2013; Shams and Seitz 2008; von Kriegstein and Giraud 2006).

In the classroom, multisensory enrichment can take many forms: flash cards (Wissman et al. 2012); videos (Tan and Pearce 2011; for review see Snelson 2011); video games (Annetta et al. 2009; Hsu 2011); songs and poetry (Foster and Freeman 2008; Millington 2011); and interactive activities that make use of mobile phones, computers, and tablets (Ehret and Hollett 2014; for a review see Herodotou 2018; Volk et al. 2017). These enrichment strategies have all been studied as means of enhancing learning outcomes. While some of these pedagogical tools are already widely in use in educational settings, others have scarcely been adopted.

There is mounting evidence that sensorimotor enrichment also increases learning efficiency and memory performance. We use the term sensorimotor enrichment to indicate the presence of movements such as gestures during learning that are semantically congruent with information presented in another sensory modality (for a review see Macedonia 2014). Note that sensorimotor enrichment involves information presented across at least one sensory modality, because movements induce somatosensory feedback. Sensorimotor enrichment can also involve the presence of further sensory modalities, e.g., due to seeing the self-generated movements. Like multisensory enrichment, sensorimotor enrichment benefits learning (MacLeod et al. 2010). The pairing of auditory stimuli such as novel spoken words with semantically related gestures, for example, enhances subsequent auditory stimulus recognition (Mayer et al. 2015) and retrieval (Macedonia et al. 2011). In most studies involving the performance of gestures, an individual who models the gestures for the participants is integrated into the participants’ learning experience. Also, the performance of pantomimes during the learning of pseudowords and associated visual objects can enhance subsequent visual object recognition compared with the performance of simple pointing movements during learning (v. Soden-Fraunhofen et al. 2008). Memory enhancements following sensorimotor enrichment have variously been termed enactment effects (Engelkamp and Zimmer 1985), production effects (Dodson and Schacter 2001), and subject-performed task effects (Cohen 1981).

Sensorimotor enrichment is of interest to educators as principles of active learning have moved from the periphery of education to its center over the past decades (Lewis and Williams 1994; Michael 2006). During active learning, students engage in activity that encourages them to reflect upon the learning material (Collins and O’Brien 2003). Active learning techniques contrast with other pedagogical approaches that focus on only reading, watching, or listening to learning material and may facilitate children’s learning because they complement the use of more cognitive learning strategies (Cook et al. 2012; Goldin-Meadow 2003; Gullberg et al. 2008). Student engagement and active participation are considered to be among the most critical factors driving their persistence to learn (Braxton et al. 2008).

Cognitive and Neuroscientific Theories of Multisensory and Sensorimotor Enrichment

Benefits of multisensory and sensorimotor enrichment for learning are accounted for by several cognitive and neuroscientific theories. Theories of embodiment argue that concepts are mentally represented in terms of their perceptual features, motor features, and other aspects of one’s personal experience (for a review see Barsalou 2008). According to an embodied perspective, multisensory and sensorimotor enrichment may enhance memory by grounding the remembered material in multisensory and sensorimotor experiences. For example, training studies have demonstrated that writing letters by hand enhances subsequent recognition of those letters compared with typing in preschool children (Kiefer et al. 2015). Such studies hint at why multisensory and sensorimotor experience may aid children’s education: When meaningful interactions with to-be-learned material are lacking and are replaced by verbal descriptions, children may rely simply on verbal associations for learning, which may be less durable than sensorimotor associations (Kiefer and Trumpp 2012; see also DeLoache et al. 2010).

Nested within an embodiment framework are theories of dual coding (Engelkamp and Zimmer 1984; Hommel et al. 2001; Paivio 1991; Paivio and Csapo 1969) and simulation or imagery accounts (Jeannerod 1995; Kosslyn et al. 2006; Saltz and Dixon 1982). Dual coding theory proposes that stimuli presented in various sensory and sensorimotor modalities are coded either verbally or nonverbally. For example, vocabulary words that are heard and then pronounced are coded verbally, whereas related gestures that are seen and then performed are coded nonverbally. Benefits of multisensory- and sensorimotor-enriched learning can be attributed to the encoding of learned material both verbally and nonverbally in one or more sensory modalities, with the nonverbal code contributing more to memory than the verbal code (Sadoski and Paivio 2013).

How beneficial effects of enrichment are instantiated in the human brain are to date unknown. One overarching mechanistic account of brain function is the Bayesian brain hypothesis. It assumes that the brain represents information probabilistically and uses an internal generative model and predictive coding to most effectively process sensory input (Friston 2005; Friston and Kiebel 2009; Kiebel et al. 2008; Knill and Pouget 2004). In this view, simply listening to a stimulus that has been encoded both in terms of its auditory and visual features, for example, may trigger an internal dynamic generative model that reconstructs its stored visual features (implemented in visual cortices) and thereby helps to recognize the perceptual input (for a review see von Kriegstein 2012; Mayer et al. 2015; Yildirim and Jacobs 2012). These internal generative models of the enriched learning material could explain enhancing learning outcomes (von Kriegstein and Giraud 2006; Yildirim and Jacobs 2012).

Effects of Enrichment on Native Language (L1) Vocabulary Learning

A quintessential example of the benefits of multisensory and sensorimotor enrichment can be found in the acquisition of novel vocabulary. When acquiring their native language, individuals accumulate extensive multisensory and sensorimotor experience with caregivers and the environment (Kuhl 2010). Over time, specific sensory experiences and motor responses become associated with each other and labeled with a sequence of phonemes, i.e., a word (Lupyan and Thompson-Schill 2012; Macedonia 2015).

Multisensory-enriched learning of native language words aids L1 acquisition (Hadley et al. 2016). Many parents begin reading picture books to their children shortly after birth; the reading of picture books prior to age 2 correlates with pre-literate children’s oral L1 skills (DeBaryshe 1993). However, quantifying the benefits of picture book reading has been difficult due to the lexical diversity of picture books, as well as the lack of adequate control learning conditions (Montag et al. 2015; Sénéchal and Cornell 1993). In one study, 135 third- and fourth-grade children showed improved memory for L1 words accompanied by pictures compared with words with no visual enrichment (Acha 2009). The inclusion of sign language curriculum in preschool, in which words are presented visually, kinesthetically, and verbally, may also speed up children’s acquisition of L1 vocabulary (Daniels 1997).

Sensorimotor-enriched learning of native language words also contributes to L1 vocabulary knowledge. For example, speaking native language words can aid their acquisition: Five-year-olds who are taught novel L1 words by speaking them aloud show greater memory for those words compared with words taught only by listening, a sensorimotor enrichment benefit (Icht and Mama 2015). Several studies have suggested that the performance of gestures and other movement-based interventions during learning facilitates the comprehension of novel L1 sentences (for a review see Sadoski 2018). For example, 6- and 8-year-olds remember action phrases in L1 such as “lift the bottle” better following enactment using gestures compared to verbal repetition (Mecklenbräuker et al. 2011). Gestures produced early in language development provide a way for children to communicate information that they cannot yet express verbally and predict the future acquisition of corresponding L1 vocabulary (Iverson and Goldin-Meadow 2005; Caselli et al. 2012). Cross-cultural studies (Bavin et al. 2008; Eriksson and Berglund 1999) suggest a common cultural and biological basis for the role of gestures in the development of spoken language.

Effects of Enrichment on Vocabulary Learning in the Foreign Language (L2) Classroom

Only a few studies have examined the effects of multisensory and sensorimotor enrichment on vocabulary learning in applied educational contexts. These studies have mostly investigated the learning of L2 vocabulary, as young children typically spend less than 15% of classroom time on direct L1 vocabulary instruction depending on the curricula (McGill-Franzen et al. 2006; Scott et al. 2003).

Multisensory enrichment for the learning of L2 vocabulary is commonly used by teachers of foreign languages. However, empirical investigations of the use of visual materials for L2 learning in young children are sparse. Silverman and Hines (2009) found that the viewing of short video clips that supplemented teachers’ regular instruction increased kindergartners’ through second graders’ knowledge of L2 vocabulary words. This multimedia enrichment also narrowed the gap in vocabulary knowledge between L2 learners and native speakers. The use of images, animations, and videos in the teaching of English as an L2 has been suggested as a means of improving L2 learning outcomes in university-age students (for a review see Gilakjani 2012; Konomi 2014). Video, in particular, provides a rich semantic and pragmatic context to which lexical phrases can be linked for the teaching of foreign languages (Tschirner, 2001).

In the case of sensorimotor enrichment, one recent study conducted with 111 preschool children (5-year-olds) in an educational context found that the performance of physical exercise and iconic gestures while listening to foreign language vocabulary increased recall compared with exercising without gestures (Mavilidi et al. 2015). In another study, a teacher of English as a foreign language enacted gestures, repeated utterances, and checked children’s comprehension while telling a simple story. Together, these modifications increased Spanish primary school children’s (10-year-olds’) comprehension of the story compared with non-modified storytelling (Cabrera and Martínez 2001). Finally, Macedonia et al. (2014); see also de Wit et al. 2018) demonstrated that 11-year-old children’s L2 vocabulary learning outcomes were boosted more by performing related gestures themselves during learning than by viewing a pedagogical agent who performs the gestures.

Comparing the Effectiveness of Multisensory and Sensorimotor Enrichment on L2 Vocabulary Learning

The quest for optimal L2 teaching strategies generates a key question: Is the use of gesture-enriched learning more beneficial than the use of more commonly practiced picture-enriched learning? Studies on young adults have suggested that gesture-enriched learning can enhance cued memory recall of L2 vocabulary even more than picture-enriched learning (Mayer et al. 2015; Repetto et al. 2017): In one recent study, adults learned L2 vocabulary by reading an L2 word aloud while viewing its written L1 translation and performing a gesture (gesture-enriched learning) or viewing a picture (picture-enriched learning; Repetto et al. 2017). After 35 min of training, participants produced fewer translation errors for gesture-enriched L2 words compared with picture-enriched L2 words in a multiple choice task. In another study, gesture-enriched learning yielded more accurate L2 translation performance compared with picture-enriched learning 6 months following a week-long L2 training period (15 h of training; Mayer et al. 2015).

Whether these findings in adults (Mayer et al. 2015; Repetto et al. 2017) can be translated to children is an open question. Adults and children differ in many ways with regard to learning mechanisms, such as working memory abilities (Luna et al. 2004) and use of visual and motor imagery (Frick et al. 2009; Funk et al. 2005; for a review see Gabbard 2009). However, some studies show similar learning mechanisms in adults and children (Raviv and Arnon 2018; Saffran et al. 1999). It may also be the case for children’s L2 learning in the classroom that gesture-enriched learning is more effective than picture-enriched learning. The potentially greater effectiveness of gesture-enriched compared with picture-enriched learning would have consequences for the development of evidence-based teaching strategies. Additionally, to our knowledge, no previous studies on children’s L2 vocabulary learning have directly compared sensorimotor enrichment strategies with a unisensory baseline learning condition. This leaves an open possibility that facilitative effects of sensorimotor enrichment could merely be an artifact of impaired learning in one of the enriched control conditions, as enriched learning creates a larger cognitive load than unisensory learning (Mayer and Moreno 2003).

Two studies have attempted to clarify the question of whether gesture enrichment or picture enrichment may differ in terms of their effects on children’s L2 vocabulary acquisition (Porter 2016; Tellier 2008). A study conducted by Tellier (2008) with 4- to 5-year-olds demonstrated that gesture enrichment may yield larger gains in children’s L2 vocabulary learning compared with picture enrichment (viewing pictures while hearing novel words). Porter (2016) additionally showed that combined gesture and picture enrichment may yield larger gains for 5- to 6-year-olds than picture enrichment alone. However, these studies possess some limitations. Besides the use of small sample sizes (10 children per condition in one study in a between-subjects design; Tellier 2008), one study did not control the number of L2 word repetitions that occurred within each enrichment condition or the classes of words that were taught (Porter 2016), so that learning effects could be attributed to participant’s amounts of practice rather than enrichment types. Also the studies examined enrichment effects over limited time periods (i.e., up to 2 weeks after instruction had stopped; Porter 2016).

Aims of the Current Study

In adults, benefits of gesture enrichment are known to exceed those of picture enrichment (Mayer et al. 2015). It is, however, unclear whether similar outcomes would be shown by children in naturalistic classroom environments. Evaluating learning outcomes in children is important as it would be a first step toward answering the question of whether educators should integrate both forms of enrichment in classrooms for school children, or whether one type of enrichment should be preferred due to its greater effectiveness. The approach of the current study was to follow a design that was used previously in laboratory-based tests in adults (Mayer et al. 2015) and translate it into a primary school classroom setting.

Our primary aim was to compare the effects of gesture and picture enrichment on L2 vocabulary learning. In order to evaluate enrichment effects, we first compared benefits of gesture enrichment with a non-enriched baseline learning condition. Comparison of gesture-enriched learning and non-enriched learning is necessary for quantifying the extent to which gesture enrichment enhances non-enriched learning, as well as addressing the alternative explanation for differences between learning conditions that enrichment may under some circumstances impede learning (Mayer and Moreno 2003).

Gesture and picture enrichment have yielded similar effects for both concrete (e.g., tent) and abstract (e.g., thought) L2 vocabulary in adults (Macedonia and Knösche 2011; Mayer et al. 2017). However, abstract vocabulary learning typically lags behind concrete vocabulary learning during development (McFalls et al. 1996). Our second aim was to investigate whether gesture or picture enrichment has the potential to boost children’s learning of both concrete and abstract word types.

An important aspect of developing L2 proficiency is the retention of new vocabulary over long time periods, even when words are not retrieved from memory on a regular basis. Therefore, our third aim was to investigate long-term influences of gesture and picture enrichment on school children’s L2 vocabulary retention by comparing the effects of the two types of enrichment up to 6 months post-learning.

Summary of the Experimental Approach and Hypotheses

We conducted three experiments that all included school children enrolled in grade three of primary school. Three main hypotheses were tested. First, on the basis of studies performed in young adults (Mayer et al. 2015), we expected that pairing self-performed gestures with L2 vocabulary would enhance learning outcomes compared with non-enriched learning (Experiments 1 and 2). In adults, gesture enrichment has been shown to improve performance on both cued translation (Mayer et al. 2015) and free recall (Macedonia and Knösche 2011; Zimmer et al. 2000) tests. Cued translation refers to a learner’s translation of a written or spoken L1 or L2 word that serves as a memory cue, and free recall refers to a learners remembering an L1 word and its L2 translation in the absence of any written or spoken cue. Gesture-enriched learning was expected to enhance children’s cued L1 and L2 translation, as well as free recall of L1-L2 translations, compared with non-enriched learning.

Second, though the learning of concrete words was expected to exceed the learning of abstract words, enrichment effects of a similar magnitude were expected for both concrete and abstract L2 words based on studies in adults (Macedonia and Knösche 2011; Mayer et al. 2017) (Experiments 1 and 2).

Third, as adults show greater memory for L2 words learned using gestures compared with pictures over the long term (6 months following learning; Mayer et al. 2015) on cued translation tasks, gesture enrichment was expected to benefit post-learning cued L1 and L2 translation more than picture enrichment at later time points following learning (Experiment 3). We also explored whether gesture-enriched learning benefitted children’s free recall performance more than picture-enriched learning; this particular effect was not observed in adults (Mayer et al. 2015).

Experiment 1

Experiment 1 served to test the hypotheses that pairing self-performed gestures with auditorily presented L2 vocabulary would enhance learning outcomes compared with non-enriched learning and that the gesture-based learning benefits would occur for both concrete and abstract nouns.

Methods

Participants

Participants were school children enrolled in grade three at a primary school in Leipzig, Germany. The school is classified within the German education system as a bewegte Schule (“movement school”; http://www.bewegte-schule-und-kita.de/konzept/bewegteSchule/english/html/konzept.html). This type of school emphasizes the role of movement in student learning and development (Breithecker and Dordel 2003). Seventy-one children participated in Experiment 1. The investigators briefed all children and teachers on the study procedures in an introductory session that took place prior to the experiment. Children who were absent from at least one training or test session were excluded from the analyses. Therefore, 54 children were included in the analyses. Written informed consent was obtained from the legal guardians of all individual school children who participated. None of the children were reported by the school principal or teachers to possess learning disabilities. All of the children possessed normal or corrected-to-normal vision. Demographic information can be found in Table 1. Experiment 1 was reviewed and approved by the Education Department of the state of Saxony, Germany.

Table 1 Participant demographic information for Experiments 1, 2, and 3

Full size table

Stimulus Materials

Forty English words were used in Experiment 1 (Table 2). Half of the words were concrete nouns, and the other half were abstract nouns. The concrete and abstract nouns significantly differed in terms of concreteness on the basis of ratings from a corpus of 350,000 German lemmas, t (38) = 13.06, p < 0.001 (concrete nouns, M = 6.58, SD = 0.98, range, 4.07–7.99; abstract nouns, M = 2.95, SD = 0.76, range, 1.80–4.61) (Köper and Schulte Im Walde 2016). Words were selected from the “Vimmi” language corpus (Macedonia et al. 2010, 2011). Word selection was based on recommendations of the children’s schoolteachers and was constrained by two factors. First, children had not yet encountered the words in lessons, and the words were not anticipated to be included in teaching curriculum for the 6-month duration of the investigation. Second, the words were considered by the teacher to be relevant for future use by the children. Word frequencies in written German (http://wortschatz.uni-leipzig.de/en), as well as the numbers of syllables and letters contained in the English translations, were counterbalanced between learning conditions. Word frequencies and lengths were also counterbalanced between sets of concrete and abstract words included in the two learning conditions.

Table 2 German and English words used in Experiments 1, 2, and 3

Full size table

Experiment 1 made use of two stimulus types: audio recordings of English words and their German translations and videos of an actress performing gestures that were semantically related to word meanings (iconic gestures). Videos and pictures were adopted from the “Vimmi” corpus (Macedonia et al. 2010, 2011; Mayer et al. 2015).

We recorded German stimuli with a bilingual Italian-German speaker (female, age 44) and English translations with a native speaker of British English (female, age 21). Recordings were made using a RØDE NT55 microphone (RØDE Microphones, Silverwater, Australia) in a sound-dampened room. The 40 German word recordings used in Experiment 1 were M = 0.86 s (SD = 0.18 s) in length, and English word recordings were M = 0.84 s (SD = 0.15 s) in length. The 24 German word recordings used in Experiments 2 and 3 were M = 0.81 s (SD = 0.16 s) in length, and English word recordings were M = 0.83 s (SD = 0.15 s) in length.

Videos were recorded using a Canon Legria HF S10 camcorder (Canon Inc., Tokyo, Japan). Each video was 4 s long and shot in color. The actress shown in the videos began and ended each video by standing motionless with her arms by her sides. During the videos, she used head movements, movements of one or both arms or legs, fingers, or combinations of these body parts to convey the meaning of the foreign language word through the movement. For example, the word tent was conveyed by moving the arms and fingers together to form an upside-down “V” shape, and the word thought was conveyed by touching the head with one hand and subsequently pointing upward with the same hand above the head (Fig. 1). The actress always maintained a neutral facial expression. Gestures selected for abstract nouns were previously agreed upon by three independent raters (Macedonia et al. 2011; Mayer et al. 2015).

Design and Procedure

Experiment 1 had a 2 × 2 × 3 factorial within-subjects design with the factors learning condition (gesture enrichment, no enrichment), word type (concrete, abstract), and testing time point (3 days, 2 months, and 6 months post-learning).

Learning Phase

Children completed 5 consecutive days of L2 vocabulary training (Fig. 2). Training occurred over four 5-min blocks per day. Rest activities occurred for 5 min between each of the blocks. Thus, each daily training session had a total duration of approximately 35 min.

L2 words and their L1 translations were presented in non-enriched and gesture-enriched trials (Fig. 3). In both trial types, children first heard an English word, which was followed by its auditorily presented German translation and then by a repetition of the English word. The children’s teacher then cued the children to recite the English word aloud with the words “all together.” The teacher stood at the front of the classroom during the entire training period. In the gesture enrichment condition, recorded English words were accompanied by videos of an actress performing an iconic gesture. At the end of the trial, children performed the gesture along with the teacher. The time interval between English and German word onsets in the non-enriched learning condition was 2.5 s. A shorter time interval was used for the presentation of the English words in the non-enriched learning condition, compared with the videos in the gesture condition, in order to avoid a large difference in inter-stimulus intervals between the two conditions. Long time periods during which no sensory information is presented could potentially decrease attention, motivation, or stimulus-driven arousal during the non-enriched trials compared with the gesture-enriched trials, which would favor higher performance outcomes for the gesture learning condition compared with the picture learning condition. The time interval between the onset of the German word’s presentation and the onset of the English word’s repetition was 2.5 s in both conditions. Children’s locations in the classroom were randomly assigned for each training block. Children sat at desks during no enrichment trials and stood next to desks during the gesture trials. One of the investigators monitored the testing equipment and initiated each trial as soon as the children were ready.

Learning phase trials were blocked by learning condition. In Experiment 1, children completed 4 blocks containing 10 trials (5 concrete word trials and 5 abstract word trials) per day. Half of the blocks comprised gesture enrichment, and the other half comprised non-enriched learning. Each German word and its English translation were presented in only a single trial each day. Word orders within each block and orders of enrichment condition blocks were counterbalanced across learning days.

Five-minute rest activities occurred between learning blocks. The activities were intended to promote cognitive recovery and relaxation and consisted of skipping rope, ball-throwing, and partner massage in groups of 2 to 4 individuals. These activities were familiar to the children as they were sometimes integrated into the School in Motion curriculum outside of the context of the study. The order of rest activities was counterbalanced across learning days.

Children completed training sessions in groups of up to 13 to ensure adequate space to perform the gestures and minimization of distraction. Stimuli were counterbalanced across groups of children: Some children learned one set of words in the gesture condition and another set in the non-enriched learning condition, and the remaining children received the reverse assignment of words to the enrichment conditions. Children’s positions within the classroom were counterbalanced across 5 days of training.

Test Phase

Children completed vocabulary tests at three time points: 3 days, 2 months, and 6 months following the completion of the learning phase. Free recall, German-English, and English-German translation tests were conducted orally at each time point, since none of the children possessed adequate writing skills in English as a foreign language.

Native German-speaking examiners conducted the test sessions individually at the same school where the learning phase took place. The examiners were university students enrolled in teaching certification programs at the University of Leipzig. Examiners were blind with respect to which words had been learned in enrichment condition. Further, they had no knowledge of gestures or pictures paired with individual words in the experiment.

During each test session, one of the school children sat at a desk opposite one of the examiners. In the free recall test, children were asked to verbalize as many German-English or English-German translations, individual German words, or individual English words as they could remember from the training. A time limit of 5 min was imposed; children were not instructed about this time limit, and no child in any experiment exceeded 5 min. Following the free recall test, the children completed the two translation tests. The free recall test was always administered prior to the translation tests to eliminate influences of memory cues present in the translation tests.

During the German-English translation test, the examiner spoke the German words one at a time, and the children were asked each time to give the correct English translation. During the English-German translation test, the examiner spoke the English words one at a time, and the children were asked each time to give the correct German translation. The German-English translation test was always administered prior to the English-German test, as translation from one’s native to a foreign language has been shown to be a more difficult task than the translation from a foreign language into one’s native language (Kroll and Stewart 1994). Examiners were instructed to speak the English words with a pronunciation based on the recordings used in the experiments. Children were given 5 s to state their answers before moving to the next word. Test word orders in the two translation tests were randomized for each testing time point (3 days, 2 months, and 6 months post-learning).

Examiners recorded test sessions as an audio file for subsequent analysis using a personal recording device such as a mobile phone. The children did not receive any feedback regarding the correctness of their answers. Children were instructed not to discuss the tests with their classmates. Each test session lasted approximately 10–15 min.

Data Analysis

Audio files from individual student test sessions were independently scored for accuracy by two raters. The two raters had not conducted any of the test sessions and were also blind with respect to which words had been learned in each enrichment condition. In cases of disagreement, a third independent rater was employed, and the majority decision was adopted.

German-English translation and English-German translation tests were scored in terms of the total number of correct translations recalled in each test (one point for each correct translation). One point was given for each correct translation (German-English or English-German word pair) provided during the free recall test. No points were given for a German word that was missing a corresponding English translation or vice versa.

In order to evaluate effects of enrichment and vocabulary type on learning, three-way analyses of variance (ANOVAs) with the factors enrichment condition (gesture enrichment, no enrichment), word type (concrete and abstract), and time point (3 days post-learning, 2 months post-learning, and 6 months post-learning) were conducted for each learning outcome (L1-L2 translation, L2-L1 translation, and free recall). All post hoc comparisons were evaluated with Tukey’s HSD tests. The significance threshold was set to α = 0.05, and partial eta-squared and Hedge’s g were computed as measures of effect size (Greenland et al. 2016).

Results

Table 3 shows children’s mean test scores at 3 days, 2 months, and 6 months post-learning for the L1-L2 translation, L2-L1 translation, and free recall tests.

Table 3 Mean scores on German-English translation, English-German translation, and free recall tests in Experiments 1 and 2

Full size table

L1-L2 Translation

We first tested the hypothesis that gesture enrichment would benefit children’s L1-L2 translation test scores compared with the non-enriched learning condition. A three-way ANOVA with the factors learning condition, word type, and time point yielded a significant main effect of learning condition [F (1, 53) = 6.29, p < 0.05, \( {\eta}_p^2 \) = 0.11]. Children’s L1-L2 translation test scores were significantly higher for words learned with gesture enrichment compared with words learned without enrichment (Fig. 4a).

We next examined whether gesture enrichment benefits were influenced by the class of vocabulary that children had learned. The ANOVA yielded a significant interaction between learning condition and word type factors [F (1, 53) = 22.57, p < 0.001, \( {\eta}_p^2 \) = 0.30], as well as a significant three-way learning condition × word type × time point interaction [F (2, 106) = 3.51, p < 0.05, \( {\eta}_p^2 \) =0.06]. Contrary to our hypothesis (see hypothesis two in the introduction), significantly higher L1-L2 translation test scores were observed across time points for abstract nouns learned with gesture enrichment compared with abstract nouns learned without enrichment (p < 0.01, Hedge’s g = 0.69). The three-way interaction revealed higher L1-L2 translation scores for abstract nouns learned with gesture enrichment compared with no enrichment at 3 days post-learning (p < 0.01, Hedge’s g = 0.80), 2 months post-learning (p < 0.01, Hedge’s g = 0.65), and 6 months post-learning (p < 0.01, Hedge’s g = 0.68) (Fig. 4b).

The ANOVA additionally revealed a significant main effect of time point [F (2, 106) = 43.89, p < 0.001, \( {\eta}_p^2 \) = 0.45]. Tukey’s post hoc comparisons revealed that L1-L2 translation scores were significantly higher at 3 days post-learning compared with both 2 months post-learning (p < 0.01, Hedge’s g = 0.43) and 6 months post-learning (p < 0.01, Hedge’s g = 0.41). There was also a significant main effect of word type [F (1, 53) = 163.70, p < 0.001, \( {\eta}_p^2 \) = 0.76]: Children’s L1-L2 translation scores were higher for concrete nouns compared with abstract nouns (Fig. 4b). These main effects were expected due to memory decay over time (Caramelli et al. 2004; Howe and Brainerd 1989) and previous reports of children’s greater performance for concrete than abstract nouns (Schwanenflugel 1991). These main effects were qualified by a significant word type × time point interaction [F (2, 106) = 10.00, p < 0.001, \( {\eta}_p^2 \) = 0.16], which revealed that L1-L2 translation scores were significantly higher for concrete words—but not abstract words—at 6 months post-learning compared with 2 months post-learning (p < 0.05, Hedge’s g = 0.13). There were no other significant main effects or interactions.

L2-L1 Translation

We next tested whether gesture enrichment benefitted children’s L2-L1 translation test scores compared with the non-enriched learning condition. A three-way ANOVA with the factors learning condition, word type, and time point yielded a significant main effect of learning condition [F (1, 53) = 12.81, p < 0.01, \( {\eta}_p^2 \) = 0.19]. Children’s L1-L2 translation test scores were significantly higher for words learned with gesture enrichment compared with words learned without enrichment (Fig. 4a).

We examined whether gesture enrichment benefits on the L2-L1 translation test were influenced by the class of vocabulary that children had learned. The ANOVA yielded a significant interaction between learning condition and word type factors [F (1, 53) = 22.57, p < 0.001, \( {\eta}_p^2 \) = 0.30] and a significant three-way learning condition × word type × time point interaction [F (2, 106) = 3.51, p < 0.05, \( {\eta}_p^2 \) = 0.06]. Contrary to our hypothesis, significantly higher test scores were observed across time points for abstract nouns learned with gesture enrichment compared with abstract nouns learned without enrichment (p < 0.01, Hedge’s g = 0.73). The three-way interaction revealed higher L2-L1 translation scores for abstract nouns learned with gesture enrichment compared with no enrichment at 3 days post-learning (p < 0.01, Hedge’s g = 0.86), 2 months post-learning (p < 0.01, Hedge’s g = 0.75), and 6 months post-learning (p < 0.01, Hedge’s g = 0.66) (Fig. 4b).

The ANOVA additionally revealed a significant main effect of time point [F (2, 106) = 41.19, p < 0.001, \( {\eta}_p^2 \) = 0.44]. Tukey’s post hoc comparisons revealed that L2-L1 translation scores were significantly higher at 3 days post-learning compared with both 2 months post-learning (p < 0.01, Hedge’s g = 0.30) and 6 months post-learning (p < 0.01, Hedge’s g = 0.34). There was also a significant main effect of word type [F (1, 53) = 235.58, p < 0.001, \( {\eta}_p^2 \) = 0.82]: Children’s L2-L1 translation scores were higher for concrete nouns compared with abstract nouns. These main effects were qualified by a significant word type × time point interaction [F (2, 106) = 5.64, p < 0.01, \( {\eta}_p^2 \) = 0.10], which revealed that L2-L1 translation scores were significantly higher for abstract words—but not concrete words—at 3 days post-learning compared with 6 months post-learning (p < 0.01, Hedge’s g = 0.63). There were no other significant main effects or interactions.

Free Recall

We also tested whether gesture enrichment benefitted children’s free recall test scores compared with the non-enriched learning condition. A three-way ANOVA with the factors learning condition, word type, and time point yielded a significant main effect of learning condition [F (1, 53) = 8.19, p < 0.01, \( {\eta}_p^2 \) = 0.13]. Similar to both of the translation test scores, children’s free recall test scores were significantly higher for words learned with gesture enrichment compared with words learned without enrichment (Fig. 4a).

Lastly, we examined whether gesture enrichment benefits on the free recall test were influenced by the class of vocabulary that children had learned. The learning condition × word type interaction did not reach significance (p = 0.07).

The ANOVA additionally revealed a significant main effect of time point [F (2, 106) = 5.07, p < 0.01, \( {\eta}_p^2 \) = 0.09]. Tukey’s post hoc comparisons revealed that free recall scores were significantly higher at 6 months post-learning compared with 3 days post-learning (p < 0.01, Hedge’s g = 0.21) and 2 months post-learning (p < 0.01, Hedge’s g = 0.22). This main effect was qualified by a significant word type × time point interaction [F (2, 106) = 19.15, p < 0.001, \( {\eta}_p^2 \) = 0.27], which revealed that free recall scores were significantly higher for concrete words—but not abstract words—at 6 months post-learning compared with 3 days post-learning (p < 0.01, Hedge’s g = 0.52). The increase in free recall performance at 6 months post-learning compared with 3 days post-learning is difficult to explain, as we expected memory for the translations to decay over time.

There was also a significant main effect of word type [F (1, 53) = 60.42, p < 0.001, \( {\eta}_p^2 \) = 0.53]: Children’s free recall scores were higher for concrete nouns compared with abstract nouns (Fig. 4b). There were no other significant main effects or interactions.

Analysis of Moderating Effects of Age on Observed Gesture Enrichment Benefits

In an exploratory analysis, we tested whether children’s ages moderated the observed learning effects. Children in Experiment 1 ranged in age from 8.5 to 10.2 years (M = 8.7 years, SD = 0.5 years). For each dependent variable (free recall scores, L1-L2 translation scores, and L2-L1 translation scores), we used multiple linear regression to generated both a null model and an alternative model of learning outcomes. The null model predicted test scores from children’s ages and the learning condition factor (dummy-coded). The alternative model predicted test scores from children’s ages, learning condition (dummy-coded), and a moderator term (age multiplied by learning condition). Variance accounted for by the two models was compared using chi-square tests, which revealed no significant differences between the null and alternative models (free recall, p(χ2) = 0.46; L1-L2 translation, p(χ2) = 0.82; L2-L1 translation, p(χ2) = 0.92). These nonsignificant comparisons suggest that children’s ages did not moderate the effects of gesture enrichment on vocabulary test performance.

Discussion

The results of Experiment 1 confirmed our first hypothesis that children demonstrated a significant benefit of gesture enrichment on vocabulary learning in comparison to non-enriched learning. This occurred for both the L1-L2 translation and L2-L1 translation tests, as well as the free recall test. Children also demonstrated higher overall performance for concrete vocabulary compared with abstract vocabulary for all three test types, consistent with previous reports (Schwanenflugel 1991). Unexpectedly in relation to the previous findings in adults (Mayer et al. 2015), enrichment benefits were not equivalent across classes of foreign vocabulary: The performance of gestures during learning significantly aided the subsequent L1-L2 and L2-L1 translation of abstract words, but not concrete words, at all time points. The greater effectiveness of gesture enrichment for abstract words compared with concrete words occurred for both the L1-L2 and L2-L1 translation tests, but not for the free recall test. We performed Experiment 2 to test the robustness of these effects.

Experiment 2

The relative superiority of the benefits of gesture enrichment for the learning of abstract words as opposed to concrete words observed in Experiment 1 was not predicted based on the previous results in adults (Mayer et al. 2015), and overall test accuracy in Experiment 1 was low considering the total number of stimuli included in the learning phase (Table 2). We therefore sought to replicate the findings of Experiment 1 in Experiment 2 using a subset of the stimuli used in Experiment 1 (24 German words and 24 English translations). We assumed that the presentation of fewer words overall would lead to higher performance (Dennis et al. 2008).