1 Introduction

National and international studies have repeatedly demonstrated that mathematics skills are strongly correlated with students’ skills in the language of instruction. In particular regarding students with a migrant background (Tarelli et al. 2012) and students whose family language differs from the language of instruction (Heinze et al. 2007), language skills have been identified as an important explanatory variable for their disadvantages in mathematics learning, above and beyond measures of socio-economic status (Tarelli et al. 2012). Current research from different perspectives strives to understand this relation between language skills and mathematics learning. General education research has adopted a differentiated view on language skills to identify those facets of language skills which are important for learning processes, including mathematics learning (e.g. grammar, vocabulary, and listening comprehension: Paetsch et al. 2015). From a subject-specific perspective, the role of subject-specific language registers has been discussed (e.g. Prediger and Wessel 2013), which must be mastered to construct meaning about mathematical concepts during instruction. This raises the question on the role of subject-specific language skills that go beyond mastering everyday language registers or more sophisticated demands of general academic language registers (Wessel and Erath 2018). While first promising attempts to conceptualise and operationalise mathematical language skills exist (Bochnik and Ufer 2016a; Schindler et al. 2019), our knowledge about the relation of such subject-specific language skills to the learning of mathematics skills remains limited.

Furthermore, in prior research, mathematics skills have often been conceptualised as a one-dimensional construct (Prediger et al. 2018). This contrasts with claims that language skills should be of particular importance for certain, more conceptual facets of mathematics skills (Heinze et al. 2007). First attempts to differentiate these more conceptual facets from other, more schematic skills, however, were mostly explorative and based on instruments that were not designed for such a differentiation (e.g. Ufer et al. 2013). On the other hand, models to conceptualise multidimensional mathematics assessments have been proposed in the past (Suurtamm et al. 2016). Thus, studying the role of general and subject-specific language skills when learning different dimensions of mathematics skills is a desideratum that can extend our understanding of the complex interaction between language skills and mathematics learning.

Accordingly, the goal of this contribution is to investigate the relation between general and subject-specific language skills and interindividual differences in the development of mathematics skills, applying a one-dimensional as well as a multidimensional conceptualisation of mathematics skills.

Our research interest in the role of language is from the perspective of migration-related disparities in mathematics skills. Thus, we will introduce control variables that proved relevant in this field to frame our study. Based on this, we present the theoretical mechanisms from the literature that explain the well-established finding that the development of mathematics skills is linked to students’ language skills (Sect. 1.1). Subsequently, we present theoretical approaches on the role of subject-specific language skills, and introduce our understanding of this construct (Sect. 1.2). Prior findings on varying relations of language skills to different subdimensions of mathematics skills are presented in Sect. 1.3. Here, we connect these findings to the SPUR framework, which we used to conceptualise subdimensions of mathematics skills in arithmetic.

1.1 Language Skills in the Language of Instruction, and the Development of Mathematics Skills in Elementary School

Much interest in the role of language for mathematics learning in the literature stems from results on the comparably lower mathematics skills of students who come from families with a migrant background (Tarelli et al. 2012), or speak different languages at home than the language of instruction (Prediger et al. 2018; Ufer et al. 2013; Heinze et al. 2007). Two explanatory mechanisms for these disparities are discussed in the literature.

First, the families’ socio-economic status, in particular the economic and cultural resources that a family can draw upon to support the students’ learning, are discussed as a cause of migration-related disparities. Apart from the Highest International Socio-Economic Index of Occupational Status (HISEI; e.g. Gebhardt et al. 2013), the Books-at-home Index (Paulus 2009) has frequently been used as a proxy-measure of socio-economic status, which takes the approximate number of books in the students’ home as a measure of the cultural capital of the family. From this perspective, it was shown that, in Germany, socio-economic status is particularly strongly related to mathematics skills (Baumert and Schümer 2001; Ehmke and Jude 2010) and that socio-economic status partially explains migration-related disparities in cross-sectional studies (Haag et al. 2012). Some longitudinal studies have demonstrated the relations of similar strength socio-economic status and language skills to mathematics learning gain (Paetsch et al. 2016) in school age. Morgan et al. (2011) found that only socio-economic status, and not reading skills, predicted mathematical learning gain in Kindergarten children. However, other longitudinal studies have not identified a relation of socio-economic status to mathematical learning gain in secondary school (Ehmke et al. 2006) and primary school (Ufer et al. 2013) when controlling for language skills. To summarise, existing evidence indicates that language skills do predict mathematical learning gain in school age beyond parents’ socio-economic status, but it also warrants controlling measures of socio-economic status when studying the relation between language skills and mathematics learning.

As a second mechanism, students’ language skills have been studied in the past. Indeed, migration-related differences in reading skills have been documented by several large-scale studies (Tarelli et al. 2012; Haag et al. 2012). A positive relation between mathematics skills and language skills has been demonstrated for reading comprehension at the end of grade 4 in Germany (Pietsch and Krauthausen 2006), but also as early as pre-school age (Grüßing and Schmitman gen. Pothmann 2007). Several cross-sectional studies have revealed that language skills have explanatory power regarding mathematics skills beyond measures of general cognitive ability (Heinze et al. 2007) or socio-economic status (e.g., Prediger et al. 2018; Ufer et al. 2013). In longitudinal analyses, not only overarching measures of language skills (Ufer et al. 2013), but also specific measures of reading comprehension (Paetsch et al. 2016) and vocabulary (Paetsch et al. 2015) could be identified as significant predictors of mathematics skills in elementary school. In contrast, grammatical skills could not explain a significant share of variance beyond these other two language measures (Paetsch et al. 2015). Different explanations have been proposed in the past for these effects of language skills on mathematics learning (Bochnik 2017).

1.1.1 Explanation by Text-based Test Instruments

A first explanation starts from the text-based test instruments used in most of the studies reported above. It might be that a certain share of the relation between language skills and measured mathematical learning gain is actually caused within the test situation itself, since students with lower language skills might have problems reading the test items. Along this line, prior research has identified features of mathematical test items, which specifically disadvantage learners with lower language skills (e.g. Haag et al. 2013; Shaftel et al. 2006). These studies indicate that it is not technical mathematical vocabulary which poses problems for learners with low language skills, but complex grammatical constructions such as noun constructions, unknown or ambiguous terms, or the length of the text in the tasks (Haag et al. 2013; Martiniello 2008). Research on language accommodations of mathematics tests has studied the effects of eliminating or reducing language-related difficulties in tests by splitting long sentences, paraphrasing uncommon phrases or words, providing additional time, or implementing other kinds of support for learners with a home language different from the language of instruction (Abedi et al. 2004). In their meta-analysis of studies on these language accommodations, Kieffer et al. (2009) found only weak positive effects. Providing English-Language-Learners with dictionaries had the relatively largest, but still small effect. These results indicate that difficulties in test comprehension cannot plausibly be seen as the only explanation for the observed language-related disparities in mathematics skills, and other explanations must be taken into consideration.

1.1.2 Learning-related Explanations

These explanations of language-based disparities have, for example, focused on students’ possibility of using language-based opportunities to learn during mathematics instruction. This comprises (a) comprehension of explanations by the teacher or fellow students (Erath et al. 2018), participation in the classroom discourse (Bochnik and Ufer 2016b), and access to written materials in mathematics instruction, such as textbooks (Österholm and Bergqvist 2013). Research on multilingual classrooms has illustrated that students are less involved in the classroom discourse and contribute their ideas less frequently, if their home language differs from the language of instruction (Civil 2008; Gorgorió and Planas 2001). This is particularly important since school-based mathematics learning is conceptualised as a discursive practice in research on language in mathematics learning (e.g. Moschkovich and Zahner 2018). For example, Steenpaß and Steinbring (2014) point out that even though concrete manipulatives are used to support mathematical communication in classrooms, the structure of these materials and the way mathematical concepts are encoded in their use is mediated verbally during instruction. Moreover, (b) research from very different perspectives has pointed out that weak language skills might exclude students from a meaningful participation in this discourse (Civil and Planas 2004; Abedi and Herman 2010; Heinze et al. 2007). Finally (c), other authors have described the role of language during mathematics learning and construction of mathematical meaning itself. Such approaches conceptualize mathematical thinking and knowing as a completely (Sfard 2008) or at least partially language-based process. This includes not only using language as one among several possible representations of mathematical concepts (e.g. Lesh et al. 2003), but also private or mental speech, such as during mathematical calculation (e.g. Ostad 2013), or self-explanations and verbalisations of mathematical operations based on symbols or concrete manipulatives (Sarama and Clements 2016).

To describe students’ difficulties and instructional support regarding these three uses of language in mathematics learning, the concept of language register has been used successfully in the past (Prediger and Wessel 2018). Halliday (1978) defines a register as a ‘configuration of semantic resources that the member of the culture associates with a situation type and is the meaning potential that is accessible in a given social context’. Registers are usually associated with specific linguistic features such as lexical features (e.g. vocabulary used), grammatical constructions, and text types, but also certain styles and forms of discourse. In this vein, past research has made the distinction between everyday registers (relating to Basic Interpersonal Communications Skills, BICS; Cummins 2008) and academic language registers (relating to Cognitive Academic Language Proficiency, CALP; Cummins 2008). The latter register is mostly associated with lexical features such as specific vocabulary or complex word constructions, and grammatical features such as complex sentences, passive voice, or noun phrases (e.g. Cummins 2008). The results described above on the explanatory power of skills in the language of instruction for the development of mathematics skills illustrate that mastering this academic register is an important prerequisite for benefitting from mathematical learning opportunities in school (Schleppegrell 2001). However, apart from these mostly subject-independent conceptualisations of language skills, the role of subject-specific language registers in mathematics learning has also been discussed.

1.2 Conceptualising and Measuring Subject-specific Language Skills

From a subject-related perspective, language serves as one of multiple, interlinked representations that constitute the meaning and content of mathematical concepts. Among symbolic, graphical, and concrete representations (e.g. in the form of manipulatives), verbal language is considered one central form of representation (Lesh et al. 2003). Prediger and Wessel (2013) characterise the use of language to describe mathematical concepts during classroom discourse by a specific register. This verbal school register is considered a part of the academic language register described by Schleppegrell (2001). Prediger and Wessel (2013) situate this register between an everyday register, which relates to Cummins BICS concept (Cummins 2008), and a technical register, which describes the disciplinary use of language in mathematics as a school subject. They hypothesise that the verbal school register is at least partially subject-specific, that it is crucial for an epistemic use of language during mathematics learning, but that usual classroom instruction offers few learning opportunities in school to acquire this register. Outside school, learning opportunities regarding this register are considered to vary with the degree of support offered by the students’ families, resulting in restricted access to this register for learners, for example, from families who speak languages other than the language of instruction (e.g. Schleppegrell et al. 2004).

Related but not identical to the differentiation between school and technical register, some authors differentiate between formal and meaning-related registers (Prediger and Pöhler 2015; Prediger and Wessel 2018). In this perspective, the formal register is restricted to elaborating symbolic representations and techniques (Prediger and Wessel 2018), including technical terms which should be mastered after learning a mathematical concept (Prediger and Pöhler 2015). The meaning-related register, in contrast, is connected to the description of phenomena underlying mathematical concepts (e.g. bundling quantities as a basis of the decimal system). This register overlaps with the everyday register, the school register, but also technical terms for concepts learned earlier (Prediger and Wessel 2018). Thus, what was part of the formal register while learning one concept seems to be seen as part of the meaning-related register that is necessary to acquire more advanced concepts. Based on qualitative analyses, the meaning-related register is assumed to form a central basis for the acquisition of conceptual understanding (Götze 2018). In this sense, formal registers relate to outcomes of mathematics learning, while meaning-related registers relate to prerequisites as well as outcomes of mathematics learning.

1.2.1 Subject-specific Language Skills

In this contribution, we conceptualise subject-specific language skills as a construct describing how individual students can master formal and meaning-related aspects of the mathematical verbal school register. Considering the role of meaning-related language as a prerequisite of mathematics learning, we include technical language here to the extent that it is used to describe representations, examples, or phenomena related to previously learned mathematical concepts. Such language comprises the command of vocabulary specific to the school subject, but also more complex skills such as extracting mathematical structures described verbally, for example, in word problems (cf. Gabler and Ufer n.d.). Due to its close connection to knowledge of other representations of mathematical concepts, this construct overlaps naturally with mathematical knowledge and skills (Härtig et al. 2015), as well as subject-overarching (general) language skills (Dyrvold et al. 2015). Past research, however, has shown that it is possible to conceptualise and measure such subject-specific language skills sufficiently, independently from mathematics skills as well as general language skills (Bochnik and Ufer 2017). Different approaches have been developed in mathematics education but also for other subjects, to operationalise these subject-specific language skills.

1.2.2 Subject-specific Vocabulary

The specific vocabulary of mathematics, in particular technical terms, is a central tool for mathematical precision and accuracy (Maier and Schweiger 1999). The terms used in school to discuss mathematical concepts are part of the lexical level of the formal register. A particular difficulty is assumed in technical terms which have a different or more specific meaning in the context of (school) mathematics than in everyday language (e.g. ‘function’ or ‘bundling’; Heinze et al. 2011). Based on interaction analyses of classroom transcripts, Schütte (2009) hypothesises that even though subject-specific vocabulary is often explicitly introduced in elementary school mathematics instruction, the terms are often automatized with little embedding into concrete discursive practices. When conceptualising students’ subject-specific skills in terms of vocabulary, a consensus seems to be that ‘knowing’ a mathematical term cannot be completely independent of understanding the related mathematical concept to some extent (Dyrvold et al. 2015). However, while understanding that a concept relates to (potentially) deep connections between different representations as well as real-world phenomena (Lesh et al. 2003), knowing a specific term may describe the ability to match the term to one or more examples or (alternative) representations of the concept, in contrast to examples or representations of other concepts. In our understanding, knowledge of subject-specific vocabulary goes beyond merely knowing technical terms. Subject-specific language skills in our sense include being able to connect vocabulary from the formal register to corresponding (acquired) mathematical concepts (e.g. ‘increasing’ or ‘decreasing’ with additive-subtractive structures). In this sense, knowledge of mathematical vocabulary requires some (perhaps shallow) understanding of the corresponding concept, but this should of course not be misunderstood as an indicator of deep understanding of the concept.

Schindler et al. (2019) report that over 60% of elementary school students in their samples from Switzerland and Germany could correctly identify examples for technical terms for arithmetic concepts such as ‘addition’ or ‘difference’. Lower solution rates were found for geometric terms. Bochnik and Ufer (2017) report comparable solution rates (59% on average) for items similar to those used by Schindler et al. (2019), but higher solution rates (83% on average) when third-graders were asked to name mathematical concepts such as ‘remainder’. Moreover, they found a significant correlation between knowledge of mathematical vocabulary and mathematics skills measured by a separate test. Bochnik and Ufer (2017) report that knowledge of subject-specific vocabulary cross-sectionally explained differences in mathematics skills beyond socio-economic status, general cognitive skills, and general language skills. In a study on science classrooms, Taboada (2012) demonstrates that knowledge of subject-specific vocabulary predicts science reading comprehension beyond knowledge of more general vocabulary. However, even in our broad conceptualisation, a sole focus on subject-specific vocabulary can be regarded as a reductionist perspective on subject-specific language skills, leading to the demand for alternative measures that go beyond knowledge of subject-specific vocabulary.

1.2.3 Subject-specific Text Comprehension

Current works on language in mathematics learning argue for the role of a meaning-related register of subject-specific language skills (Götze 2018). As indicated above, knowledge of mathematical vocabulary might be shallow in the sense that it does not connect to a deeper understanding of the underlying concepts. Götze (2018) reports that a substantial amount of vocabulary-related teaching is not connected to the conceptual meaning of the vocabulary. Thus, more comprehensive measures of subject-specific language skills are required, which cover meaning-related facets. However, such measures have rarely been proposed in the literature, so far. Most approaches start from the concept of C‑Tests (Eckes and Grotjahn 2006). These consist of a set of short texts, in which parts of certain words have been erased. To complete these words, it is necessary to reconstruct the meaning of the whole text from the available parts using local and global coherences in the original text. As Schütte (2009) illustrates in his case studies, much of the contents of classroom discourse remain implicit. Inferring missing information in a C-Test from the available information may be seen as parallel to reconstructing implicit or missed information when following a verbal classroom discourse. For this reconstruction, a merely shallow understanding of subject-specific language, in the sense of a formal register, is most likely not sufficient, but the meaning of the underlying concepts and their relations are required to infer the missing information. C‑Tests are considered effective, valid, and reliable measures covering a broad range of components of language skills (Eckes and Grotjahn 2006). Özcan (2013) constructed an adapted version of C‑Tests to measure subject-related language skills in chemistry, and found moderate positive correlations to subject-matter knowledge. Höttecke et al. (2017) studied C‑Tests to measure domain-independent (general) language skills as well as subject-specific language skills in physics and sports. Based on moderate latent correlations between the three constructs, they could separate the two subject-specific language skill measures from the general language skill measure empirically. While the subject-related language skills in physics correlated substantially with the physics grade, the respective correlation was not significantly different from zero for the sports test. To explain the latter finding, the authors argue that subject-related language skills might play a minor role when assigning grades in sports classes. Bochnik and Ufer (2016a) developed an instrument for mathematical text comprehension based on C‑Tests, and report that it explains differences in mathematics skills beyond socio-economic status, general cognitive skills, general language skills, and knowledge of subject-specific vocabulary cross-sectionally.

A major challenge in conceptualising and measuring subject-specific language skills is to separate the construct from general language skills and measures of mathematics skills. For example, Schindler et al. (2019) tried to reduce the linguistic and mathematical demands of their mathematical vocabulary items as much as possible by using a multiple-choice format with very brief and focused instructions. In contrast, a study by Bae et al. (2015) operationalised knowledge of mathematical vocabulary by asking the students to provide definitions for mathematical terms. It can be assumed that such measures placed much broader demands on students’ mathematical knowledge than selecting from pre-defined alternatives relating to the corresponding concepts. Solving the items of Bae et al. (2015) requires much deeper knowledge of the concepts behind the technical terms to construct the required definitions. Results on the separability of the subject-specific and general language skills are scarce. However, using confirmatory factor analyses, Bochnik (2017) could demonstrate that subject-specific language skills as measured by her instruments can be separated psychometrically from general language skills and mathematics skills.

To summarise, first approaches to measure subject-related language skills have been developed in the past, including approaches that go beyond subject-specific vocabulary. First studies have provided evidence that subject-specific language skills can be separated empirically from subject-specific knowledge with respect to skills as well as general language skills (Özcan 2013; Höttecke et al. 2017). For example, in a latent variable analysis of data from a cross-sectional study with third-graders, Bochnik and Ufer (2016a) provide evidence that subject-specific language skills partially mediate the relation between general language skills and mathematics skills. Moreover, training studies from secondary school indicate a possible causal connection between subject-specific language skills and mathematics skills (e.g. Prediger and Wessel 2018). However, it remains an open question to what extent subject-specific language skills are of particular importance for mathematics learning beyond other factors. For example, it might be argued that knowledge of mathematical vocabulary is of little relevance to explain mathematical learning progress over time, because this vocabulary is not connected to a deep conceptual understanding in the sense of a meaning-related register (Moschkovich 2002), or because it is explicitly treated in mathematics instruction anyway (Schütte 2009; Schindler et al. 2019). On the other hand, if a considerable amount of content, apart from technical vocabulary, remains implicit in classroom discourse (Schütte 2009), this might speak for the necessity to reconstruct subject-specific coherences when attending to classroom discourse, as it is measured by instruments based on C‑Tests.

1.3 A Multi-dimensional Perspective on Mathematics Skills

Much research on the role of language skills for mathematics learning has conceptualised mathematics skills as a one-dimensional construct, but there are calls for a more differentiated perspective (Suurtamm et al. 2016). Based on results from the SOKKE study, for example, Heinze et al. (2007) and Ufer et al. (2013) argue that language skills might be more important to gain deep conceptual insights into mathematics, and of less importance to acquire more or less schematic calculation skills. Paetsch and Felbrich (2015) proposed that language skills should be particularly important when learning to solve mathematical word problems. Furthermore, educational standards worldwide proposed a multi-facetted view of mathematics skills, focusing on a variety of conceptual fields, mathematical practices, and levels of conceptual demand (NCTM 2000; CCSSI 2010; KMK 2004). A prominent differentiation has been between knowledge of mathematical concepts versus the ability to perform mathematical operations efficiently and correctly. Extending such works, Bleiler and Thompson (2013) describe a so-called SPUR approach towards mathematics assessment that differentiates the four dimensions performing (schematic) operations (S), using properties to solve mathematical problems (P), using mathematical concepts as a functional tool outside mathematics (U), and dealing with representations of mathematical concepts (R). While the framework originates from a curriculum development approach, these task types pose different demands on students’ mathematical skills and they can be assumed to involve language skills in different ways to decode the task, to solve the task, and to acquire the knowledge necessary to solve the task.

Tasks of dimension S require that operations be performed, e.g. symbolic arithmetic procedures. They usually only contain a few words in the task text (e.g. ‘Calculate!’). The necessary operations can—in principle—be solved using explicitly learned solution strategies, which can be applied on a purely symbolic level without necessarily making cognitive use of (meaning-related) language. Moreover, it has been argued that symbolic schemata might be easier to grasp in classroom instruction than a deeper understanding of the properties and relations of mathematical concepts (Heinze et al. 2007; Ufer et al. 2013). All this speaks against a strong role of language skills in performing such operations and learning the corresponding skills. Indeed, only weak language-related differences were found in grades 1 and 2 for performance on tasks of this type (Heinze et al. 2007; Ufer et al. 2013). However, results on the relation between procedural and conceptual knowledge indicate that knowledge of procedures is also acquired in interaction with conceptual knowledge (Schneider 2006; Rittle-Johnson et al. 2015). Learning to cope with more conceptual demands has, in turn, been connected to language skills in past studies (Heinze et al. 2007; Ufer et al. 2013). Thus, it remains an open question as to whether relations between language and mathematics skills might not be more pronounced in later grades of primary school.

For the other three types of assessment tasks, students’ performance can be assumed to rely strongly on their conceptual knowledge: Rich knowledge of the concepts underlying a problem has been found to be a major predictor of mathematical problem-solving skills (Ufer et al. 2008). Knowledge about the meanings of mathematical concepts in real-world situations is considered an important prerequisite to apply mathematics to the problems in the real world (Stern 1994). Finally, it has been argued that the transfer between semantic representations such as visualisations and manipulatives of mathematical concepts and their symbolic representation is central to conceptual understanding in mathematics (Lesh et al. 2003). As noted above, in prior studies, the acquisition of conceptual knowledge has been considered to be specifically sensitive to students’ language skills (Heinze et al. 2007; Ufer et al. 2013), so that it can be expected that these three dimensions, P, U, and R, are particularly sensitive to students’ general and subject-specific language skills.

Among these three dimensions, tasks to use concept properties to solve mathematical problems (P) represent conceptual understanding of mathematical concepts most broadly. Even if language demands within the text can be limited for problem-solving items, the strong connection of problem-solving skills to conceptual understanding speaks for a close relation between performance on these tasks and subject-specific language skills. Using mathematical concepts in real-world situations (U) may be assumed to demonstrate the strongest relation to mathematical text comprehension skills, since reconstructing verbal descriptions of mathematical structures from the problem text is seen as a central barrier in word-problems (Paetsch and Felbrich 2015). This holds for solving the test items, but also during classroom instruction, because world-problems can be assumed to be mainly presented in text form also during instruction. In contrast, mathematical representations are usually explained verbally during classroom discourse (Steenpaß and Steinbring 2014). Connecting these representations to their mathematical meaning during instruction will most likely rely strongly on the availability of subject-specific vocabulary. To summarise, if there is any inter-individual relation between subject-specific language skills and mathematics skills beyond general language skills, performance in and development of the three conceptual dimensions (P, U, R) can be expected to correlate with subject-specific language skills. Among these, a pronounced connection can be expected between subject-specific text comprehension with skills and using mathematical concepts to solve real-world problems (U) as well as between subject-specific vocabulary with skills and dealing with mathematical representations (R).

Although these first hypotheses can be proposed in relation to the specific importance of general language skills to acquire different facets of mathematics skills, their differential dependency on general and subject-specific language skills has not yet been approached empirically in the literature. The most common hypothesis is that the acquisition of conceptual understanding (mirrored specifically in the P, U, and R facets) will demonstrate a stronger link to language skills than skills relating to the execution of more or less schematic procedures.

2 A Longitudinal Study on the Role of Subject-specific Language Skills

2.1 Goals and Research Questions

Based on prior findings, the role of general skills in the language of instruction when learning mathematics can be seen as mostly established for the elementary school age (Heinze et al. 2007; Ufer et al. 2013) as well as for secondary school (Prediger et al. 2018). However, open questions remain. Firstly, several authors claim that language skills are particularly important for more conceptual demands such as problem-solving based on concept properties, functional use of mathematics, and representation of mathematical concepts, as compared to more schematic, technical demands (Heinze et al. 2007). However, the most evidence for this hypothesis stems from research in the first year of school. Its generalisability to later years of (elementary) schooling is not straightforward, since technical aspects of mathematics, such as calculation strategies or algorithms, also build on sophisticated mathematical ideas such as place value, or properties of arithmetic operations. Thus, one goal of the current study was to investigate the role of language skills for mathematics learning with grade 3 learners, and to analyse the effects of language skills on a variety of dimensions of mathematics skills. Our specific focus was on arithmetic, which forms a central part of the contents of elementary school mathematics instruction.

Q1

To what extent do general language skills predict mathematics skills at the end of grade 3, beyond prior mathematics skills, cultural capital, and general cognitive skills?

Theoretical and prior research indicates that general language skills in the language of instruction are related to the development of mathematics skills. Thus, we expected a significant contribution of general language skills above prior mathematics skills and other control variables when predicting overall mathematics skills at the end of grade 3. However, based on prior results from studies in earlier grades (Heinze et al. 2007), we expected that this would hold for the three subdimensions that reflect conceptual understanding (P, U, R), but not for schematic skills (S).

Secondly, prior research has highlighted the potential role of subject-specific language skills for mathematics learning. Although there are promising results of first intervention studies establishing learning opportunities for mathematics-related language skills, research explicitly measuring such skills and investigating their role for learning mathematics is scarce. This holds in particular for the elementary school age, longitudinal studies, and a nuanced view on the broad concept of subject-related language skills, ranging from having a command of mathematical vocabulary to the comprehension of mathematical structures presented in verbal form. To contribute to filling this research gap, we included different measures of subject-related language skills.

Q2

To what extent can prior subject-specific language skills predict mathematics skills at the end of grade 3, above and beyond the general language skills, prior mathematics skills, cultural capital, and general cognitive skills?

Based on prior statements on the role of subject-specific language skills, we expected that they would predict overall mathematics skills at the end of grade 3 beyond the variables analysed for question Q1. Again, we expected a significant relation for the three dimensions that involve conceptual understanding, but not for schematic skills.

Q3

Which of the two facets of subject-specific language skills (mathematical vocabulary and mathematical text comprehension) contributes more strongly when predicting mathematics skills at the end of grade 3?

Little prior research has studied different facets of subject-specific language skills. Based on prior research, both facets can be expected to relate to learning gain regarding understanding of mathematical concepts. The first hypotheses described in 1.3 lead to the expectation that subject-specific text comprehension will be linked to learning to use concepts to solve real-world problems (U), while knowledge of subject-specific vocabulary will relate to learning to interpret and use mathematical representations (R).

2.2 Method: Sample, Instruments, Analysis Strategy

To answer the questions raised above, we conducted a longitudinal study with grade 3 students from eight elementary schools in southern Germany. Out of an initial sample of 383 students, N = 237 (111 female, 126 male) remained in the longitudinal sample after 8 of 24 classes did not participate in the second measurement. The number of students who did not speak German at home differed only marginally between the whole sample (39.8%) and the selected subsample (39.7%). This indicates that drop-out from the study was not connected to students’ language background. The students participated voluntarily and with informed parental consent after the research team had approached their teachers. The study consisted of two measurement times, one in autumn 2013 (T1, see also Bochnik and Ufer 2016a) and one in summer 2014 (T2), each consisting of two sessions on different days within two weeks.

2.2.1 Instruments—Mathematics Skills (T1 and T2)

For each of the two measurements, four scales of mathematics tasks were developed focusing on different dimensions of mathematics skills. These dimensions were based on an adaptation of the SPUR framework to the regional elementary school curriculum. Items for the dimension schematic skills (T1: 9 items; T2: 10 items) focused on symbolic two-digit (T1) and three-digit (T2) addition and subtraction respectively as well as single-digit multiplication and division. Items for the dimension problem-solving based on properties (T1: 15 items; T2: 16 items) of mathematical concepts involved non-standard tasks that were posed in an intra-mathematical context (e.g. write down all multiplications that have 18 as a result). The dimension using mathematical concepts (T1: 6 items; T2: 8 items) involved single- and multistep word problems. Finally, representing mathematical concepts (T1: 18 items; T2: 18 items) focused on representing numbers and operations using mathematical visualisations such as the place-value table, base-ten-blocks, the number line, or a 10 × 10 point grid for multiplications. Figs. 123 and 4 in the appendix provide example items for each scale. All items were piloted and adapted repeatedly to include only those language demands which were necessary to communicate the task requirements. Separate mean scores were calculated for each of the four dimensions at each measurement. Moreover, we generated an overall score of mathematics skills for each of the two measurements by averaging the z‑standardised scores from the four dimensions. The test scores were not linked between the two measurements, so that the scores cannot be compared directly between measurements of the same scale at different times.

2.2.2 Instruments—Subject-specific Language Skills (T1)

Three scales of items to measure subject-specific language skills at T1 were developed to study their relation to the development of mathematics skills. These scales mirror the differentiation of knowledge of subject-specific vocabulary and text comprehension.

Regarding mathematical vocabulary, two scales were developed. One scale (7 items) presented the students with tasks or pictures, in which a mathematical symbol or a part of the picture are marked with an arrow. Children were asked to describe the meaning of this symbol with an appropriate term. In some cases, a few written words provided the students with the start of a short sentence that had to be completed with the respective term (cf. Fig. 5 for an example item). Students’ answers were coded as correct if the correct word could be identified. Thus, correct spelling was not necessary for a correct answer. Since students were asked to actively produce technical terms in these items, we refer to this scale as knowledge of active subject-specific vocabulary. The second subject-specific vocabulary scale (7 items) consisted of items that presented students with one single subject-specific term each. These terms referred to a mathematical concept (e.g. ‘Verdopplungsaufgabe’—doubling task) or concrete operations that can be described arithmetically (e.g. ‘dazukommen’—increasing). Below each term, five different possible descriptions of this operation with mathematical symbols were provided (cf. Fig. 6). The students were asked to select the alternative that best fitted the given term. Since students were provided with subject-specific vocabulary and had to select an appropriate example, we refer to this scale as knowledge of passive subject-specific vocabulary. Five alternative options were provided to reduce the chance of guessing the correct solution, while not overstraining students with too many alternatives. The terms used in both scales comprised typical technical terms from elementary school instruction (e.g. smaller number, neighbour number, remaining the same). Both scales did not presuppose deep conceptual knowledge of the related concepts, but required students to provide (active scale) a mathematical term for examples or representations of a mathematical concept, or to identify (passive scale) the correct example or representation for a given mathematical term. For both scales of mathematical vocabulary, students had 10 min to complete all items. The ratio of correctly solved items per scale was calculated for each scale. As a measure of subject-specific vocabulary, we calculated the mean of the scores for the two scales.

To measure mathematical text comprehension, three short texts were developed that referred to mathematical concepts. One text provided a story from everyday life (cf. Fig. 7), two texts resembled written adaptations of texts that could arise in typical interactions in the mathematics classroom (e.g. describing a strategy to calculate 34 + 7). The texts consisted of 43–67 words. In each text, we removed 7–10 words, leaving only the first one to four letters. To create these gaps, we deleted parts of words that carry the central mathematical meaning of the text, but which can be reconstructed from the mathematical meaning of the rest of the text. In particular, it was not possible to reconstruct the words using only the surface structure of the text. Students had 15 min to complete all three texts. We coded each completed word independently as correct or incorrect. Again, correct spelling was not necessary as long as a word could be identified that fitted the meaning of the text. For each text, the ratio of correctly filled-in gaps was calculated and the mean of these scores from the three texts was calculated.

To generate an overall indicator for subject-specific mathematics skills, we calculated the mean of the z‑standardised scores for active mathematical vocabulary, passive mathematical vocabulary, and mathematical text comprehension.

2.2.3 Instruments—Control Variables (T1)

As a control variable, general language skills were measured using the SFD 3–4 (Hobusch et al. 2002), an approved instrument which has been used in prior research. It covers general vocabulary, prepositions, articles, and listening comprehension. General cognitive skills were surveyed with the Culture Fair Test, Scale 1 (CFT 1; Cattell et al. 1997). As a rough proxy measure of cultural capital, we asked students to estimate the number of books in their homes on a scale from ‘very few (0–10)’, ‘one book shelf (11–25)’, ‘one bookcase (26–100)’, and ‘two bookcases (101–200)’ up to ‘more than 200 books’. Each answer choice was accompanied by a picture showing a number of books in the respective range arranged as in the answer option. This is an approved instrument to measure cultural capital using student reports at the elementary school level (Paulus 2009). To survey the language spoken at home, we asked each student to report the languages she or he can speak, which languages she or he most often uses to communicate with mother, father, and siblings (if available) and which language their parents use to communicate with each other. For every question, a range of typical languages was provided for selection. Children could select multiple languages, and add other languages which were not on the list, and were asked to underline the language they speak most. Based on these data, a language spoken at home different from the language of instruction was coded if the underlined language spoken with both parents was not German.

Table 1 provides an overview of descriptive statistics and internal consistencies for the instruments used in the analyses. For some scales, the number of items differs from the number of developed items, as some items did not fit quality criteria in the evaluation process (Bochnik 2017)Footnote 1. Table 11 in the appendix contains information about the language-related features of the test items in this study and, for comparison, in the studies by Haag et al. (2013) and Paetsch et al. (2015).

Table 1 Mean solution rates, standard deviations, and internal consistencies for the instruments used in the analyses

2.2.4 Data Analysis

Multiple linear regression analyses were used to analyse the relations between the different constructs. The overall mathematics skill score at T2 was used as a dependent variable, and also the scores at T2 relating to each subdimension of the SPUR model. To describe inter-individual differences in students’ development of mathematics skills and to model learning gain, mathematics skills in the first measurement were included as a covariate in all regression analyses. In a first baseline model (model 0), language spoken at home (German vs. non-German), cultural capital, and general cognitive skills were included as predictors. Subsequently, general language skills (model 1) and the overall score for subject-specific language skills (model 2) were added to this initial model. In models 3a and 3b, the overall score for subject-specific language skills was replaced by the separate scores for mathematical vocabulary and mathematical text comprehension. Dependencies between students from the same class were controlled using the TYPE=COMPLEX option in the MPLUS software (Muthén and Muthén 2017).

3 Results

Tables 2345 and 6 show the results of planned stepwise regression analyses to investigate questions Q1–Q3.

Table 2 Regression models for the dependent variable overall mathematics score (T2)
Table 3 Regression models for the dependent variable schematic skills (T2)
Table 4 Regression models for the dependent variable problem solving with concept properties (T2)
Table 5 Regression models for the dependent variable using concepts (T2)
Table 6 Regression models for the dependent variable representing concepts (T2)

3.1 Preliminary Analyses: Control Variables

In a first step, regression analyses were conducted for the overall mathematics skill score and for each of the four subdimension scores at T2, using the respective T1 measures as well as language spoken at home, cultural capital, and general cognitive skills as independent variables (Tables 2345 and 6, models 0, 0.S, 0.P, 0.U, and 0.R). In all models, the respective measure of mathematics skills from T1 was a significant predictor (e.g. Table 2, model 0, β = 0.74, p < 0.001). Students with better skills at T1 demonstrated better skills at T2, as expected. As prior research has found (Ufer et al. 2013), the language spoken at home was not a significant predictor of learning gain, when prior skills, cultural capital, and general cognitive skills were controlled (e.g. Table 2, model 0, β = 0.02, p = 0.59). For the overall score, cultural capital was the only additional significant predictor (Table 2, model 0, β = 0.10, p = 0.01), and this relation was significant for the dimensions problem-solving with conceptual properties (Table 4, model 0.P, β = 0.16, p < 0.01), using concepts (Table 5, model 0.U, β = 0.14, p < 0.001), and representing concepts (Table 6, model 0.R, β = 0.11, p = 0.01), while it was slightly weaker for schematic skills (Table 3, model 0.S, β = 0.08, p = 0.07). As for general cognitive skills, its relation to overall learning gain was only a tendency (Table 2, model 0, β = 0.07, p = 0.07), although it demonstrated a significant relation to learning gain for each of the four separate dimensions.

3.2 Prediction by General Language Skills (Question Q1)

When including general German language skills in the models, they significantly predicted mathematics skills at the end of grade 3 beyond prior mathematics skills (Table 2, model 1, β = 0.15, p < 0.001). The same pattern could be observed for schematic skills (Table 3, model 1.S, β = 0.17, p < 0.01), and was substantially stronger for using concepts (Table 5, model 1.U, β = 0.30, p < 0.001) and representing concepts (Table 6, model 1.R, β = 0.29, p < 0.001). For problem-solving (Table 4, model 1.P, β = 0.06, p = 0.48), general language skills showed no relation to learning gain.

To summarise, the results regarding question Q1 for the overall score are in line with prior findings in the literature: General language skills relate to mathematical learning gain (i.e. predicted mathematics skills in grade 3, controlling for prior mathematics skills). That this relation is significant not only for using concepts (U) and representing concepts (R), but with a slightly weaker relation also for schematic skills (S), differs from prior cross-sectional findings (Heinze et al. 2007) as well as longitudinal findings for grade 1 (Ufer et al. 2013). Finally, also not in line with the hypotheses found in prior research, general language skills did not relate to learning gain regarding problem solving based on concept properties (P).

3.3 Prediction by Subject-specific Language Skills (Questions Q2 and Q3)

Regarding subject-specific language skills (question Q2), the predictivity of an overall score combining all three indicators (active vocabulary, passive vocabulary, mathematical text comprehension) was first examined. For the overall score (Table 2, model 2, β = 0.10, p = 0.046), as well as for the single subdimensions schematic skills (Table 3, model 2.S, β = 0.19, p < 0.01), using concepts (Table 5, model 2.U, β = 0.17, p < 0.01), and representing concepts (Table 6, model 2.R, β = 0.14, p < 0.01), subject-specific language skills significantly predicted mathematics skills in grade 3 beyond general language skills and the respective prior mathematics skills. While the strengths of these relations were similar, subject-specific language skills did not significantly predict problem solving based on concept properties (Table 4, model 2.P, β = 0.03, p = 0.58).

In two final models, the overall score for subject-specific language skills was replaced by the scores for mathematical vocabulary (models 3a) with respect to mathematical text comprehension (models 3b) to analyse the relative predictivity of these two measures (question Q3). Due to the substantial correlation between the two measures (r = 0.72, p < 0.001), we did not analyse them in a joint regression model. For the overall score, mathematical text comprehension demonstrated a similar relation to learning gain as the overall measure of subject-specific language skills (Table 2, model 3b, β = 0.11, p = 0.04), but the respective relation did not reach significance for mathematical vocabulary (Table 2, model 3a, β = 0.09, p = 0.05). For the subdimensions schematic skills (Table 3, model 3a.S, 3b.S), using concepts (Table 4, model 3a.U, 3b.U), and representing concepts (Table 6, model 3a.R, 3b.R), both indicators showed similar predictivity as the overall score for subject-specific language skills. Descriptively, these relations seemed to be slightly stronger for mathematical text comprehension than for mathematical vocabulary, in particular when predicting learning gain in representing concepts (Table 6, models 3a.R, 3b.R). To test this difference statistically, we compared the fit of two models including both predictors, mathematical text comprehension, and mathematical vocabulary: One model with the two standardised regression coefficients constrained to be equal, and one with freely estimated standardised regression coefficients. A Chi-square test of model fit did not indicate a significantly worse fit of the constrained model compared to the model with free standardised regression coefficients (χ2 (1) = 1.13, p = 0.29). The same pattern was observed for the overall score (χ2 (1) = 0.01, p = 0.92), schematic skills (χ2 (1) = 0.01, p = 0.93), using problem solving based on concept properties (χ2 (1) = 0.05, p = 0.82), and using concepts (χ2 (1) = 0.23, p = 0.63).

4 Discussion

The main goal of the current contribution was to study the predictive value of mathematics-specific language skills beyond prior mathematics skills, general language skills, general cognitive abilities, and cultural capital on mathematics learning during grade 3. Furthermore, apart from a one-dimensional conceptualisation of mathematics skills, a differentiated perspective was adopted in accordance with the SPUR-approach (Bleiler and Thompson 2013). We will first discuss the results for the one-dimensional conceptualisation, and then for the four subdimensions of mathematics skills. We will derive final conclusions in light of the study’s limitations and highlight central implications for research and development.

4.1 Results from a One-dimensional Perspective on Mathematics Skills

Regarding the variables which predict overall mathematical learning gain, our results point to a central role of language skills above related measures, such as general cognitive abilities, language spoke at home, and cultural capital. These results on general language skills replicate prior results from the primary (Heinze et al. 2007; Ufer et al. 2013) and secondary level (Prediger et al. 2018) and underpin the well-established relevance of general language skills for mathematics learning.

Our results on subject-specific language skills extend these results. In our study, subject-specific language skills demonstrated a significant relation to mathematical learning gain beyond general language skills and the other control variables (e.g. Table 2, model 2). This result indicates that it is not only important to master aspects of the academic language register in general. Having a certain command of a subject-specific school register as conceptualised, for example, by Prediger and Wessel (2013) has additional impact on elementary school students’ mathematical learning progress. Intervention studies in secondary school have provided first evidence (Prediger and Wessel 2018) that a causal effect might stand behind this relation. This underpins the importance of subject-specific language support which is closely connected to the concepts at hand during mathematics instructions.

Beyond intervention studies, in which usually only a few, quite broad dimensions of language skills and support can be studied systematically, our longitudinal, correlational approach allows us to explore the importance of different facets of subject-specific language skills separately. Of course, the inclusion of relevant competing explanatory variables, such as cultural capital, general cognitive skills, and of course general language skills, is vital to arrive at valid conclusions. Controlling for these variables, our results indicate that in particular mathematical text comprehension relates to the acquisition of mathematics skills. Knowledge of mathematical vocabulary demonstrated a descriptively positive, but not significant relation to learning gain for the overall measure of mathematics skills. This underpins hypotheses from qualitative research and theoretical assumptions on the role of skills relating to meaning-related registers that go beyond isolated knowledge of technical terms (Moschkovich 2002; Götze 2018). Skills in mathematical text comprehension were measured using an instrument based on the idea of C‑Tests. Similar instruments of subject-specific language skills have also been developed for other school subjects in the past (Höttecke et al. 2017; Özcan 2013), but results on their longitudinal predictivity have been rare, up to now. The current study is the first to provide direct longitudinal evidence regarding the importance of the two facets of subject-specific language skills for the acquisition of mathematics skills.

4.2 Results from a Multi-dimensional Perspective on Mathematics Skills

Following hypotheses on diverging effects of language skills on different dimensions of mathematics skills, we applied an instrument to measure mathematics skills that was conceptualised to provide separate scores for each of the four SPUR dimensions (Bleiler and Thompson 2013). The results here largely reproduce the effects of language skills observed for the overall learning gain: Subject-specific language skills predict learning beyond general language skills and the other control variables. In these cases, the relations can be traced back to both indicators, mathematical vocabulary and mathematical text comprehension, with a slightly stronger relation for mathematical text comprehension.

Studying the effects of subject-specific text comprehension was a central innovation in our study. As expected, this measure demonstrated significant relations to learning gain in representing mathematical concepts (R) and in using mathematical concepts (U). Both dimensions are typical content of elementary school mathematics instruction, and both dimensions have been hypothesised to be strongly related to language skills before (Steenpaß and Steinbring 2014 for representations; Paetsch and Felbrich 2015 for word problems), because the meaning of mathematical representations is negotiated mostly verbally in the classroom. Similarly, mastering the different verbal means to represent mathematical concepts in word problems (cf. Gabler and Ufer n.d.) seems not only related to mathematical vocabulary, but to mathematical text comprehension. Thus, these results illustrate the importance of subject-specific text comprehension beyond other measures such as general language skills for some dimensions of mathematics skills, and they extend prior research on approaches to measure such skills (Höttecke et al. 2017; Özcan 2013).

Moreover, the relation to mathematical vocabulary is interesting, since Haag et al. (2013) find that mathematical vocabulary is not a reason for second language learners’ difficulties in mathematics assessments. Although it has been pointed out that mathematical vocabulary is usually explained explicitly in the classroom (Haag et al. 2013), it seems plausible that its predictive power in our study mirrors the role of subject-specific vocabulary when learning and teaching mathematics. The descriptively slightly stronger relation of mathematical text comprehension with learning gain is, again, in line with theoretical assumptions and hypotheses from qualitative research on the role of meaning-related registers (Moschkovich 2002; Götze 2018).

Due to the restricted sample size and the strong correlation between the two measures of subject-specific language skills, we could not find any indications that one of the two measures is a stronger predictor of mathematics skills than the other. This is particularly interesting since prior analysis of data from the first measurement has demonstrated that knowledge of mathematical vocabulary can be differentiated from mathematical text comprehension (manifest correlation r = 0.73; cf. Table 8 in the appendix). It remains an important direction for further research to examine the relation between and the effect of these two measures in detail and empirically test assumptions on the specific role of skills regarding meaning-related registers. Our test of knowledge of mathematical vocabulary only requires a relatively shallow connection between mathematical vocabulary and conceptual knowledge. Of course, it is a desirable goal of mathematics instruction in general that students make deep connections with the meanings of these terms (Götze 2018). However, it remains an open question as to whether these differences in deep connections can further explain learning, or whether knowledge of vocabulary, as we operationalised it, might not be sufficient to support further learning. This would be of specific interest since such understanding might be easier to acquire in initial learning, and could form a basis for deeper elaboration in subsequent learning.

Beyond these commonalities to the one-dimensional analysis, there are interesting deviations from the picture observed there. Firstly, both general as well as subject-specific language skills did not explain learning gain regarding problem solving based on concept properties (dimension P) beyond the control variables. For this measure, learning gain was best explained by general cognitive abilities and the families’ cultural capital. This result contradicts expectations from the literature that language skills are most central for more conceptual dimensions of mathematics skills. One reason for this result could be that we deliberately selected non-routine tasks to measure subdimension P. These tasks require students to develop their own solution approaches, based on their knowledge of the involved concepts. At least analyses from secondary school have demonstrated that such tasks are quite rarely implemented in classroom instruction (Jordan et al. 2008). Thus, being able to develop strategies to approach these tasks might be less a matter of school-based learning processes and perhaps more one of informal learning opportunities outside school. Individually, the use of such informal learning processes can plausibly be assumed to be related to general cognitive skills, and it is also plausible that they are more intensively initiated in families that have more cultural capital at their disposal. A similar explanation would not apply to representing and using mathematical concepts (which showed significant relations to language skills), since the representations and word problems used in our study are typical content of elementary school mathematics instruction. However, further research is necessary to investigate this result in more detail.

Secondly, regarding the hypothesis that general and subject-specific language skills are of lower importance for the acquisition of more schematic aspects of mathematics skills (dimension S) than of more conceptual dimensions, our study could not replicate the cross-sectional results (Heinze et al. 2007) and longitudinal results (Ufer et al. 2013) from grade 1. Although the relation of general language skills was descriptively weaker for schematic skills than for representing and using concepts, schematic skill related significantly to general and subject-specific language skills. One explanation might be the difference in mathematical content between grade 1 and grade 3: While numbers until 20 in grade 1 can, in principle, be represented and discussed as compositions of single objects, this is no longer possible for the numbers up to 1000 in grade 3. Here, handling these numbers, also in terms of calculation techniques and procedures, requires substantial conceptual discourse regarding the place-value system (Fuson et al. 1997), as well as calculation strategies (Heinze et al. 2018). Thus, the increasingly complex conceptual background of seemingly schematic calculation procedures in grade 3 might lead to a stronger impact of language skills on the acquisition of calculation procedures when progressing through the elementary school grades. This interpretation would be in line with findings indicating bidirectional relations between conceptual and procedural knowledge when learning mathematics (Schneider 2006; Rittle-Johnson et al. 2015).

Finally, there is an interesting difference between our results on using mathematical concepts and Paetsch and Felbrich’s (2015) analysis of learning gain in mathematical word problems during grade 3. While we find a relation between initial language skills and mathematics skills at T2, when controlling for mathematics skills at T1, Paetsch and Felbrich found that initial language skills do not predict learning gain in word problems. In their study, the same students who progressed in language skills during grade 3 also improved in solving word problems. On the one hand, this result might be due to methodological differences: While our study applied autoregressive models to investigate learning gain, Paetsch and Felbrich explicitly modelled learning gain as a latent IRT variable. A different explanation might lie in the different samples: While our study did not select students with a specific profile, Paetsch and Felbrich’s analysis only included students who spoke German as a second language. In combination, both results might indicate that language skills are important to make progress in terms of word problems (our study), but that this relation is not observable if language skills are particularly low at the start of grade 3. In the latter case, e.g. for many students with German as a second language, it might be more important to reach a sufficient level of language skills during grade 3, even if their skills were comparably low initially.

4.3 Limitations

Although the presented study can add new insights on the role of general and subject-specific language skills in mathematics learning, the results should be taken with some qualifications. Naturally, the design of our study did not allow us to describe the development of mathematics skills in absolute terms, but only to relate interindividual differences in this development to other variables. Since the latter was the main purpose of our study, we assume that this is a minor limitation. Further, we decided, firstly, to focus on arithmetic which represents only a part of the contents of elementary mathematics instruction (KMK 2004). The results of Schindler et al. (2019) indicate that subject-specific vocabulary might involve more nuanced complexities in geometry, for example in terms of mathematical class inclusion (e.g. every square is a rectangle). Thus, the effects of subject-specific vocabulary might be even stronger in geometry if handling geometric terms indeed turns out to be more difficult. As regards mathematical text comprehension, studying this construct in other mathematical content areas will be an important step for future research. Although arithmetic skills form a central goal of elementary school instruction, an extension of this line of research to geometry and, possibly, chance and data might be of interest.

Secondly, the longitudinal, correlational design of our study poses some fundamental restrictions. For example, although a substantial number of relevant control variables were taken into account, strong causal conclusions cannot be drawn from this kind of study. Experimental intervention studies are necessary to provide such causal evidence. First studies which show an effect of interventions focusing on subject-specific language on mathematics skills exist for secondary school level (Prediger and Wessel 2018). The study by Schindler et al. (2019) at the elementary school level did not report effects on mathematical skills beyond mathematical vocabulary. Although such intervention studies are of vital interest to establish causal connections between general and subject-specific language skills and mathematics skills, they are restricted in the resolution of different dimensions of general or subject-specific language skills. Correlational studies, such as the one presented here, can provide more differentiated insights into the complex relation between different facets of language skills and mathematics skills.

Thirdly, we decided to use self-developed instruments in our study. This restricts comparability to other studies (which also use a variety of instruments). The ‘schematics skills’ scale, however, contains similar tasks as can be found in most standardised tests of elementary school mathematics (e.g. Roick et al. 2004). The final word on the validity of the SPUR framework to describe mathematics skills broadly has not yet been spoken. However, applying this framework connects our work to current assessment and teaching approaches, which emphasise the mathematical practices students conduct with a given concept, i.e. how students make use of a concept (e.g. CCSSI 2010; KMK 2004). Concerning reliability, in particular the instruments on subject-specific language skills warrant further optimisation. In particular, the reliability coefficient for active vocabulary is low. Unless this is resolved, the findings might be taken with a grain of salt and require further studies with improved instruments.

Finally, our study with N = 237 grade 3 students has, of course, a restricted power to identify relations between students’ learning prerequisites and their learning gain. In particular, the large number of control variables makes it difficult to identify cumulative effects beyond their explanatory power. For example, comparing the predictive strength of mathematical vocabulary and mathematical text comprehension with sufficient power would require a larger sample size, due to the high correlation between the two measures. Our study could reliably detect some relations between subject-specific language skills and mathematics skills. When considering the size of the regression coefficients descriptively, it seems plausible that studies with larger samples will be able to establish more of the detailed relations. For the current study, this means that insignificant coefficients should not be taken as evidence for strong claims that certain learning prerequisites are unimportant. However, for educational practice, it is primarily vital to identify those prerequisites that have the strongest relation to learning gain. Although larger studies might provide an even more nuanced picture in the future, the results obtained with our sample still have important implications.

4.4 Conclusions and Implications

To our knowledge, the present study is the first to measure different dimensions of mathematics-specific language skills explicitly and to relate them to mathematical learning gain longitudinally. Our results underpin the importance of general and subject-specific language skills for mathematics learning already in elementary school. This indicates that approaches to support students’ language acquisition within subject-related instruction (e.g. Schindler et al. 2019; Wessel and Erath 2018) may provide effective means to foster mathematical learning beyond the acquisition of subject-specific language skills. Accordingly, developing such approaches based on educational theories of the role of language in mathematics learning as well as empirical evidence on the mechanisms assumed in these theories remains a central desideratum (Gabler and Ufer n.d.). In this context, it will be of interest if migration- or language-related disparities in mathematics achievement can be reduced by a stronger focus on subject-specific language skills in classroom education.

Our study integrates two perspectives on subject-specific language skills: A perspective focusing on subject-specific vocabulary (Schindler et al. 2019; Taboada 2012) and a perspective focusing on measures of subject-specific text comprehension, often with instruments based on the idea of C‑Tests (Höttecke et al. 2017; Özcan 2013). On the one hand, it was found that both types of measures can reliably be combined into a comprehensive measure of subject-specific language skills. Moreover, the effects of these two indicators seemed to be similar in our study. Future research is necessary to investigate if a focus on only one of the two facets, mathematical vocabulary or mathematical text comprehension, is suited to support students’ learning.

Finally, our study aimed to investigate hypotheses regarding the varying role of general and subject-specific language skills on the acquisition of deep knowledge about mathematical concepts vs. of more schematic calculation procedures. This differentiation was made in an ad-hoc manner in prior studies, e.g. based on subscale analyses (Heinze et al. 2007) or factor analyses (Ufer et al. 2013) of existing tests. For this study, we designed an instrument to measure mathematics skills based on a multi-dimensional conceptualisation of mathematics skills (SPUR approach: Bleiler and Thompson 2013). The results point towards a more pronounced role of language skills also for the acquisition of calculation procedures in grade 3 than observed on the prior studies in grade 1. If this pattern of results can be sustained, it would indicate a danger that language-related problems in mathematics might be difficult to observe based on calculation skills in lower school grades, but accumulate and extend to all dimensions of mathematics skills with increasing complexity of school mathematics. This warrants an early diagnosis of schematic as well as conceptual mathematics skills, but also of general and subject-specific language skills already at early levels of elementary school to provide instructional support as early as possible. Instruments such as those developed for this study might be adapted to these earlier levels and serve as a basis for these diagnostics.