Introduction

One stated goal of teacher education at university level is to equip future teachers with the competencies needed to plan lessons as well as to teach and reflect upon the teaching–learning processes professionally (Carlson & Daehler, 2019). The knowledge, skills and motivational orientations which pre-service science teachers are asked to develop during teacher education are suggested to encompass professional knowledge, motivational orientations and self-regulation, as well as beliefs, values and goals related to teaching and learning (Baumert & Kunter, 2013). With reference to the seminal work by Shulman (1986), professional knowledge (including practical knowledge and skills) can be further subdivided into content knowledge (CK), pedagogical content knowledge (PCK), pedagogical knowledge (PK) and further knowledge dimensions (e.g. curricular knowledge). For pre-service science teachers, CK includes science content knowledge, procedural knowledge and epistemic knowledge related to science (Bybee, 2014; Großschedl et al., 2015). The knowledge and skills needed for scientific problem-solving and reasoning in science are seen as ‘[a] necessary element of any competent teacher of science’ (Osborne, 2014, 189).

Several studies investigate the development of pre-service science teachers’ knowledge, skills or competencies related to scientific reasoning (e.g. Hartmann et al., 2015; Mahler et al., 2021; Stammen et al., 2018). These studies provide suggestions about ways to foster scientific reasoning competencies (SRC) in science teacher education and identify specific factors that significantly contribute to competence development in this domain; however, most of the studies available so far investigate the contribution of such factors within specific contexts, such as single universities with specific study regulation programmes and curricula, or specific interventions (e.g. Khan & Krell, 2019; Stammen et al., 2018). The specificities of programme regulations and the resulting confounding of variables—such as pre-service science teachers’ age and cumulative learning opportunities over time—make analyses of the contribution of specific factors challenging to generalise beyond the specific contexts of the studies. For example, university teacher education programmes can be organised concurrently (with disciplinary and pedagogical studies within the same programme) or consecutively (with pedagogical studies following a pure disciplinary study programme) (Cofré et al., 2022; Zuzovsky & Donitsa-Schmidt, 2017).

This study examines the contribution of three of the proposed factors suggested in the literature (Hartmann et al., 2015; Limueco & Prudente, 2018; Schwichow & Nehring, 2018) on the development of pre-service science teachers’ SRC: the amount of science education classes (as an indicator for explicit reflections on scientific reasoning), the amount of science classes (as an indicator for learning science content knowledge) and the pre-service science teachers’ age. Other factors, such as brain growth (e.g. Kwon & Lawson, 2000) or general cognitive abilities (e.g. Göhner & Krell, 2022; Mayer et al., 2014) are not addressed in this study. Hereby, this study makes use of a sample of pre-service science teachers from six different universities and three different countries to assist in identifying the contribution of the three factors independent from specific study programmes.

Conceptualising Science Teachers’ Scientific Reasoning Competencies

It is desirable that science teachers develop their competencies at reasoning scientifically (Bybee, 2014; Osborne, 2014). In line with this contention, science teacher education standard documents in various countries require that pre-service science teachers should have—next to an advanced content knowledge of their subject domain—elaborate procedural and epistemic scientific knowledge in order to reason scientifically (e.g. Chile: Mineduc, 2012; Germany: KMK, 2019). Other science education documents require science teachers to implement scientific inquiry approaches in their lessons, including formulating research questions, developing models, designing experiments and evaluating evidence as tasks for students, in order to develop students’ competencies in reasoning well about scientific problems (e.g. BCMOE, 2019; NGSS Lead States, 2013).

In science education research, SRC are defined as the dispositions to be able to solve a scientific problem in a certain situation by applying a set of scientific skills and knowledge. SRC are further understood as a latent, complex construct that also encompasses the capacity to reflect about the problem-solving process at a meta-level (Krell et al., 2018; Lawson, 2004; Morris et al., 2012). This conceptualization of SRC is quite established in science education research. In a review study, Opitz et al. (2017) identified three aspects in which conceptualizations of SRC may differ: (1) the specific skills they include; (2) if scientific reasoning is conceptualised as a general, uniform competence or rather as a more differentiated set of skills and abilities; and (3) if scientific reasoning is assumed to be domain-general or domain-specific.

Different generic, cognitive operations involved in scientific reasoning have been identified, including encoding (process of representing information and its context in memory), retrieval of information and strategy development (e.g. Kuhn, 1989; Kuhn & Pearsall, 2000; Morris et al., 2012). Science content knowledge, procedural and epistemic knowledge related to specific ‘styles of scientific reasoning’ (e.g. experimentation and modelling) have also been suggested as another important element of SRC (Osborne, 2013). Hence, the dispositions to be able to solve a scientific problem in a certain situation (i.e. SRC) include ‘not just a knowledge of its [scientific] domain-specific constructs but also a knowledge of a set of procedural and epistemic constructs’ (Kind & Osborne, 2017, 10). As shown above, these dispositions—although not always explicitly named so—are suggested in many science teacher education standard documents as an outcome of a good science teacher education (e.g. KMK, 2019; Mineduc, 2012). Empirical studies suggest that pre-service science teachers’ SRC develop over their university studies (e.g. Krüger et al., 2020) or as a result of specifically designed interventions (e.g. Stammen et al., 2018). However, the studies conducted so far have concentrated on the development of pre-service science teachers’ SRC within individual study programmes.

For the operationalisation of SRC in this study, a theoretical framework is employed which covers two dimensions named conducting scientific investigations and using scientific models that are subdivided into seven related skills of scientific reasoning (Table 1). In this framework, the dimension conducting scientific investigations covers processes commonly associated with a hypothetic-deductive approach of experimentation, whilst using scientific models is related to modelling or the use of models as epistemic tools (Hartmann et al., 2015).

Table 1 Scientific reasoning competencies and associated dimensions of conducting scientific investigations and using scientific models, suggested by Hartmann et al. (2015)

Factors Contributing to the Development of Science Teachers’ SRC

The development of SRC has been conceptualised as a stage-like process, including ‘[a] growing awareness (i.e., consciousness) of one’s reasoning patterns and one’s reflectivity as well increases in the contexts to which the patterns can be applied’ (Lawson, 2004, 323). Other perspectives emphasise the situated nature of scientific reasoning more strongly. For example, the ‘epistemology in practice’ perspective assumes that people have various resources (e.g. knowledge, skills), which are combined for sense-making depending on the specific context (Berland et al., 2016). Independent from the theoretical perspective, there is a consensus that both domain-specific knowledge and reasoning skills are needed for successful scientific reasoning (Shavelson, 2018). Hence, content knowledge, procedural knowledge and epistemic knowledge are conceptualised as the three constituent dimensions of SRC (Osborne, 2013).

In most science teacher education programmes, mainly two kinds of classes exist: discipline-specific science classes and science education classes (the latter also called science methods classes/courses). Discipline-specific science classes typically aim at developing pre-service science teachers’ content knowledge and procedural knowledge, for example in biology: knowledge of microbiology, physiology and behavioural biology as well as knowledge of how to use a microscope, how to plan and conduct a laboratory experiment and how to scientifically observe animals’ behaviour. Science education classes aim to foster pre-service science teachers’ PCK, including, for example, knowledge of students’ understanding in science and knowledge of instructional strategies for teaching science (Park & Oliver, 2008). As many science teacher education standard documents include SRC, science education classes typically also address pre-service science teachers’ procedural and epistemic knowledge as important parts of CK (Neumann et al., 20172019; Stammen et al., 2018). It is acknowledged that often all three types of knowledge—content knowledge, procedural knowledge and epistemic knowledge—are intertwined and addressed together. It has been suggested, however, that a specific contribution of science classes is the development of pre-service science teachers’ content knowledge, whilst epistemic knowledge is emphasised more within science education classes (Hartmann et al., 2015; Neumann et al., 2019). This focus on specific knowledge types in science and science education classes is illustrated in Fig. 1. Based on this, it is clear that both kinds of classes make valuable contributions to the development of pre-service science teachers’ SRC. Psychological and science education research has proposed several factors that might specifically contribute to the development of pre-service science teachers’ SRC (e.g. Engelmann et al., 2016; Hartmann et al., 2015; Lawson, 2004). In the following sections, three factors that have been suggested in the literature as contributing to the development of pre-service science teachers’ SRC will be explained in more detail: (A) the amount of science education classes, (B) the amount of science classes and (C) the age of pre-service science teachers.

Fig. 1
figure 1

Scientific reasoning competencies (SRC) are illustrated as an amalgam of content knowledge, procedural knowledge and epistemic knowledge (Osborne, 2018). As emphasised in the text, science classes in teacher education programmes typically aim to develop content and procedural knowledge whilst science education classes target on procedural and epistemic knowledge (and further dimensions of teachers’ professional knowledge such as PCK)

A) Science education classes are likely to contribute to the development of SRC

Many authors emphasise that explicit reflections about science significantly contribute to the development of SRC (e.g. Hartmann et al., 2015; Khan & Krell, 2019). Explicit reflection can be provided by, for example, reflecting upon basic features of science (e.g. intersubjectivity and justification in science), discussing basic science terminology (e.g. hypothesis, experiment and theory) or analysing current as well as historical approaches of scientists (e.g. Copernicus, Curie and Darwin) (Krell et al., 2015). Explicit reflection is especially discussed in research on the development of epistemic knowledge (e.g. knowledge about nature of science; Duschl & Grandy, 2013), where various studies reveal that explicit-, argumentative- and inquiry-based approaches are effective in promoting participants’ epistemic knowledge of science (e.g. Krell et al., 2015). Bruckermann et al. (2018) show that the amount of such opportunities to reflect about science positively contributes to pre-service science teachers’ epistemic knowledge.

Other authors further suggest that explicit reflections that are helpful in developing epistemic knowledge about science especially take place within science teacher education classes, where pre-service science teachers focus on both scientific reasoning and how to teach reasoning to students (e.g. Hartmann et al., 2015; Mathesius et al., 2016). For example, Hartmann et al. (2015) used the instrument to assess SRC, which is also applied in this study (see below), and show in a cross-sectional comparative study that pre-service science teachers outperform even science students on this instrument. These findings were hypothetically explained with ‘the positive effect of learning opportunities to explicitly discuss and reflect on the inquiry process’ (Hartmann et al., 2015, 49), that took place in science teacher education classes. Stammen et al. (2018) propose that science teacher education, combining content (i.e. scientific reasoning) and pedagogy (i.e. teaching SRC), is effective in fostering pre-service science teachers’ competencies at scientific reasoning. Relatedly, it can be assumed that the amount of science education classes (e.g. quantified in terms of ECTS credits, see below) during teacher education is a potential predictor of science teachers’ SRC.

B) Science classes are likely to contribute to the development of SRC

Several scholars also emphasise the importance of science content knowledge as a prerequisite for developing competencies of scientific reasoning (e.g. Fischer et al., 2014). For example, Ruppert et al. (2017) discuss that content knowledge plays a significant role in students’ modelling. The authors show that domain-specific content knowledge about a phenomenon can be a prerequisite for the extent to which middle school students’ models accounted for the provided evidence. They propose, for example, that students may be better able to apply a domain-general heuristic (i.e. accounting for evidence), if they have the necessary content knowledge to identify the relevant evidence for the problem at hand. Samarapungavan (2018) emphasises that correctly analysing scientific evidence requires a solid understanding of both theoretical and methodological concepts as well as science practices. For the case of argumentation in science education, Sadler and Fowler (2006) propose the ‘Threshold Model of Content Knowledge Transfer’. This model assumes a non-linear relationship between argumentation and the level of content knowledge, with specific thresholds allowing people to argue in a more advanced way. Similarly, Schwichow and Nehring (2018) suggest, with a focus on secondary school students’ experimental competencies, that science content knowledge might even be a prerequisite for the development of SRC. Based on these propositions, it can be assumed that the amount of science classes (e.g. quantified in terms of ECTS credits, see below) during teacher education is a potential predictor of science teachers’ SRC.

C) Age and intellectual development are likely to contribute to the development of SRC

Several authors relate SRC development to stages of intellectual development and age. For example, Lawson (2004) proposes with his focus on hypothetic-deductive reasoning that intellectual development involves a growing awareness of reasoning patterns and reflectivity. For this development to occur, an individual needs to be confronted with situations in which the reasoning is not fruitful and, hence, causes the development of new rules or strategies to guide one’s future behaviour (‘challenging situations’). Depending on the external stimulation, that is, the nature of situations and feedback an individual is confronted with, development of SRC may also be fostered purposefully by designed teaching approaches and interventions (Lawson, 2004). Similarly, limiting the development thesis, Kuhn and Pearsall (2000) state that scientific thinking skills have their origins in young children’s abilities to distinguish between theory and evidence, but these skills may not even be fully developed by adulthood (Zimmermann, 2007). Woolley et al. (2018) identified ten common false strategies in solving scientific reasoning problems (e.g. confusing the independent with the dependent variable in an experiment) in a sample of undergraduates. These problems are similar to problems that have been identified for younger students and children (Hammann et al., 2008; Zimmermann, 2007). Kuhn (1989) emphasised that for students from 9th grade up to adults, the level of SRC is strongly influenced by educational level.

In line with these assumptions, several other scholars suggest that university students’ SRC increase over their course of studies due to the students’ increasing age and related intellectual development (e.g. Limueco & Prudente, 2018); however, other studies do not affirm this finding and have found no effect of formal education on scientific reasoning (e.g. Ding, 2017; Ding et al., 2016). These ambiguous findings suggest that it may depend variably on specific emphases in the curricula (e.g. amount of science classes and amount of science education classes) or other factors, for tertiary education to positively contribute to the development of students’ SRC. One challenge of assessing the contribution of age and intellectual development on SRC is that, typically, other important factors (e.g. amount of science classes and amount of science education classes) increase with increasing age. Hence, it is often difficult to determine the impact of age due to its correlation with other factors. Alternative perspectives on the development of SRC, such as epistemologies in practice (Berland et al., 2016), might also explain why there is no linear relationship between pre-service science teachers’ age or educational level and their SRC.

Aim of the Study and Hypotheses

The prior research on SRC summarised above informs about ways to foster pre-service science teachers’ SRC (e.g. focusing on science content and science education/pedagogy subsequently or concurrently in their programme) and provides avenues for further research. The aim of the present study is to evaluate three of the proposed factors (amount of science education classes, amount of sciences classes and age) as predictors of pre-service science teachers’ SRC. As suggested in the literature, the amount of science education classes is used as an indicator for explicit reflections on scientific reasoning (i.e. fostering epistemic knowledge; Hartmann et al., 2015). The amount of science classes is used as an indicator for learning content knowledge (Neumann et al., 2017, 2019). The following hypotheses are tested and discussed within this study:

  • H1: The amount of science education classes during higher education positively predicts pre-service science teachers’ SRC (see Bruckermann et al., 2018; Hartmann et al., 2015).

  • H2: The amount of science classes during higher education positively predicts pre-service science teachers’ SRC (see Ruppert et al., 2017; Schwichow & Nehring, 2018).

  • H3: The pre-service science teachers’ age positively predicts their SRC (see Kuhn & Pearsall, 2000; Limueco & Prudente, 2018; Lawson, 2004).

Methods

The hypotheses are tested within a quantitative methodological framework, applying item response theory models (Bond & Fox, 2001) and classical test theory as well (Field, 2009). Data on pre-service science teachers’ SRC are collected by means of a paper–pencil instrument and the proposed factors (H1–H3) are tested as predictor variables in multiple linear regression analyses to explain differences in pre-service science teachers’ SRC.

Sample

The sample includes N = 438 pre-service science teachers from Germany (n = 219; freshmen to 10th semester; MAGE = 24; SDAGE = 4.37; from one university), Chile (n = 118; 1st to 9th semester; MAGE = 22; SDAGE = 3.87; from four different universities) and Canada (n = 101; 8th and 9th semester; MAGE = 27; SDAGE = 5.67; from one university).

The German and Chilean samples stem from concurrent programmes: Bachelor of Science/Arts with a subsequent Master of Education programme (Germany) and Bachelor of Education programme (Chile), respectively. The Canadian sample is drawn from a 1-year post-graduate Bachelor of Education programme (i.e. consecutive programme).

Only pre-service science teachers without missing responses are considered in the sample. The data partly comes from existing studies (Khan & Krell, 2019; Krell et al., 2018, 2020; Krüger et al., 2020) and is secondarily analysed for this study.

Variables

A multiple-choice instrument to assess SRC, available in German (original version), Spanish and English (translated versions), was applied in this study (Krell et al., 2018, 2020; Mathesius et al., 2016). The instrument is based on a theoretical framework covering seven skills of reasoning in science (Table 1). For each skill, the instrument includes three multiple-choice items (i.e. 21 items in total). Each item is contextualised within an authentic scientific context and the respondents must apply their procedural and epistemic knowledge within the context to identify the attractor (= correct answering option). For example, related to the skill formulating questions, respondents need to know that scientific research questions are related to a phenomenon, empirically testable, intersubjectively comprehensible, unambiguous, principally answerable and internally and externally consistent (Mathesius et al., 2014). For sample items see Krell et al. (2018, 2020); the full instrument is available upon request to the first author.

The original German version of the instrument was extensively evaluated following the recommendations in the Standards for Educational and Psychological Testing (AERA et al., 2014), that is, considering various sources of evidence for the valid interpretation of the test scores as measures of SRC. This process of instrument development and evaluation is summarised by Krüger et al. (2020). The quality and equivalence of the three language versions were evaluated by applying the translation, review, adjudication, pretesting and documentation- approach, an established approach for questionnaire translation (Harkness, 2003). This systematic translation, including a pretesting of the translated instruments using qualitative and quantitative methods, contributed to test equivalence (Krell et al., 2018, 2020).

To be able to test H1 and H2, the factors amount of science education classes (H1) and amount of science classes (H2) have been operationalised as workload, in terms of ECTS credits as prescribed in the respective study regulation documents, where ‘one credit corresponds to 25 to 30 h of work’ (EU, 2015, 10). These credits provide a sound basis that allows a cross-country comparison. Table 2 provides this data for the sample analysed in the present study. Note that the German pre-service science teachers are studying two subjects and that the workload of science education classes and science classes differs depending on these study subjects. The Chilean universities in the sample have different study programmes.

Table 2 Workload of science (Sc.) and science education (Sc.Ed.) classes for each semester during the course of studies in the analysed samples (in terms of ECTS credits) and the respective group sizes (n) in each semester

In Table 2, the different study programmes are organised along the distinction between concurrent and subsequent study programmes. In the present sample, five study programmes have been identified, in which pre-service teachers are enrolled in science education classes within the first two years already (i.e. as freshmen or sophomores), and four study programmes, in which science education classes are part of the curriculum not earlier than in the third year (Table 2).

The sampling of different universities is advantageous having a variety of combinations of the respective variables in the dataset, whilst for single universities, the considered factors might be highly confounded.

Data analysis

Person abilities and item difficulties at the latent level have been estimated on the One-Parameter Logistic Model (1PLM) for dichotomous items (‘Rasch model’) using the software ACER Conquest (Wu et al., 2007). Weighted maximum likelihood estimates (WLE) were used as point estimates for person abilities (Wu et al., 2007). Higher WLE indicate better performance on the test. As done in the related previous studies, a one-dimensional Rasch model was specified, reflecting the view that the described skills are mainly used in both considered processes of scientific reasoning and, hence, the two processes conducting scientific investigations and using scientific models (Table 1) are not clearly separable.

For the evaluation of model fit, the sum of squared standardised residuals (MNSQ) is proposed (Wu et al., 2007), which has an expected value of 1 with acceptable values ranging from 0.8 to 1.2 (Bond & Fox, 2001). Because the unweighted MNSQ is more sensitive to outliers than the weighted MNSQ, both statistics should be considered. In addition, t-standardised fit statistics based on the MNSQ are provided, which should range from − 2 to 2 (Wu et al., 2007).

Next to the fit indices, differential item functioning (DIF) for the three language versions of the instrument was evaluated. In this study, the Mantel–Haenszel (MH) statistic was computed for the analysis of DIF, which is appropriate to compare the probability of a correct answer on an item between different groups, depending on their overall test performance (Zwick et al., 1999).

Given that the fit between data and model could be shown as satisfactory and that there is no substantial DIF, the relation between the pre-service science teachers’ WLE (as estimates for SRC) and the proposed factors (H1–H3) can be statistically analysed to test the hypotheses. For this reason, correlational and multiple linear regression analyses have been conducted. Furthermore, the kind of study programme (i.e. science education classes starting early or late; Table 2) was also considered as a predictor variable to account for the hierarchical data structure with pre-service science teachers enrolled in different programmes. The more common approach to analyse hierarchical data, which is hierarchical linear modelling, was not applicable in this study since it demands at least five random-effects levels for each factor (Hodges, 2016). Correlational and multiple linear regression analyses were done with IBM SPSS Statistics, Version 26.

Results

Evaluating Model Fit of the Rasch Model

The fit statistics suggest an acceptable fit between the data and the model for most items; however, one item related to the skill generating hypotheses shows a weighted t-statistic of 3.5. This item has been excluded from the analysis. The DIF analysis reveals a ‘moderate to large’ DIF for two more items (MH statistic > 1.5), which, therefore, have been excluded as well. Hence, the present study is based on 18 of the instrument’s 21 items. For these 18 items, the fit statistics propose a good fit. The reliability measure (rel.EAP/PV = 0.51) is acceptable (Table 3) and similar to what has been reported in previous studies with the instrument (e.g. rel.EAP/PV = 0.55, Krell et al., 2018).

Table 3 Reliability coefficients and item fit statistics for the full and the reduced instrument

Basic Data Analysis

The mean estimated person ability of the total sample is MWLE =  − 0.12 (SDWLE = 0.74), with the Chilean pre-service science teachers’ scoring significantly lower (MWLE =  − 0.34; SDWLE = 0.66) than the Canadian (MWLE = 0.18; SDWLE = 0.64; p < 0.001) and the German (MWLE =  − 0.13; SDWLE = 0.77; p < 0.01) pre-service science teachers. There is a significant (p = 0.001) difference between the German and the Canadian pre-service science teachers’ WLE as well (ANOVA with Bonferroni post-hoc test). The relatively high WLE of the Canadian pre-service science teachers might be a result of their rather high study semesters (> 7 semesters) and related opportunities to reflect and learn the respective content (Table 2). There is no significant difference in the WLE of pre-service science teachers from study programmes with science education classes starting early (MWLE =  − 0.17; SDWLE = 0.76) or late in the curriculum (MWLE =  − 0.07; SDWLE = 0.72; p = 0.12; independent t-test).

For the total sample, there is a significant bivariate (Pearson) correlation between the pre-service science teachers’ semester and the estimated WLE (r = 0.31; p < 0.001); this was also found for the German subsample (r = 0.28; p < 0.001) but not for the Canadian and Chilean subsamples. This indicates, for the total sample and the German subsample, a significant linear progression of SRC over the course of the pre-service science teachers’ studies. The Chilean pre-service science teachers show very similar WLE in the different semesters with a peak in the 7th semester, which is, however, only based on n = 3. The Canadian pre-service science teachers’ SRC slightly decrease (though not significantly) from semester 8 to 9. This non-significant decrease in the tail-end can also be found for the German and Chilean pre-service science teachers (Table 4; Fig. 2).

Table 4 Estimated WLE (MWLE ± 2∙SE) of the pre-service science teachers in the single semesters, for the German (GER), Chilean (CHI) and Canadian (CAN) subsamples. Note that the data is cross-sectional and does not provide information about individual pre-service science teachers’ development of SRC. See Table 2 for further information about the semester groups. SEM: semester
Fig. 2
figure 2

Estimated WLE (MWLE ± 2∙SE) of the pre-service science teachers in the single semesters, for the German, Chilean and Canadian sub-samples. Subgroups with n < 15 are not shown (see Table 4). Note that the data is cross-sectional and does not provide information about individual pre-service science teachers’ development of SRC. See Table 2 for further information about the semester groups

Testing the Hypotheses

For the total sample, there are significant positive (bivariate) correlations between the estimated WLE and the amount of science education classes, the amount of science classes and the pre-service science teachers’ age, and very similar values could be found when distinguishing between the two kinds of study programmes as well (Table 5).

Table 5 Pearson correlations between the estimated WLE and workload in science classes and science education classes and the pre-service science teachers’ age, specific for the two kinds of study programmes and the total sample

These findings suggest a positive relation between each of the three factors and the science teachers’ SRC. This interpretation is limited by the fact that there are significant correlations between these variables as well (Table 6), due to the respective study programme regulations (Table 2). Consequently, for the total sample, the partial correlations between the WLE and the amount of science education classes (rp = 0.04; p = 0.419) and the amount of science classes (rp = 0.27; p < 0.001) are smaller than the bivariate correlations shown in Table 5. These findings suggest only the factor amount of science classes (H2) as positively contributing to the pre-service science teachers’ WLE.

Table 6 Pearson correlations between the workload in science classes and science education classes and the pre-service science teachers’ age , specific for the two kinds of study programmes and the total sample

A multiple linear regression analysis with the pre-service science teachers’ WLE and the proposed factors (H1–H3) shows no violations of assumptions; the Durban-Watson-statistic is 1.89, indicating that there is no considerable autocorrelation, and VIF is < 10 for all items, indicating no serious multicollinearity (Field, 2009). The adjusted R2 illustrates that the three factors explain a proportion of about 9–10% of the pre-service science teachers’ SRC. Only the amount of science classes was found to be a statistically significant predictor, with a standardised β = 0.277 and a partial r2 = 0.057 (Table 7; model 1).

Table 7 Coefficients of the multiple linear regression analyses

If the kind of study programme (Table 2) is included as an additional predictor variable in the regression model (Durban-Watson-statistic = 1.88), the adjusted R2 stays almost the same (Table 7; model 2). In this analysis, the significant positive effect of the amount of science classes can still be found with almost the same size of effect (i.e. standardised β) and a slightly smaller r2p (Table 7; model 2).

Summarising, the findings of the multiple linear regression analyses supported the conclusion that the amount of science classes positively contributes to the pre-service science teachers’ SRC, even if the kind of study programme is considered (Table 7).

Discussion

This study investigated the amount of science education classes during higher education (see Bruckermann et al., 2018; Hartmann et al., 2015), the amount of science classes during higher education (see Ruppert et al., 2017; Schwichow & Nehring, 2018) and the pre-service science teachers’ age (see Kuhn & Pearsall, 2000; Lawson, 2004; Limueco & Prudente, 2018) as predictors of their SRC.

Due to the theoretical framework applied for item development in this study (Table 1), the SRC items are related to the dimensions conducting scientific investigations and using scientific models and their associated skills (Hartmann et al., 2015). The assessment instrument includes multiple-choice items requiring respondents to identify the attractor out of four answering options and various sources of validity evidence have been considered during test development that support the proposed test score interpretation (Krüger et al., 2020). In addition, this study employed a testing instrument which was systematically translated and evaluated and is available in three languages (Krell et al., 2018, 2020).

In this study, the amount of science education classes was used as an indicator for explicit reflections on scientific reasoning (specifically emphasising epistemic knowledge), the amount of science classes was used as an indicator for learning science content knowledge and the workload in both kinds of classes was measured in terms of ECTS credits (EU, 2015). These credits provided a sound basis that allowed a cross-country comparison, although not all countries follow this approach. On the other hand, ECTS credits are proposed to reflect typical workload necessary to achieve the curriculum goals, but not the exact workload of each individual student, and no information about the students’ individual workload, the teaching methods or specific course content could be considered.

For the first time, a statistical analysis of SRC at six universities in three different countries was undertaken. The findings propose specific relations between the considered factors and the pre-service science teachers’ SRC for the different subsamples. The basic data analysis revealed that the Canadian pre-service science teachers scored higher on the instrument than the Chilean and German ones. Whilst all three subsamples scored similarly on the items related to conducting scientific investigations (correct responses: GER: 53%, CHI: 46% and CAN: 53%), substantial differences exist for the items related to using scientific models (GER: 39%, CHI: 35% and CAN: 48%). These findings indicate rather basic competencies for all three subsamples, with the Canadian pre-service science teachers possessing superior competencies related to scientific modelling compared to the German and Chilean participants. This finding might be explained with the Canadian consecutive programme, which–on average–recruited older, more educated candidates than the Chilean and German programmes (Zuzovsky & Donitsa-Schmidt, 2017). We also found, as previous literature suggests (Hartmann et al., 2015; Limueco & Prudente, 2018; Schwichow & Nehring, 2018), a significant positive correlation between the pre-service science teachers’ WLE and the amount of science education classes, the amount science classes and their age (with small to medium effect sizes; Fritz et al., 2012); however, only the correlation between the amount science classes and the WLE could be observed anymore if the other variables were controlled (i.e. partial correlation).

Generally, it is plausible that the identified relations are the result of different study regulations in the three countries, resulting, for example, in high correlations between the amount of science education and the amount of science classes in concurrent programmes (e.g. Germany) but not in consecutive programmes (e.g. Canada). The specificities of study regulations and the resulting confounding or constancy of variables (Table 6) make analyses of the impact of contributing factors on the development of SRC, focusing on single countries or universities (e.g. Khan & Krell, 2019), difficult and potentially limit their generalizability.

The first multiple linear regression analysis reveals that the inclusion of the proposed factors explained about 9–10% of the total variance in the pre-service science teachers’ SRC (medium effect size; Fritz et al., 2012). The only significant factor in the regression analysis was amount of science classes, with standardised β = 0.277 (medium effect size; Acock, 2014), that is, pre-service science teachers with more ECTS credits in science classes significantly outperformed their peers on the SRC instrument. This finding remains if the two kinds of study programmes are considered as well (Table 7).

Based on these findings, H2 can be confirmed but H1 and H3 must be rejected. Hence, the present study supports the assumption of science content knowledge being a specific prerequisite for SRC (e.g. Fischer et al., 2014). The participants in the present study may have benefited from higher levels of scientific content knowledge to better understand the authentic scientific contexts in the items and to apply their procedural and epistemic knowledge within the respective context to identify the attractor. Related to H1, the findings indicate that the amount of science education classes did not necessarily impact pre-service science teachers’ SRC. This contradicts the assumption the amount of science education classes (as indicator for explicit reflections on scientific reasoning) positively contributes to pre-service science teachers’ SRC (e.g. Bruckermann et al., 2018; Hartmann et al., 2015). The present findings could be explained with insufficient science education classes (Cofré et al., 2015). Another explanation—at least partly—might be the present sample groups, which do not fully cover all semesters of teacher education at university level in all six universities (Table 2). H3 has also to be rejected based on the present findings. Hence, the pre-service science teachers’ age did not significantly predict their SRC. This is less surprising as there are ambiguous findings related to age and formal education as predictors of SRC (Ding, 2017; Ding et al., 2016; Limueco & Prudente, 2018). Most studies emphasising the importance of age and intellectual development as positively contributing to scientific reasoning focus on younger children (Kuhn, 1989; Zimmermann, 2007). Hence, the impact of age might be less significant for pre-service science teachers, as processes of cognitive development already occurred in these age groups. Also, the present findings support perspectives more strongly emphasising the situated nature of scientific reasoning, such as the ‘epistemology in practice’ perspective (Berland et al., 2016).

Conclusion

SRC are defined as a latent, complex construct that encompasses the skills and knowledge needed for scientific problem solving and the capacity to seriously reflect about the problem-solving process on a meta-level (Lawson, 2004; Morris et al., 2012). It is proposed, in general, that competence development in teacher education might be best supported with a sequence of authentic and complex contexts for problem solving, accompanied by explicit reflections about the problem-solving processes carried out (Khan & Krell, 2019; Max, 1999). In terms of the development of SRC, pre-service science teachers might benefit when challenged with multiple situations, in which their recent reasoning is not fruitful and, hence, new rules or strategies have to be applied (Lawson, 2004). Based on the present findings, it can be concluded, unlike other studies focusing on single countries or universities (e.g. Hartmann et al., 2015), that pre-service science teachers benefit from strong science content knowledge which supports them to competently reason scientifically. For science teacher education programmes to be more efficient in fostering pre-service science teachers’ SRC, this would suggest to create a stronger coherence between what is taught in different kind of classes and within each kind of class. For example, pre-service science teachers could benefit if procedural and epistemic knowledge is fostered and discussed in science education classes within contexts they already learned about in their science classes and, hence, possess the relevant content knowledge (i.e. coherence between different kinds of classes). Furthermore, and as suggested for competence development in general (Max, 1999), a coherent instructional sequence within science education classes should provide pre-service teachers with learning opportunities with increasing complexity to support the transferability of procedural and epistemic knowledge to various contexts and the development of new strategies.

In general, it is proposed that a wide range of high impact course activities are needed for the development of pre-service science teachers’ SRC and that the interplay of various learning opportunities, including appropriately designed interventions, might positively contribute to the development of SRC (Krüger et al., 2020; Lawson, 2004; Stammen et al., 2018). Hence, as the present study did not evaluate interaction effects and delayed effects, it might still be the case that the combination of the amount of science classes and the amount of science education classes in all study programmes considered in this study positively contributed to the development of the pre-service science teachers’ SRC. For example, pre-service science teachers visiting science classes may have benefited from having visited science education classes earlier—or vice versa.

The inclusion of pre-service science teachers from different universities in this study, a first of its kind, makes the present findings more robust. Future studies can build on the findings and replicate them based on larger samples, including additional countries, and conduct hierarchical linear modelling, a more common approach to analysing hierarchical data (Hodges, 2016). Future studies can also build on the present findings with a more in-depth analysis of the two factors: explicit reflections on scientific reasoning and science content knowledge and their contribution to the development of pre-service science teachers’ SRC. For example, by considering the students’ individual workload, the teaching methods and specific course content and teacher education activities, we might be able to ascertain further how and where science teacher education can best promote scientific reasoning. This type of analysis might contribute to learning more about the complex nature of competence development related to scientific reasoning, for example by testing different assumptions about the way content knowledge is used for scientific reasoning and which quantity and quality of content knowledge are needed to allow scientific reasoning on a specific level (Ruppert et al., 2017; Sadler & Fowler, 2006). Finally, future studies should extend their focus, consider further dimensions of teachers’ professional competencies (e.g. PCK; Baumert & Kunter, 2013) and investigate how the level of science teachers’ SRC impacts their teaching practices and—ultimately—the development of their students’ science competencies.