Research Aims

It is assumed that the ability to reason well in science enables some participation in those complex issues facing society and involving science, such as climate change (e.g. Hodson, 2014; Ford, 2008). Especially in “post-truth times”, where personal beliefs appear to be highly influential in shaping people’s opinions, an understanding of science and its procedures, capabilities, and limitations has become increasingly important (Lubchenco, 2017). In addition to scientific reasoning, these issues require an understanding of socio-cultural contexts of science, public policy, and individual decision-making (Kelly et al., 1993). Osborne (2013) further suggests that economic needs appear to be another driving force for the integration of scientific reasoning competencies in science education curricula and documents. Scientific reasoning competencies are seen, for example, in the PISA studies, as an indicator for the societies’ future economic competitiveness (cf. OECD, 2010). Being competent at reasoning scientifically is often considered globally as one outcome of a good science education (e.g. Hodson, 2011; KMK, 2005; NGSS Lead States, 2013).

In this study, researchers investigated whether secondary preservice science teachers’ ability to reason in science changes in teacher education. Science teacher education commonly focuses on how to teach particular topics in science. With the increasing emphasis on the practices and the nature of science, the researchers wished to learn whether preservice science teachers also are becoming more competent at several aspects of scientific reasoning. Two research questions guided this study: first, to what extent do secondary preservice science teachers possess scientific reasoning competencies; second, to what extent does a secondary science teacher education methods course influence secondary preservice science teachers’ scientific reasoning competencies?

Prior longitudinal research has shown that over the years, future teachers in university tend to become more competent in their ability to reason in science (Hartmann et al., 2015); however, this study attempts to investigate these findings in the context of an intensive science teacher education course and further hypothesises the possible role of a science teacher program in fostering scientific reasoning competencies.

To investigate the two aforementioned questions, a cohort of Canadian secondary preservice science teachers were assessed with multiple-choice problems in science before and after their science teacher education methods course. The preservice teacher assessment involved a validated multiple-choice questionnaire (Krell et al., 2018; Hartmann et al., 2015). British Columbia (BC) was considered an internationally relevant case study for research on competencies and the role of science teacher education in the development of competencies because (1) a new competency-focused curriculum was introduced in 2018 that is now mandated for teachers; and (2) some universities in BC offer an intensive after-degree teacher education program, where future secondary science teachers engage in concepts related to science teaching formally in a single semester. This type of program also differs from a concurrent program where exposure to teacher education concepts occurs over the duration of a longer 3- to 5-year degree period, focusing the present study on a delineated period where preservice teachers engage with science methods in their first semester in the program. Given the wider curricular emphases on complex problem solving in science and developing “thinking” competencies among students (cf. Quebec Education Program, 2001; Alberta Program of Studies, 2016; BC Curriculum, 2019), this study aims to suggest several recommendations on how science teacher education more globally can support future science teachers in this area.

Theoretical Background on Scientific Reasoning Competencies

Competencies

Competencies can be defined broadly as, “Context-specific cognitive dispositions that are acquired by learning and needed to successfully cope with certain situations or tasks in specific domains” (Klieme et al., 2008, p. 9). In science curricula, the term skills has often been employed pertaining to scientific inquiry skills (e.g. Nehring et al., 2015); however, Weinert (2001) acknowledges that, “the boundary between skill and competencies is fuzzy” (p. 62). He attempts to distinguish between competencies and skills by proposing to use the term competencies to describe dispositions needed to become good at tasks with a sufficient degree of complexity, whereas the term skills, he suggests, relates to less complex dispositions, such as critical thinking skills. Hence, competencies are, “[I]nterpreted as a roughly specialized system of individual skills that are necessary or sufficient to reach a specific goal” (Weinert, 2001, p. 45).

Some of the scholarly discussion about the term competencies includes to what extent competencies should be understood as purely cognitive or including motivational and affective components. While Klieme et al. (2008) prefer a parsimonious approach to competencies as purely cognitive dispositions for assessment purposes, other researchers propose to understand competencies more holistically as, “[C]omprised of cognitive and (in many cases) motivational, ethical, volitional, and/ or social components” (Weinert, 2001, p. 62). Notwithstanding the inclusion of motivational, volitional, and social components in a definition of competencies, the core concept of competencies as a disposition and complex system (of skills) has some salience. This concept of competency has been applied in several studies related to assessment in educational contexts (e.g. Krell, 2018; Shavelson, 2013); however, it has been open to scrutiny (Rychen & Salganik, 2003). In our study, we support the broader notion that competencies are complex, context-specific dispositions that encompass learnable skills and knowledge while empirically examining a component of it.

Scientific Reasoning Competencies

We suggest that scientific reasoning competencies are a complex construct as well, encompassing the abilities needed for scientific problem solving as well as the capacity to reflect on problem-solving at a meta-level (Krell et al., 2018; Lawson, 2004). Several general cognitive processes have also been suggested as important to reasoning, including, encoding (process of representing information and its context in memory) and retrieval of information, and strategy development (Morris et al., 2012). Knowledge is also intertwined with reasoning as it is important to identify key features of a problem at hand, including content knowledge (knowing that), procedural knowledge (knowing how), and epistemic knowledge (knowing why) (Kind & Osborne, 2017). In science education literature, content knowledge has been understood as knowledge of domain-specific concepts in science, procedural knowledge as understanding of scientific procedures and strategies, and epistemic knowledge as knowledge about science and how it proceeds (Kind & Osborne, 2017; Lawson, 2004; Morris et al., 2012). These knowledge types correspond with three possible goals of science education suggested by Canadian science educator, Hodson (2014): learning science (content knowledge), doing science (procedural knowledge), and learning about science (epistemic knowledge). While reasoning in science encompasses a myriad of epistemic, social, and moral dimensions (Samarapungavan, 2018; Schauble, 2018; Allchin, 2013), the focus of this study is only on two common processes associated with science that involve reasoning: investigating and modeling. These processes alone do not represent, or gauge to a great extent, the context-specific cognitive dispositions that are acquired by learning and needed to successfully cope with certain authentic situations in science. While the focus of the study precludes the ethical or affective components of being competent in a domain, we, however, suggest that being competent at investigating and modeling begins to help us understand reasoning in science.

For the operationalisation of scientific reasoning competencies in this study, a theoretical framework is employed which covers the two sub-competencies conducting scientific investigations and using scientific models (Table 1). The theoretical distinction between these two sub-competencies and associated seven skills of formulating questions, generating hypotheses, planning investigations, analysing data and drawing conclusions, judging the purpose of models, testing models, and changing models was useful for the purposes of systematically developing an instrument used to assess some scientific reasoning competencies in the present study; however, it is recognised that the same processes are used in both investigations and modeling. For example, hypotheses are generated when testing models and evidence is often used to evaluate hypotheses underlying the model (Khan, 2007; Krell et al., 2017).

Table 1 Scientific reasoning competencies and associated sub-competencies of conducting scientific investigations and using scientific models suggested by Hartmann et al. (2015)

Scientific reasoning competencies are emphasised in standards documents for science teacher education in many countries around the world (cf. Pedersen et al., 2017). As a result, preservice science teachers are asked to be competent themselves at the same processes stated in curricula for students; for example, working with models, and designing empirical approaches to test hypotheses (Khan, 2018a). It has been put forward that scientific reasoning competencies be emphasised as a part of science teachers’ overall professional competency (Mathesius et al., 2016). This study attempts to investigate reasoning competencies for the first time in the context of an intensive science teacher education course and further hypothesise the possible role of a science teacher education program in fostering these scientific reasoning competencies.

Context of the Study: Science Teacher Education in British Columbia, Canada

Curriculum Development and Competencies in Canadian Curricula

Curriculum construction is under the purview of provincial ministries in Canada. Canadian science curricula have undergone a recent renewal across the provinces. Like other provinces, the BC Ministry of Education has reconceptualised its curriculum and the concept of competencies has (re-)emerged prominently as part of its curriculum structure of “big ideas”, “competencies”, and “elaborations.” Rather than competencies, historically, “critical thinking skills” were stressed by the BC secondary science curriculum in as early as 1936; however, more recent BC curricula have “[T] ranscended the idea of critical thinking skills and put forward creative thinking competency” (Sun et al., 2015; p. 16). Indeed, the term “competency” was virtually absent in BC’s 2005 K-7 science curriculum, and is now present in the 2016 version of the elementary science-10 curriculum 286 times (Khan, 2018). By 2020, what might be termed as a “competency-driven curriculum” will have fully emerged in the province of BC (https://curriculum.gov.bc.ca/curriculum/overview).

The (re-)introduction of the term competencies in the curriculum has historical vestiges; problematically, “competence” has been associated with a technocratic ideology in an occupational culture. Competence and its derivative terms (competencies, performance) in education, furthermore, have been critiqued as being limited in their conceptual reach (Norris, 1991; Spady, 1977). For example, the BC Ministry describes “competency” and “competencies” as a combination of skills, processes, behaviours, proficiencies, and habits of mind (BCMOE, 2019). According to the BC Ministry, students are deemed competent in an area of learning to the extent that they understand and apply knowledge to new contexts. Although the historic, and sometimes unfavourable, association of competency with the assessment of students is not the subject of our paper, we suggest that the resurgence of competencies in Canadian curricula prompt further examination by scholars. We use the term operationally in this study to acknowledge its symbolic adoption in new science education curricula and the need for examining its use critically among stakeholders.

Core Competencies in the BC Curriculum

BC’s redesigned curriculum appears to define competencies at two scales: core competencies develop across the curriculum, and curricular competencies are explicit statements of what is expected at each grade level in each area of learning. The core competencies are throughout all of the subject areas. “[C]ore competencies are sets of intellectual, personal, and social and emotional proficiencies that all students need to develop in order to engage in deep learning and lifelong learning” (BCMOE, 2019). Three domains of core competencies are identified across the curriculum: (1) communication, (2) thinking, (3) personal and social. These competencies are said to come into play when students are engaged in “doing” in any area of learning (BCMOE, 2019). The “doing” includes activities where students use thinking, collaboration, and communication to solve problems, address issues, or make decisions. The ultimate goal is for learners to employ the core competencies every day in school and in life.Footnote 1

The thinking core competency is subdivided into “creative” and “critical” thinking. For critical thinking, it involves making judgments based on reasoning: students consider options; analyse these using specific criteria; and draw conclusions and make judgments. The critical thinking competency encompasses a set of abilities that students use to examine their own thinking, and that of others, about information that they receive through observation, experience, and various forms of communication. The BC science curriculum is organised with a “know-do-understand” model of learning. The elements in the science curriculum are content (“know”), big ideas (“understand”; emphasising key concepts of the science disciplines), and curricular competencies (“do”). It is suggested that the three elements work together to support competence development and deeper learning in the subject domain (BCMOE, 2019). The curricular competencies in the BC science curriculum are intended to focus on students’ explorations of the subject matter using processes that bear some similarity to those processes in Table 1, such as questioning and predicting, planning and conducting, processing and analysing data and information, evaluating, applying and innovation, and communicating. For example, the Grade 2 BC science 2016 curriculum topic on metamorphic life cycles is: “Students are expected to be able to do the following: questioning and predicting, make simple predictions about familiar objects and events,” and the Grade 10 BC science curriculum topic on the universe elaborates: how could you model the formation of the universe (BCMOE, 2019)? The BC Ministry of Education has produced what might be acknowledged as a ‘competency-based curriculum’ for students and their teachers.

Methodology

The following research questions (RQ) are addressed in this paper:

  • RQ1. To what extent do secondary preservice science teachers possess scientific reasoning competencies?

  • RQ2. To what extent does a secondary science teacher education methods course influence preservice science teachers’ scientific reasoning competencies?

A previously developed, validated, multiple-choice questionnaire (Krell et al., 2018; Hartmann et al., 2015) was employed to address these questions. This multiple-choice questionnaire assisted the researchers in exploring the extent to which preservice teachers develop their scientific reasoning competencies in an intensive science teacher education course and generating several hypotheses of the role of science teacher education, using the BC case. Case study can be considered a transparadigmatic heuristic that enables the circumscription of the unit of analysis (VanWynsberghe & Khan, 2007) and is especially useful in research when focusing on a specific context (intensive science teacher education) where a unit of analysis (scientific reasoning competencies) is being uncovered, circumscribed, or explored more fully (e.g. development in the context of BC programs). To identify a baseline of scientific reasoning competencies and changes after a science teacher education course, a one-group pre- and post-test research design was pursued. This research design involved establishing a baseline measurement for the preservice science teachers and assessing if it changed following a science teacher methods course.

Science Teacher Education Secondary Methods Course

To investigate preservice science teachers’ competencies, participants were sought from a BC secondary teacher education program where the preservice teachers followed a general syllabus for secondary science teacher education. Prior to joining the program, the secondary preservice science teachers were required to have at least one 4-year BSc. degree (or 3-year but an equivalent) in a science area, with no requirement to study education. These degrees usually entail a requirement to do 10 courses or 30 credits of instructional time per year in science, where each credit generally is considered 1 h per week of instruction. All preservice science teachers must have also completed within this degree, 6 credits of introductory courses in each of chemistry (at least one course with a lab component), mathematics (one or preferably two courses in calculus), physics (at least one course with a lab component), biology, and geology. Furthermore, before enrollment, preservice science teachers must have completed an additional of 6 to 12 credits of first- and second-year-level university science courses and 18 credits of third- and fourth-year-level courses in the selected science. The set of courses for each selected science must have included both lecture and laboratory studies. Preferences were also indicated in terms of applications to the program, for example for a biology teachable subject domain, preference was given to preservice biology teacher applicants who completed courses covering a minimum of 4 of the following areas: ecology, genetics, human physiology, evolutionary biology, invertebrate and vertebrate zoology, non-vascular and vascular plant biology, and microbiology. Specialism courses in these areas must have been completed at the second, third, or fourth year level.

This study was carried out in a general science methods course for all science subjects that were delivered in the first term in the secondary program and was accompanied by a specialism that reinforced the concepts from the methods course. The science methods course was 13 weeks or 39 h of science methods instruction, in addition to a 2-week “short practicum” that occurred within the course. The science methods course was generally framed as, “Teacher candidates will be introduced to the concept that science teaching is the act of promoting student learning of science through inquiry and are given the opportunity to model, engage in, and reflect upon inquiry teaching in the science classroom.” The course objectives combined content (e.g. nature of science, scientific reasoning), pedagogy (e.g. how to foster process skills in science), and curriculum (e.g. what are competencies). Course objectives included inquiring into the nature of science and its implications for the teaching of science; articulating and examining their personal philosophy and rationale for teaching science and the role of science in the secondary school curriculum; developing strategies for assessing understanding of science content, scientific reasoning, laboratory processes, and problem solving; acquiring skills in promoting student interest in science and understanding of the role of science in society; recognising the diversity of students’ needs, perspectives, and beliefs present in science classrooms, and developing strategies for addressing this diversity in the teaching and learning of science. The course syllabus was comprised of key topics, posed as questions. In this case, a department-wide course syllabus in Table 2 was generally distributed over many years at the university.

Table 2 Sample topics taught in a general secondary science education methods course

An example of a typical course activity was the “Theory Activity Box” otherwise commonly known as a “black box activity” (cf. Krell et al., 2017, in press; Günther et al., 2019). This activity required preservice science teachers to analyse the outcome of pouring a fluid into a black box and observing the quantity and colour of each successive sample. Secondary preservice science teachers would then be commonly asked to reason about why the quantity and colour of the fluid changed for each sample, generate hypotheses about what is happening, make predictions, and eventually, attempt to model the hidden mechanics of the box. The model was then tested against new data. Models were shared in the class, without the box being opened nor the model revealed. Discussions on how science proceeds and how we can reason about a problem regularly ensued.

Participants

A full cohort of secondary preservice science teachers from a university in British Columbia, consented to participate in this study (N = 56). Their mean age was 27 years (SD = 6.34; mode = 23). Table 3 provides further information about the sample size by major. Data collection was done in their science teacher education secondary methods course, within the Bachelor of Education after degree secondary program.

Table 3 Sample size and demographics of BC preservice teacher participants

As mentioned previously, to enroll in the secondary science program, all preservice science teachers had at least one prior degree (usually 4 years of Science or more) as tabulated above.

Pre–post-questionnaire

In order to respond to the research questions, a multiple-choice questionnaire was administered to assess some of the preservice science teachers’ scientific reasoning competencies. Scientific reasoning competencies included being competent at skills such as formulating hypotheses regarding models, analysing evidence, and evaluating models (Table 1). As acknowledged, scientific reasoning competencies encompass much more than the aforementioned skills that were tested. The questionnaire was originally developed in Germany in order to monitor the development of scientific reasoning competencies throughout undergraduate and graduate studies at participating universities (Hartmann et al., 2015; Mathesius et al., 2016). The questionnaire included 21 multiple choice items (i.e. three for each skill) related to seven skills identified with aspects of reasoning in science (Table 1). The items were intended to be contextualised in authentic scientific problems. Each item included one attractor (i.e. correct answer) and three distractors (i.e. wrong answers). For the correct identification of the attractor, the preservice teachers need to apply their procedural and epistemic knowledge, for example related to the skill formulating questions, the preservice science teachers needed to know that scientific questions are related to a phenomenon, are empirically testable, intersubjectively comprehensible, unambiguous, principally answerable, and internally and externally consistent (Mathesius et al., 2014).

Previous studies provided different sources of validity evidence that supported the plausibility of the test score interpretations as measures of possible preservice teachers’ scientific reasoning competencies. For example, response processes were examined qualitatively (by conducting think-aloud and eye tracking studies), and these studies confirmed that respondents apply procedural and epistemic knowledge to solve the multiple-choice problems (e.g. Mathesius et al., 2018). Furthermore, statistical analyses revealed good psychometric properties and a one-dimensional data structure representing one overall latent dimension scientific reasoning competencies (Hartmann et al., 2015). As some authors critique a lack of validity evidence for instruments to assess scientific reasoning competencies (e.g. Osborne, 2013), the consideration of this comprehensive validity evidence can be seen as a distinguishing quality criterion of the instrument. A sample investigation type question from the questionnaire is shown in Fig. 1 below.

Fig. 1
figure 1

Investigation-type question from the questionnaire used in this study (Krell et al., 2018). The full questionnaire is available upon request to the second author.

The questionnaire was initially translated into English and Spanish in order to enable international comparisons. Since the English version was developed outside of Canada (based on a sample of N = 105 Australian pre-service science teachers; Krell et al., 2018), it was first given to Canadian researchers from education, science, and the social sciences from British Columbia (“critical experts”; N = 5), who were asked to provide feedback on the questionnaire to identify the level of clarity of the task in English. Based on the experts’ feedback, several adjustments to English sentence construction, semantics, spelling, and grammar were made to adapt the questionnaire to the Canadian context. The questionnaire was administered to the preservice science teachers in their methods courses in a university in BC, at the end and at the beginning of the semester (pre–post-assessment). All procedures performed in this study involving preservice teacher participants were in accordance with the ethical standards of the university and with the 1964 Helsinki declaration and its later amendments or comparable ethical standards. There were 56 pre-service teachers who completed the pre- and post-questionnaire. Therefore, each question had a maximum number of N = 112 responses in general. Missing responses did occur in a few of the single items.

Data Analysis

Data were analysed within the frameworks of classical test theory (CTT) and item response theory (IRT; Bond & Fox, 2001; Field, 2009). Next to obtaining classical mean scores (M), person abilities (θs) and item difficulties (βi) at the latent level were estimated on the one-dimensional One-Parameter Logistic Model (1PLM) for dichotomous items (“Rasch Model”) using the software ACER Conquest (Wu et al., 2007). Weighted maximum likelihood estimates (WLE) were used as point estimates for θs (cf. Wu et al., 2007), that is, as indicators for the preservice science teachers’ scientific reasoning competencies. IRT has widely been applied in science education research (e.g. Krell et al., 2018; Neumann et al., 2011). One major advantage of IRT models, such as the 1PLM, is the estimation of derived parameters for person abilities (θs) and item difficulties (βi) on a ratio scale—as opposed to raw scores within CTT (Boone et al., 2014; Neumann et al., 2011). Furthermore, person abilities can be estimated even in the case of missing responses to single items. Different fit-indices have been proposed to evaluate the appropriateness of the estimated parameters θs and βi and to ensure methodological accuracy (Bond & Fox, 2001; Boone et al., 2014). One established fit-index is the sum of the squared standardised residuals (MNSQ), which has an expected value of 1 with acceptable values ranging from 0.8 to 1.2 (Bond & Fox, 2001). ACER Conquest computes a weighted and an unweighted MNSQ. Because the unweighted MNSQ is more sensitive to outliers than the weighted MNSQ, both statistics should be considered. In addition, t-standardised fit statistics based on the MNSQ are provided, which should range from − 2 to 2 (Wu et al., 2007).

To evaluate to what extent the preservice science teachers possessed an adequate level of scientific reasoning competencies (RQ 1), the relation of person ability (θs) and item difficulty (βi) within the 1PLM was employed (cf. Bond & Fox, 2001):

$$ P\left({X}_{is}\right)=\frac{\exp \left({\uptheta}_s-{\upbeta}_i\right)}{1+\exp \left({\uptheta}_s-{\upbeta}_i\right)} $$

As a consequence of the 1PLM’s equation, θs and βi are estimated on a one common logit-scale and can be illustrated accordingly in so-called Wright Maps (Bond & Fox, 2001). Further, a person with θs = βi has a 50% probability to answer the respective item correctly; if θs > βi, this probability is more than 50%, and if θs < βi, this probability is less than 50% (Bond & Fox, 2001). In this study, the sample’s mean ability (MWLE) was centred at MWLE=0 to allow for a free estimation of the items’ difficulties (“constraints = cases”; Wu et al., 2007); however, this constraint does not affect the relation between person ability and item difficulty described above.

Three subgroups of the Canadian sample were distinguished for each skill associated with scientific reasoning, the two sub-competencies, and for scientific reasoning competencies overall (as per Table 1). Respondents with WLE > Mβ + 1 SDβ were defined as having advanced competencies, whereas respondents with WLE < Mβ − 1 0β were labelled as having basic competencies. Respondents who had somewhere between advanced and basic competencies (i.e. WLE = Mβ ± 1 SDβ) were referred to as transitional (cf. Krell et al., 2018). For discussing RQ2, the effect of teacher education on the preservice science teachers’ test scores (i.e. WLE) was estimated statistically (t tests, ANOVA).

Methodological Limitations

There are several limitations acknowledged at the outset of the study. Conceptually, the researchers adopted a definition of competencies as a disposition and complex system of skills that is established in science education research (Klieme et al., 2008; Krell, 2018), but competency might be considered even more broadly (Rychen & Salganik, 2003). Furthermore, a multiple-choice questionnaire was used to assess scientific reasoning competencies, for which various evidence are available to support the test score interpretation as measures of preservice teachers’ scientific reasoning competencies (e.g. Hartmann et al., 2015; Mathesius et al., 2018). Compared to “authentic” scientific problems, however, paper-based assessments necessarily lack complexity (Shavelson, 2013). The researchers, nonetheless, embarked upon the study as a measure of potential scientific reasoning competencies that could assist in generating hypotheses about the role of teacher education and the facets that might help to develop these competencies.

Results

This study analysed to what extent secondary preservice science teachers possess scientific reasoning competencies (RQ 1), using BC teacher education as an idealised case of an intensive science education period. To perform the analysis, the researchers employed a questionnaire (or test) at the beginning and end of a science teacher education course in this discrete and intensive program of study. This design permitted the researchers to generate hypotheses about to what extent the course influence secondary preservice science teachers’ scientific reasoning competencies (RQ 2).

In examining preservice science teachers’ competencies in general at scientific reasoning, the researchers analysed pre- and post-questionnaire data at first taken together. The total response sample for each item was thus N = npre + npost or N = 112. Table 4 below illustrates the proportion of correct answers (i.e. raw data) and the estimated item difficulty parameter (i.e. βi) for the items related to the seven skills of scientific reasoning (mentioned previously in Table 1). Table 4 shows, for example, that in general, a substantial majority of items related to planning investigations were answered correctly among preservice science teachers (about 74% correct answers), whereas the items related to generating hypotheses were the most challenging to answer (about 36%). Over half of the preservice science teachers responded correctly on items related to the larger categories of both conducting scientific investigations and using scientific models. In terms of modeling, more items involving testing models were correctly responded to than other items related to modeling. Overall, on both questionnaires, the items were answered correctly by just over 50% of the preservice science teachers.

Table 4 Proportion of correct answers (i.e. 1.0 = 100 % correct answers) and estimated item difficulty parameters βi

To help further assess what this finding means in terms of difficulty, the researchers reviewed the item difficulty parameters of the questions estimated within the one-dimensional 1PLM. In the 1PLM, the EAP/PV reliability is rel. = 0.48, which is in the accepted range of previous studies (e.g. rel. = 0.44 in Mathesius et al., 2016; rel. = 0.55 in Krell et al., 2018). The item separation reliability is rel. = 0.95. MNSQ- and corresponding t values indicate a good fit between data and model (unweighted: 0.82 ≤ MNSQ ≤ 1.10, |t| ≤ 1.40; weighted: 0.93 ≤ MNSQ ≤ 1.07, |t| ≤ 1.60).

Figure 2 below depicts the person ability and item difficulty parameters within the Wright Map. The mean score of the estimated person abilities is centred at MWLE=0 (“constraints = cases”; see above); however, the relation between person ability (i.e. WLE) and item difficulty (i.e. βi) reveals some interesting patterns. In the Wright Map, rather easy items are located at the bottom (i.e. low item difficulty parameters βi) and harder items at the top (i.e. higher βi). For example, Fig. 2 reveals that the skills formulating questions and generating hypotheses were comparatively more difficult (i.e. high item difficulty parameters of 0.37 and 0.63 respectively; Table 4), than planning investigations, the easiest category of scientific reasoning competencies (i.e. low item difficulty parameters of − 1.28; Table 4). It should be noted that for planning investigations, the relative difficulty of the items within this sub-competency varied considerably more (cf. standard deviations in Table 4), with planning investigations item 9 being far more difficult than planning investigations items 7 and 8 (Fig. 2).

Fig. 2
figure 2

Wright map illustrating case distribution and item difficulty within one-logit scale; triangles, items; connected circles, case distribution. Item numbers match with Table 4; 1–3 formulating questions, 4–6 generating hypotheses, 7–9 planning investigations, 10–12 analysing data and drawing conclusions, 13–15 judging the purpose of models, 16–18 testing models, and 19–21: changing models

The researchers also utilised a definition for basic, transitional, and advanced competencies described in the “Data Analysis” section, in order to distinguish between different subgroups within the sample, each with a specific level of competency in scientific reasoning. Table 5 below introduces the total number of responses from preservice teacher on both tests who were considered to be within a particular subgroup or level of competency, as well as the number of preservice teacher responses within these specific subgroups from the pre- and post-questionnaires combined (referred to as the total sample).

Table 5 Number of preservice science teacher responses within the three subgroups in each skill, the two sub-competencies, and scientific reasoning overall

Based on the above procedures, it was concluded that the majority of the participating preservice science teachers had transitional level competencies for both investigating and modeling. As shown in Table 5, a noteworthy proportion of preservice science teachers were advanced at planning investigations and testing models. Also, more preservice teachers performed at a basic level for the skill of generating hypotheses than other skills. Interestingly, no preservice teachers performed at a basic level in planning investigations nor analysing data and drawing conclusions. In examining responses associated with modeling more closely, more responses were deemed to be at advanced or transitional levels for judging the purpose of models than testing models or changing models. Collectively, almost all preservice science teachers performed solidly at a transitional level, which is true for the total sample as well as for the pre- and the post-tests individually.

In exploring pre- and post-course differences, on average and based on the estimated WLE, there was no significant difference found between the pre- (MWLE = 0.04, SD = 0.62) and the post-test score (MWLE = − 0.05, SD = 0.71, p = 0.470). In the pre-test, however the preservice science teachers with a major in biology (n = 30) achieved significantly higher test scores (MWLE = 0.23, SD = 0.46) than those with a non-biology major (n = 26, MWLE = − 0.18, SD = 0.72, p = 0.015, d = 0.68; medium effect size). This difference is only marginally significant and with a lower effect size measure (d) in the post-test (p = 0.065; d = 0.50; medium effect size). This might suggest that the course contributed to diminish differences in scientific reasoning competencies between preservice teachers with and without biology as a major. Interestingly, it was also found that those preservice teachers with more than one previous degree (n = 0.7) performed significantly better in the post-test (MWLE = 0.41, SD = 0.73) than their colleagues with only one previous degree (MWLE = − 0.13, SD = 0.69, p = 0.064, d = 0.74; medium effect size). There was no significant difference between these groups in the pre-test (p = 0.613).

The latter finding hints to a group-specific influence of the science teacher education methods course on the preservice science teachers’ scientific reasoning competencies. In line with this assumption, a two-way ANOVA with the sum of previous degrees (dichotomous variable with 0 = one, 1 = two or more) and the time of assessment (0 = pre-test, 1 = post-test) was performed and suggests an interaction effect sum of previous degrees × time of assessment. Due to the small sample size, however, this interaction was not found to be statistically significant (Table 6). A simple effects analysis (cf. Field, 2009) reveals a marginally significant difference between the test scores of both groups for the post-test (p = 0.05) that suggests that, in terms of scientific reasoning competencies, those preservice teachers with more than one previous degree might benefit more from the course than their colleagues with only one previous degree.

Table 6 Two-way ANOVA with the sum of previous degrees (degrees) and the time of assessment (time) as independent variables (R2 = 0.04)

The interaction graph (Fig. 3) above suggests that for both the pre- and the post-tests, the preservice teachers with more than one previous degree outperform their colleagues with only one previous degree. Additionally, the test scores of those with more than one previous degree increased from pre to post (but not statistically significant), and a non-significant decrease could be identified for those with one previous degree (Fig. 3).

Fig. 3
figure 3

Interaction graph sum of previous degrees × time of assessment

Discussion

Related to RQ1, the overall findings suggest that the BC preservice science teachers mainly possess scientific reasoning competencies on what could be termed a transitional level. As shown in Table 5, however, two exceptions are the skills of planning investigations and testing models, where about 40% of the sample were identified as having advanced competencies. Comparatively, for the other investigative dimensions of scientific reasoning, formulating questions and generating hypotheses, less than 10% of the total sample were in the advanced subgroup with the vast majority being solidly at the transitional level or transitional to basic. For the modeling dimensions, few preservice teachers (about 20%) were identified as having transitional competencies in testing out a given model (Table 5), with the majority of preservice teachers being at either advanced or a basic level of competency related to this skill. Krell and Krüger (2016) show in a study with in-service biology teachers that those teachers with a better understanding of testing models also more intensively conduct sophisticated model-based teaching activities. In terms of judging the purpose of models and changing models, preservice science teachers were performing predominantly at advanced or transitional levels of competency. Independent of these two differences in investigatory and modeling skills, the overall findings suggest that BC preservice science teachers possess a satisfactory (i.e. transitional) overall level of scientific reasoning competencies, yet might be challenged to address certain aspects involving modeling in the newly created, competency-based science curriculum (BCMOE, 2019). This curriculum emphasises students’ competencies at developing inquiry questions and generating hypotheses as well as skills associated with testing and changing models in science. In an international context, it should be duly noted that, in a similar study from Australia (M = 0.45, N = 105 preservice science teachers enrolled in a Master of Teaching; Krell et al., 2018), the proportion of correct answers in the present Canadian sample was slightly higher (M = 0.50; Table 4). This finding suggests that the need for developing reasoning competencies in science teacher education supersedes this case and is of potentially international import to science teacher education programs.

Related to RQ2, the findings reveal that there is no significant influence of a science education methods course on preservice science teachers’ scientific reasoning competencies. This finding contradicts other science education research that suggest that science teacher education courses that reflect upon scientific reasoning positively impact preservice science teachers’ scientific reasoning competencies within a concurrent teacher education program (e.g. Hartmann et al., 2015). Others assert, however, that it is a complex and time-consuming task to foster preservice science teachers’ scientific reasoning competencies (e.g. Kenyon et al., 2011). This research, on the other hand, might explain why an intensive after-degree teacher education program within one semester of science education does not appear, at the outset, to be sufficient to reach this goal. Beyond this, there are several other alternative hypotheses that could explain this result. For example, another hypothesis is that the questionnaire measured something other than scientific reasoning competencies. This alternative hypothesis does not seem initially probable as the questionnaire has been used internationally and was evaluated in various studies providing evidence for the valid interpretation of the test scores (e.g. Krell et al., 2018; Hartmann et al., 2015; Mathesius et al., 2016). Preservice teachers also did not take the questionnaire home.

Secondly, as with pre–post-test designs, it should be acknowledged that there are other viable probes that might further reveal what the preservice teachers did learn, such as observable evidence of reasoning during microteaching activities, scenario-based qualitative assessments, or analyses of philosophy statements about the nature of science and modeling (Khan, 2018). Thirdly, while the purpose of the study was not to examine particular activities within the curriculum in relationship to the outcomes on the questionnaires, the study does generate several hypotheses about the types of activities employed in science teacher education that might impact teachers’ scientific reasoning. Activities like the black box activity (Krell et al., 2017, in press; Günther et al., 2019), in addition to those mentioned in other studies, such as contemporary and historical case studies (Allchin et al., 2014), explicit instruction (Khishfe, 2014), and authentic and structured research partnerships (Sadler et al., 2010), have been proposed as supporting the development of future teachers’ learning of science and scientific reasoning. The teacher education curriculum plays a role in the development of these competencies.

The present research points to several other factors that could conceivably contribute to the development of scientific reasoning competencies in teacher education. For example, although there was no statistically significant interaction effect sum of previous degrees × time of assessment in the two-way ANOVA (Table 6), there was a medium and significant difference in the post-test between preservice teachers with more than one and only one previous degree. This finding suggests a possible influence of background knowledge on the development of scientific reasoning competencies within science teacher education. It is plausible, for instance, that preservice teachers with more than one previous degree are better able to relate teacher education activities like the black box activity to already known scientific contexts they have encountered and, thus, might benefit more from this activity than their peers with one degree might. This study puts forward that there might be an additional benefit to having at least one degree as a pre-requisite for science teacher education. The need exists for ample future research that examines and compares the development of preservice science teachers’ scientific reasoning competencies within different teacher education systems.

Concluding Remarks

Internationally, competency-based curricula support the idea that teachers foster competencies of their students. Science teacher education can play a role in supporting the development of preservice teachers’ own science competencies in these areas. Orienting science teacher education to support the development of preservice teachers’ competencies at reasoning is a laudable goal and one that is enhanced when cognitive processes are paired with necessary epistemic, social, and moral dimensions of problem-solving associated with scientific problems. Researchers investigated the effect of an intensive after-degree teacher education program on the development of preservice science teachers’ scientific reasoning competencies. This setting departs from research on a longer, concurrent program of teacher education (Hartmann et al., 2015). The study here uniquely reveals that the science methods course was effective in fostering scientific reasoning competencies of those preservice teachers with more than one previous degree in science. This finding further suggests that research on competency development be considered for both longitudinal and intensive measures and that substantial background knowledge supports competence development.

For teacher education, it can be recommended that science teacher educators consider a range of high-impact course activities to foster teachers’ own reasoning in science alongside how to teach students how to reason (cf. Khan, 2018). Several examples of how to enhance student competencies in educational environments have been suggested in research, including providing complex, authentic problems, that are not manageable by applying only single skills or automatised procedures (Max, 1999; Shavelson, 2013). In terms of science teacher education, competence development might be supported by providing a sequence of contextualisation (i.e. problem solving in an authentic scientific context), decontextualisation (i.e. explicit reflection), and recontextualisation (i.e. problem solving in another or an analogous authentic context). Such a strategy for science teacher education is supported by data from a longitudinal study on the development of preservice science teachers’ scientific reasoning competencies that further suggest that explicit reflections about scientific reasoning (i.e. learning about science; Hodson, 2014) contributes more to the development of scientific reasoning competencies, than only doing science without reflecting about it (Hartmann et al., 2015; Mathesius et al., 2016). Similarly, it has recently been demonstrated that a problem-based teacher education program, including modeling activities (e.g. black box activity) and explicit reflections of these activities, successfully fostered preservice biology teachers modeling competencies (Günther et al., 2019) and in other studies of science preservice teachers (Khan, 2018). Kenyon et al. (2011) also suggest five design principles to successfully foster teachers’ epistemic knowledge and their pedagogical content knowledge related to scientific modeling: (1) Teachers should engage in the full range of scientific modeling, including (2) developing models for explanations and predictions (3) and the revision of models based on evidence; (4) teachers need to develop epistemic knowledge, subject matter knowledge, and pedagogical content knowledge related to scientific modeling; and finally, (5) modeling activities should be considered in multiple content areas. The author’s (Khan, 2018b) research also suggests that science teacher educators consider ways for preservice science teachers to engage in similar activities on practicum, the point when preservice science teachers are especially challenged to foster modeling and investigatory competencies with students. Clearly, our overarching research findings suggest that part of the consideration of designing a science teacher education curriculum should include preservice science teacher’s background knowledge entering the program and the benefits of a prior degree or degrees in science in terms of reasoning. Finally, this study supports the potential for intensive science teacher education programs to foster teacher competency with more high-impact activities. It holds potential significance as well to those internationally who have developed or enacted competency-oriented science curricula, as science teachers will need to be competent at reasoning in order to assist their future students to solve problems in science.