Introduction

Efficiency studies in the domain of science teaching have traditionally focused solely on the cognitive achievement aspect of students’ learning. Studies focusing on affective domains have frequently addressed only positive emotions (in particular, interest), thus neglecting negative emotions (besides anxiety) (Randler et al. 2015). Today, however, it is unquestioned that intervention studies must consider multiple dimensions and aims of high-quality teaching, including cognitive, social, emotional, and motivational aims.

Over the past several decades, empirical support has been accumulated for inquiry-based forms of science and technology teaching as potentially appropriate ways to teach science topics (e.g., Alfieri et al. 2011). Critical voices, however, also exist concerning inquiry-based instruction: Open, that is, minimally guided, teaching forms might be ineffective, while fully unguided instructional methods may even have negative effects on students, particularly for novices in a specific field, due to cognitive overload (Kirschner et al. 2006, p. 76). Möller (2016, p. 44) argues that socioeconomically “disadvantaged” students in particular might experience difficulties due to instruction based on low(er) guidance.

Following this argumentation, the present study aimed to test whether a specific inquiry-based teaching method that is more strongly guided, the so-called learning cycle approach (Karplus and Thier 1967; Whitehead 1929/1967), simultaneously supports students’ achievement, academic emotions, academic self-concept, and engagement in science classrooms. The intervention study was conducted in lower secondary education in Austria, addressing students at low-track schools. This student group was intentionally chosen, as Austrian students in secondary education, particularly those attending low-track schools, exhibit lower interest in science, which ultimately endangers the successful acquisition of scientific literacy. The present study aimed to test whether so-called disadvantaged students can benefit from a more guided inquiry-based teaching method.

Theoretical Background

Control-Value Theory

To theoretically frame students’ learning experiences in the science classroom, this study relied on control-value theory (CVT) (Pekrun 2006), which was developed to explain students’ academic emotions in the classroom.

Based on an appraisal-theoretical approach to emotions (Ellsworth 2013), CVT states that emotions are triggered by a cognitive evaluation of the learning situation. In this appraisal process, control appraisals and value appraisals are of core importance. In the present study, students’ appraisals were accounted for through the assessment of students’ academic self-concept as indicators of students’ control appraisals. Typically, students experience more positive and less negative emotions if they exhibit high academic self-concept, which reflect high(er) perceived controllability of the learning situation (e.g., Goetz et al. 2010). Furthermore, CVT proposes that emotions trigger students’ motivation and engagement in the classroom, thus positively affecting students’ achievement (Pekrun 2006).

The logical expectation that accompanies reliance on this theoretically claimed chain of relations is that a positively perceived and experienced instructional approach positively affects students’ cognition, emotions, behavior, and achievement, as these factors are all positively and reciprocally intertwined in CVT.

Emotions in Science Education

Since the cognitive, motivational, and behavioral dimensions have been the focus of research on science teaching and learning for decades, there is hardly any need for justification. Although research has been conducted on students’ interest in science (van Griethuijsen et al. 2015), academic emotions (enjoyment, boredom, pride, anger, etc.) in science education have been far less frequently explored thus far (Sinatra et al. 2014). Therefore, this dimension requires some special attention, particularly with regard to its definition.

Students’ academic emotions are a type of emotion that is directly linked to academic learning, achievement activities, and achievement-related outcomes (Pekrun 2006). Furthermore, for a precise analysis of academic emotions, the difference between state and trait emotions must be considered. While trait emotions refer to habitualized emotional reactions to learning in science in general, state emotions are situation-specific emotional reactions (Frenzel et al. 2009).

Moreover, research has shown that students’ academic emotions must be classified domain-specifically: Students report different emotions and different levels of emotional intensity in different school subjects (Goetz et al. 2006).

Generally, Sinatra et al. (2014) argue that little is known about the features of science instruction and their relation to students’ academic emotions (see also Tomas and Ritchie 2012). It is clear, however, that many emotions accompany learning in science classes. These emotions are triggered by characteristics of the instruction and the content (e.g., Randler et al. 2015; Tomas and Ritchie 2012), and they are linked to students’ learning (e.g., conceptual change), students’ behavior and behavioral intentions, respectively (Fröhlich et al. 2013), and students’ achievement (e.g., Hong et al. 2017; Liu et al. 2014). A number of experimental and quasi-experimental studies aiming to enhance students’ emotions in science instruction have demonstrated that students’ emotions can be fostered by the application of specific instructional strategies e.g., by integrating students’ alternative conceptions in biology lessons (see Franke and Bogner 2013) or by using expressive writing in biology lessons (see Randler et al. 2015).

Gläser-Zikuda (2010), however, argues that, although the empirical basis for the relation between emotion and student learning has been steadily increasing, there is a pronounced absence of long-term intervention studies aiming to foster students’ positive emotional experiences in school in general and in science instruction specifically. While it is possible to affect students’ state emotions with relatively short-lasting interventions (e.g., Randler et al. 2015; Fröhlich et al. 2013; for a short scale to measure situational emotions, see Randler et al. 2011), it seems more difficult to induce changes in students’ trait emotions. For instance, Gläser-Zikuda et al. (2005) applied the so-called Emotional and Cognitive Aspects of Learning (ECOLE) approach in different school subjects and also in physics instruction, among other areas. The intervention had a relatively long duration of, namely, between 12 and 18 lessons and aimed at fostering students’ cognitions and emotions. Against expectations, their results revealed no significant treatment effect for students’ emotions (interest, anxiety, and boredom). Only a small positive effect for instruction-related well-being was found. The authors conclude that the intervention might has been too short to affect emotions sustainably, as trait emotions are relatively stable and, thus, difficult to influence. Therefore, they suggest that “more extensive interventions” (Gläser-Zikuda et al. 2005, p. 492) should be applied. This is where this study performs a useful function.

The Learning Cycle Approach

Different teaching approaches in science teaching can foster cognitive development, positive emotional experiences, and engaged student behavior. The intervention in this study relied on the classical three-phase learning cycle (LC) instruction approach, which was introduced by the physicist Robert Karplus (1979) and was expected to achieve success in terms of supporting these aspects of student learning. The origin of this approach dates back to John Dewey (1916) and Alfred North Whitehead (1917) (see Lawson 2002). Based on a rhythmic understanding of learning processes, it follows the way in which people naturally learn (Lawson et al. 1989). In its classical form, the LC approach consists of a sequence of three phases: romance, precision, and generalization.

In the first phase, romance (exploration, according to Karplus), the aim is to induce the student to engage emotionally with a presented problem. It is a phase of free exploration that is not constrained in any respect. The student should connect with the given problem in whatever respects he or she wishes. Therefore, the focus in this phase resides on the pre-knowledge of the students and their free exploration of the presented problem.

In the second phase, precision (concept introduction, according to Karplus), the teacher changes his or her role: He or she is now active in introducing the basic scientific ideas necessary for solving the given problem. He or she presents the scientific concepts and theoretical underpinnings by connecting them to the experiences the students used to connect with the problem in phase one. The students play a more passive role in this phase: They listen, read prepared short input texts or web pages, watch video clips, or write down information.

Finally, in phase three, generalization (application, according to Karplus), the information input from phase two has to be applied to a new problem. The students are now once again free to undertake whatever activities they think will lead them to a solution. Therefore, this phase marks the “return to romance”: A smooth transition to a new LC takes place (Whitehead 1929/1967, p. 20). Typical student and teacher activities are the same as in the romance phase; the only difference is that the students are now supposed to solve the problem using the newly acquired ideas that were presented in the precision phase.

The LC approach can be termed a moderate approach, rather than a radical constructivist approach, to learning and instruction. To secure the best effects, the sequence of phases should remain unchanged, and no phase should be bypassed (Abraham and Renner 1986; Renner et al. 1988). The teacher and student activities that characterize the different phases of the LC approach are summarized in Fig. 1. For further clarification, an example of a particular LC is described in the appendix (“Is water an element or a compound?”).

Fig. 1
figure 1

Teacher and student activities in the three phases of a learning cycle

The LC approach can be subsumed under the inquiry teaching procedures and is well aligned with the inquiry nature of science. It aims to promote inquiry-based learning, curiosity, and the enjoyment of inquiry. In LC-based environments, students actively construct their knowledge: They learn in a self-regulated, creative, and active manner based on their intellectual stage of development. The LC approach is in accordance with Piagetʼs mental functioning model (1952). New insights are expected to be included in the existing cognitive structures (assimilation); this can eventually lead to an adjustment of mental structures during the construction of the concepts (accommodation). The acquirement of new concepts through assimilation and accommodation is described as adaptation (Piaget 1952).

The LC approach is expected to be effective with regard to emotional and cognitive outcomes. Several studies, especially studies conducted in anglophone countries, have already tested the effectiveness of the LC approach. For example, studies have revealed that instruction based on LC fosters higher cognitive skills, that is, it is effective with regard to developing students’ understanding of scientific concepts (e.g., Musheno and Lawson 1999), addressing students’ misconceptions and fostering conceptual change (e.g., Balci et al. 2006), and supporting students’ reasoning skills (e.g., Marek and Methven 1991). Furthermore, studies have shown that teaching based on LC facilitates students’ critical and creative thinking and positive attitudes toward science (Lawson 2002). Empirical evidence on the effectiveness of the LC approach with regard to students’ specific distinct positive (e.g., pride) and negative (e.g., shame, anger) academic emotions, however, is missing, especially evidence based on long-term interventions for low-achieving students. In addition, empirical evidence on the effectiveness of the LC approach in German-speaking countries is rare (for a pilot study, see Riffert et al. 2009). Although inquiry-based approaches to teaching and learning are increasingly receiving emphasis in teacher education curricula in Austria, teachers do not typically receive specific training in the LC approach.

Research Aims and Hypotheses

The intervention study tested whether the LC approach achieved effective outcomes in terms of students’ academic emotions (H1), academic self-concept (H2), behavior (H3), and cognitive development (H4) in science instruction in low-track Austrian science classrooms. Students who received the intervention were compared with students in a control group in terms of their academic emotions, academic self-concept, behavior, and cognitive development.

In addition, this study tested whether students’ emotional experiences differed in connection with the specific phase in the LC. It was expected that students’ positive emotions would be higher in the romance phase (H5a) and in the generalization phase (H5b) of a LC than the emotions of students in the control group. No difference in the emotional experiences was expected for the precision phase (H5c).

This study adds to the literature in the field, as (1) multiple aims are considered simultaneously; (2) the long-neglected topic of emotion (positive and negative) is explicitly addressed; (3) the trait and state dimensions of academic emotions are measured; (4) a long-term intervention is applied; and (5) students from low-track schools participated in the study. The effectiveness of the LC will be tested by means of a program evaluation. Thus, the aim was not to evaluate how single activities in the LC worked out (e.g., experimentation in the romance phase) but to evaluate the overall effects of the program (i.e., the LC approach) as a specific form of cyclic teaching in science instruction.

Design

A quasi-experimental treatment control group design with pre-experiment and post-experiment measurements was chosen. The pre-experiment measurement took place at the beginning of the school year, before the treatment started. The post-experiment measurement took place after 1 school year for all intervention groups and after 2 school years for the 2-year intervention groups. In addition, to check whether the teachers implemented the LC appropriately and to explore students’ state emotions, two randomly selected LCs were evaluated during the school year (see Fig. 2).

Fig. 2
figure 2

Research design

The students in the treatment group received LC instruction for 1 or 2 consecutive school years, reflecting a constructivist inquiry-based teaching approach (see the section entitled “The Learning Cycle Approach”), while the students in the control group were taught according to so-called expository teaching. In an expository teaching and learning environment, students are not as actively involved in scientific activities (e.g., in collecting data and experimenting) but are rather predominantly informed about the learning content, and the environment is more strongly teacher-centered. Moreover, the students are less active, and the instruction is not rhythmically based on different phases that interdigitate. If experiments are conducted (mainly by the teacher), they are administrated after the basic concepts have been introduced as a validation of these concepts (inform/verify/practice as the basic processes of expository teaching; see Renner 1982).

Participants

A total of 280 Austrian high school students and their science teachers (N = 7) at a “Neue Mittelschule” participated in the study. Participation was voluntary. Austria has a tracked school system. After elementary school (grades 1–4), students can attend either a “Gymnasium,” which is the high-track school type, or a “Neue Mittelschule,” which is the low-track school type.

The students in the present study were in grades 6 to 8 (18.8% were sixth graders, 64.5% were seventh graders, and 16.7% were eighth graders). Of the participants, 142 (40.9% boys, 59.1% girls) were in the treatment group and received the learning-cycle-based instruction, and 138 (50.7% boys, 49.3% girls) were in the control group.

The mean age of the students was M = 12.60 years (SD = 0.96, range = 11–14). Of the students, 63.9% spoke German as their mother language, and 88.1% were born in Austria.

The participating teachers had several years of teaching experience. Since it is well documented that the teacher plays a decisive role in the achievement of the students, this central variable was controlled for by having each participating teacher teach both a treatment and a control class of the same year level.

Teacher Training

The participating teachers received training, one full day and three half-days of training, in teaching according to the LC approach; this training included an introduction to the philosophy of science (Bunge 2017) and an introduction to Piaget’s genetic structuralism, focusing on his concepts of equilibration-disequilibration and re-equilibration (Piaget 1985).

To become familiar with the instruction concept of LC, teaching, and the role the teacher plays in the different phases (including the central role of stating questions) (Lawson 2002, pp. 182–184), the teachers progressed through three LC dealing with electricity. Staver and Shroyer (1994) elaborated these three learning cycles. Progressing through these LC gave the teachers an opportunity to experience the phases of LC instruction. In addition, some selected LC presented in Lawson (2002, Appendix G, pp. 447–571) were used as LC models. Finally, the teacher training followed “A Teacher’s Guide to the Learning Cycle” by Campbell and Fuller (1982, Chapter 4).

The teachers teaching the same year level formed work groups in which they collaboratively developed LC according to the content demands of their respective curricula. The teachers had to teach the same content (albeit using different instruction methods) in both classes (treatment and control).Footnote 1

Instruments

Academic Trait Emotions (Pre-Post Measurement)

Students’ trait emotions were measured by means of a short version of the Achievement Emotions Questionnaire (AEQ) (Pekrun et al. 2005), a valid and reliable measure of academic emotions (see Fig. 2: A). Overall, seven emotions were differentiated:

  • Joy (six items; Cronbach’s αt1/t2 = .87/87), for example, “I get excited about going to class”;

  • Pride (four items; Cronbach’s αt1/t2 = .72/.79), for example, “I take pride in being able to keep up with the material”;

  • Anger (four items; Cronbach’s αt1/t2 = .76/.74), for example, “I feel frustrated in class”;

  • Anxiety (three items; Cronbach’s αt1/t2 = .68/.69), for example, “Thinking about class makes me feel uneasy”;

  • Shame (four items; Cronbach’s αt1/t2 = .77/.80), for example, “When I say something in class, I feel I am making a fool of myself”;

  • Hopelessness (four items; Cronbach’s αt1/t2 = .74/.78), for example, “The thought of this class makes me feel hopeless”;

  • Boredom (four items; Cronbach’s αt1/t2 = .75/.80), for example, “I get bored in class.”

The items were answered on a Likert scale ranging from “not true at all” (1) to “true” (5).

Academic State Emotions (Short Questionnaire)

State emotions were measured after each LC phase of two randomly selected LC during the first intervention year, resulting in six measurement points (see Fig. 2: E). The students rated the intensity (1 = little intensity; 5 = high intensity) of their state emotions on a dartboard-like figure consisting of joy, interest, pride, and boredom (reverse-coded). The internal consistency of this positive emotion scale was high across all six measurement points (Cronbach’s α LC1 = .80/.81/.85; Cronbach’s α LC2 = .82/.85/.87).

Academic Self-Concept

Academic self-concept was measured with six items on a scale ranging from 1 = “do not agree at all” to 4 = “fully agree” (e.g., “It is easy for me to understand new ideas in science instruction”) (Cronbach’s αt1/t2 = .88/.91) (Frey et al. 2009; PISA 2006) (see Fig. 2: B).

Behavioral Engagement

Behavioral engagement was assessed with a scale developed by Skinner et al. (2008). This scale was translated into German (see Fig. 2: C). An example item (out of five items) is “In class, I work as hard as I can” (Cronbach’s αt1 = .77; Cronbach’s αt2 = .82; from 1 = “do not agree at all” to 4 = “fully agree”).

Cognitive Dimension

To grasp the effect of the intervention on the students’ cognitions, the so-called Science Reasoning Tasks (SRT) were used (see Fig. 2: D).Footnote 2 This instrument is a group test developed for application in school settings (Adey et al. 2001; Shayer et al. 1981). It is based on Piaget’s so-called clinical interviews. These tasks “are a well-documented, validated measure to gauge the cognitive level of students” (Venville and Oliver 2015, p. 53). The instrument was Rasch-scaled so that a fairly sensitive attribution of one of seven Piagetian cognitive sub-stages (ranging from early concrete to mid concrete, mature concrete, concrete generalization, early formal, mature formal, and formal generalization) to each student would be possible (Shayer and Adhami 2007). From the seven tasks, two were selected: task 4, “Equilibrium and Balance,” was used in the pre-test, while task 2, “Volume and Heaviness,” was implemented in the post-test in both treatment and control groups. These two tasks were selected because they had been successfully used in studies on the efficiency of the Cognitive Acceleration through Science Education program (Oliver and Venville 2016).

Intercoder reliability (Cohen’s kappa; see Lombard 2010) based on two independent raters (who were both physicists and physics trainers) was calculated and revealed highly satisfactory interrater agreement: kappa was .98 for both tasks (“Volume and Heaviness” and “Equilibrium and Balance”) (n = 736 double codings for “Volume and Heaviness” and n = 700 double codings for “Equilibrium and Balance”).

Treatment Validity

To secure correct treatment implementation, two measures were taken (see Fig. 2: F). First, training in LC instruction was designed and given to the participating teachers to ensure that they would be able to implement the treatment correctly. Second, in two selected learning cycles, a treatment check was conducted to check if/to what extent the treatment was in fact realized. Altogether, six items were presented to the students of all groups after each of the three phases in the treatment classes; these items were simultaneously presented to students in the control classes. Three of the items described typical student and teacher activities in the romance and the generalization phases, and three of the items represented activities in the precision phase and typical activities of students and teachers in the traditional, more teacher-centered teaching design.

Overall, the treatment was successfully implemented: Levels of group work and independent work, for example, were significantly higher in the romance and generalization phases, while levels of teacher-centered instruction were higher in the precision phase (see Table 1).

Table 1 Treatment check, learning cycle 1, and learning cycle 2 for all three phases for the treatment groups (TG) and control groups (CG)

Results

Descriptive Statistics and Intercorrelations

The intercorrelations reveal positive associations between students’ positive academic emotions, academic self-concept, and behavioral engagement, while these relations were negative with students’ negative academic emotions (see Table 2).

Table 2 Means, standard deviations, and intercorrelations

Changes in Students’ Emotions, Academic Self-Concept, and Behavioral Engagement After 1 School Year

As there were significant differences in some of the dependent variables between the treatment and control groups at t1 (students in the treatment groups expressed higher enjoyment, less anger, and less boredom, p < .05), a MANCOVA controlling for the pre-test variables as well as grade point average (GPA) for the main subjects mathematics, German, and second language was conducted to test for intervention effects.

The results demonstrated that positive emotions, academic self-concept, and behavioral engagement decreased from t1 to t2, while negative emotions increased. This effect can be observed in both groups, with one exception: Students’ reported shame in the intervention group decreaseds from t1 to t2, while it increaseds in the control group. It has to be noted that t1 was realized at the beginning of the school year and t2 at the end of the school year; thus, possible effects of the school year have to be taken into account (e.g., increased tiredness at the end of the school year).

To test for intervention effects, the effect of group affiliation on the change in the dependent variables was of interest. The results revealed a significant group effect (Pillai’s trace = .094, F(9, 260) = 2.99, p < .001, partial η2 = .094). Concretely, there was more positive development in students’ joy, pride, academic self-concept, and behavioral engagement in the LC group than in the control group. Furthermore, the change in students’ experiences of shame, boredom, and anger was more positive in the intervention group. No intervention effects could be detected for anxiety and hopelessness (see Table 3).

Table 3 Intervention effects: results of the MANCOVA and effect sizes of change within groups (Cohen’s d)

Changes in Students’ Scientific Reasoning Skills After 1 and 2 School Year(s)

An ANCOVA was calculated to test for the change in students’ scientific reasoning skills from pre-measurement to post-measurement. Some students participated in the post-SRT after 1 year of treatment (NTG = 49; NCG = 45), and the other subsample participated in the post-SRT after 2 years of treatment (NTG = 93; NCG = 93). As the students in the two groups differed in relevant characteristics in the pre-measurement, enjoyment, anger, boredom, and academic self-concept were controlled for (as well as the pre-score of SRT).

The results indicated that there was an increase in students’ scientific reasoning skills after 1 year of treatment in both groups; however, this increase was not significantly higher in the treatment group, although the effect size points to the expected direction, reflecting more pronounced development in the treatment group (group effect: F(1, 87) = 1.36, p = .246, partial η2 = .015). The gain in scientific reasoning skills was clearly higher and approaching significance for students in both groups after 2 years of treatment, and, in line with our hypothesis, the treatment group outperformed the control group (group effect: F(1, 179) = 5.18, p = .024, partial η2 = .028) (see Table 4 and Fig. 3).

Table 4 Within-group effects regarding the change in students’ scientific reasoning skills from t1 to t2
Fig. 3
figure 3

The change in students’ scientific reasoning skills from pre- to post-measurement

Students’ Positive State Emotions During the Phases of a Learning Cycle

A MANCOVA was conducted to test for differences in the positive state emotional patterns referring to the previous science class, controlling for students’ trait characteristics.

Learning cycle 1: The results indicated that there was a significant effect of the instructional approach (treatment) on students’ positive emotional experiences in science when controlling for students’ trait characteristics at t1 (trait emotions and trait academic self-concept) (N = 280, Pillai’s trace = 0.056, F(3, 271) = 5.69, p = .001; partial η2 = .059). The results revealed a significantly more intensive positive emotional pattern in the romance phase (F(1, 273) = 13.25, p < .001, partial η2 = .046) and in the generalization phase (F(1, 273) = 10.55, p = .001, partial η2 = .037); meanwhile, no significant differences were found in the precision phase (F(1, 273) = 1.66, p = .199, partial η2 = .001) (see Fig. 4).

Fig. 4
figure 4

Students’ state emotions during the three phases of learning cycle 1

Learning cycle 2: Two control classes did not fill in the short questionnaires. Thus, the evaluation of the second LC was based on N = 191 (N = 99 treatment group; N = 92 control group) students, excluding the treatment and the control group of these two schools (to keep possible teacher effects constant). Again, the results demonstrated that the treatment group experienced a more intensive positive emotional pattern than the control group, but the difference was weaker in comparison to that observed in the first LC (N = 191, Pillai’s trace = 0.041, F(3, 182) = 2.62, p = .052, partial η2 = .041). In contrast to the first LC, a significant difference could be observed in all three phases of the LC (romance: F(1, 184) = 5.17, p = .024, partial η2 = .027; precision: F(1, 184) = 6.53, p = .011, partial η2 = .034; generalization: F(1, 184) = 5.10, p = .025, partial η2 = .027) (see Fig. 5).

Fig. 5
figure 5

Students’ state emotions during the three phases of learning cycle 2

Discussion

The present study aimed to investigate the effects of the LC approach on students’ trait and state academic emotions, academic self-concept, behavioral engagement, and cognitive development.

Starting with the long-neglected topic of emotions in science education (Sinatra et al. 2014), the data revealed that, in line with our expectations, the trait emotional pattern of students developed more positively in the LC group than in the control group. These positive effects, however, were not able to counterbalance an overall negative emotional trend as reflected by a decrease in students’ positive trait emotions and an increase in students’ negative trait emotions during the intervention periods. Nevertheless, this general trend was significantly weaker for students in the LC group. This indicates that the LC approach can significantly reduce negative trends in the domain of emotions. The intervention effect was particularly high for students’ enjoyment in science, and it was also substantial for students’ pride, anger, shame, and boredom. Therefore, Hypothesis 1 was supported, except for when considering students’ levels of anxiety and hopelessness, which did not differ between the two conditions.

The missing effect on students’ anxiety and hopelessness might be explained by the fact that the LC instruction did not specifically target test situations in science classes. According to Pekrun et al. (2014), anxiety and hopelessness can be regarded as outcome emotions related to experiences of failure and are thus strongly bound to performance situations at school. Therefore, future interventions under the LC approach might explicitly consider how to design performance situations appropriately to positively support a reduction in anxiety and hopelessness.

Concerning students’ academic self-concept and behavioral engagement, the data revealed an increase in both factors, which results in the acceptance of Hypotheses 2 and 3.

As far as the effects on the cognitive dimension are concerned, the results indicated that the 2-year intervention groups benefited significantly more than the 1-year groups from the LC instruction. This is in accordance with Michael Shayer’s warning (personal communication) not to expect any measurable effect from the demanding LC intervention that is applied no longer than a single year. If this temporal constraint is taken into account, it is possible to assert that Hypothesis 4 is supported. Since no empirical evidence showing that the LC approach leads to cognitive gains in low-achieving students has been available thus far, this result is particularly interesting.

Concerning students’ state emotions, the expected results were obtained; Hypothesis 5 is thus supported. It was observed that differences in the teaching behavior were directly reflected in students’ academic emotions. This finding is in line with previous studies, which have shown the effects of different forms of instruction on students’ state emotions (e.g., Franke and Bogner 2013). The open forms of instruction (romance and generalization) triggered stronger positive emotional responses in students, while strongly teacher-led forms of instruction decreased students’ positive emotional experiences. Learning, however, is not only associated with positive emotional experiences: it also evokes negative emotions. Recently, researchers’ interest in students’ epistemic emotions (e.g., curiosity, frustration, surprise; Muis et al. 2018) has been steadily increasing. It will be of core interest in future research not only to address students’ academic emotions but also to explore epistemic emotions in order to penetrate more completely the emotional experiences of students’ learning in science education.

Although our study provided evidence of the beneficial effects of the LC approach on students’ emotions, academic self-concept, behavioral engagement, and cognitive development, several limitations should be acknowledged. First, the teachers and their students were not randomly assigned to either the intervention or control condition. Furthermore, the sample size was not very large. This might have caused problems in terms of statistical power. An attempt was made to account for this by reporting effect sizes in the analyses. In addition, the teachers developed the LCs themselves; thus, the intervention was not standardized in terms of the use of specific instructional material but “only” in terms of the basic principles regarding the use of the LC approach. Closely related to this issue, the treatment check used in this study only covered the formal activities of teachers and students during the lessons; content aspects, such as the quality of questions and motivating feedback, were not measured. The quality of the problems used in the romance and generalization phases was not measured, either. In future studies, an observation tool developed for the LC instruction approach (Riffert 2008) would be helpful for an in-depth investigation of the quality of the treatment implementation.

It is also important to draw attention to the fact that the LC approach was implemented during 50-minute units and 100-minute units during the school year. This raises the question of whether all students had sufficient time for free inquiry during the romance and generalization phases. To reduce this problem, the LC instruction approach could be implemented in science projects lasting for 2–3 days without any artificial interruptions.

Finally, since it has been demonstrated that self-efficacy has positive effects on students’ academic motivation, learning, and achievement (Schunk and DiBenedetto 2016) and that inquiry-based instruction can improve students’ self-efficacy (Jansen et al. 2015), one can lament this study’s failure to measure students’ self-efficacy. In future research, this variable should also be included. Here, however, measurements would need to be planned carefully, since simply asking the students in the treatment and control groups the same questions about their self-efficacy convictions concerning their physics/chemistry classes would measure different constructs. Thus, measurement of self-efficacy should be closely tied to the form of instruction.

Outlook, Conclusion, and Implications

Beyond the suggestions outlined above for improving future research, there are some generally interesting directions for future research.

First, it would be interesting to apply the test battery to students attending high-track schools in Austria (“Gymnasium”). Since these schools usually offer students different foci on specific subjects—such as the arts, languages, and the sciences—it would also be fascinating to test the impact of LC teaching on students who have chosen a science focus.

Second, it is possible to speculate as to whether the LC approach could be generalized to other subjects, such as, to name but a few, geography, history, or languages. Such generalization presupposes careful theoretical preparations.

In conclusion, the findings of this study indicated that the application of the LC approach is effective in lower secondary education physics/chemistry classes. The results revealed that the effectiveness of this approach encompasses the cognitive, emotional, and behavioral dimensions of students’ learning. It was also demonstrated that students who attend low-track schools in Austria are likely to benefit from this specific type of moderate constructivist teaching, suggesting that this approach, although it is rather demanding, does not discriminate between learners with (on average) low(er) achievement (and motivation) levels. There is, however, reason to believe that students’ adaptation to the LC approach needs sufficient time; this informed the decision to conduct a long-term intervention study.

With regard to practical implications, the following aspects must be considered: If the LC approach is applied to regular school science instruction, teachers need to be appropriately trained in this method (to avoid the implementation of low-quality LC teaching, e.g., Lindgren and Bleicher 2005), the application must be monitored (e.g., through classroom observations and continuous reflection), and teachers must be made aware that they should not expect instant effects, particularly with regard to cognitive development and the change in students’ trait emotions.