Theoretical background

Adolescence presents a particularly challenging developmental period for all students. It is notorious for declines in academic self-concept (e.g., Nagy et al., 2010; Pesu et al., 2016), academic motivation (e.g., Gnambs & Hanfstingl, 2016), and reduced performance in educational settings (e.g., Wijsman et al., 2016). The Stage-Environment Fit Theory (Eccles & Midgey, 1989) attributes the origins of these negative trends to a mismatch between students’ developmental needs and their learning environments. In other words, the changes in classrooms settings and new grading standards, as well as increased academic demands from multiple teachers, do not go together with adolescent students’ neurobiological, cognitive, and social needs (Steinberg & Morris, 2001). Empirical research on the interplay of adolescent students’ self-concepts, achievement, and motivation has traditionally been focused on teacher-directed learning (TDL) environments (e.g., Bakadorova & Raufelder, 2020; Marsh & Craven, 2006; Marsh & Martin, 2011), while student-directed learning environments (SDL) remain less explored so far. While in TDL a teacher is responsible for students’ progress, SDL persuades with an individually tailored program, when a student takes over the responsibility for the learning goals, educational needs, and outcomes, while teachers facilitate the progress. As in general, individualized instruction is most beneficial for each particular learner (Watts-Taffe et al., 2012), student-directed learning per se is designed to better meet students’ needs. At the same time, SDL may theoretically be challenging at this age, as adolescents may not yet have fully developed the cognition-, emotion-, and behavior-regulating skills (Smith et al., 2013; Steinberg, 2005). The existing empirical research however taps at positive associations of SDL and motivation, mastery goals, and positive emotions in secondary school settings (Schweder & Raufelder, 20232024).

The present study explores the interplay between academic self-concept, achievement, and goal orientations in both TDL and SDL. The overarching goal is to test (a) whether there are reciprocal associations between domain-specific academic self-concepts and grades over time in both learning environments, in line with the Reciprocal Effects Model (Marsh, 1990; Marsh & Craven, 2006; Marsh & Martin, 2011), and (b) whether mastery and performance goal orientations would mediate these associations, extending the Reciprocal Effects Model.

Academic self-concepts, achievement, and goal orientations

Academic self-concept refers to a student’s perception of their own school-related abilities (Shavelson et al., 1976). It is a complex domain-specific construct with multiple dimensions for comparison (e.g., individual or criterial (Spinath et al., 2012)). In this study, criterion-unsensitive, domain-specific academic self-concepts have been explored that depict students’ perceptions of their own competence in academic settings. Empirical studies show that academic self-concept is a significant predictor of educational outcomes, including goal orientations, academic achievement, and career aspirations (e.g., Barker et al., 2005; Denissen et al., 2007; Marsh & Craven, 2006). While in pre-adolescence, academic self-concept and achievement-related outcomes exhibit weak correlations, during adolescence, these correlations strengthen and become more stable (Marsh, 1989). Age-related differences are relevant not only for the association between these variables but also for their magnitude (Marsh, 1989; Perinelli et al., 2022). Gender-related findings tap at domain-specific effects: girls tend to report higher verbal self-concepts, whereas boys tend to report higher mathematical self-concepts (Heyder et al., 2017; Mejía-Rodríguez et al., 2020; Skaalvik & Skaalvik, 2004).

The Reciprocal Effects Model (Marsh, 1990; Marsh & Craven, 2006; Marsh & Martin, 2011) suggests that students’ academic self-concept and achievement are interdependent. In other words, the higher a student’s academic self-concept, the higher their achievement is, and vice versa. This model is supported by empirical research across all stages of schooling process in TDL environments (e.g., Guay et al., 2003; Marsh & Craven, 2006; Marsh & Martin, 2011) in mathematic and verbal domains (Jacobs et al., 2002; Pesu et al., 2016). While the mathematics domain is characterized by a student’s beliefs about their own abilities in mathematics (Opacic & Kadijevic, 1997), verbal self-concept refers to a student’s beliefs about their own written and oral language skills, including reading comprehension (Locher et al., 2021). The domain-specific research within the internal/external frame of reference model (Marsh, 1986) suggests that positive effects between academic self-concept and achievement could be confirmed within, but not across the verbal and mathematic domains; and therefore, the domains should be studied separately.

The existing domain-specific findings on the associations of academic self-concepts and achievement within the domains suggest differentiated results for verbal and mathematic models: while Retelsdorf and colleagues (Retelsdorf et al., 2014) found stronger effects from reading achievement to reading self-concept that vice versa, the longitudinal five-wave model for mathematics (Arens et al., 2017) showed robustness of the reciprocal effects in both directions. A meta-analysis of longitudinal studies (Wu et al., 2021) suggests students’ age is a statistically significant moderator of the effect of achievement on academic self-concept, while both age, achievement level, and type of achievement measurements are important to consider for the effect of academic self-concept on achievement.

Various approaches have been employed to integrate academic motivation into the Reciprocal Effects Model (see Green et al., 2006). It has been directly integrated (see Green et al., 2006), tested as a mediator (e.g., Bakadorova & Raufelder, 2020), or considered a moderator variable (Valentine & DuBois, 2005). Despite the encouragement by Valentine and DuBois (2005) almost 20 years ago to “include these [motivation] variables more frequently when studying the relationship between self-beliefs and achievement” (p. 72), only a few longitudinal studies follow this recommendation. This may be attributed to theoretical diversity, differing operationalizations of motivational constructs (see Green et al., 2006), and the distinction of domain-specific and domain-unspecific studies. Building on a recent domain-unspecific study that demonstrated mastery goal orientation partially mediates the association between academic self-concept and achievement (Bakadorova & Raufelder, 2020), the present study aims to replicate this finding in a sample of secondary school students, taking a domain-specific approach and exploring potential differences between TDL and SDL.

In academic settings, achievement goals are often regarded as tendencies for approach or avoidance orientations within achievement motivation (Weiner, 1990). As such, goal orientation theory is a social-cognitive theory of achievement motivation that investigates why students engage in their academic work. There exist several frameworks that differentiate between approach and avoidance goal orientations. Thus, Elliot and MacGregor (2001) differentiate between mastery approach, mastery avoidance, performance approach, and performance avoidance goal orientations. Niemivirta et al. (2019), in turn, regard mastery goal orientation as either extrinsic or intrinsic, and add work avoidance orientation. While differentiation between different orientations of mastery goals should be treated carefully and needs further exploration (see Bong, 2009), decades of research agree on the differentiation between performance approach and performance avoidance orientation. Following the trichotomous framework of achievement goal theory (Elliott & Church, 1997), empirically supported by many works (e.g., Hulleman et al., 2010; Pekrun et al., 2006), this study differentiates among mastery, and performance-approach and performance-avoidance goal orientations. In a school setting, mastery goals aim at increasing competence (Ames, 1992) and directly relate to self-regulated learning strategies (Pintrich, 2000). Performance goals, in turn, foster comparison and competition to increase individual ability perception (Covington, 2000). While performance-approach goal orientations aim to demonstrate competence and outperform others, performance-avoidance goal orientations relate to the wish to perform not worse than others and hide incompetence (e.g., Darnon et al., 2007; Pintrich, 2000).

A variety of studies shows that academic self-concept and goal orientations are highly interdependent constructs that are rooted in social comparison processes (e.g., Niepel et al., 2014; Huguet et al., 2009; Middleton & Midgey, 1997). However, academic self-concept presents the more stable variable that further defines students’ motivational orientations (Yeung et al., 2012). In the present study, the assumption by Barker et al. (2005) has been followed, that “variables drawn from self-concept and goal theories taken together will provide a fuller explanation of academic achievement than is possible with either self-concept or motivational goal variables alone” (p. 1).

The existing findings on the associations of verbal and mathematics academic self-concepts, goal orientations, and achievement in adolescence are inconsistent: Niepel et al. (2014) examine the relationship between academic self-concept and goals by cross-lagged panel models with achievement as an outcome variable. The results suggest academic self-concept in mathematics (grade 8) is positively predicted by performance-approach goals (grade 6), and is negatively predicted by performance-avoidance goals (grade 6), but not mastery goals. In addition, no goal orientation reveals a direct association with achievement. A study of Preckel and Brunner (2015), in turn, shows that mathematic self-concept and performance goals are reciprocally related; however, only mastery goals reciprocally associate with academic achievement. In contrast, the research of Seaton et al. (2014) reports positive reciprocal relations between performance approach goals and achievement in mathematics. The mastery goal orientation was reinforced by mathematic achievement, but not at every time point. Yet another study by Paulick and colleagues (Paulick et al., 2013) finds positive longitudinal reciprocal relationships between mastery approach goals and achievement. In addition, a domain-specific study on causality among academic self-concept, achievement, and goal orientations in mathematics and verbal domains shows that the causal relation may be domain-specific (Barker et al., 2005). Specifically, in verbal domain, goals affect achievement through the perception of self, while in mathematics domain, self-concept affects achievement through goal orientations.

Overall, the existing research shows no unity on the causal ordering of academic self-concepts, goal orientations, and achievement, and suggests possible domain-specific differences. There are more studies in the domain of mathematics than in the verbal domain. In addition, all studies so far have been conducted in TDL, while students’ academic self-concepts and goal orientations in TDL and SDL may differ.

Learning environment as a potential moderator

While some research focuses on differential roles of learning environments, as linked to teachers’ instruction (e.g., Trautwein et al., 2006), there are only few studies (e.g., Ryser et al., 1995) that contrast academic self-concepts of students from different learning settings. This may be due to the fact that TDL approach is widely practiced in schools. TDL is traditionally characterized by teacher-directed systematic instruction of students, aimed at transmission of basic skills, facts, and information (Rosenshine & Stevens, 1984).

However, nowadays SDL gains in importance in school practice, as either a supplement or a substitute to TDL. The term SDL stems from self-directed learning, originally used in adult education (Knowles, 1975), and refers to self-directed learning in school-related contexts. The central aim of SDL is the development of learner autonomy (Armstrong, 2010); and therefore, SDL embraces students’ both self-regulated and self-determined learning within and beyond a certain educational situation. SDL is also a broader construct in terms of a learner’s degree of control and ability to choose a learning activity, even though in some papers the terms “SDL” and “self-regulated learning” are used interchangeably (for an overview, see e.g., Saks & Leijen, 2014). Several studies show that SDL is associated with academic achievement (Chou & Chen, 2008; Lounsbury et al., 2009), and students who use and choose activities that match their interest and abilities (Hussain et al., 2011) tend to show higher achievement scores as compared to students in traditional TDL settings, even though SDL rather focuses mastery of competencies rather than grade improvement.

In school practice, SDL is often addressed by use of competency grids (Kulakow, 2020). Competency grids help students self-diagnose their learning needs, identify their goals and resources for learning, set own learning strategies, and evaluate the results. Empirical studies with adolescents that contrast SDL and TDL show higher autonomy, mastery goals, and academic self-concepts among SDL students (Kulakow, 2020; Schweder et al., 2019, Schweder & Raufelder, 2021), and also higher achievement (Orawiwatnakul & Wichadee, 2017). Longitudinal research shows that students from SDL have more positive motivation development in secondary school (Raufelder & Kulakow, 2021) due to individual control of own learning progress, continuous self-reflection, individualized feedback from teachers, less social comparison to peers, and, generally, better need satisfaction as stated in the Stage-Environment Fit Theory (e.g., O’Mara et al., 2006; Schweder et al., 2019; Schweder & Raufelder, 2021).

By combining the Stage-Environment Fit Theory and Reciprocal Effects Model, the present study has the goal of testing whether (a) consistent with the Reciprocal Effects Model (Marsh & Craven, 2006; Marsh & Martin, 2011; Marsh, 1990) there is a reciprocal association between domain-specific academic self-concepts and grades over time in both verbal and mathematics domains in both SDL and TDL, and whether (b) Reciprocal Effects Model may be extended by goal orientations as mediators.

Hypotheses

Previous studies revealed higher academic self-concept (Kulakow, 2020), mastery goal orientation (Schweder, 2020; Schweder et al., 2019), and achievement results (Orawiwatnakul & Wichadee, 2017) for students in SDL as compared to students in TDL. As such, learning environments have shown to be a distinctive factor not only in mean differences of variables, but also in the associations of variables. Accordingly, it was hypothesized (H1a), that there are substantial differences in the longitudinal associations of mathematic and verbal self-concepts and grades between students from TDL and SDL environments. In detail, we expect students in SDL to score higher on all variables. We also expect mastery goals to mediate the relation between academic self-concepts and grades in SDL as compared to TDL. The Reciprocal Effects are expected to be stronger in mathematics than in verbal model. In extension of the Reciprocal Effects Model, (H1b) goal orientations were tested as possible mediators. Since there is only one study so far (Bakadorova & Raufelder, 2020) that regards mastery goals as a mediator in the Reciprocal Effects Model, and, according to other studies, a different arrangement of these variables may be possible (e.g., Valentine & Dubois, 2005), H1b is exploratory in nature. As there might be domain-specific differences in the interplay of academic self-concept, goal orientations, and achievement (Barker et al., 2005), the hypotheses are tested for the mathematics and verbal domains separately.

Method

Participants and procedure

In total, 1153 students (Mage = 13.97; SD = 1.37, 49% girls; grades 7–10) participated in the current study. The students attended 57 classrooms in six secondary schools in the north of Germany (place of data collection was removed in the review process for anonymization). They were first surveyed during the winter term 2015/2016 (t1) and approached again 0.5 years later (t2). Of the six schools, three used SDL, based on competency grids; the other three followed TDL. The schools of the TDL group were randomly sampled. The SDL schools were chosen according to the following criteria: (1) use of competency matrices comprised part of the curriculum and was consequently applied; and (2) practice of SDL for at least 6 years, so that, besides occasional school changes, students would know only this instructional approach in secondary school. To ensure the ethical standards of the 1964 Declaration of Helsinki and of the German Psychological Society, all participants and their parents provided written informed consent.

After all necessary permissions were signed, two trained research assistants approached the students, distributed the questionnaires and reiterated the goal of the study and the anonymity of data collection, and explained the use of the Likert scales. They remained present throughout the data collection.

Measures

Domain-specific academic self-concepts

Self-concepts for mathematics (mathematics domain) and German (native language, verbal domain) were assessed with two scales from the PISA 2000 questionnaire (Artelt et al., 2004). Both scales consist of three items, each ranging from “1” (“not true at all”) to “4” (“completely true”). The mathematics self-concept scale (e.g., “I have always been good at math”) achieved good reliability at both measurement points (ωT1 = 0.87, ωT2 = 0.87). The verbal self-concept scale (e.g., “I learn fast in my German class”) achieved adequate reliability (ωT1 = 0.67, ωT2 = 0.73).

Goal orientations

To capture students’ goal orientations, the scales by Spinath and colleagues (Spinath et al., 2012) have been used. Answers were rated on a 5-point Likert scale, from “1” (“not true at all”) to “5” (“completely true”). The subscale mastery goal orientation featured eight items (e.g., “At school, I like to learn as much as possible”), with reliability of ωT1 = 0.82. The subscale approach-performance goal orientation consisted of seven items (e.g., “In school, I want to get better grades and feedback than others”) and achieved proper internal consistency (ωT1 = 0.79). The avoidance-performance goals subscale consisted of eight items (e.g., “In school, I try to avoid that other students think that I was stupid”) and exhibited good reliability (ωT1 = 0.84). Goal orientations were tested as potential mediators, so only students’ ratings on these variables at t1 have been used.

Grades

At both waves, students’ self-reported grades in the respective subjects from their certificates at the end of the previous school term have been collected. German school grades range from “1” (best possible outcome) to “6” (worst possible outcome). Accordingly, in the Math model the corresponding Math grade was utilized, whereas the German grade was used in the German model. The grades were reverse recoded in the process of data analyses for better interpretation of the results, so that higher scores reflect greater achievement. Additionally, the German grades with the LGVT 6–12 (Schneider et al., 2007) have been considered in the verbal model as covariates to address potential bias of self-report.

Covariates

The existing studies show that girls tend to report higher verbal self-concepts, whereas boys tend to report higher mathematical self-concepts (Heyder et al., 2017; Mejía-Rodríguez et al., 2020; Skaalvik & Skaalvik, 2004). Therefore, students’ gender was included as a covariate. Students’ age was also incorporated into the model as a control variable, as the existing studies show age-related differences in the development of academic self-concepts, goal orientations, and achievement (Marsh, 1989; Perinelli et al., 2022).

Missing data

The present study was subject to a certain degree of missingness. Across all observed values, 6.94% were missing, as a result of 427 incomplete cases. Missingness across all 41 observed variables varied between 0 and 33.22%. To investigate whether missingness would lead to a substantial bias, missing data patterns were examined. The most prominent missing data pattern was caused by dropping out of the study (n = 306) at t2.

A series of Bonferroni corrected t tests was run on the manifest variables to examine whether the students who dropped out of the study differed significantly in the t1 variables from those who remained. Students who dropped out had significantly worse grades in mathematics (t(693.23) = –4.54, padj < 0.01). In the models, full information maximum likelihood (FIML) estimation was used to compensate all missing data. In the presence of missingness, FIML estimation has proved to be superior to other missing data techniques (e.g., listwise deletion), and leads to less biased estimates while retaining statistical power (Schafer & Graham, 2002).

Statistical analyses

To examine the hypotheses, multi-group cross-lagged panel analyses with additional indirect effects in Mplus 8.5 (Muthén & Muthén, 19982017) have been conducted. The models examine reciprocal effects from one construct at one point of measurement on another construct (i.e., cross-lagged effect) and simultaneously to the same construct at a later point of measurement (i.e., autoregressive effect) (Geiser, 2013; Kearney, 2017). The (1) verbal model and (2) mathematics model included autoregressive paths (i.e., stability of self-concept and grades) and cross-lagged paths (i.e., reciprocal effects of self-concept on grades). Additionally, the t1 achievement goal orientations were included as mediators. The indirect effects were estimated using the delta method with symmetric confidence intervals (MacKinnon, 2008). The effects were considered as significant if the 95% confidence interval did not include 0. For direct effects, the conventional p < 0.05 criterion was applied. The cross-lagged effect sizes were estimated as recommended by Orth and colleagues (Orth et al., 2022).

A multiple-group approach was used to determine whether there are any significant differences in the autoregressive and lagged regression coefficients between students from SDL and TDL. With this method, group variations in regression coefficients can be identified as moderating or interaction effects (Mulder & Hamaker, 2021). More specifically, a model in which all regression coefficients and covariates are constrained to be the same across the groups is contrasted with a multiple group CLPM with no constraints across the groups. It is possible to determine whether or not (some of) the lagged coefficients differ between the groups using the χ2-difference test (Self & Liang, 1987).

Model fit was evaluated by χ2, RMSEA, SRMR, and CFI (Hu & Bentler, 1999). Typically, a good fit to the data is indicated by CFI higher than 0.90, RMSEA less than 0.05, and SRMR less than 0.08. The prevailing consensus is that CFI values higher than 0.95 and RMSEA and SRMR values lower than 0.05 or 0.06 indicate a strong fit to the data.

All models were specified using the MLR estimator. In order to address the multilevel nature of the dataset (i.e., students nested in classes: nclasses SDL = 36; nclasses TDL = 21; average number of students in class = 18.14), type is complex that adjusts standard errors by adding sampling weights to the estimates in relation to the clusters has been used (Asparouhov, 2005).

Results

Descriptive statistics and measurement invariance

Table 1 shows the descriptive statistics, while Table 2 demonstrates bivariate correlations between all variables.

Table 1 Descriptive statistics
Table 2 Intercorrelations for SDL and TDL

Before the scales were summarized into variables, we conducted confirmatory factor analyses to examine whether the underlying constructs were adequately measured by the scales indicators. Subsequently, we added parameter restrictions in a step-wise manner following the approach by Little (2013) to ensure measurement invariance across groups and time. The corresponding analyses are provided in the Electronic Supplement 1.

Mathematic self-concept

A multi-group cross-lagged panel model was run for both learning environments. This model fit the data well (χ2(12) = 24.071, p = 0.02, CFI = 0.99, RMSEA [90% CI] = 0.042 [0.016–0.066], SRMR = 0.02). Subsequently, a model in which all paths were invariant across groups was run, which also showed acceptable fit indices: χ2(40) = 63.96, p < 0.05; CFI = 0.99, RMSEA = 0.032 (0.016–0.046); SRMR = 0.05. The χ2-difference test of these two nested models yields Δχ2(28) = 41.69, p < 0.05, which implies that the autoregressive and lagged effects of self-concept and achievement as well as the covariances appear not to be the same for students from TDL and SDL.

SDL—direct and indirect effects

Figure 1 depicts direct and indirect effects and covariates between the variables in SDL. The Reciprocal Effects Model was confirmed, as academic self-concept and achievement in math were reciprocally interwoven over time. The effect sizes can be interpreted as large as standardized coefficients are 0.12 and higher (Orth et al., 2022). However, no statistically significant indirect path could be found. The model explained 43.8% of variance in grades (R2 = 0.49) and 53.3% of variance in academic self-concept (R2 = 0.53) at t2. Interestingly, mathematic academic self-concept at t1 was significantly associated with performance-approach goals, whereas grades in general were negatively associated with performance-avoidance goals; and grades at t1 positively associated with mastery goals. Boys reported higher academic self-concept at t1 and t2, whereas older students reported lower academic self-concept at t1.

Fig. 1
figure 1

Multi-group cross-lagged panel model with indirect effects for students in SDL environments in mathematics. Note: Estimates are shown as unstandardized (first position) and standardized (second position) values; only significant effects are displayed; non-significant path are shown in dotted lines; *p < 0.05, **p < 0.01, ***p < 0.001

TDL—direct and indirect effects

Figure 2 demonstrates direct and indirect effects and covariates between the variables for TDL. The Reciprocal Effects Model was not confirmed, as academic self-concept is associated with achievement in mathematics over time but not vice versa. The effect size of academic self-concept and achievement over time can be interpreted as large  = 0.33) (Orth et al., 2022). However, no statistically significant indirect path could be found. Students with higher academic self-concept tend to follow a mastery or performance-approach goal orientation, whereas grades negatively associate with the performance-avoidance goal orientation. The model explained 45.0% of variance in grades (R2 = 0.45) and 52.7% of variance in academic self-concept (R2 = 0.53) at t2. Boys reported higher academic self-concept at t1, whereas older students reported lower academic self-concept at t2.

Fig. 2
figure 2

Multi-group cross-lagged panel model with indirect effects for students in TDL environments in mathematics. Note: Estimates are shown as unstandardized (first position) and standardized (second position) values; only significant effects are displayed; non-significant paths are shown in dotted lines; *p < 0.05, **p < 0.01, ***p < 0.001

Verbal self-concept

A multi-group cross-lagged panel model was run and fit the data well: χ2 (24) = 51.22, p < 0.05; CFI = 0.98, RMSEA = 0.044 (0.027–0.061); SRMR = 0.26). Subsequently, a model in which all paths are invariant across groups was run, which also showed acceptable fit indices: (χ2(64) = 92.192, p < 0.05, CFI = 0.98, RMSEA [90% CI] = 0.028 [0.013–0.040], SRMR = 0.05). The χ2-difference test of these two nested models yields Δχ2(41) = 44.59, p > 0.05, which implies that statistically both models fit equally well (Werner & Schermelleh-Engel, 2010). In the next step, all paths and covariances, which were statistically non-significant in both groups, have been set to be free across groups. This model with all statistically significant paths invariant across groups and all non-significant paths free across groups showed acceptable fit indices: χ2 (51) = 89.32, p < 0.05; CFI = 0.97, RMSEA = 0.036 (0.023–0.048); SRMR = 0.55). The χ2-difference test between this model and the prior one yields Δχ2(33) = 48.02, p < 0.05, which implies that the statistically significant autoregressive and lagged effects of self-concept and achievement as well as the covariances appear not to be the same between students from TDL and SDL.

SDL—direct and indirect effects

Figure 3 depicts all direct and indirect effects as well as the covariates between the variables for students in SDL. The Reciprocal Effects Model was confirmed for the verbal domain. The effect sizes can be interpreted as large (β = 0.16) (Orth et al., 2022). However, no statistically significant indirect paths could be found. Students with higher verbal self-concept tend to follow a mastery or performance-approach goal orientation, whereas grades negatively associated with the performance-avoidance goal orientation. The model explained 38.2% of variance in grades (R2 = 0.38) and 30.9% of variance in academic self-concept (R2 = 0.31) at t2. Boys reported lower academic self-concept and poorer grades at t2. The better the students’ results in the reading comprehension and speed test, the better their grades and the higher their academic self-concept at t1 were.

Fig. 3
figure 3

Multi-group cross-lagged panel model with indirect effects for students in SDL environments in German (verbal domain). Note: Estimates are shown as unstandardized (first position) and standardized (second position) values; only significant effects are displayed; non-significant paths are shown in dotted lines; *p < 0.05, **p < 0.01, ***p < 0.001

TDL—direct and indirect effects

Figure 4 depicts all direct and indirect effects as well as the covariates between the variables for students in TDL. The Reciprocal Effects Model could not be confirmed for students in TDL. The effect size from verbal self-concept to subsequent achievement can be estimated as large (β = 0.23) (Orth et al., 2022). Additionally, no statistically significant indirect path could be found. Students with higher academic self-concept tend to follow a mastery or performance-approach goal orientation, whereas students with better grades, are likely to follow a mastery goal orientation. The model explained 37.3% of variance in grades (R2 = 0.37) and 37.6% of variance in academic self-concept (R2 = 0.38) at t2. Boys reported lower academic self-concept at t1, whereas older students tend to report poorer grades and lower academic self-concept at t2. Interestingly, students’ results in the reading comprehension and speed test had no association to their grades or verbal self-concepts.

Fig. 4
figure 4

Multi-group cross-lagged panel model with indirect effects for students in TDL environments in German (verbal domain). Note: Estimates are shown as unstandardized (first position) and standardized (second position) values; only significant effects are displayed; non-significant paths are shown in dotted lines; *p < 0.05, **p < 0.01, ***p < 0.001

Discussion

Based on the Stage-Environment Fit Theory and the Reciprocal Effects Model, the major aim of the current study was to test whether (a) there is a reciprocal association between domain-specific academic self-concepts and grades over time in both SDL and TDL and whether (b)—in extension of the Reciprocal Effects Model—students’ goal orientations would mediate this association.

In line with H1a, substantial differences between students from TDL and SDL environments could be found for both mathematics and verbal domains. The Reciprocal Effects Model could only be confirmed for SDL environment for verbal and mathematics models alike. It may be explained by a suggestion, that in SDL, due to large degree of self-determination and self-regulation students may perceive their achievements as a part of their academic self-concepts, so these concepts are better interrelated over time (in this study: 0.5 years of secondary school). There may be several reasons why the Reciprocal Effects Model did not function in the TDL. First, in TDL grades are externally assigned by teachers and might be less associated with students’ academic self-concepts over time. Second, as students in the study attended to “intermediate” track (see Arens et al., 2018 for more details on German tracking), their academic self-concepts may be vulnerable and rather associated with previously obtained last years’ grades (e.g., Kastens & van Wickeren, 2023). This finding shows that more research on the longitudinal interplay of self-concepts and grades especially in “intermediate” track is needed, as previous research found domain-specific associations between academic self-concepts and grades in both academic and vocational tracks (however, not over time) (Arens et al., 2018). The findings of the study show in TDL/”intermediate” track, the longitudinal association between self-concept and achievement may be unidirectional, supporting the self-enhancement model (Jones & Grieneeks, 1970) that postulates that prior school self-concept leads to later academic achievement, for both verbal and mathematic domains. In addition, (a) there exist other studies (e.g., Skaalvik & Hagtvet, 1990) that do not confirm the reciprocal effects, possibly due to developmental changes and educational demands in secondary schools or statistical differences between the model(s) that different author(s) use (see more in Burns et al. (2020)). In detail, the between-person differences in self-concept may have caused the reciprocal effects in “traditional” CLPM analyses (Burns et al., 2020; Hübner et al., 2023).

In contrast to H1b, none of the goal orientations functioned as a mediator between academic self-concept and achievement in both domains. This finding stands in line with some of the existing research (e.g., Seaton et al., 2014; Steinmayr et al., 2019) that presents academic self-concept as a more important predictor of academic success than goal orientations. The direct effects between the constructs in both models support previous research (e.g., Marsh et al., 2015; Retelsdorf et al., 2014), such as both verbal and mathematics self-concepts significantly associated with achievement in each subject within time at both measurement points. Even though goal orientations did not mediate the association between academic self-concept and achievement, both verbal and mathematics models for SDL and TDL showed domain-specific differences in the interplay of academic self-concepts, grades, and goal orientations. For SDL model in mathematics, the mathematic academic self-concept was positively associated with performance-approach, but not mastery or performance-avoidance goal orientations. In contrast, for the SDL verbal model, the verbal academic self-concept positively associated with both mastery and performance-approach goal orientations. While the result for the verbal model stands in line with the existing learning environment unspecific research (e.g., Jiang et al., 2014; Lee et al., 2014), and underline the importance of a positive academic self-concept in practical terms, the results for the mathematics model are somewhat surprising. According to the properties and characteristics of SDL, one would rather expect mastery than performance goal orientations to associate with the academic self-concept. This finding suggests that, first, social comparison processes (e.g., comparison of own results on competency grids to the progress of peers) in SDL may be underestimated. Second, it might mean that there might be substantial differences in SDL environments, depending on how they are established in practice (in thus study SDL in class was addressed by use of competency grids). Third, as the verbal model included reading speed and comprehension as covariates, and mathematics did not, that could indicate that more complex assessment of achievement, than grades, is warranted in future studies. In turn, the grades at t1 negatively associated in SDL with performance-avoidance goals in verbal and mathematics models alike. In other words, the better grades the students reported, the less performance-avoidance goals they revealed. One possible explanation in school environment could lie in the nature of feedback and the absence of expectation of grading for task the students get in SDL, as some studies (e.g., Pulfrey et al., 2011) show that in particular that students’ expectation of a grade for task accomplishment consistently led to greater adoption of performance-avoidance goal orientations.

The TDL models in mathematics and verbal domains alike show academic self-concept is positively associated with mastery goals and performance-approach goals, but not performance-avoidance goals, which stands in line with the existing research (e.g., Jiang et al., 2014; Lee, et al., 2014). Interestingly, grades in both domains are positively associated with mastery goal orientations, which also some other existing studies report (e.g., Sparfeldt et al., 2015), but were not related to performance-approach goal orientations. This finding might indicate that while performance-approach goals relate to one’s wish to outperform others, grades might be not a reliable indicator of competence demonstration in school context. Thus, students might rather exhibit their skills in participating in group discussions or engaging in extracurricular activities that do not directly impact their grades.

Gender played an important role for both verbal and mathematical self-concepts for SDL and TDL students. This finding is supported by previous TDL-based studies that state that while girls report higher verbal self-concepts, boys tend to report higher mathematical self-concepts (e.g., Heyder et al., 2017; Mejía-Rodríguez et al., 2020; Skaalvik & Skaalvik, 2004). Age-related differences were, however, only relevant for TDL students, suggested by previous research: older students tend to have lower academic self-concepts and worse grades (Perinelli et al., 2022). This finding may suggest that academic self-concepts of students in SDL are less sensitive to (or exhibit slower changes) in the process of maturation, which means use of SDL practices among adolescents may present a working practical intervention aimed at diminishing the age-specific academic self-concept decline.

Overall, the results show further research on the reciprocal associations between domain-specific academic self-concepts and achievement in “intermediate track” is needed, using statistical models that consider between-person differences. In the “traditional” TDL setting, academic self-concept might be a stronger component than achievement. In contrast, to that, for students in SDL, the Reciprocal Effects Model was confirmed, i.e., that academic self-concept and grades were balanced variables. Goal orientations did not prove to be mediators for both learning environments alike.

Strengths and limitations

One of the limitations of the study is the use of self-reported data, even though criticism towards self-report data may as well address non-self-report data (Chan, 2009). Future studies should include multiple sources of data, such as teachers, parents, or peers to gather multiple perspectives. Also, the data was first collected in 2015, while school environments may have further developed since then. Furthermore, there is substantial criticism with regard to the use of self-reported grades as students’ achievement indicator (Kuncel et al., 2005) as self-reported grades tend to overestimate the actual grades. However, previous research has shown that self-report of grades (in the German school system) is sufficiently reliable (e.g., Dickhäuser & Plenter, 2005), and even though there are slight variations in longitudinal research within domains, self-reported grades are an adequate measure to address achievement in school context (Sticca et al., 2017). In addition, this study controlled for reading speed and comprehension in the verbal model. Unfortunately, no supplementary achievement tests in mathematics were conducted in this study.

Another limitation is that while the study uses a domain-specific approach, the achievement goal orientations were assessed domain-unspecifically. However, achievement goals have been found to be highly domain-unspecific (e.g., Hornstra et al., 2016). Recent research on the role of conceptualizing items that measure achievement motivation in data collection process (Michel et al., 2020) shows that domain-specific variance can be explained by self-concept and self-esteem, but not by domain-specific achievement motivation. In other words, goal orientations are either domain-unspecific in general or hard to grasp by domain-specification of items in the process of data collection, which should definitely be addressed in further studies. Furthermore, the independent and mediation variables were measured both at T1; and therefore, the directionality of the paths in the SEM is presumed to be conceptual rather than causal. Finally, the study design fails to disentangle trait from state variance, as recommended in the revised latent state trait theory (Steyer et al., 2015); and therefore, the findings should be treated with caution and further tested in a random-intercept cross-lagged model.

A clear strength of the study is the domain-specific exploration of the effects of mastery, and performance-approach and performance-avoidance goal orientations in the Reciprocal Effects Model in two learning environments, which has not been done before. Another important strength is clarity in definitions of academic self-concept and SDL, as the existing literature shows (a) considerable confusion and interchangeable use of the terms “self-concept” and “self-efficacy,” as well as (b) interchangeable use of the terms “self-directed learning” and “self-regulated learning,” even in definition of the term itself (for an overview, see Saks & Leijen, 2014). Considering this fact, future studies on both academic self-concept and SDL should focus on clear definitions of terms under research and interpret existing results with caution.

Conclusions

The present study tested an extension of the Reciprocal Effects Model by including the three different goal orientations as possible mediators, and explore possible differences from students from SDL and TDL. The Reciprocal Effects Model effects could only be confirmed for SDL, and the extension by goal orientations as mediators could not be confirmed. In practical terms, this means that while in SDL academic self-concept and achievement are interwoven, in TDL settings more attention should be paid to the formation of positive and realistic academic self-concepts of students through helpful teacher feedback (e.g., Burnett, 2003) rather than grades, as it would improve not only current but also future achievement.

In sum, the results expand the existing findings: while for general academic self-concept mastery goal orientation functions as a mediator in the Reciprocal Effects Model (Bakadorova & Raufelder, 2020), this finding could not be confirmed for the verbal/mathematics domains. In addition, the present findings uncover differences in the interplay of academic self-concepts, grades, and goal orientations between SDL and TDL which underlines the importance of learning environments for both theoretical and practical implications.