Metacognitive Confidence Can Increase but Also Decrease Performance in Academic Settings

The present research examined the role of metacognitive confidence in understanding to what extent people’s valenced thoughts guide their performance in academic settings. First, students were asked to engage in positive or negative thinking about exams in their major area of study (Study 1) or about themselves (Studies 2 and 3). The valence of these primary cognitions was manipulated to be positive or negative. Furthermore, a metacognitive variable, the perceived validity of the primary cognitions, was measured or varied to be relatively high or low. Finally, performance was assessed using a knowledge test (Study 1), a geometric shapes task (Study 2) or a selection of questions from the Graduate Record Examination (Study 3). In accordance with self-validation theory, we predicted and found that metacognitive confidence (relative to doubt) increased the impact of primary cognitions on performance. When thoughts were positive, increased confidence in the primary cognitions improved performance. However, when thoughts were negative, the same confidence validated the negative primary cognitions and reduced performance. Thus, metacognitive confidence can lead to opposite findings on performance depending on whether it validates performance-relevant positive thoughts or negative thoughts. Variations in the perceived validity of thoughts mediated the obtained effects. Therefore, we conclude that understanding the process of thought validation can help in specifying why and when metacognitive confidence is likely to work or to backfire in producing the desired performance effects.

Performance in academic settings is guided in part by accessible primary cognitions about oneself or the task at hand. These primary cognitions include thoughts and beliefs about one's skills (e.g., I can do this well; I am a good student) as well as the nature of the task (e.g., the test will be very difficult; this exam is useless), and many other relevant ideas. Such cognitions that pop into peoples' heads have the potential to impact performance. A relevant dimension of primary cognition that has been heavily studied in academic domains and beyond is the number or amount of performance-relevant thoughts. For example, Elliot et al. (1999) found that how much people thought about the test while taking it contributed to explaining the number of correct responses (Grass et al., 2017;Preckel et al., 2006;Strobel et al., 2019;von Stumm & Ackerman, 2013).
Beyond the amount of thinking, another important dimension that influences performance is the direction or valence of primary cognitions. For example, participants who have positive performance-relevant thoughts (e.g., yes, I can; this test is well suited for me) are more likely to do well on a variety of academic tasks than those who have neutral or negative thoughts (e.g., Burgmer & Englich, 2012;Gilman et al., 2006;Hertwig & Grüne-Yanoff, 2017;Snyder et al., 2002;Taylor & Walton, 2011). In one illustration, Martens et al. (2006) showed that women who were asked to generate positive thoughts about the self before taking a spatial rotation test and a math test performed better on those tasks than those asked to generate more neutral, control thoughts (see Miyake et al., 2010 for similar findings). Generating positive thoughts about oneself or the task has been associated not only with better performance at school but also with subsequent professional success and wellbeing (Lange et al., 1998;McGuire & McGuire, 1996;Seligman, 2018;Taylor et al., 2003;Merlo et al., 2018).
While having positive thoughts is associated with desired outcomes, having negative thoughts (e.g., focusing on worries, self-handicapping, etc.) is often associated with reduced academic performance compared to neutral or positive thoughts (Caraway et al., 2003;Davis & Nolen-Hoeksema, 2000;De Castella et al., 2013;DeCaro et al., 2010;Hunsley, 1985;von Hippel et al., 2019). For example, De Castella et al. (2013) classified students depending on the valence of their primary cognitions as measured with inventories such as success orientation and fear of failure. Higher levels of fear of failure were associated with less academic achievement and with more use of several maladaptive strategies such as self-handicapping behavior and school disengagement (Au et al., 2009;Martin, 2012).
The current research examines the role of a third, metacognitive, dimension of thinking -the perceived validity of one's thoughts (metacognitive confidence). As described in more detail shortly, the current research examines how perceived thought validity can interact with the valence of primary cognitions to determine performance in an academic context. Before getting to our core hypotheses and outline of the current research, we next elaborate upon the role of metacognitive confidence in educational settings.

Metacognition
Metacognition refers to the distinction between primary and secondary thoughts. Primary thoughts involve people's initial associations of some object with some attribute, or a projection of some object onto some dimension of judgment such as "I am a good student" (McGuire & McGuire, 1996). These primary cognitions can be focused on oneself, other people, the environment, or at any object (e.g., exams). As noted, many thoughts can serve as primary cognitions, including beliefs about self, one's classroom, the teacher, and many other ideas, and cognitions. Critical to the current research is that following a primary thought or cognition, people can also generate other thoughts that occur at a secondary level which involve reflections on the first level thoughts (e.g., Am I confident that I am a good student?). Metacognition refers to these second order thoughts, or thoughts about one's primary cognitions or thought processes (Briñol & DeMarree, 2012;Dunlosky & Metcalfe, 2009;Goupil & Kouider, 2019). In educational settings, metacognition is considered one of the most relevant predictors of performance (Metcalfe, 2009;Rawson & Dunlosky, 2007;Veenman & Alexander, 2011;Wang et al., 1990;see Hacker et al., 2009, for a review). Previous research in this domain has mainly focused on two separate but related aspects of metacognition: metacognitive knowledge and metacognitive control (Hacker et al., 2009;Mayer & Alexander, 2011).
Metacognitive knowledge refers to people's naïve theories about their knowledge. For example, Flavell et al. (1970) showed that how many hours students dedicated to studying depended on how much they thought they knew (subjective knowledge) regardless of how much they actually knew (objective knowledge; see Metcalfe, 2009, for additional examples). Also, the stronger one's feeling of knowing about a piece of information, the more time one is willing to spend searching for it (e.g., Costermans et al., 1992). The second aspect, metacognitive control, is also important for performance and refers to the regulation of cognitions to improve effectiveness (Brown, 1987;Hacker et al., 2000;Roebers et al., 2009;Schraw & Dennison, 1994). In prior research, accuracy is the key feature emphasized regarding these metacognitive aspects (Koriat, 2019;Nietfeld et al., 2005;Stankov & Lee, 2008;Stankov et al., 2012).
Instead of focusing on the relationship between metacognitive knowledge and control with accuracy as most prior research on metacognition does (e.g., Hacker et al., 2000;Koriat, 2011Koriat, , 2019Metcalfe, 2009;Nietfeld et al., 2005;Roebers et al., 2009;Stankov & Lee, 2008), the present research focuses on a sense that one's thoughts are valid or appropriate to use in guiding performance regardless of accuracy. As we describe next, the perceived validity of primary cognitions is one of the most essential dimensions of metacognition that determines whether thoughts are translated into judgments and action. Indeed, metacognitive confidence can enhance, attenuate, or even reverse the impact of primary cognitions .

Metacognitive Confidence
Previous literature on metacognition focused on accuracy has typically measured metacognitive confidence by asking participants about correctness (e.g., Are you sure your answer is correct?) in domains in which there is an objectively correct response (Ais et al., 2016;Efklides, 2011;Finn, 2008;Fleming et al., 2010;Hacker et al., 2000;Koriat, 2011Koriat, , 2019Maclellan, 2013;Nietfeld et al., 2005;Stankov & Lee, 2008;Stankov et al., 2012; for a review Brewer & Sampaio, 2006;Fleming & Daw, 2017;Fleming & Lau, 2014;Pleskac & Busemeyer, 2010). Instead of examining accuracy, the present research concerns people's confidence in their opinion-based thoughts, where there is no objectively correct answer. Specifically, we focus on subjective perceptions that one's thoughts are valid or appropriate to use in guiding performance. The key idea is that, although there is no actual accuracy, people still can feel that their thoughts (e.g., about their own strengths and weaknesses, or about exams, etc.) are the valid ones to use in guiding their performance. Therefore, in the present studies, metacognitive confidence is defined as a subjective belief regarding the validity of any primary cognition. Regardless of the accuracy of the initial belief, this subjective feeling of confidence about a thought is important because it can determine the extent to which people`s thoughts are used in forming judgments and, ultimately, in guiding their behavior.
Self-validation theory is a conceptualization that focuses on this essential aspect of metacognition: perceived thought validity. Unlike previous work on metacognitive accuracy, research on self-validation has shown that people's perceptions of their thought validity can stem from different antecedents, including a large list of incidental inductions such as recently recalling past instances of confidence  or having a momentary feeling of being powerful Durso et al., 2016). Also, people infer the validity of their thoughts based on the ease with which those thoughts come to mind. In self-validation studies, ease is manipulated to come from many different sources completely unrelated to the availability of information, including writing thoughts with a dominant hand (Briñol & Petty, 2003) or increasing the contrast between font and background (Briñol et al., 2006). Therefore, this self-validation approach is similar to other metacognitive accounts in highlighting the importance of perceptions of thought validity but differs from other metacognitive approaches in identifying sources of thought validity independent of accuracy, availability, skills for signal detection, memory abilities, etc.
Research on self-validation has shown that, when being deliberative, the confidence that people have in their thoughts is a critical determinant of whether those thoughts lead to more moderate or extreme judgments and action. Most of the prior self-validation work has been focused on examining specific variables of social influence that could affect perceived thought validity (e.g., credible persuasive sources; Briñol & Petty, 2009a) and domains within social psychology in which validation processes operate (e.g., attitude change; Briñol & Petty, 2009b). The goal of the current research is to introduce self-validation as a useful approach that can extend the utility of metacognitive validation processes to the domain of performance in academic settings.
To illustrate the utility of the self-validation approach, consider one study in the domain of attitude change. In this research (Petty et al., 2002, Study 2), undergraduate students received a message on the topic of instituting comprehensive exams for seniors. The students were randomly assigned to generate either positive or negative thoughts about the proposal (primary cognitions) and then they rated the validity of each thought listed (metacognitive confidence). Finally, participants provided their attitudes toward the proposal. The predicted interaction between valence of thoughts and thought validity on attitudes toward the proposal emerged. This interaction showed that the valence of participants' thoughts had an increasingly larger impact on judgments as perceived thought validity increased. The same occurred when the validity of thoughts was manipulated (by asking participants to recall past instances of confidence vs. doubt) rather than measured (Petty et al., 2002, Experiment 3).
In another set of examples, consider the metacognitive confidence that comes from feelings of power following thinking. The confidence that comes from feeling powerful can magnify whatever mental content is accessible increasing and decreasing cooperation or competition (Decelles et al., 2012;DeMarree et al., 2014;Hirsh et al., 2011), and leading to action or inaction (Durso et al., 2016), depending on what the primary cognitions are to begin with (e.g., positive or negative thoughts; thoughts about cooperation or competition).
In a recent self-validation work illustrating how thought confidence can moderate the impact of self-relevant thoughts (or self-talk) on behavior , cross fit athletes were randomly assigned to generate either positive or negative statements about themselves. In this paradigm, self-talk (i.e., generating positive or negative thoughts about oneself) served as the primary cognition induction. This initial induction was based on extensive literature on sport performance showing that what athletes say to themselves prior to or during performance can influence their performance outcome (e.g., Hatzigeorgiadis et al., 2011;Tod et al., 2011;Van Raalte et al., 2016). After generating positive or negative thoughts, the athletes were randomly assigned to a thought validation induction (nodding or shaking their heads) while listening over headphones to the self-statements they had recorded. Specifically, nodding one's head in a vertical manner while thinking leads to higher perceived thought validity than shaking one's head in a horizontal manner while thinking (Briñol & Petty, 2003). Finally, physical performance was assessed in tasks such as vertical jump. Consistent with the self-validation hypothesis, athletes' self-talk was significantly more impactful on their physical performance in the head nodding (validation) than in the head shaking (invalidation) condition. That is, when self-thoughts were positive, nodding led to better performance than shaking, but when self-thoughts were negative, the opposite resulted. In the present research, we examine for the first time to what extent metacognitive confidence (validity of thoughts) can also validate positive and negative thoughts affecting performance in an academic context.

Overview
The present research examines whether metacognitive confidence in performance-relevant thoughts can play a role in determining actual performance on academic tasks. All of our studies include a variable of performance-relevant primary cognition (i.e., positive or negative thoughts about exams or about the self) and a variable of secondary thinking (i.e., metacognition involving the perceived validity of thoughts). In the first study, validity of thoughts was simply measured by asking participants how confident they were in their thoughts, and in the second and third studies validity of thoughts was experimentally manipulated. Importantly, Study 3 included both a manipulation of thought validity and a subsequent assessment of that induction on perceived thought validity as the proposed mediator (i.e., a metacognitive assessment).
According to the self-validation hypothesis, we expected that the effect of the primary cognitions on performance would be greater for participants with higher levels of metacognitive confidence in their cognitions compared to those with relatively lower levels of thought confidence. Thus, we expected an interaction between thought valence and their perceived validity in predicting performance in academic settings. Thought valence would have a larger impact on performance under high rather than low metacognitive confidence conditions. Stated differently, higher confidence in positive thoughts would produce enhanced performance, but higher confidence in negative thoughts would produce diminished performance. This interaction pattern would extend prior work on self-validation to the domain of performance in academic settings.

Study 1
The primary goal of Study 1 was to provide an initial examination of our interaction hypothesis that metacognitive thought confidence would moderate the impact of primary cognitions on academic performance. To manipulate the valence of primary cognitions, undergraduates received information about a persuasive proposal advocating in favor of or against implementing senior comprehensive exams at their own university. Following this induction, participants reported the extent to which they had confidence in the thoughts they just listed. Finally, performance was measured with the scores obtained by the students in a brief exam about knowledge in the students' major -psychology.

Participants and Design
One hundred and twenty-seven undergraduate psychology majors (115 females, 11 males, and 1 unidentified) participated anonymously in this study. The age of the participants ranged from 18 to 29 (M age = 20, SD = 1.61). Participants were randomly assigned to one of two thought valence (Positive vs. Negative) conditions and the perceived validity of those thoughts (metacognitive confidence) was an additional measured predictor variable. Exam performance was assessed as the dependent variable. A power analyses was conducted using G*Power (Faul et al., 2007). After obtaining the effect in a pilot study briefly described in footnote 1 (i.e., Cohen's f 2 = .043), we planned for a similar effect size in this study. 1 The desired sample size for a two-tailed test (α = .05) of the predicted interaction with .80 power was a total of N = 186 participants. Our final sample was determined based on the number of participants that could be collected during the week in which the study was posted, resulting in a relatively smaller number of participants (127 participants) compared to the estimated one. The final sample results in power of .64 to detect the anticipated effect size based on pilot study.

Procedure
Permission to conduct the study was provided by the university institutional ethics committee before the study began. Upon arrival, participants were led to believe that their university had recently begun to consider a new policy requiring seniors to pass comprehensive exams in their major areas to graduate (see Petty & Cacioppo, 1986, for further details on this particular academic topic). Participants were told that if implemented, the exam policy would begin in the next academic year and would apply to all students currently enrolled at their university. The topic was framed to be of high personal relevance to all participants (Petty & Cacioppo, 1979;Tormala et al., 2002) because metacognitive processes of validation are more likely to occur when students are motivated to think. Then, the students were told that before implementing this exam policy, their university wanted to assess their reactions and so they would be asked to read about the policy and report their thoughts and opinions about it. After that, participants were randomly assigned to receive a persuasive message that presented either positive (i.e., benefits) or negative (i.e., problems) information about these comprehensive exams. Additionally, participants listed either positive or negative thoughts towards the exams to match the received message (primary cognitions). Next, participants indicated the perceived validity of the thoughts (metacognitive confidence) they generated about the exam proposal. Finally, participants' performance was assessed by asking them to do a brief psychology exam. After reporting demographic and ancillary questions, participants were thanked and debriefed.

Thought Valence
Participants were randomly assigned to generate either positive or negative thoughts about the exams (primary cognitions). As noted above, this manipulation was accomplished by first exposing the participants to a one-sided message in favor or against the exams and by asking them to generate thoughts only in the same direction as the message. The messages were taken from prior research (e.g., see Briñol et al., 2006;Tormala et al., 2007).
In the positive thoughts condition, the message emphasized various benefits of comprehensive exams. For example, the message said that "students and faculty would work harder. The prospect of a comprehensive exam would increase the grade point average of undergraduates significantly," as well as "students from institutions with comprehensive exams are more likely to be accepted into good graduate programs," and "in recent years, starting salaries of students from other institutions with comprehensive exams have been, on average, three thousand to four thousand dollars higher than starting salaries for students graduating from comparable institutions without the exams." Additionally, after reading this message presenting exclusively arguments in favor of the proposal, participants in this condition were asked to list their own positive thoughts about it.
In contrast, in the negative thoughts condition, the message emphasized various risks of comprehensive exams. For example, the message stated that "the prospect of a comprehensive exam clearly appears to be stressful and overwhelming, leading students to achieve less and faculty to be less effective," as well as "students from institutions with comprehensive exams are less likely to be accepted into good graduate programs," and " in recent years, starting salaries of students from other institutions with comprehensive exams have been, on average, three thousand to four thousand dollars lower than starting salaries for students graduating from comparable institutions without the exams." After reading this negative message, participants in this condition were asked to list their thoughts about instituting the exams. Previous research in persuasion has shown that this induction is an effective procedure to ensure participants generate relatively positive or negative evaluative responses toward topics like comprehensive exams Killeya & Johnson, 1998). 2

Perceived Validity of Thoughts
Following the task of listing thoughts, participants indicated the validity of their thoughts (metacognitive confidence) about comprehensive exams by answering the two following questions: "How confident are you about your thoughts regarding comprehensive exams?" (1 = "Not at all confident" to 7 = "Extremely confident"), and "How certain are you in the thoughts that you have just generated?" (1 = "Not certain at all" to 7 = "Extremely certain"). The two measures were significantly correlated (r = .68, p < 0.001) and were thus averaged to form one single measure of perceived validity. Similar items have been used to measure thought confidence in other self-validation studies revealing that perceived thought validity can be measured by merely asking participants to rate their thought confidence (e.g., Petty et al., 2002;Horcajo et al., 2020;Requero et al., 2020; see also Graft et al., 2017). Responses to this composite index were scored so that higher values represented the perception of more validity of thoughts.

Dependent Variable: Exam Performance
Participants were asked to do a knowledge test on psychology. This dependent measure was selected because of its relevance for the object of thought in this study (i.e., thoughts about comprehensive exams in one's major). This test included twelve questions about psychology. Participants' correct responses to these questions were summed to create the exam performance index. That is, participants' performance was assessed by counting the number of questions that they correctly solved, ranging from 0 to 12 correct answers (for similar procedures of assessing performance based on standardized tests, see Callender et al., 2016;Elliot et al., 1999;Hacker et al., 2000;Veenman et al., 1997).

Results
The number of questions correctly answered in the exam was submitted to a hierarchical regression analysis, with thought valence (dichotomous), perceived validity of thoughts (continuous), and the interaction term (i.e., Thought Valence × Perceived Validity of Thoughts) as the predictor variables. Main effects were interpreted in the first step of the regressions and the two-way interaction in the final step (Cohen & Cohen, 1983). Results showed a significant interaction between thought valence and perceived validity of thoughts on participants' psychology test performance, B = 0.785 (95% CI: 0.299, 1.270), t(123) = 3.200, p = .002, Cohen's f 2 = .083. As illustrated in Fig. 1, participants' thoughts predicted the number of questions correctly solved when they perceived those thoughts to have high validity (+ 1SD), B = 0.712 (95% CI: 0.054, 1.370), t(123) = 2.141, p = .034, but the effect was reversed when they reported low thought validity (-1SD), B = -0.796 (95% CI: -1.454, -0.138), t(123) = -2.394, p = .018.

Discussion
The results of this study are consistent with the self-validation hypothesis, and showed that the extent to which people see their primary cognitions (thoughts about exams) as valid affects the impact of those thoughts on performance. Specifically, the valence of students' thoughts had a greater impact on the number of questions about psychology correctly answered when the validity of thoughts (metacognitive confidence) was perceived to be relatively high. In contrast, when perceived validity was relatively low, the effect was reversed. Viewing this interaction differently, generating positive thoughts improved performance as the perceived validity of those positive thoughts increased. However, increasing the perceived validity (metacognitive confidence) associated with negative thoughts reduced performance. In sum, this study was informative in affirming the idea that measuring the validity with which thoughts are held (metacognitive confidence) can lead to increased predictability of the effects of manipulated thoughts on performance in an academic context. Nevertheless, because participants' validity of thoughts was measured, it is possible that other, unmeasured factors (e.g., self-esteem, intelligence, etc.) may have been confounded with reported metacognitive confidence. Therefore, in the next study, we manipulated both thought validity and thought valence in order to more accurately infer the causal role of both variables. Additionally, in the next study we varied the object of thought and the type of performance assessed to gain generalization across topics, materials, and inductions. Finally, we included a neutral control group to examine the extent to which the obtained finding was mostly due to the attenuation effect of low perceived validity (invalidation) or the magnifying effect of high perceived validity (validation).

Study 2
Study 2 aimed to replicate and extend the findings of Study 1, examining the causal role of thought-validation (metacognitive confidence). For that purpose, Study 2 used a full experimental design by manipulating thought validation as well as thought valence. Furthermore, this study introduced a number of other important changes designed to generalize across procedures, and to gain further understanding of the obtained effects.
In this study, participants were first asked to think about their best or worst qualities as future job candidates (primary cognitions). Therefore, instead of generating thoughts about exams, in this study participants thoughts were about themselves. After the thought valence induction, perceived validity of their thoughts was manipulated (rather than measured as in the previous study) to be relatively high or low. Specifically, participants were randomly assigned to one of the three conditions designed to vary the perceived validity of their thoughts: validation, invalidation, or control. Finally, instead of measuring knowledge as in the previous study with a psychology exam, participants in this experiment were asked to solve problems with geometric figures (shapes task) that were designed to simulate a job-relevant screening test (see Martens et al., 2006, Preckel et al., 2006 for similar measure of performance).
Despite all the variations in inductions, we predicted the same interaction between thought valence and thought validation in affecting performance. Specifically, it was expected that the valenced thoughts of participants assigned to the high confidence condition (validation) would have more impact on performance compared to the thoughts of those assigned to the low confidence (invalidation) condition. Control condition participants were expected to perform in between the validation and invalidation conditions suggesting a role both for validity and invalidity in affecting thought use (i.e., validation increasing it and invalidation decreasing it).

Participants and Design
Two hundred and forty-six undergraduates in psychology (211 females, and 35 males) participated anonymously in this study. The age of the participants ranged from 17 to 29 (M age = 19, SD = 1.51). All of them were randomly assigned to a 2 Thought Valence (Positive vs. Negative self-relevant thoughts) × 3 Validity of Thoughts (Validation vs. Invalidation vs. Control) factorial design. Performance was measured as the dependent variable. A power analyses was conducted using G*Power (Faul et al., 2007). Based on the average effect size for the interaction in the pilot study (Cohen's f 2 = .043) and Study 1 (Cohen's f 2 = .083), results indicated that the desired sample size with .80 power was N = 157 participants. Our final sample size approximately doubled the estimated one (246 participants) because we wanted to be sure we had enough participants to detect the effect even if it was smaller than anticipated given this first manipulation in this context of thought confidence. 3

Procedure
Permission to conduct the study was provided by the university institutional ethics committee before the study began. Upon arrival, participants were told that the study was about self-evaluation in professional contexts. The self is a topic of high personal relevance to participants (Petty & Cacioppo, 1979), and even more so in the context of students facing future prospects in the job market (Briñol et al., 2009). This is important because, as noted, self-validation processes are more likely to operate when thinking is relatively high. In this study, participants were asked to think about their best or worst qualities as future job candidates. These self-relevant thoughts served as the primary cognitions. Following this thought valence manipulation, participants were randomly assigned to one of the three conditions designed to vary the perceived validity of their primary cognitions: validation, invalidation, or control. Finally, all participants were asked to complete up to four problems as the performance measure. After these problems and some ancillary questions were completed, participants were thanked, debriefed, and dismissed.

Thought Valence
Participants were asked to list either three positive or three negative personal characteristics they possessed as future job candidates. In the positive thought condition, they were asked to list the strengths, skills, and abilities they expected to have in facing the job market. In the negative thought condition, they were asked to list the weaknesses, flaws, and limitations they anticipated having when facing the job market. All participants were told that this was an important task and were asked to think carefully as they listed their characteristics. As noted, previous research has shown that self-evaluations can vary as a result of thinking about one's strengths or weaknesses (e.g., Tice, 1992;Vohs et al., 2005), and this particular procedure has been used successfully in previous self-validation studies to influence self-evaluations (e.g., Briñol & Petty, 2003;Briñol et al., , 2009Briñol et al., , , 2013Hermann et al., 2002). 4

Validity of Thoughts
The validity of primary cognitions served as the secondary, metacognitive variable. Participants were randomly assigned to one of the three conditions designed to vary perceived validity of the thoughts generated. In each of these conditions, participants received different instructions and had to answer different questions. The instructions and questions were designed to lead participants to think that the validity of their thoughts was high or low or were irrelevant to validity.
In the validation condition, the title of the instructions was "confidence in your thoughts." Participants in this condition were asked to rate the validity of their thoughts using a series of questions in which the responses were framed to be high in all cases. That is, the responses provided to answer questions in this condition were framed to always imply high degrees of validity, confidence, and certainty. For example, one question was "How certain are you in the thoughts that you have just generated?" The answer to this question was anchored from 1 = "Somewhat certain" to 9 = "Extremely certain." Therefore, the minimum score that participants could answer already implied some degree of certainty. Given that the minimum anchor was already relatively high, responding to these questions was expected to make participants appear as if they had by default relatively high levels of confidence, certainty, rightness, and validity. This validation procedure is a novel, subtle induction of thought confidence and is based on similar inductions in which the question wording or anchors presented to participants are varied to imply the presence or absence of a psychological state (e.g., Salancik & Conway, 1975). In this case, the induction was designed to increase perceived thought validity and subsequent thought reliance by leading participants to think that they had high levels of thought confidence.
In the invalidation condition, "separate from your thoughts" was the highlighted title of the instructions and participants were asked to rate the extent to which they were separated from the self-relevant thoughts they just generated. Participants in this condition also answered a series of questions but this time responses implied that the participants were distant, separated, and disengaged from their thoughts. For example, one question was "How distanced are you from the thoughts that you have just generated?" (1 = "Somewhat distanced" to 9 = "Extremely distanced"). Therefore, regardless of the scores provided by participants, all responses implied some level of distance from their thoughts. This invalidation condition was designed to reduce the perceived validity of thoughts and subsequent thought usage. Although this induction was also rather subtle, it was designed in accord with previous research revealing that creating distance from thoughts (either through mindfulness, cleansing, or other means) decreases the use of thoughts (Bernstein et al., 2015;Lee & Schwarz, 2021). Also, research on self-persuasion has demonstrated that physical distance (Briñol et al., 2013), and other forms of separation from one's thoughts, can decrease the use of both positive and negative thoughts affecting self-evaluation (Gascó et al., 2018;Paredes et al., 2019;Santos et al., 2019).
Footnote 4 (continued) characteristics as a job candidate?" (1 = Negative, 9 = Positive). One participant did not complete this measure, so it was excluded from the analysis. The 2 Thought Valence (Positive vs. Negative) × 3 Thought Validity (Validation vs. Invalidation vs. Control) ANOVA revealed a main effect for the thought valence manipulation, F (2, 239) = 120.315; p < .001, η ρ 2 = .335, such that participants in the positive thought condition perceived their thoughts to be more positive than those in the negative thought condition. No other effects reached significance (ps > .140).
Finally, in the control condition, participants were instructed to "read the following statements" and were merely asked to read a few neutral statements unrelated to distance or validity. For example, one of the statements was "you will find the instructions regarding what to do for each task."

Dependent Variable: Performance
Participants had to solve four problems that consisted of identifying the missing element that completed a pattern of geometric figures (shapes task). For example, one of the problems presented a combination of triangles and squares which changed its position and color following a pattern. To choose the correct item that completed the pattern, students had to guess the underlying logic in the sequence of shapes presented (for a similar assessment of performance see Martens et al., 2006;Preckel et al., 2006). The number of problems correctly solved was the dependent measure of the study, ranging from 0 to 4. Importantly, this task was presented as one that was relevant to potential job performance.

A 2 Thought Valence (Positive vs. Negative) × 3 Validity of Thoughts (Validation vs.
Invalidation vs. Control) ANOVA was conducted on participants' performance. The analysis revealed that thought valance did not affect performance significantly as a main effect (p ≥ .174). There was no main effect for the thought validation manipulation either (p ≥ .713). Most importantly, the results showed a significant interaction between thought valence and validity of thoughts, F (2, 240) = 5.929, p = .003, η ρ 2 = .047 (see Fig. 2). To fully interpret this interaction, further contrasts were conducted. We first examined only the validation and invalidation conditions along with thought valence and results revealed a significant interaction between thought valence and validity of thoughts, t(240) = 3.385, p = .001. This interaction conceptually replicated the two-way interaction pattern observed in Study 1. This interaction showed that the effect of thought valence on performance was greater for participants in the validation condition than for those in the invalidation condition. For thought-validation participants, those who generated positive thoughts solved more problems correctly (M = 2.40, SD = 0.90, 95% CI = [2.11, 2.68]) than did those who generated negative thoughts (M = 1.75, SD = 0.79, 95% CI = [1.43, 2.09]), F(1, 240) = 8.263, p = .004, η p 2 = .033. However, for the invalidation condition a non-significant trend in the opposite direction emerged: participants who generated positive thoughts tended to solve fewer problems correctly (M = 1.84, SD = 0.96, 95% CI = [1.56, 2.13]) than those who generated negative thoughts (M = 2.22, SD = 1.10, 95% CI = [1.94, 2.50]), F(1, 240) = 3.519, p = .062, η p 2 = .014. The interaction between thought valence and validity of thoughts was also significant when comparing control and invalidation conditions. This interaction revealed that the effect of thought valence on performance was greater for participants in the control condition than for those in the invalidation condition, t(240) = 2.133, p = .034. However, the interaction between thought valence and validity of thoughts was not significant when control conditions were compared to validation conditions, t(240) = 1.273, p = .204.
Analyzing the overall Thought Valence × Validity of Thoughts interaction differently, for the participants who generated positive thoughts, those who received the validation induction 001. Participants assigned to the invalidation condition tended to solve fewer problems correctly than did those from the control group, F(1, 240) = 3.604, p = .059, η p 2 = .015.

Discussion
The results of this study are consistent with the self-validation hypothesis and provide a conceptual replication and extension of the findings obtained in Study 1. This study provided additional support for a differential effect of thought validity as a function of the valence of thoughts that were accessible. We showed that the validation condition affected reliance on previously generated thoughts more than the invalidation condition, influencing subsequent performance. The control condition tended to fall in between validation and invalidation as expected. Viewing this interaction differently, generating positive thoughts (self-strengths) improved performance when reliance on those positive thoughts was manipulated to be high (validation) rather than low (invalidation). However, increasing the validity associated with negative thoughts (self-weaknesses) tended to reduce performance compared to invalidation. Again, the control condition fell in between.
Most importantly, this study used an experimental induction of thought validation rather than relying on a measurement approach thereby providing some evidence for a causal role of validation processes in affecting the extent to which primary cognitions determine performance. In sum, despite changing the way that positive and negative thoughts were salient, the approach to validating thoughts, and the object of thought, the validation induction affected the validity of previously activated thoughts, affecting subsequent performance.

Study 3
Study 3 was designed to replicate and extend the prior findings by including a more complete and validated measure of performance as the dependent variable, as well as adding a measure of the proposed mediator (perceived thought validity). In this study, participants were first asked to think about their best or worst qualities as students. Therefore, instead of focusing on thoughts about exams, or self-relevant thoughts as potential job candidates, in this experiment we asked participants to think about their qualities as students as a manipulation of their primary cognitions. After this thought valence induction, the perceived validity of the primary cognitions listed (metacognitive confidence) was manipulated to be relatively high or low. Specifically, participants were randomly assigned to either recall past episodes of confidence or past episodes of doubt Requero et al., 2020). As a dependent measure, participants in this experiment were asked to solve a test consisting of 30 multiple-choice questions from the Graduate Record Examination (Jamieson et al., 2010). By having a more reliable and balanced measure of performance we expected to avoid possible floor and ceiling effects that could result if a small number of measures are too easy or difficult. Finally, this study also included a new measure of the proposed metacognitive mediator (perceived thought validity) to examine the role of this metacognitive assessment in performance.
Despite all the variations, we predicted that participants assigned to the validation condition should magnify the effects of valenced thoughts on performance relative to those assigned to the invalidation condition. Furthermore, we expected that these effects of validity of thoughts on performance would be mediated by the overall perceived validity with which participants reported holding the thoughts they had listed.

Participants and Design
Three hundred and four undergraduates in psychology (271 females, 30 males, and 3 unidentified) participated anonymously in this study. The age of the participants ranged from 17 to 36 (M age = 20, SD = 1.68). Participants were randomly assigned to a 2 Thought Valence (Positive vs. Negative) × 2 Validity of Thoughts (Validation vs. Invalidation) factorial design. Performance in responding to math and verbal GRE questions was measured as the key dependent variable. A power analysis was conducted based on the average interaction effect obtained in the prior studies. Analyses were conducted using G*Power (Faul et al., 2007) entering the interaction effect size across studies (Cohen's f 2 = .057). Results on this analysis suggested that the desired sample size with .80 power is N = 140. We aimed to double this estimate and somewhat exceeded this goal because data collection continued until the end of the week when the sample size was expected to be reached. 5

Procedure
Permission to conduct the study was provided by the university institutional ethics committee before the study began. Upon arrival, participants were told that the study was about self-evaluation. First, participants were asked to think about their best or worst qualities as students (primary cognition). Following this thought valence induction, participants were randomly assigned to recall past episodes of confidence (validation condition) or doubt (invalidation condition). This manipulation was designed to vary the perceived validity of the thoughts previously listed (metacognitive confidence). After these two inductions, participants were asked to complete a test consisting of 30 multiple-choice questions from the Graduate Record Examination (Educational Testing Service, 1999) and to report their perceived thought validity (measure of the proposed metacognitive mediator). The number of questions correctly answered served as the dependent measure. Participants completed ancillary questions, and were thanked, debriefed, and dismissed.

Thought Valence
Participants were asked to list either three positive or three negative personal characteristics they possessed as students, an induction similar to that employed in Study 2 which had participants list their strengths and weaknesses as job candidates. These self-relevant thoughts were the primary cognitions of this study. Specifically, in the positive thought condition, participants were asked to list the strengths, skills, and abilities they have regarding their academic career. In the negative thought condition, they were asked to list the weaknesses, flaws, and limitations they have when considering their academic career. All participants were told that this was an important task and were asked to think carefully as they listed their characteristics. 6 5 Analyzing the data with only the first 140 participants, the two-way interaction between thought valence and validity of thoughts on performance was already significant, F(1, 136) = 4.878, p = .029, η ρ 2 = .035. For participants assigned to the thought-validation condition, those who generated positive thoughts solved correctly more questions than did those who generated negative thoughts, F (1, 136) = 5.762, p = .018, η ρ 2 = .041. Within the invalidation condition there were no differences between those who generated positive and those who generated negative thoughts, F (1, 136) = 0.614, p = .435, η ρ 2 = .004. This pattern replicated the results reported in the main text for the full sample of this study. 6 At the end of the study, participants completed a manipulation check to ensure the manipulation created the intended pattern of thought valence. Specifically, participants had to answer a question about the perceived valence of their thoughts using the scale: "How would you rate the thoughts you listed as a job candidate?" (1 = Negative, 9 = Positive). The 2 Thought Valence (Positive vs. Negative) × 2 Validity of Thoughts (Validation vs. Invalidation) ANOVA revealed a main effect for the thought valence manipulation, F (1, 300) = 54.310; p < .001, η ρ 2 = .153, such that participants in the positive thought condition perceived their thoughts to be more positive than those in the negative thought condition. No other effects reached significance (ps > .204).

Validity of Thoughts
Participants were randomly assigned to either validity or invalidity conditions. In the validation condition, participants were asked to describe a past personal episode in which they felt confidence in their thoughts. In the invalidation condition, participants were asked to describe a past personal episode in which they felt doubt. Prior research has shown that this manipulation following a thought listing can lead people to misattribute the high or low confidence induced by the writing task to the thoughts listed previously (metacognitive confidence), even though these thoughts are unrelated to the episodes described Requero et al., 2020).

Performance
Participants in this study were asked to solve 30 multiple-choice questions from the standardized math and a verbal performance test of the Graduate Record Examination (Educational Testing Service, 1999). The math test we used consisted of 20 multiple-choice problems taken from the quantitative reasoning section of practice GREs (Jamieson & Harkins, 2009). The verbal performance test we used consisted of 10 multiple-choice word problems taken from the verbal reasoning section of practice GREs (e.g., Croizet & Claire, 1998;Mrazek et al., 2013). This selection of 30 items including both math and verbal questions was used previously by Jamieson et colleagues (2010). The order of presentation of these questions was randomized and all participants responded to the same questions. Responses were coded as correct or incorrect (wrong answers, or no response), giving a minimum total correct score of 0 and a maximum of 30. The average number of questions solved was close to 13 (M = 12.71, SD = 3.52).

Perceived Thought Validity
To assess overall perceived thought validity, participants answered the two items previously used by Briñol et colleagues (2007). In particular, we asked participants, "How valid do you believe the thoughts you listed are (1 = not at all valid, 9 = extremely valid)?" and "How confident are you in the thoughts that you listed (1 = not at all confident, 9 = extremely confident)?" We averaged these ratings to form a single index of overall perceived thought validity for each participant, r(302) = .378, p < .001. Answering these questions requires participants to reflect on their own thoughts and these questions are considered well-established measures of metacognitive perceptions (Fleming & Lau, 2014;Rucker et al., 2011). Higher values on this index indicated that participants perceived their thoughts to be more valid.

Mediation Analysis
To examine whether the extent of perceived thought validity (metacognitive confidence) mediated the effect of the key theorized interaction on the performance measure, we conducted a mediated moderation test using bootstrapping methods (Muller et al., 2005). In this procedure, manipulated thought valence and validity of thoughts were both contrast coded (i.e., negative thoughts / low validity = -1, positive thoughts / high validity = 1) and perceived thought validity was mean-centered. To test the hypothesized mediation by perceived thought validity (metacognitive confidence), we conducted a biased corrected bootstrapping procedure with 10,000 bootstrap re-samples using Hayes process macro (Model 4; Preacher & Hayes, 2004;Shrout & Bolger, 2002). In this analysis, Thought Valence × Validity of Thoughts was an independent variable, performance was a dependent variable, and Thought Valence × Perceived Thought Validity was a mediating variable (see Fig. 4). Thought valence, validity of thoughts and perceived thought validity were included as covariates. This approach includes procedures that compute a 95% CI around the indirect effect and mediation is indicated if this CI does not include zero. As predicted, the result of this bootstrapping procedure revealed that the 95% CI of the indirect effect (i.e., the path through the mediator) did not include zero (Indirect Effect a × b = .082, 95% CI = [0.016, 0.205]). Therefore, the mediation by metacognitive confidence is supported as plausible. 7

Discussion
Despite changing the content of primary cognitions (self as student) and the precise procedure to manipulate secondary cognitions, the results again supported the self-validation predictions. As expected, the validation induction affected reliance on previously activated primary cognitions (thoughts about the self as a student), influencing subsequent performance. Regarding the invalidation condition, results showed that previously activated thoughts did not have any effect on performance. Viewing this interaction differently, generating positive thoughts improved performance when reliance on those primary cognitions was manipulated to be high rather than low. However, the same induction of metacognitive validity decreased performance for negative thoughts. That is, the effect of validity of thoughts on performance was significant for both positive and negative thoughts. This final experiment also showed that changes in the metacognitive measure of perceived thought validity served as a plausible mediator of the impact of thought valence and the validity of thoughts manipulations on participants' performance. Therefore, this study provided convergent evidence for the key interaction and extended the contribution by providing meditational evidence consistent with the proposed process.

General Discussion
Taken together, these three studies are consistent with the idea that metacognitive thought confidence can lead to opposite academic performance findings depending on whether it validates positive or negative performance-relevant thoughts. The pattern across all studies was that when primary cognitions were positive (positive thoughts about exams, selfstrengths as a job candidate or as a student), metacognitive confidence (whether measured or manipulated) improved performance. However, when thoughts were negative (negative thoughts about exams, self-weaknesses), the same metacognitive confidence reduced performance. 8 Thus, promoting feelings of confidence in students can produce the desired outcomes on performance or backfire depending on whether it validates positive or negative previously generated thoughts. Importantly, this research also provided meditational evidence of the proposed mechanism. Specifically, Study 3 showed that changes in perceived thought validity mediated the impact of the interaction of the two manipulations on participants' performance. Based on these findings, we argue that thought validation can help in specifying why and when metacognitive confidence would increase or decrease performance in academic settings. 8 To provide maximum power, we combined the data across all studies. First, although our samples were composed of more women than men in each study, we did not expect nor find any gender differences in this set of studies. That is, when collapsing the data of the three studies (dichotomizing the continuous variables of Study 1 at the median, eliminating participants assigned to the control group in Study 2, and including type of study and gender as additional predictors), the results revealed that the predicted two-way interaction between thought valence and validity of thoughts was significant overall, F (1, 568) = 16.909, p ≤ .001, η ρ 2 = .029, and also for both women, F (1, 514) = 22.585, p ≤ .001, η ρ 2 = .042, and men, F (1, 54) = 11.999, p = .001, η ρ 2 = .182. Decomposing the overall two-way interaction by thought valence, we observe that the effect of validity of thoughts was significant both for positive, F (1, 568) = 8.753, p = .003, η ρ 2 = .015; and negative thoughts, F (1, 568) = 8.171, p = .004, η ρ 2 = .014. Finally, this overall two-way interaction was not further qualified by either type of study, p = .502, nor gender p = .303.
The findings were robust across topics, including thoughts about exams in one's major in the first study, thoughts about the self as a future job candidate in the second study, and thoughts about the self as a student in the third. Furthermore, the same outcome was observed regardless of the type of test employed (e.g., knowledge, geometric shapes problems, or math and verbal GRE questions). In sum, this research showed the same pattern of results using different thought objects, different inductions, and different measures of performance. These findings demonstrate that there is considerable value in differentiating between merely having thoughts and engaging in metacognitive assessment of those thoughts. Although promoting confidence to becoming more successful and productive has become a popular approach (Burgmer & Englich, 2012;Körner et al., 2020;Lammers et al., 2013;Peper et al., 2018) the present studies suggest that increasing confidence will not be beneficial universally.
Recognizing that promoting feelings of confidence in students can have opposite effects on performance depending on the salience of the thoughts (positive versus negative) in a person's mind is important for designing future interventions and it is also potentially relevant for explaining some inconsistent effects in the previous literatures. In fact, rather than being inherently beneficial, variables associated with confidence used in prior research have led to positive outcomes in some cases but have also led to negative outcomes in other cases. For example, in a classic review, Hansford and Hattie (1982) concluded that the relationship between self-esteem (a variable often associated with confidence; DeMarree et al., 2020) and student's performance was limited and mixed. These authors found that the average correlation between self-esteem and academic performance was relatively low (in the range of 0.21 to 0.26). Furthermore, many efforts designed to improving students' self-esteem turned out to be potentially counterproductive (see, Baumeister et al., 2003, for a review). Although speculative, it is possible that when confidence increased via selfesteem or any other confidence-related variable (such as self-affirmation, empowerment, positive emotions, etc.) failed to produce desired positive outcomes in previous research it was due, at least in part, to the presence of negative thoughts (e.g., I am feeling anxious about this test) that were being validated by the confidence induction. As demonstrated in the current studies, increasing feelings of confidence can be detrimental for performance when the thoughts validated are negative. Just as metacognitive confidence was able to devalue but also enhance performance in the current research, future research should examine whether feelings of confidence can also either increase or decrease other important outcomes such as desirable career choices, or teamwork. Given the uniqueness of separating between thought-content and thoughtvalidity, the present approach has the potential to break new ground in other areas relevant to education.
In closing, we recommend the use of validity-related measures and inductions as being useful tools in any domain in which thoughts can affect performance. As shown, questions about thought confidence are easy for researchers and practitioners to use, and students find them easy to answer. Beyond their ease of use and efficiency, we suggest including confidence measures as a potential moderator of the effects of any other mental constructs. In accord with prior research (Paredes et al., , 2021Shoots-Reinhard et al., 2015) metacognitive confidence measures, which are relevant with regards to performance in academic settings (Stankov & Lee, 2008;Stankov et al., 2012), can increase the predictive validity of any mental content, including not only thoughts about exams and self-relevant thoughts, but also any other mental construct potentially examined in future studies.

Conclusion
This research finds that performance in academic settings is affected by an interaction between the valence of student's thoughts (primary cognition) and the metacognitive confidence with which those thoughts are held (secondary cognition). Metacognitive processes of validation can apply to any content of primary cognitions, including thoughts about exams and self-relevant thoughts. In particular, this research showed that measuring the spontaneous confidence with which primary cognitions are held can lead to increased predictability of the effects of those cognitions on performance (Study 1). Beyond being measured, metacognitive confidence can also be manipulated to be high or low showing the same pattern of results (Studies 2 and 3). That is, metacognitive thought confidence, whether it is measured or manipulated, can lead to opposite findings on academic performance depending on the valence of the thoughts in a person's mind. Thus, rather than being inherently beneficial, metacognitive confidence can increase performance in some cases (e.g., when people are thinking about positive performance-relevant thoughts), but decrease performance in other cases (e.g., for negative thoughts).

Data Availability
Data for all studies can be found at https:// osf. io/ bzymh.
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http:// creat iveco mmons. org/ licen ses/ by/4. 0/.