Introduction

Self-efficacy is conceptualized as one’s beliefs in being able to act on a certain situation to achieve specific goals (Bandura, 1977, 1997). Research demonstrates that teachers’ self-efficacy significantly contributes to their persistence, commitment, job satisfaction and instructional quality (Holzberger et al., 2013; Klassen & Chiu, 2011; Tschannen-Moran & Hoy, 2001; Zakariya, 2020). The self-efficacy scale can be used to measure “generalized self-efficacy expectations dependent on past experiences and on tendencies to attribute success to skill as opposed to chance” (Sherer et al., 1982, p. 671), which is crucial to understanding and predicting individuals’ behaviors (Bandura, 2006). In the disciplines of science, technology, engineering, and mathematics (STEM) which received increased attention in schools, self-efficacy is a strong indicator of teachers’ confidence and skills to teach STEM (Geng et al., 2019; Gunning & Mensah, 2011; Jaipal-Jamani & Angeli, 2017; Zakariya, 2020).

Recently, early childhood STEM education has become an important agenda for research and practice, as it can significantly predict children’s increased school readiness, academic outcomes, problem-solving skills, scientific literacy, and employment in STEM fields in the long run (Chesloff, 2013; Early Childhood STEM Working Group, 2017; McClure et al., 2017). The implementation of high-quality STEM education in early childhood settings requires teachers’ confidence and pedagogical beliefs in this regard (Chen et al., 2020), so as to meet the learning interests of young children who need to live and thrive in a digital and uncertain world. However, there is a lack of research on early childhood teachers’ self-efficacy in STEM, leading to the relatively weak support for early childhood educators in teaching STEM. To address this knowledge gap, this study aims to develop and validate a STEM teaching self-efficacy scale for early childhood teachers.

Literature Review

STEM as an acronym can be used differently in different research. In general, STEM education refers to an integrated and interdisciplinary approach to real-world problem solving in science, technology, engineering, and mathematics learning (Lantz, 2009; Wang et al., 2011). Sometimes STEM may intentionally refer to any of the four individual disciplines (Maher et al., 2013; National Academy of Engineering and National Research Council, 2014). In our literature review, we use both perspectives, as there is a scarcity of research on integrated STEM in early childhood education (ECE).

Prior research has explored early childhood teachers’ beliefs toward integrating STEM in ECE. For instance, Tippett and Milford (2017) reported that early childhood educators considered STEM to be an important part of ECE. Park et al. (2017) further investigated early childhood teachers’ beliefs about their readiness for teaching STEM and found that teachers’ teaching experience and awareness about STEM would lead to different levels of perceived readiness for teaching STEM. In their scale used to evaluate teachers’ beliefs in readiness for teaching STEM, items about teachers’ self-efficacy in teaching STEM were included, such as “I know how to teach developmentally appropriate STEM subjects in my classroom.” “I know the teaching strategies of STEM education.” and “I have pedagogical knowledge about STEM teaching” (Park et al., 2017, p. 291). Simoncini and Lasen (2018) also examined Australian early childhood educators’ beliefs toward STEM teaching and revealed that play-based and hands-on learning experiences were considered to be important for early childhood STEM education. Furthermore, Jamil et al. (2018) examined how early childhood educators perceived the STEAM (an extension of STEM to include “arts”) approach to learning. They showed that teachers had a wide range of endorsement levels for early STEAM education. Their preliminary findings indicated that teachers’ beliefs about STEAM education were related to their age, training experience, and teaching experience. This supports that early childhood teachers have varying levels of self-efficacy in facilitating early STEM learning. Although research has been conducted to examine early childhood teachers’ beliefs or confidence in STEM teaching, there is a lack of instruments developed for measuring their self-efficacy in teaching STEM. Recently, Chen et al. (2020) developed a Teacher STEM Self-efficacy Scale in Taiwan for evaluating teachers’ STEM competencies in the cognitive, affective, and skill aspects. However, the scale was only used in a sample of pre-service early childhood teachers.

In the field of mainstream education (i.e., elementary, middle and high schools), teacher efficacy in teaching STEM-related subjects (e.g., mathematics and science) has been surveyed by the William and Ida Friday Institute for Educational Innovation (2012), with a scale of “Personal Teaching Efficacy and Beliefs”. Likewise, Lee et al. (2019) developed a scale for evaluating high school teachers’ self-efficacy in STEM knowledge, which consisted of six factors: science, technology, engineering, mathematics, overall knowledge, and attitudes toward STEM education. However, the teacher efficacy and attitude scales used in these two studies are subject-based rather than integrated and cross-domain. It is important to promote the integrated approach to STEM education which encourages the use of knowledge across disciplines to solve real-world, complex problems among learners. According to Nadelson and Seifert (2017), it is crucial to engage students in integrated STEM learning opportunities in an era of synthesis, so as to enable them to connect and apply multiple STEM disciplines. The integration of STEM disciplines should start from teacher training and evaluation to ensure the authentic implementation of integrated STEM curriculum. To measure teachers’ self-efficacy in teaching integrated STEM, Mobley (2015) has developed a scale of Self-Efficacy for Teaching Integrated STEM (SETIS), which consists of the social, personal, and material factors of self-efficacy. However, both scales were developed for mainstream education instead of ECE. To the best of our knowledge, there is a lack of measurement for evaluating early childhood teachers’ self-efficacy in teaching STEM from an integrated perspective. Therefore, there is an urgent need to develop an integrated STEM teaching self-efficacy scale for early childhood teachers.

Although there are some studies investigating teachers’ self-efficacy and beliefs in STEM domains (e.g., Chen et al., 2020; Çiftçi et al., 2020; Jamil et al., 2018; Park et al., 2017; Simoncini & Lasen, 2018), none of them was conducted in mainland China. We thus know very little about Chinese early childhood teachers’ self-efficacy in teaching STEM. To address this gap, this exploratory study developed and validated a scale for evaluating pre-service and in-service early childhood teachers’ self-efficacy in teaching STEM in China.

Early Childhood STEM Education in China

China is running the world’s largest ECE system, which provides services to 47 million children below 6 years in 2019 (Ministry of Education of China, 2020; Yang et al., 2020). In the national Guidelines for Kindergarten Education (Trial Version) (Ministry of Education of China, 2001), there are five suggested domains for ECE, including health, language, society, science (with mathematics included), and art (Yang, 2018). However, technology and engineering related learning is not included in this national ECE curriculum framework (Tao, 2019; Yang et al., 2020).

In practice, Chinese early childhood teachers tend to set goals for mathematics teaching, and would intentionally use age-specific curriculum resources (Li et al., 2015). It is also found that mathematics teaching in Chinese early childhood classrooms usually follows the processes of teacher-led introduction of a topic, children’s participation in topic-related games/ activities, and group sharing as a sum-up, which reflect a fusion of constructivism and instructivism (Huang et al., 2019; Li et al., 2019). In terms of early childhood science education, child-led inquiry has been promoted to replace teacher-directed instruction, with specific goals and educational suggestions given in the Guide for 3- to 6-Year-Old Children’s Learning and Development (Gao & Zhang, 2017; Ministry of Education of China, 2012). Chinese educational authorities have also advocated “Learning by Doing” as a notion for early childhood science education since 2001 (Gao & Zhang, 2017). However, due to the regional differences and the inadequate access to teacher training, there exist individual differences in mathematics and science teaching among Chinese early childhood teachers. Moreover, teachers’ self-efficacy in teaching STEM in an integrated approach remains understudied.

Accordingly, the present study investigated pre-service and in-service early childhood teachers’ STEM teaching self-efficacy in China using a newly developed scale. This is important as individuals’ self-efficacy is context and situation specific (Bandura, 1997). We firstly validated this scale using factor analysis, and then examined how the teacher-related factors would be associated with their STEM teaching self-efficacy to verify the criterion validity of the scale. The following questions guided this study:

  1. (1)

    Is the STEM Teaching Self-efficacy Scale (STSS) valid and reliable for measuring early childhood teachers’ self-efficacy in teaching STEM?

  2. (2)

    How is the heterogeneity of STEM teaching self-efficacy as related to teacher-related factors, such as gender, age, grade of study, the working status (pre-service versus in-service), level of study (junior college versus bachelor), occupational commitment, professional interest, and working experience?

Method

Participants

A survey was administered to 418 pre-service and in-service early childhood teachers in Zhejiang Province, China after obtaining their informed consent. The pre-service teachers were enrolled in the Bachelor of Early Childhood Education program in a public university. In-service teachers were sampled from seven kindergartens in Zhejiang Province. Participants completed the survey about their self-efficacy in teaching STEM as well as the basic demographic items (398 female and 19 male teachers, one with unknown gender information but all the other items were completed) with ages ranging from 17 to 23 years old for the pre-service teachers (N = 225), and 20 to 48 years old for the in-service teachers (N = 193). The demographic characteristics of participants are shown in Table 1.

Table 1 Demographic characteristics of participants

Instruments

The original STEM Teaching Self-efficacy Scale (STSS) was developed based on a thorough search of literature statements related to early childhood teachers’ beliefs about STEM teaching. We adjusted the STEM Pedagogical Content Knowledge Scale (STEMPCK; Yıldırım & Topalcengiz, 2019) to adapt to the early childhood context. The STSS initially included 40 items that were divided into five subcategories: Pedagogy, Science, Mathematics, Technology, and Engineering. Items were rated on a 5-point Likert scale ranging from 1 (not at all like me) through 5 (very much like me). Items were reviewed by two experts in ECE back-to-back to ensure the clearness and practical relevance of statements before sending out for the survey.

Participants were also asked to complete a teacher survey in which they reported their demographic characteristics and professional background, which include their gender, age, grade of study, the working status (pre-service versus in-service), level of study (secondary school, junior college, bachelor, and more), occupational commitment, professional interest, and working experience (years of teaching).

Statistical Analysis

Data cleaning and descriptive analyses were performed in SPSS Version 25 (IBM Corp, 2017). We then conducted factor analysis and correlation analysis as follows:

  • Exploratory Factor Analysis. Due to the fact that STSS was initially self-developed based on the literature, initial dimensionality assessment was performed using an exploratory factor analysis (EFA) of the 40-item STSS with principal components analysis (PCA) as the method of factor extraction, to identify the underlying factor structure of the scale. The PCA, developed by Pearson (1901) and adapted for use in factor analysis by Hotelling (1933), summarizes the interrelationships among the original variables in terms of a smaller set of underlying dimensions. It is used when the researcher has no pre-existing knowledge about the factors that may explain the interrelationships between a set of variables (Pett et al., 2003). The Kaiser-Meyer-Olkin test and Bartlett test of Sphericity were undertaken, both of which were used to establish the adequacy of the item correlation matrix upon which factor analysis was based. Criteria used for determining the number of factors to retain are parallel analysis, theoretical expectations, and interpretability. Factor loadings of 0.30 are considered as the minimum level for saliency, with loadings greater than 0.40 preferred (Hair et al., 2006). Items are considered as cross-loading if they load saliently on one factor and have a loading of above 0.30 on another factor. Poorly performing items based on a low loading on the primary factor or cross-loading were removed sequentially until all items fitted the criteria. EFA was performed in Mplus Version 8.1 (Muthén & Muthén, 2018).

  • Correlation analysis. To examine criterion validity of the STSS, we performed Pearson correlation to examine the association between STSS scores and teacher-related variables (e.g., demographic characteristics) via SPSS Version 25 (IBM Corp, 2017).

Results

Exploratory Factor Analysis

The Kaiser-Meyer-Olkin coefficient for the original 40-item dataset was 0.97 and the Bartlett’s Test of Sphericity was statistically significant (χ2 = 12,081.10, df = 780, p < .01) indicating that properties of the correlation matrix justified factor analysis being carried out. With an oblique rotation, the factors were permitted to be correlated with one another.

Communality

The extraction number for all the 40 items was higher than 0.60, indicating that more than 60% of each variable’s variance can be explained by the retained factors. From this aspect, these variables were well represented in the common factor space.

Parallel Analysis

Parallel analysis indicated there were five sub-factors in total for the 40-item scale.

Pattern Matrix

However, by inspection of the pattern matrix of the original 40 items, item 10, 18, 25, 27, 30, 31, 35, and 37 were all found to be cross-loaded and their secondary factor loadings were higher than 0.30. Item 17 and 26 were not found to be saliently loaded onto any factors. Therefore, these ten items were flagged as problematic ones in the initial inspection and were removed sequentially until all fitted the criteria.

Item 37 was deleted first since it had the highest cross-loading value into another factor. Based on this rule, items were continued to be removed if cross-loading or low factor loadings for both primary and secondary factors existed. Therefore, item 10, 21, 35, 32, 16, 26, 17, 15, 33, 25, 27, 14, 22, 28 and 34 were deleted sequentially because of its cross-loading or low factor loading. After these steps, item 13 was deleted since its communality was 0.487, which was less than 0.50.

Repeating the process, item 24, 29 and 30 were removed because of cross-loading and item 34 was deleted again because of its low communality (< 0.50).

We reran the analysis. The Kaiser-Meyer-Olkin coefficient for this dataset was 0.96 and the Bartlett’s Test of Sphericity was statistically significant (χ2 = 5501.24, df = 190, p < .01). Oblique factor rotation with the Direct Oblimin method, identified three latent factors. The three factors identified, comprising 20 of the original items, accounted for 66.90% of the total variance within the data. The results presented in Table 2 relate to factor analysis conducted using the 20 remaining items.

Table 2 Standardized factor loadings for items in the specified two-factor model of STSS

Lastly, we double checked the theoretical meaning for each item. Item 19 and item 23 were found to be repetitive in the meaning which was asked about science and mathematics; therefore, we decided to remove item 19 in the final scale.

For the final scale, The Kaiser-Meyer-Olkin coefficient for this dataset was 0.96 and the Bartlett’s Test of Sphericity was statistically significant (χ2 = 5157.54, df = 171, p < .01).

The communalities were in general higher than 0.50.

Two factors were identified which comprised 19 of the original items. The first factor, which was labeled Pedagogy Self-efficacy, describes the teachers’ self-efficacy in facilitating children’s learning in early childhood settings. This factor accounted for 52.33% of the total variance and had an eigenvalue of 9.94. The second factor was labeled Content Self-efficacy to describe the teachers’ self-efficacy in content knowledge related to the STEM domains. This factor accounted for 10.02% of the total variance and had an eigenvalue of 1.91.

Internal Reliability

We then calculated McDonald’s omega (ω) (Hayes & Coutts, 2020) to evaluate the internal reliability of the subscales. Internal consistencies were ω = 0.94 for the Pedagogy Self-efficacy subscale (items A1-A9, A11, A12), and ω = 0.90 for the left 8 items (Content Self-efficacy).

Criterion Validity

The correlations obtained between the teacher-related variables and the two subscales of STSS are presented in Table 3. Participants with higher scores on the scale were more likely to believe in their capacity to facilitate children’s learning in STEM. The Pedagogy Self-efficacy scores correlated positively with working status, age, and interest in ECE. This can be inferred that scores on Pedagogy Self-efficacy predicted working experience and professional interest. However, the Pedagogical Self-efficacy scores did not correlate with gender, grade of study, level of study (secondary school, junior college, bachelor, and more), occupational commitment, and years of teaching.

Table 3 Pearson correlation of STSS scores and teacher-related variables (N = 418)

The results also provide some evidence of criterion validity for the subscale of Content Self-efficacy. The Content Self-efficacy scores were positively correlated with working status, age, occupational commitment, and interest in ECE. Hence, pre-service teachers who had commitment to being an early childhood teacher also had higher Content Self-efficacy. Gender, grade of study, level of study, and years of teaching did not correlate with the Content Self-efficacy scores.

Discussion

This study presented the development and model test of the STEM Teaching Self-efficacy Scale (STSS) for early childhood teachers. Its results thus provided preliminary evidence of reliability and validity of the two factors of STSS: Pedagogy Self-efficacy and Content Self-efficacy. Items were constructed based on literature and selected based on the evaluation of their contribution to psychologically important factors (through factor analysis) to cover the scope of self-efficacy on the behaviors of STEM teaching and ensure the content and construct validity of the scale.

The self-developed STSS has a similar purpose with the existing scale “Personal Teaching Efficacy and Beliefs” (The William and Ida Friday Institute for Educational Innovation, 2012) which was designed to evaluate elementary teachers’ self-efficacy and confidence for teaching specific STEM-related subjects. However, the STSS is particularly designed for the population of early childhood teachers and can be used as a tool for monitoring the effects of early childhood teacher education and in-service training. Moreover, through an integrated, comprehensive assessment of teachers’ self-efficacy in STEM teaching, the STSS moves beyond the traditional view of STEM as a collection of independent disciplines. In other words, the factor structure of STSS does not correspond to the four disciplines reflected in STEM. This is important as teachers’ self-efficacy in teaching STEM subjects such as science and mathematics could be different from their self-efficacy in teaching integrated STEM. As noted by Bandura (1997), self-efficacy is situation-specific, so it may differ between the situations of subject-based and integrated STEM teaching experiences. This is also confirmed by Tao (2019) that although they have many years of teaching experience in traditional mathematics and science fields, Chinese kindergarten teachers have relatively low confidence in implementing integrated STEM education. More importantly, an integrated perspective for developing the STEM teaching self-efficacy assessment tool is in line with the international efforts in making STEM education more inclusive and interconnected across disciplines (Stremmel et al., 2015) to foster learners’ twenty-first century skills (Hussin et al., 2019; Jamil et al., 2018; Johnson et al., 2015; Khalil & Osman, 2017). Stohlmann et al. (2012) highlighted that integrated STEM education is a promising approach to more connected and relevant learning experiences for learners. It is especially urgent to investigate and enhance teachers’ self-efficacy towards integrated STEM education when we consider the challenges facing teachers to connect across the STEM disciplines and provide STEM learning opportunities in authentic contexts (Kelley & Knowles, 2016).

Other results of this study revealed the relationships between some teacher-related factors and scores on the STSS subscales. Scores on the STSS subscales were positively correlated with early childhood teachers’ working status, age, occupational commitment, and interest in ECE. These results provided preliminary evidence of the criterion validity of STSS. It would be better if some other constructs such as teachers’ STEM training backgrounds and their science instruction can be linked to the responses on the STSS. In terms of factors influencing teachers’ self-efficacy in teaching STEM, we found that the more years the (student-)teachers had experienced in the field of ECE, they were more likely to have higher levels of self-efficacy in exploring STEM with young children. This is confirmed by Bandura’s (1997) self-efficacy theory which argues that various factors such as individuals’ mastery experience would play a significant role in their self-efficacy. For instance, as compared to pre-service teachers, in-service teachers may have more effective experiences in working with young children, which could have led to positive psychological change in terms of attempting something new (e.g., integrated STEM pedagogy) in early childhood settings. This finding is also consistent with Jamil et al.’s (2018) study which showed that early childhood teachers’ age could predict their professional beliefs about integrated STEAM education, which adds “Arts” into STEM education. Meanwhile, high self-efficacy may lead to positive change of self-perception (Bandura, 1997; Kung, 2009). For example, early childhood teachers who have higher self-efficacy tend to have a higher self-perception of competence to work with young children, and thus produce higher levels of occupational commitment and professional interest in ECE. In contrast, individuals with low self-efficacy tend to have higher anxiety in related work and poor attitudes toward early STEM education. However, in the present study, evidence revealed that gender, grade of study, level of study, and years of teaching did not correlate with scores on the STSS subscales. This should be interpreted with caution due to a skewed distribution of participants in terms of gender, with only 4.5% of the participants being male. Also, the grade of study, level of study, and years of teaching were only applicable to part of the participants (either pre-service or in-service teachers), leading to the difficulty for interpreting the corresponding correlational findings.

Limitations and Future Research

Although this study has developed and validated STSS for measuring early childhood teachers’ self-efficacy in teaching integrated STEM, it has a few limitations. First, due to a relatively small sample size, we only used EFA, internal consistency, and preliminary correlational evidence to support the reliability and validity of STSS. Some other techniques for validating the scale can be used in future research, including the confirmatory factor analysis (CFA) and concurrent validity. Second, the data source of this study was drawn from teacher-report questionnaires. Given that some research has found that there is a gap between teachers’ self-perception of integrated STEM teaching and the authentic instructional practice (DeCoito & Myszkal, 2018; Settlage et al., 2009), assessing teachers’ pedagogical content knowledge and instructional practices is also of great importance for enhancing their STEM teaching experiences. Scenario-based measures can be used to supplement the reliance on teacher-report questionnaires (Gropen et al., 2017) for measuring early childhood teachers’ beliefs and practices of STEM education. In-depth, qualitative studies such as teacher interviews and classroom observations can be conducted to understand how and why early childhood teachers may play out different levels of self-efficacy and competence in teaching STEM or implementing an integrated STEM curriculum. Third, we only validated the STSS in the Chinese cultural context, which may limit its validity in other countries. Therefore, it is necessary for future research to explore the validity of this scale in other cultural contexts, which will further support cross-cultural comparison. Last but not the least, we did not collect data for evaluating teachers’ practices and child outcomes for this study. Future research is needed to further examine the relationships between early childhood teachers’ STEM teaching self-efficacy, their instructional quality in STEM, and children’s domain-specific (e.g., numeracy, science knowledge) and domain-general (e.g., approaches to learning, executive function) developmental outcomes.

Conclusion

Teacher efficacy can lead to changes in professional beliefs and even teaching practices (Hamre et al., 2012; Jamil et al., 2018; Tschannen-Moran & Hoy, 2001). This study developed and validated a scale for evaluating early childhood teachers’ self-efficacy in teaching STEM. It steps forward in the field through the employment of an integrated perspective for evaluating teachers’ perceptions of early STEM education, explicitly deriving two important factors: Pedagogy Self-efficacy and Content Self-efficacy related to STEM teaching in early childhood settings. STSS thus provides a professionally and psychologically meaningful tool for evaluating early childhood teachers’ beliefs about teaching STEM, which would help obtain evidence for the optimal design of STEM teacher training programs. The two factors of STSS also guide policymakers and teacher educators to focus on enhancing early childhood teachers’ STEM pedagogical skills and content knowledge, which would allow them to cope with future challenges on the road to the provision of high-quality integrated STEM education for young children.