1 Introduction

Technologies used in education refer to those concepts, tools, innovations, and advancements that are applied for various purposes in education settings to enhance teachers’ duties while also assisting students in learning effectively and improving their achievements [1]. Recently, the world has witnessed an advancing array of modern and emerging educational technologies which are fast gaining attention all over the world. One of these emerging technologies is artificial intelligence (AI). AI has been conceptualized as a machine's capacity to think and behave like a human, or better still, are computerized systems programmed to imitate and behave in humanlike manners [2]. AI, one of the vital driving forces of the 21st Century, is speedily bringing transformational changes to almost all human endeavours, including the educational field. While there are several educational applications of AI in education, its full potential is yet to be fully harnessed in instructional settings, unlike in the business domains [3, 4]. Given the speed at which AI is advancing, it would be illogical to conclude that it will not significantly impact the education sector in a few years to come, based on the several possibilities of the technology and its mind-blowing advancements in education [5, 6].

As AI rapidly advances, it finds numerous applications in the field of education. Researchers, particularly those in instructional design and computer sciences, are actively investigating the optimal ways in which AI can support students and teachers [7]. AI-based edtech, for instance, offers the potential for a student-centred approach [8], personalized learning experiences [9, 10], and the ability to identify students' affective and cognitive needs while delivering tailored support in response to these needs [11, 12]. Beyond these possibilities, teachers also benefit from the capability to monitor their students' learning progress [13]. Teacher dashboards, for example, provide real-time notifications to teachers about their students [14, 15]. Integration of AI in education empowers teachers to assess their pedagogies and effectively plan and implement lessons [16, 17]. However, as noted by [18] in [19], there is a neglect of the role teachers play in incorporating AI-based edtech, particularly concerning teachers' trust in technology.

Trust in technology is a significant predictor of the extent to which teachers rely on technology [20]. Since interaction (calibration) between teachers’ trust and AI-based edtech may impact the outcomes of technology utilization, the need arises to examine teachers’ trust (TT) in AI-based edtech. This examination is essential for understanding the dynamics involving the trustor (teachers), the referent of trust (AI-based edtech), and the nature of trusting (the risks or vulnerabilities associated with trust or dependence on AI-based edtech) [21, 22]. Given that the trust developed by teachers in AI-based edtech plays a vital role in determining the functions of technology in instructional settings, there is a demand to scrutinize and model the factors that predict their trust in AI, particularly among STEAM teachers in Nigeria. Our observations reveal a dearth of studies detailing trust in AI edtech among STEAM teachers in Nigeria. Despite the considerable potential of AI-based edtech to revolutionize STEAM education, research investigating STEAM TT in AI is notably scarce in Nigeria. Acquisition of scientific skills in the contemporary world underscores the importance of STEAM teachers utilizing technologies, including emerging ones like AI, robotics, and AR, among others, in their instructional processes. This becomes essential to prepare the next generation of students for the lives and jobs of the future, a future predicated to be AI-dominated. Unfortunately, the prevailing situation in the majority of Nigerian secondary schools is contrary, as STEAM teaching and learning primarily rely, in most cases, on conventional methods of teaching. While the government advocates the integration of technology in schools, its implementation in the National Policy on Education faces challenges, including those related to teacher variables [23]. In Nigeria, the effectiveness of STEAM education hinges on the holistic resolution of the challenge bedeviling effective STEAM education in the country [24]. In this context, the role of technology, particularly emerging technologies like AI in education, cannot be overstated [25].

Numerous studies conducted on AI-based edtech in education in Nigeria primarily focused on teachers' perceptions, perceived utility, ease of use, opportunities, advantages, and challenges associated with AI implementation and use [17, 26,27,28,29]. Also, the majority of these studies have centred on teachers in a general sense, neglecting the specific context of STEAM teachers. While established theories like the Technology Acceptance Model (TAM) [30] and the Theory of Academic Resistance [30] have been employed to underpin the factors influencing teachers' acceptance and adoption of new technology, none of these theories have explored TT in AI-based edtech or the distinctive characteristics of such technologies within an educational framework [31]. Although TAM and the Theory of Reasoned Action (TRA) possess significant behavioural components accurately predicting the intention to accept or use technology, their explanatory power is limited, lacking consideration for additional factors impacting users' trust in technology [32,33,34,35]. This study builds upon the TAM as its theoretical foundation while extending its constructs by incorporating external factors such as AI anxieties, the absence of human characteristics in AI, preferred strategies to enhance trust in AI, and the level of technology literacy. Also, we introduced gender as a moderating factor to enhance the research's robustness [36, 37]. Our contribution extends to empirical insights into AI in Nigeria by investigating the relationship between STEAM TT in AI-based edtech and the extended external constructs explored in this study, an aspect often overlooked by previous research. According to our observation, this study probably represents the first empirical endeavour to illuminate the connections between the considered constructs. Structurally, this paper begins with an introduction, followed by a comprehensive literature review and hypotheses formulation, methods, results, discussion, conclusion and limitation and future work.

2 Artificial intelligence in education

AI, being one of the emerging technologies of the fourth industrial revolution, is increasingly gaining popularity in education globally, especially in the areas of intelligent tutoring systems, automatic scoring of essays, gaining insights into learning analytics, smart assistive technologies, and autonomous pedagogical agents like teacher bots and robots that support social-emotional development, amongst others. This has paved the way for the application of intelligent tutoring systems (ITSs), have been described as the instructional model of the 21st Century [38]. As a result, teachers’ professional works are now being challenged on multiple fronts by AI-based tools which can automate pedagogical decision-making and teaching activities in schools [39]. Aside from the threat of job loss as a result of work automation, emerging technologies such as AI-based systems have initiated a host of new ethical concerns, including but not limited to data insecurity, racial bias, and issues around trust, among others, which in turn is currently driving cross-sectorial development of policies both nationally and internationally including in education [40].

The primary goal of applying AI to education is to improve student's learning experiences [41], and this is because the system has powerful pedagogical tools that can bring about effective instruction [42]. The tools of AI in education such as simulation-based methods, virtual, augmented realities, and 3-D technology, among others, according to [2, 41, 43], assist students to get practical and experimental learning experiences in and outside of the classroom. AI-based educational technologies such as robots or cobots working along with teachers are applied in education to teach students routine tasks such as spelling and pronunciation [41, 44, 45]. Aside from using them in instructional settings of teaching and learning, AI-based educational technologies are also been used in school administration [46]. Additionally, AI-based educational tools could assist teachers in personalizing instruction for their learners and give isolated and children with disabilities access to better and more efficient learning possibilities [47, 48]. Research has demonstrated that using AI-based educational tools to offer personalized training for students in a dynamic and sophisticated learning environment is possible [49]. While qualitative education requires human teachers’ active involvement, AI-based educational technologies promise additional quality support at all levels of education [50].

2.1 Teachers’ trust (TT) in educational technologies

Teachers’ perspectives on AI’s adoption in education, especially in teaching and learning are vital since they are the direct stakeholders in charge of bringing AI-based educational technologies into the classrooms [51]. This confirms the need to examine TT and the factors responsible for predicting it. In AI-assisted decision-making and studies in educational contexts, trust is rarely defined [52]. While trust in human beings generally increases with time as a result of frequent interactions, the reverse is the case with technologies where constant errors and malfunctions over time decrease trust [53]. However, in the case of AI systems, the opposite may also be true [54] since a direct connection may lead the initial low degree of trust to rise [55]. For AI to maintain its social license, especially in the context of education, the question of trust is crucial. The AI High-Level Expert Group (AI HLEG) of the European Commission claims that if an AI-based system does not demonstrate to users that it is trustworthy, then its widespread acceptance and adoption will be seriously hampered, and its numerous potential and advantages will go unrealized [56]. While trust is still important for a variety of technology adoptions [57], the problems with AI also present a variety of qualitatively different trust concerns in comparison to previous technologies [58]. In the case of trust in technology, there is neither volition nor a moral agency. Therefore, trust in technologies is based on beliefs about the features of the technologies rather than will or motives as technologies have none [59].

Studies that itemize the factors contributing to teachers’ adoption of technologies, especially those relating to the factors predicting their trust, particularly in emerging technologies such as AI are very scarce. Most studies have centred mainly on factors such as experience [60], teachers’ readiness [61], school’s technology policies [62], pressure to use technology [63, 64], and school variables [65], among others. Because teachers are critical stakeholders in the integration and use of technologies, especially AI-based educational technologies in an emerging world [66], we are not satisfied with the current scarcity of studies on the factors predicting TT in AI-based educational technologies in science (STEAM) education in the Nigerian context.

Given the crucial role that trust plays in technology adoption and use, it is necessary to understand critically what elements influence TT in AI-based educational technologies. Trust is a key predictor of the desire to embrace AI-based systems [67]. This is because an examination of TT in AI is crucial. After all, it can inspire the creation of pertinent policies and also result in regulatory actions with potentially grave repercussions. The findings of this kind of study would help policy and decision-makers in education, especially in STEAM education, craft and implement pertinent policies for the adoption and use of AI-based educational technologies [68, 69]. This study, therefore, explored STEAM TT in AI-based educational technologies using a structural equation modelling approach.

3 Literature review and hypotheses development

Structural equation modelling, according to [70], is useful when complex datasets are been analyzed, and also when the direct and indirect relationships between variables are been examined. It is also useful in the identification of the causes or consequences previously existing among individuals or groups of variables [71]. The need for this study arises from the need to investigate the relationships between STEAM TT in AI-based educational systems and AI anxieties (AN), perceived benefits (PB), AI’s lack of human characteristics (LC), preferred methods to increase AI trust (PI), level of technology literacy (ICT skill), and the moderating effect of gender. The conceptual model is shown in Fig. 1.

Fig. 1
figure 1

Research conceptual model

3.1 AI anxieties

Issues with computer usage, a lack of proficiency with new technologies, and protracted technology use, to mention a few, have all been associated with technology-related anxiety [72]. Teachers’ use of diverse technological tools in the educational environment may lead to anxiety which in turn leads to frustration and confusion, the consequences of which are noticeable during classroom interactions [73]. The issues surrounding the adoption and utilization of AI-based educational technologies and the novelty of the technology may generate a feeling of fear and apprehension among teachers. This further condition their views of the complexity related to the use of the technology [74].

Anxiety towards the adoption and use of AI-based technology can occur due to the confused attitude of teachers toward technological improvements, confusion around technology autonomy, and ignorance relating to the socio-technicalities of the technology [75]. Therefore, anxieties related to the adoption and use, or trust in AI-based educational technologies can be expressed as apprehension or panic nervousness arising from the unknown directions of AI-based technology developments [75]. High levels of technology anxiety are associated with negative attitudes toward technology, whereas positive experiences with technology use are associated with extremely positive attitudes toward technology [76,77,78]. A high level of technology anxiety may result in trouble using technology, according to research that has linked technology anxiety to actual technology use [73]. Additionally, [79] and [80] have documented a direct correlation between technological anxiety and a number of other factors, such as age, frequency of technology use, past experiences using technology, and neuroticism. Concerning trust in AI-based systems in education, the study hypothesized that:

H1. AI anxieties predict TT in AI-based educational technologies.

3.2 Perceived benefits

According to [81], for teachers to utilize the potential of AI-based educational technologies, they must be aware of the instructional contributions of the technology. In order words, the level of belief or trust that a teacher has that technology will increase his performance and drastically reduce his efforts refers to perceived usefulness or benefits [82]. This concept was made popular by [83], who claimed that perceived advantages are related to how strongly users of technology feel that utilizing it will improve their ability to execute their jobs. The use of AI in educational activities entails some potential benefits, and teachers’ awareness of these potential benefits may affect their perceptions of the usefulness of AI in education [74, 84]. This position is corroborated by [85] that AI-based systems can be effectively adopted in education when teachers are sufficiently aware of its pedagogical benefits and are knowledgeable enough to use the system. In order words, the more teachers are aware of the benefits of using AI-based systems in education, the more they will use the system to improve motivation and engagement among their students [86]. In the same vein, [87] report that teachers who are knowledgeable about AI-based system’s use in education are better positioned to select relevant AI-based systems for instructional purposes, hence teachers’ knowledge of the role of technology is proportional to the successful integration and technology in an educational setting [88]. Based on this literature, the author propose that:

H2. Perceived benefits of AI predict TT in AI-based educational technologies.

3.3 Lack of human characteristics by AI

According to [31], humans possess several distinctive qualities, such as the capacity for perception, emotion, and cognition, which AI cannot replicate [89]. Humans can maintain numerous conflicting mentalities at once because they have free will, consciousness, and emotions that occasionally entail irrational conflicts. This is not true for machines, whose mental processes are limited to logical progression. Although AI algorithms have developed to mimic human behavior, it is still challenging for a short-term machine to replicate human features [90]. Despite the huge potential of AI-based educational technologies in instructional settings, there are also ethical issues surrounding the validity of the decisions taken by the system [91, 92] such as race and cultural discriminations [93, 94], and concerns related to fairness [95]. These, among others, have created challenges for educators in understanding the rationales for unpinning AI-based systems’ decisions [96]. These are observed situations arising from LC. Therefore, it becomes crucial for people working with sophisticated AI-based systems to create accurate mental models of how these systems' various cognitive capabilities relate to human cognition. When AI becomes more autonomous, the human vs. machine conflict is likely to take on a new shape. [97]. Consequently, this can aggravate teachers' resistance or distrust of AI-based educational technologies. Given the literature, the author hypothesized that:

H3. AI’s lack of human characteristics predicts TT in AI-based educational technologies.

3.4 Preferred means to increase trust (PI) in AI

Teachers play significant roles in the preparation of the next generation of students, especially in AI [98], and therefore, many of the reasons for the training of teachers to have a working knowledge of AI in education are very congruent to those recommended to prepare them for digital skills [99]. A crucial element that is closely related to integrating technology in the classroom is teachers' professional development. This is because instructors' knowledge of their area of expertise and their understanding of how to effectively incorporate technology to assist students' learning and achievement work together to raise their level of technological knowledge, confidence in it, and attitudes toward it [100]. The amount of instruction teachers receive in using technology directly relates to how well it is incorporated into the classroom. This result was reached when it was found that among the top factors of successful implementation of technology in education are ongoing professional development programs for teachers and the provision of ongoing support for effective practice [100]. Consequently, technology-related pieces of training foster teachers’ recognition of the roles being played by emerging technologies in students’ learning [101], with regards to this, emerging evidence suggests that training programmes and interventions are germane to mitigating AI-related biases, and therefore help at improving the processes of decision-making [102, 103] as the users’ knowledge of AI increases. Providing pieces of training to teachers while working with data provided by AI-based educational technologies helps to improve their trust in building pedagogical decisions based on data [104]. Given the foregoing, the author propose that:

H4: Preferred means to increase trust in AI-based edtech predict TT in AI-based educational technologies.

3.5 Level of technology literacy

Technology competence or literacy is conceptualized as the ability to effectively use diverse technologies for various objectives [62]. Teachers’ technology literacy or competence is an important predictor of technology integration in instructional settings [105]. This is because teachers’ understanding of educational technologies and how best to blend them with field knowledge for productivity is very important [106]. Teachers’ technological competencies are also expected to be high to be able to beneficially use technologies in instructional environments [107]. Skills level and experience can positively influence technology use [108], and this has been verified empirically [109, 110]. According to [111], the Internet, especially educational tools, is expanding at an exponential rate, and the abilities needed to thrive in technologically based societies are likely to overlap with those needed to study in technologically enhanced classrooms. There is evidence that most educators who expressed negative or indifferent opinions about incorporating technology into their teaching practices lack the necessary background information and expertise to do so in a way that allows them to make "informed decisions" about technology integration [105]. Studies have therefore shown that while there is a high usage of technology among teachers, however, their level of integrating educational technologies in their classrooms is lower [112,113,114]. Given the reviewed literature, the study proposes that:

H5: Level of technology literacy predicts TT in AI-based educational technologies.

3.6 Moderating effect of gender

Gender has been reported to influence teacher beliefs and behaviours [115, 116]. Several studies that examined teachers’ ICT skills based on gender reported that significant differences exist between male and female teachers, while some did not find any gender-based influence. With regards to the integration of technology in instructional settings [117] report that male teachers frequently integrate technologies compared with their female counterparts. The acceptance, integration, and use of technologies by teachers in the classroom, according to several studies [e.g. 117–119], has not been significantly influenced by gender. Additionally, according to studies, female teachers are more anxious about using technology in their lessons [79, 118,119,120,121]. However [122], female instructors incorporate technology more than male teachers do, and their perceptions of their proficiency in technology have improved in comparison to those of their male colleagues, who have maintained the same perceptions. Regarding the importance of gender as a determining factor in predicting teachers' adoption and integration of technology in teaching and learning processes, literature has continued to demonstrate discrepancies in conclusions. However, this has not been investigated in terms of STEAM TT in AI. To fill this gap in the literature, the author propose the following hypotheses:

H6. Significant differences exist between male and female teachers regarding the relationship between anxieties related to using AI-based edtech and trust in AI-based educational technologies.

H7. Significant differences exist between male and female teachers regarding the relationship between perceived benefits of AI-based edtech and trust in AI-based educational technologies.

H8. Significant differences exist between male and female teachers regarding the relationship between AI-based edtech’s lack of human characteristics and trust in AI-based educational technologies.

H9. Significant differences exist between male and female teachers regarding the relationship between preferred means to increase trust in AI-based edtech and in AI-based educational technologies.

H10. Significant differences exist between male and female teachers regarding the relationship between the Level of ICT literacy and trust in AI-based educational technologies.

4 Methods

4.1 Participants

Six hundred and sixty-seven (677) in-service STEAM teachers from three states in Nigeria made up the study's participants. Males make up 74.6% of the sample while females make up 25.4%. The majority of STEAM educators have at least had some training on the value and use of technology in the classroom. Due to the hype around the use of AI in education in Nigeria, the majority of teachers are aware of the technology, even though a sizable portion have not used it. The survey's aim and objectives were explained to the STEAM teachers who willingly participated, and they were also assured of their anonymity for the duration of the study for ethical reasons. In Table 1 below, their demographic profiles are further displayed.

Table 1 Demographic profile of the participants in the study (n = 677)

4.2 Instrument for data collection

The data used in the study were collected across three states in Nigeria, using a Google form-based questionnaire. The instrument used in the study was adapted [67] to suit the purpose of the study. The constructs adapted are anxieties connected with the use of AI-based edtech (AN, 3 items), perceived benefits (PB, 7 items), LC in AI (LC, 4 items), TT (PI, 3 items), and preferred methods of increasing trust in AI (PI, 3 items). A total of 20 items were adapted from [67]. We developed and included an item on the level of technology literacy (ICT skill) and gender (see Table 2 for Cronbach’s alpha; CR—composite reliability, and AVE—average variance extracted). The final instrument, which was divided into two sections, was validated by academic experts in Test and Measurement and Educational Technology. The STEAM teachers' demographic information, including gender, age, subject taught, type of school, location of the school, and degree of ICT literacy, was gathered in the first section of the instrument used. The 20 items they responded to were included in the second section. The options given range from strongly disagreed (1) to strongly agree (7) on a Likert scale. The survey was shared via the in-service teachers' school and other online professional platforms across Nigeria. The Google form-based survey was left open for three-month after which it was shut down from data collection (February to April 2023). During the three months that it was opened, daily prompts were sent out across the platforms on which the instrument was shared to remind the teachers.

Table 2 Result of the measurement model

4.3 Data analysis

The study examined the proposed relationships between AI-based educational technologies constructs and the moderating effects of gender using structural equation modelling (SEM), a method of evaluating and modifying conceptual models, including their relationships among variables, at the same time. Using WarpPLS 7.0 [123] to analyze the data and to perform the partial least squares-structural equation modelling (PLS–SEM) analysis, this study used PLS–SEM method to facilitate theory building (125, 126). As a first step in the analysis, we determine if the sample size for this study is adequate or not. Two methods are suggested to estimate the minimum sample size required for a PLS-SEM study: the inverse square root method and the gamma-exponential method [123, 124]. By simulating Monte Carlo experiments, these methods produce estimates that are similar to Monte Carlo estimates. As a result of the inverse square root method, the minimum sample size tends to be overestimated. However, a better and more precise estimate is provided by the gamma-exponential approach [124]. Since inverse square root methods are more conservative and ensure a significant power level, researchers should report estimates from both methods [123]. This study required a minimum sample size based on both methods, as shown in Fig. 2.

Fig. 2
figure 2

Minimum sample size and statistical power

The study determined that a suitable and sufficient sample size for the research is 677 in-service teachers who have trust in AI-based educational technologies. This determination was made using two different methods, namely the inverse square root and gamma-exponential methods, which estimated minimum required sample sizes of 366 and 353, respectively (see Fig. 3). The significance level (alpha) used in the study was 0.05, the minimum absolute significant path coefficient was set at 0.130, and the desired power level was 0.80. To assess the validity of the measurement model, the researchers employed convergent and discriminant validity indices. Convergent validity was established by evaluating various measures, including the CR, Cronbach alpha, and Dijkstra's PLSc reliability index. These measures were required to exceed a threshold of 0.70. Additionally, the AVE needed to be greater than 0.50 to indicate convergent validity. Discriminant validity was assessed by examining the squared AVE for each latent variable, which should be higher than the correlation between that variable and all other latent variables [125]. Furthermore, cross-loading and the Hetero-trait mono-trait ratio correlation (HTMT) were considered, and a value of less than 0.85 was required to establish discriminant validity. The researchers also evaluated the structural model using several criteria. Stone Gassier's Q2 was employed to determine the predictive relevance of the model, with a value greater than 0 indicating its relevance. The significance of paths in the model was assessed using stable3, which required a test statistic (T) value higher than 1.645 in a one-tail test. The variance inflated factor (VIF) was examined, and values below 3.3 were considered acceptable. Effect sizes (f2 values) were calculated to measure the impact of exogenous variables on the endogenous variable, with values of 0.35, 0.15, and 0.02 indicating large, medium, and small effects, respectively [126,127,128]. The adjusted R-square coefficient (R2) was used to determine the amount of variance explained by the exogenous variables with the endogenous variables. Finally, a two-stage approach was employed to test for moderating effects. In this approach, factor scores were calculated and then used to construct interactions or products. This method was chosen over variable orthogonalization and indicator products as the preferred approach for the study [129].

Fig. 3
figure 3

Path coefficients of the structural model analysis

5 Results

5.1 Measurement model validity assessment

Before conducting a detailed analysis of the structural relationships, a measurement model was carried out to validate the constructs. In this analysis, we compared the correlation matrices implied by the model with the empirical indicators using both existing and new indices [123]. The new indices used in the analysis included the standardized root mean squared residual (SRMR), standardized mean absolute residual (SMAR), standardized chi-square (SChS), standardized threshold difference count ratio (STDCR), and standardized threshold difference sum ratio (STDSR). These new indices were complemented by existing indices such as Tenenhaus GoF (GOF—small >  = 0.1, medium >  = 0.25, large >  = 0.36), Sympson's paradox ratio (SPR—acceptable if >  = 0.7, ideally = 1), R-squared contribution ratio (RSCR—acceptable if >  = 0.9, ideally = 1), nonlinear bivariate causality direction ratio (NLBCDR—acceptable if >  = 0.7), and statistical suppression ratio (SSR—acceptable if >  = 0.7). The acceptable fit of the measurement model was indicated by SRMR and SMAR values lower than 0.1. For SChS, a p-value associated with the SChS equal to or lower than 0.05 indicated a normally acceptable fit at the 0.05 level of significance. The acceptable fit was indicated by STDCR and STDSR values equal to or greater than 0.7, which referred to the modified p-value. Overall, the model fit, and quality indices demonstrated a good fit to the data. This was evidenced by the following values: SRMR = 0.09 (less than 0.10), SMAR = 0.08 (less than 0.10), SChS = 8.912 (p < 0.05), STDCR = 0.87 (greater than 0.70), STDSR = 0.73 (greater than 0.70), GOF = 0.47 (greater than 0.36), SPR = 1.00 (greater than 0.70), RSCR = 1.00 (greater than 0.90), NLBCDR = 0.80 (greater than 0.70), and SSR = 1.00 (greater than 0.70). In summary, the measurement model demonstrated an acceptable fit based on the various indices, indicating that the model adequately captured the relationships between the constructs being studied.

In addition to assessing the validity and reliability of the measurement model, we evaluated the link between the latent variables and their manifest variables. In the research model, six reflective constructs were included, including gender, anxiety when using AI-based education technology, PB of AI-based education technology, the lack of human characteristics in AI-based education technology, preferred methods of increasing trust, level of ICT literacy, and TT in AI-based education technology (Fig. 1). These constructs were classified as reflective due to high correlations between measurement items. We used AVE, CR, and a coefficient to measure the scale's reliability. A convergent validity assessment was conducted as well using AVE. The convergent validity of a measurement instrument is determined by whether respondents can understand the question statements (or other measures) associated with each latent variable. In this regard, the following criteria are recommended for determining whether a measurement model is valid in terms of convergence: the p-value of the loadings should be 0.05 or smaller; the loadings should be equal to or greater than 0.50, and the AVE score should be greater than 0.50 for each dimension [125, 128, 130,131,132,133]; Kock, 2014a). As a result, Table 1 shows convergent validity with all dimensions having an AVE of greater than 0.50 and significant item-to-factor loadings.

Construct reliability is assessed using Cronbach's alpha coefficients and CR. While some evidence suggests that Cronbach's alpha [98, 134,135,136]; may have weak psychometric properties [137], researchers are advised to consider the CR coefficient as a more reliable measure [138]. CR coefficients are also referred to as Dillon-Goldstein reliability coefficients and congeneric reliability coefficients [139, 140]. Reliable measurement instruments are those in which respondents understand the question statements or measures associated with each latent variable similarly. According to various researchers, including Fornell and Larcker [125], Hair et al. [127, 130, 131], Kock [133] and Kock and Lynn [141], both the CR and Cronbach's alpha coefficient should exceed 0.7. The CR coefficient is generally considered to be more precise than Cronbach's alpha [123, 125, 141]. In this study, the CR value exceeded 0.70, indicating good construct reliability [128, 130, 131, 142, 143]. All factors had Cronbach's alpha values ranging from 0.700 to 0.840, and Dijkstra's PLSc ranged from 0.639 to 0.926. Table 1 demonstrates that all AVE values were greater than 0.50, while the CR, Cronbach's alpha, and Dijkstra's PLSc were higher than 0.70.

Measurement instruments like question statements are typically used to test discriminant validity. An instrument with good discriminant validity prevents respondents from confusing the associated measures with those for other latent variables. WarpPLS includes HTMT ratios and other coefficients as part of its outputs [123, 125, 130, 132] that can provide useful information for assessing discriminant validity. There are also correlation coefficients among latent variables, square roots of AVEs, and loadings and cross-loadings among structures (see Table 3 and 4). We also provide p-values, a 90% confidence interval, and the HTMT ratios (see Table 2). AVEs are squared for each construct, and correlation coefficients are calculated using their square roots. As demonstrated in Table 3 and 4, all AVE values were larger than correlations in every case, and cross-loadings were greater than the correlation values for all indicators associated with their highest constructs. HTMT has been shown to perform better than the Fornell-Larcker criterion and cross-loading assessment based on heterotrait-monotrait correlations (HMC). Discriminant validity of reflective measurement models must be established by HTMT values of not more than 0.85 [144]. As shown in Table 5, the model has discriminant validity.

Table 3 Discriminant validity-correlations among constructs with the square root of AVEs
Table 4 Discriminant validity- structure loadings and cross-loading
Table 5 Discriminant validity- HeteroTrait-MonoTrait ratio of correlations

The HTMT ratio values, as presented in Table 5, were found to be below the benchmark of 0.85, thus confirming the discriminant validity of the model. Additionally, Table 6 displays the confidence intervals for the HTMT ratios. The 90 per cent confidence interval is considered substantial and acceptable when one estimate is excluded. In Table 2, the lower and upper limit values are each excluded once. This indicates that the variables in the model exhibit both convergent and discriminant validity and are reliable. Based on these findings, the structural model was assessed to examine the relationships between variables.

Table 6 Confidence interval for HTMT ratios

5.2 Structural model assessment

Table 3 provides the VIFs for all the latent variables and p-values. A redundancy assessment was carried out using this method. A reflective latent variable should have redundant indicators [133]. As a rule of thumb, full collinearity VIFs of 3.3 or less suggest the absence of multicollinearity in the model and no common method bias [133, 145]. For PLS-based SEM using latent variables, this is also the threshold for VIFs [145]. Hence, all VIFs in the model are below the 3.3 threshold, indicating that this study does not have a multicollinearity issue. Nevertheless, it is worth noting that categorical nominal variables/categorical predictor variables such as level of ICT skills cannot be used directly in warpPLS in its present categories unless they are converted into dummy variables before they can be analyzed. The result will also be compared against one of these groups as a reference. To avoid errors caused by zero variance and multicollinearity, one of the categories must be retained.

After establishing the reliability and validity of the measurement model, we proceeded to examine the structural model to assess the direct effects of the latent variables and the amount of variance predicted by the model, as depicted in Fig. 3. The model demonstrated that it predicts more than thirty per cent of the variance in TT in AI-based educational technologies, as indicated in Table 2. The model's predictive capability was further confirmed by Stone-Geisser's test, where the Q2 value (shown in Table 2) exceeded 0 for the construct, indicating its ability to make reliable predictions. The next step involved determining the magnitude of the proposed relationships between the latent variables, as illustrated in Fig. 4. Each relational hypothesis in the model had a distinct path coefficient value. Among them, the relationships proposed between PI—> TT (H4) and PB—> TT (H2) exhibited some of the strongest coefficients. However, hypotheses 3 and 4 indicated relatively weaker relationships between AN and TT, as well as between ICT Skills and TT.

Fig. 4
figure 4

Model indicating the moderating effects of gender

Table 7 presents the t-values of the path coefficients and the f2 statistic calculated to confirm the significance of the proposed relational hypotheses and the size of the latent variable's effect. Based on the results, we conclude that the PLS analysis supports all the proposed relational hypotheses within the model with a significance level of 0.05, except for H3. Usually, Cohen's size effect is recommended at 0.02, 0.15, and 0.35 [127, 128]. Even when the corresponding p-values are statistically significant, values below 0.02 suggest that the effects are too weak to be relevant from a practical standpoint. For reflective latent variables, all indicators' effect sizes should be equal to or greater than 0.02 [127]. Our study shows that H3 has an f2 of 0.001, which makes its inclusion in the model questionable since we cannot ensure that the effect size would be sufficient. As calculated by Cohen and Kock, H1, H2, and H5 have small effects because their values are under 0.15, whereas H4 has a medium effect size of under 0.35.

Table 7 Results of the paths coefficient

Nonetheless, as a complement to the analysis of direct effects, we conducted a two-stage moderating analysis effects of gender across all the paths (see Fig. 4).

In Table 8, Fig. 5a, there is a significant difference between male and female teachers regarding the relationship between anxieties related to using AI-based educational technologies and trust in them, thus supporting H6. In addition, there is a negative interaction effect. Accordingly, teachers with a higher level of gender (i.e., female) are less likely to report anxieties about using AI-based edtech while those with a lower level of gender (i.e., male) are more likely to report anxieties about using AI-based edtech. Moreover, males and females differ in their preferred means of increasing trust in AI-based education technology, level of ICT literacy, and use of AI-based educational technologies, which supports H9 and H10 (see Table 8, Fig. 5d and e). However, no significant differences were found between male and female teachers concerning the PB of AI-based edtech, lack of human characteristics, and trust in AI-based educational technologies; therefore, H7 and H8 were not supported (see Table 8, Fig. 5b and c).

Table 8 Results of the moderating effects analysis
Fig. 5
figure 5figure 5figure 5

a Interaction effect of gender between AN and TT. b Interaction effect of gender between PB and TT. c Interaction effect of gender between LC and TT. d Interaction effect of gender between PI and TT. e Interaction effect of gender between ICT skills and TT

6 Discussion

Exploring STEAM TT in AI-based systems with regards to its adoption and use in teaching and learning of STEAM is very vital. Our study therefore, examines the influence of AN, PB, LC, PI, ICT skill, and the moderating effect of gender on STEAM TT in AI-based educational technologies in K-12 setting, using a structural equation modelling approach. Our proposed model predicts over thirty per cent of the variance TT in AI-based systems in education. Each relational hypothesis has a different path coefficient value, and one of the strongest relationships is that proposed between PI—> TT in AI-based educational technologies  [99, 100], and PB—> TT in AI-based educational technologies [19, 81, 134, 146]. However, AI’s LC and PI indicate the weakest relationship between AN and TT in AI-based educational technologies, as well as between ICT Skills and TT in AI-based systems. Our study also shows that LC has an f2 of 0.001, which makes its inclusion in the model questionable since we cannot ensure that the effect size would be sufficient. The finding additionally shows that AN, PB, and LC have small effects because their values are under 0.15, whereas PI in AI have a medium effect size of under 0.35.

Our findings also show a significant difference between male and female teachers [147] regarding the relationship between anxieties related to using AI-based educational technologies and trust in them. However, there is a negative interaction effect. Accordingly,  female teachers are less likely to report anxieties about using AI-based edtech while male teachers are more likely to report anxieties about using AI-based edtech. This is an interesting finding as most studies' reports tend to favour the males in terms of technology usage [119, 120, 146]. Moreover, males and females differ in their preferred means of increasing trust in AI-based education technology, level of ICT literacy, and use of AI-based educational technologies [115, 116]. However, no significant differences were found between male and female teachers concerning the PB of AI-based edtech, lack of human characteristics, and trust in AI-based educational technologies [148,149,150].

7 Conclusion

In conclusion, our results show that anxiety and preferred methods to increase trust and PB influence trust in AI-based edtech while LC does not have a strong influence. Therefore, by implication, the findings of this study are expected to be an eye-opener to all stakeholders and policymakers in education, especially science education on the need to begin an evaluation of any existing frameworks on technology integration, especially emerging technologies in education to incorporate the adoption processes, integration procedures, and effective use of AI-based educational technologies in STEAM classrooms in Nigeria. Also, as a result of the implications of the findings for STEAM educators and STEAM education in the era of AI, it is expected that policymakers make informed decisions on what must be done to ensure that STEAM teachers trust and use AI-based educational technologies effectively and efficiently in their pedagogical processes. In this regard, the finding of the study might prompt the Ministry of Education officials and other relevant stakeholders in the education sector on the need to design effective professional development programs which will ensure that STEAM teachers in the K-12 section  are well-trained  and grounded in  the integration and use of  emerging technologies in their pedagogical processes. a. Finally, since STEAM teachers have important roles to play in the way their students learn in and outside of the classrooms, it becomes necessary to ensure that their trust in AI-based educational technologies is strategically improved since AI technology is the future of work and life of the fourth industrial revolution and students must not only be taught using the technology, they must equally be prepared to embrace and use the technology.

8 Limitations and future work

We propose that subsequent researchers should consider using a larger sample of participants even though the sample size of this study is sufficient for inference purposes. A larger sample size is suggested because the findings from the sample size of the present study may not represent the opinions of a majority of the STEAM teachers in Nigeria. Therefore, there is a need for a larger sample size to further validate the findings of this study. Also, the study was conducted among secondary school STEAM teachers in Nigeria. The opinions of these level of education teachers may not represent the views of the STEAM teachers in higher institutions in Nigeria. Hence, extending this study to the STEAM lecturers in the higher institution might be worthwhile for further validation of the results of the present study. In addition to this, we suggest that subsequent studies try to streamline the gender gap as observed in this study. Ensuring that the gender gap is balanced in further studies might bring up new discussions on the moderating effects of gender in a study of this nature.