Data mining of scientific research on artificial intelligence in teaching and administration in higher education institutions: a bibliometrics analysis and recommendation for future research

Teaching and learning as well as administrative processes are still experiencing intensive changes with the rise of artificial intelligence (AI) technologies and its diverse application opportunities in the context of higher education. Therewith, the scientific interest in the topic in general, but also specific focal points rose as well. However, there is no structured overview on AI in teaching and administration processes in higher education institutions that allows to identify major research topics and trends, and concretizing peculiarities and develops recommendations for further action. To overcome this gap, this study seeks to systematize the current scientific discourse on AI in teaching and administration in higher education institutions. This study identified an (1) imbalance in research on AI in educational and administrative contexts, (2) an imbalance in disciplines and lack of interdisciplinary research, (3) inequalities in cross-national research activities, as well as (4) neglected research topics and paths. In this way, a comparative analysis between AI usage in administration and teaching and learning processes, a systematization of the state of research, an identification of research gaps as well as further research path on AI in higher education institutions are contributed to research.


Introduction
The digitization of business, societal, and educational processes, as well as global events, have influenced the dynamics, application, and development of educational technology (EdTech). For instance, the COVID pandemic created a new and urgent situation for education, forcing a shift towards EdTech [1]. Simultaneously, the amount of scientific literature on artificial intelligence (AI) in education has increased rapidly since its emergence, enhancing both theoretical understanding and practical usage. Opportunities for the application of AI are manifold, especially when integrated into EdTech. EdTech can be summarized as all measures that aim to facilitate learning and improve learning performance through the creation, use, and management of appropriate technological processes and resources [2]. Gao et al. [3], among others, distinguish EdTech into pedagogical and operational technologies. While the first category is directly included in teaching-learning processes, whereby the use of EdTech creates learning environments that place the learner at the center of the learning experience, the second category-operational technology-basically refers to administrative or operational parts of teaching-learning processes.

Literature identification
A broad search query was developed to ensure a representative coverage of the existing research fields ( Table 1). The search query combines different keywords from the field of AI and ML with keywords from the field of higher education and the focal areas of teaching and administration. For the query, the database Web of Science Core Collection was used, which covers a large number of publications from different research areas and a variety of top-ranked journals and conferences relevant to the fields of AI and processes in HEI. We set a focus on the areas of Business, Computer Science, Education, Engineering, Social Science, and Psychology when querying.
A further refinement of the literature basket was conducted using different inclusion and exclusion criteria (Table 2). Thereby, the only further constraint in the query was date of publication, which was set between 01.01.2011 and 15.09.2021, since relevant content was expected to be published in this time range.
The query was conducted in September 2021 and resulted in 2333 hits for query 1 and 2,289 hits for query 2. These initial hits underwent a prescreening of title, keywords, and abstracts to assess the relevance of the hit according to the inclusion and exclusion criteria; each was prescreened by two researchers. If the researchers disagreed, a third researcher checked the data and added his/her judgement. Then, the candidate hit in question was discussed until consensus was reached. Prescreening of the query 1 data set led to 1590 removals, leaving 743 hits. Prescreening of the query 2 data set led to 2236 removals, leaving 53 hits. These remaining hits were used as the data source for further analysis.

Bibliometric analysis
Based on the Web of Science Analysis Results Tool and the R package "biblometrix" [17], a bibliometric analysis was performed to screen and classify the field. This includes the analysis of scientific research records of countries and collaborations and the analysis of the most frequently used keywords in order to group topics and themes and identify trends through factorial analysis. Development of this bibliometric analysis focuses on creating conceptual maps of keywords, synonyms, and related concepts through factorial analysis, providing collaboration network maps of countries or organizations, categorizing the keywords, synonyms, and related concepts, and then relating these groupings to countries or organizations, (e.g., in a three-fields plot) [18]. The number of data sets in each section corresponds to the total number of published articles. To analyze the datasets more deeply, we used R Studio. We created a co-occurrence network of the keywords from the datasets. We did not use the keywords from the articles, as they were often too generic. Instead, we   [20]; isolated nodes are deleted and a minimum of two edges to a node was set as the default criterion to be incorporated into the network.

Results
First, we present the results on AI for teaching in HEI, and afterwards, the results on AI for administration in HEI. Descriptive information regarding the development of the number of publications, involved countries, and collaborations, as well as topics, their development over time, and trends, is presented in turn.

AI for teaching in HEI
The development of the publications per year ( Fig. 1) visualizes the undoubtedly growing importance in the research landscape of the topic of AI in the context of teaching in higher education. In the ten-year period studied, the number of published papers per year increased more than tenfold-in 2011, less than 20 papers including these terms were published in the database, while in 2021 (until September), there were published more than 200 papers. The trend of the increasing number of papers in the field has not been interrupted in any of the years studied, nor has it decreased. Chinese institutions are leading the research effort on the topic of AI in teaching and learning in HEI (Fig. 2). They strongly dominate the sector, followed by the USA, Spain, and Australia. Other countries with a minimum of 10 publications in this area are England, Taiwan, Canada, Scotland, Germany, Brazil, the Netherlands, Italy, Saudi Arabia, Japan, Chile, Greece, Malaysia, Russia, France, and India.
Strong collaborations between institutions from the United States and those from Australia, Canada, China, and Europe in general can be identified. Furthermore, strong collaboration between Australia and England specifically, but also Europe generally, can be seen on the country collaboration map (Fig. 3). The analysis shows that there are almost no collaborations between Africa and other countries. South American countries collaborate mainly with European countries. Researchers from institutions in New Zealand and Russia do not collaborate on this topic with foreign residential scientists. Figure 4 shows the 50 most used words from the KeyWords Plus analysis. The size of the individual words indicates the frequency of use. In addition to more general keywords, such as education or students, there is also a more specific term, performance, among the three most frequently used keywords. With regard to the word performance, we can also identify an accumulation of other typical narratives of the scientific discourse, such as impact, outcomes, self-efficacy, support, and success, among the 50 most frequent words targeting the effectiveness or promise of AI-based technologies in teaching. Semantically, a higher frequency of technical terms, such as learning analytics, model, big data, system, and algorithm, can be identified. In contrast, there are no terms from the field of pedagogy among the 50 most frequently used terms. This could indicate that the discourse is more situated in the field of computer science/informatics. Word duplications like student(s) and system(s) are the result of the statistical software extracting keywords without taking the numerus into account. This is not deliberate; it is a consequence of the way the statistical software extracts the keywords. Figure 5 shows the correlations between three fields of analysis-the KeyWords Plus analysis, the countries, and the specific affiliations of the authors. On the one hand, it can be directly visually derived that aspects such as performance, motivation, design, engagement, and model are particularly represented in the body of literature. Also, general keywords, such as students and education, can be identified. Publications from China and Australia in particular address a wide range of the keywords. These countries also host most of the affiliations that contribute to the topic investigated. Although they are slightly fewer, strong diverse keywords can also be found in the contributions from the USA, UK, and Spain. Leading by number of contributions are the University of Edinburgh, University of Iowa, and Taipei Medical University.

3
Considering the co-occurrences of keywords, five different clusters can be identified (Fig. 6). The largest cluster (green) of keyword co-occurrences comprises aspects such as performance, students, online, participation, engagement, higher education, and achievements. The size of a presented keyword correlates with its frequency of occurrence. The lilac cluster relates the teacher keyword with aspects such as learning analytics, technology in general, acceptance, perceptions, support, and language. Another cluster (red) centers around the aspect of analytics. Co-occurring keywords are big data, success, patterns, and networks. Another cluster of keywords (blue) comprises education, design, system, impact, and knowledge. The fifth cluster (orange) consist of co-occurring keywords such as model, framework, motivation, student, classroom, beliefs, self-efficacy, and feedback. Figure 7 shows the development of the 10 most used keywords in the period of 2011 to 2021 for the selected publication strand. Generally, the frequency of keywords has increased significantly as the number of publications has increased, with the largest increase occurring in 2017. The words performance, model, and impact show the greatest annual growth. The keywords design and higher education have also increased, especially since 2017, in comparison to the former mentioned, however, rather to a moderate rate. According to keyword occurrences, the focus on students decreased since its peak in the middle of 2019. The same applies for technology, learning analytics, and online, which have decreased since the beginning of 2020. The keywords AI and machine learning, on the other hand, are not among the 10 relevant words. Figure 8 illustrates the trending topics in the publications according to the frequency of their use and with a chronological assignment on the axis of the years studied. We set a focus on the last five years, since the amount of papers rapidly increased in this period. Motivation was used most frequently, with a peak in 2019. In addition to the terms university, curriculum, and classroom, which are emblematic of the context of this study, technology also received usage. It must be noted that the general technological progress is reflected in the terms used-in 2017, the most frequently used technical term is computer; in 2018, it is networks; in 2020, many papers report on science, classroom, skills, technologies (a much broader term), and simulation as a concrete application. In 2021, many papers referenced recognition in general, which was the most frequently used term.

AI for administration in HEI
There was a steady increase in the number of publications per year with a focus on the usage of AI in HEI administration ( Fig. 9). As with the teaching string, in a 10-year period, the number of publications increased tenfold. There was an especially marked increase in 2018, and from then onwards, the numbers increased significantly, although they remained at a moderate level. It should be noted, however, that from 2011 to 2017 the number of publications on AI in HEI administration processes was very low or nonexistent. Figure 10 shows the countries with the largest output of papers on AI and HEI administration. Three countries produced about 30% of the papers; these were China, Spain, and the USA. Of the European countries, Greece, England, and France led in terms of quantitative output. South America participates relatively significantly in this publication context compared to African and other Asian countries (China excluded).
As shown in the country collaboration map (Fig. 11), there is only one collaboration, which is between Italy and Romania. There are no other cross-national cooperation activities with a focus on the usage of AI in administration in the HEI context. A possible reason for this may be the low overall number of publications on this specific topic. Figure 12 shows the 50 most frequently used words from the KeyWords Plus analysis, where the size of each word in the figure is representative of the frequency of use. Education is the most used word. Of the technology-linked words, learning analytics is the leader in the administrative field, followed by system and online with slightly less frequently occurrence. Of the terms describing the context of use, emphasis is on performance and environment, followed by participation. Terms that are distant from the topic, such as customer churn prediction, citizenship, and assurance, are also represented, even though they are used relatively rarely. Considering the relationship between KeyWords Plus results, countries, and author affiliation, the USA seems to focus mainly on performance and prediction, while Spain has the broadest variety of different keywords (Fig. 13). Generally,  there is a broad variety (e.g., the keyword system is addressed by research institutions from many different countries). Furthermore, it must be noted that there are some missing values in the data. For example, the University of La Frontera in Chile is not related to specific keywords. Figure 12 shows the 50 most frequently used keywords in the field of AI in HEI administration. In order to understand in more detail the relationships between the identified keywords, we have created a structure that represents the cooccurrence network between the 10 most used keywords (Fig. 14). By analyzing the individual keywords that are of higher relevance in the research topic of AI in HEI administration, the co-occurrence network becomes more detailed, and we can identify three different clusters (blue, red, and green). The clusters show possible causalities between the keywords. Interestingly, the blue and green clusters show overlaps. Thematically, both clusters can be located in the area of databased analysis or evaluation of learning performance. For the red cluster, we cannot analyze the relationships in more detail based on the structure of the co-occurrence network. Figure 15 illustrates the variations in the use of the most-used terms in the context of administration. In 2012, all the terms are used quite infrequently. Similar to the development in the context of teaching, the terms increase intensively  from 2017. One exception is participation, which increased until 2017 and from then onwards is stable. All the other terms (learning analytics, online, system, environment, education, and performance) increased constantly, whereby especially the latter had the highest gradient angle.
As there were either no or not enough articles focusing AI in HEI administration in the previous years, trend topics can first be identified from 2018 onwards (Fig. 16). The chosen logarithmic frequency shows performance as a trend topic in 2018. Environment and system are analyzed as trend topics in the following year.

Future research paths
This study applied bibliometric analysis as a comprehensive and structured tool to organize and analyze the research on AI in administration and teaching in HEI. Among other things, this allowed us to find gaps in the literature and identify areas that have not been adequately studied but warrant further attention [21]. Below, we explain the findings that emerge directly from our literature review and point to topics that shape the current discourse on AI in HEI but have not been adequately addressed in the literature thus far.

Imbalance in research on AI in educational and administrative contexts
The most important finding of this analysis is the imbalance in research on the two different contexts. The research output on the application of AI in the teaching and learning contexts is 10 times larger than that on the application of AI in administrative HEI processes at the time of the most significant number of publications in 2021 (see Figs. 1,9). This imbalance needs to be ameliorated in further research, as theoretical and empirical work is the foundation for the future implementation of AI in HEI processes [22]. Studies highlight the potential of AI and educational data mining for administrative operations (e.g., student admissions) [10], answering service requests or inquiries from students to the secretariat [23], future course selection and appropriate student administration [24], student retention and increasing student enrollment [25], administrative support (e.g., assignment submission, course registration, examination schedule, scoring, graduation) [26], and supporting administrative staff by answering students' FAQs [27].
Furthermore, there is a minimal number of collaborations in research on AI in administrative processes; only Italy and Romania have collaborated (Fig. 11). Thus, research cooperation should be intensified to enable cross-organizational benefits and experience sharing.

3
In addition to this identified need for more research in the administrative domain, the results point to growing research interest in both fields and research needs that affect both application contexts. These are discussed below.

Imbalance in disciplines and lack of interdisciplinary research
In both administration and teaching, the lack of diversity in the applied research approaches is evident. This can be deduced from the result of the keyword analysis (Figs. 4, 12). Most papers are predominantly technical, and nontechnology-related research is underrepresented. Further interdisciplinary research could provide insights from other disciplines. In general, the bias in study results has resulted in a lack of validity, as well as bias effects and vertical scoping problems. The importance of interdisciplinary research in the context of digitization, and particularly in the development and implementation of AI technologies, is undisputed [28]. Essential insights from philosophy, for example, could increasingly bring to the fore the limitations of using AI as a substitute for a human teacher [29], as well as provide insights from ethical and epistemological perspectives [30]. Also important are psychological and cognitive aspects and issues, such as acceptance and decision making. These themes are already apparent (see Figs. 4,12), and their importance will grow in the future. Interdisciplinary studies that link computer science with humanities and social sciences will shape research on explainable and ethical AI to address challenges such as transparency and trust, enabling testing of AI systems for regulatory reasons, and adapting AI systems in response to unexpected behavior [31]. The interdependencies and the dynamic market and technological developments in the fields of AI for admin and teaching in HEI cause a fundamental methodological change resulting from the interdisciplinary perspective. Conceptual development and determination of appropriate characteristics of AI for admin and education in HEI are necessary. Furthermore, innovative and cross-disciplinary approaches are required to address this issue [16].
The imbalance in the diverse disciplines that focus on research on AI in university educational processes also leads to an imbalance in the keywords used to describe the literature. A strong focus is on primary topics (e.g., technology and students) in the classroom context, neglecting important accompanying issues, such as teachers or competencies in the digital world. The cluster analysis of keywords (Fig. 6) similarly shows that in the fifth cluster, the identified outcomes are quantitatively less than those in other clusters, and psychological (motivation, beliefs, self-efficacy) or pedagogical (feedback) keywords are not explored in direct relation to terms like big data or learning analytics. Therefore, there is a risk that the complexity of teaching and learning processes is neglected, limiting the generalizability of the results. At the same time, these competencies are relevant, especially during and after the COVID crisis [32][33][34]. Although the development of the technology is very important and research-intensive, learning remains a human-centered process. For this reason, there should not simply be a transfer of the real world into the virtual

Research
Discover Artificial Intelligence (2022) 2:16 | https://doi.org/10.1007/s44163-022-00031-7 1 3 world, but the specifics of the interaction between people and technology should also be taken into account. In the literature, factors such as learner characteristics [35,36] and teacher competencies [37] have already been investigated, which are crucial for its long-term use and acceptance. Further research should therefore include all aspects and stakeholders of teaching and learning processes.

Inequalities in cross-national research
The data reveal inequalities in collaboration across countries and institutions (Figs. 2, 11). The literature analyzed is heavily dominated by certain countries that lead the discourse on the use of AI in higher education teaching, which implies the risk of research monopolies and thus bias effects. Moreover, studies on cultural aspects of AI application seem promising to mitigate this problem and create different cultural approaches to AI use. We point out the need to establish platforms and formats for the exchange of ideas and experiences and the promotion and funding of international research collaborations. Furthermore, regulations on handling data are often the decisive factor for the further development of AI applications that require large amounts of real data. Regulations at the national level are important in this regard. Since ensuring access to personal data is particularly critical, there is a need for research on ways to enable secure, ethical, and socially acceptable access to data [9,10,38].

Neglected research topics and paths
To our surprise, the analysis results showed that essential topics are neglected in the papers studied. These include AIrelated ethics, fairness, and privacy issues [39][40][41]. This is consistent with the findings of other extensive studies [31]. In the university sector, the issues include how to connect sustainability and AI systems, discrimination against students, and overall transparency. Reconsider the data and analyzing the respective shares of sustainability, discrimination, and transparency-related aspects in the articles unveils that these aspects are underrepresented. 1 Considering the teaching data set unveils that roughly one percent is concerned with sustainability-related aspects and less than one percent are concerned with discrimination or transparency-related aspects, respectively (Fig. 17). Considering the data set resulting from the admin search string, none of these topics are explicitly mentioned in the abstracts of the articles. Thus, we call for more research on these societal relevant topics in the context of AI in HEI. Fig. 17 Share of sustainability, discrimination, and transparency-related aspects in the data Furthermore, comparability studies are mainly lacking between countries and between the two sectors (education and administration). There is a need for different target groups in the context of AI in administration and teaching for higher education institutions, as comparison or generalization of existing results are impossible or possible only to a limited extent. Comparison of research results in AI and universities, in general, is difficult, as the complexity of AI applications for higher education administration is increased by institutional or context-specific characteristics (and due to technological developments and available market dynamics). Therefore, case study approaches that are embedded and consider a wide range of common AI for higher education administrative applications are needed. By comparing such cases, (at least) the internal validity of corresponding studies can be improved.
There is a lack of long-term observations of AI-enabled teaching in HEI [42]. Short-term observations do not lead to a holistic understanding of the long-term effects of the use of AI for teaching in HEI. Therefore, it seems necessary to investigate the usage and effects of AI on students, their learning behavior and competencies, the lecturer, applied didactical approaches, and the teaching and learning content. This would allow for a better understanding of the framework conditions and of how AI can be leveraged to enhance the teaching and learning experience.

Discussion and conclusions
In this study, a bibliometric analysis of the body of literature on AI in administrative and teaching processes in HEI was presented. On this basis, future research paths were derived. This study identified (1) an imbalance in research on AI in educational and administrative contexts, (2) an imbalance in disciplines and a lack of interdisciplinary research, (3) inequalities in cross-national research activities, and (4) neglected research topics and paths. Specifically, it unveils that the emphasis of AI research in the context of EdTech in HEI lies on the teaching aspect. The number of outputs on AI in administrative processes is less than one-tenth that of AI in teaching processes (743 in comparison to 53 final database hits). This may roughly represent the ratio of expenditure and staffing of HEI in these areas. However, the two areas also need to be considered solely separately, given the potential of AI in, e.g., learner profiling, tracking patterns in student outcomes, or staffing of courses.
The number of publications significantly increased from 2017. On the one hand, this goes hand in hand with a general increase in the number of research outputs due to the increasing number of outlets [43,44]. On the other hand, due to specific funding schemes, such as the Artificial Intelligence Funding Initiative from the German Research Foundation, the Horizon Europe scheme, the European Research Council, and the European Innovation Council from the European Union, which all explicitly address AI research, research activities have already increased. Furthermore, the COVID crisis forced the usage of EdTech and, therewith, increased the research opportunities in this area.
A very low number of African institutions conduct research on AI in HEI. In both sectors, China and North American countries have produced the main share of contributions. This is not surprising since China, in recent years, attributes very high importance to AI research and provides governmental support [45], with attention to AI in education [46,47]. The analyzed papers show different tendencies using AI, especially towards an evaluation of teaching and performance, including prediction (e.g., [48][49][50]). The use of AI for learning analytics (e.g. [51][52][53]), but also for the development and further development of algorithms for AI application in teaching (e.g. [54][55][56][57]) have been intensively researched. North American and Spanish researchers are also very active on both topics. The word growth of trending topics increases with the number of publications on this topic. The word performance is leading in both aspects. Technical aspects, such as performance, framework, model, and learning analytics in general are in focus, while soft aspects, such as skills and acceptance, seem not to be in primary focus.
Surprisingly, neither AI nor machine learning are mentioned in the keywords in the teaching data subset. In the subset of administration, they are present but mostly not mentioned. This might be explained by the recent inception of the explicit hype on AI in research. Technology, students, and online higher education are the keywords that have decreased at a rising rate since 2017. It seems that they are replaced by the technical terms model and impact, which have increased exponentially.
This research has some limitations. First, the decision to focus solely on WoS as the one database for data extraction limits the results. Although adopting a narrow focus is common practice and consistent with recent work [18,31], see [58], future research should collect data from different databases. Second, solely inter-country and no intra-country collaborations were investigated. Furthermore, although applying inclusion and exclusion criteria and mutually discussing questionable hits limits the risk of bias, the risk of bias and subjectivity cannot be excluded entirely. The research paths are primarily based on quantitative measures of the research output-the number of keywords describing the actual content. Assuming that the keywords describe the content of the papers correctly, this approach suffices. However, it must be questioned whether this relationship holds up for every item. Furthermore, only contributions in English were collected for the data set. Considering, for example, the European countries, French and Spanish researchers contribute a large share of research, but some of this is likely published in French. It cannot be ensured that all relevant research results are included in the data set underlying the results of this study.