1 Introduction

Educational Data Mining (EDM) involves the use of data mining (DM) techniques on data collected via various educational systems (Berland et al., 2014) to improve educational quality (Baker & Yacef, 2009). Recent EDM studies primarily focus on analyzing students’ learning and behavior, analyzing educational strategies and interventions, predicting students’ performance and dropout, and providing personalized recommendations to students (Romero & Ventura, 2020).

Students’ academic performance is one of the main priorities for educators in determining educational success at all educational levels (Tan et al., 2019). Predicting students’ performance enables early interventions and support to be devised and implemented by educators to enhance their performance, particularly for students who are at risk of failure (Lu et al., 2018; Wakelam et al., 2020). Despite the increasingly growing popularity of DM in various educational settings, EDM studies are still rather minimal and are not discussed in-depth, particularly in developing countries such as Malaysia (Jamil et al., 2018; Shahiri & Husain, 2015). The handful of prediction-related EDM studies done in the Malaysian school context includes Khairy et al. (2018), Kiu (2018), and Makhtar et al. (2017). Hence, very minimal is known about the academic performance predictors for students of this region.

The most common DM methods used in EDM research are classification and clustering (Mohamad & Tasir, 2013). In previous EDM studies, the two types of data modeling techniques used are descriptive and predictive models (Dutt et al., 2017). The common techniques used in the classification method are Decision Tree (DT), Neural Networks (NN), Naïve Bayes (NB), and Support Vector Machine (SVM) while the techniques used in the clustering method are fuzzy clustering, K-means algorithms, and discrimination analysis (Salem & Parusheva, 2018). Saa et al. (2019) discovered that DT is the most utilized technique in EDM to predict students’ performance. DT is often used because it is easy to understand and has high predictive accuracy (Sathe & Adamuthe, 2021).

MCE is a compulsory national examination that is equivalent to the International General Certificate of Secondary Education (IGSCE). This examination is meant for 17 years old, Form Five students in Malaysia. In recent years, students’ performance in English and Mathematics has been an issue of concern in Malaysia. According to statistics released by the Malaysia Ministry of Education, in 2018 and 2019, approximately one out of five students failed these subjects (Kementerian Pendidikan Malaysia, 2020). Although studies on the relationship between English and Mathematics have emerged in literature over the years such as Awan et al. (2011), Bagceci et al. (2014), Perez and Alieto (2018), and Stoffelsma and Spooren (2019), no known study is harnessing on data mining techniques to gain insights into it. Hence, this study aims to identify factors that predict students’ MCE performance in English and Mathematics subjects, identify characteristics of students at different levels of performance, and gain more insights into the relationship between these subjects using DM techniques.

1.1 Factors that predict students’ performance

Dutt et al. (2017) and Saa et al. (2019) conducted reviews of EDM literature particularly on predicting students’ performance. These reviews reported that numerous studies identify factors that predict students’ academic performance and the best DM method to be utilized in each context. The recent literature review by Alyahyan and Düştegör (2020) concluded that there are five factors that researchers frequently examined when predicting students’ performance. These include past academic performance (Adekitan & Salau, 2019), demographics (Atlay et al., 2019), learning environment (Mueen et al., 2016), psychological attributes (Khairy et al., 2018), and e-learning activities (Almeda et al., 2018).

Past academic performance is the most used factor to predict students’ performance (Saa et al., 2019). Shahiri and Husain (2015) noted that final grades or GPA is the most significant attribute used to predict students’ performance. Based on 30 studies from 2003 to 2014, Shahiri and Husain (2015) reported that one-third of them used GPA as the key factor in student performance prediction. Similarly, based on the review of 36 research papers from 2009 to 2018, Saa et al. (2019) identified the two most common factors in predicting students’ performance as their previous grades and internal assessments. Therefore, there is a notable justification to include past academic performance in the present study.

Next, demographic factors are identified to be important elements for predicting students’ performance (Alyahyan & Düştegör, 2020). Hoe et al. (2013) reported a strong connection between students’ demographics and their academic performance. Other studies have shown that gender is one of the student demographics that significantly affect students’ performance (Saa et al., 2019). Nevertheless, some studies reported no critical effect of gender (Martínez & Gil, 2019) and students’ social condition (Maghari & Mousa, 2017) on academic performance. In short, it is crucial to further examine demographic factors as there is a gap in existing knowledge on how demographics affect students’ performance.

Prior studies have shown that psychological attributes are significant predictors of students’ performance (Gray et al., 2014). Khairy et al. (2018) found that students’ cognitive abilities influence their academic performance during secondary school. Other studies (Garg, 2018; Putpuek et al., 2018) focus on psychological attributes such as students’ interest, study behavior, stress, anxiety, time concern, self-regulation, and motivation. These studies found that students’ performance is affected by these factors. Although some attempts have been made to deal with these attributes, this psychological dimension has received limited attention in the literature (Saa et al., 2019). Hence, it is important to include psychological attributes in the present study to discover more about how this factor affects students’ performance.

Many EDM studies examine the influence of one single factor such as past examination marks studies (Asif et al., 2017) or demographics (Atlay et al., 2019) on students’ academic performance (Saa et al., 2019). Although combining many factors in a study has the potential to increase the predictive accuracy of DM algorithms (Mishra et al., 2014), the review study by Shahiri and Husain (2015) discovered very limited studies that examine how the combination of such factors affects the prediction of students’ academic performance. Moreover, most past studies are limited to factors that are frequently studied (i.e., past academic performance, demographics) (Namoun & Alshanqiti, 2021). This creates an interesting gap to be examined in this study in which past academic performance, demographics, and psychological attributes are examined simultaneously to gain insights into their relevance to students’ performance prediction. However, this study excludes the learning environment factor. According to Alyahyan and Düştegör (2020), some attributes related to this factor include program type, class type, and semester period. In this regard, the present study’s sample only involved some Malaysian public schools of similar locality and physical classroom settings. Moreover, the students involved were all in the same grade and took the MCE national examination in the same year. Hence, the present study did not examine the learning environment factor. It also excludes the e-Learning activities factor as e-Learning is not compulsory in Malaysian secondary school education before the Covid-19 pandemic struck.

1.2 Research framework

Figure 1 shows the research framework of this study. Malaysian upper secondary schools employ a grading system that comprises 10 different grades (A + , A, A-, B + , B, C + , C, D, E, and G). In this study, students’ results were divided into three performance categories: high (A + , A, A-), moderate (B + , B, C + , C), and low (D, E, G). This study examined past academic performance, demographics, and psychological attributes of Form Four students to identify prominent DT rules that define the characteristics of students with low, moderate, and high MCE performance in each of the examined subjects. The study also examined the main predictors of their MCE performance in English and Mathematics subjects respectively and determined the algorithm with the highest predictive accuracy. The study then examined the relationship between English and Mathematics performance through correlation analysis as well as examining whether students’ past performance in one subject serves as a predictor for their performance in another subject.

Fig. 1
figure 1

Research framework

The following lists the research questions of this study:

  • RQ1: What are the DT rules that define the characteristics of students with low, moderate, and high performance in the English subject respectively?

  • RQ2: What are the DT rules that define the characteristics of students with low, moderate, and high performance in the Mathematics subject respectively?

  • RQ3: What are the main predictors of students’ English subject performance using DT, NN, SVM, and NB classification techniques?

  • RQ4: What are the main predictors of students’ Mathematics subject performance using DT, NN, SVM, and NB classification techniques?

  • RQ5: Which predictive model has the highest predictive accuracy for students’ performance in English and Mathematics subjects respectively?

  • RQ6: What is the relationship between students’ performance in English and Mathematics subjects?

2 Methods & materials

2.1 Research design

Adapting to the Knowledge Discovery in Database (KDD) process proposed by Fayyad et al. (1996), the research process was divided into five main phases which are (i) problem identification and data selection; (ii) data preparation and pre-processing; (iii) data transformation; (iv) DM model, and (v) data interpretation.

Firstly, the problem identification was accomplished through the review of EDM literature. The researcher attempted to predict students’ performance in English and Mathematics subjects as there is only a handful of EDM-based predictive studies on both subjects, specifically, and in this region, generally. For example, Lau et al. (2019) employed artificial neural networks, a DM technique, to predict students’ academic performance in a Chinese university based on their entrance examination results, which include English and Mathematics. Saleh et al. (2021) used DM techniques to predict Libyan high school students' performance based on their grades in Mathematics and English subjects. Moreover, no known EDM study examines the relationship between both subjects.

The data collected were cleaned and pre-processed as there is a possibility that the data contained errors, such as missing values. This process ensured that students’ information was complete. Then, students’ data were transformed. In the data transformation phase, the non-numerical data was converted to the numerical format. This is to improve the prediction effectiveness (Bergquist et al., 2019).

The fourth phase is creating a DM model. This phase explains how to predict student performance using DM techniques. The DM model was developed using training and testing with a ratio of 8:2, which means 80% of the data was used for training while 20% was used for testing. The training and testing data are divided into an 8:2 ratio. By doing so, the predictive accuracy of the algorithms reaches the highest value. If the ratio is changed to 7:3 or 6:4, the predictive accuracy will decrease. The 8:2 split draws its justification from the well-known Pareto principle (Joseph, 2022). The training set was what the DM algorithms trained on, and the test set was used to see how well classifiers performed on unseen data. Figure 2 illustrates the process of developing the DM model in Orange software to implement the prediction.

Fig. 2
figure 2

DM model development process

Then, in the data interpretation phase, the confusion matrix as shown in Table 1 was generated to show how the data was classified, either correctly or incorrectly (Sokolova & Lapalme, 2009). The matrix contains information about actual and predicted values. Mueen et al. (2016) define the terms as follows:

  • TP: Number of True Positives (the number of instances that are positive and predicted to be positive)

  • FP: Number of False Positives (the number of instances that are negative but predicted to be positive)

  • FN: Number of False Negatives (the number of instances that are positive but predicted to be negative)

  • TN: Number of True Negatives (the number of instances that are negative and predicted to be negative)

Table 1 Confusion matrix

Based on the confusion matrix, the following measures (Goutte & Gaussier, 2005) were used to evaluate the prediction performance of each algorithm.

  • Area Under Curve expresses how well the classifier distinguishes between positive and negative samples.

  • Accuracy is the overall proportion of the total number of correct predictions.

  • Accuracy (%) = (TP+TN)/(TP+FP+FN+TN) x 100

  • Precision is the proportion of correct positive observations.

  • Precision (%) = TP/(TP+FP) x 100

  • Recall is the proposition of actual positive that is correctly predicted as positive.

  • Recall (%) = TP/(TP+FN) x 100

  • F-score is a measure of a test’s accuracy. F-score is the weighted harmonic mean of the precision and recall.

  • F-Score (%) = 2*(precision * recall)/(precision + recall) X 100

DT algorithm in Orange was used to identify prominent DT rules that define the characteristics of students with low, moderate, or high performance in English and Mathematics respectively. DT algorithm was used because it provides a tree presentation that is easy to read and interpret without requiring statistical knowledge (Berhanu & Abera, 2015). The tree diagram was confined to only four levels to incorporate only the most significant attributes affecting students’ performance. Figure 3 shows the process of predicting students’ performance using DT.

Fig. 3
figure 3

Predicting students’ performance process using DT

Orange was employed to identify the main predictors of students’ performance in English and Mathematics using different DM techniques. To find out the impact of attributes on the prediction of English and Mathematics performance using different DM techniques, the ‘information gain’ scoring method in Orange was used. This method shows the rank of attributes that influence students’ performance. The rank of the attributes provided insight into the relationship between students’ performance in English and Mathematics subjects. In addition, Spearman’s rank correlation, rs was used to measure the strength of the linear relationship between these subjects using SPSS software (Bakhtiar et al., 2021).

2.2 Sample

This study involved the use of archival data of Form Four students from four different urban secondary schools after obtaining consent from the Ministry of Education and respective school principals. These schools were purposively chosen and fulfilled four criteria. First, all four schools are public schools. Second, these schools are within the accessible population, which are schools located in the center of Georgetown in Penang, Malaysia. Third, these schools kept the needed archival data, and fourth, the school principals granted permission to use the archival data of their respective schools. Initially, the data of all 202 Form Four students in 2019 were gathered from all schools. These students sat for their SPM examination in early 2021. However, some of the data contained missing values. Hence, only 159 Form Four students’ data were retained for analysis after data cleaning.

DM techniques are often used to extract implicit, hidden, and potentially useful information from a large database. The use of relatively small sample size is due to the limited data available from the accessible population. Hence, the results of this study are not meant to be generalized to other populations. The study by Sordo and Zeng (2005) that compared the performance of three learning algorithms (NB, SVM, and DT) using training sets with sizes ranging from 150 to 8500 cases shows that the three classifiers perform reasonably well with a small number of cases, although NB tends to perform better than SVM and DT with small datasets. Studies such as Chapman and Haug (1999) who used a dataset size of 150, Meng and Zhao (2015) who used a dataset size of 11, and Wilcox and Hripcsak (1999) who used a dataset size of 200 are evidence of the feasibility to apply DM techniques on small dataset sizes. Algarni (2016) also pointed out that as opposed to traditional research methods that require a large dataset size, DM can use a limited dataset size to build a prediction model. The data of Form Four students in 2019 were employed and these students sat for their MCE examination in early 2021.

2.3 Data collection

Table 2 shows the data required for each student.

Table 2 List of data required

3 Results & discussion

3.1 Rules defining students’ of different performance levels in English and Mathematics

Referring to RQ1 and RQ2, the DT algorithm has produced key rules that define the characteristics of low, moderate, and high-performing students in English and Mathematics respectively. DT algorithm was chosen because it is more powerful than the other DM algorithms in terms of classification (Charbuty & Abdulazeez, 2021). Furthermore, it can handle big datasets, and the visual representation of DT makes it easy for the researcher to grasp the results which are vital to determining students’ characteristics (Mühlbacher et al., 2017). These rules can enlighten stakeholders, notably teachers, on the appropriate teaching approaches or interventions particularly to increase the performance of at-risk students (García-Jiménez et al., 2020).

  • RQ1: What are the DT rules that define the characteristics of students with low, moderate, and high performance in the English subject respectively?

Figure 4 shows the tree diagram of students who are expected to have low performance in English, involving 24 out of 159 students (15.1%). From the figure, two prominent rules that define students with the highest probability of low performance in English are identified. These rules are depicted as below:

  • IF (English Final Exam = Low) AND (English Mid-term Exam = Low) AND (Diversity ≤ 7) THEN MCE English = Low.

  • IF (English Final Exam = Low) AND (English Mid-term Exam = Low) AND (Diversity > 7) THEN MCE English = Low.

Fig. 4
figure 4

Tree diagram of students with low English performance

Based on the above DT rules, three dominant attributes are involved, which are Form Four final and mid-term examinations and diversity. In the first rule with the highest probability, students who had low performance in English and did not receive a high score of diversity (below 7 points) are more likely to have low performance in English for MCE. According to Yung and Cai (2020), past English examination results can generally predict students’ future performance in the subject. Krupat et al. (2017) posited that students with low performance in previous academic examinations are at risk as there is a strong relationship between past and future academic performance. Moreover, students with low or moderate diversity levels indicate their preference toward routines, are less flexible, moderately elastic, and are not interested in trying new things. Such preference may affect their ability to answer higher-order thinking MCE questions correctly. This shows that personality traits (diversity) can be used to predict students’ performance. This is supported by Miller-Matero et al. (2018) who posited a relationship between personality traits and students’ performance.

Figure 5 shows the tree diagram of students who are expected to have moderate performance in English, involving 70 out of 159 students (44.0%). From the figure, three prominent rules that define students with the highest probability of moderate performance in English are identified. These rules are depicted as below:

  • IF (English Final Exam = Moderate OR High) AND (Mathematics Final Exam = Low OR Moderate) AND (Mathematics Mid-term Exam = Low) THEN MCE English = Moderate.

  • IF (English Final Exam = Moderate OR High) AND (Mathematics Final Exam = Low OR Moderate) AND (Mathematics Mid-term Exam = Moderate OR High) THEN MCE English = Moderate.

  • IF (English Final Exam = Low) AND (English Mid-term Exam = Moderate OR High) AND (Diversity > 8) THEN MCE English = Moderate.

Fig. 5
figure 5

Tree diagram of students with moderate English performance

Five attributes are involved in the rules for moderate MCE English performance: Form Four English final and mid-term examinations, Form Four Mathematics final and mid-term examinations, and diversity. According to the rule with the highest probability, students who have a moderate or high performance in the English final examination, low or moderate performance in the Form Four Mathematics final examination, and low performance in the Form Four Mathematics mid-term examination, are likely to perform moderately in English for MCE. The rule with the highest probability for moderate performance contains only past academic performance. This finding is aligned with Makhtar et al. (2017) who found that past academic performance solely can predict secondary school students’ performance.

Figure 6 shows the tree diagram of students who are expected to have high performance in English, involving 65 out of 159 students (40.9%). From the figure, two prominent rules that define students with the highest probability of high performance in English are identified. These rules are depicted as below:

  • IF (English Final Exam = Moderate OR High) AND (Mathematics Final Exam = High) AND (Mother’s educational level = Bachelor OR Master OR Ph.D.) THEN MCE English = High.

  • IF (English Final Exam = Moderate OR High) AND (Mathematics Final Exam = High) AND (Mother’s educational level = MCE OR Diploma) THEN MCE English = High.

Fig. 6
figure 6

Tree diagram of students with high English performance

Three attributes are influential in the rules for high MCE English performance: Form Four English and Mathematics final examinations and the mother’s educational level. Based on the rule with the highest probability, students who had moderate or high performance in Form Four English final examination, high performance in Form Four Mathematics final examination, and whose mothers have a Bachelor, Master, or Ph.D. degree are probably going to perform highly in MCE English. Based on Sulphey et al. (2018), students who had excellent grades in previous examinations are more likely to perform well in future examinations. Idris et al. (2020) revealed that mothers with high educational levels positively contribute to their children’s academic performance.

  • RQ2: What are the DT rules that define the characteristics of students with low, moderate, and high performance in the Mathematics subject respectively?

Figure 7 shows the tree diagram of students who are expected to have low performance in Mathematics, involving 24 out of 159 students (15.1%). From the figure, one prominent rule (see below) that defines students with the highest probability of low Mathematics performance is identified.

  • IF (Mathematics Final Exam = Low) AND (Mathematics Mid-term Exam = Low) AND (Father’s educational level = MCE OR Bachelor OR Master OR Ph.D.) THEN MCE Mathematics = Low.

Fig. 7
figure 7

Tree diagram of students with low Mathematics performance

The three dominant attributes involved in the rules are Form Four Mathematics final and mid-term examination, and the father’s educational level. Based on the rule, students who had low performance in Mathematics examinations and their father’s educational level is MCE, Bachelor, Master, or Ph.D. degree, indicating that either they either had an MCE certificate or studied post-secondary education, are more likely to have low performance in MCE Mathematics. A range of fathers’ educational levels is identified in this rule, and this may indicate the lack of involvement of fathers in their children’s education that contributes to low performance. Wong and Wong (2019) discovered that there is a significant relationship between low Mathematics performance and later examination performance. They argued that students have low performance due to a lack of enthusiasm for the subject. Muller (2018) also reported a substantial association between parents’ educational levels and their children’s academic performance.

Figure 8 shows the tree diagram of students who are expected to have moderate Mathematics performance, involving 36 out of 159 students (22.6%). From the figure, three prominent rules that define students with the highest probability of moderate performance in Mathematics are identified. These rules are depicted as below:

  • IF (Mathematics Final Exam = Low) AND (Father’s educational level = MCE OR Bachelor OR Master OR Ph.D.) AND (Mathematics Mid-term Exam = Moderate or High) THEN MCE Mathematics = Moderate.

  • IF (Mathematics Final Exam = Low) AND (Father’s educational level = Diploma) AND (Self-criticism ≤ 3) THEN MCE Mathematics = Moderate.

  • IF (Mathematics Final Exam = Moderate) AND (Diversity > 9) THEN MCE Mathematics = Moderate.

Fig. 8
figure 8

Tree diagram of students with moderate Mathematics performance

The rules involve five attributes: Form Four Mathematics final and mid-term examinations, the father’s educational level, diversity, and self-criticism. Based on the rule with the highest probability, if students have low performance in Form Four Mathematics final examination, their fathers’ educational level is MCE, Bachelor, Master, or Ph.D. degree, and have moderate or high performance in Form Four Mathematics mid-term examination, they are probably going to perform moderately in Mathematics for MCE. This finding aligns with the study by Chand et al. (2021) who discovered that past Mathematics performance of upper secondary school students can be used to predict future Mathematics performance. Li and Qiu (2018) reported that parents’ educational level affects students’ performance. However, the range of fathers’ educational levels discovered in this rule implies equal or lack of fathers’ involvement in educating their children.

Figure 9 shows the tree diagram of students who are expected to have high performance in Mathematics, involving 109 out of 159 students (68.6%). From the figure, three prominent rules are identified. These rules are depicted as below:

  • IF (Mathematics Final Exam = High) THEN MCE Mathematics = High.

  • IF (Mathematics Final Exam = Moderate) AND (Diversity ≤ 9) THEN MCE Mathematics = High.

  • IF (Mathematics Final Exam = Low) AND (Father’s educational level = Diploma) AND (Self-criticism > 3) THEN MCE Mathematics = High.

Fig. 9
figure 9

Tree diagram of students with high Mathematics performance

Four attributes are dominant in the above rules: Form Four Mathematics final examination, diversity, the father’s educational level, and self-criticism. Based on the rule with the highest probability, students with high performance in the Form Four Mathematics final examination most probably will achieve high performance in MCE Mathematics. This finding is supported by Lipnevich et al. (2016) who found that previous Mathematics performance is substantially correlated with present Mathematics performance. Moreover, McKee and Caldarella (2016) concluded that early Mathematics performance greatly predicts later performance.

3.1.1 Predictors of students’ performance in English and Mathematics

  • RQ3: What are the main predictors of students’ English subject performance using DT, NN, SVM, and NB classification techniques?

Figures 10, 11, 12, and 13 show the process of identifying the main predictors of English performance using DT, NN, SVM, and NB respectively in Orange.

Fig. 10
figure 10

Process of identifying main predictors using DT in orange

Fig. 11
figure 11

Process of identifying main predictors using NN in orange

Fig. 12
figure 12

Process of identifying main predictors using SVM in orange

Fig. 13
figure 13

Process of identifying main predictors using NB in orange

Table 3 shows the impact of the top five attributes on the prediction of English performance using DT, NN, SVM, and NB.

Table 3 Top five attributes on the prediction of English performance using DT, NN, SVM, and NB

Referring to Table 3, for DT, the Form Four English final examination has the most impact on the prediction with an impact of 0.225, followed by the Form Four Mathematics final examination (0.202) and the Form Four English mid-term examination (0.165). For NN, the Form Four English final examination is the most dominant in the prediction with an impact of 0.168, followed by the Form Four Mathematics final examination (0.132) and the Form Four English mid-term examination (0.117). For SVM, the Form Four English final examination is the most influential in the prediction with an impact of 0.152, followed by the Form Four Mathematics final examination (0.121) and the Form Four English mid-term examination (0.104). For NB, the Form Four English final examination is the most prominent in the prediction with an impact of 0.168, followed by the Form Four Mathematics final examination (0.132) and the Form Four English mid-term examination (0.117).

  • RQ4: What are the main predictors of students’ Mathematics subject performance using DT, NN, SVM, and NB classification techniques?

Similar procedures described in RQ3 were repeated for RQ4 to identify the main predictors of Mathematics performance using DT, NN, SVM, and NB in Orange. Table 4 shows the top five attributes derived from each classifier.

Table 4 Top five attributes on the prediction of Mathematics performance using DT, NN, SVM, and NB

Referring to Table 4, for DT, the Form Four Mathematics final examination has the most impact on the prediction with an impact of 0.341, followed by the Form Four Mathematics mid-term examination (0.235) and Form Four English final examination (0.124). For NN, the Form Four Mathematics final examination is the most dominant in the prediction with an impact of 0.223, followed by the Form Four Mathematics mid-term examination (0.151) and Form Four English final examination (0.093). For SVM, the Form Four Mathematics final examination is the most influential in the prediction with an impact of 0.205, followed by the Form Four Mathematics mid-term examination (0.135) and the Form Four English final examination (0.082). For NB, the Form Four Mathematics final examination is the most prominent in the prediction with an impact of 0.223, followed by the Form Four Mathematics mid-term examination (0.151) and the Form Four English final examination (0.093).

For RQ3 and RQ4, it has been discovered that past academic performance is the most influential attribute to predict students’ performance in English and Mathematics. The top three attributes that predict English performance were consistently derived from the four classification techniques, which are Form Four English final examination, Form Four Mathematics final examination, and Form Four English mid-term examination. Similarly, Form Four Mathematics final examination, Form Four Mathematics mid-term examination, and Form Four English final examination are the top three attributes for predicting Mathematics performance based on the four classification techniques.

This is in line with the review by Shahiri and Husain (2015) who found that academic results are the most important attribute to predict students’ performance. The findings are supported by Saa et al. (2019) as they identified that the most common aspects in predicting students’ performance are students’ previous grades and internal assessments. Next, Makhtar et al. (2017) postulated that students’ performance can be solely predicted using academic records. Furthermore, this attribute is the most employed in predicting students’ performance since it has a quantitative value that is both tangible and appropriate for predicting students’ performance (López-Zambrano et al., 2021).

Referring to RQ1 to RQ4, the trees and predictors of students’ performance in English and Mathematics incorporate some demographic attributes such as the parents’ educational levels. This finding demonstrates that past academic performance is not the only attribute that influences students’ performance. This finding is backed up by Kiu (2018) who concludes that parents’ educational level is significant as it impacts their children’s performance. The parents’ marital status and ethnicity did not appear in any tree diagrams and were only derived once among the eight predictions. Hence, these attributes are considered insignificant. The parents’ occupational status is found to have no bearing on the predictions. This is supported by findings by Etemadpour et al. (2011) which posited that parents’ educational attainment is more pertinent than their occupational status in determining students’ academic performance.

Gender is another demographic attribute that is recognized as one of the top five predictors of English performance by the NN classifier. This finding is in line with Parajuli and Thapa’s findings (2017). They discovered that there were considerable gender differences in students’ academic performance, with female students outperforming their male counterparts. Ioannis and Maria (2019) show that female students are more likely to score better in several subjects than male students.

Psychological attributes such as diversity and self-criticism are found in the trees and predictors. Khairy et al. (2018) utilized a similar method (the results of a psychometric test) to predict secondary school students’ performance. However, according to the findings, most psychological attributes play a minor role in predicting students’ performance (Verma & Yadav, 2020). In short, past academic performance is crucial to predicting students’ performance in English and Mathematics and certain demographics and psychological attributes are also influencing the predictions.

3.2 Predictive model with the highest accuracy

  • RQ5: Which predictive model has the highest predictive accuracy for students’ performance in English and Mathematics subjects respectively?

Table 5 shows the summary of measures on DM classifiers for English performance prediction.

Table 5 Summary of measures on DM classifiers for English performance prediction

Based on Table 5, DT outperforms other DM classifiers for all measures in terms of predicting students’ performance in English. DT shows the highest predictive accuracy among the DM models, with an accuracy of 87.1%, followed by SVM, NB, and NN. This finding is consistent with findings by Nawai et al. (2021) who found that DT surpassed other DM classifiers in terms of predictive performance. It may be argued that DT is the most dominant classifier since it creates classification rules that apply to both nominal and categorical data (Shetu et al., 2021). In addition, DT more closely resembles human decision-making than other DM approaches (Burkart & Huber, 2021).

Table 6 shows the summary of measures on DM classifiers for Mathematics performance prediction.

Table 6 Summary of measures on DM classifiers in Mathematics performance prediction

Referring to Table 6, although DT and NB have similar highest predictive accuracy (83.9%), a further examination into the Area Under Curve, Precision, and F1-Score shows NB has surpassed DT and other DM classifiers in the prediction of students’ Mathematics performance. Saritas and Yasar (2019) have discovered that the NB classifier is highly effective in some applications. In Chandrasekar and Qian (2016), the prediction accuracy of the NB classifier improved by using and combining appropriate data. In comparison with other DM algorithms, Shinde and Prasad (2017) posited that NB has tremendous performance in real-time predictions, and it can also perform well in multi-class predictions (Ahuja et al., 2020). In short, NB has distinct advantages in the prediction task (Amra & Maghari, 2017).

This study reveals DT and NB as the two top classifiers while NN and SVM are the two classifiers with the lowest accuracy in predicting students’ English and Mathematics performance respectively. Compared with the meta-analysis on the accuracy of classification techniques that were used to predict students’ performance in studies from the year 2002 to 2015 by Shahiri and Husain (2015), the analysis reveals slightly different results. Shahiri and Husain (2015) reported that NN shows the highest prediction accuracy at 98% and DT is second at 91%. SVM and KNN show similar accuracy, which is 83%, while NB has the lowest prediction accuracy at 76%. Hence, the recent research provides new insight into the prediction accuracy of DM classifiers.

3.3 Relationship between students’ English and Mathematics performance

  • RQ6: What is the relationship between students’ performance in English and Mathematics subjects?

For RQ6, it can be concluded that there is a relationship between students’ performance in English and Mathematics. Form Four Mathematics final and mid-term examinations are among the most important attributes when employing four algorithms to predict students’ performance in English. The Form Four English final examination is also among the top attributes in predicting students’ performance in Mathematics using DT, NN, SVM, and NB. However, Form Four English mid-term examination has less impact on students’ performance in Mathematics. To measure the relationship between these subjects, SPSS software was used to apply Spearman’s rank correlation, rs. Table 7 shows the correlation between English and Mathematics by using Spearman’s rank correlation, rs. There is a low positive correlation between Form Four Mathematics mid-term (0.345) and final examination (0.391) with MCE English performance. There is also a low positive correlation between the Form Four English mid-term (0.285) and final examination (0.367) with MCE Mathematics performance. The MCE English and MCE Mathematics performances are moderately correlated (0.433).

Table 7 Results of Spearman’s rank correlation

Based on the weight of attributes identified by DM classifiers and Spearman’s rank correlation, it can be deduced that English and Mathematics performance are interrelated. The findings are consistent with Henry et al. (2014), which also investigated the relationship between English and Mathematics performance. They discovered that English is a statistically significant predictor of Mathematics performance, and Mathematics performance improves simultaneously with English performance. Next, Rudd and Honkiss (2020) discovered that there is an indirect mediation relationship between English proficiency and Mathematics performance. The examination of the predictors of the four DM classifiers provides interesting insights into the relationship between these two subjects in addition to insights gained via the conventional statistical methods.

4 Conclusion

The study provides a deeper understanding of attributes that influence the prediction of students’ performance in English and Mathematics. It is critical to include past academic performance because it substantially impacts the prediction. Besides past academic performance, certain students’ demographics such as parents’ educational level and gender as well as psychological factors such as diversity and self-criticism also contribute to the prediction, at least in the context of this study.

This study could help academics determine which DM algorithm is most effective in predicting students’ performance. According to the findings, DT and NB algorithms are the best in predicting students’ performance in English and Mathematics, respectively. DT can reveal the characteristics of students with low, moderate, and high performance and provides good predictive performance for both subjects. Prediction of students’ academic performance and the specific characteristics or attributes that define students of different levels of performance in these subjects enables early interventions and personalized support to be given to students for optimizing their performance thus, alleviating the issue of the high failure rate of both subjects in MCE.

The performance in English and Mathematics is found to be interrelated. This finding provides additional insights as over the decade, researchers attempt to gain insights into the relationship between English and Mathematics (Bagceci et al., 2014; Rambely et al., 2013). Limitations of this study include the small sample size employed which limits the generalizability of the results. There is also a lack of diversity in terms of chosen students’ religions and ethnicities as well as the type and locality of chosen schools as all chosen schools are urban public schools. In addition, this study is confined to four DM techniques and two subjects. Future studies may include a larger number of students from various types of schools of diverse localities. Future studies may also involve other types of data mining techniques and explore other subjects as these may reveal previously undiscovered information.