Skip to main content

A Multiple Linear Regression-Based Approach to Predict Student Performance

  • Conference paper
  • First Online:
Advanced Intelligent Systems for Sustainable Development (AI2SD’2019) (AI2SD 2019)

Abstract

Predicting students’ academic outcome is useful for any educational institution that aims to ameliorate students’ performance. Based on the resulted predictions, educators can provide support to students at risk of failure. Data mining and machine learning techniques were widely used to predict students’ performance. This process called Educational data mining. In this work, we have proposed a methodology to build a student’ performance prediction model using a supervised machine learning technique which is the multiple linear regression (MLR). Our methodology consists of three major steps, the first step aims to analyze and preprocess the students’ attributes/variables using a set of statistical analysis methods, and then the second step consists in selecting the most important variables using different methods. The third step aims to construct different MLR models based on the selected variables and compare their performance using the k-fold cross-validation technique. The obtained results show that the model built using the variables selected from the Multivariate Adaptive Regression Splines method (MARS), outperforms the other constructed models.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Bakhshinategh, B., Zaiane, O.R., ElAtia, S., Ipperciel, D.: Educational data mining applications and tasks: a survey of the last 10 years. Educ. Inf. Technol. 23(1), 537–553 (2018)

    Google Scholar 

  2. Grömping, U.: Relative importance for linear regression in R: the package relaimpo. J. Stat. Softw. 17(1), 1–27 (2006)

    Google Scholar 

  3. Cortez, P., Silva, A.: Using data mining to predict secondary school student performance. In: Proceedings of 5th Annual Future Business Technology Conference (FUBUTEC 2008), pp. 5–12 (2008)

    Google Scholar 

  4. El Aissaoui, O., El Alami El Madani, Y., Oughdir, L., El Allioui, Y.: A fuzzy classification approach for learning style prediction based on web mining technique in e-learning environments. Educ. Inf. Technol. 1–17 (2018)

    Google Scholar 

  5. El Aissaoui, O., El Alami El Madani, Y., Oughdir, L., El Allioui, Y.: Combining supervised and unsupervised machine learning algorithms to predict the learners’ learning styles. Procedia Comput. Sci. 148, 87–96 (2019)

    Google Scholar 

  6. El Aissaoui, O., El Madani El Alami, Y., Oughdir, L., El Allioui, Y.: Integrating web usage mining for an automatic learner profile detection: a learning styles-based approach. In: 2018 International Conference on Intelligent Systems and Computer Vision (ISCV), pp. 1–6 (2018)

    Google Scholar 

  7. El Aissaoui, O., El Madani El Alami, Y., Oughdir, L., El Allioui, Y.: A hybrid machine learning approach to predict learning styles in adaptive E-learning system. Adv. Intell. Syst. Comput. 915, 772–786 (2019)

    Google Scholar 

  8. Shahiri, A.M., Husain, W., Rashid, N.A.: A review on predicting student’s performance using data mining techniques. Procedia Comput. Sci. 72, 414–422 (2015)

    Google Scholar 

  9. Aldowah, H., Al-Samarraie, H., Fauzy, W.M.: Educational data mining and learning analytics for 21st century higher education: a review and synthesis. Telemat. Inf. 37, 13–49 (2019)

    Google Scholar 

  10. Quadri1, M.M., Kalyankar, N.V.: Drop out feature of student data for academic performance using decision tree techniques. Glob. J. Comput. Sci. Technol. 10(2), 2–5 (2010)

    Google Scholar 

  11. Mishra, T., Kumar, D., Gupta, S.: Mining students’ data for prediction performance. In: 2014 Fourth International Conference on Advanced Computing & Communication Technologies, pp. 255–262 (2014)

    Google Scholar 

  12. Parack, S., Zahid, Z., Merchant, F.: Application of data mining in educational databases for predicting academic trends and patterns. In: 2012 IEEE International Conference on Technology Enhanced Education (ICTEE), pp. 1–4 (2012)

    Google Scholar 

  13. Helal, S., et al.: Predicting academic performance by considering student heterogeneity. Knowledge-Based Syst. 161, 134–146 (2018)

    Google Scholar 

  14. Osmanbegovic, E., Suljic, M.: Data mining approach for predicting student performance. Econ. Rev. J. Econ. Bus. 10(1), 3–12 (2012)

    Google Scholar 

  15. Jishan, S.T., Rashu, R.I., Haque, N., Rahman, R.M.: Improving accuracy of students’ final grade prediction model using optimal equal width binning and synthetic minority over-sampling technique. Decis. Anal. 2(1), 1 (2015)

    Google Scholar 

  16. Mayilvaganan, M., Kalpanadevi, D.: Comparison of classification techniques for predicting the performance of students academic environment. In: 2014 International Conference on Communication and Network Technologies, pp. 113–118 (2014)

    Google Scholar 

  17. Arsad, P.M., Buniyamin, N., Manan, J.A.: Prediction of engineering students’ academic performance using artificial neural network and linear regression: a comparison. In: 2013 IEEE 5th Conference on Engineering Education (ICEED), pp. 43–48 (2013)

    Google Scholar 

  18. Yang, S.J.H., Lu, O.H.T., Huang, A.Y.Q., Huang, J.C.H., Ogata, H., Lin, A.J.Q.: Predicting students’ academic performance using multiple linear regression and principal component analysis. J. Inf. Process. 26, 170–176 (2018)

    Google Scholar 

  19. Widyahastuti, F., Tjhin, V.U.: Predicting students performance in final examination using linear regression and multilayer perceptron. In: 2017 10th International Conference on Human System Interactions (HSI), pp. 188–192 (2017)

    Google Scholar 

  20. Dietrich, D., Heller, R., Yang, B., EMC Education Services: Data science and big data analytics : discovering, analyzing, visualizing and presenting data

    Google Scholar 

  21. Hoffman, J.I.E., Hoffman, J.I.E.: Multiple regression. In: Basic Biostatistics for Medical and Biomedical Practitioners, 2nd edn. pp. 525–560. Academic Press, Cambridge (2019)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ouafae El Aissaoui .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

El Aissaoui, O., El Alami El Madani, Y., Oughdir, L., Dakkak, A., El Allioui, Y. (2020). A Multiple Linear Regression-Based Approach to Predict Student Performance. In: Ezziyyani, M. (eds) Advanced Intelligent Systems for Sustainable Development (AI2SD’2019). AI2SD 2019. Advances in Intelligent Systems and Computing, vol 1102. Springer, Cham. https://doi.org/10.1007/978-3-030-36653-7_2

Download citation

Publish with us

Policies and ethics