Attempts Prediction by Missing Data Imputation in Engineering Degree

  • Esteban Jove
  • Patricia Blanco-Rodríguez
  • José Luis Casteleiro-Roca
  • Javier Moreno-Arboleda
  • José Antonio López-Vázquez
  • Francisco Javier de Cos Juez
  • José Luis Calvo-Rolle
Conference paper
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 649)


Nowadays, both students performance and its evaluation are important challenges and play a significant role, in general terms. Frequently, the students attempts to pass a specific curriculum subjects, have several fails due to different reasons and, in this context, lack of data adversely affects interesting future analysis for achieving conclusions. As a consequence, data imputation processes must be performed in order to substitute the missing data for estimated values. This paper presents a comparison between two data imputation methods developed by the authors in previous researches, the Adaptive Assignation Algorithm (AAA) based on Multivariate Adaptive Regression Splines (MARS), and the Multivariate Imputation by Chained Equations methodology (MICE). The results obtained demonstrate that both proposed methods achieve good results, specially AAA algorithm.


Student performance Data imputation MARS MICE AAA 



Authors greatly appreciate the support both from Spanish Ministry of Economy and Competitivenes through grant AYA2014-57648-P, and from regional Ministry of Economy and Employment through grant FC-15-GRUPIN14-017.


  1. 1. Accessed 19 Mar 2017
  2. 2.
    Ferreira, F.H.G., Gignoux, J.: The measurement of educational inequality: achievement and opportunity. World Bank Econ. Rev. 28(2), 210–246 (2014)CrossRefGoogle Scholar
  3. 3.
    Grissom, J.A., Kalogrides, D., Loeb, S.: Using student test scores to measure principal performance. Educ. Eval. Policy Anal. 37, 3–28 (2015)CrossRefGoogle Scholar
  4. 4.
    López-Vázquez, J.A., Orosa, J.A., Calvo-Rolle, J.L., Juez, F.J., Castelerio-Roca, J.L., Costa, A.M.A.: New way to improve subject selection in engineering degree studies. In: International Joint Conference: CISIS15 and ICEUTE15 (2015)Google Scholar
  5. 5.
    Kokkinos, C.M., Kargiotidis, A., Markos, A.: The relationship between learning and study strategies and big five personality traits among junior university student teachers. Learn. Individ. Differ. 43, 39–47 (2015)CrossRefGoogle Scholar
  6. 6.
    Freeman, S., Eddy, S.L., McDonough, M., Smith, M.K., Okoroafor, N., Jordt, H., Wenderoth, M.P.: Active learning increases student performance in science, engineering, and mathematics. Proc. Natl. Acad. Sci. 111(23), 8410–8415 (2014)CrossRefGoogle Scholar
  7. 7.
    Cook, W.D., Tone, K., Zhu, J.: Data envelopment analysis: prior to choosing a model. Omega 44, 1–4 (2014)CrossRefGoogle Scholar
  8. 8.
    Anderman, E.M., Gimbert, B., O’Connell, A., Riegel, L.: Approaches to academic growth assessment. Br. J. Educ. Psychol. 85(2), 138–153 (2015)CrossRefGoogle Scholar
  9. 9.
    Crespo-Ramos, M.J., Machón-González, I., López-García, H., Calvo-Rolle, J.L.: Detection of locally relevant variables using som-ng algorithm. Eng. Appl. Artif. Intell. 26(8), 1992–2000 (2013)CrossRefGoogle Scholar
  10. 10.
    Casteleiro-Roca, J.L., Calvo-Rolle, J.L., Méndez Pérez, J.A., Roqueñí Gutiérrez, N., de Cos Juez, F.J.: Hybrid intelligent system to perform fault detection on bis sensor during surgeries. Sensors 17(1), 179 (2017)CrossRefGoogle Scholar
  11. 11.
    Fernández-Serantes, L.A., Vázquez, R.E., Casteleiro-Roca, J.L., Calvo-Rolle, J.L., Corchado, E.: Hybrid intelligent model to predict the soc of a lfp power cell type. In: International Conference on Hybrid Artificial Intelligence Systems, pp. 561–572. Springer International Publishing (2014)Google Scholar
  12. 12.
    Quintián, H., Casteleiro-Roca, J.L., Perez-Castelo, F.J., Calvo-Rolle, J.L., Corchado, E.: Hybrid intelligent model for fault detection of a lithium iron phosphate power cell used in electric vehicles. In: International Conference on Hybrid Artificial Intelligence Systems, pp. 751–762. Springer International Publishing (2016)Google Scholar
  13. 13.
    Calvo-Rolle, J.L., Machón-Gonzalez, I., López-Garcia, H.: Neuro-robust controller for non-linear systems. Dyna 86(3), 308–317 (2011)CrossRefGoogle Scholar
  14. 14.
    Ghanghermeh, A., Roshan, G., Orosa, J., Calvo-Rolle, J., Costa, A.: New climatic indicators for improving urban sprawl: a case study of tehran city. Entropy 15(3), 999–1013 (2013)CrossRefGoogle Scholar
  15. 15.
    Alaiz-Moretón, H., Calvo-Rolle, J., García, I., Alonso-Alvarez, A.: Formalization and practical implementation of a conceptual model for pid controller tuning. Asian J. Control 13(6), 773–784 (2011)CrossRefMATHGoogle Scholar
  16. 16.
    Casteleiro-Roca, J., Calvo-Rolle, J., Meizoso-López, M., Piñón-Pazos, A., Rodríguez-Gómez, B.: Bio-inspired model of ground temperature behavior on the horizontal geothermal exchanger of an installation based on a heat pump. Neurocomputting 150, 90–98 (2015)CrossRefGoogle Scholar
  17. 17.
    Casteleiro-Roca, J., Quintián, H., Calvo-Rolle, J., Corchado, E., Meizoso-López, M., Piñón-Pazos, A.: An intelligent fault detection system for a heat pump installation based on a geo-thermal heat exchanger. J. Appl. Log. 17, 36–47 (2015)CrossRefMATHGoogle Scholar
  18. 18.
    Quintián, H., Calvo-Rolle, J.L., Corchado, E.: A hybrid regression system based on local models for solar energy prediction. Informatica 25(2), 265–282 (2014)CrossRefGoogle Scholar
  19. 19.
    Osborn, J., Guzmán, D., de Cos Juez, F., Basden, A., Morris, T., Gendron, E., Butterley, T., Myers, R.M., Guesalaga, A., Sánchez Lasheras, F., et al.: Open-loop tomography with artificial neural networks on canary: on-sky results. Mon. Not. R. Astron. Soc. 441(3), 2508–2514 (2014)CrossRefGoogle Scholar
  20. 20.
    Vilán, J.V., Fernández, J.A., Nieto, P.G., Lasheras, F.S., de Cos Juez, F.J., Muñiz, C.D.: Support vector machines and multilayer perceptron networks used to evaluate the cyanotoxins presence from experimental cyanobacteria concentrations in the trasona reservoir (northern spain). Water Resour. Manage. 27(9), 3457–3476 (2013)CrossRefGoogle Scholar
  21. 21.
    Basden, A., Atkinson, D., Bharmal, N., Bitenc, U., Brangier, M., Buey, T., Butterley, T., Cano, D., Chemla, F., Clark, P., et al.: Experience with wavefront sensor and deformable mirror interfaces for wide-field adaptive optics systems. Mon. Not. R. Astron. Soc. 459(2), 1350–1359 (2016)CrossRefGoogle Scholar
  22. 22.
    De Andrés, J., Sánchez-Lasheras, F., Lorca, P., de Cos Juez, F.J.: A hybrid device of self organizing maps (som) and multivariate adaptive regression splines (mars) for the forecasting of firms’ bankruptcy. Acc. Manag. Inf. Syst. 10(3), 351 (2011)Google Scholar
  23. 23.
    Sánchez-Lasheras, F., Turrado, C.C., Calvo-Rolle, J., Piñón-Pazos, A., Cos-Juez, F.: A new missing data imputation algorithm applied to electrical data loggers. Sensors 15, 31069–31082 (2015)CrossRefGoogle Scholar
  24. 24.
    Turrado, C., López, M., Lasheras, F., Gómez, B., Calvo-Rolle, J., Cos-Juez, F.: Missing data imputation of solar radiation data under different atmospheric conditions. Sensors 14, 20382 (2014)CrossRefGoogle Scholar
  25. 25.
    Van-Buuren, S., Groothuis-Oudshoorn, K.: Mice: multivariate imputation by chained equations. R. J. Stat. Softw. 45(3), 1–67 (2011)Google Scholar
  26. 26.
    Tierny, L.: Introduction to general state-space markov chain theory. In: Gilks, W.R., Richardson, S., Spiegelhalter, D.J. (eds.) Markov Chain Monte Carlo in Practice, pp. 59–71. Chapman & Hall, London (1996)Google Scholar
  27. 27.
    Van-Buuren, S.: Flexible Imputation of Missing Data. Chapman & Hall/CRC, London (2012)CrossRefMATHGoogle Scholar
  28. 28.
    Liu, Y., Brown, S.: Comparison of five iterative imputation methods for multivariate classification. Chemom. Intell. Lab. 120, 106–115 (2013)CrossRefGoogle Scholar
  29. 29.
    Plaku, E., Le, D.: Interactive search for action and motion planning with dynamics. J. Exp. Theor. Artif. Intell. 28(5), 849–869 (2016)CrossRefGoogle Scholar
  30. 30.
    Thenmozhi, M., Chand, G.S.: Forecasting stock returns based on information transmission across global markets using support vector machines. Neural Comput. Appl. 27(4), 805–824 (2016)CrossRefGoogle Scholar
  31. 31.
    Perez, R., Lorenz, E., Pelland, S., Beauharnois, M., van Knowe, G., Hemker, K., Heinemannb, D., Müllere, J.R.S., Traunmüllerf, W.: Comparison of numerical weather prediction solar irradiance forecasts in the US, Canada and Europe. Sol. Energy 94, 305–326 (2013)CrossRefGoogle Scholar
  32. 32.
    Crespo Turrado, C., Sánchez Lasheras, F., Calvo-Rollé, J.L., Piñón-Pazos, A.J., de Cos Juez, F.J.: A new missing data imputation algorithm applied to electrical data loggers. Sensors 15(12), 31069–31082 (2015)CrossRefGoogle Scholar
  33. 33.
    Gutierrez-Corea, F., Manso-Callejo, M., Moreno-Regidor, M., Velasco-Gómez, J.: Spatial estimation of sub-hour global horizontal irradiance based on official observations and remote sensors. Sensors 14, 6758–6787 (2014)CrossRefGoogle Scholar
  34. 34.
    Tiengrod, P., Wongseree, W.:A comparison of spatial interpolation methods for surface temperature in thailand. In: 2013 International Computer Science and Engineering Conference (ICSEC), pp. 174–178, September 2013Google Scholar
  35. 35.
    Quintian Pardo, H., Calvo Rolle, J.L., Fontenla Romero, O.: Application of a low cost commercial robot in tasks of tracking of objects. Dyna 79(175), 24–33 (2012)Google Scholar
  36. 36.
    Garcia, R.F., Rolle, J.L.C., Gomez, M.R., Catoira, A.D.: Expert condition monitoring on hydrostatic self-levitating bearings. Expert Syst. Appl. 40(8), 2975–2984 (2013)CrossRefGoogle Scholar
  37. 37.
    Liu, Y., Brown, S.: Comparison of five iterative imputation methods for multivariate classification. Chemom. Intell. Lab. Syst. 120, 106–115 (2013)CrossRefGoogle Scholar
  38. 38.
    García-Nieto, P., Alonso-Fernández, J., de Cos-Juez, F., Sánchez-Lasheras, F., Muñiz, C.D.: Hybrid modelling based on support vector regression with genetic algorithms in forecasting the cyanotoxins presence in the trasona reservoir (northern spain). Environ. Res. 122, 1–10 (2013)CrossRefGoogle Scholar
  39. 39.
    Quintian, H., Calvo-Rolle, J., Corchado, E.: A hybrid regression system based on local models for solar energy prediction. Informatica 25, 265–282 (2014)CrossRefGoogle Scholar
  40. 40.
    Vilar-Martinez, X., Montero-Sousa, J., Calvo-Rolle, J., Casteleiro-Roca, J.: Expert system development to assist on the verification of “TACAN” system performance. Dyna 89, 112–121 (2014)CrossRefGoogle Scholar

Copyright information

© Springer International Publishing AG 2018

Authors and Affiliations

  • Esteban Jove
    • 1
  • Patricia Blanco-Rodríguez
    • 2
  • José Luis Casteleiro-Roca
    • 1
  • Javier Moreno-Arboleda
    • 3
  • José Antonio López-Vázquez
    • 1
  • Francisco Javier de Cos Juez
    • 4
  • José Luis Calvo-Rolle
    • 1
  1. 1.Department of Industrial EngineeringUniversity of A CoruñaFerrolSpain
  2. 2.Department of Construction and Manufacturing EngineeringUniversity of OviedoGijónSpain
  3. 3.Faculty of MinesUniversity of ColombiaBogotColombia
  4. 4.Department of Mining ExploitationUniversity of OviedoOviedoSpain

Personalised recommendations