Springer Nature is making SARS-CoV-2 and COVID-19 research free. View research | View latest news | Sign up for updates

A methodology for detecting relevant single nucleotide polymorphism in prostate cancer with multivariate adaptive regression splines and backpropagation artificial neural networks

  • 172 Accesses

  • 2 Citations


The objective of the present paper is to model the genetic influence on prostate cancer with multivariate adaptive regression splines (MARS) and artificial neural networks (ANNs) techniques for classification. These models will be able to classify subjects that have cancer according to the values of the selected proteins from the genes selected with the models as most relevant. Subjects are selected as cases and controls from the MCC-Spain database and represent a heterogeneous group. Multivariate adaptive regression splines models allow to select a set of the most relevant proteins from the database. These models were trained in nine different degrees and chosen regarding its performance and complexity. Artificial neural networks models were trained on with data restricted to the most significant variables. The performance of both types of models was analyzed in terms of the area under the curve of the receiver operating characteristics curve. The ANN technique resulted in a model with AUC of 0.62006, while for MARS technique, the value was 0.569312 in the best situation. Then, the artificial neural network model obtained can determine whether a patient suffers prostate cancer significantly better than MARS models and with high rate of success. The best model presented was based on support vector machines, reaching values of AUC of 0.65212.

This is a preview of subscription content, log in to check access.

Fig. 1
Fig. 2
Fig. 3


  1. 1.

    Center MM, Jemal A, Lortet-Tieulent J, Ward E, Ferlay J, Brawley O, Bray F (2012) International variation in prostate cancer incidence and mortality rates. Eur Urol 61:1079–1092

  2. 2.

    Ferlay J, Soerjomataram I, Ervik M, Dikshit R, Eser S, Mathers C, Rebelo M, Parkin DM, Forman D, Bray F (2013) Cancer incidence and mortality worldwide: IARC Cancer Base No. 11. International Agency for Research on Cancer, Lyon. GLOBOCAN 2012 v1. 0, 2013

  3. 3.

    Di Sebastiano KM, Mourtzakis M (2014) The role of dietary fat throughout the prostate cancer trajectory. Nutrients 6:6095–6109

  4. 4.

    Allott EH, Masko EM, Freedland SJ (2013) Obesity and prostate cancer: weighing the evidence. Eur Urol 63:800–809

  5. 5.

    Huncharek M, Haddock KS, Reid R, Kupelnick B (2010) Smoking as a risk factor for prostate cancer: a meta-analysis of 24 prospective cohort studies. Am J Public Health 100:693–701

  6. 6.

    Liu Y, Hu F, Li D, Wang F, Zhu L, Chen W, Ge J, An R, Zhao Y (2011) Does physical activity reduce the risk of prostate cancer? A systematic review and meta-analysis. Eur Urol 60:1029–1044

  7. 7.

    Discacciati A, Wolk A (2014) Lifestyle and dietary factors in prostate cancer prevention. In: Cuzick J, Thorat M (eds) prostate cancer prevention. Springer, Berlin, pp 27–37

  8. 8.

    Gong Z, Neuhouser ML, Goodman PJ, Albanes D, Chi C, Hsing AW, Lippman SM, Platz EA, Pollak MN, Thompson IM et al (2006) Obesity, diabetes, and risk of prostate cancer: results from the prostate cancer prevention trial. Cancer Epidemiol Prev Biomark 15:1977–1983

  9. 9.

    Bosetti C, Rosato V, Gallus S, Cuzick J, La Vecchia C (2012) Aspirin and cancer risk: a quantitative review to 2011. Ann Oncol 23:1403–1415

  10. 10.

    Thompson IM, Goodman PJ, Tangen CM, Lucia MS, Miller GJ, Ford LG, Lieber MM, Cespedes RD, Atkins JN, Lippman SM (2003) others: The influence of finasteride on the development of prostate cancer. N Engl J Med 349:215–224

  11. 11.

    Andriole GL, Bostwick DG, Brawley OW, Gomella LG, Marberger M, Montorsi F, Pettaway CA, Tammela TL, Teloken C, Tindall DJ et al (2010) Effect of dutasteride on the risk of prostate cancer. N Engl J Med 362:1192–1202

  12. 12.

    Hamilton RJ, Freedland SJ (2008) Review of recent evidence in support of a role for statins in the prevention of prostate cancer. Curr Opin Urol 18:333–339

  13. 13.

    Jalving M, Gietema JA, Lefrandt JD, de Jong S, Reyners AKL, Gans ROB, de Vries EGE (2010) Metformin: Taking away the candy for cancer? Eur J Cancer 46:2369–2380

  14. 14.

    De Andrés J, Sánchez-Lasheras F, Lorca P, de Cos Juez FJ (2011) A hybrid device of Self Organizing Maps (SOM) and Multivariate Adaptive Regression Splines (MARS) for the forecasting of firms’ bankruptcy. Account Manag Inf Syst 10:351

  15. 15.

    Fernández JRA, Muñiz CD, Nieto PJG, de Cos Juez FJ, Lasheras FS, Roqueñí MN (2013) Forecasting the cyanotoxins presence in fresh waters: a new model based on genetic algorithms combined with the MARS technique. Ecol Eng 53:68–78

  16. 16.

    Friedman JH (1991) Multivariate adaptive regression splines. Ann Stat 19:1–67

  17. 17.

    Sekulic S, Kowalski BR (1992) MARS: a tutorial. J Chemom 6:199–216

  18. 18.

    Breiman L, Friedman JH, Olshen RA, Stone CJ (1993) Classification and Regression Trees, Wadsworth International Group, Belmon, CA, 1984. Case Descr Featur Subset Correct Missed FA Misclass 1:1–3

  19. 19.

    Antón JCÁ, Nieto PJG, de Cos Juez FJ, Lasheras FS, Viejo CB, Gutiérrez NR (2013) Battery state-of-charge estimator using the MARS technique. IEEE Trans Power Electron 28:3798–3805

  20. 20.

    Guzmán D, de Cos Juez FJ, Lasheras FS, Myers R, Young L (2010) Deformable mirror model for open-loop adaptive optics using multivariate adaptive regression splines. Opt Express 18:6492–6505

  21. 21.

    Nieto PJG, Torres JM, de Cos Juez FJ, Lasheras FS (2012) Using multivariate adaptive regression splines and multilayer perceptron networks to evaluate paper manufactured using Eucalyptus globulus. Appl Math Comput 219:755–763

  22. 22.

    Nieto PJG, Lasheras FS, de Cos Juez FJ, Fernández JRA (2011) Study of cyanotoxins presence from experimental cyanobacteria concentrations using a new data mining methodology based on multivariate adaptive regression splines in Trasona reservoir (Northern Spain). J Hazard Mater 195:414–421

  23. 23.

    Friedman JH, Roosen CB (1995) An introduction to multivariate adaptive regression splines. Springer, Heidelberg

  24. 24.

    Suárez Gómez SL, Gutiérrez CG, Rodríguez JDS, Rodríguez MLS, Lasheras FS, de Cos Juez FJ (2016) Analysing the performance of a tomographic reconstructor with different neural networks frameworks. In: International conference on intelligent systems design and applications, pp 1051–1060

  25. 25.

    Suárez Gómez SL, Santos Rodríguez JD, Iglesias Rodríguez FJ, de Cos Juez FJ (2017) Analysis of the temporal structure evolution of physical systems with the self-organising tree algorithm (SOTA): application for validating neural network systems on adaptive optics data before on-sky implementation. Entropy 19:103

  26. 26.

    Sánchez AS, Fernández PR, Lasheras FS, de Cos Juez FJ, Nieto PJG (2011) Prediction of work-related accidents according to working conditions using support vector machines. Appl Math Comput 218:3539–3552

  27. 27.

    Vilán JAV, Fernández JRA, Nieto PJG, Lasheras FS, de Cos Juez FJ, Muñiz CD (2013) Support vector machines and multilayer perceptron networks used to evaluate the cyanotoxins presence from experimental cyanobacteria concentrations in the Trasona reservoir (Northern Spain). Water Resour Manag 27:3457–3476

  28. 28.

    Basden AG, Atkinson D, Bharmal NA, Bitenc U, Brangier M, Buey T, Butterley T, Cano D, Chemla F, Clark P (2016) others: experience with wavefront sensor and deformable mirror interfaces for wide-field adaptive optics systems. Mon Not R Astron Soc 459:1350–1359

  29. 29.

    de Cos Juez FJ, Lasheras FS, Roqueñí N, Osborn J (2012) An ANN-based smart tomographic reconstructor in a dynamic environment. Sensors (Switzerland) 12:8895–8911.

  30. 30.

    Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. In: Advances in neural information processing systems, pp 1097–1105

  31. 31.

    Gardner MW, Dorling SR (1998) Artificial neural networks (the multilayer perceptron)—a review of applications in the atmospheric sciences. Atmos Environ 32:2627–2636

  32. 32.

    Hornik K, Stinchcombe M, White H (1989) Multilayer feedforward networks are universal approximators. Neural Netw 2:359–366

  33. 33.

    González-Gutiérrez C, Santos-Rodríguez JD, Díaz RÁF, Rolle JLC, Gutiérrez NR, de Cos Juez FJ (2016) Using GPUs to speed up a tomographic reconstructor based on machine learning. In: International conference on european transnational education, pp 279–289

  34. 34.

    Haykin S (1999) Neural networks: a comprehensive foundation. Prentice-Hall, Upper Saddle River

  35. 35.

    Rumelhart DE, Hinton GE, Williams RJ (1988) Learning representations by back-propagating errors. Cogn Model 5:1

  36. 36.

    Lasheras, F.S., Gómez, S.L.S., Garc’\ia, M.V.R., Krzemień, A., Sánchez, A.S.: Time series and artificial intelligence with a genetic algorithm hybrid approach for rare earth price prediction. (2017)

  37. 37.

    Ríos EMA, Crespo MMS, Sánchez AS, Gómez SLS, Lasheras FS (2017) Genetic algorithm based on support vector machines for computer vision syndrome classification. In: International Joint Conference SOCO’17-CISIS’17-ICEUTE’17 Le{ó}n, Spain, September 6–8, 2017, Proceeding, pp 381–390

  38. 38.

    Castaño-Vinyals G, Aragonés N, Pérez-Gómez B, Martin V, Llorca J, Moreno V, Altzibar JM, Ardanaz E, De Sanjosé S, Jiménez-Moleón JJ et al (2015) Population-based multicase-control study in common tumors in Spain (MCC-Spain): rationale and study design. Gac. Sanit. 29:308–315

Download references

Author information

Correspondence to Francisco Javier de Cos Juez.

Ethics declarations

Conflict of interest

The authors report no conflicts of interest in this work.

Rights and permissions

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Sánchez Lasheras, J.E., González Donquiles, C., García Nieto, P.J. et al. A methodology for detecting relevant single nucleotide polymorphism in prostate cancer with multivariate adaptive regression splines and backpropagation artificial neural networks. Neural Comput & Applic 32, 1231–1238 (2020).

Download citation


  • Artificial neural networks
  • Multivariate adaptive regression splines
  • Prostate cancer
  • Single nucleotide polymorphism