Abstract
Software fault prediction is a process, which helps to identify fault prone modules in early stages of software development. It also helps in improving the software quality with optimized effort and cost. Least Square Support Vector Machines (LSSVM) have been explored in problems related to classification. The aim of this paper is to develop and compare, software fault prediction models using LSSVM with Linear, Polynomial and Radial Basis Function (RBF) kernels. The proposed models classify a software module as faulty or non faulty by taking software metrics such as Halstead software metrics as input. Experiments on fifteen open source projects are performed to study the impact of the proposed models. The models are evaluated using Accuracy, F-measure and ROC AUC as the performance measures. The experimental results shows that, LSSVM with polynomial kernel perform better than LSSVM with linear kernel and similar to RBF kernel, and the models developed using LSSVM improve the prediction accuracy of software fault prediction, compared to the most frequently used models.
Similar content being viewed by others
References
Bisi, M.; Goyal, N.K.: Early prediction of software fault-prone module using artificial neural network. Int. J. Perform. Eng. 11(1), 43–52 (2015)
Burges, C.J.: A tutorial on support vector machines for pattern recognition. Data Min. Knowl. Disc. 2(2), 121–167 (1998)
Capgemini. Sogeti: world quality report 2015–16. Technical report, Capgemini (2015)
El Emam, K.; Benlarbi, S.; Goel, N.; Rai, S.N.: Comparing case-based reasoning classifiers for predicting high risk software components. J. Syst. Softw. 55(3), 301–320 (2001)
Elish, K.O.; Elish, M.O.: Predicting defect-prone software modules using support vector machines. J. Syst. Softw. 81(5), 649–660 (2008)
Houthuys, L.; Langone, R.; Suykens, J.A.: Multi-view least squares support vector machines classification. Neurocomputing 282, 78–88 (2018)
Jiang, Y.; Cuki, B.; Menzies, T.; Bartlow, N.: Comparing design and code metrics for software quality prediction. In Proceedings of the 4th International Workshop on Predictor Models in Software Engineering, pp. 11–18 (2008).
Kamei, Y.; Monden, A.; Matsumoto, K.-i.:Empirical evaluation of svm-based software reliability model. In Proceedings of 5th ACM-IEEE International Symposium on Empirical Software Engineering (ISESE2006), volume 2, pp. 39–41. (2006).
Kohavi, R.; et al.: A study of cross-validation and bootstrap for accuracy estimation and model selection. In: Ijcai, vol. 14, pp. 1137–1145. Montreal, Canada (1995)
Koru, A.G.; Tian, J.: An empirical comparison and characterization of high defect and high complexity modules. J. Syst. Softw. 67(3), 153–163 (2003)
Kumar, L.; Sripada, S.K.; Sureka, A.; Rath, S.K.: Effective fault prediction model developed using least square support vector machine (lssvm). J. Syst. Softw. 137, 686–712 (2018)
Lessmann, S.; Baesens, B.; Mues, C.; Pietsch, S.: Benchmarking classification models for software defect prediction: A proposed framework and novel findings. IEEE Trans. Software Eng. 34(4), 485–496 (2008)
Malhotra, R.: A systematic review of machine learning techniques for software fault prediction. Appl. Soft Comput. 27, 504–518 (2015)
Menzies, T.; DiStefano, J.; Orrego, A.; Chapman, R.: Assessing predictors of software defects. In Proceedings of Workshop Predictive Software Models (2004)
Nadeau, C.; Bengio, Y.: Inference for the generalization error. Adv. Neural Inform. Process. Syst. pp 307–313 (2000)
Rathore, S.S.; Kumar, S.: A study on software fault prediction techniques. Artif. Intell. Rev. 51(2), 255–327 (2019)
Sayyad Shirabad, J.; Menzies, T.: The PROMISE Repository of Software Engineering Databases. University of Ottawa, Canada, School of Information Technology and Engineering (2005)
Sharma, D.; Chandra, P.: Software fault prediction using machine-learning techniques. In Smart Computing and Informatics, pp. 541–549. Springer. (2018).
Sharma, D.; Chandra, P.: A comparative analysis of soft computing techniques in software fault prediction model development. Int. J. Inf. Technol. 11(1), 37–46 (2019)
Sharma, D.; Chandra, P.: Linear regression with factor analysis in fault prediction of software. J. Interdiscip. Math. 23(1), 11–19 (2020)
Sharma, P.; Sangal, A.L.: Building and testing a fuzzy linguistic assessment framework for defect prediction in asd environment using process-based software metrics. Arab. J. Sci. Eng. 1–25 (2020)
Suykens, J.A.; Vandewalle, J.: Least squares support vector machine classifiers. Neural Process. Lett. 9(3), 293–300 (1999)
Turhan, B.; Bener, A.: Analysis of naive bayes’ assumptions on software fault data: An empirical study. Data Knowl. Eng. 68(2), 278–290 (2009)
Wang, J.; Peng, H.: Keyphrases extraction from web document by the least squares support vector machine. In The 2005 IEEE/WIC/ACM International Conference on Web Intelligence, 2005, pp 293–296. IEEE. (2005).
Xing, F.; Guo, P.; Lyu, M.R.: A novel method for early software quality prediction based on support vector machine. In 16th IEEE International Symposium on Software Reliability Engineering (ISSRE’05). (2005).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Kulamala, V.K., Kumar, L. & Mohapatra, D.P. Software Fault Prediction Using LSSVM with Different Kernel Functions. Arab J Sci Eng 46, 8655–8664 (2021). https://doi.org/10.1007/s13369-021-05643-2
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s13369-021-05643-2