Abstract
In the testing phase of Software Development Life Cycle (SDLC), Software Defect Prediction (SDP) is one of the pivotal task which finds the modules that are more vulnerable to defects and therefore need substantial testing for the early identification of these defects. A lot of work has been done on Cross - Project Defect Prediction (CPDP) that aims to predict defects in the target project lacking in historical defect prediction data or having limited defect data to build an effective generalized defect prediction model. Mostly, CPDP approaches predict the defects in target project on the basis of similar metrics found between source and target project. This paper focuses on the prediction of defects using a Heterogeneous metric set such that no common metrics exist between the source and the target projects. In this paper, a systematic literature study has been done to quote the main findings about CPDP from year 2002 to 2019. The main purpose of this survey is to put forward the adequate content in front of computer science researchers for exploring the specific area and to provide various future directions in this field.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
J. Adv. Res. Comput. Sci. Softw. Eng. 3(8) (2013). ISSN: 2277128X
Han, D., Hoh, I. P., Kim, S., Lee, T., Nam, J.: Micro interaction metrics for defect prediction. In: Proceedings of the 16th ACM SIGSOFT International Symposium on Foundations of software engineering, New York, USA. ACM (2011)
D’Ambros, M., Lanza, M., Robbes, R.: Evaluating defect prediction approaches: a bench-mark and an extensive comparison. Empirical Softw. Eng. 17(4–5), 531–577 (2012)
Latih, R., Mulla, R., Na, L., Ahmed, P., Zarina, S., Christopher, W.: A study of mashup as a software application development technique with examples from an end-user programming perspective. J. Comput. Sci. 12, 1406–1415 (2010)
Bener, A.B., Menzies, T., Di Stefano, J., Turhan, B.: On the relative value of cross-company and within-company data for defect prediction. Empirical Softw. Eng. 14, 540–578 (2009)
Fu, W., Kim, S., Menzies, T., Nam, J., Tan, L.: Heterogeneous defect prediction. In: Proceedings of the 2015 10th Joint Meeting on Foundations of Software Engineering, ser. ESEC/FSE, pp. 508–519, New York, NY, USA. ACM (2015)
Baesens, B., Lessmann, S., Mues, C., Pietsch, S.: Benchmarking classification models for software defect prediction: a proposed framework and novel findings. IEEE Trans. Softw. Eng. 34(4), 485–496 (2008)
Devanbu, P., Posnett, D., Rahman, F.: Recalling the imprecision of cross-project defect prediction. In: Proceedings of the ACM SIGSOFT 20th International Symposium on the Foundations of Software Engineering, New York, NY, USA. ACM (2018)
Mosavi, A., Ruiz, L., Vargas, R.: Deep learning- a review, Adv. Intell. Syst. Comput. (2017). Series Ed.: Kacprzyk, Janusz, Springer
http://www.cs.put.poznan.pl/jstefanowski/aed/DMmultipleclassifiers.pdf
Kanellopoulos, D., Kotsiantis, S.B., Pintelas, P.E.: Data preprocessing for supervised leaning. Int. J. Comput. Electr. Autom. Control Inf. Eng. 1(12), 234–245 (2007)
Mwadulo, M.W.: A review on feature selection methods for classification tasks. Int. J. Comput. Appl. Technol. Res. 5(6), 395–402 (2015)
Massey, F.J.: The Kolmogorov-Smirnov test for goodness of fit. J. Am. Stat. Assoc. 46(253), 68–78 (1951)
Spearman, C.: The proof and measurement of association between two things. Int. J. Epidemiol. 39(5), 1137–1150 (2010)
Keivanloo, I., Mockus, A., Zhang, F., Zou, Y.: Towards building a universal defect prediction model. In: Proceedings of the 11th Working Conference on Mining Software Repositories, ser. MSR, New York, NY, USA, pp. 182–191. ACM (2014)
He, P., Li, B., Ma, Y.: Towards cross-project defect prediction with imbalanced feature sets, CoRR, vol.abs/1411.4228 (2014)
Dong, X., Jing, X., Qi, F., Wu, F., Xu, B.: Heterogeneous cross company defect prediction by unified metric representation and CCA-based transfer learning. In: Proceedings of the 2015 10th Joint Meeting on Foundations of Software Engineering, ser. ESEC/FSE 2015, New York, NY, USA, pp. 496–507. ACM (2015)
Canfora, G., De Lucia, A., Oliveto, R., Panichella, A., Di Penta, M., Panichella, S.: Multi-objective cross-project defect prediction. In: IEEE Sixth International Conference on Verification and Validation in Software Testing, Luxembourg, Luxembourg. IEEE(2013). ISSN 2159-4848
Briand, L.C., Melo, W.L., Wurst, J.: Assessing the applicability of fault-proneness models across object-oriented software projects. IEEE Trans. Softw. Eng. 28, 706–720 (2002)
Bener, A.B., Menzies, T., Di Stefano, J.S., Turhan, B.: On the relative value of cross-company and within-company data for defect prediction. Empirical Softw. Eng. 14(5), 540–578 (2009)
Gall, H., Giger, E., Murphy, B., Nagappan, N., Zimmermann, K.: Cross- project defect prediction: a large scale experiment on data vs. domain vs. process. In: Proceedings of the 7th Joint Meeting of the European Software Engineering Conference and the ACM SIGSOFT International Symposium on Foundations of Software Engineering. ACM, pp. 91–100 (2009)
Camargo Cruz, A.E., Ochimizu, K.: Towards logistic regression models for predicting fault-prone code across software projects. In: Proceedings of the Third International Symposium on Empirical Software Engineering and Measurement (ESEM), Lake Buena Vista, Florida, USA, pp. 460– 463 (2009)
Butcher, A., Cok, D.R., Marcus, A., Menzies, T., Zimmermann, T.: Local vs. global models for effort estimation and defect prediction. In: 26th IEEE/ACM International Conference on Automated Software Engineering (ASE 2011), Lawrence, KS, USA, pp. 343–351. IEEE (2011)
Bettenburg, N., Hassan, A. E., Nagappan, M.: Think locally, act globally: improving defect and effort prediction models. In: 9th IEEE Working Conference on Mining Software Repositories, MSR 2012, Zurich, Switzerland pp. 60–69. IEEE (2012)
Devanbu, P., Posnett, D., Rahman, F.: Recalling the imprecision of cross- project defect prediction. In: Proceedings of the ACM-Sigsoft 20th International Symposium on the Foundations of Software Engineering (FSE - 20), Research Triangle Park, NC, USA, pp. 61–65. ACM (2012)
Xu, Z., Yuan, P., Zhang, T., Tang, Y., Li, S., Xia, Z.: HDA: cross project defect prediction via heterogeneous domain adaptation with dictionary learning. IEEE Access 6, 57597–57613 (2018)
Li, Y., Huang, Z., Wang, Y., Fang, B.: Evaluating data filter on cross-project defect prediction: comparison and improvements. IEEE Access 5, 25646–25656 (2017)
Porto, F., Minku, L., Mendes, E., Simao, A.: A Systematic study of cross-project defect prediction with meta- learning, IEEE Trans. Softw. Eng. (2018)
Gong, L., Jiang, S., Yu, Q., Jiang, L.: Unsupervised deep domain adaptation for heterogeneous defect prediction. IEICE Trans. Info. Syst. 102(3), 537–549 (2019)
Li, Z., Jing, X., Zhiu, X., Zhang, H., Xu, B., Ying, S.: Heterogeneous defect prediction with two-stages ensemble learning. Autom. Softw. Eng. 26(2), 187–201 (2019)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Vashisht, R., Rizvi, S.A.M. (2020). Heterogeneous Cross Project Defect Prediction – A Survey. In: Singh, P., Sood, S., Kumar, Y., Paprzycki, M., Pljonkin, A., Hong, WC. (eds) Futuristic Trends in Networks and Computing Technologies. FTNCT 2019. Communications in Computer and Information Science, vol 1206. Springer, Singapore. https://doi.org/10.1007/978-981-15-4451-4_22
Download citation
DOI: https://doi.org/10.1007/978-981-15-4451-4_22
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-4450-7
Online ISBN: 978-981-15-4451-4
eBook Packages: Computer ScienceComputer Science (R0)