Heterogeneous Cross Project Defect Prediction – A Survey

Vashisht, Rohit; Rizvi, Syed Afzal Murtaza

doi:10.1007/978-981-15-4451-4_22

Rohit Vashisht¹³ &
Syed Afzal Murtaza Rizvi¹³

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1206))

Included in the following conference series:

International Conference on Futuristic Trends in Networks and Computing Technologies

689 Accesses
1 Citations

Abstract

In the testing phase of Software Development Life Cycle (SDLC), Software Defect Prediction (SDP) is one of the pivotal task which finds the modules that are more vulnerable to defects and therefore need substantial testing for the early identification of these defects. A lot of work has been done on Cross - Project Defect Prediction (CPDP) that aims to predict defects in the target project lacking in historical defect prediction data or having limited defect data to build an effective generalized defect prediction model. Mostly, CPDP approaches predict the defects in target project on the basis of similar metrics found between source and target project. This paper focuses on the prediction of defects using a Heterogeneous metric set such that no common metrics exist between the source and the target projects. In this paper, a systematic literature study has been done to quote the main findings about CPDP from year 2002 to 2019. The main purpose of this survey is to put forward the adequate content in front of computer science researchers for exploring the specific area and to provide various future directions in this field.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

J. Adv. Res. Comput. Sci. Softw. Eng. 3(8) (2013). ISSN: 2277128X
Google Scholar
https://en.wikipedia.org/wiki/Software_metric
Han, D., Hoh, I. P., Kim, S., Lee, T., Nam, J.: Micro interaction metrics for defect prediction. In: Proceedings of the 16th ACM SIGSOFT International Symposium on Foundations of software engineering, New York, USA. ACM (2011)
Google Scholar
D’Ambros, M., Lanza, M., Robbes, R.: Evaluating defect prediction approaches: a bench-mark and an extensive comparison. Empirical Softw. Eng. 17(4–5), 531–577 (2012)
Article Google Scholar
Latih, R., Mulla, R., Na, L., Ahmed, P., Zarina, S., Christopher, W.: A study of mashup as a software application development technique with examples from an end-user programming perspective. J. Comput. Sci. 12, 1406–1415 (2010)
Google Scholar
Bener, A.B., Menzies, T., Di Stefano, J., Turhan, B.: On the relative value of cross-company and within-company data for defect prediction. Empirical Softw. Eng. 14, 540–578 (2009)
Article Google Scholar
Fu, W., Kim, S., Menzies, T., Nam, J., Tan, L.: Heterogeneous defect prediction. In: Proceedings of the 2015 10th Joint Meeting on Foundations of Software Engineering, ser. ESEC/FSE, pp. 508–519, New York, NY, USA. ACM (2015)
Google Scholar
Baesens, B., Lessmann, S., Mues, C., Pietsch, S.: Benchmarking classification models for software defect prediction: a proposed framework and novel findings. IEEE Trans. Softw. Eng. 34(4), 485–496 (2008)
Article Google Scholar
Devanbu, P., Posnett, D., Rahman, F.: Recalling the imprecision of cross-project defect prediction. In: Proceedings of the ACM SIGSOFT 20th International Symposium on the Foundations of Software Engineering, New York, NY, USA. ACM (2018)
Google Scholar
Mosavi, A., Ruiz, L., Vargas, R.: Deep learning- a review, Adv. Intell. Syst. Comput. (2017). Series Ed.: Kacprzyk, Janusz, Springer
Google Scholar
http://www.cs.put.poznan.pl/jstefanowski/aed/DMmultipleclassifiers.pdf
Kanellopoulos, D., Kotsiantis, S.B., Pintelas, P.E.: Data preprocessing for supervised leaning. Int. J. Comput. Electr. Autom. Control Inf. Eng. 1(12), 234–245 (2007)
Google Scholar
Mwadulo, M.W.: A review on feature selection methods for classification tasks. Int. J. Comput. Appl. Technol. Res. 5(6), 395–402 (2015)
Google Scholar
Massey, F.J.: The Kolmogorov-Smirnov test for goodness of fit. J. Am. Stat. Assoc. 46(253), 68–78 (1951)
Article Google Scholar
Spearman, C.: The proof and measurement of association between two things. Int. J. Epidemiol. 39(5), 1137–1150 (2010)
Article Google Scholar
Keivanloo, I., Mockus, A., Zhang, F., Zou, Y.: Towards building a universal defect prediction model. In: Proceedings of the 11th Working Conference on Mining Software Repositories, ser. MSR, New York, NY, USA, pp. 182–191. ACM (2014)
Google Scholar
He, P., Li, B., Ma, Y.: Towards cross-project defect prediction with imbalanced feature sets, CoRR, vol.abs/1411.4228 (2014)
Google Scholar
Dong, X., Jing, X., Qi, F., Wu, F., Xu, B.: Heterogeneous cross company defect prediction by unified metric representation and CCA-based transfer learning. In: Proceedings of the 2015 10th Joint Meeting on Foundations of Software Engineering, ser. ESEC/FSE 2015, New York, NY, USA, pp. 496–507. ACM (2015)
Google Scholar
Canfora, G., De Lucia, A., Oliveto, R., Panichella, A., Di Penta, M., Panichella, S.: Multi-objective cross-project defect prediction. In: IEEE Sixth International Conference on Verification and Validation in Software Testing, Luxembourg, Luxembourg. IEEE(2013). ISSN 2159-4848
Google Scholar
Briand, L.C., Melo, W.L., Wurst, J.: Assessing the applicability of fault-proneness models across object-oriented software projects. IEEE Trans. Softw. Eng. 28, 706–720 (2002)
Article Google Scholar
Bener, A.B., Menzies, T., Di Stefano, J.S., Turhan, B.: On the relative value of cross-company and within-company data for defect prediction. Empirical Softw. Eng. 14(5), 540–578 (2009)
Article Google Scholar
Gall, H., Giger, E., Murphy, B., Nagappan, N., Zimmermann, K.: Cross- project defect prediction: a large scale experiment on data vs. domain vs. process. In: Proceedings of the 7th Joint Meeting of the European Software Engineering Conference and the ACM SIGSOFT International Symposium on Foundations of Software Engineering. ACM, pp. 91–100 (2009)
Google Scholar
Camargo Cruz, A.E., Ochimizu, K.: Towards logistic regression models for predicting fault-prone code across software projects. In: Proceedings of the Third International Symposium on Empirical Software Engineering and Measurement (ESEM), Lake Buena Vista, Florida, USA, pp. 460– 463 (2009)
Google Scholar
Butcher, A., Cok, D.R., Marcus, A., Menzies, T., Zimmermann, T.: Local vs. global models for effort estimation and defect prediction. In: 26th IEEE/ACM International Conference on Automated Software Engineering (ASE 2011), Lawrence, KS, USA, pp. 343–351. IEEE (2011)
Google Scholar
Bettenburg, N., Hassan, A. E., Nagappan, M.: Think locally, act globally: improving defect and effort prediction models. In: 9th IEEE Working Conference on Mining Software Repositories, MSR 2012, Zurich, Switzerland pp. 60–69. IEEE (2012)
Google Scholar
Devanbu, P., Posnett, D., Rahman, F.: Recalling the imprecision of cross- project defect prediction. In: Proceedings of the ACM-Sigsoft 20th International Symposium on the Foundations of Software Engineering (FSE - 20), Research Triangle Park, NC, USA, pp. 61–65. ACM (2012)
Google Scholar
Xu, Z., Yuan, P., Zhang, T., Tang, Y., Li, S., Xia, Z.: HDA: cross project defect prediction via heterogeneous domain adaptation with dictionary learning. IEEE Access 6, 57597–57613 (2018)
Article Google Scholar
Li, Y., Huang, Z., Wang, Y., Fang, B.: Evaluating data filter on cross-project defect prediction: comparison and improvements. IEEE Access 5, 25646–25656 (2017)
Article Google Scholar
Porto, F., Minku, L., Mendes, E., Simao, A.: A Systematic study of cross-project defect prediction with meta- learning, IEEE Trans. Softw. Eng. (2018)
Google Scholar
Gong, L., Jiang, S., Yu, Q., Jiang, L.: Unsupervised deep domain adaptation for heterogeneous defect prediction. IEICE Trans. Info. Syst. 102(3), 537–549 (2019)
Article Google Scholar
Li, Z., Jing, X., Zhiu, X., Zhang, H., Xu, B., Ying, S.: Heterogeneous defect prediction with two-stages ensemble learning. Autom. Softw. Eng. 26(2), 187–201 (2019)
Google Scholar

Download references

Author information

Authors and Affiliations

CSE Department, Jamia Millia Islamia, Delhi, India
Rohit Vashisht & Syed Afzal Murtaza Rizvi

Authors

Rohit Vashisht
View author publications
You can also search for this author in PubMed Google Scholar
Syed Afzal Murtaza Rizvi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Rohit Vashisht .

Editor information

Editors and Affiliations

Jaypee University of Information Technology, Waknaghat, Himachal Pradesh, India
Pradeep Kumar Singh
CDAC, Mohali, India
Sanjay Sood
Jaypee University of Information Technology, Solan, Himachal Pradesh, India
Yugal Kumar
Polish Academy of Sciences, Warsaw, Poland
Marcin Paprzycki
Southern Federal University, Rostov-on-Don, Russia
Anton Pljonkin
Jiangsu Normal University, Xuzhou, China
Wei-Chiang Hong

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Vashisht, R., Rizvi, S.A.M. (2020). Heterogeneous Cross Project Defect Prediction – A Survey. In: Singh, P., Sood, S., Kumar, Y., Paprzycki, M., Pljonkin, A., Hong, WC. (eds) Futuristic Trends in Networks and Computing Technologies. FTNCT 2019. Communications in Computer and Information Science, vol 1206. Springer, Singapore. https://doi.org/10.1007/978-981-15-4451-4_22

Download citation

DOI: https://doi.org/10.1007/978-981-15-4451-4_22
Published: 22 April 2020
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-4450-7
Online ISBN: 978-981-15-4451-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics