Skip to main content

Heterogeneous Cross Project Defect Prediction – A Survey

  • Conference paper
  • First Online:
Futuristic Trends in Networks and Computing Technologies (FTNCT 2019)

Abstract

In the testing phase of Software Development Life Cycle (SDLC), Software Defect Prediction (SDP) is one of the pivotal task which finds the modules that are more vulnerable to defects and therefore need substantial testing for the early identification of these defects. A lot of work has been done on Cross - Project Defect Prediction (CPDP) that aims to predict defects in the target project lacking in historical defect prediction data or having limited defect data to build an effective generalized defect prediction model. Mostly, CPDP approaches predict the defects in target project on the basis of similar metrics found between source and target project. This paper focuses on the prediction of defects using a Heterogeneous metric set such that no common metrics exist between the source and the target projects. In this paper, a systematic literature study has been done to quote the main findings about CPDP from year 2002 to 2019. The main purpose of this survey is to put forward the adequate content in front of computer science researchers for exploring the specific area and to provide various future directions in this field.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. J. Adv. Res. Comput. Sci. Softw. Eng. 3(8) (2013). ISSN: 2277128X

    Google Scholar 

  2. https://en.wikipedia.org/wiki/Software_metric

  3. Han, D., Hoh, I. P., Kim, S., Lee, T., Nam, J.: Micro interaction metrics for defect prediction. In: Proceedings of the 16th ACM SIGSOFT International Symposium on Foundations of software engineering, New York, USA. ACM (2011)

    Google Scholar 

  4. D’Ambros, M., Lanza, M., Robbes, R.: Evaluating defect prediction approaches: a bench-mark and an extensive comparison. Empirical Softw. Eng. 17(4–5), 531–577 (2012)

    Article  Google Scholar 

  5. Latih, R., Mulla, R., Na, L., Ahmed, P., Zarina, S., Christopher, W.: A study of mashup as a software application development technique with examples from an end-user programming perspective. J. Comput. Sci. 12, 1406–1415 (2010)

    Google Scholar 

  6. Bener, A.B., Menzies, T., Di Stefano, J., Turhan, B.: On the relative value of cross-company and within-company data for defect prediction. Empirical Softw. Eng. 14, 540–578 (2009)

    Article  Google Scholar 

  7. Fu, W., Kim, S., Menzies, T., Nam, J., Tan, L.: Heterogeneous defect prediction. In: Proceedings of the 2015 10th Joint Meeting on Foundations of Software Engineering, ser. ESEC/FSE, pp. 508–519, New York, NY, USA. ACM (2015)

    Google Scholar 

  8. Baesens, B., Lessmann, S., Mues, C., Pietsch, S.: Benchmarking classification models for software defect prediction: a proposed framework and novel findings. IEEE Trans. Softw. Eng. 34(4), 485–496 (2008)

    Article  Google Scholar 

  9. Devanbu, P., Posnett, D., Rahman, F.: Recalling the imprecision of cross-project defect prediction. In: Proceedings of the ACM SIGSOFT 20th International Symposium on the Foundations of Software Engineering, New York, NY, USA. ACM (2018)

    Google Scholar 

  10. Mosavi, A., Ruiz, L., Vargas, R.: Deep learning- a review, Adv. Intell. Syst. Comput. (2017). Series Ed.: Kacprzyk, Janusz, Springer

    Google Scholar 

  11. http://www.cs.put.poznan.pl/jstefanowski/aed/DMmultipleclassifiers.pdf

  12. Kanellopoulos, D., Kotsiantis, S.B., Pintelas, P.E.: Data preprocessing for supervised leaning. Int. J. Comput. Electr. Autom. Control Inf. Eng. 1(12), 234–245 (2007)

    Google Scholar 

  13. Mwadulo, M.W.: A review on feature selection methods for classification tasks. Int. J. Comput. Appl. Technol. Res. 5(6), 395–402 (2015)

    Google Scholar 

  14. Massey, F.J.: The Kolmogorov-Smirnov test for goodness of fit. J. Am. Stat. Assoc. 46(253), 68–78 (1951)

    Article  Google Scholar 

  15. Spearman, C.: The proof and measurement of association between two things. Int. J. Epidemiol. 39(5), 1137–1150 (2010)

    Article  Google Scholar 

  16. Keivanloo, I., Mockus, A., Zhang, F., Zou, Y.: Towards building a universal defect prediction model. In: Proceedings of the 11th Working Conference on Mining Software Repositories, ser. MSR, New York, NY, USA, pp. 182–191. ACM (2014)

    Google Scholar 

  17. He, P., Li, B., Ma, Y.: Towards cross-project defect prediction with imbalanced feature sets, CoRR, vol.abs/1411.4228 (2014)

    Google Scholar 

  18. Dong, X., Jing, X., Qi, F., Wu, F., Xu, B.: Heterogeneous cross company defect prediction by unified metric representation and CCA-based transfer learning. In: Proceedings of the 2015 10th Joint Meeting on Foundations of Software Engineering, ser. ESEC/FSE 2015, New York, NY, USA, pp. 496–507. ACM (2015)

    Google Scholar 

  19. Canfora, G., De Lucia, A., Oliveto, R., Panichella, A., Di Penta, M., Panichella, S.: Multi-objective cross-project defect prediction. In: IEEE Sixth International Conference on Verification and Validation in Software Testing, Luxembourg, Luxembourg. IEEE(2013). ISSN 2159-4848

    Google Scholar 

  20. Briand, L.C., Melo, W.L., Wurst, J.: Assessing the applicability of fault-proneness models across object-oriented software projects. IEEE Trans. Softw. Eng. 28, 706–720 (2002)

    Article  Google Scholar 

  21. Bener, A.B., Menzies, T., Di Stefano, J.S., Turhan, B.: On the relative value of cross-company and within-company data for defect prediction. Empirical Softw. Eng. 14(5), 540–578 (2009)

    Article  Google Scholar 

  22. Gall, H., Giger, E., Murphy, B., Nagappan, N., Zimmermann, K.: Cross- project defect prediction: a large scale experiment on data vs. domain vs. process. In: Proceedings of the 7th Joint Meeting of the European Software Engineering Conference and the ACM SIGSOFT International Symposium on Foundations of Software Engineering. ACM, pp. 91–100 (2009)

    Google Scholar 

  23. Camargo Cruz, A.E., Ochimizu, K.: Towards logistic regression models for predicting fault-prone code across software projects. In: Proceedings of the Third International Symposium on Empirical Software Engineering and Measurement (ESEM), Lake Buena Vista, Florida, USA, pp. 460– 463 (2009)

    Google Scholar 

  24. Butcher, A., Cok, D.R., Marcus, A., Menzies, T., Zimmermann, T.: Local vs. global models for effort estimation and defect prediction. In: 26th IEEE/ACM International Conference on Automated Software Engineering (ASE 2011), Lawrence, KS, USA, pp. 343–351. IEEE (2011)

    Google Scholar 

  25. Bettenburg, N., Hassan, A. E., Nagappan, M.: Think locally, act globally: improving defect and effort prediction models. In: 9th IEEE Working Conference on Mining Software Repositories, MSR 2012, Zurich, Switzerland pp. 60–69. IEEE (2012)

    Google Scholar 

  26. Devanbu, P., Posnett, D., Rahman, F.: Recalling the imprecision of cross- project defect prediction. In: Proceedings of the ACM-Sigsoft 20th International Symposium on the Foundations of Software Engineering (FSE - 20), Research Triangle Park, NC, USA, pp. 61–65. ACM (2012)

    Google Scholar 

  27. Xu, Z., Yuan, P., Zhang, T., Tang, Y., Li, S., Xia, Z.: HDA: cross project defect prediction via heterogeneous domain adaptation with dictionary learning. IEEE Access 6, 57597–57613 (2018)

    Article  Google Scholar 

  28. Li, Y., Huang, Z., Wang, Y., Fang, B.: Evaluating data filter on cross-project defect prediction: comparison and improvements. IEEE Access 5, 25646–25656 (2017)

    Article  Google Scholar 

  29. Porto, F., Minku, L., Mendes, E., Simao, A.: A Systematic study of cross-project defect prediction with meta- learning, IEEE Trans. Softw. Eng. (2018)

    Google Scholar 

  30. Gong, L., Jiang, S., Yu, Q., Jiang, L.: Unsupervised deep domain adaptation for heterogeneous defect prediction. IEICE Trans. Info. Syst. 102(3), 537–549 (2019)

    Article  Google Scholar 

  31. Li, Z., Jing, X., Zhiu, X., Zhang, H., Xu, B., Ying, S.: Heterogeneous defect prediction with two-stages ensemble learning. Autom. Softw. Eng. 26(2), 187–201 (2019)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Rohit Vashisht .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Vashisht, R., Rizvi, S.A.M. (2020). Heterogeneous Cross Project Defect Prediction – A Survey. In: Singh, P., Sood, S., Kumar, Y., Paprzycki, M., Pljonkin, A., Hong, WC. (eds) Futuristic Trends in Networks and Computing Technologies. FTNCT 2019. Communications in Computer and Information Science, vol 1206. Springer, Singapore. https://doi.org/10.1007/978-981-15-4451-4_22

Download citation

  • DOI: https://doi.org/10.1007/978-981-15-4451-4_22

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-15-4450-7

  • Online ISBN: 978-981-15-4451-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics