Neighborhood Approximate Reducts-Based Ensemble Learning Algorithm and Its Application in Software Defect Prediction

Yang, Zhiyong; Du, Junwei; Hu, Qiang; Jiang, Feng

doi:10.1007/978-3-031-21244-4_8

Zhiyong Yang¹³,
Junwei Du¹³,
Qiang Hu¹³ &
…
Feng Jiang¹³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 13633))

Included in the following conference series:

International Joint Conference on Rough Sets

651 Accesses

Abstract

Ensemble learning is a machine learning paradigm that integrates the results of multiple base learners according to a certain rule to obtain a better classification result. Ensemble learning has been widely used in many fields, but the existing methods still have the problems of difficult to guarantee the diversity of base learners and low prediction accuracy. In order to overcome the above problems, we considered ensemble learning from the perspective of attribute space division, defined the concept of neighborhood approximate reduction through neighborhood rough set theory, and further proposed an ensemble learning algorithm based on neighborhood approximate reduction, called ELNAR. ELNAR algorithm divides the attribute space of the data set into multiple subspaces. The basic learners trained based on the data sets corresponding to different subspaces have great differences, so as to ensure the strong generalization performance of the ensemble learner. In order to verify the effectiveness of ELNAR algorithm, we applied ELNAR algorithm to software defect prediction. Experiments on 20 NASA MDP data sets show that ELNAR algorithm can better improve the performance of software defect prediction compared with the existing ensemble learning algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 69.99; Price excludes VAT (USA)

Softcover Book: USD 89.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Software defect prediction ensemble learning algorithm based on adaptive variable sparrow search algorithm

Article 06 January 2023

Imbalanced Data Processing Model for Software Defect Prediction

Article 14 December 2017

Heterogeneous defect prediction with two-stage ensemble learning

Article 04 June 2019

References

Rajadurai, H., Gandhi, U.D.: A stacked ensemble learning model for intrusion detection in wireless network. In: Neural Computing and Applications 34, 15387–15395 (2020)
Google Scholar
Luo, S.Y., Gu, Y.J., Yao, X.X., Wei, F.: Research on text sentiment analysis based on neural network and ensemble learning. Revue d’Intelligence Artificielle 35(1), 63–70 (2021)
Article Google Scholar
Jabbar, M.A.: Breast cancer data classification using ensemble machine learning. Eng. Appl. Sci. Res. 48(1), 65–72 (2021)
Google Scholar
Ali, U., Aftab, S., Iqbal, A., Nawaz, Z., Bashir, M.S., Saeed, M.A.: Software defect prediction using variant based ensemble learning and feature selection techniques. Int. J. Modern Educ. Comput. Sci. 12(5), 29–40 (2020)
Article Google Scholar
Bühlmann, P., Yu, B.: Analyzing bagging. Ann. Stat. 30(4), 927–961 (2002)
Google Scholar
Ho, T.K.: The random subspace method for constructing decision forests. IEEE Trans. Pattern Anal. Mach. Intell. 20(8), 832–844 (1998)
Article Google Scholar
Liu, Z.N., et al.: Self-paced ensemble for highly imbalanced massive data classification. In: 9th International Proceedings on Data Engineering, pp. 841–852. IEEE, NY (2020)
Google Scholar
García, S., Zhang, Z.L., Altalhi, A., Alshomrani, S., Herrera, F.: Dynamic ensemble selection for multi-class imbalanced datasets. Inf. Sci. 445–456, 22–37 (2018)
Article MathSciNet Google Scholar
Liu, Z.N., et al.: Towards inter-class and intra-class imbalance in class-imbalanced learning. arXiv preprint arXiv:2111.12791 (2021)
Jiang, F., Yu, X., Zhao, H.B., Gong, D.W., Du, J.W.: Ensemble learning based on random super-reduct and resampling. Artif. Intell. Rev. 54(4), 3115–3140 (2021)
Article Google Scholar
Chen, L., Fang, B., Shang, Z.W., Tang, Y.Y.: Tackling class overlap and imbalance problems in software defect prediction. Software Qual. J. 26(1), 97–125 (2018)
Article Google Scholar
Abuqaddom, I., Hudaib, A.: Cost-sensitive learner on hybrid smote-ensemble approach to predict software defects. In: Silhavy, R., Silhavy, P., Prokopova, Z. (eds.) CoMeSySo 2018. AISC, vol. 859, pp. 12–21. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-00211-4_2
Chapter Google Scholar
Balogun, A.O., et al.: SMOTE-based homogeneous ensemble methods for software defect prediction. In: Gervasi, O., et al. (eds.) ICCSA 2020. LNCS, vol. 12254, pp. 615–631. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58817-5_45
Chapter Google Scholar
Cover, T., Hart, P.: Nearest neighbor pattern classification. IEEE Trans. Inf. Theory 13(1), 21–27 (1967)
Article MATH Google Scholar
MDP Data Repository. http://nasa-softwaredefectdatasets.wikispaces.com/. Accessed 11 Mar 2022
PROMISE Data Repository. https://code.google.com/p/promisedata/. Accessed 11 Mar 2022
Hu, Q.H., Yu, D.R., Xie, Z.X.: Neighborhood classifiers. Expert Syst. Appl. 34(2), 866–876 (2008)
Article Google Scholar
Hu, Q.H., Yu, D.R., Liu, J.F., Wu, C.X.: Neighborhood rough set based heterogeneous feature subset selection. Inf. Sci. 178(18), 3577–3594 (2008)
Article MathSciNet MATH Google Scholar
Hu, Q.H., Liu, J.F., Yu, D.R.: Mixed feature selection based on granulation and approximation. Knowl.-Based Syst. 21(4), 294–304 (2008)
Article Google Scholar
Dolatshah, M., Hadian, A., Minaei-Bidgoli, B.: Ball*-tree: Efficient spatial indexing for constrained nearest-neighbor search in metric spaces. arXiv preprint arXiv:1511.00628 (2015)
Marqués, A.I., García, V., Sánchez, J.S.: Two-level classifier ensembles for credit risk assessment. Expert Syst. Appl. 39(12), 10916–10922 (2012)
Article Google Scholar
Demšar, J.: Statistical comparisons of classifiers over multiple data sets. J. Mach. Learn. Res. 7, 1–30 (2006)
MathSciNet MATH Google Scholar

Download references

Acknowledgements

This work is supported by the National Natural Science Foundation of China (Grant Nos. 61973180, 62172249, U1806201), and the Shandong Provincial Natural Science Foundation, China (Grant Nos. ZR2022MF326, ZR2021QF074, ZR2018MF007).

Author information

Authors and Affiliations

Qingdao University of Science and Technology, Qingdao, 266100, Shandong, China
Zhiyong Yang, Junwei Du, Qiang Hu & Feng Jiang

Authors

Zhiyong Yang
View author publications
You can also search for this author in PubMed Google Scholar
Junwei Du
View author publications
You can also search for this author in PubMed Google Scholar
Qiang Hu
View author publications
You can also search for this author in PubMed Google Scholar
Feng Jiang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Feng Jiang .

Editor information

Editors and Affiliations

University of Regina, Regina, SK, Canada
JingTao Yao
Iwate Prefectural University, Takizawa, Iwate, Japan
Hamido Fujita
Shanghai University, Shanghai, China
Xiaodong Yue
Tongji University, Shanghai, China
Duoqian Miao
University of Kansas, Lawrence, KS, USA
Jerzy Grzymala-Busse
Soochow University, Suzhou, Jiangsu, China
Fanzhang Li

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Yang, Z., Du, J., Hu, Q., Jiang, F. (2022). Neighborhood Approximate Reducts-Based Ensemble Learning Algorithm and Its Application in Software Defect Prediction. In: Yao, J., Fujita, H., Yue, X., Miao, D., Grzymala-Busse, J., Li, F. (eds) Rough Sets. IJCRS 2022. Lecture Notes in Computer Science(), vol 13633. Springer, Cham. https://doi.org/10.1007/978-3-031-21244-4_8

Download citation

DOI: https://doi.org/10.1007/978-3-031-21244-4_8
Published: 11 November 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-21243-7
Online ISBN: 978-3-031-21244-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Neighborhood Approximate Reducts-Based Ensemble Learning Algorithm and Its Application in Software Defect Prediction

Abstract

Access this chapter

Similar content being viewed by others

Software defect prediction ensemble learning algorithm based on adaptive variable sparrow search algorithm

Imbalanced Data Processing Model for Software Defect Prediction

Heterogeneous defect prediction with two-stage ensemble learning

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Neighborhood Approximate Reducts-Based Ensemble Learning Algorithm and Its Application in Software Defect Prediction

Abstract

Access this chapter

Similar content being viewed by others

Software defect prediction ensemble learning algorithm based on adaptive variable sparrow search algorithm

Imbalanced Data Processing Model for Software Defect Prediction

Heterogeneous defect prediction with two-stage ensemble learning

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation