Skip to main content

Improving Unbalanced Security X-Ray Image Classification Using VGG16 and AlexNet with Z-Score Normalization and Augmentation

  • Conference paper
  • First Online:
Proceedings of the 4th International Conference on Electronics, Biomedical Engineering, and Health Informatics (ICEBEHI 2023)

Abstract

Addressing the challenge of unbalanced data sets in convolutional neural network (CNN) models for image recognition, this study aims to investigate the impact of data augmentation and normalization. The problem lies in the limited generalization of the model due to class-based data differences, hence the need for several techniques in data preprocessing such as data augmentation and normalization. The main contribution of this research is a comprehensive analysis of the effectiveness of data augmentation and normalization techniques in improving model performance. The research utilized the AlexNet and VGG16 architectures and conducted extensive experiments on data sets with varying degrees of imbalance. Data augmentation generates additional examples, while normalization reduces convergence issues. The results show that training the AlexNet model without augmentation results in a low accuracy of 0.24, underscoring the challenges posed by skewed data distributions. In contrast, augmented data substantially improves performance, with AlexNet achieving an accuracy of 0.91 and VGG16 achieving 0.84. In addition, normalized data also made a positive contribution, showing an accuracy of 0.74 for AlexNet and 0.67 for VGG16. In conclusion, data augmentation and normalization techniques are essential in reducing the effects of data imbalance, thereby improving the generalizability of the models. The improved accuracy of the data using normalization techniques indicates the ability of the model to read the data after normalization. This study underscores the importance of preprocessing strategies in optimizing model performance and advancing the field of deep learning in image recognition.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

References

  1. Kumar P, Bhatnagar R, Gaur K, Bhatnagar A (2021) Classification of imbalanced data: review of methods and applications. IOP Conf Ser Mater Sci Eng 1099:012077. https://doi.org/10.1088/1757-899x/1099/1/012077

    Article  Google Scholar 

  2. Rekha G, Tyagi AK, Sreenath N, Mishra S (2021) Class imbalanced data: open issues and future research directions. In: 2021 International conference on computer communication and informatics, ICCCI 2021. Institute of Electrical and Electronics Engineers Inc. (2021). https://doi.org/10.1109/ICCCI50826.2021.9402272

  3. Komori O, Eguchi S (2019) Introduction to imbalanced data. Presented at the (2019).https://doi.org/10.1007/978-4-431-55570-4_1

  4. Gusmão G, Raposo A, Oliveira R, de, Barbosa C (2022) Treating dataset imbalance in fetal echocardiography classification. In: Communication papers of the 17th conference on computer science and intelligence systems, pp 3–9. PTI. https://doi.org/10.15439/2022f56

  5. Rani S, Ahmad T, Masood S (2023) Handling class imbalance problem using oversampling techniques for breast cancer prediction. Presented at the June 16 (2023). https://doi.org/10.1109/reedcon57544.2023.10150702

  6. Cai W, Ning X, Zhou G, Bai X, Jiang Y, Li W, Qian P (2023) A novel hyperspectral image classification model using bole convolution with three-direction attention mechanism: small sample and unbalanced learning. IEEE Trans Geosci Remote Sens, 61. https://doi.org/10.1109/TGRS.2022.3201056

  7. Liu J, Guo F, Gao H, Huang Z, Zhang Y, Zhou H (2021) Image classification method on class imbalance datasets using multi-scale CNN and two-stage transfer learning. Neural Comput Appl. https://doi.org/10.1007/s00521-021-06066-8

  8. Johnson JM, Khoshgoftaar TM (2019) Survey on deep learning with class imbalance. J Big Data. 6. https://doi.org/10.1186/s40537-019-0192-5

  9. Hasanin T, Khoshgoftaar TM, Leevy JL, Bauder RA (2020) Investigating class rarity in big data. J Big Data, 7. https://doi.org/10.1186/s40537-020-00301-0

  10. Sadhukhan P, Pakrashi A, Palit S, Mac Namee B (2021) Integrating unsupervised clustering and label-specific oversampling to tackle imbalanced multi-label data

    Google Scholar 

  11. Alaba SY, Nabi MM, Shah C, Prior J, Campbell MD, Wallace F, Ball JE, Moorhead R (2022) Class-aware fish species recognition using deep learning for an imbalanced dataset. Sensors, 22. https://doi.org/10.3390/s22218268

  12. Guo Y, Fang Z, Zhang S, Dong H (2021) Classification of potato early blight with unbalanced data based on GhostNet. In: 2021 3rd International academic exchange conference on science and technology innovation, IAECST 2021, pp 559–563. Institute of Electrical and Electronics Engineers Inc. https://doi.org/10.1109/IAECST54258.2021.9695532

  13. Sun M, Jiang A, Li Z (2021) data enhancement for melanoma classification. In: Proceedings—2021 2nd international conference on artificial intelligence and computer engineering, ICAICE 2021, pp 149–155. Institute of Electrical and Electronics Engineers Inc. https://doi.org/10.1109/ICAICE54393.2021.00037

  14. Shafay M, Hassan T, Ahmed A, Velayudhan D, Dias J, Werghi N (2022) Programmable broad learning system to detect concealed and imbalanced baggage threats. In: 2022 2nd international conference on digital futures and transformative technologies, ICoDT2 2022. Institute of Electrical and Electronics Engineers Inc. https://doi.org/10.1109/ICoDT255437.2022.9787420

  15. Xu X, Li W, Duan Q (2021) Transfer learning and SE-ResNet152 networks-based for small-scale unbalanced fish species identification. Comput Electron Agric, 180. https://doi.org/10.1016/j.compag.2020.105878

  16. Sambasivam G, Opiyo GD (2021) A predictive machine learning application in agriculture: Cassava disease detection and classification with imbalanced dataset using convolutional neural networks. Egyptian Inf J 22:27–34. https://doi.org/10.1016/j.eij.2020.02.007

    Article  Google Scholar 

  17. Airport Security Scanning: Xray Dataset, https://universe.roboflow.com/airport-security-scanning/xray-upzdb. Accessed 27 August 2023

  18. Qi S, He M, Hoang R, Zhou Y, Namadi P, Tom B, Sandhu P, Bai Z, Chung F, Ding Z, Anderson J, Roh DM, Huynh V (2023) Salinity modeling using deep learning with data augmentation and transfer learning. Water (Switzerland), 15. https://doi.org/10.3390/w15132482

  19. Junaidi A, Tanjung NAF, Wijayanto S, Lasama J, Iskandar AR (2021) Overfitting problem in images classification for egg incubator using convolutional neural network. In: 2021 9th international conference on cyber and IT service management, CITSM 2021. Institute of Electrical and Electronics Engineers Inc. https://doi.org/10.1109/CITSM52892.2021.9588815

  20. Zhang F, Zhang Z (2023) Information transfer in multitask learning. Data Augment Beyond 37:16220–16221

    Google Scholar 

  21. Rihal S, Assal H (2023) Machine learning for the documentation, prediction, and augmentation of heritage structure data. The international archives of the photogrammetry, remote sensing and spatial information sciences. XLVIII-M-2–2023, 1301–1307. https://doi.org/10.5194/isprs-archives-xlviii-m-2-2023-1301-2023

  22. Liu X, He J, Liu M, Yin Z, Yin L, Zheng W (2023) A scenario-generic neural machine translation data augmentation method. Electronics (Switzerland), 12. https://doi.org/10.3390/electronics12102320

  23. Keller K (2019) Entropy measures for data analysis: theory, algorithms and applications. https://doi.org/10.3390/e21100935.

  24. K B (2020) data dimensionality reduction techniques: review. Int J Eng Technol Manag Sci 4:62–65. https://doi.org/10.46647/ijetms.2020.v04i04.010

  25. Krizhevsky A, Sutskever I, Hinton GE (2021) Imagenet classification with deep convolutional neural networks

    Google Scholar 

  26. Chen F, Tsou JY (2021) DRSNet: Novel architecture for small patch and low-resolution remote sensing image scene classification. Int J Appl Earth Observ Geoinf, 104. https://doi.org/10.1016/j.jag.2021.102577

  27. Smith M, Li Z, Landry L, Merz KM, Li P (2023) Consequences of overfitting the van der Waals Radii of ions

    Google Scholar 

  28. Baskin C, Liss N, Zheltonozhskii E, Bronshtein AM (2017) Mendelson, a.: streaming architecture for large-scale quantized neural networks on an FPGA-based dataflow platform.https://doi.org/10.1109/IPDPSW.2018.00032

  29. Junaidi A, Lasama J, Adhinata FD, Iskandar AR (2021) Image classification for egg incubator using transfer learning of VGG16 and VGG19. In: 10th IEEE international conference on communication, networks and satellite, comnetsat 2021—Proceedings. pp. 324–328. Institute of Electrical and Electronics Engineers Inc. https://doi.org/10.1109/COMNETSAT53002.2021.9530826

  30. Ruvinga C, Malathi D, Dorathi Jayaseeli JD (2020) Human concentration level recognition based on vgg16 cnn architecture. Int J Adv Sci Technol 29:1364–1373

    Google Scholar 

  31. Mungloo-Dilmohamud Z, Khan MHM, Jhumka K, Beedassy BN, Mungloo NZ, Peña-Reyes C (2022) Balancing data through data augmentation improves the generality of transfer learning for diabetic retinopathy classification. Appl Sci (Switzerland) 12. https://doi.org/10.3390/app12115363

  32. Zhang Y, Yue J, Song A, Jia S, Li Z (2023) A High-similarity shellfish recognition method based on convolutional neural network. Inf Process Agric 10:149–163. https://doi.org/10.1016/j.inpa.2022.05.009

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Apri Junaidi .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Qi, D., Junaidi, A., Howe, C.W., Zain, A.M. (2024). Improving Unbalanced Security X-Ray Image Classification Using VGG16 and AlexNet with Z-Score Normalization and Augmentation. In: Triwiyanto, T., Rizal, A., Caesarendra, W. (eds) Proceedings of the 4th International Conference on Electronics, Biomedical Engineering, and Health Informatics. ICEBEHI 2023. Lecture Notes in Electrical Engineering, vol 1182. Springer, Singapore. https://doi.org/10.1007/978-981-97-1463-6_14

Download citation

  • DOI: https://doi.org/10.1007/978-981-97-1463-6_14

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-97-1462-9

  • Online ISBN: 978-981-97-1463-6

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics