Skip to main content

Digit Image Recognition Using an Ensemble of One-Versus-All Deep Network Classifiers

  • Conference paper
  • First Online:
Information and Communication Technology for Competitive Strategies (ICTCS 2020)

Abstract

In multiclass deep network classifiers, the burden of classifying samples of different classes is put on a single classifier. As the result, the optimum classification accuracy is not obtained. Also, training times are large due to running the CNN training on single CPU/GPU. However, it is known that using ensembles of classifiers increases the performance. Also, the training times can be reduced by running each member of the ensemble on a separate processor. Ensemble learning has been used in the past for traditional methods to a varying extent and is a hot topic. With the advent of deep learning, ensemble learning has been applied to the former as well. However, an area which is unexplored and has potential is one-versus-all (OVA) deep ensemble learning. In this paper, we explore it and show that by using OVA ensembles of deep networks, improvements in performance of deep networks can be obtained. As shown in this paper, the classification capability of deep networks can be further increased by using an ensemble of binary classification (OVA) deep networks. We implement a novel technique for the case of digit image recognition and test and evaluate it on the same. In the proposed approach, a single OVA deep network classifier is dedicated to each category. Subsequently, OVA deep network ensembles have been investigated. Every network in an ensemble has been trained by an OVA training technique using the stochastic gradient descent with momentum algorithm (SGDMA). For classification of a test sample, the sample is presented to each network in the ensemble. After prediction score voting, the network with the largest score is assumed to have classified the sample. The experimentation has been done on the MNIST digit dataset, the USPS + digit dataset, and MATLAB digit image dataset. Our proposed technique outperforms the baseline on digit image recognition for all datasets.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. X. Dong, Z. Yu, W. Cao, Y. Shi, Q. Ma, A survey on ensemble learning. Frontiers Comput. Sci. 14(2), 241–258 (2020). https://doi.org/10.1007/s11704-019-8208-z

    Article  Google Scholar 

  2. C. Kandaswamy, L.M. Silva, L.A. Alexandre, J.M. Santos, Deep transfer learning ensemble for classification, in Advances in Computational Intelligence (Springer International Publishing, Cham, 2015), pp. 335–348

    Google Scholar 

  3. D. Nozza, E. Fersini, E. Messina, Deep learning and ensemble methods for domain adaptation, in 2016 IEEE 28th International Conference on Tools with Artificial Intelligence (ICTAI), 6–8 November 2016, pp. 184-189. https://doi.org/10.1109/ictai.2016.0037

  4. X. Liu, Z. Liu, G. Wang, Z. Cai, H. Zhang, Ensemble transfer learning algorithm. IEEE Access 6, 2389–2396 (2018). https://doi.org/10.1109/ACCESS.2017.2782884

    Article  Google Scholar 

  5. Y. Freund, R. Schapire, N. Abe, A short introduction to boosting. J. Jpn. Soc. Artif. Intell. 14(771–780), 1612 (1999)

    Google Scholar 

  6. E. Dikici, L.M. Prevedello, M. Bigelow, R.D. White, B.S. Erdal, Constrained generative adversarial network ensembles for sharable synthetic data generation. arXiv preprint arXiv:200300086 (2020)

  7. Z. Yu, Y. Zhang, C.L.P. Chen, J. You, H. Wong, D. Dai, S. Wu, J. Zhang, Multiobjective semisupervised classifier ensemble. IEEE Trans. Cybern. 49(6), 2280–2293 (2019). https://doi.org/10.1109/TCYB.2018.2824299

    Article  Google Scholar 

  8. Z. Yu, Y. Zhang, J. You, C.L.P. Chen, H. Wong, G. Han, J. Zhang, Adaptive semi-supervised classifier ensemble for high dimensional data classification. IEEE Trans. Cybern. 49(2), 366–379 (2019). https://doi.org/10.1109/TCYB.2017.2761908

    Article  Google Scholar 

  9. H.I. Fawaz, G. Forestier, J. Weber, L. Idoumghar, P. Muller, Deep neural network ensembles for time series classification, in 2019 International Joint Conference on Neural Networks (IJCNN), 14–19 July 2019, pp. 1-6. https://doi.org/10.1109/ijcnn.2019.8852316

  10. S. Tao, Deep neural network ensembles, in Machine Learning, Optimization, and Data Science (Springer International Publishing, Cham, 2019), pp. 1–12

    Google Scholar 

  11. S. Sun, S. Wang, Y. Wei, G. Zhang, A clustering-based nonlinear ensemble approach for exchange rates forecasting. IEEE Trans. Syst. Man Cybern. Syst. (2018)

    Google Scholar 

  12. O. Sagi, L. Rokach, Ensemble learning: a survey. WIREs Data Min. Knowl. Discov. 8(4), (2018). https://doi.org/10.1002/widm.1249

    Article  Google Scholar 

  13. K. Yang, Z. Yu, X. Wen, W. Cao, C.L.P. Chen, H. Wong, J. You, Hybrid classifier ensemble for imbalanced data. IEEE Trans. Neural Netw. Learn. Syst. 31(4), 1387–1400 (2020). https://doi.org/10.1109/TNNLS.2019.2920246

    Article  MathSciNet  Google Scholar 

  14. J. Zheng, X. Cao, B. Zhang, X. Zhen, X. Su, Deep ensemble machine for video classification. IEEE Trans. Neural Netw. Learn. Syst. 30(2), 553–565 (2019). https://doi.org/10.1109/TNNLS.2018.2844464

    Article  MathSciNet  Google Scholar 

  15. A.M. Hafiz, G.M. Bhat, A survey of deep learning techniques for medical diagnosis, in Information and Communication Technology for Sustainable Development (Springer, 2020), pp. 161–170

    Google Scholar 

  16. A. Madakannu, A. Selvaraj, DIGI-Net: a deep convolutional neural network for multi-format digit recognition. Neural Comput. Appl. 32(15), 11373–11383 (2020). https://doi.org/10.1007/s00521-019-04632-9

    Article  Google Scholar 

  17. D. Mellouli, T.M. Hamdani, J.J. Sanchez-Medina, M.B. Ayed, A.M. Alimi, Morphological convolutional neural network architecture for digit recognition. IEEE Trans. Neural Netw. Learn. Syst. 30(9), 2876–2885 (2019). https://doi.org/10.1109/TNNLS.2018.2890334

    Article  Google Scholar 

  18. S. Ali, Z. Shaukat, M. Azeem, Z. Sakhawat, T. Mahmood, K. ur Rehman, An efficient and improved scheme for handwritten digit recognition based on convolutional neural network. SN Appl. Sci. 1(9), 1125 (2019). https://doi.org/10.1007/s42452-019-1161-5

    Article  Google Scholar 

  19. J. Qiao, G. Wang, W. Li, M. Chen, An adaptive deep Q-learning strategy for handwritten digit recognition. Neural Netw. 107, 61–71 (2018). https://doi.org/10.1016/j.neunet.2018.02.010

    Article  MATH  Google Scholar 

  20. M. Mozafari, M. Ganjtabesh, A. Nowzari-Dalini, S.J. Thorpe, T. Masquelier, Bio-inspired digit recognition using reward-modulated spike-timing-dependent plasticity in deep convolutional networks. Pattern Recogn. 94, 87–95 (2019). https://doi.org/10.1016/j.patcog.2019.05.015

    Article  Google Scholar 

  21. S.R. Kulkarni, B. Rajendran, Spiking neural networks for handwritten digit recognition—supervised learning and network optimization. Neural Netw. 103, 118–127 (2018). https://doi.org/10.1016/j.neunet.2018.03.019

    Article  Google Scholar 

  22. X.-X. Niu, C.Y. Suen, A novel hybrid CNN–SVM classifier for recognizing handwritten digits. Pattern Recogn. 45(4), 1318–1325 (2012). https://doi.org/10.1016/j.patcog.2011.09.021

    Article  Google Scholar 

  23. A. Bellili, M. Gilloux, P. Gallinari, An MLP-SVM combination architecture for offline handwritten digit recognition. Int. J. Doc. Anal. Recogn. 5(4), 244–252 (2003). https://doi.org/10.1007/s10032-002-0094-4

    Article  Google Scholar 

  24. MNIST Dataset, http://yann.lecun.com/exdb/mnist/

  25. USPS Dataset, https://cs.nyu.edu/~roweis/data.html

  26. Y. Chen, E. Keogh, B. Hu, N. Begum, A. Bagnall, A. Mueen, G. Batista, The UCR time series classification archive (2015)

    Google Scholar 

  27. S. Xie, R. Girshick, P. Dollár, Z. Tu, K. He, Aggregated residual transformations for deep neural networks, in 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 21–26 July 2017, pp. 5987-5995. https://doi.org/10.1109/cvpr.2017.634

  28. K. He, X. Zhang, S. Ren, J. Sun, Deep residual learning for image recognition, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, (2016), pp. 770–778

    Google Scholar 

  29. K. He, X. Zhang, S. Ren, J. Sun, Identity mappings in deep residual networks, in European Conference on Computer Vision (Springer, 2016), pp. 630–645

    Google Scholar 

  30. A. Krizhevsky, G. Hinton, Learning multiple layers of features from tiny images (2009)

    Google Scholar 

  31. X. Wei, H. Yu, Y. Hu, Y. Zhang, R. Weng, W. Luo, Multiscale collaborative deep models for neural machine translation. arXiv preprint arXiv:200414021 (2020)

  32. A.M. Hafiz, G.M. Bhat, A survey on instance segmentation: state of the art. Int. J. Multimedia Inform. Retrieval 9(3), 171–189 (2020). https://doi.org/10.1007/s13735-020-00195-x

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Abdul Mueed Hafiz .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Hafiz, A.M., Hassaballah, M. (2021). Digit Image Recognition Using an Ensemble of One-Versus-All Deep Network Classifiers. In: Kaiser, M.S., Xie, J., Rathore, V.S. (eds) Information and Communication Technology for Competitive Strategies (ICTCS 2020). Lecture Notes in Networks and Systems, vol 190. Springer, Singapore. https://doi.org/10.1007/978-981-16-0882-7_38

Download citation

Publish with us

Policies and ethics