Skip to main content
Log in

An efficient indoor scene character recognition using Bayesian interactive search algorithm-based adaboost-CNN classifier

  • Original Article
  • Published:
Neural Computing and Applications Aims and scope Submit manuscript

Abstract

The primary role in many computer vision applications is text or character recognition in scenes. Under generic conditions, scene text recognition is the most complicated and open research challenge, and numerous scene techniques have been implemented to address this problem. Existing methods encountered a number of challenges during scene character recognition, including complex backgrounds, noise, blur, non-uniform lighting, local distortion, and different fonts. Hence, we present Bayesian interactive search algorithm (BISA) with AdaBoost-based convolutional neural network (BISA with AdaBoost-CNN) for scene character recognition to tackle the former issues. The word to consecutive conversion and scene character recognition are the two key components in the proposed work. At first, the HOG and SIFT feature descriptors are extracted in word to consecutive conversion. Next, the Bayesian interactive search algorithm (BISA) is utilized to enhance the presentation of AdaBoost-based convolutional neural network (BISA with AdaBoost-CNN) for scene character recognition. Experimentally, different kinds of evaluation measures are used thereby the implementation works handled in MATLAB software. The proposed BISA with AdaBoost-CNN outperforms higher recognition accuracy than other existing approaches.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

References

  1. Sheng B, Xiao Fu, Sha L, Sun L (2020) Deep Spatial-Temporal model based cross-scene action recognition using commodity WiFi. IEEE Internet Things J 7(4):3592–3601

    Article  Google Scholar 

  2. Wang C, Peng G, De Baets B (2020) Deep feature fusion through adaptive discriminative metric learning for scene recognition. Inf Fusion 63:1–12

    Article  Google Scholar 

  3. Sherly LTA, Jaya T (2021) Improved firefly algorithm-based optimized convolution neural network for scene character recognition. Signal Image Video Process 1–9

  4. Oybek D, Abdusalomov A, Mukhriddin M, Oybek D, Utkir K, Taeg KW (2020) Automatic salient object extraction based on locally adaptive thresholding to generate tactile graphics. Appl Sci 10(10):3350

    Article  Google Scholar 

  5. Kaliyar RK, Anurag G, Pratik N (2019) Multiclass fake news detection using ensemble machine learning. In: 2019 IEEE 9th international conference on advanced computing (IACC). IEEE, 2019, pp 103–107

  6. Chandio AA, Asikuzzaman M, Pickering M, Leghari M (2020) Cursive-text: A comprehensive dataset for end-to-end Urdu text recognition in natural scene images. Data Brief 31:105749

    Article  Google Scholar 

  7. Sherly LA, Jaya T (2021) Improved firefly algorithm-based optimized convolution neural network for scene character recognition. Signal Image Video Process 1–9

  8. Gowthul Alam MM, Baulkani S (2019) Geometric structure information based multi-objective function to increase fuzzy clustering performance with artificial and real-life data. Soft Comput 23(4):1079–1098

    Article  Google Scholar 

  9. Hassan BA (2020) CSCF: a chaotic sine cosine firefly algorithm for practical application problems. Neural Comput Appl 1–20

  10. Kavitha RS (2021) IOT and context-aware learning-based optimal neural network model for real-time health monitoring. Trans Emerg Telecommun Technol 32(1):e4132

    Google Scholar 

  11. Rejeesh MR (2019) Interest point based face recognition using adaptive neuro fuzzy inference system. Multimed Tools Appl 78(16):22691–22710

    Article  Google Scholar 

  12. Sundararaj V (2016) An efficient threshold prediction scheme for wavelet based ECG signal noise reduction using variable step size firefly algorithm. Int J Intell Eng Syst 9(3):117–126

    Google Scholar 

  13. Sundararaj V (2019) Optimised denoising scheme via opposition-based self-adaptive learning PSO algorithm for wavelet-based ECG signal noise reduction. Int J Biomed Eng Technol 31(4):325

    Article  Google Scholar 

  14. Sundararaj V, Anoop V, Dixit P, Arjaria A, Chourasia U, Bhambri P, Rejeesh MR, Sundararaj R (2020) CCGPA-MPPT: cauchy preferential crossover-based global pollination algorithm for MPPT in photovoltaic system. Prog Photovolt Res Appl 28(11):1128–1145

    Article  Google Scholar 

  15. Vinu S (2019) Optimal task assignment in mobile cloud computing by queue based ant-bee algorithm. Wirel Pers Commun 104(1):173–197

    Article  Google Scholar 

  16. Jose J, Gautam N, Tiwari M, Tiwari T, Suresh A, Sundararaj V, Rejeesh MR (2021) An image quality enhancement scheme employing adolescent identity search algorithm in the NSST domain for multimodal medical image fusion. Biomed Signal Process Control 66:102480

    Article  Google Scholar 

  17. Eltay M, Zidouri A, Ahmad I (2020) Exploring deep learning approaches to recognize handwritten arabic texts. IEEE Access 8:89882–89898

    Article  Google Scholar 

  18. Guo Q, Wang F, Lei J, Dan Tu, Li G (2016) Convolutional feature learning and Hybrid CNN-HMM for scene number recognition. Neurocomputing 184:78–90

    Article  Google Scholar 

  19. Jaderberg M, Karen S, Andrea V, Andrew Z (2014) Synthetic data and artificial neural networks for natural scene text recognition. http://www.ee.surrey.ac.uk/CVSSP/demos/chars74k/

  20. Wang Y, Shi C, Wang C, Xiao B, Qi C (2017) Multi-order co-occurrence activations encoded with fisher vector for scene character recognition. Pattern Recogn Lett 97:69–76

    Article  Google Scholar 

  21. Chen X, Tianwei W, Yuanzhi Z, Lianwen J, Canjie L (2020) Adaptive embedding gate for attention-based scene text recognition. Neurocomputing 381:261–271

    Article  Google Scholar 

  22. Guo Q, Lei J, Tu D, Li G (2014) Reading numbers in natural scene images with convolutional neural networks. In: Proceedings 2014 IEEE international conference on security, pattern analysis, and cybernetics (SPAC). IEEE, pp 48–53

  23. Mortazavi A, Vedat T, Ayhan N (2018) Interactive search algorithm: a new hybrid metaheuristic optimization algorithm. Eng Appl Artif Intell 71:275–292

    Article  Google Scholar 

  24. Mortazavi A (2021) Bayesian interactive search algorithm: a new probabilistic swarm intelligence tested on mathematical and structural optimization problems. Adv Eng Softw 155:102994

    Article  Google Scholar 

  25. Haseena KS, Anees S, Madheswari N (2014) Power optimization using EPAR protocol in MANET. Int J Innov Sci Eng Technol 1(6)

  26. Azath M, Banu RW, Madheswari AN (2011) Improving fairness in network traffic by controlling congestion and unresponsive flows. In: International conference on network security and applications. Springer, Berlin, Heidelberg, pp 356–363

  27. Liu Y et al (2016) Exponential stability of Markovian jumping Cohen-Grossberg neural networks with mixed mode-dependent timedelays. Neurocomputing 177:409–415

    Article  Google Scholar 

  28. Du B, Liu Y, Abbas IA (2016) Existence and asymptotic behaviorresults of periodic solution for discrete-time neutral-type neural networks. J Frankl Inst 353(2):448–461

    Article  Google Scholar 

  29. Abouelmagd EI et al (2014) Reduction the secular solution to periodic solution in the generalized restricted three-body problem. Astrophys Space Sci 350(2):495–505

    Article  Google Scholar 

  30. Afif M, Riadh A, Yahia S, Mohamed A (2020) Deep learning based application for indoor scene recognition. Neural Process Lett 1–11

  31. Su B, Shijian Lu (2017) Accurate recognition of words in scenes without character segmentation using recurrent neural network. Pattern Recogn 63:397–405

    Article  Google Scholar 

  32. Zhang Z, Wang H, Liu S, Xiao B (2018) Deep contextual stroke pooling for scene character recognition. IEEE Access 6:16454–16463

    Article  Google Scholar 

  33. Lin Q, Canjie L, Lianwen J, Songxuan L (2021) STAN: a sequential transformation attention-based network for scene text recognition. Pattern Recognit 111:107692

    Article  Google Scholar 

  34. Wang Q, Huang Ye, Jia W, He X, Blumenstein M, Lyu S, Yue Lu (2020) FACLSTM: ConvLSTM with focused attention for scene text recognition. Sci China Inf Sci 63(2):1–14

    MathSciNet  Google Scholar 

  35. Graves A, Liwicki M, Fernández S, Bertolami R, Bunke H, Schmidhuber J (2008) A novel connectionist system for unconstrained handwriting recognition. IEEE Trans Pattern Anal Mach Intell 31(5):855–868

    Article  Google Scholar 

  36. Gers Felix A, Jurgen S, Cummins F (2000) Learning to forget: continual prediction with LSTM. Neural Comput 12(10):2451–2471

    Article  Google Scholar 

  37. Hastie T, Rosset S, Zhu Ji, Zou H (2009) Multi-class adaboost. Stat Interface 2(3):349–360

    Article  MathSciNet  Google Scholar 

  38. Taherkhani A, Georgina C, Martin McGinnity T (2020) AdaBoost-CNN: an adaptive boosting algorithm for convolutional neural networks to classify multi-class imbalanced datasets using transfer learning. Neurocomputing 404:351–366

    Article  Google Scholar 

  39. Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. Adv Neural Inf Process Syst 25:1097–1105

    Google Scholar 

  40. Gülcü A, Zeki K (2020) Hyper-parameter selection in convolutional neural networks using microcanonical optimization algorithm. IEEE Access 8:52528–52540

    Article  Google Scholar 

  41. Wang Y, Zhang H, Zhang G (2019) cPSO-CNN: An efficient PSO-based algorithm for fine-tuning hyper-parameters of convolutional neural networks. Swarm Evol Comput 49:114–123

    Article  Google Scholar 

  42. Netzer Y, Tao W, Adam C, Alessandro B, Bo W, Andrew YN (2011) Reading digits in natural images with unsupervised feature learning

  43. Mishra A, Alahari K, Jawahar CV (2012) Scene text recognition using higher order language priors. In: BMVC-British Machine Vision Conference. BMVAs

  44. Karatzas D, Faisal S, Seiichi U, Masakazu I, Gomez i Bigorda L, Sergi RM, Joan M, David FM, Jon AA, Lluis Pere De Las H (2013) ICDAR 2013 robust reading competition. In: 2013 12th international conference on document analysis and recognition, pp 1484–1493. IEEE

  45. Shi C-Z, Gao S, Liu M-T, Qi C-Z, Wang C-H, Xiao B-H (2015) Stroke detector and structure based models for character recognition: a comparative study. IEEE Trans Image Process 24(12):4952–4964

    Article  MathSciNet  Google Scholar 

  46. Gao S, Chunheng W, Baihua X, Cunzhao S, Zhong Z (2014) Stroke bank: a high-level representation for scene character recognition. In: 2014 22nd international conference on pattern recognition. IEEE, pp 2909–2913

  47. Gao S, Wang C, Xiao B, Shi C, Zhou W, Zhang Z (2014) Learning co-occurrence strokes for scene character recognition based on spatiality embedded dictionary. In: 2014 IEEE international conference on image processing (ICIP). IEEE, pp 5956–5960

  48. Shi C, Wang Y, Jia F, He K, Wang C, Xiao B (2017) Fisher vector for scene character recognition: a comprehensive evaluation. Pattern Recognit 72:1–14

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to L. T. Akin Sherly.

Ethics declarations

Conflict of interest

The author(s) declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Sherly, L.T.A., Jaya, T. An efficient indoor scene character recognition using Bayesian interactive search algorithm-based adaboost-CNN classifier. Neural Comput & Applic 33, 15345–15356 (2021). https://doi.org/10.1007/s00521-021-06161-w

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00521-021-06161-w

Keywords

Navigation