Skip to main content
Log in

Decade research on text detection in images/videos: a review

  • Special Issue
  • Published:
Evolutionary Intelligence Aims and scope Submit manuscript

Abstract

Text present in an image or a video is a good representative as it provides semantic information of a respective image or video frame. Nowadays detection of textual information from videos are very challenging and exciting research area in video processing and machine learning field. Text detection finds a vital role in current applications such as indexing, easy and efficient retrieval, keyword based image search and event identification. However, the text region detection from video has several challenges like low resolution, complex background, alignment of text and variation in size, color, style. The ample of works have been done on text detection, and all these considered different properties to distinguish the text region from its background in a video frame. The main aim of this paper is to demonstrate the comprehensive study of decade research on various video text detection methods, which are categorized into horizontal text detection, arbitrarily oriented text detection, and multilingual text detection (Indian scenario and non-Indian scenario) methods. Different kinds of challenges are explained with examples and various types of applications are discussed to know the importance of the text detection process. Tables are demonstrated for all categories to provide useful information for the readers. Finally, possible future directions are discussed with respect to all categories and methods are evaluated using datasets such as ICDAR 2003, ICDAR 2013, ICDAR 2015, Nusdataset, TrecVId, YVT, MSRRC, SVT, MSRA, KAIST, Hau ’s, Neocr dataset, oriented scene text dataset, artificial text dataset and own horizontal, arbitrarily oriented, multilingual text datasets.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12

Similar content being viewed by others

References

  1. Ye Q, Huang Q, Gao W, Zhao D (2005) Fast and robust text detection in images and video frames. Image Vis Comput 23(6):565–576

    Google Scholar 

  2. Wang YK, Chen JM (2006) Detecting video texts using spatial-temporal wavelet transform. In: 18th international conference on pattern recognition, vol 4, pp 754–757

  3. Shivakumara P, Phan TQ, Tan CL (2009) A robust wavelet transform based technique for video text detection. In: 10th international conference on document analysis and recognition, pp 1285–1289

  4. Shivakumara P, Phan TQ, Tan CL (2010) New wavelet and color features for text detection in video. In: 20th international conference on pattern recognition, pp 3996–3999

  5. Aradhya VNM, Pavithra MS (2013) An application of k-means clustering for improving video text detection. Intell Inform 182:41–47

    Google Scholar 

  6. Aradhya VNM, Pavithra MS (2014) An application of LBF energy in image/video frame text detection. In: 14th international conference on frontiers in handwriting recognition, pp 760–765

  7. Aradhya VNM, Pavithra MS, Niranjan SK (2014) An exploration of wavelet transform and level set method for text detection in images and video frames. In: Recent advances in intelligent informatics, pp 419–426

  8. Liu Y, Goto S, Ikenaga T (2006) A contour-based robust algorithm for text detection in color images. IEICE Trans Inf Syst 89(3):1221–1230

    Google Scholar 

  9. Shivakumara P, Dutta A, Tan CL, Pal U (2010) A new wavelet-median-moment based method for multi-oriented video text detection. In: Document analysis systems, pp 279–286

  10. Aradhya VNM, Pavithra MS, Naveena C (2012) A robust multilingual text detection approach based on transforms and wavelet entropy. In: 2nd international conference on computer, communication, control and information technology, vol 4, pp 232–237

  11. Pavithra MS, Aradhya VNM (2014) A comprehensive of transforms, Gabor filter and k-means clustering for text detection in images and video. In: Applied computing and informatics, pp 1–15

  12. Wu W, Chen X, Yang J (2005) Detection of text on road signs from video. Intell Transp Syst 6(4):378–390

    Google Scholar 

  13. Umai C, Kassim A, Yue CL (2006) Detection and interpretation of text information in noisy video sequences. In: 9th international conference on control, automation, robotics and vision, pp 1–4

  14. Phan TQ, Shivakumara P, Tan CL (2009) A Laplacian method for video text detection. In: 10th international conference on document analysis and recognition, pp 66–70

  15. Poignant J, Thollard F, Quénot G, Besacier L (2011) Text detection and recognition for person identification in videos. In: 9th international workshop on content-based multimedia indexing, pp 245–248

  16. Shivakumara P, Dutta A, Phan TQ, Tan CL, Pal U (2011) A novel mutual nearest neighbor based symmetry for text frame classification in video. Pattern Recognit 44(8):1671–1683

    Google Scholar 

  17. Lee JM, Kim YM, Moon YS, Park KT (2014) Text detection in video sequence using 1-D DCT. In: The 18th IEEE international symposium on consumer electronics, pp 1–2

  18. Neumann L, Matas J (2012) Real-time scene text localization and recognition. In: IEEE conference on computer vision and pattern recognition (CVPR) pp 3538–3545

  19. Qian X, Liu G (2006) Text detection, localization and segmentation in compressed videos. In: IEEE international conference on acoustics speech and signal processing proceedings, vol 2, pp 385–388

  20. Shivakuamra P, Lubani M, Wong K, Lu T (2014) Optical flow based dynamic curved video text detection. In: IEEE international conference on image processing, pp 1668–1672

  21. Minemura K, Palaiahnakote S, Wong K (2014) Multi-oriented text detection for intra-frame in H. 264/AVC video. In: International symposium on intelligent signal processing and communication systems, pp 330–335

  22. Ma J, Shao W, Ye H, Wang L, Wang H, Zheng Y, Xue X (2018) Arbitrary-oriented scene text detection via rotation proposals. IEEE Trans Multimed 20(11):3111–3122

    Google Scholar 

  23. Yin XC, Yin X, Huang K, Hao HW (2014) Robust text detection in natural scene images. IEEE Trans Pattern Anal Mach Intell 36(5):970–983

    Google Scholar 

  24. Zarechensky M (2013) Text detection in natural scenes with multilingual text. In: Proceedings of the 10th spring researcher’s colloquium on database and information systems, pp 32–35

  25. Raza A, Siddiqi I, Djeddi C, Ennaji A (2013) Multilingual artificial text detection using a cascade of transforms. In: 12th international conference on document analysis and recognition, pp 309–313

  26. Bhowmick S, Banerjee P (2014) Bangla text recognition from video sequence: a new focus. arXiv:1401.1190

  27. Bosamiya JH, Agrawal P, Roy PP, Balasubramanian R (2015) Script independent scene text segmentation using fast stroke width transform and GrabCut. In: 3rd IAPR Asian conference on pattern recognition (ACPR), pp 151–155

  28. Indira K, Selvi SS (2010) Kannada character recognition system a review. arXiv:1001.5352

  29. Khare V, Shivakumara P, Paramesran R, Blumenstein M (2017) Arbitrarily-oriented multi-lingual text detection in video. Multimed Tools Appl 76(15):16625–16655

    Google Scholar 

  30. Liu X, Liang D, Yan S, Chen D, Qiao Y, Yan J (2018) FOTS: fast oriented text spotting with a unified network. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 5676–5685

  31. Vinod HC, Niranjan SK, Aradhya VNM (2014) An application of Fourier statistical features in scene text detection. In: 2014 international conference on contemporary computing and informatics, pp 1154–1159

  32. Anthimopoulos M, Gatos B, Pratikakis I (2007) Multiresolution text detection in video frames. Int Conf Comput Vis Theory Appl 2:161–166

    Google Scholar 

  33. Bhateja V, Devi S, Urooj S (2013) An evaluation of edge detection algorithms for mammographic calcifications. In: Proceedings of the fourth international conference on signal and image processing, pp 487–498

  34. Dinh VC, Chun SS, Cha S, Ryu H, Sull S (2007) An efficient method for text detection in video based on stroke width similarity. In: Asian conference on computer vision, pp 200–209

  35. Basavaraju HT, Aradhya VNM, Guru DS (2018) A novel arbitrary-oriented multilingual text detection in images/video. In: Information and decision sciences, pp 519–529

  36. Anthimopoulos M, Gatos B, Pratikakis I (2008) A hybrid system for text detection in video frames. In: The 8th IAPR international workshop on document analysis systems, pp 286–292

  37. Shivakumara P, Huang W, Tan CL (2008) An efficient edge based technique for text detection in video frames. In: The 8th IAPR international workshop on document analysis systems, pp 307–314

  38. Shivakumara P, Huang W, Tan CL (2008) An efficient video text detection using edge features. In: 19th international conference on pattern recognition, pp 307–314

  39. Li M, Wang C (2008) An adaptive text detection approach in images and video frames. In: IEEE international joint conference on neural networks (IEEE world congress on computational intelligence), pp 72–77

  40. Yu J, Wang Y (2009) Apply SOM to video artificial text area detection. In: 4th international conference on internet computing for science and engineering, pp 137–141

  41. Abi-Haidar A, Rocha LM (2011) Collective classification of textual documents by guided self-organization in T-cell cross-regulation dynamics. Evolut Intell 4(2):69–80

    Google Scholar 

  42. Shivakumara P, Phan TQ, Tan CL (2009) Video text detection based on filters and edge features. In: IEEE international conference on multimedia and expo, pp 514–517

  43. Huang X, Ma H (2010) Automatic detection and localization of natural scene text in video. In: 20th international conference on pattern recognition, pp 3216–3219

  44. Yen SH, Chang HW (2010) Precise news video text detection/localization based on multiple frames integration. In: Proceedings of the 10th international conference on signal processing, computational geometry and artificial vision. World Scientific and Engineering Academy and Society, pp 29–34

  45. Anthimopoulos M, Gatos B, Pratikakis I (2010) A two-stage scheme for text detection in video images. Image Vis Comput 28(9):1413–1426

    Google Scholar 

  46. Yang H, Quehl B, Sack H (2012) Text detection in video images using adaptive edge detection and stroke width verification. In: 19th international conference on systems, signals and image processing, pp 9–12

  47. Sharma N, Shivakumara P, Pal U, Blumenstein M, Tan CL (2012) A new method for arbitrarily-oriented text detection in video. In: 10th IAPR international workshop on document analysis systems, pp 74–78

  48. Shivakumara P, Sreedhar RP, Phan TQ, Lu S, Tan CL (2012) Multioriented video scene text detection through bayesian classification and boundary growing. IEEE Trans Circuits Syst Video Technol 22(8):1227–1235

    Google Scholar 

  49. Shivakumara P, Dutta A, Tan CL, Pal U (2014) Multi-oriented scene text detection in video based on wavelet and angle projection boundary growing. Multimedia Tools Appl 72(1):515–539

    Google Scholar 

  50. Lyu MR, Song J, Cai M (2005) A comprehensive method for multilingual video text detection, localization, and extraction. IEEE Trans Circuits Syst Video Technol 15(2):243–255

    Google Scholar 

  51. Liu X, Fu H, Jia Y (2008) Gaussian mixture modeling and learning of neighboring characters for multilingual text extraction in images. Pattern Recognit 41(2):484–493

    MATH  Google Scholar 

  52. Jeong M, Jo KH (2015) Multi language text detection using fast stroke width transform. In: 21st Korea-Japan joint workshop on frontiers of computer vision, pp 1–4

  53. Liao WH, Wu YC (2016) An integrated approach for multilingual scene text detection. Int J Comput Inf Syst Ind Manag Appl 8:033–041

    Google Scholar 

  54. Tsai TH, Chen YC (2007) A comprehensive motion video text detection localization and extraction method. In: IEEE 23rd international conference on data engineering workshop, pp 113–116

  55. Huang W, Shivakumara P, Tan CL (2008) Detecting moving text in video using temporal information. In: 19th international conference on pattern recognition, pp 1–4

  56. Huang X (2012) Automatic video text detection and localization based on coarseness texture. In: 5th international conference on intelligent computation technology and automation, pp 398–401

  57. Hsia SC, Ho CN (2012) A high-performance video text detection algorithm. In: 8th international conference on intelligent information hiding and multimedia signal processing, pp 242–245

  58. Kim D, Sohn K (2008) Static text region detection in video sequences using color and orientation consistencies. In: 19th international conference on pattern recognition, pp 1–4

  59. Kim W, Kim C (2009) A new approach for overlay text detection and extraction from complex video scene. IEEE Trans Image Process 18(2):401–411

    MathSciNet  MATH  Google Scholar 

  60. Shi S, Cheng T, Xiao S, Lv X (2009) A smart approach for text detection, localization and extraction in video frames. Int Conf Inf Technol Comput Sci 1:158–161

    Google Scholar 

  61. Shivakumara P, Phan TQ, Tan CL (2010) New fourier-statistical features in RGB space for video text detection. IEEE Trans Circuits Syst Video Technol 20(11):1520–1532

    Google Scholar 

  62. Yang Z, Shi P (2012) Caption detection and text recognition in news video. In: 5th international congress on image and signal processing, pp 188–191

  63. Anthimopoulos M, Gatos B, Pratikakis I (2013) Detection of artificial and scene text in images and video frames. Pattern Anal Appl 16(3):431–446

    MathSciNet  Google Scholar 

  64. Wu H, Zou B, Zhao YQ, Guo J (2017) Scene text detection using adaptive color reduction, adjacent character model and hybrid verification strategy. Vis Comput 33(1):113–126

    Google Scholar 

  65. Shivakumara P, Basavaraju HT, Guru DS, Tan CL (2013) Detection of curved text in video: quad tree based method. In: 12th international conference on document analysis and recognition, pp 594–598

  66. Guru DS, Manjunath S, Shivakumara P, Tan CL (2010) An eigen value based approach for text detection in video. In: Proceedings of the 9th IAPR international workshop on document analysis systems, pp 501–506

  67. Shivakumara P, Phan TQ, Tan CL (2009) A gradient difference based technique for video text detection. In: 10th international conference on document analysis and recognition, pp 156–160

  68. Dutta A, Pal U, Bandyopadhya A, Tan CL (2009) Gradient based approach for text detection in video frames. In: International conference on signal and image processing, pp 387–393

  69. Zhang J, Kasturi R (2010) Text detection using edge gradient and graph spectrum. In: 20th international conference on pattern recognition, pp 3979–3982

  70. Sharma N, Shivakumara P, Pal U, Blumenstein M, Tan CL (2015) Piece-wise linearity based method for text frame classification in video. Pattern Recognit 48(3):862–881

    Google Scholar 

  71. Shivakumara P, Phan TQ, Lu S, Tan CL (2013) Gradient vector flow and grouping-based method for arbitrarily oriented scene text detection in video images. IEEE Trans Circuits Syst Video Technol 23(10):1729–1739

    Google Scholar 

  72. Khare V, Shivakumara P, Raveendran P (2014) Multi-oriented moving text detection. In: International symposium on intelligent signal processing and communication systems, pp 347–352

  73. Ilango SS, Kalaivani L (2015) Scene text detection of curved text using gradient vector flow method. Int J Trends Eng Technol 3(3):44–48

    Google Scholar 

  74. Zhou G, Liu Y, Meng Q, Zhang Y (2011) Detecting multilingual text in natural scene. In: 1st international symposium on access spaces, pp 116–120

  75. Indhuja K, Indu M, Sreejith C, Sreekrishnapuram P, Raj PR (2014) Text based language identification system for Indian languages following Devanagari script. Int J Eng 3(4):327–331

    Google Scholar 

  76. Ye J, Huang LL, Hao X (2009) Neural network based text detection in videos using local binary patterns. In: Chinese conference on pattern recognition, pp 1–5

  77. Ma XH, Ng WW, Chan PP, Yeung DS (2010) Video text detection and localization based on localized generalization error model. Int Conf Mach Learn Cybernet 4:2161–2166

    Google Scholar 

  78. He T, Huang W, Qiao Y, Yao J (2016) Text-attentional convolutional neural network for scene text detection. IEEE Trans Image Process 25(6):2529–2541

    MathSciNet  MATH  Google Scholar 

  79. Zhou X, Yao C, Wen H, Wang Y, Zhou S, He W, Liang J (2017) EAST: an efficient and accurate scene text detector. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 5551–5560

  80. Jaderberg M, Simonyan K, Vedaldi A, Zisserman A (2016) Reading text in the wild with convolutional neural networks. Int J Comput Vis 116(1):1–20

    MathSciNet  Google Scholar 

  81. Ye Q, Jiao J, Huang J, Yu H (2007) Text detection and restoration in natural scene images. J Vis Commun Image Represent 18(6):504–513

    Google Scholar 

  82. Ren X, Zhou Y, Huang Z, Sun J, Yang X, Chen K (2017) A novel text structure feature extractor for Chinese scene text detection and recognition. IEEE Access 5:3193–3204

    Google Scholar 

  83. Jamil AJ, Batool A, Malik Z, Mirza A, Siddiqi I (2016) Multilingual artificial text extraction and script identification from video images. Int J Adv Comput Sci Appl 7(4):529–539

    Google Scholar 

  84. Mathew M, Jain M, Jawahar CV (2017) Benchmarking scene text recognition in Devanagari, Telugu and Malayalam. In: 14th IAPR international conference on document analysis and recognition (ICDAR), vol 7, pp 42–46

  85. Bhunia AK, Konwer A, Bhunia AK, Bhowmick A, Roy PP, Pal U (2019) Script identification in natural scene image and video frames using an attention based convolutional-LSTM network. Pattern Recognit 85:172–184

    Google Scholar 

  86. Ji Z, Wang J, Su YT (2009) Text detection in video frames using hybrid features. Int Conf Mach Learn Cybernet 1:318–322

    Google Scholar 

  87. Zhen W, Zhiqiang W (2009) A comparative study of feature selection for SVM in video text detection. In: Second international symposium on computational intelligence and design, vol 2, pp 552–556

  88. Miao G, Huang Q, Jiang S, Gao W (2008) Coarse-to-fine video text detection. In: IEEE international conference on multimedia and expo, pp 569–572

  89. Li X, Wang W, Jiang S, Huang Q, Gao W (2008) Fast and effective text detection. In: 15th IEEE international conference on image processing, pp 969–972

  90. Zhao Y, Lu T, Liao W (2011) A robust color-independent text detection method from complex videos. In: International conference on document analysis and recognition, pp 374–378

  91. Wei YC, Lin CH (2012) A robust video text detection approach using SVM. Expert Syst Appl 39(12):10832–10840

    Google Scholar 

  92. Nguyen PX, Wang K, Belongie S (2014) Video text detection and recognition: dataset and benchmark. In: IEEE winter conference on applications of computer vision, pp 776–783

  93. Li XC, Hou ZQ (2009) Detecting and locating text in video based on ICA algorithm. In: International conference on information engineering and computer science, pp 1–4

  94. Moin A, Bhateja V, Srivastava A (2016) Weighted-PCA based multimodal medical image fusion in contourlet domain. In: Proceedings of the international congress on information and communication technology, pp 597–605

  95. Wang C, Wang H (2010) Utilization of temporal continuity in video text detection. In: 2nd international conference on multimedia and information technology, vol 1, pp 335–338

  96. Prakash S, Ravishankar M (2013) Multi-oriented video text detection and extraction using DCT feature extraction and projection based rotation calculation. In: International conference on advances in computing, communications and informatics, pp 714–718

  97. Srivastava A, Bhateja V, Moin A (2017) Combination of PCA and contourlets for multispectral image fusion. In: Proceedings of the international conference on data engineering and communication technology, pp 577–585

  98. Pratheeba T, Kavitha V, Rajeswari SR (2010) Morphology based text detection and extraction from complex video scene. Int J Eng Technol 2(3):200–206

    Google Scholar 

  99. Wang L, Huang LL, Wu Y (2011) An efficient coarse-to-fine scheme for text detection in videos. In: First Asian conference on pattern recognition, pp 475–479

  100. Yusufu T, Wang Y, Fang X (2013) A video text detection and tracking system. In: IEEE International symposium on multimedia, pp 522–529

  101. Asif MDA, Tariq UU, Baig MN, Ahmad W (2014) A novel hybrid method for text detection and extraction from news videos. Middle-East J Sci Res 19(5):716–722

    Google Scholar 

  102. Zhang B, Liu J, Tang X (2013) Multi-scale video text detection based on corner and stroke width verification. In: Visual communications and image processing, pp 1–6

  103. Zhao X, Lin KH, Fu Y, Hu Y, Liu Y, Huang TS (2011) Text from corners: a novel approach to detect text and caption in videos. IEEE Trans Image Process 20(3):790–799

    MathSciNet  MATH  Google Scholar 

  104. Moradi M, Mozaffari S (2013) Hybrid approach for Farsi/Arabic text detection and localization in video frames. IET Image Process 7(2):154–164

    Google Scholar 

  105. Lu W, Sun H, Chu J, Huang X, Yu J (2018) A novel approach for video text detection and recognition based on a corner response feature map and transferred deep convolutional neural network. IEEE Access 6:40198–40211

    Google Scholar 

  106. Kumar PR, Devi YR, Prathima T (2012) Text detection and localization in low quality video images through image resolution enhancement technique. Int J Comput Appl 58(6):31–35

    Google Scholar 

  107. Basavaraju HT, Aradhya VNM, Guru DS (2019) Text detection through hidden Markov random field and EM-algorithm. In: Information systems design and intelligent applications, pp 19–29

  108. Mosleh A, Bouguila N, Hamza AB (2013) Automatic in painting scheme for video text detection and removal. IEEE Trans Image Process 22(11):4460–4472

    MathSciNet  MATH  Google Scholar 

  109. Gargi U, Crandall D, Antani S, Gandhi T, Keener R, Kasturi R (1999) A system for automatic text detection in video. In: Proceedings of the 5th international conference on document analysis and recognition, pp 29–32

  110. Wu L, Shivakumara P, Lu T, Tan CL (2014) Text detection using delaunay triangulation in video sequence. In: 11th IAPR international workshop on document analysis systems, pp 41–45

  111. Gómez L, Karatzas D (2014) MSER-based real-time text detection and tracking. In: 22nd international conference on pattern recognition, pp 3110–3115

  112. Liu Y, Zhang D, Zhang Y, Lin S (2014) Real-time scene text detection based on stroke model. In: 22nd international conference on pattern recognition, pp 3116–3120

  113. Jain A, Peng X, Zhuang X, Natarajan P, Cao H (2014) Text detection and recognition in natural scenes and consumer videos. In: IEEE international conference on acoustics, speech and signal processing, pp 1245–1249

  114. Shivakumara P, Huang W, Phan TQ, Tan CL (2010) Accurate video text detection through classification of low and high contrast images. Pattern Recognit 43(6):2165–2185

    Google Scholar 

  115. Boaz TK, Prabhakar CJ (2013) A novel approach for detection and localization of caption in video based on pixel pairs. In: National conference on challenges on research and technology in the coming decades, pp 1–6

  116. Angadi SA, Kodabagi MM (2010) Text region extraction from low resolution natural scene images using texture features. In: 2nd international advance computing conference (IACC), pp 121–128

  117. Qian X, Wang H, Hou X (2014) Video text detection and localization in intra-frames of H. 264/AVC compressed video. Multimedia Tools Appl 70(3):1487–1502

    Google Scholar 

  118. Hsia SC, Ho CN, Liu CH (2014) Real-time text detection using PAC/DUE embedded system. In: 10th international conference on intelligent information hiding and multimedia signal processing, pp 321–324

  119. Phan TQ, Shivakumara P, Tan CL (2010) A skeleton-based method for multi-oriented video text detection. In: Proceedings of the 9th IAPR international workshop on document analysis systems, pp 271–278

  120. Shivakumara P, Phan TQ, Tan CL (2011) A laplacian approach to multi-oriented text detection in video. IEEE Trans Pattern Anal Mach Intell 33(2):412–419

    Google Scholar 

  121. Sain A, Bhunia AK, Roy PP, Pal U (2018) Multi-oriented text detection and verification in video frames and scene images. Neurocomputing 275:1531–1549

    Google Scholar 

  122. Basavaraju HT, Aradhya VNM, Guru DS, Harish HBS (2018) LoG and structural based arbitrary oriented multilingual text detection in images/video. Int J Natural Comput Res (IJNCR) 7(3):1–16

    Google Scholar 

  123. Liao M, Shi B, Bai X (2018) Textboxes++: a single-shot oriented scene text detector. IEEE Trans Image Process 27(8):3676–3690

    MathSciNet  MATH  Google Scholar 

  124. Yang XH, Yin F, Liu CL (2018) Online video text detection with Markov decision process. In: 13th IAPR international workshop on document analysis systems (DAS), pp 103–108

  125. Tian S, Yin XC, Su Y, Hao HW (2018) A unified framework for tracking based text detection and recognition from web videos. IEEE Trans Pattern Anal Mach Intell 40(3):542–554

    Google Scholar 

  126. Khare V, Shivakumara P, Raveendran P (2015) A new histogram oriented moments descriptor for multi-oriented moving text detection in video. Expert Syst Appl 42(21):7627–7640

    Google Scholar 

  127. Mousavirad SJ, Ebrahimpour-Komleh H (2017) Multilevel image thresholding using entropy of histogram and recently developed population-based metaheuristic algorithms. Evolut Intell 10(1–2):45–75

    Google Scholar 

  128. Bhunia AK, Kumar G, Roy PP, Balasubramanian R, Pal U (2018) Text recognition in scene image and video frame using color channel selection. Multimed Tools Appl 77(7):8551–8578

    Google Scholar 

  129. Huang X (2011) A novel approach to detecting scene text in video. In: 4th international congress on image and signal processing, vol 1, pp 469–473

  130. Kumari MS, Shekar BH (2011) On the use of Moravec operator for text detection in document images and video frames. In: International conference on recent trends in information technology, pp 910–914

  131. Tsai CM, Yeh ZM (2013) Text detection in bus panel for visually impaired people “seeing” bus route number. Int Conf Mach Learn Cybernet 3:1234–1239

    Google Scholar 

  132. Hsia SC, Chang-Jian NT (2014) Efficient scrolling video text detection with adaptive temporal differential approach. IET Image Process 8(8):455–463

    Google Scholar 

  133. Karray H, Alimi A (2005) Detection and extraction of the text in a video sequence. In: 12th IEEE international conference on electronics, circuits and systems, pp 1–4

  134. Ngo CW, Chan CK (2005) Video text detection and segmentation for optical character recognition. Multimed Syst 10(3):261–272

    Google Scholar 

  135. Halin AA, Rajeswari M, Ramachandram D (2008) Automatic overlaid text detection, extraction and recognition for high level event/concept identification in soccer videos. In: International conference on computer and electrical engineering, pp 587–592

  136. Song Y, Wang W (2009) Text localization and detection for news video. In: Second international conference on information and computing science, vol 2, pp 98–101

  137. Yorita A, Kubota N (2010) Multi-stage fuzzy evaluation in evolutionary robot vision for face detection. Evolut Intell 3(2):67–78

    Google Scholar 

  138. Gllavata J, Qeli E, Freisleben B (2006) Detecting text in videos using fuzzy clustering ensembles. In: 8th IEEE international symposium on multimedia, pp 283–290

  139. Liu Y, Jin L, Zhang S, Luo C, Zhang S (2019) Curved scene text detection via transverse and longitudinal sequence connection. Pattern Recognit 90:337–345

    Google Scholar 

  140. Yang Q, Cheng M, Zhou W, Chen Y, Qiu M, Lin W (2018) IncepText: A new inception-text module with deformable PSROI pooling for multi-oriented scene text detection. arXiv:1805.01167

  141. Aradhya VNM, Kumar GH, Noushath S (2008) Multilingual OCR system for south Indian scripts and English documents: an approach based on fourier transform and principal component analysis. Eng Appl Artif Intell 21(4):658–668

    Google Scholar 

  142. http://www.iapr-tc11.org/mediawiki/index.php/ICDAR_2003_Robust_Reading_Competitions. Accessed 25 Apr 2019

  143. http://dagdata.cvc.uab.es/icdar2013competition/?ch=3&com=downloads. Accessed 25 Apr 2019

  144. https://iapr.org/archives/icdar2015/index.html%3Fp=254.html. Accessed 25 Apr 2019

  145. http://research.microsoft.com/en-us/um/people/eyalofek/text_detection_database.zip. Accessed 25 Apr 2019

  146. http://www6.cs.fau.de/research/projects/pixtract/neocr. Accessed 25 Apr 2019

  147. http://vision.ucsd.edu/~kai/svt/. Accessed 25 Apr 2019

  148. http://rrc.cvc.uab.es/?ch=5&com=downloads. Accessed 25 Apr 2019

  149. http://www.iapr-tc11.org/mediawiki/index.php/KAIST_Scene_Text_Database. Accessed 25 Apr 2019

  150. http://vision.ucsd.edu/content/youtube-video-text. Accessed 25 Apr 2019

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to V. N. Manjunath Aradhya.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Manjunath Aradhya, V.N., Basavaraju, H.T. & Guru, D.S. Decade research on text detection in images/videos: a review. Evol. Intel. 14, 405–431 (2021). https://doi.org/10.1007/s12065-019-00248-z

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s12065-019-00248-z

Keywords

Navigation