Enhanced Characterness for Text Detection in the Wild

  • Aarushi AgrawalEmail author
  • Prerana Mukherjee
  • Siddharth Srivastava
  • Brejesh Lall
Conference paper
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 703)


Text spotting is an interesting research problem as text may appear at any random place and may occur in various forms. Moreover, ability to detect text opens the horizons for improving many advanced computer vision problems. In this paper, we propose a novel language agnostic text detection method utilizing edge-enhanced maximally stable extremal regions (MSERs) in natural scenes by defining strong characterness measures. We show that a simple combination of characterness cues helps in rejecting the non-text regions. These regions are further fine-tuned for rejecting the non-textual neighbor regions. Comprehensive evaluation of the proposed scheme shows that it provides comparative to better generalization performance to the traditional methods for this task.


Text detection HOG Enhanced MSER Stroke width 


  1. 1.
    Minetto, R., Thome, N., Cord, M., Leite, N.J., Stolfi, J.: Snoopertext: A text detection system for automatic indexing of urban scenes. Computer Vision and Image Understanding 122, 92–104 (2014)Google Scholar
  2. 2.
    Ye, Q., Doermann, D.: Text detection and recognition in imagery: A survey. IEEE transactions on pattern analysis and machine intelligence 37(7), 1480–1500 (2015)Google Scholar
  3. 3.
    Chen, H., Tsai, S.S., Schroth, G., Chen, D.M., Grzeszczuk, R., Girod, B.: Robust text detection in natural images with edge-enhanced maximally stable extremal regions. In: 2011 18th IEEE International Conference on Image Processing. pp. 2609–2612. IEEE (2011)Google Scholar
  4. 4.
    Neumann, L., Matas, J.: Real-time scene text localization and recognition. In: Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on. pp. 3538–3545. IEEE (2012)Google Scholar
  5. 5.
    Huang, W., Lin, Z., Yang, J., Wang, J.: Text localization in natural images using stroke feature transform and text covariance descriptors. In: Proceedings of the IEEE International Conference on Computer Vision. pp. 1241–1248 (2013)Google Scholar
  6. 6.
    Yi, C., Tian, Y.: Localizing text in scene images by boundary clustering, stroke segmentation, and string fragment classification. IEEE Transactions on Image Processing 21(9), 4256–4268 (2012)Google Scholar
  7. 7.
    Yao, C., Bai, X., Shi, B., Liu, W.: Strokelets: A learned multi-scale representation for scene text recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp. 4042–4049 (2014)Google Scholar
  8. 8.
    Hanif, S.M., Prevost, L.: Text detection and localization in complex scene images using constrained adaboost algorithm. In: 2009 10th International Conference on Document Analysis and Recognition. pp. 1–5. IEEE (2009)Google Scholar
  9. 9.
    Sivic, J., Zisserman, A.: Video google: A text retrieval approach to object matching in videos. In: Computer Vision, 2003. Proceedings. Nineth IEEE International Conference on. pp. 1470–1477. IEEE (2003)Google Scholar
  10. 10.
    De, S., Stanley, R.J., Cheng, B., Antani, S., Long, R., Thoma, G.: Automated text detection and recognition in annotated biomedical publication images. International Journal of Healthcare Information Systems and Informatics (IJHISI) 9(2), 34–63 (2014)Google Scholar
  11. 11.
    Fabrizio, J., Marcotegui, B., Cord, M.: Text detection in street level images. Pattern Analysis and Applications 16(4), 519–533 (2013)Google Scholar
  12. 12.
    Kan, C., Srinath, M.D.: Invariant character recognition with zernike and orthogonal fourier–mellin moments. Pattern recognition 35(1), 143–154 (2002)Google Scholar
  13. 13.
    He, T., Huang, W., Qiao, Y., Yao, J.: Text-attentional convolutional neural network for scene text detection. IEEE Transactions on Image Processing 25(6), 2529–2541 (2016)Google Scholar
  14. 14.
    Jaderberg, M., Simonyan, K., Vedaldi, A., Zisserman, A.: Reading text in the wild with convolutional neural networks. International Journal of Computer Vision 116(1), 1–20 (2016)Google Scholar
  15. 15.
    Li, Y., Jia, W., Shen, C., van den Hengel, A.: Characterness: An indicator of text in the wild. IEEE Transactions on Image Processing 23(4), 1666–1677 (2014)Google Scholar
  16. 16.
    Mukherjee, P., Lall, B., Shah, A.: Saliency map based improved segmentation. In: Image Processing (ICIP), 2015 IEEE International Conference on. pp. 1290–1294. IEEE (2015)Google Scholar
  17. 17.
    Otsu, N.: A threshold selection method from gray-level histograms. Automatica 11(285-296), 23–27 (1975)Google Scholar
  18. 18.
    Epshtein, B., Ofek, E., Wexler, Y.: Detecting text in natural scenes with stroke width transform. In: Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on. pp. 2963–2970. IEEE (2010)Google Scholar
  19. 19.
    Yao, C., Bai, X., Liu, W., Ma, Y., Tu, Z.: Detecting texts of arbitrary orientations in natural images. In: Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on. pp. 1083–1090. IEEE (2012)Google Scholar
  20. 20.
    Lee, S., Cho, M.S., Jung, K., Kim, J.H.: Scene text extraction with edge constraint and text collinearity. In: Pattern Recognition (ICPR), 2010 20th International Conference on. pp. 3983–3986. IEEE (2010)Google Scholar
  21. 21.
    Karatzas, D., Shafait, F., Uchida, S., Iwamura, M., i Bigorda, L.G., Mestre, S.R., Mas, J., Mota, D.F., Almazan, J.A., de las Heras, L.P.: Icdar 2013 robust reading competition. In: Document Analysis and Recognition (ICDAR), 2013 12th International Conference on. pp. 1484–1493. IEEE (2013)Google Scholar
  22. 22.
    Jahangiri, M., Petrou, M.: An attention model for extracting components that merit identification. In: 2009 16th IEEE International Conference on Image Processing (ICIP). pp. 965–968. IEEE (2009)Google Scholar
  23. 23.
    Chen, X., Yuille, A.L.: Detecting and reading text in natural scenes. In: Computer Vision and Pattern Recognition, 2004. CVPR 2004. Proceedings of the 2004 IEEE Computer Society Conference on. vol. 2, pp. II–II. IEEE (2004)Google Scholar
  24. 24.
    Gomez, L., Karatzas, D.: Multi-script text extraction from natural scenes. In: Document Analysis and Recognition (ICDAR), 2013 12th International Conference on. pp. 467–471. IEEE (2013)Google Scholar

Copyright information

© Springer Nature Singapore Pte Ltd. 2018

Authors and Affiliations

  • Aarushi Agrawal
    • 1
    Email author
  • Prerana Mukherjee
    • 2
  • Siddharth Srivastava
    • 2
  • Brejesh Lall
    • 2
  1. 1.Department of Electrical EngineeringIndian Institute of Technology KharagpurKharagpurIndia
  2. 2.Department of Electrical EngineeringIndian Institute of Technology DelhiDelhiIndia

Personalised recommendations