Text Localization in Born-Digital Images of Advertisements

  • Dirk Siegmund
  • Aidmar Wainakh
  • Tina Ebert
  • Andreas Braun
  • Arjan Kuijper
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 10657)


Localizing text in images is an important step in a number of applications and fundamental for optical character recognition. While born-digital text localization might look similar to other complex tasks in this field, it has certain distinct characteristics. Our novel approach combines individual strengths of the commonly used methods: stroke width transform and extremal regions and combines them with a method based on edge-based morphologically growing. We present a parameter-free method with high flexibility to varying text sizes and colorful image elements. We evaluate our method on a novel image database of different retail prospects, containing textual product information. Our results show a higher f-score than competitive methods on that particular task.



This work was supported by the German Federal Ministry of Education and Research (BMBF) as well as by the Hessen State Ministry for Higher Education, Research and the Arts (HMWK) within CRISP.


  1. 1.
    Bonial International GmbH: Kaufda (2017).
  2. 2.
    Chen, T.W., Chen, Y.L., Chien, S.Y.: Fast image segmentation based on K-Means clustering with histograms in HSV color space. In: 2008 IEEE 10th Workshop on Multimedia Signal Processing, pp. 322–325 (2008)Google Scholar
  3. 3.
    Cho, H., Sung, M., Jun, B.: Canny text detector: fast and robust scene text localization algorithm. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3566–3573 (2016)Google Scholar
  4. 4.
    Epshtein, B.: Detecting text in natural scenes with stroke width transform, pp. 2963–2970 (2010)Google Scholar
  5. 5.
    Gonzalez, A., Bergasa, L.M., Yebes, J.J., Bronte, S.: Text location in complex images. In: 2012 21st International Conference on Pattern Recognition (ICPR), pp. 617–620. IEEE (2012)Google Scholar
  6. 6.
    Hanif, S.M., Prevost, L.: Text detection and localization in complex scene images using constrained adaboost algorithm. In: 2009 10th International Conference on Document Analysis and Recognition, ICDAR 2009, pp. 1–5. IEEE (2009)Google Scholar
  7. 7.
    Khan, N., Puri, S.: A study on text detection techniques of printed documents. In: Proceedings of the 2016 IEEE International Conference on Wireless Communications, Signal Processing and Networking, WiSPNET 2016, pp. 2478–2482 (2016)Google Scholar
  8. 8.
    marktguru Deutschland GmbH: Marktguru (2017).
  9. 9.
    Neumann, L., Matas, J.: Real-time scene text localization and recognition. In: 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3538–3545. IEEE (2012)Google Scholar
  10. 10.
    Neumann, L., Matas, J.: Real-time lexicon-free scene text localization and recognition. IEEE Trans. Pattern Anal. Mach. Intell. 38(9), 1872–1885 (2016)CrossRefGoogle Scholar
  11. 11.
    Siegmund, D., Ebert, T., Damer, N.: Combining low-level features of offline questionnaires for handwriting identification. In: Campilho, A., Karray, F. (eds.) ICIAR 2016. LNCS, vol. 9730, pp. 46–54. Springer, Cham (2016). CrossRefGoogle Scholar
  12. 12.
    Smith, R.: An overview of the Tesseract OCR engine. In: Proceedings of the Ninth International Conference on Document Analysis and Recognition, ICDAR 2007, Washington, DC, USA, vol. 02, pp. 629–633. IEEE Computer Society (2007)Google Scholar
  13. 13.
    Yu, C., Song, Y., Zhang, Y.: Scene text localization using edge analysis and feature pool. Neurocomputing 175, 652–661 (2016)CrossRefGoogle Scholar

Copyright information

© Springer International Publishing AG, part of Springer Nature 2018

Authors and Affiliations

  1. 1.Fraunhofer Institute for Computer Graphics Research (IGD)DarmstadtGermany

Personalised recommendations