Abstract
Characters are the basic components of text. Accurate character detection plays an important role in text detection and recognition. Previous character detectors tackle characters as independent objects, without considering the meaningful context information among them. In this paper, we propose a new module named constrained relation module which utilizes both the geometric and contextual information to exploit the strong relationship between characters. With this module, we build a new network named constrained relation network for character detection and recognition. To the best of our knowledge it is the first work to utilize contextual information among texts for character detection in scene images. The module can improve the detection results by suppressing the confusing text-like regions and recalling the hard examples. Experiments on SynthText, ICDAR2013 and SCUT-FORU demonstrate the effectiveness of our method on both detection and recognition tasks.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Bai, F., Cheng, Z., Niu, Y., Pu, S., Zhou, S.: Edit probability for scene text recognition. In: CVPR, pp. 1508–1516 (2018)
Cheng, Z., Bai, F., Xu, Y., Zheng, G., Pu, S., Zhou, S.: Focusing attention: towards accurate text recognition in natural images. In: ICCV. pp. 5076–5084 (2017)
Epshtein, B., Ofek, E., Wexler, Y.: Detecting text in natural scenes with stroke width transform. In: CVPR, pp. 2963–2970 (2010)
Girshick, R.: Fast R-CNN. In: ICCV, pp. 1440–1448 (2015)
Gupta, A., Vedaldi, A., Zisserman, A.: Synthetic data for text localisation in natural images. In: CVPR, pp. 2315–2324 (2016)
He, P., Huang, W., He, T., Zhu, Q., Qiao, Y., Li, X.: Single shot text detector with regional attention. In: ICCV, pp. 3047–3055 (2017)
He, T., Tian, Z., Huang, W., Shen, C., Qiao, Y., Sun, C.: An end-to-end textspotter with explicit alignment and attention. In: CVPR, pp. 5020–5029 (2018)
He, W., Zhang, X.Y., Yin, F., Liu, C.L.: Deep direct regression for multi-oriented scene text detection. In: ICCV, pp. 745–753 (2017)
Hu, H., Gu, J., Zhang, Z., Dai, J., Wei, Y.: Relation networks for object detection. In: CVPR, pp. 3588–3597 (2018)
Hu, H., Zhang, C., Luo, Y., Wang, Y., Han, J., Ding, E.: WordSup: exploiting word annotations for character based text detection. In: ICCV, pp. 4940–4949 (2017)
Karatzas, D., et al.: ICDAR 2013 robust reading competition. In: ICDAR, pp. 1484–1493 (2013)
Li, X., Wang, W., Hou, W., Liu, R.Z., Lu, T., Yang, J.: Shape robust text detection with progressive scale expansion network. arXiv:1806.02559 (2018)
Liao, M., Shi, B., Bai, X.: TextBoxes++: a single-shot oriented scene text detector. TIP 27(8), 3676–3690 (2018)
Liu, Y., Wang, R., Shan, S., Chen, X.: Structure inference net: object detection using scene-level context and instance-level relationships. In: CVPR, pp. 6985–6994 (2018)
Liu, Y., Jin, L.: Deep matching prior network: Toward tighter multi-oriented text detection. In: CVPR, pp. 1962–1969 (2017)
Nistér, D., Stewénius, H.: Linear time maximally stable extremal regions. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008. LNCS, vol. 5303, pp. 183–196. Springer, Heidelberg (2008). https://doi.org/10.1007/978-3-540-88688-4_14
Prasad, S., Wai Kin Kong, A.: Using object information for spotting text. In: ECCV, pp. 540–557 (2018)
Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. In: NIPS, pp. 91–99 (2015)
Rosenfeld, A., Thurston, M.: Edge and curve detection for visual scene analysis. IEEE Trans. Comput. 20(5), 562–569 (1971)
Shi, B., Bai, X., Belongie, S.: Detecting oriented text in natural images by linking segments. In: CVPR, pp. 2550–2558 (2017)
Sung, M.C., Jun, B., Cho, H., Kim, D.: Scene text detection with robust character candidate extraction method. In: ICDAR, pp. 426–430. IEEE (2015)
Tian, S., Lu, S., Li, C.: WeText: scene text detection under weak supervision. In: ICCV, pp. 1492–1500 (2017)
Tian, Z., Huang, W., He, T., He, P., Qiao, Y.: Detecting text in natural image with connectionist text proposal network. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9912, pp. 56–72. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46484-8_4
Vaswani, A., et al.: Attention is all you need. In: NIPS, pp. 5998–6008 (2017)
Xie, E., Zang, Y., Shao, S., Yu, G., Yao, C., Li, G.: Scene text detection with supervised pyramid context network. arXiv preprint arXiv:1811.08605 (2018)
Yao, C., Bai, X., Sang, N., Zhou, X., Zhou, S., Cao, Z.: Scene text detection via holistic, multi-channel prediction. arXiv:1606.09002 (2016)
Yao, C., Wu, W.: Mask TextSpotter: an end-to-end trainable neural network for spotting text with arbitrary shapes. In: ECCV, pp. 67–83 (2018)
Ye, Q., Doermann, D.: Text detection and recognition in imagery: a survey. TPAMI 37(7), 1480–1500 (2014)
Yin, X.C., Yin, X., Huang, K., Hao, H.W.: Robust text detection in natural scene images. TPAMI 36(5), 970–983 (2014)
Zhang, S., Lin, M., Chen, T., Jin, L., Lin, L.: Character proposal network for robust text extraction. In: ICASSP, pp. 2633–2637 (2016)
Zhou, X., Yao, C., Wen, H., Wang, Y., Zhou, S., He, W., Liang, J.: East: an efficient and accurate scene text detector. In: CVPR, pp. 5551–5560 (2017)
Zhu, A., Gao, R., Uchida, S.: Could scene context be beneficial for scene text detection? PR 58, 204–215 (2016)
Acknowledgments
This work is supported by the National Key R&D Program of China (2017YFB1002400) and the Strategic Priority Research Program of Chinese Academy of Sciences (XDC02000000).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Chen, Y., Zhou, Y., Yang, D., Wang, W. (2019). Constrained Relation Network for Character Detection in Scene Images. In: Nayak, A., Sharma, A. (eds) PRICAI 2019: Trends in Artificial Intelligence. PRICAI 2019. Lecture Notes in Computer Science(), vol 11672. Springer, Cham. https://doi.org/10.1007/978-3-030-29894-4_11
Download citation
DOI: https://doi.org/10.1007/978-3-030-29894-4_11
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-29893-7
Online ISBN: 978-3-030-29894-4
eBook Packages: Computer ScienceComputer Science (R0)