A Stackable Attention-Guided Multi-scale CNN for Number Plate Detection

  • Yixuan Wang
  • Shangdong Zheng
  • Wei Xu
  • Yang Xu
  • Tianming Zhan
  • Peng Zheng
  • Zhihui Wei
  • Zebin WuEmail author
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 11901)


High-speed railway transportation faces various risks due to its increasing laying area. A key to find the abnormal conditions is to locate the position of a pillar nearby. The complex background and different size of number plate (NP) makes the detection of number plate in overhead catenary system (OCS) a hard work. In this paper, we propose a novel framework and solution with two main advances: (1) a stackable attention model, which can improve the robustness of the system in complex scenarios; (2) multi-scale feature fusion stage, improve detection accuracy of distance pillar number plate. Both of them can be integrated into any CNN architectures seamlessly with negligible overheads and are end-to-end trainable along with base CNNs. We demonstrate the effectiveness of our method with experiments on different high-speed train lines and one benchmark dataset – Pascal VOC [4].


Number plate detection Attention model Multi-scale 


  1. 1.
    Bulan, O., Kozitsky, V., Ramesh, P., Shreve, M.: Segmentation-and annotation-free license plate recognition with deep localization and failure identification. IEEE Trans. Intell. Transp. Syst. 18(9), 2351–2363 (2017)CrossRefGoogle Scholar
  2. 2.
    Deb, K., Hossen, M.K., Khan, M.I., Alam, M.R.: Bangladeshi vehicle license plate detection method based on HSI color model and geometrical properties. In: 2012 7th International Forum on Strategic Technology (IFOST), pp. 1–5. IEEE (2012)Google Scholar
  3. 3.
    Deb, K., Jo, K.H.: HSI color based vehicle license plate detection. In: 2008 International Conference on Control, Automation and Systems, pp. 687–691. IEEE (2008)Google Scholar
  4. 4.
    Everingham, M., Van Gool, L., Williams, C.K., Winn, J., Zisserman, A.: The Pascal visual object classes (VOC) challenge. Int. J. Comput. Vision 88(2), 303–338 (2010)CrossRefGoogle Scholar
  5. 5.
    Hsu, G.S., Ambikapathi, A., Chung, S.L., Su, C.P.: Robust license plate detection in the wild. In: 2017 14th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS), pp. 1–6. IEEE (2017)Google Scholar
  6. 6.
    Lalimi, M.A., Ghofrani, S., McLernon, D.: A vehicle license plate detection method using region and edge based methods. Comput. Electr. Eng. 39(3), 834–845 (2013)CrossRefGoogle Scholar
  7. 7.
    Li, H., Wang, P., Shen, C.: Towards end-to-end car license plates detection and recognition with deep neural networks. Corr abs/1709.08828 (2017)Google Scholar
  8. 8.
    Liu, W., et al.: SSD: single shot multibox detector. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9905, pp. 21–37. Springer, Cham (2016). Scholar
  9. 9.
    Long, S., He, X., Ya, C.: Scene text detection and recognition: the deep learning era. arXiv preprint arXiv:1811.04256 (2018)
  10. 10.
    Redmon, J., Divvala, S., Girshick, R., Farhadi, A.: You only look once: unified, real-time object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 779–788 (2016)Google Scholar
  11. 11.
    Redmon, J., Farhadi, A.: YOLO9000: better, faster, stronger. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7263–7271 (2017)Google Scholar
  12. 12.
    Redmon, J., Farhadi, A.: Yolov3: an incremental improvement. arXiv preprint arXiv:1804.02767 (2018)
  13. 13.
    Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. In: Advances in Neural Information Processing Systems, pp. 91–99 (2015)Google Scholar
  14. 14.
    Silva, S.M., Jung, C.R.: Real-time Brazilian license plate detection and recognition using deep convolutional neural networks. In: 2017 30th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI), pp. 55–62. IEEE (2017)Google Scholar
  15. 15.
    Wang, F., et al.: Residual attention network for image classification. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3156–3164 (2017)Google Scholar
  16. 16.
    Wu, X., Yuan, P., Peng, Q., Ngo, C.W., He, J.Y.: Detection of bird nests in overhead catenary system images for high-speed rail. Pattern Recogn. 51, 242–254 (2016)CrossRefGoogle Scholar
  17. 17.
    Xie, L., Ahmad, T., Jin, L., Liu, Y., Zhang, S.: A new CNN-based method for multi-directional car license plate detection. IEEE Trans. Intell. Transp. Syst. 19(2), 507–517 (2018)CrossRefGoogle Scholar

Copyright information

© Springer Nature Switzerland AG 2019

Authors and Affiliations

  • Yixuan Wang
    • 1
  • Shangdong Zheng
    • 1
  • Wei Xu
    • 2
  • Yang Xu
    • 1
  • Tianming Zhan
    • 1
  • Peng Zheng
    • 1
  • Zhihui Wei
    • 1
  • Zebin Wu
    • 1
    Email author
  1. 1.School of Computer Science and EngineeringNanjing University of Science and TechnologyNanjingChina
  2. 2.China Railway Shanghai Group Co., Ltd., Nanjing Power Supply SectionNangJingChina

Personalised recommendations