Skip to main content
Log in

Distracted driving detection based on the improved CenterNet with attention mechanism

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Distracted driving detection has many significant application scenarios in intelligent transportation, driver assistance, and other fields. However, these distracted behaviors are difficult to be recognized due to the variable background and different scale targets. To solve these problems, a distracted driving detection scheme is proposed based on the improved CenterNet with attention mechanism in this paper. Given the complexity of driving environments, an image classification method was first designed to divide the images into person and unmanned areas, which can reduce the interference in unmanned situations. And then a novel attention mechanism module was introduced into CenterNet to improve its detection ability for small targets. Numerous experiments were conducted with a public dataset and newly built targeted dataset that included three categories of distracted driving behaviors with 6481 pictures. The results demonstrated that, the proposed scheme can detect distracted behaviors in real time while driving with a mean average precision (mAP) of 97.0%, which outperforms some representative detection methods, such as CornerNet, YOLO v3 and YOLO v4.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

Data availability

Not applicable.

Code availability

Not applicable.

References

  1. Alex K, Sutskever I, Hinton GE (2017) ImageNet classification with deep convolutional neural networks. Adv Neural Inf Process Syst 60(6):84–90

    Google Scholar 

  2. Alexey B, Wang CY, Liao HYM (2020) YOLOv4: Optimal Speed and precision of Object Detection. arXiv:2004.10934.

  3. Alotaibi M, Alotaibi B (2020) Distracted driver classification using deep learning. Signal Image Video Proces 14:617–624. https://doi.org/10.1007/s11760-019-01589-z

  4. Cui Z et al (2020) Ship detection in large-scale SAR images via spatial shuffle-group enhance attention. IEEE Trans Geosci Remote Sens (99):1–13

  5. Cai Z, Fan Q, Feris RS, Vasconcelos N (2016) A unified multi-scale deep convolutional neural network for fast object detection. Proceedings of the Eurpean Conference on Computer Vision (ECCV), 2016, Lecture Notes in Computer Science, Springer, Cham. https://doi.org/10.1007/978-3-319-46493-0_22

  6. Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR), 2005, pp 886–893. https://doi.org/10.1109/CVPR.2005.177

  7. Dhillon A, Verma GK (2020) Convolutional neural network: a review of models, methodologies and applications to object detection. In: Progress in Artificial Intelligence, pp 85–112. https://doi.org/10.1007/s13748-019-00203-0

  8. Duan K, Bai S, Xie L, Qi H, Huang Q, Tian Q (2019) CenterNet: Keypoint triplets for object detection. Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pp 6569–6578.

  9. Fisher Y, Dequan W, Evan S, Trevor D (2018) Deep layer aggregation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp 2403–2412

  10. Girshick R (2015) Fast R-CNN. Proceedings of the IEEE International Conference on Computer Vision, pp 1440–1448

  11. Girshick R, Donahue J, Darrell T, Malik J (2014) Rich feature hierarchies for accurate object detection and semantic segmentation.2014 IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, pp 580–587. https://doi.org/10.1109/CVPR.2014.81

  12. Gu J, Wang Z, Kuen J, Ma L, Shahroudy A, Shuai B et al (2018) Recent advances in convolutional neural networks. Pattern Recogn 77:354–377

    Article  Google Scholar 

  13. He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp 770–778. https://doi.org/10.1109/CVPR.2016.90

  14. Hei L, Jia D (2018) CornerNet: detecting objects as paired keypoints. Proceedings of the European Conference on Computer Vision (ECCV), pp 734–750. arXiv:1808.01244

  15. Hei L, Yun T, Olga R, Jia D (2019) CornerNet-Lite: efficient keypoint based object detection. arXiv:1904.08900.

  16. Hesham ME, Yehya A, Mohamed HS, Mohamed NM (2019) Driver Distraction Identification with an Ensemble of Convolutional Neural Networks. J Adv Transp Mach Learn Transp (MLT) Issue. https://doi.org/10.1155/2019/4125865

  17. Jia S, Zhang Y (2018)Saliency-based deep convolutional neural network for no-reference image quality assessment. Multimed Tools Appl 77:14859–14872

    Article  Google Scholar 

  18. Jie H, Li S, Samuel A, Gang S, Enhua W (2018)Squeeze-and-excitation networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp 7132–7141. arXiv:1709.01507v4

  19. Karen S, Andrew Z (2014) Very deep convolutional networks for large-scale image recognition. arXiv:1409.1556

  20. Koesdwiady A, Bedawi SM, Ou C, Karray F (2017)End-to-end deep learning for driver distraction recognition. Image Analysis and Recognition, ICIAR, pp 11–18. https://doi.org/10.1007/978-3-319-59876-5_2

  21. Li Q, Hu R, Wang Z, Ding Z (2021) Driving behavior-aware network for 3D object tracking in complex traffic scenes. IEEE Access 9:51550–51560

    Article  Google Scholar 

  22. Liu L, Ouyang W, Wang X et al (2020) Deep learning for generic object detection: a survey. Int J Comput Vis 128:261–318. https://doi.org/10.1007/s11263-019-01247-4

    Article  MATH  Google Scholar 

  23. Redmon J, Farhadi A (2017) YOLO9000: better, faster, stronger. In proceedings of 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp 7263–7271

  24. Redmon J, Farhadi A (2018) YOLOv3: An Incremental Improvement. arXiv:1804.02767

  25. Roy AG, Navab N, Wachinger C (2018) Concurrent spatial and channel ‘Squeeze & Excitation’ in fully convolutional networks. Medical Image Computing and Computer Assisted Intervention (MICCAI). Lecture Notes in Computer Science, vol 11070. Springer, Cham

    Google Scholar 

  26. Tran D, Ha Manh D, Weihua S, He B, Girish C (2018)Real-time detection of distracted driving based on deep learning. IET Intell Transp Syst 12(10):1210–1219. https://doi.org/10.1049/iet-its.2018.5172

  27. Yehya A, Hesham ME, Mohamed NM (2018)Real-time distracted driver posture classification. Machine Learning for Intelligent Transportation Systems Workshop in the 32nd Conference on Neural Information Processing Systems, Montréal, Canada. arXiv:1706.09498v3

  28. Yi P, Wang Z, Jiang K, Jiang J, Lu T, Ma J (2020) A progressive fusion generative adversarial network for realistic and consistent video super-resolution. In IEEE Transactions on Pattern Analysis and Machine Intelligence. https://doi.org/10.1109/TPAMI.2020.3042298

  29. Zhou X, Wang D, Krhenbühl P (2019) Objects as points. arXiv:1904.07850

Download references

Acknowledgements

This work was supported in part by the National Natural Science Foundation of China (Grant No. 61671412), Zhejiang Provincial Natural Science Foundation of China (Grant No. LY19F010002, LY21F010014), Natural Science Foundation of Ningbo, China (Grant No. 2018A610053, 202003N4323), Ningbo Municipal Projects for Leading and Top Talents (Grant No. NBLJ201801006), General Scientific Research Project of Zhejiang Education Department (Grant No. Y201941122), School level scientific research and innovation team project, and Fundamental Research Funds for Zhejiang Provincial Colleges and Universities.

Author information

Authors and Affiliations

Authors

Contributions

Qingqing Zhang: Ideas; Creation of models; Software; Zhongjie Zhu: Conceptualization; Investigation; Review & Editing; Funding acquisition; Yongqiang Bai: Methodology; Formal analysis; Funding acquisition; Validation; Review & Editing; Guanglong Liao: Evidence collection; Original Draft; Evidence collection; Tingna Liu: Visualization; Data Curation.

Corresponding authors

Correspondence to Zhongjie Zhu or Yongqiang Bai.

Ethics declarations

Conflict of interest

The authors declare that there are no conflicts of interest regarding the publication of this paper.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zhang, Q., Zhu, Z., Bai, Y. et al. Distracted driving detection based on the improved CenterNet with attention mechanism. Multimed Tools Appl 81, 7993–8005 (2022). https://doi.org/10.1007/s11042-022-12128-3

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-022-12128-3

Keywords

Navigation