Abstract
Distracted driving detection has many significant application scenarios in intelligent transportation, driver assistance, and other fields. However, these distracted behaviors are difficult to be recognized due to the variable background and different scale targets. To solve these problems, a distracted driving detection scheme is proposed based on the improved CenterNet with attention mechanism in this paper. Given the complexity of driving environments, an image classification method was first designed to divide the images into person and unmanned areas, which can reduce the interference in unmanned situations. And then a novel attention mechanism module was introduced into CenterNet to improve its detection ability for small targets. Numerous experiments were conducted with a public dataset and newly built targeted dataset that included three categories of distracted driving behaviors with 6481 pictures. The results demonstrated that, the proposed scheme can detect distracted behaviors in real time while driving with a mean average precision (mAP) of 97.0%, which outperforms some representative detection methods, such as CornerNet, YOLO v3 and YOLO v4.
Similar content being viewed by others
Data availability
Not applicable.
Code availability
Not applicable.
References
Alex K, Sutskever I, Hinton GE (2017) ImageNet classification with deep convolutional neural networks. Adv Neural Inf Process Syst 60(6):84–90
Alexey B, Wang CY, Liao HYM (2020) YOLOv4: Optimal Speed and precision of Object Detection. arXiv:2004.10934.
Alotaibi M, Alotaibi B (2020) Distracted driver classification using deep learning. Signal Image Video Proces 14:617–624. https://doi.org/10.1007/s11760-019-01589-z
Cui Z et al (2020) Ship detection in large-scale SAR images via spatial shuffle-group enhance attention. IEEE Trans Geosci Remote Sens (99):1–13
Cai Z, Fan Q, Feris RS, Vasconcelos N (2016) A unified multi-scale deep convolutional neural network for fast object detection. Proceedings of the Eurpean Conference on Computer Vision (ECCV), 2016, Lecture Notes in Computer Science, Springer, Cham. https://doi.org/10.1007/978-3-319-46493-0_22
Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR), 2005, pp 886–893. https://doi.org/10.1109/CVPR.2005.177
Dhillon A, Verma GK (2020) Convolutional neural network: a review of models, methodologies and applications to object detection. In: Progress in Artificial Intelligence, pp 85–112. https://doi.org/10.1007/s13748-019-00203-0
Duan K, Bai S, Xie L, Qi H, Huang Q, Tian Q (2019) CenterNet: Keypoint triplets for object detection. Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pp 6569–6578.
Fisher Y, Dequan W, Evan S, Trevor D (2018) Deep layer aggregation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp 2403–2412
Girshick R (2015) Fast R-CNN. Proceedings of the IEEE International Conference on Computer Vision, pp 1440–1448
Girshick R, Donahue J, Darrell T, Malik J (2014) Rich feature hierarchies for accurate object detection and semantic segmentation.2014 IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, pp 580–587. https://doi.org/10.1109/CVPR.2014.81
Gu J, Wang Z, Kuen J, Ma L, Shahroudy A, Shuai B et al (2018) Recent advances in convolutional neural networks. Pattern Recogn 77:354–377
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp 770–778. https://doi.org/10.1109/CVPR.2016.90
Hei L, Jia D (2018) CornerNet: detecting objects as paired keypoints. Proceedings of the European Conference on Computer Vision (ECCV), pp 734–750. arXiv:1808.01244
Hei L, Yun T, Olga R, Jia D (2019) CornerNet-Lite: efficient keypoint based object detection. arXiv:1904.08900.
Hesham ME, Yehya A, Mohamed HS, Mohamed NM (2019) Driver Distraction Identification with an Ensemble of Convolutional Neural Networks. J Adv Transp Mach Learn Transp (MLT) Issue. https://doi.org/10.1155/2019/4125865
Jia S, Zhang Y (2018)Saliency-based deep convolutional neural network for no-reference image quality assessment. Multimed Tools Appl 77:14859–14872
Jie H, Li S, Samuel A, Gang S, Enhua W (2018)Squeeze-and-excitation networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp 7132–7141. arXiv:1709.01507v4
Karen S, Andrew Z (2014) Very deep convolutional networks for large-scale image recognition. arXiv:1409.1556
Koesdwiady A, Bedawi SM, Ou C, Karray F (2017)End-to-end deep learning for driver distraction recognition. Image Analysis and Recognition, ICIAR, pp 11–18. https://doi.org/10.1007/978-3-319-59876-5_2
Li Q, Hu R, Wang Z, Ding Z (2021) Driving behavior-aware network for 3D object tracking in complex traffic scenes. IEEE Access 9:51550–51560
Liu L, Ouyang W, Wang X et al (2020) Deep learning for generic object detection: a survey. Int J Comput Vis 128:261–318. https://doi.org/10.1007/s11263-019-01247-4
Redmon J, Farhadi A (2017) YOLO9000: better, faster, stronger. In proceedings of 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp 7263–7271
Redmon J, Farhadi A (2018) YOLOv3: An Incremental Improvement. arXiv:1804.02767
Roy AG, Navab N, Wachinger C (2018) Concurrent spatial and channel ‘Squeeze & Excitation’ in fully convolutional networks. Medical Image Computing and Computer Assisted Intervention (MICCAI). Lecture Notes in Computer Science, vol 11070. Springer, Cham
Tran D, Ha Manh D, Weihua S, He B, Girish C (2018)Real-time detection of distracted driving based on deep learning. IET Intell Transp Syst 12(10):1210–1219. https://doi.org/10.1049/iet-its.2018.5172
Yehya A, Hesham ME, Mohamed NM (2018)Real-time distracted driver posture classification. Machine Learning for Intelligent Transportation Systems Workshop in the 32nd Conference on Neural Information Processing Systems, Montréal, Canada. arXiv:1706.09498v3
Yi P, Wang Z, Jiang K, Jiang J, Lu T, Ma J (2020) A progressive fusion generative adversarial network for realistic and consistent video super-resolution. In IEEE Transactions on Pattern Analysis and Machine Intelligence. https://doi.org/10.1109/TPAMI.2020.3042298
Zhou X, Wang D, Krhenbühl P (2019) Objects as points. arXiv:1904.07850
Acknowledgements
This work was supported in part by the National Natural Science Foundation of China (Grant No. 61671412), Zhejiang Provincial Natural Science Foundation of China (Grant No. LY19F010002, LY21F010014), Natural Science Foundation of Ningbo, China (Grant No. 2018A610053, 202003N4323), Ningbo Municipal Projects for Leading and Top Talents (Grant No. NBLJ201801006), General Scientific Research Project of Zhejiang Education Department (Grant No. Y201941122), School level scientific research and innovation team project, and Fundamental Research Funds for Zhejiang Provincial Colleges and Universities.
Author information
Authors and Affiliations
Contributions
Qingqing Zhang: Ideas; Creation of models; Software; Zhongjie Zhu: Conceptualization; Investigation; Review & Editing; Funding acquisition; Yongqiang Bai: Methodology; Formal analysis; Funding acquisition; Validation; Review & Editing; Guanglong Liao: Evidence collection; Original Draft; Evidence collection; Tingna Liu: Visualization; Data Curation.
Corresponding authors
Ethics declarations
Conflict of interest
The authors declare that there are no conflicts of interest regarding the publication of this paper.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Zhang, Q., Zhu, Z., Bai, Y. et al. Distracted driving detection based on the improved CenterNet with attention mechanism. Multimed Tools Appl 81, 7993–8005 (2022). https://doi.org/10.1007/s11042-022-12128-3
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-022-12128-3