Abstract
In order to realize real-time 3D environment perception of UAVs and autopilot in low-altitude complex road scenes, a neural network model RobNet based on SqueezeNet and cyclic CRF for real-time 3D point cloud segmentation is proposed to segment the road objects in real time. Firstly, the unordered, scattered 3D point cloud data are preprocessed into a standard data format similar to an image by a spherical mapping method. Then, at the macro-level of the model design, the SqueezeNet network with the residual connection optimization is selected as the basic network of the model, and then, the conditional random field (CRF) algorithm which is processed into the cyclic network structure is used to refine the segmentation result. Finally, the construction of the basic network, the cyclic network and the network parameter settings in the model is elaborated at the micro-level. The experimental results show that the RobNet model proposed in this paper can segment the target in the road scene better. The segmentation callback rate of the three types of vehicles, pedestrians and cyclists is increased by 28, 2 and 17%, respectively, compared with the VoxelNet network. The higher callback rate is in line with the safe movement specifications for drones and autonomous driving. At the same time, the proposed model parameters are small, 98.5% smaller than the classic network AlexNet, and are easy to deploy on a platform with limited computing resources. The RobNet model in the Robot Operating System (ROS) framework engineering deployment and implementation experimental data shows that the model meets the real-time and stability requirements of the drone and automatic driving application, engineering code can run in real time at 12 Hz, the standard deviation of each frame’s running time is around 4.5 ms.
Similar content being viewed by others
References
Bhushan K, Gupta BB (2017) Security challenges in cloud computing: state-of-art. Int J Big Data Intell 4(2):81–107
Chen LC, Papandreou G, Kokkinos I et al (2016) DeepLab: semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs. IEEE Trans Pattern Anal Mach Intell 40(4):834–848
Chen L, Hu X, Xu T et al (2017a) Turn signal detection during nighttime by CNN detector and perceptual hashing tracking. IEEE Trans Intell Transp Syst 18:1–12
Chen X, Kundu K, Zhu Y et al (2017b) 3D object proposals using stereo imagery for accurate object class detection. IEEE Trans Pattern Anal Mach Intell 40:1
Engelmann F, Kontogianni T, Hermans A, et al. (2017) Exploring spatial context for 3D semantic segmentation of point clouds. In: 2017 IEEE international conference on computer vision workshop (ICCVW). IEEE
Hayashi A, Shirako J, Tiotto E, Ho R, Sarkar V (2019) Performance evaluation of OpenMP’s target construct on GPUs-exploring compiler optimisations. IJHPCN 13(1):54–69
Hussain K, Mohd Salleh MN, Leman AM (2016) Optimization of ANFIS using mine blast algorithm for predicting strength of Malaysian small medium enterprises. In: International conference on fuzzy systems and knowledge discovery. IEEE
Jie L, Jian C, Lei W (2018) Design of multi-mode UAV human–computer interaction system. In: 2017 IEEE international conference on unmanned systems (ICUS)
Li L, Zhu H, Yang G, Qian J (2014) Referenceless measure of blocking artifacts by Tchebichef kernel analysis. IEEE Signal Process Lett 21(1):122–125
Li L, Wu D, Wu J, Li H, Lin W, Kot AC (2016a) Image sharpness assessment by sparse representation. IEEE Trans Multimed 18(6):1085–1097
Li L, Lin W, Wang X, Yang G, Bahrami K, Kot AC (2016b) No-reference image blur assessment based on discrete orthogonal moments. IEEE Trans Cybern 46(1):39–50
Li L, Xia W, Lin W, Fang Y, Wang S (2017) No-reference and robust image sharpness evaluation based on multiscale spatial and spectral features. IEEE Trans Multimed 19(5):1030–1040
Liao Z, Gao L, Zhou T et al (2019) An oil painters recognition method based on cluster multiple kernel learning algorithm. IEEE Access 7:26842–26854
Long J, Shelhamer E, Darrell T (2015) Fully convolutional networks for semantic segmentation. In: CVPR
Marmanis D, Datcu M, Esch T et al (2016) Deep learning earth observation classification using ImageNet pretrained networks. IEEE Geosci Remote Sens Lett 13(1):105–109
Qi CR, Su H, Mo K, et al. (2017) PointNet: deep learning on point sets for 3D classification and segmentation. In: IEEE conference on computer vision and pattern recognition (CVPR)
Pan Jeng-Shyang, Kong Lingping, Sung Tien-Wen, Tsai Pei-Wei, Snasel Waclav (2018) α-fraction first strategy for hierarchical wireless sensor networks. J Internet Technol 19(6):1717–1726
Pani D, Barabino G, Citi L et al (2016) Real-time neural signals decoding onto off-the-shelf DSP processors for neuroprosthetic applications. IEEE Trans Neural Syst Rehabilit Eng 24:1
Rubino C, Crocco M, Bue AD (2018) 3D object localisation from multi-view image detections. IEEE Trans Pattern Anal Mach Intell 40(99):1
Schlosser J, Chow CK, Kira Z (2016) Fusing lidar and images for pedestrian detection using convolutional neural networks. In: 2016 IEEE international conference on robotics and automation (ICRA). IEEE, pp 2198–2205
Shin MO, Oh GM, Kim SW et al (2017) Real-time and accurate segmentation of 3-D point clouds based on Gaussian process regression. IEEE Trans Intell Transp Syst 18:1–15
Szegedy C, Liu W, Jia Y, et al. (2015) Going deeper with convolutions. In: 2015 IEEE conference on computer vision and pattern recognition (CVPR). IEEE
Wang DZ, Posner I, Newman P (2012) What could move? Finding cars, pedestrians and bicyclists in 3d laser data. In: Robotics and automation (ICRA)
Wen H, Wu J, Pan F et al (2019) Deep-learning-based physical layer authentication for industrial wireless sensor networks. Sensors 19(11):2440
Wolf D, Prankl J, Vincze M (2015) Fast semantic segmentation of 3D point clouds using a dense CRF with learned parameters. In: IEEE international conference on robotics and automation
Wu J, Guo S, Huang H et al (2018) Information and communications technologies for sustainable development goals: state-of-the-art. Needs Perspect IEEE Commun Surv Tutor 20(3):2389–2406
Xiong T et al (2018) Robust student’s-t mixture modelling via Markov random field and its application in image segmentation. High Perform Comput Netw 11(4):342–350
Xu L, Wan P, Wang Y, Liang T (2019) A similarity algorithm based on hamming distance used to detect malicious users in cooperative spectrum sensing. Int J High Perform Comput Netw 14(1):112–119
Yi L, Kunya G, Zhuo D et al (2019) Design and implementation of an Openflow SDN controller in NS-3 discrete-event network simulator. J High Perform Computing and Networking 14(1):17–29
Yu Y, Li J, Guan H et al (2015a) Semiautomated extraction of street light poles from mobile LiDAR point-clouds. IEEE Trans Geosci Remote Sens 53(3):1374–1386
Yu Y, Li J, Guan H et al (2015b) Learning hierarchical features for automated extraction of road markings from 3-D mobile LiDAR point clouds. IEEE J Sel Topics Appl Earth Obs Remote Sens 8(2):709–726
Zhang Y, Wu J, Cai J (2016) Compact representation of high-dimensional feature vectors for large-scale image recognition and retrieval. IEEE Trans Image Process A Publ IEEE Signal Process Soc 25(5):2407–2419
Zheng S, Jayasumana S, Romera-Paredes B, Vineet V, Su Z, Du D, Huang C, Torr PH (2015) Conditional random fields as recurrent neural networks. In: Proceedings of the IEEE international conference on computer vision, pp 1529–1537
Zhou Y, Tuzel O (2018) VoxelNet: end-to-end learning for point cloud based 3D object detection. In: IEEE/CVF conference on computer vision and pattern recognition
Acknowledgements
We would like to thank the anonymous reviewers and the associate editor for their valuable comments and suggestions to improve the quality of the manuscript. This work was supported by National Nature Science Foundation of China (NSFC) under Grants 61671356, 61703403, 61601352.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declared that they have no conflicts of interest to this work. We declare that we do not have any commercial or associative interest that represents a conflict of interest in connection with the work submitted.
Additional information
Communicated by B. B. Gupta.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Sun, W., Zhang, Z. & Huang, J. RobNet: real-time road-object 3D point cloud segmentation based on SqueezeNet and cyclic CRF. Soft Comput 24, 5805–5818 (2020). https://doi.org/10.1007/s00500-019-04355-y
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00500-019-04355-y