Abstract
Automatic target recognition is critical in infrared imaging guidance. However, since the diversity of the environment, the infrared data are often complex and difficult to analyze accurately. We proposed a deep learning infrared target detection framework based on transposed convolution and fusion modules (TF-SSD). Compared with one-stage detector YOLOv5 and two-stage detector Faster R-CNN, TF-SSD has three highlights: (1) using the visualization method to revise the network structure and improve the training efficiency; (2) using transposed convolution operation to increase feature extraction ability and detection efficiency; (3) using multi-scale feature fusion models to realize the skip connection between the high-level network and the low-level network. Experimental results on our dataset of six common flight attitudes demonstrate that the maximum average precision (AP) is 90.9%, the minimum average precision is 79.8%, and the overall mean average precision (mAP) is 85.7%. It is confirmed that our proposed TF-SSD system can effectively recognize infrared targets.
Similar content being viewed by others
Availability of data and materials
Not applicable.
References
Chen, Z., Shi, K., Chen, N., et al.: The experimental study about laser induced dizziness effect of medium-wave infrared seeker which based on image processing. J. Vis. Commun. Image Represent. 59, 401–406 (2019)
Li, X., Jiang, H., Niu, M., et al.: An enhanced selective ensemble deep learning method for rolling bearing fault diagnosis with beetle antennae search algorithm. Mech. Syst. Signal Process. 142, 1–20 (2020)
Zhang, Z., Liu, Y., Chen, X., et al.: Sequential optimization for efficient high-quality object proposal generation. IEEE Trans. Pattern Anal. Mach. Intell. 40(5), 1209–1223 (2018)
Shen, F., Deng, H., Yu, L., et al.: Open-source mobile multispectral imaging system and its applications in biological sample sensing. Spectrochim Acta A 280, 121504 (2022)
Divya, S., Paul, S., Pati, U.C.: Structure tensor-based SIFT algorithm for SAR image registration. IET Image Process. 14(5), 929–938 (2020)
Alahmadi, A., Hussain, M., Aboalsamh, H.A., et al.: PCAPooL: unsupervised feature learning for face recognition using PCA, LBP, and pyramid pooling. Pattern Anal. Appl. 23(2), 673–682 (2020)
Huang, H., Liu, J.H., Liu, S.L., et al.: A method for classifying tube structures based on shape descriptors and a random forest classifier. Measurement 158, 107705 (2020)
Xu, X., Duan, H., Guo, Y., et al.: A cascade adaboost and CNN algorithm for drogue detection in UAV autonomous aerial refueling. Neurocomputing 408, 121–134 (2020)
Wang, T., Shen, F., Deng, H., et al.: Smartphone imaging spectrometer for egg/meat freshness monitoring, Anal. Methods-UK 14(5), 508–517 (2022)
Shen, C.: A transdisciplinary review of deep learning research and its relevance for water resource scientists. Water Resour. Res. 54(11), 8558–8593 (2019)
Luo, Y., Irisson, J.O., Graham, B., Guigand, C.: Automated plankton image analysis using convolutional neural networks. Limnol. Oceanogr. Methods 16(12), 814–827 (2018)
Kousik, N., Natarajan, Y., Raja, R., et al.: Improved salient object detection using hybrid convolution recurrent neural network. Expert Syst. Appl. 166, 1–26 (2020)
Gan, B.: Stacked contractive auto-encoders application in identification of pharmaceuticals. Spectrosc. Spectr. Anal. 39(1), 96–102 (2019)
Wang, R., Xu, J.: Object instance detection with pruned Alexnet and extended training data. Signal Process. Image Commun. 70, 145–156 (2019)
Xu, X., Xie, M., Miao, P., et al.: Perceptual-aware sketch simplification based on integrated VGG layers. IEEE Trans. Vis. Comput. Gr. 27(1), 178–189 (2020)
Yang, N., Zhang, Z., Yang, J., et al.: A convolutional neural network of GoogLeNet applied in mineral prospectivity prediction based on multi-source geoinformation. Nat. Resour. Res. 30(6), 3905–3923 (2021)
Wu, Z., Shen, C., Hengel, A.: Revisiting the ResNet model for visual recognition. Pattern Recogn. 90, 119–133 (2019)
Ju, M., Luo, J., Liu, G., et al.: ISTDet: an efficient end-to-end neural network for infrared small target detection. Infrared Phys. Technol. 114, 1–8 (2021)
Hou, Q., Wang, Z., Tan, F., et al.: RISTDnet: robust infrared small target detection network. IEEE Geosci. Remote Lett. 27(22), 61–64 (2021)
Ding, L., Xu, X., Cao, Y., et al.: Detection and tracking of infrared small target by jointly using SSD and pipeline filter. Digit. Signal Process. 110, 1–9 (2021)
Fang, F., Li, L., Zhu, H., et al.: Combining faster R-CNN and model-driven clustering for elongated object detection. IEEE Trans. Image Process. 29(1), 2052–2065 (2020)
Si, J., Lin, J., Jiang, F., et al.: Hand-raising gesture detection in real classrooms using improved R-FCN. Neyrocomputing 359, 69–76 (2019)
Shen, Z., Liu, Z., Li, J., et al.: Object detection from scratch with deep supervision. IEEE Trans. Pattern Anal. Mach. Intell. 42(2), 398–412 (2020)
Zhang, H., Tian, Y., Wang, K., et al.: Mask SSD: an effective single-stage approach to object instance segmentation. IEEE Trans. Image Process. 29(1), 2078–2093 (2020)
Xu, X., Zhang, X., Zhang, T.: Lite-YOLOv5: a lightweight deep learning detector for on-board ship detection in large-scene sentinel-1 SAR images. Remote Sens. 14(4), 1018 (2022)
Liang, X., Zhang, J., Zhuo, L., et al.: Small object detection in unmanned aerial vehicle images using feature fusion and scaling-based single shot detector with spatial context analysis. IEEE Trans. Circuits Syst. Video Technol. 30(6), 1758–1770 (2020)
Acknowledgements
We would like to thank the anonymous reviewers of this paper for their valuable and constructive comments.
Funding
This work is supported by Aeronautical Science Foundation of China (20170142002) and Natural Science Foundation of Henan Province (162300410095).
Author information
Authors and Affiliations
Contributions
LX and BC designed the framework of models. PX and LX studied conception. BC and LX established the experimental platform. LX collected experimental data. LX, FZ, and PX summarized and discussed experimental results. LX and BC wrote the main manuscript text. LX and FZ prepared Figs. 1–7 and Tables 1–2. All authors reviewed the manuscript.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Ethics approval and consent to participate
Not applicable.
Consent for publication
Not applicable.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Xu, L., Cao, B., Xu, P. et al. Infrared target detection using deep learning algorithms. SIViP 17, 3993–4000 (2023). https://doi.org/10.1007/s11760-023-02629-5
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11760-023-02629-5