Moving Object Detection with Single Moving Camera and IMU Sensor using Mask R-CNN Instance Image Segmentation

Jung, Sukwoo; Cho, Youngmok; Lee, KyungTaek; Chang, Minho

doi:10.1007/s12541-021-00527-9

Moving Object Detection with Single Moving Camera and IMU Sensor using Mask R-CNN Instance Image Segmentation

Regular Paper
Published: 03 May 2021

Volume 22, pages 1049–1059, (2021)
Cite this article

International Journal of Precision Engineering and Manufacturing Aims and scope Submit manuscript

Sukwoo Jung¹,
Youngmok Cho¹,
KyungTaek Lee² &
…
Minho Chang¹

474 Accesses
9 Citations
Explore all metrics

Abstract

This paper describes a new method for the moving object detection using the IMU sensor and instance image segmentation. In the proposed method, the feature points are extracted by the detector, and the initial fundamental matrix is calculated from the IMU data. Next, the epipolar line is used to classify the extracted feature points. From the background feature point matching, fundamental matrix is calculated iteratively to minimize the error of classification. After the feature point classification, image segmentation is used to enhance the quality of the classification result. The proposed method is implemented and tested with real-world driving videos, and compared with the previous works.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Detecting Parallel-Moving Objects in the Monocular Case Employing CNN Depth Maps

Proposal of Segmentation Method Adapted to the Infrared Sensor

Moving Object Detection Using SIFT Matching on Three Frames for Advanced Driver Assistance Systems

References

Baek, S., Kim, H., & Boo, K. (2014). Robust estimation of vehicle recognition on curved roads using a rear-side view vision system. International Journal of Precision Engineering and Manufacturing, 15(4), 753–760
Article Google Scholar
Bay, H., Tuytelaars, T., & Van Gool, L. (2008). SURF: Speeded up robust features. Computer Vision and Image Understanding, 110(3), 346–359
Article Google Scholar
Dalal, N., & Triggs, B. (2005). Histograms of oriented gradients for human detection. In IEEE computer society conference on computer vision and pattern recognition (pp. 886–893).
DeTone, D., Malisiewicz, T., & Rabinovich, A. (2018). SuperPoint: self-supervised interest point detection and description. In IEEE conference on computer vision and pattern recognition (pp. 224–236).
Ha, S. W., & Moon, Y. H. (2011). Multiple object tracking using SIFT features and location matching. International Journal of Smart Home, 5(4), 17–26
Google Scholar
He, K., Gkioxari, G., Dollar, P., & Girshick, R. (2017). Mask R-CNN. In IEEE conference on computer vision and pattern recognition (pp. 2961–2969).
Hu, W. C., Chen, C. H., Chen, T. Y., Huang, D. Y., & Wu, Z. C. (2015). Moving object detection and tracking from video captured by moving camera. Journal of Visual Communication and Image Representation, 30, 164–180
Article Google Scholar
Jung, S., Song, S., Chang, M., & Park, S. (2018). Range image registration based on 2D synthetic images. Computer-Aided Design, 94, 16–27
Article Google Scholar
Jung, S., Cho, Y., & Chang, M. (2020). Moving object detection from moving camera image sequences using an Inertial Measurement Unit sensor. Applied Sciences, 10(1), 268
Article Google Scholar
Kim, C., Li, F., Ciptadi, A., & Regh, J. M. (2015) Multiple hypothesis tracking revisited. In Proceddings of the IEEE international conference on computer vision (pp. 4696–4704).
Kuen, J., Lim, K. M., & Lee, C. P. (2015). Self-taught learning of a deep invariant representation for visual tracking via temporal slowness principle. Pattern Recognition, 48(10), 2964–2982
Article Google Scholar
Leal-Taixe, L., Canton-Ferrer, C., & Schindler, K. (2016). Learning by tracking: Siamese CNN for robust target association. In Proceedings of the IEEE conference on computer vision and pattern recognition workshop (pp. 33–40).
Li, P., Wang, D., Wang, L., & Lu, H. (2018). Deep visual tracking: Review and experimental comparison. Pattern Recognition, 76, 323–338
Article Google Scholar
Lin, T. Y., Maire, M., Belongie, S., Hays, J., Perona, P., Ramanan, D., Dollar, P., & Zitnick, C. L. (2014). Microsoft COCO: Common objects in context. In European conference on computer vision (pp. 740–755).
Liu, X., Lin, L., Yan, S., Jin, H., & Jiang, W. (2011). Adaptive object tracking by learning hybrid template online. IEEE Transactions on Circuits and Systems for Video Technology, 21(11), 1588–1599
Article Google Scholar
Lowe, D. G. (1999). Object recognition from local scale-invariant features. Computer Vision, 2, 1150–1157
Google Scholar
Ma, C., Huang, J. B., Yang, X., & Yang, M. H. (2015). Hierarchical convolutional features for visual tracking. In Proceedings of the IEEE international conference on computer vision (pp. 3074–3082).
Ning, J., Zhang, L., Zhang, D., & Wu, C. (2009). Robust object tracking using joint color-texture histogram. International Journal of Pattern Recognition and Artificial Intelligence, 23(7), 1245–1263
Article Google Scholar
Pan, J., Hu, B., & Zhang, J. Q. (2008). Robust and accurate object tracking under various types of occlusions. IEEE Transactions on Circuits and Systems for Video Technology, 18(2), 223–236
Article Google Scholar
Roshanbin, N., & Miller, J. (2017). A comparative study of the performance of local feature-based pattern recognition algorithms. Pattern Analysis and Applications, 20(4), 1145–1156
Article MathSciNet Google Scholar
Rublee, E., Rabaud, V., Konolige, K., & Bradski, G. (2011). ORB: An efficient alternative to SIFT or SURF. In International conference on computer vision (pp. 2564–2571).
Wang, N., & Yeung, D. Y. (2013). Learning a deep compact image representation for visual tracking. In Adv. neural inf. process. syst. (pp. 809–817).
Wang, L., Quyang, W., Wang, X., & Lu, H. (2015) Visual tracking with fully convolutional networks. In Proceedings of the IEEE international conference on computer vision (pp. 3119–3127).
Zhao, Q., Yang, Z., & Tao, H. (2010). Differential earth mover’s distance with its applications to visual tracking. IEEE Transactions on Pattern Analysis and Machine Intelligence, 32(2), 274–287
Article Google Scholar
Zhong, Y., Jain, A. K., & Dubuisson-Jolly, M. P. (2000). Object tracking using deformable templates. IEEE Transactions on Pattern Analysis and Machine Intelligence, 22(5), 544–549
Article Google Scholar

Download references

Acknowledgements

This work was supported by the IT R&D program of MSIT/IITP. [R2020040040, Development of 5G-based 3D spatial scanning device technology for virtual space composition.]

Author information

Authors and Affiliations

Department of Mechanical Engineering, Korea University, Seoul, South Korea
Sukwoo Jung, Youngmok Cho & Minho Chang
Contents Convergence Research Center, Korea Electronics Technology Institute, Seoul, South Korea
KyungTaek Lee

Authors

Sukwoo Jung
View author publications
You can also search for this author in PubMed Google Scholar
Youngmok Cho
View author publications
You can also search for this author in PubMed Google Scholar
KyungTaek Lee
View author publications
You can also search for this author in PubMed Google Scholar
Minho Chang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Minho Chang.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Jung, S., Cho, Y., Lee, K. et al. Moving Object Detection with Single Moving Camera and IMU Sensor using Mask R-CNN Instance Image Segmentation. Int. J. Precis. Eng. Manuf. 22, 1049–1059 (2021). https://doi.org/10.1007/s12541-021-00527-9

Download citation

Received: 01 January 2020
Revised: 11 January 2021
Accepted: 19 February 2021
Published: 03 May 2021
Issue Date: June 2021
DOI: https://doi.org/10.1007/s12541-021-00527-9

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Moving Object Detection with Single Moving Camera and IMU Sensor using Mask R-CNN Instance Image Segmentation

Abstract

Access this article

Similar content being viewed by others

Detecting Parallel-Moving Objects in the Monocular Case Employing CNN Depth Maps

Proposal of Segmentation Method Adapted to the Infrared Sensor

Moving Object Detection Using SIFT Matching on Three Frames for Advanced Driver Assistance Systems

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Moving Object Detection with Single Moving Camera and IMU Sensor using Mask R-CNN Instance Image Segmentation

Abstract

Access this article

Similar content being viewed by others

Detecting Parallel-Moving Objects in the Monocular Case Employing CNN Depth Maps

Proposal of Segmentation Method Adapted to the Infrared Sensor

Moving Object Detection Using SIFT Matching on Three Frames for Advanced Driver Assistance Systems

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation