Skip to main content

Memory Consumption and Computation Efficiency Improvements of Viola-Jones Object Detection Method for UAVs

  • Conference paper
  • First Online:
Pattern Recognition. ICPR International Workshops and Challenges (ICPR 2021)

Abstract

In this paper, we consider object classification and detection problems for autonomous UAVs. We propose an algorithm that is effective from the point of view of computational complexity and memory consumption. The proposed algorithm can be successfully used as a basic tool for building an autonomous UAV control system. The algorithm is based on the Viola-Jones method. It is shown in the paper, that the Viola-Jones method is the most preferable approach to detect objects on-board UAVs because it needs the least amount of memory and the number of computational operations to solve the object detection problem. To ensure sufficient accuracy, we use a modified feature: rectangular Haar-like features, calculated over the magnitude of the image gradient. To increase computational efficiency, the L1 norm was used to calculate the magnitude of the image gradient. The PSN-10 inflatable life raft (an example of an object that is detected during rescue operations using UAVs) and oil tank storage (such kind of objects are usually detected during the inspection of industrial infrastructure) are considered as target objects in this work. The performance of the trained detectors was estimated on real data (including data obtained during the real rescue operation of the trawler “Dalniy Vostok” in 2015).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Yu, H., Li, G., Zhang, W., Huang, Q., Du, D., Tian, Q., Sebe, N.: The unmanned aerial vehicle benchmark: Object detection, tracking and baseline. Int. J. Comput. Vis. 128(5), 1141–1159 (2020)

    Article  Google Scholar 

  2. Leira, F.S., Johansen, T.A., Fossen, T.I.: Automatic detection, classification and tracking of objects in the ocean surface from UAVs using a thermal camera. In: 2015 IEEE Aerospace Conference, p. 10. IEEE (2015)

    Google Scholar 

  3. Du, D., et al.: The Unmanned aerial vehicle benchmark: object detection and tracking. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) ECCV 2018. LNCS, vol. 11214, pp. 375–391. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01249-6_23

    Chapter  Google Scholar 

  4. Muraviev, V.S., Smirnov, S.A., Strotov, V.V.: Aerial vehicles detection and recognition for UAV vision system. Comput. Opt. 41(4), 545 (2017)

    Article  Google Scholar 

  5. Granger, E., Kiran, M., Blais-Morin, L.A.: A comparison of CNN-based face and head detectors for real-time video surveillance applications. In: 2017 Seventh International Conference on Image Processing Theory, Tools and Applications (IPTA), pp. 1–7. IEEE (2017)

    Google Scholar 

  6. Hulens, D., Goedemé, T., Verbeke, J.: How to choose the best embedded processing platform for on-board UAV image processing? In: The Proceedings VISAPP 2015, pp. 1–10 (2015)

    Google Scholar 

  7. Bertran, E., Sànchez-Cerdà, A.: On the tradeoff between electrical power consumption and flight performance in fixed-wing UAV autopilots. IEEE Trans. Veh. Technol. 65(11), 8832–8840 (2016)

    Article  Google Scholar 

  8. Jain, V., Learned-Miller, E.: Fddb: A benchmark for face detection in unconstrained settings. Technical report, University of Massachusetts, Amherst (2010)

    Google Scholar 

  9. Afanasyev, I.I., Laptev, V.N., Pirogov, V.P.: Analysis of the rescue assets range of the Russian navy. Sci. Bull. Volsk Mil. Inst. Mater. Support: Mil. Sci. J. 2, 150–154 (2015)

    Google Scholar 

  10. Usilin, S.A., Arlazarov, V.V., Rokhlin, N. S., Rudyka, S.A., Matveev, S.A., Zatsarinny A.A.: Training Viola-Jones detectors For 3D objects based on fully synthetic data for use in rescue missions with UAV. In: Bulletin of the South Ural State University. Ser. Mathematical Modelling, Programming and Computer Software (Bulletin SUSU MMCS), vol. 13(4), pp. 96–108 (2020). https://doi.org/10.14529/mmp200408

  11. Cho, J., Lim, G., Biobaku, T., Kim, S., Parsaei, H.: Safety and security management with unmanned aerial vehicle (UAV) in oil and gas industry. Procedia Manuf. 3, 1343–1349 (2015)

    Article  Google Scholar 

  12. Viola, P., Jones, M.: Rapid object detection using a boosted cascade of simple features. In: Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2001), vol. 1, pp. 511–518. IEEE (2001)

    Google Scholar 

  13. Viola, P., Jones, M.: Robust real-time object detection. Proc. Int. J. Comput. Vis. 4, 34–47 (2001)

    Google Scholar 

  14. Papageorgiou, C.P., Oren, M., Poggio, T.: A general framework for object detection. In: Proceedings of the Sixth International Conference Computer Vision IEEE Cat No. 98CH36271, vol. 6, pp. 555–562, January 1998

    Google Scholar 

  15. Lewis, J.P.: Fast template matching. In: Proceedings of the Vision Interface, pp. 120–123 (1995)

    Google Scholar 

  16. Kotov, A.A., Usilin, S.A., Gladilin, S.A., Nikolaev, D.P.: Construction of robust features for detection and classification of objects without characteristic brightness contrasts. J. Inf. Technol. Comput. Syst. 1, 53–60 (2014)

    Google Scholar 

  17. Matalov, D.P., Usilin, S.A., Arlazarov, V.V.: Modification of the Viola-Jones approach for the detection of the government seal stamp of the Russian Federation. In: the Proceedings of Eleventh International Conference on Machine Vision (ICMV 2018): International Society for Optics and Photonics, p. 11041 (2019). https://doi.org/10.1117/12.2522793

  18. Gayer, A.V., Chernyshova, Y.S., Sheshkus, A.V.: Effective real-time augmentation of training dataset for the neural networks learning. In: Proceedings of the Eleventh International Conference on Machine Vision (2019). https://doi.org/10.1117/12.2522969

  19. Usilin, S.A., Bezmaternykh, P.V., Arlazarov, V.V.: Fast approach for QR code localization on images using Viola-Jones method. In: Proceedings of the Twelfth International Conference on Machine Vision (2020). https://doi.org/10.1117/12.2559386

  20. Lv, J.J., Shao, X.H., Huang, J.S., Zhou, X.D., Zhou, X.: Data augmentation for face recognition. Neurocomputing 230, 184–196 (2017)

    Article  Google Scholar 

  21. Akimov, A.V., Sirota, A.A.: Synthetic data generation models and algorithms for training image recognition algorithms using the Viola-Jones framework. Comput. Opt. 40(6), 911–918 (2016)

    Article  Google Scholar 

Download references

Acknowledgments

This work is partially supported by the Russian Foundation for Basic Research (projects 18–29-26022 and 18–29-2602).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sergey A. Usilin .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Usilin, S.A., Slavin, O.A., Arlazarov, V.V. (2021). Memory Consumption and Computation Efficiency Improvements of Viola-Jones Object Detection Method for UAVs. In: Del Bimbo, A., et al. Pattern Recognition. ICPR International Workshops and Challenges. ICPR 2021. Lecture Notes in Computer Science(), vol 12665. Springer, Cham. https://doi.org/10.1007/978-3-030-68821-9_23

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-68821-9_23

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-68820-2

  • Online ISBN: 978-3-030-68821-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics