Skip to main content

Advertisement

Log in

PoseAnalyser: A Survey on Human Pose Estimation

  • Survey Article
  • Published:
SN Computer Science Aims and scope Submit manuscript

Abstract

Human pose estimation is the process of detecting the body keypoints of a person and can be used to classify different poses. Many researchers have proposed various ways to get a perfect 2D as well as a 3D human pose estimator that could be applied for various types of applications. This paper is a review of all the state-of-the-art architectures based on human pose estimation, the papers referred were based on the types of computer vision and machine learning algorithms, such as feed-forward neural networks, convolutional neural networks (CNN), OpenPose, MediaPipe, and many more. These different approaches are compared on various parameters, like the type of dataset used, the evaluation metric, etc. Different human pose datasets, such as COCO and MPII activity datasets with keypoints, as well as specific application-based datasets, are reviewed in this survey paper. Researchers may use these architectures and datasets in a range of domains, which are also discussed. The paper analyzes several approaches and architectures that can be used as a guide for other researchers to assist them in developing better techniques to achieve high accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2

Similar content being viewed by others

Data Availability

The datasets analysed during the current study are included in this published article. For eg. Section "Dataset" and Table 2 consist of relevant information.

References

  1. Ladicky L, Torr PHS, Zisserman A. Human pose estimation using a joint pixel-wise and part-wise formulation. In: 2013 IEEE conference on computer vision and pattern recognition. 2013.

  2. Nie BX, Xiong C, Zhu S-C. Joint action recognition and pose estimation from video. In: 2015 IEEE conference on computer vision and pattern recognition (CVPR). 2015.

  3. Chen X, Zhou Z, Ying Y, Qi D. Real-time human segmentation using pose skeleton map. In: 2019 Chinese control conference (CCC). 2019.

  4. Andriluka M, Roth S, Schiele B. Pictorial structures revisited: people detection and articulated pose estimation. In: 2009 IEEE conference on computer vision and pattern recognition. 2009.

  5. Zhang F, Zhu X, Wang C. Single person pose estimation: a survey. In: 2021, computer vision and pattern recognition. 2021.

  6. Fieraru M, Khoreva A, Pishchulin L, Schiele B. Learning to refine human pose estimation. In: 2018 IEEE/CVF conference on computer vision and pattern recognition workshops (CVPRW). 2018.

  7. Chen Y, Wang Z, Peng Y, Zhang Z, Yu G, Sun J. Cascaded pyramid network for multi-person pose estimation. 2018 IEEE/CVF conference on computer vision and pattern recognition. 2018.

  8. Li M, Zhou Z, Li J, Liu X. Bottom-up pose estimation of multiple person with bounding box constraint. In: 2018 24th international conference on pattern recognition (ICPR). 2018.

  9. Jingtian S, Xue C, Yanan L, Jianwen C. 2D human pose estimation from monocular images: a survey. In: 2020 IEEE 3rd international conference on computer and communication engineering technology (CCET). 2020.

  10. Wang S, Ai H, Yamashita T, Lao S. Combined top-down/bottom-up human articulated pose estimation using AdaBoost learning. In: 2010 20th international conference on pattern recognition. 2010.

  11. Deshmukh S, Fernandes F. Classification of asl alphabets and numbers using Orb and fast with brief feature extraction and dimensionality reduction techniques. In: 2021 IEEE Mysore sub section international conference (MysuruCon). 2021.

  12. Strzodka R, Ihrke I, Magnor M. A graphics hardware implementation of the generalized Hough Transform for fast object recognition, scale, and 3D pose detection. In: 12th international conference on image analysis and processing, 2003. Proceedings.

  13. Yuan S, Ping S, Da P, Shaojing Z. Human pose estimation combined with depth information. In: 2016 IEEE advanced information management, communicates, electronic and automation control conference (IMCEC). 2016.

  14. Nunez-Godoy S, Alvear-Puertas V, Realpe-Godoy S, Pujota-Cuascota E, Farinango-Endara H, Navarrete-Insuasti I, Vaca-Chapi F, Rosero-Montalvo P, Peluffo DH. Human-sitting-pose detection using data classification and dimensionality reduction. In: 2016 IEEE ecuador technical chapters meeting (ETCM). 2016.

  15. Bansal K, Gupta AK, Rai S, Bansal B. Pose estimation on 3-D models using ConvNets. In: 2020 6th international conference on signal processing and communication (ICSC). 2020.

  16. Vu HT, Wilkinson RH, Lech M, Cheng E. A hybrid neural network for graph-based human pose estimation from 2D images. IEEE Access. 2020;8:52830–40.

    Article  Google Scholar 

  17. Singh A, Agarwal S, Nagrath P, Saxena A, Thakur N. Human pose estimation using convolutional neural networks. In: 2019 amity international conference on artificial intelligence (AICAI). 2019.

  18. Choi S, Choi S, Kim C. MobileHumanPose: toward real-time 3D human pose estimation in mobile devices. In: 2021 IEEE/CVF conference on computer vision and pattern recognition workshops (CVPRW). 2021.

  19. Jain A, Tompson J, Andriluka M, Taylor GW, Bregler C. Learning human pose estimation features with convolutional networks. In: Computer vision and pattern recognition. 2013.

  20. Guler RA, Neverova N, Kokkinos I. DensePose: dense human pose estimation in the wild. In: 2018 IEEE/CVF conference on computer vision and pattern recognition. 2018.

  21. Tang H, Wang Q, Chen H. Research on 3D human pose estimation using RGBD camera. In: 2019 IEEE 9th international conference on electronics information and emergency communication (ICEIEC). 2019.

  22. Toshev A, Szegedy C. DeepPose: human pose estimation via deep neural networks. In: 2014 IEEE conference on computer vision and pattern recognition. 2014.

  23. Ouyang W, Chu X, Wang X. Multi-source deep learning for human pose estimation. In: 2014 IEEE conference on computer vision and pattern recognition. 2014.

  24. Jessika HA, Amanda I, Auliya HM. A study on part affinity fields implementation for human pose estimation with deep neural network. In: 2019 international conference of artificial intelligence and information technology (ICAIIT). 2019.

  25. Xiao B, Wu H, Wei Y. Simple baselines for human pose estimation and tracking. In: Computer vision—ECCV 2018. pp. 472–87. 2018.

  26. Artacho B, Savakis A. UniPose: unified human pose estimation in single images and videos. In: 2020 IEEE/CVF conference on computer vision and pattern recognition (CVPR). 2020.

  27. Chen L, Yu C, Chen L. A multi-person pose estimation with LSTM for video stream. In: 2019 3rd international conference on electronic information technology and computer engineering (EITCE). 2019.

  28. Li M, Zhou Z, Liu X. Multi-person pose estimation using bounding box constraint and LSTM. IEEE Trans Multim. 2019;21:2653–63.

    Article  Google Scholar 

  29. Luo Y, Ren J, Wang Z, Sun W, Pan J, Liu J, Pang J, Lin L. LSTM pose machines. In: 2018 IEEE/CVF conference on computer vision and pattern recognition. 2018.

  30. Cao Z, Hidalgo G, Simon T, Wei S-E, Sheikh Y. OpenPose: realtime multi-person 2D pose estimation using part affinity fields. IEEE Trans Pattern Anal Mach Intell. 2021;43:172–86.

    Article  Google Scholar 

  31. Cao Z, Simon T, Wei S-E, Sheikh Y. Realtime multi-person 2D pose estimation using part affinity fields. In: 2017 IEEE conference on computer vision and pattern recognition (CVPR). 2017.

  32. Osokin D. Real-time 2D multi-person pose estimation on CPU: lightweight openpose. In: Proceedings of the 8th international conference on pattern recognition applications and methods. 2019.

  33. Bazarevsky V, Grishchenko I, Raveendran K, Zhu T, Zhang F, Grundmann M. BlazePose: on-device real-time body pose tracking. In: CVPR workshop on computer vision for augmented and virtual reality, Seattle, WA, USA, 2020, 2020. 2020.

  34. Guan C. Realtime multi-person 2D pose estimation using ShuffleNet. In: 2019 14th international conference on computer science & education (ICCSE). 2019.

  35. Zhang X, Zhou X, Lin M, Sun J. ShuffleNet: an extremely efficient convolutional neural network for mobile devices. In: 2018 IEEE/CVF conference on computer vision and pattern recognition. 2018.

  36. Sun E, Xiao B, Liu D, Wang J. In: IEEE 2019 IEEE/CVF conference on computer vision and pattern recognition (CVPR)—Long Beach, CA, USA (2019.6.15–2019.6.20)] 2019 IEEE/CVF conference on computer vision and pattern recognition (CVPR)—deep high-resolution representation learning for human pose estimation. pp. 5686–96. 2019.

  37. Wang M, Tighe J, Modolo D. Combining detection and tracking for human pose estimation in videos. In: 2020 IEEE/CVF conference on computer vision and pattern recognition (CVPR). 2020.

  38. Munea TL, Jembre YZ, Weldegebriel HT, Chen L, Huang C, Yang C. The progress of human pose estimation: a survey and taxonomy of models applied in 2D human pose estimation. IEEE Access. 2020;8:133330–48.

    Article  Google Scholar 

  39. Golda T, Kalb T, Schumann A, Beyerer J. Human pose estimation for real-world crowded scenarios. In: 2019 16th IEEE international conference on advanced video and signal based surveillance (AVSS). 2019.

  40. Kilner J, Guillemaut J-Y, Hilton A. 3D action matching with key-pose detection. In: 2009 IEEE 12th international conference on computer vision workshops, ICCV workshops. 2009.

  41. Rogez G, Weinzaepfel P, Schmid C. LCR-net++: multi-person 2D and 3D pose detection in natural images. IEEE Trans Pattern Anal Mach Intell. 2019;1.

  42. Holte MB, Tran C, Trivedi MM, Moeslund TB. Human pose estimation and activity recognition from multi-view videos: comparative explorations of recent developments. IEEE J Sel Top Signal Process. 2012;6:538–52.

    Article  Google Scholar 

  43. Chaikaew A, Somkuan K, Yuyen T. Thai sign language recognition: an application of deep neural network. In: 2021 joint international conference on digital arts, media and technology with ECTI northern section conference on electrical, electronics, computer and telecommunication engineering. 2021.

  44. Chen Y, Du R, Luo K, Xiao Y. Fall detection system based on real-time pose estimation and SVM. In: 2021 IEEE 2nd international conference on big data, artificial intelligence and internet of things engineering (ICBAIE). 2021.

  45. Belagiannis V, Zisserman A. Recurrent human pose estimation. In: 2017 12th IEEE international conference on automatic face & gesture recognition (FG 2017). 2017.

  46. Lin T-Y, Maire M, Belongie S, Hays J, Perona P, Ramanan D, Dollár P, Zitnick CL. Microsoft Coco: common objects in context. In: Computer vision—ECCV 2014. pp. 740–55. 2014.

  47. Andriluka M, Pishchulin L, Gehler P, Schiele B. MPII human pose dataset. In: 2014 IEEE conference on computer vision and pattern recognition (CVPR) 2014.

  48. Sapp B, Taskar B. MODEC: multimodal decomposable models for human pose estimation. In: 2013 IEEE conference on computer vision and pattern recognition. 2013.

  49. Johnson S, Everingham M. Clustered pose and nonlinear appearance models for human pose estimation. In: Proceedings of the British machine vision conference 2010. 2010.

Download references

Acknowledgements

This research was supported by IFM Engineering Private Limited, in association with the Department of Electronics and Telecommunication at Vishwakarma Institute of Technology, Pune. A special thanks to Prof. Vaishali Jabade for her constant support and Mr. Aniket Patil, from the IFM team, for guiding us throughout the research.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sakshi Kulkarni.

Ethics declarations

Conflict of Interest

The authors declare no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Kulkarni, S., Deshmukh, S., Fernandes, F. et al. PoseAnalyser: A Survey on Human Pose Estimation. SN COMPUT. SCI. 4, 136 (2023). https://doi.org/10.1007/s42979-022-01567-2

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s42979-022-01567-2

Keywords

Navigation