Estimating Human Poses Using Deep Learning Model
Year 2023,
Volume: 27 Issue: 5, 1079 - 1087, 18.10.2023
Firgat Muradli
,
Serap Çakar
,
Feyza Cerezci
,
Gülüzar Çit
Abstract
Over the past decade, extensive research has focused on the extraction of 3D human poses from images. The existing datasets must effectively address common challenges related to pose estimation. These datasets serve as valuable resources for evaluating, informing, and comparing different models. Deep learning models have gained widespread adoption and have demonstrated impressive performance across various domains of research and engineering. In this study, we employ these models, leveraging the open-source libraries OpenCV and Keras. To enhance the diversity and complexity of the training and testing process, we utilize the MPII Human Pose dataset. Specifically, we train and test the ResNet50 and VGG16 models using this dataset, resulting in significant improvements. The model's performance is evaluated based on the validation rate of the dataset and the accuracy of our model was 88.8 percent for VGG16 and 67 percent for ResNet50.
References
- G. Pavlakos, X. Zhou, K. G. Derpanis, K. Daniilidis, "Coarse-to-Fine Volumetric Prediction for Single-Image 3D Human Pose", Proceedings of the 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017, pp. 1263–1272, 2017.
- M. Andriluka, L. Pishchulin, P. Gehler, B. Schiele, “2D Human Pose Estimation: New Benchmark and State of the Art Analysis”, Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 3686–3693, 2014.
- H. Yasin, U. Iqbal, B. Kruger, A. Weber, J. Gall, "A Dual-Source Approach for 3D Pose Estimation from a Single Image", Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 4948–4956, 2016.
- P. F. Felzenszwalb, D. P. Huttenlocher, “Pictorial Structures for Object Recognition,” International Journal of Computer Vision, vol. 61, no. 1, pp. 55–79, 2005.
- C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed, D. Anguelov, D. Erhan, V. Vanhoucke, A. Rabinovich, "Going Deeper with Convolutions", Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 1–9, 2015.
- K. He, X. Zhang, S. Ren, J. Sun, "Deep Residual Learning For Image Recognition", Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 770–778, 2016.
- B. Tekin, A. Rozantsev, V. Lepetit, P. Fua, "Direct Prediction of 3D Body Poses from Motion Compensated Sequences", Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 991–1000, 2016.
- G. Pavlakos, X. Zhou, K. Daniilidis, "Ordinal Depth Supervision for 3D Human Pose Estimation", Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7307-7316, 2018.
- H. Y. F. Tung, H. W. Tung, E. Yumer, K. Fragkiadaki, "Self-Supervised Learning of Motion Capture", Advances in Neural Information Processing Systems, pp. 5237–5247, 2017.
- X. Zhou, Q. Huang, X. Sun, X. Xue, Y. Wei, "Towards 3D Human Pose Estimation in the Wild: A Weakly-Supervised Approach", Proceedings of the IEEE Conference on Computer Vision, pp. 398–407, 2017.
- A. Kanazawa, M. J. Black, D. W. Jacobs, J. Malik, "End-to-End Recovery of Human Shape and Pose", Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7122–7131, 2018.
- T. L. Munea, Y. Jembre, H. Weldegebriel, L. Chen, C. Huang, C. Yang, "The Progress of Human Pose Estimation: A Survey and Taxonomy of Models Applied in 2D Human Pose Estimation", IEEE Access, vol. 8, pp. 133330-133348, 2020.
- H. Chen, X. Jiang, Y. Dai, "Shift Pose: A Lightweight Transformer-like Neural Network for Human Pose Estimation", Sensors 22, vol. 22, no. 19, pp. 7264, 2022.
- S. A. Runing, "An Evaluation of Human Pose Estimation Using a Deep Convolutional Neural Network", Master's Thesis, 2017.
- F. Zhang, X. Zhu, M. Ye, "Fast Human Pose Estimation", Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 3517-3526, 2019.
- V. Belagiannis and A. Zisserman, "Recurrent Human Pose Estimation," 2017 12th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2017), Washington, DC, USA, pp. 468-475, 2017.
- S. Xiao, J. Shang, S. Liang, Y. Wei, "Compositional Human Pose Regression", Proceedings of the IEEE International Conference on Computer Vision, pp. 2602-2611, 2017.
- J. Carreira, P. Agrawal, K. Fragkiadaki, J. Malik, "Human Pose Estimation with Iterative Error Feedback", Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4733-4742. 2016.
- A. Newell, K. Yang, J. Deng, "Stacked Hourglass Networks For Human Pose Estimation", Lecture Notes Computer Science (Including Subseries Lecture Notes Artificial Intelligence Lecture Notes in Bioinformatics), vol. 9912 LNCS, pp. 483–499, 2016.
- W. Li, Z. Wang, B. Yin, Q. Peng, Y. Du, T. Xiao, G. Yu, H. Lu, Y. Wei, J. Sun, "Rethinking on Multi-Stage Networks for Human Pose Estimation", January 2019, [Online]. Available: http://arxiv.org/abs/1901.00148.
- H. Zhang, H. Ouyang, S. Liu, X. Qi, X. Shen, R. Yang, J. Jia, "Human Pose Estimation with Spatial Contextual Information", January 2019, [Online]. Available: http://arxiv.org/abs/1901.01760.
- K. Sun, B. Xiao, D. Liu, J. Wang, "Deep High-Resolution Representation Learning for Human Pose Estimation", [Online]. Available: https://github.com/leoxiaobin/, Last Accessed :05.11.2022
Year 2023,
Volume: 27 Issue: 5, 1079 - 1087, 18.10.2023
Firgat Muradli
,
Serap Çakar
,
Feyza Cerezci
,
Gülüzar Çit
References
- G. Pavlakos, X. Zhou, K. G. Derpanis, K. Daniilidis, "Coarse-to-Fine Volumetric Prediction for Single-Image 3D Human Pose", Proceedings of the 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017, pp. 1263–1272, 2017.
- M. Andriluka, L. Pishchulin, P. Gehler, B. Schiele, “2D Human Pose Estimation: New Benchmark and State of the Art Analysis”, Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 3686–3693, 2014.
- H. Yasin, U. Iqbal, B. Kruger, A. Weber, J. Gall, "A Dual-Source Approach for 3D Pose Estimation from a Single Image", Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 4948–4956, 2016.
- P. F. Felzenszwalb, D. P. Huttenlocher, “Pictorial Structures for Object Recognition,” International Journal of Computer Vision, vol. 61, no. 1, pp. 55–79, 2005.
- C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed, D. Anguelov, D. Erhan, V. Vanhoucke, A. Rabinovich, "Going Deeper with Convolutions", Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 1–9, 2015.
- K. He, X. Zhang, S. Ren, J. Sun, "Deep Residual Learning For Image Recognition", Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 770–778, 2016.
- B. Tekin, A. Rozantsev, V. Lepetit, P. Fua, "Direct Prediction of 3D Body Poses from Motion Compensated Sequences", Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 991–1000, 2016.
- G. Pavlakos, X. Zhou, K. Daniilidis, "Ordinal Depth Supervision for 3D Human Pose Estimation", Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7307-7316, 2018.
- H. Y. F. Tung, H. W. Tung, E. Yumer, K. Fragkiadaki, "Self-Supervised Learning of Motion Capture", Advances in Neural Information Processing Systems, pp. 5237–5247, 2017.
- X. Zhou, Q. Huang, X. Sun, X. Xue, Y. Wei, "Towards 3D Human Pose Estimation in the Wild: A Weakly-Supervised Approach", Proceedings of the IEEE Conference on Computer Vision, pp. 398–407, 2017.
- A. Kanazawa, M. J. Black, D. W. Jacobs, J. Malik, "End-to-End Recovery of Human Shape and Pose", Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7122–7131, 2018.
- T. L. Munea, Y. Jembre, H. Weldegebriel, L. Chen, C. Huang, C. Yang, "The Progress of Human Pose Estimation: A Survey and Taxonomy of Models Applied in 2D Human Pose Estimation", IEEE Access, vol. 8, pp. 133330-133348, 2020.
- H. Chen, X. Jiang, Y. Dai, "Shift Pose: A Lightweight Transformer-like Neural Network for Human Pose Estimation", Sensors 22, vol. 22, no. 19, pp. 7264, 2022.
- S. A. Runing, "An Evaluation of Human Pose Estimation Using a Deep Convolutional Neural Network", Master's Thesis, 2017.
- F. Zhang, X. Zhu, M. Ye, "Fast Human Pose Estimation", Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 3517-3526, 2019.
- V. Belagiannis and A. Zisserman, "Recurrent Human Pose Estimation," 2017 12th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2017), Washington, DC, USA, pp. 468-475, 2017.
- S. Xiao, J. Shang, S. Liang, Y. Wei, "Compositional Human Pose Regression", Proceedings of the IEEE International Conference on Computer Vision, pp. 2602-2611, 2017.
- J. Carreira, P. Agrawal, K. Fragkiadaki, J. Malik, "Human Pose Estimation with Iterative Error Feedback", Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4733-4742. 2016.
- A. Newell, K. Yang, J. Deng, "Stacked Hourglass Networks For Human Pose Estimation", Lecture Notes Computer Science (Including Subseries Lecture Notes Artificial Intelligence Lecture Notes in Bioinformatics), vol. 9912 LNCS, pp. 483–499, 2016.
- W. Li, Z. Wang, B. Yin, Q. Peng, Y. Du, T. Xiao, G. Yu, H. Lu, Y. Wei, J. Sun, "Rethinking on Multi-Stage Networks for Human Pose Estimation", January 2019, [Online]. Available: http://arxiv.org/abs/1901.00148.
- H. Zhang, H. Ouyang, S. Liu, X. Qi, X. Shen, R. Yang, J. Jia, "Human Pose Estimation with Spatial Contextual Information", January 2019, [Online]. Available: http://arxiv.org/abs/1901.01760.
- K. Sun, B. Xiao, D. Liu, J. Wang, "Deep High-Resolution Representation Learning for Human Pose Estimation", [Online]. Available: https://github.com/leoxiaobin/, Last Accessed :05.11.2022