激光与光电子学进展, 2020, 57 (2): 021006, 网络出版: 2020-01-03   

基于视频的实时多人姿态估计方法 下载: 1448次

Real-Time Multi-Person Video-Based Pose Estimation
作者单位
西安工业大学电子信息工程学院, 陕西 西安 710021
引用该论文

闫芬婷, 王鹏, 吕志刚, 丁哲, 乔梦雨. 基于视频的实时多人姿态估计方法[J]. 激光与光电子学进展, 2020, 57(2): 021006.

Yan Fenting, Wang Peng, Lü Zhigang, Ding Zhe, Qiao Mengyu. Real-Time Multi-Person Video-Based Pose Estimation[J]. Laser & Optoelectronics Progress, 2020, 57(2): 021006.

参考文献

[1] YanS, XiongY, LinD. Spatial temporal graph convolutional networks for skeleton-based action recognition[C]∥Thirty-Second AAAI Conference on Artificial Intelligence, February 2-7, 2018, Hilton New Orleans Riverside, New Orleans, Louisiana, USA. USA: AAAI, 2018: 7444- 7452.

[2] 姜明星, 胡敏, 王晓华, 等. 视频序列中表情和姿态的双模态情感识别[J]. 激光与光电子学进展, 2018, 55(7): 071004.

    Jiang M X, Hu M, Wang X H, et al. Dual-modal emotion recognition based on facial expression and body posture in video sequences[J]. Laser & Optoelectronics Progress, 2018, 55(7): 071004.

[3] ToshevA, SzegedyC. DeepPose: human pose estimation via deep neural networks[C]∥2014 IEEE Conference on Computer Vision and Pattern Recognition, June 23-28, 2014, Columbus, OH, USA. New York: IEEE, 2014: 1653- 1660.

[4] Fan XC, ZhengK, Lin YW, et al. Combining local appearance and holistic view: dual-source deep neural networks for human pose estimation[C]∥2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 7-12, 2015, Boston, MA, USA. New York: IEEE, 2015: 1347- 1355.

[5] CarreiraJ, AgrawalP, FragkiadakiK, et al. Human pose estimation with iterative error feedback[C]∥2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 27-30, 2016, Las Vegas, NV, USA. New York: IEEE, 2016: 4733- 4742.

[6] YangW, LiS, Ouyang WL, et al. Learning feature pyramids for human pose estimation[C]∥2017 IEEE International Conference on Computer Vision (ICCV), October 22-29, 2017, Venice, Italy. New York: IEEE, 2017: 1290- 1299.

[7] NewellA, Yang KY, DengJ. Stacked hourglass networks for human pose estimation[M] ∥Leibe B, Matas J, Sebe N, et al. Computer vision-ECCV 2016. Lecture notes in computer science. Cham: Springer, 2016, 9912: 483- 499.

[8] Tompson JJ, JainA, LeCun Y, et al. Joint training of a convolutional network and a graphical model for human pose estimation[C]∥Advances in Neural Information Processing Systems 27 (NIPS 2014), December 8-13, 2014, Montreal, Quebec, Canada. Canada: NIPS, 2014.

[9] YangW, Ouyang WL, Li HS, et al. End-to-end learning of deformable mixture of parts and deep convolutional neural networks for human pose estimation[C]∥2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 27-30, 2016, Las Vegas, NV, USA. New York: IEEE, 2016: 3073- 3082.

[10] CaoZ, SimonT, Wei SE, et al. Realtime multi-person 2D pose estimation using part affinity fields[C]∥2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), July 21-26, 2017, Honolulu, HI, USA. New York: IEEE, 2017: 1302- 1310.

[11] He KM, GkioxariG, DollarP, et al. Mask R-CNN[C]∥2017 IEEE International Conference on Computer Vision (ICCV), October 22-29, 2017, Venice, Italy. New York: IEEE, 2017: 2980- 2988.

[12] Ren S Q, He K M, Girshick R, et al. Faster R-CNN: towards real-time object detection with region proposal networks[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(6): 1137-1149.

[13] 冯小雨, 梅卫, 胡大帅. 基于改进Faster R-CNN的空中目标检测[J]. 光学学报, 2018, 38(6): 0615004.

    Feng X Y, Mei W, Hu D S. Aerial target detection based on improved Faster R-CNN[J]. Acta Optica Sinica, 2018, 38(6): 0615004.

[14] Fang HS, Xie SQ, Tai YW, et al. RMPE: regional multi-person pose estimation[C]∥2017 IEEE International Conference on Computer Vision (ICCV), October 22-29, 2017, Venice, Italy. New York: IEEE, 2017: 2353- 2362.

[15] RedmonJ, Farhadi A. YOLOv3: an incremental improvement[J/OL]. ( 2018-04-08)[2019-05-16]. https:∥arxiv.org/abs/1804. 02767.

[16] 魏湧明, 全吉成, 侯宇青阳. 基于YOLO v2的无人机航拍图像定位研究[J]. 激光与光电子学进展, 2017, 54(11): 111002.

    Wei Y M, Quan J C, Houyu Q Y. Aerial image location of unmanned aerial vehicle based on YOLO v2[J]. Laser & Optoelectronics Progress, 2017, 54(11): 111002.

[17] Chen LC, Zhu YK, PapandreouG, et al. Encoder-decoder with atrous separable convolution for semantic image segmentation[M] ∥Ferrari V, Hebert M, Sminchisescu C, et al. Computer vision-ECCV 2018. Lecture notes in computer science. Cham: Springer, 2018, 11211: 833- 851.

[18] ShrivastavaA, GuptaA, GirshickR. Training region-based object detectors with online hard example mining[C]∥2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 27-30, 2016, Las Vegas, NV, USA. New York: IEEE, 2016: 761- 769.

闫芬婷, 王鹏, 吕志刚, 丁哲, 乔梦雨. 基于视频的实时多人姿态估计方法[J]. 激光与光电子学进展, 2020, 57(2): 021006. Yan Fenting, Wang Peng, Lü Zhigang, Ding Zhe, Qiao Mengyu. Real-Time Multi-Person Video-Based Pose Estimation[J]. Laser & Optoelectronics Progress, 2020, 57(2): 021006.

本文已被 4 篇论文引用
被引统计数据来源于中国光学期刊网
引用该论文: TXT   |   EndNote

相关论文

加载中...

关于本站 Cookie 的使用提示

中国光学期刊网使用基于 cookie 的技术来更好地为您提供各项服务,点击此处了解我们的隐私策略。 如您需继续使用本网站,请您授权我们使用本地 cookie 来保存部分信息。
全站搜索
您最值得信赖的光电行业旗舰网络服务平台!