基于视频的实时多人姿态估计方法 下载: 1448次
闫芬婷, 王鹏, 吕志刚, 丁哲, 乔梦雨. 基于视频的实时多人姿态估计方法[J]. 激光与光电子学进展, 2020, 57(2): 021006.
Yan Fenting, Wang Peng, Lü Zhigang, Ding Zhe, Qiao Mengyu. Real-Time Multi-Person Video-Based Pose Estimation[J]. Laser & Optoelectronics Progress, 2020, 57(2): 021006.
[1] YanS, XiongY, LinD. Spatial temporal graph convolutional networks for skeleton-based action recognition[C]∥Thirty-Second AAAI Conference on Artificial Intelligence, February 2-7, 2018, Hilton New Orleans Riverside, New Orleans, Louisiana, USA. USA: AAAI, 2018: 7444- 7452.
[2] 姜明星, 胡敏, 王晓华, 等. 视频序列中表情和姿态的双模态情感识别[J]. 激光与光电子学进展, 2018, 55(7): 071004.
[3] ToshevA, SzegedyC. DeepPose: human pose estimation via deep neural networks[C]∥2014 IEEE Conference on Computer Vision and Pattern Recognition, June 23-28, 2014, Columbus, OH, USA. New York: IEEE, 2014: 1653- 1660.
[4] Fan XC, ZhengK, Lin YW, et al. Combining local appearance and holistic view: dual-source deep neural networks for human pose estimation[C]∥2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 7-12, 2015, Boston, MA, USA. New York: IEEE, 2015: 1347- 1355.
[5] CarreiraJ, AgrawalP, FragkiadakiK, et al. Human pose estimation with iterative error feedback[C]∥2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 27-30, 2016, Las Vegas, NV, USA. New York: IEEE, 2016: 4733- 4742.
[6] YangW, LiS, Ouyang WL, et al. Learning feature pyramids for human pose estimation[C]∥2017 IEEE International Conference on Computer Vision (ICCV), October 22-29, 2017, Venice, Italy. New York: IEEE, 2017: 1290- 1299.
[7] NewellA, Yang KY, DengJ. Stacked hourglass networks for human pose estimation[M] ∥Leibe B, Matas J, Sebe N, et al. Computer vision-ECCV 2016. Lecture notes in computer science. Cham: Springer, 2016, 9912: 483- 499.
[8] Tompson JJ, JainA, LeCun Y, et al. Joint training of a convolutional network and a graphical model for human pose estimation[C]∥Advances in Neural Information Processing Systems 27 (NIPS 2014), December 8-13, 2014, Montreal, Quebec, Canada. Canada: NIPS, 2014.
[9] YangW, Ouyang WL, Li HS, et al. End-to-end learning of deformable mixture of parts and deep convolutional neural networks for human pose estimation[C]∥2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 27-30, 2016, Las Vegas, NV, USA. New York: IEEE, 2016: 3073- 3082.
[10] CaoZ, SimonT, Wei SE, et al. Realtime multi-person 2D pose estimation using part affinity fields[C]∥2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), July 21-26, 2017, Honolulu, HI, USA. New York: IEEE, 2017: 1302- 1310.
[11] He KM, GkioxariG, DollarP, et al. Mask R-CNN[C]∥2017 IEEE International Conference on Computer Vision (ICCV), October 22-29, 2017, Venice, Italy. New York: IEEE, 2017: 2980- 2988.
[12] Ren S Q, He K M, Girshick R, et al. Faster R-CNN: towards real-time object detection with region proposal networks[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(6): 1137-1149.
[13] 冯小雨, 梅卫, 胡大帅. 基于改进Faster R-CNN的空中目标检测[J]. 光学学报, 2018, 38(6): 0615004.
[14] Fang HS, Xie SQ, Tai YW, et al. RMPE: regional multi-person pose estimation[C]∥2017 IEEE International Conference on Computer Vision (ICCV), October 22-29, 2017, Venice, Italy. New York: IEEE, 2017: 2353- 2362.
[15] RedmonJ, Farhadi A. YOLOv3: an incremental improvement[J/OL]. ( 2018-04-08)[2019-05-16]. https:∥arxiv.org/abs/1804. 02767.
[16] 魏湧明, 全吉成, 侯宇青阳. 基于YOLO v2的无人机航拍图像定位研究[J]. 激光与光电子学进展, 2017, 54(11): 111002.
[17] Chen LC, Zhu YK, PapandreouG, et al. Encoder-decoder with atrous separable convolution for semantic image segmentation[M] ∥Ferrari V, Hebert M, Sminchisescu C, et al. Computer vision-ECCV 2018. Lecture notes in computer science. Cham: Springer, 2018, 11211: 833- 851.
[18] ShrivastavaA, GuptaA, GirshickR. Training region-based object detectors with online hard example mining[C]∥2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 27-30, 2016, Las Vegas, NV, USA. New York: IEEE, 2016: 761- 769.
闫芬婷, 王鹏, 吕志刚, 丁哲, 乔梦雨. 基于视频的实时多人姿态估计方法[J]. 激光与光电子学进展, 2020, 57(2): 021006. Yan Fenting, Wang Peng, Lü Zhigang, Ding Zhe, Qiao Mengyu. Real-Time Multi-Person Video-Based Pose Estimation[J]. Laser & Optoelectronics Progress, 2020, 57(2): 021006.