基于时空双流卷积神经网络的红外行为识别
吴雪平, 孙韶媛, 李佳豪, 李大威. 基于时空双流卷积神经网络的红外行为识别[J]. 应用光学, 2018, 39(5): 743.
Wu Xueping, Sun Shaoyuan, Li Jiahao, Li Dawei. Infrared behavior recognition based on spatio-temporal two-stream convolutional neural networks[J]. Journal of Applied Optics, 2018, 39(5): 743.
[1] LAPTEV I, MARSZALEK M, SCHMID C, et al. Learning realistic human actions from movies[C]// IEEE Conference on Computer Vision and Pattern Recognition.USA: IEEE, 2008:1-8.
[2] LAPTEV I. On space-time interest points[J]. International Journal of Computer Vision, 2005,64 (2/3): 107-123.
[3] DALAL N, TRIGGS B, SCHMID C. Human detection using oriented histograms of flow and appearance[C]. Berlin:Springer, 2006.
[4] KLSER A, MARSZALEK M, SCHMID C. A spatio-temporal descriptor based on 3D-Gradients[C].Leeds:DBLP, 2008.
[5] SCOVANNER P, ALI S, SHAH M. A 3-dimensional sift descriptor and its application to action recognition[C].Leeds:DBLP, 2007:357-360.
[6] WU D, SHAO L. Leveraging hierarchical parametric networks for skeletal joints based action segmentation and recognition[C].USA:IEEE, 2014: 724-731.
[7] DU Y, WANG W, WANG L. Hierarchical recurrent neural network for skeleton based action recognition[C]. USA:IEEE, 2015: 1110-1118.
[8] BOBICK A F, DAVIS J W. The recognition of human movement using temporal templates[J].IEEE Transactions on Pattem Analysis & Machine Intelligence, 2001, 23(3):257-267.
[9] WANG H, SCHMID C. Action recognition with improved trajectories[C].USA:IEEE, 2014:3551-3558.
[10] KRIZHEVSKY A, SUTSKEVER I, HINTON G E. ImageNet classification with deep convolutional neural networks[J].International Conference on Neural Information Processing Systems, 2012,60(2):1097-1105.
[11] 许路,赵海涛,孙韶媛. 基于深层卷积神经网络的单目红外图像深度估计[J]. 光学学报,2016,36(7):196-205.
XU Lu, ZHAO Haitao, SUN Shaoyuan. Monocular infrared image depth estimation based on deep convolutional neural network[J]. Acta Optica Sinica, 2016, 36(7): 196-205.
[12] 高凯珺,孙韶媛,姚广顺,赵海涛. 基于深度学习的无人车夜视图像语义分割[J]. 应用光学,2017,38(3):421-428.
[13] TRAN D, BOURDEV L D, FERGUS R, et al. Learning spatiotemporal features with 3D convolutional networks[C]//International Conference on Computer Vision.USA: IEEE, 2015: 4489-4497.
[14] VAROL G, LAPTEV I, SCHMID C. Long-term temporal convolutions for action recognition[J]. IEEE Trans. Pattern Anal. Mach. Intell., 2016, 99:1-1.
[15] SIMONYAN K, ZISSERMAN A. Two-stream convolutional networks for action recognition in videos[J]. Computational Linguistics, 2014, 1(4):568-576.
[16] WANG L, XIONG Y, WANG Z, et al. Temporal segment networks: towards good practices for deep action recognition[J].Acm Transactions on Information Systems, 2016, 22(1):20-36.
[17] FEICHTENHOFER C, PINZ A, WILDES R P, et al. Spatiotemporal residual networks for video action recognition[C]//Neural Information Processing Systems. USA: ARXIV, 2016: 3468-3476.
[18] DIBA A, SHARMA V, VAN GOOL L, et al. Deep temporal linear encoding networks[C]//IEEE Conference on Computer Vision and Pattern Recognition.USA:IEEE, 2017: 1541-1550.
[19] GAO C, DU Y, LIU J, et al. A New dataset and evaluation for infrared action recognition[C].Berlin:Springer, 2015:302-312.
[20] GAO Y, BEIJBOM O, ZHANG N, et al. Compact bilinear pooling[C]//IEEE Conference on Computer Vision and Pattern Recognition. USA:IEEE, 2016:317-326.
[21] IOFFE S, SZEGEDY C. Batch normalization: accelerating deep network training by reducing internal covariate shift[C]//International Conference on Machine Learning.USA:ARXIV, 2015: 448-456.
[22] HE K, ZHANG X, REN S, et al. Deep residual learning for image recognition[C]// IEEE Conference on Computer Vision and Pattern Recognition.USA:IEEE, 2016: 770-778.
吴雪平, 孙韶媛, 李佳豪, 李大威. 基于时空双流卷积神经网络的红外行为识别[J]. 应用光学, 2018, 39(5): 743. Wu Xueping, Sun Shaoyuan, Li Jiahao, Li Dawei. Infrared behavior recognition based on spatio-temporal two-stream convolutional neural networks[J]. Journal of Applied Optics, 2018, 39(5): 743.