应用光学, 2018, 39 (5): 743, 网络出版: 2018-10-06  

基于时空双流卷积神经网络的红外行为识别

Infrared behavior recognition based on spatio-temporal two-stream convolutional neural networks
作者单位
1 东华大学 信息科学与技术学院, 上海 201620
2 东华大学 数字化纺织服装技术教育部工程研究中心, 上海 201620
引用该论文

吴雪平, 孙韶媛, 李佳豪, 李大威. 基于时空双流卷积神经网络的红外行为识别[J]. 应用光学, 2018, 39(5): 743.

Wu Xueping, Sun Shaoyuan, Li Jiahao, Li Dawei. Infrared behavior recognition based on spatio-temporal two-stream convolutional neural networks[J]. Journal of Applied Optics, 2018, 39(5): 743.

参考文献

[1] LAPTEV I, MARSZALEK M, SCHMID C, et al. Learning realistic human actions from movies[C]// IEEE Conference on Computer Vision and Pattern Recognition.USA: IEEE, 2008:1-8.

[2] LAPTEV I. On space-time interest points[J]. International Journal of Computer Vision, 2005,64 (2/3): 107-123.

[3] DALAL N, TRIGGS B, SCHMID C. Human detection using oriented histograms of flow and appearance[C]. Berlin:Springer, 2006.

[4] KLSER A, MARSZALEK M, SCHMID C. A spatio-temporal descriptor based on 3D-Gradients[C].Leeds:DBLP, 2008.

[5] SCOVANNER P, ALI S, SHAH M. A 3-dimensional sift descriptor and its application to action recognition[C].Leeds:DBLP, 2007:357-360.

[6] WU D, SHAO L. Leveraging hierarchical parametric networks for skeletal joints based action segmentation and recognition[C].USA:IEEE, 2014: 724-731.

[7] DU Y, WANG W, WANG L. Hierarchical recurrent neural network for skeleton based action recognition[C]. USA:IEEE, 2015: 1110-1118.

[8] BOBICK A F, DAVIS J W. The recognition of human movement using temporal templates[J].IEEE Transactions on Pattem Analysis & Machine Intelligence, 2001, 23(3):257-267.

[9] WANG H, SCHMID C. Action recognition with improved trajectories[C].USA:IEEE, 2014:3551-3558.

[10] KRIZHEVSKY A, SUTSKEVER I, HINTON G E. ImageNet classification with deep convolutional neural networks[J].International Conference on Neural Information Processing Systems, 2012,60(2):1097-1105.

[11] 许路,赵海涛,孙韶媛. 基于深层卷积神经网络的单目红外图像深度估计[J]. 光学学报,2016,36(7):196-205.

    XU Lu, ZHAO Haitao, SUN Shaoyuan. Monocular infrared image depth estimation based on deep convolutional neural network[J]. Acta Optica Sinica, 2016, 36(7): 196-205.

[12] 高凯珺,孙韶媛,姚广顺,赵海涛. 基于深度学习的无人车夜视图像语义分割[J]. 应用光学,2017,38(3):421-428.

    GAO Kaijun, SUN Shaoyuan,YAO Guangshun,et al. Semantic segmentation of night vision images for unmanned vehicles based on deep learning[J]. Journal of Applied Optics, 2017,38(3):421-428. .

[13] TRAN D, BOURDEV L D, FERGUS R, et al. Learning spatiotemporal features with 3D convolutional networks[C]//International Conference on Computer Vision.USA: IEEE, 2015: 4489-4497.

[14] VAROL G, LAPTEV I, SCHMID C. Long-term temporal convolutions for action recognition[J]. IEEE Trans. Pattern Anal. Mach. Intell., 2016, 99:1-1.

[15] SIMONYAN K, ZISSERMAN A. Two-stream convolutional networks for action recognition in videos[J]. Computational Linguistics, 2014, 1(4):568-576.

[16] WANG L, XIONG Y, WANG Z, et al. Temporal segment networks: towards good practices for deep action recognition[J].Acm Transactions on Information Systems, 2016, 22(1):20-36.

[17] FEICHTENHOFER C, PINZ A, WILDES R P, et al. Spatiotemporal residual networks for video action recognition[C]//Neural Information Processing Systems. USA: ARXIV, 2016: 3468-3476.

[18] DIBA A, SHARMA V, VAN GOOL L, et al. Deep temporal linear encoding networks[C]//IEEE Conference on Computer Vision and Pattern Recognition.USA:IEEE, 2017: 1541-1550.

[19] GAO C, DU Y, LIU J, et al. A New dataset and evaluation for infrared action recognition[C].Berlin:Springer, 2015:302-312.

[20] GAO Y, BEIJBOM O, ZHANG N, et al. Compact bilinear pooling[C]//IEEE Conference on Computer Vision and Pattern Recognition. USA:IEEE, 2016:317-326.

[21] IOFFE S, SZEGEDY C. Batch normalization: accelerating deep network training by reducing internal covariate shift[C]//International Conference on Machine Learning.USA:ARXIV, 2015: 448-456.

[22] HE K, ZHANG X, REN S, et al. Deep residual learning for image recognition[C]// IEEE Conference on Computer Vision and Pattern Recognition.USA:IEEE, 2016: 770-778.

吴雪平, 孙韶媛, 李佳豪, 李大威. 基于时空双流卷积神经网络的红外行为识别[J]. 应用光学, 2018, 39(5): 743. Wu Xueping, Sun Shaoyuan, Li Jiahao, Li Dawei. Infrared behavior recognition based on spatio-temporal two-stream convolutional neural networks[J]. Journal of Applied Optics, 2018, 39(5): 743.

关于本站 Cookie 的使用提示

中国光学期刊网使用基于 cookie 的技术来更好地为您提供各项服务,点击此处了解我们的隐私策略。 如您需继续使用本网站,请您授权我们使用本地 cookie 来保存部分信息。
全站搜索
您最值得信赖的光电行业旗舰网络服务平台!