光学学报, 2014, 34 (12): 1215002, 网络出版: 2014-11-04
基于姿势字典学习的人体行为识别
Human Action Recognition by Leaning Pose Dictionary
图像处理 行为识别 Procrustes形状分析 局部保持投影 稀疏表示 字典学习 image processing behavior recognition procrustes shape analysis local preserving projection sparse representation dictionary learning
摘要
提出一种基于人体轮廓表达的姿势学习框架来进行人体行为识别。通过一种基于Procrustes形状分析和局部保持投影的姿势特征表示方法,从人体运动视频中提取具有平移、旋转和放缩不变性的姿势特征,在保留人体姿势的局部流形结构的同时尽量提取子空间上的判别信息。针对该特征还提出了一种基于姿势字典学习的人体行为识别框架,对每类行为分别学习一个对应于该类的字典,通过串联所有类的字典来得到整个姿势字典;并通过最小重构误差准则来分类测试视频。在Weizmann和MuHAVi-MAS14数据集上的实验结果证实了该方法的识别率高于大部分经典方法。特别是在MuHAVi-MAS14数据集上的识别率对比已有的结果上有巨大的提升。
Abstract
A framework for human action recognition by learning pose dictionary based on human contour representation is proposed. A new pose feature based on Procrustes analysis and local preserving projection is proposed, which can extract shape information from human motion video which is invariant to translation, scaling and rotation. Moreover, it can extract discriminative subspace information when preserving local manifold structure of human pose. After the pose feature is extracted, a human action recognition framework based on pose dictionary learning is proposed. Class-specific dictionaries are trained individually on all training frames of each class to build the whole pose dictionary by concatenating all class-specific dictionaries. The test video is classified with the minimum reconstruction error on the learned dictionary. Experimental results on Weizmann and MuHAVi-MAS14 dataset demonstrate proposed method outperforms most classical methods. Especially, classification rate of this method on MuHAVi-MAS14 dataset achieves a considerable boost compared with that of state-of-the-art approaches.
蔡加欣, 冯国灿, 汤鑫, 罗志宏. 基于姿势字典学习的人体行为识别[J]. 光学学报, 2014, 34(12): 1215002. Cai Jiaxin, Feng Guocan, Tang Xin, Luo Zhihong. Human Action Recognition by Leaning Pose Dictionary[J]. Acta Optica Sinica, 2014, 34(12): 1215002.