光电技术应用, 2019, 34 (6): 40, 网络出版: 2019-12-08  

基于多信息卷积神经网络的人群密度估计

Crowd Density Estimation Based on Multi-information via Convolutional Neural Network
作者单位
四川大学 电子信息学院, 成都 610065
摘要
针对现有人群密度估计方法在实际应用中容易受到环境干扰、适应性不强等限制, 提出一种基于多信息卷积神经网络的人群密度估计方法。首先, 根据样本数据的特点生成数据密度图标注与数据增强; 然后, 为适应不同真实场景的巨大差异, 提取图像的色调、色饱和度、灰度(HSG)信息作为训练数据的输入, 并利用共享卷积层、结合两个子网络不同卷积深度的特征构建网络模型; 最后, 对网络输出的密度图进行积分, 得到相应的人数。与主流方法对比, 在Shanghaitech数据集上进行的相关实验证明了所提方法的良好性能。
Abstract
In view of the limitations of existing crowd density estimation methods in practical application, such as environmental interference and weak adaptability, a crowd density estimation method based on multi-information via convolution neural network is proposed. Firstly, data density annotation and data augmentation are implemented according to the characteristics of samples. And then, in order to adapt to different real scenarios with large variations, the proposed method is used to extract the hue, saturation and gray level (HSG) information of images as the input of training data, and the shared convolutional layers are utilized, combination of features from two subnetworks with different convolution depth to construct the network model. Finally, the output density map is integrated to get the corresponding number of people. Compared with the typical methods, experiments on Shanhaitech datasets demonstrates the great performance of the proposed method.
参考文献

[1] Idrees H, Saleemi I, Seibert C, et al. Multi-source multi-scale counting in extremely dense crowd images[C]//Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition. New York: IEEE, 2013: 2547-2554.

[2] Chan A B, Liang Z S J, Vasconcelos N. Privacy preserving crowd monitoring: Counting people without people models or tracking[C]//Proceedings of the 2008 IEEE Conference on Computer Vision and Pattern Recognition. New York: IEEE Computer Society, 2008: 1-7.

[3] Brostow G J, Cipolla R. Unsupervised Bayesian Detection of Independent Motion in Crowds[C]//Proceedings of the 2006 IEEE Conference on Computer Vision and Pattern Recognition. New York: IEEE Computer Society, 2006: 594-601.

[4] Chen K, Loy C C, Gong S, et al. Feature mining for localised crowd counting[C]//Proceedings of the 2012 BMVC. Guildford: BMVA 2012, 1(2): 3.

[5] Dalal N,Triggs B. Histograms of oriented gradients for human detection[C]//Proceedings of the 2005 IEEE Conference on Computer Vision and Pattern Recognition. New York: IEEE Computer Society, 2005.

[6] Ojala T, Pietikainen M, Maenpaa T. Multiresolution gray-scale and rotation invariant texture classification with local binary patterns[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2002, 24(7): 971-987.

[7] Lowe D G. Object recognition from local scale-invariant features[C]//Proceedings of the Seventh IEEE International Conference on Computer Vision. New York: IEEE, 1999: 1150-1157.

[8] Zhang Y, Zhou D, Chen S, et al. Single-Image Crowd Counting via Multi-Column Convolutional Neural Network[C]//Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition. New York: IEEE Computer Society, 2016: 589-597.

[9] Boominathan L, Kruthiventi S S, Babu R V. CrowdNet: A Deep Convolutional Network for Dense Crowd Counting[C]//Proceedings of the 24th ACM international conference on Multimedia. The Netherlands: ACM, 2016: 640-644.

[10] Sam D B, Surya S, Babu R V. Switching convolutional neural network for crowd counting[C]// Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition. New York: IEEE Computer Society, 2017, 1(3): 6.

[11] Sindagi V A, Patel V M. CNN-Based cascaded multi-task learning of high-level prior and density estimation for crowd counting[C]// Proceedings of the 2017 14th IEEE International Conference on Advanced Video and Signal Based Surveillance. New York: IEEE, 2017: 1-6.

[12] Zhang C, Li H, Wang X, et al. Cross-scene crowd counting via deep convolutional neural networks[C]// Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition. New York: IEEE Computer Society, 2015: 833-841.

[13] Sheng B, Shen C, Lin G, et al. Crowd Counting via Weighted VLAD on a Dense Attribute Feature Map[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2018, 28(8): 1788-1797.

[14] Simonyan K, Zisserman A. Very deep convolutional networks for large-scale image recognition[J]. arXiv, 2014: 1409.1556.

[15] Marsden M, McGuinness K, Little S, et al. Fully Convolutional Crowd Counting On Highly Congested Scenes[J]. arXiv, 2016: 1612.00220.

[16] Fan C, Tang J, Wang N, et al. Rich convolutional features fusion for crowd counting[C]// Proceedings of the 2018 13th IEEE International Conference on Automatic Face & Gesture Recognition. New York: IEEE, 2018: 394-398.

赵威, 吴晓红, 刘文璨, 何小海, 卿粼波. 基于多信息卷积神经网络的人群密度估计[J]. 光电技术应用, 2019, 34(6): 40. ZHAO Wei, WU Xiao-hong, LIU Wen-can, HE Xiao-hai, QING Lin-bo. Crowd Density Estimation Based on Multi-information via Convolutional Neural Network[J]. Electro-Optic Technology Application, 2019, 34(6): 40.

关于本站 Cookie 的使用提示

中国光学期刊网使用基于 cookie 的技术来更好地为您提供各项服务,点击此处了解我们的隐私策略。 如您需继续使用本网站,请您授权我们使用本地 cookie 来保存部分信息。
全站搜索
您最值得信赖的光电行业旗舰网络服务平台!