红外与激光工程, 2018, 47 (1): 0126003, 网络出版: 2018-01-30   

基于深度学习的多视窗SSD目标检测方法

Object detection method of multi-view SSD based on deep learning
唐聪 1,2,3,*凌永顺 1,2,3郑科栋 4杨星 1,3郑超 1,2,3杨华 1,2,3金伟 1,2,3
作者单位
1 国防科技大学, 安徽 合肥 230037
2 红外与低温等离子体安徽省重点实验室, 安徽 合肥 230037
3 脉冲功率激光技术国家重点实验室, 安徽 合肥 230037
4 中国人民解放军31101部队, 江苏 南京 210018
摘要
提出了一种基于深度学习的多视窗SSD(Single Shot multibox Detector)目标检测方法。首先阐述了经典SSD方法的模型与工作原理, 并根据卷积感受野的概念和模型特征层与原始图像的映射关系, 分析了各层级卷积感受野大小和特征层上默认框在原始图像上的映射区域尺寸, 揭示了经典SSD方法在小目标检测上不足的原因。基于此, 提出了一种多视窗SSD模型, 阐述了其模型结构与工作原理, 并通过106张小目标图像数据集测试, 评估和对比了多视窗SSD方法与经典SSD方法在小目标检测上的物体检索能力与物体检测精度。结果表明: 在置信度阈值为0.4的条件下, 多视窗SSD方法的AF(Average F-measure)为0.729, mAP(mean Average Precision)为0.644, 相比于经典SSD方法分别提高了0.169和0.131, 验证了所提出算法的有效性。
Abstract
The object detection method of multi-view Single Shot multibox Detector(SSD) based on deep learning was proposed. Firstly, the model and the working principle of classical SSD were expounded. According to the concept of convolution receptive field and the mapping relationship between the feature map and the original image, the sizes of covolution receptive field in different levels and the scales of the default boxes mapped to the original image were analyzed to find the reason why the classical SSD was not good at small object detection. Based on this, the multi-view SSD model was put forward, and the model architecture and its working principle were deeply expounded. Then, through the test in a dataset of 106 images for small object detection, the detection performance of multi-view SSD and classical SSD were evaluated and compared in object retrieval ability and object detection precision. Experimental results show that with the confidence threshold of 0.4, the multi-view SSD is 0.729 in Average F-measure(AF) and 0.644 in mean Average Precision(mAP), and has respectively raised 0.169 and 0.131 compared to the classical SSD in the two evaluation indexes, thus verifying the effectiveness of the proposed method.

唐聪, 凌永顺, 郑科栋, 杨星, 郑超, 杨华, 金伟. 基于深度学习的多视窗SSD目标检测方法[J]. 红外与激光工程, 2018, 47(1): 0126003. Tang Cong, Ling Yongshun, Zheng Kedong, Yang Xing, Zheng Chao, Yang Hua, Jin Wei. Object detection method of multi-view SSD based on deep learning[J]. Infrared and Laser Engineering, 2018, 47(1): 0126003.

本文已被 5 篇论文引用
被引统计数据来源于中国光学期刊网
引用该论文: TXT   |   EndNote

相关论文

加载中...

关于本站 Cookie 的使用提示

中国光学期刊网使用基于 cookie 的技术来更好地为您提供各项服务,点击此处了解我们的隐私策略。 如您需继续使用本网站,请您授权我们使用本地 cookie 来保存部分信息。
全站搜索
您最值得信赖的光电行业旗舰网络服务平台!