光学 精密工程, 2020, 28 (12): 2700, 网络出版: 2021-01-19   

模态自适应权值学习机制下的多光谱行人检测网络

M u ltisp ectral pedestrian d etection netw ork u n d er m od al ad aptive w eigh t learning m echan ism
作者单位
江南大学轻工过程先进控制教育部重点实验室, 江苏无锡 214122
摘要
针对目前基于红外与可见光模态融合的行人检测方法难以自适应外界环境变化的问题, 提出基于多模态信息融合权值学习的行人检测网络。首先, 区别于目前大多数研究采用的两模态直接堆叠融合方法, 权值学习融合网络考虑两种模态在不同环境条件下对行人检测任务的不同贡献比重, 通过双流交互学习二者差异, 然后根据各模态特征的当前特性自主获得各模态特征的相应权重, 进行加权融合得到融合特征, 最后基于融合特征生成新的特征金字塔, 并改变先验框的尺寸和密集度以丰富行人先验信息, 完成行人检测任务。实验结果表明: 在 Kaist多光谱行人检测数据集上获得 26. 96%的平均漏检率, 相比目前采用直接堆叠的最优方法以及 baseline方法分别降低了 2. 77%和 27. 84%, 因此自适应权值融合红外和可见光两种模态的信息可以有效获得互补的模态信息以自适应外界环境变化, 大幅提升行人检测的性能。
Abstract
A pedestrian detection network based on the weight learning of fusing multimodal information was developed to address the issues of the pedestrian detection method based on infrared and visible modal fusion in adapting to changes in the external environment. First, unlike the fusion method used in several recent studies in which two modalities are stacked directly, the weight learning fusion network reflects dif. ferent contributions of the modalities to the pedestrian detection task under different environmental condi. tions. The differences between the two modalities were determined through dual-stream interaction learn. ing. Next, based on the current characteristics of each modal feature, the weight learning fusion network assigned the corresponding weights to each modal feature to generate the fusion feature by performing weighted fusion autonomously. Finally, a new feature pyramid based on the fusion feature was generated, and previous information about the pedestrian was improved by changing the size and density of prior boxes to complete the pedestrian detection task. The experimental results indicated that the log-average miss rate of the Kaist multispectral pedestrian detection dataset reached 26. 96%, which was 2. 77% and 27. 84% lower than that of the direct stacking method and baseline method, respectively. The adaptive weight fu. sion of infrared and visible modal information could effectively be used to obtain complementary modal in. formation to adapt to external environmental changes and significantly improve pedestrian detection perfor. mance.

陈莹, 朱宇. 模态自适应权值学习机制下的多光谱行人检测网络[J]. 光学 精密工程, 2020, 28(12): 2700. CHEN Ying, ZHU Yu. M u ltisp ectral pedestrian d etection netw ork u n d er m od al ad aptive w eigh t learning m echan ism[J]. Optics and Precision Engineering, 2020, 28(12): 2700.

本文已被 3 篇论文引用
被引统计数据来源于中国光学期刊网
引用该论文: TXT   |   EndNote

相关论文

加载中...

关于本站 Cookie 的使用提示

中国光学期刊网使用基于 cookie 的技术来更好地为您提供各项服务,点击此处了解我们的隐私策略。 如您需继续使用本网站,请您授权我们使用本地 cookie 来保存部分信息。
全站搜索
您最值得信赖的光电行业旗舰网络服务平台!