红外技术, 2023, 45 (12): 1304, 网络出版: 2024-01-17  

Infrared-PV: 面向监控应用的红外目标检测数据集

Infrared-PV: an Infrared Target Detection Dataset for Surveillance Application
作者单位
1 杭州电子科技大学自动化学院, 浙江杭州 310018
2 中国电子科技集团第 28研究所, 江苏南京 210007
摘要
红外摄像机虽然能够全天候 24 h工作, 但是相比于可见光摄像机, 其获得的红外图像分辨率和信杂比低, 目标纹理信息缺乏, 因此足够的标记图像和进行模型优化设计对于提高基于深度学习的红外目标检测性能具有重要意义。为解决面向监控应用场景的红外目标检测数据集缺乏的问题, 首先采用红外摄像机采集了不同极性的红外图像, 基于自研图像标注软件实现了 VOC格式的图像标注任务, 构建了一个包含行人和车辆两类目标的红外图像数据集( Infrared-PV), 并对数据集中的目标特性进行了统计分析。然后采用主流的基于深度学习的目标检测模型进行了模型训练与测试, 定性和定量分析了 YOLO系列和 Faster R-CNN系列等模型对于该数据集的目标检测性能。构建的红外目标数据集共包含图像 2138张, 场景中目标包含白热、黑热和热力图 3种模式。当采用各模型进行目标检测性能测试时, Cascade R-CNN模型性能最优, mAP0.5值达到了 82.3%, YOLO v5系列模型能够兼顾实时性和检测精度的平衡, 推理速度达到 175.4帧/s的同时 mAP0.5值仅降低 2.7%。构建的红外目标检测数据集能够为基于深度学习的红外图像目标检测模型优化研究提供一定的数据支撑, 同时也可以用于目标的红外特性分析。
Abstract
Although infrared cameras can operate day and night under all-weather conditions compared with visible cameras, the infrared images obtained by them have low resolution and signal-to-clutter ratio, lack of texture information, so enough labeled images and optimization model design have great influence on improving infrared target detection performance based on deep learning. First, to solve the lack of an infrared target detection dataset used for surveillance applications, an infrared camera was used to capture images with multiple polarities, and an image annotation task that outputted the VOC format was performed using our developed annotation software. An infrared image dataset containing two types of targets, person and vehicle, was constructed and named infrared-PV. The characteristics of the targets in this dataset were statistically analyzed. Second, state-of-the-art target detection models based on deep learning were adopted to perform model training and testing. Target detection performances for this dataset were qualitatively and quantitatively analyzed for the YOLO and Faster R-CNN series detection models. The constructed infrared dataset contained 2138 images, and the targets in this dataset included three types of modes: white hot, black hot, and heat map. In the benchmark test using several models, Cascade R-CNN achieves the best performance, where mean average precision when intersection over union exceeding 0.5 1304 (mAP0.5) reaches 82.3%, and YOLOv5 model can achieve the tradeoff between real-time performance and detection performance, where inference time achieves 175.4 frames per second and mAP0.5 drops only 2.7%. The constructed infrared target detection dataset can provide data support for research on infrared image target detection model optimization and can also be used to analyze infrared target characteristics.

陈旭, 吴蔚, 彭冬亮, 谷雨. Infrared-PV: 面向监控应用的红外目标检测数据集[J]. 红外技术, 2023, 45(12): 1304. CHEN Xu, WU Wei, PENG Dongliang, GU Yu. Infrared-PV: an Infrared Target Detection Dataset for Surveillance Application[J]. Infrared Technology, 2023, 45(12): 1304.

关于本站 Cookie 的使用提示

中国光学期刊网使用基于 cookie 的技术来更好地为您提供各项服务,点击此处了解我们的隐私策略。 如您需继续使用本网站,请您授权我们使用本地 cookie 来保存部分信息。
全站搜索
您最值得信赖的光电行业旗舰网络服务平台!