激光与光电子学进展, 2022, 59 (12): 1210016, 网络出版: 2022-05-23   

基于多尺度特征引导的细粒度野生菌图像识别

Fine-Grained Image Recognition of Wild Mushroom Based on Multiscale Feature Guide
作者单位
云南大学信息学院,云南 昆明 650500
摘要
鉴于国内野生菌中毒事件频发,拟采用深度学习技术来解决这一社会问题。但由于野生菌图像的类间差异较小,图像背景复杂,细粒度识别精度较低。针对这一问题,提出了一种改进的ResNeXt50网络。首先设计了一种多尺度特征引导(MSFG)模块,通过短连接引导网络充分学习和利用低级特征与高级特征;接着采用改进的注意力机制模块来减少网络对复杂背景的学习;最后对模型中的不同层级特征进行融合,利用得到的联合特征进行识别。实验结果表明,所提改进模型在测试集上的准确率可以达到96.47%,较未改进的ResNeXt50网络,在准确率上提升了2.64个百分点。对比结果显示,相较VGG19、DenseNet121、Inception_v3、ResNet50、ShuffleNet_v2这5个网络模型,所提改进模型在准确率上分别提升了8.10个百分点、5.13个百分点、3.24个百分点、3.30个百分点、4.25个百分点。
Abstract
Deep learning technology is proposed to solve the social problem of the frequent occurrences of wild mushroom poisoning in China. However, due to the small difference between classes and complex image backgrounds, fine-grained recognition accuracy is low. To solve this problem, this paper proposes an improved ResNeXt50 network. First, a multiscale feature guide (MSFG) module is designed, which guides the network to learn and use low and high-level features fully through short connections. Then, the improved attention mechanism module is used to reduce the network’s learning for complex backgrounds. Finally, the different hierarchical features in the model are fused, and the obtained joint features are used for recognition. Experimental results show that the accuracy of the proposed network on the test set can reach 96.47%, which is 2.64 percentage points higher than the unimproved ResNeXt50 network. Comparison results show that the accuracy of the improved network model is 8.10 percentage points, 5.13 percentage points, 3.24 percentage points, 3.30 percentage points, and 4.25 percentage points better than VGG19, DenseNet121, Inception_v3, ResNet50, and ShuffleNet_v2, respectively.

张志刚, 余鹏飞, 李海燕, 李红松. 基于多尺度特征引导的细粒度野生菌图像识别[J]. 激光与光电子学进展, 2022, 59(12): 1210016. Zhigang Zhang, Pengfei Yu, Haiyan Li, Hongsong Li. Fine-Grained Image Recognition of Wild Mushroom Based on Multiscale Feature Guide[J]. Laser & Optoelectronics Progress, 2022, 59(12): 1210016.

本文已被 3 篇论文引用
被引统计数据来源于中国光学期刊网
引用该论文: TXT   |   EndNote

相关论文

加载中...

关于本站 Cookie 的使用提示

中国光学期刊网使用基于 cookie 的技术来更好地为您提供各项服务,点击此处了解我们的隐私策略。 如您需继续使用本网站,请您授权我们使用本地 cookie 来保存部分信息。
全站搜索
您最值得信赖的光电行业旗舰网络服务平台!