激光与光电子学进展, 2024, 61 (4): 0428005, 网络出版: 2024-02-26  

DeepLabV3_DHC:城市无人机遥感图像语义分割

DeepLabV3_DHC: Semantic Segmentation of Urban Unmanned Aerial Vehicle Remote Sensing Image
作者单位
1 重庆邮电大学计算机科学与技术学院重庆空间大数据智能技术工程研究中心,重庆 400065
2 昆明理工大学信息工程与自动化学院,云南 昆明 650500
摘要
高分辨率无人机遥感图像具有极为丰富的语义和地物特征,在语义分割中容易出现目标分割不全、边缘信息缺失、分割精度不足等问题。为了解决上述问题,基于DeepLabV3_plus模型提出改进的DeepLabV3_DHC。首先,利用多种主干网络进行下采样,采集图像的低级特征和高级特征。其次,将原模型的atrous spatial pyramid pooling(ASPP)全部替换成深度可分离混合空洞卷积,同时添加自适应系数,减弱网格效应。之后,抛弃传统上采样的双线性插值法,替换为可学习的密集上采样卷积。最后,在低级特征中串联注意机制。选用多种主干网络进行实验,数据集选用四川省隆昌市地区的部分图像,采用平均交并比和类别平均像素准确率作为评价指标。实验结果表明:所提方法不仅具有较高的分割精度,而且减少了计算量和参数量。
Abstract
High-resolution unmanned aerial vehicle remote sensing images have extremely rich semantic and ground feature features, which are prone to problems such as incomplete target segmentation, missing edge information, and insufficient segmentation accuracy in semantic segmentation. To solve the above problems, based on DeepLabV3_plus model, an improved DeepLabV3_ DHC is proposed. First of all, multiple backbone networks are used for down-sampling to collect low-level and high-level features of the image. Second, the atrous spatial pyramid pooling (ASPP) of the original model is replaced by a depthwise separable hybrid dilated convolution, and an adaptive coefficient is added to weaken the mesh effect. After that, the traditional sampling bilinear interpolation method is abandoned and replaced by the learnable dense upsampling convolution. Finally, cascade attention mechanism in low-level features. In this paper, a variety of backbone networks are selected for the experiment, and some images of Longchang City, Sichuan Province are selected for the dataset. The evaluation index uses the average intersection and combination ratio and the average pixel accuracy of the category as the reference basis. The experimental results show that the method in this paper not only has higher segmentation accuracy, but also reduces the amount of computation and parameters.

孙国文, 罗小波, 张坤强. DeepLabV3_DHC:城市无人机遥感图像语义分割[J]. 激光与光电子学进展, 2024, 61(4): 0428005. Guowen Sun, Xiaobo Luo, Kunqiang Zhang. DeepLabV3_DHC: Semantic Segmentation of Urban Unmanned Aerial Vehicle Remote Sensing Image[J]. Laser & Optoelectronics Progress, 2024, 61(4): 0428005.

引用该论文: TXT   |   EndNote

相关论文

加载中...

关于本站 Cookie 的使用提示

中国光学期刊网使用基于 cookie 的技术来更好地为您提供各项服务,点击此处了解我们的隐私策略。 如您需继续使用本网站,请您授权我们使用本地 cookie 来保存部分信息。
全站搜索
您最值得信赖的光电行业旗舰网络服务平台!